Fast Multipole Methods for Incompressible Flow Simulation

Similar documents
Partial Differential Equations. Examples of PDEs

Fast Multipole Methods

Fast Multipole Methods for The Laplace Equation. Outline

19 th INTERNATIONAL CONGRESS ON ACOUSTICS MADRID, 2-7 SEPTEMBER 2007 HIGH FREQUENCY ACOUSTIC SIMULATIONS VIA FMM ACCELERATED BEM

Fast Multipole Methods: Fundamentals & Applications. Ramani Duraiswami Nail A. Gumerov

Course Requirements. Course Mechanics. Projects & Exams. Homework. Week 1. Introduction. Fast Multipole Methods: Fundamentals & Applications

An adaptive fast multipole boundary element method for the Helmholtz equation

Multipole-Based Preconditioners for Sparse Linear Systems.

General Solution of the Incompressible, Potential Flow Equations

A Fast, Parallel Potential Flow Solver

FastMMLib: a generic Fast Multipole Method library. Eric DARRIGRAND. IRMAR Université de Rennes 1

Fast Multipole BEM for Structural Acoustics Simulation

A Fast N-Body Solver for the Poisson(-Boltzmann) Equation

Efficient calculation for evaluating vast amounts of quadrupole sources in BEM using fast multipole method

Lecture 11: CMSC 878R/AMSC698R. Iterative Methods An introduction. Outline. Inverse, LU decomposition, Cholesky, SVD, etc.

A Robust Preconditioned Iterative Method for the Navier-Stokes Equations with High Reynolds Numbers

Fast multipole boundary element method for the analysis of plates with many holes

An implementation of the Fast Multipole Method without multipoles

The adaptive cross approximation accelerated boundary element method for bubble dynamics

Green s Functions, Boundary Integral Equations and Rotational Symmetry

Fast Multipole Methods: Fundamentals & Applications. Ramani Duraiswami Nail A. Gumerov

On the choice of abstract projection vectors for second level preconditioners

Transactions on Modelling and Simulation vol 18, 1997 WIT Press, ISSN X

Fluid Dynamics: Theory, Computation, and Numerical Simulation Second Edition

Continuum Mechanics Lecture 5 Ideal fluids

Lecture 8: Boundary Integral Equations

Open-source finite element solver for domain decomposition problems

Boundary Value Problems - Solving 3-D Finite-Difference problems Jacob White

APPROXIMATING GAUSSIAN PROCESSES

Preconditioners for the incompressible Navier Stokes equations

Domain decomposition on different levels of the Jacobi-Davidson method

ME equations. Cylindrical symmetry. Bessel functions 1 kind Bessel functions 2 kind Modifies Bessel functions 1 kind Modifies Bessel functions 2 kind

A far-field based T-matrix method for three dimensional acoustic scattering

Fluid Animation. Christopher Batty November 17, 2011

Linear Solvers. Andrew Hazel

A Solenoidal Basis Method For Efficient Inductance Extraction Λ

Solving Symmetric Indefinite Systems with Symmetric Positive Definite Preconditioners

arxiv: v2 [math.na] 1 Oct 2016

Karhunen-Loève Approximation of Random Fields Using Hierarchical Matrix Techniques

Solving Large Nonlinear Sparse Systems

SPARSE SOLVERS POISSON EQUATION. Margreet Nool. November 9, 2015 FOR THE. CWI, Multiscale Dynamics

ABSTRACT MATRICES WITH APPLICATIONS TO THE FAST MULTIPOLE METHOD. Zhihui Tang, Doctor of Philosophy, 2004

Multigrid absolute value preconditioning

arxiv: v1 [math.na] 28 Apr 2011

Computational Methods CMSC/AMSC/MAPL 460. Vectors, Matrices, Linear Systems, LU Decomposition, Ramani Duraiswami, Dept. of Computer Science

FEM-FEM and FEM-BEM Coupling within the Dune Computational Software Environment

A Domain Decomposition Based Jacobi-Davidson Algorithm for Quantum Dot Simulation

Nested Krylov methods for shifted linear systems

Finite and Boundary Element Methods in Acoustics

Lecture 18 Classical Iterative Methods

Breaking Computational Barriers: Multi-GPU High-Order RBF Kernel Problems with Millions of Points

9.1 Preconditioned Krylov Subspace Methods

Week 2 Notes, Math 865, Tanveer

A Crash Course on Fast Multipole Method (FMM)

INTRODUCTION TO FINITE ELEMENT METHODS ON ELLIPTIC EQUATIONS LONG CHEN

A Fast Solver for the Stokes Equations. Tejaswin Parthasarathy CS 598APK, Fall 2017

A dual BIE approach for large-scale modelling of 3-D electrostatic problems with the fast multipole boundary element method

PDE Solvers for Fluid Flow

Spherical harmonics based aggregation in the multilevel fast multipole algorithm

Master Thesis Literature Review: Efficiency Improvement for Panel Codes. TU Delft MARIN

Rapid evaluation of electrostatic interactions in multi-phase media

Fast solvers for steady incompressible flow

An Overview of Fluid Animation. Christopher Batty March 11, 2014

Efficient Augmented Lagrangian-type Preconditioning for the Oseen Problem using Grad-Div Stabilization

Fast algorithms for dimensionality reduction and data visualization

AMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences)

SOLVING SPARSE LINEAR SYSTEMS OF EQUATIONS. Chao Yang Computational Research Division Lawrence Berkeley National Laboratory Berkeley, CA, USA

MATH 590: Meshfree Methods

OUTLINE ffl CFD: elliptic pde's! Ax = b ffl Basic iterative methods ffl Krylov subspace methods ffl Preconditioning techniques: Iterative methods ILU

Review of Some Fast Algorithms for Electromagnetic Scattering

Mathematics and Computer Science

Fast numerical methods for solving linear PDEs

AN INDEPENDENT LOOPS SEARCH ALGORITHM FOR SOLVING INDUCTIVE PEEC LARGE PROBLEMS

Kinematics of fluid motion

(1) u i = g(x i, x j )q j, i = 1, 2,..., N, where g(x, y) is the interaction potential of electrostatics in the plane. 0 x = y.

APPLICATION OF THE SPARSE CARDINAL SINE DECOMPOSITION TO 3D STOKES FLOWS

NUMERICAL SIMULATION OF THE FLOW AROUND A SQUARE CYLINDER USING THE VORTEX METHOD

LEAST-SQUARES FINITE ELEMENT MODELS

Fast Iterative Solution of Saddle Point Problems

2 CAI, KEYES AND MARCINKOWSKI proportional to the relative nonlinearity of the function; i.e., as the relative nonlinearity increases the domain of co

Point Vortices in a Periodic Box

Hybrid Cross Approximation for the Electric Field Integral Equation

MATH 590: Meshfree Methods

ELECTROMAGNETIC SCATTERING BY MIXED CONDUCTING/DIELECTRIC OBJECTS USING HIGHER-ORDER MOM

Is My CFD Mesh Adequate? A Quantitative Answer

ON THE GENERALIZED DETERIORATED POSITIVE SEMI-DEFINITE AND SKEW-HERMITIAN SPLITTING PRECONDITIONER *

Preconditioning. Noisy, Ill-Conditioned Linear Systems

Department of Computer Science, University of Illinois at Urbana-Champaign

Vortex motion. Wasilij Barsukow, July 1, 2016

AA214B: NUMERICAL METHODS FOR COMPRESSIBLE FLOWS

VORTEX METHOD APPLICATION FOR AERODYNAMIC LOADS ON ROTOR BLADES

FastBEM Acoustics. Verification Manual , Advanced CAE Research, LLC (ACR) Cincinnati, Ohio, USA All Rights Reserved

FEM/FMBEM coupling for acoustic structure interaction and acoustic design sensitivity analysis with sound-absorbing materials

An alternative approach to integral equation method based on Treftz solution for inviscid incompressible flow

AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods; Preconditioning

2.29 Numerical Fluid Mechanics Fall 2011 Lecture 7

Computational Methods CMSC/AMSC/MAPL 460. Linear Systems, Matrices, LU Decomposition, Ramani Duraiswami, Dept. of Computer Science

Finding Rightmost Eigenvalues of Large, Sparse, Nonsymmetric Parameterized Eigenvalue Problems

Domain decomposition for the Jacobi-Davidson method: practical strategies

An H-LU Based Direct Finite Element Solver Accelerated by Nested Dissection for Large-scale Modeling of ICs and Packages

Transcription:

Fast Multipole Methods for Incompressible Flow Simulation Nail A. Gumerov & Ramani Duraiswami Institute for Advanced Computer Studies University of Maryland, College Park Support of NSF awards 0086075 and 0219681 is gratefully acknowledged

Fast Multipole Methods Complex geometric features (e.g., aircraft, submarine, or turbine geometries), physics (high <, wake-structure interactions etc.) all tend to increase problem sizes Many simulations involve several million variables Most large problems boil down to solution of linear systems or performing several matrix-vector products Regular product requires O(N 2 ) time and O(N 2 ) memory The FMM is a way to accelerate the products of particular dense matrices with vectors Do this using O(N) memory FMM achieves product in O(N) or O(N log N) time and memory Combined with iterative solution methods, can allow solution of problems hitherto unsolvable

Fast Multipole Methods To speed up a matrix vector product i Φ ij u i = v j Key idea: Let Φ ij =φ( x i, y j ) Translate elements corresponding to different x i to common location, x * φ (x i,y j )= l=1 p β l ψ l (x i -x * ) Ψ l (y j -x * ) Achieves a separation of variables by translation The number of terms retained, p, is only related to accuracy v j = j=1n u i l=1 p β l ψ l (x i ) Ψ l (y j ) = l=1 p β l Ψ l (y j ) i=1n u i ψ l (x i ) Can evaluate p sums over i A l = j=1n u i ψ l (x i ) Requires Np operations Then evaluate an expression of the type S(x i )= l=1 p A l β l ψ l (x i ) i=1,,m

Outline Incompressible Fluid problems where FMM can be used Boundary integral and particle formulations Potential flow Stokes Flow Vortex Element Methods Component of Navier-Stokes Solvers via Generalized Helmholtz decomposition Multi-sphere simulations of multiphase flow Laplace Stokes Helmholtz Numerical Analysis issues in developing efficient FMM Towards black-box FMM solvers

Basic Equations FMM & Fluid Mechanics n 2 1

Helmholtz Decomposition Key to integral equation and particle methods

Potential Flow Crum, 1979 Knowledge of the potential is sufficient to compute velocity and pressure Need a fast solver for the Laplace equation Applications panel methods for subsonic flow, water waves, bubble dynamics, Boschitsch et al, 1999

BEM/FMM Solution Laplace s Equation Lohse, 2002 Jaswon/Symm (60s) Hess & Smith (70s), Korsmeyer et al 1993, Epton & Dembart 1998, Boschitsch & Epstein 1999

Stokes Flow Green s function (Ladyzhenskaya 1969, Pozrikidis 1992) Motion of spermatozoa Cummins et al 1988 Integral equation formulation Stokes flow simulations remain a very important area of research MEMS, bio-fluids, emulsions, etc. BEM formulations (Tran-Cong & Phan-Thien 1989, Pozrikidis 1992) FMM (Kropinski 2000 (2D), Power 2000 (3D)) Cherax quadricarinatus. MEMS force calculations (Aluru & White, 1998

Rotational Flows and VEM For rotational flows Vorticity released at boundary layer or trailing edge and advected with the flow Simulated with vortex particles Especially useful where flow is mostly irrotational Fast calculation of Biot-Savart integrals (x 2,y 2,z 2 ) l n Γ (circulation strength) x Evaluation point (x 1,y 1,z 1 ) (Far field) (where y is the mid point of the filament)

Vorticity formulations of NSE Problems with boundary conditions for this equation (see e.g., Gresho, 1991) Divergence free and curl-free components are linked only by boundary conditions Splitting is invalid unless potentials are consistent on boundary Recently resolved by using the generalized Helmholtz decomposition (Kempka et al, 1997; Ingber & Kempka, 2001) This formulation uses a kinematically consistent Helmholtz decomposition in terms of boundary integrals When widely adopted will need use of boundary integrals, and hence the FMM Preliminary results in Ingber & Kempka, 2001

Generalized Helmholtz Decomposition Helmholtz decomposition leaves too many degrees of freedom Way to achieve decomposition valid on boundary and in domain, with consistent values is to use the GHD D is the domain dilatation (zero for incompressible flow) Requires solution of a boundary integral equation as part of the solution => role for the FMM in such formulations

Multi Sphere Problems So far we have seen FMM for boundary integral problems Can also use the FMM in problems involving many spheres Simulations of multiphase flow, effective media (porous media), dusty gases, slurries, and the like Key is a way to enforce proper boundary/continuity conditions on the spheres Translation theorems provide an ideal way to do this With FMM one can easily simulate 10 4-10 5 particles on desktops and 10 6-10 7 particles on supercomputers Sangani & Bo (1996), Gumerov & Duraiswami (2002,2003) Stokesian Dynamics (Brady/Bossis 1988; Kim/Karilla 1991)

Fast Multipole Methods Matrix-Vector Multiplication Middleman and Single Level Methods Multilevel FMM (MLFMM) Adaptive MLFMM Fast Translations Some computational results Conclusions

Iterative Methods To solve linear systems of equations; Simple iteration methods; Conjugate gradient or similar methods; We use Krylov subspace methods (GMRES): Preconditioners; Research is ongoing. Efficiency of these methods depends on efficiency of the matrix-vector multiplication.

Matrix-Vector Multiplication

FMM Works with Influence Matrices

Examples of Influence Matrices exp

Complexity of Standard Method Standard algorithm Sources N M Evaluation Points Total number of operations: O(NM)

Goal of FMM Reduce complexity of matrixvector multiplication (or field evaluation) by trading exactness for speed Evaluate to arbitrary accuracy

Five Key Stones of FMM Factorization Error Bound Translation Space Partitioning Data Structure

Degenerate Kernel: Factorization O(pN) operations: O(pM) operations: Total Complexity: O(p(N+M))

Factorization (Example) Scalar Product in d-dimensional space: O(dM) operations: O(dN) operations Total Complexity: O(d(N+M))

Middleman Algorithm Standard algorithm Sources Evaluation Points Middleman algorithm Sources Evaluation Points N M N M Total number of operations: O(NM) Total number of operations: O(N+M)

Non-Degenerate Kernel: Factorization Truncation Number Error Bound: Middleman Algorithm Applicability:

Factorization Problem: Usually there is no factorization available that provides a uniform approximation of the kernel in the entire computational domain.

Far and Near Field Expansions Far Field: Near Field: Far Field R c x i - x * r c x i - x * x i Ω Ω Near Field R S x * y y S: Singular (also Multipole, Outer Far Field ), R: Regular (also Local, Inner Near Field )

Example of S and R expansions (3D Laplace) z O θ φ r r y x Spherical Coordinates: Spherical Harmonics:

S and R Expansions (3D Helmholtz) z O θ φ r r y x Spherical Coordinates: Spherical Bessel Functions Spherical Hankel Functions

Idea of a Single Level FMM Standard algorithm Sources Evaluation Points SLFMM Sources Evaluation L groups Points K groups N M N M Total number of operations: O(NM) Needs Translation! Total number of operations: O(N+M+KL)

Space Partitioning Potentials due to sources in these spatial domains Φ (n) 1 (y) Φ (n) 2 (y) Φ (n) 3 (y) Ε 1 Ε 2 Ε 3 n n n

Single Level Algorithm 1. For each box build Far Field representation of the potential due to all sources in the box. 2. For each box build Near Field representation of the potential due to all sources outside the neighborhood. 3. Total potential for evaluation points belonging to this box is a direct sum of potentials due to sources in its neighborhood and the Near Field expansion of other sources near the box center.

The SLFMM requires S R-translation:

S R-translation Also Far-to-Local, Outer-to-Inner, Multipole-to-Local Ω 2i x *2 y R R 2 S R c x i - x *1 (S R) x *1 Ω 1 i x i Ω 1i R 2 = min{ x *2 - x *1 -R c x i - x *1,r c x i - x *2 }

S R-translation Operator S R-Translation Coefficients S R-Translation Matrix

S R-translation Operators for 3D Laplace and Helmholtz equations

Complexity of SLFMM We have p 2 terms and P(p) translation cost: F(K) For M~N: 0 K opt Additional Middleman complexity Κ

Idea of Multilevel FMM Source Data Hierarchy Evaluation Data Hierarchy S S S R N R R M Level 5 Level 2 Level 3 Level 4 Level 2 Level 3 Level 4 Level 5

Complexity of MLFMM Definitions: Upward Pass: Going Up on SOURCE Hierarchy Downward Pass: Going Down on EVALUATION Hierarchy Not factorial!

Hierarchical Spatial Domains Ε 1 Ε 2 Ε 3 Ε 4

Upward Pass. Step 1. S-expansion S-expansion valid in Ω Ε 3 Ω y y Ω x i x c (n,l) x * R x i S-expansion valid in E 3 (n,l)

Upward Pass. Step 2. S S-translation. Build potential for the parent box (find its S-expansion).

Downward Pass. Step 1. S R-translation Level 2: Level 3:

Downward Pass. Step 2. R R-translation

Final Summation Contribution of near sources (calculated directly) Contribution of far sources (represented by R-expansion) y j

Adaptive MLFMM Goal of computations:

Idea of Adaptation Source Point Evaluation Point 3 2 2 2 3 2 2 3 2 2 Box Level 3 4 3 Each evaluation box in this picture contains not more than 3 sources in the neighborhood. Very important for particle methods with concentrations of particles 2

We have implemented MLFMM General MLFMM shell software: Arbitrary dimensionality; Variable sizes of neighborhoods; Variable clustering parameter; Regular and Adaptive versions; User Specified basis functions, and translation operators; Efficient data structures using bit interleaving; Technical Report #1 is available online (visit the authors home pages). http://www.umiacs.umd.edu/~ramani/pubs/umiacs-tr-2003-28.pdf

Optimal choice of the clustering parameter is important 100 Max Level=8 5 65536 3 CPU Time (s) 10 1 7 16384 6 4096 7 6 5 6 4 5 4 3 2 1 5 2 0.1 N=1024 3 4 Regular Mesh, d=2 1 10 100 1000 Number of Points in the Smallest Box

Speed of computation 1000 Straightforward CPU Time (s) 100 10 1 y=cx 2 y=bx FMM (s=4) Setting FMM FMM/(a*log(N)) 0.1 Middleman Regular Mesh, d=2, k=1, Reduced S R 0.01 1000 10000 100000 1000000 Number of Points

Error analysis

Complexity of Translation For 3D Laplace and Helmholtz series have p 2 terms; Translation matrices have p 4 elements; Translation performed by direct matrix-vector multiplication has complexity O(p 4 ); Can be reduced to O(p 3 ); Can be reduced to O(p 2 log 2 p); Can be reduced to O(p 2 ) (?).

Rotation-Coaxial Translation Decomposition Yields O(p 3 ) Method z z p 4 y x y x p3 p 3 p 3 y y z x x Coaxial Translation Rotation z

FFT-based Translation for 3D Laplace Equation Yields O(p 2 log 2 p) Method Translation Matrices are Toeplitz and Hankel Multiplication can be performed using FFT (Elliot & Board 1996; Tang et al 2003)

Sparse Matrix Decompositions Can Result in O(p 2 ) Methods Laplace and Helmholtz: Helmholtz 3D: D is a sparse matrix

Interaction of Multiple Spherical Particles Helmholtz (acoustical scattering) Laplace (potential incompressible flow) k 0

Multipole Solution 1) Reexpand solution near the center of each sphere, and satisfy boundary conditions 2) Solve linear system to determine the expansion coefficients

Computational Methods Used Computable on 1 GHz, 1 GB RAM Desktop PC MLFMM Method Multipole Iterative Multipole Straightforward BEM 10 0 10 1 10 2 10 3 10 4 10 5 Number of Spheres 10 6

Comparisons with BEM HRTF (db) 12 9 6 3 0-3 60 o 30 o θ 1 = 0 o 90 o 60 180 o BEM MultisphereHelmholtz 30 o o 90 o 120 o -6 120 o 150 o -9 150 o Three Spheres, ka 1 =3.0255. -12-180 -90 0 90 180 Angle φ 1 (deg) BEM discretization with 5400 elements

Convergence of the Iterative Procedure M ax Absolute Error 1.E+02 1.E+01 1.E+00 1.E-01 1.E-02 Iterations with Reflection Method 3D Helmholtz Equation, MLFMM 100 Spheres ka = 4.8 1.E-03 1.6 2.8 1.E-04 0 5 10 15 20 25 30 35 Iteration # For Laplace equation convergence is much faster

Multiple Scattering from 100 spheres ka=1.6 ka=2.8 ka=4.8 FMM also used here for visualization

Various Configurations 343 spheres in a regular grid 1000 randomly placed spheres

Truncation Errors

Some Observations for Laplace Equation (limit at small k) Relative errors below 1% can be achieved even for p~1; Reflection method converges very fast; Number of iterations can be 2-10 for very high accuracy. CPU time on 1GHz, 1GB RAM PC for 1000 spheres ~ 1 min.

Some projects of students in our course related to incompressible flow 1). Jun Shen (Mechanical Engineering, UMD): 2D simulation of vortex flow (particle method + MLFMM)

Project 2- Calculate vortex induced forces 2). Jayanarayanan Sitaraman (Alfred Gessow Rotorcraft Center, UMD): Fast Multipole Methods For 3-D Biot-Savart Law Calculations (free vortex methods) Error, M=3200, N=3200 10-5 10-6 p 6 7 Break-even point: MLFMM is faster then straightforward for N=M>1300

Conclusions We developed a shell and framework for MLFMM which can be used for many problems, including simulations of potential and vortex flows and multiparticle motion. MLFMM shows itself as very efficient method for solution of fluid dynamics problems; MLFMM enables computation of large problems (sometimes even on desktop PC); Research is ongoing. Problems: Efficient choice of parameters for MLFMM; Efficient translation algorithms; Efficient iterative procedures; More work on error bounds is needed; Etc. CSCAMM Meeting on FMM in November 2003

Thank You!