An advanced ILU preconditioner for the incompressible Navier-Stokes equations

Similar documents
Preconditioners for the incompressible Navier Stokes equations

A comparison of preconditioners for incompressible Navier Stokes solvers

DELFT UNIVERSITY OF TECHNOLOGY

The solution of the discretized incompressible Navier-Stokes equations with iterative methods

Efficient Solvers for the Navier Stokes Equations in Rotation Form

Fast Iterative Solution of Saddle Point Problems

A Robust Preconditioned Iterative Method for the Navier-Stokes Equations with High Reynolds Numbers

Efficient Augmented Lagrangian-type Preconditioning for the Oseen Problem using Grad-Div Stabilization

Mathematics and Computer Science

9.1 Preconditioned Krylov Subspace Methods

A simple iterative linear solver for the 3D incompressible Navier-Stokes equations discretized by the finite element method

A decade of fast and robust Helmholtz solvers

Scientific Computing with Case Studies SIAM Press, Lecture Notes for Unit VII Sparse Matrix

Preconditioning Techniques for Large Linear Systems Part III: General-Purpose Algebraic Preconditioners

OUTLINE ffl CFD: elliptic pde's! Ax = b ffl Basic iterative methods ffl Krylov subspace methods ffl Preconditioning techniques: Iterative methods ILU

Comparison of Fixed Point Methods and Krylov Subspace Methods Solving Convection-Diffusion Equations

A Review of Preconditioning Techniques for Steady Incompressible Flow

ANALYSIS OF AUGMENTED LAGRANGIAN-BASED PRECONDITIONERS FOR THE STEADY INCOMPRESSIBLE NAVIER-STOKES EQUATIONS

Fast solvers for steady incompressible flow

DELFT UNIVERSITY OF TECHNOLOGY

AMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences)

Department of Computer Science, University of Illinois at Urbana-Champaign

AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods; Preconditioning

Preface to the Second Edition. Preface to the First Edition

DELFT UNIVERSITY OF TECHNOLOGY

Splitting Iteration Methods for Positive Definite Linear Systems

Unsteady Incompressible Flow Simulation Using Galerkin Finite Elements with Spatial/Temporal Adaptation

ITERATIVE METHODS FOR SPARSE LINEAR SYSTEMS

Iterative Methods and Multigrid

DELFT UNIVERSITY OF TECHNOLOGY

The amount of work to construct each new guess from the previous one should be a small multiple of the number of nonzeros in A.

c 2011 Society for Industrial and Applied Mathematics

Lecture 18 Classical Iterative Methods

ON THE ROLE OF COMMUTATOR ARGUMENTS IN THE DEVELOPMENT OF PARAMETER-ROBUST PRECONDITIONERS FOR STOKES CONTROL PROBLEMS

SOLVING SPARSE LINEAR SYSTEMS OF EQUATIONS. Chao Yang Computational Research Division Lawrence Berkeley National Laboratory Berkeley, CA, USA

On the Preconditioning of the Block Tridiagonal Linear System of Equations

Numerical behavior of inexact linear solvers

On the accuracy of saddle point solvers

Solving Large Nonlinear Sparse Systems

An Efficient Low Memory Implicit DG Algorithm for Time Dependent Problems

Iterative methods for Linear System

A TAXONOMY AND COMPARISON OF PARALLEL BLOCK MULTI-LEVEL PRECONDITIONERS FOR THE INCOMPRESSIBLE NAVIER STOKES EQUATIONS

Incomplete LU Preconditioning and Error Compensation Strategies for Sparse Matrices

Solving Ax = b, an overview. Program

A Taxonomy and Comparison of Parallel Block Multi-level Preconditioners for the Incompressible Navier Stokes Equations

Contents. Preface... xi. Introduction...

Finding Rightmost Eigenvalues of Large, Sparse, Nonsymmetric Parameterized Eigenvalue Problems

Iterative Methods for Sparse Linear Systems

Solving Symmetric Indefinite Systems with Symmetric Positive Definite Preconditioners

FEniCS Course. Lecture 6: Incompressible Navier Stokes. Contributors Anders Logg André Massing

AN AUGMENTED LAGRANGIAN APPROACH TO LINEARIZED PROBLEMS IN HYDRODYNAMIC STABILITY

Iterative methods for Linear System of Equations. Joint Advanced Student School (JASS-2009)

Iterative Solution methods

Iterative Methods for Linear Systems of Equations

Preconditioning Techniques Analysis for CG Method

1. Introduction. Consider the Navier Stokes equations ηu t ν 2 u + (u grad) u + grad p = f div u = 0 (1.1)

The flexible incomplete LU preconditioner for large nonsymmetric linear systems. Takatoshi Nakamura Takashi Nodera

IDR(s) A family of simple and fast algorithms for solving large nonsymmetric systems of linear equations

Solving Sparse Linear Systems: Iterative methods

Solving Sparse Linear Systems: Iterative methods

arxiv: v4 [math.na] 1 Sep 2018

Algebraic Multigrid as Solvers and as Preconditioner

M.A. Botchev. September 5, 2014

ETNA Kent State University

Challenges for Matrix Preconditioning Methods

REORDERING EFFECTS ON PRECONDITIONED KRYLOV METHODS IN AMR SOLUTIONS OF FLOW AND TRANSPORT

A parallel block multi-level preconditioner for the 3D incompressible Navier Stokes equations

FEM and Sparse Linear System Solving

DELFT UNIVERSITY OF TECHNOLOGY

Linear Solvers. Andrew Hazel

Lecture 8: Fast Linear Solvers (Part 7)

Adaptive preconditioners for nonlinear systems of equations

c 2006 Society for Industrial and Applied Mathematics

Boundary Value Problems - Solving 3-D Finite-Difference problems Jacob White

7.4 The Saddle Point Stokes Problem

Fine-grained Parallel Incomplete LU Factorization

A DISSERTATION. Extensions of the Conjugate Residual Method. by Tomohiro Sogabe. Presented to

Incomplete Cholesky preconditioners that exploit the low-rank property

NEWTON-GMRES PRECONDITIONING FOR DISCONTINUOUS GALERKIN DISCRETIZATIONS OF THE NAVIER-STOKES EQUATIONS

Stabilization and Acceleration of Algebraic Multigrid Method

Journal of Computational and Applied Mathematics. Optimization of the parameterized Uzawa preconditioners for saddle point matrices

Indefinite and physics-based preconditioning

A Modified SSOR Preconditioner for Sparse Symmetric Indefinite Linear Systems of Equations

CME342 Parallel Methods in Numerical Analysis. Matrix Computation: Iterative Methods II. Sparse Matrix-vector Multiplication.

BLOCK ILU PRECONDITIONED ITERATIVE METHODS FOR REDUCED LINEAR SYSTEMS

Multigrid absolute value preconditioning

Structure preserving preconditioner for the incompressible Navier-Stokes equations

CLASSICAL ITERATIVE METHODS

DELFT UNIVERSITY OF TECHNOLOGY

Multifrontal Incomplete Factorization for Indefinite and Complex Symmetric Systems

Multilevel low-rank approximation preconditioners Yousef Saad Department of Computer Science and Engineering University of Minnesota

Combination Preconditioning of saddle-point systems for positive definiteness

Chapter 7 Iterative Techniques in Matrix Algebra

CONVERGENCE BOUNDS FOR PRECONDITIONED GMRES USING ELEMENT-BY-ELEMENT ESTIMATES OF THE FIELD OF VALUES

An Efficient Low Memory Implicit DG Algorithm for Time Dependent Problems

Lecture 10 Preconditioning, Software, Parallelisation

Since the early 1800s, researchers have

Lecture 11: CMSC 878R/AMSC698R. Iterative Methods An introduction. Outline. Inverse, LU decomposition, Cholesky, SVD, etc.

Uncertainty Quantification: Does it need efficient linear algebra?

1. Fast Iterative Solvers of SLE

Transcription:

An advanced ILU preconditioner for the incompressible Navier-Stokes equations M. ur Rehman C. Vuik A. Segal Delft Institute of Applied Mathematics, TU delft The Netherlands Computational Methods with Applications, August 19-25, Harrachov 2007, Czech Republic

Outline Introduction 1 Introduction 2 3 4 5

Introduction The incompressible Navier-Stokes equation ν 2 u + u. u + p = f in Ω.u = 0 in Ω. u is the fluid velocity p is the pressure field ν > 0 is the kinematic viscosity coefficient ( 1/Re). Ω R 2 is a bounded domain with the boundary condition: u = w on Ω D, ν u n np = 0 on Ω N.

Finite element discretization Weak formulation X = (H 1 E (Ω))d, X = (H 1 0 (Ω))d, Find u X and p M M = L 2 (Ω) Z Z ν u : vdω + Ω Z (u. u).vdω Ω Ω Z q(.u)dω = 0, Ω Z p(.v)dω = f.vdω, Ω q M v X

Finite element discretization Discrete weak formulation X h = (H 1 E (Ω))d, X h = (H 1 0 (Ω))d, M h = L 2 (Ω) Find u h X h and p h M h Z Z Z Z ν u h : v h dω + (u h. u h ).v h dω p h (.v h )dω = f.v h dω, v h X h, Ω Ω Ω Ω Z q h (.u h )dω = 0 q h M h. Ω Matrix notation Au + N(u) + B T p = f Bu = 0.

Linearization Introduction Stokes problem ν 2 u + p = f.u = 0 Picard s method ν u (k+1) + (u (k). )u (k+1) + p (k+1) = f.u (k+1) = 0 Newton s method ν u k+1 + u k+1. u k + u k. u k+1 + p k+1 = f + u k. u k,.u k+1 = 0.

Linear system Introduction Matrix form after linearization [ ] [ ] F B T u = B 0 p [ ] f 0 or Ax = b F R n n, B R m n, f R n and m n Sparse linear system, Symmetric(Stokes problem), nonsymmetric indefinite otherwise. Saddle point problem having large number of zeros on the main diagonal

Direct methods To solve Ax = b, factorize A into upper U and lower L triangular matrices (LUx = b) First solve Ly = b, then Ux = y Classical iterative methods Methods based on matrix splitting, generates sequence of iterations x k+1 = M 1 (Nx k + b) = Qx k + s where A = M N Jacobi, Gauss Seidel, SOR, SSOR

Krylov subspace methods Find the approximate solution x n = x 0 + c, where c is a linear combination of basis functions of Krylov subspace K n (A, b), where K n = b, Ab, A 2 b,..., A n 1 b of dimension n. CGNR [Paige and Saunders - 1975] QMR [Freund and Nachtigal - 1991] CGS [Sonneveld - 1989] Bi-CGSTAB [van der Vorst - 1992] GMRES [Saad and Schultz - 1986] GMRESR [van der Vorst and Vuik - 1994] GCR [Eisenstat, Elman and Schultz - 1986] matrix-vector multiplications, good convergence properties, optimal and short recurrence Convergence depends strongly on eigenvalues distribution clustered around 1 or away from 0.

Preconditioner for the Navier-Stokes equations Definition A linear system Ax = b is transformed into P 1 Ax = P 1 b such that Eigenvalues of P 1 A are more clustered than A P A Pz = r cheap to compute Several approaches, we will discuss here Block triangular preconditioners Incomplete LU factorization

Preconditioners for the Navier-Stokes equations Block triangular preconditioners» F B T B 0 =» F B T P t = 0 S»»» I 0 F 0 I F 1 B T BF 1 I 0 S 0 I {z }, S = BF 1 B T (Schur complement matrix) Subsystem solve Sz 2 = r 2, Fz 1 = r 1 B T z 2 In practice F 1 and S 1 are expensive. F 1 is obtained by an approximate solve S is first approximated and then solved inexactly

Preconditioners for the Navier-Stokes equations Well-known approximations to Schur complement Pressure convection diffusion (PCD) [Kay, Login and Wathen, 2002] S A pfp 1 Q p Least squares commutator (LSC) [Elman, Howle, Shadid, Silvester and Tuminaro, 2002] S (BQ 1 B T )(BQ 1 FQ 1 B T ) 1 (BQ 1 B T ) Augmented Lagrangian approach (AL) [Benzi and Olshanskii, 2006] Convergence independent of the mesh size and mildly dependent on Reynolds number Require iterative solvers (Multigrid) for the (1,1) and (2,2) blocks Require extra operators

Preconditioners for the Navier-Stokes equations Incomplete LU preconditioners A = LD 1 U + R, (LD 1 U) i,j = a i,j for (i, j) S, where R consist of dropped entries that are absent in the index set S(i, j). [Meijerink and van der Vorst, 1977] - dropping based on position, S = {(i, j) a ij 0} (positional dropping) - dropping based on numerical size (Threshold dropping) Simple to implement, Computation is inexpensive Inaccuracies and instabilities,

Preconditioners for the Navier-Stokes equations Pivoting Prevent zero diagonal, small pivots A priori estimation of the memory required to store the matrix a difficult task A priori reordering/renumbering Improve profile and bandwidth of the matrix Minimizes dropped entries in ILU Well-known renumbering schemes Cuthill McKee renumbering (CMK) [Cuthill McKee - 1969] Sloan renumbering [Sloan - 1986] Minimum degree renumbering (MD) [Tinney and Walker - 1967] [Dutto-1993, Benzi-1997, Duff and Meurant-1989, Wille-2004, Chow and Saad - 1997]

Preconditioners for the Navier-Stokes equations ILUPACK Devloped by Matthias Bollhöfer and his team. Gives robust and stable ILU preconditioner Static reordering [RCM, AMD etc] Scaling, pivoting Inverse traingular factors are kept bounded. The above steps are perfomed recursively Krylov method is applied to solve the preconditioned system Matthias Bollhöfer, Yousef Saad. Multilevel Preconditioners Constructed From Inverse-Based ILUs, SIAM Journal on Scientific Computing, 27, 5(2005), 1627-1650

Preconditioners for the Navier-Stokes equations New priori ordering scheme Two Steps: Renumbering of grid points: Grid points are renumbered with Sloan or Cuthill McKee algorithms Reordering of unknowns p-last ordering, first all the velocity unknowns are ordered followed by pressure unknowns. Usually it produces a large profile but avoids breakdown of LU decomposition. p-last per node ordering, The velocity unknowns are ordered followed by pressure unknowns per node (Optimal profile but breakdown of ILU may occur, therefore pivoting required)

Preconditioners for the Navier-Stokes equations p-last per level reordering Levels? 1 Level 5 0.8 0.6 Level 4 0.4 0.2 0 Level 3 0.2 0.4 Level 2 0.6 0.8 Level 1 1 1 0.8 0.6 0.4 0.2 0 0.2 0.4 0.6 0.8 1 Q2 Q1 finite element subdivision

Preconditioners for the Navier-Stokes equations p-last per level reordering First we take all the velocities of level 1, then all pressures of level 1. Next we do the same for level 2, and repeat this process for all nodes. The profile is hardly enlarged Zero pivots becomes nonzero, therefore no pivoting required Choice of first level: The first level may be defined as a point, or even a line in R 2 or a surface in R 3.

Preconditioners for the Navier-Stokes equations p-last per level reordering Remark: The ILU decomposition does not breakdown if there is at least one nonzero connection between a velocity and pressure unknown. In each level, velocity unknowns must be followed by pressure unknowns.

Preconditioners for the Navier-Stokes equations Some features of SILU preconditioner 1 Fill-in based on the connectivity in the finite element grid 2 Extra-fill in 3 Lumping of positve off-diagonal entries 4 Artificial compressibility

Flow domains Channel flow The Poiseuille channel flow in a square domain ( 1, 1) 2 with a parabolic inflow boundary condition and the natural outflow condition having the analytic solution: u = 1 y 2 ; v = 0; p = 2νx Backward facing step Q2-Q1 finite element discretization [Taylor, Hood - 1973] Q2-P1 finite element discretization [Crouzeix, Raviart - 1973]

Introduction Numerical experiments Renumbering/Reordering used in direct methods The reordering methods helps in minimizing storage in band and envelope storage scheme. Bandwidth(A)=maxi {βi (A), 1 i n} Pn i=1 βi (A) Profile(A)= 16 16 channel flow with Q2-Q1 discretization 0 0 0 100 100 100 200 200 200 300 300 300 400 400 400 500 500 500 600 600 0 100 200 300 400 500 600 Profile = 52195, Bandwidth = 570 p last ordering with lexicographic numbering 600 0 100 200 300 400 500 600 Profile =31222, Bandwidth = 212 p last per level ordering with Sloan renumbering 0 100 200 300 400 500 600 Profile =47468, Bandwidth =160 p last per level ordering with Cuthill McKee renumbering

Numerical experiments Renumbering/Reordering used in direct methods Profile and bandwidth reduction in the backward facing step with Q2-Q1 discretization Grid Profile reduction Bandwidth reduction - Sloan Cuthill-McKee Sloan Cuthill-McKee 4 12 0.37 0.61 0.18 0.17 8 24 0.28 0.54 0.13 0.08 16 48 0.26 0.5 0.11 0.04 32 96 0.25 0.48 0.06 0.02

Numerical experiments Stokes Problem in a square domain with BiCGSTAB, accuracy = 10 6, Sloan renumbering Q2 Q1 Q2 P1 Grid size p-last p-last per level p-last p-last per level 16 16 36(0.11) 25(0.09) 44(0.14) 34(0.13) 32 32 90(0.92) 59(0.66) 117(1.08) 75(0.80) 64 64 255(11.9) 135(6.7) 265(14) 165(9.0) 128 128 472(96) 249(52) 597(127) 407(86)

Numerical experiments Convergence of the ILU preonconditioned Bi-CGSTAB for the Stokes Problem in a backward facing domain with an accuracy = 10 6 Grid Q2 Q1 Q2 P1 - Sloan Cuthill-McKee Sloan Cuthill-McKee - Iter. Iter. Iter. Iter. 8x24 9 15 29 97 16x48 22 32 40 288 32x96 59 65 73 1300 64x192 172 285 330 1288

Numerical experiments Effect of grid increase(left) and Reynolds number(right) on inner iterations for the Navier-Stokes backward facing step problem with accuracy = 10 2 using the p-last-level reordering 10 4 10 3 No. of accumulated inner iterations 10 3 10 2 Bi CGSTAB, Re =100 Bi CGSTAB, Re=10 GMRESR, Re = 100 GMRESR, Re = 10 Average inner iterations 10 2 32x96 Q2 Q1 32x96 Q2 P1 10 1 8x24 16x48 32x96 64x192 Grid size 10 1 50 100 150 200 250 300 350 400 Reynolds number

Numerical experiments Comparison of the preconditioners using MG solver for (1,1),(2,2) blocks of PCD and LSC preconditioner with Bi-CGSTAB and accuracy = 10 4 ( IFISS) Grid PCD SILU LSC Re=100 Iter. Mflops Iter. Mflops Iter. Mflops 8 24 40 3.7 9 0.6 24 4 16 48 36 15.3 13 3.9 19 14.9 32 96 39 70.9 21 27.5 13 44.4 64 192 61 458 55 297 13 185 64 192 grid with increasing Re Re = 200 48 259 17 241 Re = 300 50 269 19 270 Re = 400 48 259 29 412 Extra-fillin: 16 iterations, 155 flops

Numerical experiments Comparison with ILUPACK-Stokes Problem in a backward facing domain with an accuracy = 10 6, Q2-Q1 elements No. of GMRES(20) iterations 10 3 10 2 10 1 SILU(0) ILUPACK(0) SILU(1) ILUPACK(1) CPU time(s) 10 2 10 1 10 0 10 1 SILU(0) ILUPACK(0) SILU(1) ILUPACK(1) 10 0 10 2 8x24 16x48 32x96 64x192 8x24 16x48 32x96 64x192 Grid size Grid size 64x192 grid ( 8 iterations, const. time(s) = 90, solver time(s)=7, gain factor =6)

Numerical experiments Comparison with ILUPACK-Stokes Problem in a backward facing domain with an accuracy = 10 6, Q2-Q1 elements No. of GMRES(20) iterations 10 3 10 2 10 1 SILU(0) ILUPACK(0) SILU(1) ILUPACK(1) CPU time(s) 10 2 10 1 10 0 10 1 SILU(0) ILUPACK(0) SILU(1) ILUPACK(1) 10 0 10 2 8x24 16x48 32x96 64x192 8x24 16x48 32x96 64x192 Grid size Grid size 64x192 grid ( 8 iterations, const. time(s) = 90, solver time(s)=7, gain factor =6)

Introduction A new scheme for the renumbering of grid points and reordering of unknowns is introduced that prevents the break down of the ILU preconditioner and leads to faster convergence of Krylov subspace methods. Improves profile and bandwidth of a matrix Sloan with p-last per level reordering leads to best results for the Taylor Hood and Crouzeix Raviart elements. Since the block preconditioners are independent of the grid size and weakly dependent of the Reynolds number there performance can be better than the S ILU preconditioners for large grid sizes and large Reynolds numbers Varying stretched grids Testing the preconditioner for the problems with high Reynolds number (SUPG implementation) Use of SILU preconditioner in 3D

Thank you for your attention!