The Deflation Accelerated Schwarz Method for CFD

Similar documents
On deflation and singular symmetric positive semi-definite matrices

On the choice of abstract projection vectors for second level preconditioners

Master Thesis Literature Study Presentation

C. Vuik 1 R. Nabben 2 J.M. Tang 1

DELFT UNIVERSITY OF TECHNOLOGY

Robust Preconditioned Conjugate Gradient for the GPU and Parallel Implementations

The Conjugate Gradient Method

Various ways to use a second level preconditioner

THEORETICAL AND NUMERICAL COMPARISON OF PROJECTION METHODS DERIVED FROM DEFLATION, DOMAIN DECOMPOSITION AND MULTIGRID METHODS

Preface to the Second Edition. Preface to the First Edition

The solution of the discretized incompressible Navier-Stokes equations with iterative methods

A Domain Decomposition Based Jacobi-Davidson Algorithm for Quantum Dot Simulation

A Robust Preconditioned Iterative Method for the Navier-Stokes Equations with High Reynolds Numbers

Solving Large Nonlinear Sparse Systems

A parallel block-preconditioned GCR method for incompressible flow problems

APPLIED NUMERICAL LINEAR ALGEBRA

Domain decomposition on different levels of the Jacobi-Davidson method

Arnoldi Methods in SLEPc

2.29 Numerical Fluid Mechanics Spring 2015 Lecture 9

Multilevel Preconditioning of Graph-Laplacians: Polynomial Approximation of the Pivot Blocks Inverses

Chapter 7 Iterative Techniques in Matrix Algebra

Preconditioners for the incompressible Navier Stokes equations

A PRECONDITIONER FOR THE HELMHOLTZ EQUATION WITH PERFECTLY MATCHED LAYER

Contents. Preface... xi. Introduction...

Iterative Helmholtz Solvers

Numerical Methods in Matrix Computations

Convergence Behavior of a Two-Level Optimized Schwarz Preconditioner

Least-Squares Spectral Collocation with the Overlapping Schwarz Method for the Incompressible Navier Stokes Equations

arxiv: v2 [math.na] 1 Feb 2013

OUTLINE ffl CFD: elliptic pde's! Ax = b ffl Basic iterative methods ffl Krylov subspace methods ffl Preconditioning techniques: Iterative methods ILU

Overlapping Schwarz preconditioners for Fekete spectral elements

A MULTIGRID ALGORITHM FOR. Richard E. Ewing and Jian Shen. Institute for Scientic Computation. Texas A&M University. College Station, Texas SUMMARY

Linear Solvers. Andrew Hazel

DELFT UNIVERSITY OF TECHNOLOGY

Parallel Algorithms for Solution of Large Sparse Linear Systems with Applications

Solving Ax = b, an overview. Program

33 RASHO: A Restricted Additive Schwarz Preconditioner with Harmonic Overlap

Numerical behavior of inexact linear solvers

AMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences)

20. A Dual-Primal FETI Method for solving Stokes/Navier-Stokes Equations

M.A. Botchev. September 5, 2014

Parallel sparse linear solvers and applications in CFD

Iterative methods for Linear System of Equations. Joint Advanced Student School (JASS-2009)

1 Number Systems and Errors 1

Preconditioned inverse iteration and shift-invert Arnoldi method

DELFT UNIVERSITY OF TECHNOLOGY

Efficient Augmented Lagrangian-type Preconditioning for the Oseen Problem using Grad-Div Stabilization

Fast solvers for steady incompressible flow

Indefinite and physics-based preconditioning

Structure preserving preconditioner for the incompressible Navier-Stokes equations

Domain decomposition for the Jacobi-Davidson method: practical strategies

7.4 The Saddle Point Stokes Problem

ITERATIVE METHODS FOR SPARSE LINEAR SYSTEMS

SOLVING MESH EIGENPROBLEMS WITH MULTIGRID EFFICIENCY

AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods; Preconditioning

Introduction to Applied Linear Algebra with MATLAB

A finite-volume algorithm for all speed flows

NUMERICAL METHODS FOR ENGINEERING APPLICATION

Stabilization and Acceleration of Algebraic Multigrid Method

Algebraic Multigrid Preconditioners for Computing Stationary Distributions of Markov Processes

PARTITION OF UNITY FOR THE STOKES PROBLEM ON NONMATCHING GRIDS

Algebraic Multigrid as Solvers and as Preconditioner

WHEN studying distributed simulations of power systems,

Semi-implicit Krylov Deferred Correction Methods for Ordinary Differential Equations

Poisson Equation in 2D

CONVERGENCE BOUNDS FOR PRECONDITIONED GMRES USING ELEMENT-BY-ELEMENT ESTIMATES OF THE FIELD OF VALUES

2 CAI, KEYES AND MARCINKOWSKI proportional to the relative nonlinearity of the function; i.e., as the relative nonlinearity increases the domain of co

1. Fast Solvers and Schwarz Preconditioners for Spectral Nédélec Elements for a Model Problem in H(curl)

ADDITIVE SCHWARZ FOR SCHUR COMPLEMENT 305 the parallel implementation of both preconditioners on distributed memory platforms, and compare their perfo

A Parallel Scalable PETSc-Based Jacobi-Davidson Polynomial Eigensolver with Application in Quantum Dot Simulation

Lab 1: Iterative Methods for Solving Linear Systems

DELFT UNIVERSITY OF TECHNOLOGY

An advanced ILU preconditioner for the incompressible Navier-Stokes equations

J. FRANK AND C. VUIK the symmetric positive denite preconditioner K is chosen such that the spectrum of K,1 A is more clustered or has a smaller condi

Domain Decomposition Preconditioners for Spectral Nédélec Elements in Two and Three Dimensions

Short title: Total FETI. Corresponding author: Zdenek Dostal, VŠB-Technical University of Ostrava, 17 listopadu 15, CZ Ostrava, Czech Republic

On solving linear systems arising from Shishkin mesh discretizations

Generalized AOR Method for Solving System of Linear Equations. Davod Khojasteh Salkuyeh. Department of Mathematics, University of Mohaghegh Ardabili,

NUMERICAL COMPUTATION IN SCIENCE AND ENGINEERING

Iterative Solvers in the Finite Element Solution of Transient Heat Conduction

AMS526: Numerical Analysis I (Numerical Linear Algebra)

Incomplete LU Preconditioning and Error Compensation Strategies for Sparse Matrices

AMS Mathematics Subject Classification : 65F10,65F50. Key words and phrases: ILUS factorization, preconditioning, Schur complement, 1.

Multigrid Methods and their application in CFD

Acceleration of a Domain Decomposition Method for Advection-Diffusion Problems

AMS526: Numerical Analysis I (Numerical Linear Algebra)

Multigrid and Domain Decomposition Methods for Electrostatics Problems

Construction of a New Domain Decomposition Method for the Stokes Equations

Lecture 3: Inexact inverse iteration with preconditioning

Newton-Multigrid Least-Squares FEM for S-V-P Formulation of the Navier-Stokes Equations

Inexact inverse iteration with preconditioning

Preface to Second Edition... vii. Preface to First Edition...

A SPARSE APPROXIMATE INVERSE PRECONDITIONER FOR NONSYMMETRIC LINEAR SYSTEMS

Contents. Preface for the Instructor. Preface for the Student. xvii. Acknowledgments. 1 Vector Spaces 1 1.A R n and C n 2

Preconditioned Locally Minimal Residual Method for Computing Interior Eigenpairs of Symmetric Operators

A THEORETICAL INTRODUCTION TO NUMERICAL ANALYSIS

Some Geometric and Algebraic Aspects of Domain Decomposition Methods

Comparative Analysis of Mesh Generators and MIC(0) Preconditioning of FEM Elasticity Systems

A decade of fast and robust Helmholtz solvers

Scientific Computing with Case Studies SIAM Press, Lecture Notes for Unit VII Sparse Matrix

Transcription:

The Deflation Accelerated Schwarz Method for CFD J. Verkaik 1, C. Vuik 2,, B.D. Paarhuis 1, and A. Twerda 1 1 TNO Science and Industry, Stieltjesweg 1, P.O. Box 155, 2600 AD Delft, The Netherlands 2 Delft University of Technology, Faculty of Electrical Engineering, Mathematics and Computer Science, Mekelweg 4, 2628 CK Delft, The Netherlands Phone: +31 15 27 85530, Fax: +31 15 27 87209 c.vuik@ewi.tudelft.nl. Abstract. Accurate simulation of glass melting furnaces requires the solution of very large linear algebraic systems of equations. To solve these equations efficiently a Schwarz domain decomposition (multi-block) method can be used. However, it can be observed that the convergence of the Schwarz method deteriorates when a large number of subdomains is used. This is due to small eigenvalues arising from the domain decomposition which slow down the convergence. Recently, a deflation approach was proposed to solve this problem using constant approximate eigenvectors. This paper generalizes this view to piecewise linear vectors and results for two CFD problems are presented. It can be observed that the number of iterations and wall clock time decrease considerably. The reason for this is that the norm of the initial residual is much smaller and the rate of convergence is higher. Keywords: efficiency; computational fluid dynamics; domain decomposition; deflation; Krylov subspace acceleration. 1 Introduction Simulation by Computational Fluid Dynamics (CFD) is important for the design, optimization and trouble shooting of glass melting furnaces. It gives engineers in the glass industry great insight into the occurring transport phenomena. At TNO Science and Industry, a CFD simulation package called GTM-X is being developed for simulating gas- and oil-fired glass-furnaces. This is a complete model for simulating glass furnaces, describing the glass melt and combustion space simultaneously, and predicting the effects on melting performance and glass quality. A domain decomposition (DD) approach is applied within GTM-X, for which the spatial domain is decomposed into subdomains (blocks). A DD (or multiblock) approach has several important advantages for simulating glass-melting Corresponding author. V.S. Sunderam et al. (Eds.): ICCS 2005, LNCS 3514, pp. 868 875, 2005. c Springer-Verlag Berlin Heidelberg 2005

The Deflation Accelerated Schwarz Method for CFD 869 furnaces. Since a glass-furnace geometry is often complex, this enables us to easily describe the geometry using blocks (subdomains). Furthermore, parallel computing can be done and variables for each block can be simultaneously computed on different processors. This is an advantage, because simulation of glass-melting furnaces often results in very large computation times. A disadvantage of multi-block solvers is that the convergence behavior deteriorates significantly when a large number of blocks is used. This is especially the case for solving elliptic equations. In [11], a reason for this problem is given which relates the loss of convergence to the presence of small eigenvalues arising from the domain decomposition. These eigenvalues can be eliminated by applying a deflation operator using constant vectors for approximating the corresponding eigenvectors. The authors of [11] present convergence rates for solving Poisson s equation which are independent on the number of blocks. In this paper we extend and apply this idea to solve general linear systems of equations. The deflation method is implemented in GTM-X and both GCR and CG Krylov subspace acceleration is used to solve the resulting linear system of equations. Both constant (CD) and combined constant and linear deflation (CLD) vectors are considered. The research in this paper focusses on solving the singular pressure-correction system in the SIMPLE method, which is used in GTM-X to solve the Navier-Stokes equations. However, the solver can be used for general linear systems and is therefore applicable to a wide range of problems. 2 Description of the Mathematical Model Besides the incompressible Navier-Stokes equations and the energy equation, GTM-X has dedicated models for turbulence, combustion, glass melting and chemical vaporization. The user can apply different models in different domains. The equations arising from the physical models are discretised with the finite volume method. Several schemes can be used for discretization: upwind, central, TVD higher-order schemes, and blending of schemes can be done using deferred correction. The grid is boundary fitting and colocated, meaning that all variables are located in the volume cell centers. The discretised nonlinear Navier-Stokes equations are solved by the SIMPLE method ([6, 1, 12]), using pressure-weighted (Rhie & Chow) interpolation to exclude checkerboard pressure modes. The SIMPLE method is an iterative method in which the system in each iteration (outer iteration) is splitted up into linear systems for the pseudo-velocities and pressure-correction. So-called SIMPLE Stabilization Iterations (SSI) are applied, which can be seen as additional outer iterations for solving the pressure system without solving the velocities. Linear systems of equations are solved with a domain multi-block approach (inner iteration). In GTM-X, an additive Schwarz DD method with minimal overlap is applied, in combination with inaccurate subdomain solutions. Stone s SIP-solver [7], which is based on an incomplete LU decomposition, is used for obtaining the subdomain

870 J. Verkaik et al. solutions. Local grid refinement at block level can be done, as well as parallel computing using MPI libraries. Furthermore, solutions on coarse grids can be used as starting solutions for finer grids by using interpolation. The reader is referred to [10] for more details on the code. 3 Deflation and Domain Decomposition Consider a decomposition of the entire computational domain consisting of n grid points into L nonoverlapping subdomains. Discretization of the partial differential equations to be solved and grouping together the unknowns per subdomain, results in the block system A 11 A 1L....... A L1... A LL y 1.. y L = b 1.. b L, which we will denote simply as Ay = b. The diagonal blocks of A are the coefficients for the interior of the subdomains, the off-diagonal blocks represent coupling across subdomain boundaries. This system is solved with the deflation method combined with a Krylov subspace method. Let the matrix Z be n m, where m n. Furthermore, let Z be of rank m, i.e. Z has linear independent columns. The columns of Z, the so-called deflation vectors, span the deflation subspace, i.e., the space that approximates the eigenspace belonging to the smallest eigenvalues and which is to be projected out of the residual. To do this we define the projectors P = I AZE 1 Z T, Q = I ZE 1 Z T A, where the m m matrix E is defined as E = Z T AZ, andi is the identity matrix of appropriate size. To solve Ay = b with the deflation technique, we write y =(I Q)y + Qy, and since (I Q)y = Z(Z T AZ) 1 Z T Ay = ZE 1 Z T b, (1) can be computed immediately, we only need to compute Qy. In light of the identity AQ = PA we can solve the system PAỹ = P b (2) for ỹ, premultiply this by Q and add it to (1). Since PAZ = 0, this system is obviously singular, and therefore no unique solution exists. However, it can be shown that Qy is unique [9]. In [2] and [8 p. 147], it is noted that a positive semidefinite system can be solved as long as it is consistent, i.e., as long as its right-hand side does not have components in the null space (column space of Z) of PA. This assumption holds for (2), because

The Deflation Accelerated Schwarz Method for CFD 871 Subdomain 1 Subdomain 2 Fig. 1. Linear deflation vectors for the 1-D example of two subdomains with 4 grid cells per subdomain the same projection is applied to both sides of Ay = b. In [8 p. 147], it is shown that a Krylov subspace method could be used to solve (2), because the null-space never enters the iteration, and therefore the corresponding zero-eigenvalues do not influence the convergence. The reader is referred to [4] for more details on the spectrum of PA. When system (2) is solved, this involves the computation of E 1, for example statements like f = E 1 g have to be evaluated. In [4 Section 3], it is shown that deflation works correctly as long as E 1 is computed with a high accuracy. Since we never determine E 1 explicitly, this means that Ef = g is solved by a direct method, for example by an LU factorization. Deflation can also be combined with preconditioning. Suppose K is a suitable preconditioner of A, then solving Ay = b can be replaced by: solve ỹ from the left-preconditioned system K 1 PAỹ = K 1 P b, andformqỹ, or solve ỹ from the right-preconditioned system PAK 1 ỹ = P b, andformqk 1 ỹ. Both systems can be solved by a one s favorite Krylov subspace solver, for example by the GCR method. In our implementation we take K the block Gauss-Jacobi preconditioner (block diagonal of A), corresponding to the additive Schwarz iteration. This preconditioner lends itself well for parallel computing. We will refer to the deflation method combined with Krylov subspace acceleration in a domain decomposition context as the deflated Krylov-Schwarz method. 4 Approximating Eigenvectors by Constant and Linear Functions Many authors [3, 5, 11] consider constant deflation vectors for approximating the eigenvectors corresponding to the small eigenvalues which slow down the convergence. For each subdomain, exactly one deflation vector is defined having elements that are constant in the grid points on the corresponding subdomain, and zero elements in the grid points on the other subdomains. We will refer to this as CD deflation. Generalization to the 2-D and 3-D case is straightforward: for a subdomain, we simply take the elements of the deflation vector to be constant in the grid nodes.

872 J. Verkaik et al. This idea is generalized to approximating the eigenvectors using piecewise linear vectors in the grid directions 1. By this we augment the space of constant vectors with linear vectors. Figure 1 shows the linear deflation vectors. A linear deflation vector is defined as a polynomial of degree one on each subdomain and zero on the other subdomains. If the deflation vectors consist of both the constant and the linear vectors, we will denote it by Constant Linear Deflation (CLD). For the case of Figure 1, we have two deflation vectors per subdomain: one constant and one linear. Generalization to 2-D and 3-D is not so straightforward anymore, compared to CD deflation. However, it appears that for the 2-D case three vectors are connected to each subdomain: one constant, and one linear vector in each of the two grid directions. For the 3-D case, it appears that we need four deflation vectors: one constant and one linear vector in each of the three grid directions. When the coefficient matrix A is singular then the deflation vectors has to be chosen such that Z T AZ is non-singular, since break-down can occur with a direct solution method. Two options seem satisfactory to overcome this problem. The first one is to remove a constant deflation vector for one subdomain; the second is to adjust one entry in a constant deflation vector for one subdomain. 5 Numerical Experiments The deflated GCR method is compared to the SIP method for solving the singular pressure-correction system arising in the SIMPLE method applied to the stationary incompressible Navier-Stokes equations. Two test cases are considered: a buoyancy-driven cavity flow for a high Rayleigh number, and a glass tank. In both test cases the stationary energy equation is solved besides the Navier-Stokes equations. Furthermore, the flow is assumed to be laminar and buoyant. One constant deflation vector is removed in order to overcome the singularity. Furthermore, one SIP iteration is used for obtaining the subdomain solution and the GCR algorithm is truncated for one search direction. In this paper we will restrict ourselves to the results for the inner iterations. The first test case is a 2-D buoyancy-driven cavity flow in a unit square consisting of a 60 60 uniform grid and a decomposition of 4 4 subdomains, see Figure 2. The Rayleigh number is chosen to be large, i.e. Ra = 1.0 10 6, meaning that the flow is highly buoyant but still laminar. Figure 3 shows the residuals for solving the pressure-correction system. Clearly, the CLD deflation performs best, followed by CD deflation and the GCR without deflation. The SIP method performs very disappointing for this case. Note that a large jump in the initial residual can be observed for CLD deflation. 1 In this paper we let deflation vectors in the coordinate directions out of consideration. However, numerical experiments show that choosing linear vectors in grid- or coordinate directions can result in different convergence behavior.

The Deflation Accelerated Schwarz Method for CFD 873 Fig. 2. Velocity field for the buoyancy-driven cavity flow In the second test case, a glass tank is considered having dimensions 7 [m] 3 [m] 1 [m], see Figure 4. Glass fractions are injected (left of figure) from above and melted glass with desired properties leaves the tank at the outlet (right of figure). For this case the grid is uniform consisting of 10,500 grid cells, and the domain is decomposed into 18 subdomains. Figure 5 shows the results for the inner iteration residuals. Clearly, CLD deflation performs best concerning convergence behavior. 10 1 10 0 10 1 10 2 residual b Ay (k) 2 / b 2 10 3 10 4 10 5 10 6 10 7 SIP(1) 10 8 GCR CDGCR CLDGCR 10 9 0 5 10 15 20 25 30 iteration number Fig. 3. The inner iteration residuals for the SIP and deflated GCR method considering the buoyancy-driven test case (GCR: GCR without deflation; CDGCR: GCR with CD deflation; CLDGCR: GCR with CLD deflation)

874 J. Verkaik et al. Fig. 4. Velocity field for the glass tank test case 10 1 10 0 10 1 residual b Ay (k) 2 / b 2 10 2 10 3 10 4 10 5 10 6 10 7 SIP(1) GCR CDGCR CLDGCR 10 8 0 5 10 15 20 25 30 iteration number Fig. 5. The residuals of the inner iterations for the SIP and deflated GCR method considering the glas tank test case (GCR: GCR without deflation; CDGCR: GCR with CD deflation; CLDGCR: GCR with CLD deflation) 6 Conclusions From the experiments presented in this paper, we conclude that the deflation accelerated Schwarz method is a very efficient technique to solve linear systems of equations arising from domain decomposition. It seems that the combination of constant and linear deflation vectors is most efficient. A large jump in the norm of the initial residual can be observed as well as a higher convergence rate. Moreover, it should be noted that the deflation method can be implemented in existing software with relatively low effort.

The Deflation Accelerated Schwarz Method for CFD 875 References 1. J.H. Ferziger and M. Perić. Computational Methods for Fluid Dynamics. Springer, Heidelberg, second edition, 1999. 2. E.F. Kaasschieter. Preconditioned conjugate gradients for solving singular systems. J. Comp. Applied Math., 41:265 275, 1988. 3. L. Mansfield. On the conjugate gradient solution of the Schur complement system obtained from domain decomposition. SIAM J. Numer. Anal., 7(6):1612 1620, 1990. 4. R. Nabben and C. Vuik. A comparison of deflation and coarse grid correction applied to porous media flow. Report 03-10, Delft University of Technology, Department of Applied Mathematical Analysis, Delft, 2003. 5. R.A. Nicolaides. Deflation of conjugate gradients with applications to boundary value problems. SIAM J. Numer. Anal., 24(2):355 365, 1987. 6. S.V. Patankar. Numerical Heat Transfer and Fluid Flow. McGraw-Hill, New York, 1980. 7. H.L. Stone. Iterative solution of implicit approximations of multi-dimensional partial differential equations. SIAM J. Numer. Anal., 5:530 558, 1968. 8. H.A. van der Vorst. Iterative Krylov Methods for Large Linear systems. Cambridge University Press, Cambridge, 2003. 9. F. Vermolen, C. Vuik, and A. Segal. Deflation in preconditioned conjugate gradient methods for finite element problems. In M. K ri zek, P. Neittaanmäki, R. Glowinski, and S. Korotov, editors, Conjugate Gradient and Finite Element Methods, pages 103 129. Springer, Berlin, 2004. 10. R.L. Verweij. Parallel Computing for furnace simulations using domain decomposition. PhD thesis, Delft University of Technology, Delft. 11. C. Vuik and J. Frank. Coarse grid acceleration of a parallel block preconditioner. Future Generation Computer Systems, 17:933 940, 2001. 12. P. Wesseling. Principles of Computational Fluid Dynamics. Springer, Heidelberg, 2001.