EQUATIONS WITH LOW VISCOSITY HOWARD C. ELMAN UMIACS-TR November 1996

Similar documents
NAVIER-STOKES OPERATORS HOWARD C. ELMAN UMIACS-TR November Revised July 1996

UNIVERSITY OF MARYLAND PRECONDITIONERS FOR THE DISCRETE STEADY-STATE NAVIER-STOKES EQUATIONS CS-TR #4164 / UMIACS TR # JULY 2000

A Review of Preconditioning Techniques for Steady Incompressible Flow

A Robust Preconditioned Iterative Method for the Navier-Stokes Equations with High Reynolds Numbers

Fast solvers for steady incompressible flow

Solving Large Nonlinear Sparse Systems

A MULTIGRID ALGORITHM FOR. Richard E. Ewing and Jian Shen. Institute for Scientic Computation. Texas A&M University. College Station, Texas SUMMARY

Efficient Solvers for the Navier Stokes Equations in Rotation Form

DELFT UNIVERSITY OF TECHNOLOGY

Fast Iterative Solution of Saddle Point Problems

UNIVERSITY OF MARYLAND NAVIER-STOKES EQUATIONS CS-TR #4073 / UMIACS TR #99-66 OCTOBER 1999

Numerische Mathematik

Efficient Augmented Lagrangian-type Preconditioning for the Oseen Problem using Grad-Div Stabilization

Mathematics and Computer Science

Iterative Methods for Problems in Computational Fluid. Howard C. Elman. University of Maryland. David J. Silvester. University of Manchester

20. A Dual-Primal FETI Method for solving Stokes/Navier-Stokes Equations

1. Introduction. Consider the Navier Stokes equations ηu t ν 2 u + (u grad) u + grad p = f div u = 0 (1.1)

Preconditioned GMRES Revisited

Structure preserving preconditioner for the incompressible Navier-Stokes equations

7.4 The Saddle Point Stokes Problem

MODIFIED STREAMLINE DIFFUSION SCHEMES FOR CONVECTION-DIFFUSION PROBLEMS. YIN-TZER SHIH AND HOWARD C. ELMAN y

Linear Solvers. Andrew Hazel

CONVERGENCE OF GAUGE METHOD FOR INCOMPRESSIBLE FLOW CHENG WANG AND JIAN-GUO LIU

A Least-Squares Finite Element Approximation for the Compressible Stokes Equations

Domain decomposition on different levels of the Jacobi-Davidson method

UNIVERSITY OF MARYLAND A MULTIGRID METHOD ENHANCED BY KRYLOV SUBSPACE ITERATION FOR DISCRETE HELMHOLTZ EQUATIONS CS-TR #4029 / UMIACS TR #99-36

ON THE GENERALIZED DETERIORATED POSITIVE SEMI-DEFINITE AND SKEW-HERMITIAN SPLITTING PRECONDITIONER *

Finding Rightmost Eigenvalues of Large, Sparse, Nonsymmetric Parameterized Eigenvalue Problems


The Best Circulant Preconditioners for Hermitian Toeplitz Systems II: The Multiple-Zero Case Raymond H. Chan Michael K. Ng y Andy M. Yip z Abstract In

Preface to the Second Edition. Preface to the First Edition

PALADINS: Scalable Time-Adaptive Algebraic Splitting and Preconditioners for the Navier-Stokes Equations

A Finite Element Method for an Ill-Posed Problem. Martin-Luther-Universitat, Fachbereich Mathematik/Informatik,Postfach 8, D Halle, Abstract

The amount of work to construct each new guess from the previous one should be a small multiple of the number of nonzeros in A.

Solving Symmetric Indefinite Systems with Symmetric Positive Definite Preconditioners

Iterative Methods for Solving A x = b

The solution of the discretized incompressible Navier-Stokes equations with iterative methods

XIAO-CHUAN CAI AND MAKSYMILIAN DRYJA. strongly elliptic equations discretized by the nite element methods.

1e N

Numerical behavior of inexact linear solvers

A Review of Solution Techniques for Unsteady Incompressible Flow

Preconditioners for the incompressible Navier Stokes equations

Lecture 18 Classical Iterative Methods

Multigrid absolute value preconditioning

Institute for Advanced Computer Studies. Department of Computer Science. Iterative methods for solving Ax = b. GMRES/FOM versus QMR/BiCG

for Finite Element Simulation of Incompressible Flow Arnd Meyer Department of Mathematics, Technical University of Chemnitz,

PRECONDITIONING TECHNIQUES FOR NEWTON S METHOD FOR THE INCOMPRESSIBLE NAVIER STOKES EQUATIONS

A simple iterative linear solver for the 3D incompressible Navier-Stokes equations discretized by the finite element method

MULTIGRID PRECONDITIONING FOR THE BIHARMONIC DIRICHLET PROBLEM M. R. HANISCH

ETNA Kent State University

c 2006 Society for Industrial and Applied Mathematics

Luca F. Pavarino. Dipartimento di Matematica Pavia, Italy. Abstract

Fast Solvers for Unsteady Incompressible Flow

Numerische Mathematik

M.A. Botchev. September 5, 2014

Mathematics Research Report No. MRR 003{96, HIGH RESOLUTION POTENTIAL FLOW METHODS IN OIL EXPLORATION Stephen Roberts 1 and Stephan Matthai 2 3rd Febr

DELFT UNIVERSITY OF TECHNOLOGY

Preconditioning for Nonsymmetry and Time-dependence

Lecture Note III: Least-Squares Method

Eect of boundary vorticity discretization on explicit stream-function vorticity calculations

Linear Regression and Its Applications

On the accuracy of saddle point solvers

The Conjugate Gradient Method

2.29 Numerical Fluid Mechanics Spring 2015 Lecture 9

Application of turbulence. models to high-lift airfoils. By Georgi Kalitzin

Linear Algebra (part 1) : Vector Spaces (by Evan Dummit, 2017, v. 1.07) 1.1 The Formal Denition of a Vector Space

AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods; Preconditioning

Multigrid Approaches to Non-linear Diffusion Problems on Unstructured Meshes

ON ORTHOGONAL REDUCTION TO HESSENBERG FORM WITH SMALL BANDWIDTH

Termination criteria for inexact fixed point methods

INTERGRID OPERATORS FOR THE CELL CENTERED FINITE DIFFERENCE MULTIGRID ALGORITHM ON RECTANGULAR GRIDS. 1. Introduction

Preconditioning a mixed discontinuous nite element method for radiation diusion

Solving Sparse Linear Systems: Iterative methods

Solving Sparse Linear Systems: Iterative methods

Partial Differential Equations

ON A GENERAL CLASS OF PRECONDITIONERS FOR NONSYMMETRIC GENERALIZED SADDLE POINT PROBLEMS

The Deflation Accelerated Schwarz Method for CFD

Journal of Computational and Applied Mathematics. Multigrid method for solving convection-diffusion problems with dominant convection

Fraction-free Row Reduction of Matrices of Skew Polynomials

Key words. Convection-Diffusion, Domain Decomposition, Preconditioning, Spectral Element Method

The antitriangular factorisation of saddle point matrices

OUTLINE 1. Introduction 1.1 Notation 1.2 Special matrices 2. Gaussian Elimination 2.1 Vector and matrix norms 2.2 Finite precision arithmetic 2.3 Fact

Finite element approximation on quadrilateral meshes

CONVERGENCE BOUNDS FOR PRECONDITIONED GMRES USING ELEMENT-BY-ELEMENT ESTIMATES OF THE FIELD OF VALUES

Numerical Methods in Matrix Computations

Chebyshev semi-iteration in Preconditioning

1. Introduction Let f(x), x 2 R d, be a real function of d variables, and let the values f(x i ), i = 1; 2; : : : ; n, be given, where the points x i,

ANALYSIS OF AUGMENTED LAGRANGIAN-BASED PRECONDITIONERS FOR THE STEADY INCOMPRESSIBLE NAVIER-STOKES EQUATIONS

Unsteady Incompressible Flow Simulation Using Galerkin Finite Elements with Spatial/Temporal Adaptation

An advanced ILU preconditioner for the incompressible Navier-Stokes equations

n i,j+1/2 q i,j * qi+1,j * S i+1/2,j

Largest Bratu solution, lambda=4

ADDITIVE SCHWARZ FOR SCHUR COMPLEMENT 305 the parallel implementation of both preconditioners on distributed memory platforms, and compare their perfo

A JACOBI-DAVIDSON ITERATION METHOD FOR LINEAR EIGENVALUE PROBLEMS. GERARD L.G. SLEIJPEN y AND HENK A. VAN DER VORST y

CLASSICAL ITERATIVE METHODS

An Efficient Low Memory Implicit DG Algorithm for Time Dependent Problems

Iterative Methods for Linear Systems

Bindel, Fall 2016 Matrix Computations (CS 6210) Notes for

A NOTE ON THE LADYŽENSKAJA-BABUŠKA-BREZZI CONDITION

9.1 Preconditioned Krylov Subspace Methods

Transcription:

PRECONDITIONING FOR THE STEADY-STATE NAVIER-STOKES EQUATIONS WITH LOW VISCOSITY HOWARD C. ELMAN Report CS-TR-372 UMIACS-TR-96-82 November 996 Abstract. We introduce a preconditioner for the linearized Navier-Stokes equations that is eective when either the discretization mesh size or the viscosity approaches zero. For constant coecient problems with periodic boundary conditions, we show that the preconditioning yields a system with a single eigenvalue equal to one, so that performance is independent of both viscosity and mesh size. For other boundary conditions, we demonstrate empirically that convergence depends only mildly on these parameters and we give a partial analysis of this phenomenon. We also show that some expensive subsidiary computations required by the new method can be replaced by inexpensive approximate versions of these tasks based on iteration, with virtually no degradation of performance. Department of Computer Science and Institute for Advanced Computer Studies, University of Maryland, College Park, MD 2742, e-mail: elman@cs.umd.edu. This work was supported by U. S. National Science Foundation under grant DMS-942333.

PRECONDITIONING FOR THE STEADY-STATE NAVIER-STOKES EQUATIONS WITH LOW VISCOSITY HOWARD C. ELMAN Abstract. We introduce a preconditioner for the linearized Navier-Stokes equations that is eective when either the discretization mesh size or the viscosity approaches zero. For constant coecient problems with periodic boundary conditions, we show that the preconditioning yields a system with a single eigenvalue equal to one, so that performance is independent of both viscosity and mesh size. For other boundary conditions, we demonstrate empirically that convergence depends only mildly on these parameters and we give a partial analysis of this phenomenon. We also show that some expensive subsidiary computations required by the new method can be replaced by inexpensive approximate versions of these tasks based on iteration, with virtually no degradation of performance. Key words. Navier-Stokes, Oseen, preconditioning, iterative methods. AMS(MOS) subject classications. Primary: 65F, 65N2 secondary 5A6. Introduction. This paper introduces a methodology for preconditioning the discrete steady-state incompressible Navier-Stokes equations () u + (u grad) u + grad p = f div u = in subject to suitable boundary conditions on @, where is an open bounded domain in IR 2 or IR 3. The vector eld u represents the velocity in, p represents pressure, and the scalar is the viscosity, which is inversely proportional to the Reynolds number. We will develop the preconditioners for the linearized version of () known as the Oseen equations, which can be written as (2) u + (w grad) u + grad p = f div u = where w is given such that div w =. These equations arise from a nonlinear iteration essentially of the form u (m) +(u (m ) grad) u (m) +grad p (m) = f div u (m) =, see [7]. Discretization of (2) yields a linear system of equations (3) F B B T u p = f where u and p now represent discrete versions of velocity and pressure, respectively. Here F = A + N where A consists of a set of uncoupled discrete Laplace operators, corresponding to diusion, and N is a discrete convection operator. We are interested in convergence behavior of iterative methods applied to (3) as either or the discretization mesh width h tend to zero. For small it is necessary for the discretization to be ne enough to resolve features such as boundary layers [3] typically when is decreased, h will also be reduced proportionally. We are concerned only with Department of Computer Science and Institute for Advanced Computer Studies, University of Maryland, College Park, MD 2742, e-mail: elman@cs.umd.edu. This work was supported by U. S. National Science Foundation under grant DMS-942333.

values of for which stable steady-state solutions exist for example, it is shown in [2] that values of = on the order of to several thousand are feasible. Let A denote the coecient matrix of (3). We will consider preconditioners of the form (4) Q = F BT X : It is easily conrmed that (5) AQ = I BF BF B T X so that the eigenvalues of AQ are fg [ (BF B T X ): Let M denote the diagonal of the pressure mass matrix associated with the discretization for nite dierences, a natural analogue is M = h d I for a uniform grid of width h in d dimensions. It was shown in [5] that for the choice X = M, the eigenvalues of BF B T X are bounded independent of h. (See [2, 22, 25, 29] for analogous results for the Stokes equations.) Therefore, use of this preconditioner in conjunction with Krylov subspace methods such as GMRES [24] leads to asymptotic convergence rates that are independent of the mesh size. Similar asymptotic bounds were also obtained in [] for a dierent class of preconditioners based on the symmetric part of A, where the preconditioning entails solution of the Stokes equations. The convergence properties of these approaches depend on the viscosity, and in general convergence rates deteriorate as decreases. For example, the results in [5] yield eigenvalues that are contained in a box in the complex plane of the form [c 2 c 2 ] i [c 3 c 4 ] where fc j g are independent of h and, and in experimental results iteration counts increase roughly like =. (See also x4.) Our concern here is to develop alternative choices for X for which the sensitivity to is less pronounced. Our starting point is an observation derived from [6]. Let G and K be two matrices of dimensions n p n u with n u n p and such that both are of full rank n p. The matrix K T (GK T ) G can then be viewed as an operator from range(k T ) to itself, and it is trivial to see that this is in fact the identity operator. To apply this to (3), rst assume for simplicity that B is of full rank (this assumption will be eliminated below) and let G = BF and K = B. Our observation is then or, equivalently, B T (BF B T ) BF = I on range(b T ) (6) B T (BF B T ) B = F on range(f B T ): Suppose for the moment that (7) range(b T ) range(f B T ): 2

Then the equality of (6) can be postmultiplied by B T, and premultiplying the result by B yields Equivalently, That is, for the choice (BB T ) (BF B T ) (BB T ) = BF B T : (BF B T ) = (BB T ) (BF B T ) (BB T ) : (8) X = (BB T ) (BF B T ) (BB T ) the eigenvalues of the preconditioned operator (5) are identically and one step of GMRES will produce the solution. In the rest of this paper, we examine the use of the preconditioner (4) for solving the discrete two-dimensional Oseen equations, using X dened by (8) as well as some computationally less expensive variants. Because of the presence of the high-order discrete operator BF B T in (8), we will refer to the combination of (4) and (8) as the \BFBt preconditioner." Cf. [2] for other approaches to the problem of approximating the action of the inverse of BF B T. For our analysis, we restrict our attention to a \marker-and-cell" (MAC) nite-dierence operator [5], which we outline in x2. In x3, we show that in the case where (2) is given with constant \wind" w and periodic boundary conditions, (7) holds. (In fact the two spaces are identical.) Consequently, in this case the discussion of the previous paragraph represents a complete analysis. In x4, we examine through a series of numerical experiments the extent to which these results reect the behavior of the preconditioner in more realistic scenarios, that is, for Dirichlet boundary conditions or non-constant wind. Our observations are that the convergence behavior of Krylov subspace methods is independent of for the Dirichlet problem with constant wind and mildly dependent on for variable wind convergence also depends mildly on the mesh size h, with iteration counts increasing in proportion to h =2. These conclusions also hold for variants of the BFBt preconditioner designed to keep computational costs low. In particular, use of Q with (8) in an iteration entails two Poisson solves on the pressure space and (for any X) the solution of a set of convection-diusion equations on the velocity space. We show that these computations can be approximated using inner iterations with little degradation of performance of the outer iteration. In x5 we make some summarizing remarks, and in an appendix we present a partial analysis of the behavior of the BFBt preconditioner for the Stokes problem. 2. Finite dierence discretization. We briey describe the MAC nite difference scheme. Assume that is the rectangular region ( ) ( ), divided into a uniform n n grid of cells of width h = =n. Let u = (u v) T denote the velocity eld and w = (a b) T the wind. The discrete velocities and pressures are dened on a staggered grid in which the discrete values of u lie in the centers of the cell boundaries orthogonal to the x-axis, the discrete values of v lie in the center of the cell boundaries orthogonal to the y-axis, and the discrete pressures lie in the cell centers. An example is shown in Fig.. The matrix of (3) contains three block rows, the rst two of which come from the momentum equations for the individual components of the discrete velocity eld and 3

Velocity u Velocity v Pressure p Fig.. Staggered grid for the MAC nite dierence discretization. the last to the incompressibility constraint. F and B have the form F = F F 2 B = (B B 2 ) where F i = A i + N i. The submatrices are dened as follows. Let jk denote the value of a mesh function at the point (jh kh) 2. The form of the indices (j k) depends on the mesh function to which they correspond in particular, they need not be integers. The rst block row of (3) is dened by (9) [ u] jk [A u] jk h (4u 2 jk u j k u j+k u jk+ u jk ) [au x ] jk [N (x) u] jk 2h a j+=2k u j+k a j =2k u j k [bu y ] jk [N (y) u] jk 2h b jk+=2 u jk+ b jk =2 u jk [p x ] jk [B T p] jk h p j+=2k p j =2k : The second block row (associated with v) is dened analogously. The discrete convection operator N = N (x) + N (y) represents a second-order approximation to [w grad u + div (u w)] 2 this is a skew-self-adjoint version of the rst convection term of (2). The second term is treated analogously, and the resulting matrix N is skew-symmetric. The discrete incompressibility constraint is () [u x + v y ] jk [Bu] jk = u h j+=2k u j =2k + v h jk+=2 v jk =2 : We will discuss the treatment of boundary conditions in Sections 3 and 4. 3. Fourier analysis. Suppose (2) is posed with constant wind w and periodic boundary conditions u(x ) = u(x ) u( y) = u( y): For the discretization, indexing in (9) and () is done in mod n arithmetic. In particular, mesh points on the left and right (or top and bottom) boundaries are This form of the convection operator leads to the skew-self-adjoint weak form used, for example, in [4, p. 53], [26, p. 25]. 4

from S? to itself. 5 identied, and if (j k) corresponds to a pressure mesh point next to the boundary (e.g., j = =2) then one of its \neighboring" points is next to the opposite boundary (j = n =2). The discrete versions of u, v and p each contain n 2 components, and therefore each of () F F 2 B B 2 B T B T 2 is a square matrix of order n 2. We also dene a discrete convection-diusion operator F p on the pressure space using the rst three terms of (9). That is, F p A + N where A and N = N (x) +N (y) are specied exactly as in (9) and (j k) now corresponds to indices for grid functions in the pressure space. An analogous idea is used in the denition of \distributive relaxation" schemes for multigrid methods applied to the Stokes and Navier-Stokes equations, see [3, 3]. F p is also dened using periodic boundary conditions for the discrete pressures. It is then straightforward to prove the following lemma by direct calculation. Lemma 3.. If w is a constant vector then F B T = B T F p. We will assume for the rest of this section that the wind w is constant on. The preconditioned operator can then be analyzed using Fourier techniques of the type described in [4]. It turns out that F = F 2 = F p in this case and in some of the discussion we will refer to them collectively as F. Consider the discrete exponential mesh functions f (st) j s t < ng where (2) h i (st) = jk e2isx j e 2ity k x j = j=n y k = k=n j k < n: These make up an orthogonal basis for IC n2. The matrices associated with the periodic problem satisfy F (st) = 4 sin 2 (sh) + sin 2 (th) + i w sin(2sh) + w 2 sin(2th) h 2 h (st) B (st) = h e 2ish (st) B T (st) = h e 2ish (st) B (st) 2 = h e 2ith (st) B T (st) 2 = h e 2ith (st) : (3) That is, the mesh functions of (2) constitute an orthogonal basis of eigenvectors for each of these matrices. The particular choice s = t = leads to (), with an associated eigenvalue equal to for all the matrices of (). Let S denote the space generated by () and let S? denote the orthogonal complement of S. It can be veried that all eigenvalues of F are nonzero for other combinations of s and t, so that this matrix represents a nonsingular operator from S? to itself. We can also extend F to an operator on all of IC n2 by dening it to be on S?. Therefore, it makes sense to consider the operators (4) F B T B T F p BF B T = B F BT + B 2F 2 B T 2 : Lemma 3.2. The Schur complement BF B T of (4) is a nonsingular operator

Proof. It follows immediately from (3) that BF B T (st) = st (st) where st = 4 sin 2 (sh) + sin 2 (th) 4 sin 2 (sh) + sin 2 (th) + i h (w sin(2sh) + w 2 sin(2th)) : This eigenvalue is nonzero if s 6= or t 6=. Theorem 3.3. Let X be given by (8). If BF B T, X and (the identity) I are viewed as operators from S? to itself, then X is simply an alternative representation of BF B T, and BF B T X = I. Proof. It follows from Lemma 3. that F B T = B T Fp. Consequently, range(b T ) = range(b T F p ) = range(f B T ) so that (7) holds. The assertion follows from the discussion of x. 4. Experiments on Dirichlet problems. The results of x3 show that (8) de- nes a perfect preconditioner for constant wind and periodic boundary conditions. In this section, we examine the performance of the BFBt preconditioner and some variants on more realistic problems with Dirichlet boundary conditions u = g on @. For most of the experiments, the Oseen equations are posed on = ( ) ( ) and discretized with the MAC scheme. This is dened as in x2 except when the discrete operators refer to grid indices outside (for example, in components of the rst discrete momentum equation centered at the values of u next to the bottom of @). Linear extrapolation is used in these cases see the denition of T E in xa.2 for details. We consider two sets of coecients, constant w = ( 2) and w = a circular vortex. In the latter case w is the image of (5) (2y( x 2 ) 2x( y 2 )) under the linear mapping from ( ) ( ) to. Rather than impose explicit boundary conditions, we eliminate the discrete velocities on the boundary from the system and use a normally distributed random vector with mean and variance for the right hand side f of (3). We also demonstrate that the BFBt methodology is not restricted to nite dierences with a set of experiments for a nite element discretization of the driven cavity problem. Both discretizations satisfy an inf-sup condition [4]. Our results are in the form of iteration counts for various combinations of and h. We restrict our attention to values of such that for the smallest mesh size considered, wh 2 is of order, so that the discretizations are reasonably accurate. Unless otherwise indicated, the iterative solver is GMRES with right-oriented preconditioning the initial guess for all tests is u =, p =, and the stopping criterion is kf Ax k k 2 kfk 2 6 where f and x k denote the right hand side and iterate for the block system (3). We now present the results for the BFBt preconditioner. For comparison we show analogous iteration counts for the preconditioner (4) where X = M is the diagonal 6

Table Iterations of GMRES for constant wind w = ( 2) with nite dierence discretization. X = X BF Bt X = M h /6 /32 /64 /6 /32 /64 = 9 2 2 = = 8 5 34 34 33 = =3 9 3 88 87 83 = =5 9 44 45 39 Table 2 Iterations of GMRES for w = circular vortex with nite dierence discretization. X = X BF Bt X = M h /6 /32 /64 /6 /32 /64 = 8 2 = = 4 8 9 9 8 = =3 4 7 2 47 46 43 = =5 6 8 23 79 77 73 of the scaled mass matrix as described in [5] X = (h2 I) for the MAC scheme. For the moment we are ignoring any issues of cost. 2 We present four sets of results. For the MAC discretization, Table shows the number of iterations for convergence of GMRES for constant wind, and Table 2 shows the results for the circular vortex. Table 3 shows analogous statistics for one example of a dierent discretization consisting of bilinear nite elements for both velocities and pressures, with the pressure grid of width 2h and streamline upwinding for the velocities. 3 To show the eects of a suboptimal Krylov subspace iteration, Table 4 shows the iteration counts when the BFBt preconditioner is combined with quasiminimum residual (QMR) iteration without look-ahead [], which (in contrast to GMRES) has a xed cost per step. As dened, the BFBt preconditioner requires several costly subsidiary computations. For each step of GMRES, the preconditioning entails the action of Q where Q is given by (4). It can be seen from the factorization Q = F I I I B T that computing the action of Q entails computing the action of X performing a matrix-vector product by B T and I X 2 All computations were performed in MATLAB on either a Sun SPARC-2 workstation or a DEC-Alpha 2 4/275 workstation. For both preconditioners, the action of F was computed using Gaussian elimination. For the BFBt preconditioner the action of (BB T ) was computed by direct methods using the pseudo-inverse of BB T, except when h = =64 for the MAC discretization in that case, this computation was done using a multigrid iteration in which the relative residual is forced to be less than 8. 3 The test problem is slightly dierent here. It is posed on = ( ) ( ) with w as in (5), f = in (), and boundary conditions u = v = when x = or y =, and u =, v = when y =. See [5] for more details. Some of the entries in Table 3 are taken from [5]. 7

Table 3 Iterations of GMRES for w = circular vortex with bilinear nite element discretization. X = X BF Bt X = M h 2/6 2/32 2/64 2/6 2/32 2/64 = 7 9 2 22 2 = = 2 5 32 36 35 = =3 3 5 7 44 56 64 = =5 5 7 9 48 72 97 Table 4 Iterations of QMR with nite dierence discretization and BFBt preconditioning. w = ( 2) w = circular vortex h /6 /32 /64 /6 /32 /64 = 9 3 5 9 2 5 = = 9 3 7 2 7 2 = =3 4 4 2 27 = =5 2 3 6 2 28 computing the action of F. BB T is a discrete Poisson operator on the pressure space (see xa.2 for a derivation for the MAC scheme), so that for the BFBt preconditioner, computing the action of X entails solving two discrete Poisson equations. These together with the convectiondiusion solves are potentially expensive operations, and the BFBt preconditioner is signicantly more costly than when X = M, a diagonal matrix. We now examine what happens when less costly computations based on inner iteration are used in place of these three operations. We consider only nite dierences here although the same methodologies are applicable to other discretizations. These versions of the preconditioner also require less storage, and in particular this enabled us to further explore some trends in the data with a ner mesh size h = =28 and smaller viscosity parameter = = (used only with this ne mesh). We rst consider the eect of replacing the Poisson solves with approximations derived from one step of V-cycle multigrid [9, Ch. ]. 4 That is, the modied BFBt preconditioner uses X = S MG (BF B T ) S MG where S MG is the multigrid approximation to (BB T ). The computational costs of this algorithm are of the same order of magnitude as when X = M. (We are still solving the convection-diusion equations exactly except when h = =28 in this case, we use an iterative method based on relaxation and force the relative residual norm to be less than 8.) The iteration counts for the two sets of benchmark problems are shown in Tables 5 and 6. Next, we consider the eect of also replacing the convection-diusion solves with approximate solutions derived from iterative methods. We will comment on the choice of method below. The performance of any such method will depend on the relative 4 The multigrid computation used damped Jacobi smoothing with optimal smoothing parameter = 4=5, one pre-smoothing and one post-smoothing step, and bilinear interpolation for prolongation. 8

Table 5 Iterations of GMRES with modied BFBt preconditioning using multigrid for the Poisson equation. Wind w = ( 2), nite dierence discretization. X = X BF Bt=MG h /6 /32 /64 /28 = 2 5 9 = = 2 3 7 22 = =3 2 2 5 2 = =5 3 3 4 8 = = 4 Table 6 Iterations of GMRES with modied BFBt preconditioning using multigrid for the Poisson equation. Wind w = circular vortex, nite dierence discretization. X = X BF Bt=MG h /6 /32 /64 = 2 5 = = 4 6 2 = =3 9 2 24 = =5 2 24 27 amount of convection and diusion in the problem, i.e., the value of [7]. Therefore, rather than use a xed number of iterations, we perform the inner iteration until the stopping criterion (6) kw F v k k 2 kwk 2 = 2 is satised, where w represents the right hand side for each convection-diusion equation and v = is the initial guess. The number of these iterations may vary from step to step of the outer GMRES iteration, so that we are no longer using a xed preconditioner Q instead we have a series of operators Q k that vary with the GM- RES step. This may cause diculties for GMRES, but these can be avoided with a \exible" variant of GMRES (FGMRES) designed for this situation [23], which we use in these tests. The two algorithms are mathematically equivalent when Q k is xed. The results for constant wind are shown in Table 7, and for the circular vortex in Table 8. We also present the performance using the scaled mass matrix. A dash \{" indicates failure to converge this was due to divergence of the iteration for the convection-diusion equation and is an artifact of inaccuracy of the discretization. We highlight some trends displayed by these data as follows:. In all tests with the constant coecient problem (Tables, 4, and 7, left, and Table 5), iteration counts with the BFBt preconditioner are independent of the viscosity indeed, in some cases they actually decrease with. For the circular vortex, however, the iteration counts appear not to be independent of, instead exhibiting some growth as decreases (Tables 2 and 8, left, Table 4, right, and Table 6). 2. In all cases, the counts with BFBt preconditioning increase slowly with h. 3. The iteration counts for X = M are independent of h but they grow roughly linearly with = these trends are consistent with the analysis of [5]. 9

Table 7 Iterations of FGMRES with modied BFBt preconditioning using multigrid for the Poisson equation and iteration for the convection-diusion equation. Wind w = ( 2), nite dierence discretization. X = X BF Bt=MG=Iter X = M h /6 /32 /64 /28 /6 /32 /64 /28 = 3 6 2 2 2 2 = = 2 4 7 22 35 34 33 32 = =3 2 3 5 2 88 85 85 = =5 { 3 4 8 { 85 4 42 = = 5 Table 8 Iterations of FGMRES with modied BFBt preconditioning using multigrid for the Poisson equation and iteration for the convection-diusion equation. Wind w = circular vortex, nite dierence discretization. X = X BF Bt=MG=Iter X = M h /6 /32 /64 /28 /6 /32 /64 /28 = 3 6 9 2 2 2 = = 4 6 2 25 9 9 9 8 = =3 9 2 24 3 5 45 44 43 = =5 { 28 27 34 { 95 73 73 = = 37 55 The counts are smaller for the circular vortex than for the constant wind we have no explanation for this. However, the same qualitative patterns are present for both problems. In practical situations it is often desired to compute solutions of a xed accuracy for a variety of values of by letting h and simultaneously with respect to this criterion, the BFBt preconditioner requires signicantly fewer iterations as the viscosity decreases. 4. The results for bilinear nite elements (Table 3) are qualitatively the same as for nite dierences. 5. The iteration counts of QMR (Table 4) are only slightly higher than those of GMRES and they display the same patterns. This was also true for X = M (results not shown). 6. Comparison of Tables 5 and 6 with Tables and 2 shows that replacing the exact Poisson solve with one multigrid step leads to little increase in iteration counts (less than 25% for h = =64). The same trends with respect to and h are evident, and these carry over to h = =28. Note that the multigrid method used here is not necessarily an optimal choice, just a simple one. 7. Approximate solution of the convection-diusion problem produces little degradation in performance of either preconditioner, even though the tolerance in (6) is very mild. We return to the rst two items on this list, the dependence of iteration counts with BFBt preconditioning on the parameters (for the circular vortex) and h. To try to get a feeling for this dependence, we plot the behavior of the solver along with graphs of functions that might model this behavior. In particular, Figure 2 plots iteration counts as a function of using the entries of Table 8 for h = =28,

Fig. 2. Comparison of iteration counts with various functions of, for h = =28. 6 5 nu 4 Iterations 3 nu /2 2 2 3 4 5 6 7 8 9 /nu together with graphs of the functions and =2 the latter curves were shifted to make the gure easy to view. These results suggest that dependence on is like O( =2 ), although it is dicult to make a precise statement with this limited amount of data. Similarly, Figure 3 plots iteration counts as a function of h using the entries from Table 8 for = =3, and these are compared with h and h =2. These results typify the behavior for all the values in the table and suggest that the dependence on h is of order h =2 see also xa.2. We comment briey on the convection-diusion solvers. For simple ows, it is fairly easy to construct relaxation strategies that follow the ow and converge rapidly [7], but for more complex ows, especially with recirculations, this is a more dicult task. In these tests, we used a horizontal -line SOR iteration with relaxation parameter given by the optimal choice for the constant coecient problem, which can be computed analytically [8]. We used this for the circular wind only for convenience of coding, and convergence was slow for these problems. Many other options for solving this problem (such as multigrid) are available. We also ran some tests with another inner iteration (using a \multidirectional relaxation") and found the number of outer iterations not to depend signicantly on the choice of the method used here. This points to the importance of the convection-diusion problem for the BFBt preconditioner it is critical that that the approximate solution to this subproblem be computed eciently for the complete computation to be inexpensive. 5. Concluding remarks. We summarize this study as follows. The main new result is that the performance of the BFBt preconditioner for solving the steadystate Oseen equations with Dirichlet boundary conditions depends very mildly on the viscosity. This stands in contrast to other preconditioning methods, where performance deteriorates more dramatically as. Its performance depends on the mesh size h with iterations apparently growing in proportion to h =2. If a series of Dirichlet problems with decreasing viscosity are to be solved in such a way that h is proportional to, then we expect the iteration counts to grow like O( =2 ). This conclusion also appears to hold for \inexact" versions of the BFBt preconditioner in which the expensive subsidiary computations, solution of the Poisson equation

Fig. 3. Comparison of iteration counts with various functions of h, for = =3. 2 8 h 6 4 Iterations 2 h /2 2 4 6 8 2 4 /h and convection-diusion equation, are replaced by iterative solves with mild stopping criteria. The eectiveness of these variants of the BFBt preconditioner depends on having ecient methods for the convection-diusion equation. This issue is also a critical one for multigrid methods for the Navier-Stokes equations [3, 27, 28, 3]. It is dicult to make a simple comparison between the methods proposed here and multigrid most reported studies show a dependence on viscosity (or Reynolds number) but not on mesh size [27, 28, 3]. An advantage of the BFBt preconditioner is that it can be applied easily to systems arising from mixed nite elements with dierent grids for velocities and pressure (as shown in Table 3). If the goal is to solve a xed Dirichlet problem on a sequence of ner meshes, then there are other methods whose performance is less dependent on the mesh size [5, ]. This is true in particular for the Stokes equations, where several methods exist whose convergence rate is independent of h [, 2, 22, 25]. Finally, we note that the boundary conditions are very important here. Fourier analysis is often used as a guideline in behavior of numerical methods [2], and there are numerous examples of cases where the results for periodic boundary conditions are predictive of performance for Dirichlet conditions [4]. However, here there is a qualitative dierence in the two types of problems. See [8] for other examples of the eects of boundary conditions on iterative methods. A. Appendix: Partial analysis for the preconditioned Stokes problem. We show that the minimum eigenvalue for the BFBt-preconditioned Stokes operator is no smaller than one and give a partial analysis that suggests why its maximum eigenvalue is proportional to h. A.. Lower bound. Suppose the coecient matrix of (3) is a discrete Stokes operator, i.e., it is derived from =, w = in (2), with Dirichlet boundary conditions. Therefore, F = A. Let the singular value decomposition of B be denoted B = UV T = [U u ] 2 V T V T 2 = U V T :

The columns of U span range(b), the columns of V span range(b T ), and u is parallel to the discrete hydrostatic pressure p, which determines the null space S of B T. It follows that BB T = U 2 U T : As in x3, we will treat B T as an operator dened on S? on this space BBT and BAB T represent nonsingular operators whose inverses are given by the matrix pseudo-inverses (BB T ) = U 2 U T (BAB T ) = U (V T AV ) U T : We are interested in the minimum eigenvalue of the generalized eigenvalue problem (7) (BA B T )p = (BB T ) (BAB T ) (BB T )p: The Schur complement matrix on the left is S BA B T = U V T A V U T and the preconditioning matrix on the right is Consequently, X = U (V T AV ) U T : (8) S = X + U hv T A V (V T AV ) i U T : Using this splitting of S, we can prove the following result. Theorem A.. The minimum eigenvalue of the preconditioned generalized Stokes operator with Dirichlet boundary conditions is =. Proof. We will show that V T A V (V T AV ) is positive semi-denite. Let D = V T A V = D D 2 D 2 D 22 where D ij = Vi T AV j. D is symmetric positive-denite and standard analysis gives D = I D 2 D I D D 22 D 2 D D 2 Letting C D D 22 D 2 D D 2, this in turn implies that D = But D = V T A V, so that D D D 2CD 2D D 2CD CD 2D CD I D D 2 I V T A V = [D ] = D + D D 2C D D 2D : : : 3

Fig. 4. Maximum eigenvalue for the Stokes problem with BFBt preconditioner. 6 5 5.5 4 max(lambda) 3 2.8 2.68.8 5 5 2 25 3 35 /h Therefore, inf x6= (x V T A V x) (x (V AV ) x) (x D = inf + D D 2CD D 2D x) x6= (x D = inf x6= x) (x D + D 2 CD D 2x) : (x D x) It follows that V T A V (V T AV ) is positive semi-denite, so that (8) yields inf q2s? (q Sq) (q Xq) : A.2. Largest eigenvalue. The experiments described in x4 show that convergence rates for the BFBt preconditioner depend on the mesh size. Figure 4 plots the maximum eigenvalue of (7) as a function of h, where the computations were for h = =4, =8, =6 and =32. The results indicate that the maximum eigenvalue is of magnitude O(h ). By analogy with the standard convergence bounds for symmetric positive denite problems, this suggests that the iteration counts will increase in proportion to h =2, which is consistent with the results of x4. To provide some insight into this maximum eigenvalue, we write (q Sq) (q Xq) = (q Sq) (q (h 2 I)q) (q (h 2 I)q) (q Xq) where h 2 I is playing the role of the mass matrix for the nite dierence discretization. It is well-known [9] that the rst quotient on the right is bounded by a constant p 2. We would like a bound for the second quotient of the form or equivalently, (9) sup q2s? (q (h 2 I)q) (q (BB T ) (BAB T ) (BB T )q) c h sup q2s? (q BAB T q) (q (BB T ) 2 q) c h 3 : 4

The matrices comprising (9) have a tensor product structure derived from onedimensional operators. Let the interval [ ] be divided into n equally spaced subintervals of width h = =n, where the jth interval is [x j x j ] with x j = jh and midpoint ^x j = (j =2)h. An example with n = 8 is shown below. ^x ^x 2 ^x 3 ^x 4 ^x 5 ^x 6 ^x 7 ^x 8 x x x 2 x 3 x 4 x 5 x 6 x 7 x 8 Consider the following three (scaled) nite dierence approximations to the onedimensional Laplace operator u : T D = B @ 2 2... 2 2 C A T E = B @ 3 2... 2 3 C A T N = B @ 2... 2 T D and T E are derived from Dirichlet boundary conditions (u(), u() given), and T N is derived from Neumann conditions (u (), u () given). T D is dened at the cell boundaries fx j g n and is of order n T j= E and T N are dened at the cell centers f^x j g n j= and are of order n. In all cases, the discrete Laplace operator at points next to the boundaries depends on the boundary conditions. T E uses linear extrapolation C A : u(^x ) = u(x ) + 2u() u(^x n+ ) = u(x n ) + 2u() T N approximates the Neumann boundary conditions as u () [u(^x ) u(^x )]=h u () [u(^x n+ ) u(^x n )]=h: Note that (2) T E = T N + 2E E = B @... C A : We will also use the dierence operator B D = B @...... C A which has dimensions n (n ) and can be viewed as mapping cell boundary grid functions to cell centered functions. It is easily veried that (2) B D B T D = T N B D T D B T D = T 2 N : Recall that for matrices X and Y where X has dimensions rs, the tensor product of X and Y is [6, pp. 239] X Y = B @ x Y x s Y..... x r Y x rs Y 5 C A :

Letting I r denote the identity matrix of order r, it is straightforward to show that A = The identities (2) then imply I n T D + T E I n I n T E + T D I n B = [I n hb D hb D I n ] : BB T = h 2 (I n T N + T N I n ) (BB T ) 2 = h 4 (I n T 2 N + 2 T N T N + T 2 N I n) BAB T = h 2 (I n T 2 N + T E T N + T N T E + T 2 N I n): Consequently, relation (9) is equivalent to (22) sup q2s? (q (I n T 2 N + T E T N + T N T E + T 2 N I n)q) (q (I n T 2 N + 2 T N T N + T 2 N I n)q) c h : Remark. This derivation shows that BB T is a scaled discrete Poisson operator on the pressure space with Neuman boundary conditions. The matrices in (22) dier only by the presence of T E in the cross-terms of the numerator, i.e., the numerator contains submatrices derived from Dirichlet conditions whereas the denominator comes exclusively from Neumann conditions. To get a feeling for why this dierence in boundary conditions leads to an inequality of the form (22), consider the generalized eigenvalue problem for the one-dimensional operators, (23) T N x = T E x: It is evident from (2) that there is an eigenvalue = of multiplicity n 2 for which every eigenvector x satises x = x n =. Moreover, T N x = for the constant vector x, so that is also an eigenvalue. If any x satises (23) with 6= and at least one of x 6= or x n 6=, then the rst and last equations (of (23)) imply 6=. From the rst equation we have (24) x 2 x = 2 x : The assumption 6= together with the (interior) equations k = 2 : : : n imply x k+ x k = x k x k = 2 x 2 k n where the second equality comes from (24). An inductive argument then yields (25) x k = (2k ) Finally, the last equation of (23) implies x 2 k n: x n = 2 (x n x n ) = x : Equating this expression for x n with the one from (25) yields = =n = h. Therefore, we have the following result. 6

Theorem A.2. The eigenvalues of the generalized eigenvalue problem (23) are = of multiplicity n 2, =, and = h. Consequently, sup x (x T E x) (x T N x) = h where the supremum is taken over all vectors orthogonal to the constant vector. Remark. The results of xa. apply in general to any discretization for which the null space is spanned by constant pressures. The discussion of xa.2 applies only for the MAC scheme. Acknowledgements. The author acknowledges some helpful comments from David Silvester and Robert Pego. REFERENCES [] J. H. Bramble and J. E. Pasciak. A preconditioning technique for indenite systems resulting from mixed approximations of elliptic problems. Math. Comp., 5:{7, 988. [2] J. H. Bramble and J. E. Pasciak. A domain decomposition technique for Stokes problems. Applied Num. Math., pages 25{26, 989/99. [3] A. Brandt and N. Dinar. Multigrid solutions to elliptic ow problems. In S. V. Parter, editor, Numerical Methods for Partial Dierential Equations, pages 53{47. Academic Press, New York, 979. [4] T. F. Chan and H. C. Elman. Fourier analysis of iterative methods for elliptic problems. SIAM Review, 3:2{49, 989. [5] H. Elman and D. Silvester. Fast nonsymmetric iterations and preconditioning for Navier-Stokes equations. SIAM J. Sci. Comput., 7:33{46, 996. [6] H. C. Elman. Perturbation of Eigenvalues of Preconditioned Navier-Stokes Operators. Technical Report UMIACS-TR-95-, Institute for Advanced Computer Studies, University of Maryland, 995. To appear in SIAM J. Matr. Anal. Appl. [7] H. C. Elman and M. P. Chernesky. Ordering eects on relaxation methods applied to the discrete one-dimensional convection-diusion equation. SIAM J. Numer. Anal., 3:268{29, 993. [8] H. C. Elman and G. H. Golub. Iterative methods for cyclically reduced non-self-adjoint linear systems. Math. Comp., 54:67{7, 99. [9] M. Fortin and R. Pierre. Stability analysis of discrete generalised Stokes problems. Numer. Meth. Partial Di. Eq., 8:33{323, 992. [] R. Freund and N. M. Nachtigal. QMR: a quasi-minimal residual method for non-hermitian linear systems. Numer. Math., 6:35{339, 99. [] G. H. Golub and A. J. Wathen. An Iteration for Indenite Systems and its Application to the Navier-Stokes Equations. Technical Report AM-95-9, Mathematics Department, University of Bristol, 995. To appear in SIAM J. Sci. Stat. Comput. [2] P. M. Gresho, D. K. Gartling, J. R. Torczynski, K. A Clie, K. H. Winters, T. J. Garratt, A. Spence, and J. W. Goodrich. Is the steady viscous incompressible two-dimensional ow over a backward-facing step at Re=8 stable? Int. J. Numer. Meth. Fluids, 7:5{54, 993. [3] P. M. Gresho and R. L. Lee. Don't suppress the wiggles { they're telling you something. Computers and Fluids, 9:223{253, 98. [4] M. Gunzburger. Finite Element Methods for Viscous Incompressible Flows. Academic Press, San Diego, 989. [5] F. H. Harlow and J. E. Welch. Numerical calculation of time-dependent viscous incompressible ow of uid with free surface. The Physics of Fluids, 8:282{289, 965. [6] R. A. Horn and C. R. Johnson. Topics in Matrix Analysis. Cambridge University Press, New York, 99. [7] O. A. Karakashian. On a Galerkin-Lagrange multiplier method for the stationary Navier-Stokes equations. SIAM. J. Numer. Anal., 9:99{923, 982. [8] T. A. Manteuel and S. V. Parter. Preconditioning and boundary conditions. SIAM J. Numer. Anal., 27:656{694, 99. 7

[9] S. F. McCormick, editor. Multigrid Methods. SIAM, Philadelphia, 987. [2] M. F. Murphy and A. J. Wathen. On Preconditioners for the Oseen Equations. Technical Report AM 95-7, Department of Mathematics, University of Bristol, 995. [2] R. D. Richtmyer and K. W. Morton. Dierence Methods for Initial Value Problems. John Wiley & Sons, New York, second edition, 967. [22] T. Rusten and R. Winther. A preconditioned iterative method for saddle point problems. SIAM J. Matr. Anal. Appl., 3:887{94, 992. [23] Y. Saad. A exible inner-outer preconditioned GMRES algorithm. SIAM J. Sci. Comput., 4:46{469, 993. [24] Y. Saad and M. H. Schultz. GMRES: A generalized minimal residual algorithm for solving nonsymmetric linear systems. SIAM J. Sci. Stat. Comput., 7:856{869, 986. [25] D. Silvester and A. Wathen. Fast iterative solution of stabilized Stokes systems part II: using block preconditioners. SIAM J. Numer. Anal., 3:352{367, 994. [26] R. Temam. Navier-Stokes Equations. North-Holland, Amsterdam, 979. [27] M. C. Thompson and J. H. Ferziger. An adaptive multigrid technique for the incompressible Navier-Stokes equations. J. Comput. Phys., 82:94{2, 989. [28] S. P. Vanka. Block-implicit multigrid solution of Navier-Stokes in primitive variables. J. Comput. Phys., 65:38{58, 986. [29] R. Verfurth. A combined conjugate gradient-multigrid algorithm for the numerical solution of the Stokes problem. IMA J. Numer. Anal., 4:44{455, 984. [3] G. Wittum. Multi-grid methods for the Stokes and Navier-Stokes equations. Numer. Math., 54:543{564, 989. 8