MULTIGRID METHODS FOR NEARLY SINGULAR LINEAR EQUATIONS AND EIGENVALUE PROBLEMS

Similar documents
Geometric Multigrid Methods

SPECTRAL PROPERTIES OF THE LAPLACIAN ON BOUNDED DOMAINS

SOLVING MESH EIGENPROBLEMS WITH MULTIGRID EFFICIENCY

Kasetsart University Workshop. Multigrid methods: An introduction

MULTIGRID PRECONDITIONING IN H(div) ON NON-CONVEX POLYGONS* Dedicated to Professor Jim Douglas, Jr. on the occasion of his seventieth birthday.

Preconditioned Locally Minimal Residual Method for Computing Interior Eigenpairs of Symmetric Operators

10.6 ITERATIVE METHODS FOR DISCRETIZED LINEAR EQUATIONS

Multigrid Methods and their application in CFD

Computational Linear Algebra

Multigrid and Domain Decomposition Methods for Electrostatics Problems

A SHORT NOTE COMPARING MULTIGRID AND DOMAIN DECOMPOSITION FOR PROTEIN MODELING EQUATIONS

CLASSICAL ITERATIVE METHODS

A Generalized Eigensolver Based on Smoothed Aggregation (GES-SA) for Initializing Smoothed Aggregation Multigrid (SA)

Finite Elements. Colin Cotter. February 22, Colin Cotter FEM

Spectral element agglomerate AMGe

31. Successive Subspace Correction method for Singular System of Equations

Adaptive algebraic multigrid methods in lattice computations

VARIATIONAL AND NON-VARIATIONAL MULTIGRID ALGORITHMS FOR THE LAPLACE-BELTRAMI OPERATOR.

Chapter 2 Finite Element Spaces for Linear Saddle Point Problems

An Algebraic Multigrid Method for Eigenvalue Problems

NEW A PRIORI FEM ERROR ESTIMATES FOR EIGENVALUES

INTERGRID OPERATORS FOR THE CELL CENTERED FINITE DIFFERENCE MULTIGRID ALGORITHM ON RECTANGULAR GRIDS. 1. Introduction

The Algebraic Multigrid Projection for Eigenvalue Problems; Backrotations and Multigrid Fixed Points. Sorin Costiner and Shlomo Ta'asan

Constrained Minimization and Multigrid

INF-SUP CONDITION FOR OPERATOR EQUATIONS

INTRODUCTION TO MULTIGRID METHODS

Multigrid absolute value preconditioning

Solving Symmetric Indefinite Systems with Symmetric Positive Definite Preconditioners

A Finite Element Method Using Singular Functions for Poisson Equations: Mixed Boundary Conditions

LECTURE 1: SOURCES OF ERRORS MATHEMATICAL TOOLS A PRIORI ERROR ESTIMATES. Sergey Korotov,

A Least-Squares Finite Element Approximation for the Compressible Stokes Equations

arxiv: v1 [math.na] 11 Jul 2011

Lecture Note III: Least-Squares Method

A Note on Inverse Iteration

New Multigrid Solver Advances in TOPS

Multigrid Methods for Saddle Point Problems

Theory of PDE Homework 2

AMS526: Numerical Analysis I (Numerical Linear Algebra)

Simple Examples on Rectangular Domains

Functional Analysis Review

b i (x) u + c(x)u = f in Ω,

Second Order Elliptic PDE

MULTIGRID METHODS FOR NONLINEAR PROBLEMS: AN OVERVIEW

A Robust Approach to Minimizing H(div)-Dominated Functionals in an H 1 -Conforming Finite Element Space

Solving PDEs with Multigrid Methods p.1

ITERATIVE METHODS FOR NONLINEAR ELLIPTIC EQUATIONS

Analysis Preliminary Exam Workshop: Hilbert Spaces

arxiv: v2 [math.na] 16 Nov 2016

Applied Numerical Mathematics

Optimal multilevel preconditioning of strongly anisotropic problems.part II: non-conforming FEM. p. 1/36

1. Fast Iterative Solvers of SLE

Introduction to Multigrid Method

A MULTIGRID ALGORITHM FOR. Richard E. Ewing and Jian Shen. Institute for Scientic Computation. Texas A&M University. College Station, Texas SUMMARY

Multilevel Preconditioning of Graph-Laplacians: Polynomial Approximation of the Pivot Blocks Inverses

A Recursive Trust-Region Method for Non-Convex Constrained Minimization

A posteriori error estimates for non conforming approximation of eigenvalue problems

Notes on Multigrid Methods

AMG for a Peta-scale Navier Stokes Code

Math 350 Fall 2011 Notes about inner product spaces. In this notes we state and prove some important properties of inner product spaces.

Math Introduction to Numerical Analysis - Class Notes. Fernando Guevara Vasquez. Version Date: January 17, 2012.

Eigenvalues and Eigenfunctions of the Laplacian

08a. Operators on Hilbert spaces. 1. Boundedness, continuity, operator norms

Bootstrap AMG. Kailai Xu. July 12, Stanford University

A MULTILEVEL SUCCESSIVE ITERATION METHOD FOR NONLINEAR ELLIPTIC PROBLEMS

Numerische Mathematik

A Domain Decomposition Based Jacobi-Davidson Algorithm for Quantum Dot Simulation

DS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra.

From Completing the Squares and Orthogonal Projection to Finite Element Methods

Weak Formulation of Elliptic BVP s

Properties of Matrices and Operations on Matrices

Section 3.9. Matrix Norm

Efficient smoothers for all-at-once multigrid methods for Poisson and Stokes control problems

6 Inner Product Spaces

XIAO-CHUAN CAI AND MAKSYMILIAN DRYJA. strongly elliptic equations discretized by the nite element methods.

Chapter 7 Iterative Techniques in Matrix Algebra

Iterative Methods and Multigrid

FIRST-ORDER SYSTEM LEAST SQUARES (FOSLS) FOR PLANAR LINEAR ELASTICITY: PURE TRACTION PROBLEM

Linear Algebra Massoud Malek

The Conjugate Gradient Method

Local Mesh Refinement with the PCD Method

FIRST-ORDER SYSTEM LEAST SQUARES (FOSLS) FOR GEOMETRICALLY NONLINEAR ELASTICITY

A Perron-type theorem on the principal eigenvalue of nonsymmetric elliptic operators

Preconditioning in H(div) and Applications

Appendix A Functional Analysis

STEEPEST DESCENT AND CONJUGATE GRADIENT METHODS WITH VARIABLE PRECONDITIONING

and finally, any second order divergence form elliptic operator

Stabilization and Acceleration of Algebraic Multigrid Method

Domain decomposition on different levels of the Jacobi-Davidson method

Adaptive methods for control problems with finite-dimensional control space

EQUIVALENCE OF TOPOLOGIES AND BOREL FIELDS FOR COUNTABLY-HILBERT SPACES

ENERGY NORM A POSTERIORI ERROR ESTIMATES FOR MIXED FINITE ELEMENT METHODS

University of Illinois at Urbana-Champaign. Multigrid (MG) methods are used to approximate solutions to elliptic partial differential

Eigenvectors and Hermitian Operators

Chapter 6: Orthogonality

Spectral Theorem for Self-adjoint Linear Operators

6. Iterative Methods for Linear Systems. The stepwise approach to the solution...

Friedrich symmetric systems

Multigrid Method ZHONG-CI SHI. Institute of Computational Mathematics Chinese Academy of Sciences, Beijing, China. Joint work: Xuejun Xu

Aspects of Multigrid

Math 290, Midterm II-key

Transcription:

SIAM J. NUMER. ANAL. c 997 Society for Industrial and Applied Mathematics Vol. 34, No., pp. 78 200, February 997 008 MULTIGRID METHODS FOR NEARLY SINGULAR LINEAR EQUATIONS AND EIGENVALUE PROBLEMS ZHIQIANG CAI, JAN MANDEL, AND STEVE MCCORMICK Abstract. The purpose of this paper is to develop a convergence theory for multigrid methods applied to nearly singular linear elliptic partial differential equations of the type produced from a positive definite system by a shift with the identity. One of the important aspects of this theory is that it allows such shifts to vary anywhere in the multigrid scheme, enabling its application to a wider class of eigenproblem solvers. The theory is first applied to a method for computing eigenvalues and eigenvectors that consists of multigrid iterations with zero right-hand side and updating the shift from the Rayleigh quotient before every cycle. It is then applied to the Rayleigh quotient multigrid (RQMG) method, which is a more direct multigrid procedure for solving eigenproblems. Local convergence of the multigrid V-cycle and global convergence for a full multigrid version of both methods is obtained. Key words. eigenvalue problem, multigrid, Rayleigh quotient, singular equations AMS subject classifications. 65N55, 65F0, 65N30 PII. S0648275942639. Introduction. In this paper, we consider the solution of the generalized eigenvalue problem based in an abstract finite-dimensional Hilbert space V with inner product (, ): find λ Rand 0 u V such that (.) Au = λbu. Here for simplicity A and B are assumed to be linear continuous symmetric positive definite operators defined on V. We will consider two multigrid approaches for finding the smallest eigenvalue for (.) based on a sequence of subspaces. One uses multigrid as an inner loop solver for an outer loop inverse iteration type process, which has been studied by many authors (cf. the early wor in [] and [7]). The other is the Rayleigh quotient multigrid (RQMG) method [5, 8], which is a more direct approach based on minimizing the Rayleigh quotient at each stage of the multigrid processing. To our nowledge, this is the first theory for methods lie RQMG where the shifts are allowed to vary within multigrid cycles and be close to eigenvalues. Filling this gap is the main purpose of this paper. We will analyze convergence of these two multigrid methods by developing and applying a general convergence theory for singular or nearly singular linear problems: given f V and a scalar µ R, find 0 u V such that (.2) (A µb)u = f. Received by the editors April 4, 994; accepted for publication (in revised form) March 27, 995. http://www.siam.org/journals/sinum/34-/263.html Department of Mathematics, Purdue University, West Lafayette, IN 47907-395 (zcai@math. purdue.edu). The research of this author was supported by National Science Foundation grant DMS-969792. Center for Computational Mathematics, University of Colorado at Denver, Denver, CO 8027-3364. The research of this author was supported by National Science Foundation grant ASC-9243. Program in Applied Mathematics, University of Colorado at Boulder, Boulder, CO 80309-0526. The research of this author was supported by Air Force Office of Scientific Research grant AFOSR- 9-056 and by National Science Foundation grant DMS-870469. 78

MULTIGRID FOR LINEAR EQUATIONS AND EIGENPROBLEMS 79 Here we restrict ourselves to theoretical analyses because of the numerical results available in [5] for RQMG, the principal target of this paper. Previous convergence results for multigrid algorithms applied to (.2) were obtained by Ban []. In order to establish norm estimates for the rate of convergence, the shift µ was assumed to be bounded away from the smallest eigenvalue of (.) in []. In contrast, our analysis uses an error decomposition into the eigenspace associated with the smallest eigenvalue of (.) and its orthogonal complement. We will not attempt to solve (.2) in the usual sense; instead, our aim is to preserve the approximate magnitude of the components in the eigenspace of the smallest eigenvalue of (.) while attenuating error components in its orthogonal complement. In our analysis, the shift µ will be allowed to vary in a small neighborhood of the smallest eigenvalue of (.). The outline of the remainder of this paper is as follows. In section 2, we formulate the problems, establish notation, and define a multigrid algorithm for the nearly singular problem. In section 3, we develop a convergence theory for this multigrid algorithm. The theory is first applied in section 4 to a method for computing eigenvalues and eigenvectors that uses multigrid as an inner loop solver for an outer loop inverse iteration type process. It is then applied in section 5 to RQMG. The final section develops global convergence results for full multigrid V-cycle versions of both methods. 2. Preliminaries. Let V be a real linear space, on which are given inner products a(, ) and(, ), with corresponding induced norms denoted by and. Let b(, ) be a continuous, symmetric, positive definite bilinear form on V V. Consider the eigenvalue problem: find λ Rand 0 u V such that (2.) a(u, v) =λb(u, v) v V. If (2.) corresponds to the eigenvalue problem for a self-adjoint elliptic partial differential operator, it will typically admit an infinite set of nondecreasing eigenvalues. Without loss of such generality, let the (possibly multiple) eigenvalues of (2.) satisfy (2.2) 0 <λ <λ 2 <. In particular, we note that λ is the minimum of the Rayleigh quotient over V : (2.3) λ = inf RQ(u) 0 u V where the Rayleigh quotient is defined by (2.4) a(u, u) RQ(u) = b(u, u). We will consider multigrid methods for finding the smallest eigenvalue for (2.), based on a sequence of finite-dimensional subspaces. To this end, let V 0 V V be a nested family of finite-dimensional subspaces of V. Let (, ) be a given inner product on V, its induced norm, and h =2 h 0 the mesh parameter associated with V, h 0 > 0. Then the corresponding finite-dimensional problem (2.) on V is as follows: find λ Rand 0 u V such that (2.5) a(u,v)=λ b(u,v) v V.

80 ZHIQIANG CAI, JAN MANDEL, AND STEVE MCCORMICK Let the (possibly multiple) eigenvalues of (2.5) satisfy (2.6) 0 <λ <λ 2 < <λ max. Note that λ is the minimum of the Rayleigh quotient over V : λ = inf RQ(u ). 0 u V In order to analyze convergence of the multigrid algorithm for the eigenvalue problem (2.5), we will first study the behavior of multigrid applied to the following singular or nearly singular problem: given a source term f l in the dual space (V l ) and a scalar λ R, find 0 u l V l such that (2.7) a(u l,v) λb(u l,v)=f l (v) v V l. The shift λ is assumed to satisfy (2.8) By this we mean that (2.9) λ λ λ l + O(h 0 /l), λ λ λ l + Ch 0 /l, l. l for some constant C independent of h 0 and l. This condition, which allows for shifts that are arbitrarily close to λ l, can be guaranteed for the eigenproblem solvers we consider, as we show in the last section. However, since λ = λ l is allowed, problem (2.7) may not have a solution. This is acceptable because our real interest is eigenproblems: we will attempt to solve (2.7) not in the strict sense, but only in that the approximation is correct up to the eigencomponents belonging to λ l. The reader is strongly advised to eep in mind that all of the following estimates allow for λ to change any time during multigrid processing, provided it continues to satisfy the bounds in (2.8). This allowance for a floating shift is one of the ey distinguishing points of the theory developed here, and it is just what enables treatment of the nonlinear scheme RQMG below. Unfortunately, it is also what dramatically complicates our exposition. For any u, v V, define the bilinear forms c λ (, ) andc(, )onv V by c λ (u, v) =a(u, v) λb(u, v) and c(u, v) =a(u, v) λ b(u, v). Fix {0,,,l}and let q (, ) be a bilinear form on V V. We will use this form to define the smoothing step in the multigrid algorithm defined below. Define the operators A,B,C λ,c,q :V V by requiring for all u and v in V. Note that (A u, v) =a(u, v), (B u, v) =b(u, v), (Cλu, v) =c λ (u, v), (C u, v) =c(u, v), (Q u, v) =q (u, v) C λ = A λb and C = A λ B.

MULTIGRID FOR LINEAR EQUATIONS AND EIGENPROBLEMS 8 In practice, the operator Q will be constructed to approximately invert Cλ in some wea sense. The basic idea is that we want Q to satisfy the condition that it is an adequate approximate inverse of Cλ on oscillatory vectors in V. We will be precise about this condition in section 3.3. Denote the eigenspace associated with λ by and its a-orthogonal complement by V = {u V : a(u, v) =λ b(u, v) v V } V 2 = {u V : a(u, v) =0 v V }. For any u V, define the a-orthogonal projection operators P P2 : V V2 by and a(p u, v) =a(u, v) v V : V V and a(p 2 u, v) =a(u, v) v V 2. Consider the following multigrid algorithm for solving (2.7); more precisely, this algorithm attempts to reduce the error in V2 l only while eeping the V l approximation component essentially unchanged. Note that the multigrid algorithm as it is posed here is based on a direct solver for the coarsest grid problem (l = 0). Later we will allow for approximate solvers. MG/ALGORITHM. Let an initial approximation u l,0 V l and a right-hand side f l be given. Then the new approximation u l,2 = MG/ l (u l,0,f l )is defined recursively as follows: (a) If l =0, then compute u l,2 V l so that (2.0) Pu l l, 2 = Pu l l, 0 and c λ (u l, 2,v)=f l (v) v V2. l (b) If l>0, then perform the following:. Coarse-grid correction step. Denote the current residual by r l,0 (v) = f l (v) c λ (u l,0,v)and define f l (V l ) as its restriction to V l : f l (v) =r l, 0 (v) v V l. Then set u l, = u l,0 + MG/ l (0, f l ). 2. Smoothing step. Define w l V l by (2.) q l (w l,v)=r l, (v) v V l, where r l, (v) =f l (v) c λ (u l,,v). Then set u l,2 = u l, + w l. We have introduced this abstract form of the multigrid algorithm for two basic reasons. First, we want a general scheme that controls the potential instabilities found in many approaches to nearly singular equations, namely, those caused by shifts that get too close to coarse grid eigenvalues. Our coarsest grid solver in (2.0) and the smoothers we allow in section 3.3 prevent such difficulties. Second, this abstract algorithm applies naturally to RQMG, as we show in section 5.

82 ZHIQIANG CAI, JAN MANDEL, AND STEVE MCCORMICK 3. Convergence of a multigrid method for nearly singular linear equations. In this section, we will analyze convergence of multigrid for the nearly singular linear problem (2.7). Our analysis uses an error decomposition introduced in section 3.2, which is based on an interpretation of solution and error that taes into account the objective of using (2.7) to solve the eigenvalue problem (2.5) with = l. We have therefore not attempted to solve (2.7) in the usual sense; instead, our aim is to preserve the approximate magnitude of the components in the eigenspace associated with λ l while attenuating error components in its a-orthogonal complement. Based on this error decomposition, we establish a smoothing property of relaxation in section 3.3 and a reduced approximation property of the discretization in section 3.4 which are combined in section 3.5. A V-cycle estimate will be developed in section 3.7 that follows from the recursive estimates developed in section 3.6. Again, the reader is strongly advised to eep in mind that the shift λ is floating in the sense that it is allowed to vary anywhere in the multigrid process, provided it remains within the bounds expressed by (2.8). 3.. Preliminaries and assumptions. Fix {0,,,l}. For any u V, the a-orthogonal projections P and P2 yield the unique decomposition (3.) u = P u + P 2 u. Remar 3.. Note that V and V 2 are also b-orthogonal: (3.2) b(u, v) =0 u V v V2. In this paper, C>0will denote a generic constant that does not depend on the number of levels l or any of the mesh sizes h. Assumption 3.. The norms and are uniformly equivalent on all V, that is, for any v V, (3.3) C v v C v. Assumption 3.2. Assume that the following approximation properties hold: () For any u V, there exists v V such that (3.4) u v Ch u. (2) For any v V, there exists u V such that (3.5) v u Ch v. Remar 3.2. Under the usual full regularity assumptions and subspace properties, standard finite element theory (cf. [9]) concludes that for any 0 u V that minimizes RQ(u) overv there exists v V l such that u v Ch l u. (Note that C here depends on λ.) Property (3.4) then follows using the triangle inequality and this estimate with l = and l =. An analogous argument based on a corresponding estimate can be used to establish (3.5). The eigenvalue estimate in our first lemma below also follows from standard finite element theory, but we include its proof here to show that the estimate is actually a consequence of our general assumptions.

MULTIGRID FOR LINEAR EQUATIONS AND EIGENPROBLEMS 83 LEMMA 3. (eigenvalue approximation property). λ λ λ + Ch 2. Proof. The left inequality follows from noting that λ i is the minimum of RQ(u i ) over V i, i =,, and that V V. To prove the right inequality, note for any u V and v V that a(u v, u v) =a(u v, u) a(u, v)+a(v, v) = λ b(u v, u) λ b(u, v)+λ b(v, v) = λ b(u v, u v)+(λ λ )b(v, v) (λ λ )b(v, v) Hence, λ a(v, v). = λ λ λ λ λ u v 2 v 2. The lemma then follows from Assumption 3.2 and the fact that λ is bounded by a constant independent of h. Remar 3.3. A similar argument shows that Together with (2.8), this implies that λ λ λ + Ch 2. (3.6) 0 λ λ Ch 0 /l. LEMMA 3.2 (norm equivalence). It holds for any v V2 that (3.7) c(v, v) v 2 Cc(v, v) and for all sufficiently small h 0 that (3.8) c λ (v, v) v 2 Cc λ (v, v). Proof. It suffices to prove (3.8) since (3.7) may be proved in a similar way. The left inequality follows immediately from the fact that the bilinear form b(, ) is nonnegative. To prove the right inequality, note that since V2 is the a-orthogonal complement of V, then a(v, v) λ 2b(v, v). Thus, c λ (v, v) ( λ ) λ a(v, v). 2 Standard mini-max arguments (cf. [9]) show that λ 2 λ 2 λ 0 2, from which follows c λ (v, v) λ 2 λ λ 0 a(v, v). 2 The proof is concluded using (2.8).

84 ZHIQIANG CAI, JAN MANDEL, AND STEVE MCCORMICK We may thus define new norms on the spaces V 2 ( =0,,,l), which are uniformly equivalent to the norm, by v c = c(v, v) v V2. Remar 3.4. c is a seminorm on V. It is in fact a norm provided λ λ. 3.2. Error decomposition. The following concept of error decomposition will be the basis of our analysis. Because we consider algorithms for the nearly singular problem (2.7) with the ultimate objective of solving (2.5) with = l, the usual concepts of solution and error are not really relevant here. Indeed, we will not attempt to solve the problem per se but only to attenuate error components in the subspace V2 l while preserving the approximate magnitude of the components in V l. The purpose of this section is to mae this precise. The main point to note here is that we cannot measure error in the usual direct sense because λ and hence what we even mean by the solution are floating: the error component in V l is more or less well defined because λ<<λ l 2, so we will use a direct error norm to measure it. The error component in V l is elusive, however, so instead of a direct measure we will use a residual norm that is easier to pinpoint and bound. LEMMA 3.3 (an invariant subspace property). Let f2 be a continuous linear functional on V that vanishes on V. Then for sufficiently small h 0 there exists a unique U V2 such that (3.9) c λ (U, v)=f 2(v) v V. Proof. It follows from the definition of the subspace V2 and from (2.8) that the bilinear form c λ (, ) is positive definite on V2. By the Lax Milgram lemma there exists a unique solution U V2 of the variational problem c λ (U, v)=f2(v) v V2. The proof is completed by noting that U is orthogonal to V with respect to the bilinear form c λ (, ). Let u be an approximation to the solution of the following problem: find u V such that (3.0) c λ (u,v)=f (v) v V for a given functional f : V R. We then define the residual r as the functional on V given by (3.) r(v) =f (v) c λ (u, v). Suppose that we are given a decomposition (3.2) f = f + f 2, where f 2 vanishes on V. By Lemma 3.3, there exists a U V 2 such that (3.9) holds. Write Then u = u + u 2, u = P u, u 2 = P 2 u. r(v) =f (v) c λ (u,v)+f 2 (v) c λ (u 2,v)=r (v)+c λ (e 2,v),

MULTIGRID FOR LINEAR EQUATIONS AND EIGENPROBLEMS 85 where (3.3) e 2 = U u 2 and (3.4) r = r + r 2 with (3.5) (3.6) r (v) =f(v) c λ (u,v), r 2 (v) =f2(v) c λ (u 2,v)=c λ (e 2,v). We define the size of the error to be the pair ( ) r, e 2 c where r is defined as the functional norm r (v) (3.7) r = sup. v V v Remar 3.5. Note that the decomposition (3.2), and therefore the definition of the size of the error, is not unique. We allow this freedom to accommodate a general theory but will specify the decomposition later to suit our purposes. 3.3. Smoothing properties of relaxation. Recall that the smoothing step in the linear multigrid algorithm is defined as the replacement of the current approximation u V by û = u + w, with the correction w V defined using q as a preconditioner: (3.8) q (w, v) =r(v) v V, where r is the residual functional defined in (3.). Also recall that Q : V V is the operator induced by the form q. Below we use (Q ) T to denote its adjoint in (, ). Here we assume that r is decomposed according to (3.4) for some given decomposition of f according to (3.2). To analyze the smoother, we will need two additional assumptions. Assumption 3.3 (properties of the forms a and b). Assume for all u, v V that (3.9) (3.20) (3.2) (3.22) a(u, v) Ch 2 u v, a(u, u) C u 2, b(u, v) C u v, b(u, u) C u 2. Remar 3.6. Let ρ( ) denote spectral radius; then (3.9) (3.22) are equivalent to the respective inequalities (3.23) ρ(a ) Ch 2, ρ( (A ) ) C, ρ(b ) C.

86 ZHIQIANG CAI, JAN MANDEL, AND STEVE MCCORMICK In view of Assumption 3., we also have the equivalent respective assumptions (3.24) A Ch 2, (A ) C, B C, which are naturally satisfied by a wide class of discrete elliptic problems. Assumption 3.4 (smoother). Assume that w V is defined uniquely by (3.8) and that (3.25) w Ch 2 r. Further, suppose there exists a constant σ>0 such that (3.26) Q +(Q ) T Cλ σ ρ(a ) (Q ) T Q. Remar 3.7. Property (3.25) is equivalent to the inequality (3.27) (Q ) Ch 2. Let Q = D L, where D is self-adjoint in the inner product (, ) andd and L decompose C λ according to C λ = D L (L ) T. Assume that (3.28) C h 2 I D σρ(a )I and (L ) T L ( η)(d ) 2 for some appropriate constant η>0. (Self-adjoint operators E and F are said to satisfy the relation E F if E F is nonnegative definite.) Then (3.26) and (3.27) are easily verified for this choice of Q. This shows that Assumption 3.4 is natural for Gauss Seidel relaxation applied to linear equations that satisfy (3.28), which is easily verified for Poisson s equation on a uniform grid, for example. For further discussion, see [6]. Note also that the choice Q = ω D satisfies (3.26) and (3.27) under these assumptions provided ω<, which corresponds to damped Jacobi relaxation. LEMMA 3.4 (properties of the smoother). (i) Define z V as the solution of the problem (3.29) q (z, v) =r (v) v V. Then (3.30) z Ch r. (ii) Define y V as the solution of the problem (3.3) q (y, v) =c λ (e 2,v) v V. Then (3.32) P y Ch e 2 c and, for sufficiently small h 0, (3.33) e 2 P2 y 2 c ( +C(λ λ )+Ch 2 λ λ ) e 2 2 c σ ( + C(λ λ )) ρ(a ) C λe 2 2.

MULTIGRID FOR LINEAR EQUATIONS AND EIGENPROBLEMS 87 Proof. (i) Bound (3.30) immediately follows from (3.9) and (3.25). (ii) We first prove (3.32). From (3.6) we have r 2(v) =c λ (e 2,v). It therefore follows from (3.7) and (3.9) that Hence, r2(v) e 2 c v c e 2 c v Ch e 2 c v. (3.34) r 2 Ch e 2 c. Write y = P y and y 2 = P2 y. Using the fact that y V and relations (3.2), (3.2), (3.25), and (3.34), we get y = λ b(y,y ) λ b(y, y) C y Ch2 r2 Ch e 2 c, which is just (3.32). To prove (3.33), note that First note that by (3.26) we have e 2 y =(I (Q ) C λ)e 2. (I (Q ) Cλ) T Cλ(I (Q ) Cλ) = Cλ Cλ ( (Q ) T +(Q ) ) Cλ +Cλ(Q ) T Cλ(Q ) Cλ (3.35) =Cλ Cλ(Q ) T ( (Q ) T +Q Cλ) (Q ) Cλ Cλ σ ρ(a ) (C λ) 2. This bound and the inequality c λ (e 2,e 2) c(e 2,e 2) imply that c λ (e 2 y, e 2 y) c λ (e 2,e 2) σ ρ(a ) C λe 2 2 e 2 2 c σ ρ(a ) C λe 2 2. On the other hand, by (3.32) and (3.7) we have c(e 2 y 2,e 2 y 2 ) =c λ (e 2 y 2,e 2 y 2 )+(λ λ )b(e 2 y 2,e 2 y 2 ) =c λ (e 2 y, e 2 y) c λ (y,y )+(λ λ )b(e 2 y 2,e 2 y 2 ) c λ (e 2 y, e 2 y)+ λ λ λ a(y,y )+ λ λ λ a(e 2 y 2,e 2 y 2 ) 2 c λ (e 2 y, e 2 y)+ch 2 λ λ e 2 2 c + C(λ λ ) e 2 y 2 2 c. We can thus conclude from the two inequalities above that e 2 y 2 2 c ( + Ch2 λ λ ) e 2 2 c ρ(a ) C λ e 2 2, C(λ λ ) which with (2.8) gives (3.33) for all sufficiently small h 0. σ

88 ZHIQIANG CAI, JAN MANDEL, AND STEVE MCCORMICK 3.4. Properties of coarse grid correction. Define the projection operator S : V V2 by c(s u, v) =c(u, v) u V v V2. Further, define the projection onto the c-orthogonal complement of V2 by (3.36) Remar 3.8. For any v V,wehave T = I S. (3.37) v 2 c = S v 2 c + T v 2 c and, consequently, (3.38) S v c v c and T v c v c. LEMMA 3.5 (stability of S ). (3.39) S v C v v V. Proof. This bound follows immediately from (3.7) and (3.38). LEMMA 3.6 (eigenvector approximation properties). (3.40) and P v Ch v v V 2 (3.4) P2 v Ch v v V. Proof. Let v V2 and choose w V, guaranteed by (3.4) to satisfy P v w Ch P v. Then a(p v, P v) =a(v, P v) =a(v, P v w) v P v w Ch v P v, which establishes (3.40). A similar argument proves (3.4). LEMMA 3.7 (eigenvector approximation property). P S v Ch v c v V2. Proof. This is an immediate consequence of (3.40), (3.39), and (3.7). The following standard approximation assumption follows from H 2 -elliptic regularity of the bilinear form a(, ). See, for example, [6]. Assumption 3.5 (standard approximation property). There exists a constant δ>0 such that for any v V, there exists w V satisfying v w 2 δ ρ(a ) A v 2. THEOREM 3. (reduced approximation property). There exists a constant δ > 0 such that for sufficiently small h 0, (3.42) T v 2 c δ ρ(a ) C λv 2 v V 2.

MULTIGRID FOR LINEAR EQUATIONS AND EIGENPROBLEMS 89 Proof. Let v V 2. According to the Cauchy Schwarz inequality, Assumption 3., (3.20), and (3.8), we have Hence (C λv, v) C λv v C C λv v C C λv (C λv, v) 2. (C λv, v) 2 C C λ v, which together with (3.2), (3.20), and (3.8) implies that A v C λv + λ B v (3.43) C λv + Cλ(C λv, v) 2 C C λv. Let P : V V be the a-orthogonal projection operator. Then, by Assumption 3.5 and (3.43), there exists a constant δ>0 such that (3.44) v Pv 2 δ ρ(a ) A v 2 C ρ(a ) C λv 2. Let u V be such that (3.5) holds for v replaced by P Pv.Then Hence P Pv 2 = a(p P P Pv, Pv)=a(P Pv, v)=a(p Pv u v Ch P Pv v. Pv Ch v Ch A v Ch C λv, where we used (3.43). Now from this, (3.7), (3.44), and (3.23) we have Pv 2 c 2( v Pv 2 + P T v 2 c v P 2 C ρ(a ) C λv 2. Pv 2 ) Pv u, v) This completes the proof. 3.5. Combined smoothing and approximation properties. LEMMA 3.8 (reduced approximation property). Let w V be the solution of the problem Then for sufficiently small h 0, with q (w, v) =r (v)+c λ (e 2,v) v V. e 2 P2 w c {( + C(λ λ ))( S e 2 2 c + β 2 T e 2 2 ) +Ch 2 λ λ e 2 2 c} 2 +Ch r β 2 = σ δ <, where σ is given in (3.26) and δ is given in (3.42).

90 ZHIQIANG CAI, JAN MANDEL, AND STEVE MCCORMICK Proof. Let z and y be the solutions of the problems (3.29) and (3.3), respectively, so that w = z + y. The triangle inequality and (3.7) then imply that e 2 P 2 w c e 2 P 2y c + z. The proof of the lemma now follows from (3.33), (3.30), Theorem 3., and (3.37). Before we estimate the error in the multigrid algorithm, we need to estimate the error quantities just before the coarse grid correction. To this end, recall in the correction step of MG/ that the functional f (V ) is the restriction of the residual functional r, 0 to V : Now f (v) =r, 0 (v) =f (v) c λ (u,0,v) v V. u, 0 = P u, 0 and u, 0 2 = P 2 u, 0. A given decomposition of f according to (3.2) yields a decomposition of r, 0 according to (3.4) (3.6): where r, 0 (v) =r, 0 (v)+r, 0 2 (v) v V, r, 0 (v) =f(v) c λ (u, 0,v), r, 0 2 (v) =c λ (e,0 2,v), e,0 and U 2 is the solution of the problem (3.9). For any v V, 2 = U 2 u, 0 r, 0 2 (v) =c(e, 0 2,v) (λ λ )b(e, 0 2,v) =c(e, 0 2,P2 v) (λ λ )b(e, 0 2,v)+c(e, 0 2,P v) =c(s e, 0 2,P2 v) (λ λ )b(e, 0 2,v)+c(e, 0 2,P v). We are therefore led to choose 2, (3.45) f (v) =r, 0 (v) (λ λ )b(e, 0 2,v)+c(e, 0 2,P v) and (3.46) f 2 (v) =c(s e,0 2,P 2 v). This provides a legitimate coarse grid decomposition according to (3.2) because the functional f2 vanishes on V and f = f + f 2. Let U 2 V 2 be the solution of the problem (3.47) c λ (U 2,v)=f 2 (v) v V 2. Then r, 0 (v) =f (v) c λ (u,0,v) has a corresponding decomposition according to (3.4) (3.6): r, 0 (v) =f (v) c λ (u,0,v) and r,0 2 (v)=c λ (U 2 u,0 2,v).

MULTIGRID FOR LINEAR EQUATIONS AND EIGENPROBLEMS 9 LEMMA 3.9 (initial coarse grid error estimates). satisfies The initial coarse grid error (3.48) (3.49) (3.50) u, 0 =0,U 2 =e,0 2,r,0 i =f i,i=,2, U 2 c ( + C(λ λ )) S e, 0 2 c for sufficiently small h 0, U 2 S e, 0 2 c C(λ λ ) e, 0 2 c, and (3.5) r, 0 = f r, 0 +(C(λ λ )+Ch ) e, 0 2 c. Proof. Relations (3.48) follow immediately from the definitions of MG/, e, 0 i, and r, 0 i, i =,2. Bound (3.49) follows from using (3.47), (3.46), the Cauchy Schwarz inequality, (3.38), (3.2), (3.20), and (3.7), to conclude that U2 2 c = c λ (U2, U2 )+(λ λ )b(u2,u2 ) =c(s e, 0 2, U2 )+(λ λ )b(u2,u2 ) S e, 0 2 c U2 c + C(λ λ ) U2 2 c. By (3.46) and (3.47), we have c(u 2 S e, 0 2,v)=(λ λ )b(s e, 0 2,v) v V 2. Choosing v = U2 S e, 0 2, then (3.50) follows from (3.2), (3.20), (3.7), and (3.38). To prove (3.5), let v V and u V be arbitrary. Then from (3.45), the Cauchy Schwarz inequality, (3.2), and (3.7), we have f (v) r, 0 (v) +(λ λ ) b(e,0 2,v) + c(e,0 2,P v) r, 0 (v) + C(λ λ ) e,0 2 v + c(e,0 2,P v u) r, 0 (v) + C(λ λ ) e,0 2 v + C e,0 2 P v u. Choosing u V according to (3.5) with P v in place of v, it follows from (3.8) that f (v) r, 0 (v) + C(λ λ ) e,0 2 v + Ch e,0 2 c P v. Now since λ λ 0 = C, then (3.2) implies that P v = λ b(p v, P v) λ b(v, v) C v. The above two inequalities imply (3.5). 3.6. Recursive estimate. For convenience in what follows, we define the relation ( ) ( ) p r, p,q,r,s R, q s

92 ZHIQIANG CAI, JAN MANDEL, AND STEVE MCCORMICK to mean that p r and q s. For=0,,..., l, with β and ε i, i =,2,3 given, let ( ) ε E ε 2 (3.52) =. ε 3 THEOREM 3.2 (recursive estimate). Assume that there is a constant C 0 such that MG/ satisfies (3.53) with ( u, 2 u, 0 e, 2 2 c ) β E ( r, 0 e, 0 2 c ) (3.54) 0 ε i C 0, i =,2,3 and (3.55) >β β>0. Then MG/ satisfies ( u, 2 u, 0 (3.56) e, 2 2 c ) ( ) r, 0 E, e, 0 2 c with (3.57) ε i ε i ε i + Ch + C(λ λ ),i=,2,3 and (3.58) β β β + Ch + C(λ λ ). Proof. From (3.53), (3.49), (3.5), and (3.38) we have (3.59) u,2 u,0 ε r, 0 + ε 2 ( + C(λ λ )) e, 0 2 c and (3.60) e,2 2 c ε 3 r, 0 + β ( + C(λ λ )) S e, 0 2 c. Letting u, 2 = P u, 2 and u, 2 2 = P 2 u, 2, it then follows from (3.48) and (3.4) that (3.6) P u, 2 u, 2 u, 0 and (3.62) P 2 u, 2 Ch u, 2 u, 0.

MULTIGRID FOR LINEAR EQUATIONS AND EIGENPROBLEMS 93 Now (3.3), (3.7), and (3.49) (noting that λ λ C) imply that u, 2 2 = U2 e, 2 2 C( U2 c + e, 2 2 c ) C( e, 0 2 c + e, 2 2 c ), which together with (3.40) gives (3.63) P u, 2 2 Ch ( e, 0 2 c + e, 2 2 c ). By definition, u, = u, 0 + u, 2. For any v V, decomposing r, (v) =f (v) c λ (u,,v) according to (3.4) (3.6) yields r, (v) =f (v) c λ (u,,v) =r, 0 (v) c λ (P u, 2,v) =r, 0 (v)+(λ λ )b(p u,2,v). Hence by the Cauchy Schwarz inequality, (3.2), and (3.20) we get (3.64) Letting ũ = P u, 2 2 P 2 u, 2, then (3.65) r, r, 0 + C λ λ P u, 2. ũ P u, 2 2 + P 2 u, 2 and because u, 2 2 P 2 u, 2 2 = P u, 2 2 and P 2 u, 2 2 P 2 u, 2 = P 2 u, 2, we obtain (3.66) e, 2 = e, 0 2 P 2 u, 2 =(e, 0 2 U2 )+(U2 u,2 2 )+(u,2 2 P2u,2 2 ) +(P2 u, 2 2 P2 u, 2 ) =(e, 0 2 U2 )+e,2 2 +ũ. From this, (3.38), and (3.50) we get (3.67) e, 2 c T e, 0 2 c + S e, 0 2 U2 c + e,2 2 c + ũ c ( + C(λ λ )) e, 0 2 c + e, 2 2 c + ũ. We can similarly conclude that (3.68) and S e, 2 c C(λ λ ) e, 0 2 c + e, 2 2 c + ũ (3.69) T e, 2 c T e, 0 2 c + ũ. To estimate the size of e, 2 2 = U2 P2 u, 2 = e, 2 P2 w, where w V solves q (w, v) =r(v)+c λ (e, 2,v) v V, note that Lemma 3.8 implies that ( ) e, 2 2 c ( + C(λ λ )) 2 S e, 2 2 c + β 2 T e, 2 2 2 c +Ch λ λ 2 e, 2 c + Ch r,.

94 ZHIQIANG CAI, JAN MANDEL, AND STEVE MCCORMICK By using (3.59) (3.69), (3.54), and (3.55), a lengthy computation shows that e, 2 2 c ε 3 r, 0 + β e, 0 2 c, with ε 3 and β satisfying (3.57). It remains to estimate u, 2 u, 0. We have from (3.30) and (3.32) that u, 2 u, 0 u, 2 u, + u, u, 0 ( ) Ch r, + e, 2 c + P u, 2 + P u, 2 2. It thus follows from (3.64), (3.67), (3.6), (3.63), (3.65), (3.59), and (3.60) that u, 2 u, 0 (ε + Ch ) r, 0 +(ε 2 +C(λ λ )+Ch ) e, 0 2 c, which completes the proof. 3.7. Linear V-cycle estimate. THEOREM 3.3 (linear V-cycle estimate). Suppose that MG/ 0 satisfies ( ) ( ) u 0, 2 u 0, 0 r 0, 0 E 0 (3.70) e 0, 2 2 c e 0, 0 2 c independently of h 0 and of the choice of the decomposition f 0 = f 0 + f2 0 for some positive constants ε 0 i, >β 0 β>0. Then for sufficiently small h 0 and λ λ, MG/ l satisfies ( u l, 2 u l, 0 ) ( r l, 0 e l, 2 2 E l ) c e l, 0 2 c with (3.7) ε 0 i ε l i ε 0 i + Ch 0 + C(λ λ )l, i =,2,3, and (3.72) β 0 β l β 0 + Ch 0 + C(λ λ )l. Proof. This follows directly from recursion on Theorem 3.2. LEMMA 3.0 (direct solver). Let MG/ 0 be defined by (2.0). Then (3.70) holds with ε 0 = ε 0 2 = β 0 =0and ε 0 3 a constant independent of h 0. Proof. Relations (2.0) are equivalent to assuming that u 0, 2 = u 0, 0 + u 0, 2 2, where u 0, 2 2 V2 0 is defined by (3.73) Clearly then, c λ (u 0, 2 2,v)=f 0 (v) v V 0 2. u 0, 2 u 0, 0 =0, that is, ε 0 = ε 0 2 = 0. Let f 0 = f 0 + f2 0 be a decomposition according to (3.2) and U 0 V2 0 be the solution of (3.50) with =. Thene 0,2 2 =U 0 u 0,2 2 satisfies c λ (e 0, 2 2,e 0,2 2 )=f 0 (e 0,2 2 )=r 0 (e 0,2 2 ),

MULTIGRID FOR LINEAR EQUATIONS AND EIGENPROBLEMS 95 which together with (3.7), (3.8), and (3.20) implies that e 0, 2 2 2 c Cc λ (e 0,2 2,e 0,2 2 )=Cr 0 (e 0,2 2 ) C r 0 e 0, 2 2 c. Hence, β 0 = 0 and ε 0 3 is a constant independent of h 0. This completes the proof. Remar 3.9. Since (3.70) holds with β 0 = 0, it certainly holds with β 0 = β. LEMMA 3. (approximate solver). Let MG/ 0 in the definition of MG/ be replaced by a mapping such that for any f 0, M 0 : f 0 u 0, 2 (3.74) u 0, 2 f 0 (v) C sup 0 v V 0 v and for any f 0 2 that vanishes on V 0, (3.75) P 0 2 M 0 f 0 2 U 0 c β 0 U 0 c, where U 0 is defined by (3.9) with =0. Then (3.70) holds with ε i C, i =,2,3, and β 0 from (3.75). Proof. Wehave u 0, 2 u 0, 0 = u 0, 2 = P 0 u 0, 2 P 0 M 0 f 0 + P 0 M 0 f 0 2. From (3.74) and the fact that f 0 = r 0, 0 we conclude that P 0 M 0 f 0 C r 0, 0. It follows from (3.74) and the definition of U 0 that P 0 M 0 f2 0 f2 0 (v) C sup C 0 v V 0 v sup 0 v V 0 2 f 0 2 (v) v C U 0 c. The above inequalities imply that ε 0 i C for i =,2. Since e 0, 2 2 = U 0 u 0, 2 2 = U 0 P 0 2 M 0 f 0, from (3.75) and (3.74) we have e 0, 2 2 c U 0 P 0 2M 0 f 0 2 c + P 0 2M 0 f 0 c β 0 U 0 c + C r 0, 0. This completes the proof. 4. Convergence of a linearized multigrid method for eigenvalue problems. Here we apply the theory developed in the previous section to a linearized multigrid method for solving the eigenvalue problem (2.5) with = l. This method uses an outer loop iterative process, which replaces the given approximate eigenfunction u l old V l by the new approximation u l new V l that is computed by applying MG/ to the problem (4.) (4.2) λ = RQ(u l ), a(u l,v) λb(u l,v)=0 v V l. Assume for simplicity that MG/ uses an exact coarsest grid solver.

96 ZHIQIANG CAI, JAN MANDEL, AND STEVE MCCORMICK Here we are considering a conventional linearized multigrid method applied to the eigenvalue problem, so the shift λ is now considered to be fixed throughout each multigrid cycle, allowed to change only between cycles via (4.). However, the reader should note that changes could be allowed anywhere within the multigrid cycles, and the proof of our next theorem would still apply. We will use this observation in proving our theorem on RQMG. The reader should also note that this linearized algorithm relies on the fact that no multigrid cycle will produce the exact solution of (4.2) (i.e., we will show that u l new 0). LMG/ALGORITHM. Given an initial approximation u l old V l to an eigenvector for (2.5) belonging to λ l such that u l old =, then the new approximation ul new = LMG/ l (u l old ) is defined as follows:. MG inner solver. Perform one step of MG/ l applied to (4.2) using the initial approximation u l,0 = u l old and the shift λ = RQ(ul old ): 2. Normalization. Set u l, 2 = MG/ l (u l, 0, 0). u l new = ul, 2 u l, 2. To analyze the convergence of LMG/, we define the error in an approximation u l = u l + u l 2,u l i =Pl i ul,i=,2,to an eigenvector of (2.5) belonging to λ l by (4.3) e l = ul 2 u l. Note that λ l λ = RQ(u l, 0 ). THEOREM 4. (convergence of linearized multigrid). LMG/ l converges according to the estimate (4.4) e l new c (Ch 0 + β) e l old c for sufficiently small h 0. Proof. First note that the definition of λ yields c λ (u l, 0,u l, 0 )=a(u l, 0,u l, 0 ) λb(u l, 0,u l, 0 )=0. Thus writing u l, 0 = u l, 0 + u l, 0 0 2, where ul, i = Pi lul, 0, i =,2, then c λ (u l, 0,ul, 0 )+c λ(u l, 0 2,ul, 0 2 )=0, which can be rewritten u l, 0 2 2 c = η u l, 0 2 +(λ λ )b(u l, 0 2,ul, 0 2 ). Letting η =(λ λ l )/λ l, then from this relation and (2.8) we can conclude for sufficiently small h 0 that (4.5) and that u l, 0 2 2 c Cη u l, 0 2 (4.6) u l, 0 2 η ul, 0 2 2 c.

MULTIGRID FOR LINEAR EQUATIONS AND EIGENPROBLEMS 97 Now let f l (v) =f l (v)=f l 2(v)=0 v V l. Then f l 2certainly vanishes on V l, and noting that U = 0 in (3.9) with = l, we apply Theorem 3.3, the Cauchy Schwarz inequality, and (4.5) to conclude that u l, 2 u l, 0 f εl (v) c λ (u sup,v) 0 v V v l l, 0 l, 0 +ε l 2 U u l, 0 2 c =(λ λ l )ε l b(u sup,v) +ε l 0 v V v 2 u l, 0 2 c ( l ) ε l C(λ λ l ) u l, 0 + ul, 0 2 c (4.7) C ηε l u l, 0. Similarly, we apply Theorem 3.3 and (4.6) to obtain (4.8) u l, 2 2 c = U u l, 2 2 c Cε l η u l, 0 + β l u l, 0 2 c ( C ηε l + β l) u l, 0 2 c. From (4.7) we can conclude that (4.9) u l, 2 0 2 ul, ul, u l, 0 ( C ηε l) u l, 0. Bounds (4.8) and (4.9) now yield ( C ηε e l l + β l ) (4.0) new c C ηε l e l old c. The proof now follows from (3.6). Remar 4.. The theory here for MG/ assumes an exact coarsest grid solver. In fact, the theory holds if we assume only that the coarsest grid solver is effective enough that it does not contaminate the estimates in (3.7) and (3.72); see Lemma 3.. In any event, this theorem shows that the LMG/ worst case convergence factor is arbitrarily close to the multigrid factor for standard well-posed linear elliptic equations provided (2.8) holds. We will show in section 6 that (2.8) can be guaranteed by the use of a full multigrid or nested iteration process. 5. Convergence of RQMG for eigenvalue problems. In this section, we apply the theory of section 3 to RQMG, which is a direct multigrid method for (2.5) involving minimization of the Rayleigh quotient over level corrections to the finegrid eigenvector approximation. The approximate eigenvalue is the Rayleigh quotient of the eigenvector approximation incorporating all current corrections, so it changes during each level correction. The main point in establishing RQMG convergence is simply to recognize that the proof of Theorem 4. made no use of the fact that the shift λ was assumed to be fixed in the multigrid cycles. Following is the definition of RQMG/, which (as MG was) is posed with an exact coarsest grid solver and a general smoother. (Again, approximate coarsest grid solvers can be treated as in Lemma 3..) To accommodate recursion, we have defined RQMG/ in terms of a given vector u V, which should be interpreted as the finest level eigenvector approximation.

98 ZHIQIANG CAI, JAN MANDEL, AND STEVE MCCORMICK RQMG/ALGORITHM. Given an initial correction u l, 0 V l for the approximate eigenvector u V l, then the new correction u l, 2 = RQMG/ l (u l, 0,u)is defined as follows: (a) If l =0, then compute u l, 2 V l so that u l, 2 = u l, 0 and RQ(u + u l, 2 ) = min v V2 l RQ(u + u l, 0 + v). (b) If l>0, then perform the following:. Coarse-grid correction step. Set u l, = u l, 0 +RQMG/ l (0, u+u l, 0 ). 2. Smoothing step. Letλ=RQ(u + u l, ) and define w l V l by q l (w l,v)= c λ (u+u l,,v), v V l. Then set u l, 2 = u l, + w l. Remar 5.. The RQMG coarsest grid correction step posed here is in the spirit of our analysis in the sense that the V 0 error component is guaranteed to be unchanged. This is in contrast to the significant changes that can happen in the more natural RQMG scheme of determining an optimal correction from the coarsest grid, which would be much more difficult to analyze. However, the coarsest grid solver used here is no less practical because it can be implemented using projections onto V 0 and V2 0, which are obtained naturally in the full multigrid process. That is, the eigenvector approximation computed initially on the coarse grid can be used to implement the coarsest grid RQMG solver efficiently. THEOREM 5. (RQMG convergence). Suppose that the current eigenvector approximation u l old is in V l \{0}.Letu l, 2 = RQMG/ l (0, u l old ). (Here we assume that any nonzero initial correction has already been added into u = u l old.) Define the new eigenvector approximation by u l new = u l old + ul, 2. Define the errors e l old and el new in the respective approximations u l old and ul new as in (4.3). Then RQMG/ l converges according to the estimate (5.) e l new c (Ch 0 + β) e l old c. Proof. The coarsest grid solver satisfies the assumptions of Lemma 3.. Indeed, setting to zero the Fréchet derivative of RQ(u l old + u0, 0 + w) with respect to v V2 0, we get c λ (w, v) =c λ (u 0 old,v) v V2 0. Hence ( ) ( ) ( ) u 0, 2 u 0, 0 0 r 0,0 = E 0, U 0 u 0, 2 2 c 0 U 0 u 0, 0 2 c where ε 0 i =0,i=,2,3, and β 0 = β. This proof is now completed by noting that the proof of Theorem 4. did not rely on fixing the shift λ, so it applies here without modification. This theorem confirms that RQMG converges linearly with a per cycle error reduction factor that is bounded below one uniformly in l, which is the usual sense of multigrid optimality. Of course, Rayleigh quotient iteration is cubically convergent when exact linear solvers are used. It should therefore come as no surprise that the overall efficiency of a Rayleigh quotient iterative method is dictated by the linear solver efficiency. For the elliptic case considered here, RQMG is optimal in this sense, as the numerical results in [5] demonstrate.

MULTIGRID FOR LINEAR EQUATIONS AND EIGENPROBLEMS 99 6. Full multigrid for eigenvalue problems. The theory developed in this paper applies to the eigenvalue problem (2.) but only in a local sense: the multigrid convergence factors are guaranteed to be optimal only in a relatively small region about the eigenspace. More precisely, if λ ( λ ) is the current approximation to the eigenvalue on level l, then to obtain this optimality we need an estimate lie (2.8). Our purpose in this section is to show that this localness can be ensured by a full multigrid procedure, which is an outer loop iteration that uses the basic multigrid cycle first on coarser grids to obtain good initial guesses to the fine grid problem. We will use this scheme to guarantee that the error on each level satisfies (6.) e l c Ch l, which will suffice to prove (2.8) and, therefore, the optimality of the relevant multigrid solver. The full multigrid procedure will be posed in terms of a generic multigrid method for solving the level l eigenvalue problem defined by (2.5). We write this generic scheme in the form u l new = EMG/ l (u l old), which we assume converges in the sense of Theorems 4. and 5. with a factor bounded above by the constant γ Ch 0 +β<, provided (2.8) is satisfied and h 0 is sufficiently small. The following definition of the full multigrid process uses ν cycles of EMG on progressively finer levels but maes no use of any initial approximation. FMG/ALGORITHM. The final approximation u l = FMG/ l is defined as follows: (a) If l =0, then compute u l V l so that the error satisfies (2.8) and (6.) for l =0. (b) If l>0, then perform the following:. Coarse grid solution. Set u l = FMG/ l. 2. Multigrid step. Perform ν steps of EMG/ applied to (2.5) using the initial approximation u l, 0 = u l : u l, µ = EMG/ l (u l, µ ), µ =,2,,ν. Note that the coarsest grid (l = 0) step can be done by computing an accurate approximation to u 0. This follows from an estimate for the error measure e 0 c that we obtain in the proof of our last theorem. (See (6.2) and (6.3) below.) THEOREM 6. (global convergence). Let λ = RQ(u l ). Then, for sufficiently small h 0 and large ν, bounds (2.8) and (6.) are satisfied for all l 0. Proof. The case l = 0 is true by definition. Suppose for induction purposes that (2.8) and (6.) are true for l. Writing u l = u l + u l 2, u l i V i l, i =,2, first note that RQ(u l ) λ = c(ul,u l )+c(u l 2,u l 2) b(u l,ul )+b(ul 2,ul 2 ) (6.2) (λ l λ )+λ l u l 2 2 c u l 2 Ch 2 l + λ l e l 2 c, where e l = ul 2 u l as in (4.3). With the analogous decomposition ul = u l + u l 2,

200 ZHIQIANG CAI, JAN MANDEL, AND STEVE MCCORMICK u l i V l i, i =,2 and defining u = Pu l l, then by (3.8) and (3.5) it follows that P l 2u l c P l ul inf u V l ul u P lul P lul 2 ul u + u l 2 u u l 2 u l u + C u l 2 c u l u l u C u l 2 c u l u + C e l u = l c ul u C e l u l c Ch l. Ch l For sufficiently small h 0, we therefore have that (6.3) P l 2u l c P l ul Ch l. Bounds (2.8) and (6.) for l now follow from (6.2), (6.3), and the definition of γ that implies e l c γ ν P l 2u l c P l ul. REFERENCES [] R. BANK, Analysis of a multilevel inverse iteration procedure for eigenvalue problems, SIAM J. Numer. Anal., 9 (982), pp. 886 898. [2] A. BRANDT, S. MCCORMICK, AND J. RUGE, Multigrid methods for differential eigenproblems, SIAM J. Sci. Statist. Comput., 4 (983), pp. 244-260. [3] W. HACKBUSCH, Multigrid solutions to linear and nonlinear eigenvalue problems for integral and differential equations, Rostoc. Math. Kolloq., 25 (984), pp. 79 98. [4] W. HACKBUSCH, On the computation of approximate eigenvalues and eigenfunctions by means of a multigrid method, SIAM J. Numer. Anal., 6 (979), pp. 20 25. [5] J. MANDEL AND S. MCCORMICK, A multilevel variational method for Au = λbu on composite grids, J. Comput. Phys., 80 (989), pp. 442 450. [6] J. MANDEL, S. MCCORMICK, AND R. BANK, Variational multigrid theory, in Multigrid Methods, S. McCormic, ed., SIAM Frontiers in Applied Math., 3 (987), pp. 3 78. [7] S. MCCORMICK, A mesh refinement method for Ax = λbx, Math. Comp., 36 (98), pp. 485 498. [8] S. MCCORMICK, Multilevel adaptive methods for elliptic eigenproblems: A two-level convergence theory, SIAM J. Numer. Anal., 3 (994), pp. 73 745. [9] G. STRANG AND G. J. FIX, An Analysis of the Finite Element Method, Prentice Hall, Englewood Cliffs, NJ, 973.