Rayleigh-Ritz majorization error bounds with applications to FEM and subspace iterations

Similar documents
arxiv:math/ v6 [math.na] 6 Oct 2009

MAJORIZATION FOR CHANGES IN ANGLES BETWEEN SUBSPACES, RITZ VALUES, AND GRAPH LAPLACIAN SPECTRA

Majorization for Changes in Angles Between Subspaces, Ritz values, and graph Laplacian spectra

MAJORIZATION FOR CHANGES IN ANGLES BETWEEN SUBSPACES, RITZ VALUES, AND GRAPH LAPLACIAN SPECTRA

Majorization for Changes in Ritz Values and Canonical Angles Between Subspaces (Part I and Part II)

ETNA Kent State University

ANGLES BETWEEN SUBSPACES AND THE RAYLEIGH-RITZ METHOD. Peizhen Zhu. M.S., University of Colorado Denver, A thesis submitted to the

Principal Angles Between Subspaces and Their Tangents

ANGLES BETWEEN INFINITE-DIMENSIONAL SUBSPACES. Abram Jujunashvili. M.S., Tbilisi State University, Tbilisi, Georgia, 1978

Last Time. Social Network Graphs Betweenness. Graph Laplacian. Girvan-Newman Algorithm. Spectral Bisection

Math 408 Advanced Linear Algebra

Angles Between Infinite Dimensional Subspaces with Applications to the Rayleigh-Ritz and Alternating Projectors Methods

Linear Algebra Massoud Malek

Chapter 3 Transformations

Eigenvalue Estimation with the Rayleigh-Ritz and Lanczos methods

NEW A PRIORI FEM ERROR ESTIMATES FOR EIGENVALUES

HOMEWORK PROBLEMS FROM STRANG S LINEAR ALGEBRA AND ITS APPLICATIONS (4TH EDITION)

Conceptual Questions for Review

Using the Karush-Kuhn-Tucker Conditions to Analyze the Convergence Rate of Preconditioned Eigenvalue Solvers

Final Exam, Linear Algebra, Fall, 2003, W. Stephen Wilson

Solution of eigenvalue problems. Subspace iteration, The symmetric Lanczos algorithm. Harmonic Ritz values, Jacobi-Davidson s method

Assignment 1 Math 5341 Linear Algebra Review. Give complete answers to each of the following questions. Show all of your work.

The Eigenvalue Problem: Perturbation Theory

E2 212: Matrix Theory (Fall 2010) Solutions to Test - 1

ANGLES BETWEEN INFINITE DIMENSIONAL SUBSPACES WITH APPLICATIONS TO THE RAYLEIGH-RITZ AND ALTERNATING PROJECTORS METHODS

Is there life after the Lanczos method? What is LOBPCG?

Angles Between Infinite Dimensional Subspaces

LARGE SPARSE EIGENVALUE PROBLEMS. General Tools for Solving Large Eigen-Problems

Homework 2. Solutions T =

Vectors To begin, let us describe an element of the state space as a point with numerical coordinates, that is x 1. x 2. x =

LARGE SPARSE EIGENVALUE PROBLEMS

Solution of eigenvalue problems. Subspace iteration, The symmetric Lanczos algorithm. Harmonic Ritz values, Jacobi-Davidson s method

Diagonalizing Matrices

AMS526: Numerical Analysis I (Numerical Linear Algebra)

Exercise Sheet 1.

Rational Krylov methods for linear and nonlinear eigenvalue problems

Foundations of Matrix Analysis

Throughout these notes we assume V, W are finite dimensional inner product spaces over C.

Mathematical foundations - linear algebra

Math 118, Fall 2014 Final Exam

Preconditioned Locally Minimal Residual Method for Computing Interior Eigenpairs of Symmetric Operators

Contents. Preface for the Instructor. Preface for the Student. xvii. Acknowledgments. 1 Vector Spaces 1 1.A R n and C n 2

LINEAR ALGEBRA QUESTION BANK

Ma/CS 6b Class 23: Eigenvalues in Regular Graphs

Linear Algebra Final Exam Study Guide Solutions Fall 2012

Compound matrices and some classical inequalities

Linear Algebra: Matrix Eigenvalue Problems

APPLICATIONS The eigenvalues are λ = 5, 5. An orthonormal basis of eigenvectors consists of

Math Linear Algebra Final Exam Review Sheet

1. Select the unique answer (choice) for each problem. Write only the answer.

Numerical Methods I Eigenvalue Problems

Review of similarity transformation and Singular Value Decomposition

Computational Methods. Eigenvalues and Singular Values

Math 224, Fall 2007 Exam 3 Thursday, December 6, 2007

x 3y 2z = 6 1.2) 2x 4y 3z = 8 3x + 6y + 8z = 5 x + 3y 2z + 5t = 4 1.5) 2x + 8y z + 9t = 9 3x + 5y 12z + 17t = 7

1. General Vector Spaces

Index. for generalized eigenvalue problem, butterfly form, 211

TBP MATH33A Review Sheet. November 24, 2018

NEW ESTIMATES FOR RITZ VECTORS

MATH 23a, FALL 2002 THEORETICAL LINEAR ALGEBRA AND MULTIVARIABLE CALCULUS Solutions to Final Exam (in-class portion) January 22, 2003

Mathematical foundations - linear algebra

I. Multiple Choice Questions (Answer any eight)

Test 3, Linear Algebra

Math 315: Linear Algebra Solutions to Assignment 7

Math 307 Learning Goals. March 23, 2010

Implementation of a preconditioned eigensolver using Hypre

Applied Linear Algebra in Geoscience Using MATLAB

Final Review Written by Victoria Kala SH 6432u Office Hours R 12:30 1:30pm Last Updated 11/30/2015

MATH 31 - ADDITIONAL PRACTICE PROBLEMS FOR FINAL

RITZ VALUE BOUNDS THAT EXPLOIT QUASI-SPARSITY

Fall, 2003 CIS 610. Advanced geometric methods. Homework 3. November 11, 2003; Due November 25, beginning of class

Properties of Transformations

Week Quadratic forms. Principal axes theorem. Text reference: this material corresponds to parts of sections 5.5, 8.2,

Applied Linear Algebra in Geoscience Using MATLAB

1 Linear Algebra Problems

Math 302 Outcome Statements Winter 2013

DS-GA 1002 Lecture notes 10 November 23, Linear models

APPLIED NUMERICAL LINEAR ALGEBRA

Eigenvalues and Eigenvectors

8. Diagonalization.

Linear Algebra Review

Chap 3. Linear Algebra

1. Foundations of Numerics from Advanced Mathematics. Linear Algebra

NORMS ON SPACE OF MATRICES

Final A. Problem Points Score Total 100. Math115A Nadja Hempel 03/23/2017

Analysis Preliminary Exam Workshop: Hilbert Spaces

Linear Algebra Practice Problems

Ir O D = D = ( ) Section 2.6 Example 1. (Bottom of page 119) dim(v ) = dim(l(v, W )) = dim(v ) dim(f ) = dim(v )

Math Theory of Partial Differential Equations Lecture 3-2: Spectral properties of the Laplacian. Bessel functions.

Econ Slides from Lecture 7

Math 4A Notes. Written by Victoria Kala Last updated June 11, 2017

Toward the Optimal Preconditioned Eigensolver: Locally Optimal Block Preconditioned Conjugate Gradient Method

Math 307 Learning Goals

1. What is the determinant of the following matrix? a 1 a 2 4a 3 2a 2 b 1 b 2 4b 3 2b c 1. = 4, then det

Iterative methods for symmetric eigenvalue problems

Elementary linear algebra

arxiv: v1 [math.na] 5 Oct 2018

SPRING 2006 PRELIMINARY EXAMINATION SOLUTIONS

MR. YATES. Vocabulary. Quadratic Cubic Monomial Binomial Trinomial Term Leading Term Leading Coefficient

OHSx XM511 Linear Algebra: Solutions to Online True/False Exercises

Transcription:

1 Rayleigh-Ritz majorization error bounds with applications to FEM and subspace iterations Merico Argentati and Andrew Knyazev (speaker) Department of Applied Mathematical and Statistical Sciences Center for Computational Mathematics University Colorado Denver (Downtown Campus) Householder Symposium XVII Zeuthen, Germany June 5, 2008 Supported by the National Science Foundation

2 Abstract Given two subspaces X and Y of the same finite dimension, such that X is A-invariant, the absolute changes in the Ritz values of A with respect to X compared to the Ritz values with respect to Y represent the absolute eigenvalue approximation error. A recent paper [1] bounds the error in terms of the principal angles between X and Y using weak majorization. We improve and extend this bound, and derive several new related results. We present our Rayleigh-Ritz majorization error bound in the context of the finite element method (FEM). We derive a new majorization-type convergence rate bound of subspace iterations and combine it with the previous result to obtain a similar bound for the block Lanczos method. This presentation is based on [2]. A corresponsing result where neither X nor Y is A-invariant can be found in [3]. The case of infinite dimensional subspaces is considered in [4].

3 References. [1] M. E. Argentati, A. V. Knyazev, C. C. Paige, and I. Panayotov, Bounds on changes in Ritz values for a perturbed invariant subspace of a Hermitian matrix. SIMAX 30 (2008), 548-559. [2] A. V. Knyazev and M. E. Argentati, Rayleigh-Ritz majorization error bounds with applications to FEM and subspace iterations, http://arxiv.org/abs/math/0701784. [3] A. V. Knyazev and M. E. Argentati, Majorization for Changes in Angles Between Subspaces, Ritz values, and graph Laplacian spectra, SIMAX 29 (2006), 15-32. [4] A. V. Knyazev, A. Jujunashvili, and M. E. Argentati, Angles Between Infinite Dimensional Subspaces with Applications to the Rayleigh-Ritz and Alternating Projectors Methods, http://arxiv.org/abs/0705.1023.

4 CONTENTS 1. Majorization (in matrix algebra) 2. The Rayleigh Ritz method 3. Principal angles between subspaces 4. Traditional bounds on the change in Ritz values 5. Majorization bounds for Ritz values 6. Improved sine-based majorization bounds 7. Bounds for the largest (or smallest) eigenvalues 8. Application to the FEM 9. Convergence rate bounds for subspace iterations 10. Convergence rate bounds for the block Lanczos method

5 Majorization For vector x = [x 1,...,x n ], we use x [x 1,...,x n] to denote x with its elements rearranged in descending order, while x [x 1,...,x n] denotes x with its elements rearranged in ascending order. x denotes the vector x with the absolute value of its components. We say that x R n is weakly majorized by y R n, written x w y, if k x i i=1 k i=1 y i, 1 k n, (1) while x is (strongly) majorized by y, written x y, if (1) holds together with n n x i = y i. (2) i=1 i=1

6 Majorization in Matrix Algebra Let Λ(A) (for Hermitian A) and S(A) be, respectively, vectors of eigenvalues and singular values (both nonincreasing): Lidskiǐ theorem: Λ(A + B) Λ(B) Λ(A) for Hermitian A and B Gelfand-Naǐmark theorem: log S(AB) log S(B) log S(A) for general A and B, where we add zeros to the vectors of singular values if necessary to match the sizes [KA08] Generalized pinching inequality [KA08]: [S(A H 1 BC 1 ), S(A H 2 BC 2 )] w S ( A 1 A H1 + A 2A H2 B C 1 C1 H + C 2C2 H )

7 The Rayleigh Ritz Method Let A be Hermitian and X be a subspace We define an operator P X A X on X, where P X is the orthogonal projection onto X and P X A X denotes the restriction of P X A to X. The eigenvalues Λ(P X A X ) are called Ritz values. We have Λ(P X A X ) = Λ(X H AX) where X is a matrix with orthonormal columns that span X

8 Principal Angles Between Subspaces Let subspaces X and Y C n have orthonormal bases given by the columns of the matrices X and Y The principal angles, arranged in descending order, are denoted by Θ(X, Y) = Θ (X, Y), and defined using cosθ(x, Y) = S (X H Y ) The definition is symmetric: Θ(X, Y) = Θ(Y, X) if dimx = dimy and describes the angles between subspaces. If dimx < dimy, the angles Θ(X, Y) are from X to Y gap(x, Y) = P X P Y 2 = sin(θ max (X, Y)).

9 Traditional bounds for Ritz values Let dimx = dimy, then [KA06] max Λ ((P X A) X ) Λ ((P Y A) Y ) (λ max λ min ) max sinθ(x, Y), where λ min and λ max are the smallest and largest eigenvalues of the A, respectively. If in addition one of the subspaces is A-invariant then [KA08] max Λ ((P X A) X ) Λ ((P Y A) Y ) (λ max λ min ) max sin 2 Θ(X, Y). A similar bound with the min does not hold and the inequality Λ ((P X A) X ) Λ ((P Y A) Y ) (λ max λ min ) sinθ(x, Y) is WRONG! Majorization is necessary to analyse low-rank perturbations of the trial subspace, leading to many zero angles in Θ(X, Y).

10 Majorization bounds for Ritz values Let dimx = dimy, then from the Lidskiǐ theorem we can obtain [KA05] Λ ( X H AX ) Λ ( Y H AY ) w Λ ( X H AX Y H AY ) w 2 (λmax λ min ) sinθ(x, Y). The constant multiplier 2 is artificial. In reality [KA06] Λ ((P X A) X ) Λ ((P Y A) Y ) w (λ max λ min ) sinθ(x, Y). If in addition one of the subspaces is A-invariant then [KA08] Λ ((P X A) X ) Λ ((P Y A) Y ) w (λ max λ min ) sin 2 Θ(X, Y). Conjectured and proved only for a particular case in [AKPP08].

11 Improved sine-based bounds Substitutions of the space X + Y and the operator (P X+Y A) X+Y for H and A improve the constant λ max λ min λ max(x+y) λ min(x+y), using the spread of the spectrum of the operator (P X+Y A) X+Y. Spread vector ] Spr (X+Y) = [λ i (X+Y) λ i (X+Y), i = 1,...,dimX where λ 1(X+Y)... and λ 1 (X+Y)... are the dimx largest and smallest eigenvalues of (P X+Y A) X+Y. Our CONJECTURES: Λ ((P X A) X ) Λ ((P Y A) Y ) w Spr (X+Y) sinθ(x, Y); if in addition one of the subspaces is A-invariant then Λ ((P X A) X ) Λ ((P Y A) Y ) w Spr (X+Y) sin 2 Θ(X, Y) (3) We PROVE (3) for the largest (or smallest) eigenvalues only [KA08].

12 Bounds for the largest/smallest eigenvalues X represents the largest eigenvalues of A > 0 then Spr (X+Y) Λ ((P X A) X ) so 0 Λ ((P X A) X ) Λ ((P Y A) Y ) w Λ ((P X A) X ) sin 2 Θ(X, Y). New: 0 log Λ ((P X A) X ) log Λ ((P Y A) Y ) w log cos 2 Θ(X, Y), a relative multiplicative error bound, which implies 0 Λ ((P XA) X ) Λ ((P Y A) Y ) 1 w tan 2 Θ(X, Y). For X representing the smallest eigenvalues of A > 0 this gives 0 Λ ((P YA) Y ) Λ ((P X A) X ) 1 w tan 2 Θ A (X, Y). The sine- and tangent-based bounds do not follow from each other!

13 Application to the FEM The results are generalized to dimx dimy with truncated Λ ((P Y A) Y ) in infinite dimensional Hilbert spaces. For the clamped membrane vibration problem, denoting the two smallest eigenvalues Λ ((P X A) X ) = [λ 2, λ 1 ] of A =, let the corresponding eigenfunctions v 1 and v 2 in X be approximated by the FEM subspace Y such that sin Θ(v 1 + v 2, Y) = h, but sinθ(v 1 v 2, Y) = h 2, both in Ḣ 1 (Ω), where Ω denotes the polygonal membrane. Denoting the eigenvalues of the FEM discrete negative Laplacian by Λ dimx ((P Y A) Y ) = [λ h 2, λ h 1], we get error bounds for the trace: 0 λ h 1 + λ h 2 λ 1 λ 2 λ 1 h 2, and the product 1 λh 1 λ 1 λ h 2 λ 2 1 + h 2. Compared to the standard bound, the gain in the constant is the factor of 2.

14 Convergence bounds of subspace iterations Let F be invariant on both X and on X, and (P X F) X be invertible. Assume dimx = dimy and Θ(X, Y) < π/2. Then dim(f Y) = dimy and ( ) tanθ(x, F Y) ( ( log log S ((P X F) X ) 1) ) S ((P tan Θ(X, Y) X F) X ), which implies, by applying the exponential function to both sides, ( tanθ(x, F Y) w S ((P X F) X ) 1) S ((P X F) X )tan Θ(X, Y). If F = f(a), combining with our RR error bounds, we obtain Λ ((P X A) X ) Λ ((P F Y A) F Y ) ( ) f (Λ ((P w (λ max λ min ) X A) X )) 2 f (Λ ((P X A) X )) tan 2 Θ(X, Y).

15 Convergence bounds of the block Lanczos Let dimx = dimx 0 = m, operator A be Hermitian, and let the A-invariant subspace X correspond to the contiguous set of the largest eigenvalues of A. Let Y = X 0 + AX 0 + + A k X 0, then where 0 Λ ((P XA) X ) Λ dimx ((P Y A) Y ) Λ dimx ((P Y A) Y ) λ min w [σ m,...,σ 1] tan 2 Θ(X, X 0 ). ( ) σi = T λm+1 + λ min 2λ i 2 k λ m+1 λ min i = 1,...,m, and where T k is the kth Chebyshev polynomial of the first kind. Ideally, we would like to have the σ s on the left-hand side, as this would imply all previously known bounds!

16 Conclusions Majorization is a powerful tool that gives elegant and general error bounds for eigenvalues approximated by the Rayleigh-Ritz method. We discover several new results of this kind, including multiplicative bounds for relative errors. We apply majorization, apparently for first time, in the contexts of FEM error bounds and convergence rate bounds for subspace iterations and the block Lanczos method. Our initial results are promising and expected to lead to further development of the majorization technique for the theory of eigenvalue computations