On the Ritz values of normal matrices

Similar documents
c 2009 Society for Industrial and Applied Mathematics

ECS231 Handout Subspace projection methods for Solving Large-Scale Eigenvalue Problems. Part I: Review of basic theory of eigenvalue problems

Arnoldi Methods in SLEPc

Solution of eigenvalue problems. Subspace iteration, The symmetric Lanczos algorithm. Harmonic Ritz values, Jacobi-Davidson s method

On prescribing Ritz values and GMRES residual norms generated by Arnoldi processes

LARGE SPARSE EIGENVALUE PROBLEMS

Solution of eigenvalue problems. Subspace iteration, The symmetric Lanczos algorithm. Harmonic Ritz values, Jacobi-Davidson s method

Approximating the matrix exponential of an advection-diffusion operator using the incomplete orthogonalization method

A HARMONIC RESTARTED ARNOLDI ALGORITHM FOR CALCULATING EIGENVALUES AND DETERMINING MULTIPLICITY

Preconditioned inverse iteration and shift-invert Arnoldi method

The Lanczos and conjugate gradient algorithms

Last Time. Social Network Graphs Betweenness. Graph Laplacian. Girvan-Newman Algorithm. Spectral Bisection

Block Iterative Eigensolvers for Sequences of Dense Correlated Eigenvalue Problems

DELFT UNIVERSITY OF TECHNOLOGY

Solving large sparse eigenvalue problems

LARGE SPARSE EIGENVALUE PROBLEMS. General Tools for Solving Large Eigen-Problems

An Asynchronous Algorithm on NetSolve Global Computing System

EIGIFP: A MATLAB Program for Solving Large Symmetric Generalized Eigenvalue Problems

Index. for generalized eigenvalue problem, butterfly form, 211

Convergence Theory for Iterative Eigensolvers

Computation of Smallest Eigenvalues using Spectral Schur Complements

of dimension n 1 n 2, one defines the matrix determinants

On the Modification of an Eigenvalue Problem that Preserves an Eigenspace

ETNA Kent State University

Eigenvalue Problems CHAPTER 1 : PRELIMINARIES

Structured Krylov Subspace Methods for Eigenproblems with Spectral Symmetries

Matrix Functions and their Approximation by. Polynomial methods

Krylov subspace projection methods

APPLIED NUMERICAL LINEAR ALGEBRA

PERTURBED ARNOLDI FOR COMPUTING MULTIPLE EIGENVALUES

ON ORTHOGONAL REDUCTION TO HESSENBERG FORM WITH SMALL BANDWIDTH

HARMONIC RAYLEIGH RITZ EXTRACTION FOR THE MULTIPARAMETER EIGENVALUE PROBLEM

The quadratic eigenvalue problem (QEP) is to find scalars λ and nonzero vectors u satisfying

Numerical Methods I Eigenvalue Problems

Ritz Value Bounds That Exploit Quasi-Sparsity

Block Krylov-Schur Method for Large Symmetric Eigenvalue Problems

A CHEBYSHEV-DAVIDSON ALGORITHM FOR LARGE SYMMETRIC EIGENPROBLEMS

Preliminary/Qualifying Exam in Numerical Analysis (Math 502a) Spring 2012

4.8 Arnoldi Iteration, Krylov Subspaces and GMRES

ARPACK. Dick Kachuma & Alex Prideaux. November 3, Oxford University Computing Laboratory

Krylov Subspaces. Lab 1. The Arnoldi Iteration

Iterative methods for symmetric eigenvalue problems

13-2 Text: 28-30; AB: 1.3.3, 3.2.3, 3.4.2, 3.5, 3.6.2; GvL Eigen2

Finding eigenvalues for matrices acting on subspaces

Computational Linear Algebra

FEM and sparse linear system solving

Finding Rightmost Eigenvalues of Large, Sparse, Nonsymmetric Parameterized Eigenvalue Problems

Numerical Methods - Numerical Linear Algebra

Eigenvalue problems III: Advanced Numerical Methods

Matrix Algorithms. Volume II: Eigensystems. G. W. Stewart H1HJ1L. University of Maryland College Park, Maryland

Scientific Computing with Case Studies SIAM Press, Lecture Notes for Unit VII Sparse Matrix

PDEs, Matrix Functions and Krylov Subspace Methods

Key words. linear equations, polynomial preconditioning, nonsymmetric Lanczos, BiCGStab, IDR

ABSTRACT. Professor G.W. Stewart

Probabilistic upper bounds for the matrix two-norm

Numerical Methods in Matrix Computations

An Arnoldi method with structured starting vectors for the delay eigenvalue problem

Here is an example of a block diagonal matrix with Jordan Blocks on the diagonal: J

ANY FINITE CONVERGENCE CURVE IS POSSIBLE IN THE INITIAL ITERATIONS OF RESTARTED FOM

Quantum Computing Lecture 2. Review of Linear Algebra

Motivation: Sparse matrices and numerical PDE's

MULTIGRID ARNOLDI FOR EIGENVALUES

ELA

Linear Algebra Massoud Malek

2 Two-Point Boundary Value Problems

DS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra.

Solving large Hamiltonian eigenvalue problems

MATH 20F: LINEAR ALGEBRA LECTURE B00 (T. KEMP)

A reflection on the implicitly restarted Arnoldi method for computing eigenvalues near a vertical line

DELFT UNIVERSITY OF TECHNOLOGY

Introduction to Arnoldi method

The Nullspace free eigenvalue problem and the inexact Shift and invert Lanczos method. V. Simoncini. Dipartimento di Matematica, Università di Bologna

The singular value of A + B and αa + βb

Application of Lanczos and Schur vectors in structural dynamics

Linear Programming. Chapter Introduction

Inexactness and flexibility in linear Krylov solvers

Restarting Arnoldi and Lanczos algorithms

On the solution of large Sylvester-observer equations

Elementary linear algebra

FINDING RIGHTMOST EIGENVALUES OF LARGE SPARSE NONSYMMETRIC PARAMETERIZED EIGENVALUE PROBLEMS

Krylov Subspaces. The order-n Krylov subspace of A generated by x is

NORMS ON SPACE OF MATRICES

EVALUATION OF ACCELERATION TECHNIQUES FOR THE RESTARTED ARNOLDI METHOD

A hybrid reordered Arnoldi method to accelerate PageRank computations

Lecture 3: QR-Factorization

A comparison of solvers for quadratic eigenvalue problems from combustion

Comparing iterative methods to compute the overlap Dirac operator at nonzero chemical potential

Compound matrices and some classical inequalities

Exponentials of Symmetric Matrices through Tridiagonal Reductions

Linear Algebra M1 - FIB. Contents: 5. Matrices, systems of linear equations and determinants 6. Vector space 7. Linear maps 8.

AMS526: Numerical Analysis I (Numerical Linear Algebra)

ON RESTARTING THE ARNOLDI METHOD FOR LARGE NONSYMMETRIC EIGENVALUE PROBLEMS

Report Number 09/32. A Hamiltonian Krylov-Schur-type method based on the symplectic Lanczos process. Peter Benner, Heike Faßbender, Martin Stoll

Hessenberg eigenvalue eigenvector relations and their application to the error analysis of finite precision Krylov subspace methods

A mathematicians way into STRONGnet

Math 504 (Fall 2011) 1. (*) Consider the matrices

LINEAR ALGEBRA BOOT CAMP WEEK 4: THE SPECTRAL THEOREM

The skew-symmetric orthogonal solutions of the matrix equation AX = B

Matrix balancing and robust Monte Carlo algorithm for evaluating dominant eigenpair

Parallel Numerics, WT 2016/ Iterative Methods for Sparse Linear Systems of Equations. page 1 of 1

Transcription:

On the Ritz values of normal matrices Zvonimir Bujanović Faculty of Science Department of Mathematics University of Zagreb June 13, 2011 ApplMath11 7th Conference on Applied Mathematics and Scientific Computing

Contents 1 Convergence of the restarted Arnoldi method 2 Interlacing for normal matrices 3 Characterization of Ritz values using a Cauchy matrix On the Ritz values of normal matrices

Contents 1 Convergence of the restarted Arnoldi method 2 Interlacing for normal matrices 3 Characterization of Ritz values using a Cauchy matrix On the Ritz values of normal matrices Convergence of the restarted Arnoldi method

The Arnoldi method Task: determine a few eigenvalues λ 1,λ 2,... of a large, sparse matrix A C n n. The Arnoldi method: choose an initial vector v; iteratively build an m-dimensional Krylov subspace K m (v;a) = span{v,av,...,a m 1 v}; this is represented as AV m = V m H m + (β m v m+1 )em; approximate eigs of A with the Ritz values θ 1,θ 2,...,θ m (the eigenvalues of H m ). On the Ritz values of normal matrices Convergence of the restarted Arnoldi method

The Arnoldi method Task: determine a few eigenvalues λ 1,λ 2,... of a large, sparse matrix A C n n. The Arnoldi method: choose an initial vector v; iteratively build an m-dimensional Krylov subspace K m (v;a) = span{v,av,...,a m 1 v}; this is represented as AV m = V m H m + (β m v m+1 )em; approximate eigs of A with the Ritz values θ 1,θ 2,...,θ m (the eigenvalues of H m ). On the Ritz values of normal matrices Convergence of the restarted Arnoldi method

The restarted Arnoldi method The restarted Arnoldi method (ARPACK, Matlab eigs): 1 initialization: build K 1 (v;a),k 2 (v;a),...,k m p (v;a); 2 augmentation: build K m p+1 (v;a),k m p+2 (v;a),...,k m (v;a); 3 restart: choose ṽ = (A σ 1 I)(A σ 2 I)...(A σ p I)v; ṽ is such that K m p (ṽ;a) is already computed; set v = ṽ, go to 2. Roots σ 1,...,σ p are called shifts. Exact shifts = unwanted Ritz values from K m (v;a). Methods of restart: Sorensen (IRAM), Stewart (Krylov Schur) On the Ritz values of normal matrices Convergence of the restarted Arnoldi method

The restarted Arnoldi method The restarted Arnoldi method (ARPACK, Matlab eigs): 1 initialization: build K 1 (v;a),k 2 (v;a),...,k m p (v;a); 2 augmentation: build K m p+1 (v;a),k m p+2 (v;a),...,k m (v;a); 3 restart: choose ṽ = (A σ 1 I)(A σ 2 I)...(A σ p I)v; ṽ is such that K m p (ṽ;a) is already computed; set v = ṽ, go to 2. Roots σ 1,...,σ p are called shifts. Exact shifts = unwanted Ritz values from K m (v;a). Methods of restart: Sorensen (IRAM), Stewart (Krylov Schur) On the Ritz values of normal matrices Convergence of the restarted Arnoldi method

The restarted Arnoldi method The restarted Arnoldi method (ARPACK, Matlab eigs): 1 initialization: build K 1 (v;a),k 2 (v;a),...,k m p (v;a); 2 augmentation: build K m p+1 (v;a),k m p+2 (v;a),...,k m (v;a); 3 restart: choose ṽ = (A σ 1 I)(A σ 2 I)...(A σ p I)v; ṽ is such that K m p (ṽ;a) is already computed; set v = ṽ, go to 2. Roots σ 1,...,σ p are called shifts. Exact shifts = unwanted Ritz values from K m (v;a). Methods of restart: Sorensen (IRAM), Stewart (Krylov Schur) On the Ritz values of normal matrices Convergence of the restarted Arnoldi method

Convergence: Hermitian matrices Theorem (Sorensen) Let: A Hermitian; Shifts = smallest p Ritz values. Then under some mild conditions: Ritz values converge to m p largest eigenvalues. The main tool: Cauchy interlacing property λ j+(n m) θ j λ j monotonicity of Ritz values: θ (1) j θ (2) j θ (3) j... bounded from above: θ (k) j λ j restarting preserves wanted Ritz values On the Ritz values of normal matrices Convergence of the restarted Arnoldi method

Convergence: Hermitian matrices Theorem (Sorensen) Let: A Hermitian; Shifts = smallest p Ritz values. Then under some mild conditions: Ritz values converge to m p largest eigenvalues. The main tool: Cauchy interlacing property λ j+(n m) θ j λ j monotonicity of Ritz values: θ (1) j θ (2) j θ (3) j... bounded from above: θ (k) j λ j restarting preserves wanted Ritz values On the Ritz values of normal matrices Convergence of the restarted Arnoldi method

Convergence: the general case Embree: Convergence fails for a whole class of matrices, any dimension. The wanted Ritz vectors are removed from the Krylov subspace. Example: 105 105 matrix A On the Ritz values of normal matrices Convergence of the restarted Arnoldi method

Convergence: the general case Embree: Convergence fails for a whole class of matrices, any dimension. The wanted Ritz vectors are removed from the Krylov subspace. Example: 105 105 matrix A, 10-dim. Krylov subspace On the Ritz values of normal matrices Convergence of the restarted Arnoldi method

Convergence: the general case Embree: Convergence fails for a whole class of matrices, any dimension. The wanted Ritz vectors are removed from the Krylov subspace. Example: 105 105 non-normal matrix A, 10-dim. Krylov subspace On the Ritz values of normal matrices Convergence of the restarted Arnoldi method

Contents 1 Convergence of the restarted Arnoldi method 2 Interlacing for normal matrices 3 Characterization of Ritz values using a Cauchy matrix On the Ritz values of normal matrices Interlacing for normal matrices

Reduction to the Hermitian case Definition Let ϕ [0,2π. For a C, denote a (ϕ) = 1 2 ( e iϕ a + e iϕ a). For a matrix A, denote: A (ϕ) = 1 2 ( e iϕ A + e iϕ A ). On the Ritz values of normal matrices Interlacing for normal matrices

Restarting the Arnoldi method and A (ϕ) Interlacing does not hold for θ (ϕ). Interlacing for θ (A arbitrary): λ (ϕ) (n m)+j θ j λ (ϕ) j. θ θ (ϕ) < 1 2 r. H = X AX, X + = [X x], H + = X + AX +, then θ j+1,+ θ j θ j,+. Restart preserves θ (ϕ), does not preserve θ (no such shifts exist!) On the Ritz values of normal matrices Interlacing for normal matrices

Restarting the Arnoldi method and A (ϕ) Interlacing does not hold for θ (ϕ). Interlacing for θ (A arbitrary): λ (ϕ) (n m)+j θ j λ (ϕ) j. θ θ (ϕ) < 1 2 r. H = X AX, X + = [X x], H + = X + AX +, then θ j+1,+ θ j θ j,+. Restart preserves θ (ϕ), does not preserve θ (no such shifts exist!) On the Ritz values of normal matrices Interlacing for normal matrices

Contents 1 Convergence of the restarted Arnoldi method 2 Interlacing for normal matrices 3 Characterization of Ritz values using a Cauchy matrix

Geometry of the Ritz values Let A C n n. When does a set {θ 1,...,θ m } of complex numbers represent Ritz values for A, using any m dimensional subspace? A normal, Krylov subspaces.

Geometry of the Ritz values Notation: Λ = (λ 1,...,λ n ), Θ = (θ 1,...,θ m ) Cauchy matrix P(Λ,Θ) C m n, C(Λ,Θ) = 1 1 1 λ 1 θ 1 λ 2 θ... 1 λ n θ 1 1 1 1 λ 1 θ 2 λ 2 θ... 2 λ n θ 2. 1 1 1 λ 1 θ m λ 2 θ m... λ n θm P(Λ,Θ) i,j = (λ i θ j ) m q=1 q j Cm n λ i θ q 2 x = (x 1,...,x n ) > 0 if each x i 0 and at least one x i > 0. x = (x 1,...,x n ) 0 if all x i > 0.

Geometry of the Ritz values Notation: Λ = (λ 1,...,λ n ), Θ = (θ 1,...,θ m ) Cauchy matrix P(Λ,Θ) C m n, C(Λ,Θ) = 1 1 1 λ 1 θ 1 λ 2 θ... 1 λ n θ 1 1 1 1 λ 1 θ 2 λ 2 θ... 2 λ n θ 2. 1 1 1 λ 1 θ m λ 2 θ m... λ n θm P(Λ,Θ) i,j = (λ i θ j ) m q=1 q j Cm n λ i θ q 2 x = (x 1,...,x n ) > 0 if each x i 0 and at least one x i > 0. x = (x 1,...,x n ) 0 if all x i > 0.

Geometry of the Ritz values Notation: Λ = (λ 1,...,λ n ), Θ = (θ 1,...,θ m ) Cauchy matrix P(Λ,Θ) C m n, C(Λ,Θ) = 1 1 1 λ 1 θ 1 λ 2 θ... 1 λ n θ 1 1 1 1 λ 1 θ 2 λ 2 θ... 2 λ n θ 2. 1 1 1 λ 1 θ m λ 2 θ m... λ n θm P(Λ,Θ) i,j = (λ i θ j ) m q=1 q j Cm n λ i θ q 2 x = (x 1,...,x n ) > 0 if each x i 0 and at least one x i > 0. x = (x 1,...,x n ) 0 if all x i > 0.

Geometry of the Ritz values Theorem (B.) Let A C n n be normal with eigenvalues Λ = (λ 1,..., λ n ). Let Θ = (θ 1,...,θ m ) denote an m-tuple of distinct complex numbers such that λ i θ j, for all i = 1,...,n and j = 1,...,m. The following claims are equivalent: (a) There exists a vector v C n such that θ 1,...,θ m are Ritz values for A from K m (v;a). (b) Linear system P(Λ,Θ)ˆv = 0 has a solution ˆv > 0. (c) Linear system C(Λ,Θ)w = 0 has a solution w > 0. The main idea of the proof: The minimum π(a)v for π P m is obtained at π = κ(h m). The proof is constructive.

Geometry of the Ritz values Theorem (B.) Let A C n n be normal with eigenvalues Λ = (λ 1,..., λ n ). Let Θ = (θ 1,...,θ m ) denote an m-tuple of distinct complex numbers such that λ i θ j, for all i = 1,...,n and j = 1,...,m. The following claims are equivalent: (a) There exists a vector v C n such that θ 1,...,θ m are Ritz values for A from K m (v;a). (b) Linear system P(Λ,Θ)ˆv = 0 has a solution ˆv > 0. (c) Linear system C(Λ,Θ)w = 0 has a solution w > 0. The main idea of the proof: The minimum π(a)v for π P m is obtained at π = κ(h m). The proof is constructive.

Feasibility problem in LP Linear programming: solving the existence Ax = b,x > 0 is as hard as optimizing c τ x max, under condition Ax b.

The dual problem Theorem The following is equivalent to (a)(b)(c): (d) For each u C m there exists an eigenvalue λ i such that ) Re ( m j=1 u j λ i θ j 0. Farkas lemma: for M R n m it holds that: (i) Either Mv = 0 for some v 0, or u M > 0 for some u R m. (ii) Either Mv = 0 for some v > 0, or u M 0 for some u R m.

Geometry of the Ritz values

Geometry of the Ritz values

Geometry of the Ritz values

Geometry of the Ritz values

Corollary: Interlacing for the Hermitian matrices Corollary (compare: Parlett, The Symmetric Eigenvalue Problem) Suppose A is Hermitian. Let θ 1 >... > θ m denote the Ritz values from a m-dimensional Krylov subspace; θ j λ i. Then for all j = 1,...,m 1 there exists an eigenvalue λ such that θ j+1 < λ < θ j.

Corollary: Ritz values = convex combination of eigenvalues Using P(Λ,Θ)ˆv = 0 we get: θ j = n i=1 λ i ( ) P i,j = m q=1 λ i θ q 2 ˆv i 2 q j P i,j P 1,j + P 2,j +... + P n,j.

Restarting Arnoldi: an example An unwanted Ritz value cannot equal the extreme λ. Can it equal the second largest eigenvalue? λ 1 = 8, λ 2 = 4. 4 A = 3 2 2 3 3.9 8. Is there an initial vector v such that for K 3 (v;a) it holds θ 3 λ 2?

Restarting Arnoldi: an example An unwanted Ritz value cannot equal the extreme λ. Can it equal the second largest eigenvalue? λ 1 = 8, λ 2 = 4. 4 A = 3 2 2 3 3.9 8. Is there an initial vector v such that for K 3 (v;a) it holds θ 3 λ 2?

Restarting Arnoldi: an example θ = λ 2 + 10 6 = 4 + 10 6.

Restarting Arnoldi: an example Optimization (θ 1,θ 2 max,θ 3 λ 2 ): v = The Ritz values from K 3 (v;a) are: 0.775693250142234 0.028238213050217 0.028273977339263 0.629795237727870 0.007818295736434. θ 1 = 4.183227620474041 + 0.692098306609705i, θ 2 = 4.183227620474041 0.692098306609705i, θ 3 = 4.000000000000762. Restarting to dimension 2 erases the component in direction u 2.

Bibliography D. C. Sorensen Implicit Application of Polynomial Filters in a k-step Arnoldi Method, SIAM J. Matrix Anal. Appl. (vol. 13, 1992.) G. W. Stewart A Krylov Schur Algorithm for Large Eigenproblems, SIAM J. Matrix Anal. Appl. (vol. 23, 2001.) M. Embree The Arnoldi Eigenvalue Iteration with Exact Shifts Can Fail, SIAM J. Matrix Anal. Appl. (vol. 31, 2009.) S. M. Malamud Inverse spectral problem for normal matrices and the Gauss Lucas theorem, Transactions of the American Mathematical Society (vol. 357, 2005.) J. F. Queiró, A. L. Duarte Imbedding conditions for normal matrices, Linear Algebra and its Applications (vol. 430, 2009.) Z. Bujanović Krylov Type Methods for Large Scale Eigenvalue Computations, PhD Thesis, Univ. of Zagreb (2011.)