Inverse Iteration on Defective Matrices*

Similar documents
Bounds for Eigenvalues of Tridiagonal Symmetric Matrices Computed. by the LR Method. By Gene H. Golub

In particular, if A is a square matrix and λ is one of its eigenvalues, then we can find a non-zero column vector X with

1-/5 i-y. A-3i. b[ - O o. x-f -' o -^ ^ Math 545 Final Exam Fall 2011

MATH 304 Linear Algebra Lecture 20: The Gram-Schmidt process (continued). Eigenvalues and eigenvectors.

Generalized Eigenvectors and Jordan Form

In this section again we shall assume that the matrix A is m m, real and symmetric.

Decompositions Of Modules And Matrices

NORMS ON SPACE OF MATRICES

Affine iterations on nonnegative vectors

SYMMETRICOMPLETIONS AND PRODUCTS OF SYMMETRIC MATRICES

ELEMENTARY LINEAR ALGEBRA

Markov Chains and Stochastic Sampling

Singular Value Inequalities for Real and Imaginary Parts of Matrices

Putzer s Algorithm. Norman Lebovitz. September 8, 2016

1 Matrices and Systems of Linear Equations

arxiv: v1 [math.ac] 7 Feb 2009

4.2. ORTHOGONALITY 161

On Weird and Pseudoperfect

. = V c = V [x]v (5.1) c 1. c k

A Characterization of Distance-Regular Graphs with Diameter Three

The Cayley-Hamilton Theorem and the Jordan Decomposition

Solving Homogeneous Systems with Sub-matrices

1 Matrices and Systems of Linear Equations. a 1n a 2n

Linear Algebra March 16, 2019

MTH 309 Supplemental Lecture Notes Based on Robert Messer, Linear Algebra Gateway to Mathematics

NOTES II FOR 130A JACOB STERBENZ

AMS526: Numerical Analysis I (Numerical Linear Algebra)

APPH 4200 Physics of Fluids

NOTICE WARNING CONCERNING COPYRIGHT RESTRICTIONS: The copyright law of the United States (title 17, U.S. Code) governs the making of photocopies or

Linear Algebra: Matrix Eigenvalue Problems

JUST THE MATHS UNIT NUMBER 9.9. MATRICES 9 (Modal & spectral matrices) A.J.Hobson

Examples include: (a) the Lorenz system for climate and weather modeling (b) the Hodgkin-Huxley system for neuron modeling

Eigenvalues and Eigenvectors

Jordan Normal Form. Chapter Minimal Polynomials

Iterative Solution of a Matrix Riccati Equation Arising in Stochastic Control

Synopsis of Numerical Linear Algebra

Fast Matrix Multiplication*

Chapter 1 Vector Spaces

Math 504 (Fall 2011) 1. (*) Consider the matrices

D-MATH Algebra I HS 2013 Prof. Brent Doran. Exercise 11. Rings: definitions, units, zero divisors, polynomial rings

Lecture Note 7: Iterative methods for solving linear systems. Xiaoqun Zhang Shanghai Jiao Tong University

Department of Mathematics California State University, Los Angeles Master s Degree Comprehensive Examination in. NUMERICAL ANALYSIS Spring 2015

Generalized eigenspaces

Eigenvalues and Eigenvectors

5 Compact linear operators

Criteria for existence of semigroup homomorphisms and projective rank functions. George M. Bergman

Ma/CS 6b Class 23: Eigenvalues in Regular Graphs

The goal of this chapter is to study linear systems of ordinary differential equations: dt,..., dx ) T

1.3 Convergence of Regular Markov Chains

Matrix Algorithms. Volume II: Eigensystems. G. W. Stewart H1HJ1L. University of Maryland College Park, Maryland

j=1 u 1jv 1j. 1/ 2 Lemma 1. An orthogonal set of vectors must be linearly independent.

1 Last time: least-squares problems

Quantum integers and cyclotomy

G/N. Let QQ denote the total quotient sheaf of G, and let K = T(X, QL). Let = T(X, (QÖ)/Ö ), the group of Cartier divisors, let 5ß =

CHAPTER 3. Matrix Eigenvalue Problems

x 3y 2z = 6 1.2) 2x 4y 3z = 8 3x + 6y + 8z = 5 x + 3y 2z + 5t = 4 1.5) 2x + 8y z + 9t = 9 3x + 5y 12z + 17t = 7

Spanning and Independence Properties of Finite Frames

Fundamentals of Engineering Analysis (650163)

Jordan Canonical Form Homework Solutions

FUNCTION BASES FOR TOPOLOGICAL VECTOR SPACES. Yılmaz Yılmaz

A NOTE ON A BASIS PROBLEM

Edexcel GCE A Level Maths Further Maths 3 Matrices.

On the Skeel condition number, growth factor and pivoting strategies for Gaussian elimination

ELEMENTARY LINEAR ALGEBRA

UMIACS-TR July CS-TR 2721 Revised March Perturbation Theory for. Rectangular Matrix Pencils. G. W. Stewart.

Kaczmarz algorithm in Hilbert space

Decompositions of frames and a new frame identity

G1110 & 852G1 Numerical Linear Algebra

Computational Linear Algebra

Foundations of Matrix Analysis

THE S 1 -EQUIVARIANT COHOMOLOGY RINGS OF (n k, k) SPRINGER VARIETIES

ON THE MATRIX EQUATION XA AX = X P

Chapter 5. Linear Algebra

A linear algebra proof of the fundamental theorem of algebra

A NOTE ON QUASI ISOMETRIES II. S.M. Patel Sardar Patel University, India

A connection between number theory and linear algebra

A linear algebra proof of the fundamental theorem of algebra

Jim Lambers MAT 610 Summer Session Lecture 2 Notes

AN ELEMENTARY PROOF OF THE SPECTRAL RADIUS FORMULA FOR MATRICES

Notes on the matrix exponential

REFRESHER. William Stallings

JORDAN AND RATIONAL CANONICAL FORMS

Sums of diagonalizable matrices

Numerical Methods - Numerical Linear Algebra

a 11 x 1 + a 12 x a 1n x n = b 1 a 21 x 1 + a 22 x a 2n x n = b 2.

ALGEBRAIC GEOMETRY COURSE NOTES, LECTURE 2: HILBERT S NULLSTELLENSATZ.

Citation Osaka Journal of Mathematics. 44(4)

THE VALUATION STRUCTURE OF HOMOMORPHIC IMAGES OF PRÜFER DOMAINS

PERMANENTLY WEAK AMENABILITY OF REES SEMIGROUP ALGEBRAS. Corresponding author: jabbari 1. Introduction

arxiv: v1 [math.rt] 7 Oct 2014

AMS526: Numerical Analysis I (Numerical Linear Algebra)

Homotopy and homology groups of the n-dimensional Hawaiian earring

COMPLETE CLASS RESULTS FOR LINEAR REGRESSION OVER THE MULTI-DIMENSIONAL CUBE

A Note on Eigenvalues of Perturbed Hermitian Matrices

COMBINATORIAL DIMENSION OF FRACTIONAL CARTESIAN PRODUCTS

e j = Ad(f i ) 1 2a ij/a ii

Solvability of Linear Matrix Equations in a Symmetric Matrix Variable

Computing Eigenvalues and/or Eigenvectors;Part 2, The Power method and QR-algorithm

Structure of rings. Chapter Algebras

Jordan Journal of Mathematics and Statistics (JJMS) 8(3), 2015, pp THE NORM OF CERTAIN MATRIX OPERATORS ON NEW DIFFERENCE SEQUENCE SPACES

Transcription:

MATHEMATICS OF COMPUTATION, VOLUME 31, NUMBER 139 JULY 1977, PAGES 726-732 Inverse Iteration on Defective Matrices* By Nai-fu Chen Abstract. Very often, inverse iteration is used with shifts to accelerate convergence to an eigenvector. In this paper, it is shown that, if the eigenvalue belongs to a nonlinear elementary divisor, the vector sequences may diverge even when the shift sequences converge to the eigenvalue. The local behavior is discussed through a 2 X 2 example, and a sufficient condition for the convergence of the vector sequence is given. Introduction. If an accurate approximation a to an eigenvalue X of a matrix B is available, then inverse iteration is an attractive technique for computing the associated eigenvector. We choose an arbitrary unit vector vq and a fixed shift a. Then for / = 1, 2,... we solve (,) (B- ofywj = Vj_v Vj = wy/uwyll, where is the users' preferred vector norm. Semisimple case. If X is a simple eigenvalue with unit eigenvector x, if a is close enough to X, and if v0 is not an unfortunate choice, then the vector sequence {u} converges linearly to x and the convergence factor is very favorable. This well-known result holds also for multiple eigenvalue X provided that: (i) the dimension of X's eigenspace is equal to X's algebraic multiplicity (i.e. linear elementary divisors), (ii) the spectral projection of v0 into X's eigenspace is not zero (i.e. the starting vector is not deficient in x). If a is known to equal X to within working precision of the computer, then only one or two steps of the iteration are necessary. Defective Case. Wilkinson [3] and Varah [2] pointed out that the situation is not so nice if X has generalized eigenvectors of grade higher than one, i.e., when X belongs to a nonlinear elementary divisor. In exact arithmetic the iteration converges not linearly, but harmonically like 1 // as /'. Even worse is the fact that except for very special choices of u0, the vectors v2 and v3 will be poorer approximations than Uj! Variable Shifts. Inverse iteration can also be used with variable shifts. If the sequence of shifts {a} converges to X as /, then the vector sequence generated Received January 20, 1976; revised June 21, 1976. AMS (MOS) subject classifications (1970). Primary 65F15; Secondary 15A18. Key words and phrases. Eigenvector, eigenvalue, nonlinear divisors, convergence, inverse iterations, shifts. "This work was partially supported by the Office of Naval Research under Contract No. N00014-69-A-0200-1017. This work was performed under the auspices of the Energy Research and Development Administration while the author was visiting Lawrence Berkeley Laboratory. Copyright 1977, American Mathematical Society 726

INVERSE ITERATION ON DEFECTIVE MATRICES 727 by (B - of)w- = - u._, (the - sign is for convenience) (2) V, = Wy/IIWyll converges to x whenever X is a simple eigenvalue. However, when X has eigenvector of grade higher than one, the situation is again complicated; and the shifts can make things worse. In fact, we shall prove the following surprising result. The sequence {v-} generated by (2) may fail to converge to x even though the sequence { a- - X } converges monotonically to 0 as j. The Construction. There is no loss of generality in studying 'o r TV- '»,o o, (for analogous results on n x n matrices, see Chen [1]). Note that ex = (0) is the eigenvector, and any other linearly independent vector is an eigenvector of grade 2. We observe that (TV- oi)w - ex yields w = re, - e2 for any t, if a = 0, a 1ei, if a = 0. This shows that ex is a fixed point of the iteration (2) provided that o = 0. We are now ready to make our perverse construction. In fact, there are many ways to construct such a perverse sequence of shifts. We first observe the following facts: Let Op j = 1, 2.be the sequence of shifts, \- = a^1, and let t-, t, be normalizing factors to keep v, of unit length; then the vectors generated by inverse iteration with the above shifts satisfy vf = x/tv - o/)"»(tv- 0j_!/)"! (TV- a, /)"» v0 = 7-;.(/ + %rnw + Zf_tN) (/ + %,N)v0 = r'f(i + «! + %2 + + ^AOiv So if we write s;. = "Lk=1 k, then ">=t;(o îk Our goal to construct a perverse sequence is accomplished if there exists a finite number z such that z is a limit point of the infinite set {s}. This is so because u = u'/\\u'\\, where " -a >>-aoc)- r +,r>» -(:> would be a limit vector of the vector sequence {uy}. If u0 = et, then w = e,. Hence the vector sequence cannot converge to the eigenvector. So, we define a sequence of complex shifts a- = r ' with n > ö;- > - tt, r- > 0, by the rule below:

728 NAI-FU CHEN Figure 1 77ie sequence of shifts on the complex plane (i) al = rxei6^ so that Re(o-j) = 0, (ii) for / = 3, 5, 7,..., (equivalently, solve ri = rj_2/2, Bj is such that Re^r1) = Re(oj~_l2) 2, for 0, in the appropriate quadrant), cos 0,_2 = 4 cos 0- (iii) a;. = af_, for / = 2, 4, 6,... It is clear that the sequence thus generated is monotone decreasing to zero, the eigenvalue, because o. = lo.-_.jl = aj /2m_1 where/ = 2m, m = 1,2,.... Also, z = 4 Re(a "') is a finite limit point of the set {sas- = 2 =1 f-} because 2m z Urn %k = lim (É! + Ç,) + (.3 +.3) + ' + (?2m_1 + S2m_!) fe=l -*oo ^^ /TJ-tVOO * = lim 2{Re(oTl) + %Re(oT*) + + 04 T_1 Re(a "l)} = 4Re(o -'). The sequence we have constructed accomplishes our goal.

INVERSE ITERATION ON DEFECTIVE MATRICES 729 Local Behavior. This peculiar behavior of inverse iteration on defective matrices can be better understood through the local picture of our 2x2 matrix TV. For simplicity, we keep all quantities real and let s = sin 0 4=- 0. (TV-a/)w = -Q yields w= fe-i o=0, [(l-f/ooe, +te2], o 4=0, where t = tan y, c = cos y (thus y is the angle between (cs) and ex). Let y be the angle between w and e,, and t' = tan y. Then the iteration function for a typical step is given by (3) t' = *a(0 = r/(l + i/o). The fact that $0(0) = 1> o" = 0, corresponds to the harmonic convergence of the fixed shift sequence. If we study inverse iteration with shifts {ay} yielding vectors {vf}, then in applying (3), we have the following correspondence: {a} close to X corresponds to a small, {u} close to ex corresponds to t small. In the following diagram, we demonstrate how w compares with v as an approximation to the eigenvector. Each point (a, t) on the diagram represents one step of inverse iteration with a shift a and the vector v whose component ratio is t. We say that w is a better approximation if \t'\ < \t\ and worse if r' > r : Figure 2 Behavior of inverse iteration applied to TV

730 NAI-FU CHEN The above diagram clearly demonstrates that no matter how good our approximation to the eigenvector (when t is small) or to the eigenvalue (when o is small) or both, inverse iteration can still give a much worse approximation in exact arithmetic. local behavior of inverse iteration on defective matrices is precisely the source of difficulty. A Sufficient Condition. This Experience with various traditional shift strategies, even in the presence of multiple eigenvalues, indicates that inverse iteration usually converges, though very slowly. The following theorem relates how the shifts converge to the eigenvalue and gives a sufficient condition that the generated vectors converge to the eigenvector: Theorem. Let C be a general complex matrix and let the shifts a-, / = 1,2,..., converge to X, an eigenvalue of C. Let n be the dimension of the largest Jordan block corresponding to X. Write a;. - X = ^e-16/', r- > 0. If there exists <p and an integer k such that (see Figure 3) (4) 10-0,1 <., V n forj>k, ' Mn - 1) then {» } converge to an eigenvector. Figure 3 The angle condition for % = rj1 e' ' = o7l in the complex plane where X = 0 In other words, if the a- (and thus f = oj~l) ultimately fall into a cone of size specified in (4) in the complex plane, then the shifts will accelerate convergence. Proof. Without loss of generality, we can consider C to be the n x n Jordan

INVERSE ITERATION ON DEFECTIVE MATRICES 731 block /O 1 0 0 0 0 10 0 0 0 1 o\ v. and we study the vector sequence from vk on. With %. = a- 1, and r-, t- normalizing factors, we have where "*+/ - TfiC- ok+ji)-1 (C-ok+lI)-1vk a-tk+ijrl \-l. ('-$*+! o-1»* 0 1 0/ (/+w^+--*+s;i^"1k = r;.(/ + _?,(/)/ + d2(/)/2 + + d^o)/"-1)»* = r;s»(/k. (5) W) = (\ dx(j) d2(j) 0 1 //,(/) Vo 0 _l(/)\ <*,(/) 1 y where d (j) = sum of (/+'_1) terms of the form (,,- * * *,-.), ^ < iv i2,, i, < fc+/. Hence vk+ is just a linear combination of the columns of the matrix Sk(/). To show that {vk+ } converges to et, it is sufficient to show that?,_,(/)/?;(/) /»-oofor /= 1,...,«- 1 (with d0(/) = 1). Lemma d,(/) > l^+/?,_,(/). /Voo/ We first make the following observations: let > 0 as a = <e to, k. = ('».' Xl)"-(r_.«x') = ^ je, (i.e.,rx = rxi xr' "X»,_+«,a + "-+*,P«* ay ^1^2 Ky rye be two of the (i+\ 1) terms in d,(j); then by (4) /-*-^l<4^t)<i' " - /'<? Further

732 NAI-FU CHEN (i) \ax + a > \ax\ because ax, a plane, and (ii) If fly + flv = re''9, then /0-0 < make an angle less than 7r/2 in the complex Now, fr +.-i?/_i(/) *s one ^ ^e terrns 0I"^/(/)> and by repeated application of (ii) above, we know l«g(l*+/<i -i(/))-w<ir/4. vr/4. Similarly, the rest of cf;(/), dt(j) - %k+)dl_l(j), satisfies Hence largua) - fc+/d;_,(/))- /0 < ir/4. \dl(i)\ = \d,(j) - èk+id,_1(j) > II*+/<*,_i(/)l by (i). D We can now complete the proof of the theorem: i-i(/) d,(i) < à,-iu) + **+,<*,_,(/)! (by lemma) l Sfc+/ yfc+/ > 0 as / - D Corollary. In the real case, if {a} eventually converge to the eigenvalue from one side, then v- converges to an eigenvector as j. Remark. Experience with Rayleigh quotient iteration shows that in fact the Rayleigh quotients do eventually converge to the eigenvalue from one side in the real case. As for the complex case, the Rayleigh quotients also eventually fall into a cone with angle less than 7r/4(«- 1) as required in (4). We examine the behavior of Rayleigh quotients in another communication. Acknowledgment. his many suggestions and comments. Department of Mathematics University of Southern California University Park Los Angeles, California 90007 The author is very grateful to Professor Beresford Parlett for 1. N. CHEN, The Rayleigh Quotient Iteration for Non-Normal Matrices, Ph. D. Dissertation, Electronic Research Laboratory Memorandum No. ERL-M548, University of California, Berkeley, 197S. 2. J. M. VARAH, The Computation of Bounds for the Invariant Subspaces of a General Matrix Operator, Ph. D. Dissertation, Stanford University, 1967. 3. J. H. WILKINSON, "Inverse iteration in theory and in practice," Symposia Mathematica, vol. 10, Academic Press, London, 1972, pp. 361-379. MR 51 #2268.