arxiv: v2 [math.oc] 31 Jul 2017
|
|
- Annabel Wilcox
- 5 years ago
- Views:
Transcription
1 An unconstrained framework for eigenvalue problems Yunho Kim arxiv: v [math.oc] 3 Jul 07 May 0, 08 Abstract In this paper, we propose an unconstrained framework for eigenvalue problems in both discrete and continuous settings. We begin our discussion to solve a generalized eigenvalue problem Ax = λbx with two N N real symmetric matrices A, B via minimizing a proposed functional whose nonzero critical points x R N solve the eigenvalue problem and whose local minimizers are indeed global minimizers. Inspired by the properties of the proposed functional to be minimized, we provide analysis on convergence of various algorithms either to find critical points or local minimizers. Using the same framework, we will also present an eigenvalue problem for differential operators in the continuous setting. It will be interesting to see that this unconstrained framework is designed to find the smallest eigenvalue through matrix addition and multiplication and that a solution x R N and the matrix B can compute the corresponding eigenvalue λ without using A in the case of Ax = λbx. At the end, we will present a few numerical experiments which will confirm our analysis. Introduction Given an N N matrix A, the eigenvalue problem of our interest is to find an eigenvalue and its corresponding eigenvector of A, that is, to solve Ax = λx for x and λ. This is one of the most fundamental problems in mathematics with applications to all other fields of science. Especially, one may be interested in estimating the largest and the smallest eigenvalues. If A is symmetric and positive definite, then finding the smallest eigenvalue of A is the same as finding the largest eigenvalue of A, which usually involves solving systems of linear equations of type Ax = b. Then, we ask ourselves a question: Is it possible to compute the smallest eigenvalue of A through only basic matrix operations such as multiplication and addition, without solving Ax = b? Our interest of estimating the smallest eigenvalue and its corresponding eigenvector extends to the following infinite dimensional application, as well. On a compact manifold M, eigenvalues of the Laplacian reveals important structures of M, which makes understanding the eigenvalues of on M very important. This has interesting applications. For example, in image processing there are a few interesting works e.g. [6], [], [8] to distinguish reconstructed objects from point cloud data by evaluating on the surfaces of the objects. We can even consider general self-adjoint linear elliptic operators and find their eigenvalues and eigenfunctions. With these theoretical and numerical points of view in mind, our main discussion will be concentrated on finding the smallest eigenvalue and a corresponding eigenvector of a nonzero symmetric matrix, which leads us to begin with the Department of Mathematical Sciences, UNIST, Ulsan, South Korea, yunhokim@unist.ac.kr
2 following well-known constrained problem: given a symmetric and positive definite matrix A, min x R N x, Ax subject to x =. There have been a large number of works to solve by the name of inverse iteration methods. In particular, we would like to mention the work [] by J.E. Dennis and R.A. Tapia, which surveys historical developments of inverse and shifted inverse iterations and of Rayleigh quotient iteration, and which approaches the listed methods from the viewpoint of the Newton s method. The unconstrained version of analyzed in [] is min x, Ax + x R N x. The authors of [] explained why the inverse and the shifted inverse Rayleigh quotient iterations are fast and effective by showing the equivalence between the inverse Rayleigh quotient iteration and the Newton s method for, and also between the shifted inverse Rayleigh quotient iteration and the Newton s method for the shifted version of, when the given matrix A is symmetric and invertible. We refer to [], and references therein, any interested reader in the developments of the inverse and shifted inverse Rayleigh quotient iteration methods. There are, however, a few disadvantageous features of the functional in that we paid attention to. First of all, [] considered only nonsingular matrices for just as all other conventional methods do. Second of all, in the simplest case when A is symmetric and positive definite, if 0 < < λ, then the zero vector is the only critical point of the functional in and, even if λ, the critical points of the functional in are only the eigenvalues of A less than. Hence, our goal is of two folds: extending existing theories to singular matrices, and removing the additional limitations imposed by the parameter in. Noting that the functional in contains the term x = x + x, we believed that the factor x +, even though this is convex, is not desirable because the factor tries to push the norm x towards 0 during minimization. Therefore, our analysis begins without this factor. The rest of this manuscript is organized as follows. In Section, we present our unconstrained framework for solving a generalized eigenvalue problem in a finite dimensional space, where we propose an appropriate functional to be minimized and analyze its interesting properties. We, then, apply the gradient descent method and the Newton s method to the proposed minimization problem for solving eigenvalue problems and provide analysis for convergence either to a global minimizer or to a nonzero critical point. Moreover, we present a few variants of our approach for quantitative analysis of the error between a true eigenvector and an estimated one. In Section 3, we present the same unconstrained framework for eigenvalue problems in an infinite dimensional space such as finding eigenfunctions of self-adjoint differential operators, which show universality of our unconstrained framework. In Section 4, we present numerical aspects of our proposed method confirming the theoretical results obtained in the previous sections.
3 A generalized eigenvalue problem on a finite dimensional space First of all, we will consider a generalized eigenvalue problem Ax = λbx with two N N real symmetric matrices A, B, where B is positive definite. Notice that it becomes the usual eigenvalue problem Ax = λx when B is the identity matrix I. Here and in what follows, M M N R, Sym N R, Sym N,p R mean, respectively, the set of M N real matrices, the set of N N real symmetric matrices, the set of N N real symmetric and positive definite matrices. The set of N N real matrices will be simply denoted by M N R. The case of complex matrices will be mentioned later. Moreover, we consider x R N as an N column vector and for x, y R N, x, y will be denoted by y T x and x = x, x. For A M M N R, the operator norm of A will be denoted by A op, which is A op = sup Ax. x = Note that A op is the largest singular value of A. We also denote by B for B Sym N,p R the matrix Q ΛQ T, where QΛQ T is a diagonalization of B and Λ = diag λ,..., λ N when Λ = diagλ,..., λ N. Given A Sym N R, B Sym N,p R, we define a functional F A,B : R N R by F A,B x = x, Ax + x, Bx x, Bx 3 and propose the the following unconstrained problem min F A,B x. 4 x R N When B = I, we will simply drop the subscript B by writing F A, instead of F A,B. Then, we can see, by a change of variables, y = Bx, that 3 becomes F A,B x = y, Cy + y y = F C y with C = B A B Sym N R, which means that analyzing the functional F A,B is equivalent to analyzing F C. Note that F A,B and F C are both differentiable at x 0 and that F A,B x = F C Bx B implies the sets of nonzero critical points of F A,B and F C are equivalent up to the change of variables: y = Bx. More precisely, F A,B x = Ax + Bx and F C y = Cy + y Bx, x y imply that we can solve Ax = λbx by finding nonzero critical points of F C. Therefore, we will begin our discussion with F A for A Sym N R and investigate the minimization problem min F A x, 5 x R N which is equivalent to min x R N x, Ax + x. 3
4 Lemma. Let λ R be the smallest eigenvalue of A. For > max0, λ, the set of nonzero critical points of F A is { x R N : Ax = λx for some λ R with x = }, + λ and min F A x = x R N + λ. Proof. As was noted above, for x 0 0, F A x 0 = 0 Ax 0 = x 0 x 0, which implies that x 0 is a nonzero critical point of F A if and only if x 0 is an eigenvector of A corresponding to the eigenvalue λ 0 = x 0 with x 0 = +λ 0. In addition, F A x 0 = x 0 = + λ 0. Therefore, the set of nonzero critical points of F A is { x R N : Ax = λx for some λ R with x = }. + λ Due to the choice of > max0, λ, it is easy to see that F A is bounded from below and that a global minimizer x of F A exists and is an eigenvector of A corresponding to an eigenvalue λ with x = +λ. Since we can easily see that λ = λ and F A x = + λ = min F A x, x R N min F A x = x R N + λ. When dealing with F A, we will assume > max0, λ, where λ is the smallest eigenvalue of A if no condition on is stated. Theorem. Any local minimizer x of F A is a global minimizer. Proof. First of all, 0 is not a local minimizer of F A because for any nonzero θ R N, we have F A tθ F A 0 lim = < 0. t 0 + t θ Suppose that x is a local minimizer of F A. Lemma says that x is an eigenvector of A corresponding to an eigenvalue λ with x = and that +λ F A x = + λ + λ = min F Ax, x R 4
5 where λ is the smallest eigenvalue of A. We may diagonalize A such that A = QΛQ T, where Λ is a diagonal matrix with nondecreasing diagonal entries λ λ N and Q is an orthogonal matrix having x x as the jth column for some j implying λ = λ j. Then, it suffices to show that λ j = λ, which implies that x is a global minimizer. Suppose that λ j > λ. With y = Q T x, we have F A x = F A Qy = F Λ y. 6 For k =,,..., N, we set e k to be the k th column of the identity matrix I M N R. Then, we can see that x being a local minimizer of F A is equivalent to +λ j e j being a local minimizer of F Λ and that F Λ y = y, Λy + y y = N k= λ k + yk y, 7 where y = [y y N ] T. Moreover, +λ e is a global minimizer of F Λ. Since λ j > λ, e j is orthogonal to e and we can consider F Λ on the subspace spanned by {e, e j } by defining H : R R by Ha, b = F Λ ae + be j = λ + a + λ j + b a + b. Then, H0, +λ j exists and must be positive semidefinite, i.e., [ det ] H 0, 0. + λ j However, at a, b = 0, +λ, we obtain that det Ha, b = det + λ b ab a +b 3 a +b 3 ab + λ j a a +b 3 a +b 3 = λ λ j + λ j < 0, which is a contradiction. Therefore, λ = λ j = λ, i.e., any local minimizer x of F A is a global minimizer. In addition, we may be able to find all the eigenvalues and their corresponding eigenvectors of A. Corollary. Let λ λ N be the eigenvalues of A. Let {x,..., x k } be an orthonormal set of eigenvectors of A corresponding to the eigenvalues λ,..., λ k. We consider the following problem min F A x subject to x, x i = 0, i =,,..., k. 8 x R N Then, any local minimizer x of 8 is a global minimizer corresponding to the eigenvalue λ k+ with x = +λ k+. Proof. With a diagonalization QΛQ T of A, where Λ = diagλ, λ,..., λ N and the first k columns of Q are x,..., x k, and y = Q T x, we have that F A x = F Λ y and x = y, so 8 is equivalent to min{f Λ y : y = [0 0 y k+ y N ] T R N }. 9 5
6 Let Λ k be the last n k n k block of Λ, i.e., Λ k = diagλ k+,..., λ N. Then, 9 is equivalent to min z R N k F Λk z. Theorem applies to F Λk and we are done. Even though F A is not convex, we have seen that all local minimizers of F A are global minimizers, which is a rare case for nonconvex functionals, and that nonzero critical points of F A are eigenvectors of A. Hence, one can expect that any algorithm either to minimize the functional F A or to find a critical point of F A will work. For example, if we set Gx = x, then the conjugate functional G is G y = sup x R N x, y Gx = { 0, if y,, if y >, and Gx = G x = sup y R N x, y G y = sup y x, y. Then, 5 becomes min x R N, x, Ax + x x, y. y In fact, the constraint on y is y =, so we have min x R N, y = F A x, y = x, Ax + x x, y. 0 Since the functional F A, y is convex and quadratic in x for any fixed y, we may consider an algorithm such as. x = arg min x R N F A x, y,. Update y to be x x. 3. Iterate the above procedure until it converges. This algorithm is exactly the inverse power method A + I x k+ = x k x k. However, 0 is a constrained problem and the above algorithm requires solving a system of linear equation at every iteration, not to mention, the rate of convergence is linear. Therefore, we want to consider an algorithm that satisfies either one of the following two: the rate of convergence is linear, yet the algorithm applies only matrix addition and multiplication, the rate of convergence is faster than linear if we need to solve systems of linear equations.. The gradient descent method As for the first algorithm, we will analyze the gradient descent method for the minimization problem 5 with stepsize α k > 0: with x 0 0, x k+ = x k α k F A x k = x k α k Ax k + x k x k x k. 6
7 Let {q,..., q N } be an orthonormal basis for R N consisting of unit eigenvectors of A corresponding to the eigenvalues λ λ N, respectively. If x k = µ k, q + + µ k,n q N, then x k+ = µ k+, q + + µ k+,n q N = = N µ 0,i Π k j=0 [ α j λ i + i= N i= [ µ k,i α k λ i + ] q i x k x j ] q i. For simplicity, we assume a fixed stepsize 0 < α k = α < λ N + for all k N. Then, always converges to a critical point of F A. In fact, it converges to a global minimizer with probability if an initial point x 0 0 is chosen randomly. Theorem below is given in a general form. Theorem. A sequence {x k } generated by with x 0 = µ 0, q + + µ 0,N q N 0 converges to a critical point x of F A which is an eigenvector of A corresponding the eigenvalue λ l with x =, + λ l where l = min{j {,..., N} : µ 0,j 0}. More precisely, x is Proof. For any x 0, we get + λ µ 0,m q m. l µ m:λm=λl 0,m m:λ m=λ l x Ax, x F A x, = x x + x = λ + x, 3 where λ λ λ N. Then, α < λ N + implies x k > α for all k N since x k+ x k x k α F A x k, xk αλ + x k + α > α. x k Moreover, the line segment connecting x k and x k+ for k N lies entirely in {x R N : x α}. To see this, we take 0 < t < and observe that for k N, x k tα F x k x k tαλ + x k + tα > α tαλ + + t > α. Noting that F A x = A + I I xxt x x and A+I, I xxt x are positive semidefinite with A+I op λ N + and I xxt x op, we can see that for x α, F A x op maxλ N +, α α, 7
8 from which we obtain that for each k, F A x k+ F A x k + F A x k, x k+ x k + α x k+ x k = F A x k α F Ax k, which implies F A x k α F Ax k F A x k+. Note that for any K, α K K x k+ x k = k= k= F A x k α F Ax min x R N F A x. 4 Since F A is coercive and F A x k F A x < for all k, {x k } must be a bounded sequence in {x R N : x α}. Choosing a convergent subsequence {x kn } to x, we know from 4 that F A x = 0, i.e., x is an eigenvector of A corresponding to an eigenvalue λ i for some i N with norm x = +λ i. Knowing that {F A x k } is a decreasing and bounded sequence, we can easily derive that any subsequential limit x of {x k } satisfies F A x = F A x and A x = λ i x with norm x = +λ i. Hence, we can conclude that lim x k =. k + λ i Setting l = min{j {,..., N} : µ 0,j 0}, we can see from that λ i λ l. Suppose λ i > λ l. We note that for all k N, α λ l + > λ l + x k λ N + + α x k α x k > 0 and that as k, α λ l + αλ l λ i >. x k From, we can see that [ µ 0,l Π k j=0 α λ l + ] as k. x j This is a contradiction because {x k } is a bounded sequence. Therefore, λ i = λ l. Moreover, for λ p > λ l, we have α λ p + αλ p λ l 0, as k, x k implying [ µ 0,p Π k j=0 α λ p + ] 0 as k. x j 8
9 Hence, referring to, we can see that [ x k+ µ 0,m q m Π k j=0 α λ l + ] 0 as k. x j m:λ m=λ l In addition, convergence of the norm x k implies that Π j=0 [ α λ l + ] = x j + λ l µ m:λm=λl 0,m. Therefore, x k converges to + λ µ 0,m q m. l µ m:λm=λl 0,m m:λ m=λ l Now, going back to the generalized eigenvalue problem Ax = λbx, 5 via minimizing F A,B in 4, we realize that even though 4 is equivalent to 5, the gradient descent method we discussed above is applicable to F C with C = B A B to find a critical point of F A,B. That means that not only do we need to compute B, but also we need to invert it. However, it turns out that applying the gradient descent method directly to F A,B to solve 4 finds solutions of 5, as well. When solving 4, we will use µ A,j, µ B,j to denote the j th smallest eigenvalues of A and B, respectively. Moreover, we assume that µ B, = and that either µ A, 0 or µ A,N µ A, < 0 is true. Note that these assumptions are not restrictions, but simplifications. In relation to 5, when dealing with F A,B, the parameter will be assumed to satisfy > max0, µ A,. We set {r,..., r N } to be an orthonormal set of eigenvectors of C corresponding to the eigenvalues λ λ N. We also set q j = B r j for j N. Then, it is easy to see that {q,..., q N } is an orthonormal basis for R N with respect to the inner product x, y B := x T By. Theorem 3. If we choose 0 < α < then with x 0 0, the following procedure µ A,N + µ B,N 3, x k+ = x k α F A,B x k, k = 0,,,..., 6 produces a sequence {x k } converging to x with Bx, x = solution pair of 5. +λ, where x, λ is a Proof. If B = I, then this theorem is the same as Theorem. Hence, we will assume that µ B,N >. In addition, since the proof mimics that of Theorem with minor differences in 9
10 detail, we will emphasize only those minor differences. Let {x k } be the sequence generated by 6. First of all, we note that for any x k R N, Ax k, Bx k µ A,N µ B,N and = µ B, Bx k, Bx k µ B,N. 7 Bx k, x k Bx k, x k This implies that if 0 < α < µ A,N +µ B,N 3, then for 0 < ɛ := µ B,N, [ Axk, Bx k α Bx k, x k + Bx k, Bx k ] αµ A,N + µ B,N > ɛ. 8 Bx k, x k We can see by 7 and 8 that for k = 0,,,..., Bx k+, x k+ Bx k+, x k Bx k, x k = Bx k, x k ɛ + α xk α F A,B x k, Bx k Bxk, x k = Bx k, x k ɛ + Bx k, x k αɛ + α. 9 This proves that Bx k, x k α for k, which can be improved further as follows: for k, we have resulting in Hence, Bx k, x k ɛ + Bx k, x k αɛ + α Bx k, x k ɛ + α + ɛ, k Bx k+, x k+ α + ɛ ɛ l + α ɛ k. lim inf Bx k, x k α + ɛ k ɛ > α ɛ = αµ B,N and there exists K N such that k K implies l=0 Bx k, x k > αµ B,N. Moreover, we can estimate Bx k + tx k+ x k, x k + tx k+ x k for t 0, as follows: for k >,,..., and 0 < t <, Bx k + tx k+ x k, x k + tx k+ x k = Bx k tα F A x k Bxk Bxk tα F A,B x k, xk tα F A,B x k, Bx k = Bx k Bxk, x k [ [ Axk, Bx k = Bxk, x k tα Bx k, x k + Bx k, Bx k ] + tα Bx k, Bx k ] Bx k, x k Bx k, x k Bx k, x k tɛ + tα = Bx k, x k + tα Bx k, x k ɛ { Bx k, x k, if α Bx k, x k ɛ, Bx k, x k ɛ + α, if α < Bx k, x k ɛ. where ɛ = ɛ. Noting that α < Bx k, x k ɛ implies Bxk, x k ɛ + α > α ɛ ɛ + α = α ɛ > αµ B,N, 0
11 we have that for k K, Therefore, we have min Bx k + tx k+ x k, x k + tx k+ x k > αµ B,N. t [0,] {x k + tx k+ x k : t [0, ], k K} {x R N : Bx, x αµ B,N }. Since Bx B [I Bx,x and A + B op α and Bx Bx Bx T ] B is positive semidefinite and B [I Bx Bx, x T ] Bx Bx op B, α, F A,B x = A + B B [I Bx Bx, x Bx T ] Bx B, Bx we can see that F A,B x op α. The rest of the proof for convergence of the sequence {x k } k>k to x satisfying Bx, x = Bx = + λ, where x, λ is a solution pair of 5, is omitted due to the similarity of that in Theorem. In Theorem 3 above, convergence to a nonzero critical point of F A,B is confirmed. However, if we can efficiently deal with B, then we can guarantee to find a global minimizer of F A,B by considering the gradient descent method with respect to a different inner product. Corollary. If we generate a sequence {x k } by x k+ = x k αb F A,B x k, k = 0,,..., 0 with a randomly chosen x 0 0 and 0 < α < µ A,N +, then the sequence converges to x, where x, λ is a solution pair of 5 satisfying and Bx, x = Bx = + λ λ = min{λ : A λb is singular}. In fact, 0 is the gradient descent of F A,B with respect to the inner product x, y B := x T By. Proof. First of all, as we mentioned, 5 is equivalent to B Ax = λx, and to Cy = λy, with C = B A B, y = Bx.
12 Therefore, x, λ is a solution pair of 5 if and only if x is an eigenvector of B A corresponding to the eigenvalue λ if and only if Bx is an eigenvector of C corresponding to the eigenvalue λ. Note that since y, Cy = x,ax x x for x = B y, and min y = B y µb, =, λ = min y, Cy min µ A, B y y = y = { µ A,, if µ A, < 0, 0, if µ A, 0., Since > max0, µ A, implies > max0, λ, we can apply Theorem to F C y = y, Cy + y. Moreover, since λ N, the largest eigenvalue of C, is at most µ A,N 0 < α < µ A,N + λ N + and that a sequence {y k} generated by > 0, we know that y k+ = y k α F C y k, k = 0,,,..., with y 0 0 chosen at random, converges to y, an eigenvector of C corresponding to the smallest eigenvalue λ = λ with norm y = +λ. On the other hand, with y k = Bx k, k = 0,,..., we have which implies that 0 is nothing but F C y k = B F A,B x k, y k+ = y k α F C y k. Hence, the sequence generated by 0, with a randomly chosen x 0 0, converges to x, where x, λ is a solution pair of 5 satisfying Bx, x = + λ and λ = λ = min{λ : A λb is singular}. In addition, with respect to the inner product x, y B := x T By, we note that B F A,B x = B Ax + x, x, x B which is the gradient of F A,B with respect to the inner product, B because F A,B x = x, B Ax B + x B x B, Note also that B A is self-adjoint with respect to, B. Therefore, 0 is the gradient descent of F A,B with respect to, B. In general, with a nonsymmetrix square matrix E, minimizing the functional F E does not guarantee to find an eigenvector of E. However, Theorem 3 and Corollary make it possible to find eigenvectors of E in certain cases when E decomposes into E = B A with a symmetric matrix A and a symmetric positive definite matrix B. Moreover, if it is easy to compute B, then Corollary applies to find a global minimizer of F E with a better stepsize. We can also find subsequent eigenvectors in the same way as presented in Theorem.
13 Corollary 3. Let x, λ,..., x m, λ m be solution pairs of 5 where λ λ m are the first m smallest ones in {λ R : A λb is singular}. We consider the following problem min F A,B x subject to x, x k B = 0, k =,,..., m. x R N Then, any local minimizer x of F A,B in the subspace orthogonal to {x,..., x m } with respect to the inner product, B, is a solution with Bx, x = and +λ λ = min{λ R : A λb is singular} \ {λ,..., λ m }.. The Newton s method As for the second algorithm with a faster rate of convergence, we will analyze the Newton s method to find nonzero critical points of 5, which are eigenvectors of A. Since the functional F A is continuously twice differentiable at x 0, if we apply the Newton s method, we will generate a sequence {x k } by x k+ = x k F A x k F A x k, k = 0,,..., with an initial x 0 0 unless F x k is singular. We can observe that becomes for k = 0,,,..., [ x k+ = A + I + xk xk T ] xk x k x k x k x k x k. Hence, we propose the following scheme: with an initial guess x 0 0, for k = 0,,,..., compute y k and x k+ by y k = x k x k, and [ A + I + xk xk T ] x k+ = y k. 3 x k x k x k x k We wrote 3 in the given form to enhance its similarity either to the inverse iteration or to the Rayleigh quotient iteration. As for convergence, we will show that convergence of the norm x k is equivalent to convergence of x k. Theorem 4. Let A Sym N R have eigenvalues λ λ N. Let > max0, λ, and 0 < x 0 +λ j for any j N.. Suppose that a sequence {x k } k= can be generated by 3, i.e., x k is computable for all k N, and that x k converges to η > 0. If x k +λ j for any j N and for all k, then there exists i 0 N such that η = +λ i0 and x k converges to an eigenvector x of A corresponding to the eigenvalue λ i0 with x = +λ i0.. On the other hand, suppose that we generate a sequence {x k } k0 k= for some k 0 N by 3 and x k0 satisfies x k0 = +λ i, where λ i for some i N is an eigenvalue of A with multiplicity. Let q i be a unit eigenvector of A corresponding to λ i. If x k0 is not a critical point of F A with qi T x k 0 > 0, then x k0+ is an eigenvector of A corresponding to the eigenvalue λ i. If qi T x k 0 +λj +λ i for j < i, then x k0+ is a critical point of F A, i.e., an eigenvector of A corresponding to the eigenvalue λ i with norm x k0+ = +λ i. However, if qi T x k 0 = +λj +λ i for some j < i, then the system becomes singular and we may not compute x k0+ uniquely. In any case, the algorithm terminates in k 0 + iterations. 3
14 Proof. It suffices to consider the case that A is a diagonal matrix with diagonal entries λ λ λ N. Then, 3 becomes + λ j x k+,j = y k,j x k+ x k x k yt k y k+, j =,,..., N, 4 where x k+ = [ ] T x k+, x k+,n and yk = x k x k. Let {x k} be a sequence generated by 4 with x k +λ j for any j N and for all k N {0}. Firstly, we consider the case that x k converges to η > 0. Since x k is computable for all k N, we can see from 4 that x k+ x k yt k y k+ 0, k 0. By setting J 0 := {j {,,..., N} : x 0,j 0}, we know that for k 0, x k,j 0 if and only if j J 0. We will now prove lim sup k y T k y k+ = by contradiction. Suppose that lim sup yk T y k+ <. k Then, there exists ɛ < with lim sup k yk T y k+ = ɛ. Given δ 0, ɛ, we may choose l N so that k l implies x k η < δ and x k x k+ δ < and yk T y k+ < ɛ + δ. 5 We also choose J J 0 satisfying η + λ J From 4, we see that for k l, η = min + λ j. j J 0 x k+,j x k+ δ ɛ x k,j η + λ J + δ + λ J x k. 6 If η + λ J < ɛ, then by choosing δ 0, ɛ satisfying η + λ J + δ we can see from 6 that lim k x k+,j x k+ η min + λ j j J 0 + λ J < δ ɛ, =, which is impossible. Hence, ɛ i.e., η ɛ + λ or η ɛ + λ, where λ = min j J0 λ j and λ = max j J0 λ j. Suppose that η ɛ +λ. For any 0 < δ < ɛ +λ < ɛ, we can see from 5 that for each j J 0, k l implies x k + λ j < η + δ + λ j ɛ + λ j + λ + δ 4 + λ j < 0,
15 and This results in, for k l, y T k y k+ = x k x k+ yt k y k+ > δ yt k y k+ > 0. xk x k+ N yt k y k+ j= δ yt k y k+ j J 0 y k,j x k + λj yk,j x k + λj < 0. This implies that ɛ 0, i.e., η 0, which is a contradiction, i.e., η Hence, we must have η ɛ +λ. Again with 0 < δ < ɛ +λ for k l, and for each j J 0, x k + λ j > η δ + λ j ɛ + λ j + λ δ which implies that for k l, since ɛ + δ <, y T k y k+ = + λ j ɛ +λ is not possible. < ɛ, we can also see that > ɛ + λ j ɛ + λ j > 0, + λ + λ xk x k+ N yt k y k+ j= δ yt k y k+ j J 0 y k,j x k + λj yk,j x k + λj > 0. Hence, we have If we extract a subsequence y kn of 3 and knowing that 0 ɛ <. 7 such that y T k n y kn+ ɛ as n, then using the form we can see that lim inf n yk T n+ x kn+ Ax k n+ = lim inf n yk T [ n+ x kn+ A + I + x kn yt k n+ay kn+ + x kn x kn λ + + ɛ η η ɛ η η + ɛ ɛ + λ. xkn yt k n+ay kn+ λ xkn T ] x kn+ = yt k n+ y k n x kn+ x kn x kn + x kn yt k n y kn+ = yt k y n+ k n x kn+ 5
16 Therefore, we obtain ɛ + λ η + ɛ ɛ + λ. 8 However, this is a contradiction since 7 implies + ɛ ɛ < ɛ, i.e., η ɛ +λ is not possible, either. Therefore, we conclude that ɛ < is impossible, i.e., lim sup k yk T y k+ =. We can now show that η = +λ i0 for some i 0 N. Suppose that η +λ j for any j N. By considering a subsequence {y kn } with lim n y T k n y kn+ =, it is easy to see using 4 that = lim n max j N x kn x kn+ yt k n y = lim kn+ n <, η + λj N j= y k n,j x kn + λj which is a contradiction. Hence, we have that η = +λ i0 for some i 0 N. Next, we will show that x k converges to an eigenvector x of A corresponding to λ i0 with norm x = η = +λ i0. Firstly, we show that there exists j J 0 such that λ j = λ i0. As above, by choosing a subsequence y kn such that yk T n y kn+ as n, it is easy to see that there must be j J 0 with λ j = λ i0. That is, {j J 0 : λ j = λ i0 }. Hence, without loss of generality we will say that i 0 J 0. Let k 0 N be such that k k 0 implies Then, for k k 0, and for λ j λ i0, + λ i 0 λ j λ i0 < min. x k λ j λ i0 3 + λi0 x k + λj x k Moreover, since we have that for j N, <. x [ K+,j x K+ = Π K x k+ x k yt k y k+ ] x0,j k=0 + λj, K 0, 9 x 0 x k if we choose j for which λ j λ i0, we can see that Since x K+,j x K+,i0 x K+,j = x K+,i0 = x0,j x 0,i0 Π K k=k 0 + λi0 K k0+ x k + λj x k Π k0 k=0 Π k0 k=0 + λi0 + λj x k x k + λi0 + λj x k x k x 0,j x 0,i0 x 0,j 0 as K. x 0,i0 for λ j = λ i0, we can also see that x K+ η as K implies x K+,i0 λ j=λ i0 x0,j as K. x 0,i0 + λ i0 6
17 Let m i0 = x0,j λj=λi0 x 0,i0. Then, as K, This implies that y K+,i0 m i 0 as K. In addition, noting that for all k, and lim k N j= x K+,i0. 30 m i 0 + λ i0 yk T y k+ = x k x k+ N y yt k,j k y k+ j= x k + λj, y k,j x k + λ j =, we know that lim k x k x k+ yt k y k+ = 0. Hence, not only do we have lim sup k y T k y k+ =, but also we can obtain that lim k yt k y k+ =. In fact, since x k,j = x k,i0 x0,j x 0,i0 for λ j = λ i0, we can see that y T k y k+ = and that lim k y T k y k+ = implies x0,j x k,i0 x k+,i0 + x k x k+ x 0,i0 λ j=λ i0 lim x k,i 0 x k+,i0 =. k m i 0 + λ i0 Together with 30, we know that lim k x k,i0 exists and is either m i0 Letting x,i0 = lim k x k,i0, we have that x k converges to x, where x,j = { x 0,j x 0,i0 x,i0, for λ j = λ i0, 0, for λ j λ i0. λ j λ i0 x k,j x k+,j, +λ i0 or m i0 +λ i0. Note that x is an eigenvector of A corresponding to λ i0 with norm x = η = +λ i0. This finishes the first part of the theorem. For the second part of the theorem, we generate a sequence {x k } k0 k= for some k 0 N and suppose that x k0 satisfies x k0 = +λ i, where λ i for some i N is an eigenvalue of A with multiplicity. If x k0 is not a critical point of F A and x k0,i 0, then we have x k0,i < +λ i and x k0 yt k 0 x k0+ =, which turns 4 into λ λ λ N x k0+,. x k0+,n 7 = λ i x k0+,. x k0+,n.
18 Since λ i is of multiplicity, there exists a unique solution x k0+ = α e i, where e i is the standard basis element in R N with e i,j = δ ij and α = x k0,i+λ i. Note that α > +λ i and x k0+ is an eigenvector of A corresponding to the eigenvalue λ i, and yet is not a critical point of F A. Since x k0+ satisfies 4 with y k0+ = ± e i, we can see that for j N, { λ j + α x k 0+,j = δ ij α x k 0+,i, if y k0+ = e i, λj + α x k 0+,j = δ ij + α x k 0+,i, if y k0+ = e i. 3 Note that α = +λ j is equivalent to x k0,i = +λj +λ i. Since x k0,i < +λ i, it is possible to have α = +λ j only if j < i. Hence, if x k0,i +λj +λ i for j < i, then α +λ j for j < i, and 3 is nonsingular and has a unique solution { ± +λ x k0+,j = i, if j = i, 0, if j i. depending on y k0+ = ± e i. That is, x k0+ is a critical point of F A, an eigenvector of A corresponding to the eigenvalue λ i with norm x k0+ = and the algorithm terminates. +λ i On the other hand, if x k0,i = +λj +λ i for some j < i, then α = +λ j 3 becomes singular and the algorithm terminates. and the system Remark. It is very interesting to note that in both of the gradient descent method and the Newton s method discussed above, the convergence of a generated sequence {x k } is confirmed by the convergence of the sequence of their norms { x k }, which hardly happens, in general. So, it would probably be worth further investigation in a subsequent work..3 Some variants of the proposed framework We have seen in the previous sections how to solve the generalized eigenvalue problem 5 Ax = λbx with A Sym N R, B Sym N,p R in an unconstrained framework. In this section, we will proceed our discussion on some variants, inspired by 3, of our proposed framework including nonsymmetric cases, as well. When x k converges to an eigenvalue x of A corresponding to an eigenvalue λ via 3, we know that x k converges to λ, hence 3 becomes with ỹ = A λ I + + λ ỹỹ T x = ỹ x x. Hence, we may consider the following procedure. One Step Eigenvector Estimation. Given A Sym N R, and an eigenvalue λ of A, and > 0 with λ, we choose x 0 uniformly at random from S N and solve for x, A λi + + λx 0 x T 0 x = x 0. 3 In the case of 0 being an eigenvalue of A, by setting λ = 0, 3 turns into A+x 0 x T 0 x = x 0, which means that 3 is equivalent to A λi + x 0 x T 0 x = x 0, 0. 8
19 Before proceeding our discussion, we want to mention a work [3] of G. Peters and J.H. Wilkinson, which was further explained in []. In [3], the authors discussed an idea of computing an approximate eigenvector x λ when an approximate eigenvalue λ is given, i.e., when A λi is very ill-conditioned, or near singular, by considering A λi + x i p T x = x i, 33 with a random vector p, inspired by the inverse iteration, i.e., by A λix i+ = xi x. i The authors noticed that A λi + x i p T can be well-conditioned and the solution to 33 is nothing but a constant multiple of the solution to A λix = x i, but provided reasons why 33 is not in their favor simply because A λi + x i p T changes its form at every iteration making computations inefficient. However, with λ fixed, even though a limit exists for the inverse iteration, the convergence is still linear. Further discussions can be found in [4], and the references therein, in relation to the shifted inverse iteration and the Rayleigh quotient iteration. On the other hand, our concern is if we can make full use of the nonsingular system 3 to analyze quantitatively the error in eigenvector estimation regardless of the multiplicities of the corresponding eigenvalues helping understand the Newton s method. So, we will provide a series of results for the rest of this section. Proposition. Suppose that λ has multiplicity. With probability, the equation 3 has a unique nonzero solution x that is an eigenvector of A corresponding to the eigenvalue λ. Proof. Let q be a unit eigenvector of A corresponding to λ. Note that if we choose x 0 S N uniformly at random, then we have q T x 0 0 with probability. Moreover, if q T x 0 0, then A λi + + λx 0 x T 0 z = 0 implies x T 0 z = 0. Hence, we have Az = λz. Since λ has multiplicity, z = aq for some a R. In addtion, 0 = z T x 0 = aq T x 0 implies a = 0. That is, z = 0. Hence, A λi + + λx 0 x T 0 is nonsingular and there exists a unique nonzero solution x to 3. By multiplying 3 by q T, we have + λx T 0 x =, which implies that x also satisfies A λi x = 0. If the multiplicity of an eigenvalue λ is greater than, then 3 becomes singular and Proposition does not apply. However, when the multiplicity m > is known, we can construct another nonsingular system. Corollary 4. Suppose that an eigenvalue λ of A has multiplicity m >. We choose x 0,..., x m uniformly at random from S N and set an N m matrix X 0 = [x 0 x m ]. With probability, the equation A λi + + λx 0 X T 0 X = X 0 has a unique nonzero solution X = [ x 0 x m ], an N m matrix, whose columns x 0,..., x m constitute a basis for the eigenspace corresponding to the eigenvalue λ. Moreover, Proposition below says that, regardless of an eigenvalue s multiplicity and of the symmetry of a matrix, a good estimate of the eigenvalue guarantees a good estimate of a corresponding eigenvector through the nonsingular linear system 3. Proposition. Let A M N R be diagonalizable. Suppose that λ is an eigenvalue of A. Let be such that > 0 and λ. 9
20 . If N A λi, the null space of A λi, is of dimension, then choosing x 0 S N uniformly at random, 3 has a unique nonzero solution x with probability, which is an eigenvector of A corresponding to λ.. If N A λi has dimension greater that, then choosing x 0 S N uniformly at random, we can see, with probability, that for λ close enough to λ, has a unique nonzero solution x λ satisfying A λi + + λx 0 x T 0 x = x 0 34 η λ λ < x λ x < η λ λ for some 0 < η < η <, where x N A λi is uniquely determined by x 0 and η, η do not depend on λ. Proof. The first part can be proven in the same way as we proved Proposition with a choice of x 0 S N satisfying q T x 0 0 and q T x 0 0, where q and q are unit vectors spanning N A λi and N A T λi, respectively. Note that A λi for 0 < λ λ < min λj λ λ j λ, is invertible and that if x 0 S N is chosen uniformly at random, then x 0 is not orthogonal to N A λi with probability. Moreover, since R N = eigenspace λ V with V = λ j λ eigenspaceλ j, representing x 0 = q 0 + r 0 uniquely in R N, where q 0 eigenspace λ = N A λi and r 0 V, we have q0 T x 0 0 with probability. We also note that for any 0 < δ < min λ λj λ j λ, there exists K > 0 such that sup λ [ λ δ, λ+δ] A λi V < K, 35 where A λi V is the operator norm of the restriction A λi : V V, which is not difficult to see since V is invariant under A λi and A λi : V V is invertible and A λi V is continuous in λ for λ [ λ δ, λ + δ]. So, we will fix 0 < δ < min λ λj λ j λ, + λ and consider A λi as being restricted to V for λ [ λ δ, λ + δ]. Firstly, with such an x 0, we will confirm that for 0 < λ λ < δ, A λix = x 0 + λx 0 x T 0 x 36 has a unique solution x λ. If there is a solution x λ, then letting α λ = + λx T 0 x λ, we can see that 36 becomes A λix λ = α λ x 0, i.e., x λ = α λ A λi x 0. Hence, the unique existence of α λ confirms the unique existence of x λ. Note that α λ must satisfy α λ = α λ + λx T 0 A λi x 0 α λ = + + λx T 0 A λi x 0. Together with 35, it is not difficult to see that for λ [ λ δ, λ + δ], r 0 A λi V + δ < A λi r 0 < K r 0. 0
21 Since A λi q 0 = λ λ q 0, we have x T 0 A λi x 0 = λ λ qt 0 x 0 + x T 0 A λi r 0 qt 0 x 0 λ λ A λi r 0 > qt 0 x 0 λ λ K r 0. Noting that + λ > + λ for 0 < λ λ < δ, we can see that if 0 < λ λ < δ 0 := min δ, qt 0 x 0 β where β = max6k r 0, K r λ, then + λxt 0 A λi x 0 > and α λ exists and we can easily see that the unique nonzero solution x λ to 36 is represented as x λ = α λ A λi x 0 =, A λi x λx T 0 A λi x 0 and that α λ λ λ + λq 0 T x 0 q0 T x λx T 0 A λi r 0 = [λ λ + + λx T 0 A λi r λq0 T x 0] + λq 0 T x λ λ 0 < q0 T x K + λ + δ 0 r 0 + λq 0 T x 0 + λ δ 0 q0 T x 0 λ λ + λ λ λ. + δ 0 K r 0 Since + λ δ 0 > + λ / and + λ + δ 0 < 3 + λ / and δ 0 qt 0 x 6K r 0, we obtain that for 0 < λ λ < δ 0, implying that where + λ δ 0 q T 0 x 0 λ λ + λ + δ 0 K r 0 > + λq T 0 x 0 4 is determined by x 0. If we set x = α λ λ λ + λq 0 T x < ω λ λ, 0 ω = 4 qt 0 x K3 + λ / r 0 + λq T 0 x 0 + λx T 0 q0q 0, then for 0 < λ λ < δ 0, A λi x 0 x λ x = + + λx T 0 A λi x 0 + λx T 0 q 0 q 0 αλ λ λ + λx T 0 q q 0 + αλ A λi r 0 0 < η λ λ where η = ω q 0 + K r 0 + ωδ + λx 0. T 0 q0
22 On the the hand, we let z 0 V be such that z 0 = A λi r 0, i.e., A λz 0 = r 0 and set y = z 0 zt 0 q0 q 0 q 0 and y 0 := y y. Then, it is not difficult to see that yt 0 z 0 0 and y0 T q 0 = 0 and y T 0 A λi r 0 y T 0 z 0 = λ λy T 0 A λ A λi r 0 K r 0 λ λ, which implies that there exists δ > 0 such that for 0 < λ λ < δ, we have y T 0 A λi r 0 > yt 0 z 0 and α λ > λ λ + λx T 0 q 0. i.e., x λ x y0 T x λ x = α λ y T A λi r 0 > η λ λ, with η = y 4 T 0 z0 + λx > 0. Therefore, 0 < λ λ < minδ0, δ T impilies 0 q0 η λ < x λ x < η λ. From the analysis about 3, we can see a similarity between the Newton s methd and the Rayleigh quotient iteration. When generating a sequence {x k } via A µ k I + + µ k y k y T k x k+ = y k, where y k = x k x k, we can see that is obtained with µ k = x k and the Rayleigh quotient method is obtained with µ k = x k,ax k x k. The same is true for solving Ax = λbx, i.e., if we use µ k = x k B, where x k B = x k, x k B, we have the Newton s method, [ A + B + Bxk Bxk T ] x k+ = Bx k, x k B x k B x k B x k B x k B whereas if we use µ k = x k,ax k x k,x k B, then we have the Rayleigh quotient method. We will present numerical experiments towards the end of this paper showing an interesting feature of our proposed framework, that is, our proposed framework tends to find the smallest eigenvalues due to the philosophy of our proposed framework where F A,B is to be minimized, whereas the Rayleigh quotient method tends to find the largest eigenvalues when both methods begin with a random initial point x 0. Remark. We would like to mention that the same framework applies to solving eigenvalue problems involving complex Hermitian matrices and to finding singular values of A M M N R. When it comes to singular values of A, we may consider either A T A or AA T in place of A since λ > 0 is a singular value of A if and only if λ > 0 is that of A T A or AA T. 3 An eigenvalue problem on infinite dimensional spaces It is interesting to see that the same framework as 3 applies to eigenvalue problems on infinite dimensional spaces such as the Sturm-Liouville eigenvalue problem, the eigenvalue problem of self-adjoint elliptic operators, etc. We will present one such application of finding an eigenfunction of a self-adjoint uniformly elliptic operator corresponding to the smallest eigenvalue, which is an infinite dimensional version of a real symmetric and positive definite matrix.
23 3. Symmetric uniformly elliptic operators Let Ω be a bounded open subset of R d with Lipschitz boundary Ω. We will denote by L a symmetric uniformly elliptic operator defined by Lu = d i a i,j j u + cu, 37 i,j= where a i,j, c L Ω, i, j =,..., d, are such that a i,j x = a j,i x a.e., and cx 0 a.e., and there exists 0 < α β < such that for a.e. x Ω and for ξ R d, α ξ d a i,j xξ i ξ j β ξ. i,j= The problem that we are interested in is to find ϕ H0 Ω that solves { Lϕ = λϕ, in Ω, ϕ = 0, on Ω. 38 It is known that the eigenvalues λ, λ,... of L are nonnegative and we may have them in an increasing order, that is, 0 < λ < λ λ 3. Therefore, a natural question to ask is to find the smallest eigenvalue λ of L and its corresponding eigenfuction. Definition. ϕ H 0 Ω is a weak solution of 38 if for any ψ C 0 Ω, d Ω i,j= a i,j x j ϕx i ψxdx + cx λϕxψxdx = 0. Ω Then, it is natural from the discussion in the previous sections that we want to define a functional F L with > 0 by F L u = + d a i,j x j ux i uxdx + Ω i,j= Ω ux dx and solve the following minimization problem Ω ux dx Ω cx ux dx 39 min F L u 40 u H0 Ω and investigate the relationship between 38 and 40. The existence of a minimizer of the problem 40 is obvious by the standard method using the compact embedding theorem by Rellich-Kondrachov. Moreover, we can observe the same characteristics of 39 as those of 3. For theorems and lemmas that follow, we will omit their proofs because they are essentially the same as what we presented in the finite dimensional case. 3
24 Lemma. The set of nonzero critical points of F L is { ϕ H0 Ω : Lϕ = λϕ in Ω and ϕ L Ω = }. + λ and where λ > 0 is the smallest eigenvalue of L. min F L u = u H0 Ω + λ, Theorem 5. Any local minimizer of 39 is a global minimizer. 3.. Corresponding parabolic PDEs Corresponding to the uniformly elliptic PDEs of the form 38, we will consider the following parabolic PDE: for T 0,, we solve u t = Lu ut u in Ω T := Ω 0, T ], u = 0 on Ω T := Ω [0, T ], 4 u0 = u 0 0 in H0 Ω, where ut = Ω ux, t dx. Due to the condition c 0 in Ω, L is positive definite. In the context, we will use,, for both inner products in L Ω and in R N for N N. Note that the partial differential equation in 4 is the formal gradient flow of a functional F L in 39, i.e., u t = F Lu. Theorem 6. The problem 4 has a unique weak solution u for any T 0,. Moreover, if ψ is an eigenfunction of L corresponding to the smallest eigenvalue λ > 0 with ψ =, and if u 0, ψ 0, then the solution u satisfies ut + λ v in L Ω as t for some v {±ψ }. Before proving Theorem 6, we will present an ODE version of 4, which will be used when proving Theorem 6. For A Sym N,p R, as we did previously, if we consider a diagonalization of A, A = QΛ N Q T, where Q is an orthogonal matrix and Λ N = diagλ,..., λ N, 0 < λ = = λ p < λ p+ λ N for some p < N, we have F A x = F ΛN y, with y = Q T x. 4 The gradient descent flow associated with the functional F ΛN d dt Φ Nt = F ΛN Φ N t, and we are interested in the existence of a solution of 43 below, which is to solve on 0, is { d dt Φ Nt = F ΛN Φ N t = Λ N + IΦ N t + Φ N 0 R N, φ j 0 0 for some j p, Φ N t Φ Nt, 43 where Φ N = Φ N t = [φ t φ N t] T and Φ N t = 4 N k= φ k t.
25 Theorem 7. There exist a unique solution Φ N C [0, of 43 and v R N such that Φ N t v as t, where v is an eigenvector of Λ N corresponding to the eigenvalue λ with v = +λ v, e j 0. Note that { e,..., e N } is the standard basis for R N. Proof. The existence and uniqueness of a solution Φ N on [0, is easily guaranteed by the theory of ODEs once we establish a lower bound ω > 0 for Φ N t, t [0,. Due to φ j 0 0, i.e., Φ N 0 > 0, a solution exists and is unique on [0, ɛ] with ɛ <<. Then, by setting T max = sup{t 0, : a solution Φ N exists and is unique on [0, T ]}, we know that T max ɛ. Note that for t 0, T max and for k N, we have d dt φ kt = λ k + φ kt + Φ N t φ kt > λ N + φ kt, resulting in φ k t 0 for t 0 if and only if φ k 0 0. Let ω = min +λ N, Φ N 0. Suppose {t 0, T max : Φ N t ω}. Then, T ω = inf{t 0, T max : Φ N t ω} exists. Since there exists δ > 0 such that ω < Φ N t < 3 ω on T ω δ, T ω, we have d dt Φ Nt 3 λ N + Φ N t > 0 on T ω δ, T ω, that is, Φ N t is increasing in T ω δ, T ω and lim t Tω Φ N t > ω = Φ N T ω. This is a contradiction. Therefore, we have inf Φ N t > ω, t 0,T max which eventually implies that T max =. In addition, we note that d d dt F Λ N Φ N = dt Φ N, Λ N Φ N + Φ N Φ N Φ N = dφ N dt and that since F ΛN Φ N is bounded below, there exists y R such that implying that lim dφ N = 0 dt t and lim F Λ t N Φ N t = y 44 ΛN lim Φ N + Φ N = 0. t Φ N That is, for each k =,,..., N, φ lim λ k + t Φ N t k t =
26 Since we have ΛN 0 = lim Φ N + ΛN Φ N lim + IΦ N, t Φ N t lim t k= N λ k + φ kt =, 46 which implies that there exist k and {t n } such that t n as n and Unless lim n φ k t n = lim n φ k t n > 0. +λ k, there exists k k such that lim n φ k t n > 0 by taking a subsequence of {t n } if necessary. Then, 45 implies λ k = λ k and lim Φ Nt n =. n + λ k We may repeat this process finitely many times to have lim n {k:λ k =λ k } φ kt n = + λ k, lim n {k:λ k λ k } φ kt n = 0. Suppose that there exist l and {s n } such that λ l λ k and s n as n and n {k:λ k =λ l } lim n φ l s n > 0. Then, as was done above, with a subsequence of {s n } if necessary, we have lim n Φ N s n = +λ l and lim φ ks n = + λ l, lim φ ks n = 0 n and 44 implies that {k:λ k λ l } lim F Λ n N Φ N t n = + λ k = y = lim F Λ n N Φ N s n = + λ l, which is a contradiction. Therefore, we conclude that lim t Φ N t = lim t {k:λ k =λ k } φ kt = + λ k, lim t {k:λ k λ k } +λ k φ kt = 0. We will now claim that λ k = λ. Suppose that λ k > λ, i.e., k > p. Then, since +λ k, there exists T > 0 such that for t > T, +λ +λ k +λ +λ k > Φ N t < + λ + λ k + λ + λ k, and 6
27 and we have d dt p φ kt = k= λ + + Φ N t p φ λ k λ p kt φ + λ + λ kt k for t > T, which results in Φ N t as t. This is a contradiction. Therefore, we N have λ k = λ and lim t k=p+ φ k t = 0 and lim Φ N t = lim t t k= k= Lastly, we will claim that there exists v R N such that v = Note that for k =,..., p, if φ k 0 0, then Φ N t v as t. k= p φ kt = + λ. 47 +λ, and v, e j 0, and d φ k t dt φ kt = λ + + Φ N t for t 0, implying that for T 0,, T lnφ kt lnφ k0 = λ + + dt =: ΨT. 0 Φ N t From 47, we can see that which implies p φ kt = e ΨT k= Hence, for k p, p φ k0 as T, + λ k= Ψ = lim Ψt = ln t + λ φ kt v k as t, ln p k= φ k0. where v k = φ k 0e Ψ. Noting that φ k tφ k 0 > 0 for all t 0 unless φ k 0 = 0, we can easily see that φ k t v k as t. If we set v = [v v p 0 0] T, then v is an eigenvector of Λ N corresponding to λ and v, e j = v j 0 since φ j 0 0 for some j p, and Φ N t v as t and v = +λ. We will now give a proof of Theorem 6. 7
28 Proof. Theorem 6 The proof is inspired by the Galerkin s method. Note that we can find an orthonormal basis {ψ k } k N for L Ω such that ψ k is an eigenfunction of L corresponding to the k th smallest eigenvalue λ k with ψ k H0 Ω. Then, we let V N, N N, be the subspace of L Ω spanned by {ψ,..., ψ N } and let P N be the projection of L Ω onto V N. Suppose that u 0 H0 Ω is given and satisfies u 0, ψ 0. We consider u t = Lu ut u in Ω T, u = 0 on Ω T, 48 u0 = P N u 0. If u N L [0, T ]; H 0 Ω with d dt u N L [0, T ]; H Ω is a solution of 48, then for k =,..., N, we have and d dt u N, ψ k = λ k + u N, ψ k + u N, ψ k u N t d dt u N, ψ k = λ k + u N, ψ k + u N, ψ k u N t with u N 0, ψ k = P N u 0, ψ k. Considering φ k t = u N t, ψ k, k =,..., N, we can see that φ,..., φ N solve 43. Since Φ N = [φ φ N ] T satisfies that for t [0,, u N t Φ N t > ω = min, Φ N λ N Therefore, the existence of a solution to 48 can be obtained by a linear system of ODEs 43 given in the Appendix below, with the initial condition and setting Φ N 0 = [ P N u 0, ψ P N u 0, ψ N ] T, u N x, t = φ tψ x + + φ N tψ N x. Then, u N is a weak solution of 48 such that u N t + λ v in L Ω as t, where φ,..., φ N satisfy 43 and v is either ψ or ψ. Suppose now that u N,, u N, are two such solutions of 48. Let v N = u N, u N,. Then, we have v N = Lv N fu N, fu N,, 50 t where fux, t = ux, t ux,t ut. Since f on L Ω is Lipschitz with a Lipschitz constant µ = ω on {u : u ω} and the two solutions u N,, u N, satisfy 49 for all t 0, by taking the inner product on L Ω with v N on both sides of 50, we have which implies d dt v Nt, v N t Lv N t, v N t + µ v N t, v N t, d dt v N, v N µ v N, v N Lv N, v N
29 Therefore, v N 0 = 0 implies u N, = u N, for a.e. x, t Ω [0, T ] for any T 0,. In fact, we know that u N C [0, ; H 0 Ω as well as u N L [0, ; H 0 Ω, d dt u N L [0, ; H Ω. We now solve 4. Firstly, we fix k 0 N so that u0 < P k0 u 0 with λ k0 < λ k0+ and obtain the solution u N of 48 with N > k 0. We may write u N as u N t = φ,n tψ + + φ N,N tψ N. Let ϕ,n t = φ,n t + + φ k t and ϕ 0,N,N t = φ k t ,N φ N,N t. Then, we have This implies that dϕ,n dt dϕ,n dt λ k0 + ϕ,n + Φ N t ϕ,n, 5 λ k0+ + ϕ,n + Φ N t ϕ,n. 53 ϕ,n t ϕ,n 0e λ k 0 +t+ t 0 Φ N s ds ϕ,n 0e λ k 0 ++t+ t 0 Φ N s ds ϕ,n t i.e., ϕ,n t ϕ,n t ϕ,n 0 ϕ,n 0 eλ k 0 + λ k0 t > e λ k 0 + λ k0 t 54 due to ϕ,n 0 > u0 > ϕ,n 0. Let M = max +λ, u 0, M = min λ k0 +, u 0. Note that dϕ,n dt λ + ϕ,n + implies ϕ,n t Φ N t < M. And 54 implies Φ N t ϕ,n ϕ,n t < M e λ k 0 + λ k0 t. 55 Since we already saw that φ j,n t, j =,,..., N, exist for all t 0,, if we suppose t 0 = inf{t 0, ϕ,n t = M } <, then from 5 and 54, we have Φ N t 0 = ϕ,n t 0 + ϕ,n t 0 < M + e λ k 0 + λ k0 t 0 < M. λ k0 + Therefore, there exists δ > 0 such that for t t 0 δ, t 0, Φ N t < 9 λ k0 +,
30 which implies that dϕ,n t λ k0 + ϕ,n t + dt Φ N ϕ,n t > 0 on t 0 δ, t 0. This is a contradiction since ϕ,n t > M for t [0, t 0. Therefore, we end up with Note that 56 implies That is, the solution u N for N > k 0 satisfies M < ϕ,n t < M for t inf t 0 Φ N t M inf u N t M. 57 t 0 We fix T 0, and consider a sequence of solutions {u N t} N>k0, where u N is the solution of 48 with u N 0 = P N u 0. Using v = u N u M with N, M > k 0 in place of v N in 5, and noting that we may choose ω = M for µ = ω, we obtain u N t u M t e µt u N 0 u M 0 e µt u N 0 u M 0 for t [0, T ], 58 which implies that for any m N, {φ m,n t} N>k0 converges uniformly to φ m, t in [0, T ] and Φ N t Φ t uniformly on [0, T ] as N, where φ m,n t = u N t, ψ m and Φ t = m= φ m, t. Hence, for all m, we have In addition, for k 0 < m N, if we consider d dt φ m, = λ m + φ m, + Φ φ m,. ω m,n t = φ m,nt + + φ N,Nt, then we can obtain, by the same argument for 55, ω m,n t < M e λm λ k 0 t, t 0 59 implying φ m,n t < M e λm λ k t 0 on [0, T ] and, eventually, on 0,. By a slight modification on 55, we can see that even for each m k 0, there exists ζ m > 0 such that φ m,n t < ζ m M e λm λt on [0,, i.e., with η k0 = max k=,...,k0 ζ k, { η k0 M e λm λt, m k 0, φ m,n t < M e λm λ k t 60 0, m > k 0. This implies that { N } λ m φ m,n m= Hence, by defining N N converges uniformly to u x, t = λ m φ m, in [0, T ] as N. m= φ m, tψ m x, m= 30
31 we can easily see that u L [0, T ]; H 0 Ω and d dt u = m= and that for a.e. t [0, T ], and for each v H 0 Ω, Ω t u x, tvxdx + Ω dφ m, tψ m x L [0, T ]; H Ω dt d Ω i,j= u xvxdx u t a i,j x j u x i vxdx + cxu xvxdx Ω Ω u xvxdx = 0 with u 0 = u 0 in L Ω. Therefore, u is a weak solution of 4 for any T 0,. Next, using the functional in 39, we define ft for t 0, by Note that ft = F L u = + ft = d a ij x j u x, t i u x, tdx + Ω i,j= Ω u x, t dx λ m φ m, t + m= Ω u x, t dx Ω φ m, t Φ t. m=, cx u x, t dx Since φ m, t decays exponentially to 0 in 0, as m, we can see that df dt t = = m= λ m φ m, t + φ m, t φ m, t Φ t dφ m, t. dt m= dφm, Therefore, f is non-increasing and bounded below, i.e., a = lim t ft exists. From the fact that for all N N, Φ N t as t, + λ together with 60, we can see that Φ t + λ as t. which eventually implies that for each m N, dφm, dt t 0 as t, i.e., lim λ m + t Φ t dt t φ m, t = 0. 6 Moreover, since we have φ m, t max k=,...,k0 {M ζ k e λ λt } for all m with ζ = and inf t 0 Φ t M, 3
32 we finally obtain, together with 6, which implies that lim t φ, t = for some v {±ψ }. u t 4 Numerical experiments lim Φ t = t λ +, λ +, and λ + v in L Ω as t We will now present a few numerical experiments to confirm those analyzed properties of our proposed framework. 4. The gradient descent method In Figure,, 3 and 4, we show the first 5 eigenfunctions of the Laplace operator corresponding to their eigenvalues in the increasing order on various domains that are subsets of,, in R. We solved 4 using the FDM Finite Difference Method on uniform grids of size 8 8 including the boundaries, i.e., we represented,, as a uniform grid of size 8 8 and set the interior of each domain to have value and the complement of the interior to have value 0. As was seen that the solution u, t to 4 converges to an eigenfunction as t, corresponding to the smallest eigenvalue, no eigenvalue estimate was necessary. We set dt = t n+ t n = 0.7 h and dx = dy = h = 0.05 = 80, and used utn ut n < 0 3 ut n as a stopping criterion for all the numerical simulations. When the algorithm stops, ut n is set to be the eigenvalue corresponding to the eigenfunction u, t n, Using MATLAB, at the n th iteration, representing ut n as an 8 8 matrix, ut n+ is computed by where ut n+ = ut n + dt ut n ut n, 6 ut n ut n = 4 circshiftut n, [0, ] + 4 circshiftut n, [0, ] + 4 circshiftut n, [, 0] + 4 circshiftut n, [, 0] ut n. When considering various domains Ω, we create a mask, which is an 8 8 matrix representing Ω, and multiply ut n, ut n+ and ut n by the mask in 6, which makes this method extremely useful since we can deal with various domains by defining the mask matrix only. 3
33 Figure : The first 5 eigenfunctions of the Laplace operator on an L shape domain, which is a union of three unit squares in R, i.e.,,, \ [0,, 0], corresponding to their eigenvalues in the increasing order. The last figure visualizes the domain, a uniform grid of size 8 8 including the boundary. The 8th and 9th eigenfunctions correspond to the same eigenvalue The 8th and 9th eigenfunctions correspond to the same eigenvalue , The 3rd and 4th eigenfunctions correspond to the same eigenvalue To have rough estimates of eigenvalues and eigenfunctions, we can solve 4 on coarser grids. For example, since the L shape domain in Fig. has a simple structure and there are many previous works e.g. [3], [3], [4] on computing eigenvalues and eigenvectors of the L shape domain in the literature, we indeed observed that 4 on a uniform grid of size computed rough estimates of eigenvalues and eigenfunctions fast. One thing that we would like to emphasize in Fig. is that our method provides as accurate results as possible on uniform grids. To see this, we would like to take a closer look at the 3rd and 4th eigenfunctions and corresponding eigenvalues. It is known that ux, y = sinπx sinπy for x, y,, \ [0,, 0] is an eigenfunction corresponding to the eigenvalue π. If we discretize this eigenfunction on a uniform grid of size 8 8 to have u d 3x i, y j = sinπx i sinπy j and compare it with the 3rd eigenfunction u 3 that we computed in Fig., then we have ũ d 3 ũ 3 = max ũ d 3x i, y j ũ 3 x i, y j 0 4, i,j 8 where ũ d 3, ũ 3 are the L normalizations of u d 3, u 3, and i,j d ũ d 3x i, y j ũ d 3x i, y j λ 3 0 4, where d is the discrete Laplace operator on the uniform grid and λ 3 is the computed eigenvalue λ 3 = u 3. The same is observed for the 4th eigenfunction, as well. As for the 8th and 9th eigenfunctions, we know that the corresponding eigenspace is spanned by the two simpler eigenfunctions sinπx sinπy and sinπx sinπy than the 8th and 9th eigenfunctions u 8, u 9 in Fig.. The true eigenvalue is 5π. By setting ũ d 8, ũ d 9 to be the discretized and L normalized eigenfunctions from the true simpler ones, we computed the projections v 8, v 9 of u 8, u 9 onto the eigenspace spanned by {ũ d 8, ũ d 9} and computed u 8 v 8 and u 9 v 9 and observed that the L norms are about 0 4 implying that {u 8, u 9 } is indeed a basis for the same eigenspace. After normalizing the projections v 8, v 9 by their L 33
34 norms, we also observed i,j d ṽ 8 x i, y j ṽ 8 x i, y j λ 8 0 4, where λ 8 is the computed eigenvalue u 8, and same for ṽ 9. However, when we compare computed eigenfunctions with true eigenfunctions of the form sinnπx sinmπy on the fixed uniform grid of size 8 8, we observed that the accuracy of computed eigenvalues is still preserved as we increase n, m, but the accuracy of computed eigenfunctions deteriorates. Figure : The first 5 eigenfunctions of the Laplace operator on an annulus domain corresponding to their eigenvalues in the increasing order. The last figure visualizes the domain. It s clear which eigenfunctions should correspond to the same eigenvalues. As was shown in Fig., orthogonal eigenfunctions corresponding to the same eigenvalues that are rotations of each other can be expected when the domain is rotationally invariant. However, due to the uniform grid that we use and depending on the rotated angle, we observed that estimated eigenvalues in Fig. can be slightly different for such eigenfunctions unlike the L shape domain in Fig.. In Fig. 3 and Fig. 4, we computed eigenfunctions of the Laplace operator on other domains. Especially, we created an domain with no symmetry in Fig. 4. Figure 3: The first 5 eigenfunctions of the Laplace operator on a domain, visualized in the last, corresponding to their eigenvalues in the increasing order. Since we provided a generalized eigenvalue problem in the previous work [?] as an application, we can also see how to apply the Newton s method 3 to solve Ax = λbx by 34
35 Figure 4: The first 5 eigenfunctions of the Laplace operator on a domain, visualized in the last, corresponding to their eigenvalues in the increasing order. minimizing F A,B, where A, B are symmetric and B is positive definite and F A,B is given by F A,B x = x, Ax + x, Bx x, Bx, with some > 0 making A + B positive definite. We note that minimizing F A,B by the Newton s method with x B := x T Bx generates a sequence {x k } satisfying [ A + B + Bxk Bxk T ] x k+ = Bx k, 63 x k B x k B x k B x k B x k B which can be rewritten as [ Bxk Bxk T ] A λ k B + + λ k x k+ = Bx k 64 x k B x k B x k B with λ k = x k B. Then, depending on how to update λ k in 64, either by λ k = x k B or by λ k = xt k Ax k x T k Bx, we end up with either 63 or the type of the Rayleigh k Quotient Iteration. However, when we tested the two different eigenvalue update rules above with randomly selected positive definite matrices A, B, we observed numerically that the update rule λ = x B found small eigenvalues much more often than the update rule λ = xt Ax x T Bx. In found the true smallest eigenvalues much more often fact, we noticed that λ = x B than λ = xt Ax x T Bx as can be seen in Fig. 5, which is interesting to confirm that λ = x B is from minimizing the functional F A,B trying to find the smallest eigenvalue. We also performed the same experiment as in Fig. 5 with different sizes, i.e., A i s and B i s are of size shown in blue and of size shown in red in Fig. 6. Solid lines indicate results using the update rule λ = x B and lines with circular dots indicate results using the other update rule λ = xt Ax x T Bx λ = x B interesting to observe that the update rule λ = xt Ax. Fig. 6 clearly shows that the updated rule tends to find smaller eigenvalues than the update rule λ = xt Ax. It is x T Bx never found the smallest eigenvalues. x T Bx Secondly, we will perform the same numerical experiments as was done with the gradient descent method: finding eigenfunctions of the Laplace operator on various domains. Especially, we will show a comparison result with the L shape experiment. When computing a few eigenfunctions corresponding to their eigenvalues in the increasing order, a drawback of 35
36 Figure 5: 00 trials with random positive definite pairs A i, B i, i =,,..., 00. A i s and B i s are of size 0 0. For each pair A i, B i, we test the two eigenvalue update rules 000 times starting from random vectors. The red and blue colors indicate the two update rules: λ = x B in red and λ = xt Ax in blue. Top Row: For each i =,..., 00, we plot x T Bx the number of times each update rule finds the true smallest eigenvalue λ min. We counted the number of eigenvalues computed whose difference from λ min is less than 0 3. Middle Row: For each i =,..., 00, we plot the maximum eigenvalue among the 000 times trials. Bottom Row: For each i =,..., 00, we plot the mean eigenvalue among the 000 times trials. the gradient descent method, besides its rate of convergence, is that one needs to apply any type of the Gram-Schmidt process at every iteration to make sure that the next eigenfunction being searched is orthogonal to the previous ones. This slows down the whole process of computing the first n eigenfunctions quite a bit as n increases. On the other hand, if an estimated eigenvalue λ k = u k at the point u k is close enough to a true eigenvalue, then the Newton s method would generate a convergent sequence to a nearby critical point, i.e., a corresponding eigenfunction, which implies that it may be unnecessary to apply the Gram-Schmidt process. Hence, when computing many eigenfunctions, the Newton s method can speed up the total time spent significantly if we can find good starting points. Without any fast and efficient ideas combined, we would like to compare the total time spent by the gradient descent method with that by the Newton s method. to find the first n eigenfunctions. We computed the first 00 eigenfunctions of the Laplace operator on the L shape domain, 0, [0, 0, on a uniform grid of size 8 8 using the gradient descent method and the Newton s method, separately. Having computed the first k unit eigenfunctions ϕ,..., ϕ k an knowing that the gradient descent method provides convergence to a global minimizer, we can find u 0 satisfying the constraints ϕ j, u 0 = 0, j =,,..., k and u0 + u 0 < ɛ 65 u 0 with ɛ = 0. or 0.0, in which case u 0 is close to the k + st smallest eigenvalue, and consider u 0 as a good starting point for both the gradient descent method and the Newton s 36
37 Figure 6: The same experiments as in Fig. 5 with different matrix sizes. The red and blue colors indicate the two different sizes: in blue and in red. Solid lines are from the update rule λ = x B and the lines with circular dots are from the update rule λ = xt Ax x T Bx. Top Row: We counted the number of times that the true smallest eigenvalues were computed among the 000 times trials for each random positive definite pair A i, B i, i =,..., 00. Only the counts using the update rule λ = x B are shown because the update rule λ = xt Ax x T Bx never found the smallest eigenvalues. Middle Row: The maximum eigenvalue among the 000 times trials for each A i, B i. Bottom Row: The mean eigenvalue among the 000 times trials for each A i, B i. method to find the k+ st eigenfunction for comparison. Then, we measure the time spent for the gradient descent method to converge starting from u 0 under the orthogonality constraints and also the time spent for the Newton s method to converge starting from the same u 0 without any constraints. In Fig. 7, we present the amount of time elapsed for both cases: the gradient descent method with the orthogonality constraints vs the Newton s method 3 without any constraints. We can observe in this experiment that a linear increase in time due to the linear increase in the number of constraints for the gradient descent method is clear and that good starting points even for the gradient method can reduce the computational time significantly. In Fig. 8, we compare the Newton s method 3 with the Rayleigh quotient iteration with the same starting points used in Fig. 7. As was expected, the gradient descent method with the orthogonality constraints takes much more time to converge, depending on the starting point and on the number of constraints k. On the other hand, the time spent for the Newton s method to compute each eigenfunction depends only on the starting point, showing that almost the same amount of time is required for convergence in computing each eigenfunction. Moreover, the Newton s method 3 presents a simliar rate of convergence as the Rayleigh quotient iteration. In addition, the absence of the orthogonality constraints doesn t allow the Newton s method to find orthogonal eigenfunctions corresponding to the same eigenvalues of multiplicity greater than. However, their linear independence is guar- 37
SPECTRAL THEOREM FOR COMPACT SELF-ADJOINT OPERATORS
SPECTRAL THEOREM FOR COMPACT SELF-ADJOINT OPERATORS G. RAMESH Contents Introduction 1 1. Bounded Operators 1 1.3. Examples 3 2. Compact Operators 5 2.1. Properties 6 3. The Spectral Theorem 9 3.3. Self-adjoint
More informationDS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra.
DS-GA 1002 Lecture notes 0 Fall 2016 Linear Algebra These notes provide a review of basic concepts in linear algebra. 1 Vector spaces You are no doubt familiar with vectors in R 2 or R 3, i.e. [ ] 1.1
More informationMath 108b: Notes on the Spectral Theorem
Math 108b: Notes on the Spectral Theorem From section 6.3, we know that every linear operator T on a finite dimensional inner product space V has an adjoint. (T is defined as the unique linear operator
More informationMultiparameter eigenvalue problem as a structured eigenproblem
Multiparameter eigenvalue problem as a structured eigenproblem Bor Plestenjak Department of Mathematics University of Ljubljana This is joint work with M Hochstenbach Będlewo, 2932007 1/28 Overview Introduction
More informationEigenvalues and Eigenvectors
/88 Chia-Ping Chen Department of Computer Science and Engineering National Sun Yat-sen University Linear Algebra Eigenvalue Problem /88 Eigenvalue Equation By definition, the eigenvalue equation for matrix
More informationMath 489AB Exercises for Chapter 2 Fall Section 2.3
Math 489AB Exercises for Chapter 2 Fall 2008 Section 2.3 2.3.3. Let A M n (R). Then the eigenvalues of A are the roots of the characteristic polynomial p A (t). Since A is real, p A (t) is a polynomial
More informationLecture 8 : Eigenvalues and Eigenvectors
CPS290: Algorithmic Foundations of Data Science February 24, 2017 Lecture 8 : Eigenvalues and Eigenvectors Lecturer: Kamesh Munagala Scribe: Kamesh Munagala Hermitian Matrices It is simpler to begin with
More informationIn particular, if A is a square matrix and λ is one of its eigenvalues, then we can find a non-zero column vector X with
Appendix: Matrix Estimates and the Perron-Frobenius Theorem. This Appendix will first present some well known estimates. For any m n matrix A = [a ij ] over the real or complex numbers, it will be convenient
More informationChapter 3 Transformations
Chapter 3 Transformations An Introduction to Optimization Spring, 2014 Wei-Ta Chu 1 Linear Transformations A function is called a linear transformation if 1. for every and 2. for every If we fix the bases
More informationSymmetric matrices and dot products
Symmetric matrices and dot products Proposition An n n matrix A is symmetric iff, for all x, y in R n, (Ax) y = x (Ay). Proof. If A is symmetric, then (Ax) y = x T A T y = x T Ay = x (Ay). If equality
More informationLinear Algebra: Matrix Eigenvalue Problems
CHAPTER8 Linear Algebra: Matrix Eigenvalue Problems Chapter 8 p1 A matrix eigenvalue problem considers the vector equation (1) Ax = λx. 8.0 Linear Algebra: Matrix Eigenvalue Problems Here A is a given
More informationNORMS ON SPACE OF MATRICES
NORMS ON SPACE OF MATRICES. Operator Norms on Space of linear maps Let A be an n n real matrix and x 0 be a vector in R n. We would like to use the Picard iteration method to solve for the following system
More informationChapter 4 Euclid Space
Chapter 4 Euclid Space Inner Product Spaces Definition.. Let V be a real vector space over IR. A real inner product on V is a real valued function on V V, denoted by (, ), which satisfies () (x, y) = (y,
More informationEigenvalues and Eigenfunctions of the Laplacian
The Waterloo Mathematics Review 23 Eigenvalues and Eigenfunctions of the Laplacian Mihai Nica University of Waterloo mcnica@uwaterloo.ca Abstract: The problem of determining the eigenvalues and eigenvectors
More informationNonlinear Programming Algorithms Handout
Nonlinear Programming Algorithms Handout Michael C. Ferris Computer Sciences Department University of Wisconsin Madison, Wisconsin 5376 September 9 1 Eigenvalues The eigenvalues of a matrix A C n n are
More information1. General Vector Spaces
1.1. Vector space axioms. 1. General Vector Spaces Definition 1.1. Let V be a nonempty set of objects on which the operations of addition and scalar multiplication are defined. By addition we mean a rule
More informationLINEAR ALGEBRA BOOT CAMP WEEK 4: THE SPECTRAL THEOREM
LINEAR ALGEBRA BOOT CAMP WEEK 4: THE SPECTRAL THEOREM Unless otherwise stated, all vector spaces in this worksheet are finite dimensional and the scalar field F is R or C. Definition 1. A linear operator
More information1. Nonlinear Equations. This lecture note excerpted parts from Michael Heath and Max Gunzburger. f(x) = 0
Numerical Analysis 1 1. Nonlinear Equations This lecture note excerpted parts from Michael Heath and Max Gunzburger. Given function f, we seek value x for which where f : D R n R n is nonlinear. f(x) =
More informationMATH 423 Linear Algebra II Lecture 33: Diagonalization of normal operators.
MATH 423 Linear Algebra II Lecture 33: Diagonalization of normal operators. Adjoint operator and adjoint matrix Given a linear operator L on an inner product space V, the adjoint of L is a transformation
More informationLinear Algebra Review
Chapter 1 Linear Algebra Review It is assumed that you have had a course in linear algebra, and are familiar with matrix multiplication, eigenvectors, etc. I will review some of these terms here, but quite
More informationMATH 583A REVIEW SESSION #1
MATH 583A REVIEW SESSION #1 BOJAN DURICKOVIC 1. Vector Spaces Very quick review of the basic linear algebra concepts (see any linear algebra textbook): (finite dimensional) vector space (or linear space),
More informationLecture 7. Econ August 18
Lecture 7 Econ 2001 2015 August 18 Lecture 7 Outline First, the theorem of the maximum, an amazing result about continuity in optimization problems. Then, we start linear algebra, mostly looking at familiar
More informationJordan Normal Form. Chapter Minimal Polynomials
Chapter 8 Jordan Normal Form 81 Minimal Polynomials Recall p A (x) =det(xi A) is called the characteristic polynomial of the matrix A Theorem 811 Let A M n Then there exists a unique monic polynomial q
More informationUniversity of Colorado at Denver Mathematics Department Applied Linear Algebra Preliminary Exam With Solutions 16 January 2009, 10:00 am 2:00 pm
University of Colorado at Denver Mathematics Department Applied Linear Algebra Preliminary Exam With Solutions 16 January 2009, 10:00 am 2:00 pm Name: The proctor will let you read the following conditions
More informationA Brief Outline of Math 355
A Brief Outline of Math 355 Lecture 1 The geometry of linear equations; elimination with matrices A system of m linear equations with n unknowns can be thought of geometrically as m hyperplanes intersecting
More informationj=1 u 1jv 1j. 1/ 2 Lemma 1. An orthogonal set of vectors must be linearly independent.
Lecture Notes: Orthogonal and Symmetric Matrices Yufei Tao Department of Computer Science and Engineering Chinese University of Hong Kong taoyf@cse.cuhk.edu.hk Orthogonal Matrix Definition. Let u = [u
More informationSymmetric Matrices and Eigendecomposition
Symmetric Matrices and Eigendecomposition Robert M. Freund January, 2014 c 2014 Massachusetts Institute of Technology. All rights reserved. 1 2 1 Symmetric Matrices and Convexity of Quadratic Functions
More informationThe goal of this chapter is to study linear systems of ordinary differential equations: dt,..., dx ) T
1 1 Linear Systems The goal of this chapter is to study linear systems of ordinary differential equations: ẋ = Ax, x(0) = x 0, (1) where x R n, A is an n n matrix and ẋ = dx ( dt = dx1 dt,..., dx ) T n.
More informationMATH 5720: Unconstrained Optimization Hung Phan, UMass Lowell September 13, 2018
MATH 57: Unconstrained Optimization Hung Phan, UMass Lowell September 13, 18 1 Global and Local Optima Let a function f : S R be defined on a set S R n Definition 1 (minimizers and maximizers) (i) x S
More informationGQE ALGEBRA PROBLEMS
GQE ALGEBRA PROBLEMS JAKOB STREIPEL Contents. Eigenthings 2. Norms, Inner Products, Orthogonality, and Such 6 3. Determinants, Inverses, and Linear (In)dependence 4. (Invariant) Subspaces 3 Throughout
More informationPreliminary/Qualifying Exam in Numerical Analysis (Math 502a) Spring 2012
Instructions Preliminary/Qualifying Exam in Numerical Analysis (Math 502a) Spring 2012 The exam consists of four problems, each having multiple parts. You should attempt to solve all four problems. 1.
More informationHere is an example of a block diagonal matrix with Jordan Blocks on the diagonal: J
Class Notes 4: THE SPECTRAL RADIUS, NORM CONVERGENCE AND SOR. Math 639d Due Date: Feb. 7 (updated: February 5, 2018) In the first part of this week s reading, we will prove Theorem 2 of the previous class.
More informationReal Analysis Problems
Real Analysis Problems Cristian E. Gutiérrez September 14, 29 1 1 CONTINUITY 1 Continuity Problem 1.1 Let r n be the sequence of rational numbers and Prove that f(x) = 1. f is continuous on the irrationals.
More informationSeptember 26, 2017 EIGENVALUES AND EIGENVECTORS. APPLICATIONS
September 26, 207 EIGENVALUES AND EIGENVECTORS. APPLICATIONS RODICA D. COSTIN Contents 4. Eigenvalues and Eigenvectors 3 4.. Motivation 3 4.2. Diagonal matrices 3 4.3. Example: solving linear differential
More informationOptimization Theory. A Concise Introduction. Jiongmin Yong
October 11, 017 16:5 ws-book9x6 Book Title Optimization Theory 017-08-Lecture Notes page 1 1 Optimization Theory A Concise Introduction Jiongmin Yong Optimization Theory 017-08-Lecture Notes page Optimization
More informationw T 1 w T 2. w T n 0 if i j 1 if i = j
Lyapunov Operator Let A F n n be given, and define a linear operator L A : C n n C n n as L A (X) := A X + XA Suppose A is diagonalizable (what follows can be generalized even if this is not possible -
More informationThe Eigenvalue Problem: Perturbation Theory
Jim Lambers MAT 610 Summer Session 2009-10 Lecture 13 Notes These notes correspond to Sections 7.2 and 8.1 in the text. The Eigenvalue Problem: Perturbation Theory The Unsymmetric Eigenvalue Problem Just
More informationLinear Algebra 1. M.T.Nair Department of Mathematics, IIT Madras. and in that case x is called an eigenvector of T corresponding to the eigenvalue λ.
Linear Algebra 1 M.T.Nair Department of Mathematics, IIT Madras 1 Eigenvalues and Eigenvectors 1.1 Definition and Examples Definition 1.1. Let V be a vector space (over a field F) and T : V V be a linear
More informationNumerical Methods for Solving Large Scale Eigenvalue Problems
Peter Arbenz Computer Science Department, ETH Zürich E-mail: arbenz@inf.ethz.ch arge scale eigenvalue problems, Lecture 2, February 28, 2018 1/46 Numerical Methods for Solving Large Scale Eigenvalue Problems
More informationChapter 7 Iterative Techniques in Matrix Algebra
Chapter 7 Iterative Techniques in Matrix Algebra Per-Olof Persson persson@berkeley.edu Department of Mathematics University of California, Berkeley Math 128B Numerical Analysis Vector Norms Definition
More informationIterative Reweighted Minimization Methods for l p Regularized Unconstrained Nonlinear Programming
Iterative Reweighted Minimization Methods for l p Regularized Unconstrained Nonlinear Programming Zhaosong Lu October 5, 2012 (Revised: June 3, 2013; September 17, 2013) Abstract In this paper we study
More informationIr O D = D = ( ) Section 2.6 Example 1. (Bottom of page 119) dim(v ) = dim(l(v, W )) = dim(v ) dim(f ) = dim(v )
Section 3.2 Theorem 3.6. Let A be an m n matrix of rank r. Then r m, r n, and, by means of a finite number of elementary row and column operations, A can be transformed into the matrix ( ) Ir O D = 1 O
More informationScientific Computing: An Introductory Survey
Scientific Computing: An Introductory Survey Chapter 4 Eigenvalue Problems Prof. Michael T. Heath Department of Computer Science University of Illinois at Urbana-Champaign Copyright c 2002. Reproduction
More informationEigenvalue Problems. Eigenvalue problems occur in many areas of science and engineering, such as structural analysis
Eigenvalue Problems Eigenvalue problems occur in many areas of science and engineering, such as structural analysis Eigenvalues also important in analyzing numerical methods Theory and algorithms apply
More informationLinear Algebra 2 Spectral Notes
Linear Algebra 2 Spectral Notes In what follows, V is an inner product vector space over F, where F = R or C. We will use results seen so far; in particular that every linear operator T L(V ) has a complex
More informationComputational Methods. Eigenvalues and Singular Values
Computational Methods Eigenvalues and Singular Values Manfred Huber 2010 1 Eigenvalues and Singular Values Eigenvalues and singular values describe important aspects of transformations and of data relations
More informationEigenvalues and eigenvectors
Chapter 6 Eigenvalues and eigenvectors An eigenvalue of a square matrix represents the linear operator as a scaling of the associated eigenvector, and the action of certain matrices on general vectors
More informationOctober 4, 2017 EIGENVALUES AND EIGENVECTORS. APPLICATIONS
October 4, 207 EIGENVALUES AND EIGENVECTORS. APPLICATIONS RODICA D. COSTIN Contents 4. Eigenvalues and Eigenvectors 3 4.. Motivation 3 4.2. Diagonal matrices 3 4.3. Example: solving linear differential
More informationSymmetric and self-adjoint matrices
Symmetric and self-adjoint matrices A matrix A in M n (F) is called symmetric if A T = A, ie A ij = A ji for each i, j; and self-adjoint if A = A, ie A ij = A ji or each i, j Note for A in M n (R) that
More informationMath113: Linear Algebra. Beifang Chen
Math3: Linear Algebra Beifang Chen Spring 26 Contents Systems of Linear Equations 3 Systems of Linear Equations 3 Linear Systems 3 2 Geometric Interpretation 3 3 Matrices of Linear Systems 4 4 Elementary
More informationLecture Notes on PDEs
Lecture Notes on PDEs Alberto Bressan February 26, 2012 1 Elliptic equations Let IR n be a bounded open set Given measurable functions a ij, b i, c : IR, consider the linear, second order differential
More informationAMS526: Numerical Analysis I (Numerical Linear Algebra)
AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 16: Eigenvalue Problems; Similarity Transformations Xiangmin Jiao Stony Brook University Xiangmin Jiao Numerical Analysis I 1 / 18 Eigenvalue
More informationUniversity of Colorado Denver Department of Mathematical and Statistical Sciences Applied Linear Algebra Ph.D. Preliminary Exam January 23, 2015
University of Colorado Denver Department of Mathematical and Statistical Sciences Applied Linear Algebra PhD Preliminary Exam January 23, 2015 Name: Exam Rules: This exam lasts 4 hours and consists of
More informationMatrices and Vectors. Definition of Matrix. An MxN matrix A is a two-dimensional array of numbers A =
30 MATHEMATICS REVIEW G A.1.1 Matrices and Vectors Definition of Matrix. An MxN matrix A is a two-dimensional array of numbers A = a 11 a 12... a 1N a 21 a 22... a 2N...... a M1 a M2... a MN A matrix can
More informationSPRING 2006 PRELIMINARY EXAMINATION SOLUTIONS
SPRING 006 PRELIMINARY EXAMINATION SOLUTIONS 1A. Let G be the subgroup of the free abelian group Z 4 consisting of all integer vectors (x, y, z, w) such that x + 3y + 5z + 7w = 0. (a) Determine a linearly
More informationEigenvalues and Eigenvectors
Chapter 1 Eigenvalues and Eigenvectors Among problems in numerical linear algebra, the determination of the eigenvalues and eigenvectors of matrices is second in importance only to the solution of linear
More informationExamples include: (a) the Lorenz system for climate and weather modeling (b) the Hodgkin-Huxley system for neuron modeling
1 Introduction Many natural processes can be viewed as dynamical systems, where the system is represented by a set of state variables and its evolution governed by a set of differential equations. Examples
More informationNumerical Methods I Eigenvalue Problems
Numerical Methods I Eigenvalue Problems Aleksandar Donev Courant Institute, NYU 1 donev@courant.nyu.edu 1 MATH-GA 2011.003 / CSCI-GA 2945.003, Fall 2014 October 2nd, 2014 A. Donev (Courant Institute) Lecture
More informationFundamentals of Engineering Analysis (650163)
Philadelphia University Faculty of Engineering Communications and Electronics Engineering Fundamentals of Engineering Analysis (6563) Part Dr. Omar R Daoud Matrices: Introduction DEFINITION A matrix is
More informationExercise Solutions to Functional Analysis
Exercise Solutions to Functional Analysis Note: References refer to M. Schechter, Principles of Functional Analysis Exersize that. Let φ,..., φ n be an orthonormal set in a Hilbert space H. Show n f n
More information08a. Operators on Hilbert spaces. 1. Boundedness, continuity, operator norms
(February 24, 2017) 08a. Operators on Hilbert spaces Paul Garrett garrett@math.umn.edu http://www.math.umn.edu/ garrett/ [This document is http://www.math.umn.edu/ garrett/m/real/notes 2016-17/08a-ops
More informationEigenvalues and Eigenvectors
Contents Eigenvalues and Eigenvectors. Basic Concepts. Applications of Eigenvalues and Eigenvectors 8.3 Repeated Eigenvalues and Symmetric Matrices 3.4 Numerical Determination of Eigenvalues and Eigenvectors
More informationAnalysis Preliminary Exam Workshop: Hilbert Spaces
Analysis Preliminary Exam Workshop: Hilbert Spaces 1. Hilbert spaces A Hilbert space H is a complete real or complex inner product space. Consider complex Hilbert spaces for definiteness. If (, ) : H H
More informationIterative methods for positive definite linear systems with a complex shift
Iterative methods for positive definite linear systems with a complex shift William McLean, University of New South Wales Vidar Thomée, Chalmers University November 4, 2011 Outline 1. Numerical solution
More informationQuadratic forms. Here. Thus symmetric matrices are diagonalizable, and the diagonalization can be performed by means of an orthogonal matrix.
Quadratic forms 1. Symmetric matrices An n n matrix (a ij ) n ij=1 with entries on R is called symmetric if A T, that is, if a ij = a ji for all 1 i, j n. We denote by S n (R) the set of all n n symmetric
More informationEigenvectors and Hermitian Operators
7 71 Eigenvalues and Eigenvectors Basic Definitions Let L be a linear operator on some given vector space V A scalar λ and a nonzero vector v are referred to, respectively, as an eigenvalue and corresponding
More informationTheorem 1. ẋ = Ax is globally exponentially stable (GES) iff A is Hurwitz (i.e., max(re(σ(a))) < 0).
Linear Systems Notes Lecture Proposition. A M n (R) is positive definite iff all nested minors are greater than or equal to zero. n Proof. ( ): Positive definite iff λ i >. Let det(a) = λj and H = {x D
More informationChapter 3. Matrices. 3.1 Matrices
40 Chapter 3 Matrices 3.1 Matrices Definition 3.1 Matrix) A matrix A is a rectangular array of m n real numbers {a ij } written as a 11 a 12 a 1n a 21 a 22 a 2n A =.... a m1 a m2 a mn The array has m rows
More informationPart 3: Trust-region methods for unconstrained optimization. Nick Gould (RAL)
Part 3: Trust-region methods for unconstrained optimization Nick Gould (RAL) minimize x IR n f(x) MSc course on nonlinear optimization UNCONSTRAINED MINIMIZATION minimize x IR n f(x) where the objective
More informationLecture 7: Positive Semidefinite Matrices
Lecture 7: Positive Semidefinite Matrices Rajat Mittal IIT Kanpur The main aim of this lecture note is to prepare your background for semidefinite programming. We have already seen some linear algebra.
More informationTOEPLITZ OPERATORS. Toeplitz studied infinite matrices with NW-SE diagonals constant. f e C :
TOEPLITZ OPERATORS EFTON PARK 1. Introduction to Toeplitz Operators Otto Toeplitz lived from 1881-1940 in Goettingen, and it was pretty rough there, so he eventually went to Palestine and eventually contracted
More informationSecond Order Optimization Algorithms I
Second Order Optimization Algorithms I Yinyu Ye Department of Management Science and Engineering Stanford University Stanford, CA 94305, U.S.A. http://www.stanford.edu/ yyye Chapters 7, 8, 9 and 10 1 The
More informationA Perron-type theorem on the principal eigenvalue of nonsymmetric elliptic operators
A Perron-type theorem on the principal eigenvalue of nonsymmetric elliptic operators Lei Ni And I cherish more than anything else the Analogies, my most trustworthy masters. They know all the secrets of
More informationMath 489AB Exercises for Chapter 1 Fall Section 1.0
Math 489AB Exercises for Chapter 1 Fall 2008 Section 1.0 1.0.2 We want to maximize x T Ax subject to the condition x T x = 1. We use the method of Lagrange multipliers. Let f(x) = x T Ax and g(x) = x T
More informationBoolean Inner-Product Spaces and Boolean Matrices
Boolean Inner-Product Spaces and Boolean Matrices Stan Gudder Department of Mathematics, University of Denver, Denver CO 80208 Frédéric Latrémolière Department of Mathematics, University of Denver, Denver
More information(a) If A is a 3 by 4 matrix, what does this tell us about its nullspace? Solution: dim N(A) 1, since rank(a) 3. Ax =
. (5 points) (a) If A is a 3 by 4 matrix, what does this tell us about its nullspace? dim N(A), since rank(a) 3. (b) If we also know that Ax = has no solution, what do we know about the rank of A? C(A)
More informationLINEAR ALGEBRA SUMMARY SHEET.
LINEAR ALGEBRA SUMMARY SHEET RADON ROSBOROUGH https://intuitiveexplanationscom/linear-algebra-summary-sheet/ This document is a concise collection of many of the important theorems of linear algebra, organized
More informationFunctional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...
Functional Analysis Franck Sueur 2018-2019 Contents 1 Metric spaces 1 1.1 Definitions........................................ 1 1.2 Completeness...................................... 3 1.3 Compactness......................................
More informationCS 246 Review of Linear Algebra 01/17/19
1 Linear algebra In this section we will discuss vectors and matrices. We denote the (i, j)th entry of a matrix A as A ij, and the ith entry of a vector as v i. 1.1 Vectors and vector operations A vector
More informationSuppose that the approximate solutions of Eq. (1) satisfy the condition (3). Then (1) if η = 0 in the algorithm Trust Region, then lim inf.
Maria Cameron 1. Trust Region Methods At every iteration the trust region methods generate a model m k (p), choose a trust region, and solve the constraint optimization problem of finding the minimum of
More informationSPECTRAL THEOREM FOR SYMMETRIC OPERATORS WITH COMPACT RESOLVENT
SPECTRAL THEOREM FOR SYMMETRIC OPERATORS WITH COMPACT RESOLVENT Abstract. These are the letcure notes prepared for the workshop on Functional Analysis and Operator Algebras to be held at NIT-Karnataka,
More informationc c c c c c c c c c a 3x3 matrix C= has a determinant determined by
Linear Algebra Determinants and Eigenvalues Introduction: Many important geometric and algebraic properties of square matrices are associated with a single real number revealed by what s known as the determinant.
More information1. Diagonalize the matrix A if possible, that is, find an invertible matrix P and a diagonal
. Diagonalize the matrix A if possible, that is, find an invertible matrix P and a diagonal 3 9 matrix D such that A = P DP, for A =. 3 4 3 (a) P = 4, D =. 3 (b) P = 4, D =. (c) P = 4 8 4, D =. 3 (d) P
More informationMatrix Theory. A.Holst, V.Ufnarovski
Matrix Theory AHolst, VUfnarovski 55 HINTS AND ANSWERS 9 55 Hints and answers There are two different approaches In the first one write A as a block of rows and note that in B = E ij A all rows different
More informationMath 102, Winter Final Exam Review. Chapter 1. Matrices and Gaussian Elimination
Math 0, Winter 07 Final Exam Review Chapter. Matrices and Gaussian Elimination { x + x =,. Different forms of a system of linear equations. Example: The x + 4x = 4. [ ] [ ] [ ] vector form (or the column
More informationMathematical Methods wk 2: Linear Operators
John Magorrian, magog@thphysoxacuk These are work-in-progress notes for the second-year course on mathematical methods The most up-to-date version is available from http://www-thphysphysicsoxacuk/people/johnmagorrian/mm
More informationACM 104. Homework Set 5 Solutions. February 21, 2001
ACM 04 Homework Set 5 Solutions February, 00 Franklin Chapter 4, Problem 4, page 0 Let A be an n n non-hermitian matrix Suppose that A has distinct eigenvalues λ,, λ n Show that A has the eigenvalues λ,,
More informationGeometric Aspects of Sturm-Liouville Problems I. Structures on Spaces of Boundary Conditions
Geometric Aspects of Sturm-Liouville Problems I. Structures on Spaces of Boundary Conditions QINGKAI KONG HONGYOU WU and ANTON ZETTL Abstract. We consider some geometric aspects of regular Sturm-Liouville
More informationMath 5630: Iterative Methods for Systems of Equations Hung Phan, UMass Lowell March 22, 2018
1 Linear Systems Math 5630: Iterative Methods for Systems of Equations Hung Phan, UMass Lowell March, 018 Consider the system 4x y + z = 7 4x 8y + z = 1 x + y + 5z = 15. We then obtain x = 1 4 (7 + y z)
More informationMathematical Optimisation, Chpt 2: Linear Equations and inequalities
Mathematical Optimisation, Chpt 2: Linear Equations and inequalities Peter J.C. Dickinson p.j.c.dickinson@utwente.nl http://dickinson.website version: 12/02/18 Monday 5th February 2018 Peter J.C. Dickinson
More informationThe following definition is fundamental.
1. Some Basics from Linear Algebra With these notes, I will try and clarify certain topics that I only quickly mention in class. First and foremost, I will assume that you are familiar with many basic
More information1. Introduction. Consider the following quadratically constrained quadratic optimization problem:
ON LOCAL NON-GLOBAL MINIMIZERS OF QUADRATIC OPTIMIZATION PROBLEM WITH A SINGLE QUADRATIC CONSTRAINT A. TAATI AND M. SALAHI Abstract. In this paper, we consider the nonconvex quadratic optimization problem
More informationIMPORTANT DEFINITIONS AND THEOREMS REFERENCE SHEET
IMPORTANT DEFINITIONS AND THEOREMS REFERENCE SHEET This is a (not quite comprehensive) list of definitions and theorems given in Math 1553. Pay particular attention to the ones in red. Study Tip For each
More informationLecture Notes for Math 524
Lecture Notes for Math 524 Dr Michael Y Li October 19, 2009 These notes are based on the lecture notes of Professor James S Muldowney, the books of Hale, Copple, Coddington and Levinson, and Perko They
More informationOctober 25, 2013 INNER PRODUCT SPACES
October 25, 2013 INNER PRODUCT SPACES RODICA D. COSTIN Contents 1. Inner product 2 1.1. Inner product 2 1.2. Inner product spaces 4 2. Orthogonal bases 5 2.1. Existence of an orthogonal basis 7 2.2. Orthogonal
More informationA PROJECTED HESSIAN GAUSS-NEWTON ALGORITHM FOR SOLVING SYSTEMS OF NONLINEAR EQUATIONS AND INEQUALITIES
IJMMS 25:6 2001) 397 409 PII. S0161171201002290 http://ijmms.hindawi.com Hindawi Publishing Corp. A PROJECTED HESSIAN GAUSS-NEWTON ALGORITHM FOR SOLVING SYSTEMS OF NONLINEAR EQUATIONS AND INEQUALITIES
More informationIterative solvers for linear equations
Spectral Graph Theory Lecture 23 Iterative solvers for linear equations Daniel A. Spielman November 26, 2018 23.1 Overview In this and the next lecture, I will discuss iterative algorithms for solving
More informationHOMEWORK PROBLEMS FROM STRANG S LINEAR ALGEBRA AND ITS APPLICATIONS (4TH EDITION)
HOMEWORK PROBLEMS FROM STRANG S LINEAR ALGEBRA AND ITS APPLICATIONS (4TH EDITION) PROFESSOR STEVEN MILLER: BROWN UNIVERSITY: SPRING 2007 1. CHAPTER 1: MATRICES AND GAUSSIAN ELIMINATION Page 9, # 3: Describe
More informationMath Linear Algebra Final Exam Review Sheet
Math 15-1 Linear Algebra Final Exam Review Sheet Vector Operations Vector addition is a component-wise operation. Two vectors v and w may be added together as long as they contain the same number n of
More informationMath Matrix Algebra
Math 44 - Matrix Algebra Review notes - (Alberto Bressan, Spring 7) sec: Orthogonal diagonalization of symmetric matrices When we seek to diagonalize a general n n matrix A, two difficulties may arise:
More information