Numerical methods for matrix functions

Similar documents
Lecture 2: Computing functions of dense matrices

Krylov Subspace Methods for the Evaluation of Matrix Functions. Applications and Algorithms

Linear Algebra review Powers of a diagonalizable matrix Spectral decomposition

Lecture 10 - Eigenvalues problem

7 Planar systems of linear ODE

Linear Algebra Review (Course Notes for Math 308H - Spring 2016)

Matrix functions and their approximation. Krylov subspaces

Linear Algebra review Powers of a diagonalizable matrix Spectral decomposition

November 18, 2013 ANALYTIC FUNCTIONAL CALCULUS

AMS526: Numerical Analysis I (Numerical Linear Algebra)

Matrix Functions and their Approximation by. Polynomial methods

Chapter 7. Canonical Forms. 7.1 Eigenvalues and Eigenvectors

Linear Algebra & Geometry why is linear algebra useful in computer vision?

Definition (T -invariant subspace) Example. Example

Jordan normal form notes (version date: 11/21/07)

Linear Algebra in Actuarial Science: Slides to the lecture

Generalized eigenvector - Wikipedia, the free encyclopedia

Eigenvalue Problems and Singular Value Decomposition

Math Ordinary Differential Equations

MTH5112 Linear Algebra I MTH5212 Applied Linear Algebra (2017/2018)

Ortho-Skew and Ortho-Sym Matrix Trigonometry 1

The Eigenvalue Shift Technique and Its Eigenstructure Analysis of a Matrix

Math 102, Winter Final Exam Review. Chapter 1. Matrices and Gaussian Elimination

ACM/CMS 107 Linear Analysis & Applications Fall 2016 Assignment 4: Linear ODEs and Control Theory Due: 5th December 2016

Eigenvalue and Eigenvector Problems

CS 246 Review of Linear Algebra 01/17/19

Nonconstant Coefficients

Computing Real Logarithm of a Real Matrix

Matrix-valued functions

Math 240 Calculus III

j=1 u 1jv 1j. 1/ 2 Lemma 1. An orthogonal set of vectors must be linearly independent.

Lecture 4: Numerical solution of ordinary differential equations

Ir O D = D = ( ) Section 2.6 Example 1. (Bottom of page 119) dim(v ) = dim(l(v, W )) = dim(v ) dim(f ) = dim(v )

Numerical Methods I Eigenvalue Problems

MATHEMATICS 217 NOTES

Solutions Chapter 9. u. (c) u(t) = 1 e t + c 2 e 3 t! c 1 e t 3c 2 e 3 t. (v) (a) u(t) = c 1 e t cos 3t + c 2 e t sin 3t. (b) du

A brief introduction to ordinary differential equations

Linear Algebra and ODEs review

MATH 5524 MATRIX THEORY Problem Set 4

Linear Algebra & Geometry why is linear algebra useful in computer vision?

Solution of Linear Equations

Numerical Methods - Numerical Linear Algebra

Linear Algebra 1. M.T.Nair Department of Mathematics, IIT Madras. and in that case x is called an eigenvector of T corresponding to the eigenvalue λ.

Eigenvalues. Matrices: Geometric Interpretation. Calculating Eigenvalues

Department of Mathematics, University of California, Berkeley. GRADUATE PRELIMINARY EXAMINATION, Part A Spring Semester 2015

CAAM 335 Matrix Analysis

AMS526: Numerical Analysis I (Numerical Linear Algebra)

Lecture Notes 6: Dynamic Equations Part C: Linear Difference Equation Systems

M.A. Botchev. September 5, 2014

Quantum Computing Lecture 2. Review of Linear Algebra

Eigenvalues, Eigenvectors and the Jordan Form

Practical Linear Algebra: A Geometry Toolbox

Computing the pth Roots of a Matrix. with Repeated Eigenvalues

Math 240 Calculus III

What is A + B? What is A B? What is AB? What is BA? What is A 2? and B = QUESTION 2. What is the reduced row echelon matrix of A =

= A(λ, t)x. In this chapter we will focus on the case that the matrix A does not depend on time (so that the ODE is autonomous):

Linear Algebra Review. Vectors

EC Control Engineering Quiz II IIT Madras

Topic 15 Notes Jeremy Orloff

AN ELEMENTARY PROOF OF THE SPECTRAL RADIUS FORMULA FOR MATRICES

EXAM. Exam 1. Math 5316, Fall December 2, 2012

j=1 x j p, if 1 p <, x i ξ : x i < ξ} 0 as p.

Lecture Summaries for Linear Algebra M51A

Linear Algebra March 16, 2019

Course Notes: Week 1

1 Last time: least-squares problems

Matrix Solutions to Linear Systems of ODEs

Linear Algebra. Min Yan

MTH 464: Computational Linear Algebra

EECS 275 Matrix Computation

Example: Filter output power. maximization. Definition. Eigenvalues, eigenvectors and similarity. Example: Stability of linear systems.

MIDTERM REVIEW AND SAMPLE EXAM. Contents

0.1 Rational Canonical Forms

MATH 425-Spring 2010 HOMEWORK ASSIGNMENTS

State will have dimension 5. One possible choice is given by y and its derivatives up to y (4)

Spring 2019 Exam 2 3/27/19 Time Limit: / Problem Points Score. Total: 280

INHOMOGENEOUS LINEAR SYSTEMS OF DIFFERENTIAL EQUATIONS

Vectors and matrices: matrices (Version 2) This is a very brief summary of my lecture notes.

Linear Algebra Review

Notes on Linear Algebra and Matrix Theory

AMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences)

Eigenvalues and Eigenvectors

MATH 583A REVIEW SESSION #1

Jordan Normal Form and Singular Decomposition

Lecture 1 Systems of Linear Equations and Matrices

Mathematical Foundations

Introduction to Matrix Algebra

Sensitivity analysis of the differential matrix Riccati equation based on the associated linear differential system

The matrix sign function

18.06 Problem Set 8 - Solutions Due Wednesday, 14 November 2007 at 4 pm in

Polynomials. p n (x) = a n x n + a n 1 x n 1 + a 1 x + a 0, where

Lecture notes: Applied linear algebra Part 2. Version 1

12x + 18y = 30? ax + by = m

Math 108b: Notes on the Spectral Theorem

Review of Linear Algebra

LinGloss. A glossary of linear algebra

MAT Linear Algebra Collection of sample exams

Mathematical Methods for Engineers and Scientists 1

A Divide-and-Conquer Algorithm for Functions of Triangular Matrices

Lecture 6: Lies, Inner Product Spaces, and Symmetric Matrices

Transcription:

Numerical methods for matrix functions SF2524 - Matrix Computations for Large-scale Systems Lecture 13 Numerical methods for matrix functions 1 / 26

Reading material Lecture notes online Numerical methods for matrix functions (Further reading: Nicholas Higham - Functions of Matrices) (Further reading: Golub and Van Loan - Matrix computations) Agenda Block D Matrix functions Lecture 13: Defintions Lecture 13: General methods Lecture 14: Matrix exponential (underlying expm(a) in matlab) Lecture 14: Matrix square root, matrix sign function Lecture 15: Krylov methods for f (A)b Lecture 15: Exponential integrators Numerical methods for matrix functions 2 / 26

Functions of matrices Matrix functions (or functions of matrices) will in this block refer to a certain class of functions f : C n n C n n that are consistent extensions of scalar functions. Simplest examples If f (t) = b 0 + b 1 t + + b m t m it is natural to define f (A) = b 0 I + b 1 A + + b m A m. If f (t) = α+t β+t it is natural to define f (A) = (αi + A) 1 (βi + A) = (βi + A)(αI + A) 1. Not matrix functions: f (A) = det(a), f (A) = A Numerical methods for matrix functions 3 / 26

Definitions Definition encountered in earlier courses (maybe) Consider an analytic function f : C C, with a Taylor expansion with expansion point µ = 0 f (z) = f (0) + f (0) z +. 1! The matrix function f (A) is defined as f (A) := i=0 f (i) (0) i! A i = f (0)I + f (0) A +. 1! In this course we are more careful. Essentially equivalent definitions: Taylor series: Definition 4.1.1 Jordan based: Definition 4.1.3 Cauchy integral: Definition 4.1.4 Numerical methods for matrix functions 4 / 26

Applications The most well-known non-trivial matrix function Consider the linear autonomous ODE y (t) = Ay(t), y(0) = y 0 The matrix exponential (expm(a) in matlab) is the function that satisfies More generally, the solution to y(t) = exp(ta)y 0 y (t) = Ay(t) + f (t) satisfies t y(t) = exp(ta)y 0 + exp(a(t s))f (s) ds 0 For some problems much better than traditional time-stepping methods. Numerical methods for matrix functions 5 / 26

Trigonometric matrix functions and square roots Suppose y(t) R n satisfies y (x) + Au(x) = 0 y(0) = y 0, y (0) = y 0. The solution is explicitly given by y(t) = cos( At)y 0 + ( A) 1 sin( (A)t)y 0. Numerical methods for matrix functions 6 / 26

Matrix logarithm in Markov chains The transition probability matrix P(t) is related to the transition intensity matrix Q with P(t) = exp(qt) were Q satisfies certain properties. Inverse problem: Given P(1) is there Q such that the properties are satisfied. Method: Compute and check properties. Q = log(p(0)) Numerical methods for matrix functions 7 / 26

Further applications in Solving the Riccati equation (in control theory) Study of stability of time-delay systems Orthogonal procrustes problems Geometric mean Numerical methods for differential equations See youtube video from Gene Golub summer school: https://www.youtube.com/watch?v=uxwmyr0lqak Numerical methods for matrix functions 8 / 26

Definitions of matrix functions PDF lecture notes section 4.1 Numerical methods for matrix functions 9 / 26

Polynomials If p(z) = a 0 + a 1 z + a p z p, then the matrix function extension is p(a) = a 0 I + a 1 A + a p A p Taylor expansion of scalar function f (z) with expansion point µ f (z) = i=0 f (i) (µ) (z µ) i. i! Definition (Taylor definition) The Taylor definition with expansion point µ C of the matrix function associated with f (z) is given by f (A) = i=0 f (i) (µ) (A µi ) i. i! Numerical methods for matrix functions 10 / 26

When is the infinite sum f (A) = i=0 f (i) (µ) (A µi ) i. (1) i! finite? Theorem (Convergence of Taylor definition) Suppose f (z) is analytic in D(µ, r) and suppose r > A µi. Let f (A) be (1) and A µi γ := < 1. r Then, there exists a constant C > 0 independent of N such that f (A) N i=0 f (i) (µ) (A µi ) i Cγ N 0 as N. i! Consequence: f (A) finite if f entire function Proof on black board Numerical methods for matrix functions 11 / 26

Simple properties: f (z) = g(z) + h(z) f (A) = g(a) + h(a) f (z) = g(z)h(z) f (A) = g(a)h(a) = h(a)g(a) f (V 1 XV ) = V 1 f (X )V ( ) 1 f (t 1 ) f ( t... ) =... t n f (tn) t 1 f (t 1 ) f (... ) =... tn f (t n ) [ ] [ ] A 0 f (A) 0 f ( ) ( ) 0 B 0 f (B) Note g(a)g(b) g(b)g(a) unless AB = BA Numerical methods for matrix functions 12 / 26

Jordan form definition Use ( ) with Jordan decomposition A = VJV 1 : f (A) = f (VJV 1 ) = Vf (J)V 1 Use ( ): 1 f (J) = f ( J... f (J 1 ) ) =... f (Jq) J q What is the matrix function of a Jordan block? λ 1...... J i =... 1 λ Numerical methods for matrix functions 13 / 26

Example: f (J) Example in Matlab: and p(z) = z 4.For this case we have s 1 0 A = s 1 s p(λ) p 1 (λ) 2 p (λ) p(j) = 0 p(λ) p (λ). 0 0 p(λ) Can be formalized (proof in PDF lecture notes)... Numerical methods for matrix functions 14 / 26

Definition (Jordan canonical form (JCF) definition) Suppose A C n n and let X and J 1,..., J q be the JCF. The JCF-definition of the matrix function f (A) is given by f (A) := X diag(f 1,..., F q )X 1, (2) where f (λ i ) F i = f (J i ) := f (λ i ) f (n i 1) (λ i ) (n i 1)! 1!.......... f (λ i ) 1! f (λ i ) Cn i n i. (3) Show specialization when eigenvalues distinct Numerical methods for matrix functions 15 / 26

Cauchy integral definition From complex analysis: Cauchy integral formula f (x) = 1 f (z) 2iπ z x dz. where Γ encircles x counter-clockwise. Replace x with A: Definition (Cauchy integral definition) Suppose f is analytic inside and on a simple, closed, piecewise-smooth curve Γ, which encloses the eigenvalues of A once counter-clockwise. The Cauchy integral definition of matrix functions is given by f (A) := 1 2iπ Γ Γ f (z)(zi A) 1 dz. * example in lecture notes * Numerical methods for matrix functions 16 / 26

Equivalence of definitions We have learned about Definition 1: Taylor definition Definition 2: Jordan form definition Definition 3: Cauchy integral definition Slightly different different definition domains. Theorem (Equivalence of the matrix function definitions) Suppose f is an entire function and suppose A C n n. Then, the matrix function definitions are equivalent. Which definition valid for and f (x) = x with A = f (x) = x with A = [ ] 0 1? 0 4 [ ] 3 1? 0 4 Numerical methods for matrix functions 17 / 26

General methods PDF lecture notes section 4.2 Numerical methods for matrix functions 18 / 26

General methods for matrix functions: Today: Truncated Taylor series (4.2.1) Today: Eigenvalue-eigenvector approach (4.2.2) Today: Schur-Parlett method (4.2.3) Lecture 15: Krylov methods for f (A)b (4.4) Numerical methods for matrix functions 19 / 26

Truncated Taylor series (naive approach) First approach based on truncting Taylor series: f (A) F N = N i=0 f (i) (µ) (A µi ) i i! Properties Can be very slow if Taylor series converges slowly We need N 1 matrix-matrix multiplications. Complexity O(Nn 3 ) We need access to the derivatives The truncated Taylor series is mostly for theoretical purposes. Numerical methods for matrix functions 20 / 26

Eigenvalue-eigenvector approach If we have distinct eigenvalues or symmetric matrix: f (λ 1 ) f (A) = V... V 1 f (λn) where V = [v 1,..., v n ] are the eigenvectors. Main properties Requires computation of eigenvalues and eigenvectors: Complexity essentially O(n 3 ) Requires only the function value in the eigenvalues Can be numerically unstable If A is symmetric V 1 = V T. Conclusion: Can be used for numerical computations if reliability is not important. Numerical methods for matrix functions 21 / 26

Schur-Parlett method We know how to compute a Schur factorization A = QTQ where Q orthogonal and T upper triangular f (A) = f (QTQ ) = Qf (T )Q. Schur-Parlett method: Compute a Schur factorization Q, T Compute f (T ) where T triangular Compute f (A) = Qf (T )Q. What is f (T ) for a triangular matrix? Numerical methods for matrix functions 22 / 26

f (T ) where T triangular Note: f (T ) commutes with T : f (T )T = Tf (T ). * On black board: two-by-two example. Generalization derivation * Numerical methods for matrix functions 23 / 26

Theorem (Computation of one element of f (T )) Suppose T C n n is an upper triangular matrix with distinct eigenvalues. Then, for any i and any j > i, s f ij = t jj t ii where s = t ij (f jj f ii ) + j 1 k=i+1 t ik f kj f ik t kj. F : j + + + + + 0 + + + + + i 0 0 + + + + f ij 0 0 0 + + + + 0 0 0 0 + + + + 0 0 0 0 0 + + + 0 0 0 0 0 0 + + 0 0 0 0 0 0 0 + T : j + + + + + + + + 0 + + + + + + + i 0 0 + + + + + + 0 0 0 + + + + + 0 0 0 0 + + + + 0 0 0 0 0 + + + 0 0 0 0 0 0 + + 0 0 0 0 0 0 0 + Numerical methods for matrix functions 24 / 26

Repeat sub-column by sub-column. * On blackboard * * Matlab simulation * Numerical methods for matrix functions 25 / 26

Main properties Schur-Parlett (simplified) Requires the computation of a Schur-decomposition (O(n 3 )) which is often the dominating computational cost. The only usage of f : f (λ i ), i = 1,..., n Only works when eigenvalues distinct Numerical cancellation can occur when eigenvalues close: Can repaired with the full version of Schur-Parlett by using f (i) (z). Numerical methods for matrix functions 26 / 26