Approximation of functions of large matrices. Part I. Computational aspects. V. Simoncini

Similar documents
Recent advances in approximation using Krylov subspaces. V. Simoncini. Dipartimento di Matematica, Università di Bologna.

Adaptive rational Krylov subspaces for large-scale dynamical systems. V. Simoncini

Error Estimation and Evaluation of Matrix Functions

The Nullspace free eigenvalue problem and the inexact Shift and invert Lanczos method. V. Simoncini. Dipartimento di Matematica, Università di Bologna

The rational Krylov subspace for parameter dependent systems. V. Simoncini

Error bounds for Lanczos approximations of rational functions of matrices

Exponential integration of large systems of ODEs

CONVERGENCE ANALYSIS OF PROJECTION METHODS FOR THE NUMERICAL SOLUTION OF LARGE LYAPUNOV EQUATIONS

On the numerical solution of large-scale linear matrix equations. V. Simoncini

ADAPTIVE RATIONAL KRYLOV SUBSPACES FOR LARGE-SCALE DYNAMICAL SYSTEMS. 1. Introduction. The time-invariant linear system

Valeria Simoncini and Daniel B. Szyld. Report October 2009

EXTENDED KRYLOV SUBSPACE FOR PARAMETER DEPENDENT SYSTEMS

Bounds for the Matrix Condition Number

APPROXIMATION OF THE LINEAR THE BLOCK SHIFT-AND-INVERT KRYLOV SUBSPACE METHOD

EXTENDED KRYLOV SUBSPACE FOR PARAMETER DEPENDENT SYSTEMS

Iterative solvers for saddle point algebraic linear systems: tools of the trade. V. Simoncini

Comparing Leja and Krylov approximations of large scale matrix exponentials

Introduction. Chapter One

z k, z C, (1.1) , for k 2. j! j=0

RD-Rational Approximations of the Matrix Exponential

Order reduction numerical methods for the algebraic Riccati equation. V. Simoncini

Last Time. Social Network Graphs Betweenness. Graph Laplacian. Girvan-Newman Algorithm. Spectral Bisection

CONVERGENCE ANALYSIS OF THE EXTENDED KRYLOV SUBSPACE METHOD FOR THE LYAPUNOV EQUATION

Rational Krylov approximation of matrix functions: Numerical methods and optimal pole selection

The ReLPM Exponential Integrator for FE Discretizations of Advection-Diffusion Equations

C M. A two-sided short-recurrence extended Krylov subspace method for nonsymmetric matrices and its relation to rational moment matching

Computational methods for large-scale matrix equations and application to PDEs. V. Simoncini

LARGE SPARSE EIGENVALUE PROBLEMS. General Tools for Solving Large Eigen-Problems

LARGE SPARSE EIGENVALUE PROBLEMS

u(t) = Au(t), u(0) = B, (1.1)

ETNA Kent State University

Solution of eigenvalue problems. Subspace iteration, The symmetric Lanczos algorithm. Harmonic Ritz values, Jacobi-Davidson s method

Qualifying Examination

Comparing Leja and Krylov Approximations of Large Scale Matrix Exponentials

Spectral Properties of Saddle Point Linear Systems and Relations to Iterative Solvers Part I: Spectral Properties. V. Simoncini

Computational methods for large-scale matrix equations and application to PDEs. V. Simoncini

Solutions for Math 225 Assignment #4 1

Computational methods for large-scale matrix equations: recent advances and applications. V. Simoncini

Iterative Methods for Linear Systems of Equations

FEM and sparse linear system solving

Preliminary/Qualifying Exam in Numerical Analysis (Math 502a) Spring 2012

RECURSION RELATIONS FOR THE EXTENDED KRYLOV SUBSPACE METHOD

Implementation of exponential Rosenbrock-type integrators

MATH 5640: Functions of Diagonalizable Matrices

Exponential integrators for semilinear parabolic problems

Approximating the matrix exponential of an advection-diffusion operator using the incomplete orthogonalization method

DEFLATED RESTARTING FOR MATRIX FUNCTIONS

Structured Preconditioners for Saddle Point Problems

On the Superlinear Convergence of MINRES. Valeria Simoncini and Daniel B. Szyld. Report January 2012

M.A. Botchev. September 5, 2014

Solution of eigenvalue problems. Subspace iteration, The symmetric Lanczos algorithm. Harmonic Ritz values, Jacobi-Davidson s method

Efficient Wavefield Simulators Based on Krylov Model-Order Reduction Techniques

Linköping University Post Print. A numerical solution of a Cauchy problem for an elliptic equation by Krylov subspaces

4.8 Arnoldi Iteration, Krylov Subspaces and GMRES

Block Krylov Space Solvers: a Survey

Shift-Invert Lanczos Method for the Symmetric Positive Semidefinite Toeplitz Matrix Exponential

c 2005 Society for Industrial and Applied Mathematics

MULTIGRID ARNOLDI FOR EIGENVALUES

Matrix Algorithms. Volume II: Eigensystems. G. W. Stewart H1HJ1L. University of Maryland College Park, Maryland

IDR(s) Master s thesis Goushani Kisoensingh. Supervisor: Gerard L.G. Sleijpen Department of Mathematics Universiteit Utrecht

Krylov methods for the computation of matrix functions

Krylov Space Solvers

The Lanczos and conjugate gradient algorithms

Recent computational developments in Krylov Subspace Methods for linear systems. Valeria Simoncini and Daniel B. Szyld

Karhunen-Loève Approximation of Random Fields Using Hierarchical Matrix Techniques

Accurate evaluation of divided differences for polynomial interpolation of exponential propagators

Eigenvalue Problems CHAPTER 1 : PRELIMINARIES

AMS526: Numerical Analysis I (Numerical Linear Algebra)

Charles University Faculty of Mathematics and Physics DOCTORAL THESIS. Krylov subspace approximations in linear algebraic problems

From Completing the Squares and Orthogonal Projection to Finite Element Methods

Algorithms that use the Arnoldi Basis

Structured Preconditioners for Saddle Point Problems

Recycling Bi-Lanczos Algorithms: BiCG, CGS, and BiCGSTAB

Linear Algebra Massoud Malek

ON ORTHOGONAL REDUCTION TO HESSENBERG FORM WITH SMALL BANDWIDTH

RESIDUAL, RESTARTING AND RICHARDSON ITERATION FOR THE MATRIX EXPONENTIAL

Ritz Value Bounds That Exploit Quasi-Sparsity

Efficient Solution of Parabolic Equations by Krylov Approximation Methods

Structured Krylov Subspace Methods for Eigenproblems with Spectral Symmetries

1. Introduction. Many applications in science and engineering require the evaluation of expressions of the form (1.1)

Nonlinear Programming Algorithms Handout

ILAS 2017 July 25, 2017

A short course on: Preconditioned Krylov subspace methods. Yousef Saad University of Minnesota Dept. of Computer Science and Engineering

Assignment 1 Math 5341 Linear Algebra Review. Give complete answers to each of the following questions. Show all of your work.

Applied Linear Algebra in Geoscience Using MATLAB

Research Article MINRES Seed Projection Methods for Solving Symmetric Linear Systems with Multiple Right-Hand Sides

Arnoldi Methods in SLEPc

DELFT UNIVERSITY OF TECHNOLOGY

An Arnoldi Gram-Schmidt process and Hessenberg matrices for Orthonormal Polynomials

The EVSL package for symmetric eigenvalue problems Yousef Saad Department of Computer Science and Engineering University of Minnesota

Sensitivity of Gauss-Christoffel quadrature and sensitivity of Jacobi matrices to small changes of spectral data

Preface to the Second Edition. Preface to the First Edition

ETNA Kent State University

Finding Rightmost Eigenvalues of Large, Sparse, Nonsymmetric Parameterized Eigenvalue Problems

Three-Dimensional Transient Electromagnetic Modeling Using Rational Krylov Methods

Solving Ill-Posed Cauchy Problems in Three Space Dimensions using Krylov Methods

SOLVING ILL-POSED LINEAR SYSTEMS WITH GMRES AND A SINGULAR PRECONDITIONER

Krylov subspace projection methods

The quadratic eigenvalue problem (QEP) is to find scalars λ and nonzero vectors u satisfying

ANALYSIS OF SOME KRYLOV SUBSPACE APPROXIMATIONS TO THE MATRIX EXPONENTIAL OPERATOR

Transcription:

Approximation of functions of large matrices. Part I. Computational aspects V. Simoncini Dipartimento di Matematica, Università di Bologna and CIRSA, Ravenna, Italy valeria@dm.unibo.it 1

The problem Given A R n n, v R n and f sufficiently smooth function, approximate x = f(a)v A large dimension, v = 1 A sym. pos. (semi)def., or A positive real f(a)v vs f(a) Projection-type methods K approximation space, m = dim(k) V R n m s.t. K = range(v ) x = f(a)v x m = V f(v AV )(V v) 2

The problem Given A R n n, v R n and f sufficiently smooth function, approximate A large dimension, v = 1 x = f(a)v A sym. pos. (semi)def., or A positive real f(a)v vs f(a) Projection-type methods K approximation space, m = dim(k) V R n m s.t. K = range(v ) x = f(a)v x m = V f(v AV )(V v) 3

Numerical approximation. f(a)v x Important issues: Role of f in the approximation quality Role of A in the approximation quality Efficiency? Measures/Estimates of accuracy? 4

Standard Krylov subspace approximation K = K m (A, v) For H m = V AV, v = V e 1 and V V = I m : x m = V m f(h m )e 1 Polynomial approximation: x m = p m 1 (A)v (p m 1 interpolates f at eigenvalues of H m ) (alternatives: other polyns or interpolation points; e.g., Moret & Novati, 01-04 ) Note: Procedure valid for A symmetric and nonsymmetric Numerical and theoretical results since mid 80s. 5

Typical convergence estimates in K m Approximation of f(λ) = exp( λ) (Hochbruck & Lubich 97) A sym. semidef. σ(a) [0, 4ρ], x m = V m exp( H m )e 1, f(a)v x m 10e m2 /(5ρ), 4ρ m 2ρ f(a)v x m 10 ( eρ ) m ρ e ρ, m 2ρ m see also Tal-Ezer 89, Druskin & Knizhnerman 89, Stewart & Leyk 96 Approximation of f(λ) = λ 1/2, exp( λ),... : f(a)v V m f( H m )e 1 = O(exp( 2m λmin λ max )) 6

Typical convergence estimates in K m Approximation of f(λ) = exp( λ) (Hochbruck & Lubich 97) A sym. semidef. σ(a) [0, 4ρ], x m = V m exp( H m )e 1, f(a)v x m 10e m2 /(5ρ), 4ρ m 2ρ f(a)v x m 10 ( eρ ) m ρ e ρ, m 2ρ m see also Tal-Ezer 89, Druskin & Knizhnerman 89, Stewart & Leyk 96 Approximation of f(λ) = λ 1/2, exp( λ),... : f(a)v V m f( H m )e 1 = O(exp( 2m λmin λ max )) 7

...When things are not so easy exp( A)v V m exp( H m )e 1 A R 400 400, A = 10 5 10 0 10 2 10 4 10 6 error norm 10 8 10 10 10 12 10 14 0 20 40 60 80 100 120 140 Krylov subspace dimension exp( A)v V m exp( H m )e 1 10e m2 /(5ρ), where σ(a) [0, 4ρ] 4ρ m 2ρ 8

Acceleration Procedures: Shift-Invert Lanczos A symmetric pos. (semi)def. Choose γ s.t. (I + γa) is invertible, and construct K = K m ((I + γa) 1, v), van den Eshof-Hochbruck 06, Moret-Novati 04 with T m = V (I + γa) 1 V, v = V e 1 and V V = I m x m = V m f( 1 γ (T 1 m I m ))e 1 Rational approximation: x m = p m 1 ((I + γa) 1 )v Choice of γ: γ = 1/ λ min λ max (Moret, tr 2005) 9

Acceleration Procedures: Extended Krylov For A nonsingular, K = K m1 (A, v) + K m2 (A 1, A 1 v), Druskin-Knizhnerman 1998, A sym. Note: K = A (m 2 1) K m1 +m 2 1(A, v) Algorithm (augmentation-style) - Fix m 2 m 1 - Run m 2 steps of Inverted Lanczos - Run m 1 steps of Standard Lanczos + orth. 10

Extended Krylov: a new implementation m 1 = m 2 = m not fixed a priori K = K m (A, v) + K m (A 1, A 1 v) Arnoldi-type recurrence: - U 1 [v, A 1 v] + orth - U j+1 [AU j (:, 1), A 1 U j (:, 2)] + orth j = 1, 2,... Recurrence to cheaply compute T m = UmAU m, U m = [U 1,..., U m ] Compute x m = U m f(t m )e 1 Simoncini, 2007 11

Extended Krylov: Convergence theory I f satisfying f(z) = 0 (with convenient measure dµ(ζ)) 1 dµ(ζ), z C\], 0] z ζ Druskin-Knizhnerman 1998: A sym: x x m = O(m 2 exp( 2m 4 λ min λ max )) 12

Extended Krylov: Convergence theory II A new approximation result for nonsingular A: Let f = f 1 + f 2, a [0, ) f 1 (z) f 2 (z) m 1 k=0 m 1 k=0 γ 1,k F 1,k (z) c 1 Φ 1 ( a) m, γ 2,k F 2,k (z 1 ) c 2 Φ 2 ( 1 a ) m, Φ 1, F 1,k conformal mapping and Faber Polynomial w.r.to W(A) Φ 2, F 2,k conformal mapping and Faber Polynomial w.r.to W(A 1 ) From this, for A symmetric: x x m = O(exp( 2m 4 λ min λ max )) Currently working on A nonsymmetric 13

Convergence rate. A R 400 400 symmetric. f(λ) = λ 1/2 10 0 10 0 10 2 10 2 10 4 10 4 norm of error 10 6 norm of error 10 6 10 8 10 8 10 10 10 10 10 12 10 12 0 5 10 15 20 25 30 dimension of approximation space 10 14 0 5 10 15 20 25 30 dimension of approximation space σ(a) = [0.01, 0.9] σ(a) = [1, 50] Uniform spectral distribution 14

Experiment. A R 400 400 normal. f(λ) = λ 1/2 10 0 ellipse a1=0.097, a2=0.01, c=0.1 10 2 true a= sqrt(l 1 l n ) a optimal 10 4 norm of error 10 6 10 8 10 10 10 12 10 14 0 5 10 15 20 25 30 35 40 45 50 dimension of approximation space σ(a) on an elliptic curve in C + with center on real axis 15

Log-uniform spectral distribution. f (3) (z) = exp( z), f (5) (z) = z 1/2, f (7) (z) = z 1/4 case 3 1 case 5 1 case 7 1 space dim. at convergence 400 300 200 100 space dim. at convergence 400 300 200 100 space dim. at convergence 400 300 200 100 0 10 0 10 5 case 3 2 0 10 0 10 5 case 5 2 0 10 0 shift invert Lanczos(0.1) 10 5 SI Lanczos(γ) Standard Krylov case Extended 7 2 Krylov space dim. at convergence 400 300 200 100 space dim. at convergence 400 300 200 100 space dim. at convergence 400 300 200 100 0 0 0 10 0 10 0 10 0 cases *-1: λ min = 10 1 cases *-2: λ min = 10 4 x-axis: λ max = 10 k 16

SI-Lanczos and dependence on parameter γ 10 0 f(x) = exp( sqrt(x)), λ min = 1e 1, λ max = 1e6 10 0 f(x) = exp( sqrt(x)), λ min = 1e 4, λ max = 1e3 10 2 Lanczos 10 1 10 2 Lanczos 10 3 SI(0.01) 10 4 10 6 10 8 EK 10 4 10 5 10 6 10 7 SI(γ) EK SI(0.1) 10 10 SI(0.1)SI(1) SI(100) SI(10) SI(0.01) SI(γ) 10 8 10 9 0 50 100 150 200 250 300 10 10 0 50 100 150 200 250 300 Log-uniform spectral distribution 17

Comparisons: CPU Time in Matlab A R 4900 4900 : L(u) = 1 10 u xx 100u yy, in [0, 1] 2, hom.b.c. σ(a) [9.6 10 2, 1.96 10 6 ], f(λ) = λ 1/2 Method space dim. CPU Time Standard Krylov 185 16.02 Rational (Zolotarev) 0.50 SI-Lanczos(0.001) 62 1.00 SI-Lanczos (1e-5) 49 0.60 SI-Lanczos (γ=2e-5) 33 0.32 Extended Krylov 32 0.20 No reorthogonalization. Direct solves. Tol = 10 12. 18

A nonsymmetric matrix. f(z) = z A R 900 900 : L(u) = 100u xx u yy + 10xu x in [0, 1] 2, hom.b.c. 10 2 10 0 10 2 norm of error 10 4 SI Arnoldi(1e 3) 10 6 10 8 Extended Krylov SI Arnoldi(2e 5) SI Arnoldi(1e 5) 10 10 0 10 20 30 40 50 60 70 80 90 100 dimension of approximation space σ(a) R, λ min = 9.2 10 2, λ max = 3.6 10 5 Full orthogonalization 19

Conclusions and work in progress Great potential of using f(a)v in application problems Exploit low cost of using A instead of f(a) Further developments in acceleration techniques The case of A nonsymmetric (preliminary encouraging tests) New implementation of the Extended Krylov method Improved convergence bounds for A symmetric New convergence results for A nonsymmetric (to be completed) Performance: - Does not depend on parameters - Competitive for A sym, and nonsym. (preliminary tests) 20

Conclusions and work in progress Great potential of using f(a)v in application problems Exploit low cost of using A instead of f(a) Further developments in acceleration techniques The case of A nonsymmetric (preliminary encouraging tests) New implementation of the Extended Krylov method Improved convergence bounds for A symmetric New convergence results for A nonsymmetric (to be completed) Performance: - Does not depend on parameters - Competitive for A sym, and nonsym. (preliminary tests) 21