Matching moments and matrix computations

Similar documents
Moments, Model Reduction and Nonlinearity in Solving Linear Algebraic Problems

On the Vorobyev method of moments

Sensitivity of Gauss-Christoffel quadrature and sensitivity of Jacobi matrices to small changes of spectral data

Lanczos tridiagonalization, Krylov subspaces and the problem of moments

Golub-Kahan iterative bidiagonalization and determining the noise level in the data

On the interplay between discretization and preconditioning of Krylov subspace methods

Efficient Estimation of the A-norm of the Error in the Preconditioned Conjugate Gradient Method

Composite convergence bounds based on Chebyshev polynomials and finite precision conjugate gradient computations

Krylov subspace methods from the analytic, application and computational perspective

Principles and Analysis of Krylov Subspace Methods

O jednom starém článku a jeho mnohých souvislostech On an old article and its many connections

Efficient Estimation of the A-norm of the Error in the Preconditioned Conjugate Gradient Method

Contribution of Wo¹niakowski, Strako²,... The conjugate gradient method in nite precision computa

Gene H. Golub and Gérard Meurant Matrices, Moments and Quadrature with Applications

Key words. conjugate gradients, normwise backward error, incremental norm estimation.

Lanczos tridigonalization and Golub - Kahan bidiagonalization: Ideas, connections and impact

ON EFFICIENT NUMERICAL APPROXIMATION OF THE BILINEAR FORM c A 1 b

On numerical stability in large scale linear algebraic computations

On the properties of Krylov subspaces in finite precision CG computations

Introduction. Chapter One

The Lanczos and conjugate gradient algorithms

On Numerical Stability in Large Scale Linear Algebraic Computations

High performance Krylov subspace method variants

Orthogonal polynomials

CME 302: NUMERICAL LINEAR ALGEBRA FALL 2005/06 LECTURE 0

On a residual-based a posteriori error estimator for the total error

On prescribing Ritz values and GMRES residual norms generated by Arnoldi processes

AN ITERATIVE METHOD WITH ERROR ESTIMATORS

On the loss of orthogonality in the Gram-Schmidt orthogonalization process

Reduced Synchronization Overhead on. December 3, Abstract. The standard formulation of the conjugate gradient algorithm involves

Analysis of Krylov subspace methods

ERROR ESTIMATION IN PRECONDITIONED CONJUGATE GRADIENTS

On the influence of eigenvalues on Bi-CG residual norms

Error Estimation in Preconditioned Conjugate Gradients. September 15, 2004

On solving linear systems arising from Shishkin mesh discretizations

ON ORTHOGONAL REDUCTION TO HESSENBERG FORM WITH SMALL BANDWIDTH

Introduction to Numerical Analysis

ESTIMATES OF THE TRACE OF THE INVERSE OF A SYMMETRIC MATRIX USING THE MODIFIED CHEBYSHEV ALGORITHM

1. Introduction. Let µ(t) be a distribution function with infinitely many points of increase in the interval [ π, π] and let

Generalized MINRES or Generalized LSQR?

Key words. GMRES method, convergence bounds, worst-case GMRES, ideal GMRES, field of values

CHARLES UNIVERSITY, PRAGUE Faculty of Mathematics and Physics Department of Numerical Mathematics ON SOME OPEN PROBLEMS IN KRYLOV SUBSPACE METHODS

AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods; Preconditioning

LARGE SPARSE EIGENVALUE PROBLEMS. General Tools for Solving Large Eigen-Problems

Why Gaussian quadrature in the complex plane?

LARGE SPARSE EIGENVALUE PROBLEMS

1 Conjugate gradients

The Conjugate Gradient Method

Numerical behavior of inexact linear solvers

CALCULATION OF GAUSS-KRONROD QUADRATURE RULES. 1. Introduction A(2n+ 1)-point Gauss-Kronrod integration rule for the integral.

6.4 Krylov Subspaces and Conjugate Gradients

AMSC 600 /CMSC 760 Advanced Linear Numerical Analysis Fall 2007 Krylov Minimization and Projection (KMP) Dianne P. O Leary c 2006, 2007.

Martin Tůma From Moments to Modern Iterative Methods - Historical Connections and Inspirations

Non-stationary extremal eigenvalue approximations in iterative solutions of linear systems and estimators for relative error

Large-scale eigenvalue problems

Total least squares. Gérard MEURANT. October, 2008

Karhunen-Loève Approximation of Random Fields Using Hierarchical Matrix Techniques

ANY FINITE CONVERGENCE CURVE IS POSSIBLE IN THE INITIAL ITERATIONS OF RESTARTED FOM

Charles University Faculty of Mathematics and Physics DOCTORAL THESIS. Krylov subspace approximations in linear algebraic problems

Lecture 9: Krylov Subspace Methods. 2 Derivation of the Conjugate Gradient Algorithm

Key words. cyclic subspaces, Krylov subspaces, orthogonal bases, orthogonalization, short recurrences, normal matrices.

Solution of eigenvalue problems. Subspace iteration, The symmetric Lanczos algorithm. Harmonic Ritz values, Jacobi-Davidson s method

A stable variant of Simpler GMRES and GCR

Stabilization and Acceleration of Algebraic Multigrid Method

How Rutishauser may have found the qd and LR algorithms, t

Finite-choice algorithm optimization in Conjugate Gradients

On the accuracy of saddle point solvers

The Conjugate Gradient Method for Solving Linear Systems of Equations

arxiv: v1 [hep-lat] 2 May 2012

M.A. Botchev. September 5, 2014

INTRODUCTION TO MULTIGRID METHODS

Numerical Analysis Preliminary Exam 10 am to 1 pm, August 20, 2018

A Method for Constructing Diagonally Dominant Preconditioners based on Jacobi Rotations

Applied Mathematics 205. Unit V: Eigenvalue Problems. Lecturer: Dr. David Knezevic

Implicitly Defined High-Order Operator Splittings for Parabolic and Hyperbolic Variable-Coefficient PDE Using Modified Moments

Optimal Left and Right Additive Schwarz Preconditioning for Minimal Residual Methods with Euclidean and Energy Norms

Lecture 9 Approximations of Laplace s Equation, Finite Element Method. Mathématiques appliquées (MATH0504-1) B. Dewals, C.

Solving Constrained Rayleigh Quotient Optimization Problem by Projected QEP Method

Contents. I Basic Methods 13

Lecture Note 7: Iterative methods for solving linear systems. Xiaoqun Zhang Shanghai Jiao Tong University

A Chebyshev-based two-stage iterative method as an alternative to the direct solution of linear systems

Introduction to Iterative Solvers of Linear Systems

Chebyshev semi-iteration in Preconditioning

Conjugate Gradient Method

PDEs, Matrix Functions and Krylov Subspace Methods

c 2005 Society for Industrial and Applied Mathematics

APPLIED NUMERICAL LINEAR ALGEBRA

Chasing the Bulge. Sebastian Gant 5/19/ The Reduction to Hessenberg Form 3

Lecture 18 Classical Iterative Methods

ETNA Kent State University

Department of Mathematics California State University, Los Angeles Master s Degree Comprehensive Examination in. NUMERICAL ANALYSIS Spring 2015

Numerical Methods in Matrix Computations

Algebraic Multigrid as Solvers and as Preconditioner

Elements of linear algebra

Electronic Transactions on Numerical Analysis Volume 50, 2018

Solving large sparse Ax = b.

The Nullspace free eigenvalue problem and the inexact Shift and invert Lanczos method. V. Simoncini. Dipartimento di Matematica, Università di Bologna

EECS 275 Matrix Computation

On the Superlinear Convergence of MINRES. Valeria Simoncini and Daniel B. Szyld. Report January 2012

AMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences)

Transcription:

Matching moments and matrix computations Jörg Liesen Technical University of Berlin and Petr Tichý Czech Academy of Sciences and Zdeněk Strakoš Charles University in Prague and Czech Academy of Sciences http://www.karlin.mff.cuni.cz/ strakos Householder Symposium XVIII, Granlibakken, June 2011

Thanks I wish to give my thanks to many authors and collaborators whose work and help has enabled the presented view. Gene Golub, Gérard Meurant, Claude Brezinski, Jan Papež, Tomáš Gergelits. Z. Strakoš 2

CG matrix formulation of the Gauss Ch. q. Ax = b, x 0 ω(λ), T n y n = r 0 e 1 ω (n) (λ), ξ ζ n i=1 (λ) 1 dω(λ) ω (n) i ( θ (n) i ) 1 x n = x 0 + W n y n ω (n) (λ) ω(λ) Z. Strakoš 3

Distribution function ω(λ) λ i, s i are the eigenpairs of A, ω i = (s i, w 1 ) 2, w 1 = r 0 / r 0 1 ω N 0 ω 4 ω 3 ω 2 ω 1 L λ 1 λ 2 λ...... 3 Hestenes and Stiefel (1952). λ N U Z. Strakoš 4

CG and Gauss-Ch. quadrature errors U L (λ) 1 dω(λ) = n i=1 ω (n) i ( θ (n) i ) 1 + Rn (f) x x 0 2 A r 0 2 = n-th Gauss quadrature + x x n 2 A r 0 2 With x 0 = 0, b A 1 b = n 1 j=0 γ j r j 2 + r na 1 r n. CG : model reduction matching 2n moments; Golub, Meurant, Reichel, Boley, Gutknecht, Saylor, Smolarski,..., Meurant and S (2006), Golub and Meurant (2010), S and Tichý (2011) Z. Strakoš 5

Outline 1. History 2. Presence 3. Future Z. Strakoš 6

1 History Brouncker and Wallis (1655), Euler (1737, 1748,... ), Gauss (1814), Jacobi (1826, 1850, 1857), Chebyshev (1855, 1859,... ), Christoffel (1858, 1877), Heine (1878, 1881), Markov (1884,... ), Stieltjes (1884 1894) see Perron (1913), Szegö (1839), Gantmacher and Krein (1941), Shohat and Tamarkin (1943), Krein (1951), Akhiezer (1965), Gautschi (1981, 2004), Brezinski(1991), Van Assche (1993), Kjeldsen (1993), Gutknecht (1994)... Hilbert (1906), F. Riesz (1909), Hellinger and Toeplitz (1910, 1914), von Neumann (1927, 1932), Wintner (1929), Krylov (1931), Lanczos (1950, 1952), Hestenes and Stiefel (1952), Rutishauser (1953), Henrici (1958), Stiefel (1958), Householder (1953, 1964)... Z. Strakoš 7

1 Continued fraction corresponding to ω(λ) F N (λ) λ γ 1 λ γ 2 1 δ2 2 δ3 2... λ γ 3... λ γ N 1 δ2 N λ γ N The entries γ 1,...,γ N and δ 2,...,δ N represent coefficients of the Stieltjes recurrence (1893-4), see Chebyshev (1855), Brouncker (1655), Wallis (1656), with the Jacobi matrix T N (Toeplitz and Hellinger (1914), Simon (2006), S (2011), Liesen and S (201?).) The nth convergent F n (λ). Z. Strakoš 8

1 Partial fraction decomposition w 1(λI A) 1 w 1 = U L dω(µ) N λ µ = j=1 ω j = R N(λ) λ λ j P N (λ) = F N(λ), The denominator P n (λ) corresponding to the nth convergent F n (λ) of F N (λ), n = 1, 2,... is the nth orthogonal polynomial in the sequence determined by ω(λ) ; see Chebyshev (1855). Similarly, e T 1 (λi T n ) 1 e 1 = U L dω (n) (µ) λ µ = n j=1 ω (n) j λ λ j = R n(λ) P n (λ) = F n(λ), Z. Strakoš 9

1 Minimal partial realization Consider the expansion w 1(λI A) 1 w 1 = F N (λ) = 2n l=1 ξ l 1 λ l + O ( 1 ) λ 2n+1. Using the same expansion of the nth convergent F n (λ) of F N (λ), e T 1 (λi T n ) 1 e 1 = F n (λ) = 2n l=1 ξ l 1 λ l + O ( 1 ) λ 2n+1, where the moments in the numerators are identical due to the Gauss-Ch. quadrature property. F n (λ) approximates F N (λ) with the error proportional to λ (2n+1). Z. Strakoš 10

1 Minimal partial realization - roots Christoffel (1858), Chebyshev (1859), Stieltjes (1894); Euler (1737), translation published by Wyman and Wyman (1985). The minimal partial realization was rediscovered in the engineering literature by Kalman (1979). The works of Hestenes and Stiefel (1952), Vorobyev (1958, 1965) (see Brezinski (1991,... )), were not studied and recalled. The links with Chebyshev and Stieltjes were pointed out by Gragg (1974), Gragg and Lindquist (1983), Bultheel and Van Barel (1997). S (2011), S and Tichý (2011), S and Liesen (201?). Z. Strakoš 11

2 Presence A natural shift from the moment matching background to computing, technical issues and algorithmic developments. Example: The bound x x n A 2 ( κ(a) 1 κ(a) + 1 ) n x x 0 A should not be used in connection with the behaviour of CG unless κ(a) is really small or unless the (very special) distribution of eigenvalues makes it relevant. In particular, one should be very careful while using it as a part of the composed bound in the presence of the large outlying eigenvalues. Z. Strakoš 12

2 Axelsson (1976), Jennings (1977) p. 72:... it may be inferred that rounding errors... affects the convergence rate when large outlying eigenvalues are present. Z. Strakoš 13

2 Mathematical model of FP CG CG in finite precision arithmetic can be seen as the exact arithmetic CG for the problem with the slightly modified distribution function with larger support, i.e., with single eigenvalues replaced by tight clusters. Paige (1971-80), Greenbaum (1989), Parlett (1990), S (1991), Greenbaum and S (1992), Notay (1993),..., Druskin, Knizhnermann, Zemke, Wülling, Meurant,... Recent review and update in Meurant and S, Acta Numerica (2006). Fundamental consequence: In FP computations, the composed convergence bounds eliminating in exact arithmetic large outlying eigenvalues at the cost of one iteration per eigenvalue do not, in general, work. Z. Strakoš 14

2 : Composed bounds 10 0 10 5 10 5 10 0 10 5 10 10 10 10 10 15 50 100 150 200 10 15 50 100 150 200 Composed bounds with varying number of outliers (left) and the failure of the composed bounds in FP CG (right), Gergelits (2011). Z. Strakoš 15

3 PDE discretization and matrix computations u = 32(η 1 η 2 1 + η 2 η 2 2) on the unit square with zero Dirichlet boundary conditions. Galerkin finite element method (FEM) discretization with linear basis functions on a regular triangular grid with the mesh size h = 1/(m + 1), where m is the number of inner nodes in each direction. Discrete solution u h = N ζ j φ j. j=1 Up to a small inaccuracy proportional to machine precision, (u u (n) h ) 2 = (u u h ) 2 + (u h u (n) h ) 2 = (u u h ) 2 + x x n 2 A. Z. Strakoš 16

3 Solution and discretization error Exact solution u of the PDE x 10 4 Discretisation error u u h 3 0.9 0.8 0.7 0.6 2 0.5 0.4 0.3 1 0.2 0.1 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 Exact solution u of the Poisson model problem (left) and the discretization error u u h (right). Z. Strakoš 17

3 Algebraic and total error x 10 4 Algebraic error u h u CG h with c=1 and α=3 x 10 4 Total error u u CG h with c=1 and α=3 10 6 8 4 6 2 4 0 2 2 0 4 2 6 4 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 Algebraic error u h u (n) h (left) and the total error u u (n) h (right) when CG is stopped with b Ax n /( b + A x n ) < 1e 03. Z. Strakoš 18

3 The point (u u (n) h ) 2 = (u u h ) 2 + x x n 2 A = 5.8444e 03 + 1.4503e 05. Message? PDE: Matrix computations do not provide exact results. Algebra: Error should be evaluated in the function space. Both: Local distribution of the discretization and the algebraic error can be very different. Liesen and S (2011), Liesen and S (201?). Z. Strakoš 19

3 One can see the 1D analogy The discretization error (left), the algebraic and the total error (right), Papež (2011). Z. Strakoš 20

Ideas and people Euclid,..., Brouncker and Wallis (1655-56): Three term recurences (for numbers) Euler (1737, 1748),..., Brezinski (1991), Khrushchev (2008) Gauss (1814), Jacobi (1826), Christoffel (1858, 1857),..., Gautschi (1981, 2004) Chebyshev (1855, 1859), Markov (1884), Stieltjes (1884, 1893-94): Orthogonal polynomials Hilbert (1906),..., Von Neumann (1927, 1932), Wintner (1929) Krylov (1931), Lanczos (1950), Hestenes and Stiefel (1952), Rutishauser (1953), Henrici (1958),..., Vorobyev (1958, 1965), Golub and Welsh (1968),..., Laurie (1991-2001),... Gordon (1968), Schlesinger and Schwartz (1966), Reinhard (1979),... Paige (1971), Reid (1971), Greenbaum (1989),... Gragg (1974), Kalman (1979), Gragg, Lindquist (1983), Gallivan, Grimme, Van Dooren (1994),... Euler, Christoffel, Chebyshev (Markov), Stieltjes! Z. Strakoš 21

Thank you for your kind patience Z. Strakoš 22