Towards high performance IRKA on hybrid CPU-GPU systems

Size: px
Start display at page:

Download "Towards high performance IRKA on hybrid CPU-GPU systems"

Transcription

1 Towards high performance IRKA on hybrid CPU-GPU systems Jens Saak in collaboration with Georg Pauer (OVGU/MPI Magdeburg) Kapil Ahuja, Ruchir Garg (IIT Indore) Hartwig Anzt, Jack Dongarra (ICL Uni Tennessee Knoxville) Edmond Chow (Georgia Tech) 17th GAMM Workshop on Applied and Numerical Linear Algebra Cologne, Germany September 7-8, 2017

2 Introduction Projection based Model Order Reduction (MOR) Linear time-invariant (LTI) system: E ẋ(t) = A x(t) + B u(t) y(t) = C x(t) E r = W T EV A r = W T AV B r = W T B C r = CV x(t) R n, x r(t) R r, u R p, y, ỹ R q Reduced order model (ROM): E r ẋ r (t) = A r x r (t) + B r u(t) ỹ(t) = C r x r (t) Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 2/15

3 Introduction Description of Today s Focus Problem MOR of LTI systems often employs the rational Krylov subspace method (RKSM) of some flavor: Moment Matching IRKA Balanced Truncation (V, W span rational Krylov subspaces), (iterative rational Krylov algorithm) (computing the Gramians via ADI or RKSM) Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 3/15

4 Introduction Description of Today s Focus Problem MOR of LTI systems often employs the rational Krylov subspace method (RKSM) of some flavor: Moment Matching IRKA Balanced Truncation (V, W span rational Krylov subspaces), (iterative rational Krylov algorithm) (computing the Gramians via ADI or RKSM) IRKA in a nutshell e.g.[gugercin/antoulas/beattie 08] 1. Guess initial ROM or initial shifts, 2. Compute span V = span{(σ 1 E A) 1 B,..., (σ r E A) 1 B}, span W = span{(σ 1 E T A T ) 1 C T,..., (σ r E T A T ) 1 C T }. 3. Update ROM, set σ = Λ(A r, E r ), repeat until σ has converged. Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 3/15

5 Introduction Description of Today s Focus Problem MOR of LTI systems often employs the rational Krylov subspace method (RKSM) of some flavor: Moment Matching IRKA Balanced Truncation Dominant computation in all these methods Sequences of Shifted Linear Systems (SLS) (V, W span rational Krylov subspaces), (iterative rational Krylov algorithm) (computing the Gramians via ADI or RKSM) (A + σ j E) X j = Y j, for j I N, with real data A R n n, E R n n, Y j R n m, m n, but possibly complex shifts σ j C. Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 3/15

6 Introduction Description of Today s Focus Problem MOR of LTI systems often employs the rational Krylov subspace method (RKSM) of some flavor: Moment Matching IRKA Balanced Truncation Dominant computation in all these methods Sequences of Shifted Linear Systems (SLS) (V, W span rational Krylov subspaces), (iterative rational Krylov algorithm) (computing the Gramians via ADI or RKSM) P j (A + σ j E) X j = P j Y j, for j I N, with real data A R n n, E R n n, Y j R n m, m n, but possibly complex shifts σ j C. Preconditioner P j R n n usually needs update for every j. Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 3/15

7 Introduction Strategies for the switched linear Systems Sequences of Shifted Linear Systems (SLS) P j (A + σ j E) X j = P j Y j, for j I N, We aim at using iterative solvers on hybrid CPU-GPU systems. Efficiency improving strategies 1. recycle / realign preconditioner P j 2. recycle solvers / subspaces 3. solve shifted linear systems in parallel Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 4/15

8 Introduction Strategies for the switched linear Systems Sequences of Shifted Linear Systems (SLS) P j (A + σ j E) X j = P j Y j, for j I N, We aim at using iterative solvers on hybrid CPU-GPU systems. Efficiency improving strategies 1. recycle / realign preconditioner P j 2. recycle solvers / subspaces extending [Anzt/Chow/S./Dongarra 16] 3. solve shifted linear systems in parallel Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 4/15

9 Introduction Strategies for the switched linear Systems Sequences of Shifted Linear Systems (SLS) P j (A + σ j E) X j = P j Y j, for j I N, We aim at using iterative solvers on hybrid CPU-GPU systems. Efficiency improving strategies 1. recycle / realign preconditioner P j 2. recycle solvers / subspaces extending [Anzt/Chow/S./Dongarra 16] following [Ahuja/de Sturler/Gugercin/Chang 12] 3. solve shifted linear systems in parallel Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 4/15

10 Agenda 1. Introduction Projection based Model Order Reduction (MOR) Description of Today s Focus Problem Strategies for the switched linear Systems 2. Reusing and Realigning Preconditioners Prior Work and Our Approach The Real aka. Symmetric Case The Non-Symmetric aka. Complex Case Structure Exploiting Preconditioner Experiments 3. Recycling Solvers and Subspaces Background Preliminary Results 4. Parallel Solution of the Shifted Systems Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 5/15

11 Reusing and Realigning Preconditioners Prior Work and Our Approach 1. reuse preconditioner for small matrix changes, i.e. similar shifts 2. update incomplete factorization preconditioner [Bellavia/De Simone/di Serafino/Morini 11, Benzi/Bertaccini 03, Bertaccini 04, Calgaro/Chehab/Saad 10, Duintjer Tebbens/Tüma 07] Main Issues reuses reduce efficiency of the preconditioner updates have limited potential for parallel execution Goal highly parallel preconditioner update and application Jens Saak, Towards hybrid CPU-GPU IRKA 6/15

12 Reusing and Realigning Preconditioners The Real aka. Symmetric Case [Anzt/Chow/S ] Preconditioned iterative solution of Sequences of Shifted Linear Systems (SLS) P j (A + σ j E) X j = P j Y j, for j I N, for A, E symmetric and σ j real. Idea: Minimize A U T U via fixed point approach [Chow/et al. 15] Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 7/15

13 Reusing and Realigning Preconditioners The Real aka. Symmetric Case [Anzt/Chow/S ] Preconditioned iterative solution of Sequences of Shifted Linear Systems (SLS) P j (A + σ j E) X j = P j Y j, for j I N, for A, E symmetric and σ j real. Idea: Minimize A U T U via fixed point approach [Chow/et al. 15] Initialization If σ j σ j 1 is small enough, then P j P j 1 is small P j 1 good initial guess for P j. Use [Chow/Patel 15, Chow/Anzt/Dongarra 15] for (asynchronous) iterative computation of incomplete factorization preconditioners. Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 7/15

14 Reusing and Realigning Preconditioners The Real aka. Symmetric Case Time [ms] Speedup Number of sweeps Number of sweeps IC Test example: Oberwolfach Collection Steel Cooling Benchmark[Benner/S. 05] here using the FEniCS reimplementation by Maximilian Behr Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 8/15

15 Reusing and Realigning Preconditioners The Non-Symmetric aka. Complex Case Consider Ax = b for A C n n, x, b C n. With A r, A i R n n, x r, x i, b r, b i R n write A = A r + ıa i, x = x r + ıx i, b = b r + ıb i. Idea e.g.[day/heroux 01, Benzi/Bertaccini 08] Rewrite to real linear system [ ] [ ] [ ] Ar A i xr br = A i A r x i b i and employ block preconditioner. Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 9/15

16 Reusing and Realigning Preconditioners Structure Exploiting Preconditioner In our special case A j,r = A + Re(σ j ) E, A j,i = Im(σ j ) E Let Then A j Ṽ j = W j [ ] 0 Im(σ J j = j ). Im(σ j ) 0 [ ] [ ] Aj,r A j,i Vj,r = A j,i A j,r V j,i [ Wj,r W j,i (I 2 A j,r + J j E)Ṽj = W j ] Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 10/15

17 Reusing and Realigning Preconditioners Structure Exploiting Preconditioner In our special case A j,r = A + Re(σ j ) E, A j,i = Im(σ j ) E Let Then A j Ṽ j = W j [ ] 0 Im(σ J j = j ). Im(σ j ) 0 [ ] [ ] Aj,r A j,i Vj,r = A j,i A j,r V j,i [ Wj,r W j,i (I 2 A j,r + J j E)Ṽj = W j ( ) Idea: P := (I 2 A j,r ) 1 = I 2 A 1 j,r ] Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 10/15

18 Reusing and Realigning Preconditioners Structure Exploiting Preconditioner In our special case A j,r = A + Re(σ j ) E, A j,i = Im(σ j ) E Let [ ] counter diagonal blocks 0 Im(σ J j = are: j ). Im(σ j ) 0 Then ± Im(σ j) (A + Re(σ j) E) 1 E = ± Im(σj) Re(σ (A + Re(σj) j) E) 1 (A + Re(σ j) E A) [ ] [ ] [ ] A j Ṽ j = W = ± Im(σj) ( Aj,r A In j j,i (A + Vj,r Re(σ j) E) Wj,r Re(σ j) = 1 A ). A j,i A j,r V j,i W j,i (I 2 A j,r + J j E)Ṽj = W j ( ) Idea: P := (I 2 A j,r ) 1 = I 2 A 1 j,r Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 10/15

19 Reusing and Realigning Preconditioners FEniCS Convect. Diffusion on Unit Square 1 n = 9801 P3-GMRES iterations for SLS (tol=10 8, maxiter=250) ı /10 2 / / ı no prec block ILU(0) block ILU(1) block ILU(2) block ILU(3) block \ full ILU(0) model data by Maximilian Behr Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 11/15

20 Recycling Solvers and Subspaces Recall IRKA needs: span V = span{(σ 1 E A) 1 B,..., (σ k E A) 1 B}, span W = span{(σ 1 E T A T ) 1 C T,..., (σ k E T A T ) 1 C T }. Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 12/15

21 Recycling Solvers and Subspaces Recall IRKA needs: span V = span{(σ 1 E A) 1 B,..., (σ k E A) 1 B}, span W = span{(σ 1 E T A T ) 1 C T,..., (σ k E T A T ) 1 C T }. Idea [Ahuja/de Sturler/Gugercin/Chang 12] Need to solve with dual operators Use BiCG Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 12/15

22 Recycling Solvers and Subspaces Recall IRKA needs: span V = span{(σ 1 E A) 1 B,..., (σ k E A) 1 B}, span W = span{(σ 1 E T A T ) 1 C T,..., (σ k E T A T ) 1 C T }. Idea [Ahuja/de Sturler/Gugercin/Chang 12] Need to solve with dual operators Use BiCG MIMO systems (i.e., p, q > 1) repeatedly solve with different right hand sides update subspace, i.e. recycle Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 12/15

23 Recycling Solvers and Subspaces Recall IRKA needs: span V = span{(σ 1 E A) 1 B,..., (σ k E A) 1 B}, span W = span{(σ 1 E T A T ) 1 C T,..., (σ k E T A T ) 1 C T }. Idea [Ahuja/de Sturler/Gugercin/Chang 12] Need to solve with dual operators Use BiCG MIMO systems (i.e., p, q > 1) repeatedly solve with different right hand sides update subspace, i.e. recycle upon convergence: σ (i) j σ (i+1) j employ deflation, i.e., recycle Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 12/15

24 Recycling Solvers and Subspaces Recall IRKA needs: span V = span{(σ 1 E A) 1 B,..., (σ k E A) 1 B}, span W = span{(σ 1 E T A T ) 1 C T,..., (σ k E T A T ) 1 C T }. Idea [Ahuja/de Sturler/Gugercin/Chang 12] Need to solve with dual operators Use BiCG MIMO systems (i.e., p, q > 1) repeatedly solve with different right hand sides update subspace, i.e. recycle upon convergence: σ (i) j σ (i+1) j employ deflation, i.e., recycle RBiCG for IRKA Current focus with Kapil Ahuja, Ruchir Garg and Georg Pauer: CUDA accelerated version Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 12/15

25 Recycling Solvers and Subspaces Preliminary Results MATLAB results [Ahuja/de Sturler/Gugercin/Chang 12] iteration count time(s) n σ r drop IRKA tol steps BiCG RBiCG Ratio BiCG RBiCG Ratio Test example: Oberwolfach Collection Steel Cooling Benchmark[Benner/S. 05] avilable from: 20Profiles%20% %29 Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 13/15

26 Next: Parallel Solution of the Shifted Systems Observation: Iterative solver needs sparse matrix vector product (A + σ j E)x j = y j. memory bandwidth bounded. Idea: Define: X = [x 1,..., x r ], Y = [y 1,..., y r ] and Σ = diag(σ) Then Y = AX + EXΣ, and sparse matrix vector product sparse-dense matrix matrix product 2r sparse matrix sweeps become 2 sparse matrix sweeps Requires short recurrence method (e.g. BiCG) to limit memory usage Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 14/15

27 Conclusions & Outlook Conclusions Outlook analysis of the problem structure gives rise to 3 promising implementation candidates Preconditioner realignment and Subspace recycling can even be combined proper HPC implementation pending comparison of the strategies Master Thesis Georg Pauer Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 15/15

28 Conclusions & Outlook Conclusions Outlook analysis of the problem structure gives rise to 3 promising implementation candidates Preconditioner realignment and Subspace recycling can even be combined proper HPC implementation pending comparison of the strategies Master Thesis Georg Pauer Thank you for your attention! Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 15/15

Updating incomplete factorization preconditioners for model order reduction

Updating incomplete factorization preconditioners for model order reduction DOI 10.1007/s11075-016-0110-2 ORIGINAL PAPER Updating incomplete factorization preconditioners for model order reduction Hartwig Anzt 1 Edmond Chow 2 Jens Saak 3 Jack Dongarra 1,4,5 Received: 18 September

More information

A Newton-Galerkin-ADI Method for Large-Scale Algebraic Riccati Equations

A Newton-Galerkin-ADI Method for Large-Scale Algebraic Riccati Equations A Newton-Galerkin-ADI Method for Large-Scale Algebraic Riccati Equations Peter Benner Max-Planck-Institute for Dynamics of Complex Technical Systems Computational Methods in Systems and Control Theory

More information

MARCH 24-27, 2014 SAN JOSE, CA

MARCH 24-27, 2014 SAN JOSE, CA MARCH 24-27, 2014 SAN JOSE, CA Sparse HPC on modern architectures Important scientific applications rely on sparse linear algebra HPCG a new benchmark proposal to complement Top500 (HPL) To solve A x =

More information

arxiv: v3 [math.na] 6 Jul 2018

arxiv: v3 [math.na] 6 Jul 2018 A Connection Between Time Domain Model Order Reduction and Moment Matching for LTI Systems arxiv:181.785v3 [math.na] 6 Jul 218 Manuela Hund a and Jens Saak a a Max Planck Institute for Dynamics of Complex

More information

Model order reduction of large-scale dynamical systems with Jacobi-Davidson style eigensolvers

Model order reduction of large-scale dynamical systems with Jacobi-Davidson style eigensolvers MAX PLANCK INSTITUTE International Conference on Communications, Computing and Control Applications March 3-5, 2011, Hammamet, Tunisia. Model order reduction of large-scale dynamical systems with Jacobi-Davidson

More information

Fine-Grained Parallel Algorithms for Incomplete Factorization Preconditioning

Fine-Grained Parallel Algorithms for Incomplete Factorization Preconditioning Fine-Grained Parallel Algorithms for Incomplete Factorization Preconditioning Edmond Chow School of Computational Science and Engineering Georgia Institute of Technology, USA SPPEXA Symposium TU München,

More information

MODEL REDUCTION BY A CROSS-GRAMIAN APPROACH FOR DATA-SPARSE SYSTEMS

MODEL REDUCTION BY A CROSS-GRAMIAN APPROACH FOR DATA-SPARSE SYSTEMS MODEL REDUCTION BY A CROSS-GRAMIAN APPROACH FOR DATA-SPARSE SYSTEMS Ulrike Baur joint work with Peter Benner Mathematics in Industry and Technology Faculty of Mathematics Chemnitz University of Technology

More information

Feeding of the Thousands. Leveraging the GPU's Computing Power for Sparse Linear Algebra

Feeding of the Thousands. Leveraging the GPU's Computing Power for Sparse Linear Algebra SPPEXA Annual Meeting 2016, January 25 th, 2016, Garching, Germany Feeding of the Thousands Leveraging the GPU's Computing Power for Sparse Linear Algebra Hartwig Anzt Sparse Linear Algebra on GPUs Inherently

More information

Max Planck Institute Magdeburg Preprints

Max Planck Institute Magdeburg Preprints Peter Benner Patrick Kürschner Jens Saak Real versions of low-rank ADI methods with complex shifts MAXPLANCKINSTITUT FÜR DYNAMIK KOMPLEXER TECHNISCHER SYSTEME MAGDEBURG Max Planck Institute Magdeburg Preprints

More information

Model Order Reduction via Matlab Parallel Computing Toolbox. Istanbul Technical University

Model Order Reduction via Matlab Parallel Computing Toolbox. Istanbul Technical University Model Order Reduction via Matlab Parallel Computing Toolbox E. Fatih Yetkin & Hasan Dağ Istanbul Technical University Computational Science & Engineering Department September 21, 2009 E. Fatih Yetkin (Istanbul

More information

Accelerating linear algebra computations with hybrid GPU-multicore systems.

Accelerating linear algebra computations with hybrid GPU-multicore systems. Accelerating linear algebra computations with hybrid GPU-multicore systems. Marc Baboulin INRIA/Université Paris-Sud joint work with Jack Dongarra (University of Tennessee and Oak Ridge National Laboratory)

More information

Iterative Methods for Linear Systems of Equations

Iterative Methods for Linear Systems of Equations Iterative Methods for Linear Systems of Equations Projection methods (3) ITMAN PhD-course DTU 20-10-08 till 24-10-08 Martin van Gijzen 1 Delft University of Technology Overview day 4 Bi-Lanczos method

More information

Contents. Preface... xi. Introduction...

Contents. Preface... xi. Introduction... Contents Preface... xi Introduction... xv Chapter 1. Computer Architectures... 1 1.1. Different types of parallelism... 1 1.1.1. Overlap, concurrency and parallelism... 1 1.1.2. Temporal and spatial parallelism

More information

Approximation of the Linearized Boussinesq Equations

Approximation of the Linearized Boussinesq Equations Approximation of the Linearized Boussinesq Equations Alan Lattimer Advisors Jeffrey Borggaard Serkan Gugercin Department of Mathematics Virginia Tech SIAM Talk Series, Virginia Tech, April 22, 2014 Alan

More information

9.1 Preconditioned Krylov Subspace Methods

9.1 Preconditioned Krylov Subspace Methods Chapter 9 PRECONDITIONING 9.1 Preconditioned Krylov Subspace Methods 9.2 Preconditioned Conjugate Gradient 9.3 Preconditioned Generalized Minimal Residual 9.4 Relaxation Method Preconditioners 9.5 Incomplete

More information

Accelerated Solvers for CFD

Accelerated Solvers for CFD Accelerated Solvers for CFD Co-Design of Hardware/Software for Predicting MAV Aerodynamics Eric de Sturler, Virginia Tech Mathematics Email: sturler@vt.edu Web: http://www.math.vt.edu/people/sturler Co-design

More information

Adaptive rational Krylov subspaces for large-scale dynamical systems. V. Simoncini

Adaptive rational Krylov subspaces for large-scale dynamical systems. V. Simoncini Adaptive rational Krylov subspaces for large-scale dynamical systems V. Simoncini Dipartimento di Matematica, Università di Bologna valeria@dm.unibo.it joint work with Vladimir Druskin, Schlumberger Doll

More information

Recycling Bi-Lanczos Algorithms: BiCG, CGS, and BiCGSTAB

Recycling Bi-Lanczos Algorithms: BiCG, CGS, and BiCGSTAB Recycling Bi-Lanczos Algorithms: BiCG, CGS, and BiCGSTAB Kapil Ahuja Thesis submitted to the Faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements

More information

Accelerating Model Reduction of Large Linear Systems with Graphics Processors

Accelerating Model Reduction of Large Linear Systems with Graphics Processors Accelerating Model Reduction of Large Linear Systems with Graphics Processors P. Benner 1, P. Ezzatti 2, D. Kressner 3, E.S. Quintana-Ortí 4, Alfredo Remón 4 1 Max-Plank-Institute for Dynamics of Complex

More information

Krylov Subspace Methods for Nonlinear Model Reduction

Krylov Subspace Methods for Nonlinear Model Reduction MAX PLANCK INSTITUT Conference in honour of Nancy Nichols 70th birthday Reading, 2 3 July 2012 Krylov Subspace Methods for Nonlinear Model Reduction Peter Benner and Tobias Breiten Max Planck Institute

More information

BALANCING-RELATED MODEL REDUCTION FOR DATA-SPARSE SYSTEMS

BALANCING-RELATED MODEL REDUCTION FOR DATA-SPARSE SYSTEMS BALANCING-RELATED Peter Benner Professur Mathematik in Industrie und Technik Fakultät für Mathematik Technische Universität Chemnitz Computational Methods with Applications Harrachov, 19 25 August 2007

More information

Efficient Implementation of Large Scale Lyapunov and Riccati Equation Solvers

Efficient Implementation of Large Scale Lyapunov and Riccati Equation Solvers Efficient Implementation of Large Scale Lyapunov and Riccati Equation Solvers Jens Saak joint work with Peter Benner (MiIT) Professur Mathematik in Industrie und Technik (MiIT) Fakultät für Mathematik

More information

c 2017 Society for Industrial and Applied Mathematics

c 2017 Society for Industrial and Applied Mathematics SIAM J. MATRIX ANAL. APPL. Vol. 38, No. 2, pp. 401 424 c 2017 Society for Industrial and Applied Mathematics PRECONDITIONED MULTISHIFT BiCG FOR H 2 -OPTIMAL MODEL REDUCTION MIAN ILYAS AHMAD, DANIEL B.

More information

Moving Frontiers in Model Reduction Using Numerical Linear Algebra

Moving Frontiers in Model Reduction Using Numerical Linear Algebra Using Numerical Linear Algebra Max-Planck-Institute for Dynamics of Complex Technical Systems Computational Methods in Systems and Control Theory Group Magdeburg, Germany Technische Universität Chemnitz

More information

Notes on Some Methods for Solving Linear Systems

Notes on Some Methods for Solving Linear Systems Notes on Some Methods for Solving Linear Systems Dianne P. O Leary, 1983 and 1999 and 2007 September 25, 2007 When the matrix A is symmetric and positive definite, we have a whole new class of algorithms

More information

Lecture 17: Iterative Methods and Sparse Linear Algebra

Lecture 17: Iterative Methods and Sparse Linear Algebra Lecture 17: Iterative Methods and Sparse Linear Algebra David Bindel 25 Mar 2014 Logistics HW 3 extended to Wednesday after break HW 4 should come out Monday after break Still need project description

More information

Inexact Solves in Krylov-based Model Reduction

Inexact Solves in Krylov-based Model Reduction Inexact Solves in Krylov-based Model Reduction Christopher A. Beattie and Serkan Gugercin Abstract We investigate the use of inexact solves in a Krylov-based model reduction setting and present the resulting

More information

Preconditioned inverse iteration and shift-invert Arnoldi method

Preconditioned inverse iteration and shift-invert Arnoldi method Preconditioned inverse iteration and shift-invert Arnoldi method Melina Freitag Department of Mathematical Sciences University of Bath CSC Seminar Max-Planck-Institute for Dynamics of Complex Technical

More information

Hot-Starting NLP Solvers

Hot-Starting NLP Solvers Hot-Starting NLP Solvers Andreas Wächter Department of Industrial Engineering and Management Sciences Northwestern University waechter@iems.northwestern.edu 204 Mixed Integer Programming Workshop Ohio

More information

Accelerating Linear Algebra on Heterogeneous Architectures of Multicore and GPUs using MAGMA and DPLASMA and StarPU Schedulers

Accelerating Linear Algebra on Heterogeneous Architectures of Multicore and GPUs using MAGMA and DPLASMA and StarPU Schedulers UT College of Engineering Tutorial Accelerating Linear Algebra on Heterogeneous Architectures of Multicore and GPUs using MAGMA and DPLASMA and StarPU Schedulers Stan Tomov 1, George Bosilca 1, and Cédric

More information

Model Reduction for Linear Dynamical Systems

Model Reduction for Linear Dynamical Systems Summer School on Numerical Linear Algebra for Dynamical and High-Dimensional Problems Trogir, October 10 15, 2011 Model Reduction for Linear Dynamical Systems Peter Benner Max Planck Institute for Dynamics

More information

Multilevel Low-Rank Preconditioners Yousef Saad Department of Computer Science and Engineering University of Minnesota. Modelling 2014 June 2,2014

Multilevel Low-Rank Preconditioners Yousef Saad Department of Computer Science and Engineering University of Minnesota. Modelling 2014 June 2,2014 Multilevel Low-Rank Preconditioners Yousef Saad Department of Computer Science and Engineering University of Minnesota Modelling 24 June 2,24 Dedicated to Owe Axelsson at the occasion of his 8th birthday

More information

A connection between time domain model order reduction and moment matching for LTI systems

A connection between time domain model order reduction and moment matching for LTI systems Mathematical and Computer Modelling of Dynamical Systems Methods, Tools and Applications in Engineering and Related Sciences ISSN: 1387-3954 (Print) 1744-551 (Online) Journal homepage: http://www.tandfonline.com/loi/nmcm

More information

Incomplete LU Preconditioning and Error Compensation Strategies for Sparse Matrices

Incomplete LU Preconditioning and Error Compensation Strategies for Sparse Matrices Incomplete LU Preconditioning and Error Compensation Strategies for Sparse Matrices Eun-Joo Lee Department of Computer Science, East Stroudsburg University of Pennsylvania, 327 Science and Technology Center,

More information

Linear Solvers. Andrew Hazel

Linear Solvers. Andrew Hazel Linear Solvers Andrew Hazel Introduction Thus far we have talked about the formulation and discretisation of physical problems...... and stopped when we got to a discrete linear system of equations. Introduction

More information

Solving PDEs with CUDA Jonathan Cohen

Solving PDEs with CUDA Jonathan Cohen Solving PDEs with CUDA Jonathan Cohen jocohen@nvidia.com NVIDIA Research PDEs (Partial Differential Equations) Big topic Some common strategies Focus on one type of PDE in this talk Poisson Equation Linear

More information

AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods; Preconditioning

AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods; Preconditioning AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods; Preconditioning Xiangmin Jiao SUNY Stony Brook Xiangmin Jiao Numerical Analysis I 1 / 18 Outline

More information

AMS Mathematics Subject Classification : 65F10,65F50. Key words and phrases: ILUS factorization, preconditioning, Schur complement, 1.

AMS Mathematics Subject Classification : 65F10,65F50. Key words and phrases: ILUS factorization, preconditioning, Schur complement, 1. J. Appl. Math. & Computing Vol. 15(2004), No. 1, pp. 299-312 BILUS: A BLOCK VERSION OF ILUS FACTORIZATION DAVOD KHOJASTEH SALKUYEH AND FAEZEH TOUTOUNIAN Abstract. ILUS factorization has many desirable

More information

Model Reduction for Unstable Systems

Model Reduction for Unstable Systems Model Reduction for Unstable Systems Klajdi Sinani Virginia Tech klajdi@vt.edu Advisor: Serkan Gugercin October 22, 2015 (VT) SIAM October 22, 2015 1 / 26 Overview 1 Introduction 2 Interpolatory Model

More information

IDR(s) A family of simple and fast algorithms for solving large nonsymmetric systems of linear equations

IDR(s) A family of simple and fast algorithms for solving large nonsymmetric systems of linear equations IDR(s) A family of simple and fast algorithms for solving large nonsymmetric systems of linear equations Harrachov 2007 Peter Sonneveld en Martin van Gijzen August 24, 2007 1 Delft University of Technology

More information

Structure preserving Krylov-subspace methods for Lyapunov equations

Structure preserving Krylov-subspace methods for Lyapunov equations Structure preserving Krylov-subspace methods for Lyapunov equations Matthias Bollhöfer, André Eppler Institute Computational Mathematics TU Braunschweig MoRePas Workshop, Münster September 17, 2009 System

More information

Linear and Nonlinear Matrix Equations Arising in Model Reduction

Linear and Nonlinear Matrix Equations Arising in Model Reduction International Conference on Numerical Linear Algebra and its Applications Guwahati, January 15 18, 2013 Linear and Nonlinear Matrix Equations Arising in Model Reduction Peter Benner Tobias Breiten Max

More information

On the numerical solution of large-scale linear matrix equations. V. Simoncini

On the numerical solution of large-scale linear matrix equations. V. Simoncini On the numerical solution of large-scale linear matrix equations V. Simoncini Dipartimento di Matematica, Università di Bologna (Italy) valeria.simoncini@unibo.it 1 Some matrix equations Sylvester matrix

More information

Fine-grained Parallel Incomplete LU Factorization

Fine-grained Parallel Incomplete LU Factorization Fine-grained Parallel Incomplete LU Factorization Edmond Chow School of Computational Science and Engineering Georgia Institute of Technology Sparse Days Meeting at CERFACS June 5-6, 2014 Contribution

More information

FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION

FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION EDMOND CHOW AND AFTAB PATEL Abstract. This paper presents a new fine-grained parallel algorithm for computing an incomplete LU factorization. All nonzeros

More information

Solving Large Nonlinear Sparse Systems

Solving Large Nonlinear Sparse Systems Solving Large Nonlinear Sparse Systems Fred W. Wubs and Jonas Thies Computational Mechanics & Numerical Mathematics University of Groningen, the Netherlands f.w.wubs@rug.nl Centre for Interdisciplinary

More information

The new challenges to Krylov subspace methods Yousef Saad Department of Computer Science and Engineering University of Minnesota

The new challenges to Krylov subspace methods Yousef Saad Department of Computer Science and Engineering University of Minnesota The new challenges to Krylov subspace methods Yousef Saad Department of Computer Science and Engineering University of Minnesota SIAM Applied Linear Algebra Valencia, June 18-22, 2012 Introduction Krylov

More information

Optimal Approximation of Dynamical Systems with Rational Krylov Methods. Department of Mathematics, Virginia Tech, USA

Optimal Approximation of Dynamical Systems with Rational Krylov Methods. Department of Mathematics, Virginia Tech, USA Optimal Approximation of Dynamical Systems with Rational Krylov Methods Serkan Güǧercin Department of Mathematics, Virginia Tech, USA Chris Beattie and Virginia Tech. jointly with Thanos Antoulas Rice

More information

Block Bidiagonal Decomposition and Least Squares Problems

Block Bidiagonal Decomposition and Least Squares Problems Block Bidiagonal Decomposition and Least Squares Problems Åke Björck Department of Mathematics Linköping University Perspectives in Numerical Analysis, Helsinki, May 27 29, 2008 Outline Bidiagonal Decomposition

More information

Iterative Methods for Sparse Linear Systems

Iterative Methods for Sparse Linear Systems Iterative Methods for Sparse Linear Systems Luca Bergamaschi e-mail: berga@dmsa.unipd.it - http://www.dmsa.unipd.it/ berga Department of Mathematical Methods and Models for Scientific Applications University

More information

Chapter 7 Iterative Techniques in Matrix Algebra

Chapter 7 Iterative Techniques in Matrix Algebra Chapter 7 Iterative Techniques in Matrix Algebra Per-Olof Persson persson@berkeley.edu Department of Mathematics University of California, Berkeley Math 128B Numerical Analysis Vector Norms Definition

More information

Krylov Space Solvers

Krylov Space Solvers Seminar for Applied Mathematics ETH Zurich International Symposium on Frontiers of Computational Science Nagoya, 12/13 Dec. 2005 Sparse Matrices Large sparse linear systems of equations or large sparse

More information

TR A Comparison of the Performance of SaP::GPU and Intel s Math Kernel Library (MKL) for Solving Dense Banded Linear Systems

TR A Comparison of the Performance of SaP::GPU and Intel s Math Kernel Library (MKL) for Solving Dense Banded Linear Systems TR-0-07 A Comparison of the Performance of ::GPU and Intel s Math Kernel Library (MKL) for Solving Dense Banded Linear Systems Ang Li, Omkar Deshmukh, Radu Serban, Dan Negrut May, 0 Abstract ::GPU is a

More information

FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION

FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION EDMOND CHOW AND AFTAB PATEL Abstract. This paper presents a new fine-grained parallel algorithm for computing an incomplete LU factorization. All nonzeros

More information

1 Conjugate gradients

1 Conjugate gradients Notes for 2016-11-18 1 Conjugate gradients We now turn to the method of conjugate gradients (CG), perhaps the best known of the Krylov subspace solvers. The CG iteration can be characterized as the iteration

More information

Order reduction numerical methods for the algebraic Riccati equation. V. Simoncini

Order reduction numerical methods for the algebraic Riccati equation. V. Simoncini Order reduction numerical methods for the algebraic Riccati equation V. Simoncini Dipartimento di Matematica Alma Mater Studiorum - Università di Bologna valeria.simoncini@unibo.it 1 The problem Find X

More information

Iterative Methods for Solving A x = b

Iterative Methods for Solving A x = b Iterative Methods for Solving A x = b A good (free) online source for iterative methods for solving A x = b is given in the description of a set of iterative solvers called templates found at netlib: http

More information

Downloaded 07/31/12 to Redistribution subject to SIAM license or copyright; see

Downloaded 07/31/12 to Redistribution subject to SIAM license or copyright; see SIAM J SCI COMPUT Vol 34, No 4, pp A1925 A1949 c 2012 Society for Industrial and Applied Mathematics RECYCLING BICG WITH AN APPLICATION TO MODEL REDUCTION KAPIL AHUJA, ERIC DE STURLER, SERKAN GUGERCIN,

More information

M.A. Botchev. September 5, 2014

M.A. Botchev. September 5, 2014 Rome-Moscow school of Matrix Methods and Applied Linear Algebra 2014 A short introduction to Krylov subspaces for linear systems, matrix functions and inexact Newton methods. Plan and exercises. M.A. Botchev

More information

EIGIFP: A MATLAB Program for Solving Large Symmetric Generalized Eigenvalue Problems

EIGIFP: A MATLAB Program for Solving Large Symmetric Generalized Eigenvalue Problems EIGIFP: A MATLAB Program for Solving Large Symmetric Generalized Eigenvalue Problems JAMES H. MONEY and QIANG YE UNIVERSITY OF KENTUCKY eigifp is a MATLAB program for computing a few extreme eigenvalues

More information

Fast Frequency Response Analysis using Model Order Reduction

Fast Frequency Response Analysis using Model Order Reduction Fast Frequency Response Analysis using Model Order Reduction Peter Benner EU MORNET Exploratory Workshop Applications of Model Order Reduction Methods in Industrial Research and Development Luxembourg,

More information

Lab 1: Iterative Methods for Solving Linear Systems

Lab 1: Iterative Methods for Solving Linear Systems Lab 1: Iterative Methods for Solving Linear Systems January 22, 2017 Introduction Many real world applications require the solution to very large and sparse linear systems where direct methods such as

More information

arxiv: v1 [hep-lat] 19 Jul 2009

arxiv: v1 [hep-lat] 19 Jul 2009 arxiv:0907.3261v1 [hep-lat] 19 Jul 2009 Application of preconditioned block BiCGGR to the Wilson-Dirac equation with multiple right-hand sides in lattice QCD Abstract H. Tadano a,b, Y. Kuramashi c,b, T.

More information

Model Reduction for Dynamical Systems

Model Reduction for Dynamical Systems Otto-von-Guericke Universität Magdeburg Faculty of Mathematics Summer term 2015 Model Reduction for Dynamical Systems Lecture 8 Peter Benner Lihong Feng Max Planck Institute for Dynamics of Complex Technical

More information

High Performance Nonlinear Solvers

High Performance Nonlinear Solvers What is a nonlinear system? High Performance Nonlinear Solvers Michael McCourt Division Argonne National Laboratory IIT Meshfree Seminar September 19, 2011 Every nonlinear system of equations can be described

More information

FEM and sparse linear system solving

FEM and sparse linear system solving FEM & sparse linear system solving, Lecture 9, Nov 19, 2017 1/36 Lecture 9, Nov 17, 2017: Krylov space methods http://people.inf.ethz.ch/arbenz/fem17 Peter Arbenz Computer Science Department, ETH Zürich

More information

The rational Krylov subspace for parameter dependent systems. V. Simoncini

The rational Krylov subspace for parameter dependent systems. V. Simoncini The rational Krylov subspace for parameter dependent systems V. Simoncini Dipartimento di Matematica, Università di Bologna valeria.simoncini@unibo.it 1 Given the continuous-time system Motivation. Model

More information

An Asynchronous Algorithm on NetSolve Global Computing System

An Asynchronous Algorithm on NetSolve Global Computing System An Asynchronous Algorithm on NetSolve Global Computing System Nahid Emad S. A. Shahzadeh Fazeli Jack Dongarra March 30, 2004 Abstract The Explicitly Restarted Arnoldi Method (ERAM) allows to find a few

More information

ADI-preconditioned FGMRES for solving large generalized Lyapunov equations - A case study

ADI-preconditioned FGMRES for solving large generalized Lyapunov equations - A case study -preconditioned for large - A case study Matthias Bollhöfer, André Eppler TU Braunschweig Institute Computational Mathematics Syrene-MOR Workshop, TU Hamburg October 30, 2008 2 / 20 Outline 1 2 Overview

More information

On the Preconditioning of the Block Tridiagonal Linear System of Equations

On the Preconditioning of the Block Tridiagonal Linear System of Equations On the Preconditioning of the Block Tridiagonal Linear System of Equations Davod Khojasteh Salkuyeh Department of Mathematics, University of Mohaghegh Ardabili, PO Box 179, Ardabil, Iran E-mail: khojaste@umaacir

More information

Finite-choice algorithm optimization in Conjugate Gradients

Finite-choice algorithm optimization in Conjugate Gradients Finite-choice algorithm optimization in Conjugate Gradients Jack Dongarra and Victor Eijkhout January 2003 Abstract We present computational aspects of mathematically equivalent implementations of the

More information

arxiv: v1 [hep-lat] 2 May 2012

arxiv: v1 [hep-lat] 2 May 2012 A CG Method for Multiple Right Hand Sides and Multiple Shifts in Lattice QCD Calculations arxiv:1205.0359v1 [hep-lat] 2 May 2012 Fachbereich C, Mathematik und Naturwissenschaften, Bergische Universität

More information

Solving Ax = b, an overview. Program

Solving Ax = b, an overview. Program Numerical Linear Algebra Improving iterative solvers: preconditioning, deflation, numerical software and parallelisation Gerard Sleijpen and Martin van Gijzen November 29, 27 Solving Ax = b, an overview

More information

Balanced Truncation Model Reduction of Large and Sparse Generalized Linear Systems

Balanced Truncation Model Reduction of Large and Sparse Generalized Linear Systems Balanced Truncation Model Reduction of Large and Sparse Generalized Linear Systems Jos M. Badía 1, Peter Benner 2, Rafael Mayo 1, Enrique S. Quintana-Ortí 1, Gregorio Quintana-Ortí 1, A. Remón 1 1 Depto.

More information

Two-sided Eigenvalue Algorithms for Modal Approximation

Two-sided Eigenvalue Algorithms for Modal Approximation Two-sided Eigenvalue Algorithms for Modal Approximation Master s thesis submitted to Faculty of Mathematics at Chemnitz University of Technology presented by: Supervisor: Advisor: B.sc. Patrick Kürschner

More information

Multigrid absolute value preconditioning

Multigrid absolute value preconditioning Multigrid absolute value preconditioning Eugene Vecharynski 1 Andrew Knyazev 2 (speaker) 1 Department of Computer Science and Engineering University of Minnesota 2 Department of Mathematical and Statistical

More information

c 2015 Society for Industrial and Applied Mathematics

c 2015 Society for Industrial and Applied Mathematics SIAM J. SCI. COMPUT. Vol. 37, No. 2, pp. C169 C193 c 2015 Society for Industrial and Applied Mathematics FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION EDMOND CHOW AND AFTAB PATEL Abstract. This paper

More information

The flexible incomplete LU preconditioner for large nonsymmetric linear systems. Takatoshi Nakamura Takashi Nodera

The flexible incomplete LU preconditioner for large nonsymmetric linear systems. Takatoshi Nakamura Takashi Nodera Research Report KSTS/RR-15/006 The flexible incomplete LU preconditioner for large nonsymmetric linear systems by Takatoshi Nakamura Takashi Nodera Takatoshi Nakamura School of Fundamental Science and

More information

AMS526: Numerical Analysis I (Numerical Linear Algebra)

AMS526: Numerical Analysis I (Numerical Linear Algebra) AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods Xiangmin Jiao SUNY Stony Brook Xiangmin Jiao Numerical Analysis I 1 / 9 Minimizing Residual CG

More information

Robust Preconditioned Conjugate Gradient for the GPU and Parallel Implementations

Robust Preconditioned Conjugate Gradient for the GPU and Parallel Implementations Robust Preconditioned Conjugate Gradient for the GPU and Parallel Implementations Rohit Gupta, Martin van Gijzen, Kees Vuik GPU Technology Conference 2012, San Jose CA. GPU Technology Conference 2012,

More information

Department of Computer Science, University of Illinois at Urbana-Champaign

Department of Computer Science, University of Illinois at Urbana-Champaign Department of Computer Science, University of Illinois at Urbana-Champaign Probing for Schur Complements and Preconditioning Generalized Saddle-Point Problems Eric de Sturler, sturler@cs.uiuc.edu, http://www-faculty.cs.uiuc.edu/~sturler

More information

Eigenvalue problems III: Advanced Numerical Methods

Eigenvalue problems III: Advanced Numerical Methods Eigenvalue problems III: Advanced Numerical Methods Sam Sinayoko Computational Methods 10 Contents 1 Learning Outcomes 2 2 Introduction 2 3 Inverse Power method: finding the smallest eigenvalue of a symmetric

More information

Jacobi-Davidson Eigensolver in Cusolver Library. Lung-Sheng Chien, NVIDIA

Jacobi-Davidson Eigensolver in Cusolver Library. Lung-Sheng Chien, NVIDIA Jacobi-Davidson Eigensolver in Cusolver Library Lung-Sheng Chien, NVIDIA lchien@nvidia.com Outline CuSolver library - cusolverdn: dense LAPACK - cusolversp: sparse LAPACK - cusolverrf: refactorization

More information

Inexact Solves in Interpolatory Model Reduction

Inexact Solves in Interpolatory Model Reduction Inexact Solves in Interpolatory Model Reduction Sarah Wyatt Thesis submitted to the Faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements for the

More information

Preconditioned multishift BiCG for H 2 -optimal model reduction. Mian Ilyas Ahmad, Daniel B. Szyld, and Martin B. van Gijzen

Preconditioned multishift BiCG for H 2 -optimal model reduction. Mian Ilyas Ahmad, Daniel B. Szyld, and Martin B. van Gijzen Preconditioned multishift BiCG for H 2 -optimal model reduction Mian Ilyas Ahmad, Daniel B. Szyld, and Martin B. van Gijzen Report 12-06-15 Department of Mathematics Temple University June 2012. Revised

More information

Key words. conjugate gradients, normwise backward error, incremental norm estimation.

Key words. conjugate gradients, normwise backward error, incremental norm estimation. Proceedings of ALGORITMY 2016 pp. 323 332 ON ERROR ESTIMATION IN THE CONJUGATE GRADIENT METHOD: NORMWISE BACKWARD ERROR PETR TICHÝ Abstract. Using an idea of Duff and Vömel [BIT, 42 (2002), pp. 300 322

More information

Krylov-Subspace Based Model Reduction of Nonlinear Circuit Models Using Bilinear and Quadratic-Linear Approximations

Krylov-Subspace Based Model Reduction of Nonlinear Circuit Models Using Bilinear and Quadratic-Linear Approximations Krylov-Subspace Based Model Reduction of Nonlinear Circuit Models Using Bilinear and Quadratic-Linear Approximations Peter Benner and Tobias Breiten Abstract We discuss Krylov-subspace based model reduction

More information

Parallel programming practices for the solution of Sparse Linear Systems (motivated by computational physics and graphics)

Parallel programming practices for the solution of Sparse Linear Systems (motivated by computational physics and graphics) Parallel programming practices for the solution of Sparse Linear Systems (motivated by computational physics and graphics) Eftychios Sifakis CS758 Guest Lecture - 19 Sept 2012 Introduction Linear systems

More information

Review: From problem to parallel algorithm

Review: From problem to parallel algorithm Review: From problem to parallel algorithm Mathematical formulations of interesting problems abound Poisson s equation Sources: Electrostatics, gravity, fluid flow, image processing (!) Numerical solution:

More information

Iterative Methods and Multigrid

Iterative Methods and Multigrid Iterative Methods and Multigrid Part 3: Preconditioning 2 Eric de Sturler Preconditioning The general idea behind preconditioning is that convergence of some method for the linear system Ax = b can be

More information

Iterative Rational Krylov Algorithm for Unstable Dynamical Systems and Generalized Coprime Factorizations

Iterative Rational Krylov Algorithm for Unstable Dynamical Systems and Generalized Coprime Factorizations Iterative Rational Krylov Algorithm for Unstable Dynamical Systems and Generalized Coprime Factorizations Klajdi Sinani Thesis submitted to the Faculty of the Virginia Polytechnic Institute and State University

More information

Energy Consumption Evaluation for Krylov Methods on a Cluster of GPU Accelerators

Energy Consumption Evaluation for Krylov Methods on a Cluster of GPU Accelerators PARIS-SACLAY, FRANCE Energy Consumption Evaluation for Krylov Methods on a Cluster of GPU Accelerators Serge G. Petiton a and Langshi Chen b April the 6 th, 2016 a Université de Lille, Sciences et Technologies

More information

Preconditioners for the incompressible Navier Stokes equations

Preconditioners for the incompressible Navier Stokes equations Preconditioners for the incompressible Navier Stokes equations C. Vuik M. ur Rehman A. Segal Delft Institute of Applied Mathematics, TU Delft, The Netherlands SIAM Conference on Computational Science and

More information

Preface to the Second Edition. Preface to the First Edition

Preface to the Second Edition. Preface to the First Edition n page v Preface to the Second Edition Preface to the First Edition xiii xvii 1 Background in Linear Algebra 1 1.1 Matrices................................. 1 1.2 Square Matrices and Eigenvalues....................

More information

Incomplete Cholesky preconditioners that exploit the low-rank property

Incomplete Cholesky preconditioners that exploit the low-rank property anapov@ulb.ac.be ; http://homepages.ulb.ac.be/ anapov/ 1 / 35 Incomplete Cholesky preconditioners that exploit the low-rank property (theory and practice) Artem Napov Service de Métrologie Nucléaire, Université

More information

Multilevel low-rank approximation preconditioners Yousef Saad Department of Computer Science and Engineering University of Minnesota

Multilevel low-rank approximation preconditioners Yousef Saad Department of Computer Science and Engineering University of Minnesota Multilevel low-rank approximation preconditioners Yousef Saad Department of Computer Science and Engineering University of Minnesota SIAM CSE Boston - March 1, 2013 First: Joint work with Ruipeng Li Work

More information

Topics. The CG Algorithm Algorithmic Options CG s Two Main Convergence Theorems

Topics. The CG Algorithm Algorithmic Options CG s Two Main Convergence Theorems Topics The CG Algorithm Algorithmic Options CG s Two Main Convergence Theorems What about non-spd systems? Methods requiring small history Methods requiring large history Summary of solvers 1 / 52 Conjugate

More information

A POD Projection Method for Large-Scale Algebraic Riccati Equations

A POD Projection Method for Large-Scale Algebraic Riccati Equations A POD Projection Method for Large-Scale Algebraic Riccati Equations Boris Kramer Department of Mathematics Virginia Tech Blacksburg, VA 24061-0123 Email: bokr@vt.edu John R. Singler Department of Mathematics

More information

Algebraic Multigrid as Solvers and as Preconditioner

Algebraic Multigrid as Solvers and as Preconditioner Ò Algebraic Multigrid as Solvers and as Preconditioner Domenico Lahaye domenico.lahaye@cs.kuleuven.ac.be http://www.cs.kuleuven.ac.be/ domenico/ Department of Computer Science Katholieke Universiteit Leuven

More information