Towards high performance IRKA on hybrid CPU-GPU systems
|
|
- Scarlett Fleming
- 5 years ago
- Views:
Transcription
1 Towards high performance IRKA on hybrid CPU-GPU systems Jens Saak in collaboration with Georg Pauer (OVGU/MPI Magdeburg) Kapil Ahuja, Ruchir Garg (IIT Indore) Hartwig Anzt, Jack Dongarra (ICL Uni Tennessee Knoxville) Edmond Chow (Georgia Tech) 17th GAMM Workshop on Applied and Numerical Linear Algebra Cologne, Germany September 7-8, 2017
2 Introduction Projection based Model Order Reduction (MOR) Linear time-invariant (LTI) system: E ẋ(t) = A x(t) + B u(t) y(t) = C x(t) E r = W T EV A r = W T AV B r = W T B C r = CV x(t) R n, x r(t) R r, u R p, y, ỹ R q Reduced order model (ROM): E r ẋ r (t) = A r x r (t) + B r u(t) ỹ(t) = C r x r (t) Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 2/15
3 Introduction Description of Today s Focus Problem MOR of LTI systems often employs the rational Krylov subspace method (RKSM) of some flavor: Moment Matching IRKA Balanced Truncation (V, W span rational Krylov subspaces), (iterative rational Krylov algorithm) (computing the Gramians via ADI or RKSM) Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 3/15
4 Introduction Description of Today s Focus Problem MOR of LTI systems often employs the rational Krylov subspace method (RKSM) of some flavor: Moment Matching IRKA Balanced Truncation (V, W span rational Krylov subspaces), (iterative rational Krylov algorithm) (computing the Gramians via ADI or RKSM) IRKA in a nutshell e.g.[gugercin/antoulas/beattie 08] 1. Guess initial ROM or initial shifts, 2. Compute span V = span{(σ 1 E A) 1 B,..., (σ r E A) 1 B}, span W = span{(σ 1 E T A T ) 1 C T,..., (σ r E T A T ) 1 C T }. 3. Update ROM, set σ = Λ(A r, E r ), repeat until σ has converged. Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 3/15
5 Introduction Description of Today s Focus Problem MOR of LTI systems often employs the rational Krylov subspace method (RKSM) of some flavor: Moment Matching IRKA Balanced Truncation Dominant computation in all these methods Sequences of Shifted Linear Systems (SLS) (V, W span rational Krylov subspaces), (iterative rational Krylov algorithm) (computing the Gramians via ADI or RKSM) (A + σ j E) X j = Y j, for j I N, with real data A R n n, E R n n, Y j R n m, m n, but possibly complex shifts σ j C. Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 3/15
6 Introduction Description of Today s Focus Problem MOR of LTI systems often employs the rational Krylov subspace method (RKSM) of some flavor: Moment Matching IRKA Balanced Truncation Dominant computation in all these methods Sequences of Shifted Linear Systems (SLS) (V, W span rational Krylov subspaces), (iterative rational Krylov algorithm) (computing the Gramians via ADI or RKSM) P j (A + σ j E) X j = P j Y j, for j I N, with real data A R n n, E R n n, Y j R n m, m n, but possibly complex shifts σ j C. Preconditioner P j R n n usually needs update for every j. Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 3/15
7 Introduction Strategies for the switched linear Systems Sequences of Shifted Linear Systems (SLS) P j (A + σ j E) X j = P j Y j, for j I N, We aim at using iterative solvers on hybrid CPU-GPU systems. Efficiency improving strategies 1. recycle / realign preconditioner P j 2. recycle solvers / subspaces 3. solve shifted linear systems in parallel Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 4/15
8 Introduction Strategies for the switched linear Systems Sequences of Shifted Linear Systems (SLS) P j (A + σ j E) X j = P j Y j, for j I N, We aim at using iterative solvers on hybrid CPU-GPU systems. Efficiency improving strategies 1. recycle / realign preconditioner P j 2. recycle solvers / subspaces extending [Anzt/Chow/S./Dongarra 16] 3. solve shifted linear systems in parallel Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 4/15
9 Introduction Strategies for the switched linear Systems Sequences of Shifted Linear Systems (SLS) P j (A + σ j E) X j = P j Y j, for j I N, We aim at using iterative solvers on hybrid CPU-GPU systems. Efficiency improving strategies 1. recycle / realign preconditioner P j 2. recycle solvers / subspaces extending [Anzt/Chow/S./Dongarra 16] following [Ahuja/de Sturler/Gugercin/Chang 12] 3. solve shifted linear systems in parallel Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 4/15
10 Agenda 1. Introduction Projection based Model Order Reduction (MOR) Description of Today s Focus Problem Strategies for the switched linear Systems 2. Reusing and Realigning Preconditioners Prior Work and Our Approach The Real aka. Symmetric Case The Non-Symmetric aka. Complex Case Structure Exploiting Preconditioner Experiments 3. Recycling Solvers and Subspaces Background Preliminary Results 4. Parallel Solution of the Shifted Systems Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 5/15
11 Reusing and Realigning Preconditioners Prior Work and Our Approach 1. reuse preconditioner for small matrix changes, i.e. similar shifts 2. update incomplete factorization preconditioner [Bellavia/De Simone/di Serafino/Morini 11, Benzi/Bertaccini 03, Bertaccini 04, Calgaro/Chehab/Saad 10, Duintjer Tebbens/Tüma 07] Main Issues reuses reduce efficiency of the preconditioner updates have limited potential for parallel execution Goal highly parallel preconditioner update and application Jens Saak, Towards hybrid CPU-GPU IRKA 6/15
12 Reusing and Realigning Preconditioners The Real aka. Symmetric Case [Anzt/Chow/S ] Preconditioned iterative solution of Sequences of Shifted Linear Systems (SLS) P j (A + σ j E) X j = P j Y j, for j I N, for A, E symmetric and σ j real. Idea: Minimize A U T U via fixed point approach [Chow/et al. 15] Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 7/15
13 Reusing and Realigning Preconditioners The Real aka. Symmetric Case [Anzt/Chow/S ] Preconditioned iterative solution of Sequences of Shifted Linear Systems (SLS) P j (A + σ j E) X j = P j Y j, for j I N, for A, E symmetric and σ j real. Idea: Minimize A U T U via fixed point approach [Chow/et al. 15] Initialization If σ j σ j 1 is small enough, then P j P j 1 is small P j 1 good initial guess for P j. Use [Chow/Patel 15, Chow/Anzt/Dongarra 15] for (asynchronous) iterative computation of incomplete factorization preconditioners. Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 7/15
14 Reusing and Realigning Preconditioners The Real aka. Symmetric Case Time [ms] Speedup Number of sweeps Number of sweeps IC Test example: Oberwolfach Collection Steel Cooling Benchmark[Benner/S. 05] here using the FEniCS reimplementation by Maximilian Behr Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 8/15
15 Reusing and Realigning Preconditioners The Non-Symmetric aka. Complex Case Consider Ax = b for A C n n, x, b C n. With A r, A i R n n, x r, x i, b r, b i R n write A = A r + ıa i, x = x r + ıx i, b = b r + ıb i. Idea e.g.[day/heroux 01, Benzi/Bertaccini 08] Rewrite to real linear system [ ] [ ] [ ] Ar A i xr br = A i A r x i b i and employ block preconditioner. Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 9/15
16 Reusing and Realigning Preconditioners Structure Exploiting Preconditioner In our special case A j,r = A + Re(σ j ) E, A j,i = Im(σ j ) E Let Then A j Ṽ j = W j [ ] 0 Im(σ J j = j ). Im(σ j ) 0 [ ] [ ] Aj,r A j,i Vj,r = A j,i A j,r V j,i [ Wj,r W j,i (I 2 A j,r + J j E)Ṽj = W j ] Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 10/15
17 Reusing and Realigning Preconditioners Structure Exploiting Preconditioner In our special case A j,r = A + Re(σ j ) E, A j,i = Im(σ j ) E Let Then A j Ṽ j = W j [ ] 0 Im(σ J j = j ). Im(σ j ) 0 [ ] [ ] Aj,r A j,i Vj,r = A j,i A j,r V j,i [ Wj,r W j,i (I 2 A j,r + J j E)Ṽj = W j ( ) Idea: P := (I 2 A j,r ) 1 = I 2 A 1 j,r ] Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 10/15
18 Reusing and Realigning Preconditioners Structure Exploiting Preconditioner In our special case A j,r = A + Re(σ j ) E, A j,i = Im(σ j ) E Let [ ] counter diagonal blocks 0 Im(σ J j = are: j ). Im(σ j ) 0 Then ± Im(σ j) (A + Re(σ j) E) 1 E = ± Im(σj) Re(σ (A + Re(σj) j) E) 1 (A + Re(σ j) E A) [ ] [ ] [ ] A j Ṽ j = W = ± Im(σj) ( Aj,r A In j j,i (A + Vj,r Re(σ j) E) Wj,r Re(σ j) = 1 A ). A j,i A j,r V j,i W j,i (I 2 A j,r + J j E)Ṽj = W j ( ) Idea: P := (I 2 A j,r ) 1 = I 2 A 1 j,r Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 10/15
19 Reusing and Realigning Preconditioners FEniCS Convect. Diffusion on Unit Square 1 n = 9801 P3-GMRES iterations for SLS (tol=10 8, maxiter=250) ı /10 2 / / ı no prec block ILU(0) block ILU(1) block ILU(2) block ILU(3) block \ full ILU(0) model data by Maximilian Behr Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 11/15
20 Recycling Solvers and Subspaces Recall IRKA needs: span V = span{(σ 1 E A) 1 B,..., (σ k E A) 1 B}, span W = span{(σ 1 E T A T ) 1 C T,..., (σ k E T A T ) 1 C T }. Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 12/15
21 Recycling Solvers and Subspaces Recall IRKA needs: span V = span{(σ 1 E A) 1 B,..., (σ k E A) 1 B}, span W = span{(σ 1 E T A T ) 1 C T,..., (σ k E T A T ) 1 C T }. Idea [Ahuja/de Sturler/Gugercin/Chang 12] Need to solve with dual operators Use BiCG Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 12/15
22 Recycling Solvers and Subspaces Recall IRKA needs: span V = span{(σ 1 E A) 1 B,..., (σ k E A) 1 B}, span W = span{(σ 1 E T A T ) 1 C T,..., (σ k E T A T ) 1 C T }. Idea [Ahuja/de Sturler/Gugercin/Chang 12] Need to solve with dual operators Use BiCG MIMO systems (i.e., p, q > 1) repeatedly solve with different right hand sides update subspace, i.e. recycle Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 12/15
23 Recycling Solvers and Subspaces Recall IRKA needs: span V = span{(σ 1 E A) 1 B,..., (σ k E A) 1 B}, span W = span{(σ 1 E T A T ) 1 C T,..., (σ k E T A T ) 1 C T }. Idea [Ahuja/de Sturler/Gugercin/Chang 12] Need to solve with dual operators Use BiCG MIMO systems (i.e., p, q > 1) repeatedly solve with different right hand sides update subspace, i.e. recycle upon convergence: σ (i) j σ (i+1) j employ deflation, i.e., recycle Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 12/15
24 Recycling Solvers and Subspaces Recall IRKA needs: span V = span{(σ 1 E A) 1 B,..., (σ k E A) 1 B}, span W = span{(σ 1 E T A T ) 1 C T,..., (σ k E T A T ) 1 C T }. Idea [Ahuja/de Sturler/Gugercin/Chang 12] Need to solve with dual operators Use BiCG MIMO systems (i.e., p, q > 1) repeatedly solve with different right hand sides update subspace, i.e. recycle upon convergence: σ (i) j σ (i+1) j employ deflation, i.e., recycle RBiCG for IRKA Current focus with Kapil Ahuja, Ruchir Garg and Georg Pauer: CUDA accelerated version Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 12/15
25 Recycling Solvers and Subspaces Preliminary Results MATLAB results [Ahuja/de Sturler/Gugercin/Chang 12] iteration count time(s) n σ r drop IRKA tol steps BiCG RBiCG Ratio BiCG RBiCG Ratio Test example: Oberwolfach Collection Steel Cooling Benchmark[Benner/S. 05] avilable from: 20Profiles%20% %29 Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 13/15
26 Next: Parallel Solution of the Shifted Systems Observation: Iterative solver needs sparse matrix vector product (A + σ j E)x j = y j. memory bandwidth bounded. Idea: Define: X = [x 1,..., x r ], Y = [y 1,..., y r ] and Σ = diag(σ) Then Y = AX + EXΣ, and sparse matrix vector product sparse-dense matrix matrix product 2r sparse matrix sweeps become 2 sparse matrix sweeps Requires short recurrence method (e.g. BiCG) to limit memory usage Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 14/15
27 Conclusions & Outlook Conclusions Outlook analysis of the problem structure gives rise to 3 promising implementation candidates Preconditioner realignment and Subspace recycling can even be combined proper HPC implementation pending comparison of the strategies Master Thesis Georg Pauer Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 15/15
28 Conclusions & Outlook Conclusions Outlook analysis of the problem structure gives rise to 3 promising implementation candidates Preconditioner realignment and Subspace recycling can even be combined proper HPC implementation pending comparison of the strategies Master Thesis Georg Pauer Thank you for your attention! Jens Saak, saak@mpi-magdeburg.mpg.de Towards hybrid CPU-GPU IRKA 15/15
Updating incomplete factorization preconditioners for model order reduction
DOI 10.1007/s11075-016-0110-2 ORIGINAL PAPER Updating incomplete factorization preconditioners for model order reduction Hartwig Anzt 1 Edmond Chow 2 Jens Saak 3 Jack Dongarra 1,4,5 Received: 18 September
More informationA Newton-Galerkin-ADI Method for Large-Scale Algebraic Riccati Equations
A Newton-Galerkin-ADI Method for Large-Scale Algebraic Riccati Equations Peter Benner Max-Planck-Institute for Dynamics of Complex Technical Systems Computational Methods in Systems and Control Theory
More informationMARCH 24-27, 2014 SAN JOSE, CA
MARCH 24-27, 2014 SAN JOSE, CA Sparse HPC on modern architectures Important scientific applications rely on sparse linear algebra HPCG a new benchmark proposal to complement Top500 (HPL) To solve A x =
More informationarxiv: v3 [math.na] 6 Jul 2018
A Connection Between Time Domain Model Order Reduction and Moment Matching for LTI Systems arxiv:181.785v3 [math.na] 6 Jul 218 Manuela Hund a and Jens Saak a a Max Planck Institute for Dynamics of Complex
More informationModel order reduction of large-scale dynamical systems with Jacobi-Davidson style eigensolvers
MAX PLANCK INSTITUTE International Conference on Communications, Computing and Control Applications March 3-5, 2011, Hammamet, Tunisia. Model order reduction of large-scale dynamical systems with Jacobi-Davidson
More informationFine-Grained Parallel Algorithms for Incomplete Factorization Preconditioning
Fine-Grained Parallel Algorithms for Incomplete Factorization Preconditioning Edmond Chow School of Computational Science and Engineering Georgia Institute of Technology, USA SPPEXA Symposium TU München,
More informationMODEL REDUCTION BY A CROSS-GRAMIAN APPROACH FOR DATA-SPARSE SYSTEMS
MODEL REDUCTION BY A CROSS-GRAMIAN APPROACH FOR DATA-SPARSE SYSTEMS Ulrike Baur joint work with Peter Benner Mathematics in Industry and Technology Faculty of Mathematics Chemnitz University of Technology
More informationFeeding of the Thousands. Leveraging the GPU's Computing Power for Sparse Linear Algebra
SPPEXA Annual Meeting 2016, January 25 th, 2016, Garching, Germany Feeding of the Thousands Leveraging the GPU's Computing Power for Sparse Linear Algebra Hartwig Anzt Sparse Linear Algebra on GPUs Inherently
More informationMax Planck Institute Magdeburg Preprints
Peter Benner Patrick Kürschner Jens Saak Real versions of low-rank ADI methods with complex shifts MAXPLANCKINSTITUT FÜR DYNAMIK KOMPLEXER TECHNISCHER SYSTEME MAGDEBURG Max Planck Institute Magdeburg Preprints
More informationModel Order Reduction via Matlab Parallel Computing Toolbox. Istanbul Technical University
Model Order Reduction via Matlab Parallel Computing Toolbox E. Fatih Yetkin & Hasan Dağ Istanbul Technical University Computational Science & Engineering Department September 21, 2009 E. Fatih Yetkin (Istanbul
More informationAccelerating linear algebra computations with hybrid GPU-multicore systems.
Accelerating linear algebra computations with hybrid GPU-multicore systems. Marc Baboulin INRIA/Université Paris-Sud joint work with Jack Dongarra (University of Tennessee and Oak Ridge National Laboratory)
More informationIterative Methods for Linear Systems of Equations
Iterative Methods for Linear Systems of Equations Projection methods (3) ITMAN PhD-course DTU 20-10-08 till 24-10-08 Martin van Gijzen 1 Delft University of Technology Overview day 4 Bi-Lanczos method
More informationContents. Preface... xi. Introduction...
Contents Preface... xi Introduction... xv Chapter 1. Computer Architectures... 1 1.1. Different types of parallelism... 1 1.1.1. Overlap, concurrency and parallelism... 1 1.1.2. Temporal and spatial parallelism
More informationApproximation of the Linearized Boussinesq Equations
Approximation of the Linearized Boussinesq Equations Alan Lattimer Advisors Jeffrey Borggaard Serkan Gugercin Department of Mathematics Virginia Tech SIAM Talk Series, Virginia Tech, April 22, 2014 Alan
More information9.1 Preconditioned Krylov Subspace Methods
Chapter 9 PRECONDITIONING 9.1 Preconditioned Krylov Subspace Methods 9.2 Preconditioned Conjugate Gradient 9.3 Preconditioned Generalized Minimal Residual 9.4 Relaxation Method Preconditioners 9.5 Incomplete
More informationAccelerated Solvers for CFD
Accelerated Solvers for CFD Co-Design of Hardware/Software for Predicting MAV Aerodynamics Eric de Sturler, Virginia Tech Mathematics Email: sturler@vt.edu Web: http://www.math.vt.edu/people/sturler Co-design
More informationAdaptive rational Krylov subspaces for large-scale dynamical systems. V. Simoncini
Adaptive rational Krylov subspaces for large-scale dynamical systems V. Simoncini Dipartimento di Matematica, Università di Bologna valeria@dm.unibo.it joint work with Vladimir Druskin, Schlumberger Doll
More informationRecycling Bi-Lanczos Algorithms: BiCG, CGS, and BiCGSTAB
Recycling Bi-Lanczos Algorithms: BiCG, CGS, and BiCGSTAB Kapil Ahuja Thesis submitted to the Faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements
More informationAccelerating Model Reduction of Large Linear Systems with Graphics Processors
Accelerating Model Reduction of Large Linear Systems with Graphics Processors P. Benner 1, P. Ezzatti 2, D. Kressner 3, E.S. Quintana-Ortí 4, Alfredo Remón 4 1 Max-Plank-Institute for Dynamics of Complex
More informationKrylov Subspace Methods for Nonlinear Model Reduction
MAX PLANCK INSTITUT Conference in honour of Nancy Nichols 70th birthday Reading, 2 3 July 2012 Krylov Subspace Methods for Nonlinear Model Reduction Peter Benner and Tobias Breiten Max Planck Institute
More informationBALANCING-RELATED MODEL REDUCTION FOR DATA-SPARSE SYSTEMS
BALANCING-RELATED Peter Benner Professur Mathematik in Industrie und Technik Fakultät für Mathematik Technische Universität Chemnitz Computational Methods with Applications Harrachov, 19 25 August 2007
More informationEfficient Implementation of Large Scale Lyapunov and Riccati Equation Solvers
Efficient Implementation of Large Scale Lyapunov and Riccati Equation Solvers Jens Saak joint work with Peter Benner (MiIT) Professur Mathematik in Industrie und Technik (MiIT) Fakultät für Mathematik
More informationc 2017 Society for Industrial and Applied Mathematics
SIAM J. MATRIX ANAL. APPL. Vol. 38, No. 2, pp. 401 424 c 2017 Society for Industrial and Applied Mathematics PRECONDITIONED MULTISHIFT BiCG FOR H 2 -OPTIMAL MODEL REDUCTION MIAN ILYAS AHMAD, DANIEL B.
More informationMoving Frontiers in Model Reduction Using Numerical Linear Algebra
Using Numerical Linear Algebra Max-Planck-Institute for Dynamics of Complex Technical Systems Computational Methods in Systems and Control Theory Group Magdeburg, Germany Technische Universität Chemnitz
More informationNotes on Some Methods for Solving Linear Systems
Notes on Some Methods for Solving Linear Systems Dianne P. O Leary, 1983 and 1999 and 2007 September 25, 2007 When the matrix A is symmetric and positive definite, we have a whole new class of algorithms
More informationLecture 17: Iterative Methods and Sparse Linear Algebra
Lecture 17: Iterative Methods and Sparse Linear Algebra David Bindel 25 Mar 2014 Logistics HW 3 extended to Wednesday after break HW 4 should come out Monday after break Still need project description
More informationInexact Solves in Krylov-based Model Reduction
Inexact Solves in Krylov-based Model Reduction Christopher A. Beattie and Serkan Gugercin Abstract We investigate the use of inexact solves in a Krylov-based model reduction setting and present the resulting
More informationPreconditioned inverse iteration and shift-invert Arnoldi method
Preconditioned inverse iteration and shift-invert Arnoldi method Melina Freitag Department of Mathematical Sciences University of Bath CSC Seminar Max-Planck-Institute for Dynamics of Complex Technical
More informationHot-Starting NLP Solvers
Hot-Starting NLP Solvers Andreas Wächter Department of Industrial Engineering and Management Sciences Northwestern University waechter@iems.northwestern.edu 204 Mixed Integer Programming Workshop Ohio
More informationAccelerating Linear Algebra on Heterogeneous Architectures of Multicore and GPUs using MAGMA and DPLASMA and StarPU Schedulers
UT College of Engineering Tutorial Accelerating Linear Algebra on Heterogeneous Architectures of Multicore and GPUs using MAGMA and DPLASMA and StarPU Schedulers Stan Tomov 1, George Bosilca 1, and Cédric
More informationModel Reduction for Linear Dynamical Systems
Summer School on Numerical Linear Algebra for Dynamical and High-Dimensional Problems Trogir, October 10 15, 2011 Model Reduction for Linear Dynamical Systems Peter Benner Max Planck Institute for Dynamics
More informationMultilevel Low-Rank Preconditioners Yousef Saad Department of Computer Science and Engineering University of Minnesota. Modelling 2014 June 2,2014
Multilevel Low-Rank Preconditioners Yousef Saad Department of Computer Science and Engineering University of Minnesota Modelling 24 June 2,24 Dedicated to Owe Axelsson at the occasion of his 8th birthday
More informationA connection between time domain model order reduction and moment matching for LTI systems
Mathematical and Computer Modelling of Dynamical Systems Methods, Tools and Applications in Engineering and Related Sciences ISSN: 1387-3954 (Print) 1744-551 (Online) Journal homepage: http://www.tandfonline.com/loi/nmcm
More informationIncomplete LU Preconditioning and Error Compensation Strategies for Sparse Matrices
Incomplete LU Preconditioning and Error Compensation Strategies for Sparse Matrices Eun-Joo Lee Department of Computer Science, East Stroudsburg University of Pennsylvania, 327 Science and Technology Center,
More informationLinear Solvers. Andrew Hazel
Linear Solvers Andrew Hazel Introduction Thus far we have talked about the formulation and discretisation of physical problems...... and stopped when we got to a discrete linear system of equations. Introduction
More informationSolving PDEs with CUDA Jonathan Cohen
Solving PDEs with CUDA Jonathan Cohen jocohen@nvidia.com NVIDIA Research PDEs (Partial Differential Equations) Big topic Some common strategies Focus on one type of PDE in this talk Poisson Equation Linear
More informationAMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods; Preconditioning
AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods; Preconditioning Xiangmin Jiao SUNY Stony Brook Xiangmin Jiao Numerical Analysis I 1 / 18 Outline
More informationAMS Mathematics Subject Classification : 65F10,65F50. Key words and phrases: ILUS factorization, preconditioning, Schur complement, 1.
J. Appl. Math. & Computing Vol. 15(2004), No. 1, pp. 299-312 BILUS: A BLOCK VERSION OF ILUS FACTORIZATION DAVOD KHOJASTEH SALKUYEH AND FAEZEH TOUTOUNIAN Abstract. ILUS factorization has many desirable
More informationModel Reduction for Unstable Systems
Model Reduction for Unstable Systems Klajdi Sinani Virginia Tech klajdi@vt.edu Advisor: Serkan Gugercin October 22, 2015 (VT) SIAM October 22, 2015 1 / 26 Overview 1 Introduction 2 Interpolatory Model
More informationIDR(s) A family of simple and fast algorithms for solving large nonsymmetric systems of linear equations
IDR(s) A family of simple and fast algorithms for solving large nonsymmetric systems of linear equations Harrachov 2007 Peter Sonneveld en Martin van Gijzen August 24, 2007 1 Delft University of Technology
More informationStructure preserving Krylov-subspace methods for Lyapunov equations
Structure preserving Krylov-subspace methods for Lyapunov equations Matthias Bollhöfer, André Eppler Institute Computational Mathematics TU Braunschweig MoRePas Workshop, Münster September 17, 2009 System
More informationLinear and Nonlinear Matrix Equations Arising in Model Reduction
International Conference on Numerical Linear Algebra and its Applications Guwahati, January 15 18, 2013 Linear and Nonlinear Matrix Equations Arising in Model Reduction Peter Benner Tobias Breiten Max
More informationOn the numerical solution of large-scale linear matrix equations. V. Simoncini
On the numerical solution of large-scale linear matrix equations V. Simoncini Dipartimento di Matematica, Università di Bologna (Italy) valeria.simoncini@unibo.it 1 Some matrix equations Sylvester matrix
More informationFine-grained Parallel Incomplete LU Factorization
Fine-grained Parallel Incomplete LU Factorization Edmond Chow School of Computational Science and Engineering Georgia Institute of Technology Sparse Days Meeting at CERFACS June 5-6, 2014 Contribution
More informationFINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION
FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION EDMOND CHOW AND AFTAB PATEL Abstract. This paper presents a new fine-grained parallel algorithm for computing an incomplete LU factorization. All nonzeros
More informationSolving Large Nonlinear Sparse Systems
Solving Large Nonlinear Sparse Systems Fred W. Wubs and Jonas Thies Computational Mechanics & Numerical Mathematics University of Groningen, the Netherlands f.w.wubs@rug.nl Centre for Interdisciplinary
More informationThe new challenges to Krylov subspace methods Yousef Saad Department of Computer Science and Engineering University of Minnesota
The new challenges to Krylov subspace methods Yousef Saad Department of Computer Science and Engineering University of Minnesota SIAM Applied Linear Algebra Valencia, June 18-22, 2012 Introduction Krylov
More informationOptimal Approximation of Dynamical Systems with Rational Krylov Methods. Department of Mathematics, Virginia Tech, USA
Optimal Approximation of Dynamical Systems with Rational Krylov Methods Serkan Güǧercin Department of Mathematics, Virginia Tech, USA Chris Beattie and Virginia Tech. jointly with Thanos Antoulas Rice
More informationBlock Bidiagonal Decomposition and Least Squares Problems
Block Bidiagonal Decomposition and Least Squares Problems Åke Björck Department of Mathematics Linköping University Perspectives in Numerical Analysis, Helsinki, May 27 29, 2008 Outline Bidiagonal Decomposition
More informationIterative Methods for Sparse Linear Systems
Iterative Methods for Sparse Linear Systems Luca Bergamaschi e-mail: berga@dmsa.unipd.it - http://www.dmsa.unipd.it/ berga Department of Mathematical Methods and Models for Scientific Applications University
More informationChapter 7 Iterative Techniques in Matrix Algebra
Chapter 7 Iterative Techniques in Matrix Algebra Per-Olof Persson persson@berkeley.edu Department of Mathematics University of California, Berkeley Math 128B Numerical Analysis Vector Norms Definition
More informationKrylov Space Solvers
Seminar for Applied Mathematics ETH Zurich International Symposium on Frontiers of Computational Science Nagoya, 12/13 Dec. 2005 Sparse Matrices Large sparse linear systems of equations or large sparse
More informationTR A Comparison of the Performance of SaP::GPU and Intel s Math Kernel Library (MKL) for Solving Dense Banded Linear Systems
TR-0-07 A Comparison of the Performance of ::GPU and Intel s Math Kernel Library (MKL) for Solving Dense Banded Linear Systems Ang Li, Omkar Deshmukh, Radu Serban, Dan Negrut May, 0 Abstract ::GPU is a
More informationFINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION
FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION EDMOND CHOW AND AFTAB PATEL Abstract. This paper presents a new fine-grained parallel algorithm for computing an incomplete LU factorization. All nonzeros
More information1 Conjugate gradients
Notes for 2016-11-18 1 Conjugate gradients We now turn to the method of conjugate gradients (CG), perhaps the best known of the Krylov subspace solvers. The CG iteration can be characterized as the iteration
More informationOrder reduction numerical methods for the algebraic Riccati equation. V. Simoncini
Order reduction numerical methods for the algebraic Riccati equation V. Simoncini Dipartimento di Matematica Alma Mater Studiorum - Università di Bologna valeria.simoncini@unibo.it 1 The problem Find X
More informationIterative Methods for Solving A x = b
Iterative Methods for Solving A x = b A good (free) online source for iterative methods for solving A x = b is given in the description of a set of iterative solvers called templates found at netlib: http
More informationDownloaded 07/31/12 to Redistribution subject to SIAM license or copyright; see
SIAM J SCI COMPUT Vol 34, No 4, pp A1925 A1949 c 2012 Society for Industrial and Applied Mathematics RECYCLING BICG WITH AN APPLICATION TO MODEL REDUCTION KAPIL AHUJA, ERIC DE STURLER, SERKAN GUGERCIN,
More informationM.A. Botchev. September 5, 2014
Rome-Moscow school of Matrix Methods and Applied Linear Algebra 2014 A short introduction to Krylov subspaces for linear systems, matrix functions and inexact Newton methods. Plan and exercises. M.A. Botchev
More informationEIGIFP: A MATLAB Program for Solving Large Symmetric Generalized Eigenvalue Problems
EIGIFP: A MATLAB Program for Solving Large Symmetric Generalized Eigenvalue Problems JAMES H. MONEY and QIANG YE UNIVERSITY OF KENTUCKY eigifp is a MATLAB program for computing a few extreme eigenvalues
More informationFast Frequency Response Analysis using Model Order Reduction
Fast Frequency Response Analysis using Model Order Reduction Peter Benner EU MORNET Exploratory Workshop Applications of Model Order Reduction Methods in Industrial Research and Development Luxembourg,
More informationLab 1: Iterative Methods for Solving Linear Systems
Lab 1: Iterative Methods for Solving Linear Systems January 22, 2017 Introduction Many real world applications require the solution to very large and sparse linear systems where direct methods such as
More informationarxiv: v1 [hep-lat] 19 Jul 2009
arxiv:0907.3261v1 [hep-lat] 19 Jul 2009 Application of preconditioned block BiCGGR to the Wilson-Dirac equation with multiple right-hand sides in lattice QCD Abstract H. Tadano a,b, Y. Kuramashi c,b, T.
More informationModel Reduction for Dynamical Systems
Otto-von-Guericke Universität Magdeburg Faculty of Mathematics Summer term 2015 Model Reduction for Dynamical Systems Lecture 8 Peter Benner Lihong Feng Max Planck Institute for Dynamics of Complex Technical
More informationHigh Performance Nonlinear Solvers
What is a nonlinear system? High Performance Nonlinear Solvers Michael McCourt Division Argonne National Laboratory IIT Meshfree Seminar September 19, 2011 Every nonlinear system of equations can be described
More informationFEM and sparse linear system solving
FEM & sparse linear system solving, Lecture 9, Nov 19, 2017 1/36 Lecture 9, Nov 17, 2017: Krylov space methods http://people.inf.ethz.ch/arbenz/fem17 Peter Arbenz Computer Science Department, ETH Zürich
More informationThe rational Krylov subspace for parameter dependent systems. V. Simoncini
The rational Krylov subspace for parameter dependent systems V. Simoncini Dipartimento di Matematica, Università di Bologna valeria.simoncini@unibo.it 1 Given the continuous-time system Motivation. Model
More informationAn Asynchronous Algorithm on NetSolve Global Computing System
An Asynchronous Algorithm on NetSolve Global Computing System Nahid Emad S. A. Shahzadeh Fazeli Jack Dongarra March 30, 2004 Abstract The Explicitly Restarted Arnoldi Method (ERAM) allows to find a few
More informationADI-preconditioned FGMRES for solving large generalized Lyapunov equations - A case study
-preconditioned for large - A case study Matthias Bollhöfer, André Eppler TU Braunschweig Institute Computational Mathematics Syrene-MOR Workshop, TU Hamburg October 30, 2008 2 / 20 Outline 1 2 Overview
More informationOn the Preconditioning of the Block Tridiagonal Linear System of Equations
On the Preconditioning of the Block Tridiagonal Linear System of Equations Davod Khojasteh Salkuyeh Department of Mathematics, University of Mohaghegh Ardabili, PO Box 179, Ardabil, Iran E-mail: khojaste@umaacir
More informationFinite-choice algorithm optimization in Conjugate Gradients
Finite-choice algorithm optimization in Conjugate Gradients Jack Dongarra and Victor Eijkhout January 2003 Abstract We present computational aspects of mathematically equivalent implementations of the
More informationarxiv: v1 [hep-lat] 2 May 2012
A CG Method for Multiple Right Hand Sides and Multiple Shifts in Lattice QCD Calculations arxiv:1205.0359v1 [hep-lat] 2 May 2012 Fachbereich C, Mathematik und Naturwissenschaften, Bergische Universität
More informationSolving Ax = b, an overview. Program
Numerical Linear Algebra Improving iterative solvers: preconditioning, deflation, numerical software and parallelisation Gerard Sleijpen and Martin van Gijzen November 29, 27 Solving Ax = b, an overview
More informationBalanced Truncation Model Reduction of Large and Sparse Generalized Linear Systems
Balanced Truncation Model Reduction of Large and Sparse Generalized Linear Systems Jos M. Badía 1, Peter Benner 2, Rafael Mayo 1, Enrique S. Quintana-Ortí 1, Gregorio Quintana-Ortí 1, A. Remón 1 1 Depto.
More informationTwo-sided Eigenvalue Algorithms for Modal Approximation
Two-sided Eigenvalue Algorithms for Modal Approximation Master s thesis submitted to Faculty of Mathematics at Chemnitz University of Technology presented by: Supervisor: Advisor: B.sc. Patrick Kürschner
More informationMultigrid absolute value preconditioning
Multigrid absolute value preconditioning Eugene Vecharynski 1 Andrew Knyazev 2 (speaker) 1 Department of Computer Science and Engineering University of Minnesota 2 Department of Mathematical and Statistical
More informationc 2015 Society for Industrial and Applied Mathematics
SIAM J. SCI. COMPUT. Vol. 37, No. 2, pp. C169 C193 c 2015 Society for Industrial and Applied Mathematics FINE-GRAINED PARALLEL INCOMPLETE LU FACTORIZATION EDMOND CHOW AND AFTAB PATEL Abstract. This paper
More informationThe flexible incomplete LU preconditioner for large nonsymmetric linear systems. Takatoshi Nakamura Takashi Nodera
Research Report KSTS/RR-15/006 The flexible incomplete LU preconditioner for large nonsymmetric linear systems by Takatoshi Nakamura Takashi Nodera Takatoshi Nakamura School of Fundamental Science and
More informationAMS526: Numerical Analysis I (Numerical Linear Algebra)
AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods Xiangmin Jiao SUNY Stony Brook Xiangmin Jiao Numerical Analysis I 1 / 9 Minimizing Residual CG
More informationRobust Preconditioned Conjugate Gradient for the GPU and Parallel Implementations
Robust Preconditioned Conjugate Gradient for the GPU and Parallel Implementations Rohit Gupta, Martin van Gijzen, Kees Vuik GPU Technology Conference 2012, San Jose CA. GPU Technology Conference 2012,
More informationDepartment of Computer Science, University of Illinois at Urbana-Champaign
Department of Computer Science, University of Illinois at Urbana-Champaign Probing for Schur Complements and Preconditioning Generalized Saddle-Point Problems Eric de Sturler, sturler@cs.uiuc.edu, http://www-faculty.cs.uiuc.edu/~sturler
More informationEigenvalue problems III: Advanced Numerical Methods
Eigenvalue problems III: Advanced Numerical Methods Sam Sinayoko Computational Methods 10 Contents 1 Learning Outcomes 2 2 Introduction 2 3 Inverse Power method: finding the smallest eigenvalue of a symmetric
More informationJacobi-Davidson Eigensolver in Cusolver Library. Lung-Sheng Chien, NVIDIA
Jacobi-Davidson Eigensolver in Cusolver Library Lung-Sheng Chien, NVIDIA lchien@nvidia.com Outline CuSolver library - cusolverdn: dense LAPACK - cusolversp: sparse LAPACK - cusolverrf: refactorization
More informationInexact Solves in Interpolatory Model Reduction
Inexact Solves in Interpolatory Model Reduction Sarah Wyatt Thesis submitted to the Faculty of the Virginia Polytechnic Institute and State University in partial fulfillment of the requirements for the
More informationPreconditioned multishift BiCG for H 2 -optimal model reduction. Mian Ilyas Ahmad, Daniel B. Szyld, and Martin B. van Gijzen
Preconditioned multishift BiCG for H 2 -optimal model reduction Mian Ilyas Ahmad, Daniel B. Szyld, and Martin B. van Gijzen Report 12-06-15 Department of Mathematics Temple University June 2012. Revised
More informationKey words. conjugate gradients, normwise backward error, incremental norm estimation.
Proceedings of ALGORITMY 2016 pp. 323 332 ON ERROR ESTIMATION IN THE CONJUGATE GRADIENT METHOD: NORMWISE BACKWARD ERROR PETR TICHÝ Abstract. Using an idea of Duff and Vömel [BIT, 42 (2002), pp. 300 322
More informationKrylov-Subspace Based Model Reduction of Nonlinear Circuit Models Using Bilinear and Quadratic-Linear Approximations
Krylov-Subspace Based Model Reduction of Nonlinear Circuit Models Using Bilinear and Quadratic-Linear Approximations Peter Benner and Tobias Breiten Abstract We discuss Krylov-subspace based model reduction
More informationParallel programming practices for the solution of Sparse Linear Systems (motivated by computational physics and graphics)
Parallel programming practices for the solution of Sparse Linear Systems (motivated by computational physics and graphics) Eftychios Sifakis CS758 Guest Lecture - 19 Sept 2012 Introduction Linear systems
More informationReview: From problem to parallel algorithm
Review: From problem to parallel algorithm Mathematical formulations of interesting problems abound Poisson s equation Sources: Electrostatics, gravity, fluid flow, image processing (!) Numerical solution:
More informationIterative Methods and Multigrid
Iterative Methods and Multigrid Part 3: Preconditioning 2 Eric de Sturler Preconditioning The general idea behind preconditioning is that convergence of some method for the linear system Ax = b can be
More informationIterative Rational Krylov Algorithm for Unstable Dynamical Systems and Generalized Coprime Factorizations
Iterative Rational Krylov Algorithm for Unstable Dynamical Systems and Generalized Coprime Factorizations Klajdi Sinani Thesis submitted to the Faculty of the Virginia Polytechnic Institute and State University
More informationEnergy Consumption Evaluation for Krylov Methods on a Cluster of GPU Accelerators
PARIS-SACLAY, FRANCE Energy Consumption Evaluation for Krylov Methods on a Cluster of GPU Accelerators Serge G. Petiton a and Langshi Chen b April the 6 th, 2016 a Université de Lille, Sciences et Technologies
More informationPreconditioners for the incompressible Navier Stokes equations
Preconditioners for the incompressible Navier Stokes equations C. Vuik M. ur Rehman A. Segal Delft Institute of Applied Mathematics, TU Delft, The Netherlands SIAM Conference on Computational Science and
More informationPreface to the Second Edition. Preface to the First Edition
n page v Preface to the Second Edition Preface to the First Edition xiii xvii 1 Background in Linear Algebra 1 1.1 Matrices................................. 1 1.2 Square Matrices and Eigenvalues....................
More informationIncomplete Cholesky preconditioners that exploit the low-rank property
anapov@ulb.ac.be ; http://homepages.ulb.ac.be/ anapov/ 1 / 35 Incomplete Cholesky preconditioners that exploit the low-rank property (theory and practice) Artem Napov Service de Métrologie Nucléaire, Université
More informationMultilevel low-rank approximation preconditioners Yousef Saad Department of Computer Science and Engineering University of Minnesota
Multilevel low-rank approximation preconditioners Yousef Saad Department of Computer Science and Engineering University of Minnesota SIAM CSE Boston - March 1, 2013 First: Joint work with Ruipeng Li Work
More informationTopics. The CG Algorithm Algorithmic Options CG s Two Main Convergence Theorems
Topics The CG Algorithm Algorithmic Options CG s Two Main Convergence Theorems What about non-spd systems? Methods requiring small history Methods requiring large history Summary of solvers 1 / 52 Conjugate
More informationA POD Projection Method for Large-Scale Algebraic Riccati Equations
A POD Projection Method for Large-Scale Algebraic Riccati Equations Boris Kramer Department of Mathematics Virginia Tech Blacksburg, VA 24061-0123 Email: bokr@vt.edu John R. Singler Department of Mathematics
More informationAlgebraic Multigrid as Solvers and as Preconditioner
Ò Algebraic Multigrid as Solvers and as Preconditioner Domenico Lahaye domenico.lahaye@cs.kuleuven.ac.be http://www.cs.kuleuven.ac.be/ domenico/ Department of Computer Science Katholieke Universiteit Leuven
More information