FAS and Solver Performance

Size: px
Start display at page:

Download "FAS and Solver Performance"

Transcription

1 FAS and Solver Performance Matthew Knepley Mathematics and Computer Science Division Argonne National Laboratory Fall AMS Central Section Meeting Chicago, IL Oct 05 06, 2007 M. Knepley (ANL) FAS AMS 07 1 / 27

2 Payoff Why should I care? 1 Optimal multilevel solvers are necessary 2 Processor flops are increasing much faster than bandwidth 3 Nonlinear algorithms can be efficient than linear algorithms 4 Presents an opportunity for numerical algebraic geometry M. Knepley (ANL) FAS AMS 07 2 / 27

3 Payoff Why should I care? 1 Optimal multilevel solvers are necessary 2 Processor flops are increasing much faster than bandwidth 3 Nonlinear algorithms can be efficient than linear algorithms 4 Presents an opportunity for numerical algebraic geometry M. Knepley (ANL) FAS AMS 07 2 / 27

4 Payoff Why should I care? 1 Optimal multilevel solvers are necessary 2 Processor flops are increasing much faster than bandwidth 3 Nonlinear algorithms can be efficient than linear algorithms 4 Presents an opportunity for numerical algebraic geometry M. Knepley (ANL) FAS AMS 07 2 / 27

5 Payoff Why should I care? 1 Optimal multilevel solvers are necessary 2 Processor flops are increasing much faster than bandwidth 3 Nonlinear algorithms can be efficient than linear algorithms 4 Presents an opportunity for numerical algebraic geometry M. Knepley (ANL) FAS AMS 07 2 / 27

6 Outline Simulation Basics 1 Simulation Basics 2 Newton-Multigrid 3 Machine Performance 4 FAS and Multigrid-Newton 5 Possible Extensions M. Knepley (ANL) FAS AMS 07 3 / 27

7 Simulation Basics Necessity Of Simulation Experiment are... Expensive Difficult Impossible Dangerous M. Knepley (ANL) FAS AMS 07 4 / 27

8 Simulation Basics Why Optimal Algorithms? The more powerful the computer, the greater the importance of optimality Example: Suppose Alg 1 solves a problem in time CN 2, N is the input size Suppose Alg 2 solves the same problem in time CN Suppose Alg 1 and Alg 2 are able to use 10,000 processors In constant time compared to serial, Alg1 can run a problem 100X larger Alg2 can run a problem 10,000X larger Alternatively, filling the machine s memory, Alg1 requires 100X time Alg2 runs in constant time M. Knepley (ANL) FAS AMS 07 5 / 27

9 Outline Newton-Multigrid 1 Simulation Basics 2 Newton-Multigrid 3 Machine Performance 4 FAS and Multigrid-Newton 5 Possible Extensions M. Knepley (ANL) FAS AMS 07 6 / 27

10 What Is Optimal? Newton-Multigrid I will define optimal as an O(N) solution algorithm These are generally hierarchical, so we need hierarchy generation assembly on subdomains restriction and prolongation M. Knepley (ANL) FAS AMS 07 7 / 27

11 Newton-Multigrid The Bratu Problem u + λe u = f in Ω (1) u = g on Ω (2) Also called the Solid-Fuel Ignition equation Can be treated as a nonlinear eigenvalue problem Has two solution branches until λ = 6.28 M. Knepley (ANL) FAS AMS 07 8 / 27

12 Newton s Method Newton-Multigrid 0 = F(u + δu) = F(u) + J(u)δu (3) so that u + δu = u J(u) 1 F(u) (4) Quadratic convergence J can be solved approximately (Dembo-Eisensat-Steihaug) M. Knepley (ANL) FAS AMS 07 9 / 27

13 Linear Multigrid Newton-Multigrid Smoothing (typically Gauss-Seidel) x new = S(x old, b) (5) Coarse-grid Correction J c δx c = R(b Jx old ) (6) x new = x old + R T δx c (7) M. Knepley (ANL) FAS AMS / 27

14 Newton-Multigrid Linear Convergence Convergence to r < 10 9 b using GMRES(30)/ILU Elements Iterations M. Knepley (ANL) FAS AMS / 27

15 Newton-Multigrid Linear Convergence Convergence to r < 10 9 b using GMRES(30)/MG Elements Iterations M. Knepley (ANL) FAS AMS / 27

16 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27

17 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27

18 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27

19 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27

20 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27

21 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27

22 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27

23 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27

24 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27

25 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27

26 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27

27 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27

28 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27

29 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27

30 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27

31 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27

32 Outline Machine Performance 1 Simulation Basics 2 Newton-Multigrid 3 Machine Performance 4 FAS and Multigrid-Newton 5 Possible Extensions M. Knepley (ANL) FAS AMS / 27

33 Machine Performance STREAM Benchmark Simple benchmark program measuring sustainable memory bandwidth Protoypical operation is Triad (WAXPY): w = y + αx Measures the memory bandwidth bottleneck (much below peak) Datasets outstrip cache Machine Peak (MF/s) Triad (MB/s) MF/MW Eq. MF/s Matt s Laptop (5.5%) Intel Core2 Quad (1.2%) Tesla 1060C * (0.8%) Table: Bandwidth limited machine performance M. Knepley (ANL) FAS AMS / 27

34 Machine Performance Analysis of Sparse Matvec (SpMV) Assumptions No cache misses No waits on memory references Notation m Number of matrix rows nz Number of nonzero matrix elements V Number of vectors to multiply We can look at bandwidth needed for peak performance ( V ) m nz + 6 V or achieveable performance given a bandwith BW byte/flop (8) Vnz BW Mflop/s (9) (8V + 2)m + 6nz Towards Realistic Performance Bounds for Implicit CFD Codes, Gropp, Kaushik, Keyes, and Smith. M. Knepley (ANL) FAS AMS / 27

35 Machine Performance Improving Serial Performance For a single matvec with 3D FD Poisson, Matt s laptop can achieve at most 1 (8 + 2) 1 bytes/flop( MB/s) = 151 MFlops/s, (10) which is a dismal 8.8% of peak. Can improve performance by Blocking Multiple vectors but operation issue limitations take over. M. Knepley (ANL) FAS AMS / 27

36 Machine Performance Improving Serial Performance For a single matvec with 3D FD Poisson, Matt s laptop can achieve at most 1 (8 + 2) 1 bytes/flop( MB/s) = 151 MFlops/s, (10) which is a dismal 8.8% of peak. Better approaches: Unassembled operator application (Spectral elements, FMM) N data, N 2 computation Nonlinear evaluation (Picard, FAS, Exact Polynomial Solvers) N data, N k computation M. Knepley (ANL) FAS AMS / 27

37 Outline FAS and Multigrid-Newton 1 Simulation Basics 2 Newton-Multigrid 3 Machine Performance 4 FAS and Multigrid-Newton 5 Possible Extensions M. Knepley (ANL) FAS AMS / 27

38 FAS and Multigrid-Newton Matrix-Free Smoothing 7/23/15, 8:50 AM = We can use point Jacobi x new i = x old i + J 1 ii (b i J T i x old ) (11) In the nonlinear case, J T i x old = e T i F (x)x old (12) which might be calculated automatically using AD. M. Knepley (ANL) FAS AMS / 27

39 FAS and Multigrid-Newton Nonlinear Gauss-Seidel If we have an initial guess u, b = 0 F(u), xi new = xi old + J 1 ii (b i Ji T x old ) (13) xi new = xi old + J 1 ii ( F i (u) F i (u) T x old ) (14) xi new = xi old J 1 ii (F i (u) + F i (u) T x old ) (15) xi new = xi old J 1 ii F i (u + x old ) (16) u new i = u old i J 1 ii F i (u old ) (17) This is just Newton s method on a single equation at a time... which is Nonlinear Gauss-Seidel. M. Knepley (ANL) FAS AMS / 27

40 FAS and Multigrid-Newton Nonlinear Gauss-Seidel If we have an initial guess u, b = 0 F(u), xi new = xi old + J 1 ii (b i Ji T x old ) (13) xi new = xi old + J 1 ii ( F i (u) F i (u) T x old ) (14) xi new = xi old J 1 ii (F i (u) + F i (u) T x old ) (15) xi new = xi old J 1 ii F i (u + x old ) (16) u new i = u old i J 1 ii F i (u old ) (17) This is just Newton s method on a single equation at a time... which is Nonlinear Gauss-Seidel. M. Knepley (ANL) FAS AMS / 27

41 FAS and Multigrid-Newton Nonlinear Gauss-Seidel If we have an initial guess u, b = 0 F(u), xi new = xi old + J 1 ii (b i Ji T x old ) (13) xi new = xi old + J 1 ii ( F i (u) F i (u) T x old ) (14) xi new = xi old J 1 ii (F i (u) + F i (u) T x old ) (15) xi new = xi old J 1 ii F i (u + x old ) (16) u new i = u old i J 1 ii F i (u old ) (17) This is just Newton s method on a single equation at a time... which is Nonlinear Gauss-Seidel. M. Knepley (ANL) FAS AMS / 27

42 FAS and Multigrid-Newton Nonlinear Multigrid Most authors just offer an ansatz with nonlinear smoothing x new = S(x old, b) (18) and coarse-grid correction F c (x c ) = F c ( x c ) + γr(b F(x old )) (19) x new = x old + 1 γ RT (x c x c ) (20) where x is an approximate solution. If F is a linear operator L, the correction reduces to L c (x c ) = L c ( x c ) + γr(b L(x old )) (21) L c (x c x c ) = γr(b L(x old )) (22) L c δx c = γrr (23) M. Knepley (ANL) FAS AMS / 27

43 FAS and Multigrid-Newton Nonlinear Multigrid Most authors just offer an ansatz with nonlinear smoothing and coarse-grid correction x new = S(x old, b) (18) F c (x c ) = F c ( x c ) + γr(b F(x old )) (19) x new = x old + 1 γ RT (x c x c ) (20) where x is an approximate solution. and the update becomes x new = x old + 1 γ RT δx c (21) x new = x old + R T ˆL 1 c Rr (22) M. Knepley (ANL) FAS AMS / 27

44 FAS and Multigrid-Newton Nonlinear Multigrid It is instructive to look at the alternate derivation of Barry Smith Begin with the nonlinear generalization F(u) = 0, for a correction and then using Taylor series we have the correction J c x c = R(b Jx old ) (23) J c x c = R(F(u) + Jx old ) (24) F(u old ) = F(u) + J(u old u) +... (25) F c (uc old + x c ) = F c (uc old ) + J c x c +... (26) F c (u old c + x c ) F c (uc old ) = RF(u old ) (27) F c (uc old + x c ) = F c (uc old ) RF(u old ) (28) M. Knepley (ANL) FAS AMS / 27

45 FAS and Multigrid-Newton Nonlinear Multigrid It is instructive to look at the alternate derivation of Barry Smith Begin with the nonlinear generalization F(u) = 0, for a correction and then using Taylor series and the same update J c x c = R(b Jx old ) (23) J c x c = R(F(u) + Jx old ) (24) F(u old ) = F(u) + J(u old u) +... (25) F c (uc old + x c ) = F c (uc old ) + J c x c +... (26) x new = x old + R T x c (27) M. Knepley (ANL) FAS AMS / 27

46 FAS and Multigrid-Newton Spectrum of Methods Newton-Multigrid 7/23/15, 8:51 AM FAS When does linearization happen? Which Jacobian entries are updated? M. Knepley (ANL) FAS AMS / 27

47 FAS and Multigrid-Newton Nonlinear Convergence Convergence to r < 10 9 r 0 using Newton/GMRES(30)/ILU Elements Iterations M. Knepley (ANL) FAS AMS / 27

48 FAS and Multigrid-Newton Nonlinear Convergence M. Knepley (ANL) FAS AMS / 27

49 Outline Possible Extensions 1 Simulation Basics 2 Newton-Multigrid 3 Machine Performance 4 FAS and Multigrid-Newton 5 Possible Extensions M. Knepley (ANL) FAS AMS / 27

50 Possible Extensions Polynomial Solvers A great opportunity exists for polynomial solvers Better performance Bandwidth considerations only intensify on multicore chips Petascale systems will need these improvements More robust Most practical engineering calculations are quadratic New algorithms Can multiple solutions speed up convergence? M. Knepley (ANL) FAS AMS / 27

51 Conclusions Possible Extensions Newton-Multigrid provides Good nonlinear solves Simple interface for software libraries Low computational efficiency Multigrid-FAS provides Good nonlinear solves Lower memory bandwidth and storage Potentially high computational efficiency Requires formation on small systems on the fly M. Knepley (ANL) FAS AMS / 27

52 Possible Extensions PETSc Resources Can download tarballs or clone a Mercurial repository Hyperlinked documentation Manual Manual pages for evey method HTML of all example code (linked to manual pages) FAQ Full support at petsc-maint@mcs.anl.gov M. Knepley (ANL) FAS AMS / 27

ERLANGEN REGIONAL COMPUTING CENTER

ERLANGEN REGIONAL COMPUTING CENTER ERLANGEN REGIONAL COMPUTING CENTER Making Sense of Performance Numbers Georg Hager Erlangen Regional Computing Center (RRZE) Friedrich-Alexander-Universität Erlangen-Nürnberg OpenMPCon 2018 Barcelona,

More information

Kasetsart University Workshop. Multigrid methods: An introduction

Kasetsart University Workshop. Multigrid methods: An introduction Kasetsart University Workshop Multigrid methods: An introduction Dr. Anand Pardhanani Mathematics Department Earlham College Richmond, Indiana USA pardhan@earlham.edu A copy of these slides is available

More information

Lecture 17: Iterative Methods and Sparse Linear Algebra

Lecture 17: Iterative Methods and Sparse Linear Algebra Lecture 17: Iterative Methods and Sparse Linear Algebra David Bindel 25 Mar 2014 Logistics HW 3 extended to Wednesday after break HW 4 should come out Monday after break Still need project description

More information

Indefinite and physics-based preconditioning

Indefinite and physics-based preconditioning Indefinite and physics-based preconditioning Jed Brown VAW, ETH Zürich 2009-01-29 Newton iteration Standard form of a nonlinear system F (u) 0 Iteration Solve: Update: J(ũ)u F (ũ) ũ + ũ + u Example (p-bratu)

More information

High Performance Nonlinear Solvers

High Performance Nonlinear Solvers What is a nonlinear system? High Performance Nonlinear Solvers Michael McCourt Division Argonne National Laboratory IIT Meshfree Seminar September 19, 2011 Every nonlinear system of equations can be described

More information

An Accelerated Block-Parallel Newton Method via Overlapped Partitioning

An Accelerated Block-Parallel Newton Method via Overlapped Partitioning An Accelerated Block-Parallel Newton Method via Overlapped Partitioning Yurong Chen Lab. of Parallel Computing, Institute of Software, CAS (http://www.rdcps.ac.cn/~ychen/english.htm) Summary. This paper

More information

Numerical Methods I Solving Nonlinear Equations

Numerical Methods I Solving Nonlinear Equations Numerical Methods I Solving Nonlinear Equations Aleksandar Donev Courant Institute, NYU 1 donev@courant.nyu.edu 1 MATH-GA 2011.003 / CSCI-GA 2945.003, Fall 2014 October 16th, 2014 A. Donev (Courant Institute)

More information

Minisymposia 9 and 34: Avoiding Communication in Linear Algebra. Jim Demmel UC Berkeley bebop.cs.berkeley.edu

Minisymposia 9 and 34: Avoiding Communication in Linear Algebra. Jim Demmel UC Berkeley bebop.cs.berkeley.edu Minisymposia 9 and 34: Avoiding Communication in Linear Algebra Jim Demmel UC Berkeley bebop.cs.berkeley.edu Motivation (1) Increasing parallelism to exploit From Top500 to multicores in your laptop Exponentially

More information

Numerical Programming I (for CSE)

Numerical Programming I (for CSE) Technische Universität München WT 1/13 Fakultät für Mathematik Prof. Dr. M. Mehl B. Gatzhammer January 1, 13 Numerical Programming I (for CSE) Tutorial 1: Iterative Methods 1) Relaxation Methods a) Let

More information

Linear Solvers. Andrew Hazel

Linear Solvers. Andrew Hazel Linear Solvers Andrew Hazel Introduction Thus far we have talked about the formulation and discretisation of physical problems...... and stopped when we got to a discrete linear system of equations. Introduction

More information

MULTIGRID METHODS FOR NONLINEAR PROBLEMS: AN OVERVIEW

MULTIGRID METHODS FOR NONLINEAR PROBLEMS: AN OVERVIEW MULTIGRID METHODS FOR NONLINEAR PROBLEMS: AN OVERVIEW VAN EMDEN HENSON CENTER FOR APPLIED SCIENTIFIC COMPUTING LAWRENCE LIVERMORE NATIONAL LABORATORY Abstract Since their early application to elliptic

More information

AIMS Exercise Set # 1

AIMS Exercise Set # 1 AIMS Exercise Set #. Determine the form of the single precision floating point arithmetic used in the computers at AIMS. What is the largest number that can be accurately represented? What is the smallest

More information

A Domain Decomposition Based Jacobi-Davidson Algorithm for Quantum Dot Simulation

A Domain Decomposition Based Jacobi-Davidson Algorithm for Quantum Dot Simulation A Domain Decomposition Based Jacobi-Davidson Algorithm for Quantum Dot Simulation Tao Zhao 1, Feng-Nan Hwang 2 and Xiao-Chuan Cai 3 Abstract In this paper, we develop an overlapping domain decomposition

More information

Multigrid Methods and their application in CFD

Multigrid Methods and their application in CFD Multigrid Methods and their application in CFD Michael Wurst TU München 16.06.2009 1 Multigrid Methods Definition Multigrid (MG) methods in numerical analysis are a group of algorithms for solving differential

More information

Partial Differential Equations

Partial Differential Equations Partial Differential Equations Introduction Deng Li Discretization Methods Chunfang Chen, Danny Thorne, Adam Zornes CS521 Feb.,7, 2006 What do You Stand For? A PDE is a Partial Differential Equation This

More information

6. Iterative Methods for Linear Systems. The stepwise approach to the solution...

6. Iterative Methods for Linear Systems. The stepwise approach to the solution... 6 Iterative Methods for Linear Systems The stepwise approach to the solution Miriam Mehl: 6 Iterative Methods for Linear Systems The stepwise approach to the solution, January 18, 2013 1 61 Large Sparse

More information

Newton additive and multiplicative Schwarz iterative methods

Newton additive and multiplicative Schwarz iterative methods IMA Journal of Numerical Analysis (2008) 28, 143 161 doi:10.1093/imanum/drm015 Advance Access publication on July 16, 2007 Newton additive and multiplicative Schwarz iterative methods JOSEP ARNAL, VIOLETA

More information

AMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences)

AMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences) AMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences) Lecture 19: Computing the SVD; Sparse Linear Systems Xiangmin Jiao Stony Brook University Xiangmin Jiao Numerical

More information

Review: From problem to parallel algorithm

Review: From problem to parallel algorithm Review: From problem to parallel algorithm Mathematical formulations of interesting problems abound Poisson s equation Sources: Electrostatics, gravity, fluid flow, image processing (!) Numerical solution:

More information

An Introduction to Algebraic Multigrid (AMG) Algorithms Derrick Cerwinsky and Craig C. Douglas 1/84

An Introduction to Algebraic Multigrid (AMG) Algorithms Derrick Cerwinsky and Craig C. Douglas 1/84 An Introduction to Algebraic Multigrid (AMG) Algorithms Derrick Cerwinsky and Craig C. Douglas 1/84 Introduction Almost all numerical methods for solving PDEs will at some point be reduced to solving A

More information

Stabilization and Acceleration of Algebraic Multigrid Method

Stabilization and Acceleration of Algebraic Multigrid Method Stabilization and Acceleration of Algebraic Multigrid Method Recursive Projection Algorithm A. Jemcov J.P. Maruszewski Fluent Inc. October 24, 2006 Outline 1 Need for Algorithm Stabilization and Acceleration

More information

TOPS Contributions to PFLOTRAN

TOPS Contributions to PFLOTRAN TOPS Contributions to PFLOTRAN Barry Smith Matthew Knepley Mathematics and Computer Science Division Argonne National Laboratory TOPS Meeting at SIAM PP 08 Atlanta, Georgia March 14, 2008 M. Knepley (ANL)

More information

Course Notes: Week 1

Course Notes: Week 1 Course Notes: Week 1 Math 270C: Applied Numerical Linear Algebra 1 Lecture 1: Introduction (3/28/11) We will focus on iterative methods for solving linear systems of equations (and some discussion of eigenvalues

More information

Solving Ax = b, an overview. Program

Solving Ax = b, an overview. Program Numerical Linear Algebra Improving iterative solvers: preconditioning, deflation, numerical software and parallelisation Gerard Sleijpen and Martin van Gijzen November 29, 27 Solving Ax = b, an overview

More information

Numerical Methods I Non-Square and Sparse Linear Systems

Numerical Methods I Non-Square and Sparse Linear Systems Numerical Methods I Non-Square and Sparse Linear Systems Aleksandar Donev Courant Institute, NYU 1 donev@courant.nyu.edu 1 MATH-GA 2011.003 / CSCI-GA 2945.003, Fall 2014 September 25th, 2014 A. Donev (Courant

More information

Numerical solutions of nonlinear systems of equations

Numerical solutions of nonlinear systems of equations Numerical solutions of nonlinear systems of equations Tsung-Ming Huang Department of Mathematics National Taiwan Normal University, Taiwan E-mail: min@math.ntnu.edu.tw August 28, 2011 Outline 1 Fixed points

More information

Fine-Grained Parallel Algorithms for Incomplete Factorization Preconditioning

Fine-Grained Parallel Algorithms for Incomplete Factorization Preconditioning Fine-Grained Parallel Algorithms for Incomplete Factorization Preconditioning Edmond Chow School of Computational Science and Engineering Georgia Institute of Technology, USA SPPEXA Symposium TU München,

More information

PHYS 410/555 Computational Physics Solution of Non Linear Equations (a.k.a. Root Finding) (Reference Numerical Recipes, 9.0, 9.1, 9.

PHYS 410/555 Computational Physics Solution of Non Linear Equations (a.k.a. Root Finding) (Reference Numerical Recipes, 9.0, 9.1, 9. PHYS 410/555 Computational Physics Solution of Non Linear Equations (a.k.a. Root Finding) (Reference Numerical Recipes, 9.0, 9.1, 9.4) We will consider two cases 1. f(x) = 0 1-dimensional 2. f(x) = 0 d-dimensional

More information

Lehrstuhl für Informatik 10 (Systemsimulation)

Lehrstuhl für Informatik 10 (Systemsimulation) FRIEDRICH-ALEXANDER-UNIVERSITÄT ERLANGEN-NÜRNBERG INSTITUT FÜR INFORMATIK (MATHEMATISCHE MASCHINEN UND DATENVERARBEITUNG) Lehrstuhl für Informatik 10 (Systemsimulation) Comparison of two implementations

More information

Computers and Mathematics with Applications

Computers and Mathematics with Applications Computers and Mathematics with Applications 68 (2014) 1151 1160 Contents lists available at ScienceDirect Computers and Mathematics with Applications journal homepage: www.elsevier.com/locate/camwa A GPU

More information

PowerPoints organized by Dr. Michael R. Gustafson II, Duke University

PowerPoints organized by Dr. Michael R. Gustafson II, Duke University Part 3 Chapter 10 LU Factorization PowerPoints organized by Dr. Michael R. Gustafson II, Duke University All images copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display.

More information

A multigrid method for large scale inverse problems

A multigrid method for large scale inverse problems A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of Earth and Ocean Science University of British Columbia haber@cs.ubc.ca July 4, 2003 E.Haber: Multigrid

More information

Numerical Solution Techniques in Mechanical and Aerospace Engineering

Numerical Solution Techniques in Mechanical and Aerospace Engineering Numerical Solution Techniques in Mechanical and Aerospace Engineering Chunlei Liang LECTURE 3 Solvers of linear algebraic equations 3.1. Outline of Lecture Finite-difference method for a 2D elliptic PDE

More information

AMG for a Peta-scale Navier Stokes Code

AMG for a Peta-scale Navier Stokes Code AMG for a Peta-scale Navier Stokes Code James Lottes Argonne National Laboratory October 18, 2007 The Challenge Develop an AMG iterative method to solve Poisson 2 u = f discretized on highly irregular

More information

Notes for CS542G (Iterative Solvers for Linear Systems)

Notes for CS542G (Iterative Solvers for Linear Systems) Notes for CS542G (Iterative Solvers for Linear Systems) Robert Bridson November 20, 2007 1 The Basics We re now looking at efficient ways to solve the linear system of equations Ax = b where in this course,

More information

Preface to the Second Edition. Preface to the First Edition

Preface to the Second Edition. Preface to the First Edition n page v Preface to the Second Edition Preface to the First Edition xiii xvii 1 Background in Linear Algebra 1 1.1 Matrices................................. 1 1.2 Square Matrices and Eigenvalues....................

More information

Robust solution of Poisson-like problems with aggregation-based AMG

Robust solution of Poisson-like problems with aggregation-based AMG Robust solution of Poisson-like problems with aggregation-based AMG Yvan Notay Université Libre de Bruxelles Service de Métrologie Nucléaire Paris, January 26, 215 Supported by the Belgian FNRS http://homepages.ulb.ac.be/

More information

Universität Dortmund UCHPC. Performance. Computing for Finite Element Simulations

Universität Dortmund UCHPC. Performance. Computing for Finite Element Simulations technische universität dortmund Universität Dortmund fakultät für mathematik LS III (IAM) UCHPC UnConventional High Performance Computing for Finite Element Simulations S. Turek, Chr. Becker, S. Buijssen,

More information

1. Fast Iterative Solvers of SLE

1. Fast Iterative Solvers of SLE 1. Fast Iterative Solvers of crucial drawback of solvers discussed so far: they become slower if we discretize more accurate! now: look for possible remedies relaxation: explicit application of the multigrid

More information

Convergence Behavior of a Two-Level Optimized Schwarz Preconditioner

Convergence Behavior of a Two-Level Optimized Schwarz Preconditioner Convergence Behavior of a Two-Level Optimized Schwarz Preconditioner Olivier Dubois 1 and Martin J. Gander 2 1 IMA, University of Minnesota, 207 Church St. SE, Minneapolis, MN 55455 dubois@ima.umn.edu

More information

Toward less synchronous composable multilevel methods for implicit multiphysics simulation

Toward less synchronous composable multilevel methods for implicit multiphysics simulation Toward less synchronous composable multilevel methods for implicit multiphysics simulation Jed Brown 1, Mark Adams 2, Peter Brune 1, Matt Knepley 3, Barry Smith 1 1 Mathematics and Computer Science Division,

More information

Numerical Methods for Large-Scale Nonlinear Equations

Numerical Methods for Large-Scale Nonlinear Equations Slide 1 Numerical Methods for Large-Scale Nonlinear Equations Homer Walker MA 512 April 28, 2005 Inexact Newton and Newton Krylov Methods a. Newton-iterative and inexact Newton methods. Slide 2 i. Formulation

More information

Nonlinear Preconditioning in PETSc

Nonlinear Preconditioning in PETSc Nonlinear Preconditioning in PETSc Matthew Knepley PETSc Team Computation Institute University of Chicago Department of Molecular Biology and Physiology Rush University Medical Center Algorithmic Adaptivity

More information

- Part 4 - Multicore and Manycore Technology: Chances and Challenges. Vincent Heuveline

- Part 4 - Multicore and Manycore Technology: Chances and Challenges. Vincent Heuveline - Part 4 - Multicore and Manycore Technology: Chances and Challenges Vincent Heuveline 1 Numerical Simulation of Tropical Cyclones Goal oriented adaptivity for tropical cyclones ~10⁴km ~1500km ~100km 2

More information

Newton s Method and Efficient, Robust Variants

Newton s Method and Efficient, Robust Variants Newton s Method and Efficient, Robust Variants Philipp Birken University of Kassel (SFB/TRR 30) Soon: University of Lund October 7th 2013 Efficient solution of large systems of non-linear PDEs in science

More information

Nonlinear Preconditioning in PETSc

Nonlinear Preconditioning in PETSc Nonlinear Preconditioning in PETSc Matthew Knepley PETSc Team Computation Institute University of Chicago Challenges in 21st Century Experimental Mathematical Computation ICERM, Providence, RI July 22,

More information

Edwin van der Weide and Magnus Svärd. I. Background information for the SBP-SAT scheme

Edwin van der Weide and Magnus Svärd. I. Background information for the SBP-SAT scheme Edwin van der Weide and Magnus Svärd I. Background information for the SBP-SAT scheme As is well-known, stability of a numerical scheme is a key property for a robust and accurate numerical solution. Proving

More information

COURSE Iterative methods for solving linear systems

COURSE Iterative methods for solving linear systems COURSE 0 4.3. Iterative methods for solving linear systems Because of round-off errors, direct methods become less efficient than iterative methods for large systems (>00 000 variables). An iterative scheme

More information

Chapter Two: Numerical Methods for Elliptic PDEs. 1 Finite Difference Methods for Elliptic PDEs

Chapter Two: Numerical Methods for Elliptic PDEs. 1 Finite Difference Methods for Elliptic PDEs Chapter Two: Numerical Methods for Elliptic PDEs Finite Difference Methods for Elliptic PDEs.. Finite difference scheme. We consider a simple example u := subject to Dirichlet boundary conditions ( ) u

More information

Numerical Solution I

Numerical Solution I Numerical Solution I Stationary Flow R. Kornhuber (FU Berlin) Summerschool Modelling of mass and energy transport in porous media with practical applications October 8-12, 2018 Schedule Classical Solutions

More information

High-Performance Scientific Computing

High-Performance Scientific Computing High-Performance Scientific Computing Instructor: Randy LeVeque TA: Grady Lemoine Applied Mathematics 483/583, Spring 2011 http://www.amath.washington.edu/~rjl/am583 World s fastest computers http://top500.org

More information

Iterative Methods for Solving A x = b

Iterative Methods for Solving A x = b Iterative Methods for Solving A x = b A good (free) online source for iterative methods for solving A x = b is given in the description of a set of iterative solvers called templates found at netlib: http

More information

Scientific Computing II

Scientific Computing II Technische Universität München ST 008 Institut für Informatik Dr. Miriam Mehl Scientific Computing II Final Exam, July, 008 Iterative Solvers (3 pts + 4 extra pts, 60 min) a) Steepest Descent and Conjugate

More information

Solving PDEs with CUDA Jonathan Cohen

Solving PDEs with CUDA Jonathan Cohen Solving PDEs with CUDA Jonathan Cohen jocohen@nvidia.com NVIDIA Research PDEs (Partial Differential Equations) Big topic Some common strategies Focus on one type of PDE in this talk Poisson Equation Linear

More information

Parallel programming practices for the solution of Sparse Linear Systems (motivated by computational physics and graphics)

Parallel programming practices for the solution of Sparse Linear Systems (motivated by computational physics and graphics) Parallel programming practices for the solution of Sparse Linear Systems (motivated by computational physics and graphics) Eftychios Sifakis CS758 Guest Lecture - 19 Sept 2012 Introduction Linear systems

More information

An efficient multigrid solver based on aggregation

An efficient multigrid solver based on aggregation An efficient multigrid solver based on aggregation Yvan Notay Université Libre de Bruxelles Service de Métrologie Nucléaire Graz, July 4, 2012 Co-worker: Artem Napov Supported by the Belgian FNRS http://homepages.ulb.ac.be/

More information

arxiv: v1 [math.na] 7 May 2009

arxiv: v1 [math.na] 7 May 2009 The hypersecant Jacobian approximation for quasi-newton solves of sparse nonlinear systems arxiv:0905.105v1 [math.na] 7 May 009 Abstract Johan Carlsson, John R. Cary Tech-X Corporation, 561 Arapahoe Avenue,

More information

New Streamfunction Approach for Magnetohydrodynamics

New Streamfunction Approach for Magnetohydrodynamics New Streamfunction Approac for Magnetoydrodynamics Kab Seo Kang Brooaven National Laboratory, Computational Science Center, Building 63, Room, Upton NY 973, USA. sang@bnl.gov Summary. We apply te finite

More information

Parallel Numerical Linear Algebra

Parallel Numerical Linear Algebra Parallel Numerical Linear Algebra 1 QR Factorization solving overconstrained linear systems tiled QR factorization using the PLASMA software library 2 Conjugate Gradient and Krylov Subspace Methods linear

More information

On nonlinear adaptivity with heterogeneity

On nonlinear adaptivity with heterogeneity On nonlinear adaptivity with heterogeneity Jed Brown jed@jedbrown.org (CU Boulder) Collaborators: Mark Adams (LBL), Matt Knepley (UChicago), Dave May (ETH), Laetitia Le Pourhiet (UPMC), Ravi Samtaney (KAUST)

More information

Chapter 7 Iterative Techniques in Matrix Algebra

Chapter 7 Iterative Techniques in Matrix Algebra Chapter 7 Iterative Techniques in Matrix Algebra Per-Olof Persson persson@berkeley.edu Department of Mathematics University of California, Berkeley Math 128B Numerical Analysis Vector Norms Definition

More information

Algebraic Multigrid as Solvers and as Preconditioner

Algebraic Multigrid as Solvers and as Preconditioner Ò Algebraic Multigrid as Solvers and as Preconditioner Domenico Lahaye domenico.lahaye@cs.kuleuven.ac.be http://www.cs.kuleuven.ac.be/ domenico/ Department of Computer Science Katholieke Universiteit Leuven

More information

9. Iterative Methods for Large Linear Systems

9. Iterative Methods for Large Linear Systems EE507 - Computational Techniques for EE Jitkomut Songsiri 9. Iterative Methods for Large Linear Systems introduction splitting method Jacobi method Gauss-Seidel method successive overrelaxation (SOR) 9-1

More information

Constrained Minimization and Multigrid

Constrained Minimization and Multigrid Constrained Minimization and Multigrid C. Gräser (FU Berlin), R. Kornhuber (FU Berlin), and O. Sander (FU Berlin) Workshop on PDE Constrained Optimization Hamburg, March 27-29, 2008 Matheon Outline Successive

More information

Poisson Equation in 2D

Poisson Equation in 2D A Parallel Strategy Department of Mathematics and Statistics McMaster University March 31, 2010 Outline Introduction 1 Introduction Motivation Discretization Iterative Methods 2 Additive Schwarz Method

More information

Nonlinear Iterative Solution of the Neutron Transport Equation

Nonlinear Iterative Solution of the Neutron Transport Equation Nonlinear Iterative Solution of the Neutron Transport Equation Emiliano Masiello Commissariat à l Energie Atomique de Saclay /DANS//SERMA/LTSD emiliano.masiello@cea.fr 1/37 Outline - motivations and framework

More information

Boundary Value Problems - Solving 3-D Finite-Difference problems Jacob White

Boundary Value Problems - Solving 3-D Finite-Difference problems Jacob White Introduction to Simulation - Lecture 2 Boundary Value Problems - Solving 3-D Finite-Difference problems Jacob White Thanks to Deepak Ramaswamy, Michal Rewienski, and Karen Veroy Outline Reminder about

More information

Finite Element Solver for Flux-Source Equations

Finite Element Solver for Flux-Source Equations Finite Element Solver for Flux-Source Equations Weston B. Lowrie A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Aeronautics Astronautics University

More information

CS 323: Numerical Analysis and Computing

CS 323: Numerical Analysis and Computing CS 323: Numerical Analysis and Computing MIDTERM #2 Instructions: This is an open notes exam, i.e., you are allowed to consult any textbook, your class notes, homeworks, or any of the handouts from us.

More information

Solving PDEs with Multigrid Methods p.1

Solving PDEs with Multigrid Methods p.1 Solving PDEs with Multigrid Methods Scott MacLachlan maclachl@colorado.edu Department of Applied Mathematics, University of Colorado at Boulder Solving PDEs with Multigrid Methods p.1 Support and Collaboration

More information

Altered Jacobian Newton Iterative Method for Nonlinear Elliptic Problems

Altered Jacobian Newton Iterative Method for Nonlinear Elliptic Problems Altered Jacobian Newton Iterative Method for Nonlinear Elliptic Problems Sanjay K Khattri Abstract We present an Altered Jacobian Newton Iterative Method for solving nonlinear elliptic problems Effectiveness

More information

Solving Boundary Value Problems (with Gaussians)

Solving Boundary Value Problems (with Gaussians) What is a boundary value problem? Solving Boundary Value Problems (with Gaussians) Definition A differential equation with constraints on the boundary Michael McCourt Division Argonne National Laboratory

More information

AMS526: Numerical Analysis I (Numerical Linear Algebra)

AMS526: Numerical Analysis I (Numerical Linear Algebra) AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 24: Preconditioning and Multigrid Solver Xiangmin Jiao SUNY Stony Brook Xiangmin Jiao Numerical Analysis I 1 / 5 Preconditioning Motivation:

More information

Solution of Matrix Eigenvalue Problem

Solution of Matrix Eigenvalue Problem Outlines October 12, 2004 Outlines Part I: Review of Previous Lecture Part II: Review of Previous Lecture Outlines Part I: Review of Previous Lecture Part II: Standard Matrix Eigenvalue Problem Other Forms

More information

Lecture 18 Classical Iterative Methods

Lecture 18 Classical Iterative Methods Lecture 18 Classical Iterative Methods MIT 18.335J / 6.337J Introduction to Numerical Methods Per-Olof Persson November 14, 2006 1 Iterative Methods for Linear Systems Direct methods for solving Ax = b,

More information

CME342 Parallel Methods in Numerical Analysis. Matrix Computation: Iterative Methods II. Sparse Matrix-vector Multiplication.

CME342 Parallel Methods in Numerical Analysis. Matrix Computation: Iterative Methods II. Sparse Matrix-vector Multiplication. CME342 Parallel Methods in Numerical Analysis Matrix Computation: Iterative Methods II Outline: CG & its parallelization. Sparse Matrix-vector Multiplication. 1 Basic iterative methods: Ax = b r = b Ax

More information

GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic

GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic Jan Verschelde joint work with Xiangcheng Yu University of Illinois at Chicago

More information

ITERATIVE METHODS FOR NONLINEAR ELLIPTIC EQUATIONS

ITERATIVE METHODS FOR NONLINEAR ELLIPTIC EQUATIONS ITERATIVE METHODS FOR NONLINEAR ELLIPTIC EQUATIONS LONG CHEN In this chapter we discuss iterative methods for solving the finite element discretization of semi-linear elliptic equations of the form: find

More information

Aggregation-based algebraic multigrid

Aggregation-based algebraic multigrid Aggregation-based algebraic multigrid from theory to fast solvers Yvan Notay Université Libre de Bruxelles Service de Métrologie Nucléaire CEMRACS, Marseille, July 18, 2012 Supported by the Belgian FNRS

More information

Math 5630: Iterative Methods for Systems of Equations Hung Phan, UMass Lowell March 22, 2018

Math 5630: Iterative Methods for Systems of Equations Hung Phan, UMass Lowell March 22, 2018 1 Linear Systems Math 5630: Iterative Methods for Systems of Equations Hung Phan, UMass Lowell March, 018 Consider the system 4x y + z = 7 4x 8y + z = 1 x + y + 5z = 15. We then obtain x = 1 4 (7 + y z)

More information

1. Nonlinear Equations. This lecture note excerpted parts from Michael Heath and Max Gunzburger. f(x) = 0

1. Nonlinear Equations. This lecture note excerpted parts from Michael Heath and Max Gunzburger. f(x) = 0 Numerical Analysis 1 1. Nonlinear Equations This lecture note excerpted parts from Michael Heath and Max Gunzburger. Given function f, we seek value x for which where f : D R n R n is nonlinear. f(x) =

More information

Lecture 11: CMSC 878R/AMSC698R. Iterative Methods An introduction. Outline. Inverse, LU decomposition, Cholesky, SVD, etc.

Lecture 11: CMSC 878R/AMSC698R. Iterative Methods An introduction. Outline. Inverse, LU decomposition, Cholesky, SVD, etc. Lecture 11: CMSC 878R/AMSC698R Iterative Methods An introduction Outline Direct Solution of Linear Systems Inverse, LU decomposition, Cholesky, SVD, etc. Iterative methods for linear systems Why? Matrix

More information

An Overly Simplified and Brief Review of Differential Equation Solution Methods. 1. Some Common Exact Solution Methods for Differential Equations

An Overly Simplified and Brief Review of Differential Equation Solution Methods. 1. Some Common Exact Solution Methods for Differential Equations An Overly Simplified and Brief Review of Differential Equation Solution Methods We will be dealing with initial or boundary value problems. A typical initial value problem has the form y y 0 y(0) 1 A typical

More information

A Differential Variational Inequality Approach for the Simulation of Heterogeneous Materials

A Differential Variational Inequality Approach for the Simulation of Heterogeneous Materials A Differential Variational Inequality Approach for the Simulation of Heterogeneous Materials Lei Wang 1, Jungho Lee 1, Mihai Anitescu 1, Anter El Azab 2, Lois Curfman McInnes 1, Todd Munson 1, Barry Smith

More information

A STUDY OF MULTIGRID SMOOTHERS USED IN COMPRESSIBLE CFD BASED ON THE CONVECTION DIFFUSION EQUATION

A STUDY OF MULTIGRID SMOOTHERS USED IN COMPRESSIBLE CFD BASED ON THE CONVECTION DIFFUSION EQUATION ECCOMAS Congress 2016 VII European Congress on Computational Methods in Applied Sciences and Engineering M. Papadrakakis, V. Papadopoulos, G. Stefanou, V. Plevris (eds.) Crete Island, Greece, 5 10 June

More information

Linear Algebra. PHY 604: Computational Methods in Physics and Astrophysics II

Linear Algebra. PHY 604: Computational Methods in Physics and Astrophysics II Linear Algebra Numerical Linear Algebra We've now seen several places where solving linear systems comes into play Implicit ODE integration Cubic spline interpolation We'll see many more, including Solving

More information

IMPLEMENTATION OF A PARALLEL AMG SOLVER

IMPLEMENTATION OF A PARALLEL AMG SOLVER IMPLEMENTATION OF A PARALLEL AMG SOLVER Tony Saad May 2005 http://tsaad.utsi.edu - tsaad@utsi.edu PLAN INTRODUCTION 2 min. MULTIGRID METHODS.. 3 min. PARALLEL IMPLEMENTATION PARTITIONING. 1 min. RENUMBERING...

More information

6. Multigrid & Krylov Methods. June 1, 2010

6. Multigrid & Krylov Methods. June 1, 2010 June 1, 2010 Scientific Computing II, Tobias Weinzierl page 1 of 27 Outline of This Session A recapitulation of iterative schemes Lots of advertisement Multigrid Ingredients Multigrid Analysis Scientific

More information

Parallel Polynomial Evaluation

Parallel Polynomial Evaluation Parallel Polynomial Evaluation Jan Verschelde joint work with Genady Yoffe University of Illinois at Chicago Department of Mathematics, Statistics, and Computer Science http://www.math.uic.edu/ jan jan@math.uic.edu

More information

Multigrid absolute value preconditioning

Multigrid absolute value preconditioning Multigrid absolute value preconditioning Eugene Vecharynski 1 Andrew Knyazev 2 (speaker) 1 Department of Computer Science and Engineering University of Minnesota 2 Department of Mathematical and Statistical

More information

ARock: an algorithmic framework for asynchronous parallel coordinate updates

ARock: an algorithmic framework for asynchronous parallel coordinate updates ARock: an algorithmic framework for asynchronous parallel coordinate updates Zhimin Peng, Yangyang Xu, Ming Yan, Wotao Yin ( UCLA Math, U.Waterloo DCO) UCLA CAM Report 15-37 ShanghaiTech SSDS 15 June 25,

More information

Scalable Non-Linear Compact Schemes

Scalable Non-Linear Compact Schemes Scalable Non-Linear Compact Schemes Debojyoti Ghosh Emil M. Constantinescu Jed Brown Mathematics Computer Science Argonne National Laboratory International Conference on Spectral and High Order Methods

More information

Multigrid Solvers in Space and Time for Highly Concurrent Architectures

Multigrid Solvers in Space and Time for Highly Concurrent Architectures Multigrid Solvers in Space and Time for Highly Concurrent Architectures Future CFD Technologies Workshop, Kissimmee, Florida January 6-7, 2018 Robert D. Falgout Center for Applied Scientific Computing

More information

Computational Linear Algebra

Computational Linear Algebra Computational Linear Algebra PD Dr. rer. nat. habil. Ralf Peter Mundani Computation in Engineering / BGU Scientific Computing in Computer Science / INF Winter Term 2017/18 Part 3: Iterative Methods PD

More information

Multigrid solvers for equations arising in implicit MHD simulations

Multigrid solvers for equations arising in implicit MHD simulations Multigrid solvers for equations arising in implicit MHD simulations smoothing Finest Grid Mark F. Adams Department of Applied Physics & Applied Mathematics Columbia University Ravi Samtaney PPPL Achi Brandt

More information

Integration of PETSc for Nonlinear Solves

Integration of PETSc for Nonlinear Solves Integration of PETSc for Nonlinear Solves Ben Jamroz, Travis Austin, Srinath Vadlamani, Scott Kruger Tech-X Corporation jamroz@txcorp.com http://www.txcorp.com NIMROD Meeting: Aug 10, 2010 Boulder, CO

More information

Mflop/s per Node. Nonlinear Iterations

Mflop/s per Node. Nonlinear Iterations Lagrange-Newton-Krylov-Schur-Schwarz: Methods for Matrix-free PDE-constrained Optimization D. Hovland Paul & Computer Science Division, Argonne National Laboratory Mathematics E. Keyes David & Statistics

More information

An Introduction to Numerical Continuation Methods. with Application to some Problems from Physics. Eusebius Doedel. Cuzco, Peru, May 2013

An Introduction to Numerical Continuation Methods. with Application to some Problems from Physics. Eusebius Doedel. Cuzco, Peru, May 2013 An Introduction to Numerical Continuation Methods with Application to some Problems from Physics Eusebius Doedel Cuzco, Peru, May 2013 Persistence of Solutions Newton s method for solving a nonlinear equation

More information

Bindel, Fall 2016 Matrix Computations (CS 6210) Notes for

Bindel, Fall 2016 Matrix Computations (CS 6210) Notes for 1 Iteration basics Notes for 2016-11-07 An iterative solver for Ax = b is produces a sequence of approximations x (k) x. We always stop after finitely many steps, based on some convergence criterion, e.g.

More information

A Scalable, Parallel Implementation of Weighted, Non-Linear Compact Schemes

A Scalable, Parallel Implementation of Weighted, Non-Linear Compact Schemes A Scalable, Parallel Implementation of Weighted, Non-Linear Compact Schemes Debojyoti Ghosh Emil M. Constantinescu Jed Brown Mathematics Computer Science Argonne National Laboratory SIAM Annual Meeting

More information