FAS and Solver Performance
|
|
- Mark Brown
- 5 years ago
- Views:
Transcription
1 FAS and Solver Performance Matthew Knepley Mathematics and Computer Science Division Argonne National Laboratory Fall AMS Central Section Meeting Chicago, IL Oct 05 06, 2007 M. Knepley (ANL) FAS AMS 07 1 / 27
2 Payoff Why should I care? 1 Optimal multilevel solvers are necessary 2 Processor flops are increasing much faster than bandwidth 3 Nonlinear algorithms can be efficient than linear algorithms 4 Presents an opportunity for numerical algebraic geometry M. Knepley (ANL) FAS AMS 07 2 / 27
3 Payoff Why should I care? 1 Optimal multilevel solvers are necessary 2 Processor flops are increasing much faster than bandwidth 3 Nonlinear algorithms can be efficient than linear algorithms 4 Presents an opportunity for numerical algebraic geometry M. Knepley (ANL) FAS AMS 07 2 / 27
4 Payoff Why should I care? 1 Optimal multilevel solvers are necessary 2 Processor flops are increasing much faster than bandwidth 3 Nonlinear algorithms can be efficient than linear algorithms 4 Presents an opportunity for numerical algebraic geometry M. Knepley (ANL) FAS AMS 07 2 / 27
5 Payoff Why should I care? 1 Optimal multilevel solvers are necessary 2 Processor flops are increasing much faster than bandwidth 3 Nonlinear algorithms can be efficient than linear algorithms 4 Presents an opportunity for numerical algebraic geometry M. Knepley (ANL) FAS AMS 07 2 / 27
6 Outline Simulation Basics 1 Simulation Basics 2 Newton-Multigrid 3 Machine Performance 4 FAS and Multigrid-Newton 5 Possible Extensions M. Knepley (ANL) FAS AMS 07 3 / 27
7 Simulation Basics Necessity Of Simulation Experiment are... Expensive Difficult Impossible Dangerous M. Knepley (ANL) FAS AMS 07 4 / 27
8 Simulation Basics Why Optimal Algorithms? The more powerful the computer, the greater the importance of optimality Example: Suppose Alg 1 solves a problem in time CN 2, N is the input size Suppose Alg 2 solves the same problem in time CN Suppose Alg 1 and Alg 2 are able to use 10,000 processors In constant time compared to serial, Alg1 can run a problem 100X larger Alg2 can run a problem 10,000X larger Alternatively, filling the machine s memory, Alg1 requires 100X time Alg2 runs in constant time M. Knepley (ANL) FAS AMS 07 5 / 27
9 Outline Newton-Multigrid 1 Simulation Basics 2 Newton-Multigrid 3 Machine Performance 4 FAS and Multigrid-Newton 5 Possible Extensions M. Knepley (ANL) FAS AMS 07 6 / 27
10 What Is Optimal? Newton-Multigrid I will define optimal as an O(N) solution algorithm These are generally hierarchical, so we need hierarchy generation assembly on subdomains restriction and prolongation M. Knepley (ANL) FAS AMS 07 7 / 27
11 Newton-Multigrid The Bratu Problem u + λe u = f in Ω (1) u = g on Ω (2) Also called the Solid-Fuel Ignition equation Can be treated as a nonlinear eigenvalue problem Has two solution branches until λ = 6.28 M. Knepley (ANL) FAS AMS 07 8 / 27
12 Newton s Method Newton-Multigrid 0 = F(u + δu) = F(u) + J(u)δu (3) so that u + δu = u J(u) 1 F(u) (4) Quadratic convergence J can be solved approximately (Dembo-Eisensat-Steihaug) M. Knepley (ANL) FAS AMS 07 9 / 27
13 Linear Multigrid Newton-Multigrid Smoothing (typically Gauss-Seidel) x new = S(x old, b) (5) Coarse-grid Correction J c δx c = R(b Jx old ) (6) x new = x old + R T δx c (7) M. Knepley (ANL) FAS AMS / 27
14 Newton-Multigrid Linear Convergence Convergence to r < 10 9 b using GMRES(30)/ILU Elements Iterations M. Knepley (ANL) FAS AMS / 27
15 Newton-Multigrid Linear Convergence Convergence to r < 10 9 b using GMRES(30)/MG Elements Iterations M. Knepley (ANL) FAS AMS / 27
16 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27
17 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27
18 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27
19 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27
20 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27
21 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27
22 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27
23 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27
24 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27
25 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27
26 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27
27 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27
28 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27
29 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27
30 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27
31 Newton-Multigrid Linear Multigrid Memory Access Memory J Processor M. Knepley (ANL) FAS AMS / 27
32 Outline Machine Performance 1 Simulation Basics 2 Newton-Multigrid 3 Machine Performance 4 FAS and Multigrid-Newton 5 Possible Extensions M. Knepley (ANL) FAS AMS / 27
33 Machine Performance STREAM Benchmark Simple benchmark program measuring sustainable memory bandwidth Protoypical operation is Triad (WAXPY): w = y + αx Measures the memory bandwidth bottleneck (much below peak) Datasets outstrip cache Machine Peak (MF/s) Triad (MB/s) MF/MW Eq. MF/s Matt s Laptop (5.5%) Intel Core2 Quad (1.2%) Tesla 1060C * (0.8%) Table: Bandwidth limited machine performance M. Knepley (ANL) FAS AMS / 27
34 Machine Performance Analysis of Sparse Matvec (SpMV) Assumptions No cache misses No waits on memory references Notation m Number of matrix rows nz Number of nonzero matrix elements V Number of vectors to multiply We can look at bandwidth needed for peak performance ( V ) m nz + 6 V or achieveable performance given a bandwith BW byte/flop (8) Vnz BW Mflop/s (9) (8V + 2)m + 6nz Towards Realistic Performance Bounds for Implicit CFD Codes, Gropp, Kaushik, Keyes, and Smith. M. Knepley (ANL) FAS AMS / 27
35 Machine Performance Improving Serial Performance For a single matvec with 3D FD Poisson, Matt s laptop can achieve at most 1 (8 + 2) 1 bytes/flop( MB/s) = 151 MFlops/s, (10) which is a dismal 8.8% of peak. Can improve performance by Blocking Multiple vectors but operation issue limitations take over. M. Knepley (ANL) FAS AMS / 27
36 Machine Performance Improving Serial Performance For a single matvec with 3D FD Poisson, Matt s laptop can achieve at most 1 (8 + 2) 1 bytes/flop( MB/s) = 151 MFlops/s, (10) which is a dismal 8.8% of peak. Better approaches: Unassembled operator application (Spectral elements, FMM) N data, N 2 computation Nonlinear evaluation (Picard, FAS, Exact Polynomial Solvers) N data, N k computation M. Knepley (ANL) FAS AMS / 27
37 Outline FAS and Multigrid-Newton 1 Simulation Basics 2 Newton-Multigrid 3 Machine Performance 4 FAS and Multigrid-Newton 5 Possible Extensions M. Knepley (ANL) FAS AMS / 27
38 FAS and Multigrid-Newton Matrix-Free Smoothing 7/23/15, 8:50 AM = We can use point Jacobi x new i = x old i + J 1 ii (b i J T i x old ) (11) In the nonlinear case, J T i x old = e T i F (x)x old (12) which might be calculated automatically using AD. M. Knepley (ANL) FAS AMS / 27
39 FAS and Multigrid-Newton Nonlinear Gauss-Seidel If we have an initial guess u, b = 0 F(u), xi new = xi old + J 1 ii (b i Ji T x old ) (13) xi new = xi old + J 1 ii ( F i (u) F i (u) T x old ) (14) xi new = xi old J 1 ii (F i (u) + F i (u) T x old ) (15) xi new = xi old J 1 ii F i (u + x old ) (16) u new i = u old i J 1 ii F i (u old ) (17) This is just Newton s method on a single equation at a time... which is Nonlinear Gauss-Seidel. M. Knepley (ANL) FAS AMS / 27
40 FAS and Multigrid-Newton Nonlinear Gauss-Seidel If we have an initial guess u, b = 0 F(u), xi new = xi old + J 1 ii (b i Ji T x old ) (13) xi new = xi old + J 1 ii ( F i (u) F i (u) T x old ) (14) xi new = xi old J 1 ii (F i (u) + F i (u) T x old ) (15) xi new = xi old J 1 ii F i (u + x old ) (16) u new i = u old i J 1 ii F i (u old ) (17) This is just Newton s method on a single equation at a time... which is Nonlinear Gauss-Seidel. M. Knepley (ANL) FAS AMS / 27
41 FAS and Multigrid-Newton Nonlinear Gauss-Seidel If we have an initial guess u, b = 0 F(u), xi new = xi old + J 1 ii (b i Ji T x old ) (13) xi new = xi old + J 1 ii ( F i (u) F i (u) T x old ) (14) xi new = xi old J 1 ii (F i (u) + F i (u) T x old ) (15) xi new = xi old J 1 ii F i (u + x old ) (16) u new i = u old i J 1 ii F i (u old ) (17) This is just Newton s method on a single equation at a time... which is Nonlinear Gauss-Seidel. M. Knepley (ANL) FAS AMS / 27
42 FAS and Multigrid-Newton Nonlinear Multigrid Most authors just offer an ansatz with nonlinear smoothing x new = S(x old, b) (18) and coarse-grid correction F c (x c ) = F c ( x c ) + γr(b F(x old )) (19) x new = x old + 1 γ RT (x c x c ) (20) where x is an approximate solution. If F is a linear operator L, the correction reduces to L c (x c ) = L c ( x c ) + γr(b L(x old )) (21) L c (x c x c ) = γr(b L(x old )) (22) L c δx c = γrr (23) M. Knepley (ANL) FAS AMS / 27
43 FAS and Multigrid-Newton Nonlinear Multigrid Most authors just offer an ansatz with nonlinear smoothing and coarse-grid correction x new = S(x old, b) (18) F c (x c ) = F c ( x c ) + γr(b F(x old )) (19) x new = x old + 1 γ RT (x c x c ) (20) where x is an approximate solution. and the update becomes x new = x old + 1 γ RT δx c (21) x new = x old + R T ˆL 1 c Rr (22) M. Knepley (ANL) FAS AMS / 27
44 FAS and Multigrid-Newton Nonlinear Multigrid It is instructive to look at the alternate derivation of Barry Smith Begin with the nonlinear generalization F(u) = 0, for a correction and then using Taylor series we have the correction J c x c = R(b Jx old ) (23) J c x c = R(F(u) + Jx old ) (24) F(u old ) = F(u) + J(u old u) +... (25) F c (uc old + x c ) = F c (uc old ) + J c x c +... (26) F c (u old c + x c ) F c (uc old ) = RF(u old ) (27) F c (uc old + x c ) = F c (uc old ) RF(u old ) (28) M. Knepley (ANL) FAS AMS / 27
45 FAS and Multigrid-Newton Nonlinear Multigrid It is instructive to look at the alternate derivation of Barry Smith Begin with the nonlinear generalization F(u) = 0, for a correction and then using Taylor series and the same update J c x c = R(b Jx old ) (23) J c x c = R(F(u) + Jx old ) (24) F(u old ) = F(u) + J(u old u) +... (25) F c (uc old + x c ) = F c (uc old ) + J c x c +... (26) x new = x old + R T x c (27) M. Knepley (ANL) FAS AMS / 27
46 FAS and Multigrid-Newton Spectrum of Methods Newton-Multigrid 7/23/15, 8:51 AM FAS When does linearization happen? Which Jacobian entries are updated? M. Knepley (ANL) FAS AMS / 27
47 FAS and Multigrid-Newton Nonlinear Convergence Convergence to r < 10 9 r 0 using Newton/GMRES(30)/ILU Elements Iterations M. Knepley (ANL) FAS AMS / 27
48 FAS and Multigrid-Newton Nonlinear Convergence M. Knepley (ANL) FAS AMS / 27
49 Outline Possible Extensions 1 Simulation Basics 2 Newton-Multigrid 3 Machine Performance 4 FAS and Multigrid-Newton 5 Possible Extensions M. Knepley (ANL) FAS AMS / 27
50 Possible Extensions Polynomial Solvers A great opportunity exists for polynomial solvers Better performance Bandwidth considerations only intensify on multicore chips Petascale systems will need these improvements More robust Most practical engineering calculations are quadratic New algorithms Can multiple solutions speed up convergence? M. Knepley (ANL) FAS AMS / 27
51 Conclusions Possible Extensions Newton-Multigrid provides Good nonlinear solves Simple interface for software libraries Low computational efficiency Multigrid-FAS provides Good nonlinear solves Lower memory bandwidth and storage Potentially high computational efficiency Requires formation on small systems on the fly M. Knepley (ANL) FAS AMS / 27
52 Possible Extensions PETSc Resources Can download tarballs or clone a Mercurial repository Hyperlinked documentation Manual Manual pages for evey method HTML of all example code (linked to manual pages) FAQ Full support at petsc-maint@mcs.anl.gov M. Knepley (ANL) FAS AMS / 27
ERLANGEN REGIONAL COMPUTING CENTER
ERLANGEN REGIONAL COMPUTING CENTER Making Sense of Performance Numbers Georg Hager Erlangen Regional Computing Center (RRZE) Friedrich-Alexander-Universität Erlangen-Nürnberg OpenMPCon 2018 Barcelona,
More informationKasetsart University Workshop. Multigrid methods: An introduction
Kasetsart University Workshop Multigrid methods: An introduction Dr. Anand Pardhanani Mathematics Department Earlham College Richmond, Indiana USA pardhan@earlham.edu A copy of these slides is available
More informationLecture 17: Iterative Methods and Sparse Linear Algebra
Lecture 17: Iterative Methods and Sparse Linear Algebra David Bindel 25 Mar 2014 Logistics HW 3 extended to Wednesday after break HW 4 should come out Monday after break Still need project description
More informationIndefinite and physics-based preconditioning
Indefinite and physics-based preconditioning Jed Brown VAW, ETH Zürich 2009-01-29 Newton iteration Standard form of a nonlinear system F (u) 0 Iteration Solve: Update: J(ũ)u F (ũ) ũ + ũ + u Example (p-bratu)
More informationHigh Performance Nonlinear Solvers
What is a nonlinear system? High Performance Nonlinear Solvers Michael McCourt Division Argonne National Laboratory IIT Meshfree Seminar September 19, 2011 Every nonlinear system of equations can be described
More informationAn Accelerated Block-Parallel Newton Method via Overlapped Partitioning
An Accelerated Block-Parallel Newton Method via Overlapped Partitioning Yurong Chen Lab. of Parallel Computing, Institute of Software, CAS (http://www.rdcps.ac.cn/~ychen/english.htm) Summary. This paper
More informationNumerical Methods I Solving Nonlinear Equations
Numerical Methods I Solving Nonlinear Equations Aleksandar Donev Courant Institute, NYU 1 donev@courant.nyu.edu 1 MATH-GA 2011.003 / CSCI-GA 2945.003, Fall 2014 October 16th, 2014 A. Donev (Courant Institute)
More informationMinisymposia 9 and 34: Avoiding Communication in Linear Algebra. Jim Demmel UC Berkeley bebop.cs.berkeley.edu
Minisymposia 9 and 34: Avoiding Communication in Linear Algebra Jim Demmel UC Berkeley bebop.cs.berkeley.edu Motivation (1) Increasing parallelism to exploit From Top500 to multicores in your laptop Exponentially
More informationNumerical Programming I (for CSE)
Technische Universität München WT 1/13 Fakultät für Mathematik Prof. Dr. M. Mehl B. Gatzhammer January 1, 13 Numerical Programming I (for CSE) Tutorial 1: Iterative Methods 1) Relaxation Methods a) Let
More informationLinear Solvers. Andrew Hazel
Linear Solvers Andrew Hazel Introduction Thus far we have talked about the formulation and discretisation of physical problems...... and stopped when we got to a discrete linear system of equations. Introduction
More informationMULTIGRID METHODS FOR NONLINEAR PROBLEMS: AN OVERVIEW
MULTIGRID METHODS FOR NONLINEAR PROBLEMS: AN OVERVIEW VAN EMDEN HENSON CENTER FOR APPLIED SCIENTIFIC COMPUTING LAWRENCE LIVERMORE NATIONAL LABORATORY Abstract Since their early application to elliptic
More informationAIMS Exercise Set # 1
AIMS Exercise Set #. Determine the form of the single precision floating point arithmetic used in the computers at AIMS. What is the largest number that can be accurately represented? What is the smallest
More informationA Domain Decomposition Based Jacobi-Davidson Algorithm for Quantum Dot Simulation
A Domain Decomposition Based Jacobi-Davidson Algorithm for Quantum Dot Simulation Tao Zhao 1, Feng-Nan Hwang 2 and Xiao-Chuan Cai 3 Abstract In this paper, we develop an overlapping domain decomposition
More informationMultigrid Methods and their application in CFD
Multigrid Methods and their application in CFD Michael Wurst TU München 16.06.2009 1 Multigrid Methods Definition Multigrid (MG) methods in numerical analysis are a group of algorithms for solving differential
More informationPartial Differential Equations
Partial Differential Equations Introduction Deng Li Discretization Methods Chunfang Chen, Danny Thorne, Adam Zornes CS521 Feb.,7, 2006 What do You Stand For? A PDE is a Partial Differential Equation This
More information6. Iterative Methods for Linear Systems. The stepwise approach to the solution...
6 Iterative Methods for Linear Systems The stepwise approach to the solution Miriam Mehl: 6 Iterative Methods for Linear Systems The stepwise approach to the solution, January 18, 2013 1 61 Large Sparse
More informationNewton additive and multiplicative Schwarz iterative methods
IMA Journal of Numerical Analysis (2008) 28, 143 161 doi:10.1093/imanum/drm015 Advance Access publication on July 16, 2007 Newton additive and multiplicative Schwarz iterative methods JOSEP ARNAL, VIOLETA
More informationAMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences)
AMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences) Lecture 19: Computing the SVD; Sparse Linear Systems Xiangmin Jiao Stony Brook University Xiangmin Jiao Numerical
More informationReview: From problem to parallel algorithm
Review: From problem to parallel algorithm Mathematical formulations of interesting problems abound Poisson s equation Sources: Electrostatics, gravity, fluid flow, image processing (!) Numerical solution:
More informationAn Introduction to Algebraic Multigrid (AMG) Algorithms Derrick Cerwinsky and Craig C. Douglas 1/84
An Introduction to Algebraic Multigrid (AMG) Algorithms Derrick Cerwinsky and Craig C. Douglas 1/84 Introduction Almost all numerical methods for solving PDEs will at some point be reduced to solving A
More informationStabilization and Acceleration of Algebraic Multigrid Method
Stabilization and Acceleration of Algebraic Multigrid Method Recursive Projection Algorithm A. Jemcov J.P. Maruszewski Fluent Inc. October 24, 2006 Outline 1 Need for Algorithm Stabilization and Acceleration
More informationTOPS Contributions to PFLOTRAN
TOPS Contributions to PFLOTRAN Barry Smith Matthew Knepley Mathematics and Computer Science Division Argonne National Laboratory TOPS Meeting at SIAM PP 08 Atlanta, Georgia March 14, 2008 M. Knepley (ANL)
More informationCourse Notes: Week 1
Course Notes: Week 1 Math 270C: Applied Numerical Linear Algebra 1 Lecture 1: Introduction (3/28/11) We will focus on iterative methods for solving linear systems of equations (and some discussion of eigenvalues
More informationSolving Ax = b, an overview. Program
Numerical Linear Algebra Improving iterative solvers: preconditioning, deflation, numerical software and parallelisation Gerard Sleijpen and Martin van Gijzen November 29, 27 Solving Ax = b, an overview
More informationNumerical Methods I Non-Square and Sparse Linear Systems
Numerical Methods I Non-Square and Sparse Linear Systems Aleksandar Donev Courant Institute, NYU 1 donev@courant.nyu.edu 1 MATH-GA 2011.003 / CSCI-GA 2945.003, Fall 2014 September 25th, 2014 A. Donev (Courant
More informationNumerical solutions of nonlinear systems of equations
Numerical solutions of nonlinear systems of equations Tsung-Ming Huang Department of Mathematics National Taiwan Normal University, Taiwan E-mail: min@math.ntnu.edu.tw August 28, 2011 Outline 1 Fixed points
More informationFine-Grained Parallel Algorithms for Incomplete Factorization Preconditioning
Fine-Grained Parallel Algorithms for Incomplete Factorization Preconditioning Edmond Chow School of Computational Science and Engineering Georgia Institute of Technology, USA SPPEXA Symposium TU München,
More informationPHYS 410/555 Computational Physics Solution of Non Linear Equations (a.k.a. Root Finding) (Reference Numerical Recipes, 9.0, 9.1, 9.
PHYS 410/555 Computational Physics Solution of Non Linear Equations (a.k.a. Root Finding) (Reference Numerical Recipes, 9.0, 9.1, 9.4) We will consider two cases 1. f(x) = 0 1-dimensional 2. f(x) = 0 d-dimensional
More informationLehrstuhl für Informatik 10 (Systemsimulation)
FRIEDRICH-ALEXANDER-UNIVERSITÄT ERLANGEN-NÜRNBERG INSTITUT FÜR INFORMATIK (MATHEMATISCHE MASCHINEN UND DATENVERARBEITUNG) Lehrstuhl für Informatik 10 (Systemsimulation) Comparison of two implementations
More informationComputers and Mathematics with Applications
Computers and Mathematics with Applications 68 (2014) 1151 1160 Contents lists available at ScienceDirect Computers and Mathematics with Applications journal homepage: www.elsevier.com/locate/camwa A GPU
More informationPowerPoints organized by Dr. Michael R. Gustafson II, Duke University
Part 3 Chapter 10 LU Factorization PowerPoints organized by Dr. Michael R. Gustafson II, Duke University All images copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
More informationA multigrid method for large scale inverse problems
A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of Earth and Ocean Science University of British Columbia haber@cs.ubc.ca July 4, 2003 E.Haber: Multigrid
More informationNumerical Solution Techniques in Mechanical and Aerospace Engineering
Numerical Solution Techniques in Mechanical and Aerospace Engineering Chunlei Liang LECTURE 3 Solvers of linear algebraic equations 3.1. Outline of Lecture Finite-difference method for a 2D elliptic PDE
More informationAMG for a Peta-scale Navier Stokes Code
AMG for a Peta-scale Navier Stokes Code James Lottes Argonne National Laboratory October 18, 2007 The Challenge Develop an AMG iterative method to solve Poisson 2 u = f discretized on highly irregular
More informationNotes for CS542G (Iterative Solvers for Linear Systems)
Notes for CS542G (Iterative Solvers for Linear Systems) Robert Bridson November 20, 2007 1 The Basics We re now looking at efficient ways to solve the linear system of equations Ax = b where in this course,
More informationPreface to the Second Edition. Preface to the First Edition
n page v Preface to the Second Edition Preface to the First Edition xiii xvii 1 Background in Linear Algebra 1 1.1 Matrices................................. 1 1.2 Square Matrices and Eigenvalues....................
More informationRobust solution of Poisson-like problems with aggregation-based AMG
Robust solution of Poisson-like problems with aggregation-based AMG Yvan Notay Université Libre de Bruxelles Service de Métrologie Nucléaire Paris, January 26, 215 Supported by the Belgian FNRS http://homepages.ulb.ac.be/
More informationUniversität Dortmund UCHPC. Performance. Computing for Finite Element Simulations
technische universität dortmund Universität Dortmund fakultät für mathematik LS III (IAM) UCHPC UnConventional High Performance Computing for Finite Element Simulations S. Turek, Chr. Becker, S. Buijssen,
More information1. Fast Iterative Solvers of SLE
1. Fast Iterative Solvers of crucial drawback of solvers discussed so far: they become slower if we discretize more accurate! now: look for possible remedies relaxation: explicit application of the multigrid
More informationConvergence Behavior of a Two-Level Optimized Schwarz Preconditioner
Convergence Behavior of a Two-Level Optimized Schwarz Preconditioner Olivier Dubois 1 and Martin J. Gander 2 1 IMA, University of Minnesota, 207 Church St. SE, Minneapolis, MN 55455 dubois@ima.umn.edu
More informationToward less synchronous composable multilevel methods for implicit multiphysics simulation
Toward less synchronous composable multilevel methods for implicit multiphysics simulation Jed Brown 1, Mark Adams 2, Peter Brune 1, Matt Knepley 3, Barry Smith 1 1 Mathematics and Computer Science Division,
More informationNumerical Methods for Large-Scale Nonlinear Equations
Slide 1 Numerical Methods for Large-Scale Nonlinear Equations Homer Walker MA 512 April 28, 2005 Inexact Newton and Newton Krylov Methods a. Newton-iterative and inexact Newton methods. Slide 2 i. Formulation
More informationNonlinear Preconditioning in PETSc
Nonlinear Preconditioning in PETSc Matthew Knepley PETSc Team Computation Institute University of Chicago Department of Molecular Biology and Physiology Rush University Medical Center Algorithmic Adaptivity
More information- Part 4 - Multicore and Manycore Technology: Chances and Challenges. Vincent Heuveline
- Part 4 - Multicore and Manycore Technology: Chances and Challenges Vincent Heuveline 1 Numerical Simulation of Tropical Cyclones Goal oriented adaptivity for tropical cyclones ~10⁴km ~1500km ~100km 2
More informationNewton s Method and Efficient, Robust Variants
Newton s Method and Efficient, Robust Variants Philipp Birken University of Kassel (SFB/TRR 30) Soon: University of Lund October 7th 2013 Efficient solution of large systems of non-linear PDEs in science
More informationNonlinear Preconditioning in PETSc
Nonlinear Preconditioning in PETSc Matthew Knepley PETSc Team Computation Institute University of Chicago Challenges in 21st Century Experimental Mathematical Computation ICERM, Providence, RI July 22,
More informationEdwin van der Weide and Magnus Svärd. I. Background information for the SBP-SAT scheme
Edwin van der Weide and Magnus Svärd I. Background information for the SBP-SAT scheme As is well-known, stability of a numerical scheme is a key property for a robust and accurate numerical solution. Proving
More informationCOURSE Iterative methods for solving linear systems
COURSE 0 4.3. Iterative methods for solving linear systems Because of round-off errors, direct methods become less efficient than iterative methods for large systems (>00 000 variables). An iterative scheme
More informationChapter Two: Numerical Methods for Elliptic PDEs. 1 Finite Difference Methods for Elliptic PDEs
Chapter Two: Numerical Methods for Elliptic PDEs Finite Difference Methods for Elliptic PDEs.. Finite difference scheme. We consider a simple example u := subject to Dirichlet boundary conditions ( ) u
More informationNumerical Solution I
Numerical Solution I Stationary Flow R. Kornhuber (FU Berlin) Summerschool Modelling of mass and energy transport in porous media with practical applications October 8-12, 2018 Schedule Classical Solutions
More informationHigh-Performance Scientific Computing
High-Performance Scientific Computing Instructor: Randy LeVeque TA: Grady Lemoine Applied Mathematics 483/583, Spring 2011 http://www.amath.washington.edu/~rjl/am583 World s fastest computers http://top500.org
More informationIterative Methods for Solving A x = b
Iterative Methods for Solving A x = b A good (free) online source for iterative methods for solving A x = b is given in the description of a set of iterative solvers called templates found at netlib: http
More informationScientific Computing II
Technische Universität München ST 008 Institut für Informatik Dr. Miriam Mehl Scientific Computing II Final Exam, July, 008 Iterative Solvers (3 pts + 4 extra pts, 60 min) a) Steepest Descent and Conjugate
More informationSolving PDEs with CUDA Jonathan Cohen
Solving PDEs with CUDA Jonathan Cohen jocohen@nvidia.com NVIDIA Research PDEs (Partial Differential Equations) Big topic Some common strategies Focus on one type of PDE in this talk Poisson Equation Linear
More informationParallel programming practices for the solution of Sparse Linear Systems (motivated by computational physics and graphics)
Parallel programming practices for the solution of Sparse Linear Systems (motivated by computational physics and graphics) Eftychios Sifakis CS758 Guest Lecture - 19 Sept 2012 Introduction Linear systems
More informationAn efficient multigrid solver based on aggregation
An efficient multigrid solver based on aggregation Yvan Notay Université Libre de Bruxelles Service de Métrologie Nucléaire Graz, July 4, 2012 Co-worker: Artem Napov Supported by the Belgian FNRS http://homepages.ulb.ac.be/
More informationarxiv: v1 [math.na] 7 May 2009
The hypersecant Jacobian approximation for quasi-newton solves of sparse nonlinear systems arxiv:0905.105v1 [math.na] 7 May 009 Abstract Johan Carlsson, John R. Cary Tech-X Corporation, 561 Arapahoe Avenue,
More informationNew Streamfunction Approach for Magnetohydrodynamics
New Streamfunction Approac for Magnetoydrodynamics Kab Seo Kang Brooaven National Laboratory, Computational Science Center, Building 63, Room, Upton NY 973, USA. sang@bnl.gov Summary. We apply te finite
More informationParallel Numerical Linear Algebra
Parallel Numerical Linear Algebra 1 QR Factorization solving overconstrained linear systems tiled QR factorization using the PLASMA software library 2 Conjugate Gradient and Krylov Subspace Methods linear
More informationOn nonlinear adaptivity with heterogeneity
On nonlinear adaptivity with heterogeneity Jed Brown jed@jedbrown.org (CU Boulder) Collaborators: Mark Adams (LBL), Matt Knepley (UChicago), Dave May (ETH), Laetitia Le Pourhiet (UPMC), Ravi Samtaney (KAUST)
More informationChapter 7 Iterative Techniques in Matrix Algebra
Chapter 7 Iterative Techniques in Matrix Algebra Per-Olof Persson persson@berkeley.edu Department of Mathematics University of California, Berkeley Math 128B Numerical Analysis Vector Norms Definition
More informationAlgebraic Multigrid as Solvers and as Preconditioner
Ò Algebraic Multigrid as Solvers and as Preconditioner Domenico Lahaye domenico.lahaye@cs.kuleuven.ac.be http://www.cs.kuleuven.ac.be/ domenico/ Department of Computer Science Katholieke Universiteit Leuven
More information9. Iterative Methods for Large Linear Systems
EE507 - Computational Techniques for EE Jitkomut Songsiri 9. Iterative Methods for Large Linear Systems introduction splitting method Jacobi method Gauss-Seidel method successive overrelaxation (SOR) 9-1
More informationConstrained Minimization and Multigrid
Constrained Minimization and Multigrid C. Gräser (FU Berlin), R. Kornhuber (FU Berlin), and O. Sander (FU Berlin) Workshop on PDE Constrained Optimization Hamburg, March 27-29, 2008 Matheon Outline Successive
More informationPoisson Equation in 2D
A Parallel Strategy Department of Mathematics and Statistics McMaster University March 31, 2010 Outline Introduction 1 Introduction Motivation Discretization Iterative Methods 2 Additive Schwarz Method
More informationNonlinear Iterative Solution of the Neutron Transport Equation
Nonlinear Iterative Solution of the Neutron Transport Equation Emiliano Masiello Commissariat à l Energie Atomique de Saclay /DANS//SERMA/LTSD emiliano.masiello@cea.fr 1/37 Outline - motivations and framework
More informationBoundary Value Problems - Solving 3-D Finite-Difference problems Jacob White
Introduction to Simulation - Lecture 2 Boundary Value Problems - Solving 3-D Finite-Difference problems Jacob White Thanks to Deepak Ramaswamy, Michal Rewienski, and Karen Veroy Outline Reminder about
More informationFinite Element Solver for Flux-Source Equations
Finite Element Solver for Flux-Source Equations Weston B. Lowrie A thesis submitted in partial fulfillment of the requirements for the degree of Master of Science in Aeronautics Astronautics University
More informationCS 323: Numerical Analysis and Computing
CS 323: Numerical Analysis and Computing MIDTERM #2 Instructions: This is an open notes exam, i.e., you are allowed to consult any textbook, your class notes, homeworks, or any of the handouts from us.
More informationSolving PDEs with Multigrid Methods p.1
Solving PDEs with Multigrid Methods Scott MacLachlan maclachl@colorado.edu Department of Applied Mathematics, University of Colorado at Boulder Solving PDEs with Multigrid Methods p.1 Support and Collaboration
More informationAltered Jacobian Newton Iterative Method for Nonlinear Elliptic Problems
Altered Jacobian Newton Iterative Method for Nonlinear Elliptic Problems Sanjay K Khattri Abstract We present an Altered Jacobian Newton Iterative Method for solving nonlinear elliptic problems Effectiveness
More informationSolving Boundary Value Problems (with Gaussians)
What is a boundary value problem? Solving Boundary Value Problems (with Gaussians) Definition A differential equation with constraints on the boundary Michael McCourt Division Argonne National Laboratory
More informationAMS526: Numerical Analysis I (Numerical Linear Algebra)
AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 24: Preconditioning and Multigrid Solver Xiangmin Jiao SUNY Stony Brook Xiangmin Jiao Numerical Analysis I 1 / 5 Preconditioning Motivation:
More informationSolution of Matrix Eigenvalue Problem
Outlines October 12, 2004 Outlines Part I: Review of Previous Lecture Part II: Review of Previous Lecture Outlines Part I: Review of Previous Lecture Part II: Standard Matrix Eigenvalue Problem Other Forms
More informationLecture 18 Classical Iterative Methods
Lecture 18 Classical Iterative Methods MIT 18.335J / 6.337J Introduction to Numerical Methods Per-Olof Persson November 14, 2006 1 Iterative Methods for Linear Systems Direct methods for solving Ax = b,
More informationCME342 Parallel Methods in Numerical Analysis. Matrix Computation: Iterative Methods II. Sparse Matrix-vector Multiplication.
CME342 Parallel Methods in Numerical Analysis Matrix Computation: Iterative Methods II Outline: CG & its parallelization. Sparse Matrix-vector Multiplication. 1 Basic iterative methods: Ax = b r = b Ax
More informationGPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic
GPU acceleration of Newton s method for large systems of polynomial equations in double double and quad double arithmetic Jan Verschelde joint work with Xiangcheng Yu University of Illinois at Chicago
More informationITERATIVE METHODS FOR NONLINEAR ELLIPTIC EQUATIONS
ITERATIVE METHODS FOR NONLINEAR ELLIPTIC EQUATIONS LONG CHEN In this chapter we discuss iterative methods for solving the finite element discretization of semi-linear elliptic equations of the form: find
More informationAggregation-based algebraic multigrid
Aggregation-based algebraic multigrid from theory to fast solvers Yvan Notay Université Libre de Bruxelles Service de Métrologie Nucléaire CEMRACS, Marseille, July 18, 2012 Supported by the Belgian FNRS
More informationMath 5630: Iterative Methods for Systems of Equations Hung Phan, UMass Lowell March 22, 2018
1 Linear Systems Math 5630: Iterative Methods for Systems of Equations Hung Phan, UMass Lowell March, 018 Consider the system 4x y + z = 7 4x 8y + z = 1 x + y + 5z = 15. We then obtain x = 1 4 (7 + y z)
More information1. Nonlinear Equations. This lecture note excerpted parts from Michael Heath and Max Gunzburger. f(x) = 0
Numerical Analysis 1 1. Nonlinear Equations This lecture note excerpted parts from Michael Heath and Max Gunzburger. Given function f, we seek value x for which where f : D R n R n is nonlinear. f(x) =
More informationLecture 11: CMSC 878R/AMSC698R. Iterative Methods An introduction. Outline. Inverse, LU decomposition, Cholesky, SVD, etc.
Lecture 11: CMSC 878R/AMSC698R Iterative Methods An introduction Outline Direct Solution of Linear Systems Inverse, LU decomposition, Cholesky, SVD, etc. Iterative methods for linear systems Why? Matrix
More informationAn Overly Simplified and Brief Review of Differential Equation Solution Methods. 1. Some Common Exact Solution Methods for Differential Equations
An Overly Simplified and Brief Review of Differential Equation Solution Methods We will be dealing with initial or boundary value problems. A typical initial value problem has the form y y 0 y(0) 1 A typical
More informationA Differential Variational Inequality Approach for the Simulation of Heterogeneous Materials
A Differential Variational Inequality Approach for the Simulation of Heterogeneous Materials Lei Wang 1, Jungho Lee 1, Mihai Anitescu 1, Anter El Azab 2, Lois Curfman McInnes 1, Todd Munson 1, Barry Smith
More informationA STUDY OF MULTIGRID SMOOTHERS USED IN COMPRESSIBLE CFD BASED ON THE CONVECTION DIFFUSION EQUATION
ECCOMAS Congress 2016 VII European Congress on Computational Methods in Applied Sciences and Engineering M. Papadrakakis, V. Papadopoulos, G. Stefanou, V. Plevris (eds.) Crete Island, Greece, 5 10 June
More informationLinear Algebra. PHY 604: Computational Methods in Physics and Astrophysics II
Linear Algebra Numerical Linear Algebra We've now seen several places where solving linear systems comes into play Implicit ODE integration Cubic spline interpolation We'll see many more, including Solving
More informationIMPLEMENTATION OF A PARALLEL AMG SOLVER
IMPLEMENTATION OF A PARALLEL AMG SOLVER Tony Saad May 2005 http://tsaad.utsi.edu - tsaad@utsi.edu PLAN INTRODUCTION 2 min. MULTIGRID METHODS.. 3 min. PARALLEL IMPLEMENTATION PARTITIONING. 1 min. RENUMBERING...
More information6. Multigrid & Krylov Methods. June 1, 2010
June 1, 2010 Scientific Computing II, Tobias Weinzierl page 1 of 27 Outline of This Session A recapitulation of iterative schemes Lots of advertisement Multigrid Ingredients Multigrid Analysis Scientific
More informationParallel Polynomial Evaluation
Parallel Polynomial Evaluation Jan Verschelde joint work with Genady Yoffe University of Illinois at Chicago Department of Mathematics, Statistics, and Computer Science http://www.math.uic.edu/ jan jan@math.uic.edu
More informationMultigrid absolute value preconditioning
Multigrid absolute value preconditioning Eugene Vecharynski 1 Andrew Knyazev 2 (speaker) 1 Department of Computer Science and Engineering University of Minnesota 2 Department of Mathematical and Statistical
More informationARock: an algorithmic framework for asynchronous parallel coordinate updates
ARock: an algorithmic framework for asynchronous parallel coordinate updates Zhimin Peng, Yangyang Xu, Ming Yan, Wotao Yin ( UCLA Math, U.Waterloo DCO) UCLA CAM Report 15-37 ShanghaiTech SSDS 15 June 25,
More informationScalable Non-Linear Compact Schemes
Scalable Non-Linear Compact Schemes Debojyoti Ghosh Emil M. Constantinescu Jed Brown Mathematics Computer Science Argonne National Laboratory International Conference on Spectral and High Order Methods
More informationMultigrid Solvers in Space and Time for Highly Concurrent Architectures
Multigrid Solvers in Space and Time for Highly Concurrent Architectures Future CFD Technologies Workshop, Kissimmee, Florida January 6-7, 2018 Robert D. Falgout Center for Applied Scientific Computing
More informationComputational Linear Algebra
Computational Linear Algebra PD Dr. rer. nat. habil. Ralf Peter Mundani Computation in Engineering / BGU Scientific Computing in Computer Science / INF Winter Term 2017/18 Part 3: Iterative Methods PD
More informationMultigrid solvers for equations arising in implicit MHD simulations
Multigrid solvers for equations arising in implicit MHD simulations smoothing Finest Grid Mark F. Adams Department of Applied Physics & Applied Mathematics Columbia University Ravi Samtaney PPPL Achi Brandt
More informationIntegration of PETSc for Nonlinear Solves
Integration of PETSc for Nonlinear Solves Ben Jamroz, Travis Austin, Srinath Vadlamani, Scott Kruger Tech-X Corporation jamroz@txcorp.com http://www.txcorp.com NIMROD Meeting: Aug 10, 2010 Boulder, CO
More informationMflop/s per Node. Nonlinear Iterations
Lagrange-Newton-Krylov-Schur-Schwarz: Methods for Matrix-free PDE-constrained Optimization D. Hovland Paul & Computer Science Division, Argonne National Laboratory Mathematics E. Keyes David & Statistics
More informationAn Introduction to Numerical Continuation Methods. with Application to some Problems from Physics. Eusebius Doedel. Cuzco, Peru, May 2013
An Introduction to Numerical Continuation Methods with Application to some Problems from Physics Eusebius Doedel Cuzco, Peru, May 2013 Persistence of Solutions Newton s method for solving a nonlinear equation
More informationBindel, Fall 2016 Matrix Computations (CS 6210) Notes for
1 Iteration basics Notes for 2016-11-07 An iterative solver for Ax = b is produces a sequence of approximations x (k) x. We always stop after finitely many steps, based on some convergence criterion, e.g.
More informationA Scalable, Parallel Implementation of Weighted, Non-Linear Compact Schemes
A Scalable, Parallel Implementation of Weighted, Non-Linear Compact Schemes Debojyoti Ghosh Emil M. Constantinescu Jed Brown Mathematics Computer Science Argonne National Laboratory SIAM Annual Meeting
More information