The Removal of Critical Slowing Down. Lattice College of William and Mary

Similar documents
Adaptive Multigrid for QCD. Lattice University of Regensburg

Multigrid Methods for Linear Systems with Stochastic Entries Arising in Lattice QCD. Andreas Frommer

Adaptive algebraic multigrid methods in lattice computations

Solving PDEs with Multigrid Methods p.1

hypre MG for LQFT Chris Schroeder LLNL - Physics Division

New Multigrid Solver Advances in TOPS

Case Study: Quantum Chromodynamics

A Jacobi Davidson Method with a Multigrid Solver for the Hermitian Wilson-Dirac Operator

arxiv: v1 [hep-lat] 19 Jul 2009

Hierarchically Deflated Conjugate Gradient

A Comparison of Adaptive Algebraic Multigrid and Lüscher s Inexact Deflation

Algebraic Multigrid as Solvers and as Preconditioner

1. Fast Iterative Solvers of SLE

Stabilization and Acceleration of Algebraic Multigrid Method

Multigrid Methods and their application in CFD

AMS526: Numerical Analysis I (Numerical Linear Algebra)

Conjugate Gradients: Idea

AMG for a Peta-scale Navier Stokes Code

Solving Symmetric Indefinite Systems with Symmetric Positive Definite Preconditioners

Preface to the Second Edition. Preface to the First Edition

Multigrid absolute value preconditioning

DDalphaAMG for Twisted Mass Fermions

Multigrid for Staggered Lattice Fermions

College of William & Mary Department of Computer Science

Spectral element agglomerate AMGe

ADAPTIVE ALGEBRAIC MULTIGRID

Iterative Methods and Multigrid

Preconditioned Locally Minimal Residual Method for Computing Interior Eigenpairs of Symmetric Operators

Robust solution of Poisson-like problems with aggregation-based AMG

Deflation for inversion with multiple right-hand sides in QCD

AN AGGREGATION MULTILEVEL METHOD USING SMOOTH ERROR VECTORS

The Conjugate Gradient Method

AMS526: Numerical Analysis I (Numerical Linear Algebra for Computational and Data Sciences)

Multigrid solvers for equations arising in implicit MHD simulations

A Novel Aggregation Method based on Graph Matching for Algebraic MultiGrid Preconditioning of Sparse Linear Systems

A Generalized Eigensolver Based on Smoothed Aggregation (GES-SA) for Initializing Smoothed Aggregation Multigrid (SA)

M.A. Botchev. September 5, 2014

Indefinite and physics-based preconditioning

A New Multilevel Smoothing Method for Wavelet-Based Algebraic Multigrid Poisson Problem Solver

OPERATOR-BASED INTERPOLATION FOR BOOTSTRAP ALGEBRAIC MULTIGRID

6. Multigrid & Krylov Methods. June 1, 2010

MULTI-LEVEL TECHNIQUES FOR THE SOLUTION OF THE KINETIC EQUATIONS IN CONDENSING FLOWS SIMON GLAZENBORG

Algebraic Multigrid Preconditioners for Computing Stationary Distributions of Markov Processes

ALGEBRAIC MULTILEVEL METHODS FOR GRAPH LAPLACIANS

Computational Linear Algebra

Least-Squares Finite Element Methods for Quantum Electrodynamics

A decade of fast and robust Helmholtz solvers

Iterative Methods and Multigrid

AMS526: Numerical Analysis I (Numerical Linear Algebra) Lecture 23: GMRES and Other Krylov Subspace Methods; Preconditioning

Adaptive Algebraic Multigrid for Lattice QCD Computations

Kasetsart University Workshop. Multigrid methods: An introduction

Contents. Preface... xi. Introduction...

Multi-grid Methods in Lattice Field Theory

Markov Chains and Web Ranking: a Multilevel Adaptive Aggregation Method

TWO-GRID DEFLATED KRYLOV METHODS FOR LINEAR EQUATIONS

Iterative methods for Linear System of Equations. Joint Advanced Student School (JASS-2009)

Introduction to Multigrid Methods

Iterative methods for Linear System

University of Illinois at Urbana-Champaign. Multigrid (MG) methods are used to approximate solutions to elliptic partial differential

Bootstrap AMG. Kailai Xu. July 12, Stanford University

Solving Ax = b, an overview. Program

Using an Auction Algorithm in AMG based on Maximum Weighted Matching in Matrix Graphs

On the choice of abstract projection vectors for second level preconditioners

Elliptic Problems / Multigrid. PHY 604: Computational Methods for Physics and Astrophysics II

arxiv: v1 [hep-lat] 28 Aug 2014 Karthee Sivalingam

A COMPARISON OF TWO-LEVEL PRECONDITIONERS BASED ON MULTIGRID AND DEFLATION

Boundary Value Problems - Solving 3-D Finite-Difference problems Jacob White

A fast method for the solution of the Helmholtz equation

Numerical Methods - Numerical Linear Algebra

Numerical behavior of inexact linear solvers

arxiv: v1 [hep-lat] 2 May 2012

Aspects of Multigrid

Progress Report for Year 2. College of William and Mary Algorithms and software

Lattice simulation of 2+1 flavors of overlap light quarks

Research Directions in Scalable Algorithms

Mixed action simulations: approaching physical quark masses

An Algebraic Multigrid Method for Eigenvalue Problems

Chapter 7 Iterative Techniques in Matrix Algebra

Chiral Fermions on the Lattice:

9.1 Preconditioned Krylov Subspace Methods

Parallel Numerics, WT 2016/ Iterative Methods for Sparse Linear Systems of Equations. page 1 of 1

Review: From problem to parallel algorithm

Course Notes: Week 1

The amount of work to construct each new guess from the previous one should be a small multiple of the number of nonzeros in A.

Aggregation-based algebraic multigrid

Numerical Methods I Non-Square and Sparse Linear Systems

Geometric Multigrid Methods

FEM and Sparse Linear System Solving

Fast solvers for steady incompressible flow

Spectral analysis of complex shifted-laplace preconditioners for the Helmholtz equation

Domain decomposition on different levels of the Jacobi-Davidson method

Algebraic Multi-Grid solver for lattice QCD on Exascale hardware: Intel Xeon Phi

Iterative Methods for Solving A x = b

Numerical Methods in Matrix Computations

An efficient multigrid solver based on aggregation

Introduction to Iterative Solvers of Linear Systems

A multigrid method for large scale inverse problems

The convergence of stationary iterations with indefinite splitting

Efficient implementation of the overlap operator on multi-gpus

LINEAR ALGEBRA 1, 2012-I PARTIAL EXAM 3 SOLUTIONS TO PRACTICE PROBLEMS

Transcription:

The Removal of Critical Slowing Down Lattice 2008 College of William and Mary Michael Clark Boston University James Brannick, Rich Brower, Tom Manteuffel, Steve McCormick, James Osborn, Claudio Rebbi 1

Inverting the Dirac Operator Required to invert the Dirac operator for QCD calculation Determinant: O(1) inversions, continuously changing U Observables: O(10) O(1000) inversions, constant U Direct solve infeasible Krylov solvers typically used Using CG on D Dx = D b e.g., BiCGstab on Dx = b Condition number 1/m (1/m 2 ) Critical slowing down in solver 2

Inverting the Dirac Operator: Recent Progress Eigenvector deflation strategies (Morgan/Wilcox, Orginos/Stathopoulos) Deflate Dirac operator by its low e-vectors With sufficient e-vectors, critical slowing down gone O(N 2 ) scaling in number of e-vectors required Large overhead finding e-vectors multiple rhs problems Local mode deflation (Lüscher) Deflate the Dirac operator only locally over blocks E-vectors locally co-linear ( Local coherence ) O(N)/O(N log N) cost Small setup overhead suitable for all problems Adaptive multigrid (Brannick/Brower/Clark/Osborn/Rebbi) Preconditioner for Dirac normal equations in 2d Critical slowing down gone How about 4d? 3

Intro to Multigrid: Failure of stationary solvers Iterative solvers (relaxation) effective on high frequency error Minimal effect on low frequency error Example: Free Laplace operator in 2d Ax = 0, x 0 random Gauss-Seidel relaxation Plot error = x 4

Intro to Multigrid: Failure of stationary solvers Iterative solvers (relaxation) effective on high frequency error Minimal effect on low frequency error Example: Free Laplace operator in 2d Ax = 0, x 0 random Gauss-Seidel relaxation Plot error = x 5

Intro to Multigrid: Failure of stationary solvers Iterative solvers (relaxation) effective on high frequency error Minimal effect on low frequency error Example: Free Laplace operator in 2d Ax = 0, x 0 random Gauss-Seidel relaxation Plot error = x 6

Intro to Multigrid: Failure of stationary solvers Iterative solvers (relaxation) effective on high frequency error Minimal effect on low frequency error Example: Free Laplace operator in 2d Ax = 0, x 0 random Gauss-Seidel relaxation Plot error = x 7

Intro to Multigrid: Coarse Grid Representation I Low frequency error components slow down solver Low frequency modes are smooth represent on coarse grid Fine low frequency Coarse high frequency Relax system on coarse grid Prolongate coarse correction to fine grid 2 grid scheme much better than relaxation Iterate until exact solve is possible (V-cycle) 8

Intro to Multigrid: Coarse Grid Representation II Define restriction operator R Define prolongation operator P Notice R = P Define coarse grid operator A c = P AP (Galerkin prescription) Equivalent to rediscretization in free theory 9

Intro to Multigrid: Geometric MG and QCD Classical geometric multigrid O(N log N) / O(N) work Removes critical slowing down Gauge field U is not geometrically smooth Dirac operator low modes oscillatory Geometric MG fails 20 years of failures applying MG to QCD MG requires low modes preserved in coarse space (1 P P )ψ 0 = 0 Adaptivity required for gauge fields 10

Adaptive Smooth Aggregation MG (αsa) Adaptively use (s)low modes to define required V-cycle (Brezina et al) Initial Algorithm setup: 1. Relax on homogenous problem Ax = 0, random x 0 resulting error vector is a representation of slow modes 2. Cut vector x into subsets to be aggregated (blocked) 3. This defines the prolongator such that (1 P P )x = 0 4. Define coarse grid operator A c = P AP 5. Relax on coarse grid A c x c = 0, x c 0 = P x, goto 2. This defines initial V-Cycle 11

Adaptive Smooth Aggregation MG (αsa) In general require more a single vector (Brezina et al) Now repeat setup process replacing relaxation with V-cycle Previously found errors components quickly reduced Error vector rich in new error components Augment V-Cycle to preserve additional vector space Cut vectors x 1, x 2 into blocks Orthonormalize blocks to define augmented prolongator Define new coarse grid operator Each vector corresponds to an extra dof per coarse site Add more vectors until satisfactory solver found Setup cost depends on number of vectors required 12

Example Setup: Gauge Laplacian 2d 128 128 lattice, m = 10 6 1e+05 1 1e-05 r 2 1e-10 1e-15 1e-20 0 20 40 60 80 100 N iter 13

Example Setup: Gauge Laplacian 2d 128 128 lattice, m = 10 6 1e+05 1 1e-05 r 2 1e-10 1e-15 1e-20 0 20 40 60 80 100 N iter 14

Example Setup: Gauge Laplacian 2d 128 128 lattice, m = 10 6 1e+05 1 1e-05 r 2 1e-10 1e-15 1e-20 0 20 40 60 80 100 N iter 15

Example Setup: Gauge Laplacian 2d 128 128 lattice, m = 10 6 1e+05 1 1e-05 r 2 1e-10 1e-15 1e-20 0 20 40 60 80 100 N iter 16

Example Setup: Gauge Laplacian 2d 128 128 lattice, m = 10 6 1e+05 1 1e-05 r 2 1e-10 1e-15 1e-20 0 20 40 60 80 100 N iter 17

QCD Adaptive Multigrid αsa designed for problems without an underlying geometry Uses algebraic strength of connection to block system QCD has regular geometric lattice with unitary connections Use regular geometric blocking strategy (e.g.,4 d N s N c ) Coarse operator looks like a Dirac operator Dirac operator is not Hermitian Positive Definite Multigrid convergence proof requires HPD operator Use normal equations A = D D Critical slowing down gone in 2d (PRL 100:041601) Coarse grid operator P (D D)P not nearest neighbour D D has squared condition number compared to D More vectors to capture null space, cost N 2 v Coarse operator very expensive in 4d 18

Multigrid on D Eigenvector decomposition of D = ψ λ λ ˆψ λ Consider deflation: bi-orthogonality requires left-right projection P = ( 1 ψ λ 1 ) λ ˆψ λ = ( 1 ψ λ ˆψ λ D ψ λ 1 ˆ ψ λ ) = ( 1 P (RDP ) 1 R ) Prolongation from right vectors, restriction from left vectors Left and right eigenvectors related through γ 5 : ˆψ λ = γ 5 ψ λ R = P γ 5 Leave chirality intact [γ 5, P ] = 0 γ 5 cancels out of correction Coarse grid operator defined as D c retains γ 5 Hermiticity D c is positive real if D is D c = P γ 5 DP 19

2d Results 128 128 lattice, β = 6, 10, ˆm = 0.001 0.5 MG setup run at lightest mass only D D-MG algorithm 4 4( 2) blocking, 3 levels, N v = 8 Under-relaxed MR relaxation Preconditioner for CG D-MG algorithm 4 4 blocking, 3 levels, N v = 4 Under-relaxed MR relaxation Preconditioner for BiCGstab Results Critical slowing down virtually gone Weak dependence on β D-MG superior to D D-MG Dirac Operator Applications Flops 12000 10000 8000 6000 4000 2000 0 4e+10 3e+10 2e+10 1e+10 0 Q=4, CG Q=4, MG-CG Gauge Length Q=4, MG-BiCGstab 0.001 0.01 0.1 1 mass gap MG-CG MG-BiCGstab 0.001 0.01 0.1 1 mass gap 20

4d Results Apply D-MG algorithm to full problem 16 3 32 lattice β = 6.0, m crit = 0.8049 Compare against CG, BiCGstab D-MG Algorithm 4 4 ( 3 2) blocking, 3 levels N v = 20, (c.f. Lüscher) Setup run at m crit Under-relaxed MR smoother Preconditioner for GCR(50) 21

4d Results 6000 5000 BiCGstab CG Dirac Operator Applications 4000 3000 2000 1000 0-0.81-0.805-0.8-0.795-0.79-0.785-0.78 mass 22

4d Results 6000 5000 BiCGstab CG MG-GCR Dirac Operator Applications 4000 3000 2000 1000 0-0.81-0.805-0.8-0.795-0.79-0.785-0.78 mass 23

4d Results 2000 1500 CG BiCGstab MG-GCR Time 1000 500 0-0.81-0.805-0.8-0.795-0.79-0.785-0.78 mass 24

Adaptive MG vs. Deflation (PRELIMINARY) 16 3 64 anisotropic Wilson lattice β = 5.5 m crit = 0.4180 Adaptive MG N v = 20 Eigen-vector deflation N v = 240 (Orginos and Stathopoulos) Dirac operator applications 4000 3000 2000 1000 CG Deflated CG MG-GCR m crit (setup) Order of magnitude reduction in N v Very similar naïve performance 0-0.43-0.42-0.41-0.4 mass Deflation actually 2 faster but 10 N v Expect MG to be increasingly competitive at large volumes 25

Conclusions and Future Work Now have a variety of algorithms to tame critical slowing down Eigenvector deflation Local mode deflation Adaptive Multigrid These have different ranges of applicability All suitable for multiple rhs calculations Local mode deflation also suitable HMC (MG also?) Staggered fermions No problems expected here Ideal since very large lattices Chiral fermions (overlap, domain wall) Essentially a highly indefinite Wilson operator D-MG requires reformulation 26

Local Mode Deflation vs. Multigrid MG is multiplicative cancellation (P Dc 1 P )Dx = (P Dc 1 P )b Local mode deflation is additive cancellation (1 DP Dc 1 P )Dx = (1 DP Dc 1 P )b Expect MG to require less accurate D 1 c Local mode deflation Dc 1 MG Dc 1 accuracy? solution accuracy 10 6 27