On nonlinear adaptivity with heterogeneity

Similar documents
Aspects of Multigrid

Adaptive algebraic multigrid methods in lattice computations

Kasetsart University Workshop. Multigrid methods: An introduction

Solving PDEs with Multigrid Methods p.1

Preface to the Second Edition. Preface to the First Edition

Computational Linear Algebra

Algebraic Multigrid as Solvers and as Preconditioner

1. Fast Iterative Solvers of SLE

Indefinite and physics-based preconditioning

Robust solution of Poisson-like problems with aggregation-based AMG

MULTIGRID METHODS FOR NONLINEAR PROBLEMS: AN OVERVIEW

Multigrid solvers for equations arising in implicit MHD simulations

Chapter 5. Methods for Solving Elliptic Equations

Multigrid absolute value preconditioning

Adaptive Multigrid for QCD. Lattice University of Regensburg

AMS526: Numerical Analysis I (Numerical Linear Algebra)

Multigrid Methods and their application in CFD

AMG for a Peta-scale Navier Stokes Code

A multigrid method for large scale inverse problems

Geometric Multigrid Methods for the Helmholtz equations

Contents. Preface... xi. Introduction...

Elliptic Problems / Multigrid. PHY 604: Computational Methods for Physics and Astrophysics II

INTERGRID OPERATORS FOR THE CELL CENTERED FINITE DIFFERENCE MULTIGRID ALGORITHM ON RECTANGULAR GRIDS. 1. Introduction

Solving Symmetric Indefinite Systems with Symmetric Positive Definite Preconditioners

Bootstrap AMG. Kailai Xu. July 12, Stanford University

6. Multigrid & Krylov Methods. June 1, 2010

Conjugate Gradients: Idea

New Multigrid Solver Advances in TOPS

MULTI-LEVEL TECHNIQUES FOR THE SOLUTION OF THE KINETIC EQUATIONS IN CONDENSING FLOWS SIMON GLAZENBORG

Today s class. Linear Algebraic Equations LU Decomposition. Numerical Methods, Fall 2011 Lecture 8. Prof. Jinbo Bi CSE, UConn

Stabilization and Acceleration of Algebraic Multigrid Method

Comparison of V-cycle Multigrid Method for Cell-centered Finite Difference on Triangular Meshes

Multigrid finite element methods on semi-structured triangular grids

Algebraic Coarse Spaces for Overlapping Schwarz Preconditioners

Spectral element agglomerate AMGe

Linear and Non-Linear Preconditioning

Multigrid and Domain Decomposition Methods for Electrostatics Problems

A SHORT NOTE COMPARING MULTIGRID AND DOMAIN DECOMPOSITION FOR PROTEIN MODELING EQUATIONS

Geometric Multigrid Methods

ITERATIVE METHODS FOR NONLINEAR ELLIPTIC EQUATIONS

FAS and Solver Performance

An efficient multigrid solver based on aggregation

On domain decomposition preconditioners for finite element approximations of the Helmholtz equation using absorption

A MULTIGRID ALGORITHM FOR. Richard E. Ewing and Jian Shen. Institute for Scientic Computation. Texas A&M University. College Station, Texas SUMMARY

Partial Differential Equations

Introduction to Multigrid Methods

A Recursive Trust-Region Method for Non-Convex Constrained Minimization

University of Illinois at Urbana-Champaign. Multigrid (MG) methods are used to approximate solutions to elliptic partial differential

Multigrid Methods for Linear Systems with Stochastic Entries Arising in Lattice QCD. Andreas Frommer

Alternative numerical method in continuum mechanics COMPUTATIONAL MULTISCALE. University of Liège Aerospace & Mechanical Engineering

Aggregation-based algebraic multigrid

Using an Auction Algorithm in AMG based on Maximum Weighted Matching in Matrix Graphs

Bindel, Fall 2016 Matrix Computations (CS 6210) Notes for

Introduction to Scientific Computing II Multigrid

An Introduction to Algebraic Multigrid (AMG) Algorithms Derrick Cerwinsky and Craig C. Douglas 1/84

Newton s Method and Efficient, Robust Variants

Constrained Minimization and Multigrid

Key words. preconditioned conjugate gradient method, saddle point problems, optimal control of PDEs, control and state constraints, multigrid method

NUMERICAL METHODS FOR ENGINEERING APPLICATION

The Removal of Critical Slowing Down. Lattice College of William and Mary

Scalable Domain Decomposition Preconditioners For Heterogeneous Elliptic Problems

2 CAI, KEYES AND MARCINKOWSKI proportional to the relative nonlinearity of the function; i.e., as the relative nonlinearity increases the domain of co

INTRODUCTION TO MULTIGRID METHODS

Research Article Evaluation of the Capability of the Multigrid Method in Speeding Up the Convergence of Iterative Methods

arxiv: v1 [math.na] 11 Jul 2011

Linear and Non-Linear Preconditioning

Chapter 7 Iterative Techniques in Matrix Algebra

Compact High Order Finite Difference Stencils for Elliptic Variable Coefficient and Interface Problems

Space-time Discontinuous Galerkin Methods for Compressible Flows

Numerical Solution I

arxiv: v1 [physics.comp-ph] 10 Aug 2015

An Introduction of Multigrid Methods for Large-Scale Computation

A High-Performance Parallel Hybrid Method for Large Sparse Linear Systems

Sparse Matrix Techniques for MCAO

Computational Astrophysics 1 Particle Mesh methods

A Space-Time Multigrid Solver Methodology for the Optimal Control of Time-Dependent Fluid Flow

Nonlinear Multigrid and Domain Decomposition Methods

Shifted Laplace and related preconditioning for the Helmholtz equation

Introduction to Multigrid Method

A Line search Multigrid Method for Large-Scale Nonlinear Optimization

K.S. Kang. The multigrid method for an elliptic problem on a rectangular domain with an internal conductiong structure and an inner empty space

Newton-Multigrid Least-Squares FEM for S-V-P Formulation of the Navier-Stokes Equations

Goal-oriented error control of the iterative solution of finite element equations

High Performance Nonlinear Solvers

Convergence Behavior of a Two-Level Optimized Schwarz Preconditioner

Numerical Programming I (for CSE)

A Comparison of Adaptive Algebraic Multigrid and Lüscher s Inexact Deflation

EFFICIENT MULTIGRID BASED SOLVERS FOR ISOGEOMETRIC ANALYSIS

On the Development of Implicit Solvers for Time-Dependent Systems

Principles of Systematic Upscaling Achi Brandt. 1 Introduction

Abstract. 1 Introduction

Multigrid Algorithms for High-Order Discontinuous Galerkin Discretizations of the Compressible Navier-Stokes Equations

Newton-Krylov-Schwarz Method for a Spherical Shallow Water Model

Hence a root lies between 1 and 2. Since f a is negative and f(x 0 ) is positive The root lies between a and x 0 i.e. 1 and 1.

A Generalized Eigensolver Based on Smoothed Aggregation (GES-SA) for Initializing Smoothed Aggregation Multigrid (SA)

Robust multigrid methods for nonsmooth coecient elliptic linear systems

Multigrid Method ZHONG-CI SHI. Institute of Computational Mathematics Chinese Academy of Sciences, Beijing, China. Joint work: Xuejun Xu

Two-level multiplicative domain decomposition algorithm for recovering the Lamé coefficient in biological tissues

Utilizing Adjoint-Based Techniques to Improve the Accuracy and Reliability in Uncertainty Quantification

Efficient domain decomposition methods for the time-harmonic Maxwell equations

Transcription:

On nonlinear adaptivity with heterogeneity Jed Brown jed@jedbrown.org (CU Boulder) Collaborators: Mark Adams (LBL), Matt Knepley (UChicago), Dave May (ETH), Laetitia Le Pourhiet (UPMC), Ravi Samtaney (KAUST) Copper Mountain Multigrid Conference, 2017-03-30 This talk: https://jedbrown.org/files/ 20170330-AdaptHeterogeneous.pdf

DD6 (1992) in Como, Italy

Plan: ruthlessly eliminate communication Eliminate, not aggregate and amortize Why? Enables pruning unnecessary work More scope for dynamic load balance Tolerance for high-frequency load imbalance From irregular computation or hardware error correction Local recovery despite global coupling Requirements Must retain optimal convergence with good constants Flexible, robust, and debuggable

Multigrid Preliminaries Multigrid is an O(n) method for solving algebraic problems by defining a hierarchy of scale. A multigrid method is constructed from: 1. a sequence of discretizations coarser approximations of problem, same or different equations constructed algebraically or geometrically 2. intergrid transfer operators residual restriction Ih H (fine to coarse) state restriction Îh H (fine to coarse) partial state interpolation IH h (coarse to fine, prolongation ) state reconstruction I h H (coarse to fine) 3. Smoothers (S) correct the high frequency error components Richardson, Jacobi, Gauss-Seidel, etc. Gauss-Seidel-Newton or optimization methods Compatible Monte Carlo,...

τ formulation of Full Approximation Scheme (FAS) classical formulation: coarse grid accelerates fine grid τ formulation: fine grid feeds back into coarse grid To solve Nu = f, recursively apply solve coarse problem for u H correction and post-smooth pre-smooth ũ h S h pre(u h 0,f h ) N H u H = I H h f h +N H Î H h }{{} ũh I H h Nh ũ h }{{} f H τh H u h S h post (ũ h + I hh(u ) H Î Hh ũh ),f h Ih H residual restriction Îh H solution restriction IH h solution interpolation f H = Ih H f h restricted forcing {Spre,S h post} h smoothing operations on the fine grid At convergence, u H = Îh H uh solves the τ-corrected coarse grid equation N H u H = f H + τh H, thus τ h H is the fine grid feedback that makes the coarse grid equation accurate. τh H is local and need only be recomputed where it becomes stale. Interpretation by Achi Brandt in 1977, many tricks followed

Segmental refinement: dependencies

Segmental refinement: parallel

Low communication MG red arrows can be removed by τ-fas with overlap blue arrows can also be removed, but then algebraic convergence stalls when discretization error is reached no simple way to check that discretization error is obtained if fine grid state is not stored, use compatible relaxation to complete prolongation P Segmental refinement by Achi Brandt (1977) 2-process case by Brandt and Diskin (1994)

Segmental refinement: no horizontal communication Adams, Brown, Knepley, Samtaney (SISC 2016) 27-point second-order stencil, manufactured analytic solution 5 SR levels: 16 3 cells/process local coarse grid Overlap = Base + (L l)increment Implementation requires even number of cells round down. FMG with V(2,2) cycles Solve times: Laplacian, u=(x 4 L 2 x 2 ), L=(2,1,1) (8 solves) Table: e SR / e FMG Base Increment 1 2 3 1 1.59 2.34 1.00 2 1.00 1.00 1.00 3 1.00 1.00 1.00 Time 1 F cycle w/ V(2,2), 128 3 cells/core, 8 solves non redundant CGS 25 1 F cycle w/ V(2,2), 128 3 cells/core, 8 solves SR, non redundant CGS V(2,2) cycles, 128 3 cells, rtol=10. 4, 8 solves, non redundant CGS 1 F cycle w/ V(2,2), N=32/core, 512 solves redundant CGS 1 F cycle w/ V(2,2), N=32/core, 512 solves non redundant CGS 20 1 F cycle w/ V(2,2), N=32/core, 512 solves SR, non redundant CGS 15 10 5 0 16 128 1024 8192 65536 # cores (Edison)

Reducing memory bandwidth Sweep through coarse grid with moving window Zoom in on new slab, construct fine grid window in-cache Interpolate to new fine grid, apply pipelined smoother (s-step) Compute residual, accumulate restriction of state and residual into coarse grid, expire slab from window

Connections to Fast Multipole Method / H -matrices [Yokota, Barba (2011)] Can evaluate solution of nearby problems without solving everywhere need τ correction from everywhere Segmental Refinement buffer regions separation criteria No need for Green s functions

Other uses of segmental refinement Compression of solutions, local decompression, resilience Transient adjoints Adjoint model runs backward-in-time, needs state from solution of forward model Status quo: hierarchical checkpointing Memory-constrained and requires computing forward model multiple times If forward model is stiff, each step has global dependence Compression via τ-fas accelerates recomputation, can be local Visualization and analysis Targeted visualization in small part of domain Interesting features emergent so can t predict where to look

Tolerances and FMG

Continental rifting Rifting Video

Nuclear fuel pellet-cladding mechanics [Williamson et al (2012)]

Model problem: p-laplacian with slip boundary conditions 2-dimensional model problem for power-law fluid cross-section ( u p 2 u ) f = 0, 1 p Singular or degenerate when u = 0 Regularized variant (η u) f = 0 η(γ) = (ε 2 + γ) p 2 2 γ(u) = 1 2 u 2 Friction boundary condition on one side of domain u n + A(x) u q 1 u = 0

Model problem: p-laplacian with slip boundary conditions p = 1.3 and q = 0.2, checkerboard coefficients {10 2,1} Friction coefficient A = 0 in center, 1 at corners

Model problem: p-laplacian with slip boundary conditions p = 1.3 and q = 0.2, checkerboard coefficients {10 2,1} Friction coefficient A = 0 in center, 1 at corners

Model problem: p-laplacian with slip boundary conditions p = 1.3 and q = 0.2, checkerboard coefficients {10 2,1} Friction coefficient A = 0 in center, 1 at corners

Model problem: p-laplacian with slip boundary conditions p = 1.3 and q = 0.2, checkerboard coefficients {10 2,1} Friction coefficient A = 0 in center, 1 at corners

Model problem: p-laplacian with slip boundary conditions p = 1.3 and q = 0.2, checkerboard coefficients {10 2,1} Friction coefficient A = 0 in center, 1 at corners

Model problem: p-laplacian with slip boundary conditions p = 1.3 and q = 0.2, checkerboard coefficients {10 2,1} Friction coefficient A = 0 in center, 1 at corners

τ corrections Plane strain elasticity, E = 1000,ν = 0.4 inclusions in E = 1,ν = 0.2 material, coarsen by 3 2. Solve initial problem everywhere and compute = AH Îh H uh Ih H Ah u h τ H h Change boundary conditions and solve FAS coarse problem N H ú H = I H f h h +N H Î H h }{{} ũh I H h Nh ũ h }{{} f H τh H Prolong, post-smooth, compute error e h = ú h (N h ) 1 f h Coarse grid with τ is nearly 10 better accuracy

τ corrections Plane strain elasticity, E = 1000,ν = 0.4 inclusions in E = 1,ν = 0.2 material, coarsen by 3 2. Solve initial problem everywhere and compute = AH Îh H uh Ih H Ah u h τ H h Change boundary conditions and solve FAS coarse problem N H ú H = I H f h h +N H Î H h }{{} ũh I H h Nh ũ h }{{} f H τh H Prolong, post-smooth, compute error e h = ú h (N h ) 1 f h Coarse grid with τ is nearly 10 better accuracy

τ adaptivity: an idea for heterogeneous media Applications with localized nonlinearities Subduction, rifting, rupture/fault dynamics Carbon fiber, biological tissues, fracture Frictional contact Adaptive methods fail for heterogeneous media τ adaptivity Rocks are rough, solutions are not smooth Cannot build accurate coarse space without scale separation Fine-grid work needed everywhere at first Then τ becomes accurate in nearly-linear regions Only visit fine grids in interesting places: active nonlinearity, drastic change of solution

Comparison to nonlinear domain decomposition ASPIN (Additive Schwarz preconditioned inexact Newton) Cai and Keyes (2003) More local iterations in strongly nonlinear regions Each nonlinear iteration only propagates information locally Many real nonlinearities are activated by long-range forces locking in granular media (gravel, granola) binding in steel fittings, crack propagation Two-stage algorithm has different load balancing τ adaptivity Nonlinear subdomain solves Global linear solve Minimum effort to communicate long-range information Nonlinearity sees effects as accurate as with global fine-grid feedback Fine-grid work always proportional to interesting changes

Nonlinear and matrix-free smoothing matrix-based smoothers require global linearization nonlinearity often more efficiently resolved locally nonlinear additive or multiplicative Schwarz nonlinear/matrix-free is good if C = (cost to evaluate residual at one point ) N (cost of global residual) 1 finite difference: C < 2 finite volume: C 2, depends on reconstruction finite element: C number of vertices per cell larger block smoothers help reduce C additive correction (Jacobi/Chebyshev/multi-stage) global evaluation, as good as C = 1 but, need to assemble corrector/scaling need spectral estimates or wave speeds

Outlook τ adaptivity: benefits of AMR without fine-scale smoothness Coarse-centric restructuring is a major interface change Algebraic coarsening? Nonlinear smoothers (and discretizations) Smooth in neighborhood of interesting fine-scale features Which discretizations can provide efficient matrix-free smoothers? Does there exist an efficient smoother based on element Neumann problems? Weakening data dependencies enables dynamic load balancing Reliability of error estimates for refreshing τ We want a coarse indicator for whether τ needs to change Phase fields can provide such information Thanks DOE Office of Advanced Scientific Computing Research