A multigrid method for large scale inverse problems

Similar documents
Parallelizing large scale time domain electromagnetic inverse problem

Computational methods for large distributed parameter estimation problems with possible discontinuities

ETNA Kent State University

A parallel method for large scale time domain electromagnetic inverse problems

ITERATIVE METHODS FOR NONLINEAR ELLIPTIC EQUATIONS

Computational methods for large distributed parameter estimation problems in 3D

1. Introduction. In this work we consider the solution of finite-dimensional constrained optimization problems of the form

Direct Current Resistivity Inversion using Various Objective Functions

Simulation based optimization

Inversion of 3D electromagnetic data in frequency and time domain using an inexact all-at-once approach

A multilevel, level-set method for optimizing eigenvalues in shape design problems

1. Introduction. In this work, we consider the solution of finite-dimensional constrained optimization problems of the form

Numerical Methods for PDE-Constrained Optimization

A Multilevel Method for Image Registration

An Introduction to Algebraic Multigrid (AMG) Algorithms Derrick Cerwinsky and Craig C. Douglas 1/84

Simultaneous estimation of wavefields & medium parameters

AMS526: Numerical Analysis I (Numerical Linear Algebra)

Inversion of 3D Electromagnetic Data in frequency and time domain using an inexact all-at-once approach

Multigrid Methods and their application in CFD

New Multigrid Solver Advances in TOPS

Full-Waveform Inversion with Gauss- Newton-Krylov Method

MULTIGRID METHODS FOR NONLINEAR PROBLEMS: AN OVERVIEW

A Sobolev trust-region method for numerical solution of the Ginz

Solving PDEs with Multigrid Methods p.1

Parameter Identification in Partial Differential Equations

Achieving depth resolution with gradient array survey data through transient electromagnetic inversion

Algorithms for Constrained Optimization

Constrained Minimization and Multigrid

Aspects of Multigrid

Partial Differential Equations

Adaptive algebraic multigrid methods in lattice computations

Kasetsart University Workshop. Multigrid methods: An introduction

On nonlinear adaptivity with heterogeneity

Iterative regularization of nonlinear ill-posed problems in Banach space

AMG for a Peta-scale Navier Stokes Code

Elliptic Problems / Multigrid. PHY 604: Computational Methods for Physics and Astrophysics II

Algebraic Multigrid as Solvers and as Preconditioner

4TE3/6TE3. Algorithms for. Continuous Optimization

Stabilization and Acceleration of Algebraic Multigrid Method

Numerical Analysis of Electromagnetic Fields

5 Handling Constraints

Constrained Optimization

Regular article Nonlinear multigrid methods for total variation image denoising

Chapter 5. Methods for Solving Elliptic Equations

1. Fast Iterative Solvers of SLE

Preconditioned all-at-once methods for large, sparse parameter estimation problems

3D Space Charge Routines: The Software Package MOEVE and FFT Compared

SF2822 Applied Nonlinear Optimization. Preparatory question. Lecture 9: Sequential quadratic programming. Anders Forsgren

Scientific Computing: An Introductory Survey

Scientific Computing: An Introductory Survey

The Lifted Newton Method and Its Use in Optimization

2.29 Numerical Fluid Mechanics Spring 2015 Lecture 9

On effective methods for implicit piecewise smooth surface recovery

A Novel Aggregation Method based on Graph Matching for Algebraic MultiGrid Preconditioning of Sparse Linear Systems

INTRODUCTION TO MULTIGRID METHODS

Inverse Problems and Optimal Design in Electricity and Magnetism

A Line search Multigrid Method for Large-Scale Nonlinear Optimization

Multigrid solvers for equations arising in implicit MHD simulations

Robust solution of Poisson-like problems with aggregation-based AMG

Nonlinear Least Squares

Newton s Method and Efficient, Robust Variants

Constrained optimization: direct methods (cont.)

FAS and Solver Performance

6. Multigrid & Krylov Methods. June 1, 2010

Infeasibility Detection and an Inexact Active-Set Method for Large-Scale Nonlinear Optimization

Adaptive Multigrid for QCD. Lattice University of Regensburg

Numerical Optimization of Partial Differential Equations

Direct Methods. Moritz Diehl. Optimization in Engineering Center (OPTEC) and Electrical Engineering Department (ESAT) K.U.

A Robust Preconditioner for the Hessian System in Elliptic Optimal Control Problems

Bootstrap AMG. Kailai Xu. July 12, Stanford University

Chapter 3 Numerical Methods

Fast algorithms for the inverse medium problem. George Biros University of Pennsylvania

Multi-source inversion of TEM data: with field applications to Mt. Milligan

Geometric Multigrid Methods for the Helmholtz equations

Optimization and Root Finding. Kurt Hornik

Outline. Scientific Computing: An Introductory Survey. Optimization. Optimization Problems. Examples: Optimization Problems

Adaptive and multilevel methods for parameter identification in partial differential equations

j=1 r 1 x 1 x n. r m r j (x) r j r j (x) r j (x). r j x k

Numerical Optimization Professor Horst Cerjak, Horst Bischof, Thomas Pock Mat Vis-Gra SS09

University of Illinois at Urbana-Champaign. Multigrid (MG) methods are used to approximate solutions to elliptic partial differential

Key words. preconditioned conjugate gradient method, saddle point problems, optimal control of PDEs, control and state constraints, multigrid method

Preface to the Second Edition. Preface to the First Edition

An Inexact Sequential Quadratic Optimization Method for Nonlinear Optimization

OPTIMAL ESTIMATION of DYNAMIC SYSTEMS

Algorithms for nonlinear programming problems II

MULTI-LEVEL TECHNIQUES FOR THE SOLUTION OF THE KINETIC EQUATIONS IN CONDENSING FLOWS SIMON GLAZENBORG

A note on accurate and efficient higher order Galerkin time stepping schemes for the nonstationary Stokes equations

hypre MG for LQFT Chris Schroeder LLNL - Physics Division

Iterative Rigid Multibody Dynamics A Comparison of Computational Methods

Trust-Region SQP Methods with Inexact Linear System Solves for Large-Scale Optimization


High Performance Nonlinear Solvers

An introduction to PDE-constrained optimization

The Removal of Critical Slowing Down. Lattice College of William and Mary

Multigrid Algorithms for Inverse Problems with Linear Parabolic PDE Constraints

Aggregation-based algebraic multigrid

Divergence Formulation of Source Term

Contents. Preface. 1 Introduction Optimization view on mathematical models NLP models, black-box versus explicit expression 3

FWI with Compressive Updates Aleksandr Aravkin, Felix Herrmann, Tristan van Leeuwen, Xiang Li, James Burke

Multigrid Acceleration of the Horn-Schunck Algorithm for the Optical Flow Problem

Transcription:

A multigrid method for large scale inverse problems Eldad Haber Dept. of Computer Science, Dept. of Earth and Ocean Science University of British Columbia haber@cs.ubc.ca July 4, 2003 E.Haber: Multigrid methods for full-space formulation

Collaborator Uri Ascher E.Haber: Multigrid methods for full-space formulation 1

Main Outline Preliminaries - all-at-once methods. Multigrid for the linearized system Discretization Smoothers The multigrid iteration Solving the nonlinear system using an inexact Newton SQP method Example Conclusions E.Haber: Multigrid methods for full-space formulation 2

Preliminaries In many applications we need to evaluate coefficients from noisy measured data Examples: Resistivity/hydrology measurements: σ φ k = q k k = 1...N Electromagnetics: µ 1 E jk iω j σe jk = iω j q k Discretize PDE plus BC and write forward problem as A(m)u = q In these problems we need to infer the model given measurements of the field u m E.Haber: Multigrid methods for full-space formulation 3

Formulation of the inverse problem We have: The PDE+BC Measured data A(m)u = q (physical process) b obs = Qu A priori information What are we after? Solve the differential problem A(m)u = q Fit the data, Qu b obs T ol Get a reasonable model: W (m m ref ) not too large E.Haber: Multigrid methods for full-space formulation 4

Formulation cont. There are two (major) ways to approach the problem Unconstrained: φ(m) = Qu b 2 + β W m 2 = QA(m) 1 q b 2 + β W m 2 Constrained or - simultaneous; all-at-once: L m,u,λ = Qu b 2 + β W m 2 + λ T (A(m)u q) λ - Vector of Lagrange multipliers. E.Haber: Multigrid methods for full-space formulation 5

All-at-once - Unconstrained: The general idea u A(m)u-q=0 m Constrained - Unconstrained E.Haber: Multigrid methods for full-space formulation 6

Linear systems which evolve from all-at-once methods The Gradients: L u = Q T (Qu b) + A(m) T λ = 0 L m = βw T W (m m 0 ) + G(m, u) T λ = 0 L λ = A(m)u q = 0 where G(m, u) = (A(m)u) m. Gauss-Newton iteration 1 : A 0 G Q T Q A T 0 δu δλ = 0 G T βw T W δm L u L λ L m 1 In the Gauss-Newton case we do not use second order information E.Haber: Multigrid methods for full-space formulation 7

The linearized system It is easy to see that the permuted KKT matrix A 0 G Q Q A T 0 0 G T βw T W corresponds to discretizing the differential operator (em ( )) 0 ((e m u)( )) δ(x xi ) (e m ( )) 0 0 ( u) T (e m )( ( )) β 2 plus BC This PDE system can be efficiently solved using multigrid methods E.Haber: Multigrid methods for full-space formulation 8

Ellipticity of the PDE system Assuming we have data everywhere, consider one mode δu δû δλ = δˆλ e ıθ x δm δ ˆm Denote σ = e m and obtain the symbol C = σ θ 2 0 ıσ( u) T θ 1 σ θ 2 0 0 ıσ( u) T θ β θ 2 Thus det(c) = σ 2 (β θ 6 + [( u) T θ] 2 ) The system is therefore elliptic for any β > 0 E.Haber: Multigrid methods for full-space formulation 9

Discretization of the system - I Care must be taken to ensure that the discrete version of the Euler-Lagrange equations is h- elliptic. Given m at cell centers, there are two possible approaches to the discretization of the forward problem Cell-centered Nodal 2 1.5 1 0 0 0 0.5 m O O 0 0 0 0 0.5 O O 1 0 0 0 1.5 2 2 1.5 1 0.5 0 0.5 1 1.5 2 E.Haber: Multigrid methods for full-space formulation 10

Discretization of the system I - cont. The continuous linearized Euler-Lagrange equations are (e m (δu)) ((e m u)δm) = g u δ(x xi )δu + (e m (δλ)) = g λ ( u) T (e m )( (δλ)) β 2 δm = g m If we use a cell-centered discretization for the forward problem, then the term ( u) T e m ( (δλ)) must be calculated using long differences in both u and λ In this case, for very small regularization parameter β, the h-ellipticity of the system can be lost on coarser grids. E.Haber: Multigrid methods for full-space formulation 11

Discretization of the system I - cont. m,u,λ u/ y u/ x ( u) [exp(m)] ( λ) Note - Compact discretization for the forward problem may still lead to non-compact discretization of the inverse problem! E.Haber: Multigrid methods for full-space formulation 12

Discretization of the system I - cont. To avoid the difficulty associated with long differencing when the regularization parameter is small, we may introduce an O(h 2 ) artificial regularization (similar to Levenberg-Marquardt for the reduced Hessian and to pressure correction techniques in CFD) (e m (δu)) ((e m u)δm) = g u δ(x xi )δu + (e m (δλ)) = g λ ( u) T (e m )( (δλ)) (β + ωh 2 ) 2 δm = g m E.Haber: Multigrid methods for full-space formulation 13

Discretization of the system II The problem of long differences can be solved by nodal discretization for u and λ while m remains at cell centers. In this case we use only short differences and therefore obtain an h-elliptic discretization even for very small regularization parameters. u/ x m u/ y ( u) [exp(m)] ( λ) E.Haber: Multigrid methods for full-space formulation 14

Discretization of the system and optimization We discretize the optimization problem rather than the Euler-Lagrange equations directly, thereby retaining KKT symmetry in the discrete necessary conditions. We obtain a discrete optimization problem, and use optimization techniques (e.g. variants of SQP [Nocedal & Wright 1999] ) E.Haber: Multigrid methods for full-space formulation 15

Smoothing I Cell-centered discretization In this case all unknowns are at the same point. For small regularization parameter values, the system is tightly coupled. We therefore use Collective Symmetric Gauss- Seidel relaxation. During a relaxation sweep invert a 3 3 block at each grid point. m u λ E.Haber: Multigrid methods for full-space formulation 16

Smoothing II Cell-node discretization The grid is staggered and we therefore use box relaxation. Box relaxation I [Vanka 1986] Box relaxation II - Eight-color box relaxation (in the spirit of red-black boxes) Box Eight color box m u,λ E.Haber: Multigrid methods for full-space formulation 17

The Multigrid iteration Prolongation and restriction Nodal unknowns - Trilinear interpolation and full-weight restriction. Cell unknowns - Trilinear interpolation and fullweight restriction. Coarse grid operator by Galerkin coarsening. We perform a classical W(2,2) cycle. E.Haber: Multigrid methods for full-space formulation 18

The Multigrid iteration in an Inexact Newton solver Reminder We are solving a nonlinear problem where at each iteration the linear system must be solved. Hδy = g In inexact Multigrid-Newton methods, we solve this system to a very rough tolerance (say 0.1 [Kelley 1998] ). This can be accomplished by one W(2,2) cycle per Inexact Newton iteration. Note The cost of calculating the Jacobian is negligible compared with one W-cycle. E.Haber: Multigrid methods for full-space formulation 19

The Inexact Newton solver, Optimization issues. We incorporate the method into an inexact Sequential Quadratic Programming (SQP) version [Heinkenshlus 1996]. Add a globalization strategy (using a line search and the l 1 merit function) [Flecher 1981]. Further stabilization is obtained by inexact projection to the constraint [Nocedal & Wright 1999] E.Haber: Multigrid methods for full-space formulation 20

Numerical experiments I - The linearized problem The Setting The PDE is e m u = q in the interval [ 1, 1] 3. Apply natural BC. We use cell-centered and nodal finite volume discretizations for u, where m is at cell centers. We measure u on a 6 3 -grid uniformly spaced in the interval [ 0.6, 0.6] (contaminated with noise) The fine grid has 32 3 cells which leads to roughly 10 5 unknowns E.Haber: Multigrid methods for full-space formulation 21

Numerical experiments I - The linearized problem To test the multigrid scheme, we solve the system which arises in the first iteration to accuracy of 10 6 Rate of convergence for different β s and different discretizations β Levels CC SCC CN 10 2 4 0.11 0.06 0.04 3 0.05 0.05 0.04 10 3 4 NC 0.09 0.04 3 0.35 0.09 0.04 10 4 4 NC 0.12 0.04 3 NC 0.12 0.04 CC - Cell Centered SCC - Stabilized Cell Centered CN - Cell-Node E.Haber: Multigrid methods for full-space formulation 22

Numerical experiments I - The linearized problem, cont. The stabilization of the cell-centered discretization is needed for small values of the regularization parameter Cell-Node performs better than Cell Centered (possible due to h-ellipticity effects) Using cell-node, the multigrid method is only slightly sensitive to the size of the regularization parameter E.Haber: Multigrid methods for full-space formulation 23

Solving the nonlinear problem In the second experiment we solve the nonlinear problem using inexact Newton. Number of nonlinear iterations and of W-cycles needed to solve the nonlinear problem when using an inexact Newton method. β Newton iterations SCC CN 10 1 8 8 8 10 2 8 9 8 10 3 14 17 14 10 4 17 21 17 Note that we manage to solve the entire nonlinear inverse problem in a total cost which is not much larger than that of solving a few forward problems. E.Haber: Multigrid methods for full-space formulation 24

Solving the nonlinear problem cont. The solutions Smooth training model True smooth model 10 10 20 20 30 30 40 40 50 50 60 20 40 60 80 60 20 40 60 80 2 3 4 5 2 3 4 5 Rough training model True rough model 10 10 20 20 30 30 40 40 50 50 60 20 40 60 80 60 20 40 60 80 2 3 4 5 2 3 4 5 Slices in the true model (lest) and the reconstructed model for z = 0.3, 0.55, 0.75. E.Haber: Multigrid methods for full-space formulation 25

Solving the nonlinear problem cont. Convergence of the iteration 10 0 10 1 10 2 10 3 g / g 0 10 4 10 5 A(m)u q / q 10 6 10 7 10 8 1 2 3 4 5 6 7 8 Relative gradient and residual of the constraint (PDE) as a function of iteration for β = 10 2. E.Haber: Multigrid methods for full-space formulation 26

Conclusions & some related questions Using an h-elliptic discretization, it is possible to solve the inverse problem in a cost which is not much larger than of a forward problem Compact discretizations for the forward problem do not necessarily lead to compact discretizations of the inverse problem. Related questions Incorporating a-priori information Robust statistics - choosing β using GCV, L- curve, etc. Incorporation of better SQP-type schemes Parallelism in the case of multi-experiments E.Haber: Multigrid methods for full-space formulation 27