Uncertainty quantification for Wavefield Reconstruction Inversion

Similar documents
Uncertainty quantification for Wavefield Reconstruction Inversion

Uncertainty quantification for inverse problems with a weak wave-equation constraint

Wavefield Reconstruction Inversion (WRI) a new take on wave-equation based inversion Felix J. Herrmann

Source estimation for frequency-domain FWI with robust penalties

Simultaneous estimation of wavefields & medium parameters

Comparison between least-squares reverse time migration and full-waveform inversion

Multiple Scenario Inversion of Reflection Seismic Prestack Data

FWI with Compressive Updates Aleksandr Aravkin, Felix Herrmann, Tristan van Leeuwen, Xiang Li, James Burke

A projected Hessian for full waveform inversion

Full-Waveform Inversion with Gauss- Newton-Krylov Method

Migration with Implicit Solvers for the Time-harmonic Helmholtz

Matrix Probing and Simultaneous Sources: A New Approach for Preconditioning the Hessian

Development of Stochastic Artificial Neural Networks for Hydrological Prediction

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 13: SEQUENTIAL DATA

STA414/2104 Statistical Methods for Machine Learning II

Sparsity-promoting migration with multiples

SUMMARY REVIEW OF THE FREQUENCY DOMAIN L2 FWI-HESSIAN

Probing the covariance matrix

Seismic data interpolation and denoising using SVD-free low-rank matrix factorization

Low-rank Promoting Transformations and Tensor Interpolation - Applications to Seismic Data Denoising

Structured tensor missing-trace interpolation in the Hierarchical Tucker format Curt Da Silva and Felix J. Herrmann Sept. 26, 2013

Towards full waveform inversion: A torturous path

Approximate- vs. full-hessian in FWI: 1D analytical and numerical experiments

Waveform inversion for attenuation estimation in anisotropic media Tong Bai & Ilya Tsvankin Center for Wave Phenomena, Colorado School of Mines

Strong Lens Modeling (II): Statistical Methods

Hierarchical sparse Bayesian learning for structural health monitoring. with incomplete modal data

The Inversion Problem: solving parameters inversion and assimilation problems

Variational Methods in Bayesian Deconvolution

Full waveform inversion of shot gathers in terms of poro-elastic parameters

Introduction to General and Generalized Linear Models

SIO223A A Brief Introduction to Geophysical Modeling and Inverse Theory.

MCMC Sampling for Bayesian Inference using L1-type Priors

Parameter estimation: A new approach to weighting a priori information

Introduction to Bayesian methods in inverse problems

Full-waveform inversion application in different geological settings Denes Vigh*, Jerry Kapoor and Hongyan Li, WesternGeco

Computational Methods Short Course on Image Quality

Application of matrix square root and its inverse to downward wavefield extrapolation

c 2016 Society for Industrial and Applied Mathematics

Dimension-Independent likelihood-informed (DILI) MCMC

Dynamic System Identification using HDMR-Bayesian Technique

SLIM. University of British Columbia

Outline. 1 Introduction. 2 Problem and solution. 3 Bayesian tracking model of group debris. 4 Simulation results. 5 Conclusions

σ(a) = a N (x; 0, 1 2 ) dx. σ(a) = Φ(a) =

Compressive Sensing Applied to Full-wave Form Inversion

STA 4273H: Sta-s-cal Machine Learning

Bayesian Inference: Principles and Practice 3. Sparse Bayesian Models and the Relevance Vector Machine

Wenyong Pan and Lianjie Huang. Los Alamos National Laboratory, Geophysics Group, MS D452, Los Alamos, NM 87545, USA

Fully Bayesian Deep Gaussian Processes for Uncertainty Quantification

SUMMARY INTRODUCTION. f ad j (t) = 2 Es,r. The kernel

Application of new Monte Carlo method for inversion of prestack seismic data. Yang Xue Advisor: Dr. Mrinal K. Sen

STA 4273H: Statistical Machine Learning

Scalable algorithms for optimal experimental design for infinite-dimensional nonlinear Bayesian inverse problems

Course on Inverse Problems

(Extended) Kalman Filter

Recent results in curvelet-based primary-multiple separation: application to real data

Overview Avoiding Cycle Skipping: Model Extension MSWI: Space-time Extension Numerical Examples

W011 Full Waveform Inversion for Detailed Velocity Model Building

Large Scale Bayesian Inference

STA 4273H: Statistical Machine Learning

Simultaneous Multi-frame MAP Super-Resolution Video Enhancement using Spatio-temporal Priors

PART I INTRODUCTION The meaning of probability Basic definitions for frequentist statistics and Bayesian inference Bayesian inference Combinatorics

Physics 403. Segev BenZvi. Numerical Methods, Maximum Likelihood, and Least Squares. Department of Physics and Astronomy University of Rochester

Chapter 3 Numerical Methods

Mitigating data gaps in the estimation of primaries by sparse inversion without data reconstruction

Youzuo Lin and Lianjie Huang

Optimization Tools in an Uncertain Environment

Compressive sampling meets seismic imaging

SUMMARY. (Sun and Symes, 2012; Biondi and Almomin, 2012; Almomin and Biondi, 2012).

Bayesian Lithology-Fluid Prediction and Simulation based. on a Markov Chain Prior Model

Bayesian reservoir characterization

Gaussian Process Approximations of Stochastic Differential Equations

Reliability Monitoring Using Log Gaussian Process Regression

SUMMARY. The main contribution of this paper is threefold. First, we show that FWI based on the Helmholtz equation has the advantage

Probabilistic Graphical Models

Full waveform inversion and the inverse Hessian

Supplementary Note on Bayesian analysis

Direct Current Resistivity Inversion using Various Objective Functions

ICES REPORT March Tan Bui-Thanh And Mark Andrew Girolami

Achieving depth resolution with gradient array survey data through transient electromagnetic inversion

MTTTS16 Learning from Multiple Sources

Automatic Differentiation Equipped Variable Elimination for Sensitivity Analysis on Probabilistic Inference Queries

Efficient Data Assimilation for Spatiotemporal Chaos: a Local Ensemble Transform Kalman Filter

Nonparametric Bayesian Methods (Gaussian Processes)

The Kalman Filter ImPr Talk

Course on Inverse Problems

Course on Inverse Problems Albert Tarantola

Seismic Modeling, Migration and Velocity Inversion

Microseismic Event Estimation Via Full Waveform Inversion

Introduction to Machine Learning

Fighting the Curse of Dimensionality: Compressive Sensing in Exploration Seismology

Bayesian Calibration of Simulators with Structured Discretization Uncertainty

Bayesian Networks: Construction, Inference, Learning and Causal Interpretation. Volker Tresp Summer 2014

Bayesian System Identification based on Hierarchical Sparse Bayesian Learning and Gibbs Sampling with Application to Structural Damage Assessment

Basic Sampling Methods

STA 4273H: Statistical Machine Learning

New Fast Kalman filter method

Recent Advances in Bayesian Inference Techniques

MEASUREMENT UNCERTAINTY AND SUMMARISING MONTE CARLO SAMPLES

Anisotropic anelastic seismic full waveform modeling and inversion: Application to North Sea offset VSP data

18 : Advanced topics in MCMC. 1 Gibbs Sampling (Continued from the last lecture)

Transcription:

Uncertainty quantification for Wavefield Reconstruction Inversion Zhilong Fang *, Chia Ying Lee, Curt Da Silva *, Felix J. Herrmann *, and Rachel Kuske * Seismic Laboratory for Imaging and Modeling (SLIM), University of British Columbia Department of Mathematics, University of British Columbia Abstract In this work, we propose a method to quantify the uncertainty of wavefield reconstruction inversion under the framework of Bayesian inference. Unlike the conventional method using the wave equation as the forward mapping, we involve the wave equation misfit in the posterior distribution and propose a new posterior distribution. The negative log-likelihood of the new distribution is less oscillatory than that of the conventional posterior distribution, and its Gauss-Newton Hessian is a diagonal matrix that can be generated without any additional computational cost. We use the diagonal Gauss-Newton Hessian to derive an approximate Gaussian distribution at the maximum likelihood point to quantify the uncertainty. This method makes the uncertainty quantification for WRI computationally tractable and is able to provide reasonable uncertainty analysis based on our numerical results.

Introduction Full-waveform inversion (FWI) (Tarantola and Valette, 1982a; Virieux and Operto, 2009) is a popular method in seismic exploration that uses the waveform information to reconstruct subsurface structure. However, conventional FWI suffers from nonlinearity and local minima due to the elimination of the PDE constraint in the optimization problem. Wavefield reconstruction inversion (WRI) (van Leeuwen and Herrmann, 2013; Peters et al., 2014) is a new approach of FWI that reduces the nonlinearity and lessens the effect of local minima. Instead of explicitly solving the wave equation, WRI relaxes the wave equation constraint by proposing a penalty formulation that explores both model and wavefield spaces. By exploiting this larger search space, WRI reduces the likelihood of reaching a local minimum. Moreover, compared to conventional FWI, whose Gauss-Newton Hessian is typically a dense matrix, the corresponding Gauss-Newton Hessian of WRI is a diagonal matrix and can be generated without any additional cost once wavefields are computed (van Leeuwen and Herrmann, 2013). Statistical inversion aims to obtain the posterior distribution of the model parameters given the distribution of the observed data and prior model information (Tarantola and Valette, 1982b) and quantify the uncertainty based on the posterior distribution. However, quantifying the uncertainty for conventional FWI is difficult. The forward mapping involves explicit PDE solves and is therefore very expensive computationally. Moreover the number of unknown parameters is also very large, which renders Markov chain Monte Carlo type methods intractable due to the large number of iterations needed to reach convergence. Moreover, due to the strong nonlinearity of the forward mapping, it is also difficult to generate an approximate distribution for the conventional FWI objective function to correctly quantify the uncertainty. In this work, we propose a new posterior distribution, in which we use the wave equation misfit of WRI. Compared to the conventional FWI, the negative log-likelihood of this new posterior distribution is less prone to local minima and the Gauss-Newton Hessian, which is diagonal, is easy to obtain. These properties allow us to derive a Gaussian distribution that approximates the true posterior distribution, and quantify uncertainty based on this approximate distribution. No additional computational cost related to sampling and estimating the Gauss-Newton Hessian is required, unlike for the standard FWI case. This makes the method computationally feasible to quantify the uncertainty of WRI. Numerical experiments illustrate that the uncertainty quantified by this method is reasonable. Posterior Distribution for WRI The deterministic wavefield reconstruction inversion is aimed at solving the following unconstrained optimization problem (1), min F(u,v) = P k u d 2 u,v 2 + λ 2 A (v)u q 2 2, (1) which aims to balance the PDE misfit A(v)u q with the data misfit Pu d, simultaneously over velocity models v and wavefields u, for all shots indexed by k = 1,2,...,N s and a few relevant frequencies indexed by l = 1,2,...,N f. P is the operator that restricts wavefields to receiver locations, and q represents sources. This joint optimization problem, as stated, is difficult to solve and requires a large amount of memory to store all the wavefields. One approach is to use the variable projection method (van Leeuwen and Herrmann, 2013) to reduce the number of parameters by finding optimal wavefields u for given velocity model v at each iteration. As a result, the original joint optimization problem (1) is reduced to the problem (2), min v F(u(v),v) = P k u (v) d 2 2 + λ 2 A (v)u (v) q 2 2, (2) and optimal wavefields u are generated by solving the following least-squares problem (3): ( ) ( ) λa λq u =. (3) P k d 77 th EAGE Conference & Exhibition 2015

In this work, we use the same idea from deterministic WRI to derive the posterior distribution of velocity models v given observed data d based on the Bayesian inference. The conventional statistical FWI defines the posterior distribution as the product of data misfit likelihood and prior distribution, which only considers the uncertainty of data. However, the forward modeling kernel also should contain uncertainty due to the fact that we do not fit the PDE exactly. Based on this observation, we derive a new posterior distribution of velocity models (Equation 4), ρ post (v,u(v)) exp( Pu (v) d 2 λ 2 A (v)u (v) q 2 v v noise prior 2 ), (4) prior which considers data misfit likelihood (the first term), wave equation misfit likelihood (the second term) and the prior distribution (the last term). Covariance matrices Σ noise, Σ and Σ prior reflect uncertainty level and statistics of data, wave equation and prior model, respectively. We use variable projection method to generate optimal wavefields u for fixed velocity model v by solving the following least-squares problem (5), ( λ/2 A /2 noise P k ) u = ( λ/2 q /2 noise d ). (5) By involving the wave equation misfit, the Gauss-Newton Hessian of the negative log-likelihood of Equation (4) is a diagonal matrix, which makes it less nonlinear than that of conventional statistical FWI (van Leeuwen and Herrmann, 2013). Quantify the Uncertainty The posterior probability distribution (4) is essential for the uncertainty quantification. However, quantifying statistical parameters based on Equation (4) is extremely expensive because of the huge computational cost of solving least-squares problem (5) and the high dimension of v. These properties renders traditional MCMC methods, which require an enormous amount of PDE solves (on the order of 10000 objective evaluations or more) in order to generate reasonable results, computationally intractable. An alternative method is to use a Gaussian distribution at the maximum likelihood point to approximate the original distribution, which is much cheaper to deal with than the posterior distribution (4). The approximate Gaussian distribution for conventional FWI is difficult to generate because the Gauss- Newton Hessian of the negative log-likelihood of the posterior distribution is dense and requires a large number of PDE solves to calculate. On the other hand, the Gauss-Newton Hessian of the negative log-likelihood of (4) is a diagonal matrix. Using the following Equations (6) and (7), g = H = 2λ 2 diag(conj(u )) A T v (A u q ), (6) 2λ 2 diag(conj(u )) A T A v v diag(u ), (7) we can compute the gradient and the diagonal Gauss-Newton Hessian and derive a quadratic approximation of the negative log-likelihood. Figure 1 shows the comparison of the negative log-likelihood (green line) and its quadratic approximation (blue line) at the maximum likelihood point (technically, the maximum a posteriori point or MAP) in four different random directions. We can observe that around the maximum likelihood point, the quadratic approximation matches the negative logarithm function quite well, which implies that the approximate Gaussian distribution N (v MAP H 1 MAP g MAP,H 1 MAP ) is a good approximation to the posterior distribution. Numerical Experiment We test our uncertainty quantification method for WRI on the BG compass model. The model size is 4.5km x 2km. The true and initial models are shown in the Figure 2. 91 sources are located at the surface every 50m and 451 receivers are located at the surface every 10m. Ten frequency bands ranging from 77 th EAGE Conference & Exhibition 2015

Figure 1: Comparisons of the true negative log-likelihood (green line) and the approximated quadratic function (blue line) in four different random directions dv with step-size α. {2,3,4} Hz to {29,30,31} Hz are used for the inversion. In this experiment, to simplify the problem, we assume that we know the noise-free mean of the data, which may not be available in practice, and generate it using a 9-point finite difference method. The standard deviation for the noise of data is 1, the standard deviation for the PDE is 1 and the penalty parameter λ is selected to be 100 to balance the data misfit and PDE misfit. We assume that we do not have any prior information so that we do not include the prior distribution term in (4) on our velocity models. (a) True model (b) Initial model Figure 2: The true velocity model (a) and the initial velocity model (b). We first solve the deterministic optimization problem to find the maximum likelihood point (Figure 3a). Then we calculate the gradient and Gauss-Newton Hessian at this point and form the approximate Gaussian distribution. The standard deviation (Figure 3b) and confidence intervals (Figure 4) are computed using our Gaussian approximation to the true distribution. From the computed standard deviation, we observe that the standard deviation at the deep and boundary areas of the model is higher than in other areas, which agrees with our intuition. We also observe similar phenomena in the confidence interval results that the velocity in deeper regions has a larger uncertainty compared to shallower regions. Both observations corresponds to the fact that the observed data is less sensitive to the velocity in deep and boundary area. In order to verify our uncertainty results, we invert five sets of data generated by adding noise according to the prescribed noise distribution above. These five results are shown in the Figure (4) with the initial model (blue line), true model (green line) and maximum likelihood point result (black line). All five of these results lie in the confidence interval, which indicates that our results are reasonable. Conclusion In this work, we propose a method to quantify the uncertainty of wavefield reconstruction inversion by introducing the wave equation misfit to the posterior distribution. This makes the negative log-likelihood of the posterior distribution less prone to local minima than in the FWI case and the diagonal GaussNewton Hessian is straightforward and efficient to generate. We use this Gauss-Newton Hessian to derive an approximate Gaussian distribution and quantify the uncertainty of the velocity model using this approximation. Since the Gauss-Newton Hessian is diagonal, we avoid solving a large number of PDEs that would otherwise have to be computed when estimating uncertainty for standard FWI. This method 77th EAGE Conference & Exhibition 2015

(a) Maximum likelihood point (b) Standard Deviation Figure 3: The Maximum likelihood point (a) and the standard deviation of the inversion result (b). At the deep part of the model and near the boundary, the velocity (a) is not well resolved since the observed data influences the inverted model much less in these regions than in the rest of the model, and these bad resolved results are reflected in larger standard deviation. At the shallow part, the standard deviation is smaller, which shows less uncertainty at this area. (a) x = 1000m (b) x = 2500m (c) x = 3500m Figure 4: Confidence interval (gray background) at lateral position x = 1000m, 2500m, and 3500m. Green line - the true velocity, black line the MAP result, blue line the initial velocity and others inversion result from noisy data obeying the specified noise distribution. All inversion results lie within the confidence interval we obtain. makes the uncertainty quantification for WRI computationally tractable and can provide reasonable uncertainty results based on the numerical experiments. In this work we assume we had noise-free data to compute confidence intervals as a proof of concept. In the future, we will consider the case of handling noisy data in our uncertainty quantification framework. Acknowledgment We thank all our sponsors of DNOISE II and SINBAD II for their continued support of this research.. References Peters, B., Herrmann, F.J. and van Leeuwen, T. [2014] Wave-equation based inversion with the penalty method: adjoint-state versus wavefield-reconstruction inversion. EAGE, doi:10.3997/2214-4609.20140704. Tarantola, A. and Valette, B. [1982a] Generalized nonlinear inverse problems solved using the least squares criterion. Reviews of Geophysics, 20(2), 219 232, ISSN 1944-9208, doi:10.1029/rg020i002p00219. Tarantola, A. and Valette, B. [1982b] Inverse problems = quest for information. Journal of Geophysics, 50, 159 170. van Leeuwen, T. and Herrmann, F.J. [2013] Mitigating local minima in full-waveform inversion by expanding the search space. Geophysical Journal International, 195, 661 667, doi:10.1093/gji/ggt258. Virieux, J. and Operto, S. [2009] An overview of full-waveform inversion in exploration geophysics. GEOPHYSICS, 74(6), WCC1 WCC26, doi:10.1190/1.3238367. 77th EAGE Conference & Exhibition 2015