Transdimensional Markov Chain Monte Carlo Methods. Jesse Kolb, Vedran Lekić (Univ. of MD) Supervisor: Kris Innanen

Similar documents
On the Limitation of Receiver Functions Method: Beyond Conventional Assumptions & Advanced Inversion Techniques

When one of things that you don t know is the number of things that you don t know

Monte Carlo in Bayesian Statistics

Bayesian Inference and MCMC

Large Scale Bayesian Inference

Bayes Nets: Sampling

Bayesian Regression Linear and Logistic Regression

Who was Bayes? Bayesian Phylogenetics. What is Bayes Theorem?

Bayesian Phylogenetics

Geophysical Journal International

Assessing Regime Uncertainty Through Reversible Jump McMC

Trans-dimensional inverse problems, model comparison and the evidence

Answers and expectations

Efficient adaptive covariate modelling for extremes

A Bayesian Approach to Phylogenetics

Bayesian Inverse Problems

On Bayesian Computation

pyblocxs Bayesian Low-Counts X-ray Spectral Analysis in Sherpa

Bayesian Methods and Uncertainty Quantification for Nonlinear Inverse Problems

Chapter 3: Maximum-Likelihood & Bayesian Parameter Estimation (part 1)

Bayesian room-acoustic modal analysis

ECO 513 Fall 2009 C. Sims HIDDEN MARKOV CHAIN MODELS

Markov-Chain Monte Carlo

Statistical Methods in Particle Physics Lecture 1: Bayesian methods

Doing Bayesian Integrals

Markov chain Monte Carlo methods in atmospheric remote sensing

Monte Carlo Markov Chains: A Brief Introduction and Implementation. Jennifer Helsby Astro 321

Markov Chain Monte Carlo

Markov Chain Monte Carlo methods

STA414/2104 Statistical Methods for Machine Learning II

Geophysical Journal International

Infer relationships among three species: Outgroup:

F denotes cumulative density. denotes probability density function; (.)

Lecture: Gaussian Process Regression. STAT 6474 Instructor: Hongxiao Zhu

Reminder of some Markov Chain properties:

Bayesian Inference in Astronomy & Astrophysics A Short Course

Forward Problems and their Inverse Solutions

ECE295, Data Assimila0on and Inverse Problems, Spring 2015

Leïla Haegel University of Geneva

Lecture 2: From Linear Regression to Kalman Filter and Beyond

Introduction to Bayesian Data Analysis

Uncertainty quantification for inverse problems with a weak wave-equation constraint

Bayesian Networks BY: MOHAMAD ALSABBAGH

An introduction to Markov Chain Monte Carlo techniques

STA 4273H: Statistical Machine Learning

References. Markov-Chain Monte Carlo. Recall: Sampling Motivation. Problem. Recall: Sampling Methods. CSE586 Computer Vision II

Fundamental Probability and Statistics

STA 4273H: Sta-s-cal Machine Learning

Uncertainty analysis for the integration of seismic and CSEM data Myoung Jae Kwon & Roel Snieder, Center for Wave Phenomena, Colorado School of Mines

Statistical Estimation of the Parameters of a PDE

Geophysical Journal International

CPSC 540: Machine Learning

MCMC and Gibbs Sampling. Kayhan Batmanghelich

Nested Sampling. Brendon J. Brewer. brewer/ Department of Statistics The University of Auckland

CS 343: Artificial Intelligence

Bayesian model selection for computer model validation via mixture model estimation

arxiv:astro-ph/ v1 14 Sep 2005

A Beginner s Guide to MCMC

MCMC: Markov Chain Monte Carlo

Thomas Bayes versus the wedge model: An example inference using a geostatistical prior function

Lecture 6: Markov Chain Monte Carlo

Introduction to Machine Learning

The University of Auckland Applied Mathematics Bayesian Methods for Inverse Problems : why and how Colin Fox Tiangang Cui, Mike O Sullivan (Auckland),

ST 740: Markov Chain Monte Carlo

Quantifying Uncertainty

Probabilistic surface reconstruction from multiple data sets: An example for the Australian Moho

A Review of Pseudo-Marginal Markov Chain Monte Carlo

BAYESIAN METHODS FOR VARIABLE SELECTION WITH APPLICATIONS TO HIGH-DIMENSIONAL DATA

Parameter estimation and forecasting. Cristiano Porciani AIfA, Uni-Bonn

Probabilistic modeling. The slides are closely adapted from Subhransu Maji s slides

an introduction to bayesian inference

Markov Networks.

The Metropolis-Hastings Algorithm. June 8, 2012

(5) Multi-parameter models - Gibbs sampling. ST440/540: Applied Bayesian Analysis

Probabilistic Graphical Networks: Definitions and Basic Results

Multimodal Nested Sampling

Bayesian Inference. Chapter 4: Regression and Hierarchical Models

Approximate Inference

Lecture 2: From Linear Regression to Kalman Filter and Beyond

MCMC Sampling for Bayesian Inference using L1-type Priors

SAMSI Astrostatistics Tutorial. More Markov chain Monte Carlo & Demo of Mathematica software

Bayesian Estimation of Input Output Tables for Russia

Computational Methods Short Course on Image Quality

Inadequacies of the Fisher Matrix in Gravitational-wave Parameter Estimation

Markov chain Monte Carlo methods for visual tracking

Learning the hyper-parameters. Luca Martino

Bayesian Methods for Machine Learning

Signal Modeling, Statistical Inference and Data Mining in Astrophysics

Lecture 6: Model Checking and Selection

Likelihood-free MCMC

Bayesian Inference. Chapter 4: Regression and Hierarchical Models

Stat 535 C - Statistical Computing & Monte Carlo Methods. Lecture 18-16th March Arnaud Doucet

Approximate inference in Energy-Based Models

Advanced Statistical Methods. Lecture 6

Probabilistic Geoacoustic Inversion in Complex Environments

Strong Lens Modeling (II): Statistical Methods

Lecture 3. G. Cowan. Lecture 3 page 1. Lectures on Statistical Data Analysis

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016

Systematic uncertainties in statistical data analysis for particle physics. DESY Seminar Hamburg, 31 March, 2009

USING GEOSTATISTICS TO DESCRIBE COMPLEX A PRIORI INFORMATION FOR INVERSE PROBLEMS THOMAS M. HANSEN 1,2, KLAUS MOSEGAARD 2 and KNUD S.

Transcription:

Transdimensional Markov Chain Monte Carlo Methods Jesse Kolb, Vedran Lekić (Univ. of MD) Supervisor: Kris Innanen

Motivation for Different Inversion Technique Inversion techniques typically provide a single best-fit model while the fit of other models may be only slightly worse. Inversion algorithms often get stuck in local minima. Non-uniqueness is often uncharacterized. Dimensionality (e.g. number of layers) may be unknown prior to inversion. Uncertainty analysis is desired on individual features within a result.

Probability Density Bayes Theorem Incorporates prior knowledge and data to obtain posterior probability density function (PDF): p m d Likelihood PDF = p d m p(m) p(d) Prior PDF Prior Likelihood Posterior Posterior PDF Evidence

What is Markov Chain Monte Carlo? Markov chain: a process in which the next state only depends on the current state Monte Carlo: Using random numbers to estimate properties of a solution Example of 3-state Markov chain. Courtesy of Gareth Jones.

Metropolis-Hastings Algorithm Tests new set of model parameters, m, in order to sample the posterior probability distribution, p m d. Proposal distribution for m given current model, m : q m m Acceptance probability: α = min 1, p(m )p(d m )q(m m ) p(m)p(d m)q(m m) Example of progression of states. After Sambridge (2010).

Depth (km) Probability MCMC Results After Markov chain has converged, models are binned and the ratio of models in a given bin is proportional to posterior probability in that bin S-vel (km/s) V p /V s After Piana Agostinetti and Malinverno (2010)

Reversible-Jump (Transdimensional) MCMC Allows for jumps between parameter spaces of different dimensions Acceptance probability changes from To α = min α = min 1, p(m )p(d m )q(m m ) p(m)p(d m)q(m m) 1, p(m )p(d m )q(m m ) p(m)p(d m)q(m m) J Jumps between parameter spaces are commonly done using Birth-Death MCMC

RJMCMC for Receiver Function Deconvolution Receiver functions use P to S or S to P conversions from teleseismic waves to infer velocity structure Assumed relation between Parent (P) and Daughter (D) waveforms: D = P G + ε Receiver function, G, is parameterized as an unknown number of Gaussians at unknown lag times with unknown widths and amplitudes. Likelihood of observed daughter waveform given G: p D G = 1 (2π) n C D e P G D T C D 1 P G D

Correlation Noise Parameterization We parameterize the noise covariance matrix with hyperparameters σ, λ, and ω 0 : C Dij = σ 2 e λ t j t i cos (ω 0 λ t j t i ) Noise Param. 1 Noise Param. 2 Noise Param. 3 Real Noise Lag Time (s)

Why Correlation Matters

Process Start with initial model, G, of no Gaussians. Convolve P with G (forward model) and use to calculate likelihood. Propose new model by either birthing or killing a Gaussian, changing an existing Gaussian s amplitude, width, or location, or changing a noise hyperparameter. Calculate forward model, likelihood, and acceptance probability of new model. If acceptance probability is greater than a random number from 0 to 1, accept the model. Otherwise, reject it. Repeat proposal and accept/reject process until convergence and then continue, saving models afterwards.

PS MCMC results Wiener Filtering Waveform Pairs MCMC P D P D P D Time (s)

SP MCMC results Wiener Filtering Waveform Pairs P MCMC D P D P D Time (s)

SP Result Analysis Parent & Daughter Waveforms Location 1: Amplitudes 2: Amplitudes Ensemble Solution 1: Widths 2: Widths # of Gaussians Noise σ Posterior Probability

Spectral Hole Test Solve for G when using STF with spectral holes: D STF = P STF G + ε

Model Comparisons Can compare hypotheses H which may be parameterizations or forward models: With evidence, p d H : p m d, H = p d m, H p(m H) p(d H) p d H = p d m, H p m H dm Point approximations using evidence: AIC = 2log [p d m mle ] + 2k BIC = 2 log p d m mle + klog(n) DIC = E θ [ 2 log p d m + k eff Where m mle is maximum likelihood estimate of the model, k is number of parameters, and n is sample size

Noise Parameterization 1 vs. 3 Parameterization 3 has higher likelihoods and uses fewer Gaussians. X 10 6 X 10 6

Tomography MCMC with parameterization of unknown # of Voronoi nuclei with unknown velocities and locations was proposed (Sambridge et al., 1995). Allows for cells of varying shapes and sizes, and multiple parameterizations for the same structure. Voronoi Nuclei Movement of a Voronoi nucleus to a new location. The velocity structure in each of the teal cells has been changed. After Bodin et al. (2009).

Probability Depth (km) Depth (km) Depth (km) Joint Inversion Only RF Only SWD Joint (RF+SWD) Vs (km/s) p(discontinuity) Vs (km/s) p(discontinuity) Vs (km/s) p(discontinuity) After Bodin et al., (2012)

Probability Depth (km) Depth (km) Depth (km) Joint Inversion Mean Max Vs (km/s) Vs (km/s) p(discontinuity) After Bodin et al., (2012)

Probability Depth (km) Depth (km) Depth (km) Well-Constrained Features Joint Inversion Mean Max More uncertainty on these velocities Vs (km/s) Vs (km/s) p(discontinuity) After Bodin et al., (2012)

Probability Depth (km) Depth (km) Depth (km) Joint Inversion Mean Max Vs (km/s) Vs (km/s) p(discontinuity) Sharp increase in max. posterior model not necessarily a discontinuity After Bodin et al., (2012)

Conclusions Transdimensional MCMC: Is a data-driven approach allowing for dimensionality to be decided by the data; Results in an ensemble solution of models that can make uncertainty analysis simpler; Provides uncertainties on individual features of solutions; Can jump out of local minima and can sample nonunique solutions.

Acknowledgements Vedran Lekic Fellow transdimensionalers CREWES

References Agostinetti, N. Piana, and A. Malinverno. "Receiver function inversion by trans dimensional Monte Carlo sampling." Geophysical Journal International 181.2 (2010): 858-872. Bodin, Thomas, and Malcolm Sambridge. "Seismic tomography with the reversible jump algorithm." Geophysical Journal International 178.3 (2009): 1411-1436. Bodin, Thomas, et al. "Transdimensional inversion of receiver functions and surface wave dispersion." Journal of Geophysical Research: Solid Earth (1978 2012) 117.B2 (2012). Jones, Gareth. Finance Markov Chain Example State Space. Digital image. Wikipedia.org. Wikipedia, n.d. Web. <http://en.wikipedia.org/wiki/file:finance_markov_chain_example_state_sp ace.svg>. Sambridge, Malcolm (2010). Uncertainty in Transdimensional Inverse Problems. PDF File. Lecture Slides.

Happy Valentine s Day!