Bayesian PalaeoClimate Reconstruction from proxies:

Similar documents
STA 4273H: Statistical Machine Learning

Log Gaussian Cox Processes. Chi Group Meeting February 23, 2016

Lecture: Gaussian Process Regression. STAT 6474 Instructor: Hongxiao Zhu

MCMC Sampling for Bayesian Inference using L1-type Priors

Bayesian Regression Linear and Logistic Regression

Nonparametric Bayesian Methods (Gaussian Processes)

Large Scale Bayesian Inference

Fast Approximate Inverse Bayesian Inference in non-parametric Multivariate Regression

Creating Non-Gaussian Processes from Gaussian Processes by the Log-Sum-Exp Approach. Radford M. Neal, 28 February 2005

Contents. Part I: Fundamentals of Bayesian Inference 1

Markov chain Monte Carlo methods for visual tracking

Probabilistic Models of Past Climate Change

Approximate Bayesian Computation and Particle Filters

Bayesian Inference and MCMC

Lecture : Probabilistic Machine Learning

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 13: SEQUENTIAL DATA

PART I INTRODUCTION The meaning of probability Basic definitions for frequentist statistics and Bayesian inference Bayesian inference Combinatorics

Analysing geoadditive regression data: a mixed model approach

Disease mapping with Gaussian processes

When one of things that you don t know is the number of things that you don t know

STA 4273H: Statistical Machine Learning

Machine Learning Techniques for Computer Vision

STA 4273H: Sta-s-cal Machine Learning

An Introduction to Reversible Jump MCMC for Bayesian Networks, with Application

Spatial Statistics with Image Analysis. Outline. A Statistical Approach. Johan Lindström 1. Lund October 6, 2016

Lecture 2: From Linear Regression to Kalman Filter and Beyond

Human Pose Tracking I: Basics. David Fleet University of Toronto

Fast Likelihood-Free Inference via Bayesian Optimization

Variational Bayesian Logistic Regression

Consistent Downscaling of Seismic Inversions to Cornerpoint Flow Models SPE

Infer relationships among three species: Outgroup:

STA 4273H: Statistical Machine Learning

Bayesian Phylogenetics:

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016

Dynamic System Identification using HDMR-Bayesian Technique

Probability and Information Theory. Sargur N. Srihari

Lecture 2: From Linear Regression to Kalman Filter and Beyond

ntopic Organic Traffic Study

Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model

Detection ASTR ASTR509 Jasper Wall Fall term. William Sealey Gosset

Penalized Loss functions for Bayesian Model Choice

Gaussian Process Approximations of Stochastic Differential Equations

Bayesian Networks BY: MOHAMAD ALSABBAGH

Dimension Reduction. David M. Blei. April 23, 2012

Robotics. Lecture 4: Probabilistic Robotics. See course website for up to date information.

Chapter 4 - Fundamentals of spatial processes Lecture notes

Computer Vision Group Prof. Daniel Cremers. 3. Regression

Machine Learning. Probabilistic KNN.

Review. DS GA 1002 Statistical and Mathematical Models. Carlos Fernandez-Granda

Introduction to Probabilistic Graphical Models: Exercises

Default Priors and Effcient Posterior Computation in Bayesian

Bayesian Methods for Machine Learning

Supplemental Resource: Brain and Cognitive Sciences Statistics & Visualization for Data Analysis & Inference January (IAP) 2009

Spatio-temporal modeling of avalanche frequencies in the French Alps

Parameter Estimation. William H. Jefferys University of Texas at Austin Parameter Estimation 7/26/05 1

Lecture 5. G. Cowan Lectures on Statistical Data Analysis Lecture 5 page 1

Large-scale Ordinal Collaborative Filtering

Markov Chain Monte Carlo

Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com

Advances and Applications in Perfect Sampling

Bayesian hierarchical modelling for data assimilation of past observations and numerical model forecasts

CPSC 540: Machine Learning

INTRODUCTION TO BAYESIAN INFERENCE PART 2 CHRIS BISHOP

Hybrid Dirichlet processes for functional data

Bayesian Calibration of Simulators with Structured Discretization Uncertainty

Multimodal Nested Sampling

ECE295, Data Assimila0on and Inverse Problems, Spring 2015

Basic Sampling Methods

PROBABILISTIC PROGRAMMING: BAYESIAN MODELLING MADE EASY

Beyond MCMC in fitting complex Bayesian models: The INLA method

Variability within multi-component systems. Bayesian inference in probabilistic risk assessment The current state of the art

Bayesian Inference. Chapter 1. Introduction and basic concepts

Bayesian inference J. Daunizeau

Efficient Likelihood-Free Inference

Bayesian inference J. Daunizeau

CSEP 573: Artificial Intelligence

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

EE562 ARTIFICIAL INTELLIGENCE FOR ENGINEERS

A short introduction to INLA and R-INLA

Bayesian Learning. HT2015: SC4 Statistical Data Mining and Machine Learning. Maximum Likelihood Principle. The Bayesian Learning Framework

Linear Dynamical Systems

Bayesian statistics and modelling in ecology

Bayesian Models in Machine Learning

Uncertainty Quantification and Validation Using RAVEN. A. Alfonsi, C. Rabiti. Risk-Informed Safety Margin Characterization.

CosmoSIS Webinar Part 1

Machine Learning. Probability Basics. Marc Toussaint University of Stuttgart Summer 2014

Approximate Inference Part 1 of 2

PROBABILISTIC PROGRAMMING: BAYESIAN MODELLING MADE EASY. Arto Klami

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models

Approximate Inference Part 1 of 2

Bayesian RL Seminar. Chris Mansley September 9, 2008

What is (certain) Spatio-Temporal Data?

A Bayesian Approach to Phylogenetics

MAT Inverse Problems, Part 2: Statistical Inversion

F denotes cumulative density. denotes probability density function; (.)

Monte Carlo in Bayesian Statistics

STAT 518 Intro Student Presentation

Probabilistic Graphical Models

BAYESIAN METHODS FOR VARIABLE SELECTION WITH APPLICATIONS TO HIGH-DIMENSIONAL DATA

Bayesian Estimation of Input Output Tables for Russia

Transcription:

Bayesian PalaeoClimate Reconstruction from proxies: Framework Bayesian modelling of space-time processes General Circulation Models

Space time stochastic process C = {C(x,t) = Multivariate climate at all locations x and at all times t} Eg 3 dims (Growing degree days, Mean temp coldest month, AET/PET) 14000 years at 20 year intervals on 50 x 50 grid = 5250000 dim random variable

Inference on C C! proxies But other influences and meas error Forward model Pr( proxy data C) modern and ancient Inverse model Pr(C proxy data) " Pr( proxy data C)Pr(C) dim(c) = 5250000 Sample from Pr(C proxy data)

Modules for Inference on C Decompose Pr( proxy data C) = Pr( proxy data old C)Pr( proxy data new C) Pr( proxy data old C) = Pr( pollen data old C)Pr(diatom data old C)...

Modules for Inference on C Decompose Pr( pollen data old C) = Pr( pollen data old,site1 C)! Pr( pollen data old,site2 C)! Pr( pollen data old,site3 C)!...

Prior Pr(C) Descriptive {C(x,t)} stochastically smooth eg Gaussian process eg Heavy tail Random Walk Physical {C(x,t)} satisfies GCM equations

Sampling the Palaeoclimate Samples of C(x,t) (sets of 5250000 random nums) = plausible equally likely stories of what happened consistent with data & theory from which can (eg) Construct space-time averages (eg W Europe, 500y) Time series at one location Research Dynamics, Extremes, Comparisons

Modelled Climate Histories; eg at one site summary MTCO t t = 20,40, 14000 700 marginal summaries Multi-modal Highest Posterior Density Regions

Sampled climate histories

Sampled climate histories

Sampled climate histories

Sampled climate histories

Sampled climate histories smooth

Modelled Climate Histories at one site MTCO t t = 20,40, 14000 700 marginal summaries Highest Posterior Density Regions Multi-modal Other summaries Eg Max change in 20 years

Alder response Alder percentage 100 100 50 50 0 0 0 2500 5000-40 -20 0 20 GDD5 MTCO Ireland, currently

Alder mean response parameter 20 0 MTCO -20-40 0 2500 5000 GDD5 High alder count! about (1600,6) But large noise parameters!

Multivar non para regression Response surfaces x 1 (c), x 2 (c),... Modern data, Zero-inf. Poisson Gaussian prior 2D climate

Zero Inflated Poisson 1D climate Latent x j (c);poisson! = e x(c) ; Pr(0)= " $ $ # ex(c) 1+e x(c) % ' ' & (

Climate inf, given counts y and x(c) count=hi count=lo Bimodal Likelihood of obs count, for every possible c

Climate inf, given counts y only count=hi count=lo Likelihood of obs count, for every possible c

Climate likelihoods, given counts y Implied climate likelihoods given data and climate resp surfaces Taxon A Taxon B Taxon C All taxa marginal joint 1D climate

Climate history; joint inference Implied joint climate likelihoods given data + Joint prior reflecting smooth climate 28 taxa at Depth 1 Depth 2 Depth 3 1D climate Regular depths! Irregular uncertain times

Why Bayes? Why? Need joint statements of uncertainty Multiple sources of uncertainty Weak priors eg MTCO at 10000BP Strong Priors eg stochastically smooth Flexible

Why Flexible? Non-normal Multi-modal Zero-Inflation Presence/Absence Hierarchical Missing data Constraints monotonicity in chronologies Stochastically smooth in space time

Why Bayes? Problems New, non-standard, software! Display and publication of data Solutions Use Monte Carlo, modularise, software " Bchron R software, Parnell 2008

Monte Carlo Generate multiple random copies { } at one site of (eg) C = c 1,c 2,...c t,... each probabilistically consistent jointly with data, information Hence form multiple random copies of c t! est marginal dist of c t " c t"20! est marginal dist of diff of max(c t,c t"20 )! est marginal dist of max

Monte Carlo Rejection Sampling 1 Generate multiple random copies of { c 1,c 2,...c t,...}from prior 2 Compute likelihood L(data c t ) 3 Reject c t with high prob if L(data c t ) is low low prob if L(data c t ) is high 4 Hence copies probabilistically consistent jointly with data and prior

Using joint prior information Chronology example: Age at depth 1 5000 ± 500 (Normal model, SD = 250) Age at depth 2 5300 ± 600 (SD = 150) Info : (Depth 2 > Depth 1)! (Age 1 > Age 2) Algorithm: rejection sampling reject if inverted

Using joint prior information Monte Carlo Samples Accept? Depth 1 Depth 2 1 4784 5565 Y 2 5050 5083 Y 3 5092 5297 Y 4 4690 5172 Y 5 4926 5260 Y 6 4924 5118 Y 7 5211 5438 Y With constraint: Age1 Age2 Without Age1 Age2 Mean 4901 5213 4901 5213 SD 196 97 250 150

Sampling, using joint information Post Dist [ c proxies]! = Model Prob[ proxies c] " Prior[c] Prior[c] - Joint prior for c = { c, c,... c,..} 1 2 1 2 Eg if c, c,... c denote climate at times 1,2,... t,... t then c will usually be more like c than 2 3 { c, c,... c,..} is stochastically smooth 1 2 t Prior: time series model eg Random Walk Prior ties things together t c 20

Posterior Dists Probabilistically consistent with data & info Random samples {,,...,..}from c = c1 c2 c t Post Dist [ c proxies]! Likelihood [ proxies c] " Prior[c] = Model Prob[ proxies c] " Prior[c] Inverse model! Forward model " Prior[c] Modules: Decomposing and Integrating

Modules Decomposing the Likelihood via Conditional Independence Typically : at least as an approximation Model Prob[ all proxies c] = Product of Probs[ each proxy type c] = Probs[ pollen c]! Probs[ diatoms c]... " separate modules!!!!

Modules One sample at a time With multivar count data y Compute Prob[ c counts] for all climates for each sample separately Fast approximations, no Monte carlo

Rejecting Climate Histories Algorithm in principle MCMC just efficiency Generate entirely random histories Reject with hi prob those that are improbable, given data&info Reject with lo prob those that are quite probable Accept the remainder

Temporal Smoothing Module Temporal smoothing module Generate random histories for each sample separately Reject with hi prob those that are not smooth Reject with lo prob those that are smooth Accept the remainder MCMC just efficiency

Multiple Cores in Space Sample space-time histories Random movies Consistent with data and models Reject movies with hi prob if with hi prob if not spatio-temporally smooth with lo prob if spatio-temporally smooth But Different and irregular depths Different, irregular and uncertain times

Chronology Module Known depths, uncertain dates Randomly generate dates for each sample consistent with depths & 14 C via rejection sampling (Round to nearest 20 years) info consistent with monotone order

Vision Multiple-proxy types Space-time reconstructions movies arbitrary resolution Noisy if weak signal One model Many modules GCM comparisons

GCM comparisons Different spatio-temporal scales Modelling dynamics Uncertainties