Lecture 14 Bayesian Models for Spatio-Temporal Data

Similar documents
Lecture 13 Fundamentals of Bayesian Inference

Gaussian Multiscale Spatio-temporal Models for Areal Data

NRCSE. Statistics, data, and deterministic models

Statistics Introduction to Probability

Spatio-temporal precipitation modeling based on time-varying regressions

Statistics Introduction to Probability

Gibbs Sampling in Linear Models #2

Parameter Estimation in the Spatio-Temporal Mixed Effects Model Analysis of Massive Spatio-Temporal Data Sets

Multivariate Normal & Wishart

Name: Climate Date: EI Niño Conditions

Topic Models. Brandon Malone. February 20, Latent Dirichlet Allocation Success Stories Wrap-up

Stefan Liess University of Minnesota Saurabh Agrawal, Snigdhansu Chatterjee, Vipin Kumar University of Minnesota

Lecture 16: Mixtures of Generalized Linear Models

The Climate System and Climate Models. Gerald A. Meehl National Center for Atmospheric Research Boulder, Colorado

El Niño: How it works, how we observe it. William Kessler and the TAO group NOAA / Pacific Marine Environmental Laboratory

The Wishart distribution Scaled Wishart. Wishart Priors. Patrick Breheny. March 28. Patrick Breheny BST 701: Bayesian Modeling in Biostatistics 1/11

Introduction to Bayesian Statistics with WinBUGS Part 4 Priors and Hierarchical Models

New Zealand Climate Update No 223, January 2018 Current climate December 2017

Making rating curves - the Bayesian approach

5.2 Expounding on the Admissibility of Shrinkage Estimators

New Zealand Climate Update No 221, October 2017 Current climate October 2017

PMR Learning as Inference

New Zealand Climate Update No 222, November 2017 Current climate November 2017

Default Priors and Effcient Posterior Computation in Bayesian

Percentage of normal rainfall for August 2017 Departure from average air temperature for August 2017

A Fully Nonparametric Modeling Approach to. BNP Binary Regression

Percentage of normal rainfall for April 2018 Departure from average air temperature for April 2018

ENSO Cycle: Recent Evolution, Current Status and Predictions. Update prepared by Climate Prediction Center / NCEP 24 September 2012

ENSO Cycle: Recent Evolution, Current Status and Predictions. Update prepared by Climate Prediction Center / NCEP 5 August 2013

Chapter 8: Sampling distributions of estimators Sections

2015 Record breaking temperature anomalies

ENSO: Recent Evolution, Current Status and Predictions. Update prepared by: Climate Prediction Center / NCEP 30 October 2017

Gibbs Sampling in Endogenous Variables Models

ENSO Cycle: Recent Evolution, Current Status and Predictions. Update prepared by Climate Prediction Center / NCEP 23 April 2012

Chart Discussion: Fri-09-Nov-2018 (Harvey Stern) Chart Discussion Latest Chart & SAT PIC

ENSO Cycle: Recent Evolution, Current Status and Predictions. Update prepared by Climate Prediction Center / NCEP July 26, 2004

Module 22: Bayesian Methods Lecture 9 A: Default prior selection

Simple Mathematical, Dynamical Stochastic Models Capturing the Observed Diversity of the El Niño Southern Oscillation (ENSO)

Bayesian Methods for Machine Learning

Bayesian linear regression

Appearance of solar activity signals in Indian Ocean Dipole (IOD) phenomena and monsoon climate pattern over Indonesia

ENSO Cycle: Recent Evolution, Current Status and Predictions. Update prepared by Climate Prediction Center / NCEP 15 July 2013

ENSO Cycle: Recent Evolution, Current Status and Predictions. Update prepared by Climate Prediction Center / NCEP 11 November 2013

Exploring Climate Patterns Embedded in Global Climate Change Datasets

MDA WEATHER SERVICES AG WEATHER OUTLOOK. Kyle Tapley-Senior Agricultural Meteorologist May 22, 2014 Chicago, IL

Bayesian spatial hierarchical modeling for temperature extremes

Stat260: Bayesian Modeling and Inference Lecture Date: February 10th, Jeffreys priors. exp 1 ) p 2

STAT 518 Intro Student Presentation

Statistics 220 Bayesian Data Analysis

Spatial Statistics with Image Analysis. Outline. A Statistical Approach. Johan Lindström 1. Lund October 6, 2016

Summary STK 4150/9150

Switching Regime Estimation

ENSO UPDATE By Joseph D Aleo, CCM

ENSO: Recent Evolution, Current Status and Predictions. Update prepared by: Climate Prediction Center / NCEP 9 November 2015

El Niño / Southern Oscillation

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data

Climate Change: the Uncertainty of Certainty

Bayesian Hierarchical Model Characterization of Model Error in Ocean Data Assimilation and Forecasts

Name: Date: Hour: Comparing the Effects of El Nino & La Nina on the Midwest (E4.2c)

Hierarchical Bayesian Spatio-Temporal Models for Population Spread

Statistical Approaches to Combining Models and Observations

November 2002 STA Random Effects Selection in Linear Mixed Models

Chart Discussion: Fri-16-Nov-2018 (Harvey Stern) Chart Discussion Latest Chart & SAT PIC

NIWA Outlook: March-May 2015

Issue Overview: El Nino and La Nina

ENSO Cycle: Recent Evolution, Current Status and Predictions. Update prepared by Climate Prediction Center / NCEP 25 February 2013

Geostatistical Modeling for Large Data Sets: Low-rank methods

THE INFLUENCE OF CLIMATE TELECONNECTIONS ON WINTER TEMPERATURES IN WESTERN NEW YORK INTRODUCTION

Bayesian spatial quantile regression

ENSO Outlook by JMA. Hiroyuki Sugimoto. El Niño Monitoring and Prediction Group Climate Prediction Division Japan Meteorological Agency

Lab 12: El Nino Southern Oscillation

The Matrix Reloaded: Computations for large spatial data sets

Recap on Data Assimilation

New Zealand Climate Update No 226, April 2018 Current climate March 2018

AnuMS 2018 Atlantic Hurricane Season Forecast

NIWA Outlook: April June 2019

Bayesian Statistics. Debdeep Pati Florida State University. April 3, 2017

Approximate Bayesian computation for spatial extremes via open-faced sandwich adjustment

Minimum Message Length Analysis of the Behrens Fisher Problem

Seasonal Climate Watch January to May 2016

Chart Discussion: Fri-24-Aug-2018 (Harvey Stern) Last Week s Rainfall

An Introduction to Physical-Statistical Modelling Using Bayesian Methods

Seasonal Climate Outlook for South Asia (June to September) Issued in May 2014

Bayesian Linear Models

ALASKA REGION CLIMATE OUTLOOK BRIEFING. July 20, 2018 Rick Thoman National Weather Service Alaska Region

Bayesian Linear Models

A new Hierarchical Bayes approach to ensemble-variational data assimilation

South & South East Asian Region:

Statistical techniques for data analysis in Cosmology

Climate Outlook for Pacific Islands for May - October 2015

Forward Problems and their Inverse Solutions

Bayesian nonparametric estimation of finite population quantities in absence of design information on nonsampled units

Pattern Recognition and Machine Learning. Bishop Chapter 2: Probability Distributions

Part 6: Multivariate Normal and Linear Models

Preferred spatio-temporal patterns as non-equilibrium currents

Lesson IV. TOPEX/Poseidon Measuring Currents from Space

Bayesian Regression (1/31/13)

The Matrix Reloaded: Computations for large spatial data sets

lecture 10 El Niño and the Southern Oscillation (ENSO) Part I sea surface height anomalies as measured by satellite altimetry

COMP2610/COMP Information Theory

Transcription:

Lecture 14 Bayesian Models for Spatio-Temporal Data Dennis Sun Stats 253 August 13, 2014

Outline of Lecture 1 Recap of Bayesian Models 2 Empirical Bayes 3 Case 1: Long-Lead Forecasting of Sea Surface Temperatures 4 Case 2: Modeling and Forecasting the Eurasian Dove Invasion 5 Case 3: Mediterranean Surface Vector Winds 6 Wrapping Up the Course

Recap of Bayesian Models Where are we? 1 Recap of Bayesian Models 2 Empirical Bayes 3 Case 1: Long-Lead Forecasting of Sea Surface Temperatures 4 Case 2: Modeling and Forecasting the Eurasian Dove Invasion 5 Case 3: Mediterranean Surface Vector Winds 6 Wrapping Up the Course

Recap of Bayesian Models Bayesian Models Bayesian models differ from frequentist models only in that the parameters θ are random. This allows us to stack priors to create hierarchical models. The Gibbs sampler is a universal algorithm that allows us to efficiently sample from the posterior in hierarchical models.

Recap of Bayesian Models Example: Rater Model

Recap of Bayesian Models Computations The Gibbs sampler provides samples from the posterior. We can use these samples to estimate the posterior distribution (e.g., histogram) or the posterior mean. JAGS will automatically simulate from the posterior.

Recap of Bayesian Models Results for the Rater Model I put priors on γ k and δ l so that I could do inference on them. Posterior of gamma, student 1 Posterior of gamma, student 2 Frequency 0 500 1000 1500 Frequency 0 500 1000 1500 3 2 1 0 2.5 1.5 0.5 0.5 1.0 Posterior of gamma, student 3 Posterior of gamma, student 4 Frequency 0 500 1000 1500 Frequency 0 500 1000 1500 1 0 1 2 1.0 0.0 0.5 1.0 1.5 2.0 2.5

Empirical Bayes Where are we? 1 Recap of Bayesian Models 2 Empirical Bayes 3 Case 1: Long-Lead Forecasting of Sea Surface Temperatures 4 Case 2: Modeling and Forecasting the Eurasian Dove Invasion 5 Case 3: Mediterranean Surface Vector Winds 6 Wrapping Up the Course

Empirical Bayes What would a frequentist do? Remember: In the Bayesian framework, to perform inference on a parameter, you must put a prior on it. Otherwise, you must specify its value beforehand. Example: Suppose there are 3 sections of a class taught by different professors. Let y ij denote the final exam score of student j in class i. y ij θi N(θ i, σ 2 ) (instructor effect) θ i N(µ, τ 2 ) Maybe we can instead try to estimate hyperparameters such as µ and τ 2 from the data, then use N(ˆµ, ˆτ 2 ) as the prior. What would our estimates of the parameters be then?

Empirical Bayes Empirical Bayes The idea of estimating hyperparameters from the data is called empirical Bayes. It is ultimately a frequentist method because we don t need to specify a prior on the hyperparameters we estimate! We get hierarchical models without subjective priors! Is this too good to be true? What are the challenges of doing this in practice?

Case 1: Long-Lead Forecasting of Sea Surface Temperatures Where are we? 1 Recap of Bayesian Models 2 Empirical Bayes 3 Case 1: Long-Lead Forecasting of Sea Surface Temperatures 4 Case 2: Modeling and Forecasting the Eurasian Dove Invasion 5 Case 3: Mediterranean Surface Vector Winds 6 Wrapping Up the Course

Case 1: Long-Lead Forecasting of Sea Surface Temperatures Problem Setup El Niño is characterized by warmer sea surface temperatures (SST) in the equatorial Pacific Ocean. Therefore, to predict El Niño, one needs to forecast the SST several months in advance. We observe the average monthly SST at different locations in the Pacific: def z t = (z t (s 1 ),..., z t (s m )) Want to forecast SSTs τ months in advance: z T +τ. This analysis is taken from Berliner, Wikle, and Cressie (2000).

Case 1: Long-Lead Forecasting of Sea Surface Temperatures First Model: Linear Dynamical Model State model: y t+τ = Φy t + ɛ t ɛ t N(0, Σ) Data model: z t = Ay t + δ t δ t N(0, σ 2 I) A contains the first k principal components of the empirical covariance matrix over the spatial locations. y t represents weights on those PCs. Unknown parameters are Φ, Σ, σ 2. Need to put priors on all of these: vec(φ) N(vec(0.9I), 100I) ( ) Σ 1 1 Wishart 100(k 1) I, k 1 σ 2 InverseGaussian(0.1, 100)

Case 1: Long-Lead Forecasting of Sea Surface Temperatures First Model: Linear Dynamical Model 1997 1998 2.5% and 97.5% quantiles of posterior

Case 1: Long-Lead Forecasting of Sea Surface Temperatures Second Model: Non-Linear Model State model: y t+τ = Φ t y t + ɛ t ɛ t N(0, Σ) Data model: z t = Ay t + δ t δ t N(0, σ 2 I) Allow Φ t = Φ(I t, J t ) to vary with time. I t classifies the current regime as cool, normal, or warm. Obtained by thresholding the Southern Oscillation Index (SOI): 0 if SOI t < low threshold I t = 1 if SOI t in between 2 if SOI t > upper threshold J t is obtained by similarly thresholding a latent process W t : W t β, τ 2 N(x T t β, τ 2 )

Case 1: Long-Lead Forecasting of Sea Surface Temperatures Second Model: Non-Linear Model 1997 1998 2.5% and 97.5% quantiles of posterior

Case 2: Modeling and Forecasting the Eurasian Dove Invasion Where are we? 1 Recap of Bayesian Models 2 Empirical Bayes 3 Case 1: Long-Lead Forecasting of Sea Surface Temperatures 4 Case 2: Modeling and Forecasting the Eurasian Dove Invasion 5 Case 3: Mediterranean Surface Vector Winds 6 Wrapping Up the Course

Case 2: Modeling and Forecasting the Eurasian Dove Invasion Problem Setup The Eurasian Collared Dove (ECD) was first observed in North America in the 80s and are now spreading quickly throughout the continent. They pose a threat to native ecosystems, so we would like to forecast their spread. Observe z t (s i ): number of doves observed at location s i. This analysis is taken from Hooten, Wikle, Dorazio, and Royle (2007).

Case 2: Modeling and Forecasting the Eurasian Dove Invasion

Case 2: Modeling and Forecasting the Eurasian Dove Invasion The Model Data model: z t (s i ) y t (s i ), π Bin(y t (s i ), π) y t λ t Pois(Hλ t ) State model: λ t = B(α)G(λ t 1 ; θ)λ t 1 π is the probability of observing an animal. Not estimable from this data alone, but the authors estimated it using data collected on a related species. G is a diagonal matrix that models growth, while B models dispersion. The authors go on to put priors on α and θ.

Case 2: Modeling and Forecasting the Eurasian Dove Invasion Results Posterior means for years in sample

Case 2: Modeling and Forecasting the Eurasian Dove Invasion Results Posterior means for future years

Case 3: Mediterranean Surface Vector Winds Where are we? 1 Recap of Bayesian Models 2 Empirical Bayes 3 Case 1: Long-Lead Forecasting of Sea Surface Temperatures 4 Case 2: Modeling and Forecasting the Eurasian Dove Invasion 5 Case 3: Mediterranean Surface Vector Winds 6 Wrapping Up the Course

Case 3: Mediterranean Surface Vector Winds Problem Setup The goal is to predict wind speeds and directions at different locations over the Mediterranean. (x t (s i ), y t (s i )) is the vector indicating the wind speed in the x- and y-directions. We assume that x t and y t are noisy measurements of underlying states u t and v t.

Case 3: Mediterranean Surface Vector Winds Data on February 1, 2005

Case 3: Mediterranean Surface Vector Winds A Physical State Model The state model is movtivated by the Rayleigh friction equations: u t fv = 1 p ρ 0 x γu v t + fu = 1 p ρ 0 y γv. where u, v are the east-west and north-south components, p is the sea-level pressure, and the rest are (unknown) parameters. An approximate solution to these equations is given by f p u ρ 0(f 2 + γ 2 ) y γ p ρ 0(f 2 + γ 2 ) x 2 γ u ρ 0(f 2 + γ 2 ) t 1 2 p ρ 0(f 2 + γ 2 ) x t f p v ρ 0(f 2 + γ 2 ) x γ p ρ 0(f 2 + γ 2 ) y 2 γ v ρ 0(f 2 + γ 2 ) t 1 ρ 0(f 2 + γ 2 ) We can discretize this as follows: u t = a 1 D y p t + a 2 D x p t + a 3 u t 1 + a 4 D x p t 1 v t = b 1 D x p t + b 2 D y p t + b 3 v t 1 + b 4 D y p t 1 2 p y t

Case 3: Mediterranean Surface Vector Winds Results for February 2, 2005

Wrapping Up the Course Where are we? 1 Recap of Bayesian Models 2 Empirical Bayes 3 Case 1: Long-Lead Forecasting of Sea Surface Temperatures 4 Case 2: Modeling and Forecasting the Eurasian Dove Invasion 5 Case 3: Mediterranean Surface Vector Winds 6 Wrapping Up the Course

Wrapping Up the Course A Few Last Thoughts The analysis of spatial and temporal data is really the analysis of correlated data. Model the mean function, use spatial and temporal methods to model the residual. There are two main ways to capture correlations: model the correlation directly (e.g., kriging) and via autoregressions. Many methods that work well for time series (e.g., Kalman filter) break down in space because the data are no longer ordered. Thanks for a great quarter!