Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Similar documents
Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

On Gaussian Process Models for High-Dimensional Geostatistical Datasets

Hierarchical Modeling for Spatio-temporal Data

Bayesian Modeling and Inference for High-Dimensional Spatiotemporal Datasets

Nearest Neighbor Gaussian Processes for Large Spatial Data

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets

Hierarchical Modeling for Multivariate Spatial Data

Hierarchical Modelling for Univariate Spatial Data

Modelling Multivariate Spatial Data

Hierarchical Modelling for Univariate Spatial Data

Bayesian Dynamic Modeling for Space-time Data in R

Introduction to Geostatistics

Bayesian Linear Models

Spatial bias modeling with application to assessing remotely-sensed aerosol as a proxy for particulate matter

Principles of Bayesian Inference

Fusing point and areal level space-time data. data with application to wet deposition

Model Assessment and Comparisons

Principles of Bayesian Inference

Bayesian Linear Models

Lecture 23. Spatio-temporal Models. Colin Rundel 04/17/2017

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models

Hierarchical Modeling for non-gaussian Spatial Data

Bayesian Linear Regression

Hierarchical Modelling for non-gaussian Spatial Data

Hierarchical Modelling for Multivariate Spatial Data

Introduction to Spatial Data and Models

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields

Hierarchical Modeling for Univariate Spatial Data

Introduction to Spatial Data and Models

Gaussian predictive process models for large spatial data sets.

Parameter Estimation in the Spatio-Temporal Mixed Effects Model Analysis of Massive Spatio-Temporal Data Sets

An Additive Gaussian Process Approximation for Large Spatio-Temporal Data

Bayesian hierarchical space time model applied to highresolution hindcast data of significant wave height

Hierarchical Modeling for Univariate Spatial Data

Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency

A full scale, non stationary approach for the kriging of large spatio(-temporal) datasets

A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models

A Complete Spatial Downscaler

Principles of Bayesian Inference

Climatography of the United States No

Spatio-temporal modelling of daily air temperature in Catalonia

Data Integration Model for Air Quality: A Hierarchical Approach to the Global Estimation of Exposures to Ambient Air Pollution

Bayesian Hierarchical Modelling: Incorporating spatial information in water resources assessment and accounting

Geostatistical Modeling for Large Data Sets: Low-rank methods

arxiv: v4 [stat.me] 14 Sep 2015

Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling

Spatial smoothing using Gaussian processes

NRCSE. Misalignment and use of deterministic models

Bayesian hierarchical modelling for data assimilation of past observations and numerical model forecasts

A short introduction to INLA and R-INLA

Bayesian Linear Models

Spatio-temporal precipitation modeling based on time-varying regressions

Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University

Hierarchical Modeling for Spatial Data

On Some Computational, Modeling and Design Issues in Bayesian Analysis of Spatial Data

A Divide-and-Conquer Bayesian Approach to Large-Scale Kriging

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

A STATISTICAL TECHNIQUE FOR MODELLING NON-STATIONARY SPATIAL PROCESSES

Statistics for extreme & sparse data

Climatography of the United States No

Climatography of the United States No

Space-time downscaling under error in computer model output

Some notes on efficient computing and setting up high performance computing environments

Statistícal Methods for Spatial Data Analysis

Introduction to Spatial Data and Models

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Space-Time Data fusion Under Error in Computer Model Output: An Application to Modeling Air Quality

Wrapped Gaussian processes: a short review and some new results

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Efficient Posterior Inference and Prediction of Space-Time Processes Using Dynamic Process Convolutions

Bruno Sansó. Department of Applied Mathematics and Statistics University of California Santa Cruz bruno

Climatography of the United States No

Climatography of the United States No

Practical Bayesian Modeling and Inference for Massive Spatial Datasets On Modest Computing Environments

Climatography of the United States No

Climatography of the United States No

Climatography of the United States No

Transcription:

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Andrew O. Finley 1 and Sudipto Banerjee 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. March 8, 2013 1 NEON Applied Bayesian Regression Spatio-temporal Workshop

Outline 1 Opportunities and challenges in spatio-temporal data analysis 2 Typical examples 3 Dynamic space-time modeling 4 Dimension reduction and predictive spatial process models 5 Data analysis 6 Extensions to multivariate setting 2 NEON Applied Bayesian Regression Spatio-temporal Workshop

Opportunities and challenges in spatio-temporal data analysis Data sets often exhibit: Challenges in spatio-temporal data analysis missingness and misalignment among outcomes space- and time-varying impact of covariates complex residual dependence structures nonstationarity among multiple outcomes across locations unknown time and perhaps space lags between outcomes and covariates 3 NEON Applied Bayesian Regression Spatio-temporal Workshop

Opportunities and challenges in spatio-temporal data analysis Data sets often exhibit: Challenges in spatio-temporal data analysis missingness and misalignment among outcomes space- and time-varying impact of covariates complex residual dependence structures nonstationarity among multiple outcomes across locations unknown time and perhaps space lags between outcomes and covariates We seek modeling frameworks that: incorporate many sources of space and time indexed data accommodate structured residual dependence propagate uncertainty through to predictions scale to effectively exploit information in massive datasets 3 NEON Applied Bayesian Regression Spatio-temporal Workshop

Typical examples Discrete-time, continuous-space settings Data is viewed as arising from a time series of spatial processes. For example: US Environmental Protection Agency s Air Quality System which reports pollutants mean, minimum, and maximum at 8 and 24 hour intervals soil sensor array that measures moisture and CO 2 at 1 minute intervals climate model outputs of weather variables generated on hourly or daily intervals remotely sensed landuse/landcover change recorded at annual or decadal time steps 4 NEON Applied Bayesian Regression Spatio-temporal Workshop

Typical examples NCDC weather station data from North-eastern US 1530 stations that recorded temperature between 1/1/2000 1/1/2005. The outcome variable is mean monthly temperature. Northing (km) 0 1000 2000 3000 Elevation (m) 1000 800 600 400 200 0 0 1000 2000 3000 Easting (km) 5 NEON Applied Bayesian Regression Spatio-temporal Workshop

Typical examples NCDC weather station data from North-eastern US 1530 stations that recorded temperature between 1/1/2000 1/1/2005. The outcome variable is mean monthly temperature. Northing (km) 0 1000 2000 3000 Jan 2000 Temperature degrees C 20 10 0 10 20 0 1000 2000 3000 5 NEON Applied Bayesian Regression Spatio-temporal Workshop

Typical examples NCDC weather station data from North-eastern US 1530 stations that recorded temperature between 1/1/2000 1/1/2005. The outcome variable is mean monthly temperature. Northing (km) 0 1000 2000 3000 Feb 2000 Temperature degrees C 20 10 0 10 20 0 1000 2000 3000 5 NEON Applied Bayesian Regression Spatio-temporal Workshop

Typical examples NCDC weather station data from North-eastern US 1530 stations that recorded temperature between 1/1/2000 1/1/2005. The outcome variable is mean monthly temperature. Northing (km) 0 1000 2000 3000 Mar 2000 Temperature degrees C 20 10 0 10 20 0 1000 2000 3000 5 NEON Applied Bayesian Regression Spatio-temporal Workshop

Typical examples NCDC weather station data from North-eastern US 1530 stations that recorded temperature between 1/1/2000 1/1/2005. The outcome variable is mean monthly temperature. Northing (km) 0 1000 2000 3000 Apr 2000 Temperature degrees C 20 10 0 10 20 0 1000 2000 3000 5 NEON Applied Bayesian Regression Spatio-temporal Workshop

Typical examples NCDC weather station data from North-eastern US 1530 stations that recorded temperature between 1/1/2000 1/1/2005. The outcome variable is mean monthly temperature. Northing (km) 0 1000 2000 3000 May 2000 Temperature degrees C 20 10 0 10 20 0 1000 2000 3000 5 NEON Applied Bayesian Regression Spatio-temporal Workshop

Typical examples NCDC weather station data from North-eastern US 1530 stations that recorded temperature between 1/1/2000 1/1/2005. The outcome variable is mean monthly temperature. Northing (km) 0 1000 2000 3000 Jun 2000 Temperature degrees C 20 10 0 10 20 0 1000 2000 3000 5 NEON Applied Bayesian Regression Spatio-temporal Workshop

Typical examples NCDC weather station data from North-eastern US 1530 stations that recorded temperature between 1/1/2000 1/1/2005. The outcome variable is mean monthly temperature. Northing (km) 0 1000 2000 3000 Jul 2000 Temperature degrees C 20 10 0 10 20 0 1000 2000 3000 5 NEON Applied Bayesian Regression Spatio-temporal Workshop

Typical examples NCDC weather station data from North-eastern US 1530 stations that recorded temperature between 1/1/2000 1/1/2005. The outcome variable is mean monthly temperature. Northing (km) 0 1000 2000 3000 Aug 2000 Temperature degrees C 20 10 0 10 20 0 1000 2000 3000 5 NEON Applied Bayesian Regression Spatio-temporal Workshop

Typical examples NCDC weather station data from North-eastern US Of the n N t = 1530 station 61 months = 93,330 possible observations, 15,107 had missing outcomes. Station count 0 200 400 600 800 1000 0 20 40 60 80 100 Percent missing temperature observations 6 NEON Applied Bayesian Regression Spatio-temporal Workshop

Dynamic space-time modeling Dynamic space-time modeling West and Harrison (1997); Tonellato; (1997); Stroud et al. (2001); Gelfand et al. (2005);... Measurement equation: Regression specification + space-time process + serially and spatially uncorrelated zero-centered Gaussian disturbances as measurement error Transition equation: Markovian dependence in time temporal component: akin to usual state-space model space-time component: modeled using Gaussian processes 7 NEON Applied Bayesian Regression Spatio-temporal Workshop

Dynamic space-time modeling Dynamic space-time model y t (s) = x t (s) β t + u t (s) + ɛ t (s), t = 1, 2,..., N t 8 NEON Applied Bayesian Regression Spatio-temporal Workshop

Dynamic space-time modeling Dynamic space-time model y t (s) = x t (s) β t + u t (s) + ɛ t (s), β t = β t 1 + η t, η t iid N(0, Σ η ) ; t = 1, 2,..., N t 8 NEON Applied Bayesian Regression Spatio-temporal Workshop

Dynamic space-time modeling Dynamic space-time model y t (s) = x t (s) β t + u t (s) + ɛ t (s), β t = β t 1 + η t, u t (s) = u t 1 (s) + w t (s), η t iid N(0, Σ η ) ; t = 1, 2,..., N t w t (s) ind GP (0, C t (, θ t )), 8 NEON Applied Bayesian Regression Spatio-temporal Workshop

Dynamic space-time modeling Dynamic space-time model y t (s) = x t (s) β t + u t (s) + ɛ t (s), β t = β t 1 + η t, u t (s) = u t 1 (s) + w t (s), ɛ t (s) ind N(0, τ 2 t ). η t iid N(0, Σ η ) ; Assume β 0 N(m 0, Σ 0 ) and u 0 (s) 0 t = 1, 2,..., N t w t (s) ind GP (0, C t (, θ t )), 8 NEON Applied Bayesian Regression Spatio-temporal Workshop

Dynamic space-time modeling Consider settings where the number of locations is large but the number of time points is manageable. For any S = {s 1, s 2,..., s n } R d. Then, w t = (w t (s 1 ), w t (s 2 ),..., w t (s n )) N(0, C t (θ t )), where C t (θ t ) = {C t (s i, s j ; θ t )}. 9 NEON Applied Bayesian Regression Spatio-temporal Workshop

Dynamic space-time modeling Consider settings where the number of locations is large but the number of time points is manageable. For any S = {s 1, s 2,..., s n } R d. Then, w t = (w t (s 1 ), w t (s 2 ),..., w t (s n )) N(0, C t (θ t )), where C t (θ t ) = {C t (s i, s j ; θ t )}. Model parameter estimation requires iterative matrix decomposition for C t (θ t ) 1 and determinant, i.e., computation of order O(n 3 ) for every MCMC iteration! Conducting full Bayesian inference is computationally onerous! 9 NEON Applied Bayesian Regression Spatio-temporal Workshop

Dimension reduction and predictive spatial process models Modeling large spatial datasets Covariance tapering (Furrer et al. 2006; Zhang and Du 2007; Du et al. 2009; Kaufman et al., 2009) Spectral domain: (Fuentes 2007; Paciorek 2007) Approximate using MRFs: INLA (Rue et al. 2009) Approximations using cond. indep. (Vecchia 1988; Stein et al. 2004) Low-rank approaches (Wahba 1990; Higdon 2002; Lin et al. 2000; Kamman & Wand, 2003; Paciorek 2007; Rasmussen & Williams 2006; Tokdar et al. 2007, 2011; Stein 2007, 2008; Cressie & Johannesson 2008; Banerjee et al. 2008, 2011; Finley et al. 2008 2013) 10 NEON Applied Bayesian Regression Spatio-temporal Workshop

Dimension reduction and predictive spatial process models Predictive process models Wahba (1990), Lin et al. (2000), Rasmussen and Williams (2006), Tokdar (2007, 2011) and Banerjee, Finley et al. (2008 2011). Parent process: w t (s) GP (0, C t ( ; θ t )) 11 NEON Applied Bayesian Regression Spatio-temporal Workshop

Dimension reduction and predictive spatial process models Predictive process models Wahba (1990), Lin et al. (2000), Rasmussen and Williams (2006), Tokdar (2007, 2011) and Banerjee, Finley et al. (2008 2011). Parent process: w t (s) GP (0, C t ( ; θ t )) Knots : S = {s 1, s 2,..., s n} with n << n w t = (w t (s 1), w t (s 2),..., w t (s n )) N(0, C t (θ t )), where C t (θ t ) = { C t (s i, s j ; θ t) } w t (s) = E[w t (s) w t ] = c t (s; θ t ) C t (θ t ) 1 w t, where c t (s; θ t ) = { C t (s, s j ; θ t) } Replace u t (s) by ũ t (s) = ũ t 1 (s) + w t (s). Now we only have order O(n 3 ) complexity for every MCMC iteration. 11 NEON Applied Bayesian Regression Spatio-temporal Workshop

y Dimension reduction and predictive spatial process models y 0.5 0 2 1 1.5 1.5 3.5 2.5 3 2 3.5 1.5 3 6 5.5 4.5 5 5.5 3.5 4 3.5 2 2.5 2.5 2 3 4 4.5 3.5 1.5 3 2.5 2.5 5 2 2 Illustration of biased estimates of variance parameters from the predictive process (synthetic data). Parent process surface Predictive process surface 0.0 0.2 0.4 0.6 0.8 1.0 4 2 0 2 4 6 0.0 0.2 0.4 0.6 0.8 1.0 0.5 2.5 0 1 2 3 4 5 6 0.0 0.2 0.4 0.6 0.8 1.0 x 0.0 0.2 0.4 0.6 0.8 1.0 x 12 NEON Applied Bayesian Regression Spatio-temporal Workshop

Dimension reduction and predictive spatial process models Predictive process model estimates (synthetic data) σ 2 τ 2 True parameter values σ 2 =5 and τ 2 =5. Increasing knot intensity reduces bias. 13 NEON Applied Bayesian Regression Spatio-temporal Workshop

Dimension reduction and predictive spatial process models Bias-adjusted predictive process w ɛ (s) ind N( w(s), C(s, s; θ) c(s; θ) C (θ) 1 c(s; θ)). 14 NEON Applied Bayesian Regression Spatio-temporal Workshop

Dimension reduction and predictive spatial process models Bias-adjusted predictive process w ɛ (s) ind N( w(s), C(s, s; θ) c(s; θ) C (θ) 1 c(s; θ)). Rectified predictive process model estimates (synthetic data) σ 2 τ 2 14 NEON Applied Bayesian Regression Spatio-temporal Workshop

Dimension reduction and predictive spatial process models Bias-adjusted predictive process w ɛ (s) ind N( w(s), C(s, s; θ) c(s; θ) C (θ) 1 c(s; θ)). Rectified predictive process model estimates (synthetic data) σ 2 τ 2 So, in our current setting replace u t (s) by ũ t (s) = ũ t 1 (s) + w t (s) + ɛ t (s), where again, ɛ(s) ind N(0, C t (s, s; θ) c t (s; θ t ) C t (θ t ) 1 c t (s; θ t )). 14 NEON Applied Bayesian Regression Spatio-temporal Workshop

Dimension reduction and predictive spatial process models Hierarchical dynamic predictive process models N t N t p (θ t) IG ( τt 2 ) a τ, b τ N(β0 m 0, Σ 0) IW (Σ η r η, Υ η) t=1 t=1 N t N t N(β t β t 1, Σ η) N(w t 0, C t (θ t)) t=1 t=1 N t N(ũ t ũ t 1 + c t(θ t) C t (θ t) 1 w t, D t) t=1 N t N(y t X tβ t + ũ t, τt 2 I n). t=1 15 NEON Applied Bayesian Regression Spatio-temporal Workshop

Dimension reduction and predictive spatial process models Selection of knots How do we choose S Knot selection Regular grid? More knots near locations we have sampled more? formal spatial design paradigm: maximize information metrics geometric considerations: space-filling designs; various clustering algorithms adaptive modeling of knots using point processes, see, e.g., Guhaniyogi et al. (2011) 16 NEON Applied Bayesian Regression Spatio-temporal Workshop

Dimension reduction and predictive spatial process models Selection of knots How do we choose S Knot selection Regular grid? More knots near locations we have sampled more? formal spatial design paradigm: maximize information metrics geometric considerations: space-filling designs; various clustering algorithms adaptive modeling of knots using point processes, see, e.g., Guhaniyogi et al. (2011) In practice: compare performance of estimation of range and smoothness by varying knot intensity usually inference is quite robust to S. more important to capture loss of variability due to low rank approximation 16 NEON Applied Bayesian Regression Spatio-temporal Workshop

Data analysis Candidate models: NCDC weather station data analysis non-spatial, i.e., ũ t s = 0, but temporally dependent β t s full model with spatial predictive process using 5, 10, 25, and 50 knot intensities Assessment: model fit minimum posterior predictive loss approach predictive performance based on 1000 holdout observations RMSPE and CI coverage rate 17 NEON Applied Bayesian Regression Spatio-temporal Workshop

Data analysis Elevation covariate Northing (km) 0 1000 2000 3000 Elevation (m) 1000 800 600 400 200 0 0 1000 2000 3000 Easting (km) 18 NEON Applied Bayesian Regression Spatio-temporal Workshop

Data analysis Knot locations Northing (km) 0 1000 2000 3000 Stations Predictive process knots 0 1000 2000 3000 Easting (km) 19 NEON Applied Bayesian Regression Spatio-temporal Workshop

Data analysis Dynamic estimation of the intercept β 0,t (25 knot model) Beta0 5 0 5 10 15 20 25 2000 2001 2002 2003 2004 2005 Years 20 NEON Applied Bayesian Regression Spatio-temporal Workshop

Data analysis Dynamic estimation of the impact of elevation β 1,t (25 knot model) Beta1 0.006 0.0045 0.003 2000 2001 2002 2003 2004 2005 Years 21 NEON Applied Bayesian Regression Spatio-temporal Workshop

Data analysis Non-spatial Predictive process knots 25 Σ η [1,1] 24.41 (17.97, 36.65) 23.61 (16.83, 35.45) Σ η [2,1] 0.001 (-0.050, 0.054) 0.001 (-0.050, 0.055) Σ η [2,2] 0.002 (0.001, 0.002) 0.002 (0.001, 0.002) Σ η parameter credible intervals, 50 (2.5, 97.5) percentiles, for the non-spatial and 25 knot predictive process models. 22 NEON Applied Bayesian Regression Spatio-temporal Workshop

Data analysis Dynamic estimation of spatial range θ 1 (25 knot model) Spatial range (km) 200 700 1200 2000 2001 2002 2003 2004 2005 Years 23 NEON Applied Bayesian Regression Spatio-temporal Workshop

Data analysis Dynamic estimation of partial sill θ 2 (25 knot model) sigmaˆ2 0 1 2 3 4 5 6 7 2000 2001 2002 2003 2004 2005 Years 24 NEON Applied Bayesian Regression Spatio-temporal Workshop

Data analysis Dynamic estimation of nugget τ 2 t (25 knot model) tauˆ2 0.03 0.05 0.07 2000 2001 2002 2003 2004 2005 Years 25 NEON Applied Bayesian Regression Spatio-temporal Workshop

Data analysis Fitted values Non-spatial model Full model (25 knots) Mean 20 10 0 10 20 30 Mean 20 10 0 10 20 30 20 10 0 10 20 30 Observed Temp. 20 10 0 10 20 30 Observed Temp. 26 NEON Applied Bayesian Regression Spatio-temporal Workshop

Data analysis Predictive process knots Non-spatial 5 10 25 50 G 699174 282 162 75 80 P 699036 8333 6829 6022 6030 D 1398210 8616 6992 6097 6110 RMSPE 8.4 0.54 0.45 0.31 0.29 CI cover % 96 97 97 96 97 Using a squared error loss function, G lack of fit, P predictive variance, D = G + P lower is better (Gelfand and Ghosh, 1998). 27 NEON Applied Bayesian Regression Spatio-temporal Workshop

Data analysis Cross-validation and temporal imputation Temp. 10 0 10 20 30 2000 2001 2002 2003 2004 2005 Years Temp. 10 0 10 20 2000 2001 2002 2003 2004 2005 Years 28 NEON Applied Bayesian Regression Spatio-temporal Workshop

Data analysis 2003 2004 28 NEON Applied Bayesian Regression Spatio-temporal Workshop

Extensions to multivariate setting Extending this model to consider m outcome variables, e.g., temperature and precipitation, at each location. Simply choose your favorite approach to modeling w t (s) = (w t,1 (s), w t,2 (s),..., w t,m (s)). LMC w t (s) Multivariate Matérn Constructive Kernel Convolution Characterize Latent Dimension 29 NEON Applied Bayesian Regression Spatio-temporal Workshop

Summary Extensions to multivariate setting Challenge to meet modeling needs: predictive process models for large datasets use model-based adjustment to compensate for over-smoothing Future work: extend to multivariate response and spatially-varying coefficients make these models available in the R package spbayes This work was supported by NSF Grant DMS 0706870. 30 NEON Applied Bayesian Regression Spatio-temporal Workshop