Spatial smoothing using Gaussian processes

Similar documents
Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency

Nonstationary Covariance Functions for Gaussian Process Regression

Spatial Modelling Using a New Class of Nonstationary Covariance Functions

What s for today. Continue to discuss about nonstationary models Moving windows Convolution model Weighted stationary model

NONSTATIONARY GAUSSIAN PROCESSES FOR REGRESSION AND SPATIAL MODELLING

A Framework for Daily Spatio-Temporal Stochastic Weather Simulation

arxiv: v4 [stat.me] 14 Sep 2015

Model Selection for Gaussian Processes

Anomaly Detection and Removal Using Non-Stationary Gaussian Processes

STA 4273H: Sta-s-cal Machine Learning

Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University

A STATISTICAL TECHNIQUE FOR MODELLING NON-STATIONARY SPATIAL PROCESSES

Hierarchical Modelling for Univariate Spatial Data

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models

Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp

STATISTICAL MODELS FOR QUANTIFYING THE SPATIAL DISTRIBUTION OF SEASONALLY DERIVED OZONE STANDARDS

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Overview of Spatial Statistics with Applications to fmri

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

State Space Representation of Gaussian Processes

Hierarchical Modeling for Univariate Spatial Data

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

An Introduction to GAMs based on penalized regression splines. Simon Wood Mathematical Sciences, University of Bath, U.K.

A Generalized Convolution Model for Multivariate Nonstationary Spatial Processes

A Fused Lasso Approach to Nonstationary Spatial Covariance Estimation

Estimation of non-stationary spatial covariance structure

Outline Lecture 2 2(32)

Measurement Error in Spatial Modeling of Environmental Exposures

Spatial bias modeling with application to assessing remotely-sensed aerosol as a proxy for particulate matter

Integrated Likelihood Estimation in Semiparametric Regression Models. Thomas A. Severini Department of Statistics Northwestern University

Basics of Point-Referenced Data Models

A Generalized Convolution Model for Multivariate Nonstationary Spatial Processes

Spatial smoothing over complex domain

Student-t Process as Alternative to Gaussian Processes Discussion

Multi-resolution models for large data sets

Modelling with smooth functions. Simon Wood University of Bath, EPSRC funded

Hierarchical Modelling for Univariate Spatial Data

random fields on a fine grid

Nonparametric Bayesian Methods (Gaussian Processes)

Power EP. Thomas Minka Microsoft Research Ltd., Cambridge, UK MSR-TR , October 4, Abstract

ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS

Machine Learning Lecture 3

Spatial Statistics with Image Analysis. Lecture L02. Computer exercise 0 Daily Temperature. Lecture 2. Johan Lindström.

Implementing an anisotropic and spatially varying Matérn model covariance with smoothing filters

Gaussian Process Regression

Disease mapping with Gaussian processes

Journal of Multivariate Analysis. Nonstationary modeling for multivariate spatial processes

A new covariance function for spatio-temporal data analysis with application to atmospheric pollution and sensor networking

Inversion Base Height. Daggot Pressure Gradient Visibility (miles)

Multivariate Gaussian Random Fields with SPDEs

Introduction to Spatial Data and Models

Constrained Gaussian processes: methodology, theory and applications

Cross Validation and Maximum Likelihood estimations of hyper-parameters of Gaussian processes with model misspecification

STATISTICAL INTERPOLATION METHODS RICHARD SMITH AND NOEL CRESSIE Statistical methods of interpolation are all based on assuming that the process

Bayesian non-parametric model to longitudinally predict churn

arxiv: v1 [stat.me] 3 Dec 2014

Lecture 1c: Gaussian Processes for Regression

Neutron inverse kinetics via Gaussian Processes

Spatial Process Estimates as Smoothers: A Review

Introduction to Spatial Data and Models

6.435, System Identification

Geostatistics for Seismic Data Integration in Earth Models

Generative Models and Stochastic Algorithms for Population Average Estimation and Image Analysis

STA414/2104 Statistical Methods for Machine Learning II

Nearest Neighbor Gaussian Processes for Large Spatial Data

Statistical Geometry Processing Winter Semester 2011/2012

Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines

Geostatistical Modeling for Large Data Sets: Low-rank methods

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012

Low-rank methods and predictive processes for spatial models

STA 4273H: Statistical Machine Learning

A short introduction to INLA and R-INLA

Flexible Spatio-temporal smoothing with array methods

Flexible Gaussian Processes via Convolution

Some Topics in Convolution-Based Spatial Modeling

Spatio-Temporal Modeling using Kernel Convolution of. Order Based Dependent Dirichlet Process

Fixed-domain Asymptotics of Covariance Matrices and Preconditioning

Lecture 3: Statistical Decision Theory (Part II)

FuncICA for time series pattern discovery

VARIANCE ESTIMATION AND KRIGING PREDICTION FOR A CLASS OF NON-STATIONARY SPATIAL MODELS

The Matrix Reloaded: Computations for large spatial data sets

Modeling Real Estate Data using Quantile Regression

A Complete Spatial Downscaler

A full scale, non stationary approach for the kriging of large spatio(-temporal) datasets

Introduction to Gaussian Processes

Practical Bayesian Optimization of Machine Learning. Learning Algorithms

Spatial Statistics with Image Analysis. Outline. A Statistical Approach. Johan Lindström 1. Lund October 6, 2016

Spatially-Varying Covariance Functions for Nonstationary Spatial Process Modeling

Statistícal Methods for Spatial Data Analysis

Multi-task Learning with Gaussian Processes, with Applications to Robot Inverse Dynamics

A Divide-and-Conquer Bayesian Approach to Large-Scale Kriging

Paul Sampson Doris Damian Peter Guttorp NRCSE. T e c h n i c a l R e p o r t S e r i e s. NRCSE-TRS No January 18, 2001

Introduction to Gaussian Process

2.7 The Gaussian Probability Density Function Forms of the Gaussian pdf for Real Variates

The Bayesian approach to inverse problems

CSci 8980: Advanced Topics in Graphical Models Gaussian Processes

Accounting for Complex Sample Designs via Mixture Models

GAUSSIAN PROCESS REGRESSION

Data are collected along transect lines, with dense data along. Spatial modelling using GMRFs. 200m. Today? Spatial modelling using GMRFs

Transcription:

Spatial smoothing using Gaussian processes Chris Paciorek paciorek@hsph.harvard.edu August 5, 2004 1

OUTLINE Spatial smoothing and Gaussian processes Covariance modelling Nonstationary covariance modelling Eamples with simulated and real data 2

SPATIAL DATA 37 38 39 40 41 latgridvals 38 40 42 44 46 48 109 108 107 106 105 104 103 102 90 85 80 longridvals 75 70 109 37 38 39 40 41 108 107 106 38 40 42 44 46 48 105 104 103 102 Colorado annual precipitation 90 85 80 75 monthly PM10 70 3

SPATIAL DATA AND MODELS Features of spatial data highly interactive, comple surfaces high df:n; heterogeneous coverage possible nonstationarity spatial model is often a component of a larger model spatiotemporal model with covariates possibly replicated Some standard models Thin-plate splines Basis function models with penalized coefficients Gaussian processes (kriging or Bayesian) 4

GAUSSIAN PROCESS DISTRIBUTION Infinite-dimensional joint distribution for f(), X : Eample: f( ) a spatial process, X = R 2 f( ) GP(µ( ), C(, )) Finite-dimensional marginals are normal Types of covariance functions, C(f( i ), f( j )): stationary, isotropic stationary, anisotropic nonstationary X j j i i 5

GAUSSIAN PROCESS DISTRIBUTION Infinite-dimensional joint distribution for f(), X : Eample: f( ) a spatial process, X = R 2 f( ) GP(µ( ), C(, )) Finite-dimensional marginals are normal Types of covariance functions, C(f( i ), f( j )): stationary, isotropic stationary, anisotropic nonstationary X j j i i 6

GAUSSIAN PROCESS DISTRIBUTION Infinite-dimensional joint distribution for f(), X : Eample: f( ) a spatial process, X = R 2 f( ) GP(µ( ), C(, )) Finite-dimensional marginals are normal Types of covariance functions, C(f( i ), f( j )): stationary, isotropic stationary, anisotropic nonstationary X j j i i 7

CORRELATION MODELLING R(f( i ), f( j )) = R ( ) τij 2 τ 2 ij = ( i j ) T ( i j ) ρ 2 Correlation 0.0 0.4 0.8 ρ = 0.2 ρ = 0.04 0.0 0.2 0.4 Distance Correlation scale 0.0 0.4 0.8 0.0 0.4 0.8 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0 4 0 2 4 4 0 2 4 Nonstationarity: correlation a function of location 37 38 39 40 41 109 107 105 103 5.0 5.5 6.0 6.5 7.0 7.5 37 38 39 40 41 109 107 105 103 5.0 5.5 6.0 6.5 7.0 7.5 8

NONSTATIONARY COVARIANCE spatial deformation (Sampson, Guttorp & co-workers) mitures of Gaussian processes (Fuentes) or splines (Wood et al.) kernel convolution method (Higdon, Swall, and Kern) R NS (f( i ), f( j )) = c ij R P k i (u)k j (u)du Guaranteed positive definite Gaussian kernels: ( ) k i (u) ep (u i ) T Σ 1 i (u i ) R NS (f( i ), f( j )) = c ij ep f( ) GP(µ, σ 2 R NS (, )) ( ( i j ) T ( Σi +Σ j 2 ) 1 (i j ) 9

NONSTATIONARY GPS IN 2D Kernel Structure Sample Function y 0.0 0.2 0.4 0.6 0.8 1.0 1.0 z 0.5 0.0 0.5 1.0 1.0 0.8 0.6 y 0.4 0.2 0.2 0.4 1.0 0.8 0.6 0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.0 10

GENERALIZING THE HSK KERNEL METHOD Squared eponential form: ( ( ) ) 2 τij ep c ρ ij ep ( ( ) 1 ( i j ) T Σi +Σ j (i 2 j )) Theorem 1: if R(τ ) is positive definite for R P, P = 1, 2,..., then R NS (f( i ), f( j )) = c ij R ( Q ij ) is positive definite for R P, P = 1, 2,... Theorem 2: Smoothness properties of original stationary correlation retained Specific case of Matérn nonstationary covariance ( ) ν ( ) 1 2 ντij 2 2 ντij K 2 Γ(ν)2 ν 1 ρ ν ρ 1 Γ(ν)2 ν 1 ( 2 νqij ) ν Kν ( 2 νqij ) More fleible form, differentiability not constrained, possible asymptotic advantages 11

A BASIC BAYESIAN SPATIAL MODEL Model: Y i N(f( i ), η 2 ), i R 2 f( ) GP(µ, σ 2 R NS (, ; ν, Σ ( ) )) Let R NS be the nonstationary Matérn correlation Kernels (Σ ) constructed based on stationary GP priors Define multiple kernel matrices, Σ, X Smoothly-varying (element-wise) in covariate space Positive definite MCMC sampling of all parameters including those determining Σ ( ) 12

1 EMPIRICAL ASSESSMENT - SIMULATED FUNCTIONS 0.0 0.0 0.0 0.2 0.4 0.6 0.8 1.0 2 2 1.0 0.8 0.6 0.4 0.2 0.2 0.41 0.6 0.81.0 z 6 4 0.4 0.6 0.6 1 0.8 1.0 0.4 2 0.8 1.0 z 0.5 0.0 0.5 0.3 0.5 0.7 0.9 1 krige tps GAM sgp nsgp fks nn 1.0 0.9 0.8 0.7 0.6 0.5 0.4 0.3 13 2 test MSE 0.02 0.08 0.14 krige tps GAM sgp nsgp fks nn 2 0.0 0.2 0.4 0.6 0.8 1.0 test MSE 0.02 0.06 0.10 0.14

COLORADO PRECIPITATION CHARACTERIZATION 37 38 39 40 41 109 108 107 106 105 104 103 102 14

COLORADO PRECIPITATION SURFACES 109 107 105 103 109 107 105 103 37 38 39 40 41 5.0 5.5 6.0 6.5 7.0 7.5 37 38 39 40 41 stationary form nonstationary form 15 5.0 5.5 6.0 6.5 7.0 7.5

EMPIRICAL ASSESSMENT - COLORADO PRECIPITATION krige krige2 GAM GAM2 sgp nsgp fks nn test MSE 0.05 0.10 0.15 0.20 0.25 0.30 Comments surface is comple relative to number of obs; high df:n ratio bias-variance tradeoff simple two-region nonstationary models offer little improvement 16

FUTURE WORK Further implementation of nonstationary model: Approaches for constraining hyperparameters for better fits and miing Computationally tractable approaches for multiple realizations Ad hoc approaches for fitting the nonstationary covariance structure criteria for choosing a spatial smoother based on data spacing, df:n, signal to noise ratio Fourier basis representation for spatial processes for fast Bayesian estimation with non-normal data or complicated models 17

NONSTATIONARY GPS IN 1D 0.1 0.3 0.5 Kernel standard deviation y 2.0 1.0 0.0 Some sample functions y 1 0 1 2 0.0 0.2 0.4 0.6 0.8 1.0 Some kernels 0.0 0.4 0.8 0.0 0.4 0.8 0.0 0.4 0.8 0.0 0.2 0.4 0.6 0.8 1.0 y 1.5 0.5 0.5 1.5 0.0 0.4 0.8 y 2.0 1.0 0.0 1.0 0.0 0.4 0.8 18

SMOOTHLY-VARYING KERNEL MATRICES Spectral decomposition (R P ): Σ = Γ T Λ Γ in R 2, stationary GP priors on unnormalized eigenvector coordinates (a, b ) and on logarithm of second eigenvalue (λ,2 ) efficient parameterizations of Φ( ) {a( ), b( ), λ 2 ( )} using basis function approimation to a stationary GP a, b λ 2 λ 1 19

DEGREE OF SMOOTHING ρ = 0.2 ρ = 0.04 y 1 0 1 2 y 1 0 1 2 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.1 0.2 0.3 0.4 0.5 0.6 0.7 y 2 1 0 1 2 y 2 1 0 1 2 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.1 0.2 0.3 0.4 0.5 0.6 0.7 20