Geostatistical Modeling for Large Data Sets: Low-rank methods

Similar documents
Low-rank methods and predictive processes for spatial models

A full scale, non stationary approach for the kriging of large spatio(-temporal) datasets

Gaussian predictive process models for large spatial data sets.

Hierarchical Modeling for Spatio-temporal Data

On Gaussian Process Models for High-Dimensional Geostatistical Datasets

Nonparametric Bayesian Methods

Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes

Chapter 4 - Fundamentals of spatial processes Lecture notes

Hierarchical Modelling for Univariate Spatial Data

Parameter Estimation in the Spatio-Temporal Mixed Effects Model Analysis of Massive Spatio-Temporal Data Sets

Space-time data. Simple space-time analyses. PM10 in space. PM10 in time

A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models

An Additive Gaussian Process Approximation for Large Spatio-Temporal Data

STATISTICAL INTERPOLATION METHODS RICHARD SMITH AND NOEL CRESSIE Statistical methods of interpolation are all based on assuming that the process

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Hierarchical Modeling for Multivariate Spatial Data

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Hierarchical Modelling for Univariate Spatial Data

Bayesian Modeling and Inference for High-Dimensional Spatiotemporal Datasets

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Hierarchical Modeling for Univariate Spatial Data

Groundwater permeability

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Latent Gaussian Processes and Stochastic Partial Differential Equations

Regionalization of Multiscale Spatial Processes Using a Criterion for Spatial Aggregation Error. Christopher K. Wikle

Hierarchical Modelling for Multivariate Spatial Data

Bayesian Dynamic Modeling for Space-time Data in R

Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields

A Divide-and-Conquer Bayesian Approach to Large-Scale Kriging

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets

Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data

Bayesian spatial hierarchical modeling for temperature extremes

Hierarchical Low Rank Approximation of Likelihoods for Large Spatial Datasets

A Framework for Daily Spatio-Temporal Stochastic Weather Simulation

A Bayesian Spatio-Temporal Geostatistical Model with an Auxiliary Lattice for Large Datasets

Chapter 4 - Fundamentals of spatial processes Lecture notes

CBMS Lecture 1. Alan E. Gelfand Duke University

Nearest Neighbor Gaussian Processes for Large Spatial Data

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed

Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University

COVARIANCE APPROXIMATION FOR LARGE MULTIVARIATE SPATIAL DATA SETS WITH AN APPLICATION TO MULTIPLE CLIMATE MODEL ERRORS 1

Model Selection for Geostatistical Models

Hierarchical Modelling for Univariate and Multivariate Spatial Data

Learning gradients: prescriptive models

A full-scale approximation of covariance functions for large spatial data sets

Statistical modeling of MODIS cloud data using the spatial random effects model

A Multi-resolution Gaussian process model for the analysis of large spatial data sets.

ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS

MA 575 Linear Models: Cedric E. Ginestet, Boston University Mixed Effects Estimation, Residuals Diagnostics Week 11, Lecture 1

BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS

Linear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept,

Spatial Extremes in Atmospheric Problems

Some notes on efficient computing and setting up high performance computing environments

Short Questions (Do two out of three) 15 points each

Fast Direct Methods for Gaussian Processes

Hierarchical Matrices. Jon Cockayne April 18, 2017

Gaussian Multiscale Spatio-temporal Models for Areal Data

TAMS39 Lecture 2 Multivariate normal distribution

A BAYESIAN SPATIO-TEMPORAL GEOSTATISTICAL MODEL WITH AN AUXILIARY LATTICE FOR LARGE DATASETS

A Sequential Split-Conquer-Combine Approach for Analysis of Big Spatial Data

Gaussian Process Regression Model in Spatial Logistic Regression

Modified Linear Projection for Large Spatial Data Sets

SPATIO-TEMPORAL METHODS IN CLIMATOLOGY. Christopher K. Wikle Department of Statistics, University of Missouri Columbia, USA

Introduction to Geostatistics

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models

Spatial smoothing using Gaussian processes

Multivariate modelling and efficient estimation of Gaussian random fields with application to roller data

Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines

Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency

Overview of Spatial Statistics with Applications to fmri

*Department Statistics and Operations Research (UPC) ** Department of Economics and Economic History (UAB)

A Generalized Convolution Model for Multivariate Nonstationary Spatial Processes

Summary STK 4150/9150

Basics of Point-Referenced Data Models

On fixed effects estimation in spline-based semiparametric regression for spatial data

Tutorial on Fixed Rank Kriging (FRK) of CO 2 data. M. Katzfuss, The Ohio State University N. Cressie, The Ohio State University

Spatial Statistics with Image Analysis. Lecture L02. Computer exercise 0 Daily Temperature. Lecture 2. Johan Lindström.

Lecture 14 Bayesian Models for Spatio-Temporal Data

Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp

Bruno Sansó. Department of Applied Mathematics and Statistics University of California Santa Cruz bruno

Cross-covariance Functions for Tangent Vector Fields on the Sphere

A general science-based framework for dynamical spatio-temporal models

Regularized Estimation of High Dimensional Covariance Matrices. Peter Bickel. January, 2008

Kriging Luc Anselin, All Rights Reserved

A Probability Review

Climate Change: the Uncertainty of Certainty

Fixed rank filtering for spatio-temporal data

Sparse Covariance Selection using Semidefinite Programming

A HIERARCHICAL MAX-STABLE SPATIAL MODEL FOR EXTREME PRECIPITATION

A multi-resolution Gaussian process model for the analysis of large spatial data sets.

Analysing geoadditive regression data: a mixed model approach

Douglas Nychka, Soutir Bandyopadhyay, Dorit Hammerling, Finn Lindgren, and Stephan Sain. October 10, 2012

Karhunen-Loève Approximation of Random Fields Using Hierarchical Matrix Techniques

Experiences with Model Reduction and Interpolation

Approaches for Multiple Disease Mapping: MCAR and SANOVA

MIT Spring 2015

Transcription:

Geostatistical Modeling for Large Data Sets: Low-rank methods Whitney Huang, Kelly-Ann Dixon Hamil, and Zizhuang Wu Department of Statistics Purdue University February 22, 2016

Outline Motivation Low-rank methods

Gaussian process (GP) geostatistics Model: where Y (s) = µ(s) + η(s) + ε(s), s S R d µ(s) = E [Y (s)] = X(s) T β {η(s)} s S GP (0, C (, )) ε(s) iid N(0, τ 2 ) s S

Likelihood When the process Y (s) is observed at n locations, say, s 1, s n ie, Y = (Y (s 1 ),, Y (s n )) T Log-likelihood: where l n 1 2 log Σ Y 1 2 (Y XT β) T [Σ Y ] 1 (Y Xβ) Σ Y = [Cov {Y (s i ), Y (s j )}] i,j=1,,n [ ] = C (s i, s j ) + τ 2 1 {si =s j } i,j=1,,n

Big n Problem in geostatistics Modern environmental instruments have produced a wealth of space time data n is big Evaluation of the likelihood function involves factorizing large covariance matrices that generally requires O(n 3 ) operations O(n 2 ) memory Modeling strategies are needed to deal with large spatial data set parameter estimation MLE, Bayesian spatial interpolation Kriging multivariate spatial data (np np), spatio-temporal data (nt nt)

Big n Problem in geostatistics Modern environmental instruments have produced a wealth of space time data n is big Evaluation of the likelihood function involves factorizing large covariance matrices that generally requires O(n 3 ) operations O(n 2 ) memory Modeling strategies are needed to deal with large spatial data set parameter estimation MLE, Bayesian spatial interpolation Kriging multivariate spatial data (np np), spatio-temporal data (nt nt)

Big n Problem in geostatistics Modern environmental instruments have produced a wealth of space time data n is big Evaluation of the likelihood function involves factorizing large covariance matrices that generally requires O(n 3 ) operations O(n 2 ) memory Modeling strategies are needed to deal with large spatial data set parameter estimation MLE, Bayesian spatial interpolation Kriging multivariate spatial data (np np), spatio-temporal data (nt nt)

Outline Motivation Low-rank methods

Low rank approximation Goal: want to approximate Y = X T β + η + ε into a lower-dimensional structure Hierarchical Representation (assume zero mean) Y = η + ε, ε N n (0, Σ ε ) η = AZ + ξ, ξ N n (0, Σ ξ ) Z N r (0, Σ Z ) Z = (Z 1,, Z r ) T, r n A is the (n r) expansion matrix that maps Z to η Σ ε and Σ ξ are (usually assumed to be) diagonal η 0 = A 0 Z + ξ 0 η(s 0 ) = r j=1 a(s 0) T Z j + ξ(s 0 )

Low rank approximation Goal: want to approximate Y = X T β + η + ε into a lower-dimensional structure Hierarchical Representation (assume zero mean) Y = η + ε, ε N n (0, Σ ε ) η = AZ + ξ, ξ N n (0, Σ ξ ) Z N r (0, Σ Z ) Z = (Z 1,, Z r ) T, r n A is the (n r) expansion matrix that maps Z to η Σ ε and Σ ξ are (usually assumed to be) diagonal η 0 = A 0 Z + ξ 0 η(s 0 ) = r j=1 a(s 0) T Z j + ξ(s 0 )

Low rank approximation Goal: want to approximate Y = X T β + η + ε into a lower-dimensional structure Hierarchical Representation (assume zero mean) Y = η + ε, ε N n (0, Σ ε ) η = AZ + ξ, ξ N n (0, Σ ξ ) Z N r (0, Σ Z ) Z = (Z 1,, Z r ) T, r n A is the (n r) expansion matrix that maps Z to η Σ ε and Σ ξ are (usually assumed to be) diagonal η 0 = A 0 Z + ξ 0 η(s 0 ) = r j=1 a(s 0) T Z j + ξ(s 0 )

Low rank approximation Goal: want to approximate Y = X T β + η + ε into a lower-dimensional structure Hierarchical Representation (assume zero mean) Y = η + ε, ε N n (0, Σ ε ) η = AZ + ξ, ξ N n (0, Σ ξ ) Z N r (0, Σ Z ) Z = (Z 1,, Z r ) T, r n A is the (n r) expansion matrix that maps Z to η Σ ε and Σ ξ are (usually assumed to be) diagonal η 0 = A 0 Z + ξ 0 η(s 0 ) = r j=1 a(s 0) T Z j + ξ(s 0 )

Computational savings using low rank approximation To carry out parameter estimation and spatial interpolation one need to compute ( AΣZ A T + V ) 1 where V = Σ ε + Σ ξ Sherman Morrison Woodbury formula (A + BCD) 1 = A 1 A 1 B ( C 1 + DA 1 B ) 1 DA 1 In low rank approximation, we have ( AΣZ A T + V ) 1 = V 1 V 1 A ( Σ 1 Z + AT V 1 A ) 1 A T V 1

Low rank approximation: expansion matrix A η(s 1 ) η(s 2 ) η(s n ) }{{} η A 11 A 12 A 1r A 21 A 22 A 2r A n1 A n2 A nr }{{} A Z 1 Z 2 Z r }{{} Z Question How to choose A? How to determine Z = (Z 1,, Z r ) T? choose r knots s 1,, s r S and let Z = (Z (s 1 ),, Z (s r )) T

Low rank approximation: expansion matrix A η(s 1 ) η(s 2 ) η(s n ) }{{} η A 11 A 12 A 1r A 21 A 22 A 2r A n1 A n2 A nr }{{} A Z 1 Z 2 Z r }{{} Z Question How to choose A? How to determine Z = (Z 1,, Z r ) T? choose r knots s 1,, s r S and let Z = (Z (s 1 ),, Z (s r )) T

Choice of A Orthogonal basis functions: eg, Fourier, orthogonal polynomial, etc Karhune Loéve expansion: Cov(η(s 1 ), η(s 2 )) = j=1 λ jφ j (s 1 )φ j (s 2 ) where Cov(η(s 1 ), η(s 2 ))φ j (s 1 ) ds 1 = λ j φ j (s 2 ) S Empirical orthogonal functions (EOFs) Nonorthogonal Basis Functions: represent the spatial process as combination of kernel functions η(s) = r a(s, s j )Z j + ξ(s) j=1

Fixed Rank Kriging (Cressie & Johannesson 08) Fixed-rank kriging outlines how kriging (ie, spatial best linear unbiased predictor) can be applied in the low-rank parameterization of a spatial process A: (multiresolutional) non orthogonal basis functions ( ) a(s) T = {1 ( s s1(l) /ρ 1(l)) 2 } 2 +,, {1 ( s sr(l) /ρ r(l)) 2 } 2 + Z = (Z (s 1 ),, Z (s r )) T, Σ Z to be estimated from data The fixed rank kriging is equivalent to the following low rank model Y (s) = X (s) T β + r a(s s j(l) )Z j + ε(s) + ξ(s) j=1

Multiresolutional basis functions Figure: courtesy of Cressie & Johannesson 08

Nonstationary covariance function Figure: courtesy of Cressie & Johannesson 08

Gaussian Predictive Process (Banerjee et al 08) Y (s) = X (s) T β + η(s) + ε(s), η GP(0, C(, ; θ) Reasoning: Given Z, we seek a linear combination a(s) T Z such that [ 2 E η (s) a (s) Z] T ds S is minimized In predictive process one let Z = η = {η(s i )}p i=1 N p(0, C (θ)) [ ] p where C (θ) = C(si, s j ; θ) and i,j=1 ] j=1,,p A = [C(s i, s j ; θ) [C ] 1 i=1,,n

Covaraince function approximation Figure: courtesy of Banerjee et al 08

Covaraince function approximation cont d Figure: courtesy of Banerjee et al 08

Discussion

References S Banerjee, A E Gelfand, A O Finley, and H Sang Gaussian Predictive Process Models for Large Spatial Data Sets JRSSB, 70:825 848, 2008 N A C Cressie, and G Johannesson Fixed Rank Kriging for Very Large Spatial Data Sets JRSSB, 70:209 226, 2008 C K Wikle Low-rank representations for spatial processes, Chapter 8 of Handbook of Spatial Statistics Chapman & Hall/CRC, 2010