Gaussian predictive process models for large spatial data sets.

Size: px
Start display at page:

Download "Gaussian predictive process models for large spatial data sets."

Transcription

1 Gaussian predictive process models for large spatial data sets. Sudipto Banerjee, Alan E. Gelfand, Andrew O. Finley, and Huiyan Sang Presenters: Halley Brantley and Chris Krut September 28, 2015

2 Overview Recap Gaussian Process Spatial Regression Univariate Predictive Multivariate Gaussian Processes Linear Model of Coregionalization Multivariate Predictive Process Extensions to non-gaussian and Space-Time Data.

3 Gaussian Process Definition Y (s) is a Gaussian process with mean function µ(s) and covariance function H(s, s ) = cov(y (s), Y (s )) if for every subset of locations s 1,..., s n the vector Ỹ = (Y (s 1),..., Y (s n )) T Ỹ MV N n ( µ, H) (1) where µ = (µ(s 1 ),..., µ(s n )) T and H is a matrix such that {H} ij = H(s i, s j ; φ). To be a valid covariance function H(s, s : φ) must be positive semidefinite in the sense that it generates covariance matrices H which are positive semi definite v T Hv 0[2, Pg. 80].

4 Spatial Regression Model Y (s) = X(s) T β + w(s) + ε(s) (2) X(s) T is a vector of coefficients. ε(s) is an independent process, nugget effect. w(s) is a spatial random effect. w(s) GP (0, C(s, s ; θ)) C(s, s ; θ) = σ 2 ρ(s, s ; θ) Y N(Xβ, Σ Y ) Σ Y = C(θ) + τ 2 I

5 Computational Challenges Fitting the above model requires calculating determinants and inverses of large matrices. Computational Complexity grows with matrix size Matrix Complexity Memory limitations also create problems. Lot s of work has been done trying to fit models for large spatial data sets.

6 Univariate Predictive Process Y (s) = X(s) T β + w(s) + ε(s) (3) w = (w(s 1),..., w(s m)) w(s) = E(w(s 0 ) w ) = c(s; θ) T C 1 (θ)w c(s; θ) T = (C(s, s 1; θ),..., C(s, s m; θ)) w(s) GP (0, c T (s; θ)c 1 c(s; θ)) Advantage: Now work with m m matrices instead of n n.

7 Properties w(s 0 ) minimizes E(w(s 0 ) f(w ) w ) over all real valued functions f(w ). w(s ) = c T (s j; θ)c 1 (θ)w = w(s j) It interpolates process w(s) at the knots. w a = (w(s 1 ),..., w(s n ), w(s 1),..., w(s m)) and p(w a Y ) be the posterior distribution of w a with all other parameters fixed. 1. p(w a Y ) P (w a )P (Y w) since P (Y w) = P (Y w a ). 2. The posterior for the predictive process model replaces P (Y w) with q(y w ). 3. We want to preserve q(y w a ) = q(y w ). 4. Authors claim the predictive process model corresponds to the density which minimizes the reverse Kullback Leibler divergence between the posteriors q(w a Y ) and p(w a Y ).

8 Knot Selection In addition to specifying a covariance function the predictive process relies on specifying a set of knots S. We need to specify the number of knots m and the location of the knots. Choosing the knots to be all spatial locations reduces to the original model. Choosing the number of knots we balance performance with computational complexity. Authors consider modifications to a standard grid of knots(close pairs, infill).

9 To compare performance Compare covariance function of the parent process with that of the predictive process. 200 locations are uniformly generated over a [0, 10] X [0, 10] rectangle. Knots consist of a 10 X 10 equally spaced grid Matern covariance with σ 2 = 1 and range parameter φ = 2, and four values of ν. Covariances for 2,000 of the roughly 40,000 distance pairs are plotted for the predictive process.

10 Covariances of w(s) against distance (line) and covariances of w(s) against distance (points) : (a) smoothness parameter 0.5; (b) smoothness parameter 1; (c) smoothness parameter 1.5; (d) smoothness parameter 5 See Figure 1, Pg. 831

11 Alternative Scenario Exponential covariance Set ν = 0.5 Choose 4 values of the range parameter.

12 Covariances of w(s) against distance (line) and covariances of w(s) against distance (points): (a) range parameter 2; (b) range parameter 4; (c) range parameter 6; (d) range parameter 12 See figure 2, Pg. 832

13 Take-Aways 1 Covariance functions agree better at larger distances Especially when increasing smoothness and range May need dense knots. Knot selection with a packed subset (instead of just a grid), may improve results.

14 Lattice plus close pairs configuration: regular k x k lattice of knots, intensifies this grid by randomly choosing m of these lattice points and then placing an additional knot close to each of them. Lattice plus infill design: starts with knots on a regular k x k lattice, intensifies the grid by placing a more finely spaced lattice within m randomly chosen cells of the original lattice.

15 Simulation 1 Simulate Y (s) from 3000 irregularly scattered locations (s). Y (s) = x T (s)β + w(s) + ɛ(s) See figure 3a, Pg. 838.

16 See Figures 3a,3b,3c Pg. 838.

17 See Table 1, Pg. 837

18 See Table 2, Pg. 839

19 See Table 3, Pg. 839

20 Take-Aways 2 Estimation is more sensitive to the number of knots than to the underlying design. Close pair designs appear to improve estimation of the shorter ranges as seen for λ 2 with 256 knots. Predictions are much more robust (little change with increase in knots).

21 Simulation 2 15,000 locations (vs 3000 in Simulation 1) Non-stationary random field full model computationally infeasible. Divide domain into 3 regions each with a different intercept.

22 Simulated sites and OLS residuals knots. See Figure 4ab, Pg Spatial residuals See Figure 4c, Pg. 840.

23 See Table 4, Pg. 841

24 Take-Aways 3 Knot density, better estimation Spatial residuals are smoother and illustrate regional anisotropy.

25 Application Forest biomass and other variables that are related to current carbon stocks are important for quantifying ecological and economic viability of forest landscapes. Want to know: how biomass changes across the landscape (as a continuous surface) and how homogeneous it is across the region? interpolated surface.

26 Data Point-referenced biomass (log-transformed) data observed at 9500 locations (USDA) Y (s): biomass from trees X 1 (s) : the cross-sectional area of all stems above 1.37 m from the ground (basal area) X 2 (s) number of tree stems (stem density) at that location. Spatially varying-coefficient model: Y (s) = x T (s) β(s) + ɛ(s) Predictive Process Model: See Figure 5, Pg Y = Xβ + Z T C T (θ)c 1 (θ)w + ɛ

27 See Table 5 Pg. 844.

28 See Figure 6, Pg. 845 Posterior (mean) estimates of spatial surfaces from the spatially varying coefficients model: (a) intercept parameter (b) basal area parameter (c) stem density (d) (log-) biomass

29 Bivariate Gaussian Process Bivariance ( ) Gaussian Process w1 (s) MV GP w 2 (s) 2 (0, Γ w (s, s )) Cross-Covariance ( Function Γ w (s, s cov(w1 (s), w ) = 1 (s )) cov(w 1 (s), w 2 (s ) )) cov(w 2 (s), w 1 (s )) cov(w 2 (s), w 2 (s )) For observed locations s 1,..., s n The covariance matrix induced by Γ w (s, s ) becomes 2n 2n.

30 Multivariate Gaussian Process Multivariate Gaussian Process w 1 (s). MV GP k (0, Γ w (s, s )) w k (s) Cross-Covariance Function cov(w 1 (s), w 1 (s ))... cov(w 1 (s), w k (s )) Γ w (s, s ) =..... cov(w k (s), w 1 (s ))... cov(k 2 (s), w k (s )) For observed locations s 1,..., s n The covariance matrix induced by Γ w (s, s ) becomes k n k n.

31 Multivariate Spatial Regression Y (s) = X(s) T β + w(s) + ε(s) (4) Linear Model of Coregionalization[1] w 1 (s) v 1 (s). = A(s)v(s) = A(s). w m (s) v m (s) v j (s) independent GP (0, ρ j (s, s )) v j (s) GP (, Γ v (s, s )) Γ v (s, s ) = diag([ρ i (s, s )] k i=1) Γ w (s, s ) = A(s)Γ v (s, s )A T (s ) SinceΓ w (s, s) = A(s)A T (s ) we can take A(s) to be lower

32 Multivariate Predictive Process Y (s) = X(s) T β + w(s) + ε(s) (5) w(s) = cov(w(s), w )var 1 (w )w = C T (s; θ)c 1 (θ)w C(s; θ) = Γ w (s, s 1; θ). Γ w (s, s m; θ) is a mk k matrix. C 1 (θ) = [ Γ w (s i, s j) ] m is an mk mk matrix. i,j=1

33 Additional Computational Savings A(s 1) A 0 A(s 2)... 0 = A(s m) Γ v (s 1, s 1) Γ v (s 1, s 2)... Γ v (s 1, s m) Γ v (s 2, s 1) Γ v (s 2, s 2)... Γ v (s 2, s m) Σ v = Γ v (s m, s 1) Γ v (s m, s m) ρ 1 (s i, s j) Γ v (s i, s 0 ρ 2 (s i, s j)... 0 j) = ρ k (s i, s j)

34 Additional Computational Savings C = A Σ v A T C 1 = A 1 Σ 1 v A 1T Σ v = P T H P H H 0 H = Hm H i = [ρ i (s j, s j )]m j,j =1 P 1 = P T

35 General Framework Spatial Mixed Model Y (s) = X T β + Z T (s)w(s) + ε(s) Y (s) is a q 1 vector of responses at location s. X T (s) = diag ( x T 1 (s),..., x T q (s) ) Z T (s) is a q k design matrix. w(s) is k 1 vector of spatial effects. β is a vector of coefficients of length p = Predictive Process Model Y (s) = X T β + Z T (s) w(s) + ε(s) q p l. l=1

36 Implementation Y = Xβ + Z T C T (θ)c 1 (θ)w + ε, ε N(0, I q Ψ). Y = [Y (s i )] n i=1 is a nq 1 vector of responses. X T = [X(s i ) T ] n i=1 is an nq p matrix of coefficients. Z T = BlockDiag(Z(s 1 ),..., Z(s n )) is a nq nk design matrix. C T (θ) = [Γ w (s i, s j )] n,m i,j=1 w and C are from the predictive process. After marginalizing out w f(y Ω) = MV N(Xβ, Z T C T (θ)c 1 (θ)c(θ)z + I n Ψ)

37 Sherman-Woodbury-Morrison Formula 1. (A + UCV ) 1 = A 1 A 1 U(C 1 + V A 1 U) 1 V A 1 [4] 2. det(a + UW V T ) = det(w 1 + V T A 1 U)det(W )det(a).[3] Likelihood calculations for Y involve computing Determinant Inverse of (Z T C T (θ)c 1 (θ)c(θ)z + I n Ψ). Using the identities computations are in terms of mk mk matrices instead of nq nq.

38 Extensions: Non-gaussian and Spatio-Temporal Data 1. Non-Gaussian Data(Binary, Count, Categorical): Binomial data (probit and logistic models) Count data. Assume we have an appropriate transformation η(s) = g(e(y (s))) = X T (s)β + w(s), g() known. In general you can t marginalize out w(s) and it s full conditional is not available. Clever trick: η(s) = g(e(y (s))) = X T (s)β + w(s) + ε(s), ε(s) produces full conditionals for w(s) which are multivariate normal. 2. Space-Time Data Now must specify knots over space and time D S. The predictive process model extends naturally to this case.

39 S Banerjee, BP Carlin, and AE Gelfand. Hierarchical modeling and analysis for spatial data. Monographs on statistics and applied probability (101) Show all parts in this series, Carl Edward Rasmussen. Gaussian processes for machine learning Wikipedia. Matrix determinant lemma wikipedia, the free encyclopedia, [Online; accessed 16-September-2015]. Wikipedia. Woodbury matrix identity wikipedia, the free encyclopedia, 2015.

40 [Online; accessed 16-September-2015].

Hierarchical Modelling for Univariate Spatial Data

Hierarchical Modelling for Univariate Spatial Data Hierarchical Modelling for Univariate Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department

More information

Hierarchical Modeling for Multivariate Spatial Data

Hierarchical Modeling for Multivariate Spatial Data Hierarchical Modeling for Multivariate Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department

More information

Hierarchical Modeling for Univariate Spatial Data

Hierarchical Modeling for Univariate Spatial Data Hierarchical Modeling for Univariate Spatial Data Geography 890, Hierarchical Bayesian Models for Environmental Spatial Data Analysis February 15, 2011 1 Spatial Domain 2 Geography 890 Spatial Domain This

More information

Hierarchical Modelling for Univariate Spatial Data

Hierarchical Modelling for Univariate Spatial Data Spatial omain Hierarchical Modelling for Univariate Spatial ata Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A.

More information

Nearest Neighbor Gaussian Processes for Large Spatial Data

Nearest Neighbor Gaussian Processes for Large Spatial Data Nearest Neighbor Gaussian Processes for Large Spatial Data Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns

More information

Hierarchical Modelling for Multivariate Spatial Data

Hierarchical Modelling for Multivariate Spatial Data Hierarchical Modelling for Multivariate Spatial Data Geography 890, Hierarchical Bayesian Models for Environmental Spatial Data Analysis February 15, 2011 1 Point-referenced spatial data often come as

More information

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets Abhirup Datta 1 Sudipto Banerjee 1 Andrew O. Finley 2 Alan E. Gelfand 3 1 University of Minnesota, Minneapolis,

More information

Hierarchical Modelling for Univariate and Multivariate Spatial Data

Hierarchical Modelling for Univariate and Multivariate Spatial Data Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 1/4 Hierarchical Modelling for Univariate and Multivariate Spatial Data Sudipto Banerjee sudiptob@biostat.umn.edu University of Minnesota

More information

Hierarchical Modeling for Spatio-temporal Data

Hierarchical Modeling for Spatio-temporal Data Hierarchical Modeling for Spatio-temporal Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of

More information

On Gaussian Process Models for High-Dimensional Geostatistical Datasets

On Gaussian Process Models for High-Dimensional Geostatistical Datasets On Gaussian Process Models for High-Dimensional Geostatistical Datasets Sudipto Banerjee Joint work with Abhirup Datta, Andrew O. Finley and Alan E. Gelfand University of California, Los Angeles, USA May

More information

Geostatistical Modeling for Large Data Sets: Low-rank methods

Geostatistical Modeling for Large Data Sets: Low-rank methods Geostatistical Modeling for Large Data Sets: Low-rank methods Whitney Huang, Kelly-Ann Dixon Hamil, and Zizhuang Wu Department of Statistics Purdue University February 22, 2016 Outline Motivation Low-rank

More information

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models Andrew O. Finley 1, Sudipto Banerjee 2, and Bradley P. Carlin 2 1 Michigan State University, Departments

More information

Introduction to Geostatistics

Introduction to Geostatistics Introduction to Geostatistics Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore,

More information

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data Hierarchical models for spatial data Based on the book by Banerjee, Carlin and Gelfand Hierarchical Modeling and Analysis for Spatial Data, 2004. We focus on Chapters 1, 2 and 5. Geo-referenced data arise

More information

Low-rank methods and predictive processes for spatial models

Low-rank methods and predictive processes for spatial models Low-rank methods and predictive processes for spatial models Sam Bussman, Linchao Chen, John Lewis, Mark Risser with Sebastian Kurtek, Vince Vu, Ying Sun February 27, 2014 Outline Introduction and general

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Alan Gelfand 1 and Andrew O. Finley 2 1 Department of Statistical Science, Duke University, Durham, North

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota,

More information

Hierarchical Modeling for non-gaussian Spatial Data

Hierarchical Modeling for non-gaussian Spatial Data Hierarchical Modeling for non-gaussian Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Andrew O. Finley 1 and Sudipto Banerjee 2 1 Department of Forestry & Department of Geography, Michigan

More information

CBMS Lecture 1. Alan E. Gelfand Duke University

CBMS Lecture 1. Alan E. Gelfand Duke University CBMS Lecture 1 Alan E. Gelfand Duke University Introduction to spatial data and models Researchers in diverse areas such as climatology, ecology, environmental exposure, public health, and real estate

More information

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes Andrew O. Finley Department of Forestry & Department of Geography, Michigan State University, Lansing

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Researchers in diverse areas such as climatology, ecology, environmental health, and real estate marketing are increasingly faced with the task of analyzing data

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee and Andrew O. Finley 2 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry

More information

Hierarchical Modelling for non-gaussian Spatial Data

Hierarchical Modelling for non-gaussian Spatial Data Hierarchical Modelling for non-gaussian Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry

More information

Modelling Multivariate Spatial Data

Modelling Multivariate Spatial Data Modelling Multivariate Spatial Data Sudipto Banerjee 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. June 20th, 2014 1 Point-referenced spatial data often

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics,

More information

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields 1 Introduction Jo Eidsvik Department of Mathematical Sciences, NTNU, Norway. (joeid@math.ntnu.no) February

More information

Hierarchical Modelling for non-gaussian Spatial Data

Hierarchical Modelling for non-gaussian Spatial Data Hierarchical Modelling for non-gaussian Spatial Data Geography 890, Hierarchical Bayesian Models for Environmental Spatial Data Analysis February 15, 2011 1 Spatial Generalized Linear Models Often data

More information

Chapter 4 - Fundamentals of spatial processes Lecture notes

Chapter 4 - Fundamentals of spatial processes Lecture notes TK4150 - Intro 1 Chapter 4 - Fundamentals of spatial processes Lecture notes Odd Kolbjørnsen and Geir Storvik January 30, 2017 STK4150 - Intro 2 Spatial processes Typically correlation between nearby sites

More information

Chapter 4 - Fundamentals of spatial processes Lecture notes

Chapter 4 - Fundamentals of spatial processes Lecture notes Chapter 4 - Fundamentals of spatial processes Lecture notes Geir Storvik January 21, 2013 STK4150 - Intro 2 Spatial processes Typically correlation between nearby sites Mostly positive correlation Negative

More information

Point-Referenced Data Models

Point-Referenced Data Models Point-Referenced Data Models Jamie Monogan University of Georgia Spring 2013 Jamie Monogan (UGA) Point-Referenced Data Models Spring 2013 1 / 19 Objectives By the end of these meetings, participants should

More information

Spatio-temporal prediction of site index based on forest inventories and climate change scenarios

Spatio-temporal prediction of site index based on forest inventories and climate change scenarios Forest Research Institute Spatio-temporal prediction of site index based on forest inventories and climate change scenarios Arne Nothdurft 1, Thilo Wolf 1, Andre Ringeler 2, Jürgen Böhner 2, Joachim Saborowski

More information

Multivariate spatial modeling

Multivariate spatial modeling Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Chapter 7: Multivariate Spatial Modeling p. 1/21 Multivariate spatial modeling Point-referenced

More information

Bayesian Modeling and Inference for High-Dimensional Spatiotemporal Datasets

Bayesian Modeling and Inference for High-Dimensional Spatiotemporal Datasets Bayesian Modeling and Inference for High-Dimensional Spatiotemporal Datasets Sudipto Banerjee University of California, Los Angeles, USA Based upon projects involving: Abhirup Datta (Johns Hopkins University)

More information

Statistics 203: Introduction to Regression and Analysis of Variance Course review

Statistics 203: Introduction to Regression and Analysis of Variance Course review Statistics 203: Introduction to Regression and Analysis of Variance Course review Jonathan Taylor - p. 1/?? Today Review / overview of what we learned. - p. 2/?? General themes in regression models Specifying

More information

Wrapped Gaussian processes: a short review and some new results

Wrapped Gaussian processes: a short review and some new results Wrapped Gaussian processes: a short review and some new results Giovanna Jona Lasinio 1, Gianluca Mastrantonio 2 and Alan Gelfand 3 1-Università Sapienza di Roma 2- Università RomaTRE 3- Duke University

More information

Hierarchical Modeling for Spatial Data

Hierarchical Modeling for Spatial Data Bayesian Spatial Modelling Spatial model specifications: P(y X, θ). Prior specifications: P(θ). Posterior inference of model parameters: P(θ y). Predictions at new locations: P(y 0 y). Model comparisons.

More information

Information geometry for bivariate distribution control

Information geometry for bivariate distribution control Information geometry for bivariate distribution control C.T.J.Dodson + Hong Wang Mathematics + Control Systems Centre, University of Manchester Institute of Science and Technology Optimal control of stochastic

More information

A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models

A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models arxiv:1811.03735v1 [math.st] 9 Nov 2018 Lu Zhang UCLA Department of Biostatistics Lu.Zhang@ucla.edu Sudipto Banerjee UCLA

More information

Lecture 23. Spatio-temporal Models. Colin Rundel 04/17/2017

Lecture 23. Spatio-temporal Models. Colin Rundel 04/17/2017 Lecture 23 Spatio-temporal Models Colin Rundel 04/17/2017 1 Spatial Models with AR time dependence 2 Example - Weather station data Based on Andrew Finley and Sudipto Banerjee s notes from National Ecological

More information

Gaussian Processes 1. Schedule

Gaussian Processes 1. Schedule 1 Schedule 17 Jan: Gaussian processes (Jo Eidsvik) 24 Jan: Hands-on project on Gaussian processes (Team effort, work in groups) 31 Jan: Latent Gaussian models and INLA (Jo Eidsvik) 7 Feb: Hands-on project

More information

Cross-covariance Functions for Tangent Vector Fields on the Sphere

Cross-covariance Functions for Tangent Vector Fields on the Sphere Cross-covariance Functions for Tangent Vector Fields on the Sphere Minjie Fan 1 Tomoko Matsuo 2 1 Department of Statistics University of California, Davis 2 Cooperative Institute for Research in Environmental

More information

A full scale, non stationary approach for the kriging of large spatio(-temporal) datasets

A full scale, non stationary approach for the kriging of large spatio(-temporal) datasets A full scale, non stationary approach for the kriging of large spatio(-temporal) datasets Thomas Romary, Nicolas Desassis & Francky Fouedjio Mines ParisTech Centre de Géosciences, Equipe Géostatistique

More information

Spatial Statistics with Image Analysis. Lecture L02. Computer exercise 0 Daily Temperature. Lecture 2. Johan Lindström.

Spatial Statistics with Image Analysis. Lecture L02. Computer exercise 0 Daily Temperature. Lecture 2. Johan Lindström. C Stochastic fields Covariance Spatial Statistics with Image Analysis Lecture 2 Johan Lindström November 4, 26 Lecture L2 Johan Lindström - johanl@maths.lth.se FMSN2/MASM2 L /2 C Stochastic fields Covariance

More information

A Framework for Daily Spatio-Temporal Stochastic Weather Simulation

A Framework for Daily Spatio-Temporal Stochastic Weather Simulation A Framework for Daily Spatio-Temporal Stochastic Weather Simulation, Rick Katz, Balaji Rajagopalan Geophysical Statistics Project Institute for Mathematics Applied to Geosciences National Center for Atmospheric

More information

Analysis of Marked Point Patterns with Spatial and Non-spatial Covariate Information

Analysis of Marked Point Patterns with Spatial and Non-spatial Covariate Information Analysis of Marked Point Patterns with Spatial and Non-spatial Covariate Information p. 1/27 Analysis of Marked Point Patterns with Spatial and Non-spatial Covariate Information Shengde Liang, Bradley

More information

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA JAPANESE BEETLE DATA 6 MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA Gauge Plots TuscaroraLisa Central Madsen Fairways, 996 January 9, 7 Grubs Adult Activity Grub Counts 6 8 Organic Matter

More information

Bayesian data analysis in practice: Three simple examples

Bayesian data analysis in practice: Three simple examples Bayesian data analysis in practice: Three simple examples Martin P. Tingley Introduction These notes cover three examples I presented at Climatea on 5 October 0. Matlab code is available by request to

More information

Multivariate Bayesian Linear Regression MLAI Lecture 11

Multivariate Bayesian Linear Regression MLAI Lecture 11 Multivariate Bayesian Linear Regression MLAI Lecture 11 Neil D. Lawrence Department of Computer Science Sheffield University 21st October 2012 Outline Univariate Bayesian Linear Regression Multivariate

More information

Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes

Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes TTU, October 26, 2012 p. 1/3 Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes Hao Zhang Department of Statistics Department of Forestry and Natural Resources Purdue University

More information

An Additive Gaussian Process Approximation for Large Spatio-Temporal Data

An Additive Gaussian Process Approximation for Large Spatio-Temporal Data An Additive Gaussian Process Approximation for Large Spatio-Temporal Data arxiv:1801.00319v2 [stat.me] 31 Oct 2018 Pulong Ma Statistical and Applied Mathematical Sciences Institute and Duke University

More information

Nonparametric Bayesian Methods (Gaussian Processes)

Nonparametric Bayesian Methods (Gaussian Processes) [70240413 Statistical Machine Learning, Spring, 2015] Nonparametric Bayesian Methods (Gaussian Processes) Jun Zhu dcszj@mail.tsinghua.edu.cn http://bigml.cs.tsinghua.edu.cn/~jun State Key Lab of Intelligent

More information

Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines

Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines Maximilian Kasy Department of Economics, Harvard University 1 / 37 Agenda 6 equivalent representations of the

More information

Multivariate Spatial Process Models. Alan E. Gelfand and Sudipto Banerjee

Multivariate Spatial Process Models. Alan E. Gelfand and Sudipto Banerjee Multivariate Spatial Process Models Alan E. Gelfand and Sudipto Banerjee April 29, 2009 ii Contents 28 Multivariate Spatial Process Models 1 28.1 Introduction.................................... 1 28.2

More information

STA 4273H: Statistical Machine Learning

STA 4273H: Statistical Machine Learning STA 4273H: Statistical Machine Learning Russ Salakhutdinov Department of Statistics! rsalakhu@utstat.toronto.edu! http://www.utstat.utoronto.ca/~rsalakhu/ Sidney Smith Hall, Room 6002 Lecture 3 Linear

More information

Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp

Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp Marcela Alfaro Córdoba August 25, 2016 NCSU Department of Statistics Continuous Parameter

More information

Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University

Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University this presentation derived from that presented at the Pan-American Advanced

More information

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012 Gaussian Processes Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 01 Pictorial view of embedding distribution Transform the entire distribution to expected features Feature space Feature

More information

Bayesian Linear Models

Bayesian Linear Models Bayesian Linear Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry & Department

More information

EXTREME VALUE MODELING FOR SPACE-TIME DATA WITH METEOROLOGICAL APPLICATIONS

EXTREME VALUE MODELING FOR SPACE-TIME DATA WITH METEOROLOGICAL APPLICATIONS EXTREME VALUE MODELING FOR SPACE-TIME DATA WITH METEOROLOGICAL APPLICATIONS by Huiyan Sang Department of Statistical Sciences Duke University Date: Approved: Dr. Alan E. Gelfand, Supervisor Dr. Merlise

More information

Gaussian Process Regression

Gaussian Process Regression Gaussian Process Regression 4F1 Pattern Recognition, 21 Carl Edward Rasmussen Department of Engineering, University of Cambridge November 11th - 16th, 21 Rasmussen (Engineering, Cambridge) Gaussian Process

More information

Bayesian Dynamic Modeling for Space-time Data in R

Bayesian Dynamic Modeling for Space-time Data in R Bayesian Dynamic Modeling for Space-time Data in R Andrew O. Finley and Sudipto Banerjee September 5, 2014 We make use of several libraries in the following example session, including: ˆ library(fields)

More information

Dynamically updated spatially varying parameterisations of hierarchical Bayesian models for spatially correlated data

Dynamically updated spatially varying parameterisations of hierarchical Bayesian models for spatially correlated data Dynamically updated spatially varying parameterisations of hierarchical Bayesian models for spatially correlated data Mark Bass and Sujit Sahu University of Southampton, UK June 4, 06 Abstract Fitting

More information

Basics of Point-Referenced Data Models

Basics of Point-Referenced Data Models Basics of Point-Referenced Data Models Basic tool is a spatial process, {Y (s), s D}, where D R r Chapter 2: Basics of Point-Referenced Data Models p. 1/45 Basics of Point-Referenced Data Models Basic

More information

Spatial Statistics with Image Analysis. Outline. A Statistical Approach. Johan Lindström 1. Lund October 6, 2016

Spatial Statistics with Image Analysis. Outline. A Statistical Approach. Johan Lindström 1. Lund October 6, 2016 Spatial Statistics Spatial Examples More Spatial Statistics with Image Analysis Johan Lindström 1 1 Mathematical Statistics Centre for Mathematical Sciences Lund University Lund October 6, 2016 Johan Lindström

More information

Nonparametric Bayesian Methods - Lecture I

Nonparametric Bayesian Methods - Lecture I Nonparametric Bayesian Methods - Lecture I Harry van Zanten Korteweg-de Vries Institute for Mathematics CRiSM Masterclass, April 4-6, 2016 Overview of the lectures I Intro to nonparametric Bayesian statistics

More information

Multivariate Gaussian Random Fields with SPDEs

Multivariate Gaussian Random Fields with SPDEs Multivariate Gaussian Random Fields with SPDEs Xiangping Hu Daniel Simpson, Finn Lindgren and Håvard Rue Department of Mathematics, University of Oslo PASI, 214 Outline The Matérn covariance function and

More information

Covariance function estimation in Gaussian process regression

Covariance function estimation in Gaussian process regression Covariance function estimation in Gaussian process regression François Bachoc Department of Statistics and Operations Research, University of Vienna WU Research Seminar - May 2015 François Bachoc Gaussian

More information

Bayesian Linear Models

Bayesian Linear Models Bayesian Linear Models Sudipto Banerjee September 03 05, 2017 Department of Biostatistics, Fielding School of Public Health, University of California, Los Angeles Linear Regression Linear regression is,

More information

Statistícal Methods for Spatial Data Analysis

Statistícal Methods for Spatial Data Analysis Texts in Statistícal Science Statistícal Methods for Spatial Data Analysis V- Oliver Schabenberger Carol A. Gotway PCT CHAPMAN & K Contents Preface xv 1 Introduction 1 1.1 The Need for Spatial Analysis

More information

arxiv: v1 [stat.me] 28 Dec 2017

arxiv: v1 [stat.me] 28 Dec 2017 A Divide-and-Conquer Bayesian Approach to Large-Scale Kriging Raarshi Guhaniyogi, Cheng Li, Terrance D. Savitsky 3, and Sanvesh Srivastava 4 arxiv:7.9767v [stat.me] 8 Dec 7 Department of Applied Mathematics

More information

Hierarchical Modeling for Univariate Spatial Data

Hierarchical Modeling for Univariate Spatial Data Univariate spatial models Spatial Domain Hierarchical Modeling for Univariate Spatial Data Sudipto Banerjee and Andrew O. Finley 2 Biostatistics, School of Public Health, University of Minnesota, Minneapolis,

More information

Fusing space-time data under measurement error for computer model output

Fusing space-time data under measurement error for computer model output for computer model output (vjb2@stat.duke.edu) SAMSI joint work with Alan E. Gelfand and David M. Holland Introduction In many environmental disciplines data come from two sources: monitoring networks

More information

(Multivariate) Gaussian (Normal) Probability Densities

(Multivariate) Gaussian (Normal) Probability Densities (Multivariate) Gaussian (Normal) Probability Densities Carl Edward Rasmussen, José Miguel Hernández-Lobato & Richard Turner April 20th, 2018 Rasmussen, Hernàndez-Lobato & Turner Gaussian Densities April

More information

x. Figure 1: Examples of univariate Gaussian pdfs N (x; µ, σ 2 ).

x. Figure 1: Examples of univariate Gaussian pdfs N (x; µ, σ 2 ). .8.6 µ =, σ = 1 µ = 1, σ = 1 / µ =, σ =.. 3 1 1 3 x Figure 1: Examples of univariate Gaussian pdfs N (x; µ, σ ). The Gaussian distribution Probably the most-important distribution in all of statistics

More information

Journal of Statistical Software

Journal of Statistical Software JSS Journal of Statistical Software April 2007, Volume 19, Issue 4. http://www.jstatsoft.org/ spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models Andrew O.

More information

STAT 518 Intro Student Presentation

STAT 518 Intro Student Presentation STAT 518 Intro Student Presentation Wen Wei Loh April 11, 2013 Title of paper Radford M. Neal [1999] Bayesian Statistics, 6: 475-501, 1999 What the paper is about Regression and Classification Flexible

More information

Cross-sectional space-time modeling using ARNN(p, n) processes

Cross-sectional space-time modeling using ARNN(p, n) processes Cross-sectional space-time modeling using ARNN(p, n) processes W. Polasek K. Kakamu September, 006 Abstract We suggest a new class of cross-sectional space-time models based on local AR models and nearest

More information

Model Selection for Geostatistical Models

Model Selection for Geostatistical Models Model Selection for Geostatistical Models Richard A. Davis Colorado State University http://www.stat.colostate.edu/~rdavis/lectures Joint work with: Jennifer A. Hoeting, Colorado State University Andrew

More information

Spatial Lasso with Application to GIS Model Selection. F. Jay Breidt Colorado State University

Spatial Lasso with Application to GIS Model Selection. F. Jay Breidt Colorado State University Spatial Lasso with Application to GIS Model Selection F. Jay Breidt Colorado State University with Hsin-Cheng Huang, Nan-Jung Hsu, and Dave Theobald September 25 The work reported here was developed under

More information

Probabilistic & Unsupervised Learning

Probabilistic & Unsupervised Learning Probabilistic & Unsupervised Learning Gaussian Processes Maneesh Sahani maneesh@gatsby.ucl.ac.uk Gatsby Computational Neuroscience Unit, and MSc ML/CSML, Dept Computer Science University College London

More information

9.2 Support Vector Machines 159

9.2 Support Vector Machines 159 9.2 Support Vector Machines 159 9.2.3 Kernel Methods We have all the tools together now to make an exciting step. Let us summarize our findings. We are interested in regularized estimation problems of

More information

Using Estimating Equations for Spatially Correlated A

Using Estimating Equations for Spatially Correlated A Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship

More information

Linear Regression (9/11/13)

Linear Regression (9/11/13) STA561: Probabilistic machine learning Linear Regression (9/11/13) Lecturer: Barbara Engelhardt Scribes: Zachary Abzug, Mike Gloudemans, Zhuosheng Gu, Zhao Song 1 Why use linear regression? Figure 1: Scatter

More information

A full-scale approximation of covariance functions for large spatial data sets

A full-scale approximation of covariance functions for large spatial data sets A full-scale approximation of covariance functions for large spatial data sets Huiyan Sang Department of Statistics, Texas A&M University, College Station, USA. Jianhua Z. Huang Department of Statistics,

More information

Bayesian Linear Models

Bayesian Linear Models Bayesian Linear Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics, School of Public

More information

Approaches for Multiple Disease Mapping: MCAR and SANOVA

Approaches for Multiple Disease Mapping: MCAR and SANOVA Approaches for Multiple Disease Mapping: MCAR and SANOVA Dipankar Bandyopadhyay Division of Biostatistics, University of Minnesota SPH April 22, 2015 1 Adapted from Sudipto Banerjee s notes SANOVA vs MCAR

More information

COVARIANCE APPROXIMATION FOR LARGE MULTIVARIATE SPATIAL DATA SETS WITH AN APPLICATION TO MULTIPLE CLIMATE MODEL ERRORS 1

COVARIANCE APPROXIMATION FOR LARGE MULTIVARIATE SPATIAL DATA SETS WITH AN APPLICATION TO MULTIPLE CLIMATE MODEL ERRORS 1 The Annals of Applied Statistics 2011, Vol. 5, No. 4, 2519 2548 DOI: 10.1214/11-AOAS478 Institute of Mathematical Statistics, 2011 COVARIANCE APPROXIMATION FOR LARGE MULTIVARIATE SPATIAL DATA SETS WITH

More information

Gaussian processes. Chuong B. Do (updated by Honglak Lee) November 22, 2008

Gaussian processes. Chuong B. Do (updated by Honglak Lee) November 22, 2008 Gaussian processes Chuong B Do (updated by Honglak Lee) November 22, 2008 Many of the classical machine learning algorithms that we talked about during the first half of this course fit the following pattern:

More information

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models Thomas Kneib Department of Mathematics Carl von Ossietzky University Oldenburg Sonja Greven Department of

More information

Bayesian Linear Regression

Bayesian Linear Regression Bayesian Linear Regression Sudipto Banerjee 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. September 15, 2010 1 Linear regression models: a Bayesian perspective

More information

CS Lecture 19. Exponential Families & Expectation Propagation

CS Lecture 19. Exponential Families & Expectation Propagation CS 6347 Lecture 19 Exponential Families & Expectation Propagation Discrete State Spaces We have been focusing on the case of MRFs over discrete state spaces Probability distributions over discrete spaces

More information

CPSC 540: Machine Learning

CPSC 540: Machine Learning CPSC 540: Machine Learning MCMC and Non-Parametric Bayes Mark Schmidt University of British Columbia Winter 2016 Admin I went through project proposals: Some of you got a message on Piazza. No news is

More information

Bayesian inference & process convolution models Dave Higdon, Statistical Sciences Group, LANL

Bayesian inference & process convolution models Dave Higdon, Statistical Sciences Group, LANL 1 Bayesian inference & process convolution models Dave Higdon, Statistical Sciences Group, LANL 2 MOVING AVERAGE SPATIAL MODELS Kernel basis representation for spatial processes z(s) Define m basis functions

More information

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Daniel K. Heersink 1, Reinhard Furrer 1, and Mike A. Mooney 2 1 Institute of Mathematics, University of Zurich, CH-8057 Zurich 2

More information

Hilbert Space Methods for Reduced-Rank Gaussian Process Regression

Hilbert Space Methods for Reduced-Rank Gaussian Process Regression Hilbert Space Methods for Reduced-Rank Gaussian Process Regression Arno Solin and Simo Särkkä Aalto University, Finland Workshop on Gaussian Process Approximation Copenhagen, Denmark, May 2015 Solin &

More information

BAYESIAN PREDICTIVE PROCESS MODELS FOR HISTORICAL PRECIPITATION DATA OF ALASKA AND SOUTHWESTERN CANADA. Peter Vanney

BAYESIAN PREDICTIVE PROCESS MODELS FOR HISTORICAL PRECIPITATION DATA OF ALASKA AND SOUTHWESTERN CANADA. Peter Vanney BAYESIAN PREDICTIVE PROCESS MODELS FOR HISTORICAL PRECIPITATION DATA OF ALASKA AND SOUTHWESTERN CANADA By Peter Vanney RECOMMENDED: Dr. Ronald Barry Dr. Scott Goddard Dr. Margaret Short Advisory Committee

More information