Hierarchical Modelling for Univariate and Multivariate Spatial Data

Similar documents
Hierarchical Modeling for Multivariate Spatial Data

Hierarchical Modelling for Multivariate Spatial Data

Hierarchical Modelling for Univariate Spatial Data

Hierarchical Modeling for Univariate Spatial Data

Hierarchical Modelling for Univariate Spatial Data

Hierarchical Modeling for non-gaussian Spatial Data

Hierarchical Modelling for non-gaussian Spatial Data

Hierarchical Modelling for non-gaussian Spatial Data

Introduction to Spatial Data and Models

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data

Multivariate spatial modeling

Introduction to Spatial Data and Models

Gaussian predictive process models for large spatial data sets.

Introduction to Spatial Data and Models

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models

Nearest Neighbor Gaussian Processes for Large Spatial Data

Introduction to Spatial Data and Models

Hierarchical Modeling for Spatial Data

Introduction to Geostatistics

Modelling Multivariate Spatial Data

A Framework for Daily Spatio-Temporal Stochastic Weather Simulation

Bayesian data analysis in practice: Three simple examples

Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.

Bayesian spatial quantile regression

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets

CBMS Lecture 1. Alan E. Gelfand Duke University

Analysis of Marked Point Patterns with Spatial and Non-spatial Covariate Information

Geostatistical Modeling for Large Data Sets: Low-rank methods

Chapter 4 - Fundamentals of spatial processes Lecture notes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Basics of Point-Referenced Data Models

Bayesian Linear Regression

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields

Multivariate Spatial Process Models. Alan E. Gelfand and Sudipto Banerjee

Bayesian Linear Models

Chapter 4 - Fundamentals of spatial processes Lecture notes

Lecture 19. Spatial GLM + Point Reference Spatial Data. Colin Rundel 04/03/2017

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

MCMC algorithms for fitting Bayesian models

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Hierarchical Modeling for Spatio-temporal Data

On Gaussian Process Models for High-Dimensional Geostatistical Datasets

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

Point-Referenced Data Models

Introduction to Bayes and non-bayes spatial statistics

Metropolis-Hastings Algorithm

Wrapped Gaussian processes: a short review and some new results

Hierarchical Modeling and Analysis for Spatial Data

Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands

A Fully Nonparametric Modeling Approach to. BNP Binary Regression

Journal of Statistical Software

Bayesian non-parametric model to longitudinally predict churn

Part 8: GLMs and Hierarchical LMs and GLMs

A Note on the comparison of Nearest Neighbor Gaussian Process (NNGP) based models

Bayesian spatial hierarchical modeling for temperature extremes

Empirical Bayes methods for the transformed Gaussian random field model with additive measurement errors

Bayesian inference & process convolution models Dave Higdon, Statistical Sciences Group, LANL

Hastings-within-Gibbs Algorithm: Introduction and Application on Hierarchical Model

Principles of Bayesian Inference

Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes

Spatial Misalignment

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Markov Chain Monte Carlo

Principles of Bayesian Inference

Generating gridded fields of extreme precipitation for large domains with a Bayesian hierarchical model

Practicum : Spatial Regression

Modeling Real Estate Data using Quantile Regression

Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency

Lecture 23. Spatio-temporal Models. Colin Rundel 04/17/2017

Hierarchical Linear Models

Econ 2148, fall 2017 Gaussian process priors, reproducing kernel Hilbert spaces, and Splines

Bayesian linear regression

Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling

Bruno Sansó. Department of Applied Mathematics and Statistics University of California Santa Cruz bruno

Approaches for Multiple Disease Mapping: MCAR and SANOVA

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models

ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS

A short introduction to INLA and R-INLA

Model Selection for Geostatistical Models

Principles of Bayesian Inference

Point process with spatio-temporal heterogeneity

Default Priors and Effcient Posterior Computation in Bayesian

Bayesian Dynamic Modeling for Space-time Data in R

variability of the model, represented by σ 2 and not accounted for by Xβ

spbayes: an R package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models

Linear Regression. Data Model. β, σ 2. Process Model. ,V β. ,s 2. s 1. Parameter Model

Gaussian Processes 1. Schedule

Analysing geoadditive regression data: a mixed model approach

BAYESIAN KRIGING AND BAYESIAN NETWORK DESIGN

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012

Web Appendices: Hierarchical and Joint Site-Edge Methods for Medicare Hospice Service Region Boundary Analysis

CPSC 540: Machine Learning

Bayesian Inference. Chapter 4: Regression and Hierarchical Models

1 Data Arrays and Decompositions

Gibbs Sampling in Linear Models #2

Linear Methods for Prediction

Bayesian Linear Models

Generalized Linear Models. Kurt Hornik

Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP

Transcription:

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 1/4 Hierarchical Modelling for Univariate and Multivariate Spatial Data Sudipto Banerjee sudiptob@biostat.umn.edu University of Minnesota

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 2/4 Introduction Researchers in diverse areas such as climatology, ecology, environmental health, and real estate marketing are increasingly faced with the task of analyzing data that are:

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 2/4 Introduction Researchers in diverse areas such as climatology, ecology, environmental health, and real estate marketing are increasingly faced with the task of analyzing data that are: highly multivariate, with many important predictors and response variables,

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 2/4 Introduction Researchers in diverse areas such as climatology, ecology, environmental health, and real estate marketing are increasingly faced with the task of analyzing data that are: highly multivariate, with many important predictors and response variables, geographically referenced, and often presented as maps, and

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 2/4 Introduction Researchers in diverse areas such as climatology, ecology, environmental health, and real estate marketing are increasingly faced with the task of analyzing data that are: highly multivariate, with many important predictors and response variables, geographically referenced, and often presented as maps, and temporally correlated, as in longitudinal or other time series structures.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 2/4 Introduction Researchers in diverse areas such as climatology, ecology, environmental health, and real estate marketing are increasingly faced with the task of analyzing data that are: highly multivariate, with many important predictors and response variables, geographically referenced, and often presented as maps, and temporally correlated, as in longitudinal or other time series structures. motivates hierarchical modeling and data analysis for complex spatial (and spatiotemporal) data sets.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 3/4 Spatial Domain y 0 2 4 6 8 10 0 2 4 6 8 10 x

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 4/4 Algorithmic Modelling Spatial surface observed at finite set of locations S = {s 1,s 2,...,s n } Tessellate the spatial domain (usually with data locations as vertices) Fit an interpolating polynomial: f(s) = i w i (S;s)f(s i ) Interpolate by reading off f(s 0 ). Issues: Sensitivity to tessellations Choices of multivariate interpolators Numerical error analysis

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 5/4 What is a spatial process? x x Y(s 1 ) Y(s 2 ) Y(s n ) x x

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 6/4 Mean, Covariance and Error Spatial model: Y (s) = µ(s) + w(s) + ǫ(s) Mean: µ(s) = x T (s)β Covariance: w(s) GP(0,σ 2 ρ( )) Cov(w(s),w(s )) = σ 2 ρ(φ; s s ) Error: ǫ(s) iid N(0,τ 2 ) For S = {s 1,...,s n } let w = [w(s i )]. Then w MV N(0,σ 2 H(φ)); H(φ) = [ρ(φ; s i s j )]

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 7/4 The exponential decay model H(φ) must be p.d. ρ( ) must be somewhat special ρ(φ;t) = E X [e iφtx ], X is symmetric about 0.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 7/4 The exponential decay model H(φ) must be p.d. ρ( ) must be somewhat special ρ(φ;t) = E X [e iφtx ], X is symmetric about 0. Example: exponential decay model ρ(φ;t) = exp( φt) if t 0,

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 7/4 The exponential decay model H(φ) must be p.d. ρ( ) must be somewhat special ρ(φ;t) = E X [e iφtx ], X is symmetric about 0. Example: exponential decay model ρ(φ;t) = exp( φt) if t 0, Effective range: t 0 = arg ρ(φ;t) = 0.05 3/φ.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 7/4 The exponential decay model H(φ) must be p.d. ρ( ) must be somewhat special ρ(φ;t) = E X [e iφtx ], X is symmetric about 0. Example: exponential decay model ρ(φ;t) = exp( φt) if t 0, Effective range: t 0 = arg ρ(φ;t) = 0.05 3/φ. Spatial variance components: The nugget τ 2 is a nonspatial effect variance The partial sill (σ 2 ) is a spatial effect variance. The sill is σ 2 + τ 2.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 8/4 Hierarchical modelling First stage: y β, w,τ 2 N(Xβ + w,τ 2 I)

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 8/4 Hierarchical modelling First stage: y β, w,τ 2 N(Xβ + w,τ 2 I) Second stage: w σ 2,φ N(0,σ 2 H(φ))

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 8/4 Hierarchical modelling First stage: y β, w,τ 2 N(Xβ + w,τ 2 I) Second stage: w σ 2,φ N(0,σ 2 H(φ)) Third stage: Priors on Ω = (β,τ 2,σ 2,φ).

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 8/4 Hierarchical modelling First stage: y β, w,τ 2 N(Xβ + w,τ 2 I) Second stage: w σ 2,φ N(0,σ 2 H(φ)) Third stage: Priors on Ω = (β,τ 2,σ 2,φ). Marginalized likelihood: y Ω N(Xβ,σ 2 H(φ) + τ 2 I)

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 8/4 Hierarchical modelling First stage: y β, w,τ 2 N(Xβ + w,τ 2 I) Second stage: w σ 2,φ N(0,σ 2 H(φ)) Third stage: Priors on Ω = (β,τ 2,σ 2,φ). Marginalized likelihood: y Ω N(Xβ,σ 2 H(φ) + τ 2 I) Note: Spatial process parametrizes Σ: y = Xβ + ǫ, ǫ N (0, Σ), Σ = σ 2 H(φ) + τ 2 I

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 9/4 Bayesian Computations Choice: Fit [y Ω] [Ω] or [y β, w,τ 2 ] [w σ 2,φ] [Ω].

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 9/4 Bayesian Computations Choice: Fit [y Ω] [Ω] or [y β, w,τ 2 ] [w σ 2,φ] [Ω]. Conditional model: conjugate full conditionals for σ 2, τ 2 and w. Easier to program.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 9/4 Bayesian Computations Choice: Fit [y Ω] [Ω] or [y β, w,τ 2 ] [w σ 2,φ] [Ω]. Conditional model: conjugate full conditionals for σ 2, τ 2 and w. Easier to program. Marginalized model: need Metropolis or Slice sampling for σ 2, τ 2 and φ. Harder to program. But, reduced parameter space faster convergence σ 2 H(φ) + τ 2 I is more stable than σ 2 H(φ).

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 9/4 Bayesian Computations Choice: Fit [y Ω] [Ω] or [y β, w,τ 2 ] [w σ 2,φ] [Ω]. Conditional model: conjugate full conditionals for σ 2, τ 2 and w. Easier to program. Marginalized model: need Metropolis or Slice sampling for σ 2, τ 2 and φ. Harder to program. But, reduced parameter space faster convergence σ 2 H(φ) + τ 2 I is more stable than σ 2 H(φ). But what about H 1 (φ)?? EXPENSIVE!.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 10/4 Where are the w s? Interest often lies in the spatial surface w y.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 10/4 Where are the w s? Interest often lies in the spatial surface w y. They are recovered from [w y,x] = [w Ω, y,x] [Ω y,x]dω using posterior samples:

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 10/4 Where are the w s? Interest often lies in the spatial surface w y. They are recovered from [w y,x] = [w Ω, y,x] [Ω y,x]dω using posterior samples: Obtain Ω (1),...,Ω (G) [Ω y,x]

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 10/4 Where are the w s? Interest often lies in the spatial surface w y. They are recovered from [w y,x] = [w Ω, y,x] [Ω y,x]dω using posterior samples: Obtain Ω (1),...,Ω (G) [Ω y,x] For each Ω (g), draw w (g) [w Ω (g), y,x]

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 10/4 Where are the w s? Interest often lies in the spatial surface w y. They are recovered from [w y,x] = [w Ω, y,x] [Ω y,x]dω using posterior samples: Obtain Ω (1),...,Ω (G) [Ω y,x] For each Ω (g), draw w (g) [w Ω (g), y,x] NOTE: With Gaussian likelihoods [w Ω, y, X] is also Gaussian. With other likelihoods this may not be easy and often the conditional updating scheme is preferred.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 11/4 Residual plot: [w(s) y] w mean of Y Easting Northing

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 12/4 Another look: [w(s) y] w Easting Northing

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 13/4 Another look: [w(s) y] w Easting Northing

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 14/4 Spatial Prediction Often we need to predict Y (s) at a new set of locations { s 0,..., s m } with associated predictor matrix X. Sample from predictive distribution: [ỹ y,x, X] = [ỹ, Ω y,x, X]dΩ = [ỹ y, Ω,X, X] [Ω y,x]dω, [ỹ y, Ω,X, X] is multivariate normal. Sampling scheme: Obtain Ω (1),...,Ω (G) [Ω y,x] For each Ω (g), draw ỹ (g) [ỹ y, Ω (g),x, X].

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 15/4 Prediction: Summary of [Y (s) y] y Easting Northing

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 16/4 Colorado data illustration Modelling temperature: 507 locations in Colorado. Simple spatial regression model: Y (s) = x T (s)β + w(s) + ǫ(s) w(s) GP(0,σ 2 ρ( ;φ,ν)); ǫ(s) iid N(0,τ 2 ) Parameters 50% (2.5%,97.5%) Intercept 2.827 (2.131,3.866) [Elevation] -0.426 (-0.527,-0.333) Precipitation 0.037 (0.002,0.072) σ 2 0.134 (0.051, 1.245) φ 7.39E-3 (4.71E-3, 51.21E-3) Range 278.2 (38.8, 476.3) τ 2 0.051 (0.022, 0.092)

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 17/4 Temperature residual map Northing 4100 4200 4300 4400 4500 300 200 100 0 100 200 300 Easting

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 18/4 Elevation map Latitude 37 38 39 40 41 108 106 104 102 Longitude

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 19/4 Residual map with elev. as covariate Northing 4100 4200 4300 4400 4500 300 200 100 0 100 200 300 Easting

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 20/4 Spatial Generalized Linear Models Often data sets preclude Gaussian modelling: Y (s) may not even be continuous

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 20/4 Spatial Generalized Linear Models Often data sets preclude Gaussian modelling: Y (s) may not even be continuous Example: Y (s) is a binary or count variable

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 20/4 Spatial Generalized Linear Models Often data sets preclude Gaussian modelling: Y (s) may not even be continuous Example: Y (s) is a binary or count variable rainfall was measurable or not

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 20/4 Spatial Generalized Linear Models Often data sets preclude Gaussian modelling: Y (s) may not even be continuous Example: Y (s) is a binary or count variable rainfall was measurable or not number of insurance claims by residents of a single family home at s

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 20/4 Spatial Generalized Linear Models Often data sets preclude Gaussian modelling: Y (s) may not even be continuous Example: Y (s) is a binary or count variable rainfall was measurable or not number of insurance claims by residents of a single family home at s price is high or low for home at location s

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 20/4 Spatial Generalized Linear Models Often data sets preclude Gaussian modelling: Y (s) may not even be continuous Example: Y (s) is a binary or count variable rainfall was measurable or not number of insurance claims by residents of a single family home at s price is high or low for home at location s Replace Gaussian likelihood by exponential family member

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 20/4 Spatial Generalized Linear Models Often data sets preclude Gaussian modelling: Y (s) may not even be continuous Example: Y (s) is a binary or count variable rainfall was measurable or not number of insurance claims by residents of a single family home at s price is high or low for home at location s Replace Gaussian likelihood by exponential family member Diggle Tawn and Moyeed (1998)

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 21/4 Spatial GLM (contd.) First stage: Y (s i ) are conditionally independent given β and w(s i ), so f(y(s i ) β,w(s i ),γ) equals h(y(s i ),γ) exp (γ[y(s i )η(s i ) ψ(η(s i ))]) where g(e(y (s i ))) = η(s i ) = x T (s i )β + w(s i ) (canonical link function) and γ is a dispersion parameter.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 21/4 Spatial GLM (contd.) First stage: Y (s i ) are conditionally independent given β and w(s i ), so f(y(s i ) β,w(s i ),γ) equals h(y(s i ),γ) exp (γ[y(s i )η(s i ) ψ(η(s i ))]) where g(e(y (s i ))) = η(s i ) = x T (s i )β + w(s i ) (canonical link function) and γ is a dispersion parameter. Second stage: Model w(s) as a Gaussian process: w N(0,σ 2 H(φ))

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 21/4 Spatial GLM (contd.) First stage: Y (s i ) are conditionally independent given β and w(s i ), so f(y(s i ) β,w(s i ),γ) equals h(y(s i ),γ) exp (γ[y(s i )η(s i ) ψ(η(s i ))]) where g(e(y (s i ))) = η(s i ) = x T (s i )β + w(s i ) (canonical link function) and γ is a dispersion parameter. Second stage: Model w(s) as a Gaussian process: w N(0,σ 2 H(φ)) Third stage: Priors and hyperpriors.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 21/4 Spatial GLM (contd.) First stage: Y (s i ) are conditionally independent given β and w(s i ), so f(y(s i ) β,w(s i ),γ) equals h(y(s i ),γ) exp (γ[y(s i )η(s i ) ψ(η(s i ))]) where g(e(y (s i ))) = η(s i ) = x T (s i )β + w(s i ) (canonical link function) and γ is a dispersion parameter. Second stage: Model w(s) as a Gaussian process: w N(0,σ 2 H(φ)) Third stage: Priors and hyperpriors. No process for Y (s), only a valid joint distribution

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 21/4 Spatial GLM (contd.) First stage: Y (s i ) are conditionally independent given β and w(s i ), so f(y(s i ) β,w(s i ),γ) equals h(y(s i ),γ) exp (γ[y(s i )η(s i ) ψ(η(s i ))]) where g(e(y (s i ))) = η(s i ) = x T (s i )β + w(s i ) (canonical link function) and γ is a dispersion parameter. Second stage: Model w(s) as a Gaussian process: w N(0,σ 2 H(φ)) Third stage: Priors and hyperpriors. No process for Y (s), only a valid joint distribution Not sensible to add a pure error term ǫ(s)

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 22/4 Spatial GLM: comments We are modelling with spatial random effects

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 22/4 Spatial GLM: comments We are modelling with spatial random effects Introducing these in the transformed mean encourages means of spatial variables at proximate locations to be close to each other

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 22/4 Spatial GLM: comments We are modelling with spatial random effects Introducing these in the transformed mean encourages means of spatial variables at proximate locations to be close to each other Marginal spatial dependence is induced between, say, Y (s) and Y (s ), but observed Y (s) and Y (s ) need not be close to each other

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 22/4 Spatial GLM: comments We are modelling with spatial random effects Introducing these in the transformed mean encourages means of spatial variables at proximate locations to be close to each other Marginal spatial dependence is induced between, say, Y (s) and Y (s ), but observed Y (s) and Y (s ) need not be close to each other Second stage spatial modelling is attractive for spatial explanation in the mean

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 22/4 Spatial GLM: comments We are modelling with spatial random effects Introducing these in the transformed mean encourages means of spatial variables at proximate locations to be close to each other Marginal spatial dependence is induced between, say, Y (s) and Y (s ), but observed Y (s) and Y (s ) need not be close to each other Second stage spatial modelling is attractive for spatial explanation in the mean First stage spatial modelling more appropriate to encourage proximate observations to be close.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 23/4 Binary spatial regression: Home prices Here we illustrate a non-gaussian model for point-referenced spatial data: Data: Observations are home values (based on recent real estate sales) at 50 locations in Baton Rouge, Louisiana, USA.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 23/4 Binary spatial regression: Home prices Here we illustrate a non-gaussian model for point-referenced spatial data: Data: Observations are home values (based on recent real estate sales) at 50 locations in Baton Rouge, Louisiana, USA. The response Y (s) is a binary variable, with { 1 if price is high (above the median) Y (s) = 0 if price is low (below the median)

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 23/4 Binary spatial regression: Home prices Here we illustrate a non-gaussian model for point-referenced spatial data: Data: Observations are home values (based on recent real estate sales) at 50 locations in Baton Rouge, Louisiana, USA. The response Y (s) is a binary variable, with { 1 if price is high (above the median) Y (s) = 0 if price is low (below the median) Observed covariates include the house s age and total living area

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 24/4 Binary spatial regression: Home prices We fit a generalized linear model where Y (s i ) Bernoulli(p(s i )), logit(p(s i )) = x T (s i )β + w(s i )

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 24/4 Binary spatial regression: Home prices We fit a generalized linear model where Y (s i ) Bernoulli(p(s i )), logit(p(s i )) = x T (s i )β + w(s i ) Assume vague priors for β, a Uniform(0, 10) prior for φ, and an Inverse Gamma(0.1, 0.1) prior for σ 2.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 24/4 Binary spatial regression: Home prices We fit a generalized linear model where Y (s i ) Bernoulli(p(s i )), logit(p(s i )) = x T (s i )β + w(s i ) Assume vague priors for β, a Uniform(0, 10) prior for φ, and an Inverse Gamma(0.1, 0.1) prior for σ 2. The WinBUGS code: for (i in 1:N) { Y[i] dbern(p[i]) logit(p[i]) <- w[i] mu[i] <- beta[1]+beta[2]*livingarea[i]/1000+beta[3]*age[i] } for (i in 1:3) beta[i] dnorm(0.0,0.001) w[1:n] spatial.exp(mu[], x[], y[], spat.prec, phi, 1) phi dunif(0.1,10) spat.prec dgamma(0.1, 0.1) sigmasq <- 1/spat.prec

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 25/4 Image plot of w(s) surface Latitude 30.40 30.45 30.50 30.55 91.15 91.10 91.05 91.00 Longitude

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 25/4 Image plot of w(s) surface Latitude 30.40 30.45 30.50 30.55 91.15 91.10 91.05 91.00 Longitude

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 25/4 Image plot of w(s) surface Latitude 30.40 30.45 30.50 30.55 91.15 91.10 91.05 91.00 Longitude

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 26/4 Posterior parameter estimates Parameter estimates (posterior medians and upper and lower.025 percentiles): Parameter 50% (2.5%, 97.5%) β 1 (intercept) 1.096 ( 4.198, 0.4305) β 2 (living area) 0.659 ( 0.091, 2.254) β 3 (age) 0.009615 ( 0.8653, 0.7235) φ 5.79 (1.236, 9.765) σ 2 1.38 (0.1821, 6.889) The covariate effects are generally uninteresting, though living area seems to have a marginally significant effect on price class.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 27/4 Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 27/4 Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Examples:

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 27/4 Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Examples: Environmental monitoring stations yield measurements on ozone, NO, CO, PM 2.5, etc.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 27/4 Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Examples: Environmental monitoring stations yield measurements on ozone, NO, CO, PM 2.5, etc. In atmospheric modeling at a given site we observe surface temperature, precipitation and wind speed

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 27/4 Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Examples: Environmental monitoring stations yield measurements on ozone, NO, CO, PM 2.5, etc. In atmospheric modeling at a given site we observe surface temperature, precipitation and wind speed In real estate modeling for an individual property we observe selling price and total rental income

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 27/4 Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Examples: Environmental monitoring stations yield measurements on ozone, NO, CO, PM 2.5, etc. In atmospheric modeling at a given site we observe surface temperature, precipitation and wind speed In real estate modeling for an individual property we observe selling price and total rental income We anticipate dependence between measurements

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 27/4 Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Examples: Environmental monitoring stations yield measurements on ozone, NO, CO, PM 2.5, etc. In atmospheric modeling at a given site we observe surface temperature, precipitation and wind speed In real estate modeling for an individual property we observe selling price and total rental income We anticipate dependence between measurements at a particular location

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 27/4 Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Examples: Environmental monitoring stations yield measurements on ozone, NO, CO, PM 2.5, etc. In atmospheric modeling at a given site we observe surface temperature, precipitation and wind speed In real estate modeling for an individual property we observe selling price and total rental income We anticipate dependence between measurements at a particular location across locations

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 28/4 Multivariate spatial regression Each location contains m spatial regressions Y k (s) = µ k (s) + w k (s) + ǫ k (s), k = 1,...,m. Mean: µ(s) = [µ k (s)] m k=1 = [xt k (s)β k] m k=1 Cov: w(s) = [w k (s)] m k=1 MV GP(0, Γ w(, )) Γ w (s, s ) = [Cov(w k (s),w k (s ))] m k,k =1 Error: ǫ(s) = [ǫ k (s)] m k=1 MV N(0, Ψ) Ψ is an m m p.d. matrix, e.g. usually Diag(τ 2 k )m k=1.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 29/4 Multivariate Gaussian Process w(s) MV GP(0, Γ w ( )) with Γ w (s, s ) = [Cov(w k (s),w k (s ))] m k,k =1 Example: with m = 2 ( Γ w (s, s Cov(w 1 (s),w 1 (s )) Cov(w 1 (s),w 2 (s )) ) = Cov(w 2 (s),w 1 (s )) Cov(w 2 (s),w 2 (s )) ) For finite set of locations S = {s 1,...,s n }: V ar ([w(s i )] n i=1) = Σ w = [Γ w (s i, s j )] n i,j=1

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 30/4 contd. Properties: Γ w (s, s) = Γ T w(s, s ) lim s s Γ w (s, s ) is p.d. and Γ w (s, s) = V ar(w(s)). For sites in any finite collection S = {s 1,...,s n }: n i=1 n j=1 u T i Γ w (s i, s j )u j 0 for all u i, u j R m. Any valid Γ W must satisfy the above conditions. The last property implies that Σ w is p.d. In complete generality: Γ w (s, s ) need not be symmetric. Γ w (s, s ) need not be p.d. for s s.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 31/4 Modelling cross-covariances Moving average or kernel convolution of a process: Let Z(s) GP(0,ρ( )). Use kernels to form: w j (s) = κ j (u)z(s + u)du = κ j (s s )Z(s )ds Γ w (s s ) has (i,j)-th element: [Γ w (s s )] i,j = κ i (s s +u)κ j (u )ρ(u u )dudu Convolution of Covariance Functions: ρ 1,ρ 2,...ρ m are valid covariance functions. Form: [Γ w (s s )] i,j = ρ i (s s t)ρ j (t)dt.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 32/4 Constructive approach Let v k (s) GP(0,ρ k (s, s )), for k = 1,...,m be m independent GP s with unit variance. Form the simple multivariate process v(s) = [v k (s)] m k=1 : v(s) MV GP(0, Γ v (, )) with Γ v (s, s ) = Diag(ρ k (s, s )) m k=1. Assume w(s) = A(s)v(s) arises as a space-varying linear transformation of v(s). Then: Γ w (s, s ) = A(s)Γ v (s, s )A T (s ) is a valid cross-covariance function.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 33/4 contd. When s = s, Γ v (s, s) = I m, so: Γ w (s, s) = A(s)A T (s) A(s) identifies with any square-root of Γ w (s, s). Can be taken as lower-triangular (Cholesky). A(s) is unknown! Should we first model A(s) to obtain Γ w (s, s)? Or should we model Γ w (s, s ) first and derive A(s)? A(s) is completely determined from within-site associations.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 34/4 contd. If A(s) = A: w(s) is stationary when v(s) is. Γ w (s, s ) is symmetric. Γ v (s, s ) = ρ(s, s )I m Γ w = ρ(s, s )AA T Last specification is called intrinsic and leads to separable models: Σ w = H(φ) Λ; Λ = AA T

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 35/4 Hierarchical modelling Let y = [Y(s i )] n i=1 and w = [W(s i)] n i=1.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 35/4 Hierarchical modelling Let y = [Y(s i )] n i=1 and w = [W(s i)] n i=1. First stage: y β, w, Ψ MV N(Xβ + w,i Ψ)

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 35/4 Hierarchical modelling Let y = [Y(s i )] n i=1 and w = [W(s i)] n i=1. First stage: y β, w, Ψ MV N(Xβ + w,i Ψ) Second stage: w θ MV N(0, Σ w (Φ)) where Σ w (Φ) = [Γ W (s i, s j ; Φ)] n i,j=1.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 35/4 Hierarchical modelling Let y = [Y(s i )] n i=1 and w = [W(s i)] n i=1. First stage: y β, w, Ψ MV N(Xβ + w,i Ψ) Second stage: w θ MV N(0, Σ w (Φ)) where Σ w (Φ) = [Γ W (s i, s j ; Φ)] n i,j=1. Third stage: Priors on Ω = (β, Ψ, Φ).

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 35/4 Hierarchical modelling Let y = [Y(s i )] n i=1 and w = [W(s i)] n i=1. First stage: y β, w, Ψ MV N(Xβ + w,i Ψ) Second stage: w θ MV N(0, Σ w (Φ)) where Σ w (Φ) = [Γ W (s i, s j ; Φ)] n i,j=1. Third stage: Priors on Ω = (β, Ψ, Φ). Marginalized likelihood: y β,θ, Ψ MV N(Xβ, Σ w (Φ) + I Ψ)

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 36/4 Bayesian Computations Choice: Fit as [y Ω] [Ω] or as [y β, w, Ψ] [w Φ] [Ω].

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 36/4 Bayesian Computations Choice: Fit as [y Ω] [Ω] or as [y β, w, Ψ] [w Φ] [Ω]. Conditional model: Conjugate distributions are available for Ψ and other variance parameters. Easy to program.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 36/4 Bayesian Computations Choice: Fit as [y Ω] [Ω] or as [y β, w, Ψ] [w Φ] [Ω]. Conditional model: Conjugate distributions are available for Ψ and other variance parameters. Easy to program. Marginalized model: need Metropolis or Slice sampling for most variance-covariance parameters. Harder to program. But reduced parameter space (no w s) results in faster convergence Σ w (Φ) + I Ψ is more stable than Σ w (Φ).

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 36/4 Bayesian Computations Choice: Fit as [y Ω] [Ω] or as [y β, w, Ψ] [w Φ] [Ω]. Conditional model: Conjugate distributions are available for Ψ and other variance parameters. Easy to program. Marginalized model: need Metropolis or Slice sampling for most variance-covariance parameters. Harder to program. But reduced parameter space (no w s) results in faster convergence Σ w (Φ) + I Ψ is more stable than Σ w (Φ). But what about Σ 1 w (Φ)?? Matrix inversion is EXPENSIVE O(n 3 ).

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 37/4 Recovering the w s? Interest often lies in the spatial surface w y.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 37/4 Recovering the w s? Interest often lies in the spatial surface w y. They are recovered from [w y,x] = [w Ω, y,x] [Ω y,x]dω using posterior samples:

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 37/4 Recovering the w s? Interest often lies in the spatial surface w y. They are recovered from [w y,x] = [w Ω, y,x] [Ω y,x]dω using posterior samples: Obtain Ω (1),...,Ω (G) [Ω y,x]

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 37/4 Recovering the w s? Interest often lies in the spatial surface w y. They are recovered from [w y,x] = [w Ω, y,x] [Ω y,x]dω using posterior samples: Obtain Ω (1),...,Ω (G) [Ω y,x] For each Ω (g), draw w (g) [w Ω (g), y,x]

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 37/4 Recovering the w s? Interest often lies in the spatial surface w y. They are recovered from [w y,x] = [w Ω, y,x] [Ω y,x]dω using posterior samples: Obtain Ω (1),...,Ω (G) [Ω y,x] For each Ω (g), draw w (g) [w Ω (g), y,x] NOTE: With Gaussian likelihoods [w Ω, y, X] is also Gaussian. With other likelihoods this may not be easy and often the conditional updating scheme is preferred.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 38/4 Spatial Prediction Often we need to predict Y (s) at a new set of locations { s 0,..., s m } with associated predictor matrix X. Sample from predictive distribution: [ỹ y,x, X] = [ỹ, Ω y,x, X]dΩ = [ỹ y, Ω,X, X] [Ω y,x]dω, [ỹ y, Ω,X, X] is multivariate normal. Sampling scheme: Obtain Ω (1),...,Ω (G) [Ω y,x] For each Ω (g), draw ỹ (g) [ỹ y, Ω (g),x, X].

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 39/4 The Big N problem The multivariate spatial regression model: Y(s) = X T (s)β + w(s) + ǫ(s) with m spatial regressions at each s. Fitting a fully Bayes model over sites S = {s 1,...,s n } involves matrix decompositions and determinants for the mn mn Σ w (Φ). When Φ is estimated this must be done in each MCMC iteration. This can be prohibitive when n is large and is known as the Big-N problem in geostatistics. Approach: Replace w(s) by a process w(s) that will somehow project the model into a smaller subspace.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 40/4 The Kriging Equation Consider knots S = {s 1,...,s p} with p << n. Predict w(s) based upon S at s 0 : w(s 0 ) = E[w(s 0 ) w ] = Cov(w(s 0 ), w )V ar(w ) 1 w ; w = [w(s i)] p i=1, Cov(w(s 0 ), w ) = [Γ w (s 0, s 1; Φ),...,Γ w (s 0, s p; Φ)], V ar(w ) = [Γ w (s i,s j; Φ)] p i,j=1.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 41/4 The Predictive Process For generic s, w(s) MV GP(0, Γ w(, )) Γ w(s, s ) = Cov(w(s), w )V ar(w ) 1 Cov T (w(s), w ). We can write w(s) = Z(s)w Z(s) = Cov(w(s), w )V ar(w ) 1 is an m mp matrix. Process realizations over S = s 1,...,s n : w = [ w(s i )] n i=1 = Zw ; MV N(0, ZΣ 1 w Z T ) with Z = [Z(s i )] n i=1 being mn mp. Realization is singular as soon as n exceeds m.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 42/4 Reducing the dimension We fit the reduced model: Y(s) = X T (s)β + w(s) + ǫ(s) = X T (s)β + Z(s)w + ǫ(s). Computations involve Σ w (mp mp) instead of Σ w (np np). Can make a huge difference with p << n. Concerns: How many knots will suffice? How do we choose the knots? Impact of knot selection in practical data analysis?

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 43/4 Attractions Greater flexibility with knots compared to other methods. Does not have to be on a regular grid as for spectral methods and other likelihood approximation methods.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 43/4 Attractions Greater flexibility with knots compared to other methods. Does not have to be on a regular grid as for spectral methods and other likelihood approximation methods. Derives from the original process. No need to specify kernels or other covariance functions.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 43/4 Attractions Greater flexibility with knots compared to other methods. Does not have to be on a regular grid as for spectral methods and other likelihood approximation methods. Derives from the original process. No need to specify kernels or other covariance functions. Adapts seamlessly to spatiotemporal and other settings helps being a model rather than an approximation!

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 43/4 Attractions Greater flexibility with knots compared to other methods. Does not have to be on a regular grid as for spectral methods and other likelihood approximation methods. Derives from the original process. No need to specify kernels or other covariance functions. Adapts seamlessly to spatiotemporal and other settings helps being a model rather than an approximation! Given S you cannot do better in a (reverse entropy) Kullback-Leibler paradigm with any process of the form H(s)w.

Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 43/4 Attractions Greater flexibility with knots compared to other methods. Does not have to be on a regular grid as for spectral methods and other likelihood approximation methods. Derives from the original process. No need to specify kernels or other covariance functions. Adapts seamlessly to spatiotemporal and other settings helps being a model rather than an approximation! Given S you cannot do better in a (reverse entropy) Kullback-Leibler paradigm with any process of the form H(s)w. When S = S we recover the original model.