Multivariate spatial modeling

Similar documents
Hierarchical Modelling for Univariate and Multivariate Spatial Data

Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.

Hierarchical Modelling for Multivariate Spatial Data

Hierarchical Modeling for Multivariate Spatial Data

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data

Nearest Neighbor Gaussian Processes for Large Spatial Data

Approaches for Multiple Disease Mapping: MCAR and SANOVA

Modelling Multivariate Spatial Data

Bayesian Areal Wombling for Geographic Boundary Analysis

Hierarchical Modeling for Univariate Spatial Data

Analysis of Marked Point Patterns with Spatial and Non-spatial Covariate Information

Hierarchical Modelling for Univariate Spatial Data

Hierarchical Modeling for Spatio-temporal Data

Multivariate Spatial Process Models. Alan E. Gelfand and Sudipto Banerjee

Bayesian Linear Regression

Spatio-Temporal Threshold Models for Relating UV Exposures and Skin Cancer in the Central United States

Gaussian predictive process models for large spatial data sets.

Multivariate Survival Analysis

On Gaussian Process Models for High-Dimensional Geostatistical Datasets

Gibbs Sampling in Linear Models #2

Generalized logit models for nominal multinomial responses. Local odds ratios

Areal data models. Spatial smoothers. Brook s Lemma and Gibbs distribution. CAR models Gaussian case Non-Gaussian case

Hierarchical Modeling for Spatial Data

Modelling geoadditive survival data

Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets

Bayesian Linear Models

Marginal Survival Modeling through Spatial Copulas

Bayesian data analysis in practice: Three simple examples

CTDL-Positive Stable Frailty Model

Example using R: Heart Valves Study

Principles of Bayesian Inference

Stat260: Bayesian Modeling and Inference Lecture Date: February 10th, Jeffreys priors. exp 1 ) p 2

Chapter 4 - Fundamentals of spatial processes Lecture notes

Package SimSCRPiecewise

Part 6: Multivariate Normal and Linear Models

Hierarchical Modelling for Univariate Spatial Data

Spatial Survival Analysis via Copulas

Multivariate Normal & Wishart

The linear model is the most fundamental of all serious statistical models encompassing:

Bayesian Linear Models

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Lecture 3. Truncation, length-bias and prevalence sampling

Introduction to Geostatistics

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Hypothesis Testing. Econ 690. Purdue University. Justin L. Tobias (Purdue) Testing 1 / 33

MCMC algorithms for fitting Bayesian models

Bayesian spatial quantile regression

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian Linear Models

PROBABILITY DISTRIBUTIONS. J. Elder CSE 6390/PSYC 6225 Computational Modeling of Visual Perception

Hierarchical Modeling for non-gaussian Spatial Data

Motivation Scale Mixutres of Normals Finite Gaussian Mixtures Skew-Normal Models. Mixture Models. Econ 690. Purdue University

Hierarchical Linear Models

CBMS Lecture 1. Alan E. Gelfand Duke University

Basics of Point-Referenced Data Models

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Practical considerations for survival models

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

Hierarchical models. Dr. Jarad Niemi. August 31, Iowa State University. Jarad Niemi (Iowa State) Hierarchical models August 31, / 31

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets

Frailty Modeling for clustered survival data: a simulation study

Spatial Misalignment

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Statistical Inference and Methods

Cross-covariance Functions for Tangent Vector Fields on the Sphere

Bayesian linear regression

Part 7: Hierarchical Modeling

Modeling Real Estate Data using Quantile Regression

Principles of Bayesian Inference

Local Likelihood Bayesian Cluster Modeling for small area health data. Andrew Lawson Arnold School of Public Health University of South Carolina

Longitudinal breast density as a marker of breast cancer risk

Reliability Monitoring Using Log Gaussian Process Regression

Introduction to Spatial Data and Models

Markov Chain Monte Carlo (MCMC)

STA 4273H: Statistical Machine Learning

Lecture 5 Models and methods for recurrent event data

Pattern Recognition and Machine Learning. Bishop Chapter 2: Probability Distributions

Spatial Smoothing in Stan: Conditional Auto-Regressive Models

Bayesian Inference for Regression Parameters

Making rating curves - the Bayesian approach

Sampling bias in logistic models

Bayesian inference & process convolution models Dave Higdon, Statistical Sciences Group, LANL

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models

Disease mapping with Gaussian processes

Using Estimating Equations for Spatially Correlated A

Hierarchical Modelling for non-gaussian Spatial Data

A Fully Nonparametric Modeling Approach to. BNP Binary Regression

Bayesian Multivariate Logistic Regression

Metropolis-Hastings Algorithm

Physician Performance Assessment / Spatial Inference of Pollutant Concentrations

Markov Random Fields

Introduction to Spatial Data and Models

Bayesian Inference in a Normal Population

Part 8: GLMs and Hierarchical LMs and GLMs

Hierarchical Modeling and Analysis for Spatial Data

Point process with spatio-temporal heterogeneity

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS

Generalized common spatial factor model

GAUSSIAN PROCESS REGRESSION

Bayesian Hierarchical Models

Transcription:

Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Chapter 7: Multivariate Spatial Modeling p. 1/21

Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Examples: Chapter 7: Multivariate Spatial Modeling p. 1/21

Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Examples: Environmental monitoring stations yield measurements on ozone, NO, CO, PM 2.5, etc. Chapter 7: Multivariate Spatial Modeling p. 1/21

Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Examples: Environmental monitoring stations yield measurements on ozone, NO, CO, PM 2.5, etc. In atmospheric modeling at a given site we observe surface temperature, precipitation and wind speed Chapter 7: Multivariate Spatial Modeling p. 1/21

Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Examples: Environmental monitoring stations yield measurements on ozone, NO, CO, PM 2.5, etc. In atmospheric modeling at a given site we observe surface temperature, precipitation and wind speed In real estate modeling for an individual property we observe selling price and total rental income Chapter 7: Multivariate Spatial Modeling p. 1/21

Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Examples: Environmental monitoring stations yield measurements on ozone, NO, CO, PM 2.5, etc. In atmospheric modeling at a given site we observe surface temperature, precipitation and wind speed In real estate modeling for an individual property we observe selling price and total rental income We anticipate dependence between measurements Chapter 7: Multivariate Spatial Modeling p. 1/21

Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Examples: Environmental monitoring stations yield measurements on ozone, NO, CO, PM 2.5, etc. In atmospheric modeling at a given site we observe surface temperature, precipitation and wind speed In real estate modeling for an individual property we observe selling price and total rental income We anticipate dependence between measurements at a particular location Chapter 7: Multivariate Spatial Modeling p. 1/21

Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Examples: Environmental monitoring stations yield measurements on ozone, NO, CO, PM 2.5, etc. In atmospheric modeling at a given site we observe surface temperature, precipitation and wind speed In real estate modeling for an individual property we observe selling price and total rental income We anticipate dependence between measurements at a particular location across locations Chapter 7: Multivariate Spatial Modeling p. 1/21

Basic issues Y(s) denotes a p 1 vector of random variables at s Chapter 7: Multivariate Spatial Modeling p. 2/21

Basic issues Y(s) denotes a p 1 vector of random variables at s We seek to model Y(s) : s D, again specifying a finite dimensional distribution for Y = (Y(s 1 ),...,Y(s n )) Chapter 7: Multivariate Spatial Modeling p. 2/21

Basic issues Y(s) denotes a p 1 vector of random variables at s We seek to model Y(s) : s D, again specifying a finite dimensional distribution for Y = (Y(s 1 ),...,Y(s n )) Crucial object: the cross-covariance, C(s, s ) = Cov(Y(s), Y(s )) a p p matrix that need not be symmetric, i.e., cov(y j (s),y j (s )) need not equal cov(y j (s),y j (s )) Chapter 7: Multivariate Spatial Modeling p. 2/21

Basic issues Y(s) denotes a p 1 vector of random variables at s We seek to model Y(s) : s D, again specifying a finite dimensional distribution for Y = (Y(s 1 ),...,Y(s n )) Crucial object: the cross-covariance, C(s, s ) = Cov(Y(s), Y(s )) a p p matrix that need not be symmetric, i.e., cov(y j (s),y j (s )) need not equal cov(y j (s),y j (s )) C(s, s ) is not positive definite except in a limiting sense: C(s, s) is the covariance matrix associated with Y(s). Chapter 7: Multivariate Spatial Modeling p. 2/21

Basic issues Y(s) denotes a p 1 vector of random variables at s We seek to model Y(s) : s D, again specifying a finite dimensional distribution for Y = (Y(s 1 ),...,Y(s n )) Crucial object: the cross-covariance, C(s, s ) = Cov(Y(s), Y(s )) a p p matrix that need not be symmetric, i.e., cov(y j (s),y j (s )) need not equal cov(y j (s),y j (s )) C(s, s ) is not positive definite except in a limiting sense: C(s, s) is the covariance matrix associated with Y(s). our primary focus: Gaussian processes and valid specification for C(s, s ) Chapter 7: Multivariate Spatial Modeling p. 2/21

Separable models A popular specification is the separable model C(s, s ) = ρ(s, s ) T where ρ is a valid (univariate) correlation function and T is a p p positive definite matrix Chapter 7: Multivariate Spatial Modeling p. 3/21

Separable models A popular specification is the separable model C(s, s ) = ρ(s, s ) T where ρ is a valid (univariate) correlation function and T is a p p positive definite matrix T is the non-spatial or local" covariance matrix Chapter 7: Multivariate Spatial Modeling p. 3/21

Separable models A popular specification is the separable model C(s, s ) = ρ(s, s ) T where ρ is a valid (univariate) correlation function and T is a p p positive definite matrix T is the non-spatial or local" covariance matrix ρ controls spatial association based upon proximity Chapter 7: Multivariate Spatial Modeling p. 3/21

Separable models A popular specification is the separable model C(s, s ) = ρ(s, s ) T where ρ is a valid (univariate) correlation function and T is a p p positive definite matrix T is the non-spatial or local" covariance matrix ρ controls spatial association based upon proximity Easy to verify that Σ Y = H T, where H ij = ρ(s i, s j ) and is the Kronecker product. Chapter 7: Multivariate Spatial Modeling p. 3/21

Separable models A popular specification is the separable model C(s, s ) = ρ(s, s ) T where ρ is a valid (univariate) correlation function and T is a p p positive definite matrix T is the non-spatial or local" covariance matrix ρ controls spatial association based upon proximity Easy to verify that Σ Y = H T, where H ij = ρ(s i, s j ) and is the Kronecker product. Σ Y is positive definite since H and T are Chapter 7: Multivariate Spatial Modeling p. 3/21

Separable models A popular specification is the separable model C(s, s ) = ρ(s, s ) T where ρ is a valid (univariate) correlation function and T is a p p positive definite matrix T is the non-spatial or local" covariance matrix ρ controls spatial association based upon proximity Easy to verify that Σ Y = H T, where H ij = ρ(s i, s j ) and is the Kronecker product. Σ Y is positive definite since H and T are Σ Y is convenient since Σ Y = H p T n and Σ 1 Y = H 1 T 1. Chapter 7: Multivariate Spatial Modeling p. 3/21

Application: Bivariate spatial regression A single covariate X(s) and a univariate response Y (s) Chapter 7: Multivariate Spatial Modeling p. 4/21

Application: Bivariate spatial regression A single covariate X(s) and a univariate response Y (s) To enable joint estimation and prediction, treat this as a bivariate process, ( ) X(s) Z(s) = N(µ(s),T) Y (s) Chapter 7: Multivariate Spatial Modeling p. 4/21

Application: Bivariate spatial regression A single covariate X(s) and a univariate response Y (s) To enable joint estimation and prediction, treat this as a bivariate process, ( ) X(s) Z(s) = N(µ(s),T) Y (s) Simplifying assumptions: Chapter 7: Multivariate Spatial Modeling p. 4/21

Application: Bivariate spatial regression A single covariate X(s) and a univariate response Y (s) To enable joint estimation and prediction, treat this as a bivariate process, ( ) X(s) Z(s) = N(µ(s),T) Y (s) Simplifying assumptions: separable cross-covariance for Z(s) Chapter 7: Multivariate Spatial Modeling p. 4/21

Application: Bivariate spatial regression A single covariate X(s) and a univariate response Y (s) To enable joint estimation and prediction, treat this as a bivariate process, ( ) X(s) Z(s) = N(µ(s),T) Y (s) Simplifying assumptions: separable cross-covariance for Z(s) µ(s) = (µ 1,µ 2 ) is coordinate-free. Chapter 7: Multivariate Spatial Modeling p. 4/21

Application: Bivariate spatial regression A single covariate X(s) and a univariate response Y (s) To enable joint estimation and prediction, treat this as a bivariate process, ( ) X(s) Z(s) = N(µ(s),T) Y (s) Simplifying assumptions: separable cross-covariance for Z(s) µ(s) = (µ 1,µ 2 ) is coordinate-free. Then p(y (s) X(s)) N(β 0 + β 1 X(s),σ 2 ), where β 0 = µ 2 T 12 T 11 µ 1, β 1 = T 12 T 11, and σ 2 = T 22 T 2 12 T 11 Chapter 7: Multivariate Spatial Modeling p. 4/21

Bivariate spatial regression (cont d) Rearrangement of the components of Z to Z = (X(s 1 ),X(s 2 ),...,X(s n ),Y (s 1 ),Y (s 2 ),...,Y (s n )) yields ( X Y ) N (( µ 1 1 µ 2 1 ), T H (φ) ), Chapter 7: Multivariate Spatial Modeling p. 5/21

Bivariate spatial regression (cont d) Rearrangement of the components of Z to Z = (X(s 1 ),X(s 2 ),...,X(s n ),Y (s 1 ),Y (s 2 ),...,Y (s n )) yields ( X Y ) N (( µ 1 1 µ 2 1 ), T H (φ) Priors: Wishart for T 1, vague but proper normal for (µ 1,µ 2 ), uniform or other suitable choice for φ ), Chapter 7: Multivariate Spatial Modeling p. 5/21

Bivariate spatial regression (cont d) Rearrangement of the components of Z to Z = (X(s 1 ),X(s 2 ),...,X(s n ),Y (s 1 ),Y (s 2 ),...,Y (s n )) yields ( X Y ) N (( µ 1 1 µ 2 1 ), T H (φ) Priors: Wishart for T 1, vague but proper normal for (µ 1,µ 2 ), uniform or other suitable choice for φ Full conditionals for Gibbs sampler: again Wishart for T 1, bivariate normal for (µ 1,µ 2 ); nonconjugate for φ (so need Metropolis sampling here) ), Chapter 7: Multivariate Spatial Modeling p. 5/21

Example with dew-shrub data 1129 locations with UTM coordinates Chapter 7: Multivariate Spatial Modeling p. 6/21

Example with dew-shrub data 1129 locations with UTM coordinates Y (s) : shrub density (% of land cover) at location s Chapter 7: Multivariate Spatial Modeling p. 6/21

Example with dew-shrub data 1129 locations with UTM coordinates Y (s) : shrub density (% of land cover) at location s X(s) : dew duration (minutes from 8 am) at location s Chapter 7: Multivariate Spatial Modeling p. 6/21

Example with dew-shrub data 1129 locations with UTM coordinates Y (s) : shrub density (% of land cover) at location s X(s) : dew duration (minutes from 8 am) at location s Perform an illustrative Bayesian analysis assuming separability and assuming an exponential correlation function, ρ(h;φ) = e φh Chapter 7: Multivariate Spatial Modeling p. 6/21

Example with dew-shrub data 1129 locations with UTM coordinates Y (s) : shrub density (% of land cover) at location s X(s) : dew duration (minutes from 8 am) at location s Perform an illustrative Bayesian analysis assuming separability and assuming an exponential correlation function, ρ(h;φ) = e φh conjugate priors for µ, T as above; prior for φ has infinite variance and suggests a range (3/φ) of 125 km, roughly half the maximum pairwise distance in the region Chapter 7: Multivariate Spatial Modeling p. 6/21

Example with dew-shrub data 1129 locations with UTM coordinates Y (s) : shrub density (% of land cover) at location s X(s) : dew duration (minutes from 8 am) at location s Perform an illustrative Bayesian analysis assuming separability and assuming an exponential correlation function, ρ(h;φ) = e φh conjugate priors for µ, T as above; prior for φ has infinite variance and suggests a range (3/φ) of 125 km, roughly half the maximum pairwise distance in the region (µ 1,µ 2,T 11,T 12,T 22 ) updated directly; φ updated via Metropolis Chapter 7: Multivariate Spatial Modeling p. 6/21

Example with dew-shrub data 1129 locations with UTM coordinates Y (s) : shrub density (% of land cover) at location s X(s) : dew duration (minutes from 8 am) at location s Perform an illustrative Bayesian analysis assuming separability and assuming an exponential correlation function, ρ(h;φ) = e φh conjugate priors for µ, T as above; prior for φ has infinite variance and suggests a range (3/φ) of 125 km, roughly half the maximum pairwise distance in the region (µ 1,µ 2,T 11,T 12,T 22 ) updated directly; φ updated via Metropolis (β 0,β 1,σ 2 ) samples automatically determined as functions of the others Chapter 7: Multivariate Spatial Modeling p. 6/21

Dew and shrub data sites 128100 128150 128200 128250 128300 1031350 1031400 1031450 1031500 1031550 Eastings Northings 128100 128150 128200 128250 128300 1031350 1031400 1031450 1031500 1031550 Chapter 7: Multivariate Spatial Modeling p. 7/21

Parameter estimation, dew-shrub data Parameter 2.5% 50% 97.5% µ 1 73.12 73.89 74.67 µ 2 5.20 5.38 5.572 T 11 95.10 105.22 117.69 T 12 4.46 2.42 0.53 T 22 5.56 6.19 6.91 φ 0.01 0.03 0.21 β 0 5.72 7.08 8.46 β 1 0.04 0.02 0.01 σ 2 5.58 6.22 6.93 T 12 / T 11 T 22 0.17 0.10 0.02 surprising significant negative association between dew duration and shrub density! (also emerges with a nonspatial model) Chapter 7: Multivariate Spatial Modeling p. 8/21

Benefits and limitations of separability Benefits: Easy interpretation (decomposition of variance structure) Substantial computational benefits Chapter 7: Multivariate Spatial Modeling p. 9/21

Benefits and limitations of separability Benefits: Easy interpretation (decomposition of variance structure) Substantial computational benefits Limitations: Symmetry in cross-covariance matrix Imposes same spatial range for every measurement (only one correlation function) Chapter 7: Multivariate Spatial Modeling p. 9/21

Benefits and limitations of separability Benefits: Easy interpretation (decomposition of variance structure) Substantial computational benefits Limitations: Symmetry in cross-covariance matrix Imposes same spatial range for every measurement (only one correlation function) One possible solution: Coregionalization models!... Chapter 7: Multivariate Spatial Modeling p. 9/21

Linear Model of Coregionalization For point referenced data, assume Y(s) = Aw(s) where w(s) = (w 1 (s),w 2 (s),...,w p (s)). Chapter 7: Multivariate Spatial Modeling p. 10/21

Linear Model of Coregionalization For point referenced data, assume Y(s) = Aw(s) where w(s) = (w 1 (s),w 2 (s),...,w p (s)). p independent spatial processes with stationary correlation functions ρ j (s s ),j = 1, 2,...,p Chapter 7: Multivariate Spatial Modeling p. 10/21

Linear Model of Coregionalization For point referenced data, assume Y(s) = Aw(s) where w(s) = (w 1 (s),w 2 (s),...,w p (s)). p independent spatial processes with stationary correlation functions ρ j (s s ),j = 1, 2,...,p If ρ j = ρ for all j separable case with AA = T Chapter 7: Multivariate Spatial Modeling p. 10/21

Linear Model of Coregionalization For point referenced data, assume Y(s) = Aw(s) where w(s) = (w 1 (s),w 2 (s),...,w p (s)). p independent spatial processes with stationary correlation functions ρ j (s s ),j = 1, 2,...,p If ρ j = ρ for all j separable case with AA = T In general, the cross covariance matrix is C(s s ) = p j=1 ρ j (s s )a j a j where a j is the jth column of A. Chapter 7: Multivariate Spatial Modeling p. 10/21

Linear Model of Coregionalization For point referenced data, assume Y(s) = Aw(s) where w(s) = (w 1 (s),w 2 (s),...,w p (s)). p independent spatial processes with stationary correlation functions ρ j (s s ),j = 1, 2,...,p If ρ j = ρ for all j separable case with AA = T In general, the cross covariance matrix is C(s s ) = p j=1 ρ j (s s )a j a j where a j is the jth column of A. Approach is constructive" and hence immediately valid, still stationary, and provides a distinct covariance function for each component Chapter 7: Multivariate Spatial Modeling p. 10/21

Linear Model of Coregionalization More frisky: Y(s) = A(s)w(s) = Spatially varying LMC! Chapter 7: Multivariate Spatial Modeling p. 11/21

Linear Model of Coregionalization More frisky: Y(s) = A(s)w(s) = Spatially varying LMC! model A(s) model T(s) = A(s)A (s) Chapter 7: Multivariate Spatial Modeling p. 11/21

Linear Model of Coregionalization More frisky: Y(s) = A(s)w(s) = Spatially varying LMC! model A(s) model T(s) = A(s)A (s) Possibilities for T(s): Chapter 7: Multivariate Spatial Modeling p. 11/21

Linear Model of Coregionalization More frisky: Y(s) = A(s)w(s) = Spatially varying LMC! model A(s) model T(s) = A(s)A (s) Possibilities for T(s): T(s) = g(x(s)) T Chapter 7: Multivariate Spatial Modeling p. 11/21

Linear Model of Coregionalization More frisky: Y(s) = A(s)w(s) = Spatially varying LMC! model A(s) model T(s) = A(s)A (s) Possibilities for T(s): T(s) = g(x(s)) T T(s) is a spatial process (e.g., T 1 (s) is a spatial Wishart process) Chapter 7: Multivariate Spatial Modeling p. 11/21

Other Approaches Moving average or kernel convolution of a process: Y j (s) = k j (u)z(s + u)du = k j (s s )Z(s )ds where Z(s) is a univariate spatial process and k j are kernel functions, j = 1, 2,...,p. Yields the cross covariance C ij (s s ) = k i (s s + u)k j (u )ρ(u u )dudu Chapter 7: Multivariate Spatial Modeling p. 12/21

Other Approaches Moving average or kernel convolution of a process: Y j (s) = k j (u)z(s + u)du = k j (s s )Z(s )ds where Z(s) is a univariate spatial process and k j are kernel functions, j = 1, 2,...,p. Yields the cross covariance C ij (s s ) = k i (s s + u)k j (u )ρ(u u )dudu Convolution of Covariance Functions: Suppose C 1,C 2,...C p are valid covariance functions. Define C ij (s) = C i (s t)c j (t)dt. Then the p p matrix C(s) = {C ij (s)} is a valid cross covariance function. Chapter 7: Multivariate Spatial Modeling p. 12/21

Multivariate Areal Data Models Now areal units (e.g., counties) instead of points Chapter 7: Multivariate Spatial Modeling p. 13/21

Multivariate Areal Data Models Now areal units (e.g., counties) instead of points Need to model dependence within and across units Chapter 7: Multivariate Spatial Modeling p. 13/21

Multivariate Areal Data Models Now areal units (e.g., counties) instead of points Need to model dependence within and across units As in univariate case, often handled using spatial random effects φ ij, where again i = 1,...,n indexes region but now j = 1,...,p indexes variable (e.g., cancer type) within region Chapter 7: Multivariate Spatial Modeling p. 13/21

Multivariate Areal Data Models Now areal units (e.g., counties) instead of points Need to model dependence within and across units As in univariate case, often handled using spatial random effects φ ij, where again i = 1,...,n indexes region but now j = 1,...,p indexes variable (e.g., cancer type) within region Suppose we observe Y i = (Y i1,y i2,...y ip ), Then g(e(y ij )) = x T ij β j + φ ij, where φ i = (φ i1,...,φ ip ) and φ = (φ 1,...,φ n ). Link function g useful for modeling rate (e.g., Poisson) or survival (e.g., Weibull or Cox model) data. Chapter 7: Multivariate Spatial Modeling p. 13/21

Univariate vs. Multivariate Modeling We can combine CAR and traditional models to handle a broad range of problems. Chapter 7: Multivariate Spatial Modeling p. 14/21

Univariate vs. Multivariate Modeling We can combine CAR and traditional models to handle a broad range of problems. Example: For breast cancer control model in county i, let Y 1i = observed age-adjusted mortality rate / 100,000 Y 2i = observed age-adjusted incidence rate / 100,000 Y 3i = observed percent of late diagnoses Y 4i = % of surveyed women over the age of 40 who have not had a mammogram in two years Chapter 7: Multivariate Spatial Modeling p. 14/21

Univariate vs. Multivariate Modeling We can combine CAR and traditional models to handle a broad range of problems. Example: For breast cancer control model in county i, let Y 1i = observed age-adjusted mortality rate / 100,000 Y 2i = observed age-adjusted incidence rate / 100,000 Y 3i = observed percent of late diagnoses Y 4i = % of surveyed women over the age of 40 who have not had a mammogram in two years ind Assume Y ki N(θ ki,σki 2 ), k = 1,...,4, i = 1,...,I = 87, and set σki 2 = σ2 k /n ki, where n ki is the number of persons at risk for event k in county i. Chapter 7: Multivariate Spatial Modeling p. 14/21

Univariate vs. Multivariate Modeling Writing θ k = (θ k1,...,θ ki ), let θ k CAR(λ k ). Chapter 7: Multivariate Spatial Modeling p. 15/21

Univariate vs. Multivariate Modeling Writing θ k = (θ k1,...,θ ki ), let θ k CAR(λ k ). We then borrow strength across cancer indicators: λ k iid G(a,b) and σ 2 k iid IG(a,b), where G and IG denote the gamma and inverse gamma distributions, respectively. Chapter 7: Multivariate Spatial Modeling p. 15/21

Univariate vs. Multivariate Modeling Writing θ k = (θ k1,...,θ ki ), let θ k CAR(λ k ). We then borrow strength across cancer indicators: λ k iid G(a,b) and σ 2 k iid IG(a,b), where G and IG denote the gamma and inverse gamma distributions, respectively. Parameter of interest: For a set of weights α k such that 4 k=1 α k = 1, form the breast cancer control variable, η i = 4 k=1 α k θ ki. Chapter 7: Multivariate Spatial Modeling p. 15/21

Univariate vs. Multivariate Modeling A sample from the posterior distribution of η = (η 1,...,η I ), p(η y), is easily obtained in WinBUGS. Chapter 7: Multivariate Spatial Modeling p. 16/21

Univariate vs. Multivariate Modeling A sample from the posterior distribution of η = (η 1,...,η I ), p(η y), is easily obtained in WinBUGS. Using α k = 1/4 for all k, compare the posterior medians of the η i with naive averages: Chapter 7: Multivariate Spatial Modeling p. 16/21

Univariate vs. Multivariate Modeling A sample from the posterior distribution of η = (η 1,...,η I ), p(η y), is easily obtained in WinBUGS. Using α k = 1/4 for all k, compare the posterior medians of the η i with naive averages: Note smoothed control variates cover a narrower range, and clarify the overall spatial pattern in the state! Chapter 7: Multivariate Spatial Modeling p. 16/21

Multivariate CAR (MCAR) models Again, local or neighbor idea, conditioning, CAR Two strategies: Chapter 7: Multivariate Spatial Modeling p. 17/21

Multivariate CAR (MCAR) models Again, local or neighbor idea, conditioning, CAR Two strategies: multivariate CAR (MCAR): p(φ i φ j,j i) Chapter 7: Multivariate Spatial Modeling p. 17/21

Multivariate CAR (MCAR) models Again, local or neighbor idea, conditioning, CAR Two strategies: multivariate CAR (MCAR): p(φ i φ j,j i) two-fold CAR: p(φ ij φ (ij) ) Chapter 7: Multivariate Spatial Modeling p. 17/21

Multivariate CAR (MCAR) models Again, local or neighbor idea, conditioning, CAR Two strategies: multivariate CAR (MCAR): p(φ i φ j,j i) two-fold CAR: p(φ ij φ (ij) ) For MCAR, p(φ i φ j i, Σ i ) = N j B ij φ j, Σ i, i = 1,...,n Chapter 7: Multivariate Spatial Modeling p. 17/21

Multivariate CAR (MCAR) models Again, local or neighbor idea, conditioning, CAR Two strategies: multivariate CAR (MCAR): p(φ i φ j,j i) two-fold CAR: p(φ ij φ (ij) ) For MCAR, p(φ i φ j i, Σ i ) = N j B ij φ j, Σ i, i = 1,...,n As earlier, Brook s Lemma yields p(φ), improper, etc. Chapter 7: Multivariate Spatial Modeling p. 17/21

Multivariate CAR (MCAR) models Again, local or neighbor idea, conditioning, CAR Two strategies: multivariate CAR (MCAR): p(φ i φ j,j i) two-fold CAR: p(φ ij φ (ij) ) For MCAR, p(φ i φ j i, Σ i ) = N j B ij φ j, Σ i, i = 1,...,n As earlier, Brook s Lemma yields p(φ), improper, etc. Simplification: B ij = b ij I,b ij = w ij /w i+, Σ i = ( 1 w i+ )Σ Chapter 7: Multivariate Spatial Modeling p. 17/21

Multivariate CAR (MCAR) models Again, local or neighbor idea, conditioning, CAR Two strategies: multivariate CAR (MCAR): p(φ i φ j,j i) two-fold CAR: p(φ ij φ (ij) ) For MCAR, p(φ i φ j i, Σ i ) = N j B ij φ j, Σ i, i = 1,...,n As earlier, Brook s Lemma yields p(φ), improper, etc. Simplification: B ij = b ij I,b ij = w ij /w i+, Σ i = ( 1 w i+ )Σ To make proper, add ρ or perhaps ρ j, j = 1,...,p Chapter 7: Multivariate Spatial Modeling p. 17/21

Application to spatial frailty modeling From the NCI s SEER (Surveillance, Epidemiology, and End Results) database (seer.cancer.gov), we can obtain information on patients suffering from possibly multiple cancers by county for several U.S. states. Chapter 7: Multivariate Spatial Modeling p. 18/21

Application to spatial frailty modeling From the NCI s SEER (Surveillance, Epidemiology, and End Results) database (seer.cancer.gov), we can obtain information on patients suffering from possibly multiple cancers by county for several U.S. states. For instance, cancers of 5 GI-related organs: colorectal, gall bladder, pancreas, small intestine, and stomach. Chapter 7: Multivariate Spatial Modeling p. 18/21

Application to spatial frailty modeling From the NCI s SEER (Surveillance, Epidemiology, and End Results) database (seer.cancer.gov), we can obtain information on patients suffering from possibly multiple cancers by county for several U.S. states. For instance, cancers of 5 GI-related organs: colorectal, gall bladder, pancreas, small intestine, and stomach. t ijk, the time to death/censoring for patient k with primary cancer j in Iowa county i for diagnosis years 1992-98 Chapter 7: Multivariate Spatial Modeling p. 18/21

Application to spatial frailty modeling From the NCI s SEER (Surveillance, Epidemiology, and End Results) database (seer.cancer.gov), we can obtain information on patients suffering from possibly multiple cancers by county for several U.S. states. For instance, cancers of 5 GI-related organs: colorectal, gall bladder, pancreas, small intestine, and stomach. t ijk, the time to death/censoring for patient k with primary cancer j in Iowa county i for diagnosis years 1992-98 δ ijk, the corresponding death indicator Chapter 7: Multivariate Spatial Modeling p. 18/21

Application to spatial frailty modeling From the NCI s SEER (Surveillance, Epidemiology, and End Results) database (seer.cancer.gov), we can obtain information on patients suffering from possibly multiple cancers by county for several U.S. states. For instance, cancers of 5 GI-related organs: colorectal, gall bladder, pancreas, small intestine, and stomach. t ijk, the time to death/censoring for patient k with primary cancer j in Iowa county i for diagnosis years 1992-98 δ ijk, the corresponding death indicator x ijk, a vector of patient-specific covariates Chapter 7: Multivariate Spatial Modeling p. 18/21

Application to spatial frailty modeling From the NCI s SEER (Surveillance, Epidemiology, and End Results) database (seer.cancer.gov), we can obtain information on patients suffering from possibly multiple cancers by county for several U.S. states. For instance, cancers of 5 GI-related organs: colorectal, gall bladder, pancreas, small intestine, and stomach. t ijk, the time to death/censoring for patient k with primary cancer j in Iowa county i for diagnosis years 1992-98 δ ijk, the corresponding death indicator x ijk, a vector of patient-specific covariates z ijk = ( z ijk1,z ijk2,...,z ijkn ) T, where zijkl = 1 if patient ijk suffers from cancer type l, and 0 if not. Chapter 7: Multivariate Spatial Modeling p. 18/21

Application to spatial frailty modeling Hierarchical model: The likelihood is p i=1 n sij { ( )} δijk j=1 k=1 h tijk ; x ijk ( ) ( )} exp { H 0i tijk exp x T ijk β + zt ijk θ + φ ij, where h ( t ijk ; x ijk ) = h0i ( tijk ) exp (x T ijk β + zt ijk θ + φ ij H 0i ( tijk ) = tijk 0 h 0i (u) du. φ i = (φ i1,φ i2,...,φ in ) T ( ) T and Φ = φ T 1,...,φ T p MCAR (α, Λ). ) Chapter 7: Multivariate Spatial Modeling p. 19/21

Application to spatial frailty modeling We take flat priors on β and θ, and model the county-specific baseline hazard semiparametrically Chapter 7: Multivariate Spatial Modeling p. 20/21

Application to spatial frailty modeling We take flat priors on β and θ, and model the county-specific baseline hazard semiparametrically Alternate approach: Random coefficients model: x T ijk β + zt ijk θ i, β flat, but now Θ ( θ T 1,...,θ T p ) T MCAR (α, Λ). Chapter 7: Multivariate Spatial Modeling p. 20/21

Application to spatial frailty modeling We take flat priors on β and θ, and model the county-specific baseline hazard semiparametrically Alternate approach: Random coefficients model: x T ijk β + zt ijk θ i, β flat, but now Θ ( θ T 1,...,θ T p ) T MCAR (α, Λ). Next slide maps the fitted spatially varying coefficients Chapter 7: Multivariate Spatial Modeling p. 20/21

Application to spatial frailty modeling We take flat priors on β and θ, and model the county-specific baseline hazard semiparametrically Alternate approach: Random coefficients model: x T ijk β + zt ijk θ i, β flat, but now Θ ( θ T 1,...,θ T p ) T MCAR (α, Λ). Next slide maps the fitted spatially varying coefficients Not residuals, but the effects of the presence of the primary cancer on the death rate Chapter 7: Multivariate Spatial Modeling p. 20/21

Application to spatial frailty modeling We take flat priors on β and θ, and model the county-specific baseline hazard semiparametrically Alternate approach: Random coefficients model: x T ijk β + zt ijk θ i, β flat, but now Θ ( θ T 1,...,θ T p ) T MCAR (α, Λ). Next slide maps the fitted spatially varying coefficients Not residuals, but the effects of the presence of the primary cancer on the death rate Strong spatial pattern: SW Iowa counties have high fitted values for pancreatic and stomach cancer, while SE counties have high rates of colorectal and small intestinal cancer. Chapter 7: Multivariate Spatial Modeling p. 20/21

Fitted spatial effects, Iowa SEER data Colorectal Cancer Stomach Cancer 0.15 0.20 0.20 0.25 0.25 0.30 0.30-0.35 0.35-0.40 0.40 0.45 0.45-0.50 1.0-1.05 1.05 1.10 1.10 1.15 1.15 1.20 1.20 1.25 1.25 1.30 1.30-1.35 Pancreas Cancer Small Intestine Cancer 1.55 1.60 1.60 1.65 1.65 1.70 1.70 1.75 1.75 1.80 1.80 1.85 1.85-1.90 0.20 0.25 0.25 0.30 0.30 0.35 0.35 0.40 0.40 0.45 0.45 0.50 0.50-0.55 Chapter 7: Multivariate Spatial Modeling p. 21/21