Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets

Similar documents
Areal data models. Spatial smoothers. Brook s Lemma and Gibbs distribution. CAR models Gaussian case Non-Gaussian case

Lecture 18. Models for areal data. Colin Rundel 03/22/2017

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models

CSC 412 (Lecture 4): Undirected Graphical Models

Asymptotic standard errors of MLE

Summary STK 4150/9150

Spatial Autocorrelation (2) Spatial Weights

Markov Random Fields

Web Appendices: Hierarchical and Joint Site-Edge Methods for Medicare Hospice Service Region Boundary Analysis

Basics of Geographic Analysis in R

Markov Random Fields

Spatial Analysis 2. Spatial Autocorrelation

Lecture 7 Autoregressive Processes in Space

Part 6: Multivariate Normal and Linear Models

Areal data. Infant mortality, Auckland NZ districts. Number of plant species in 20cm x 20 cm patches of alpine tundra. Wheat yield

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets

Introduction to Spatial Data and Models

Introduction to Graphical Models

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data

Bayesian SAE using Complex Survey Data Lecture 4A: Hierarchical Spatial Bayes Modeling

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University

Classification. Chapter Introduction. 6.2 The Bayes classifier

Multivariate spatial modeling

Introduction to Spatial Data and Models

MATH 829: Introduction to Data Mining and Analysis Graphical Models I

Markov random fields. The Markov property

Statistics 352: Spatial statistics. Jonathan Taylor. Department of Statistics. Models for discrete data. Stanford University.

Chapter 3 - Temporal processes

Outline. Overview of Issues. Spatial Regression. Luc Anselin

Introduction. Spatial Processes & Spatial Patterns

MAP Examples. Sargur Srihari

Review: Directed Models (Bayes Nets)

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

Simultaneous Multi-frame MAP Super-Resolution Video Enhancement using Spatio-temporal Priors

Approaches for Multiple Disease Mapping: MCAR and SANOVA

Probabilistic Graphical Models

Spatial and Environmental Statistics

Bayesian Image Segmentation Using MRF s Combined with Hierarchical Prior Models

STAT 518 Intro Student Presentation

Part 8: GLMs and Hierarchical LMs and GLMs

EXAM IN STATISTICAL MACHINE LEARNING STATISTISK MASKININLÄRNING

Introduction. Semivariogram Cloud

Econometría 2: Análisis de series de Tiempo

Bayesian spatial hierarchical modeling for temperature extremes

Nearest Neighbor Gaussian Processes for Large Spatial Data

MCMC Sampling for Bayesian Inference using L1-type Priors

Bayesian inference for general Gaussian graphical models with application to multivariate lattice data

A spatio-temporal model for extreme precipitation simulated by a climate model

Undirected Graphical Models: Markov Random Fields

Spatial Statistics with Image Analysis. Outline. A Statistical Approach. Johan Lindström 1. Lund October 6, 2016

Point-Referenced Data Models

Spatial autocorrelation: robustness of measures and tests

Massachusetts Institute of Technology Department of Electrical Engineering and Computer Science Algorithms For Inference Fall 2014

Layered Synthesis of Latent Gaussian Trees

Stat260: Bayesian Modeling and Inference Lecture Date: February 10th, Jeffreys priors. exp 1 ) p 2

Chris Bishop s PRML Ch. 8: Graphical Models

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.

Spatial Smoothing in Stan: Conditional Auto-Regressive Models

Types of Spatial Data

Bayesian Areal Wombling for Geographic Boundary Analysis

Gaussian Processes 1. Schedule

The Nature of Geographic Data

Dale L. Zimmerman Department of Statistics and Actuarial Science, University of Iowa, USA

Spatial inference. Spatial inference. Accounting for spatial correlation. Multivariate normal distributions

A fast sampler for data simulation from spatial, and other, Markov random fields

A = {(x, u) : 0 u f(x)},

Temporal vs. Spatial Data

ECE 592 Topics in Data Science

Pattern Theory and Its Applications

Machine Learning, Fall 2009: Midterm

CMPE 58K Bayesian Statistics and Machine Learning Lecture 5

Image segmentation combining Markov Random Fields and Dirichlet Processes

Permutation-invariant regularization of large covariance matrices. Liza Levina

Probabilistic Graphical Models

A Bayesian Perspective on Residential Demand Response Using Smart Meter Data

BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS

An Introduction to Pattern Statistics

Hierarchical Modeling for Spatio-temporal Data

An EM algorithm for Gaussian Markov Random Fields

Hidden Markov Models for precipitation

Practicum : Spatial Regression

Chapter 4 - Fundamentals of spatial processes Lecture notes

Spatial Misalignment

Probabilistic Graphical Models Lecture Notes Fall 2009

Global Spatial Autocorrelation Clustering

Modelling geoadditive survival data

Multinomial Data. f(y θ) θ y i. where θ i is the probability that a given trial results in category i, i = 1,..., k. The parameter space is

GIST 4302/5302: Spatial Analysis and Modeling

Part 7: Hierarchical Modeling

Research Division Federal Reserve Bank of St. Louis Working Paper Series

Graphical Models and Independence Models

Introduction to Spatial Statistics and Modeling for Regional Analysis

On prediction and density estimation Peter McCullagh University of Chicago December 2004

Gaussian predictive process models for large spatial data sets.

Switching Regime Estimation

Analysing geoadditive regression data: a mixed model approach

Basics of Point-Referenced Data Models

Nonresponse weighting adjustment using estimated response probability

Markov Chain Monte Carlo

Transcription:

Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets Is there spatial pattern? Chapter 3: Basics of Areal Data Models p. 1/18

Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets Is there spatial pattern? Do we want to smooth the data? Chapter 3: Basics of Areal Data Models p. 1/18

Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets Is there spatial pattern? Do we want to smooth the data? Inference for new areal units? Chapter 3: Basics of Areal Data Models p. 1/18

Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets Is there spatial pattern? Do we want to smooth the data? Inference for new areal units? Descriptive/algorithmic vs. Model-based Chapter 3: Basics of Areal Data Models p. 1/18

Areal unit data Actual Transformed SIDS Rates <2.0 2.0-3.0 3.0-3.5 >3.5 Chapter 3: Basics of Areal Data Models p. 2/18

Proximity matrices W, entries w ij (with w ii = 0). Choices for w ij : w ij = 1 if i,j share a common boundary (possibly a common vertex) w ij is an inverse distance between units w ij = 1 if distance between units is K w ij = 1 for m nearest neighbors. Chapter 3: Basics of Areal Data Models p. 3/18

Proximity matrices W, entries w ij (with w ii = 0). Choices for w ij : w ij = 1 if i,j share a common boundary (possibly a common vertex) w ij is an inverse distance between units w ij = 1 if distance between units is K w ij = 1 for m nearest neighbors. W is typically symmetric, but need not be Chapter 3: Basics of Areal Data Models p. 3/18

Proximity matrices W, entries w ij (with w ii = 0). Choices for w ij : w ij = 1 if i,j share a common boundary (possibly a common vertex) w ij is an inverse distance between units w ij = 1 if distance between units is K w ij = 1 for m nearest neighbors. W is typically symmetric, but need not be W : standardize row i by w i+ = j w ij (so matrix is now row stochastic, but probably no longer symmetric). Chapter 3: Basics of Areal Data Models p. 3/18

Proximity matrices W, entries w ij (with w ii = 0). Choices for w ij : w ij = 1 if i,j share a common boundary (possibly a common vertex) w ij is an inverse distance between units w ij = 1 if distance between units is K w ij = 1 for m nearest neighbors. W is typically symmetric, but need not be W : standardize row i by w i+ = j w ij (so matrix is now row stochastic, but probably no longer symmetric). W elements often called weights ; interpretation Chapter 3: Basics of Areal Data Models p. 3/18

Proximity matrices W, entries w ij (with w ii = 0). Choices for w ij : w ij = 1 if i,j share a common boundary (possibly a common vertex) w ij is an inverse distance between units w ij = 1 if distance between units is K w ij = 1 for m nearest neighbors. W is typically symmetric, but need not be W : standardize row i by w i+ = j w ij (so matrix is now row stochastic, but probably no longer symmetric). W elements often called weights ; interpretation Could also define first-order neighbors W (1), second-order neighbors W (2), etc. Chapter 3: Basics of Areal Data Models p. 3/18

Measures of spatial association Moran s I: essentially an areal covariogram" I = n i j w ij(y i Ȳ )(Y j Ȳ ) ( i j w ij) i (Y i Ȳ )2 Chapter 3: Basics of Areal Data Models p. 4/18

Measures of spatial association Moran s I: essentially an areal covariogram" I = n i j w ij(y i Ȳ )(Y j Ȳ ) ( i j w ij) i (Y i Ȳ )2 Geary s C: essentially an areal variogram" C = (n 1) i j w ij(y i Y j ) 2 ( i j w ij) i (Y i Ȳ )2 Chapter 3: Basics of Areal Data Models p. 4/18

Measures of spatial association Moran s I: essentially an areal covariogram" I = n i j w ij(y i Ȳ )(Y j Ȳ ) ( i j w ij) i (Y i Ȳ )2 Geary s C: essentially an areal variogram" C = (n 1) i j w ij(y i Y j ) 2 ( i j w ij) i (Y i Ȳ )2 Both are asymptotically normal if Y i are i.i.d.; Moran has mean 1/(n 1) 0, Geary has mean 1 Chapter 3: Basics of Areal Data Models p. 4/18

Measures of spatial association Moran s I: essentially an areal covariogram" I = n i j w ij(y i Ȳ )(Y j Ȳ ) ( i j w ij) i (Y i Ȳ )2 Geary s C: essentially an areal variogram" C = (n 1) i j w ij(y i Y j ) 2 ( i j w ij) i (Y i Ȳ )2 Both are asymptotically normal if Y i are i.i.d.; Moran has mean 1/(n 1) 0, Geary has mean 1 Significance testing by comparing to a collection of say 1000 random permutations of the Y i Chapter 3: Basics of Areal Data Models p. 4/18

Measures of spatial association (cont d) <503 504-525 526-562 >563 Figure 1: Choropleth map of 1999 average verbal SAT scores, lower 48 U.S. states. Chapter 3: Basics of Areal Data Models p. 5/18

Measures of spatial association (cont d) For these data, the spatial.cor function in S+SpatialStats gives a Moran s I of 0.5833, with associated standard error estimate 0.0920 very strong evidence against H 0 : no spatial correlation Chapter 3: Basics of Areal Data Models p. 6/18

Measures of spatial association (cont d) For these data, the spatial.cor function in S+SpatialStats gives a Moran s I of 0.5833, with associated standard error estimate 0.0920 very strong evidence against H 0 : no spatial correlation spatial.cor also gives a Geary s C of 0.3775, with associated standard error estimate 0.1008 again, very strong evidence against H 0 (departure from 1) Chapter 3: Basics of Areal Data Models p. 6/18

Measures of spatial association (cont d) For these data, the spatial.cor function in S+SpatialStats gives a Moran s I of 0.5833, with associated standard error estimate 0.0920 very strong evidence against H 0 : no spatial correlation spatial.cor also gives a Geary s C of 0.3775, with associated standard error estimate 0.1008 again, very strong evidence against H 0 (departure from 1) Warning: These data have not been adjusted for covariates, such as the proportion of students who take the exam (Midwestern colleges have historically relied on the ACT, not the SAT; only the best and brightest students in these states would bother taking the SAT) Chapter 3: Basics of Areal Data Models p. 6/18

Measures of spatial association (cont d) For these data, the spatial.cor function in S+SpatialStats gives a Moran s I of 0.5833, with associated standard error estimate 0.0920 very strong evidence against H 0 : no spatial correlation spatial.cor also gives a Geary s C of 0.3775, with associated standard error estimate 0.1008 again, very strong evidence against H 0 (departure from 1) Warning: These data have not been adjusted for covariates, such as the proportion of students who take the exam (Midwestern colleges have historically relied on the ACT, not the SAT; only the best and brightest students in these states would bother taking the SAT) the map, I, and C all motivate the search for spatial covariates! Chapter 3: Basics of Areal Data Models p. 6/18

Correlogram (via Moran s I) rho(d) -1.0-0.5 0.0 0.5 1.0 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 distance Replace w ij with w (1) ij taken from W (1) I (1) Chapter 3: Basics of Areal Data Models p. 7/18

Correlogram (via Moran s I) rho(d) -1.0-0.5 0.0 0.5 1.0 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 distance Replace w ij with w (1) ij taken from W (1) I (1) Replace w ij with w (2) ij taken from W (2) I (2), etc. Chapter 3: Basics of Areal Data Models p. 7/18

Correlogram (via Moran s I) rho(d) -1.0-0.5 0.0 0.5 1.0 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 distance Replace w ij with w (1) ij taken from W (1) I (1) Replace w ij with w (2) ij taken from W (2) I (2), etc. Plot I (r) vs. r; an initial decline in r followed by variation around 0 spatial pattern! Chapter 3: Basics of Areal Data Models p. 7/18

Correlogram (via Moran s I) rho(d) -1.0-0.5 0.0 0.5 1.0 0.0 0.2 0.4 0.6 0.8 1.0 1.2 1.4 distance Replace w ij with w (1) ij taken from W (1) I (1) Replace w ij with w (2) ij taken from W (2) I (2), etc. Plot I (r) vs. r; an initial decline in r followed by variation around 0 spatial pattern! spatial analogue of the temporal lag autocorrelation plot Chapter 3: Basics of Areal Data Models p. 7/18

Rasterized binary data map NORTH SOUTH land use classification non-forest forest Chapter 3: Basics of Areal Data Models p. 8/18

Binary data correlogram logodds 0 1 2 3 N 0 1 2 3 NW 0 50 100 150 200 0 20 40 60 distance 0 1 2 3 NE 0 1 2 3 E 0 20 40 60 0 10 20 30 40 50 A version for a binary map, using two-way tables and log odds ratios at the pixel level Chapter 3: Basics of Areal Data Models p. 9/18

Binary data correlogram logodds 0 1 2 3 N 0 1 2 3 NW 0 50 100 150 200 0 20 40 60 distance 0 1 2 3 NE 0 1 2 3 E 0 20 40 60 0 10 20 30 40 50 A version for a binary map, using two-way tables and log odds ratios at the pixel level Note strongest pattern is to the north (N), but in no direction are the values 0 even at 40 km Chapter 3: Basics of Areal Data Models p. 9/18

Spatial smoothers To smooth Y i, replace with Ŷi = i w ijy j w i+ Chapter 3: Basics of Areal Data Models p. 10/18

Spatial smoothers To smooth Y i, replace with Ŷi = i w ijy j w i+ More generally, we could include the value actually observed for unit i, and revise our smoother to (1 α)y i + αŷi For 0 < α < 1, this is a linear (convex) combination in shrinkage" form Chapter 3: Basics of Areal Data Models p. 10/18

Spatial smoothers To smooth Y i, replace with Ŷi = i w ijy j w i+ More generally, we could include the value actually observed for unit i, and revise our smoother to (1 α)y i + αŷi For 0 < α < 1, this is a linear (convex) combination in shrinkage" form Finally, we could try model-based smoothing, i.e., based on E(Y i Data), i.e., the mean of the predictive distribution. Smoothers then emerge as byproducts of the hierarchical spatial models we use to explain the Y i s Chapter 3: Basics of Areal Data Models p. 10/18

Markov random fields Consider Y = (Y 1,Y 2,...,Y n ) and the set {p(y i y j, j i)} Chapter 3: Basics of Areal Data Models p. 11/18

Markov random fields Consider Y = (Y 1,Y 2,...,Y n ) and the set {p(y i y j, j i)} We know p(y 1,y 2,...y n ) determines {p(y i y j,j i)} (the set of full conditional distributions) Chapter 3: Basics of Areal Data Models p. 11/18

Markov random fields Consider Y = (Y 1,Y 2,...,Y n ) and the set {p(y i y j, j i)} We know p(y 1,y 2,...y n ) determines {p(y i y j,j i)} (the set of full conditional distributions) Does {p(y i y j,j i)} determine p(y 1,y 2,...y n )??? Chapter 3: Basics of Areal Data Models p. 11/18

Markov random fields Consider Y = (Y 1,Y 2,...,Y n ) and the set {p(y i y j, j i)} We know p(y 1,y 2,...y n ) determines {p(y i y j,j i)} (the set of full conditional distributions) Does {p(y i y j,j i)} determine p(y 1,y 2,...y n )??? If the full conditionals are compatible, then by Brook s Lemma, p(y 1,...,y n ) = p(y 1 y 2,...,y n ) p(y 10 y 2,...,y n ) p(y 2 y 10,y 3,...,y n ) p(y 20 y 10,y 3,...,y n )... p(y n y 10,...,y n 1,0 ) p(y n0 y 10,...,y n 1,0 ) p(y 10,...,y n0 ) If left side is proper, the fact that it integrates to 1 determines the normalizing constant! Chapter 3: Basics of Areal Data Models p. 11/18

Local modeling Suppose we specify the full conditionals such that p(y i y j,j i) = p(y i y j i ), where i is the set of neighbors of cell (region) i. When does {p(y i y j i )} determine p(y 1,y 2,...y n )? Chapter 3: Basics of Areal Data Models p. 12/18

Local modeling Suppose we specify the full conditionals such that p(y i y j,j i) = p(y i y j i ), where i is the set of neighbors of cell (region) i. When does {p(y i y j i )} determine p(y 1,y 2,...y n )? Def n: a clique is a set of cells such that each element is a neighbor of every other element Chapter 3: Basics of Areal Data Models p. 12/18

Local modeling Suppose we specify the full conditionals such that p(y i y j,j i) = p(y i y j i ), where i is the set of neighbors of cell (region) i. When does {p(y i y j i )} determine p(y 1,y 2,...y n )? Def n: a clique is a set of cells such that each element is a neighbor of every other element Def n: a potential function of order k is a function of k arguments that is exchangeable in these arguments Chapter 3: Basics of Areal Data Models p. 12/18

Local modeling Suppose we specify the full conditionals such that p(y i y j,j i) = p(y i y j i ), where i is the set of neighbors of cell (region) i. When does {p(y i y j i )} determine p(y 1,y 2,...y n )? Def n: a clique is a set of cells such that each element is a neighbor of every other element Def n: a potential function of order k is a function of k arguments that is exchangeable in these arguments Def n: p(y 1,...,y n ) is a Gibbs distribution if, as a function of the y i, it is a product of potentials on cliques Chapter 3: Basics of Areal Data Models p. 12/18

Local modeling Suppose we specify the full conditionals such that p(y i y j,j i) = p(y i y j i ), where i is the set of neighbors of cell (region) i. When does {p(y i y j i )} determine p(y 1,y 2,...y n )? Def n: a clique is a set of cells such that each element is a neighbor of every other element Def n: a potential function of order k is a function of k arguments that is exchangeable in these arguments Def n: p(y 1,...,y n ) is a Gibbs distribution if, as a function of the y i, it is a product of potentials on cliques Example: For binary data, k = 2, we might take Q(y i,y j ) = I(y i = y j ) = y i y j + (1 y i )(1 y j ) Chapter 3: Basics of Areal Data Models p. 12/18

Local modeling Cliques of size 1 independence Chapter 3: Basics of Areal Data Models p. 13/18

Local modeling Cliques of size 1 independence Cliques of size 2 pairwise difference form p(y 1,y 2,...y n ) exp 1 2τ 2 (y i y j ) 2 I(i j) and therefore p(y i y j,j i) = N( j i y j /m i,τ 2 /m i ), where m i is the number of neighbors of i i,j Chapter 3: Basics of Areal Data Models p. 13/18

Local modeling Cliques of size 1 independence Cliques of size 2 pairwise difference form p(y 1,y 2,...y n ) exp 1 2τ 2 (y i y j ) 2 I(i j) and therefore p(y i y j,j i) = N( j i y j /m i,τ 2 /m i ), where m i is the number of neighbors of i Hammersley-Clifford Theorem: If we have a Markov Random Field (i.e., {p(y i y j i )} uniquely determine p(y 1,y 2,...y n )), then the latter is a Gibbs distribution i,j Chapter 3: Basics of Areal Data Models p. 13/18

Local modeling Cliques of size 1 independence Cliques of size 2 pairwise difference form p(y 1,y 2,...y n ) exp 1 2τ 2 (y i y j ) 2 I(i j) and therefore p(y i y j,j i) = N( j i y j /m i,τ 2 /m i ), where m i is the number of neighbors of i Hammersley-Clifford Theorem: If we have a Markov Random Field (i.e., {p(y i y j i )} uniquely determine p(y 1,y 2,...y n )), then the latter is a Gibbs distribution Geman and Geman result : If we have a joint Gibbs distribution, then we have a Markov Random Field i,j Chapter 3: Basics of Areal Data Models p. 13/18

Conditional autoregressive (CAR) model Gaussian (autonormal) case p(y i y j,j i) = N j b ij y j,τ 2 i Using Brook s Lemma we can obtain { p(y 1,y 2,...y n ) exp 1 } 2 y D 1 (I B)y where B = {b ij } and D is diagonal with D ii = τ 2 i. Chapter 3: Basics of Areal Data Models p. 14/18

Conditional autoregressive (CAR) model Gaussian (autonormal) case p(y i y j,j i) = N j b ij y j,τ 2 i Using Brook s Lemma we can obtain { p(y 1,y 2,...y n ) exp 1 } 2 y D 1 (I B)y where B = {b ij } and D is diagonal with D ii = τ 2 i. suggests a multivariate normal distribution with µ Y = 0 and Σ Y = (I B) 1 D Chapter 3: Basics of Areal Data Models p. 14/18

Conditional autoregressive (CAR) model Gaussian (autonormal) case p(y i y j,j i) = N j b ij y j,τ 2 i Using Brook s Lemma we can obtain { p(y 1,y 2,...y n ) exp 1 } 2 y D 1 (I B)y where B = {b ij } and D is diagonal with D ii = τ 2 i. suggests a multivariate normal distribution with µ Y = 0 and Σ Y = (I B) 1 D D 1 (I B) symmetric requires b ij τ 2 i = b ji τ 2 j for all i,j Chapter 3: Basics of Areal Data Models p. 14/18

CAR Model (cont d) Returning to W, let b ij = w ij /w i+ and τ 2 i = τ 2 /w i+, so p(y 1,y 2,...y n ) exp { 1 } 2τ 2y (D w W)y where D w is diagonal with (D w ) ii = w i+ and thus p(y 1,y 2,...y n ) exp 1 2τ 2 w ij (y i y j ) 2 i j Intrinsic autoregressive (IAR) model! Chapter 3: Basics of Areal Data Models p. 15/18

CAR Model (cont d) Returning to W, let b ij = w ij /w i+ and τ 2 i = τ 2 /w i+, so p(y 1,y 2,...y n ) exp { 1 } 2τ 2y (D w W)y where D w is diagonal with (D w ) ii = w i+ and thus p(y 1,y 2,...y n ) exp 1 2τ 2 w ij (y i y j ) 2 i j Intrinsic autoregressive (IAR) model! Joint distribution is improper since (D w W)1 = 0, so requires a constraint say, i y i = 0 Chapter 3: Basics of Areal Data Models p. 15/18

CAR Model (cont d) Returning to W, let b ij = w ij /w i+ and τ 2 i = τ 2 /w i+, so p(y 1,y 2,...y n ) exp { 1 } 2τ 2y (D w W)y where D w is diagonal with (D w ) ii = w i+ and thus p(y 1,y 2,...y n ) exp 1 2τ 2 w ij (y i y j ) 2 i j Intrinsic autoregressive (IAR) model! Joint distribution is improper since (D w W)1 = 0, so requires a constraint say, i y i = 0 Not a data model a random effects model! Chapter 3: Basics of Areal Data Models p. 15/18

CAR Model Issues With τ 2 unknown, appropriate exponent for τ 2? Hodges et al.: n I, where I is the # of islands Chapter 3: Basics of Areal Data Models p. 16/18

CAR Model Issues With τ 2 unknown, appropriate exponent for τ 2? Hodges et al.: n I, where I is the # of islands Proper version: replace D w W by D w ρw, and choose ρ so that Σ y = (D w ρw) 1 exists! This in turn implies Y i Y j i N(ρ j w ijy j,τ 2 /m i ) Chapter 3: Basics of Areal Data Models p. 16/18

CAR Model Issues With τ 2 unknown, appropriate exponent for τ 2? Hodges et al.: n I, where I is the # of islands Proper version: replace D w W by D w ρw, and choose ρ so that Σ y = (D w ρw) 1 exists! This in turn implies Y i Y j i N(ρ j w ijy j,τ 2 /m i ) To ρ or not to ρ? (that is the question) Chapter 3: Basics of Areal Data Models p. 16/18

CAR Model Issues With τ 2 unknown, appropriate exponent for τ 2? Hodges et al.: n I, where I is the # of islands Proper version: replace D w W by D w ρw, and choose ρ so that Σ y = (D w ρw) 1 exists! This in turn implies Y i Y j i N(ρ j w ijy j,τ 2 /m i ) To ρ or not to ρ? (that is the question) Advantages: Chapter 3: Basics of Areal Data Models p. 16/18

CAR Model Issues With τ 2 unknown, appropriate exponent for τ 2? Hodges et al.: n I, where I is the # of islands Proper version: replace D w W by D w ρw, and choose ρ so that Σ y = (D w ρw) 1 exists! This in turn implies Y i Y j i N(ρ j w ijy j,τ 2 /m i ) To ρ or not to ρ? (that is the question) Advantages: makes distribution proper Chapter 3: Basics of Areal Data Models p. 16/18

CAR Model Issues With τ 2 unknown, appropriate exponent for τ 2? Hodges et al.: n I, where I is the # of islands Proper version: replace D w W by D w ρw, and choose ρ so that Σ y = (D w ρw) 1 exists! This in turn implies Y i Y j i N(ρ j w ijy j,τ 2 /m i ) To ρ or not to ρ? (that is the question) Advantages: makes distribution proper adds parametric flexibility Chapter 3: Basics of Areal Data Models p. 16/18

CAR Model Issues With τ 2 unknown, appropriate exponent for τ 2? Hodges et al.: n I, where I is the # of islands Proper version: replace D w W by D w ρw, and choose ρ so that Σ y = (D w ρw) 1 exists! This in turn implies Y i Y j i N(ρ j w ijy j,τ 2 /m i ) To ρ or not to ρ? (that is the question) Advantages: makes distribution proper adds parametric flexibility ρ = 0 interpretable as independence Chapter 3: Basics of Areal Data Models p. 16/18

CAR Models with ρ parameter Disadvantages: why should we expect Y i to be a proportion of average of neighbors a sensible spatial interpretation? Chapter 3: Basics of Areal Data Models p. 17/18

CAR Models with ρ parameter Disadvantages: why should we expect Y i to be a proportion of average of neighbors a sensible spatial interpretation? calibration of ρ as a correlation, e.g., ρ = 0.80 yields 0.1 Moran s I 0.15, ρ = 0.90 yields 0.2 Moran s I 0.25, ρ = 0.99 yields Moran s I 0.5 Chapter 3: Basics of Areal Data Models p. 17/18

CAR Models with ρ parameter Disadvantages: why should we expect Y i to be a proportion of average of neighbors a sensible spatial interpretation? calibration of ρ as a correlation, e.g., ρ = 0.80 yields 0.1 Moran s I 0.15, ρ = 0.90 yields 0.2 Moran s I 0.25, ρ = 0.99 yields Moran s I 0.5 So, used with random effects, scope of spatial pattern may be limited Chapter 3: Basics of Areal Data Models p. 17/18

Comments The CAR specifies Σ 1 Y, not Σ Y (as in the point-level modeling case), so does not directly model association Chapter 3: Basics of Areal Data Models p. 18/18

Comments The CAR specifies Σ 1 Y, not Σ Y (as in the point-level modeling case), so does not directly model association (Σ 1 Y ) ii = 1/τ 2 i ; (Σ 1 Y ) ij = 0 cond l independence Chapter 3: Basics of Areal Data Models p. 18/18

Comments The CAR specifies Σ 1 Y, not Σ Y (as in the point-level modeling case), so does not directly model association (Σ 1 Y ) ii = 1/τ 2 i ; (Σ 1 Y ) ij = 0 cond l independence Prediction at new sites is ad hoc, in that if p(y 0 y 1,y 2,...y n ) = N( j w 0j y j /w 0+,τ 2 /w 0+ ) then p(y 0,y 1,...y n ) well-defined but not CAR Chapter 3: Basics of Areal Data Models p. 18/18

Comments The CAR specifies Σ 1 Y, not Σ Y (as in the point-level modeling case), so does not directly model association (Σ 1 Y ) ii = 1/τ 2 i ; (Σ 1 Y ) ij = 0 cond l independence Prediction at new sites is ad hoc, in that if p(y 0 y 1,y 2,...y n ) = N( j w 0j y j /w 0+,τ 2 /w 0+ ) then p(y 0,y 1,...y n ) well-defined but not CAR Non-Gaussian case: For binary data, the autologistic: p(y i y j,j i) exp φ w ij I(y i = y j ) j Chapter 3: Basics of Areal Data Models p. 18/18