Introduction to Spatial Data and Models

Similar documents
Introduction to Spatial Data and Models

Introduction to Geostatistics

Introduction to Spatial Data and Models

Basics of Point-Referenced Data Models

Hierarchical Modelling for Univariate Spatial Data

CBMS Lecture 1. Alan E. Gelfand Duke University

Hierarchical Modelling for Univariate Spatial Data

Point-Referenced Data Models

Hierarchical Modeling for Spatio-temporal Data

Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data

Introduction to Spatial Data and Models

Spatial Statistics with Image Analysis. Lecture L02. Computer exercise 0 Daily Temperature. Lecture 2. Johan Lindström.

Hierarchical Modeling for Multivariate Spatial Data

Statistícal Methods for Spatial Data Analysis

Hierarchical Modeling for Univariate Spatial Data

Hierarchical Modelling for Univariate and Multivariate Spatial Data

Hierarchical Modeling for Univariate Spatial Data

Introduction. Spatial Processes & Spatial Patterns

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models

Hierarchical Modeling and Analysis for Spatial Data

Hierarchical Modelling for non-gaussian Spatial Data

A Framework for Daily Spatio-Temporal Stochastic Weather Simulation

Chapter 4 - Fundamentals of spatial processes Lecture notes

REML Estimation and Linear Mixed Models 4. Geostatistics and linear mixed models for spatial data

Hierarchical Modelling for Multivariate Spatial Data

Hierarchical Nearest-Neighbor Gaussian Process Models for Large Geo-statistical Datasets

Practicum : Spatial Regression

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Bayesian Linear Models

Nearest Neighbor Gaussian Processes for Large Spatial Data

Positive Definite Functions on Spheres

Non-gaussian spatiotemporal modeling

Hierarchical Modeling for non-gaussian Spatial Data

Gaussian predictive process models for large spatial data sets.

Overview of Spatial Statistics with Applications to fmri

What s for today. All about Variogram Nugget effect. Mikyoung Jun (Texas A&M) stat647 lecture 4 September 6, / 17

What s for today. Introduction to Space-time models. c Mikyoung Jun (Texas A&M) Stat647 Lecture 14 October 16, / 19

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH

Fluvial Variography: Characterizing Spatial Dependence on Stream Networks. Dale Zimmerman University of Iowa (joint work with Jay Ver Hoef, NOAA)

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Principles of Bayesian Inference

Kriging Luc Anselin, All Rights Reserved

Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes

Introduction. Semivariogram Cloud

7 Geostatistics. Figure 7.1 Focus of geostatistics

Bayesian Linear Models

Improving Spatial Data Interoperability

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields

Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency

Covariance function estimation in Gaussian process regression

ON THE USE OF NON-EUCLIDEAN ISOTROPY IN GEOSTATISTICS

Spatio-Temporal Geostatistical Models, with an Application in Fish Stock

Types of Spatial Data

Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets

Chapter 4 - Fundamentals of spatial processes Lecture notes

Interpolation of Spatial Data

E(x i ) = µ i. 2 d. + sin 1 d θ 2. for d < θ 2 0 for d θ 2

Kriging models with Gaussian processes - covariance function estimation and impact of spatial sampling

Spatial and Environmental Statistics

Lecture 9: Introduction to Kriging

Principles of Bayesian Inference

Predictive spatio-temporal models for spatially sparse environmental data. Umeå University

ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS

Estimation of non-stationary spatial covariance structure

A new covariance function for spatio-temporal data analysis with application to atmospheric pollution and sensor networking

The ProbForecastGOP Package

Assessing the covariance function in geostatistics

Investigation of Monthly Pan Evaporation in Turkey with Geostatistical Technique

Spatial analysis is the quantitative study of phenomena that are located in space.

Spatial-Temporal Modeling of Active Layer Thickness

Chapter 3 - Temporal processes

Introduction to Bayes and non-bayes spatial statistics

Bayesian Transgaussian Kriging

Bayesian Linear Regression

Bayesian Thinking in Spatial Statistics

Gaussian Process Regression

Chapter 1. Summer School GEOSTAT 2014, Spatio-Temporal Geostatistics,

Part 5: Spatial-Temporal Modeling

Model Assessment and Comparisons

Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.

Generating Spatial Correlated Binary Data Through a Copulas Method

Extreme Value Analysis and Spatial Extremes

1 Isotropic Covariance Functions

Cross-covariance Functions for Tangent Vector Fields on the Sphere

Overview of Statistical Analysis of Spatial Data

Influence of parameter estimation uncertainty in Kriging: Part 2 Test and case study applications

Derivatives of Spatial Variances of Growing Windows and the Variogram 1

Statistical Models for Monitoring and Regulating Ground-level Ozone. Abstract

On prediction and density estimation Peter McCullagh University of Chicago December 2004

Multivariate Gaussian Random Fields with SPDEs

Space-time data. Simple space-time analyses. PM10 in space. PM10 in time

Nonstationary cross-covariance models for multivariate processes on a globe

Paper Review: NONSTATIONARY COVARIANCE MODELS FOR GLOBAL DATA

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models

Non-stationary Cross-Covariance Models for Multivariate Processes on a Globe

Discrete time processes

Transcription:

Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. March 9, 2009 1

Researchers in diverse areas such as climatology, ecology, environmental health, and real estate marketing are increasingly faced with the task of analyzing data that are: 2 ENAR 2009 Hierarchical Modeling and Analysis

Researchers in diverse areas such as climatology, ecology, environmental health, and real estate marketing are increasingly faced with the task of analyzing data that are: highly multivariate, with many important predictors and response variables, 2 ENAR 2009 Hierarchical Modeling and Analysis

Researchers in diverse areas such as climatology, ecology, environmental health, and real estate marketing are increasingly faced with the task of analyzing data that are: highly multivariate, with many important predictors and response variables, geographically referenced, and often presented as maps, and 2 ENAR 2009 Hierarchical Modeling and Analysis

Researchers in diverse areas such as climatology, ecology, environmental health, and real estate marketing are increasingly faced with the task of analyzing data that are: highly multivariate, with many important predictors and response variables, geographically referenced, and often presented as maps, and temporally correlated, as in longitudinal or other time series structures. 2 ENAR 2009 Hierarchical Modeling and Analysis

Researchers in diverse areas such as climatology, ecology, environmental health, and real estate marketing are increasingly faced with the task of analyzing data that are: highly multivariate, with many important predictors and response variables, geographically referenced, and often presented as maps, and temporally correlated, as in longitudinal or other time series structures. motivates hierarchical modeling and data analysis for complex spatial (and spatiotemporal) data sets. 2 ENAR 2009 Hierarchical Modeling and Analysis

Type of spatial data point-referenced data, where Y (s) is a random vector at a location s R r, where s varies continuously over D, a fixed subset of R r that contains an r-dimensional rectangle of positive volume; 3 ENAR 2009 Hierarchical Modeling and Analysis

Type of spatial data point-referenced data, where Y (s) is a random vector at a location s R r, where s varies continuously over D, a fixed subset of R r that contains an r-dimensional rectangle of positive volume; areal data, where D is again a fixed subset (of regular or irregular shape), but now partitioned into a finite number of areal units with well-defined boundaries; 3 ENAR 2009 Hierarchical Modeling and Analysis

Type of spatial data point-referenced data, where Y (s) is a random vector at a location s R r, where s varies continuously over D, a fixed subset of R r that contains an r-dimensional rectangle of positive volume; areal data, where D is again a fixed subset (of regular or irregular shape), but now partitioned into a finite number of areal units with well-defined boundaries; point pattern data, where now D is itself random; its index set gives the locations of random events that are the spatial point pattern. Y (s) itself can simply equal 1 for all s D (indicating occurrence of the event), or possibly give some additional covariate information (producing a marked point pattern process). 3 ENAR 2009 Hierarchical Modeling and Analysis

Exploration of spatial data First step in analyzing data 4 ENAR 2009 Hierarchical Modeling and Analysis

Exploration of spatial data First step in analyzing data First Law of Geography: Mean + Error 4 ENAR 2009 Hierarchical Modeling and Analysis

Exploration of spatial data First step in analyzing data First Law of Geography: Mean + Error Mean: first-order behavior 4 ENAR 2009 Hierarchical Modeling and Analysis

Exploration of spatial data First step in analyzing data First Law of Geography: Mean + Error Mean: first-order behavior Error: second-order behavior (covariance function) 4 ENAR 2009 Hierarchical Modeling and Analysis

Exploration of spatial data First step in analyzing data First Law of Geography: Mean + Error Mean: first-order behavior Error: second-order behavior (covariance function) EDA tools examine both first and second order behavior 4 ENAR 2009 Hierarchical Modeling and Analysis

Exploration of spatial data First step in analyzing data First Law of Geography: Mean + Error Mean: first-order behavior Error: second-order behavior (covariance function) EDA tools examine both first and second order behavior Preliminary displays: Simple locations to surface displays 4 ENAR 2009 Hierarchical Modeling and Analysis

Exploration of spatial data First Law of Geography = + data mean error 5 ENAR 2009 Hierarchical Modeling and Analysis

Exploration of spatial data Scallops Sites 6 ENAR 2009 Hierarchical Modeling and Analysis

Deterministic surface interpolation Spatial surface observed at finite set of locations S = {s 1, s 2,..., s n } Tessellate the spatial domain (usually with data locations as vertices) Fit an interpolating polynomial: f(s) = i w i (S ; s)f(s i ) Interpolate by reading off f(s 0 ). Issues: Sensitivity to tessellations Choices of multivariate interpolators Numerical error analysis 7 ENAR 2009 Hierarchical Modeling and Analysis

Scallops data: image and contour plots Latitude 39.0 39.5 40.0 40.5 73.5 73.0 72.5 72.0 Longitude 8 ENAR 2009 Hierarchical Modeling and Analysis

Scallops data: image and contour plots Drop-line scatter plot 9 ENAR 2009 Hierarchical Modeling and Analysis

11 11.211.411.611.8 12 12.2 12.4 logsp Introduction to spatial data and models 38.04 Scallops data: image and contour plots Surface plot 38.02 38 Latitude 37.98 37.96-121.34-121.32 Longitude -121.3-121.28 10 ENAR 2009 Hierarchical Modeling and Analysis

Scallops data: image and contour plots Image contour plot Latitude 37.96 37.98 38.00 38.02 38.04 121.36 121.34 121.32 121.30 121.28 121.26 Longitude 11 ENAR 2009 Hierarchical Modeling and Analysis

Scallops data: image and contour plots Locations form patterns 128100 128150 128200 128250 128300 1031350 1031400 1031450 1031500 1031550 Eastings Northings 128100 128150 128200 128250 128300 1031350 1031400 1031450 1031500 1031550 12 ENAR 2009 Hierarchical Modeling and Analysis

0 2 4 6 8 10 12 14 Shrub Density Introduction to spatial data and models Scallops data: image and contour plots Surface features 1031550 1031500 1031450 Northings 1031400 1031350 128100 128150 128200 128250 128300 Eastings 13 ENAR 2009 Hierarchical Modeling and Analysis

Scallops data: image and contour plots Interesting plot arrangements N S UTM coordinates 4879050 4879100 4879150 4879200 456300 456320 456340 456360 456380 456400 456420 456440 E W UTM coordinates 14 ENAR 2009 Hierarchical Modeling and Analysis

Elements of point-level modelling Point-level modelling refers to modelling of spatial datacollected at locations referenced by coordinates (e.g., lat-long, Easting-Northing). 15 ENAR 2009 Hierarchical Modeling and Analysis

Elements of point-level modelling Point-level modelling refers to modelling of spatial datacollected at locations referenced by coordinates (e.g., lat-long, Easting-Northing). Fundamental concept: Data from a spatial process {Y (s) : s D}, where D is a fixed subset in Euclidean space. 15 ENAR 2009 Hierarchical Modeling and Analysis

Elements of point-level modelling Point-level modelling refers to modelling of spatial datacollected at locations referenced by coordinates (e.g., lat-long, Easting-Northing). Fundamental concept: Data from a spatial process {Y (s) : s D}, where D is a fixed subset in Euclidean space. Example: Y (s) is a pollutant level at site s 15 ENAR 2009 Hierarchical Modeling and Analysis

Elements of point-level modelling Point-level modelling refers to modelling of spatial datacollected at locations referenced by coordinates (e.g., lat-long, Easting-Northing). Fundamental concept: Data from a spatial process {Y (s) : s D}, where D is a fixed subset in Euclidean space. Example: Y (s) is a pollutant level at site s Conceptually: Pollutant level exists at all possible sites 15 ENAR 2009 Hierarchical Modeling and Analysis

Elements of point-level modelling Point-level modelling refers to modelling of spatial datacollected at locations referenced by coordinates (e.g., lat-long, Easting-Northing). Fundamental concept: Data from a spatial process {Y (s) : s D}, where D is a fixed subset in Euclidean space. Example: Y (s) is a pollutant level at site s Conceptually: Pollutant level exists at all possible sites Practically: Data will be a partial realization of a spatial process observed at {s 1,..., s n } 15 ENAR 2009 Hierarchical Modeling and Analysis

Elements of point-level modelling Point-level modelling refers to modelling of spatial datacollected at locations referenced by coordinates (e.g., lat-long, Easting-Northing). Fundamental concept: Data from a spatial process {Y (s) : s D}, where D is a fixed subset in Euclidean space. Example: Y (s) is a pollutant level at site s Conceptually: Pollutant level exists at all possible sites Practically: Data will be a partial realization of a spatial process observed at {s 1,..., s n } Statistical objectives: Inference about the process Y (s); predict at new locations. 15 ENAR 2009 Hierarchical Modeling and Analysis

Stationary Gaussian processes Suppose our spatial process has a mean, µ (s) = E (Y (s)), and that the variance of Y (s) exists for all s D. 16 ENAR 2009 Hierarchical Modeling and Analysis

Stationary Gaussian processes Suppose our spatial process has a mean, µ (s) = E (Y (s)), and that the variance of Y (s) exists for all s D. Strong stationarity: If for any given set of sites, and any displacement h, the distribution of (Y (s 1 ),..., Y (s n )) is the same as, (Y (s 1 + h),..., Y (s n + h)). Weak stationarity: Constant mean µ(s) = µ, and Cov(Y (s), Y (s + h)) = C(h): the covariance depends only upon the displacement (or separation) vector. 16 ENAR 2009 Hierarchical Modeling and Analysis

Stationary Gaussian processes Suppose our spatial process has a mean, µ (s) = E (Y (s)), and that the variance of Y (s) exists for all s D. Strong stationarity: If for any given set of sites, and any displacement h, the distribution of (Y (s 1 ),..., Y (s n )) is the same as, (Y (s 1 + h),..., Y (s n + h)). Weak stationarity: Constant mean µ(s) = µ, and Cov(Y (s), Y (s + h)) = C(h): the covariance depends only upon the displacement (or separation) vector. Strong stationarity implies weak stationarity 16 ENAR 2009 Hierarchical Modeling and Analysis

Stationary Gaussian processes Suppose our spatial process has a mean, µ (s) = E (Y (s)), and that the variance of Y (s) exists for all s D. Strong stationarity: If for any given set of sites, and any displacement h, the distribution of (Y (s 1 ),..., Y (s n )) is the same as, (Y (s 1 + h),..., Y (s n + h)). Weak stationarity: Constant mean µ(s) = µ, and Cov(Y (s), Y (s + h)) = C(h): the covariance depends only upon the displacement (or separation) vector. Strong stationarity implies weak stationarity The process isgaussian if Y = (Y (s 1 ),..., Y (s n )) has a multivariate normal distribution. 16 ENAR 2009 Hierarchical Modeling and Analysis

Stationary Gaussian processes Suppose our spatial process has a mean, µ (s) = E (Y (s)), and that the variance of Y (s) exists for all s D. Strong stationarity: If for any given set of sites, and any displacement h, the distribution of (Y (s 1 ),..., Y (s n )) is the same as, (Y (s 1 + h),..., Y (s n + h)). Weak stationarity: Constant mean µ(s) = µ, and Cov(Y (s), Y (s + h)) = C(h): the covariance depends only upon the displacement (or separation) vector. Strong stationarity implies weak stationarity The process isgaussian if Y = (Y (s 1 ),..., Y (s n )) has a multivariate normal distribution. For Gaussian processes, strong and weak stationarity are equivalent. 16 ENAR 2009 Hierarchical Modeling and Analysis

Stationary Gaussian processes Variograms Suppose we assume E[Y (s + h) Y (s)] = 0 and define E[Y (s + h) Y (s)] 2 = V ar (Y (s + h) Y (s)) = 2γ (h). This is sensible if the left hand side depends only upon h. Then we say the process is intrinsically stationary. 17 ENAR 2009 Hierarchical Modeling and Analysis

Stationary Gaussian processes Variograms Suppose we assume E[Y (s + h) Y (s)] = 0 and define E[Y (s + h) Y (s)] 2 = V ar (Y (s + h) Y (s)) = 2γ (h). This is sensible if the left hand side depends only upon h. Then we say the process is intrinsically stationary. γ(h) is called the semivariogram and 2γ(h) is called the variogram. 17 ENAR 2009 Hierarchical Modeling and Analysis

Stationary Gaussian processes Variograms Suppose we assume E[Y (s + h) Y (s)] = 0 and define E[Y (s + h) Y (s)] 2 = V ar (Y (s + h) Y (s)) = 2γ (h). This is sensible if the left hand side depends only upon h. Then we say the process is intrinsically stationary. γ(h) is called the semivariogram and 2γ(h) is called the variogram. Note that intrinsic stationarity defines only the first and second moments of the differences Y (s + h) Y (s). It says nothing about the joint distribution of a collection of variables Y (s 1 ),..., Y (s n ), and thus provides no likelihood. 17 ENAR 2009 Hierarchical Modeling and Analysis

Stationary Gaussian processes Intrinsic Stationarity and Ergodicity Relationship between γ(h) and C(h): 2γ(h) = V ar(y (s + h)) + V ar(y (s)) 2Cov(Y (s + h), Y (s)) = C(0) + C(0) 2C(h) = 2[C(0) C(h)]. 18 ENAR 2009 Hierarchical Modeling and Analysis

Stationary Gaussian processes Intrinsic Stationarity and Ergodicity Relationship between γ(h) and C(h): 2γ(h) = V ar(y (s + h)) + V ar(y (s)) 2Cov(Y (s + h), Y (s)) = C(0) + C(0) 2C(h) = 2[C(0) C(h)]. Easy to recover γ from C. The converse needs the additional assumption of ergodicity: lim u C(u) = 0. 18 ENAR 2009 Hierarchical Modeling and Analysis

Stationary Gaussian processes Intrinsic Stationarity and Ergodicity Relationship between γ(h) and C(h): 2γ(h) = V ar(y (s + h)) + V ar(y (s)) 2Cov(Y (s + h), Y (s)) = C(0) + C(0) 2C(h) = 2[C(0) C(h)]. Easy to recover γ from C. The converse needs the additional assumption of ergodicity: lim u C(u) = 0. So lim u γ(u) = C(0), and we can recover C from γ as long as this limit exists. C(h) = lim γ(u) γ(h). u 18 ENAR 2009 Hierarchical Modeling and Analysis

Isotropy When γ(h) or C(h) depends upon the separation vector only through the distance h, we say that the process is isotropic. In that case, we write γ( h ) or C( h ). Otherwise we say that the process is anisotropic. 19 ENAR 2009 Hierarchical Modeling and Analysis

Isotropy When γ(h) or C(h) depends upon the separation vector only through the distance h, we say that the process is isotropic. In that case, we write γ( h ) or C( h ). Otherwise we say that the process is anisotropic. If the process is intrinsically stationary and isotropic, it is also called homogeneous. 19 ENAR 2009 Hierarchical Modeling and Analysis

Isotropy When γ(h) or C(h) depends upon the separation vector only through the distance h, we say that the process is isotropic. In that case, we write γ( h ) or C( h ). Otherwise we say that the process is anisotropic. If the process is intrinsically stationary and isotropic, it is also called homogeneous. Isotropic processes are popular because of their simplicity, interpretability, and because a number of relatively simple parametric forms are available as candidates for C (and γ). Denoting h by t for notational simplicity, the next two tables provide a few examples... 19 ENAR 2009 Hierarchical Modeling and Analysis

Isotropy Some common isotropic variograms model Variogram, { γ(t) τ Linear γ(t) = 2 + σ 2 t if t > 0 0 otherwise τ 2 + σ 2 if t 1/φ Spherical γ(t) = τ 2 + σ 2 [ 3 2 φt 1 2 (φt)3] if 0 < t 1/φ { 0 otherwise τ Exponential γ(t) = 2 + σ 2 (1 exp( φt)) if t > 0 { 0 otherwise Powered τ γ(t) = 2 + σ 2 (1 exp( φt p )) if t > 0 exponential { 0 otherwise Matérn τ γ(t) = 2 + σ 2 [ 1 (1 + φt) e φt] if t > 0 at ν = 3/2 0 o/w 20 ENAR 2009 Hierarchical Modeling and Analysis

Isotropy γ(t) = Examples: Spherical Variogram τ 2 + σ 2 if t 1/φ τ 2 + σ 2 [ 3 2 φt 1 2 (φt)3] if 0 < t 1/φ 0 if t = 0. 21 ENAR 2009 Hierarchical Modeling and Analysis

Isotropy γ(t) = Examples: Spherical Variogram τ 2 + σ 2 if t 1/φ τ 2 + σ 2 [ 3 2 φt 1 2 (φt)3] if 0 < t 1/φ 0 if t = 0. While γ(0) = 0 by definition, γ(0 + ) lim t 0 + γ(t) = τ 2 ; this quantity is the nugget. 21 ENAR 2009 Hierarchical Modeling and Analysis

Isotropy γ(t) = Examples: Spherical Variogram τ 2 + σ 2 if t 1/φ τ 2 + σ 2 [ 3 2 φt 1 2 (φt)3] if 0 < t 1/φ 0 if t = 0. While γ(0) = 0 by definition, γ(0 + ) lim t 0 + γ(t) = τ 2 ; this quantity is the nugget. lim t γ(t) = τ 2 + σ 2 ; this asymptotic value of the semivariogram is called the sill. (The sill minus the nugget, σ 2 in this case, is called the partial sill.) 21 ENAR 2009 Hierarchical Modeling and Analysis

Isotropy γ(t) = Examples: Spherical Variogram τ 2 + σ 2 if t 1/φ τ 2 + σ 2 [ 3 2 φt 1 2 (φt)3] if 0 < t 1/φ 0 if t = 0. While γ(0) = 0 by definition, γ(0 + ) lim t 0 + γ(t) = τ 2 ; this quantity is the nugget. lim t γ(t) = τ 2 + σ 2 ; this asymptotic value of the semivariogram is called the sill. (The sill minus the nugget, σ 2 in this case, is called the partial sill.) Finally, the value t = 1/φ at which γ(t) first reaches its ultimate level (the sill) is called the range, R 1/φ. 21 ENAR 2009 Hierarchical Modeling and Analysis

Isotropy Examples: Spherical Variogram 0.0 0.2 0.4 0.6 0.8 1.0 1.2 0.0 0.5 1.0 1.5 2.0 b) spherical; a0 = 0.2, a1 = 1, R = 1 22 ENAR 2009 Hierarchical Modeling and Analysis

Isotropy Some common isotropic covariograms Model Covariance function, C(t) Linear C(t) does not exist 0 if t 1/φ Spherical C(t) = σ 2 [ 1 3 2 φt + 1 2 (φt)3] if 0 < t 1/φ { τ 2 + σ 2 otherwise σ Exponential C(t) = 2 exp( φt) if t > 0 { τ 2 + σ 2 otherwise Powered σ C(t) = 2 exp( φt p ) if t > 0 exponential { τ 2 + σ 2 otherwise Matérn σ C(t) = 2 (1 + φt) exp( φt) if t > 0 at ν = 3/2 τ 2 + σ 2 otherwise 23 ENAR 2009 Hierarchical Modeling and Analysis

Isotropy Notes on exponential model C(t) = { τ 2 + σ 2 if t = 0 σ 2 exp( φt) if t > 0. 24 ENAR 2009 Hierarchical Modeling and Analysis

Isotropy Notes on exponential model C(t) = { τ 2 + σ 2 if t = 0 σ 2 exp( φt) if t > 0. We define the effective range, t 0, as the distance at which this correlation has dropped to only 0.05. Setting exp( φt 0 ) equal to this value we obtain t 0 3/φ, since log(0.05) 3. 24 ENAR 2009 Hierarchical Modeling and Analysis

Isotropy Notes on exponential model C(t) = { τ 2 + σ 2 if t = 0 σ 2 exp( φt) if t > 0. We define the effective range, t 0, as the distance at which this correlation has dropped to only 0.05. Setting exp( φt 0 ) equal to this value we obtain t 0 3/φ, since log(0.05) 3. Finally, the form of C(t) shows why the nugget τ 2 is often viewed as a nonspatial effect variance, and the partial sill (σ 2 ) is viewed as a spatial effect variance. 24 ENAR 2009 Hierarchical Modeling and Analysis

Isotropy The Matèrn Correlation Function Much of statistical modelling is carried out through correlation functions rather than variograms 25 ENAR 2009 Hierarchical Modeling and Analysis

Isotropy The Matèrn Correlation Function Much of statistical modelling is carried out through correlation functions rather than variograms The Matèrn is a very versatile family: { σ 2 C(t) = 2 ν 1 Γ(ν) (2 νtφ) ν K ν (2 (ν)tφ) if t > 0 τ 2 + σ 2 if t = 0 K ν is the modified Bessel function of order ν (computationally tractable) 25 ENAR 2009 Hierarchical Modeling and Analysis

Isotropy The Matèrn Correlation Function Much of statistical modelling is carried out through correlation functions rather than variograms The Matèrn is a very versatile family: { σ 2 C(t) = 2 ν 1 Γ(ν) (2 νtφ) ν K ν (2 (ν)tφ) if t > 0 τ 2 + σ 2 if t = 0 K ν is the modified Bessel function of order ν (computationally tractable) ν is a smoothness parameter (a fractal) controlling process smoothness 25 ENAR 2009 Hierarchical Modeling and Analysis

Variogram model fitting How do we select a variogram? Can the data really distinguish between variograms? 26 ENAR 2009 Hierarchical Modeling and Analysis

Variogram model fitting How do we select a variogram? Can the data really distinguish between variograms? Empirical Variogram: γ(t) = 1 2 N(t) s i,s j N(t) (Y (s i ) Y (s j )) 2 where N(t) is the number of points such that s i s j = t and N(t) is the number of points in N(t). 26 ENAR 2009 Hierarchical Modeling and Analysis

Variogram model fitting How do we select a variogram? Can the data really distinguish between variograms? Empirical Variogram: γ(t) = 1 2 N(t) s i,s j N(t) (Y (s i ) Y (s j )) 2 where N(t) is the number of points such that s i s j = t and N(t) is the number of points in N(t). Grid up the t space into intervals I 1 = (0, t 1 ), I 2 = (t 1, t 2 ), and so forth, up to I K = (t K 1, t K ). Representing t values in each interval by its midpoint, we define: N(t k ) = {(s i, s j ) : s i s j I k }, k = 1,..., K. 26 ENAR 2009 Hierarchical Modeling and Analysis

Variogram model fitting Empirical variogram: scallops data Gamma(d) 0 1 2 3 4 5 6 0.0 0.5 1.0 1.5 2.0 distance 27 ENAR 2009 Hierarchical Modeling and Analysis

Variogram model fitting Empirical variogram: scallops data Parametric Semivariograms gamma(d) 0 2 4 6 8 Exponential Gaussian Cauchy Spherical Bessel-J0 0.0 0.5 1.0 1.5 2.0 2.5 3.0 distance Bessel Mixtures - Random Weights gamma(d) 0 2 4 6 8 Two Three Four Five 0.0 0.5 1.0 1.5 2.0 2.5 3.0 distance Bessel Mixtures - Random Phi s gamma(d) 0 2 4 6 8 Two Three Four Five 0.0 0.5 1.0 1.5 2.0 2.5 3.0 distance 28 ENAR 2009 Hierarchical Modeling and Analysis