Basics of Point-Referenced Data Models

Size: px
Start display at page:

Download "Basics of Point-Referenced Data Models"

Transcription

1 Basics of Point-Referenced Data Models Basic tool is a spatial process, {Y (s), s D}, where D R r Chapter 2: Basics of Point-Referenced Data Models p. 1/45

2 Basics of Point-Referenced Data Models Basic tool is a spatial process, {Y (s), s D}, where D R r Note that time series follows this approach with r = 1; we will usually have r = 2 or 3 Chapter 2: Basics of Point-Referenced Data Models p. 1/45

3 Basics of Point-Referenced Data Models Basic tool is a spatial process, {Y (s), s D}, where D R r Note that time series follows this approach with r = 1; we will usually have r = 2 or 3 We begin with essentials of point-level data modeling, including stationarity, isotropy, and variograms key elements of the "Matheron school" Chapter 2: Basics of Point-Referenced Data Models p. 1/45

4 Basics of Point-Referenced Data Models Basic tool is a spatial process, {Y (s), s D}, where D R r Note that time series follows this approach with r = 1; we will usually have r = 2 or 3 We begin with essentials of point-level data modeling, including stationarity, isotropy, and variograms key elements of the "Matheron school" Then we add the spatial (typically Gaussian) process modeling that enables likelihood (and Bayesian) inference in these settings. Chapter 2: Basics of Point-Referenced Data Models p. 1/45

5 Scallops catch sites, NY/NJ coast, USA Chapter 2: Basics of Point-Referenced Data Models p. 2/45

6 Scallops sites with log-catch contours Chapter 2: Basics of Point-Referenced Data Models p. 3/45

7 Image plot with log-catch contours Latitude Longitude Chapter 2: Basics of Point-Referenced Data Models p. 4/45

8 Stationarity Suppose our spatial process has a mean, µ (s) = E (Y (s)), and that the variance of Y (s) exists for all s D. The process is said to be strictly stationary (also called strongly stationary) if, for any given n 1, any set of n sites {s 1,...,s n } and any h R r, the distribution of (Y (s 1 ),...,Y (s n )) is the same as that of (Y (s 1 + h),...,y (s n + h)). Chapter 2: Basics of Point-Referenced Data Models p. 5/45

9 Stationarity Suppose our spatial process has a mean, µ (s) = E (Y (s)), and that the variance of Y (s) exists for all s D. The process is said to be strictly stationary (also called strongly stationary) if, for any given n 1, any set of n sites {s 1,...,s n } and any h R r, the distribution of (Y (s 1 ),...,Y (s n )) is the same as that of (Y (s 1 + h),...,y (s n + h)). A less restrictive condition is given by weak stationarity (also called second-order stationarity): A process is weakly stationary if µ (s) µ and Cov (Y (s),y (s + h)) = C (h) for all h R r such that s and s + h both lie within D. Chapter 2: Basics of Point-Referenced Data Models p. 5/45

10 Notes on Stationarity Weak stationarity implies that the covariance relationship between the values of the process at any two locations can be summarized by a covariance function C (h) (sometimes called a covariogram), and this function depends only on the separation vector h. Chapter 2: Basics of Point-Referenced Data Models p. 6/45

11 Notes on Stationarity Weak stationarity implies that the covariance relationship between the values of the process at any two locations can be summarized by a covariance function C (h) (sometimes called a covariogram), and this function depends only on the separation vector h. Note that with all variances assumed to exist, strong stationarity implies weak stationarity. Chapter 2: Basics of Point-Referenced Data Models p. 6/45

12 Notes on Stationarity Weak stationarity implies that the covariance relationship between the values of the process at any two locations can be summarized by a covariance function C (h) (sometimes called a covariogram), and this function depends only on the separation vector h. Note that with all variances assumed to exist, strong stationarity implies weak stationarity. The converse is not true in general, but it does hold for Gaussian processes Chapter 2: Basics of Point-Referenced Data Models p. 6/45

13 Variograms Suppose we assume E[Y (s + h) Y (s)] = 0 and define E[Y (s + h) Y (s)] 2 = V ar (Y (s + h) Y (s)) = 2γ (h). This expression makes sense when the left-hand side depends only on h (so that the right-hand side can be written at all), and not the particular choice of s. If this is the case, we say the process is intrinsically stationary. Chapter 2: Basics of Point-Referenced Data Models p. 7/45

14 Variograms Suppose we assume E[Y (s + h) Y (s)] = 0 and define E[Y (s + h) Y (s)] 2 = V ar (Y (s + h) Y (s)) = 2γ (h). This expression makes sense when the left-hand side depends only on h (so that the right-hand side can be written at all), and not the particular choice of s. If this is the case, we say the process is intrinsically stationary. The function 2γ (h) is then called the variogram, and γ (h) is called the semivariogram. Chapter 2: Basics of Point-Referenced Data Models p. 7/45

15 Variograms Suppose we assume E[Y (s + h) Y (s)] = 0 and define E[Y (s + h) Y (s)] 2 = V ar (Y (s + h) Y (s)) = 2γ (h). This expression makes sense when the left-hand side depends only on h (so that the right-hand side can be written at all), and not the particular choice of s. If this is the case, we say the process is intrinsically stationary. The function 2γ (h) is then called the variogram, and γ (h) is called the semivariogram. Note that intrinsic stationarity requires only the first and second moments of the differences Y (s + h) Y (s). It says nothing about the joint distribution of a collection of variables Y (s 1 ),...,Y (s n ), and thus provides no likelihood. Chapter 2: Basics of Point-Referenced Data Models p. 7/45

16 Relationship between C(h) and γ(h) 2γ(h) = V ar (Y (s + h) Y (s)) = V ar(y (s + h)) + V ar(y (s)) 2Cov(Y (s + h),y (s)) = C(0) + C(0) 2C (h) = 2 [C (0) C (h)]. Chapter 2: Basics of Point-Referenced Data Models p. 8/45

17 Relationship between C(h) and γ(h) 2γ(h) = V ar (Y (s + h) Y (s)) = V ar(y (s + h)) + V ar(y (s)) 2Cov(Y (s + h),y (s)) = C(0) + C(0) 2C (h) = 2 [C (0) C (h)]. Thus, γ (h) = C (0) C (h). Chapter 2: Basics of Point-Referenced Data Models p. 8/45

18 Relationship between C(h) and γ(h) 2γ(h) = V ar (Y (s + h) Y (s)) = V ar(y (s + h)) + V ar(y (s)) 2Cov(Y (s + h),y (s)) = C(0) + C(0) 2C (h) = 2 [C (0) C (h)]. Thus, γ (h) = C (0) C (h). So given C, we are able to determine γ. But what about the converse: in general, can we recover C from γ?... Chapter 2: Basics of Point-Referenced Data Models p. 8/45

19 Relationship between C(h) and γ(h) In γ (h) = C (0) C (h), we can add or subtract a constant on the right side, so C (h) is not identified. Chapter 2: Basics of Point-Referenced Data Models p. 9/45

20 Relationship between C(h) and γ(h) In γ (h) = C (0) C (h), we can add or subtract a constant on the right side, so C (h) is not identified. If the spatial process is ergodic, then C (h) 0 as h, where h denotes the length of the h vector. Chapter 2: Basics of Point-Referenced Data Models p. 9/45

21 Relationship between C(h) and γ(h) In γ (h) = C (0) C (h), we can add or subtract a constant on the right side, so C (h) is not identified. If the spatial process is ergodic, then C (h) 0 as h, where h denotes the length of the h vector. Taking the limit of both sides of γ (h) = C (0) C (h) as h, we then have that lim h γ (h) = C (0). Chapter 2: Basics of Point-Referenced Data Models p. 9/45

22 Relationship between C(h) and γ(h) In γ (h) = C (0) C (h), we can add or subtract a constant on the right side, so C (h) is not identified. If the spatial process is ergodic, then C (h) 0 as h, where h denotes the length of the h vector. Taking the limit of both sides of γ (h) = C (0) C (h) as h, we then have that lim h γ (h) = C (0). Thus, using the dummy variable u to avoid confusion, C (h) = C(0) γ(h) = lim u γ (u) γ (h). So if lim h γ (h) exists, C (h) is well defined. Chapter 2: Basics of Point-Referenced Data Models p. 9/45

23 Relationship between C(h) and γ(h) In γ (h) = C (0) C (h), we can add or subtract a constant on the right side, so C (h) is not identified. If the spatial process is ergodic, then C (h) 0 as h, where h denotes the length of the h vector. Taking the limit of both sides of γ (h) = C (0) C (h) as h, we then have that lim h γ (h) = C (0). Thus, using the dummy variable u to avoid confusion, C (h) = C(0) γ(h) = lim u γ (u) γ (h). So if lim h γ (h) exists, C (h) is well defined. And then, intrinsic stationarity implies weak (second-order) stationarity. Thus weak stationarity implies intrinsic stationarity, but the converse is not true in general. Chapter 2: Basics of Point-Referenced Data Models p. 9/45

24 Isotropy If the semivariogram γ (h) depends upon the separation vector only through its length h, then we say that the process is isotropic; if not, we say it is anisotropic. Chapter 2: Basics of Point-Referenced Data Models p. 10/45

25 Isotropy If the semivariogram γ (h) depends upon the separation vector only through its length h, then we say that the process is isotropic; if not, we say it is anisotropic. For an isotropic process, γ (h) is a real-valued function of a univariate argument, and can be written as γ ( h ). Chapter 2: Basics of Point-Referenced Data Models p. 10/45

26 Isotropy If the semivariogram γ (h) depends upon the separation vector only through its length h, then we say that the process is isotropic; if not, we say it is anisotropic. For an isotropic process, γ (h) is a real-valued function of a univariate argument, and can be written as γ ( h ). If the process is intrinsically stationary and isotropic, it is also called homogeneous. Chapter 2: Basics of Point-Referenced Data Models p. 10/45

27 Isotropy If the semivariogram γ (h) depends upon the separation vector only through its length h, then we say that the process is isotropic; if not, we say it is anisotropic. For an isotropic process, γ (h) is a real-valued function of a univariate argument, and can be written as γ ( h ). If the process is intrinsically stationary and isotropic, it is also called homogeneous. Isotropic processes are popular because of their simplicity, interpretability, and because a number of relatively simple parametric forms are available as candidates for C (and γ). Denoting h by t for notational simplicity, the next two tables provide a few examples... Chapter 2: Basics of Point-Referenced Data Models p. 10/45

28 Some common isotropic covariograms Model Covariance function, C(t) Linear C(t) does not exist 0 if t 1/φ Spherical C(t) = σ 2 [ 1 2 3φt + 2 1(φt)3] if 0 < t 1/φ { τ 2 + σ 2 otherwise σ 2 exp( φt) if t > 0 Exponential C(t) = τ 2 + σ 2 otherwise { Powered σ 2 exp( φt p ) if t > 0 C(t) = exponential τ 2 + σ 2 otherwise Matérn at ν = 3/2 C(t) = { σ 2 (1 + φt) exp( φt) if t > 0 τ 2 + σ 2 otherwise Chapter 2: Basics of Point-Referenced Data Models p. 11/45

29 Some common isotropic variograms model Linear γ(t) = Spherical γ(t) = Exponential γ(t) = Powered exponential Matérn at ν = 3/2 Variogram, { γ(t) γ(t) = γ(t) = { { { τ 2 + σ 2 t if t > 0 0 otherwise τ 2 + σ 2 if t 1/φ τ 2 + σ 2 [ 3 2 φt 1 2 (φt)3] if 0 < t 1/φ 0 otherwise τ 2 + σ 2 (1 exp( φt)) if t > 0 0 otherwise τ 2 + σ 2 (1 exp( φt p )) if t > 0 0 otherwise τ 2 + σ 2 [ 1 (1 + φt)e φt] if t > 0 0 o/w Chapter 2: Basics of Point-Referenced Data Models p. 12/45

30 Example: Spherical semivariogram γ(t) = τ 2 + σ 2 if t 1/φ τ 2 + σ 2 [ 3 2 φt 1 2 (φt)3] if 0 < t 1/φ 0 otherwise While γ(0) = 0 by definition, γ(0 + ) lim t 0 + γ(t) = τ 2 ; this quantity is called the nugget. Chapter 2: Basics of Point-Referenced Data Models p. 13/45

31 Example: Spherical semivariogram γ(t) = τ 2 + σ 2 if t 1/φ τ 2 + σ 2 [ 3 2 φt 1 2 (φt)3] if 0 < t 1/φ 0 otherwise While γ(0) = 0 by definition, γ(0 + ) lim t 0 + γ(t) = τ 2 ; this quantity is called the nugget. lim t γ(t) = τ 2 + σ 2 ; this asymptotic value of the semivariogram is called the sill. (The sill minus the nugget, σ 2 in this case, is called the partial sill.) Chapter 2: Basics of Point-Referenced Data Models p. 13/45

32 Example: Spherical semivariogram γ(t) = τ 2 + σ 2 if t 1/φ τ 2 + σ 2 [ 3 2 φt 1 2 (φt)3] if 0 < t 1/φ 0 otherwise While γ(0) = 0 by definition, γ(0 + ) lim t 0 + γ(t) = τ 2 ; this quantity is called the nugget. lim t γ(t) = τ 2 + σ 2 ; this asymptotic value of the semivariogram is called the sill. (The sill minus the nugget, σ 2 in this case, is called the partial sill.) Finally, the value t = 1/φ at which γ(t) first reaches its ultimate level (the sill) is called the range, R 1/φ. (Both R and φ are sometimes referred to as the "range," but φ is more naturally called the decay parameter.) Chapter 2: Basics of Point-Referenced Data Models p. 13/45

33 3 common semivariogram models Linear; tau2 = 0.2, sig2 = Spherical; tau2 = 0.2, sig2 = 1, phi = Expo; tau2 = 0.2, sig2 = 1, phi = 2 Note that for the linear model (left panel), γ (t) as t, and so this semivariogram does not correspond to a weakly stationary process (although it is intrinsically stationary). Chapter 2: Basics of Point-Referenced Data Models p. 14/45

34 3 common semivariogram models Linear; tau2 = 0.2, sig2 = Spherical; tau2 = 0.2, sig2 = 1, phi = Expo; tau2 = 0.2, sig2 = 1, phi = 2 Note that for the linear model (left panel), γ (t) as t, and so this semivariogram does not correspond to a weakly stationary process (although it is intrinsically stationary). The nugget is τ 2, but the sill and range are both infinite. Chapter 2: Basics of Point-Referenced Data Models p. 14/45

35 Notes on the exponential model The sill is only reached asymptotically, meaning that strictly speaking, the range R = 1/φ is infinite. Chapter 2: Basics of Point-Referenced Data Models p. 15/45

36 Notes on the exponential model The sill is only reached asymptotically, meaning that strictly speaking, the range R = 1/φ is infinite. To define the notion of an "effective range", for t > 0, C(t) = lim u γ(u) γ(t) = τ 2 + σ 2 [ τ 2 + σ 2 (1 exp( φt)) ] = σ 2 exp( φt) However, with γ (h) = C (0) C (h), since γ(0 + ) = τ 2, by convention we (and other textbook authors) set C(0) = C(0 + ) = τ 2 + σ 2. Chapter 2: Basics of Point-Referenced Data Models p. 15/45

37 Notes on the exponential model Thus we have C(t) = { τ 2 + σ 2 if t = 0 σ 2 exp( φt) if t > 0. Then the correlation between two points distance t apart is exp( φt); note that exp( φt) = 1 for t = 0 + and exp( φt) = 0 for t =, as expected. Chapter 2: Basics of Point-Referenced Data Models p. 16/45

38 Notes on the exponential model Thus we have C(t) = { τ 2 + σ 2 if t = 0 σ 2 exp( φt) if t > 0. Then the correlation between two points distance t apart is exp( φt); note that exp( φt) = 1 for t = 0 + and exp( φt) = 0 for t =, as expected. We define the effective range, t 0, as the distance at which this correlation has dropped to only Setting exp( φt 0 ) equal to this value we obtain t 0 3/φ, since log(0.05) 3. Chapter 2: Basics of Point-Referenced Data Models p. 16/45

39 Notes on the exponential model Thus we have C(t) = { τ 2 + σ 2 if t = 0 σ 2 exp( φt) if t > 0. Then the correlation between two points distance t apart is exp( φt); note that exp( φt) = 1 for t = 0 + and exp( φt) = 0 for t =, as expected. We define the effective range, t 0, as the distance at which this correlation has dropped to only Setting exp( φt 0 ) equal to this value we obtain t 0 3/φ, since log(0.05) 3. Finally, the form of C(t) shows why the nugget τ 2 is often viewed as a nonspatial effect variance, and the partial sill (σ 2 ) is viewed as a spatial effect variance. Chapter 2: Basics of Point-Referenced Data Models p. 16/45

40 The Matèrn Correlation Function The Matèrn is a very versatile family: { σ 2 C(t) = 2 ν 1 Γ(ν) (2 νtφ) ν K ν (2 (ν)tφ) if t > 0 τ 2 + σ 2 if t = 0 K ν is the modified Bessel function of order ν (computationally tractable in C/C++ or geor) Chapter 2: Basics of Point-Referenced Data Models p. 17/45

41 The Matèrn Correlation Function The Matèrn is a very versatile family: { σ 2 C(t) = 2 ν 1 Γ(ν) (2 νtφ) ν K ν (2 (ν)tφ) if t > 0 τ 2 + σ 2 if t = 0 K ν is the modified Bessel function of order ν (computationally tractable in C/C++ or geor) ν is a smoothness parameter: Chapter 2: Basics of Point-Referenced Data Models p. 17/45

42 The Matèrn Correlation Function The Matèrn is a very versatile family: { σ 2 C(t) = 2 ν 1 Γ(ν) (2 νtφ) ν K ν (2 (ν)tφ) if t > 0 τ 2 + σ 2 if t = 0 K ν is the modified Bessel function of order ν (computationally tractable in C/C++ or geor) ν is a smoothness parameter: ν = 1/2 exponential; ν Gaussian; ν = 3/2 convenient closed form for C(t),γ(t) Chapter 2: Basics of Point-Referenced Data Models p. 17/45

43 The Matèrn Correlation Function The Matèrn is a very versatile family: { σ 2 C(t) = 2 ν 1 Γ(ν) (2 νtφ) ν K ν (2 (ν)tφ) if t > 0 τ 2 + σ 2 if t = 0 K ν is the modified Bessel function of order ν (computationally tractable in C/C++ or geor) ν is a smoothness parameter: ν = 1/2 exponential; ν Gaussian; ν = 3/2 convenient closed form for C(t),γ(t) in two-dimensions, the greatest integer in ν indicates the number of times process realizations will be mean-square differentiable. Chapter 2: Basics of Point-Referenced Data Models p. 17/45

44 Some covariance function theory To be a valid covariance function the function must be positive definite Chapter 2: Basics of Point-Referenced Data Models p. 18/45

45 Some covariance function theory To be a valid covariance function the function must be positive definite Whether a function is positive definite or not can depend upon dimension Chapter 2: Basics of Point-Referenced Data Models p. 18/45

46 Some covariance function theory To be a valid covariance function the function must be positive definite Whether a function is positive definite or not can depend upon dimension Bochner s Theorem: C is a valid covariance function if and only if it is the characteristic function of a random variable that is symmetric about 0, i.e., C(h) = cos(w T h)g(dw). Chapter 2: Basics of Point-Referenced Data Models p. 18/45

47 Some covariance function theory To be a valid covariance function the function must be positive definite Whether a function is positive definite or not can depend upon dimension Bochner s Theorem: C is a valid covariance function if and only if it is the characteristic function of a random variable that is symmetric about 0, i.e., C(h) = cos(w T h)g(dw). Fourier transform, spectral distribution, spectral density Chapter 2: Basics of Point-Referenced Data Models p. 18/45

48 Some covariance function theory To be a valid covariance function the function must be positive definite Whether a function is positive definite or not can depend upon dimension Bochner s Theorem: C is a valid covariance function if and only if it is the characteristic function of a random variable that is symmetric about 0, i.e., C(h) = cos(w T h)g(dw). Fourier transform, spectral distribution, spectral density inversion formula could be used to check if C(h) is valid Chapter 2: Basics of Point-Referenced Data Models p. 18/45

49 Constructing valid covariance functions One can construct valid covariance function by using properties of characteristic functions: multiplying valid covariance functions (corresponds to summing independent random variables) Chapter 2: Basics of Point-Referenced Data Models p. 19/45

50 Constructing valid covariance functions One can construct valid covariance function by using properties of characteristic functions: multiplying valid covariance functions (corresponds to summing independent random variables) mixing covariance functions (corresponds to mixing distributions) Chapter 2: Basics of Point-Referenced Data Models p. 19/45

51 Constructing valid covariance functions One can construct valid covariance function by using properties of characteristic functions: multiplying valid covariance functions (corresponds to summing independent random variables) mixing covariance functions (corresponds to mixing distributions) convolving covariance functions (if C 1 and C 2 are valid then C 12 (s) = C 1 (s u)c 2 (u)du is valid). Chapter 2: Basics of Point-Referenced Data Models p. 19/45

52 Variogram model fitting How does one choose a good parametric variogram model? Historically, one plots the empirical semivariogram, γ(t) = 1 2N(t) (s i,s j ) N(t) [Y (s i ) Y (s j )] 2, where N(t) is the set of pairs such that s i s j = t, and N(t) is the number of pairs in this set. Chapter 2: Basics of Point-Referenced Data Models p. 20/45

53 Variogram model fitting How does one choose a good parametric variogram model? Historically, one plots the empirical semivariogram, γ(t) = 1 2N(t) (s i,s j ) N(t) [Y (s i ) Y (s j )] 2, where N(t) is the set of pairs such that s i s j = t, and N(t) is the number of pairs in this set. Usually need to grid up the t-space into intervals I 1 = (0,t 1 ),...,I K = (t K 1,t K ) for 0 < t 1 < < t K. Represent each interval by its midpoint, and redefine N(t k ) = {(s i, s j ) : s i s j I k }, k = 1,...,K. Chapter 2: Basics of Point-Referenced Data Models p. 20/45

54 Empirical variogram: scallops data Gamma(d) distance Chapter 2: Basics of Point-Referenced Data Models p. 21/45

55 Variogram model fitting (cont d) This method of moments estimate (analogue of the usual sample variance s 2 ) may not be so great: Chapter 2: Basics of Point-Referenced Data Models p. 22/45

56 Variogram model fitting (cont d) This method of moments estimate (analogue of the usual sample variance s 2 ) may not be so great: It will be sensitive to outliers Chapter 2: Basics of Point-Referenced Data Models p. 22/45

57 Variogram model fitting (cont d) This method of moments estimate (analogue of the usual sample variance s 2 ) may not be so great: It will be sensitive to outliers A sample average of squared differences can be badly behaved. Chapter 2: Basics of Point-Referenced Data Models p. 22/45

58 Variogram model fitting (cont d) This method of moments estimate (analogue of the usual sample variance s 2 ) may not be so great: It will be sensitive to outliers A sample average of squared differences can be badly behaved. It uses data differences, rather than the data itself. Chapter 2: Basics of Point-Referenced Data Models p. 22/45

59 Variogram model fitting (cont d) This method of moments estimate (analogue of the usual sample variance s 2 ) may not be so great: It will be sensitive to outliers A sample average of squared differences can be badly behaved. It uses data differences, rather than the data itself. The components of the sum will be dependent within and across bins, and N(t k ) will vary across bins. Chapter 2: Basics of Point-Referenced Data Models p. 22/45

60 Variogram model fitting (cont d) This method of moments estimate (analogue of the usual sample variance s 2 ) may not be so great: It will be sensitive to outliers A sample average of squared differences can be badly behaved. It uses data differences, rather than the data itself. The components of the sum will be dependent within and across bins, and N(t k ) will vary across bins. Informally, one plots γ(t), and then an appropriately shaped theoretical variogram is fit by eye or by trial and error to choose the nugget, sill, and range. Chapter 2: Basics of Point-Referenced Data Models p. 22/45

61 Variogram model fitting (cont d) This method of moments estimate (analogue of the usual sample variance s 2 ) may not be so great: It will be sensitive to outliers A sample average of squared differences can be badly behaved. It uses data differences, rather than the data itself. The components of the sum will be dependent within and across bins, and N(t k ) will vary across bins. Informally, one plots γ(t), and then an appropriately shaped theoretical variogram is fit by eye or by trial and error to choose the nugget, sill, and range. Traditional but more formal fitting uses least squares, weighted least squares or generalized least squares Chapter 2: Basics of Point-Referenced Data Models p. 22/45

62 Variogram model fitting (cont d) This method of moments estimate (analogue of the usual sample variance s 2 ) may not be so great: It will be sensitive to outliers A sample average of squared differences can be badly behaved. It uses data differences, rather than the data itself. The components of the sum will be dependent within and across bins, and N(t k ) will vary across bins. Informally, one plots γ(t), and then an appropriately shaped theoretical variogram is fit by eye or by trial and error to choose the nugget, sill, and range. Traditional but more formal fitting uses least squares, weighted least squares or generalized least squares For us, a (likelihood or Bayesian) estimation problem! Chapter 2: Basics of Point-Referenced Data Models p. 22/45

63 Anisotropy Here association depends upon the separation vector between locations (i.e., direction and distance). Chapter 2: Basics of Point-Referenced Data Models p. 23/45

64 Anisotropy Here association depends upon the separation vector between locations (i.e., direction and distance). Example: geometric anisotropy, where c(s s ) = σ 2 ρ((s s ) T B(s s )). Chapter 2: Basics of Point-Referenced Data Models p. 23/45

65 Anisotropy Here association depends upon the separation vector between locations (i.e., direction and distance). Example: geometric anisotropy, where c(s s ) = σ 2 ρ((s s ) T B(s s )). B is positive definite with ρ a valid correlation function in R r (e.g., exponential). Chapter 2: Basics of Point-Referenced Data Models p. 23/45

66 Anisotropy Here association depends upon the separation vector between locations (i.e., direction and distance). Example: geometric anisotropy, where c(s s ) = σ 2 ρ((s s ) T B(s s )). B is positive definite with ρ a valid correlation function in R r (e.g., exponential). We omit the range/decay parameter φ since we now use B; when r = 2 this means three range parameters" instead of one. Chapter 2: Basics of Point-Referenced Data Models p. 23/45

67 Anisotropy Here association depends upon the separation vector between locations (i.e., direction and distance). Example: geometric anisotropy, where c(s s ) = σ 2 ρ((s s ) T B(s s )). B is positive definite with ρ a valid correlation function in R r (e.g., exponential). We omit the range/decay parameter φ since we now use B; when r = 2 this means three range parameters" instead of one. Contours of constant association arising from c above are elliptical. In particular, the contour corresponding to ρ =.05 provides the effective range in each spatial direction. Chapter 2: Basics of Point-Referenced Data Models p. 23/45

68 Anisotropy (cont d) We can extend geometric anisotropy to product geometric anisotropy, e.g. c(s s ) = σ 2 ρ 1 ((s s ) T B 1 (s s ))ρ 2 ((s s ) T B 2 (s s )), a valid covariance function provided ρ 1 and ρ 2 are valid. Chapter 2: Basics of Point-Referenced Data Models p. 24/45

69 Anisotropy (cont d) We can extend geometric anisotropy to product geometric anisotropy, e.g. c(s s ) = σ 2 ρ 1 ((s s ) T B 1 (s s ))ρ 2 ((s s ) T B 2 (s s )), a valid covariance function provided ρ 1 and ρ 2 are valid. Both geometric anisotropy and product geometric anisotropy are special cases of range anisotropy (Zimmerman, 1993); suggests we might also define Chapter 2: Basics of Point-Referenced Data Models p. 24/45

70 Anisotropy (cont d) We can extend geometric anisotropy to product geometric anisotropy, e.g. c(s s ) = σ 2 ρ 1 ((s s ) T B 1 (s s ))ρ 2 ((s s ) T B 2 (s s )), a valid covariance function provided ρ 1 and ρ 2 are valid. Both geometric anisotropy and product geometric anisotropy are special cases of range anisotropy (Zimmerman, 1993); suggests we might also define sill anisotropy: Given a variogram γ(h), lim c γ(ch/ h ) depends upon h Chapter 2: Basics of Point-Referenced Data Models p. 24/45

71 Anisotropy (cont d) We can extend geometric anisotropy to product geometric anisotropy, e.g. c(s s ) = σ 2 ρ 1 ((s s ) T B 1 (s s ))ρ 2 ((s s ) T B 2 (s s )), a valid covariance function provided ρ 1 and ρ 2 are valid. Both geometric anisotropy and product geometric anisotropy are special cases of range anisotropy (Zimmerman, 1993); suggests we might also define sill anisotropy: Given a variogram γ(h), lim c γ(ch/ h ) depends upon h nugget anisotropy: Given a variogram γ(h), lim c 0 γ(ch/ h ) depends upon h. Chapter 2: Basics of Point-Referenced Data Models p. 24/45

72 Exploration of Spatial Data First step in analyzing data Chapter 2: Basics of Point-Referenced Data Models p. 25/45

73 Exploration of Spatial Data First step in analyzing data First Law of Geostatistics : Data = Mean + Error Chapter 2: Basics of Point-Referenced Data Models p. 25/45

74 Exploration of Spatial Data First step in analyzing data First Law of Geostatistics : Data = Mean + Error Mean: first-order behavior Chapter 2: Basics of Point-Referenced Data Models p. 25/45

75 Exploration of Spatial Data First step in analyzing data First Law of Geostatistics : Data = Mean + Error Mean: first-order behavior Error: second-order behavior (covariance function) Chapter 2: Basics of Point-Referenced Data Models p. 25/45

76 Exploration of Spatial Data First step in analyzing data First Law of Geostatistics : Data = Mean + Error Mean: first-order behavior Error: second-order behavior (covariance function) EDA tools examine both first and second order behavior Chapter 2: Basics of Point-Referenced Data Models p. 25/45

77 Exploration of Spatial Data First step in analyzing data First Law of Geostatistics : Data = Mean + Error Mean: first-order behavior Error: second-order behavior (covariance function) EDA tools examine both first and second order behavior Preliminary displays: From simple location maps to surface, image, and contour plots Chapter 2: Basics of Point-Referenced Data Models p. 25/45

78 First Law of Geostatistics = + data mean error Chapter 2: Basics of Point-Referenced Data Models p. 26/45

79 Arrangement of plots N S UTM coordinates E W UTM coordinates Chapter 2: Basics of Point-Referenced Data Models p. 27/45

80 Drop-line scatter plot Chapter 2: Basics of Point-Referenced Data Models p. 28/45

81 Longitude Surface plot Chapter 2: Basics of Point-Referenced Data Models p. 29/ Latitude logsp

82 Image-contour plot Latitude Longitude Chapter 2: Basics of Point-Referenced Data Models p. 30/45

83 EDA for assessing anisotropy Directional semivariogram: Choose angle classes η i ± ǫ, i = 1,...,L where ǫ is the halfwidth of the angle class and L is the number of angle classes Chapter 2: Basics of Point-Referenced Data Models p. 31/45

84 EDA for assessing anisotropy Directional semivariogram: Choose angle classes η i ± ǫ, i = 1,...,L where ǫ is the halfwidth of the angle class and L is the number of angle classes Example: Take ǫ = 22.5 and L = 4, so we obtain the four cardinal directions measured counterclockwise from the x-axis, 0, 45, 90, and 135 Chapter 2: Basics of Point-Referenced Data Models p. 31/45

85 EDA for assessing anisotropy Directional semivariogram: Choose angle classes η i ± ǫ, i = 1,...,L where ǫ is the halfwidth of the angle class and L is the number of angle classes Example: Take ǫ = 22.5 and L = 4, so we obtain the four cardinal directions measured counterclockwise from the x-axis, 0, 45, 90, and 135 For a given angle class, the Matheron empirical semivariogram provides a directional semivariogram for angle η i. Chapter 2: Basics of Point-Referenced Data Models p. 31/45

86 EDA for assessing anisotropy Directional semivariogram: Choose angle classes η i ± ǫ, i = 1,...,L where ǫ is the halfwidth of the angle class and L is the number of angle classes Example: Take ǫ = 22.5 and L = 4, so we obtain the four cardinal directions measured counterclockwise from the x-axis, 0, 45, 90, and 135 For a given angle class, the Matheron empirical semivariogram provides a directional semivariogram for angle η i. Theoretically, all types of anisotropy can be assessed from these directional semivariograms; however, in practice determining whether the sill, nugget, and/or range varies with direction can be difficult. Chapter 2: Basics of Point-Referenced Data Models p. 31/45

87 EDA for assessing anisotropy Rose diagram: Created from the directional semivariograms to evaluate geometric anisotropy Chapter 2: Basics of Point-Referenced Data Models p. 32/45

88 EDA for assessing anisotropy Rose diagram: Created from the directional semivariograms to evaluate geometric anisotropy At an arbitrarily selected γ, for a directional semivariogram at angle η, the distance d at which the directional semivariogram attains γ can be interpolated. Chapter 2: Basics of Point-Referenced Data Models p. 32/45

89 EDA for assessing anisotropy Rose diagram: Created from the directional semivariograms to evaluate geometric anisotropy At an arbitrarily selected γ, for a directional semivariogram at angle η, the distance d at which the directional semivariogram attains γ can be interpolated. The rose diagram is a plot of angle η and corresponding distance d in polar coordinates. Chapter 2: Basics of Point-Referenced Data Models p. 32/45

90 EDA for assessing anisotropy Rose diagram: Created from the directional semivariograms to evaluate geometric anisotropy At an arbitrarily selected γ, for a directional semivariogram at angle η, the distance d at which the directional semivariogram attains γ can be interpolated. The rose diagram is a plot of angle η and corresponding distance d in polar coordinates. If an elliptical contour describes the extremities of the rose diagram reasonably well, then the process exhibits geometric anisotropy; a circular contour suggests isotropy. Chapter 2: Basics of Point-Referenced Data Models p. 32/45

91 EDA for assessing anisotropy Panel A below shows directional semivariograms for the 1990 scallop data (the semivariogram points are connected only to aid comparison). Possible conclusions: the variability in the 45 direction (parallel to the coastline) is significantly less than in the other three directions; Chapter 2: Basics of Point-Referenced Data Models p. 33/45

92 EDA for assessing anisotropy Panel A below shows directional semivariograms for the 1990 scallop data (the semivariogram points are connected only to aid comparison). Possible conclusions: the variability in the 45 direction (parallel to the coastline) is significantly less than in the other three directions; the variability perpendicular to the coastline (135 ) is very erratic, possibly exhibiting sill anisotropy. Chapter 2: Basics of Point-Referenced Data Models p. 33/45

93 EDA for assessing anisotropy Panel A below shows directional semivariograms for the 1990 scallop data (the semivariogram points are connected only to aid comparison). Possible conclusions: the variability in the 45 direction (parallel to the coastline) is significantly less than in the other three directions; the variability perpendicular to the coastline (135 ) is very erratic, possibly exhibiting sill anisotropy. Panel B below shows the rose diagram for the 1990 scallop data, using the γ contour of 4.5. It is approximately elliptical, oriented parallel to the coastline ( 45 ) possible geometric anisotropy! Chapter 2: Basics of Point-Referenced Data Models p. 33/45

94 EDA for assessing anisotropy A B Gamma (dist) Degrees h, N-S Distance (km) h, E-W Chapter 2: Basics of Point-Referenced Data Models p. 34/45

95 EDA for assessing anisotropy A B Gamma (dist) Degrees h, N-S Distance (km) h, E-W Warning: it is dangerous to read too much significance and interpretation into directional variograms. No sample sizes (and thus no assessments of variability) are attached to these pictures. Directional variograms from truly isotropic data often exhibit differences of the magnitudes seen above. Chapter 2: Basics of Point-Referenced Data Models p. 34/45

96 Empirical semivariogram contour plots Also called contour maps of the grouped variogram values, or isarithmic plots of the semivariogram. For each of the ( N 2) pairs of sites in R 2, calculate h x and h y, the separation distances along each axis. Chapter 2: Basics of Point-Referenced Data Models p. 35/45

97 Empirical semivariogram contour plots Also called contour maps of the grouped variogram values, or isarithmic plots of the semivariogram. For each of the ( N 2) pairs of sites in R 2, calculate h x and h y, the separation distances along each axis. Since the sign of h y depends upon the order of the sites, insist h y 0. (Take ( h x, h y ) when h y < 0.) Chapter 2: Basics of Point-Referenced Data Models p. 35/45

98 Empirical semivariogram contour plots Also called contour maps of the grouped variogram values, or isarithmic plots of the semivariogram. For each of the ( N 2) pairs of sites in R 2, calculate h x and h y, the separation distances along each axis. Since the sign of h y depends upon the order of the sites, insist h y 0. (Take ( h x, h y ) when h y < 0.) Aggregate these pairs into rectangular bins B ij and compute the empirical semivariogram for B ij as γ ij = 1 2N Bij {(k,l):(s k s l ) B ij } (Y (s k ) Y (s l )) 2, where N Bij is the number of sites in bin B ij. Chapter 2: Basics of Point-Referenced Data Models p. 35/45

99 Empirical semivariogram contour plots Labeling the center of the (i,j)th bin by (x i,y j ), a three dimensional plot of γ ij versus (x i,y j ) yields an empirical semivariogram surface. A contour plot of a smoothed version of this surface produces the ESC plot. Chapter 2: Basics of Point-Referenced Data Models p. 36/45

100 Empirical semivariogram contour plots Labeling the center of the (i,j)th bin by (x i,y j ), a three dimensional plot of γ ij versus (x i,y j ) yields an empirical semivariogram surface. A contour plot of a smoothed version of this surface produces the ESC plot. The ESC plot can be used to assess departures from isotropy: isotropy is depicted by circular contours, while geometric anisotropy is implied by elliptical contours. Chapter 2: Basics of Point-Referenced Data Models p. 36/45

101 Empirical semivariogram contour plots Labeling the center of the (i,j)th bin by (x i,y j ), a three dimensional plot of γ ij versus (x i,y j ) yields an empirical semivariogram surface. A contour plot of a smoothed version of this surface produces the ESC plot. The ESC plot can be used to assess departures from isotropy: isotropy is depicted by circular contours, while geometric anisotropy is implied by elliptical contours. A rose diagram traces only one arbitrarily selected contour of this plot. Chapter 2: Basics of Point-Referenced Data Models p. 36/45

102 Empirical semivariogram contour plots Labeling the center of the (i,j)th bin by (x i,y j ), a three dimensional plot of γ ij versus (x i,y j ) yields an empirical semivariogram surface. A contour plot of a smoothed version of this surface produces the ESC plot. The ESC plot can be used to assess departures from isotropy: isotropy is depicted by circular contours, while geometric anisotropy is implied by elliptical contours. A rose diagram traces only one arbitrarily selected contour of this plot. A possible drawback to the ESC plot is the occurrence of sparse counts in extreme bins. However, these bins may be trimmed before smoothing if desired. Chapter 2: Basics of Point-Referenced Data Models p. 36/45

103 ESC plot of the 1993 scallop data h, N-S h, E-W Plot is overlaid on the bin centers and their counts. Chapter 2: Basics of Point-Referenced Data Models p. 37/45

104 ESC plot of the 1993 scallop data h, N-S h, E-W Plot is overlaid on the bin centers and their counts. Suggests geometric anisotropy! Chapter 2: Basics of Point-Referenced Data Models p. 37/45

105 ESC plot of the 1993 scallop data h, N-S h, E-W Plot is overlaid on the bin centers and their counts. Suggests geometric anisotropy! The ESC values in the h y 0 row provides an alternative to the usual 0 directional semivariogram. Chapter 2: Basics of Point-Referenced Data Models p. 37/45

106 Classical spatial prediction (Kriging) Named by Matheron (1963) in honor of D.G. Krige, a South African mining engineer whose seminal work on empirical methods for geostatistical data inspired the general approach Chapter 2: Basics of Point-Referenced Data Models p. 38/45

107 Classical spatial prediction (Kriging) Named by Matheron (1963) in honor of D.G. Krige, a South African mining engineer whose seminal work on empirical methods for geostatistical data inspired the general approach Optimal spatial prediction: given observations of a random field Y = (Y (s 1 ),...,Y (s n )), predict the variable Y at a site s 0 where it has not been observed Chapter 2: Basics of Point-Referenced Data Models p. 38/45

108 Classical spatial prediction (Kriging) Named by Matheron (1963) in honor of D.G. Krige, a South African mining engineer whose seminal work on empirical methods for geostatistical data inspired the general approach Optimal spatial prediction: given observations of a random field Y = (Y (s 1 ),...,Y (s n )), predict the variable Y at a site s 0 where it has not been observed Under squared error loss, the best linear prediction minimizes E[Y (s 0 ) ( l i Y (s i ) + δ 0 )] 2 (the kriging equations) over δ 0 and l i. Chapter 2: Basics of Point-Referenced Data Models p. 38/45

109 Classical spatial prediction (Kriging) Named by Matheron (1963) in honor of D.G. Krige, a South African mining engineer whose seminal work on empirical methods for geostatistical data inspired the general approach Optimal spatial prediction: given observations of a random field Y = (Y (s 1 ),...,Y (s n )), predict the variable Y at a site s 0 where it has not been observed Under squared error loss, the best linear prediction minimizes E[Y (s 0 ) ( l i Y (s i ) + δ 0 )] 2 (the kriging equations) over δ 0 and l i. Under intrinsic stationarity and adopting unbiasedness, δ 0 drops out. So, with an estimate of γ, one immediately obtains the ordinary kriging estimate. Chapter 2: Basics of Point-Referenced Data Models p. 38/45

110 Classical spatial prediction (Kriging) Named by Matheron (1963) in honor of D.G. Krige, a South African mining engineer whose seminal work on empirical methods for geostatistical data inspired the general approach Optimal spatial prediction: given observations of a random field Y = (Y (s 1 ),...,Y (s n )), predict the variable Y at a site s 0 where it has not been observed Under squared error loss, the best linear prediction minimizes E[Y (s 0 ) ( l i Y (s i ) + δ 0 )] 2 (the kriging equations) over δ 0 and l i. Under intrinsic stationarity and adopting unbiasedness, δ 0 drops out. So, with an estimate of γ, one immediately obtains the ordinary kriging estimate. No distributional assumptions are required for the Y (s i ). Chapter 2: Basics of Point-Referenced Data Models p. 38/45

111 Difficulties Limitation of a constant mean so introduce a mean surface, and then we have universal kriging Chapter 2: Basics of Point-Referenced Data Models p. 39/45

112 Difficulties Limitation of a constant mean so introduce a mean surface, and then we have universal kriging But what if mean surface unknown? variogram unknown? If we put estimates of both of these into the kriging equations, we fail to take into account the uncertainty in these estimates Chapter 2: Basics of Point-Referenced Data Models p. 39/45

113 Difficulties Limitation of a constant mean so introduce a mean surface, and then we have universal kriging But what if mean surface unknown? variogram unknown? If we put estimates of both of these into the kriging equations, we fail to take into account the uncertainty in these estimates So, we turn to Gaussian processes and likelihoodbased methods, and work with the covariance function directly. We now redo the calculations in this setting... Chapter 2: Basics of Point-Referenced Data Models p. 39/45

114 Kriging with Gaussian processes Given covariate values x(s i ), i = 0, 1,...,n, suppose Y = Xβ + ǫ, where ǫ N (0, Σ). Chapter 2: Basics of Point-Referenced Data Models p. 40/45

115 Kriging with Gaussian processes Given covariate values x(s i ), i = 0, 1,...,n, suppose Y = Xβ + ǫ, where ǫ N (0, Σ). For a spatial covariance structure having no nugget effect, we specify Σ as Σ = σ 2 H (φ) where (H (φ)) ij = ρ(φ;d ij ), where d ij = s i s j, the distance between s i and s j, and ρ is a valid correlation function on R r. Chapter 2: Basics of Point-Referenced Data Models p. 40/45

116 Kriging with Gaussian processes Given covariate values x(s i ), i = 0, 1,...,n, suppose Y = Xβ + ǫ, where ǫ N (0, Σ). For a spatial covariance structure having no nugget effect, we specify Σ as Σ = σ 2 H (φ) where (H (φ)) ij = ρ(φ;d ij ), where d ij = s i s j, the distance between s i and s j, and ρ is a valid correlation function on R r. For a model having a nugget effect, we instead set Σ = σ 2 H (φ) + τ 2 I, where τ 2 is the nugget effect variance. Chapter 2: Basics of Point-Referenced Data Models p. 40/45

117 Kriging with Gaussian processes We seek the function f (y) that minimizes [ the ] mean-squared prediction error, E (Y (s 0 ) f (y)) 2 y, i.e., we work with the conditional distribution of Y (s 0 ) y Chapter 2: Basics of Point-Referenced Data Models p. 41/45

118 Kriging with Gaussian processes We seek the function f (y) that minimizes [ the ] mean-squared prediction error, E (Y (s 0 ) f (y)) 2 y, i.e., we work with the conditional distribution of Y (s 0 ) y But this expression equals { } E (Y (s 0 ) E[Y (s 0 ) y]) 2 y + {E[Y (s 0 ) y] f (y)} 2, since the expectation of the cross-product term is 0. Chapter 2: Basics of Point-Referenced Data Models p. 41/45

119 Kriging with Gaussian processes We seek the function f (y) that minimizes [ the ] mean-squared prediction error, E (Y (s 0 ) f (y)) 2 y, i.e., we work with the conditional distribution of Y (s 0 ) y But this expression equals { } E (Y (s 0 ) E[Y (s 0 ) y]) 2 y + {E[Y (s 0 ) y] f (y)} 2, since the expectation of the cross-product term is 0. Since the second term is nonnegative, we have [ ] { } E (Y (s 0 ) f (y)) 2 y E (Y (s 0 ) E[Y (s 0 ) y]) 2 y for any function f (y). Chapter 2: Basics of Point-Referenced Data Models p. 41/45

120 Kriging with Gaussian processes Equality holds above f (y) = E[Y (s 0 ) y], so it must be that the predictor f(y) that minimizes the error is the conditional expectation of Y (s 0 ) given the data. Chapter 2: Basics of Point-Referenced Data Models p. 42/45

121 Kriging with Gaussian processes Equality holds above f (y) = E[Y (s 0 ) y], so it must be that the predictor f(y) that minimizes the error is the conditional expectation of Y (s 0 ) given the data. This result is quite intuitive from a Bayesian point of view, since this f(y) is just the posterior mean of Y (s 0 )! Chapter 2: Basics of Point-Referenced Data Models p. 42/45

122 Kriging with Gaussian processes Equality holds above f (y) = E[Y (s 0 ) y], so it must be that the predictor f(y) that minimizes the error is the conditional expectation of Y (s 0 ) given the data. This result is quite intuitive from a Bayesian point of view, since this f(y) is just the posterior mean of Y (s 0 )! Now consider estimation of this best predictor, first in the wildly unrealistic situation in which all the population parameters (β,σ 2,φ, and τ 2 ) are known. Suppose ( Y 1 Y 2 ) N (( µ 1 µ 2 ), ( Ω 11 Ω 12 Ω 21 Ω 22 )) where Ω 21 = Ω T 12. Chapter 2: Basics of Point-Referenced Data Models p. 42/45,

123 Kriging with Gaussian processes Then p (Y 1 Y 2 ) is normal with mean and variance E[Y 1 Y 2 ] = µ 1 + Ω 12 Ω 1 22 (Y 2 µ 2 ) and V ar[y 1 Y 2 ] = Ω 11 Ω 12 Ω 1 22 Ω 21. Chapter 2: Basics of Point-Referenced Data Models p. 43/45

124 Kriging with Gaussian processes Then p (Y 1 Y 2 ) is normal with mean and variance E[Y 1 Y 2 ] = µ 1 + Ω 12 Ω 1 22 (Y 2 µ 2 ) and V ar[y 1 Y 2 ] = Ω 11 Ω 12 Ω 1 22 Ω 21. In our framework, Y 1 = Y (s 0 ) and Y 2 = y, meaning that Ω 11 = σ 2 + τ 2, Ω 12 = γ T, and Ω 22 = Σ = σ 2 H (φ) + τ 2 I, where γ T = ( σ 2 ρ(φ;d 01 ),...,σ 2 ρ(φ;d 0n ) ). Chapter 2: Basics of Point-Referenced Data Models p. 43/45

125 Kriging with Gaussian processes Then p (Y 1 Y 2 ) is normal with mean and variance E[Y 1 Y 2 ] = µ 1 + Ω 12 Ω 1 22 (Y 2 µ 2 ) and V ar[y 1 Y 2 ] = Ω 11 Ω 12 Ω 1 22 Ω 21. In our framework, Y 1 = Y (s 0 ) and Y 2 = y, meaning that Ω 11 = σ 2 + τ 2, Ω 12 = γ T, and Ω 22 = Σ = σ 2 H (φ) + τ 2 I, where γ T = ( σ 2 ρ(φ;d 01 ),...,σ 2 ρ(φ;d 0n ) ). Substituting these values into the mean and variance formulae above, we obtain E [Y (s 0 ) y] = x T 0 β + γ T Σ 1 (y Xβ), and V ar[y (s 0 ) y] = σ 2 + τ 2 γ T Σ 1 γ. Chapter 2: Basics of Point-Referenced Data Models p. 43/45

126 Kriging with Gaussian processes Next, consider how these answers are modified in the more realistic scenario where the model parameters are unknown. We modify f(y) to f (y) = x T β ( ) 0 + γ T Σ 1 y X β, ( T where γ = ˆσ 2 ρ(ˆφ;d 01 ),..., ˆσ 2 ρ(ˆφ;d 0n )), Σ = ˆσ 2 H(ˆφ), and β = β ( 1 WLS = X X) T Σ 1 X T Σ 1 y. Chapter 2: Basics of Point-Referenced Data Models p. 44/45

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics,

More information

CBMS Lecture 1. Alan E. Gelfand Duke University

CBMS Lecture 1. Alan E. Gelfand Duke University CBMS Lecture 1 Alan E. Gelfand Duke University Introduction to spatial data and models Researchers in diverse areas such as climatology, ecology, environmental exposure, public health, and real estate

More information

Point-Referenced Data Models

Point-Referenced Data Models Point-Referenced Data Models Jamie Monogan University of Georgia Spring 2013 Jamie Monogan (UGA) Point-Referenced Data Models Spring 2013 1 / 19 Objectives By the end of these meetings, participants should

More information

Introduction to Geostatistics

Introduction to Geostatistics Introduction to Geostatistics Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore,

More information

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data Hierarchical models for spatial data Based on the book by Banerjee, Carlin and Gelfand Hierarchical Modeling and Analysis for Spatial Data, 2004. We focus on Chapters 1, 2 and 5. Geo-referenced data arise

More information

Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp

Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp Marcela Alfaro Córdoba August 25, 2016 NCSU Department of Statistics Continuous Parameter

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Researchers in diverse areas such as climatology, ecology, environmental health, and real estate marketing are increasingly faced with the task of analyzing data

More information

Chapter 4 - Fundamentals of spatial processes Lecture notes

Chapter 4 - Fundamentals of spatial processes Lecture notes TK4150 - Intro 1 Chapter 4 - Fundamentals of spatial processes Lecture notes Odd Kolbjørnsen and Geir Storvik January 30, 2017 STK4150 - Intro 2 Spatial processes Typically correlation between nearby sites

More information

Hierarchical Modeling for Univariate Spatial Data

Hierarchical Modeling for Univariate Spatial Data Hierarchical Modeling for Univariate Spatial Data Geography 890, Hierarchical Bayesian Models for Environmental Spatial Data Analysis February 15, 2011 1 Spatial Domain 2 Geography 890 Spatial Domain This

More information

Statistícal Methods for Spatial Data Analysis

Statistícal Methods for Spatial Data Analysis Texts in Statistícal Science Statistícal Methods for Spatial Data Analysis V- Oliver Schabenberger Carol A. Gotway PCT CHAPMAN & K Contents Preface xv 1 Introduction 1 1.1 The Need for Spatial Analysis

More information

Chapter 4 - Fundamentals of spatial processes Lecture notes

Chapter 4 - Fundamentals of spatial processes Lecture notes Chapter 4 - Fundamentals of spatial processes Lecture notes Geir Storvik January 21, 2013 STK4150 - Intro 2 Spatial processes Typically correlation between nearby sites Mostly positive correlation Negative

More information

ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS

ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS Richard L. Smith Department of Statistics and Operations Research University of North Carolina Chapel Hill, N.C.,

More information

REML Estimation and Linear Mixed Models 4. Geostatistics and linear mixed models for spatial data

REML Estimation and Linear Mixed Models 4. Geostatistics and linear mixed models for spatial data REML Estimation and Linear Mixed Models 4. Geostatistics and linear mixed models for spatial data Sue Welham Rothamsted Research Harpenden UK AL5 2JQ December 1, 2008 1 We will start by reviewing the principles

More information

Hierarchical Modelling for Univariate Spatial Data

Hierarchical Modelling for Univariate Spatial Data Hierarchical Modelling for Univariate Spatial Data Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department

More information

Introduction. Semivariogram Cloud

Introduction. Semivariogram Cloud Introduction Data: set of n attribute measurements {z(s i ), i = 1,, n}, available at n sample locations {s i, i = 1,, n} Objectives: Slide 1 quantify spatial auto-correlation, or attribute dissimilarity

More information

E(x i ) = µ i. 2 d. + sin 1 d θ 2. for d < θ 2 0 for d θ 2

E(x i ) = µ i. 2 d. + sin 1 d θ 2. for d < θ 2 0 for d θ 2 1 Gaussian Processes Definition 1.1 A Gaussian process { i } over sites i is defined by its mean function and its covariance function E( i ) = µ i c ij = Cov( i, j ) plus joint normality of the finite

More information

Practicum : Spatial Regression

Practicum : Spatial Regression : Alexandra M. Schmidt Instituto de Matemática UFRJ - www.dme.ufrj.br/ alex 2014 Búzios, RJ, www.dme.ufrj.br Exploratory (Spatial) Data Analysis 1. Non-spatial summaries Numerical summaries: Mean, median,

More information

Spatial Statistics with Image Analysis. Lecture L02. Computer exercise 0 Daily Temperature. Lecture 2. Johan Lindström.

Spatial Statistics with Image Analysis. Lecture L02. Computer exercise 0 Daily Temperature. Lecture 2. Johan Lindström. C Stochastic fields Covariance Spatial Statistics with Image Analysis Lecture 2 Johan Lindström November 4, 26 Lecture L2 Johan Lindström - johanl@maths.lth.se FMSN2/MASM2 L /2 C Stochastic fields Covariance

More information

Kriging Luc Anselin, All Rights Reserved

Kriging Luc Anselin, All Rights Reserved Kriging Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign http://sal.agecon.uiuc.edu Outline Principles Kriging Models Spatial Interpolation

More information

Geostatistics in Hydrology: Kriging interpolation

Geostatistics in Hydrology: Kriging interpolation Chapter Geostatistics in Hydrology: Kriging interpolation Hydrologic properties, such as rainfall, aquifer characteristics (porosity, hydraulic conductivity, transmissivity, storage coefficient, etc.),

More information

Hierarchical Modelling for Univariate Spatial Data

Hierarchical Modelling for Univariate Spatial Data Spatial omain Hierarchical Modelling for Univariate Spatial ata Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A.

More information

Fluvial Variography: Characterizing Spatial Dependence on Stream Networks. Dale Zimmerman University of Iowa (joint work with Jay Ver Hoef, NOAA)

Fluvial Variography: Characterizing Spatial Dependence on Stream Networks. Dale Zimmerman University of Iowa (joint work with Jay Ver Hoef, NOAA) Fluvial Variography: Characterizing Spatial Dependence on Stream Networks Dale Zimmerman University of Iowa (joint work with Jay Ver Hoef, NOAA) March 5, 2015 Stream network data Flow Legend o 4.40-5.80

More information

Lecture 9: Introduction to Kriging

Lecture 9: Introduction to Kriging Lecture 9: Introduction to Kriging Math 586 Beginning remarks Kriging is a commonly used method of interpolation (prediction) for spatial data. The data are a set of observations of some variable(s) of

More information

Types of Spatial Data

Types of Spatial Data Spatial Data Types of Spatial Data Point pattern Point referenced geostatistical Block referenced Raster / lattice / grid Vector / polygon Point Pattern Data Interested in the location of points, not their

More information

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Daniel K. Heersink 1, Reinhard Furrer 1, and Mike A. Mooney 2 1 Institute of Mathematics, University of Zurich, CH-8057 Zurich 2

More information

PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH

PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH SURESH TRIPATHI Geostatistical Society of India Assumptions and Geostatistical Variogram

More information

Modeling and Interpolation of Non-Gaussian Spatial Data: A Comparative Study

Modeling and Interpolation of Non-Gaussian Spatial Data: A Comparative Study Modeling and Interpolation of Non-Gaussian Spatial Data: A Comparative Study Gunter Spöck, Hannes Kazianka, Jürgen Pilz Department of Statistics, University of Klagenfurt, Austria hannes.kazianka@uni-klu.ac.at

More information

Spatio-Temporal Geostatistical Models, with an Application in Fish Stock

Spatio-Temporal Geostatistical Models, with an Application in Fish Stock Spatio-Temporal Geostatistical Models, with an Application in Fish Stock Ioannis Elmatzoglou Submitted for the degree of Master in Statistics at Lancaster University, September 2006. Abstract Geostatistics

More information

A Framework for Daily Spatio-Temporal Stochastic Weather Simulation

A Framework for Daily Spatio-Temporal Stochastic Weather Simulation A Framework for Daily Spatio-Temporal Stochastic Weather Simulation, Rick Katz, Balaji Rajagopalan Geophysical Statistics Project Institute for Mathematics Applied to Geosciences National Center for Atmospheric

More information

7 Geostatistics. Figure 7.1 Focus of geostatistics

7 Geostatistics. Figure 7.1 Focus of geostatistics 7 Geostatistics 7.1 Introduction Geostatistics is the part of statistics that is concerned with geo-referenced data, i.e. data that are linked to spatial coordinates. To describe the spatial variation

More information

Hierarchical Modelling for Univariate and Multivariate Spatial Data

Hierarchical Modelling for Univariate and Multivariate Spatial Data Hierarchical Modelling for Univariate and Multivariate Spatial Data p. 1/4 Hierarchical Modelling for Univariate and Multivariate Spatial Data Sudipto Banerjee sudiptob@biostat.umn.edu University of Minnesota

More information

Covariance function estimation in Gaussian process regression

Covariance function estimation in Gaussian process regression Covariance function estimation in Gaussian process regression François Bachoc Department of Statistics and Operations Research, University of Vienna WU Research Seminar - May 2015 François Bachoc Gaussian

More information

Index. Geostatistics for Environmental Scientists, 2nd Edition R. Webster and M. A. Oliver 2007 John Wiley & Sons, Ltd. ISBN:

Index. Geostatistics for Environmental Scientists, 2nd Edition R. Webster and M. A. Oliver 2007 John Wiley & Sons, Ltd. ISBN: Index Akaike information criterion (AIC) 105, 290 analysis of variance 35, 44, 127 132 angular transformation 22 anisotropy 59, 99 affine or geometric 59, 100 101 anisotropy ratio 101 exploring and displaying

More information

Overview of Spatial Statistics with Applications to fmri

Overview of Spatial Statistics with Applications to fmri with Applications to fmri School of Mathematics & Statistics Newcastle University April 8 th, 2016 Outline Why spatial statistics? Basic results Nonstationary models Inference for large data sets An example

More information

Introduction to Spatial Data and Models

Introduction to Spatial Data and Models Introduction to Spatial Data and Models Sudipto Banerjee and Andrew O. Finley 2 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry

More information

BAYESIAN KRIGING AND BAYESIAN NETWORK DESIGN

BAYESIAN KRIGING AND BAYESIAN NETWORK DESIGN BAYESIAN KRIGING AND BAYESIAN NETWORK DESIGN Richard L. Smith Department of Statistics and Operations Research University of North Carolina Chapel Hill, N.C., U.S.A. J. Stuart Hunter Lecture TIES 2004

More information

What s for today. All about Variogram Nugget effect. Mikyoung Jun (Texas A&M) stat647 lecture 4 September 6, / 17

What s for today. All about Variogram Nugget effect. Mikyoung Jun (Texas A&M) stat647 lecture 4 September 6, / 17 What s for today All about Variogram Nugget effect Mikyoung Jun (Texas A&M) stat647 lecture 4 September 6, 2012 1 / 17 What is the variogram? Let us consider a stationary (or isotropic) random field Z

More information

Spatial smoothing using Gaussian processes

Spatial smoothing using Gaussian processes Spatial smoothing using Gaussian processes Chris Paciorek paciorek@hsph.harvard.edu August 5, 2004 1 OUTLINE Spatial smoothing and Gaussian processes Covariance modelling Nonstationary covariance modelling

More information

Bayesian Transgaussian Kriging

Bayesian Transgaussian Kriging 1 Bayesian Transgaussian Kriging Hannes Müller Institut für Statistik University of Klagenfurt 9020 Austria Keywords: Kriging, Bayesian statistics AMS: 62H11,60G90 Abstract In geostatistics a widely used

More information

Non-gaussian spatiotemporal modeling

Non-gaussian spatiotemporal modeling Dec, 2008 1/ 37 Non-gaussian spatiotemporal modeling Thais C O da Fonseca Joint work with Prof Mark F J Steel Department of Statistics University of Warwick Dec, 2008 Dec, 2008 2/ 37 1 Introduction Motivation

More information

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields

Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields 1 Introduction Jo Eidsvik Department of Mathematical Sciences, NTNU, Norway. (joeid@math.ntnu.no) February

More information

What s for today. Introduction to Space-time models. c Mikyoung Jun (Texas A&M) Stat647 Lecture 14 October 16, / 19

What s for today. Introduction to Space-time models. c Mikyoung Jun (Texas A&M) Stat647 Lecture 14 October 16, / 19 What s for today Introduction to Space-time models c Mikyoung Jun (Texas A&M) Stat647 Lecture 14 October 16, 2012 1 / 19 Space-time Data So far we looked at the data that vary over space Now we add another

More information

Interpolation of Spatial Data

Interpolation of Spatial Data Michael L. Stein Interpolation of Spatial Data Some Theory for Kriging With 27 Illustrations Springer Contents Preface vii 1 Linear Prediction 1 1.1 Introduction 1 1.2 Best linear prediction 2 Exercises

More information

A new covariance function for spatio-temporal data analysis with application to atmospheric pollution and sensor networking

A new covariance function for spatio-temporal data analysis with application to atmospheric pollution and sensor networking A new covariance function for spatio-temporal data analysis with application to atmospheric pollution and sensor networking György Terdik and Subba Rao Tata UofD, HU & UofM, UK January 30, 2015 Laboratoire

More information

Introduction. Spatial Processes & Spatial Patterns

Introduction. Spatial Processes & Spatial Patterns Introduction Spatial data: set of geo-referenced attribute measurements: each measurement is associated with a location (point) or an entity (area/region/object) in geographical (or other) space; the domain

More information

Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP

Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP The IsoMAP uses the multiple linear regression and geostatistical methods to analyze isotope data Suppose the response variable

More information

Space-time data. Simple space-time analyses. PM10 in space. PM10 in time

Space-time data. Simple space-time analyses. PM10 in space. PM10 in time Space-time data Observations taken over space and over time Z(s, t): indexed by space, s, and time, t Here, consider geostatistical/time data Z(s, t) exists for all locations and all times May consider

More information

Spatial and Environmental Statistics

Spatial and Environmental Statistics Spatial and Environmental Statistics Dale Zimmerman Department of Statistics and Actuarial Science University of Iowa January 17, 2019 Dale Zimmerman (UIOWA) Spatial and Environmental Statistics January

More information

Introduction. Chapter 1

Introduction. Chapter 1 Chapter 1 Introduction In this book we will be concerned with supervised learning, which is the problem of learning input-output mappings from empirical data (the training dataset). Depending on the characteristics

More information

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models

Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Technical Vignette 5: Understanding intrinsic Gaussian Markov random field spatial models, including intrinsic conditional autoregressive models Christopher Paciorek, Department of Statistics, University

More information

Time Series Models and Inference. James L. Powell Department of Economics University of California, Berkeley

Time Series Models and Inference. James L. Powell Department of Economics University of California, Berkeley Time Series Models and Inference James L. Powell Department of Economics University of California, Berkeley Overview In contrast to the classical linear regression model, in which the components of the

More information

Bayesian Thinking in Spatial Statistics

Bayesian Thinking in Spatial Statistics Bayesian Thinking in Spatial Statistics Lance A. Waller Department of Biostatistics Rollins School of Public Health Emory University 1518 Clifton Road NE Atlanta, GA 30322 E-mail: lwaller@sph.emory.edu

More information

Time Series 2. Robert Almgren. Sept. 21, 2009

Time Series 2. Robert Almgren. Sept. 21, 2009 Time Series 2 Robert Almgren Sept. 21, 2009 This week we will talk about linear time series models: AR, MA, ARMA, ARIMA, etc. First we will talk about theory and after we will talk about fitting the models

More information

Exploring the World of Ordinary Kriging. Dennis J. J. Walvoort. Wageningen University & Research Center Wageningen, The Netherlands

Exploring the World of Ordinary Kriging. Dennis J. J. Walvoort. Wageningen University & Research Center Wageningen, The Netherlands Exploring the World of Ordinary Kriging Wageningen University & Research Center Wageningen, The Netherlands July 2004 (version 0.2) What is? What is it about? Potential Users a computer program for exploring

More information

Gaussian predictive process models for large spatial data sets.

Gaussian predictive process models for large spatial data sets. Gaussian predictive process models for large spatial data sets. Sudipto Banerjee, Alan E. Gelfand, Andrew O. Finley, and Huiyan Sang Presenters: Halley Brantley and Chris Krut September 28, 2015 Overview

More information

x. Figure 1: Examples of univariate Gaussian pdfs N (x; µ, σ 2 ).

x. Figure 1: Examples of univariate Gaussian pdfs N (x; µ, σ 2 ). .8.6 µ =, σ = 1 µ = 1, σ = 1 / µ =, σ =.. 3 1 1 3 x Figure 1: Examples of univariate Gaussian pdfs N (x; µ, σ ). The Gaussian distribution Probably the most-important distribution in all of statistics

More information

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012 Gaussian Processes Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 01 Pictorial view of embedding distribution Transform the entire distribution to expected features Feature space Feature

More information

Investigation of Monthly Pan Evaporation in Turkey with Geostatistical Technique

Investigation of Monthly Pan Evaporation in Turkey with Geostatistical Technique Investigation of Monthly Pan Evaporation in Turkey with Geostatistical Technique Hatice Çitakoğlu 1, Murat Çobaner 1, Tefaruk Haktanir 1, 1 Department of Civil Engineering, Erciyes University, Kayseri,

More information

A kernel indicator variogram and its application to groundwater pollution

A kernel indicator variogram and its application to groundwater pollution Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session IPS101) p.1514 A kernel indicator variogram and its application to groundwater pollution data Menezes, Raquel University

More information

Chapter 1. Summer School GEOSTAT 2014, Spatio-Temporal Geostatistics,

Chapter 1. Summer School GEOSTAT 2014, Spatio-Temporal Geostatistics, Chapter 1 Summer School GEOSTAT 2014, Geostatistics, 2014-06-19 sum- http://ifgi.de/graeler Institute for Geoinformatics University of Muenster 1.1 Spatial Data From a purely statistical perspective, spatial

More information

Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes

Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes TTU, October 26, 2012 p. 1/3 Karhunen-Loeve Expansion and Optimal Low-Rank Model for Spatial Processes Hao Zhang Department of Statistics Department of Forestry and Natural Resources Purdue University

More information

University of California, Los Angeles Department of Statistics. Geostatistical data: variogram and kriging

University of California, Los Angeles Department of Statistics. Geostatistical data: variogram and kriging University of California, Los Angeles Department of Statistics Statistics 403 Instructor: Nicolas Christou Geostatistical data: variogram and kriging Let Z(s) and Z(s + h) two random variables at locations

More information

Conditional Distribution Fitting of High Dimensional Stationary Data

Conditional Distribution Fitting of High Dimensional Stationary Data Conditional Distribution Fitting of High Dimensional Stationary Data Miguel Cuba and Oy Leuangthong The second order stationary assumption implies the spatial variability defined by the variogram is constant

More information

Laplace Distribution Based Stochastic Models

Laplace Distribution Based Stochastic Models Laplace Distribution Based Stochastic Models Anastassia Baxevani Department of Mathematics and Statistics University of Cyprus results based on work of/with Podgorski, Wegener, Aberg, Bolin, Walin July

More information

STATISTICAL INTERPOLATION METHODS RICHARD SMITH AND NOEL CRESSIE Statistical methods of interpolation are all based on assuming that the process

STATISTICAL INTERPOLATION METHODS RICHARD SMITH AND NOEL CRESSIE Statistical methods of interpolation are all based on assuming that the process 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 STATISTICAL INTERPOLATION METHODS

More information

Spatial analysis is the quantitative study of phenomena that are located in space.

Spatial analysis is the quantitative study of phenomena that are located in space. c HYON-JUNG KIM, 2016 1 Introduction Spatial analysis is the quantitative study of phenomena that are located in space. Spatial data analysis usually refers to an analysis of the observations in which

More information

An Introduction to Pattern Statistics

An Introduction to Pattern Statistics An Introduction to Pattern Statistics Nearest Neighbors The CSR hypothesis Clark/Evans and modification Cuzick and Edwards and controls All events k function Weighted k function Comparative k functions

More information

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models

Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Environmentrics 00, 1 12 DOI: 10.1002/env.XXXX Comparing Non-informative Priors for Estimation and Prediction in Spatial Models Regina Wu a and Cari G. Kaufman a Summary: Fitting a Bayesian model to spatial

More information

Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency

Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency Gaussian processes for spatial modelling in environmental health: parameterizing for flexibility vs. computational efficiency Chris Paciorek March 11, 2005 Department of Biostatistics Harvard School of

More information

Dale L. Zimmerman Department of Statistics and Actuarial Science, University of Iowa, USA

Dale L. Zimmerman Department of Statistics and Actuarial Science, University of Iowa, USA SPATIAL STATISTICS Dale L. Zimmerman Department of Statistics and Actuarial Science, University of Iowa, USA Keywords: Geostatistics, Isotropy, Kriging, Lattice Data, Spatial point patterns, Stationarity

More information

Lecture 2: ARMA(p,q) models (part 2)

Lecture 2: ARMA(p,q) models (part 2) Lecture 2: ARMA(p,q) models (part 2) Florian Pelgrin University of Lausanne, École des HEC Department of mathematics (IMEA-Nice) Sept. 2011 - Jan. 2012 Florian Pelgrin (HEC) Univariate time series Sept.

More information

Bayesian Transformed Gaussian Random Field: A Review

Bayesian Transformed Gaussian Random Field: A Review Bayesian Transformed Gaussian Random Field: A Review Benjamin Kedem Department of Mathematics & ISR University of Maryland College Park, MD (Victor De Oliveira, David Bindel, Boris and Sandra Kozintsev)

More information

Introduction to Bayes and non-bayes spatial statistics

Introduction to Bayes and non-bayes spatial statistics Introduction to Bayes and non-bayes spatial statistics Gabriel Huerta Department of Mathematics and Statistics University of New Mexico http://www.stat.unm.edu/ ghuerta/georcode.txt General Concepts Spatial

More information

Multivariate spatial modeling

Multivariate spatial modeling Multivariate spatial modeling Point-referenced spatial data often come as multivariate measurements at each location Chapter 7: Multivariate Spatial Modeling p. 1/21 Multivariate spatial modeling Point-referenced

More information

Geostatistics: Kriging

Geostatistics: Kriging Geostatistics: Kriging 8.10.2015 Konetekniikka 1, Otakaari 4, 150 10-12 Rangsima Sunila, D.Sc. Background What is Geostatitics Concepts Variogram: experimental, theoretical Anisotropy, Isotropy Lag, Sill,

More information

A Process over all Stationary Covariance Kernels

A Process over all Stationary Covariance Kernels A Process over all Stationary Covariance Kernels Andrew Gordon Wilson June 9, 0 Abstract I define a process over all stationary covariance kernels. I show how one might be able to perform inference that

More information

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models

spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models spbayes: An R Package for Univariate and Multivariate Hierarchical Point-referenced Spatial Models Andrew O. Finley 1, Sudipto Banerjee 2, and Bradley P. Carlin 2 1 Michigan State University, Departments

More information

Extreme Value Analysis and Spatial Extremes

Extreme Value Analysis and Spatial Extremes Extreme Value Analysis and Department of Statistics Purdue University 11/07/2013 Outline Motivation 1 Motivation 2 Extreme Value Theorem and 3 Bayesian Hierarchical Models Copula Models Max-stable Models

More information

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA JAPANESE BEETLE DATA 6 MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA Gauge Plots TuscaroraLisa Central Madsen Fairways, 996 January 9, 7 Grubs Adult Activity Grub Counts 6 8 Organic Matter

More information

Regression, Ridge Regression, Lasso

Regression, Ridge Regression, Lasso Regression, Ridge Regression, Lasso Fabio G. Cozman - fgcozman@usp.br October 2, 2018 A general definition Regression studies the relationship between a response variable Y and covariates X 1,..., X n.

More information

ON THE USE OF NON-EUCLIDEAN ISOTROPY IN GEOSTATISTICS

ON THE USE OF NON-EUCLIDEAN ISOTROPY IN GEOSTATISTICS Johns Hopkins University, Dept. of Biostatistics Working Papers 12-5-2005 ON THE USE OF NON-EUCLIDEAN ISOTROPY IN GEOSTATISTICS Frank C. Curriero Department of Environmental Health Sciences and Biostatistics,

More information

1 Isotropic Covariance Functions

1 Isotropic Covariance Functions 1 Isotropic Covariance Functions Let {Z(s)} be a Gaussian process on, ie, a collection of jointly normal random variables Z(s) associated with n-dimensional locations s The joint distribution of {Z(s)}

More information

Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University

Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University Nonstationary spatial process modeling Part II Paul D. Sampson --- Catherine Calder Univ of Washington --- Ohio State University this presentation derived from that presented at the Pan-American Advanced

More information

Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands

Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands Asymptotic Multivariate Kriging Using Estimated Parameters with Bayesian Prediction Methods for Non-linear Predictands Elizabeth C. Mannshardt-Shamseldin Advisor: Richard L. Smith Duke University Department

More information

Basics in Geostatistics 2 Geostatistical interpolation/estimation: Kriging methods. Hans Wackernagel. MINES ParisTech.

Basics in Geostatistics 2 Geostatistical interpolation/estimation: Kriging methods. Hans Wackernagel. MINES ParisTech. Basics in Geostatistics 2 Geostatistical interpolation/estimation: Kriging methods Hans Wackernagel MINES ParisTech NERSC April 2013 http://hans.wackernagel.free.fr Basic concepts Geostatistics Hans Wackernagel

More information

STATISTICS OF OBSERVATIONS & SAMPLING THEORY. Parent Distributions

STATISTICS OF OBSERVATIONS & SAMPLING THEORY. Parent Distributions ASTR 511/O Connell Lec 6 1 STATISTICS OF OBSERVATIONS & SAMPLING THEORY References: Bevington Data Reduction & Error Analysis for the Physical Sciences LLM: Appendix B Warning: the introductory literature

More information

Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets

Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets Is there spatial pattern? Chapter 3: Basics of Areal Data Models p. 1/18 Areal Unit Data Regular or Irregular Grids

More information

Why experimenters should not randomize, and what they should do instead

Why experimenters should not randomize, and what they should do instead Why experimenters should not randomize, and what they should do instead Maximilian Kasy Department of Economics, Harvard University Maximilian Kasy (Harvard) Experimental design 1 / 42 project STAR Introduction

More information

Geostatistics for Gaussian processes

Geostatistics for Gaussian processes Introduction Geostatistical Model Covariance structure Cokriging Conclusion Geostatistics for Gaussian processes Hans Wackernagel Geostatistics group MINES ParisTech http://hans.wackernagel.free.fr Kernels

More information

Model Selection for Geostatistical Models

Model Selection for Geostatistical Models Model Selection for Geostatistical Models Richard A. Davis Colorado State University http://www.stat.colostate.edu/~rdavis/lectures Joint work with: Jennifer A. Hoeting, Colorado State University Andrew

More information

1. Fundamental concepts

1. Fundamental concepts . Fundamental concepts A time series is a sequence of data points, measured typically at successive times spaced at uniform intervals. Time series are used in such fields as statistics, signal processing

More information

Kernel-based Approximation. Methods using MATLAB. Gregory Fasshauer. Interdisciplinary Mathematical Sciences. Michael McCourt.

Kernel-based Approximation. Methods using MATLAB. Gregory Fasshauer. Interdisciplinary Mathematical Sciences. Michael McCourt. SINGAPORE SHANGHAI Vol TAIPEI - Interdisciplinary Mathematical Sciences 19 Kernel-based Approximation Methods using MATLAB Gregory Fasshauer Illinois Institute of Technology, USA Michael McCourt University

More information

State Space Representation of Gaussian Processes

State Space Representation of Gaussian Processes State Space Representation of Gaussian Processes Simo Särkkä Department of Biomedical Engineering and Computational Science (BECS) Aalto University, Espoo, Finland June 12th, 2013 Simo Särkkä (Aalto University)

More information

Correlation model for spatially distributed ground-motion intensities

Correlation model for spatially distributed ground-motion intensities EARTHQUAKE ENGINEERING AND STRUCTURAL DYNAMICS Earthquake Engng Struct. Dyn. 2009; 38:1687 1708 Published online 28 April 2009 in Wiley InterScience (www.interscience.wiley.com)..922 Correlation model

More information

Discrete time processes

Discrete time processes Discrete time processes Predictions are difficult. Especially about the future Mark Twain. Florian Herzog 2013 Modeling observed data When we model observed (realized) data, we encounter usually the following

More information

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University An Introduction to Spatial Statistics Chunfeng Huang Department of Statistics, Indiana University Microwave Sounding Unit (MSU) Anomalies (Monthly): 1979-2006. Iron Ore (Cressie, 1986) Raw percent data

More information

ELEMENTS OF PROBABILITY THEORY

ELEMENTS OF PROBABILITY THEORY ELEMENTS OF PROBABILITY THEORY Elements of Probability Theory A collection of subsets of a set Ω is called a σ algebra if it contains Ω and is closed under the operations of taking complements and countable

More information

Geostatistics for Seismic Data Integration in Earth Models

Geostatistics for Seismic Data Integration in Earth Models 2003 Distinguished Instructor Short Course Distinguished Instructor Series, No. 6 sponsored by the Society of Exploration Geophysicists European Association of Geoscientists & Engineers SUB Gottingen 7

More information

arxiv: v4 [stat.me] 14 Sep 2015

arxiv: v4 [stat.me] 14 Sep 2015 Does non-stationary spatial data always require non-stationary random fields? Geir-Arne Fuglstad 1, Daniel Simpson 1, Finn Lindgren 2, and Håvard Rue 1 1 Department of Mathematical Sciences, NTNU, Norway

More information