Introduction to applied geostatistics. Short version. Overheads

Size: px
Start display at page:

Download "Introduction to applied geostatistics. Short version. Overheads"

Transcription

1 Introduction to applied geostatistics Short version Overheads Department of Earth Systems Analysis International Institute for Geo-information Science & Earth Observation (ITC) < March 21, 2007

2 Introduction to applied geostatistics 1 Topic: Resources There are many resources, at various mathematical levels, some aimed at particular applications. These lists are not comprehensive but should be good starting points: Texts Web pages Computer programmes

3 Introduction to applied geostatistics 2 Texts: Mathematical Chilès, J.-P. and Delfiner, P., Geostatistics: modeling spatial uncertainty. Wiley series in probability and statistics. John Wiley & Sons, New York. Christakos, G., Modern spatiotemporal geostatistics. Oxford University Press, New York. Cressie, N., Statistics for spatial data. John Wiley & Sons, New York. Ripley, B.D., Spatial statistics. John Wiley & Sons, New York.

4 Introduction to applied geostatistics 3 Texts: In the context of a particular application field Davis, J.C., Statistics and data analysis in geology. John Wiley & Sons, New York. Fotheringham, A.S., Brunsdon, C. and Charlton, M., Quantitative geography : perspectives on spatial data analysis. Sage Publications, London ; Thousand Oaks, Calif. Stein, A., Meer, F.v.d. and Gorte, B.G.F. (Editors), Spatial statistics for remote sensing. Kluwer Academic, Dordrecht. Kitanidis, P.K., Introduction to geostatistics : applications to hydrogeology. Cambridge University Press, Cambridge, England.

5 Introduction to applied geostatistics 4 Texts: Application-oriented but mathematical Webster, R., and Oliver, M. A., Geostatistics for environmental scientists. Wiley & Sons, Chichester. Goovaerts, P., Geostatistics for natural resources evaluation. Oxford University Press, Oxford and New York. Isaaks, E.H. and Srivastava, R.M., An introduction to applied geostatistics. Oxford University Press, New York.

6 Introduction to applied geostatistics 5 Texts: Emphasis on computational methods Venables, W.N. & Ripley, B.D., Modern applied statistics with S, 4 th edition. Springer-Verlag, New York. Deutsch, C. V., & Journel, A. G., GSLIB: Geostatistical software library and user s guide. Oxford University Press, Oxford.

7 Introduction to applied geostatistics 6 Web pages R: R spatial projects: gstat: gslib: GEOEAS: ILWIS: ArcGIS Geostatistical Analyst: http: // Geostatistical analysis tutor [Colorado (USA) School of Mines]:

8 Introduction to applied geostatistics 7 Computer programmes ILWIS 3.3 (ITC) R open-source environment for statistical computing and visualisation; includes several relevant libraries, including * gstat, by Pebesma * spatial, by Ripley * geor, by Ribeiro & Diggle * spdep, by Rowlingson & Diggle * spatstat, by Baddeley & Turner (point pattern analysis) * sp, underlying spatial data structures (used by others) ArcGIS Geostatistical Analyst (ESRI) [requires ArcGIS base] PCRaster + gstat (Utrecht) [free] GeoEAS, GSLIB, Variowin, VESPER...

9 Introduction to applied geostatistics 8 Topic: Introduction to Spatial Analysis 1. Concepts of space: geographic and feature spaces 2. What is special about spatial data? 3. Key concepts in spatial analysis 4. Measuring spatial correlation

10 Introduction to applied geostatistics 9 What is space? A set of n continuous dimensions; dimension i has range [x imin x imax ] Points are mathematical n-dimensional vectors: x = (x 1, x 2,, x n ) Depending on how we choose the axes, we can speak of both geographic and feature spaces...

11 Introduction to applied geostatistics 10 Feature space This space is not geographic space, but rather a mathematical space formed by any set of variables: Axes are the range of each variable Coordinates are values of variables, possibly transformed or combined Not included in the common use of the term spatial data or analysis But the observation may be related in this space and we often plot variables in this space, e.g. 2-D scatterplots This is the space in which univariate, bivariate, or multivariate analysis are carried out.

12 Introduction to applied geostatistics 11 Geographic space Axes are 1-d lines One-dimensional: coordinates are on a line with respect to some origin (0): (x 1 ) = x Two-dimensional: coordinates are on a grid with respect to some origin (0, 0): (x 1, x 2 ) = (x, y) = (E, N) Three-dimensional: coordinates are grid and elevation from a reference elevation: (x 1, x 2, x 3 ) = (x, y, z) = (E, N, H) Must transform latitude-longitude to grid coordinates in some 2-d projection; distortions occur over large areas Can work directly with geographic coordinates, but not as a grid

13 Introduction to applied geostatistics 12 What is special about spatial data? (1) 1. The location of a sample is an intrinsic part of its definition. 2. All data sets from a given area are implicitly related by their coordinates models of spatial structure 3. Values at sample points can not be assumed to be independent 4. That is, there may be a spatial structure to the data Classical statistics assumes independence, at least within sampling strata Major implications for sampling design and statistical inference 5. Data values may be related to their coordinates spatial trend

14 Introduction to applied geostatistics 13 Key Concepts Spatial dependence: the value of a variable at a point in space is related to its value at nearby points; knowing the value of these points allows us to predict (with some degree of certainty) the value at the chosen point Spatial structure: the nature of the spatial relation: how far, and in what directions, is the spatial dependence? How does the dependence vary with distance and direction between points? Support of a sample: the physical dimensions it represents (n.b. may try to predict to coarser or finer resolutions)

15 Introduction to applied geostatistics 14 Topic: Exploratory spatial data analysis Since spatial data were collected at known points in geographic space, we should visualise them in that space. Distribution of sample points Postplots (values vs. locations): where are which values? Geographic postplots: with images, landuse maps etc. as background: do there appear to be any explanation for the distribution of values? Spatial structure: range, direction, strength... Is there anisotropy? In what direction(s)? Do there seem to be several populations with distinct geographic distribution?

16 Introduction to applied geostatistics 15 Point distribution This shows how sample points are distributed in space. What was the sampling plan? Random or clustered? Are some areas over or under sampled?

17 Introduction to applied geostatistics 16 Example: Walker Lake: Distribution of points All points

18 Introduction to applied geostatistics 17 The Postplot: distribution of values in space The so-called postplot shows how the data values are distributed in space. Are values of closeby points similar to each other, or do the values appear to be random? Does there appear to be a trend? Are there distinct clusters of high or low values? Is there any directional difference in clustering? (anisotropy)

19 Introduction to applied geostatistics 18 Meuse Distribution of Log(Cadmium) in soils

20 Introduction to applied geostatistics 19 Geographic postplot This shows the postplot against a background that may explain the distribution of samples or values. Examples: land cover or land use geologic or soil units structural geology

21 Introduction to applied geostatistics 20 Meuse Log(Cadmium) on a false-colour composite

22 Introduction to applied geostatistics 21 Topic: Spatial correlation 1. What is spatial auto-correlation? 2. Evidence of spatial correlation 3. Computing spatial correlation and covariance 4. Summarizing and visualising spatial covariance; the empirical variogram Topics for later units: 1. modelling spatial correlation 2. predicting using the modelled structure

23 Introduction to applied geostatistics 22 Spatial Correlation Question: are nearby points in geographic space also nearby in feature space? That is, does knowing the value of some variable at some location give us information on the value at nearby locations? The concept of correlation between variables can be applied to correlation within a variable, using distance to model the relation

24 Introduction to applied geostatistics 23 Covariance and Correlation Recall: for two non-spatial variables X and Y : Sample covariance: s XY = 1 n 1 n (x i x) (y i y) i=1 Sample correlation coefficient: the covariance normalized by sample standard deviations; range [ ]: r XY = s XY s X s Y = (xi x) (y i y) (xi x) 2 (y i y) 2 Can we extend this idea to a single variable, which is then correlated with itself?

25 Introduction to applied geostatistics 24 Auto-correlation We want to apply the idea of correlation to one variable (auto-correlation); the prefix auto- means self, here referring to the single variable. Here, the correlation is controlled by some other dimension: time if the variable is collected as a time series space if the variable is collected at points in space So we will get a measure of how much the variable is correlated to itself, considering the other factor (time or space).

26 Introduction to applied geostatistics 25 Auto-covariance The spatial auto-covariance is computed within the same variable, using pairs of observations. Each pair of observations (x i, x j ) has a covariance, showing how they jointly differ from the variable s mean x: (x i x)(x j x) There are (n (n 1))/2 point pairs for which this can be calculated This is a large number! For example, with 200 points this is 19,900 point pairs.

27 Introduction to applied geostatistics 26 Modelling the auto-covariance By themselves the individual auto-covariances are not usefull; they just quantify the covariance of each point pair. We need to summarize the individual covariances as a covariance function of spatial separation Theory: the covariance depends only on the separation between point. If we can model this function we can then predict the covariance between any two locations in space.

28 Introduction to applied geostatistics 27 Semivariances It is easier to model semivariances than covariances: Each pair of observation points has a semivariance, usually symbolized by the Greek letter gamma, i.e. γ, defined as: γ(x i, x j ) = 1 2 [z(x i) z(x j )] 2 Each point pair is separated by a known distance, so... We can plot the semivariances against distance as a variogram cloud, with (n (n 1))/2 points in the graph Can also summarize in a variogram (The semi refers to the factor 1/2, because there are two ways to compute for the same point pair)

29 Introduction to applied geostatistics 28 The gstat package of R We illustrate the concepts of spatial correlation with the gstat package of the R environment and the meuse example data set. The meuse data frame has coördinates in fields x and y; these are used to promote the object to class SpatialPointsDataFrame. > # view package information > library(help=gstat) > # load the package > library(gstat) >?meuse > # load sample data > data(meuse) > # as loaded is a data frame > summary(meuse) > # promote to class SpatialPointsDataFrame > coordinates(meuse) <- ~ x+y > # now has explicit coordinates > summary(meuse)

30 Introduction to applied geostatistics 29 > summary(meuse) Object of class SpatialPointsDataFrame Coordinates: min max x y Is projected: NA proj4string : [NA] Number of points: 155 Data attributes: cadmium copper lead zinc elev Min. : 0.20 Min. : 14.0 Min. : 37.0 Min. : 113 Min. : st Qu.: st Qu.: st Qu.: st Qu.: 198 1st Qu.: 7.55 Median : 2.10 Median : 31.0 Median :123.0 Median : 326 Median : 8.18 Mean : 3.25 Mean : 40.3 Mean :153.4 Mean : 470 Mean : rd Qu.: rd Qu.: rd Qu.: rd Qu.: 674 3rd Qu.: 8.96 Max. :18.10 Max. :128.0 Max. :654.0 Max. :1839 Max. :10.52 dist om ffreq soil lime landuse dist.m Min. : Min. : :84 1:97 0:111 W :50 Min. : 10 1st Qu.: st Qu.: :48 2:46 1: 44 Ah :39 1st Qu.: 80 Median : Median : :23 3:12 Am :22 Median : 270 Mean : Mean : 7.48 Fw :10 Mean : 290 3rd Qu.: rd Qu.: 9.00 Ab : 8 3rd Qu.: 450 Max. : Max. :17.00 (Other):25 Max. :1000 NA s : 2.00 NA s : 1

31 Introduction to applied geostatistics 30 The empirical variogram To summarize the variogram cloud, compute average semivariance at various separations ( lags ); this is the empirical variogram γ(h) = m(h) 1 [z(x i ) z(x j )] 2 2m(h) i=1 m(h) is the number of point pairs separated by vector h In practice, we have to define the set of vectors in each bin (to have enough points); that is, we collect a distance range into one bin. (Note: there are other ways to estimate the variogram from the variogram cloud; in particular so-called robust estimators.)

32 Introduction to applied geostatistics 31 Example of an experimental variogram > (v <- variogram(log(cadmium)~1, data=meuse)) np dist gamma np are the number of point pairs in the bin; dist is the average separation of these pairs; gamma is the average semivariance in the bin.

33 Introduction to applied geostatistics 32 Plotting the experimental variogram This can be plotted as semivariance gamma against average separation dist, along with the number of points that contributed to each estimate np: > plot(v, plot.numbers=t) (Note: gstat defaults to 15 equally-spaced bins and a maximum distance of 1/3 of the maximum separation. These can be over-ridden with the width= and cutoff= arguments, respectively; or explicit bin limits can be set with the boundaries= argument.)

34 Introduction to applied geostatistics 33 Default variogram of Log(Cd) semivariance distance

35 Introduction to applied geostatistics 34 Features of the experimental variogram Later we will look at fitting a theoretical model to the experimental variogram; but even without a model we can notice some features, which we define here only qualitatively: Sill: maximum semi-variance * represents variability in the absence of spatial dependence Range: separation between point-pairs at which the sill is reached * distance at which there is no evidence of spatial dependence Nugget: semi-variance as the separation approaches zero * represents variability at a point that can t be explained by spatial structure In the previous slide, we can estimate the sill 1.9, the range 1200 m, and the nugget 0.5 i.e. 25% of the sill.

36 Introduction to applied geostatistics 35 Defining the bins (1) Distance interval, specifying the centres. E.g. (0, 100, 200,...) means intervals of [ ], [ ],... All point pairs whose separation is in the interval are used to estimate γ(h) for h as the interval centre Narrow intervals: more resolution but fewer point pairs for each sample > v <- variogram(log(cadmium)~1, meuse, boundaries = seq(50, 2050, by = 100)) > plot(v, pl=t) > par(mfrow = c(2, 3)) # show all six plots together > for (bw in seq(20, 220, by = 40)) { v<-variogram(log(cadmium)~1, meuse, width=bw) plot(v$dist, v$gamma, xlab=paste("bin width", bw))

37 Introduction to applied geostatistics 36 Variograms of Log(Cd) with different bin widths v$gamma v$gamma v$gamma bin width 20 bin width 60 bin width 100 v$gamma v$gamma v$gamma bin width 140 bin width 180 bin width 220

38 Introduction to applied geostatistics 37 Defining the bins (2) Each bin should have > 100 point pairs; > 300 is much more reliable > v <- variogram(log(cadmium)~1, meuse, width=20) > plot(v, plot.numbers=t) > v$np [1] [16] [31] [46] [61] [76] > v <- variogram(log(cadmium)~1, meuse, width=120) > v$np [1] > plot(v, plot.numbers=t)

39 Introduction to applied geostatistics 38 Topic: Modelling the variogram From the empirical variogram we now derive a variogram model which expresses semivariance as a function of separation vector. The model allows us to: Infer the characteristics of the underlying process from the functional form and its parameters; Compute the semi-variance between any point-pair, separated by any vector which is used in an optimal interpolator ( kriging ) to predict at unsampled locations.

40 Introduction to applied geostatistics 39 A variogram model, with parameters

41 Introduction to applied geostatistics 40 Authorized variogram models Only some functional forms can be used to model the variogram (theoretical and mathematical constraints) The permitted forms are called authorized models Simplest: The exponential model; sill c, effective range 3a γ(h) = c{1 e ( h a ) } E.g. if the effective range is estimated as 120, the parameter a is 40. Another common model: The Spherical model; sill c, range a γ(h) = c { 3h 2a 1 2 ( ) } h 3 a : h < a c : h a

42 Introduction to applied geostatistics 41 Graphs of authorized variogram models Linear with sill variogram model Circular variogram model Spherical variogram model semivariance sill nugget range semivariance sill nugget range semivariance sill nugget range separation distance separation distance separation distance Pentaspherical variogram model Exponential variogram model Gaussian variogram model semivariance sill nugget range semivariance sill nugget range semivariance sill nugget range separation distance separation distance separation distance

43 Introduction to applied geostatistics 42 MComparaison Mseparation Msemivariance MExponential MSpherical MGaussian MPentaspherical MCircular MLinear-with-sill distance of variogram models Comparaison of variogram models m0m0m 2M2M 4M4M 6M6M 8M8M m0.0 M0.0 M0.2 M0.4 M0.6 M0.8 M1.0 Msill Mnugget Mrange semivariance sill Exponential Gaussian Circular Spherical Pentaspherical Linear-with-sill 0.2 nugget 0.0 range separation distance 6 8 Models vary considerably, from origin to range

44 Introduction to applied geostatistics 43 Comparaison of models available in gstat > show.vgms()

45 Introduction to applied geostatistics 44 Choosing a model (1) The empirical variogram should be one realization of a random process. So, what do we expect from the process that is supposed to be responsible for the spatial structure represented in the variogram? Exponential: First-order autoregressive process: values are random but with dependency on the nearest neighbour; boundaries according to a Poisson process Gaussian: as exponential, but with strong close-range dependency, very smooth at each point.

46 Introduction to applied geostatistics 45 Choosing a model (1) continued Spherical, circular, pentaspherical: Patches of similar values; patches have similar size range) with transition zones (overlap of processes); These differ mainly in the shoulder transition to the sill

47 Introduction to applied geostatistics 46 Choosing a model (2) Which has been successfully applied with this kind of data? (This is evidence for the nature of this kind of process) What do we expect from the supposed process? if we have some other evidence of its spatial behaviour. For example, a Gaussian model might be expected for a phenomenon which physically must be very continuous, e.g. the surface of a ground-water table. Visual estimate of functional form from the variogram (Fit various models, pick the statistically-best fit)

48 Introduction to applied geostatistics 47 Fitting the model Once a model form is selected, then the model parameters must be adjusted for a best fit of the experimental variogram. By eye, adjusting parameters for good-looking fit * Hard to judge the relative value of each point * This is all that s possible in ILWIS Automatically, looking for the best fit according to some objective criterion * Various criteria possible in gstat In both cases, favour sections of the variogram with more pairs and at shorter ranges (because it is a local interpolator). Mixed: adust by eye, evaluate statistically; or vice versa

49 Introduction to applied geostatistics 48 Fitting a variogram model in gstat We ve decided on a spherical + nugget model: > # Calculate the experimental variogram and display it > v1 <- variogram(log(cadmium)~1, meuse); plot(v1, plot.numbers=t) > # Fit by eye, display fit > m1 <- vgm(1.4, "Sph", 1200, 0.5); plot(v1, plot.numbers=t, model=m1) > # Let gstat adjust the parameters, display fit > m2 <- fit.variogram(v1, m1); m2 model psill range 1 Nug Sph > plot(v1, plot.numbers=t, model=m2) > # Fix the nugget, fit only the sill of spherical model > m2a <- fit.variogram(v1,m1,fit.sills=c(f,t),fit.range=f); m2a model psill range 1 Nug Sph In this case, the eyeball did a pretty good job...

50 Introduction to applied geostatistics semivariance semivariance distance distance By eye: c 0 = 0.5, c 1 = 1.4, a = 1200; total sill c 0 + c 1 = 1.9 Automatic: c 0 = 0.548, c 1 = 1.340, a = 1149; total sill c 0 + c 1 = The total sill was almost unchanged; gstat raised the nugget and lowered the partial sill of the spherical model a bit; the range was shortened by 51 m.

51 Introduction to applied geostatistics 50 What sample size to fit a variogram model? Can t use non-spatial formulas for sample size, because spatial samples are correlated, and each sample is used multiple times in the variogram estimate Stochastic simulation from an assumed random field with a known variogram suggests: 1. < 50 points: not at all reliable to 150 points: more or less acceptable 3. > 250 points: almost certaintly reliable More points are needed to estimate an anisotropic variogram. This is very worrying for many environmental datasets (soil cores, vegetation plots,... ) especially from short-term fieldwork, where sample sizes of are typical. Should variograms even be attempted on such small samples?

52 Introduction to applied geostatistics 51 Topic: Approaches to spatial prediction This is the prediction of the value of some variable at an unsampled point, based on the values at the sampled points. This is often called interpolation, but strictly speaking: Interpolation: prediction is only for points that are geographically inside the (convex hull of the) sample set; Extrapolation: prediction outside this geographic area (Note: same usage as in feature-space predictions)

53 Introduction to applied geostatistics 52 A taxomomy of spatial prediction methods Strata divide area to be mapped into homogeneous strata; predict within each stratum from all samples in that stratum Global predictors: use all samples to predict at all points; also called regional predictors; Local predictors: use only nearby samples to predict at each point Mixed predictors: some of structure is explained by strata or globally, some locally

54 Introduction to applied geostatistics 53 Which approach is best? No theoretical answer Depends on how well the approach models the true spatial structure, and this is unknown (but we may have prior evidence) Should correspond with what we know about the process that created the spatial structure

55 Introduction to applied geostatistics 54 Polynomial trend surfaces A global predictor which models a regional trend The value of a variable at each point depends only on its coödinates and parameters of a fitted surface This is modelled with a smooth function of position, z = f (x, y) = f (E, N) for grid coördinates; this is called the trend surface Simple form (plane, 1 st order): z = β 0 + β x E + β y N Higher-order surfaces may also be fitted (beware of fitting the noise!)

56 Introduction to applied geostatistics 55 Fitting trend surfaces The trend surface is predicted by linear regression with coödinates as the predictor variables and the response variable to be predicted, using data from all sample points. All samples participate equally in the prediction We can measure the goodness of fit of the trend surface to the sample by the residual sum of squares The same cautions as in feature-space regression analysis! Ordinary Least Squares (OLS) is often used but is not really correct, since it ignores possible correlation among closely-spaced samples; better is Generalised Least Squares (GLS)

57 Introduction to applied geostatistics 56 Predictions of 1st and 2nd order Trend Surfaces in the study area M1st M M M M M M M M M M M M-2.0 M-1.5 M-1.0 M-0.5 M0.0 M0.5 M1.0 M1.5 and 2nd order trend surfaces, study area M xmxm ymym MTS1 MTS2 1st and 2nd order trend surfaces, study area y x

58 Introduction to applied geostatistics 57 Predictions of 1st and 2nd order Trend Surfaces in the bounding box -4M1st M M M M M M M M M M M M-4M-4M -3M-3M-3M -2M-2M-2M -1M-1M-1M 0M0M0M 1M1M1M 2M2M2M 3M3M3M 4M4M4M and 2nd order trend surfaces, bounding box -4M xmxm ymym MTS1 MTS2 1st and 2nd order trend surfaces, bounding box y x

59 Introduction to applied geostatistics 58 Approaches to prediction: Local predictors No strata No regional trend Value of the variable is predicted from nearby samples * Example: concentrations of soil constituents (e.g. salts, pollutants) * Example: vegetation density

60 Introduction to applied geostatistics 59 Local Predictors Each interpolator has its own assumptions, i.e. theory of spatial variability Nearest neighbour (Thiessen polygons) Average within a radius Average of the n nearest neighbours Distance-weighted average within a radius Distance-weighted average of n nearest neighbours... Optimal weighting Kriging

61 Introduction to applied geostatistics 60 Local predictor: Nearest neighbour (Thiessen polygons) Predict each point from its single nearest sample point Conceptually-simple, makes the minimal assumptions about spatial structure No way to estimate prediction variances, ignores other nearby information Maps show abrupt discontinuities at boundaries, so don t look very realistic But may be a more accurate predictor than poorly-modelled predictors

62 Introduction to applied geostatistics 61 Local predictor: Average within a radius Use the set of all neighbouring sample points within some radius r Predict by averaging : ˆx 0 = 1 n n i=1 x i, d(x 0, x i ) r Although we can calculate prediction variances from the neighbours, these assume no spatial structure closer than the radius Problem: How do we select a radius?

63 Introduction to applied geostatistics 62 Local predictors: Distance-weighted average Inverse of distance to some set of n nearest-neighbours: ˆx 0 = n i=1 x n i d(x 0, x i ) / i=1 1 d(x 0, x i ) Inverse of distance to some set of n nearest-neighbours, to some power k ˆx 0 = n i=1 x i k d(x 0, x i ) k/ i=1 1 d(x 0, x i ) k Implicit theory of spatial structure (a power model), but this is not testable Can select all points within some limiting distance (radius), or some fixed number of nearest points, or... How to select radius or number and power?

64 Introduction to applied geostatistics 63 Inverse distance in gstat The idw method is used. There is no model of spatial variability, so there is no way to estimate a prediction variance. > kid <- idw(log(cadmium) ~ 1, meuse, meuse.grid) [inverse distance weighted interpolation] > levelplot(var1.pred ~ x+y, as.data.frame(kid), aspect="iso") The weights are computed only from the inverse distance; they do not account for spatial structure nor for the relative positions of the sample points. Compare inverse distance (linear) to Ordinary Kriging with a spherical model (range = 1150 m): OK gives a much smoother map.

65 Introduction to applied geostatistics 64 Inverse distance y x y x Ordinary kriging y x y x

66 Introduction to applied geostatistics 65 Approaches to prediction: Mixed predictors For situations where there is both long-range structure (trend) or strata and local structure * Example: Particle size in the soil: strata (rock type), trend (distance from a river), and local variation in depositional or weathering processes One approach: model strata or global trend, subtract from each value, then model residuals Regression Kriging. Another approach: model everything together Universal Kriging or Kriging with External Drift

67 Introduction to applied geostatistics 66 Topic: Ordinary Kriging The theory of regionalised variables leads to an optimal interpolation method, in the sense that the prediction variance is minimized. This is based on the theory of random functions, and requires certain assumptions.

68 Introduction to applied geostatistics 67 Kriging A Best Linear Unbiased Predictor (BLUP) that satisfies a certain optimality criterion (so it s best with respect to the criterion) It is only optimal with respect to the chosen model and the chosen optimality criterion Based on the theory of random processes, with covariances depending only on separation (i.e. a variogram model) Theory developed several times (Kolmogorov 1930 s, Wiener 1949) but current practise dates back to Matheron (1963), formalizing the practical work of the mining engineer D G Krige (RSA). * Should really be written as krigeing (Fr. krigeage) but it s too late for that.

69 Introduction to applied geostatistics 68 What is so special about kriging? Predicts at any point as the weighted average of the values at sampled points * as for inverse distance (to a power) Weights given to each sample point are optimal, given the spatial covariance structure as revealed by the variogram model (in this sense it is best ) So, the prediction is only as good as the model of spatial structure! The prediction error at each point is automatically generated as part of the process of computing the weights.

70 Introduction to applied geostatistics 69 How do we use Kriging? 1. Sample, preferably at different resolutions 2. Calculate the experimental variogram 3. Model the variogram with one or more authorized functions N.b. the variogram model may already be known from other studies or theoretical considereations 4. Apply the kriging system of equations, with the variogram model of spatial dependence, at each point to be predicted Predictions are often at each point on a regular grid (e.g. a raster map) 5. Calculate the variance of each prediction; this is based only on the sample point locations, not their data values.

71 Introduction to applied geostatistics 70 OK in gstat The krige method is used with a variogram model: # compute experimental variogram v <- variogram(log(cadmium) ~ 1, meuse) # estimated model m <- vgm(1.4, "Sph", 1200, 0.5) # fitted model m.f <- fit.variogram(v, m) data(meuse.grid); coordinates(meuse.grid) <- ~ x +y # interpolation grid kr <- krige(log(cadmium)~ 1, loc=meuse, newdata=meuse.grid, model=m.f) [using ordinary kriging] # visualize interpolation; note aspect option to get correct geometry levelplot(var1.pred ~ x+y, as.data.frame(kr), aspect="iso") # visualize prediction error levelplot(var1.var ~ x+y, as.data.frame(kr), aspect="iso") Note the model specification (model=m.f); this gives the assumed covariance structure with which to compute the optimal weights.

72 Introduction to applied geostatistics 71 Ordinary kriging (OK) results for Meuse log(cd) y y x x

73 Introduction to applied geostatistics 72 Kriging prediction errors for Meuse log(cd) y y x x

74 Introduction to applied geostatistics 73 How realistic are maps made by Ordinary Kriging? The resulting surface is smooth and shows no noise, no matter if there is a nugget effect in the variogram model So the field is the best at each point taken separately, but taken as a whole is not a realistic map The sample points are predicted exactly; they are assumed to be without error, again even if there is a nugget effect in the variogram model

75 Introduction to applied geostatistics 74 Non-parametric geostatistics A non-parametric statistic is one that does not assume any underlying data distribution. For example: a mean is an estimate of a parameter of location of some assumed distribution (e.g.mid-point of normal, expected proportion of success in a binomial,... ) a median is simply the value at which half the samples are smaller and half larger, without knowing anything about the distribution underlying the process which produced the sample. In geostatistics, non-parametric refers to methods that make no assumptions about the distribution of the data values, only about spatial structure.

76 Introduction to applied geostatistics 75 Non-parametric geostatistics: Motivation (1) There is some positive motivation... In some applications, we may be most interested in finding areas with values above a certain threshold (e.g. polluted areas), and not really care if we get accurate predictions in other areas (as long as we are sure they are below the threshold) So the form of the distribution is not important, just whether a value is above or below some threshold. In these applications, we often want a probability that an interpolated point exceeds the threshold; this is directly useful for probabilistic decision-making * e.g. whether or not to clean up a polluted site

77 Introduction to applied geostatistics 76 Non-parametric geostatistics: Motivation (2)... and there is also some negative motivation: The outlier problem: a dataset may contain a few very high values These can make the area mean arbitrarily high (n.b. not the median) These contribute a disproportionate amount to the total variance as well These can make the experimental semivariogram unreliable for typical values and useless for unusual values: * the point-pairs where the outliers are included will have very high semivariances * these contribute disproportionately to the average semivariance in a bin... *... so that the variogram is very difficult to model

78 Introduction to applied geostatistics 77 E.g. a random sample of 15 N(10, 1) variates with one outlier at 100 (i.e. 10x the expected value) replacing the last value: 1. Without outlier: x = 9.95, s 2 x = With outlier: x = 15.98, s 2 x = The one point with value 100 accounts for (100 x) 2 /15 = 470 of the variance, i.e. 470/540 = 87% of it So, it will make semi-variances of point pairs involving this point much higher than others; these can be seen in the variogram cloud Note: the median is only slightly affected: if the outlier replaces a value above the median, the next-highest value is now the median

79 Introduction to applied geostatistics 78 Solutions to the outlier problem 1. Ignore (assume that they represent a different population and remove from the dataset before further analysis) under-estimation, can t find hot spots 2. Set to some arbitrary maximum, nearer the bulk of the population; same problem 3. Transform the variable to logarithms for modelling; transform back for the final maps and estimates Good solution if the whole distribution is lognormal Not optimal if the aim is just to bring some outliers closer (i.e. the rest of the distribution is not lognormal) 4. Transform to indicator variables, interpolate by Indicator Kriging (IK)

80 Introduction to applied geostatistics 79 Lognormal Kriging 1. Transform the data to their (natural) logarithms: y( x) i = log z( x i ); this should be approximately normally distributed 2. Model and interpolate with the transformed variable (OK, block kriging, UK, KED, trend surfaces... ) 3. Optional: Back-transform to original units of measure Back-transformation is not required if we don t care about the original variable, e.g. if the logarithm itself is a useful index. (Back-transformation of prediction variances is only possible for SK.)

81 Introduction to applied geostatistics 80 Indicator kriging This is a simple non-parametric (also called distribution-free) method ofinterpolation. It is used primarily to estimate the probability of exceeding some pre-defined threshold value. It can also be used to estimate an entire cumulative probability distribution (CDF). Note that there are other non-parametric methods, e.g. disjunctive kriging; these have a much more difficult theory.

82 Introduction to applied geostatistics 81 Distribution-free estimates So far we have assumed an approximately normal or lognormal distribution of the target spatially-correlated random variable. But this may be demonstrably not true. A non-parametric approach does not attempt to fit a distribution to the data, but rather works directly with the experimental CDF, by dividing it into sample quantiles. To work with these, we introduce the idea of indicator variables.

83 Introduction to applied geostatistics 82 Indicator variables Binary variables: Take one of the values {1, 0} depending on whether the point is in or out of the set; i.e. if it does or does not meet some criterion * These are suitable for binary nominal variables, e.g. { urban, not urban }; { land use changed, land use did not change } A continuous variable can be converted to an indicator z t by a threshold or cut-off value x t : z t = 1 x x t * e.g. x t = 350 to cut-off at 350 mg kg -1 * Formally: I( x i, z t ) = 1 iff Z( x i ) z t ; 0 otherwise * By convention 1 indicates values below the threshold (to model the CDF); inverting reverses the sense

84 Introduction to applied geostatistics 83 Setting up indicators in gstat > mind <- as.data.frame(meuse)[c("x","y","cadmium")]; str(mind) data.frame : 155 obs. of 3 variables: $ x : num $ y : num $ cadmium: num > attach(mind) > quantile(cadmium, seq(0,1,.1)) 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% > for (q in seq(.1,.9,.1)) mind <- + cbind(mind, as.numeric(cadmium<=quantile(cadmium,q))) > names(mind)[4:12] <- paste("q",seq(1:9),sep="") > mind$q5[1:30] [1] So field q5 of data frame mind contains a 1 if the corresponding value of field cadmium is 2.10, the fifth decile (i.e. the median).

85 Introduction to applied geostatistics 84 Indicator map Every sample point is either 1 ( in ) or 0 ( out ); a binary map No measure of how far in or out Prepare a series of indicator maps, with increasing thresholds, to visualise the cumulative sample distribution A common strategy is to divide the range of the sample values into quartiles or deciles and prepare an indicator for each The proportion of 1 s will increase with increasing quantile.

86 Introduction to applied geostatistics 85 The Indicator variogram Compute as for a parametric variogram; every sample point has either value 1 (below the cutoff, in the set) or 0. The semivariance of each point pair is either 0 (both above or below; both out or in) or 0.5 (one above, one below; one out, one in). For a quantized continuous variable, each indicator variable (quantile) might well have different spatial structure Variograms near the two ends of the CDF have few 1 s or 0 s (depending on the end), so few point-pairs will have semivariance 0.5 hard to model (fluctuates) Model as for parametric variogram; however the total sill must be < 0.5 (generally it s a lot lower)

87 Introduction to applied geostatistics 86 Probability kriging using indicator variables 1. Calculate the indicator at the required threshold 2. Calculate the empirical variogram for that indicator (not the median) (May have to use a threshold closer to the median if there are too few 1 s so that the variogram is erratic) 3. Model the variogram 4. Solve the kriging system at each point to be predicted, using Simple Kriging (SK) with the quantile proportion as the expected value (e.g., in the 6th decile, 0.6 of the values are expected to be 1 s) Note! this is only true if the original sampling scheme was unbiased! If not, also estimate the mean (use OK). 5. If necessary, limit the results to the range [0... 1] 6. This may be interepreted as the probability that the point does not exceed the threshold

88 Introduction to applied geostatistics 87 Indicator kriging in gstat > # convert to spatial object > coordinates(mind) <- ~ x +y > #compute the variogram for the 90th percentile > vq9 <- variogram(q9~1, mind) > plot(vq9, plot.numbers=t) > mq9 <- vgm(0.05,"sph",500,0.04) > plot(vq9, plot.numbers=t, model=mq9) > mq9f <- fit.variogram(vq9, mq9) > plot(vq9, plot.numbers=t, model=mq9f) > # erratic around sill, leads to very short range variogram > mq9f model psill range 1 Nug Sph > # krige this quantile; note expected proportion of 1 s is known > k9 <- krige(q9~1, mind, meuse.grid, beta=0.9, model=mq9f) [using simple kriging] > levelplot(var1.pred~x+y, as.data.frame(k9), aspect="iso") > # this is the probability of being *below* the cutoff of 8.26ppm

89 Introduction to applied geostatistics Indicator variogram (9th decile); semivariance semivariance Estimated model distance distance Fitted model; note unrealistic nugget semivariance y Probability < 8.26mg kg distance x 0.2 0

90 Introduction to applied geostatistics 89

91 Introduction to applied geostatistics 90 Summary: Advantages of IK Makes no assumption about the theoretical distribution of the data values, yet still give realistic probability estimates Outlier-resistent: these can not increase the estimate or prediction variances of an indicator arbitrarily; for data values they only affect one quantile Simple Kriging is used at each quantile, which improves the estimate.

92 Introduction to applied geostatistics 91 Summary: Disadvantages of IK Variograms may be difficult to model, especially at the highest and lowest quantiles (few pairs with different 0/1 values)

Practical 12: Geostatistics

Practical 12: Geostatistics Practical 12: Geostatistics This practical will introduce basic tools for geostatistics in R. You may need first to install and load a few packages. The packages sp and lattice contain useful function

More information

Practical 12: Geostatistics

Practical 12: Geostatistics Practical 12: Geostatistics This practical will introduce basic tools for geostatistics in R. You may need first to install and load a few packages. The packages sp and lattice contain useful function

More information

Data Break 8: Kriging the Meuse RiverBIOS 737 Spring 2004 p.1/27

Data Break 8: Kriging the Meuse RiverBIOS 737 Spring 2004 p.1/27 Data Break 8: Kriging the Meuse River BIOS 737 Spring 2004 Data Break 8: Kriging the Meuse RiverBIOS 737 Spring 2004 p.1/27 Meuse River: Reminder library(gstat) Data included in gstat library. data(meuse)

More information

GRAD6/8104; INES 8090 Spatial Statistic Spring 2017

GRAD6/8104; INES 8090 Spatial Statistic Spring 2017 Lab #4 Semivariogram and Kriging (Due Date: 04/04/2017) PURPOSES 1. Learn to conduct semivariogram analysis and kriging for geostatistical data 2. Learn to use a spatial statistical library, gstat, in

More information

PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH

PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH SURESH TRIPATHI Geostatistical Society of India Assumptions and Geostatistical Variogram

More information

Geostatistics: Kriging

Geostatistics: Kriging Geostatistics: Kriging 8.10.2015 Konetekniikka 1, Otakaari 4, 150 10-12 Rangsima Sunila, D.Sc. Background What is Geostatitics Concepts Variogram: experimental, theoretical Anisotropy, Isotropy Lag, Sill,

More information

COMPARISON OF DIGITAL ELEVATION MODELLING METHODS FOR URBAN ENVIRONMENT

COMPARISON OF DIGITAL ELEVATION MODELLING METHODS FOR URBAN ENVIRONMENT COMPARISON OF DIGITAL ELEVATION MODELLING METHODS FOR URBAN ENVIRONMENT Cahyono Susetyo Department of Urban and Regional Planning, Institut Teknologi Sepuluh Nopember, Indonesia Gedung PWK, Kampus ITS,

More information

7 Geostatistics. Figure 7.1 Focus of geostatistics

7 Geostatistics. Figure 7.1 Focus of geostatistics 7 Geostatistics 7.1 Introduction Geostatistics is the part of statistics that is concerned with geo-referenced data, i.e. data that are linked to spatial coordinates. To describe the spatial variation

More information

Spatiotemporal Analysis of Environmental Radiation in Korea

Spatiotemporal Analysis of Environmental Radiation in Korea WM 0 Conference, February 25 - March, 200, Tucson, AZ Spatiotemporal Analysis of Environmental Radiation in Korea J.Y. Kim, B.C. Lee FNC Technology Co., Ltd. Main Bldg. 56, Seoul National University Research

More information

A MultiGaussian Approach to Assess Block Grade Uncertainty

A MultiGaussian Approach to Assess Block Grade Uncertainty A MultiGaussian Approach to Assess Block Grade Uncertainty Julián M. Ortiz 1, Oy Leuangthong 2, and Clayton V. Deutsch 2 1 Department of Mining Engineering, University of Chile 2 Department of Civil &

More information

The Proportional Effect of Spatial Variables

The Proportional Effect of Spatial Variables The Proportional Effect of Spatial Variables J. G. Manchuk, O. Leuangthong and C. V. Deutsch Centre for Computational Geostatistics, Department of Civil and Environmental Engineering University of Alberta

More information

GEOSTATISTICAL ANALYSIS OF SPATIAL DATA. Goovaerts, P. Biomedware, Inc. and PGeostat, LLC, Ann Arbor, Michigan, USA

GEOSTATISTICAL ANALYSIS OF SPATIAL DATA. Goovaerts, P. Biomedware, Inc. and PGeostat, LLC, Ann Arbor, Michigan, USA GEOSTATISTICAL ANALYSIS OF SPATIAL DATA Goovaerts, P. Biomedware, Inc. and PGeostat, LLC, Ann Arbor, Michigan, USA Keywords: Semivariogram, kriging, spatial patterns, simulation, risk assessment Contents

More information

Lecture 5 Geostatistics

Lecture 5 Geostatistics Lecture 5 Geostatistics Lecture Outline Spatial Estimation Spatial Interpolation Spatial Prediction Sampling Spatial Interpolation Methods Spatial Prediction Methods Interpolating Raster Surfaces with

More information

11/8/2018. Spatial Interpolation & Geostatistics. Kriging Step 1

11/8/2018. Spatial Interpolation & Geostatistics. Kriging Step 1 (Z i Z j ) 2 / 2 (Z i Zj) 2 / 2 Semivariance y 11/8/2018 Spatial Interpolation & Geostatistics Kriging Step 1 Describe spatial variation with Semivariogram Lag Distance between pairs of points Lag Mean

More information

Spatial Data Mining. Regression and Classification Techniques

Spatial Data Mining. Regression and Classification Techniques Spatial Data Mining Regression and Classification Techniques 1 Spatial Regression and Classisfication Discrete class labels (left) vs. continues quantities (right) measured at locations (2D for geographic

More information

Index. Geostatistics for Environmental Scientists, 2nd Edition R. Webster and M. A. Oliver 2007 John Wiley & Sons, Ltd. ISBN:

Index. Geostatistics for Environmental Scientists, 2nd Edition R. Webster and M. A. Oliver 2007 John Wiley & Sons, Ltd. ISBN: Index Akaike information criterion (AIC) 105, 290 analysis of variance 35, 44, 127 132 angular transformation 22 anisotropy 59, 99 affine or geometric 59, 100 101 anisotropy ratio 101 exploring and displaying

More information

SPATIAL-TEMPORAL TECHNIQUES FOR PREDICTION AND COMPRESSION OF SOIL FERTILITY DATA

SPATIAL-TEMPORAL TECHNIQUES FOR PREDICTION AND COMPRESSION OF SOIL FERTILITY DATA SPATIAL-TEMPORAL TECHNIQUES FOR PREDICTION AND COMPRESSION OF SOIL FERTILITY DATA D. Pokrajac Center for Information Science and Technology Temple University Philadelphia, Pennsylvania A. Lazarevic Computer

More information

Exploring the World of Ordinary Kriging. Dennis J. J. Walvoort. Wageningen University & Research Center Wageningen, The Netherlands

Exploring the World of Ordinary Kriging. Dennis J. J. Walvoort. Wageningen University & Research Center Wageningen, The Netherlands Exploring the World of Ordinary Kriging Wageningen University & Research Center Wageningen, The Netherlands July 2004 (version 0.2) What is? What is it about? Potential Users a computer program for exploring

More information

Soil Moisture Modeling using Geostatistical Techniques at the O Neal Ecological Reserve, Idaho

Soil Moisture Modeling using Geostatistical Techniques at the O Neal Ecological Reserve, Idaho Final Report: Forecasting Rangeland Condition with GIS in Southeastern Idaho Soil Moisture Modeling using Geostatistical Techniques at the O Neal Ecological Reserve, Idaho Jacob T. Tibbitts, Idaho State

More information

Spatial Interpolation & Geostatistics

Spatial Interpolation & Geostatistics (Z i Z j ) 2 / 2 Spatial Interpolation & Geostatistics Lag Lag Mean Distance between pairs of points 1 y Kriging Step 1 Describe spatial variation with Semivariogram (Z i Z j ) 2 / 2 Point cloud Map 3

More information

ENGRG Introduction to GIS

ENGRG Introduction to GIS ENGRG 59910 Introduction to GIS Michael Piasecki October 13, 2017 Lecture 06: Spatial Analysis Outline Today Concepts What is spatial interpolation Why is necessary Sample of interpolation (size and pattern)

More information

A Short Note on the Proportional Effect and Direct Sequential Simulation

A Short Note on the Proportional Effect and Direct Sequential Simulation A Short Note on the Proportional Effect and Direct Sequential Simulation Abstract B. Oz (boz@ualberta.ca) and C. V. Deutsch (cdeutsch@ualberta.ca) University of Alberta, Edmonton, Alberta, CANADA Direct

More information

Advanced analysis and modelling tools for spatial environmental data. Case study: indoor radon data in Switzerland

Advanced analysis and modelling tools for spatial environmental data. Case study: indoor radon data in Switzerland EnviroInfo 2004 (Geneva) Sh@ring EnviroInfo 2004 Advanced analysis and modelling tools for spatial environmental data. Case study: indoor radon data in Switzerland Mikhail Kanevski 1, Michel Maignan 1

More information

Mapping Precipitation in Switzerland with Ordinary and Indicator Kriging

Mapping Precipitation in Switzerland with Ordinary and Indicator Kriging Journal of Geographic Information and Decision Analysis, vol. 2, no. 2, pp. 65-76, 1998 Mapping Precipitation in Switzerland with Ordinary and Indicator Kriging Peter M. Atkinson Department of Geography,

More information

Spatial Analysis II. Spatial data analysis Spatial analysis and inference

Spatial Analysis II. Spatial data analysis Spatial analysis and inference Spatial Analysis II Spatial data analysis Spatial analysis and inference Roadmap Spatial Analysis I Outline: What is spatial analysis? Spatial Joins Step 1: Analysis of attributes Step 2: Preparing for

More information

I don t have much to say here: data are often sampled this way but we more typically model them in continuous space, or on a graph

I don t have much to say here: data are often sampled this way but we more typically model them in continuous space, or on a graph Spatial analysis Huge topic! Key references Diggle (point patterns); Cressie (everything); Diggle and Ribeiro (geostatistics); Dormann et al (GLMMs for species presence/abundance); Haining; (Pinheiro and

More information

2.6 Two-dimensional continuous interpolation 3: Kriging - introduction to geostatistics. References - geostatistics. References geostatistics (cntd.

2.6 Two-dimensional continuous interpolation 3: Kriging - introduction to geostatistics. References - geostatistics. References geostatistics (cntd. .6 Two-dimensional continuous interpolation 3: Kriging - introduction to geostatistics Spline interpolation was originally developed or image processing. In GIS, it is mainly used in visualization o spatial

More information

REN R 690 Lab Geostatistics Lab

REN R 690 Lab Geostatistics Lab REN R 690 Lab Geostatistics Lab The objective of this lab is to try out some basic geostatistical tools with R. Geostatistics is used primarily in the resource and environmental areas for estimation, uncertainty

More information

University of California, Los Angeles Department of Statistics

University of California, Los Angeles Department of Statistics University of California, Los Angeles Department of Statistics Statistics C173/C273 Instructor: Nicolas Christou Working with gstat - An example We will use the data from the Meuse (Dutch Maas) river.

More information

Geostatistical Interpolation: Kriging and the Fukushima Data. Erik Hoel Colligium Ramazzini October 30, 2011

Geostatistical Interpolation: Kriging and the Fukushima Data. Erik Hoel Colligium Ramazzini October 30, 2011 Geostatistical Interpolation: Kriging and the Fukushima Data Erik Hoel Colligium Ramazzini October 30, 2011 Agenda Basics of geostatistical interpolation Fukushima radiation Database Web site Geoanalytic

More information

Conditional Distribution Fitting of High Dimensional Stationary Data

Conditional Distribution Fitting of High Dimensional Stationary Data Conditional Distribution Fitting of High Dimensional Stationary Data Miguel Cuba and Oy Leuangthong The second order stationary assumption implies the spatial variability defined by the variogram is constant

More information

Introduction. Semivariogram Cloud

Introduction. Semivariogram Cloud Introduction Data: set of n attribute measurements {z(s i ), i = 1,, n}, available at n sample locations {s i, i = 1,, n} Objectives: Slide 1 quantify spatial auto-correlation, or attribute dissimilarity

More information

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display.

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 15. SPATIAL INTERPOLATION 15.1 Elements of Spatial Interpolation 15.1.1 Control Points 15.1.2 Type of Spatial Interpolation 15.2 Global Methods 15.2.1 Trend Surface Models Box 15.1 A Worked Example

More information

Investigation of Monthly Pan Evaporation in Turkey with Geostatistical Technique

Investigation of Monthly Pan Evaporation in Turkey with Geostatistical Technique Investigation of Monthly Pan Evaporation in Turkey with Geostatistical Technique Hatice Çitakoğlu 1, Murat Çobaner 1, Tefaruk Haktanir 1, 1 Department of Civil Engineering, Erciyes University, Kayseri,

More information

Chapter 1. Summer School GEOSTAT 2014, Spatio-Temporal Geostatistics,

Chapter 1. Summer School GEOSTAT 2014, Spatio-Temporal Geostatistics, Chapter 1 Summer School GEOSTAT 2014, Geostatistics, 2014-06-19 sum- http://ifgi.de/graeler Institute for Geoinformatics University of Muenster 1.1 Spatial Data From a purely statistical perspective, spatial

More information

Empirical Bayesian Kriging

Empirical Bayesian Kriging Empirical Bayesian Kriging Implemented in ArcGIS Geostatistical Analyst By Konstantin Krivoruchko, Senior Research Associate, Software Development Team, Esri Obtaining reliable environmental measurements

More information

BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS

BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS Srinivasan R and Venkatesan P Dept. of Statistics, National Institute for Research Tuberculosis, (Indian Council of Medical Research),

More information

Concepts and Applications of Kriging. Eric Krause Konstantin Krivoruchko

Concepts and Applications of Kriging. Eric Krause Konstantin Krivoruchko Concepts and Applications of Kriging Eric Krause Konstantin Krivoruchko Outline Introduction to interpolation Exploratory spatial data analysis (ESDA) Using the Geostatistical Wizard Validating interpolation

More information

GEOINFORMATICS Vol. II - Stochastic Modelling of Spatio-Temporal Phenomena in Earth Sciences - Soares, A.

GEOINFORMATICS Vol. II - Stochastic Modelling of Spatio-Temporal Phenomena in Earth Sciences - Soares, A. STOCHASTIC MODELLING OF SPATIOTEMPORAL PHENOMENA IN EARTH SCIENCES Soares, A. CMRP Instituto Superior Técnico, University of Lisbon. Portugal Keywords Spacetime models, geostatistics, stochastic simulation

More information

Concepts and Applications of Kriging. Eric Krause

Concepts and Applications of Kriging. Eric Krause Concepts and Applications of Kriging Eric Krause Sessions of note Tuesday ArcGIS Geostatistical Analyst - An Introduction 8:30-9:45 Room 14 A Concepts and Applications of Kriging 10:15-11:30 Room 15 A

More information

Transiogram: A spatial relationship measure for categorical data

Transiogram: A spatial relationship measure for categorical data International Journal of Geographical Information Science Vol. 20, No. 6, July 2006, 693 699 Technical Note Transiogram: A spatial relationship measure for categorical data WEIDONG LI* Department of Geography,

More information

CREATION OF DEM BY KRIGING METHOD AND EVALUATION OF THE RESULTS

CREATION OF DEM BY KRIGING METHOD AND EVALUATION OF THE RESULTS CREATION OF DEM BY KRIGING METHOD AND EVALUATION OF THE RESULTS JANA SVOBODOVÁ, PAVEL TUČEK* Jana Svobodová, Pavel Tuček: Creation of DEM by kriging method and evaluation of the results. Geomorphologia

More information

Acceptable Ergodic Fluctuations and Simulation of Skewed Distributions

Acceptable Ergodic Fluctuations and Simulation of Skewed Distributions Acceptable Ergodic Fluctuations and Simulation of Skewed Distributions Oy Leuangthong, Jason McLennan and Clayton V. Deutsch Centre for Computational Geostatistics Department of Civil & Environmental Engineering

More information

Tricks to Creating a Resource Block Model. St John s, Newfoundland and Labrador November 4, 2015

Tricks to Creating a Resource Block Model. St John s, Newfoundland and Labrador November 4, 2015 Tricks to Creating a Resource Block Model St John s, Newfoundland and Labrador November 4, 2015 Agenda 2 Domain Selection Top Cut (Grade Capping) Compositing Specific Gravity Variograms Block Size Search

More information

Advances in Locally Varying Anisotropy With MDS

Advances in Locally Varying Anisotropy With MDS Paper 102, CCG Annual Report 11, 2009 ( 2009) Advances in Locally Varying Anisotropy With MDS J.B. Boisvert and C. V. Deutsch Often, geology displays non-linear features such as veins, channels or folds/faults

More information

Highly Robust Variogram Estimation 1. Marc G. Genton 2

Highly Robust Variogram Estimation 1. Marc G. Genton 2 Mathematical Geology, Vol. 30, No. 2, 1998 Highly Robust Variogram Estimation 1 Marc G. Genton 2 The classical variogram estimator proposed by Matheron is not robust against outliers in the data, nor is

More information

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University An Introduction to Spatial Statistics Chunfeng Huang Department of Statistics, Indiana University Microwave Sounding Unit (MSU) Anomalies (Monthly): 1979-2006. Iron Ore (Cressie, 1986) Raw percent data

More information

Modeling of Atmospheric Effects on InSAR Measurements With the Method of Stochastic Simulation

Modeling of Atmospheric Effects on InSAR Measurements With the Method of Stochastic Simulation Modeling of Atmospheric Effects on InSAR Measurements With the Method of Stochastic Simulation Z. W. LI, X. L. DING Department of Land Surveying and Geo-Informatics, Hong Kong Polytechnic University, Hung

More information

Concepts and Applications of Kriging

Concepts and Applications of Kriging Esri International User Conference San Diego, California Technical Workshops July 24, 2012 Concepts and Applications of Kriging Konstantin Krivoruchko Eric Krause Outline Intro to interpolation Exploratory

More information

ENVIRONMENTAL MONITORING Vol. II - Geostatistical Analysis of Monitoring Data - Mark Dowdall, John O Dea GEOSTATISTICAL ANALYSIS OF MONITORING DATA

ENVIRONMENTAL MONITORING Vol. II - Geostatistical Analysis of Monitoring Data - Mark Dowdall, John O Dea GEOSTATISTICAL ANALYSIS OF MONITORING DATA GEOSTATISTICAL ANALYSIS OF MONITORING DATA Mark Dowdall Norwegian Radiation Protection Authority, Environmental Protection Unit, Polar Environmental Centre, Tromso, Norway John O Dea Institute of Technology,

More information

Nonstationary models for exploring and mapping monthly precipitation in the United Kingdom

Nonstationary models for exploring and mapping monthly precipitation in the United Kingdom INTERNATIONAL JOURNAL OF CLIMATOLOGY Int. J. Climatol. 3: 39 45 (21) Published online 16 March 29 in Wiley InterScience (www.interscience.wiley.com) DOI: 1.12/joc.1892 Nonstationary models for exploring

More information

Spatial analysis. 0 move the objects and the results change

Spatial analysis. 0 move the objects and the results change 0 Outline: Roadmap 0 What is spatial analysis? 0 Transformations 0 Introduction to spatial interpolation 0 Classification of spatial interpolation methods 0 Interpolation methods 0 Areal interpolation

More information

Concepts and Applications of Kriging

Concepts and Applications of Kriging 2013 Esri International User Conference July 8 12, 2013 San Diego, California Technical Workshop Concepts and Applications of Kriging Eric Krause Konstantin Krivoruchko Outline Intro to interpolation Exploratory

More information

Estimation of direction of increase of gold mineralisation using pair-copulas

Estimation of direction of increase of gold mineralisation using pair-copulas 22nd International Congress on Modelling and Simulation, Hobart, Tasmania, Australia, 3 to 8 December 2017 mssanz.org.au/modsim2017 Estimation of direction of increase of gold mineralisation using pair-copulas

More information

Combining geological surface data and geostatistical model for Enhanced Subsurface geological model

Combining geological surface data and geostatistical model for Enhanced Subsurface geological model Combining geological surface data and geostatistical model for Enhanced Subsurface geological model M. Kurniawan Alfadli, Nanda Natasia, Iyan Haryanto Faculty of Geological Engineering Jalan Raya Bandung

More information

POPULAR CARTOGRAPHIC AREAL INTERPOLATION METHODS VIEWED FROM A GEOSTATISTICAL PERSPECTIVE

POPULAR CARTOGRAPHIC AREAL INTERPOLATION METHODS VIEWED FROM A GEOSTATISTICAL PERSPECTIVE CO-282 POPULAR CARTOGRAPHIC AREAL INTERPOLATION METHODS VIEWED FROM A GEOSTATISTICAL PERSPECTIVE KYRIAKIDIS P. University of California Santa Barbara, MYTILENE, GREECE ABSTRACT Cartographic areal interpolation

More information

Automatic Determination of Uncertainty versus Data Density

Automatic Determination of Uncertainty versus Data Density Automatic Determination of Uncertainty versus Data Density Brandon Wilde and Clayton V. Deutsch It is useful to know how various measures of uncertainty respond to changes in data density. Calculating

More information

The ProbForecastGOP Package

The ProbForecastGOP Package The ProbForecastGOP Package April 24, 2006 Title Probabilistic Weather Field Forecast using the GOP method Version 1.3 Author Yulia Gel, Adrian E. Raftery, Tilmann Gneiting, Veronica J. Berrocal Description

More information

An Introduction to Spatial Autocorrelation and Kriging

An Introduction to Spatial Autocorrelation and Kriging An Introduction to Spatial Autocorrelation and Kriging Matt Robinson and Sebastian Dietrich RenR 690 Spring 2016 Tobler and Spatial Relationships Tobler s 1 st Law of Geography: Everything is related to

More information

4th HR-HU and 15th HU geomathematical congress Geomathematics as Geoscience Reliability enhancement of groundwater estimations

4th HR-HU and 15th HU geomathematical congress Geomathematics as Geoscience Reliability enhancement of groundwater estimations Reliability enhancement of groundwater estimations Zoltán Zsolt Fehér 1,2, János Rakonczai 1, 1 Institute of Geoscience, University of Szeged, H-6722 Szeged, Hungary, 2 e-mail: zzfeher@geo.u-szeged.hu

More information

Correcting Variogram Reproduction of P-Field Simulation

Correcting Variogram Reproduction of P-Field Simulation Correcting Variogram Reproduction of P-Field Simulation Julián M. Ortiz (jmo1@ualberta.ca) Department of Civil & Environmental Engineering University of Alberta Abstract Probability field simulation is

More information

Report on Kriging in Interpolation

Report on Kriging in Interpolation Tabor Reedy ENVS421 3/12/15 Report on Kriging in Interpolation In this project I explored use of the geostatistical analyst extension and toolbar in the process of creating an interpolated surface through

More information

University of California, Los Angeles Department of Statistics. Geostatistical data: variogram and kriging

University of California, Los Angeles Department of Statistics. Geostatistical data: variogram and kriging University of California, Los Angeles Department of Statistics Statistics 403 Instructor: Nicolas Christou Geostatistical data: variogram and kriging Let Z(s) and Z(s + h) two random variables at locations

More information

Anomaly Density Estimation from Strip Transect Data: Pueblo of Isleta Example

Anomaly Density Estimation from Strip Transect Data: Pueblo of Isleta Example Anomaly Density Estimation from Strip Transect Data: Pueblo of Isleta Example Sean A. McKenna, Sandia National Laboratories Brent Pulsipher, Pacific Northwest National Laboratory May 5 Distribution Statement

More information

Using linear and non-linear kriging interpolators to produce probability maps

Using linear and non-linear kriging interpolators to produce probability maps To be presented at 2001 Annual Conference of the International Association for Mathematical Geology, Cancun, Mexico, September, IAMG2001. Using linear and non-linear kriging interpolators to produce probability

More information

A robust statistically based approach to estimating the probability of contamination occurring between sampling locations

A robust statistically based approach to estimating the probability of contamination occurring between sampling locations A robust statistically based approach to estimating the probability of contamination occurring between sampling locations Peter Beck Principal Environmental Scientist Image placeholder Image placeholder

More information

Gstat: multivariable geostatistics for S

Gstat: multivariable geostatistics for S DSC 2003 Working Papers (Draft Versions) http://www.ci.tuwien.ac.at/conferences/dsc-2003/ Gstat: multivariable geostatistics for S Edzer J. Pebesma Dept. of Physical Geography, Utrecht University, P.O.

More information

Basics of Point-Referenced Data Models

Basics of Point-Referenced Data Models Basics of Point-Referenced Data Models Basic tool is a spatial process, {Y (s), s D}, where D R r Chapter 2: Basics of Point-Referenced Data Models p. 1/45 Basics of Point-Referenced Data Models Basic

More information

Spatial Data Analysis in Archaeology Anthropology 589b. Kriging Artifact Density Surfaces in ArcGIS

Spatial Data Analysis in Archaeology Anthropology 589b. Kriging Artifact Density Surfaces in ArcGIS Spatial Data Analysis in Archaeology Anthropology 589b Fraser D. Neiman University of Virginia 2.19.07 Spring 2007 Kriging Artifact Density Surfaces in ArcGIS 1. The ingredients. -A data file -- in.dbf

More information

Lecture 8. Spatial Estimation

Lecture 8. Spatial Estimation Lecture 8 Spatial Estimation Lecture Outline Spatial Estimation Spatial Interpolation Spatial Prediction Sampling Spatial Interpolation Methods Spatial Prediction Methods Interpolating Raster Surfaces

More information

Basics in Geostatistics 2 Geostatistical interpolation/estimation: Kriging methods. Hans Wackernagel. MINES ParisTech.

Basics in Geostatistics 2 Geostatistical interpolation/estimation: Kriging methods. Hans Wackernagel. MINES ParisTech. Basics in Geostatistics 2 Geostatistical interpolation/estimation: Kriging methods Hans Wackernagel MINES ParisTech NERSC April 2013 http://hans.wackernagel.free.fr Basic concepts Geostatistics Hans Wackernagel

More information

Applied geostatistics Exercise 6 Assessing the quality of spatial predictions Geostatistical simulation

Applied geostatistics Exercise 6 Assessing the quality of spatial predictions Geostatistical simulation Applied geostatistics Exercise 6 Assessing the quality of spatial predictions Geostatistical simulation D G Rossiter University of Twente, Faculty of Geo-Information Science & Earth Observation (ITC) June

More information

A kernel indicator variogram and its application to groundwater pollution

A kernel indicator variogram and its application to groundwater pollution Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session IPS101) p.1514 A kernel indicator variogram and its application to groundwater pollution data Menezes, Raquel University

More information

What is Non-Linear Estimation?

What is Non-Linear Estimation? What is Non-Linear Estimation? You may have heard the terms Linear Estimation and Non-Linear Estimation used in relation to spatial estimation of a resource variable and perhaps wondered exactly what they

More information

11. Kriging. ACE 492 SA - Spatial Analysis Fall 2003

11. Kriging. ACE 492 SA - Spatial Analysis Fall 2003 11. Kriging ACE 492 SA - Spatial Analysis Fall 2003 c 2003 by Luc Anselin, All Rights Reserved 1 Objectives The goal of this lab is to further familiarize yourself with ESRI s Geostatistical Analyst, extending

More information

Beta-Binomial Kriging: An Improved Model for Spatial Rates

Beta-Binomial Kriging: An Improved Model for Spatial Rates Available online at www.sciencedirect.com ScienceDirect Procedia Environmental Sciences 27 (2015 ) 30 37 Spatial Statistics 2015: Emerging Patterns - Part 2 Beta-Binomial Kriging: An Improved Model for

More information

Statistícal Methods for Spatial Data Analysis

Statistícal Methods for Spatial Data Analysis Texts in Statistícal Science Statistícal Methods for Spatial Data Analysis V- Oliver Schabenberger Carol A. Gotway PCT CHAPMAN & K Contents Preface xv 1 Introduction 1 1.1 The Need for Spatial Analysis

More information

Interpolation and 3D Visualization of Geodata

Interpolation and 3D Visualization of Geodata Marek KULCZYCKI and Marcin LIGAS, Poland Key words: interpolation, kriging, real estate market analysis, property price index ABSRAC Data derived from property markets have spatial character, no doubt

More information

What s for today. All about Variogram Nugget effect. Mikyoung Jun (Texas A&M) stat647 lecture 4 September 6, / 17

What s for today. All about Variogram Nugget effect. Mikyoung Jun (Texas A&M) stat647 lecture 4 September 6, / 17 What s for today All about Variogram Nugget effect Mikyoung Jun (Texas A&M) stat647 lecture 4 September 6, 2012 1 / 17 What is the variogram? Let us consider a stationary (or isotropic) random field Z

More information

Types of Spatial Data

Types of Spatial Data Spatial Data Types of Spatial Data Point pattern Point referenced geostatistical Block referenced Raster / lattice / grid Vector / polygon Point Pattern Data Interested in the location of points, not their

More information

ASPECTS REGARDING THE USEFULNESS OF GEOGRAPHICALLY WEIGHTED REGRESSION (GWR) FOR DIGITAL MAPPING OF SOIL PARAMETERS

ASPECTS REGARDING THE USEFULNESS OF GEOGRAPHICALLY WEIGHTED REGRESSION (GWR) FOR DIGITAL MAPPING OF SOIL PARAMETERS Lucrări Ştiinţifice vol. 52, seria Agronomie ASPECTS REGARDING THE USEFULNESS OF GEOGRAPHICALLY WEIGHTED REGRESSION (GWR) FOR DIGITAL MAPPING OF SOIL PARAMETERS C. PATRICHE 1, I. VASILINIUC 2 1 Romanian

More information

Introduction to Geostatistics

Introduction to Geostatistics Introduction to Geostatistics Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore,

More information

Bayesian Transgaussian Kriging

Bayesian Transgaussian Kriging 1 Bayesian Transgaussian Kriging Hannes Müller Institut für Statistik University of Klagenfurt 9020 Austria Keywords: Kriging, Bayesian statistics AMS: 62H11,60G90 Abstract In geostatistics a widely used

More information

Point patterns. The average number of events (the intensity, (s)) is homogeneous

Point patterns. The average number of events (the intensity, (s)) is homogeneous Point patterns Consider the point process {Z(s) :s 2 D R 2 }.Arealization of this process consists of a pattern (arrangement) of points in D. (D is a random set.) These points are called the events of

More information

Improving Spatial Data Interoperability

Improving Spatial Data Interoperability Improving Spatial Data Interoperability A Framework for Geostatistical Support-To To-Support Interpolation Michael F. Goodchild, Phaedon C. Kyriakidis, Philipp Schneider, Matt Rice, Qingfeng Guan, Jordan

More information

Kriging Luc Anselin, All Rights Reserved

Kriging Luc Anselin, All Rights Reserved Kriging Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign http://sal.agecon.uiuc.edu Outline Principles Kriging Models Spatial Interpolation

More information

Roger S. Bivand Edzer J. Pebesma Virgilio Gömez-Rubio. Applied Spatial Data Analysis with R. 4:1 Springer

Roger S. Bivand Edzer J. Pebesma Virgilio Gömez-Rubio. Applied Spatial Data Analysis with R. 4:1 Springer Roger S. Bivand Edzer J. Pebesma Virgilio Gömez-Rubio Applied Spatial Data Analysis with R 4:1 Springer Contents Preface VII 1 Hello World: Introducing Spatial Data 1 1.1 Applied Spatial Data Analysis

More information

Contents 1 Introduction 2 Statistical Tools and Concepts

Contents 1 Introduction 2 Statistical Tools and Concepts 1 Introduction... 1 1.1 Objectives and Approach... 1 1.2 Scope of Resource Modeling... 2 1.3 Critical Aspects... 2 1.3.1 Data Assembly and Data Quality... 2 1.3.2 Geologic Model and Definition of Estimation

More information

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Daniel K. Heersink 1, Reinhard Furrer 1, and Mike A. Mooney 2 1 Institute of Mathematics, University of Zurich, CH-8057 Zurich 2

More information

Entropy of Gaussian Random Functions and Consequences in Geostatistics

Entropy of Gaussian Random Functions and Consequences in Geostatistics Entropy of Gaussian Random Functions and Consequences in Geostatistics Paula Larrondo (larrondo@ualberta.ca) Department of Civil & Environmental Engineering University of Alberta Abstract Sequential Gaussian

More information

On dealing with spatially correlated residuals in remote sensing and GIS

On dealing with spatially correlated residuals in remote sensing and GIS On dealing with spatially correlated residuals in remote sensing and GIS Nicholas A. S. Hamm 1, Peter M. Atkinson and Edward J. Milton 3 School of Geography University of Southampton Southampton SO17 3AT

More information

Substitution Random Fields with Gaussian and Gamma Distributions: Theory and Application to a Pollution Data Set

Substitution Random Fields with Gaussian and Gamma Distributions: Theory and Application to a Pollution Data Set Substitution Random Fields with Gaussian and Gamma Distributions: Theory and Application to a Pollution Data Set Xavier Emery Abstract This paper presents random field models with Gaussian or gamma univariate

More information

Geostatistical Determination of Production Uncertainty: Application to Firebag Project

Geostatistical Determination of Production Uncertainty: Application to Firebag Project Geostatistical Determination of Production Uncertainty: Application to Firebag Project Abstract C. V. Deutsch, University of Alberta (cdeutsch@civil.ualberta.ca) E. Dembicki and K.C. Yeung, Suncor Energy

More information

Spatial Interpolation Comparison Evaluation of spatial prediction methods

Spatial Interpolation Comparison Evaluation of spatial prediction methods Spatial Interpolation Comparison Evaluation of spatial prediction methods Tomislav Hengl ISRIC World Soil Information, Wageningen University Topic Geostatistics = a toolbox to generate maps from point

More information

Statistical Tools and Concepts

Statistical Tools and Concepts Statistical Tools and Concepts Abstract Mineral resource estimation requires extensive use of statistics. In our context, statistics are mathematical methods for collecting, organizing, and interpreting

More information

Chapter 6 Spatial Analysis

Chapter 6 Spatial Analysis 6.1 Introduction Chapter 6 Spatial Analysis Spatial analysis, in a narrow sense, is a set of mathematical (and usually statistical) tools used to find order and patterns in spatial phenomena. Spatial patterns

More information

Toward an automatic real-time mapping system for radiation hazards

Toward an automatic real-time mapping system for radiation hazards Toward an automatic real-time mapping system for radiation hazards Paul H. Hiemstra 1, Edzer J. Pebesma 2, Chris J.W. Twenhöfel 3, Gerard B.M. Heuvelink 4 1 Faculty of Geosciences / University of Utrecht

More information

A Geostatistical Approach to Predict the Average Annual Rainfall of Bangladesh

A Geostatistical Approach to Predict the Average Annual Rainfall of Bangladesh Journal of Data Science 14(2016), 149-166 A Geostatistical Approach to Predict the Average Annual Rainfall of Bangladesh Mohammad Samsul Alam 1 and Syed Shahadat Hossain 1 1 Institute of Statistical Research

More information

Lecture 9: Introduction to Kriging

Lecture 9: Introduction to Kriging Lecture 9: Introduction to Kriging Math 586 Beginning remarks Kriging is a commonly used method of interpolation (prediction) for spatial data. The data are a set of observations of some variable(s) of

More information

Gstat: Multivariable Geostatistics for S

Gstat: Multivariable Geostatistics for S New URL: http://www.r-project.org/conferences/dsc-2003/ Proceedings of the 3rd International Workshop on Distributed Statistical Computing (DSC 2003) March 20 22, Vienna, Austria ISSN 1609-395X Kurt Hornik,

More information