Assessing the covariance function in geostatistics
|
|
- Magdalen Francis
- 5 years ago
- Views:
Transcription
1 Statistics & Probability Letters 52 (2001) Assessing the covariance function in geostatistics Ana F. Militino, M. Dolores Ugarte Departamento de Estadstica e Investigacion Operativa, Universidad Publica de Navarra, Campus Arrosada, Pamplona, Navarra, Spain Received October 1999; received in revised form October 2000 Abstract In geostatistics, one of the crucial problems is the choice of the covariance function. In this paper we show how to improve the cross-validation criterion, traditionally used for evaluating the t of a covariance function, in the case of unequally spaced data. c 2001 Elsevier Science B.V. All rights reserved Keywords: Dirichlet tessellation; Cross-validation; Mean integrated squared error; Spatial linear model 0. Introduction Interpolation based on spatial dependence of samples was rst used by Krige (1951) for estimation of the gold content of ore bodies in the mining industry of South Africa. However, classical interpolation procedures were considered inappropriate. They were biased and non-optimal due to the fact that they did not take into account local spatial dependence during estimation. Krige s practical methods were generalized by Matheron (1962) into the theory of regionalized variables. A regionalized variable Z(x) is a random variable that takes dierent values according to its location x within some region. Nowadays, kriging is a technique of making optimal, unbiased estimates of regionalized variables at unsampled locations using hypothesis of stationarity and structural properties of the covariance and the initial set of data values. The regionalized function, representing the variable under study, is split into a deterministic trend or drift plus a random error function with zero mean. The procedure called universal kriging assumes that the drift can be expressed locally as a linear combination of base functions (generally polynomials) with unknown location parameters. If the spatial variance covariance structure is known, universal kriging generally gives precise predictions, because it chooses optimal weights to be applied to the data. A basic tool in evaluating statistical models is cross-validation. In the geostatistical context, cross-validation is widely used to assess the t of a covariance function. Although the continuity of the parameters and Corresponding author. address: militino@unavarra.es (A.F. Militino) /01/$ - see front matter c 2001 Elsevier Science B.V. All rights reserved PII: S (00)
2 200 A.F. Militino, M.D. Ugarte / Statistics & Probability Letters 52 (2001) the great variety of covariance models make the task of choosing the right covariance model dicult, the cross-validation criterion plays an important role because it helps to choose the best model among a certain number of potential candidates. Besides the procedure is much used in practice by geologists. The purpose of this paper is two-fold. Firstly, we propose an improvement of the cross-validation procedure in the case of unequally spaced data. A weighted cross-validate statistic is presented by using the tile areas of the Dirichlet tessellation. This statistic is optimum in the sense that it gives estimators minimizing the mean integrated squared error. Secondly, we evaluate the behaviour of the new statistic presenting some simulation results based on two well-known examples of the geostatistical literature. 1. The spatial linear model Let {Z(x); x D} denote a covariance-stationary Gaussian process dened on D, x being the location, where an observation is taken. The spatial linear model is dened by Z(x)=S(x)+(x); (1) where S(x) is the underlying spatial surface. Universal kriging assumes that this surface may be expressed as a polynomial surface p S(x)= f k (x) k = f (x) T ; k=1 (2) where f (x)=(f 1 (x);:::;f p (x)) T is a known base function of the spatial coordinates x and T indicates transpose. Let =( 1 ;:::; p ) T be the location parameters, deterministic but unknown drift coecients and (x) a zero-mean random error process with var[(x)] = 2. The procedure requires knowledge of the covariance function K(x i ; x j ) = Cov[Z(x i );Z(x j )] for all x i ; x j D, which is usually unknown. There are several parametric models such as exponential, Gaussian or spherical which are widely used in practice. Much eort has been devoted to estimate its unknown parameters. Some of these methods require considerable more computational eort than others. Generally, however, several estimators are applicable in any given study and it is of practical use by geologists to make approximations, sometimes tted by eye from the theoretical covariance function to the empirical one. Alternatively, we may characterize the second-order moment structure necessary for the estimation procedure through the variogram. The variogram or equivalently the semivariogram is written as (h)= 1 2 Var[Z(x + h) Z(x)] = 1 2 E[Z(x + h) Z(x)]2 ; where Z(x+h) is the value of the random function at point x+h; Z(x) is the value of the random function at the point x and h is the deviation vector between the points in space x + h and x. It can be replaced by ( h ) under isotropy. It is easy to verify that the covariance function and semivariogram of a covariance-stationary random function are related, because (h) = Var[Z(x)] Cov[Z(x + h);z(x)]: This relationship allows for the estimation of the covariance function or the variogram in an equivalent way. 2. Kriging To obtain a kriging predictor let us suppose that a stationary process is observed at each of the n distinct locations x 1 ;:::;x n, yielding the sample vector Z N =(z(x 1 );:::;z(x n )) T. The kriging method follows a
3 A.F. Militino, M.D. Ugarte / Statistics & Probability Letters 52 (2001) three-step procedure: an ordinary least-squares regression (OLS), an estimation of the covariance matrix using OLS residuals, and a generalized least-squares estimator (GLS) using the estimated covariance matrix. Then, the location parameters of model (1) are given by ˆ GLS = arg min MSE [Ẑ(x)]; expressed in the form ˆ GLS =[F T K 1 F] 1 F T K 1 Z N ; (3) where F is the design matrix f 1 (x 1 ) ::: f p (x 1 ) f (x 1 ) T F =. :::. =. f 1 (x n ) ::: f p (x n ) f (x n ) T K is the covariance matrix K(x 1 ; x 1 ) ::: K(x 1 ; x n ) K =. :::. K(x n ; x 1 ) ::: K(x n ; x n ) ; and K 1 is the inverse of K. The covariance matrix of ˆ is given by [F T K 1 F T ] 1. If the covariance function is known, kriging gives an estimate of Z at a new location x as Ẑ(x)=f (x) T ˆ + k(x) T K 1 (Z N F ˆ); (4) where k(x)=(k(x 1 ; x);:::;k(x n ; x)) T is the covariance vector between the new prediction at x and the whole sample. Eq. (4) corresponds to the BLUP or best linear unbiased predictor, with kriging variance or mean squared error k(x)=e[z(x) 2 Ẑ(x)] 2 = 2 k(x) T K 1 k(x) +[f (x) F T K 1 k(x)] T (F T K 1 F) 1 [ f (x) F T K 1 k(x)]: (5) Robust alternatives to kriging have been proposed by Militino and Ugarte (1997). 3. A weighted cross-validate statistic Usually, dierent variogram models can often be tted to an empirical variogram. One of the crucial problems is how to know which is the best. Cross-validation is often used for such aim. The procedure consists of eliminating one of the data points temporarily from the set and then predict its value by kriging using the remaining data points. Repeating the procedure for all the points, we are able to estimate a cross-validate (CV) statistic n ( ) 2 zi ẑ (i) CV = (6) i=1 (i) and choose the parameters of the covariance function that make CV close to n. However, several authors have pointed out that practical problems may arise when using this technique (Ripley, 1981; Amstrong, 1998). The error sum of squares may be dominated by a few data points that are hard to predict, for example, points at the border of the region of interest which are more isolated. In the presence of unequally spaced data this problem could be more relevant. Our proposal consists of improving the CV criterion by using the tile areas
4 202 A.F. Militino, M.D. Ugarte / Statistics & Probability Letters 52 (2001) of the Dirichlet tessellation. With these areas, it is possible to balance the inuence of isolated data points in the leaving-out process, because under spatial dependence when we leave a data point, we also eliminate its area of inuence. In addition, we show below that the statistic that we provide is optimum in the sense that it gives estimators minimizing the mean integrated squared error (MISE). The MISE was introduced in the context of density estimation and has been frequently used to measure the discrepancy of an estimator and the true density. It has two dierent though equivalent interpretations: it is a measure of both the average global error and the accumulated point-wise error. We express it as MISE(Ẑ)= E[Z(x) Ẑ(x)] 2 dx = MSE(Ẑ)dx; (7) D D corresponding to the volume generated by the mean square error (MSE) over D, the region of interest. Now, it is possible to make a partition of the region of interest through the Dirichlet tessellation and assume that inside each polygon D i, the mean squared error is the same. MISE(Ẑ)= k(x)dx 2 = k(x)dx: D D 2 (8) i i D Dirichlet tessellation is based on associating a polygon D i to each point of a sample inside D in such a way that all of the points in this polygon, also called Dirichlet cell or tile, are nearer to the referenced sampled point than do any other. To approximate MISE(Ẑ) we use cross-validation, deleting an observation from the data and retting the model using the reduced data set. Typically, each observation is deleted in turn. Its purpose is to estimate in a realistic way the mean squared error of the prediction. Therefore, the use of kriging variance dened without the ith observation k(i) 2 (x)=e[z i(x) Ẑ (i) (x)] 2 is proposed, and then k(x)dx 2 w i E[Z i (x) Ẑ (i) (x)] 2 ; (9) D i D i D i where w i =area(tile) i for i =1;:::;n are the weights calculated as the tile areas of the Dirichlet tessellation. In order to estimate it, let us introduce the required notation and some quick formulae based on Christensen et al. (1992). The subscript (i) indicates that the observation i has been removed. Let Z (i) =(z 1 ;:::;z i 1 ;z i+1 ;:::;z n ) T be the vector of observations excluding the ith component, and K the covariance matrix of the whole sample, given in block form by ( kii k T K = i k i K (i) ) ; where k i = Cov(z i ; Z (i) ) is the vector of covariances between z i and the rest of the sample, k ii =Cov(z i ;z i ) and K (i) is the covariance matrix of Z (i). Denote by K 1 (i) the inverse of K (i), F (i) the design matrix after removing the ith observation, f (x i )=f (x i ) F(i) T K 1 (i) k i and z i =z i Z(i) T K 1 (i) k i where var( z i )=si 2 and si 2 =k ii ki T K 1 (i) k i. We dene ẽ i = z i ẑ (i) and after some algebra we obtain ẽ i = z i f T i ˆ (i) =( z i f T i where h i = f T i (F T K 1 F) 1 f i and s 2 i ˆ) ; si 2 h i ˆ (i) = ˆ (F T K 1 F) 1 f i ( z i f T i ˆ) : si 2 h i
5 A.F. Militino, M.D. Ugarte / Statistics & Probability Letters 52 (2001) Thus, approximating E[Z i (x) Ẑ (i) (x)] 2 by [z i ẑ (i) ] 2, we conclude n n MISE(Ẑ) w i [z i ẑ (i) ] 2 = w i ẽ 2 i : (10) i=1 i=1 As usual, we proceed standardizing the residuals and then dividing by the square root of Var(ẽ i )= s4 i : si 2 h i Therefore, we call SMISE (standardized MISE) the statistic n w i i=1 z i f T ˆ i : si 2 h i 2 (11) One just needs to minimize (11) with respect to the parameters of the covariance function to obtain the estimates of such parameters. In order to evaluate the behaviour of the proposed SMISE statistic (11) versus CV (6) we conducted a simulation study based on two well-known examples of the literature Example 1 The example corresponds to the 52 topographic data cited in Davis (1986, Table 5:11). They are observations over a small area on the northern side of a hill. Davis was interested in the analysis of maps, and he used the survey to produce contours of the region. We assume a quadratic surface and an exponential variogram given by { (h; a; b; r)= 0 if h =0; otherwise; a + b(1 exp( h r )) where a is the nugget eect, b is the sill, r is the range parameter and h is the Euclidean distance. Here, there is no nugget eect and the sill is a common constant that we can drop it out. Table 1 shows both CV and SMISE estimations for dierent range values of the exponential variogram. The areas of the topographic data set are plotted in Fig. 1. The minimum value of the SMISE statistic is that corresponds to the range value 0.9. This is also the weighted least-squares estimate provided by Cressie (1985). Now, we proceed to simulate 100 data sets using the scenario as encountered in the analysis of the topographic data, which means that we are using the same locations and we xed the range parameter of the covariance function equal to 0.9. We count the number of times where the minimum has been reached at ranges 0.1, 0.5, 0.9 and 1.4 with statistics (11) and (6). The results on the right of Table 1 show that SMISE is more successful than CV identifying the correct range of the covariance function. The identication of the range 0.9 is correct for Table 1 Analysis of the topographic data set and simulations Topo data set Simulations Range CV SMISE CV SMISE :63 a 44:09 a a Values of CV and SMISE for the correct parameter.
6 204 A.F. Militino, M.D. Ugarte / Statistics & Probability Letters 52 (2001) Fig. 1. Areas of the Dirichlet tessellation in the topographic data set. SMISE in all of the simulations, whereas CV fails almost 30% of them. In fact, it reaches its minimum at range 0.9, 72% of the times and the rest identies the range 0.5 as optimum Example 2 The aquifer head data from the Saratoga Valley in Wyoming is a data set of 93 observations which have been published and analyzed by Jones (1989) using stochastic partial dierential equations. This example has also been studied by Christensen et al. (1992), who t a linear trend and a Gaussian covariance function. The Gaussian variogram is given by { 0 if h =0; (h; a; b; r)= a + b(1 exp(( h r )2 )) otherwise: They show the great sensitivity of the covariance parameter estimates to the sampled data. In fact, this data set presents several outliers and give as restricted maximum likelihood estimates: a = 25, b = 1200 and r =8. The tile areas of the Dirichlet tessellation are given in Fig. 2. Table 2 shows the CV and SMISE estimations with nugget = 25 and b = Both statistics reach the minimum at range 4. Then, we simulate 100 data sets with parameters of the covariance function a = 25, b = 1200 and range = 4 and at the same spatial locations. We have checked the simulations in these parameters and allowed for small oscillations in the parameter estimates. This is why the right of Table 2 shows the frequencies of identifying the ranges 0.1, 4, 6, 8, 10 and 12. Range 4 is identied by SMISE in 95 simulations, whereas only 47 are identied using CV. We also observe larger dierences between both statistics due to the fact that data are quite irregularly spaced and the areas are quite big. Here, the use of the tile areas improves considerably the identication of the correct range parameter.
7 A.F. Militino, M.D. Ugarte / Statistics & Probability Letters 52 (2001) Fig. 2. Areas of the Dirichlet tessellation in the aquifer head data set. Table 2 Analysis of the aquifer data set and simulations Aquifer data set Simulations Range Nugget Sill CV SMISE CV SMISE :80 a 1262:14 a a Values of CV and SMISE for the correct parameters. 4. Discussion In geostatistics, one of the main uses of the cross-validation criterion has been to support the selection of a particular semivariogram model. Although this technique should not be interpreted as a conrmatory tool, it helps to choose the right model among a discrete number of potential candidates. Traditional cross-validation is based on minimizing a point-wise measure of error. This means that for example, points at the border of the region of interest that are dicult to predict, contribute heavily to the sum of errors. The weighted criterion that it is presented here, is based on minimizing a global measure of error which means that it will not be highly aected by the presence of points that are far apart. On the other hand, the use of the tile areas of the Dirichlet tessellation helps to cope with the problem of adapting the traditional cross-validation from independent and identically distributed data to the geostatistical context. The areas permit to take into account
8 206 A.F. Militino, M.D. Ugarte / Statistics & Probability Letters 52 (2001) not just a particular data point but its area of inuence. The new statistic does not warranty a complete success in identifying the covariance function parameters, but the simulation results show that it presents a reasonable criterion to choose the appropriate one, improving at the same time the traditional simple cross-validation. References Amstrong, M., Basic Linear Geostatistics. Springer, Berlin. Christensen, R., Johnson, W., Pearson, L.M., Prediction diagnostics for spatial linear models. Biometrika 79, Cressie, N., Fitting variogram models by weighted least squares. Math. Geol. 17, Davis, J.C., Statistics and Data Analysis in Geology, 2nd Edition. Wiley, New York. Jones, R.H., Fitting a stochastic partial dierential equation to aquifer data. Stochastic Hydrol. Hydraul. 3, Krige, D.G., A statistical approach to some basic mine valuation problems on the Witwatersrand. J. Chem. Metall. Min. Soc. South Africa 52, Matheron, G., Traite de Geostatistique Apliquee. Tome I: Memoires du Bureau de Reserches Geologiques et Minieres, Technip, Paris. Militino, A.F., Ugarte, M.D., Bounded inuence estimation in a spatial linear mixed model. In: Gregoire, T.G. et al. (Eds.), Modelling Longitudinal and Spatially Correlated Data, Lecture notes in statistics, vol Springer, New York, pp Ripley, B.D., Spatial Statistics. Wiley, New York.
Highly Robust Variogram Estimation 1. Marc G. Genton 2
Mathematical Geology, Vol. 30, No. 2, 1998 Highly Robust Variogram Estimation 1 Marc G. Genton 2 The classical variogram estimator proposed by Matheron is not robust against outliers in the data, nor is
More informationPRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH
PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH SURESH TRIPATHI Geostatistical Society of India Assumptions and Geostatistical Variogram
More informationStatistícal Methods for Spatial Data Analysis
Texts in Statistícal Science Statistícal Methods for Spatial Data Analysis V- Oliver Schabenberger Carol A. Gotway PCT CHAPMAN & K Contents Preface xv 1 Introduction 1 1.1 The Need for Spatial Analysis
More informationIntroduction to Spatial Data and Models
Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Department of Forestry & Department of Geography, Michigan State University, Lansing Michigan, U.S.A. 2 Biostatistics,
More informationBasics of Point-Referenced Data Models
Basics of Point-Referenced Data Models Basic tool is a spatial process, {Y (s), s D}, where D R r Chapter 2: Basics of Point-Referenced Data Models p. 1/45 Basics of Point-Referenced Data Models Basic
More informationIntroduction to Spatial Data and Models
Introduction to Spatial Data and Models Sudipto Banerjee 1 and Andrew O. Finley 2 1 Biostatistics, School of Public Health, University of Minnesota, Minneapolis, Minnesota, U.S.A. 2 Department of Forestry
More informationNonparametric variogram and covariogram estimation with Fourier Bessel matrices
Computational Statistics & Data Analysis 41 (2002) 47 57 www.elsevier.com/locate/csda Nonparametric variogram and covariogram estimation with Fourier Bessel matrices Marc G. Genton a;, David J. Gorsich
More informationKriging by Example: Regression of oceanographic data. Paris Perdikaris. Brown University, Division of Applied Mathematics
Kriging by Example: Regression of oceanographic data Paris Perdikaris Brown University, Division of Applied Mathematics! January, 0 Sea Grant College Program Massachusetts Institute of Technology Cambridge,
More informationIndex. Geostatistics for Environmental Scientists, 2nd Edition R. Webster and M. A. Oliver 2007 John Wiley & Sons, Ltd. ISBN:
Index Akaike information criterion (AIC) 105, 290 analysis of variance 35, 44, 127 132 angular transformation 22 anisotropy 59, 99 affine or geometric 59, 100 101 anisotropy ratio 101 exploring and displaying
More informationarxiv: v1 [stat.me] 24 May 2010
The role of the nugget term in the Gaussian process method Andrey Pepelyshev arxiv:1005.4385v1 [stat.me] 24 May 2010 Abstract The maximum likelihood estimate of the correlation parameter of a Gaussian
More informationLimit Kriging. Abstract
Limit Kriging V. Roshan Joseph School of Industrial and Systems Engineering Georgia Institute of Technology Atlanta, GA 30332-0205, USA roshan@isye.gatech.edu Abstract A new kriging predictor is proposed.
More informationBasics in Geostatistics 2 Geostatistical interpolation/estimation: Kriging methods. Hans Wackernagel. MINES ParisTech.
Basics in Geostatistics 2 Geostatistical interpolation/estimation: Kriging methods Hans Wackernagel MINES ParisTech NERSC April 2013 http://hans.wackernagel.free.fr Basic concepts Geostatistics Hans Wackernagel
More informationGeostatistics in Hydrology: Kriging interpolation
Chapter Geostatistics in Hydrology: Kriging interpolation Hydrologic properties, such as rainfall, aquifer characteristics (porosity, hydraulic conductivity, transmissivity, storage coefficient, etc.),
More informationTypes of Spatial Data
Spatial Data Types of Spatial Data Point pattern Point referenced geostatistical Block referenced Raster / lattice / grid Vector / polygon Point Pattern Data Interested in the location of points, not their
More informationInvestigation of Monthly Pan Evaporation in Turkey with Geostatistical Technique
Investigation of Monthly Pan Evaporation in Turkey with Geostatistical Technique Hatice Çitakoğlu 1, Murat Çobaner 1, Tefaruk Haktanir 1, 1 Department of Civil Engineering, Erciyes University, Kayseri,
More informationIntroduction to Geostatistics
Introduction to Geostatistics Abhi Datta 1, Sudipto Banerjee 2 and Andrew O. Finley 3 July 31, 2017 1 Department of Biostatistics, Bloomberg School of Public Health, Johns Hopkins University, Baltimore,
More informationChapter 4 - Fundamentals of spatial processes Lecture notes
TK4150 - Intro 1 Chapter 4 - Fundamentals of spatial processes Lecture notes Odd Kolbjørnsen and Geir Storvik January 30, 2017 STK4150 - Intro 2 Spatial processes Typically correlation between nearby sites
More informationIs there still room for new developments in geostatistics?
Is there still room for new developments in geostatistics? Jean-Paul Chilès MINES ParisTech, Fontainebleau, France, IAMG 34th IGC, Brisbane, 8 August 2012 Matheron: books and monographs 1962-1963: Treatise
More informationInterpolation and 3D Visualization of Geodata
Marek KULCZYCKI and Marcin LIGAS, Poland Key words: interpolation, kriging, real estate market analysis, property price index ABSRAC Data derived from property markets have spatial character, no doubt
More informationIntroduction. Semivariogram Cloud
Introduction Data: set of n attribute measurements {z(s i ), i = 1,, n}, available at n sample locations {s i, i = 1,, n} Objectives: Slide 1 quantify spatial auto-correlation, or attribute dissimilarity
More information7 Geostatistics. Figure 7.1 Focus of geostatistics
7 Geostatistics 7.1 Introduction Geostatistics is the part of statistics that is concerned with geo-referenced data, i.e. data that are linked to spatial coordinates. To describe the spatial variation
More informationA kernel indicator variogram and its application to groundwater pollution
Int. Statistical Inst.: Proc. 58th World Statistical Congress, 2011, Dublin (Session IPS101) p.1514 A kernel indicator variogram and its application to groundwater pollution data Menezes, Raquel University
More informationReduced rank regression in cointegrated models
Journal of Econometrics 06 (2002) 203 26 www.elsevier.com/locate/econbase Reduced rank regression in cointegrated models.w. Anderson Department of Statistics, Stanford University, Stanford, CA 94305-4065,
More informationA STUDY OF MEAN AREAL PRECIPITATION AND SPATIAL STRUCTURE OF RAINFALL DISTRIBUTION IN THE TSEN-WEN RIVER BASIN
Journal of the Chinese Institute of Engineers, Vol. 24, No. 5, pp. 649-658 (2001) 649 A STUDY OF MEAN AREAL PRECIPITATION AND SPATIAL STRUCTURE OF RAINFALL DISTRIBUTION IN THE TSEN-WEN RIVER BASIN Jet-Chau
More informationCopyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display.
Chapter 15. SPATIAL INTERPOLATION 15.1 Elements of Spatial Interpolation 15.1.1 Control Points 15.1.2 Type of Spatial Interpolation 15.2 Global Methods 15.2.1 Trend Surface Models Box 15.1 A Worked Example
More informationMultivariate Geostatistics
Hans Wackernagel Multivariate Geostatistics An Introduction with Applications Third, completely revised edition with 117 Figures and 7 Tables Springer Contents 1 Introduction A From Statistics to Geostatistics
More information2.6 Two-dimensional continuous interpolation 3: Kriging - introduction to geostatistics. References - geostatistics. References geostatistics (cntd.
.6 Two-dimensional continuous interpolation 3: Kriging - introduction to geostatistics Spline interpolation was originally developed or image processing. In GIS, it is mainly used in visualization o spatial
More informationSpatial analysis of annual rainfall using ordinary kriging techniques and lognormal kriging in the Cheliff watershed. Algeria.
15 th International Conference on Environmental Science and Technology Rhodes, Greece, 31 August to 2 September 2017 Spatial analysis of annual rainfall using ordinary kriging techniques and lognormal
More informationSpatial data analysis
Spatial data analysis Global Health Sciences Global Health Group Data Science Africa 2016 Ricardo Andrade Outline The geographic context Geostatistics Non-linear models Discrete processes Time interactions
More informationLinear Regression and Its Applications
Linear Regression and Its Applications Predrag Radivojac October 13, 2014 Given a data set D = {(x i, y i )} n the objective is to learn the relationship between features and the target. We usually start
More informationAn Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University
An Introduction to Spatial Statistics Chunfeng Huang Department of Statistics, Indiana University Microwave Sounding Unit (MSU) Anomalies (Monthly): 1979-2006. Iron Ore (Cressie, 1986) Raw percent data
More informationGeneralized cross-covariances and their estimation
Research Collection Working Paper Generalized cross-covariances and their estimation Author(s): Künsch, Hansruedi; Papritz, Andreas Jürg; Bassi, F. Publication Date: 1996 Permanent Link: https://doi.org/10.3929/ethz-a-001673771
More informationMETHODOLOGY WHICH APPLIES GEOSTATISTICS TECHNIQUES TO THE TOPOGRAPHICAL SURVEY
International Journal of Computer Science and Applications, 2008, Vol. 5, No. 3a, pp 67-79 Technomathematics Research Foundation METHODOLOGY WHICH APPLIES GEOSTATISTICS TECHNIQUES TO THE TOPOGRAPHICAL
More informationTraps for the Unwary Subsurface Geoscientist
Traps for the Unwary Subsurface Geoscientist ashley.francis@sorviodvnvm.co.uk http://www.sorviodvnvm.co.uk Presented at SEG Development & Production Forum, 24-29 th June 2001, Taos, New Mexico, USA 24-29
More informationBayesian Transgaussian Kriging
1 Bayesian Transgaussian Kriging Hannes Müller Institut für Statistik University of Klagenfurt 9020 Austria Keywords: Kriging, Bayesian statistics AMS: 62H11,60G90 Abstract In geostatistics a widely used
More informationRidge analysis of mixture response surfaces
Statistics & Probability Letters 48 (2000) 3 40 Ridge analysis of mixture response surfaces Norman R. Draper a;, Friedrich Pukelsheim b a Department of Statistics, University of Wisconsin, 20 West Dayton
More informationGeostatistics for Gaussian processes
Introduction Geostatistical Model Covariance structure Cokriging Conclusion Geostatistics for Gaussian processes Hans Wackernagel Geostatistics group MINES ParisTech http://hans.wackernagel.free.fr Kernels
More informationPARAMETER ESTIMATION FOR FRACTIONAL BROWNIAN SURFACES
Statistica Sinica 2(2002), 863-883 PARAMETER ESTIMATION FOR FRACTIONAL BROWNIAN SURFACES Zhengyuan Zhu and Michael L. Stein University of Chicago Abstract: We study the use of increments to estimate the
More informationKriging Methodology for Regional Economic Analysis: Estimating the Housing Price in Albacete
Kriging Methodology for Regional Economic Analysis: Estimating the Housing Price in Albacete MATÍAS GÁMEZ MARTÍNEZ, JOSÉ MARÍA MONTERO LORENZO, AND NOELIA GARCÍA RUBIO * Because of the socioeconomic importance
More informationPade approximants and noise: rational functions
Journal of Computational and Applied Mathematics 105 (1999) 285 297 Pade approximants and noise: rational functions Jacek Gilewicz a; a; b;1, Maciej Pindor a Centre de Physique Theorique, Unite Propre
More informationAn Introduction to Spatial Autocorrelation and Kriging
An Introduction to Spatial Autocorrelation and Kriging Matt Robinson and Sebastian Dietrich RenR 690 Spring 2016 Tobler and Spatial Relationships Tobler s 1 st Law of Geography: Everything is related to
More informationSpatial Backfitting of Roller Measurement Values from a Florida Test Bed
Spatial Backfitting of Roller Measurement Values from a Florida Test Bed Daniel K. Heersink 1, Reinhard Furrer 1, and Mike A. Mooney 2 1 Institute of Mathematics, University of Zurich, CH-8057 Zurich 2
More informationWhat s for today. Introduction to Space-time models. c Mikyoung Jun (Texas A&M) Stat647 Lecture 14 October 16, / 19
What s for today Introduction to Space-time models c Mikyoung Jun (Texas A&M) Stat647 Lecture 14 October 16, 2012 1 / 19 Space-time Data So far we looked at the data that vary over space Now we add another
More informationSpatial analysis is the quantitative study of phenomena that are located in space.
c HYON-JUNG KIM, 2016 1 Introduction Spatial analysis is the quantitative study of phenomena that are located in space. Spatial data analysis usually refers to an analysis of the observations in which
More informationInfluence of parameter estimation uncertainty in Kriging: Part 2 Test and case study applications
Hydrology and Earth System Influence Sciences, of 5(), parameter 5 3 estimation (1) uncertainty EGS in Kriging: Part Test and case study applications Influence of parameter estimation uncertainty in Kriging:
More informationOn dealing with spatially correlated residuals in remote sensing and GIS
On dealing with spatially correlated residuals in remote sensing and GIS Nicholas A. S. Hamm 1, Peter M. Atkinson and Edward J. Milton 3 School of Geography University of Southampton Southampton SO17 3AT
More informationGeostatistics: Kriging
Geostatistics: Kriging 8.10.2015 Konetekniikka 1, Otakaari 4, 150 10-12 Rangsima Sunila, D.Sc. Background What is Geostatitics Concepts Variogram: experimental, theoretical Anisotropy, Isotropy Lag, Sill,
More informationCovariance function estimation in Gaussian process regression
Covariance function estimation in Gaussian process regression François Bachoc Department of Statistics and Operations Research, University of Vienna WU Research Seminar - May 2015 François Bachoc Gaussian
More informationSpatial Analysis II. Spatial data analysis Spatial analysis and inference
Spatial Analysis II Spatial data analysis Spatial analysis and inference Roadmap Spatial Analysis I Outline: What is spatial analysis? Spatial Joins Step 1: Analysis of attributes Step 2: Preparing for
More information11/8/2018. Spatial Interpolation & Geostatistics. Kriging Step 1
(Z i Z j ) 2 / 2 (Z i Zj) 2 / 2 Semivariance y 11/8/2018 Spatial Interpolation & Geostatistics Kriging Step 1 Describe spatial variation with Semivariogram Lag Distance between pairs of points Lag Mean
More informationAcceptable Ergodic Fluctuations and Simulation of Skewed Distributions
Acceptable Ergodic Fluctuations and Simulation of Skewed Distributions Oy Leuangthong, Jason McLennan and Clayton V. Deutsch Centre for Computational Geostatistics Department of Civil & Environmental Engineering
More informationIf we want to analyze experimental or simulated data we might encounter the following tasks:
Chapter 1 Introduction If we want to analyze experimental or simulated data we might encounter the following tasks: Characterization of the source of the signal and diagnosis Studying dependencies Prediction
More informationA Geostatistical Approach to Predict the Average Annual Rainfall of Bangladesh
Journal of Data Science 14(2016), 149-166 A Geostatistical Approach to Predict the Average Annual Rainfall of Bangladesh Mohammad Samsul Alam 1 and Syed Shahadat Hossain 1 1 Institute of Statistical Research
More informationSpatial and Environmental Statistics
Spatial and Environmental Statistics Dale Zimmerman Department of Statistics and Actuarial Science University of Iowa January 17, 2019 Dale Zimmerman (UIOWA) Spatial and Environmental Statistics January
More informationRegression revisited (again)
by I. Clark* Synopsis One of the seminal pioneering papers in reserve evaluation was published by Danie Krige in 1951. In that paper he introduced the concept of regression techniques in providing better
More informationSampling of variables with proportional effect
Sampling of variables with proportional effect Samuel Canchaya, R&D Senior Geologist, Cía. de Minas Buenaventura SAA, Lima, Peru, samuel.canchaya@buenaventura.pe ABSTRACT According to the theory of regionalized
More informationOFTEN we need to be able to integrate point attribute information
ALLAN A NIELSEN: GEOSTATISTICS AND ANALYSIS OF SPATIAL DATA 1 Geostatistics and Analysis of Spatial Data Allan A Nielsen Abstract This note deals with geostatistical measures for spatial correlation, namely
More informationREML Estimation and Linear Mixed Models 4. Geostatistics and linear mixed models for spatial data
REML Estimation and Linear Mixed Models 4. Geostatistics and linear mixed models for spatial data Sue Welham Rothamsted Research Harpenden UK AL5 2JQ December 1, 2008 1 We will start by reviewing the principles
More informationA C O M P AR I S O N O F O R D I N AR Y AN D S I M P L E L I M B O F T H E B U S H V E L D C O M P L E X
A C O M P AR I S O N O F O R D I N AR Y AN D S I M P L E K R I G I N G O N A P G E R E S O U R C E I N T H E E AS T E R N L I M B O F T H E B U S H V E L D C O M P L E X Mbalenhle Mpanza A research report
More informationSpatial Interpolation & Geostatistics
(Z i Z j ) 2 / 2 Spatial Interpolation & Geostatistics Lag Lag Mean Distance between pairs of points 1 y Kriging Step 1 Describe spatial variation with Semivariogram (Z i Z j ) 2 / 2 Point cloud Map 3
More information1/sqrt(B) convergence 1/B convergence B
The Error Coding Method and PICTs Gareth James and Trevor Hastie Department of Statistics, Stanford University March 29, 1998 Abstract A new family of plug-in classication techniques has recently been
More informationSpatial Correlation of Acceleration Spectral Ordinates from European Data
Spatial Correlation of Acceleration Spectral Ordinates from European Data S. Esposito & I. Iervolino Università degli Studi di Napoli Federico II, Italy SUMMARY Spatial correlation of spectral acceleration
More informationDiagnostic tools for random eects in the repeated measures growth curve model
Computational Statistics & Data Analysis 33 (2000) 79 100 www.elsevier.com/locate/csda Diagnostic tools for random eects in the repeated measures growth curve model P.J. Lindsey, J.K. Lindsey Biostatistics,
More informationHandbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp
Handbook of Spatial Statistics Chapter 2: Continuous Parameter Stochastic Process Theory by Gneiting and Guttorp Marcela Alfaro Córdoba August 25, 2016 NCSU Department of Statistics Continuous Parameter
More informationTaylor series based nite dierence approximations of higher-degree derivatives
Journal of Computational and Applied Mathematics 54 (3) 5 4 www.elsevier.com/locate/cam Taylor series based nite dierence approximations of higher-degree derivatives Ishtiaq Rasool Khan a;b;, Ryoji Ohba
More informationLecture 24: Weighted and Generalized Least Squares
Lecture 24: Weighted and Generalized Least Squares 1 Weighted Least Squares When we use ordinary least squares to estimate linear regression, we minimize the mean squared error: MSE(b) = 1 n (Y i X i β)
More informationAn Introduction to Pattern Statistics
An Introduction to Pattern Statistics Nearest Neighbors The CSR hypothesis Clark/Evans and modification Cuzick and Edwards and controls All events k function Weighted k function Comparative k functions
More informationI don t have much to say here: data are often sampled this way but we more typically model them in continuous space, or on a graph
Spatial analysis Huge topic! Key references Diggle (point patterns); Cressie (everything); Diggle and Ribeiro (geostatistics); Dormann et al (GLMMs for species presence/abundance); Haining; (Pinheiro and
More informationENVIRONMENTAL MONITORING Vol. II - Geostatistical Analysis of Monitoring Data - Mark Dowdall, John O Dea GEOSTATISTICAL ANALYSIS OF MONITORING DATA
GEOSTATISTICAL ANALYSIS OF MONITORING DATA Mark Dowdall Norwegian Radiation Protection Authority, Environmental Protection Unit, Polar Environmental Centre, Tromso, Norway John O Dea Institute of Technology,
More informationNon-gaussian spatiotemporal modeling
Dec, 2008 1/ 37 Non-gaussian spatiotemporal modeling Thais C O da Fonseca Joint work with Prof Mark F J Steel Department of Statistics University of Warwick Dec, 2008 Dec, 2008 2/ 37 1 Introduction Motivation
More informationPoint-Referenced Data Models
Point-Referenced Data Models Jamie Monogan University of Georgia Spring 2013 Jamie Monogan (UGA) Point-Referenced Data Models Spring 2013 1 / 19 Objectives By the end of these meetings, participants should
More informationBusiness Statistics. Tommaso Proietti. Linear Regression. DEF - Università di Roma 'Tor Vergata'
Business Statistics Tommaso Proietti DEF - Università di Roma 'Tor Vergata' Linear Regression Specication Let Y be a univariate quantitative response variable. We model Y as follows: Y = f(x) + ε where
More informationImproving Spatial Data Interoperability
Improving Spatial Data Interoperability A Framework for Geostatistical Support-To To-Support Interpolation Michael F. Goodchild, Phaedon C. Kyriakidis, Philipp Schneider, Matt Rice, Qingfeng Guan, Jordan
More informationRegression Revisited (again) Isobel Clark Geostokos Limited, Scotland. Abstract
Regression Revisited (again) Isobel Clark Geostokos Limited, Scotland Abstract One of the seminal pioneering papers in reserve evaluation was published by Danie Krige in 1951. In this paper he introduced
More informationExploring the World of Ordinary Kriging. Dennis J. J. Walvoort. Wageningen University & Research Center Wageningen, The Netherlands
Exploring the World of Ordinary Kriging Wageningen University & Research Center Wageningen, The Netherlands July 2004 (version 0.2) What is? What is it about? Potential Users a computer program for exploring
More informationStatistics for analyzing and modeling precipitation isotope ratios in IsoMAP
Statistics for analyzing and modeling precipitation isotope ratios in IsoMAP The IsoMAP uses the multiple linear regression and geostatistical methods to analyze isotope data Suppose the response variable
More informationA robust statistically based approach to estimating the probability of contamination occurring between sampling locations
A robust statistically based approach to estimating the probability of contamination occurring between sampling locations Peter Beck Principal Environmental Scientist Image placeholder Image placeholder
More informationApproximating likelihoods for large spatial data sets
Approximating likelihoods for large spatial data sets By Michael Stein, Zhiyi Chi, and Leah Welty Jessi Cisewski April 28, 2009 1 Introduction Given a set of spatial data, often the desire is to estimate
More informationSimple Regression Model Setup Estimation Inference Prediction. Model Diagnostic. Multiple Regression. Model Setup and Estimation.
Statistical Computation Math 475 Jimin Ding Department of Mathematics Washington University in St. Louis www.math.wustl.edu/ jmding/math475/index.html October 10, 2013 Ridge Part IV October 10, 2013 1
More informationp(z)
Chapter Statistics. Introduction This lecture is a quick review of basic statistical concepts; probabilities, mean, variance, covariance, correlation, linear regression, probability density functions and
More informationDale L. Zimmerman Department of Statistics and Actuarial Science, University of Iowa, USA
SPATIAL STATISTICS Dale L. Zimmerman Department of Statistics and Actuarial Science, University of Iowa, USA Keywords: Geostatistics, Isotropy, Kriging, Lattice Data, Spatial point patterns, Stationarity
More informationENGRG Introduction to GIS
ENGRG 59910 Introduction to GIS Michael Piasecki October 13, 2017 Lecture 06: Spatial Analysis Outline Today Concepts What is spatial interpolation Why is necessary Sample of interpolation (size and pattern)
More informationThe Proportional Effect of Spatial Variables
The Proportional Effect of Spatial Variables J. G. Manchuk, O. Leuangthong and C. V. Deutsch Centre for Computational Geostatistics, Department of Civil and Environmental Engineering University of Alberta
More informationFluvial Variography: Characterizing Spatial Dependence on Stream Networks. Dale Zimmerman University of Iowa (joint work with Jay Ver Hoef, NOAA)
Fluvial Variography: Characterizing Spatial Dependence on Stream Networks Dale Zimmerman University of Iowa (joint work with Jay Ver Hoef, NOAA) March 5, 2015 Stream network data Flow Legend o 4.40-5.80
More informationSpatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields
Spatial statistics, addition to Part I. Parameter estimation and kriging for Gaussian random fields 1 Introduction Jo Eidsvik Department of Mathematical Sciences, NTNU, Norway. (joeid@math.ntnu.no) February
More informationEconometrics I. Ricardo Mora
Econometrics I Department of Economics Universidad Carlos III de Madrid Master in Industrial Economics and Markets Outline Motivation 1 Motivation 2 3 4 Motivation The Analogy Principle The () is a framework
More informationChapter 4 - Fundamentals of spatial processes Lecture notes
Chapter 4 - Fundamentals of spatial processes Lecture notes Geir Storvik January 21, 2013 STK4150 - Intro 2 Spatial processes Typically correlation between nearby sites Mostly positive correlation Negative
More informationMohsen Pourahmadi. 1. A sampling theorem for multivariate stationary processes. J. of Multivariate Analysis, Vol. 13, No. 1 (1983),
Mohsen Pourahmadi PUBLICATIONS Books and Editorial Activities: 1. Foundations of Time Series Analysis and Prediction Theory, John Wiley, 2001. 2. Computing Science and Statistics, 31, 2000, the Proceedings
More informationHamid Mohebzadeh. Department of Water Engineering, Faculty of Agriculture, Bu-Ali Sina University, Hamedan, Iran
American-Eurasian J. Agric. & Environ. Sci., 8 (): 64-76, 08 ISSN 88-6769 IDOSI Publications, 08 DOI: 0.589/idosi.aejaes.08.64.76 Comparison of Methods for Fitting the Theoretical Variogram to the Experimental
More informationEmpirical Bayesian Kriging
Empirical Bayesian Kriging Implemented in ArcGIS Geostatistical Analyst By Konstantin Krivoruchko, Senior Research Associate, Software Development Team, Esri Obtaining reliable environmental measurements
More informationLOGNORMAL ORDINARY KRIGING METAMODEL
LOGNORMAL ORDINARY KRIGING METAMODEL Muzaffer Balaban Turkish Statistical Institute & Department of Industrial Engineering Başkent University Ankara, Turkey balabanmuzaffer@gmail.com Berna Dengiz Department
More informationLecture 9: Introduction to Kriging
Lecture 9: Introduction to Kriging Math 586 Beginning remarks Kriging is a commonly used method of interpolation (prediction) for spatial data. The data are a set of observations of some variable(s) of
More information[km]
SEGMENTATION INDEPENDENT ESTIMATES OF TURBULENCE PARAMETERS G. C. Papanicolaou a, K. Solna b and D. Washburn c adepartment of Mathematics, Stanford University, Stanford, CA 9435 bdepartment of Mathematics,
More informationBAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS
BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS Srinivasan R and Venkatesan P Dept. of Statistics, National Institute for Research Tuberculosis, (Indian Council of Medical Research),
More informationA Parametric Spatial Bootstrap
A Parametric Spatial Bootstrap Liansheng Tang a ; William R. Schucany b ; Wayne A. Woodward b ; and Richard F. Gunst b July 17, 2006 a Department of Biostatistics, University of Washington, P.O. Box 357232,
More informationEstimation of the transmissmty of the Santiago aquifer, Chile, using different geostatistical methods
Groundwater Management: Quantity and Quality (Proceedings of the Benidorm Symposium, October 1989). IAHS Publ. no. 188,1989. Estimation of the transmissmty of the Santiago aquifer, Chile, using different
More informationPruscha: Semiparametric Estimation in Regression Models for Point Processes based on One Realization
Pruscha: Semiparametric Estimation in Regression Models for Point Processes based on One Realization Sonderforschungsbereich 386, Paper 66 (1997) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner
More informationA full scale, non stationary approach for the kriging of large spatio(-temporal) datasets
A full scale, non stationary approach for the kriging of large spatio(-temporal) datasets Thomas Romary, Nicolas Desassis & Francky Fouedjio Mines ParisTech Centre de Géosciences, Equipe Géostatistique
More informationManuscript of paper for APCOM 2003.
1 Manuscript of paper for APCOM 2003. AN ANALYSIS OF THE PRACTICAL AND ECONOMIC IMPLICATIONS OF SYSTEMATIC UNDERGROUND DRILLING IN DEEP SOUTH AFRICAN GOLD MINES W. ASSIBEY-BONSU Consultant: Geostatistics
More informationYear
Variance Estimation in Spatial Regression Using a Nonparametric Semivariogram Based on Residuals Hyon-Jung Kim and Dennis D. Boos 1 Abstract The empirical semivariogram of residuals from a regression model
More information