Types of Spatial Data

Similar documents
Lab #3 Background Material Quantifying Point and Gradient Patterns

I don t have much to say here: data are often sampled this way but we more typically model them in continuous space, or on a graph

11/8/2018. Spatial Interpolation & Geostatistics. Kriging Step 1

Spatial Data Mining. Regression and Classification Techniques

Introduction. Semivariogram Cloud

Spatial Interpolation & Geostatistics

Improving Spatial Data Interoperability

Index. Geostatistics for Environmental Scientists, 2nd Edition R. Webster and M. A. Oliver 2007 John Wiley & Sons, Ltd. ISBN:

An Introduction to Pattern Statistics

PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH

Introduction to Geostatistics

Basics of Point-Referenced Data Models

Areal data. Infant mortality, Auckland NZ districts. Number of plant species in 20cm x 20 cm patches of alpine tundra. Wheat yield

Lecture 5 Geostatistics

Point-Referenced Data Models

Introduction to Spatial Data and Models

Spatial Analysis II. Spatial data analysis Spatial analysis and inference

Introduction to Spatial Data and Models

Soil Moisture Modeling using Geostatistical Techniques at the O Neal Ecological Reserve, Idaho

Models for spatial data (cont d) Types of spatial data. Types of spatial data (cont d) Hierarchical models for spatial data

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display.

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed

Spatial analysis is the quantitative study of phenomena that are located in space.

Multivariate Geostatistics

Geostatistics: Kriging

7 Geostatistics. Figure 7.1 Focus of geostatistics

GIST 4302/5302: Spatial Analysis and Modeling

11. Kriging. ACE 492 SA - Spatial Analysis Fall 2003

Statistícal Methods for Spatial Data Analysis

Classic Time Series Analysis

Roger S. Bivand Edzer J. Pebesma Virgilio Gömez-Rubio. Applied Spatial Data Analysis with R. 4:1 Springer

Universitat Autònoma de Barcelona Facultat de Filosofia i Lletres Departament de Prehistòria Doctorat en arqueologia prehistòrica

Spatial and Environmental Statistics

ENGRG Introduction to GIS

A GEOSTATISTICAL APPROACH TO PREDICTING A PHYSICAL VARIABLE THROUGH A CONTINUOUS SURFACE

Gridding of precipitation and air temperature observations in Belgium. Michel Journée Royal Meteorological Institute of Belgium (RMI)

Chapter 4 - Fundamentals of spatial processes Lecture notes

Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis

GEOSTATISTICS. Dr. Spyros Fountas

Assessing the covariance function in geostatistics

Spatial Statistics or Why Spatial is Special?

Spatial Analysis 1. Introduction

Report on Kriging in Interpolation

OFTEN we need to be able to integrate point attribute information

Kriging Luc Anselin, All Rights Reserved

Spatial Data Analysis in Archaeology Anthropology 589b. Kriging Artifact Density Surfaces in ArcGIS

SUT WA Research Night

Geostatistical Interpolation: Kriging and the Fukushima Data. Erik Hoel Colligium Ramazzini October 30, 2011

The Matrix Reloaded: Computations for large spatial data sets

Areal Unit Data Regular or Irregular Grids or Lattices Large Point-referenced Datasets

Concepts and Applications of Kriging. Eric Krause

Lecture 8. Spatial Estimation

Performance Analysis of Some Machine Learning Algorithms for Regression Under Varying Spatial Autocorrelation

Data Break 8: Kriging the Meuse RiverBIOS 737 Spring 2004 p.1/27

A robust statistically based approach to estimating the probability of contamination occurring between sampling locations

Geostatistics for Gaussian processes

Extent of Radiological Contamination in Soil at Four Sites near the Fukushima Daiichi Power Plant, Japan (ArcGIS)

Spatial analysis. 0 move the objects and the results change

Space-time data. Simple space-time analyses. PM10 in space. PM10 in time

Investigation of Monthly Pan Evaporation in Turkey with Geostatistical Technique

Exploring the World of Ordinary Kriging. Dennis J. J. Walvoort. Wageningen University & Research Center Wageningen, The Netherlands

Dale L. Zimmerman Department of Statistics and Actuarial Science, University of Iowa, USA

ENVIRONMENTAL MONITORING Vol. II - Geostatistical Analysis of Monitoring Data - Mark Dowdall, John O Dea GEOSTATISTICAL ANALYSIS OF MONITORING DATA

Spatial-Temporal Modeling of Active Layer Thickness

Concepts and Applications of Kriging. Eric Krause Konstantin Krivoruchko

Statistical Learning

Comparison of rainfall distribution method

On dealing with spatially correlated residuals in remote sensing and GIS

2.6 Two-dimensional continuous interpolation 3: Kriging - introduction to geostatistics. References - geostatistics. References geostatistics (cntd.

The Matrix Reloaded: Computations for large spatial data sets

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University

Geog 210C Spring 2011 Lab 6. Geostatistics in ArcMap

REN R 690 Lab Geostatistics Lab

Lesson 6 Maximum Likelihood: Part III. (plus a quick bestiary of models)

Geostatistics in Hydrology: Kriging interpolation

Contents 1 Introduction 2 Statistical Tools and Concepts

University of California, Los Angeles Department of Statistics. Effect of variogram parameters on kriging weights

Fast kriging of large data sets with Gaussian Markov random fields

Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign

SPATIAL-TEMPORAL TECHNIQUES FOR PREDICTION AND COMPRESSION OF SOIL FERTILITY DATA

BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS

Beta-Binomial Kriging: An Improved Model for Spatial Rates

REML Estimation and Linear Mixed Models 4. Geostatistics and linear mixed models for spatial data

Spatial Analysis 2. Spatial Autocorrelation

ArcGIS for Geostatistical Analyst: An Introduction. Steve Lynch and Eric Krause Redlands, CA.

Lesson 5 Practice Problems

Kriging of spatial-temporal water vapor data

Uncertainty Quantification and Validation Using RAVEN. A. Alfonsi, C. Rabiti. Risk-Informed Safety Margin Characterization.

University of California, Los Angeles Department of Statistics. Universal kriging

ENVIRONMENTAL DATA ANALYSIS WILLIAM MENKE JOSHUA MENKE WITH MATLAB COPYRIGHT 2011 BY ELSEVIER, INC. ALL RIGHTS RESERVED.

Is there still room for new developments in geostatistics?

Practical 12: Geostatistics

Geostatistical Analyst for Deciding Optimal Interpolation Strategies for Delineating Compact Zones

An Introduction to Spatial Autocorrelation and Kriging

Feb 21 and 25: Local weighted least squares: Quadratic loess smoother

The minimisation gives a set of linear equations for optimal weights w:

Propagation of Errors in Spatial Analysis

Chapter 1. Summer School GEOSTAT 2014, Spatio-Temporal Geostatistics,

GRAD6/8104; INES 8090 Spatial Statistic Spring 2017

Umeå University Sara Sjöstedt-de Luna Time series analysis and spatial statistics

Transcription:

Spatial Data

Types of Spatial Data Point pattern Point referenced geostatistical Block referenced Raster / lattice / grid Vector / polygon

Point Pattern Data Interested in the location of points, not their attributes Degree of aggregation

Ripley's K Calculates counts of points as a function of distance bins for each point Combine points together and normalize by area Positive = more points expected than random at that distance Negative = less than expected Intervals by bootstrap Requires def'n of area L d = A i=1 n n j=1, j 1 n n 1 k i, j

Ripley's K

library(spatial) Ripley's K in R ## load library ppregion(xmin,xmax,ymin,ymax) ## define region rk <- Kfn(x,max.distance) ## calculate Ripley's K plot(rk$x,rk$y-rk$x,type='l',xlab="d",ylab="l(d)") ##Plot as L(d) rather than K(d) ## compute and plot interval estimate Ke <- Kenvl(max.distance, nrep, Psim(n)) lines(ke$x,ke$upper-ke$x,lty=2,col="grey") lines(ke$x,ke$lower-ke$x,lty=2,col="grey")

Applications and Extensions Irregularly shaped areas Choice of points counted in each sum can vary with categorical attribute Tree maps Juvenile aggregated (dispersal) Intermediate random (DD mortality) Adults are over-dispersed (crown competition)

Point Referenced Data Data has a value/attribute plus spatial coordinates but not area Aka geospatial data Origin in mining Usually sampling some underlying continuum Aims: Account for lack of independence in data due to spatial proximity (analogous to time series) Predict the value at some new location (usually a grid / map)

Examples of Point Ref Data Soils Moisture, nutrients, ph, texture, etc. Atmospheric or Ocean measurement Surface meteorology (temperature, precip, etc.) CO2, pollutant concentration, salinity, etc. Plot data were size of plot << size of domain Biomass/abundance, presence/absence, richness Invasive species, disease prevalence, etc.

Geospatial Exploratory Analyses Smoothing & Detrending Autocorrelation Interpolation Linear Inverse distance weighed Geostatistical (Kriging) Many packages in R, will focus on most basic & built in

Smoothing / Detrending Objective: Like with time-series, most statistical methods assume stationarity More complicated in 2D (sparse, irregular) Polynomial (in R, library(spatial) ) Fit surface: surf.ls(degree, x, y, z) Project: trmat(surf.obj, xmin, xmax, ymin, ymax, n) Plot: image(tr.obj)

Degree 0 Degree 1 Degree 2

Spatial autocorrelation correlogram(surf.ls,nbin)

NULL model interval estimate by nonparametric bootstrap

Variogram Traditionally, autocorrelation in geostatistics has been expressed in terms of a variogram or semivariogram N d Units = variance d = 1 N d i, j d Z i Z j 2 Sill = asymptote Range = distance to asymptote Nugget = variance at lag 0

variogram(surf.ls,nbin)

Spatial Covariance If C(d) is the spatial covariance C d =COV [Z x, Z x d ] Autocorrelation : Variogram : d =C d /C 0 d =C 0 C d

Interpolation Objective: predict Z at some new point(s) Often on a grid to make a raster map Linear Simplest if data already on a grid (four corners)

Interpolation Bicubic interpolation: cubic analog to bilinear Nearest-Neighbor: Tesselation Voronoi Diagram Triangular irregular network (TIN)

Inverse-Distance Weighted Previous methods only used nearest points All are special cases of a weighted average For irregular, often want to use n-nearest points or a fixed search radius (variable number of points) Requires a way of WEIGHTING points as a function of distance Inverse-distance weighted: W ij = 1/d ij Z i = Σ W ij Z j / Σ W ij

Spatial Weighted Averages Other alternatives to 1/d (e.g. 1/d 2 ) Major criticisms Choice of weighting function somewhat arbitrary, not connected to properties of the data Does not account for error in interpolation Points further from known points should be more uncertain Interpolation vs smoothing Interpolation always passes exactly though the data points (0 residuals) Smoothing separates trends + residuals

Kriging Interpolation based on autocorrelation fcn Requires fitting an autocorrelation model to the variogram or correlogram Provides weight to points based on observed relationship between distance and correlation Requires choice of parametric function Provides mechanism for estimating interpolation error

Variogram Models

AIC 0.0 20.5 5.2

Step 1: Fit variance model ##correlogram cg <- correlogram(data,nbin) ##fit covariance function expfit <- function(parm){ -sum(dnorm(cg$y, expcov(cg$x,parm[1]), parm[2],log=true)) } efit <- optim(ic,expfit) Built in function for exponential covariance N y f x, 2 log N y f x, 2

Step 2: Krige surface ##detrend accounting for covariance kr <- surf.gls(degree,expcov,data,d=efit$par[1],...) ## matrix prediction (Kriging) pr <- prmat(kr, xmin, xmax, ymin, ymax, n) image(pr) ## matrix error se <- semat(kr, xmin, xmax, ymin, ymax, n) contour(se3,add=true)

Anisotropy In addition to STATIONARITY (spatial covariance is the same at all locations), spatial models also assume ISOTROPY, that the spatial covariance is the same in all DIRECTIONS Calculate/fit variogram separately for different directions (angular bins) to account for anisotropy Increases # of parameters, less data points as bins get smaller

Flavors of Kriging Simple Kriging: mean = 0 Ordinary Kriging: mean = unknown µ Universial Kriging: mean = polynomial trend Cokriging: inclusion of covariates

Limitations of Kriging Assumes the variogram model is known Dropped parameter error Fitting of variogram model: Not done as part of overall model fit Not done on data directly Binned means of all n 2 pairwise differences Detrending and autocorr done separately Sometimes just want non-independence Similar to T.S., OK for EDA but ultimately want to fit whole model at once.