Spatial Data Mining. Regression and Classification Techniques

Similar documents
Lecture 5 Geostatistics

Interpolating Raster Surfaces

Types of spatial data. The Nature of Geographic Data. Types of spatial data. Spatial Autocorrelation. Continuous spatial data: geostatistics

ENGRG Introduction to GIS

An Introduction to Spatial Autocorrelation and Kriging

Nature of Spatial Data. Outline. Spatial Is Special

Lecture 8. Spatial Estimation

I don t have much to say here: data are often sampled this way but we more typically model them in continuous space, or on a graph

Spatial analysis. Spatial descriptive analysis. Spatial inferential analysis:

Copyright The McGraw-Hill Companies, Inc. Permission required for reproduction or display.

Types of Spatial Data

11/8/2018. Spatial Interpolation & Geostatistics. Kriging Step 1

Introduction to Spatial Statistics and Modeling for Regional Analysis

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

On dealing with spatially correlated residuals in remote sensing and GIS

Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

An Introduction to Pattern Statistics

Spatial Interpolation & Geostatistics

SPACE Workshop NSF NCGIA CSISS UCGIS SDSU. Aldstadt, Getis, Jankowski, Rey, Weeks SDSU F. Goodchild, M. Goodchild, Janelle, Rebich UCSB

Chapter 1. Summer School GEOSTAT 2014, Spatio-Temporal Geostatistics,

Temporal vs. Spatial Data

Spatial Regression. 1. Introduction and Review. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Introduction to Geostatistics

Spatial Analysis 1. Introduction

Michael Harrigan Office hours: Fridays 2:00-4:00pm Holden Hall

Spatial Autocorrelation

Geostatistics: Kriging

Introduction. Semivariogram Cloud

7 Geostatistics. Figure 7.1 Focus of geostatistics

A GEOSTATISTICAL APPROACH TO PREDICTING A PHYSICAL VARIABLE THROUGH A CONTINUOUS SURFACE

PRODUCING PROBABILITY MAPS TO ASSESS RISK OF EXCEEDING CRITICAL THRESHOLD VALUE OF SOIL EC USING GEOSTATISTICAL APPROACH

Soil Moisture Modeling using Geostatistical Techniques at the O Neal Ecological Reserve, Idaho

Exploring the World of Ordinary Kriging. Dennis J. J. Walvoort. Wageningen University & Research Center Wageningen, The Netherlands

Exploratory Spatial Data Analysis (ESDA)

Kriging Luc Anselin, All Rights Reserved

BAYESIAN MODEL FOR SPATIAL DEPENDANCE AND PREDICTION OF TUBERCULOSIS

Lab #3 Background Material Quantifying Point and Gradient Patterns

KAAF- GE_Notes GIS APPLICATIONS LECTURE 3

Concepts and Applications of Kriging. Eric Krause

Finding Hot Spots in ArcGIS Online: Minimizing the Subjectivity of Visual Analysis. Nicholas M. Giner Esri Parrish S.

Optimal Interpolation

Spatial Data Analysis in Archaeology Anthropology 589b. Kriging Artifact Density Surfaces in ArcGIS

Propagation of Errors in Spatial Analysis

Spatial Analysis 2. Spatial Autocorrelation

Global Spatial Autocorrelation Clustering

Spatial Analysis II. Spatial data analysis Spatial analysis and inference

SPATIAL-TEMPORAL TECHNIQUES FOR PREDICTION AND COMPRESSION OF SOIL FERTILITY DATA

GIST 4302/5302: Spatial Analysis and Modeling Lecture 2: Review of Map Projections and Intro to Spatial Analysis

Spatial Statistics For Real Estate Data 1

The Study of Soil Fertility Spatial Variation Feature Based on GIS and Data Mining *

GIST 4302/5302: Spatial Analysis and Modeling

Transiogram: A spatial relationship measure for categorical data

What s special about spatial data?

Areal data. Infant mortality, Auckland NZ districts. Number of plant species in 20cm x 20 cm patches of alpine tundra. Wheat yield

Outline ESDA. Exploratory Spatial Data Analysis ESDA. Luc Anselin

Performance Analysis of Some Machine Learning Algorithms for Regression Under Varying Spatial Autocorrelation

Influence of parameter estimation uncertainty in Kriging: Part 2 Test and case study applications

Spatial analysis of a designed experiment. Uniformity trials. Blocking

CREATION OF DEM BY KRIGING METHOD AND EVALUATION OF THE RESULTS

Geostatistics in Hydrology: Kriging interpolation

Index. Geostatistics for Environmental Scientists, 2nd Edition R. Webster and M. A. Oliver 2007 John Wiley & Sons, Ltd. ISBN:

Geog 210C Spring 2011 Lab 6. Geostatistics in ArcMap

Modeling Spatial Relationships Using Regression Analysis. Lauren M. Scott, PhD Lauren Rosenshein Bennett, MS

Representation of Geographic Data

Geostatistical Interpolation: Kriging and the Fukushima Data. Erik Hoel Colligium Ramazzini October 30, 2011

Lecture 1: Introduction to Spatial Econometric

Creating and Managing a W Matrix

POPULAR CARTOGRAPHIC AREAL INTERPOLATION METHODS VIEWED FROM A GEOSTATISTICAL PERSPECTIVE

Data Break 8: Kriging the Meuse RiverBIOS 737 Spring 2004 p.1/27

Lecture 9: Introduction to Kriging

2/7/2018. Module 4. Spatial Statistics. Point Patterns: Nearest Neighbor. Spatial Statistics. Point Patterns: Nearest Neighbor

The Nature of Geographic Data

Concepts and Applications of Kriging. Eric Krause Konstantin Krivoruchko

Spatial and Environmental Statistics

Comparison of rainfall distribution method

IRON ORE 2015, July 2015, Perth Australia. Danny Kentwell

Spatial Statistics or Why Spatial is Special?

Gridding of precipitation and air temperature observations in Belgium. Michel Journée Royal Meteorological Institute of Belgium (RMI)

Sampling Scheme for 2003 General Social Survey of China

Performance of W-AMOEBA and W-Contiguity matrices in Spatial Lag Model

Chapter 4 - Fundamentals of spatial processes Lecture notes

Tricks to Creating a Resource Block Model. St John s, Newfoundland and Labrador November 4, 2015

Spatial analysis is the quantitative study of phenomena that are located in space.

Estimation of Parameters

Spatial Effects and Externalities

Investigation of Monthly Pan Evaporation in Turkey with Geostatistical Technique

Spatial Regression Modeling

Contents. Learning Outcomes 2012/2/26. Lecture 6: Area Pattern and Spatial Autocorrelation. Dr. Bo Wu

Overview of Statistical Analysis of Spatial Data

Analysis for Regional Differences and Influence Factor of Rural Income in China

Concepts and Applications of Kriging

The Comparative Analysis of Spatial Structure of Ji Wheat 22 Yield Based on Different Stochastic Samplings

Mapping Precipitation in Switzerland with Ordinary and Indicator Kriging

A Spatial Econometric Approach to Model the Growth of Tourism Flows to China Cities

Space-time data. Simple space-time analyses. PM10 in space. PM10 in time

University of California, Los Angeles Department of Statistics. Effect of variogram parameters on kriging weights

Roger S. Bivand Edzer J. Pebesma Virgilio Gömez-Rubio. Applied Spatial Data Analysis with R. 4:1 Springer

Universitat Autònoma de Barcelona Facultat de Filosofia i Lletres Departament de Prehistòria Doctorat en arqueologia prehistòrica

Spatial Backfitting of Roller Measurement Values from a Florida Test Bed

Transcription:

Spatial Data Mining Regression and Classification Techniques 1

Spatial Regression and Classisfication Discrete class labels (left) vs. continues quantities (right) measured at locations (2D for geographic applications) Build a model for predicting the measured quantity at any location Additional (to spatial) attributes may exist 2

Geostatistics Analysis and inference of continuously-distributed variables Analysis: Describing the spatial variability of the phenomenon under study Inference: Estimating the unknown values Questions on measurements: How are they distributed? How are they related to each other? How can I infer a distribution from one sample? Water Availabilty Index Estimated Surface Estimated Uncertainty 3

Spatial continuity and stationarity Why prediction is possible? Continuity: Spatial close measurements are more similar than distant ones What does it mean? Model the underlying phenomenon with the model f(x,w), x the location vector and w the measurement If not just noise, then continuity creates smoothness of w values that can be modeled by f(x,w) Can all locations be modeled by a single f(x,w)? Stationarity: Measurements generated by a single distribution at all locations 4

Spatial Autocorrelation Continuity produces autocorrelation: correlation of a variable with itself through space First law of geography: everything is related to everything else, but near things are more related than distant things Waldo Tobler 3 possible cases: If nearby or neighboring areas are more alike, this is positive spatial autocorrelation Negative autocorrelation describes patterns in which neighboring areas are unlike Random patterns exhibit no spatial autocorrelation

Why to bother about spatial autocorrelation? Most statistics/data mining methods are based on the assumption that the values of observations in each sample are independent of one another Positive spatial autocorrelation may violate this, if the samples were taken from nearby areas Spatial Autocorrelation is a kind of redundancy: the measurement at a location constrains, or makes more probable, the measurement in a neighboring location Models will be biased, since measurements tend to be concentrated and there are actually fewer number of independent observations than are being assumed 6

Measures of autocorrelation Objectives: Measure the strength of spatial autocorrelation Test the assumption of independence or randomness Measures Moran s I Variograms other (Geary s C, Ripley s K) 7

Moran s I: A measure of spatial autocorrelation Compares the value of the variable at any one location with the value at all other locations Similar to correlation coefficient, it varies between 1.0 and + 1.0 When autocorrelation is high, the coefficient is high A high I value indicates positive autocorrelation 8

Symbols and Contiguity matrix N is the number of cases X i is the variable value at location i X j is the variable value at location j \bar{x} is the mean of the variable W ij is a weight applied to the comparison between location i and location j W ij is a contiguity matrix If location j is adjacent to zone i, the interaction receives a weight of 1 Another option is to make W ij a distance-based weight which is the inverse distance between locations I and j (1/d ij ) 9

Example: Per Capita Income in Monroe County Actual values: Moran s I: 0.66 Random values: Moran s I: 0.01 10

Local Moran s I Following Anselin s (1995) definition, a local Moran s I i may be defined as: I i = z s i 2 j w z i s are the deviations from the mean of y i s ij z j, i j 89 71 52 85 75 63 51 61 64 I 75 55.82 675.32 [ 71+ 85 + 61+ 63 4 55.82] 1. 61 75 = = 11

Global vs. Local Moran s I: example Spatial pattern detection in China s provincial development The variable used: per capita GDP Dynamic patterns global Moran s I Specific local spatial process local Moran s I and the Moran s scatterplot 12

China: per capita GDP in 1978 Western Region Central Region Eastern Region Yuan 175-291 292-430 0 250 500 1,000 Miles 0 500 1,000 2,000 Kilometers 431-680 681-1290 1291-249813

China: per capita GDP in 2000 Western Region Central Region Eastern Region Yuan 869-1913 1914-3162 0 250 500 1,000 Miles 0 500 1,000 2,000 Kilometers 3163-4532 4533-8411 8412-15593 14

Global vs. Local Moran s I: example There is a clustering trend in China s provincial level development (represented by per capita GDP But the global Moran s I can t tell on which side does the clustering trend take place Global Moran's I 0.25 0.2 0.15 0.1 0.05 0 2000 1999 1998 1997 1996 1995 1994 1993 1992 1991 1990 1989 1988 1987 1986 1985 1984 1983 1982 1981 1980 1979 1978 Year 15

Local Moran s I in 1978 Western Region Central Region Eastern Region 0 250 500 1,000 Miles Local Moran's I < - 0.3-0.3-0 0-0.3 0.3-1.0 0 500 1,000 2,000 Kilometers > 1.0 16

Local Moran s I in 2000 Western Region Central Region Eastern Region 0 250 500 1,000 Miles Local Moran's I - 0.3-0 0-0.3 0.3-1.0 0 500 1,000 2,000 Kilometers > 1.0 17

More details to the Chine GDP example First, China s coast-interior divide persisted Interior provinces exhibit great geographical similarity in economic development and spatial contributions to the global Moran s I Second, the municipalities (Beijing, Tianjin, Shanghai) always contribute the most Shanghai s position is worth noting, it development changed the spatial pattern the most Third, Guangdong s contribution to the global index corresponds with its changing spatial behavior depicted in the Moran scatterplot Fourth, while most of the interior provinces have similar patterns, coastal provinces vary greatly Fifth, Shandong fell into the low-low quadrant, and contributed very little to the global index Sixth, Guizhou and Yunnan, two provinces in southwest China, contributed relatively highly to the global index in 2000 The poorest ones tend to form a poor cluster 18

Variograms Analyse the observed variation in data values by distance bands using a spatial autocorrelation-like measure, γ: Semivariance measure is most often used: d = h+δ/2 ij 1 ˆ( γ h) = ( zi zj) 2 Nh ( ) d ij = h Δ/2 2 Bands have width Δ. N(h) is the number of pairs in the band with mid-point distance h After building an experimental variogram, we need to fit a theoretical function in order to model the spatial variation 19

Variograms Smallest observed separation Fitted curve sill Average semivariance for band 4 C 1 =C 0 +C (structural variance) Range, A 0 model C 0 Nugget Lag (distance band) 20

Variograms Model Formula (Theoretical Fit) Notes Nugget effect Linear Exponential Exp() γ( 0) = C 0 γ( h) = C1 ( h) γ( h) = C1 1 kh ( e ) Simple constant. May be added to all models. Models with a nugget will not be exact No sill. Often used in combination with other functions. May be used as a ramp, with a constant sill value set at a range, a k is a constant, often k=1 or k=3. Useful when there is a larger nugget and slow rise to the sill Spherical Sph() γ ( h) γ ( h) = = C C 1 1 3h 2, h 1 1 2 h 3, h < 1 Useful when the nugget effect is important but small. Given as the default model in some packages. 21

Approaches to spatial prediction Value of the variable is predicted from nearby samples Example: concentrations of soil constituents (e.g. salts, pollutants) Example: vegetation density Each interpolator has its own assumptions: Nearest neighbor and variations: Average within a radius Average of n nearest neighbors Distance-weighted average within a radius Distance-weighted average of n nearest neighbours Optimal weighting -> Kriging 22

Nearest Neighbor Methods k-nn Classification: assign the class label of the majority of the k-nn k-nn Regression: assign the mean value of the k-nn 1-NN: Voronoi Diagram A common weighting scheme is to give each neighbor a weight of 1/d, where d is the distance to the neighbor 23

Nearest Neighbor Methods Pros: Simple, no training (lazy) Benchmark: E1-NN <= 2 EB Often as good as more sophisticated methods Per-se considerations of autocorrelation Cons: Slow classification (lazy) Prone to noise High-variance Need to determine k Cross validation Need to determine weights (for variations) 24

Nearest Neighbor Methods When no spatial autocorrelation (random data): CV (LOO) error is maximized for 1-NN: 25 ( ) ( ) ( ) ) ( 2 2 1 1 ) ( ) ( ) ( 2 1,1 2 1 1, Z Var Var N Z Z N E X Z X X f Z N j NN j j N j NN j j = + = = = ε ε ε ε ε

Nearest Neighbor Methods Random data No minimum occurs Spatial autocorrelation Minimum occurs 26

Nearest Neighbor Methods Bias-Variance decomposition: original k=3 Err k 2 2 1 σ x0 ) = σ ε + f ( x0) f ( xn) + k n= 1 k ( ε 2 Bias 2 Variance k=1 (hi var) k=30 (hi bias) 27

Optimal Weighting : Kriging Characteristics of optimality : Prediction is made as a linear combination of known data values (a weighted average) Points closer to the point to be predicted have larger weight Prediction is unbiased and exact at known points Error estimate is based only on the sample configuration, not the data values Prediction error should be as small as possible Why optimal and not optimal? optimal with respect to the chosen model! 28

Overview of Kriging 1. Sample, preferably at different resolutions 2. Calculate the experimental variogram 3. Model the variogram with one or more authorized functions 4. Apply the kriging system, with the variogram model of spatial dependence, at each point to be predicted Predictions are often at each point on a regular grid (e.g. a raster map) 5. Calculate the error of each prediction; this is based only on the sample point locations, no their data values. 29

Ordinary Kriging (OK) In OK, we model the value of variable z at location s i as the sum of a regional mean m and a spatially-correlated random component e(s i ): Z (s i ) = m +e (s i ) The regional mean m is estimated from the sample, but not as the simple average, because there is spatial dependence It is implicit in the OK system 30

Ordinary Kriging: Solution 31

Kriging usage Supported by many GIS http://faculty.washington.edu/mlog/teaching/geos tats/labs/arcwizzard/wizzard_demo.shtml But be aware of polemics between classic statistics vs. geostatistics spatial dependence may be assumed or be verified? Kriging in scandal: Spatial dependence between borehole grades or blasthole grades was assumed at Bre-X's Busang property More details: http://en.wikipedia.org/wiki/kriging#controversy_in _climate_change.2c_mineral_exploration.2c_and_ mining http://www.geostatscam.com/ 32