Points. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Similar documents
Point Pattern Analysis

Spatial point processes

Spatial Analysis I. Spatial data analysis Spatial analysis and inference

GIST 4302/5302: Spatial Analysis and Modeling Point Pattern Analysis

Spatial point process. Odd Kolbjørnsen 13.March 2017

Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis

Overview of Spatial analysis in ecology

Lecture 9 Point Processes

Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis

Lab #3 Background Material Quantifying Point and Gradient Patterns

A Spatio-Temporal Point Process Model for Firemen Demand in Twente

Chapter 6 Spatial Analysis

Spatial Point Pattern Analysis

SPACE Workshop NSF NCGIA CSISS UCGIS SDSU. Aldstadt, Getis, Jankowski, Rey, Weeks SDSU F. Goodchild, M. Goodchild, Janelle, Rebich UCSB

Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign

Spatial Analysis 2. Spatial Autocorrelation

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

Global Spatial Autocorrelation Clustering

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

Resolution Limit of Image Analysis Algorithms

Rate Maps and Smoothing

Spatial Autocorrelation

Spatial Regression. 1. Introduction and Review. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

An adapted intensity estimator for linear networks with an application to modelling anti-social behaviour in an urban environment

Mapping and Analysis for Spatial Social Science

Test of Complete Spatial Randomness on Networks

CONDUCTING INFERENCE ON RIPLEY S K-FUNCTION OF SPATIAL POINT PROCESSES WITH APPLICATIONS

Lecture 10 Spatio-Temporal Point Processes

Summary STK 4150/9150

Interaction Analysis of Spatial Point Patterns

Spatial Clusters of Rates

This lab exercise will try to answer these questions using spatial statistics in a geographic information system (GIS) context.

Outline. Practical Point Pattern Analysis. David Harvey s Critiques. Peter Gould s Critiques. Global vs. Local. Problems of PPA in Real World

Spatial Point Pattern Analysis

Approaches for Cluster Analysis of Activity Locations along Streets: from Euclidean. Plane to Street Network Space. Yongmei Lu

Spatial Models: A Quick Overview Astrostatistics Summer School, 2017

Introduction to Spatial Statistics and Modeling for Regional Analysis

Detection of Clustering in Spatial Data

Multifractal portrayal of the distribution of the Swiss population

Spatial Analysis 1. Introduction

Ripley s K function. Philip M. Dixon Iowa State University,

Statistícal Methods for Spatial Data Analysis

Spatial Regression. 6. Specification Spatial Heterogeneity. Luc Anselin.

Outline. 15. Descriptive Summary, Design, and Inference. Descriptive summaries. Data mining. The centroid

Spatial Autocorrelation (2) Spatial Weights

An Introduction to Pattern Statistics

Nature of Spatial Data. Outline. Spatial Is Special

Construction Engineering. Research Laboratory. Approaches Towards the Identification of Patterns in Violent Events, Baghdad, Iraq ERDC/CERL CR-09-1

Comparison of spatial methods for measuring road accident hotspots : a case study of London

Spatial Models: A Quick Overview Astrostatistics Summer School, 2009

An introduction to spatial point processes

ENGRG Introduction to GIS

School on Modelling Tools and Capacity Building in Climate and Public Health April Point Event Analysis

Models to carry out inference vs. Models to mimic (spatio-temporal) systems 5/5/15

Cluster Analysis using SaTScan. Patrick DeLuca, M.A. APHEO 2007 Conference, Ottawa October 16 th, 2007

Second-order pseudo-stationary random fields and point processes on graphs and their edges

Estimating the long-term health impact of air pollution using spatial ecological studies. Duncan Lee

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Guest Lecture: Steering in Games. the. gamedesigninitiative at cornell university

Econometrics I. Lecture 10: Nonparametric Estimation with Kernels. Paul T. Scott NYU Stern. Fall 2018

Using AMOEBA to Create a Spatial Weights Matrix and Identify Spatial Clusters, and a Comparison to Other Clustering Algorithms

Cluster Analysis using SaTScan

Chapter 3 METHODS. The primary goal of this section is to explore different methods in which populations of

An Assessment of Crime Forecasting Models

Monsuru Adepeju 1 and Andy Evans 2. School of Geography, University of Leeds, LS21 1HB 1

Bayesian Hierarchical Models

Lecture 4. Spatial Statistics

I don t have much to say here: data are often sampled this way but we more typically model them in continuous space, or on a graph

Spatial clustering of traffic accidents using distances along the network

Urban GIS for Health Metrics

The Use of Statistical Point Processes in Geoinformation Analysis

Types of spatial data. The Nature of Geographic Data. Types of spatial data. Spatial Autocorrelation. Continuous spatial data: geostatistics

Introduction to GIS. Dr. M.S. Ganesh Prasad

Final Project: An Income and Education Study of Washington D.C.

Michael Harrigan Office hours: Fridays 2:00-4:00pm Holden Hall

Spatial and Temporal Geovisualisation and Data Mining of Road Traffic Accidents in Christchurch, New Zealand

Large-Scale Spatial Distribution Identification of Base Stations in Cellular Networks

Objectives Define spatial statistics Introduce you to some of the core spatial statistics tools available in ArcGIS 9.3 Present a variety of example a

Spatial Statistics A Framework for Analyzing Geographically Referenced Data in Insurance Ratemaking

RESEARCH REPORT. Inhomogeneous spatial point processes with hidden second-order stationarity. Ute Hahn and Eva B.

Second-Order Analysis of Spatial Point Processes

Spatial Analyst. By Sumita Rai

Algorithms for Picture Analysis. Lecture 07: Metrics. Axioms of a Metric

Seminar Alg Basics. Name: Class: Date: ID: A. Multiple Choice Identify the choice that best completes the statement or answers the question.

Spatial Data Mining. Regression and Classification Techniques

OPEN GEODA WORKSHOP / CRASH COURSE FACILITATED BY M. KOLAK

Class 9. Query, Measurement & Transformation; Spatial Buffers; Descriptive Summary, Design & Inference

Chapter 6. Fundamentals of GIS-Based Data Analysis for Decision Support. Table 6.1. Spatial Data Transformations by Geospatial Data Types

The Joys of Geocoding (from a Spatial Statistician s Perspective)

Introduction to Machine Learning Midterm Exam

Types of Spatial Data

Bayesian Analysis of Spatial Point Patterns

22 cities with at least 10 million people See map for cities with red dots

Identification of Regional Subcenters Using Spatial Data Analysis for Estimating Traffic Volume

SPATIAL ANALYSIS. Transformation. Cartogram Central. 14 & 15. Query, Measurement, Transformation, Descriptive Summary, Design, and Inference

Stochastic Modeling of Tropical Cyclone Track Data

covariance function, 174 probability structure of; Yule-Walker equations, 174 Moving average process, fluctuations, 5-6, 175 probability structure of

Application of eigenvector-based spatial filtering approach to. a multinomial logit model for land use data

Detection of Clustering in Spatial Data

arxiv: v2 [math.pr] 22 May 2014

Transcription:

Points Luc Anselin http://spatial.uchicago.edu 1

classic point pattern analysis spatial randomness intensity distance-based statistics points on networks 2

Classic Point Pattern Analysis 3

Classic Examples forestry, plant species, astronomy locations of crimes, accidents locations of persons with a disease facility locations (economic geography) settlement patterns 4

SF car thefts, Aug 2012 5

Events points are the location of an event of interest all points are known = mapped pattern selection bias events are mapped, but non-events are not 6

Research Questions is the pattern random or structured in some fashion clustered: closer than random dispersed/regular: farther than random what is the process that might have generated the pattern 7

Classic Point Pattern Analysis points located on an isotropic plane no directional effect distance as straight line distance 8

Marked Point Pattern both location and value e.g., location and employment of manufacturing plants, trunk size of trees patterns in the location of the points and in the values association with the locations = spatial autocorrelation 9

Classic data set: longleaf pines 10

Multi-Type Pattern multiple categories of events in one pattern research questions: patterning within a single type association between patterns in different types repulsion or attraction between types 11

Chicago multitype point pattern 12

Case-Control Design take into account background heterogeneity non-uniform population at risk pattern for event of interest = case pattern for background population = control 13

Classic case-control data set: Lancashire cancers 14

Spatial Randomness 15

Complete Spatial Randomness standard of reference uniform distribution each location has equal probability for an event locations of events are independent homogeneous planar Poisson process 16

Poisson Point Process distribution for N points in area A, N(A) intensity: λ = N/ A ( A is area of A) therefore N = λ A points randomly scattered in a region with area A Poisson distribution: N(A) ~ Poi(λ A ) 17

CSR (uniform) N=50 CSR (uniform) N=100 Simulated CSR - uniform with N fixed on unit square 18

Contagious Point Distributions two stages distribution for parents distribution for offspring formal models Poisson cluster process or Neyman-Scott process Matern cluster process 19

Neyman Scott Parents Lambda=10 Neyman Scott Children, N=5 per parent Simulated Neyman-Scott process realized N=15 realized N=55 overall λ=10x5 20

Heterogeneous Poisson Process spatially varying intensity λ(s) mean intensity is integral of the location-specific intensities over the region source of variability function for λ(s) = f(z) with covariates doubly stochastic process with λ(s) Λ(s) 21

Log Gaussian Point Process 0.0 0.2 0.4 0.6 0.8 1.0 1.2 0.0 0.2 0.4 0.6 0.8 1.0 0 500 1000 1500 2000 lnz N(4.1,1) E[λ] 100 average λ = 113 22

Intensity 23

Average Intensity first moment of a point pattern distribution number of points per unit area intensity: λ = N/ A area depends on bounding polygon 24

Bounding Polygon classic unit square unrealistic but used in classic example data sets actual regional boundary (GIS) bounding box convex hull 25

Chicago supermarkets - City boundary 26

Quadrat Counts assess the extent to which intensity is constant across space quadrat = polygon count the points in the quadrant visualize counts, intensity map 27

Quadrat counts - alternative configurations 28

Quadrat count intensity graph intensity = count / area 29

Intensity Function spatial heterogeneity intensity λ(s) varies with location s estimating λ(s) non-parametric kernel function 30

Kernel Density Estimation non-parametric approach weighted moving average of the data f(u) = (1/Nb) i K[(u - ui)/b] u is any location K is the kernel function (a function of distance) b is the bandwidth, i.e., how far the moving average is computed with Nb as the number of observations within the bandwidth 31

Chicago supermarket locations Gaussian kernel bw = 14259 32

Chicago supermarket locations Gaussian kernel bw = 6071 33

Distance-Based Statistics 34

Nearest Neighbor Functions 35

Terminology events and points event: observed location of an event point: reference point (e.g., point on a grid) distances event-to-event distance point-to-event distance 36

Nearest Neighbor Statistic principle under CSR the nearest neighbor distance between points has known mathematical properties testing strategy = detect deviations from these properties 37

Nearest Neighbor Statistic (2) implementation event to nearest event point to nearest event characterize this distribution relative to CSR many nearest neighbor statistics G function (event to event) F function (point to event) J function (combination) 38

G Function - Event-to-Event Distribution cumulative distribution of nearest neighbor distances G(r) = n -1 #(ri r) proportion of nearest neighbor distances that are less than r plot estimated G(r) against r implementation: many types of edge corrections 39

G under CSR nearest neighbor at distance r implies that no other points are within a circle with radius r P[y=0] is exp(-λπr 2 ) under Poisson distribution the probability of finding a nearest neighbor is then the complement of this P[ri < r] = 1 - exp(-λπr 2 ) reference function, plot 1 - exp(-λπr 2 ) against r 40

G function with reference curve for CSR 41

Inference analytical results intractable or only under unrealistic assumptions mimic CSR by random simulation random pattern for same n compute G(r) for each random pattern create a simulation envelope 42

G function with randomization envelope using min and max for each r 43

Interpretation clustering G(r) function above randomization envelope inhibition G(r) function below randomization envelope 44

0.00 0.02 0.04 0.06 0.08 0.0 0.2 0.4 0.6 0.8 1.0 distance G(d) G for CSR 45

0.00 0.02 0.04 0.06 0.0 0.2 0.4 0.6 0.8 distance G(d) G for Poisson Clustered Process 46

0.00 0.05 0.10 0.15 0.0 0.2 0.4 0.6 0.8 1.0 distance G(d) G for Matern II Inhibition Process 47

Second Order Statistics 48

Beyond Nearest Neighbor Statistics nearest neighbor distances do not fully capture the complexity of point processes instead, take into account all the pair-wise distances as a density function or as a cumulative density function 49

Second Order Statistics second order statistics exploit the notion of covariance based on the number of other points within a given radius of a point pair correlation function, or g-function Ripley s K and Besag s L function 50

Ripley s K Function best known second order statistic so-called reduced second order moment λk(r) = E[N0(r)] E[N0(r)] is the expected number of events within a distance r from an arbitrary event K(r) = λ -1 E[N0(r)] is the K function 51

Estimating the K Function expected events within distance r E[N0(r)] = n -1 i j i Ih(rij < r) for each event, sum over all other events within the given distance band, for increasing distances cumulative function edge corrections 52

Inference and Interpretation for CSR, K(r) = πr 2 K(r) > πr 2 implies clustering K(r) < πr 2 implies inhibition (regular process) use randomization envelope for inference 53

K function with reference line for CSR 54

K function with randomization envelope using min and max for each r 55

K for CSR 0.00 0.05 0.10 0.15 0.20 0.25 0.00 0.05 0.10 0.15 0.20 distance K(d) 56

K for Poisson Cluster Process 0.00 0.05 0.10 0.15 0.20 0.25 0.0 0.1 0.2 0.3 distance K(d) 57

K for Matern II Inhibition Process 0.00 0.05 0.10 0.15 0.20 0.25 0.00 0.05 0.10 0.15 0.20 0.25 distance K(d) 58

Points on Networks 59

Points on a Network realistic locations events located on actual network, not floating in space network distance replaces straight line distance shortest path on the network 60

Los Angeles riot locations 61

Baghdad IED locations 62

network heat maps (kernel density) Source: Rosser et al (2017) 63

from events to points on network segments Source: Rosser et al (2017) 64

kernel function on a network Source: Rosser et al (2017) 65

SANET functionality Source: Okabe et al (2016) 66

Network Segments aggregate data by street segment e.g., accidents per traffic intensity street segments spatial weights define contiguity use shortest path distance network LISA 67

68