Spatial Regression. 1. Introduction and Review. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Similar documents
Spatial Autocorrelation

Spatial Autocorrelation (2) Spatial Weights

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

Global Spatial Autocorrelation Clustering

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

Spatial Data, Spatial Analysis and Spatial Data Science

Nature of Spatial Data. Outline. Spatial Is Special

Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign

Local Spatial Autocorrelation Clusters

Outline ESDA. Exploratory Spatial Data Analysis ESDA. Luc Anselin

Creating and Managing a W Matrix

Temporal vs. Spatial Data

SPACE Workshop NSF NCGIA CSISS UCGIS SDSU. Aldstadt, Getis, Jankowski, Rey, Weeks SDSU F. Goodchild, M. Goodchild, Janelle, Rebich UCSB

Spatial Analysis 1. Introduction

Finding Hot Spots in ArcGIS Online: Minimizing the Subjectivity of Visual Analysis. Nicholas M. Giner Esri Parrish S.

Introduction to Spatial Statistics and Modeling for Regional Analysis

Mapping and Analysis for Spatial Social Science

Lecture 1: Introduction to Spatial Econometric

Lecture 8. Spatial Estimation

OPEN GEODA WORKSHOP / CRASH COURSE FACILITATED BY M. KOLAK

Basics of Geographic Analysis in R

Using AMOEBA to Create a Spatial Weights Matrix and Identify Spatial Clusters, and a Comparison to Other Clustering Algorithms

Types of spatial data. The Nature of Geographic Data. Types of spatial data. Spatial Autocorrelation. Continuous spatial data: geostatistics

2/7/2018. Module 4. Spatial Statistics. Point Patterns: Nearest Neighbor. Spatial Statistics. Point Patterns: Nearest Neighbor

Spatial Regression. 6. Specification Spatial Heterogeneity. Luc Anselin.

KAAF- GE_Notes GIS APPLICATIONS LECTURE 3

Spatial Data Mining. Regression and Classification Techniques

Lecture 5 Geostatistics

Spatial Analysis I. Spatial data analysis Spatial analysis and inference

Clusters. Unsupervised Learning. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

GIS and Spatial Statistics: One World View or Two? Michael F. Goodchild University of California Santa Barbara

Exploratory Spatial Data Analysis Using GeoDA: : An Introduction

What s special about spatial data?

Spatial Regression. 10. Specification Tests (2) Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

CHAPTER 3 APPLICATION OF MULTIVARIATE TECHNIQUE SPATIAL ANALYSIS ON RURAL POPULATION UNDER POVERTYLINE FROM OFFICIAL STATISTICS, THE WORLD BANK

Exploratory Spatial Data Analysis (ESDA)

Exploratory Spatial Data Analysis (And Navigating GeoDa)

Contents. Learning Outcomes 2012/2/26. Lecture 6: Area Pattern and Spatial Autocorrelation. Dr. Bo Wu

Introduction to Spatial Analysis. Spatial Analysis. Session organization. Learning objectives. Module organization. GIS and spatial analysis

University of Florida CISE department Gator Engineering. Clustering Part 1

Modeling the Ecology of Urban Inequality in Space and Time

The Use of Spatial Weights Matrices and the Effect of Geometry and Geographical Scale

Spatial analysis. Spatial descriptive analysis. Spatial inferential analysis:

A GEOSTATISTICAL APPROACH TO PREDICTING A PHYSICAL VARIABLE THROUGH A CONTINUOUS SURFACE

Geographical Information Systems Institute. Center for Geographic Analysis, Harvard University. GeoDa: Spatial Autocorrelation

Spatial Regression. 9. Specification Tests (1) Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Spatial Effects and Externalities

Geovisualization. Luc Anselin. Copyright 2016 by Luc Anselin, All Rights Reserved

The Nature of Geographic Data

SUPPLEMENTARY INFORMATION

Spatial Tools for Econometric and Exploratory Analysis

GIST 4302/5302: Spatial Analysis and Modeling

Combining Incompatible Spatial Data

I don t have much to say here: data are often sampled this way but we more typically model them in continuous space, or on a graph

Lecture 6: Hypothesis Testing

Overview of Statistical Analysis of Spatial Data

Community Health Needs Assessment through Spatial Regression Modeling

Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University

Title. Description. var intro Introduction to vector autoregressive models

Points. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

GIST 4302/5302: Spatial Analysis and Modeling

Outline. Overview of Issues. Spatial Regression. Luc Anselin

Construction Engineering. Research Laboratory. Approaches Towards the Identification of Patterns in Violent Events, Baghdad, Iraq ERDC/CERL CR-09-1

Attribute Data. ArcGIS reads DBF extensions. Data in any statistical software format can be

Introduction GeoXp : an R package for interactive exploratory spatial data analysis. Illustration with a data set of schools in Midi-Pyrénées.

Spatial Analysis 2. Spatial Autocorrelation

Working Paper No Introduction to Spatial Econometric Modelling. William Mitchell 1. April 2013

Spatial Variation in Hospitalizations for Cardiometabolic Ambulatory Care Sensitive Conditions Across Canada

Spatial Investigation of Mineral Transportation Characteristics in the State of Washington

Michael Harrigan Office hours: Fridays 2:00-4:00pm Holden Hall

EXPLORATORY SPATIAL DATA ANALYSIS OF BUILDING ENERGY IN URBAN ENVIRONMENTS. Food Machinery and Equipment, Tianjin , China

Neighborhood social characteristics and chronic disease outcomes: does the geographic scale of neighborhood matter? Malia Jones

A Space-Time Model for Computer Assisted Mass Appraisal

On A Comparison between Two Measures of Spatial Association

Hypothesis Testing hypothesis testing approach

This lab exercise will try to answer these questions using spatial statistics in a geographic information system (GIS) context.

Lecture 18. Models for areal data. Colin Rundel 03/22/2017

Areal data. Infant mortality, Auckland NZ districts. Number of plant species in 20cm x 20 cm patches of alpine tundra. Wheat yield

Roger S. Bivand Edzer J. Pebesma Virgilio Gömez-Rubio. Applied Spatial Data Analysis with R. 4:1 Springer

Departamento de Economía Universidad de Chile

Experimental Design and Data Analysis for Biologists

GIST 4302/5302: Spatial Analysis and Modeling Lecture 2: Review of Map Projections and Intro to Spatial Analysis

Concepts and Applications of Kriging. Eric Krause

Areal data models. Spatial smoothers. Brook s Lemma and Gibbs distribution. CAR models Gaussian case Non-Gaussian case

GIST 4302/5302: Spatial Analysis and Modeling

Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis

Objectives Define spatial statistics Introduce you to some of the core spatial statistics tools available in ArcGIS 9.3 Present a variety of example a

Representation of Geographic Data

Functions in Tables 2.0

Spatial Regression Modeling

ARIC Manuscript Proposal # PC Reviewed: _9/_25_/06 Status: A Priority: _2 SC Reviewed: _9/_25_/06 Status: A Priority: _2

SPATIAL ECONOMETRICS: METHODS AND MODELS

The Analysis of Sustainability Development of Eastern and South Eastern Europe in the Post Socialist Period

The Case for Space in the Social Sciences

Rob Baller Department of Sociology University of Iowa. August 17, 2003

Application of eigenvector-based spatial filtering approach to. a multinomial logit model for land use data

Outline. Introduction to SpaceStat and ESTDA. ESTDA & SpaceStat. Learning Objectives. Space-Time Intelligence System. Space-Time Intelligence System

Variables and Variable De nitions

Cluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May

Output: -Observed Mean Distance -Expected Mean Distance - Nearest Neighbor Index -Graphic report - Test variables:

Transcription:

Spatial Regression 1. Introduction and Review Luc Anselin http://spatial.uchicago.edu

matrix algebra basics spatial econometrics - definitions pitfalls of spatial analysis spatial autocorrelation spatial weights

Matrix Algebra Basics

Review (on your own) matrix multiplication, quadratic forms matrix inverse determinant Cholesky decomposition eigenvalues, eigenvalue decomposition optional: matrix partial derivates

Spatial Econometrics Definitions

When is Analysis Spatial? geo-spatial data: location + value (attribute) non-spatial analysis: location does NOT matter = locational invariance spatial analysis: when the location changes, the information content of the data changes

Spatial Distribution

A-Spatial Distribution Histogram

Spatial Analysis Global Spatial Autocorrelation Moran Scatter Plot

Definition of Spatial Econometrics (e.g., Anselin 1988, 2006) a subset of econometric methods concerned with spatial aspects present in cross-sectional and space-time observations explicit treatment of location, distance and arrangement in model specification, estimation, diagnostics and prediction spatial effects: dependence, heterogeneity

spatial econometrics

Spatial Effects spatial heterogeneity structural change, varying coefficients standard methods apply spatial dependence two-dimensional and multidirectional crosssectional dependence not a direct extension of time series methods

Four dimensions of spatial econometrics specifying the structure of spatial dependence/ heterogeneity testing for the presence of spatial effects estimating models with spatial effects spatial prediction

Pitfalls of Spatial Analysis

Ecological Fallacy individual behavior cannot be explained at the aggregate level issue of interpretation e.g., county homicide rates do not explain individual criminal behavior model aggregate dependent variables with aggregate explanatory variables alternative: multilevel modeling

Modifiable Areal Unit Problem (MAUP) what is the proper spatial scale of analysis? a million spatial autocorrelation coefficients (Openshaw) spatial heterogeneity - different processes at different locations/scales both size and spatial arrangement of spatial units matter

Source: Scholl and Brenner (2012)

Change of Support Problem (COSP) variables measured at different spatial scales nested, hierarchical structures non-nested, overlapping solutions aggregate up to a common scale interpolate/impute - Bayesian approach

Spatial Autocorrelation

The Null Hypothesis spatial randomness is absence of any pattern spatial randomness is not very interesting if rejected, then there is evidence of spatial structure

Tobler s First Law of Geography everything depends on everything else, but closer things more so structures spatial dependence importance of distance decay

Positive Spatial Autocorrelation impression of clustering clumps of like values like values can be either high (hot spots) or low (cold spots) difficult to rely on human perception

< positive s.a. random >

Negative Spatial Autocorrelation checkerboard pattern hard to distinguish from spatial randomness

< negative s.a. random >

Spatial Autocorrelation Statistic captures both attribute similarity and locational similarity how to construct an index from the data that captures both attribute similarity and locational similarity (i.e., neighbors are alike)

Attribute Similarity summary of the similarity (or dissimilarity) of observations for a variable at different locations variable y locations i, j how to construct f(y i, yj) cross product, y i.yj squared difference, (y i -yj) 2 absolute difference, y i - yj

Locational Similarity formalizing the notion of neighbors = spatial weights (wij) when are two spatial units i and j a priori likely to interact not necessarily a geographical notion, can be based on social network concepts or general distance concepts (distance in multivariate space)

General Spatial Autocorrelation Statistic general form sum over all observations of an attribute similarity measure with the neighbors f(x ixj) is attribute similarity between i and j for x wij is a spatial weight between i and j statistic = Σij f(xixj).wij

Moran s I the most commonly used of many spatial autocorrelation statistics I = [ Σi Σj wij zi.zj/s0 ]/[Σi zi 2 / N] with zi = yi - mx : deviations from mean cross product statistic (z i.zj) similar to a correlation coefficient value depends on weights (w ij)

Spatial Weights

Basic Concepts

Why Spatial Weights formal expression of locational similarity spatial autocorrelation is about interaction n x (n - 1)/2 pairwise interactions but only n observations in a cross-section insufficient information to extract pattern of interaction from cross-section example: North Carolina has 100 counties 5,000 pairwise interactions, 100 observations

Solution impose structure limit the number of parameters to be estimated incidental parameter problem = number of parameters grows with sample size for spatial interaction, number of parameters grows with n 2

Spatial Weights exclude some interactions constrain the number of neighbors, e.g., only those locations that share a border single parameter = spatial autocorrelation coefficient strength of interaction = combined effect of coefficient and weights small coefficient with large weights large coefficient with small weights

six polygons - neighbors share common border

link node neighbor structure as a graph

Spatial Weights Matrix Definition N by N positive matrix W with elements wij w ij non-zero for neighbors w ij = 0, i and j are not neighbors w ii = 0, no self-similarity

Geography-Based Spatial Weights

Binary Contiguity Weights contiguity = common border i and j share a border, then w ij = 1 i and j are not neighbors, then w ij = 0 weights are 0 or 1, hence binary

binary contiguity weights matrix for six-region example

1 2 3 4 5 6 7 8 9 contiguity on a regular grid - different definitions

1 2 3 4 5 6 7 8 9 rook contiguity - edges only 2, 4, 6, 8 are neighbors of 5

1 2 3 4 5 6 7 8 9 bishop contiguity - corners only 1, 3, 7, 9 are neighbors of 5

1 2 3 4 5 6 7 8 9 queen contiguity - edges and corners 5 has eight neighbors

rook queen Solano county, CA contiguity

Distance-Based Weights distance between points distance between polygon centroids or central points in general, can be any function of distance that implies distance decay, e.g., inverse distance in practice, mostly based on a notion of contiguity defined by distance

Distance-Band Weights w ij nonzero for dij < d less than a critical distance d potential problem: isolates = no neighbors make sure critical distance is max-min, i.e., the largest of the nearest neighbor distance for each observation

k-nearest Neighbor Weights k nearest observations, irrespective of distance fixes isolates problem for distance bands same number of neighbors for all observations in practice, potential problem with ties needs tie-breaking rule (random, include all)

Spatial Weights Transformations

Row-Standardized Weights rescale weights such that Σj wij = 1 wij * = wij / Σj wij constrains parameter space makes analyses comparable spatial lag = average of the neighbors

row-standardized weights matrix

Higher Order Weights

second order contiguity: neighbor of neighbor

redundancy in higher order contiguity paths of lenght 2 between 1 and other cells

Higher Order Weights recursive definition k-th order neighbor is first order neighbor of (k-1)th order neighbor avoid duplication, only unique neighbors of a given order (not both first and second order) pure contiguity or cumulative contiguity, i.e., lower order neighbors included in weights

exclusive of first order inclusive of first order Solano county, CA, second order contiguity

Properties of Weights

Connectivity Histogram histogram of number of neighbors neighbor cardinality diagnostic for isolates or neighborless units assess characteristics of the distribution

Things to Watch for isolates need to be removed for proper spatial analysis do not need to be removed for standard analysis very large number of neighbors bimodal distribution