Mapping and Analysis for Spatial Social Science

Similar documents
Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign

Outline ESDA. Exploratory Spatial Data Analysis ESDA. Luc Anselin

Geovisualization. Luc Anselin. Copyright 2016 by Luc Anselin, All Rights Reserved

Rate Maps and Smoothing

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

Exploratory Spatial Data Analysis Using GeoDA: : An Introduction

Exploratory Spatial Data Analysis and GeoDa

Exploratory Spatial Data Analysis (And Navigating GeoDa)

OPEN GEODA WORKSHOP / CRASH COURSE FACILITATED BY M. KOLAK

KAAF- GE_Notes GIS APPLICATIONS LECTURE 3

Soc/Anth 597 Spatial Demography March 14, GeoDa 0.95i Exercise A. Stephen A. Matthews. Outline. 1. Background

Local Spatial Autocorrelation Clusters

Kriging Luc Anselin, All Rights Reserved

Spatial Autocorrelation (2) Spatial Weights

Outline. Introduction to SpaceStat and ESTDA. ESTDA & SpaceStat. Learning Objectives. Space-Time Intelligence System. Space-Time Intelligence System

Spatial Tools for Econometric and Exploratory Analysis

Spatial Regression. 1. Introduction and Review. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Introduction to Spatial Statistics and Modeling for Regional Analysis

EXPLORATORY SPATIAL DATA ANALYSIS OF BUILDING ENERGY IN URBAN ENVIRONMENTS. Food Machinery and Equipment, Tianjin , China

Spatial Data, Spatial Analysis and Spatial Data Science

Exploratory Spatial Data Analysis (ESDA)

Spatial Clusters of Rates

Introduction GeoXp : an R package for interactive exploratory spatial data analysis. Illustration with a data set of schools in Midi-Pyrénées.

SASI Spatial Analysis SSC Meeting Aug 2010 Habitat Document 5

Measures of Spatial Dependence

Finding Hot Spots in ArcGIS Online: Minimizing the Subjectivity of Visual Analysis. Nicholas M. Giner Esri Parrish S.

Attribute Data. ArcGIS reads DBF extensions. Data in any statistical software format can be

Spatial Analysis I. Spatial data analysis Spatial analysis and inference

Geographical Information Systems Institute. Center for Geographic Analysis, Harvard University. GeoDa: Spatial Autocorrelation

GIS Test Drive What a Geographic Information System Is and What it Can Do. Alison Davis-Holland

Geometric Algorithms in GIS

Spatial Autocorrelation

SPACE Workshop NSF NCGIA CSISS UCGIS SDSU. Aldstadt, Getis, Jankowski, Rey, Weeks SDSU F. Goodchild, M. Goodchild, Janelle, Rebich UCSB

Lecture 4. Spatial Statistics

Global Spatial Autocorrelation Clustering

Intro to GIS Summer 2012 Data Visualization

The GeoDa Book. Exploring Spatial Data. Luc Anselin

Comparing Color and Leader Line Approaches for Highlighting in Geovisualization

The Nature of Geographic Data

Concepts and Applications of Kriging. Eric Krause Konstantin Krivoruchko

Web-Based Analytical Tools for the Exploration of Spatial Data

GIS and Spatial Statistics: One World View or Two? Michael F. Goodchild University of California Santa Barbara

VISUALIZING MULTIVARIATE SPATIAL CORRELATION WITH DYNAMICALLY LINKED WINDOWS

Introduction to Statistics

Spatial Analysis 1. Introduction

A Local Indicator of Multivariate Spatial Association: Extending Geary s c.

Concepts and Applications of Kriging. Eric Krause

Exploratory Spatial Data Analysis of Regional Economic Disparities in Beijing during the Preparation Period of the 2008 Olympic Games

GEOGRAPHY 350/550 Final Exam Fall 2005 NAME:

Geographical Information Systems Institute. Center for Geographic Analysis, Harvard University. GeoDa: Exploratory Spatial Data Analysis

Spatial correlation and demography.

Lecture 5 Geostatistics

Tracey Farrigan Research Geographer USDA-Economic Research Service

Michael Harrigan Office hours: Fridays 2:00-4:00pm Holden Hall

CSISS Tools and Spatial Analysis Software

GIS Spatial Statistics for Public Opinion Survey Response Rates

CS : Spatial Data Modeling and Analysis. Geovisualization

Spatial Regression. 10. Specification Tests (2) Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

The role of topological outliers in the spatial analysis of georeferenced social media data

Geographic Data Science - Lecture IV

Construction Engineering. Research Laboratory. Approaches Towards the Identification of Patterns in Violent Events, Baghdad, Iraq ERDC/CERL CR-09-1

Finding Hot Spots in ArcGIS Online: Minimizing the Subjectivity of Visual Analysis. Nicholas M. Giner Esri Parrish S.

Final Project: An Income and Education Study of Washington D.C.

Concepts and Applications of Kriging

Models to carry out inference vs. Models to mimic (spatio-temporal) systems 5/5/15

Geog183: Cartographic Design and Geovisualization Winter Quarter 2017 Lecture 6: Map types and Data types

The Case for Space in the Social Sciences

GIST 4302/5302: Spatial Analysis and Modeling

The Use of Spatial Weights Matrices and the Effect of Geometry and Geographical Scale

The Implementation of Autocorrelation-Based Regioclassification in ArcMap Using ArcObjects

AIM HIGH SCHOOL. Curriculum Map W. 12 Mile Road Farmington Hills, MI (248)

Points. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Where Do Overweight Women In Ghana Live? Answers From Exploratory Spatial Data Analysis

Concepts and Applications of Kriging

Spatial Trends of unpaid caregiving in Ireland

Overview of Statistical Analysis of Spatial Data

STARS: Space-Time Analysis of Regional Systems

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

GIST 4302/5302: Spatial Analysis and Modeling Lecture 2: Review of Map Projections and Intro to Spatial Analysis

Identification of Regional Subcenters Using Spatial Data Analysis for Estimating Traffic Volume

Sociology 6Z03 Review I

Spatial autocorrelation: robustness of measures and tests

Development of Integrated Spatial Analysis System Using Open Sources. Hisaji Ono. Yuji Murayama

Spatial Pattern Analysis: Mapping Trends and Clusters. Lauren M. Scott, PhD Lauren Rosenshein Bennett, MS

EXPLORATORY SPATIAL DATA ANALYSIS IN A GEOCOMPUTATIONAL ENVIRONMENT

Aggregated cancer incidence data: spatial models

Intensity Analysis of Spatial Point Patterns Geog 210C Introduction to Spatial Data Analysis

Spatial Analysis II. Spatial data analysis Spatial analysis and inference

What s special about spatial data?

Spatial Analysis 2. Spatial Autocorrelation

Contents. 9. Fractional and Quadratic Equations 2 Example Example Example

Why Is It There? Attribute Data Describe with statistics Analyze with hypothesis testing Spatial Data Describe with maps Analyze with spatial analysis

Quantitative Methods Chapter 0: Review of Basic Concepts 0.1 Business Applications (II) 0.2 Business Applications (III)

Spatial Data Mining. Regression and Classification Techniques

I L L I N O I S UNIVERSITY OF ILLINOIS AT URBANA-CHAMPAIGN

DIFFERENT INFLUENCES OF SOCIOECONOMIC FACTORS ON THE HUNTING AND FISHING LICENSE SALES IN COOK COUNTY, IL

This lab exercise will try to answer these questions using spatial statistics in a geographic information system (GIS) context.

Overview key concepts and terms (based on the textbook Chang 2006 and the practical manual)

Where to Invest Affordable Housing Dollars in Polk County?: A Spatial Analysis of Opportunity Areas

Transcription:

Mapping and Analysis for Spatial Social Science Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign http://sal.agecon.uiuc.edu

Outline Introduction Geovisualization Statistical Maps Map Smoothing Linking and Brushing Visualizing Spatial Autocorrelation Space-Time Correlation

Introduction

Spatial Models Growing Interest in Space, Spatiality and Spatial Interaction among Theoretical Social Sciences Overarching Concepts social interaction context, neighborhood effects interacting agents, strategic interaction spatial externalities, agglomeration geography as a proxy

Spatial Data Growing Interest in the use of Spatial Data in Empirical Social Science georeferenced data addresses, lat-lon (GPS survey) distance and accessibility measures access to infrastructure, spatial mismatch Role of GIS affordable and transparent spatial data manipulation

Geographic Information Systems GIS as a Set of Tools Burrough: set of tools for collecting, storing, retrieving at will, transforming and displaying spatial data from the real world for a particular set of purposes a GIS, GISes (= systems) GIS as Science (the new geography) Goodchild: Geographic Information Science generic scientific questions pertaining to geographic data central role of spatial analysis GIScience

What is Spatial Analysis From Data to Information beyond mapping: added value transformations, manipulations and application of analytical methods to spatial (geographic) data Lack of Locational Invariance analyses where the outcome changes when the location of the objects under study changes median center, clusters, spatial autocorrelation

spatial analysis avant la lettre Dr. Snow s map of cholera deaths in London

Components of Spatial Analysis Visualization Showing interesting patterns Exploratory Spatial Data Analysis Finding interesting patterns Spatial Modeling, Regression Explaining interesting patterns

Implementation of Spatial Analysis Beyond GIS Analytical functionality not part of typical commercial GIS Analytical extensions, DynESDA2 Exploration requires interactive approach Spatial modeling requires specialized statistical methods Explicit treatment of spatial autocorrelation Space-time is not space + time Methods of Geovisualization, ESDA and Spatial Regression Analysis

(Limited) Illustration of Techniques Geovisualization specialized maps ESDA dynamically linked windows Spatial Correlation Analysis global and local spatial autocorrelation space-time correlation

Geovisualization

Beyond Mapping Map a collection of spatially defined objects (Monmonier) Geovisualization combination of map and scientific visualization methods (computer science) exploit human s pattern recognition abilities How to lie with maps many design issues human perception can be tricked

Choropleth Maps Map Counterpart of Histogram values/attributes for discrete spatial units choro from choros (colors) NOT chloro Practical Issues choice of intervals equal interval, equal share (quantiles), standard deviational, choice of colors important for perception of patterns misleading role of area larger areas seem more important

Color Brewer www.colorbrewer.org

Cartograms Misleading Effect of Area large areal units draw attention Symbol Maps symbols (bars, circles) superimposed on actual areas Cartogram change the layout to reflect size other than area population size, variable magnitude respect topology (spatial arrangement)

Choropleth Map (Juvenile Crime VA)

Point Map

Thiessen Polygons

Contiguous Cartogram Nebraska county population - http://www.bbr.unl.edu/cartograms/pop.html

Cartogram (GeoTools) Infant Mortality England and Wales, 1851

Isopleth Maps Map Counterpart of Density Plot values/attributes for continuous fields lines of equal value, contour lines 3-d surface plots Practical Issues choice of intervals spatial interpolation construct observations for locations that are not observed statistical problem = spatial prediction

Residential Sales Price, Baltimore MD (1980) sample points (darker is higher) and contours

Statistical Maps

Visualizing Spatial Distributions Spatialized EDA icons and glyphs matching locations special case of symbol maps Box Map outlier map visual popout, both magnitude and location Regional Box Plots spatial heterogeneity different distributions in spatial subsets

spatial lag bar chart blue = crime at i, red = spatial lag, average crime for neighbors

Spatialized EDA Spatial Chernoff Faces the burglar s view of crime clusters in Columbus

Box Map quartile map with outliers highlighted suicide rates in France (Durkheim 1897)

Linked Box Map in DynESDA2

Regional Histogram

Regional Box Plot Columbus crime: core vs periphery

Map Plots and Plot Maps Linked Micromap Plots - LM plots a micromap for each quantile micromaps linked to other statistical graphs Conditioned Choropleth Maps - cc maps choropleth maps on dependent variable micromap matrix conditioning along two dimensions

Linked Micromap Plots (Carr)

Conditioned Choropleth Map (Carr)

Map Smoothing

Mapping Events Events as Locations individual points point pattern analysis Events as Rates areal aggregates counts of events rate = # events / # population at risk raw rate is ML estimate of risk

Problems with Rate Maps Intrinsic Heterogeneity variance depends on mean variance depends on base Variance Instability spurious outliers Excess Risk is Non-Spatial does not account for spatial autocorrelation

Map Smoothing Empirical Bayes shrink rates to reference national average regional average = subset average Spatial Rate Smoother spatial moving average spatial range defined by spatial weights

Event and Base

Raw Rate Map

EB Smoothed Map

Spatial Rate Smoother

Regional EB Smoothing

Linking and Brushing

Linking Views different views of data statistical graphs: histogram, box plot, scatterplot map Table (list) Dynamic Linking views dynamically linked click on one view and corresponding observations (points, areas) on other views are highlighted

Linking Point and Polygon Maps

Dynamic Linking and Multimedia - panoramap

Brushing Brushing moving brush over map or graph highlights matching observations in other statistical graphs and vice versa Brushing Scatterplots recalculates slope of regression line Geographic Brushing simultaneous selecting on multiple maps

Selection in Scatterplot

Map Brushing in DynESDA2

Visualizing Spatial Autocorrelation

Random or Clustered?

Random or Clustered?

Moran s I Moran s I Spatial Autocorrelation Statistic cross-product statistic I = (N/S 0 ) Σ i Σ j w ij. z i.z j / Σ i z i 2 with z i = x i - µ and S 0 = Σ i Σ j w ij Inference normal distribution randomization permutation

Observed (left) and randomized (right) distribution for Columbus Crime Moran s I = 0.486 Moran s I = -0.003

Moran Scatterplot Linear Spatial Autocorrelation linear association between value at i and weighted average of neighbors: Σ j w ij y j vs. y i, or Wy vs y four quadrants high-high, low-low = spatial clusters high-low, low-high = spatial outliers Moran s I slope of linear scatterplot smoother I = z Wz / z z

Significance Envelope

Reference Distribution (CRIME)

Use of Moran Scatterplot Classification of Spatial Autocorrelation Local Nonstationarity outliers high leverage points sensitivity to boundary values Regimes different slopes in subsets of the data

Moran Scatterplot Map for Columbus crime four quadrants of the scatterplot (not significant )

Moran Scatterplot - Regimes

Local Moran Local Moran Statistic I i = (z i / m 2 )Σ j w ij.z j Σ i I i = N.I Inference randomization assumption conditional permutation local dependence or heterogeneity? Visualization LISA map and Moran Significance Map

LISA MAPS

Space-Time Correlation

Space-Time Moran Scatterplot Generalized Moran Scatterplot Regression slope of Wz t on z t-1 both variables standardized = visualization of Wartenberg multivariate Moran statistic Significance testing permutation permutation envelope (2.5% and 97.5% from permutation reference distribution) Four Types of Association High-high, Low-low; High-low, Low-high

Space-Time Moran Scatterplot p = 0.001 p = 0.002

Moran Scatterplot Matrix

Generalized LISA Generalization of Local Moran z 1i x Σ j w ij z 2j z 1 and z 2 different variables same variable at different times Inference Null hypothesis random assignment between value of z 1 at i, t and neighboring values of z 2

Space-Time Patterns Space-Time Cluster = Contagion High (above avg) values at a location surrounded by High values at different time compare to high-high same time Similar for Low-Low Space-Time Outlier = Change High (above avg) surrounded by Low (below avg) at different time Similar for Low-High Significance based on permutation

Space-Time LISA Maps

Interpretation and Limitations Most Important assessing lack of spatial randomness suggests significant spatial structure Multivariate Association univariate spatial autocorrelation may result from multivariate association scale mismatch need to control for other variables = spatial regression LISA Clusters and Hot Spots suggest interesting locations do not explain