Spatial Regression Modeling

Size: px
Start display at page:

Download "Spatial Regression Modeling"

Transcription

1 Spatial Regression Modeling Paul Voss & Katherine Curtis The Center for Spatially Integrated Social Science Santa Barbara, CA July 12-17, 2009 Day 1

2 Objective Provide a solid introduction and overview of the concepts and techniques of spatial regression analysis with plenty of hands-on experience in the afternoon lab sessions

3 What s the point? Data that are referenced to location bring an extremely important additional amount of valuable information to a data analysis But it also brings some (possibly unfamiliar) pitfalls that require a new awareness as that analysis proceeds

4 Plan for the week (1) Today: Understanding Spatial Data Broad overview of spatial data, spatial data analysis and core spatial concepts. Why spatial is special Classical linear regression model assumptions underlying OLS consequences of violations of assumptions why spatial processes violate OLS assumptions Introduction to EDA & ESDA Lab: Shapefiles & introduction to GeoDa & EDA using R Tuesday: Diagnosing spatial autocorrelation Understanding & measuring global spatial autocorrelation Weights matrices Understanding & measuring local spatial autocorrelation Moran scatterplot LISA statistics Lab: Global & local measures of spatial autocorrelation

5 Plan for the week (2) Wednesday: Common modeling strategies for spatial processes Understanding spatial processes spatial heterogeneity spatial dependence Spatial regression models OLS in GeoDa understanding GeoDa regression diagnostics spatial lag model; spatial error model No Lab. Free time in Santa Barbara! Thursday: Focus on spatial heterogeneity Spatial heterogeneity in relationships Introduction to GWR GWR theory and approach GWR software Lab: GWR hands-on using R

6 Plan for the week (3) Friday: Alternative spatial processes Reminder that there are some spatial data analysis approaches that have not been covered in this class point pattern analysis geostatistical methods; kriging Exciting things on the horizon Resources for pursuing these topics intensively Lab: Research presentations Open forum

7 Plan for today A good motivational example Why spatial is special characteristics of spatial data problems caused by spatial data Spatial analysis vs. spatial data analysis Broad, gentle overview of spatial data and spatial data analysis Classes of problems in spatial data analysis Review OLS assumptions & violations Exploratory Data Analysis Introduction to afternoon lab

8 Any questions as we get started?

9 Some beginning facts Regression is the workhorse of quantitative social science Much social science data is spatially referenced Spatially referenced data bring special problems to an analysis heterogeneity of observational units heteroskedasticity spatial autocorrelation residual dependence A consequence of these special problems is that the assumption of iid errors in a standard OLS regression specification is violated, and statistical inference from such a model is not valid

10 Motivation Omer R. Galle, Walter R. Gove, & J. Miller McPherson Population Density and Pathology: What Are the Relations for Man Science 176(4030):23-30 data: 75 community areas in Chicago for measures of social pathology as function of crowding, controlling for social class & ethnicity the greater the density, the greater the fertility (p. 176) Colin Loftin & Sally K. Ward A Spatial Autocorrelation Model of the Effects of Population Density on Fertility American Sociological Review 48(1): the GGM findings with regard to fertility are an artifact of the failure to recognize the presence of disturbance variables which are spatially autocorrelated (p. 127) Moral: When analyzing spatially referenced data, it s highly useful to know something about the rudiments of spatial data analysis (i.e., some understanding of why spatial is special )

11 Okay So why is spatial special? Scale dependency Robinson (ASR, 1950) Ecological Fallacy The relationship between ecological and individual correlations which is discussed in this paper provides a definite answer as to whether ecological correlations can validly be used as substitutes for individual correlations. They cannot. (p. 357) MAUP (Modifiable Areal Unit Problem) Habitual users of ecological correlations know that the size of the coefficient depends to a marked degree upon the number of sub-areas. [T]he size of the ecological correlation [will increase numerically as consolidation of smaller areas into larger areas takes place]. (Robinson, pp )

12 Why is spatial special? (2) Observational areas are generally of different size Spatial heterogeneity error heteroskedasticity

13 Counties in U.S. South: 2000 Census Brewster Co. TX: Area: 6,193 mi 2 Fairfax City, VA: Area: 6.3 mi 2 n = 1,387

14 Counties in U.S. South: 2000 Census Loving Co. TX: Pop: 67 Harris Co. TX: Pop: 3,400,578 n = 1,387

15 Why is spatial special? (3) Observational areas are generally of different size (geographic size; population size) Heteroskedasticity in regression errors Neighboring areas are similar Tobler s 1 st law of Geography: Everything is related to everything else, but near things are more related than distant things. (1970:236) (positive) spatial autocorrelation

16 Proportion of Children (under age 18) in Poverty: 2000 Census Loudoun Co. VA: Source: SF3 Table P87 Starr Co. TX: 0.595

17 Why is spatial special? (4) Observational areas are generally of different size Heteroskedasticity in regression errors Neighboring areas are similar Tobler s 1 st law of Geography: Everything is related to everything else, but near things are more related than distant things. (1970:236) (positive) spatial autocorrelation Probable stumbling blocks when modeling the data Again the assumption of iid errors in a standard OLS regression specification is violated and statistical inference is not valid

18 Spatial versus Non-Spatial Data Analysis

19 Take these two maps, for example Any traditional data analysis that does not utilize the location & spatial arrangement (topological information) of the data will lead to identical results for the two maps

20 And why is this important? What makes the methods of modern [spatial data analysis] different from many of their predecessors is that they have been developed with the recognition that spatial data have unique properties and that these properties make the use of methods borrowed from aspatial disciplines highly questionable Fotheringham, Brunsdon & Charlton, Quantitative Geography: Perspectives on Spatial Data Analysis Sage, 2000 p. xii

21 Because of these unique properties, if we blithely carry out an OLS regression using aggregated geographic data surely some large subset of the following undesirable horrors almost certainly awaits us (the curse of Tobler s 1 st Law) Our estimated regression coefficients are biased and inconsistent, or Our estimated regression coefficients are inefficient Our R 2 statistic is exaggerated We ve made incorrect inferences We ll never get it published or shouldn t!

22 Given these problems, why would anyone bother to analyze spatial data? There s lots of it! Occasional need for non-disclosure of individual-level data Space is important Space as a means of organizing human activities Location as a means of integrating interesting data Some interesting questions can (only) be examined with spatial data

23 Some interesting questions that might be addressed using modern spatial analysis: Local tax rates ( spillover in y?) Expenditures for police ( spillover in x?) Demographic analysis: Are Quitman & Tallahatchie counties (two contiguous counties in the Mississippi Delta) really two separate observations? ( spillover in ε?)

24 Now, Let s Define Some Terms

25 What exactly are spatial data? data where, in addition to attribute values relating to the primary phenomena of interest, the relative spatial locations of observations are also recorded

26 And what is spatial regression analysis? Regression using spatial data where explicit attention is given to location and arrangement of geographic units Even if we don t really care about spatial processes

27 Spatial Analysis versus Spatial Data Analysis

28 GIS Spatial Analysis P-median problems Maximal covering problem Location set covering problem Traveling salesman problem (TSP)

29 GIS Spatial Analysis Spatial Data Analysis Spatial Statistics

30 Spatial Data Analysis

31 GIS Spatial Analysis Spatial Data Analysis Spatial Statistics Event Data Lattice Data Spatial Econometrics Spatial Regression Analysis Geostatistical Data

32 Types of Spatial Data Event data (point data) Spatially continuous data (geostatistical data) Lattice data (regionalized data) Spatial interaction data (flow data)

33 Our focus of attention in this workshop will primarily be on Lattice Data On Friday we ll briefly touch on the other kinds of spatial data and other spatial data analytic approaches

34 Lattice Data Have one or more variables whose values are measured over a set of areas Interest focuses on the attribute values, not on the locations which are known and unchanging Objectives: Detecting patterns in the spatial arrangement of attribute values examining the relationships among the set of variables taking into account any spatial effects present Approach: Exploratory spatial data analysis Confirmatory spatial data analysis (spatial regression)

35 Some early cautions: Our goal is to correctly model and draw proper inferences about an unobserved, random DGP (random field) spatial process? Spatial heterogeneity? spatial dependence? time? sampling perspective? Spatial autocorrelation Scale issues; scale dependency; aggregation bias; boundary issues The tools are pretty good, but along the way many subjective decisions are made defining neighborhood choosing a weights matrix

36 Before taking a closer look at the meanings of Spatial Autocorrelation, Spatial Heterogeneity and Spatial Dependence, let s take a closer look at the traditional OLS regression model

37 Standard OLS Regression y i = β + β x + β x β x i 2 2i k ki ε i In matrix notation: y = X β + ε (n x 1) (n x k+1) (k+1 x 1) (n x 1) and where: βˆ = ( X T 1 X ) X T y σˆ 2 = 1 ( n k e 1) T e

38 For those of you who glaze over at the sight of that, the next 3 slides are for you

39 How do we get from this y i = β + β x + β x β x i 2 2i k ki ε i to this y = Xβ + ε???

40 ε β β β ε β β β ε β β β ε β β β ε β β β ε β β β = = = = = = x x y x x y x x y x x y x x y x x y Assume n = 6 and we have two independent variables x 1 and x 2 ε Xβ y + = is shorthand for the above

41 = y y y y y y y = x x x x x x x x x x x x X = β β β β = ε ε ε ε ε ε ε We get to the shorthand matrix algebra expression by realizing the following: ε Xβ y + = and

42 Okay but what about the assumptions underlying the OLS regression model? We must place some conditions both on the population and on the data to establish unbiasedness, consistency and efficiency These conditions are embodied in the Gauss-Markov Theorem

43 The Gauss-Markov Theorem asserts that is a Best Linear Unbiased Estimator (BLUE) of, provided the following assumptions are met: Linearity Mean independence E[ε i x i ] = 0 (implies E[ε] = 0) Homoskedasticity and uncorrelated disturbances Cov[ε] = E[ε ε ] = σ 2 I X is of rank k+1 (k = no. of independent vars.) X is non-stochastic (or stochastic with finite second moments, and E[X ε] = 0 for unbiasedness) Normal disturbance β $

44 To discuss these assumptions we need a few words to describe the performance of estimates Bias. An estimation method is unbiased if it produces estimates that have a statistical expectation equal to the true (population) value Consistency. Estimates converge toward the quantity being estimated as the sample size increases Efficiency. Efficient estimates are those that have smaller standard errors than estimates produced by some competing estimator

45 Regarding the OLS assumptions Linearity and mean independence support unbiased estimates Homoskedasticity and uncorrelated disturbances support efficiency Normality of disturbances means we can do statistical inference using t tables Normality also means we can estimate the model by MLE

46 It is partly with these OLS assumptions in mind that we stress the need to know our data EDA / ESDA We engage in EDA (ESDA) to determine how our data might lead to violations of one or more of the assumptions underlying our model regarding the unknown DGP

47 For example Are we starting out by maximizing the probability of obtaining a normal error structure? Why do we care about this? How can we check for this? What can we do about it? Do we have good linear relationships between our dependent variable and independent variables? Why do we care about this? How can we check for this? What can we do about it? Should any of our variables be transformed? Do you know how to proceed? Do we have any outliers? What kind of outliers? What options are available to us? Fortunately almost everything we ll want to do can be done within GeoDa TM And what we do in GeoDa, we ll try to replicate with R

48 GeoDa TM is a trademark of Luc Anselin & the University of Illinois

49 Exploring Spatial Data Goal is to seek good understanding and description of the data, thus suggesting hypotheses to explore Not much emphasis here on p-values, which are so ubiquitous in most of our training & our statistical instincts Look especially for clues to spatial heterogeneity or spatial dependence Few a priori assumptions about the data Robust ( resistant ) methods Analysis may sometimes end here EDA/ESDA: Despite good tools, at heart EDA is a philosophy; an attitude; best practice way of thinking about your data; a way of staying out of trouble

50 EDA / SDA Tools Maps Descriptive statistics Plots and graphics Classification and clustering methods Software with dynamically linked objects Global and local spatial autocorrelation Look for evidence of Tobler s First Law

51 Maps? Sure, but what kind? So some answers are: What are you trying to discover with the map? What are you wishing to show with the map? PPOV Std. PPOV Dev. Map BoxMap (1.5) PPOV Percentile PPOV Map Quantile Map (9)

52 Descriptive Statistics

53 Plots and Graphs

54 Plots and Graphs (cont.) Checks for normality and symmetry

55 Plots and Graphs (cont.) Checks for linearity & outliers

56 Tomorrow we ll take a close look at the concept of spatial autocorrelation But, in the context of assumptions underlying the OLS model, this raises some serious problems

57 Correlated Disturbances Cov[ε] = E[ε ε ] = σ 2 Σ σ 2 I This can arise from a number of violations of the OLS assumptions Often occurs when there is an important independent variable missing from the specification It causes OLS parameter estimates or their associated standard errors to be unreliable It inflates the value of the R 2 statistic

58 If non-zero covariance between X and ε E[X ε] 0 E[b] = E[(X X) -1 X y] = E[(X X) -1 X (Xβ + ε)] = β + (X X) -1 E[X ε] β OLS parameter estimates are biased plim[b] = β + plim[(x X/n) -1 ] x plim[x ε/n] OLS parameter estimates are inconsistent

59 Readings for today Anselin, Luc What is Special About Spatial Data? Alternative Perspectives on Spatial Data Analysis. NCGIA Technical Paper Ward, Michael D., and Kristian Skrede Gleditsch Spatial Regression Models. Quantitative Applications in the Social Sciences, No Thousand Oaks, CA: Sage. Chapter 1. [Book is available on line at: ] Goodchild, Michael F., Luc Anselin, Richard P. Applebaum, and Barbara Herr Harthorn Toward Spatially Integrated Social Science. International Regional Science Review 23: Loftin, Colin and Sally K. Ward A Spatial Autocorrelation Model of the Effects of Population Density on Fertility. American Sociological Review, 48(1): Galle, Omer R., Walter R. Gove, and J. Miller McPherson Population Density and Pathology: What Are the Relations for Man? Science (new series) 176:23-30.

60 Afternoon Lab Shapefiles and introduction to GeoDa TM and EDA using GeoDa & R

61 Questions?

Introduction to Spatial Statistics and Modeling for Regional Analysis

Introduction to Spatial Statistics and Modeling for Regional Analysis Introduction to Spatial Statistics and Modeling for Regional Analysis Dr. Xinyue Ye, Assistant Professor Center for Regional Development (Department of Commerce EDA University Center) & School of Earth,

More information

Community & Environmental Sociology/Sociology 977 Spatial Data Analysis

Community & Environmental Sociology/Sociology 977 Spatial Data Analysis Community & Environmental Sociology/Sociology 977 Spatial Data Analysis Spring 2012 Katherine Curtis Class Meeting: 301 Ag Hall, Labs 3218 SS 316B Ag Hall/4424 Social Sciences Class Hours: Thursdays, 1:20-3:15P

More information

Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign

Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign GIS and Spatial Analysis Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign http://sal.agecon.uiuc.edu Outline GIS and Spatial Analysis

More information

Spatial Analysis 1. Introduction

Spatial Analysis 1. Introduction Spatial Analysis 1 Introduction Geo-referenced Data (not any data) x, y coordinates (e.g., lat., long.) ------------------------------------------------------ - Table of Data: Obs. # x y Variables -------------------------------------

More information

Spatial Regression Modeling

Spatial Regression Modeling Spatial Regression Modeling Paul Voss & Katherine Curtis The Center for Spatially Integrated Social Science Santa Barbara, CA July 12-17, 2009 Day 4 Plan for today Focus on spatial heterogeneity A bit

More information

Exploratory Spatial Data Analysis (ESDA)

Exploratory Spatial Data Analysis (ESDA) Exploratory Spatial Data Analysis (ESDA) VANGHR s method of ESDA follows a typical geospatial framework of selecting variables, exploring spatial patterns, and regression analysis. The primary software

More information

GeoDa and Spatial Regression Modeling

GeoDa and Spatial Regression Modeling GeoDa and Spatial Regression Modeling June 9, 2006 Stephen A. Matthews Associate Professor of Sociology & Anthropology, Geography and Demography Director of the Geographic Information Analysis Core Population

More information

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad Key message Spatial dependence First Law of Geography (Waldo Tobler): Everything is related to everything else, but near things

More information

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad Key message Spatial dependence First Law of Geography (Waldo Tobler): Everything is related to everything else, but near things

More information

SPACE Workshop NSF NCGIA CSISS UCGIS SDSU. Aldstadt, Getis, Jankowski, Rey, Weeks SDSU F. Goodchild, M. Goodchild, Janelle, Rebich UCSB

SPACE Workshop NSF NCGIA CSISS UCGIS SDSU. Aldstadt, Getis, Jankowski, Rey, Weeks SDSU F. Goodchild, M. Goodchild, Janelle, Rebich UCSB SPACE Workshop NSF NCGIA CSISS UCGIS SDSU Aldstadt, Getis, Jankowski, Rey, Weeks SDSU F. Goodchild, M. Goodchild, Janelle, Rebich UCSB August 2-8, 2004 San Diego State University Some Examples of Spatial

More information

KAAF- GE_Notes GIS APPLICATIONS LECTURE 3

KAAF- GE_Notes GIS APPLICATIONS LECTURE 3 GIS APPLICATIONS LECTURE 3 SPATIAL AUTOCORRELATION. First law of geography: everything is related to everything else, but near things are more related than distant things Waldo Tobler Check who is sitting

More information

Spatial Regression. 3. Review - OLS and 2SLS. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Spatial Regression. 3. Review - OLS and 2SLS. Luc Anselin.   Copyright 2017 by Luc Anselin, All Rights Reserved Spatial Regression 3. Review - OLS and 2SLS Luc Anselin http://spatial.uchicago.edu OLS estimation (recap) non-spatial regression diagnostics endogeneity - IV and 2SLS OLS Estimation (recap) Linear Regression

More information

Types of spatial data. The Nature of Geographic Data. Types of spatial data. Spatial Autocorrelation. Continuous spatial data: geostatistics

Types of spatial data. The Nature of Geographic Data. Types of spatial data. Spatial Autocorrelation. Continuous spatial data: geostatistics The Nature of Geographic Data Types of spatial data Continuous spatial data: geostatistics Samples may be taken at intervals, but the spatial process is continuous e.g. soil quality Discrete data Irregular:

More information

Regression III Lecture 1: Preliminary

Regression III Lecture 1: Preliminary Regression III Lecture 1: Preliminary Dave Armstrong University of Western Ontario Department of Political Science Department of Statistics and Actuarial Science (by courtesy) e: dave.armstrong@uwo.ca

More information

Spatial Regression. 1. Introduction and Review. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Spatial Regression. 1. Introduction and Review. Luc Anselin.  Copyright 2017 by Luc Anselin, All Rights Reserved Spatial Regression 1. Introduction and Review Luc Anselin http://spatial.uchicago.edu matrix algebra basics spatial econometrics - definitions pitfalls of spatial analysis spatial autocorrelation spatial

More information

E 31501/4150 Properties of OLS estimators (Monte Carlo Analysis)

E 31501/4150 Properties of OLS estimators (Monte Carlo Analysis) E 31501/4150 Properties of OLS estimators (Monte Carlo Analysis) Ragnar Nymoen 10 February 2011 Repeated sampling Section 2.4.3 of the HGL book is called Repeated sampling The point is that by drawing

More information

Spatial Data Mining. Regression and Classification Techniques

Spatial Data Mining. Regression and Classification Techniques Spatial Data Mining Regression and Classification Techniques 1 Spatial Regression and Classisfication Discrete class labels (left) vs. continues quantities (right) measured at locations (2D for geographic

More information

Outline ESDA. Exploratory Spatial Data Analysis ESDA. Luc Anselin

Outline ESDA. Exploratory Spatial Data Analysis ESDA. Luc Anselin Exploratory Spatial Data Analysis ESDA Luc Anselin University of Illinois, Urbana-Champaign http://www.spacestat.com Outline ESDA Exploring Spatial Patterns Global Spatial Autocorrelation Local Spatial

More information

Introduction to Spatial Regression Analysis ICPSR 2014

Introduction to Spatial Regression Analysis ICPSR 2014 Introduction to Spatial Regression Analysis ICPSR 2014 Elisabeth Dowling Root (roote@colorado.edu) University of Colorado, Boulder Department of Geography and Institute of Behavioral Science 1440 15 th

More information

Outline. Overview of Issues. Spatial Regression. Luc Anselin

Outline. Overview of Issues. Spatial Regression. Luc Anselin Spatial Regression Luc Anselin University of Illinois, Urbana-Champaign http://www.spacestat.com Outline Overview of Issues Spatial Regression Specifications Space-Time Models Spatial Latent Variable Models

More information

Exploratory Spatial Data Analysis (And Navigating GeoDa)

Exploratory Spatial Data Analysis (And Navigating GeoDa) Exploratory Spatial Data Analysis (And Navigating GeoDa) June 9, 2006 Stephen A. Matthews Associate Professor of Sociology & Anthropology, Geography and Demography Director of the Geographic Information

More information

Mapping and Analysis for Spatial Social Science

Mapping and Analysis for Spatial Social Science Mapping and Analysis for Spatial Social Science Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign http://sal.agecon.uiuc.edu Outline

More information

Spatial Modeling, Regional Science, Arthur Getis Emeritus, San Diego State University March 1, 2016

Spatial Modeling, Regional Science, Arthur Getis Emeritus, San Diego State University March 1, 2016 Spatial Modeling, Regional Science, and UCSB Arthur Getis Emeritus, San Diego State University March 1, 2016 My Link to UCSB The 1980s at UCSB (summers and sabbatical) Problems within Geography: The Quantitative

More information

Michael Harrigan Office hours: Fridays 2:00-4:00pm Holden Hall

Michael Harrigan Office hours: Fridays 2:00-4:00pm Holden Hall Announcement New Teaching Assistant Michael Harrigan Office hours: Fridays 2:00-4:00pm Holden Hall 209 Email: michael.harrigan@ttu.edu Guofeng Cao, Texas Tech GIST4302/5302, Lecture 2: Review of Map Projection

More information

Nature of Spatial Data. Outline. Spatial Is Special

Nature of Spatial Data. Outline. Spatial Is Special Nature of Spatial Data Outline Spatial is special Bad news: the pitfalls of spatial data Good news: the potentials of spatial data Spatial Is Special Are spatial data special? Why spatial data require

More information

Spatial Tools for Econometric and Exploratory Analysis

Spatial Tools for Econometric and Exploratory Analysis Spatial Tools for Econometric and Exploratory Analysis Michael F. Goodchild University of California, Santa Barbara Luc Anselin University of Illinois at Urbana-Champaign http://csiss.org Outline A Quick

More information

Lecture 5 Geostatistics

Lecture 5 Geostatistics Lecture 5 Geostatistics Lecture Outline Spatial Estimation Spatial Interpolation Spatial Prediction Sampling Spatial Interpolation Methods Spatial Prediction Methods Interpolating Raster Surfaces with

More information

CSISS Tools and Spatial Analysis Software

CSISS Tools and Spatial Analysis Software CSISS Tools and Spatial Analysis Software June 5, 2006 Stephen A. Matthews Associate Professor of Sociology & Anthropology, Geography and Demography Director of the Geographic Information Analysis Core

More information

LECTURE 11. Introduction to Econometrics. Autocorrelation

LECTURE 11. Introduction to Econometrics. Autocorrelation LECTURE 11 Introduction to Econometrics Autocorrelation November 29, 2016 1 / 24 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists of choosing: 1. correct

More information

Spatial Data, Spatial Analysis and Spatial Data Science

Spatial Data, Spatial Analysis and Spatial Data Science Spatial Data, Spatial Analysis and Spatial Data Science Luc Anselin http://spatial.uchicago.edu 1 spatial thinking in the social sciences spatial analysis spatial data science spatial data types and research

More information

Introduction. Part I: Quick run through of ESDA checklist on our data

Introduction. Part I: Quick run through of ESDA checklist on our data CSDE GIS Workshop Series Spatial Regression Chris Fowler csfowler@uw.edu Introduction The goal of these exercises is to give you a chance to put the concepts we have just discussed into practice. Keep

More information

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9. Section 7 Model Assessment This section is based on Stock and Watson s Chapter 9. Internal vs. external validity Internal validity refers to whether the analysis is valid for the population and sample

More information

Introduction to Spatial Regression Analysis ICPSR Summer Program University of North Carolina at Chapel Hill. University of Wisconsin-Madison

Introduction to Spatial Regression Analysis ICPSR Summer Program University of North Carolina at Chapel Hill. University of Wisconsin-Madison Introduction to Spatial Regression Analysis ICPSR Summer Program 2012 Paul R. Voss 1 and Katherine J. Curtis 2 1 University of North Carolina at Chapel Hill 2 University of Wisconsin-Madison 1 Odum Institute

More information

Concepts and Applications of Kriging. Eric Krause

Concepts and Applications of Kriging. Eric Krause Concepts and Applications of Kriging Eric Krause Sessions of note Tuesday ArcGIS Geostatistical Analyst - An Introduction 8:30-9:45 Room 14 A Concepts and Applications of Kriging 10:15-11:30 Room 15 A

More information

Steps in Regression Analysis

Steps in Regression Analysis MGMG 522 : Session #2 Learning to Use Regression Analysis & The Classical Model (Ch. 3 & 4) 2-1 Steps in Regression Analysis 1. Review the literature and develop the theoretical model 2. Specify the model:

More information

A spatial literacy initiative for undergraduate education at UCSB

A spatial literacy initiative for undergraduate education at UCSB A spatial literacy initiative for undergraduate education at UCSB Mike Goodchild & Don Janelle Department of Geography / spatial@ucsb University of California, Santa Barbara ThinkSpatial Brown bag forum

More information

Econometrics Summary Algebraic and Statistical Preliminaries

Econometrics Summary Algebraic and Statistical Preliminaries Econometrics Summary Algebraic and Statistical Preliminaries Elasticity: The point elasticity of Y with respect to L is given by α = ( Y/ L)/(Y/L). The arc elasticity is given by ( Y/ L)/(Y/L), when L

More information

Concepts and Applications of Kriging

Concepts and Applications of Kriging 2013 Esri International User Conference July 8 12, 2013 San Diego, California Technical Workshop Concepts and Applications of Kriging Eric Krause Konstantin Krivoruchko Outline Intro to interpolation Exploratory

More information

1 Motivation for Instrumental Variable (IV) Regression

1 Motivation for Instrumental Variable (IV) Regression ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data

More information

Lecture 8. Spatial Estimation

Lecture 8. Spatial Estimation Lecture 8 Spatial Estimation Lecture Outline Spatial Estimation Spatial Interpolation Spatial Prediction Sampling Spatial Interpolation Methods Spatial Prediction Methods Interpolating Raster Surfaces

More information

Modeling the Ecology of Urban Inequality in Space and Time

Modeling the Ecology of Urban Inequality in Space and Time Modeling the Ecology of Urban Inequality in Space and Time Corina Graif PhD Candidate, Department Of Sociology Harvard University Presentation for the Workshop on Spatial and Temporal Modeling, Center

More information

Concepts and Applications of Kriging. Eric Krause Konstantin Krivoruchko

Concepts and Applications of Kriging. Eric Krause Konstantin Krivoruchko Concepts and Applications of Kriging Eric Krause Konstantin Krivoruchko Outline Introduction to interpolation Exploratory spatial data analysis (ESDA) Using the Geostatistical Wizard Validating interpolation

More information

What s special about spatial data?

What s special about spatial data? What s special about spatial data? Road map Geographic Information analysis The need to develop spatial thinking Some fundamental geographic concepts (PBCS) What are the effects of space? Spatial autocorrelation

More information

The Use of Spatial Weights Matrices and the Effect of Geometry and Geographical Scale

The Use of Spatial Weights Matrices and the Effect of Geometry and Geographical Scale The Use of Spatial Weights Matrices and the Effect of Geometry and Geographical Scale António Manuel RODRIGUES 1, José António TENEDÓRIO 2 1 Research fellow, e-geo Centre for Geography and Regional Planning,

More information

Everything is related to everything else, but near things are more related than distant things.

Everything is related to everything else, but near things are more related than distant things. SPATIAL ANALYSIS DR. TRIS ERYANDO, MA Everything is related to everything else, but near things are more related than distant things. (attributed to Tobler) WHAT IS SPATIAL DATA? 4 main types event data,

More information

Exploratory Spatial Data Analysis Using GeoDA: : An Introduction

Exploratory Spatial Data Analysis Using GeoDA: : An Introduction Exploratory Spatial Data Analysis Using GeoDA: : An Introduction Prepared by Professor Ravi K. Sharma, University of Pittsburgh Modified for NBDPN 2007 Conference Presentation by Professor Russell S. Kirby,

More information

Concepts and Applications of Kriging

Concepts and Applications of Kriging Esri International User Conference San Diego, California Technical Workshops July 24, 2012 Concepts and Applications of Kriging Konstantin Krivoruchko Eric Krause Outline Intro to interpolation Exploratory

More information

Econometric Modelling Prof. Rudra P. Pradhan Department of Management Indian Institute of Technology, Kharagpur

Econometric Modelling Prof. Rudra P. Pradhan Department of Management Indian Institute of Technology, Kharagpur Econometric Modelling Prof. Rudra P. Pradhan Department of Management Indian Institute of Technology, Kharagpur Module No. # 01 Lecture No. # 28 LOGIT and PROBIT Model Good afternoon, this is doctor Pradhan

More information

Running head: GEOGRAPHICALLY WEIGHTED REGRESSION 1. Geographically Weighted Regression. Chelsey-Ann Cu GEOB 479 L2A. University of British Columbia

Running head: GEOGRAPHICALLY WEIGHTED REGRESSION 1. Geographically Weighted Regression. Chelsey-Ann Cu GEOB 479 L2A. University of British Columbia Running head: GEOGRAPHICALLY WEIGHTED REGRESSION 1 Geographically Weighted Regression Chelsey-Ann Cu 32482135 GEOB 479 L2A University of British Columbia Dr. Brian Klinkenberg 9 February 2018 GEOGRAPHICALLY

More information

Spatial Effects and Externalities

Spatial Effects and Externalities Spatial Effects and Externalities Philip A. Viton November 5, Philip A. Viton CRP 66 Spatial () Externalities November 5, / 5 Introduction If you do certain things to your property for example, paint your

More information

Ref.: Spring SOS3003 Applied data analysis for social science Lecture note

Ref.:   Spring SOS3003 Applied data analysis for social science Lecture note SOS3003 Applied data analysis for social science Lecture note 05-2010 Erling Berge Department of sociology and political science NTNU Spring 2010 Erling Berge 2010 1 Literature Regression criticism I Hamilton

More information

Ordinary Least Squares Regression

Ordinary Least Squares Regression Ordinary Least Squares Regression Goals for this unit More on notation and terminology OLS scalar versus matrix derivation Some Preliminaries In this class we will be learning to analyze Cross Section

More information

1. The OLS Estimator. 1.1 Population model and notation

1. The OLS Estimator. 1.1 Population model and notation 1. The OLS Estimator OLS stands for Ordinary Least Squares. There are 6 assumptions ordinarily made, and the method of fitting a line through data is by least-squares. OLS is a common estimation methodology

More information

Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University

Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University Spatial Analysis and Modeling (GIST 4302/5302) Guofeng Cao Department of Geosciences Texas Tech University TTU Graduate Certificate Geographic Information Science and Technology (GIST) 3 Core Courses and

More information

OPEN GEODA WORKSHOP / CRASH COURSE FACILITATED BY M. KOLAK

OPEN GEODA WORKSHOP / CRASH COURSE FACILITATED BY M. KOLAK OPEN GEODA WORKSHOP / CRASH COURSE FACILITATED BY M. KOLAK WHAT IS GEODA? Software program that serves as an introduction to spatial data analysis Free Open Source Source code is available under GNU license

More information

PBAF 528 Week 8. B. Regression Residuals These properties have implications for the residuals of the regression.

PBAF 528 Week 8. B. Regression Residuals These properties have implications for the residuals of the regression. PBAF 528 Week 8 What are some problems with our model? Regression models are used to represent relationships between a dependent variable and one or more predictors. In order to make inference from the

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 6: Bias and variance (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 49 Our plan today We saw in last lecture that model scoring methods seem to be trading off two different

More information

Spatial analysis. Spatial descriptive analysis. Spatial inferential analysis:

Spatial analysis. Spatial descriptive analysis. Spatial inferential analysis: Spatial analysis Spatial descriptive analysis Point pattern analysis (minimum bounding box, mean center, weighted mean center, standard distance, nearest neighbor analysis) Spatial clustering analysis

More information

GIS and Spatial Statistics: One World View or Two? Michael F. Goodchild University of California Santa Barbara

GIS and Spatial Statistics: One World View or Two? Michael F. Goodchild University of California Santa Barbara GIS and Spatial Statistics: One World View or Two? Michael F. Goodchild University of California Santa Barbara Location as attribute The data table Census summary table What value is location as an explanatory

More information

review session gov 2000 gov 2000 () review session 1 / 38

review session gov 2000 gov 2000 () review session 1 / 38 review session gov 2000 gov 2000 () review session 1 / 38 Overview Random Variables and Probability Univariate Statistics Bivariate Statistics Multivariate Statistics Causal Inference gov 2000 () review

More information

Introduction to Econometrics. Heteroskedasticity

Introduction to Econometrics. Heteroskedasticity Introduction to Econometrics Introduction Heteroskedasticity When the variance of the errors changes across segments of the population, where the segments are determined by different values for the explanatory

More information

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity LECTURE 10 Introduction to Econometrics Multicollinearity & Heteroskedasticity November 22, 2016 1 / 23 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists

More information

A GEOSTATISTICAL APPROACH TO PREDICTING A PHYSICAL VARIABLE THROUGH A CONTINUOUS SURFACE

A GEOSTATISTICAL APPROACH TO PREDICTING A PHYSICAL VARIABLE THROUGH A CONTINUOUS SURFACE Katherine E. Williams University of Denver GEOG3010 Geogrpahic Information Analysis April 28, 2011 A GEOSTATISTICAL APPROACH TO PREDICTING A PHYSICAL VARIABLE THROUGH A CONTINUOUS SURFACE Overview Data

More information

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i, A Course in Applied Econometrics Lecture 18: Missing Data Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. When Can Missing Data be Ignored? 2. Inverse Probability Weighting 3. Imputation 4. Heckman-Type

More information

GIST 4302/5302: Spatial Analysis and Modeling

GIST 4302/5302: Spatial Analysis and Modeling GIST 4302/5302: Spatial Analysis and Modeling Basics of Statistics Guofeng Cao www.myweb.ttu.edu/gucao Department of Geosciences Texas Tech University guofeng.cao@ttu.edu Spring 2015 Outline of This Week

More information

1 The Multiple Regression Model: Freeing Up the Classical Assumptions

1 The Multiple Regression Model: Freeing Up the Classical Assumptions 1 The Multiple Regression Model: Freeing Up the Classical Assumptions Some or all of classical assumptions were crucial for many of the derivations of the previous chapters. Derivation of the OLS estimator

More information

GIST 4302/5302: Spatial Analysis and Modeling

GIST 4302/5302: Spatial Analysis and Modeling GIST 4302/5302: Spatial Analysis and Modeling Fall 2015 Lectures: Tuesdays & Thursdays 2:00pm-2:50pm, Science 234 Lab sessions: Tuesdays or Thursdays 3:00pm-4:50pm or Friday 9:00am-10:50am, Holden 204

More information

SPACE Workshop Santa Barbara, California July 2007

SPACE Workshop Santa Barbara, California July 2007 SPACE Workshop Santa Barbara, California 15 20 July 2007 Modeling a Center for Spatially Integrated Social Science Critical Themes in Social Science + Tools and Concepts for Spatial Thinking + Infrastructure

More information

GIST 4302/5302: Spatial Analysis and Modeling

GIST 4302/5302: Spatial Analysis and Modeling GIST 4302/5302: Spatial Analysis and Modeling Spring 2016 Lectures: Tuesdays & Thursdays 12:30pm-1:20pm, Science 234 Labs: GIST 4302: Monday 1:00-2:50pm or Tuesday 2:00-3:50pm GIST 5302: Wednesday 2:00-3:50pm

More information

Non-Spherical Errors

Non-Spherical Errors Non-Spherical Errors Krishna Pendakur February 15, 2016 1 Efficient OLS 1. Consider the model Y = Xβ + ε E [X ε = 0 K E [εε = Ω = σ 2 I N. 2. Consider the estimated OLS parameter vector ˆβ OLS = (X X)

More information

Week 11 Heteroskedasticity and Autocorrelation

Week 11 Heteroskedasticity and Autocorrelation Week 11 Heteroskedasticity and Autocorrelation İnsan TUNALI Econ 511 Econometrics I Koç University 27 November 2018 Lecture outline 1. OLS and assumptions on V(ε) 2. Violations of V(ε) σ 2 I: 1. Heteroskedasticity

More information

Departamento de Economía Universidad de Chile

Departamento de Economía Universidad de Chile Departamento de Economía Universidad de Chile GRADUATE COURSE SPATIAL ECONOMETRICS November 14, 16, 17, 20 and 21, 2017 Prof. Henk Folmer University of Groningen Objectives The main objective of the course

More information

Representation of Geographic Data

Representation of Geographic Data GIS 5210 Week 2 The Nature of Spatial Variation Three principles of the nature of spatial variation: proximity effects are key to understanding spatial variation issues of geographic scale and level of

More information

Lectures 5 & 6: Hypothesis Testing

Lectures 5 & 6: Hypothesis Testing Lectures 5 & 6: Hypothesis Testing in which you learn to apply the concept of statistical significance to OLS estimates, learn the concept of t values, how to use them in regression work and come across

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

GIST 4302/5302: Spatial Analysis and Modeling

GIST 4302/5302: Spatial Analysis and Modeling GIST 4302/5302: Spatial Analysis and Modeling Spring 2014 Lectures: Tuesdays & Thursdays 2:00pm-2:50pm, Holden Hall 00038 Lab sessions: Tuesdays or Thursdays 3:00pm-4:50pm or Wednesday 1:00pm-2:50pm, Holden

More information

COS513: FOUNDATIONS OF PROBABILISTIC MODELS LECTURE 9: LINEAR REGRESSION

COS513: FOUNDATIONS OF PROBABILISTIC MODELS LECTURE 9: LINEAR REGRESSION COS513: FOUNDATIONS OF PROBABILISTIC MODELS LECTURE 9: LINEAR REGRESSION SEAN GERRISH AND CHONG WANG 1. WAYS OF ORGANIZING MODELS In probabilistic modeling, there are several ways of organizing models:

More information

The Case for Space in the Social Sciences

The Case for Space in the Social Sciences The Case for Space in the Social Sciences Don Janelle Center for Spatially Integrated Social Science University of California, Santa Barbara Roundtable on Geographical Voices and Geographical Analysis

More information

Homoskedasticity. Var (u X) = σ 2. (23)

Homoskedasticity. Var (u X) = σ 2. (23) Homoskedasticity How big is the difference between the OLS estimator and the true parameter? To answer this question, we make an additional assumption called homoskedasticity: Var (u X) = σ 2. (23) This

More information

Business Economics BUSINESS ECONOMICS. PAPER No. : 8, FUNDAMENTALS OF ECONOMETRICS MODULE No. : 3, GAUSS MARKOV THEOREM

Business Economics BUSINESS ECONOMICS. PAPER No. : 8, FUNDAMENTALS OF ECONOMETRICS MODULE No. : 3, GAUSS MARKOV THEOREM Subject Business Economics Paper No and Title Module No and Title Module Tag 8, Fundamentals of Econometrics 3, The gauss Markov theorem BSE_P8_M3 1 TABLE OF CONTENTS 1. INTRODUCTION 2. ASSUMPTIONS OF

More information

Statistics: A review. Why statistics?

Statistics: A review. Why statistics? Statistics: A review Why statistics? What statistical concepts should we know? Why statistics? To summarize, to explore, to look for relations, to predict What kinds of data exist? Nominal, Ordinal, Interval

More information

Overview of Statistical Analysis of Spatial Data

Overview of Statistical Analysis of Spatial Data Overview of Statistical Analysis of Spatial Data Geog 2C Introduction to Spatial Data Analysis Phaedon C. Kyriakidis www.geog.ucsb.edu/ phaedon Department of Geography University of California Santa Barbara

More information

Introduction to Econometrics Midterm Examination Fall 2005 Answer Key

Introduction to Econometrics Midterm Examination Fall 2005 Answer Key Introduction to Econometrics Midterm Examination Fall 2005 Answer Key Please answer all of the questions and show your work Clearly indicate your final answer to each question If you think a question is

More information

A Space-Time Model of Fertility and Development in China, Katherine King University of Michigan

A Space-Time Model of Fertility and Development in China, Katherine King University of Michigan A Space-Time Model of Fertility and Development in China, 1982-2000 Katherine King University of Michigan Abstract Although China is extremely regionally diverse and fertility policy is implemented at

More information

GEOG 3340: Introduction to Human Geography Research

GEOG 3340: Introduction to Human Geography Research GEOG 3340: Introduction to Human Geography Research Lecture 1: Course Overview Guofeng Cao www.myweb.ttu.edu/gucao Department of Geosciences Texas Tech University guofeng.cao@ttu.edu Fall 2015 Course Description

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

2 Prediction and Analysis of Variance

2 Prediction and Analysis of Variance 2 Prediction and Analysis of Variance Reading: Chapters and 2 of Kennedy A Guide to Econometrics Achen, Christopher H. Interpreting and Using Regression (London: Sage, 982). Chapter 4 of Andy Field, Discovering

More information

ECNS 561 Multiple Regression Analysis

ECNS 561 Multiple Regression Analysis ECNS 561 Multiple Regression Analysis Model with Two Independent Variables Consider the following model Crime i = β 0 + β 1 Educ i + β 2 [what else would we like to control for?] + ε i Here, we are taking

More information

Exploratory Spatial Data Analysis and GeoDa

Exploratory Spatial Data Analysis and GeoDa Exploratory Spatial Data Analysis and GeoDa Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign http://sal.agecon.uiuc.edu Outline

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

Linear Models in Econometrics

Linear Models in Econometrics Linear Models in Econometrics Nicky Grant At the most fundamental level econometrics is the development of statistical techniques suited primarily to answering economic questions and testing economic theories.

More information

Outline. Introduction to SpaceStat and ESTDA. ESTDA & SpaceStat. Learning Objectives. Space-Time Intelligence System. Space-Time Intelligence System

Outline. Introduction to SpaceStat and ESTDA. ESTDA & SpaceStat. Learning Objectives. Space-Time Intelligence System. Space-Time Intelligence System Outline I Data Preparation Introduction to SpaceStat and ESTDA II Introduction to ESTDA and SpaceStat III Introduction to time-dynamic regression ESTDA ESTDA & SpaceStat Learning Objectives Activities

More information

An overview of applied econometrics

An overview of applied econometrics An overview of applied econometrics Jo Thori Lind September 4, 2011 1 Introduction This note is intended as a brief overview of what is necessary to read and understand journal articles with empirical

More information

Neighborhood social characteristics and chronic disease outcomes: does the geographic scale of neighborhood matter? Malia Jones

Neighborhood social characteristics and chronic disease outcomes: does the geographic scale of neighborhood matter? Malia Jones Neighborhood social characteristics and chronic disease outcomes: does the geographic scale of neighborhood matter? Malia Jones Prepared for consideration for PAA 2013 Short Abstract Empirical research

More information

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 8 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 25 Recommended Reading For the today Instrumental Variables Estimation and Two Stage

More information

Heteroskedasticity in Panel Data

Heteroskedasticity in Panel Data Essex Summer School in Social Science Data Analysis Panel Data Analysis for Comparative Research Heteroskedasticity in Panel Data Christopher Adolph Department of Political Science and Center for Statistics

More information

Reliability of inference (1 of 2 lectures)

Reliability of inference (1 of 2 lectures) Reliability of inference (1 of 2 lectures) Ragnar Nymoen University of Oslo 5 March 2013 1 / 19 This lecture (#13 and 14): I The optimality of the OLS estimators and tests depend on the assumptions of

More information

GIST 4302/5302: Spatial Analysis and Modeling Lecture 2: Review of Map Projections and Intro to Spatial Analysis

GIST 4302/5302: Spatial Analysis and Modeling Lecture 2: Review of Map Projections and Intro to Spatial Analysis GIST 4302/5302: Spatial Analysis and Modeling Lecture 2: Review of Map Projections and Intro to Spatial Analysis Guofeng Cao http://www.spatial.ttu.edu Department of Geosciences Texas Tech University guofeng.cao@ttu.edu

More information

Topic 7: HETEROSKEDASTICITY

Topic 7: HETEROSKEDASTICITY Universidad Carlos III de Madrid César Alonso ECONOMETRICS Topic 7: HETEROSKEDASTICITY Contents 1 Introduction 1 1.1 Examples............................. 1 2 The linear regression model with heteroskedasticity

More information

Heteroskedasticity in Panel Data

Heteroskedasticity in Panel Data Essex Summer School in Social Science Data Analysis Panel Data Analysis for Comparative Research Heteroskedasticity in Panel Data Christopher Adolph Department of Political Science and Center for Statistics

More information