Geographically Weighted Regression LECTURE 2 : Introduction to GWR II

Similar documents
GEOGRAPHICAL STATISTICS & THE GRID

Context-dependent spatial analysis: A role for GIS?

Geographically Weighted Regression (GWR)

Models for Count and Binary Data. Poisson and Logistic GWR Models. 24/07/2008 GWR Workshop 1

Geographically Weighted Regression and Kriging: Alternative Approaches to Interpolation A Stewart Fotheringham

ESRI 2008 Health GIS Conference

Urban GIS for Health Metrics

CSISS Tools and Spatial Analysis Software

Regression Analysis. A statistical procedure used to find relations among a set of variables.

Evaluating sustainable transportation offers through housing price: a comparative analysis of Nantes urban and periurban/rural areas (France)

Geographical General Regression Neural Network (GGRNN) Tool For Geographically Weighted Regression Analysis

A Space-Time Model for Computer Assisted Mass Appraisal

Spatial Variation in Infant Mortality with Geographically Weighted Poisson Regression (GWPR) Approach

1Department of Demography and Organization Studies, University of Texas at San Antonio, One UTSA Circle, San Antonio, TX

Introduction. Introduction (Contd.) Market Equilibrium and Spatial Variability in the Value of Housing Attributes. Urban location theory.

Prospect. February 8, Geographically Weighted Analysis - Review and. Prospect. Chris Brunsdon. The Basics GWPCA. Conclusion

The Building Blocks of the City: Points, Lines and Polygons

Commuting in Northern Ireland: Exploring Spatial Variations through Spatial Interaction Modelling

School of Geographical Sciences, University of Bristol

Exploratory Spatial Data Analysis (ESDA)

GIS Analysis: Spatial Statistics for Public Health: Lauren M. Scott, PhD; Mark V. Janikas, PhD

Bayesian Hierarchical Models

How to make R, PostGIS and QGis cooperate for statistical modelling duties: a case study on hedonic regressions

Using Spatial Statistics Social Service Applications Public Safety and Public Health

Gridded population. redistribution models and applications. David Martin 20 February 2009

Geographically Weighted Regression as a Statistical Model

Spatial Regression. 6. Specification Spatial Heterogeneity. Luc Anselin.

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Outline. ArcGIS? ArcMap? I Understanding ArcMap. ArcMap GIS & GWR GEOGRAPHICALLY WEIGHTED REGRESSION. (Brief) Overview of ArcMap

A GEOSTATISTICAL APPROACH TO PREDICTING A PHYSICAL VARIABLE THROUGH A CONTINUOUS SURFACE

Dwelling Price Ranking vs. Socio-Economic Ranking: Possibility of Imputation

Modeling Spatial Relationships Using Regression Analysis. Lauren M. Scott, PhD Lauren Rosenshein Bennett, MS

Spatial Analysis 1. Introduction

Links between socio-economic and ethnic segregation at different spatial scales: a comparison between The Netherlands and Belgium

Geographically weighted regression: a natural evolution of the expansion method for spatial data analysis

MOVING WINDOW REGRESSION (MWR) IN MASS APPRAISAL FOR PROPERTY RATING. Universiti Putra Malaysia UPM Serdang, Malaysia

Modeling Spatial Relationships using Regression Analysis

Time: the late arrival at the Geocomputation party and the need for considered approaches to spatio- temporal analyses

Geographically Weighted Regression Using a Non-Euclidean Distance Metric with a Study on London House Price Data

Examining the extent to which hotspot analysis can support spatial predictions of crime

Running head: GEOGRAPHICALLY WEIGHTED REGRESSION 1. Geographically Weighted Regression. Chelsey-Ann Cu GEOB 479 L2A. University of British Columbia

The Cost of Transportation : Spatial Analysis of US Fuel Prices

Homework 2. For the homework, be sure to give full explanations where required and to turn in any relevant plots.

The geography of domestic energy consumption

Statistics: A review. Why statistics?

Measuring The Benefits of Air Quality Improvement: A Spatial Hedonic Approach. Chong Won Kim, Tim Phipps, and Luc Anselin

ENGRG Introduction to GIS

GeoDa-GWR Results: GeoDa-GWR Output (portion only): Program began at 4/8/2016 4:40:38 PM

(4) 1. Create dummy variables for Town. Name these dummy variables A and B. These 0,1 variables now indicate the location of the house.

Chapter 3: Regression Methods for Trends

Understanding the modifiable areal unit problem

Developing Spatial Data to Support Statistical Analysis of Education

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

Mapping and Analysis for Spatial Social Science

Explorative Spatial Analysis of Coastal Community Incomes in Setiu Wetlands: Geographically Weighted Regression

Modeling Spatial Relationships Using Regression Analysis

How to make R, PostGIS and QGis cooperate for statistical modelling duties: a case study on hedonic regressions

PostPoint Professional

Calculating Land Values by Using Advanced Statistical Approaches in Pendik

Exploring Digital Welfare data using GeoTools and Grids

2008 ESRI Business GIS Summit Spatial Analysis for Business 2008 Program

A multivariate multilevel model for the analysis of TIMMS & PIRLS data

Estimation, Interpretation, and Hypothesis Testing for Nonparametric Hedonic House Price Functions

Spatial Relationships in Rural Land Markets with Emphasis on a Flexible. Weights Matrix

EXPLORATORY SPATIAL DATA ANALYSIS OF BUILDING ENERGY IN URBAN ENVIRONMENTS. Food Machinery and Equipment, Tianjin , China

Spatial nonstationarity and autoregressive models

APPLIED TIME SERIES ECONOMETRICS

Lecture 3: Exploratory Spatial Data Analysis (ESDA) Prof. Eduardo A. Haddad

Spatial Regression Modeling

This report details analyses and methodologies used to examine and visualize the spatial and nonspatial

Bayesian Spatial Health Surveillance

STAT 3A03 Applied Regression With SAS Fall 2017

Evaluating the Impact of the Fukushima Daiichi Nuclear Power Plant Accident

Real Estate Price Prediction with Regression and Classification CS 229 Autumn 2016 Project Final Report

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY (formerly the Examinations of the Institute of Statisticians) GRADUATE DIPLOMA, 2007

Spatio-Temporal Methods for Mass Appraisal

Spatial Analysis I. Spatial data analysis Spatial analysis and inference

Geographically and temporally weighted regression for modeling spatio-temporal variation in house prices

Geographically Weighted Panel Regression

Lecture 14: Introduction to Poisson Regression

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

Luc Anselin Spatial Analysis Laboratory Dept. Agricultural and Consumer Economics University of Illinois, Urbana-Champaign

Satellite and gauge rainfall merging using geographically weighted regression

The Built Environment, Car Ownership, and Travel Behavior in Seoul

Geographically weighted regression approach for origin-destination flows

CULVERHAY THE EXISTING SITE PLAN

Transaction Statistics. Pulau Pinang

INTRODUCTION TO GIS. Dr. Ori Gudes

Outline. Introduction to SpaceStat and ESTDA. ESTDA & SpaceStat. Learning Objectives. Space-Time Intelligence System. Space-Time Intelligence System

Techniques for Science Teachers: Using GIS in Science Classrooms.

Spatial Heterogeneity in House Price Models: An Iterative Locally Weighted Regression Approach

SPACE Workshop NSF NCGIA CSISS UCGIS SDSU. Aldstadt, Getis, Jankowski, Rey, Weeks SDSU F. Goodchild, M. Goodchild, Janelle, Rebich UCSB

KAAF- GE_Notes GIS APPLICATIONS LECTURE 3

Quantitative Trendspotting. Rex Yuxing Du and Wagner A. Kamakura. Web Appendix A Inferring and Projecting the Latent Dynamic Factors

THE FIVE THEMES OF GEOGRAPHY U N I T O N E

Forecasting: Methods and Applications

Week 3: The Urban Housing Market, Structures and Density.

GIS = Geographic Information Systems;

An Application of Spatial Econometrics in Relation to Hedonic House Price Modelling. Liv Osland 1 Stord/Haugesund University College

Geometric Algorithms in GIS

Transcription:

Geographically Weighted Regression LECTURE 2 : Introduction to GWR II Stewart.Fotheringham@nuim.ie http://ncg.nuim.ie/gwr

A Simulation Experiment Y i = α i + β 1i X 1i + β 2i X 2i Data on X 1 and X 2 drawn randomly for 2500 locations on a 50 x 50 matrix s.t. r(x 1, X 2 ) is controlled. Results shown to be independent of r(x 1,X 2 ) Experiment 1: (parameters spatially invariant) α i = 10 for all i β 1i = 3 for all i Β 2i = -5 for all i Y i obtained from above Data used to calibrate model by global regression and by GWR

Results Global: Adj. R 2 = 1.0 AIC = -59,390 K = 3 α (est.) = 10; β 1 (est.) = 3; β 2 (est.) = -5 GWR: Adj. R 2 = 1.0 AIC = -59,386 K = 6.5 N = 2,434 α i (est.) = 10 for all i β 1i (est.) = 3 for all i β 2i (est.) = -5 for all i Conclusion: GWR does NOT appear to suggest any spurious nonstationarity when relationships are constant

Experiment 2: (parameters spatially variant) 0 i 50 0 j 50 α i = 0 + 0.2i + 0.2j 0 to 20 β 1i = -5 + 0.1i + 0.1j -5 to 5 Β 2i = -5 + 0.2i + 0.2j -5 to 15 Y i obtained in same way Data used to calibrate model by global regression and by GWR

Results Global: Adj. R 2 = 0.04 AIC = 17,046 K = 3 α (est.) = 10.26; β 1 (est.) = -0.1; β 2 (est.) = 5.28 These are close to the averages of the local estimates (10;0;5) GWR: Adj. R 2 = 0.997 AIC = 2,218 K = 167 N = 129 α i (est.) range = 2 to 18.6 β 1i (est.) range = -4.3 to 4.7 β 2i (est.) range = -3.9 to 13.6 Conclusion: GWR identifies spatial nonstationarity in relationships; global model fails completely.

0 α(i) 20-5 β1(i) 5-5 β2(i) 15

An Empirical Example - House Prices in London 1990 sales price data for 12,493 houses in London (excludes houses sold below market value) along with various attributes of each property and a postcode so locations down to 100m can be obtained via the Central Postcode Directory neighbourhood data obtained for enumeration districts (via postcode-to- ED LUT)

Locations of house sales in data set

To what extent are differences in average house prices a function of differences in the intrinsic value associated with different areas and to what extent are they due to different mixes of properties? To answer this, we need regression techniques to account for variations in housing attributes so that we can derive a comparable value per sq.m.

Basic premise: P i = f [S(i), N(i)] Lancaster (1966) J. Political Economy Overviews: (very popular technique) Meen and Andrew (1998) Modelling Regional House Prices: A Review of the Literature DETR Orford (1999) Valuing the Built Environment: GIS and House Price Analysis Ashgate: Aldershot. Issues: Hedonic Price Modelling Almost all applications are global, implying no coefficient variation over space whereas several authors have argued that the assumption of uniform price coefficients is unrealistic even within a single metropolitan area.

Global Regression Parameter Estimates Variable Parameter T value Estimate Intercept 58,900 23.3 FLRAREA 697 49.3 FLRDETACH* 205 7.5 FLRFLAT* -123-5.6 FLRBNGLW* -87-1.4 FLRTRRCD* -119-6.2 BLDPWW1** -2,340-3.9 BLDPOSTW** -2,786-3.1 BLD60S** -5,177-5.0 BLD70S** -2,421-2.1 BLD80S** 6,315 6.9 GARAGE 5,956 10.6 CENHEAT 7,777 12.4 BATH2+ 22,297 19.1 PROF 72 3.0 UNEMPLOY -211-5.5 ln(distcl) -18,137-30.1 R 2 = 0.60 * Excluded house type is Semi-detached ** Excluded age is Inter-war 1914-1939

Price / Square Metre of Various House Types Estimated from the Global Regression Results House Type Price / Sq. M. ( ) Detached 902 Semi-Detached 697 Bungalow 610 Terraced 578 Flat 574

Price Comparisons of equivalent houses by age built Period of Housi ng Pre- 1914 Pre- 1914 1914-1939 1940-1959 1960-1969 1970-1979 1980-1989 - -2,340 446 2,837 81-8,655 1914-1939 1940-1959 1960-1969 1970-1979 1980-1989 2,340-2,786 5,177 2,421-6,315-446 -2,786-2,391-365 -9.101-2,837-5,177-2,391 - -2,756-11,492-81 -2,421 365 2,756 - -8,736 8,655 6,315 9,101 11,492 8,736 -

However, these are all global results, i.e. averages over the whole of London. Might there be differences across London in some of these relationships?

Using GWR In this case an adaptive kernel is used - a bisquare function Calibration yielded an optimal number of nearest neighbours = 931 Results presented in a series of parameter surfaces - those shown all have significant spatial variation

Value of terraced property /m 2 (global estimate = 578)

Pre-1914 housing compared to inter-war (global estimate = -2,340)

1960s housing compared to inter-war (global estimate = -5,177)

10 Reasons Why You Might want to use GWR in Your Research

1. Conforms to different philosophical approaches A post-modernist view : Relationships intrinsically different across space e.g. differences in attitudes, preferences or different administrative, political or other contextual effects produce different responses to the same stimuli A positivist view : Global statements can be made but models not properly specified to allow us to make them. GWR is a good indicator of when and in what way a global model is misspecified. Can all contextual effects ever be modelled?

2. GWR is part of a growing trend towards local analysis (as opposed to traditional global types of analysis) Local statistics are spatial disaggregations of global statistics Global Local similarities across space single-valued statistics non-mappable GIS unfriendly search for regularities aspatial differences across space multi-valued statistics mappable GIS friendly search for exceptions spatial

3. Provides useful link to GIS GIS are very useful for the storage, manipulation and display of spatial data They are less useful for the analysis of spatial data Have been repeated calls for this to change In some cases the link between GIS and spatial analysis has been a step backwards One important way the situation can be improved is to develop better spatial analytical tools that can take advantage of the features of GIS

An important catalyst for the better integration of GIS and spatial analysis has been the development of local spatial statistical techniques Chief among these has been the development of Geographically Weighted Regression (GWR)

4. GWR is widely applicable to almost any form of spatial data Link between health and wealth Modelling presence/absence of a disease Examining spatial patterns of a disease (e.g. GW log odds ratio) Educational attainment levels Determinants of house prices Determinants of critical load variations in lakes Urban temperature variations Economic performance indicators

5. GWR is a truly spatial technique It uses locational information as well as attribute information It employs a spatial weighting function with the assumption that near places are more similar than distant ones. The outputs are location-specific and geocoded so they can easily be mapped and subject to further spatial analysis

6. Residuals from GWR are generally much lower and are not spatially autocorrelated GWR models give much better fits to data, even accounting for increases in number of parameters GWR residuals are generally not spatially autocorrelated so reducing/removing the need for spatial regression models

Global Regression Parameter Estimates Variable Parameter T value Estimate Intercept 58,900 23.3 FLRAREA 697 49.3 FLRDETACH* 205 7.5 FLRFLAT* -123-5.6 FLRBNGLW* -87-1.4 FLRTRRCD* -119-6.2 BLDPWW1** -2,340-3.9 BLDPOSTW** -2,786-3.1 BLD60S** -5,177-5.0 BLD70S** -2,421-2.1 BLD80S** 6,315 6.9 GARAGE 5,956 10.6 CENHEAT 7,777 12.4 BATH2+ 22,297 19.1 PROF 72 3.0 UNEMPLOY -211-5.5 ln(distcl) -18,137-30.1 R 2 = 0.60 * Excluded house type is Semi-detached ** Excluded age is Inter-war 1914-1939

Residuals from Global Model

Residuals from GWR Model

7. User-friendly software for GWR (GWR 3.0) makes it simple Currently about 7,000 lines of FORTRAN code VB front-end to create a control file and run the program in Windows Can, if you want, run the code directly under Unix with a control file

8. The concept of Geographical Weighting can be extended to many other statistics In GWR, weight around a given point is based on a kernel. However, regression is not the only technique in which weighting can be applied

Most descriptive statistics can be geographically weighted Continuous Univariate Mean, Standard Deviation, Skewness, Median, Interquartile Range Bi/Multivariate Correlation Coefficient, Regression Coefficients Discrete Proportions Odds Ratios

9. Extensions of Geographically Weighting can be applied to other modelling techniques GW Poisson regression GW logistic models GW kernel density estimation GW principal components analysis

10. Finally, can use GWR as a Spatial Microscope Instead of determining an optimal bandwidth during the calibration of a GWR model, a bandwidth can be input a priori. A series of bandwidths can be selected and the resulting parameter surfaces examined at different levels of smoothing For example, consider a very simple model of house prices regressed on floor area for 570 houses in Tyne & Wear, North East England. Surfaces of the local floorspace parameter are derived for bandwidths corresponding to 400, 350, 300, 250, 200, 150, 100 and 50 NN

400

350

300

250

200

150

100

50

Summary GWR appears to be a useful method to investigate spatial non-stationarity - simply assuming relationships are stationary over space is no longer tenable GWR can be likened to a spatial microscope - allows us to see variations in relationships that were previous unobservable Can use GWR as a model diagnostic or to identify interesting locations for investigation. Windows-based software makes it easy to apply to any spatial data set.

End of presentation