Application of eigenvector-based spatial filtering approach to. a multinomial logit model for land use data

Similar documents
Modeling Land Use Change Using an Eigenvector Spatial Filtering Model Specification for Discrete Response

Geographically weighted regression approach for origin-destination flows

Using AMOEBA to Create a Spatial Weights Matrix and Identify Spatial Clusters, and a Comparison to Other Clustering Algorithms

Departamento de Economía Universidad de Chile

SPACE Workshop NSF NCGIA CSISS UCGIS SDSU. Aldstadt, Getis, Jankowski, Rey, Weeks SDSU F. Goodchild, M. Goodchild, Janelle, Rebich UCSB

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Urban form, resource intensity & renewable energy potential of cities

Course in Data Science

Kazuhiko Kakamu Department of Economics Finance, Institute for Advanced Studies. Abstract

Mitsuhiko Kawakami and Zhenjiang Shen Department of Civil Engineering Faculty of Engineering Kanazawa University Japan ABSTRACT

Representation of Geographic Data

Lecture 5: Spatial probit models. James P. LeSage University of Toledo Department of Economics Toledo, OH

Spatial Analysis 1. Introduction

ECLT 5810 Linear Regression and Logistic Regression for Classification. Prof. Wai Lam

A Post-Aggregation Error Record Extraction Based on Naive Bayes for Statistical Survey Enumeration

Cross-sectional space-time modeling using ARNN(p, n) processes

CS6220: DATA MINING TECHNIQUES

The Built Environment, Car Ownership, and Travel Behavior in Seoul

Evaluating sustainable transportation offers through housing price: a comparative analysis of Nantes urban and periurban/rural areas (France)

New York University Department of Economics. Applied Statistics and Econometrics G Spring 2013

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Spatial Analysis of Tokyo Apartment Market

Spatial Modeling, Regional Science, Arthur Getis Emeritus, San Diego State University March 1, 2016

Spatial Discrete Choice Models

Identifying Megaregions in the US: Implications for Infrastructure Investment

A SPATIAL ANALYSIS OF A RURAL LAND MARKET USING ALTERNATIVE SPATIAL WEIGHT MATRICES

ENGRG Introduction to GIS

Models for Count and Binary Data. Poisson and Logistic GWR Models. 24/07/2008 GWR Workshop 1

iafor The International Academic Forum

Chapter 6. Fundamentals of GIS-Based Data Analysis for Decision Support. Table 6.1. Spatial Data Transformations by Geospatial Data Types

EXPLORATORY SPATIAL DATA ANALYSIS OF BUILDING ENERGY IN URBAN ENVIRONMENTS. Food Machinery and Equipment, Tianjin , China

USE OF GLOBAL MAP DATABASE FOR MODELING SHRIMP FARMING IN SOUTHERN THAILAND

Kneib, Fahrmeir: Supplement to "Structured additive regression for categorical space-time data: A mixed model approach"

Multilevel modeling and panel data analysis in educational research (Case study: National examination data senior high school in West Java)

Paddy Availability Modeling in Indonesia Using Spatial Regression

Methodological issues in the development of accessibility measures to services: challenges and possible solutions in the Canadian context

Small-Area Population Forecasting Using a Spatial Regression Approach

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation

The Scope and Growth of Spatial Analysis in the Social Sciences

Modeling Spatial Dependence and Spatial Heterogeneity in. County Yield Forecasting Models

Combining Regressive and Auto-Regressive Models for Spatial-Temporal Prediction

INTEGRATION OF GIS AND MULTICRITORIAL HIERARCHICAL ANALYSIS FOR AID IN URBAN PLANNING: CASE STUDY OF KHEMISSET PROVINCE, MOROCCO

Land Use/Cover Changes & Modeling Urban Expansion of Nairobi City

The Role of "Leads" in the Dynamic Title of Cointegrating Regression Models. Author(s) Hayakawa, Kazuhiko; Kurozumi, Eiji

Spatial Relationships in Rural Land Markets with Emphasis on a Flexible. Weights Matrix

Exploratory Spatial Data Analysis (ESDA)

ESTIMATION OF LANDFORM CLASSIFICATION BASED ON LAND USE AND ITS CHANGE - Use of Object-based Classification and Altitude Data -

Spatial Analysis of Public Services (schools) in Nablus City Using the Tool of Geographic Information System (GIS)

Are Travel Demand Forecasting Models Biased because of Uncorrected Spatial Autocorrelation? Frank Goetzke RESEARCH PAPER

Modelling Network Interdependencies of Regional Economies using Spatial Econometric Techniques 1

Identification of Driving Factors Causing Land Cover change In Bandung Region Using Binary Logistic Regression Based on Geospatial Data

Econometrics. 5) Dummy variables

Testing an Autoregressive Structure in Binary Time Series Models

Pattern Recognition and Machine Learning

STA 4273H: Statistical Machine Learning

Trip Generation Model Development for Albany

CS6220: DATA MINING TECHNIQUES

Community Health Needs Assessment through Spatial Regression Modeling

Spatial Analysis 2. Spatial Autocorrelation

Christopher Dougherty London School of Economics and Political Science

A Constrained CA Model to Simulate Urban Growth of the Tokyo Metropolitan Area

Compact guides GISCO. Geographic information system of the Commission

Variation of geospatial thinking in answering geography questions based on topographic maps

ASIA GEOSPATIAL FORUM 2011 Eco-Friendly Walk Score Calculator Choosing a Place to Live with GIS

Spatial filtering and eigenvector stability : space-time model for German unemployment data

INSTITUTE OF POLICY AND PLANNING SCIENCES. Discussion Paper Series

Using Spatial Statistics Social Service Applications Public Safety and Public Health

APPLICATION OF THE COUNTRY PRODUCT DUMMY METHOD TO CONSTRUCT SPATIAL AND TEMPORAL PRICE INDICES FOR SRI LANKA

STA 4273H: Sta-s-cal Machine Learning

Introduction to Spatial Statistics and Modeling for Regional Analysis

Field Course Descriptions

Lecture 7: Spatial Econometric Modeling of Origin-Destination flows

Measuring The Benefits of Air Quality Improvement: A Spatial Hedonic Approach. Chong Won Kim, Tim Phipps, and Luc Anselin

On Autoregressive Order Selection Criteria

Diffusion of GIS in Public Policy Doctoral Program

Finite Sample Properties of Moran s I Test for Spatial Autocorrelation in Probit and Tobit Models - Empirical Evidence

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools

Urban Growth Analysis: Calculating Metrics to Quantify Urban Sprawl

ECON 594: Lecture #6

Progress and Land-Use Characteristics of Urban Sprawl in Busan Metropolitan City using Remote sensing and GIS

Tracey Farrigan Research Geographer USDA-Economic Research Service

Review of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University.

Unobserved Heterogeneity and the Statistical Analysis of Highway Accident Data. Fred Mannering University of South Florida

Where Do Overweight Women In Ghana Live? Answers From Exploratory Spatial Data Analysis

Urban GIS for Health Metrics

Spatial fluctuation on speed-density relationship of pedestrian dynamics

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence

Outline. Overview of Issues. Spatial Regression. Luc Anselin

Robustness of Logit Analysis: Unobserved Heterogeneity and Misspecified Disturbances

Computer Vision Group Prof. Daniel Cremers. 10a. Markov Chain Monte Carlo

Developing Confidence in Your Net-to-Gross Ratio Estimates

26:010:557 / 26:620:557 Social Science Research Methods

Cluster investigations using Disease mapping methods International workshop on Risk Factors for Childhood Leukemia Berlin May

Appropriate Selection of Cartographic Symbols in a GIS Environment

Knowledge Spillovers, Spatial Dependence, and Regional Economic Growth in U.S. Metropolitan Areas. Up Lim, B.A., M.C.P.

Spatial Regression. 1. Introduction and Review. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

The impact of residential density on vehicle usage and fuel consumption*

What is a compact city? How could it be measured?

Econometrics of Panel Data

Transcription:

Presented at the Seventh World Conference of the Spatial Econometrics Association, the Key Bridge Marriott Hotel, Washington, D.C., USA, July 10 12, 2013. Application of eigenvector-based spatial filtering approach to a multinomial logit model for land use data Takahiro YOSHIDA 1 and Morito TSUTSUMI 2 1 Graduate School of Systems and Information Engineering, University of Tsukuba, Tsukuba, Ibaraki 305-8573, Japan (E-mail: yoshida.takahiro@sk.tsukuba.ac.p) 2 Faculty of Engineering, Information and systems, University of Tsukuba, Tsukuba, Ibaraki 305-8573, Japan (E-mail: tsutsumi@sk.tsukuba.ac.p) Abstract: The present study builds a spatial statistical model for estimating land use maps. Many approaches for modeling land use maps exist in the literatures. A representative method includes a multinomial logit model (Miller and Plantinga, 1999), in which the likelihood of placing each land use category, such as land for building and forest, into each zone is explained by selected attributes such as population and elevation. Because neighboring zones tend to be categorized into the same land use class, considering spatial dependence among zones is important when applying a multinomial logit model for modeling land use maps. Although previous studies involving spatial econometric techniques attempted to consider spatial dependence by using a spatial weight matrix, such methods require a computationally burdensome iterative calculation for parameter estimation (e.g., the expectation-maximization algorithm or the Markov chain Monte Carlo method). On the contrary, the present study employs eigenvector-based spatial filtering based on a spatial statistical approach, in which parameters are estimated using the standard maximum likelihood method for modeling spatial dependence. This is easy to implement using standard statistical software packages. Moreover, while analyzing land use, it is essential that the filtering method enable visualization of secular changes in spatial patterns influencing the choice of each land use category at each instance. The results suggest that compared to conventional nonspatial multinomial logit models, the predictive power in terms of the Akaike information criterion (AIC) is substantially improved with spatial filtering. Keywords: eigenvector-based spatial filtering, land use, multinomial logit model, spatial dependence JEL: C21, C25, R14

1. Introduction It is vital to deal appropriately with spatial dependence and spatial heterogeneity when using regression models dealing with spatial data (Anselin, 1988). In recent years, studies on spatial statistics and spatial econometrics have made a number of suggestions on how to consider spatial effects on qualitative data (e.g., McMillen, 1992; Congdon, 2005; LeSage, 2000; Chakir and Parent, 2009; Higgs and Hoeting; 2010; Wang et al., 2012) 1. However, the suggested methods typically require a computationally burdensome iterative calculation for parameter estimation such as expectation-maximization algorithm or the Markov chain Monte Carlo (MCMC) method 2. Also, the models for multinomial discrete choice models are still ongoing (Smirnov, 2010), and methods are not well established. Hence the present paper takes another way, namely eigenvector-based spatial filtering (ESF) developed by the work of e.g., Griffith (2003), in which spatial dependence can be considered only by introducing eigenvectors of a modified spatial weight matrix as the explanatory variables. The advantage of this approach is its ease in practical implementation because the method does not requires a special parameter estimation technique and can be used with standard statistical software packages after selecting eigenvectors (e.g., Griffith and Paelinck, 2011). The modified spatial weight matrix has direct relationship to the well-known Moran s I statistics, and therefore significant merits in interpretation. Linear combinations of the selected eigenvectors and their estimated coefficients can be interpreted as potential explanatory variables substituting for missing variables as well as map patterns. Hence in this study, we apply ESF to a multinomial logit model, and compare the predictive accuracies of land use categories in terms of Akaike Information Criterion (AIC) and hit ratio to the standard non spatial model. In addition, linear sum of the eigenvectors and its coefficients are calculated for each category, and the interpretation of the result is discussed in detail. This paper constitutes as follows. Section 2 describes ESF. Section 3 applies ESF to a multinomial logit model, and conducts the empirical analysis with the land use data in Ibaraki prefecture, Japan. Finally, Section 4 discusses the potential and challenges of the proposed method and concludes the study. 2. Estimation of Land Use with the Eigenvector-based Spatial Filtering Thus far numerous models for modeling land use have been proposed (Koomen et al., 2007; Irwin, 2010), and one of the representative approaches are using multinomial logit model (e.g., 1 Smirnov (2010) gives excellent reviews for such methods. 2 Smirnov (2010), Kiler and McMillen (2008) proposed a spatial multinomial logit model not based on iterative algorithm, but these methods are based on weak positive spatial autocorrelation.

Miller and Plantinga, 1999). Although critics argue that the logistic transformation is ad hoc, the practical advantages seem to outweigh the conceptual shortcomings (Miller and Plantinga, 1999). Because neighboring zones tend to be categorized into the same land use class, considering spatial dependence among zones is important when applying a multinomial logit model for modeling land use (e.g., Wang et al., 2012). In this study, we apply ESF to a multinomial logit model in order to consider spatial dependence 3. As mentioned earlier, Moran s I statistics is often used as a representative of spatial dependence. Moran s I statistics is defined as I n y'( I 11' / n) C( I 11' / n) y. 1 ' C1 y'( I 11' / n) y (1) where, n is the number of samples, 1 is the n 1 vector of 1s, I is the n n identity matrix, y is the n 1 vector of observations, C is the n n proximity matrix, and (I - 11 /n) is a proection matrix. For simplicity, C is treated as adacency matrix. Griffith s (e.g., 2000, 2003) methodology exploits eigenvector decomposition techniques, which extract orthogonal and uncorrelated numerical components from an (I - 11 /n) C (I - 11 /n). The basic model which we build for ESF in a multinomial logit model is given by U Xβ Eγ ε. (2) Here, U is the n 1 vector of the utility for option (land use category), X is an n k explanatory variable matrix, and E is an n r eigenvector matrix whose elements are calculated from the proximity matrix, β is a k 1 parameter vector, γ is a r 1 parameter vector. ε is the n 1 error vector, and is independent and identically Gumbel distributed. The selection probability P i of each option in each mesh i is given by P i exp( Xβ Eγ ), 1 exp( Xβ Eγ ) s 0 0,1,, s 1. (3) The option which has the highest utility is selected in each zone. The parameters β and γ can be estimated by the maximum likelihood method (Train, 2003). 3 We noticed that Wang et al. (2013) employed ESF for land use data, but their focus is no binary data.

3. Application of Eigenvector-based Spatial Filtering to a multinomial logit model In section 2, we built the model for ESF in a multinomial logit model. Therefore, in section 3, we conduct a case study which we apply the model for a land use data in. Ibaraki prefecture (Figure 1), which is located in the northeast of Tokyo Metropolitan area, Japan, was selected for our case study. Some maor cities in Ibaraki prefecture include Mito city, which is the seat of prefectural government, and Tsukuba city, which is central city of the southern region. The Distances from Tokyo station to Mito and Tsukuba stations is approximately 100km and 60 km, respectively. Figure 1: Location of Ibaraki prefecture We compare the predictive accuracy of the multinomial logit model with ESF (Spatial) with the conventional non-spatial multinomial logit model (Non-spatial). Our target period is 2006 year. The land use data used in this study is available through the National Land Numerical Information Services (URL: http://nlftp.mlit.go.p/ks-e/index.html) (Figure 2). The area of each mesh block is approximately 1 km 2 (1 km x 1 km). For the modeling, we aggregated the land use categories as shown in Figure 3. Some categories (e.g., rivers and lakes) are excluded from this study; if the occupied area of the land use classification is equal, we exclude the same from this study. Some other categories (e.g., wasteland, golf course) have been aggregated into other Land, Paddy Field, Other Agricultural Land, Forest, and Land for Building are considered as is. In order to classify each mesh into an appropriate category, we classify it depending on the largest land

use classification for that particular mesh. Further, since the estimation accuracy is improved by considering stable points in the field during estimation and prediction (Osaragi, 1996). The explanatory variables include elevation, slope and a water zone dummy as the geographical conditions, and population, distance to the nearest station, distance to Tokyo Station as the social conditions. The data for elevation, slope, straight line distance to Tokyo Station, and the straight line distance to the nearest station were sourced/derived from the National Land Numerical Information Services and only total population is sourced/derived from the 2005 census, which is available online from the Ministry of Internal Affairs and Communications Statistics Bureau (URL: http://www.e-stat.go.p/sg1/estat/estattopportale.do, accessed June 20, 2013.). We measure the straight line distance to the nearest station and Tokyo station from the center of gravity of the mesh (in meters). We used the only station that was operational in fiscal year 2006 after consulting the railway time-series data from the digital national land information data for every station. In addition, in the present study, we use selected eigenvectors as the explanatory variables in order to consider spatial autocorrelation. We limit the number of eigenvectors to 150 in order to reduce computation time. Legend Paddy Field Other Agricultural Land Forest Land for Buiding Other Land Figure 2: Land use of Ibaraki prefecture depicted in the 1km 2 mesh (2006)

Excluded Figure 3: Aggregated land use classification for this study Table 1 shows the parameter estimation results of each explanatory variable. The variables are significant and display the expected signs. However, certain variables are significant nonspatially, but not significant spatially. This could be attributed to a bias introduced by due to excluding the spatial dependence. In order to check which model is better, we calculated the AIC. In addition, we employ the hitting ratio as another indicator, to assess which model is better (Table 2). Table 2 shows that the AIC is smaller for the spatial model, in spite of our adding 150 eigenvectors to the explanatory variable. Figure 4 and 5 illustrates the spatial distribution for both models. The nonspatial model cannot capture a spatially heterogeneity, such as the southeast area which is mixed in various land use classification. The spatial distribution of each choice extracted from the ESF may be represented as linear combinations of parameters and eigenvectors. Figure 6 shows the spatial distributions as described by the combination in sections 1 to 10. Because Moran s I statistic quantity is relatively big among the eigenvectors ranked sections 1 to 10, the eigenvectors depict the spatial patterns global basis spatial distribution of Figure 6. For example, looking at the spatial distribution of land for building, it can be seen that the map captures the character of prosperous area which is the north or south region which indicates the value high. And also, although analysts have to pay attention to the possibility that a choice of based alternative which we use in estimation process changes spatial pattern, a comparison between each spatial pattern can be possible.

Table 1: Parameter estimation results Explanatory Variable Choice Coefficient Intercept Other Agricultural Land 1.19 Forest 3.59 Land for Building 4.12 Other Land 3.66 Nonspatial 4.05 4.84 Spatial Population Other Agricultural Land 2.54 10 4 4 7.17 10 7.66 4.95 Forest 1.32 10 3 4 8.04 10 Land for Building 2.66 10 3 3 3.32 10 Other Land 6.87 10 4 4 4.02 10 Average Elevation Other Agricultural Land 4.10 10 2 1 1.20 10 Forest 4.47 10 2 1 1.37 10 Land for Building 2.85 10 2 2 7.70 10 Other Land 4.40 10 2 1 1.28 10 Average Slope Other Agricultural Land 5.60 10 1 1 8.03 10 Forest 3.99 10 1 1 1.96 10 Land for Building 2.33 10 1 1 5.10 10 Other Land 5.67 10 1 1 2.46 10 Water Zone Dummy Other Agricultural Land 1.09 Forest 1.20 1 5.95 10 1 7.02 10 Land for Building 2.15 10 1 3 7.35 10 Other Land 5.46 10 1 1 1.28 10 Distance to Tokyo Station Other Agricultural Land 5.70 10 6 2.72 10 6 * Forest 1.79 10 5 2.03 10 6 Land for Building 2.31 10 7 5 2.01 10 Other Land 6.03 10 6 5 1.46 10 Distance to the Nearest Station Other Agricultural Land 9.70 10 5 4 1.57 10 Forest 9.89 10 5 5 6.13 10 Land for Building 1.46 10 5 2.37 10 5 Other Land 1.50 10 4 4 2.19 10, **, and * indicate significance at the 1%, 5%, and 10% level respectively. Table 2: Comparison of predictive values Nonspatial Spatial AIC 8989.3 7614.6 Hitting ratio [%] 71.7 79.1

Legend Result of Nonspatial Paddy Field Other Agricultural Land Forest Land for Buiding Other Land Legend Result of Spatial Paddy Field Other Agricultural Land Forest Land for Buiding Other Land Figure 4: Estimated land use choice result of Nonspatial (left)/spatial (right) Legend Result of Nonspatial Miss Hit Legend Result of Spatial Miss Hit Figure 5: Comparison of map pattern of Hit/Miss of estimated choice of Nonspatial (left)/spatial (right)

High High Low Low Other Agricultural Land Forest High High Low Low Land for Building Other Land Figure 6: Spatial pattern for each choice calculated by Eγ 10 1

4. Conclusion This study built a model of eigenvector-based spatial filtering in a multinomial logit model and applied the model to land use data. The estimation results suggest that compared to conventional nonspatial multinomial logit models, the predictive power in terms of the Akaike information criterion and hitting ratio is substantially improved with spatial filtering. Moreover, although analysts have to pay attention to the possibility that a choice of based alternative which we use in estimation process changes spatial pattern, a linear combination of eigenvectors and their parameters enable visualization of spatial patterns influencing the choice of each land use category. The comparison of technique of spatial econometrics and spatial statistical in order to consider spatial effect in analyzing land use is needed for future work. Although The basic model of eigenvector-based spatial filtering is decomposition of the spatial lag model which is the basic model of spatial econometric method (Tiefelsdolf and Griffith, 2007), this study dealt with only spatial statistical approach which eigenvector-based spatial filtering method is classified as (Griffith and Paelinck, 2011). It would be of our future work to examine whether model would be good in terms of computation burdensome and accuracy of estimation. References Anselin, L.: Spatial Econometrics: Methods and Models, Kluwer Academic Publishers, 1988. Chakir, R. and Parent, O.: Determinants of land use changes: A spatial multinomial probit approach, Papers in Regional Science, Vol. 88, No. 2, pp. 327 344, 2009. Congdon, P.: Bayesian Models for Categorical Data, Wiley, New York, 2005. Griffith D. A.: A linear regression solution to the spatial autocorrelation problem, Journal of Geographical Systems, Vol. 2, No. 2, pp. 141 156, 2000. Griffith, D. A.: Spatial Autocorrelation and Spatial Filtering: Gaining Understanding Through Theory and Scientific Visualization, Springer, 2003. Griffith, D. A. and Paelinck, J. H.: Non-standard Spatial Statistics and Spatial Econometrics, Springer, 2011. Higgs, M. D. and Hoeting, J. A.: A clipped latent variable model for spatially correlated ordered categorical data, Computational Statistics & Data Analysis, Vol. 54, No. 8, pp. 1999 2011, 2010. Irwin, E. G.: New directions for urban economic models of land use change: Incorporating spatial dynamics and heterogeneity, Journal of Regional Science, Vol. 50, No. 1, pp. 65 91, 2010. Kiler, T. and McMillen, D. P.: Clustering of Auto Supplier Plants in the United States, Journal of Business & Economic Statistics, Vol. 26, No. 4, pp. 460 471, 2008. Koomen, E., Stillwell, J., Bakema, A., and Scholten, H. J. (Eds.): Modeling Land-Use Change, Springer, Dordrecht, 2007. LeSage, J. P.: Bayesian estimation of limited dependent variable spatial autoregressive models, Geographical Analysis, Vol. 32, No. 1, pp. 19 35, 2000. McMillen, D. P.: Probit with spatial autocorrelation, Journal of Regional Science, Vol. 32, No. 3, pp. 335 348, 1992. Miller, D. J. and Plantinga, A. J.: Modeling land use decisions with aggregate data, American Journal of Agricultural Economics, Vol. 81, No. 1, pp. 180 194, 1999. Osaragi, T. and Baita, K.: Land-use forecasts considering the existence of steady-state places, Theory and

Applications of GIS, Vol. 4, No. 1, pp. 1 6, 1996 (in Japanese). Smirnov, O. A.: Modeling spatial discrete choice, Regional Science and Urban Economics, Vol. 40, No. 5, pp. 292 298, 2010. Tiefelsdolf, M. and Griffith, D. A.: Semiparametric filtering of spatial autocorrelation: the eigenvector approach, Environment and Planning A, Vol. 39, No. 5, pp. 1193 1221, 2007. Train, K.: Discrete Choice Methods with Simulation, Cambridge University Press, Cambridge, 2003. Wang, X., Kockelman, K. and Lemp, J.: The dynamic spatial multinomial probit model: Analysis of land use change using parcel-level data, Journal of Transport Geography, Vol. 24, pp. 77 88, 2012. Wang, Y., Kockelman, K. M. and Wang, X.: Understanding spatial filtering for analysis of land use data sets, Journal of Transport Geography, in print, 2013.