MOVING WINDOW REGRESSION (MWR) IN MASS APPRAISAL FOR PROPERTY RATING. Universiti Putra Malaysia UPM Serdang, Malaysia

Similar documents
A Review of Property Mass Valuation Models

A Space-Time Model for Computer Assisted Mass Appraisal

MODELLING THE VALUE OF LOCATION IN THE PREDICTION OF RESIDENTIAL PROPERTY VALUE

Spatial Statistics For Real Estate Data 1

Using Ridge Least Median Squares to Estimate the Parameter by Solving Multicollinearity and Outliers Problems

Geographically weighted regression approach for origin-destination flows

Transaction Statistics. Pulau Pinang

Commuting in Northern Ireland: Exploring Spatial Variations through Spatial Interaction Modelling

Exploratory Spatial Data Analysis (ESDA)

Spatio-Temporal Methods for Mass Appraisal

Land Values Analysis A Pragmatic Approach for Mass Appraisal Of Residential Properties In Los Angeles County

How does Transport Infrastructure Affect Dwelling Prices in Athens?

1Department of Demography and Organization Studies, University of Texas at San Antonio, One UTSA Circle, San Antonio, TX

Running head: GEOGRAPHICALLY WEIGHTED REGRESSION 1. Geographically Weighted Regression. Chelsey-Ann Cu GEOB 479 L2A. University of British Columbia

Online Robustness Appendix to Endogenous Gentrification and Housing Price Dynamics

Regression Analysis. A statistical procedure used to find relations among a set of variables.

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

Combining AHP with GIS in the evaluation of locational characteristics quality for purpose-built offices in Malaysia

Geographically Weighted Regression and Kriging: Alternative Approaches to Interpolation A Stewart Fotheringham

Chapter 13. Multiple Regression and Model Building

Geographically and temporally weighted regression for modeling spatio-temporal variation in house prices

Simultaneous Coefficient Penalization and Model Selection in Geographically Weighted Regression: The Geographically Weighted Lasso

Songklanakarin Journal of Science and Technology SJST R1 Sifriyani

Measuring The Benefits of Air Quality Improvement: A Spatial Hedonic Approach. Chong Won Kim, Tim Phipps, and Luc Anselin

A SPATIAL ANALYSIS OF A RURAL LAND MARKET USING ALTERNATIVE SPATIAL WEIGHT MATRICES

IDENTIFYING MULTIPLE OUTLIERS IN LINEAR REGRESSION : ROBUST FIT AND CLUSTERING APPROACH

Explorative Spatial Analysis of Coastal Community Incomes in Setiu Wetlands: Geographically Weighted Regression

2. Linear regression with multiple regressors

Report. Reference. Spatial Dependence, Housing Submarkets and House Price Prediction. BOURASSA, Steven C., CANTONI, Eva, HOESLI, Martin E.

The Empirical Study of Rail Transit Impacts on Land Value in Developing Countries: A Case Study in Bangkok, Thailand

Unit 10: Simple Linear Regression and Correlation

2008 ESRI Business GIS Summit Spatial Analysis for Business 2008 Program

Subject: Note on spatial issues in Urban South Africa From: Alain Bertaud Date: Oct 7, A. Spatial issues

Using Spatial Statistics Social Service Applications Public Safety and Public Health

The Building Blocks of the City: Points, Lines and Polygons

Spatial Relationships in Rural Land Markets with Emphasis on a Flexible. Weights Matrix

Geographically Weighted Regression LECTURE 2 : Introduction to GWR II

Predicting House Prices with Spatial Dependence: A Comparison of Alternative Methods

Incorporating GIS into Hedonic Pricing Models

Evaluating sustainable transportation offers through housing price: a comparative analysis of Nantes urban and periurban/rural areas (France)

Time: the late arrival at the Geocomputation party and the need for considered approaches to spatio- temporal analyses

Geographically weighted methods for examining the spatial variation in land cover accuracy

ECON 450 Development Economics

Geographical General Regression Neural Network (GGRNN) Tool For Geographically Weighted Regression Analysis

Assessment of Three Spatial Interpolation Models to Obtain the Best One for Cumulative Rainfall Estimation (Case study: Ramsar District)

Fitting a regression model

The Cost of Transportation : Spatial Analysis of US Fuel Prices

Spatial Autocorrelation and Real Estate Studies: A Literature Review

Estimation, Interpretation, and Hypothesis Testing for Nonparametric Hedonic House Price Functions

SPATIO-TEMPORAL ANALYSIS OF PRECIPITATION AND TEMPERATURE DISTRIBUTION OVER TURKEY

A New Approach to Estimating the Production Function for Housing: Appendices

Interpolation and 3D Visualization of Geodata

Predicting House Prices with Spatial Dependence: A Comparison of Alternative Methods

ABDUL HAMID Universiti Teknologi Malaysia

Spatial Variation in Infant Mortality with Geographically Weighted Poisson Regression (GWPR) Approach

GRAD6/8104; INES 8090 Spatial Statistic Spring 2017

Modeling Spatial Variation in Stand Volume of Acacia mangium Plantations Using Geographically Weighted Regression

Context-dependent spatial analysis: A role for GIS?

Prepared by: Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti

Multiple Dependent Hypothesis Tests in Geographically Weighted Regression

ECON 4230 Intermediate Econometric Theory Exam

A GEOSTATISTICAL APPROACH TO PREDICTING A PHYSICAL VARIABLE THROUGH A CONTINUOUS SURFACE

Applied Econometrics. Professor Bernard Fingleton

Urban Residential Land Value Analysis: The Case of Potenza

Evaluating the Impact of the Fukushima Daiichi Nuclear Power Plant Accident

Regression Analysis. BUS 735: Business Decision Making and Research

Statistics: A review. Why statistics?

Spatio-Temporal Analysis for Monitoring Water Quality of Skudai River, Malaysia

The prediction of house price

Remedial Measures, Brown-Forsythe test, F test

Wooldridge, Introductory Econometrics, 4th ed. Chapter 6: Multiple regression analysis: Further issues

MS&E 226: Small Data


Statistics II Exercises Chapter 5

GIS Analysis: Spatial Statistics for Public Health: Lauren M. Scott, PhD; Mark V. Janikas, PhD

ASPECTS REGARDING THE USEFULNESS OF GEOGRAPHICALLY WEIGHTED REGRESSION (GWR) FOR DIGITAL MAPPING OF SOIL PARAMETERS

Marking Scheme Field Work. 6 International Geography Olympiad. Brisbane

Geographically Weighted Regression as a Statistical Model

Prospect. February 8, Geographically Weighted Analysis - Review and. Prospect. Chris Brunsdon. The Basics GWPCA. Conclusion

Geographically Weighted Regression Using a Non-Euclidean Distance Metric with a Study on London House Price Data

Housing Sub-markets and Hedonic Price Analysis: A Bayesian Approach

Incorporating spatial variation in housing attribute prices: a comparison of geographically weighted regression and the spatial expansion method

This lab exercise will try to answer these questions using spatial statistics in a geographic information system (GIS) context.

Chapter 1. An Overview of Regression Analysis. Econometrics and Quantitative Analysis. What is Econometrics? (cont.) What is Econometrics?

SOME NOTES ON PARAMETRIC SIGNIFICANCE TESTS FOR GEOGRAPHICALLY WEIGHTED REGRESSION

PREDICTING HOUSE PRICES FOR SINGAPORE CONDOMINIUM RESALE MARKET: A COMPARISON OF TWO MODELS

House Price Distribution and Price Indexes in Tokyo

Title: Effect on surface deformation resulting from mining activity on value of property valuation : a case study from Jastrzębie-Zdrój region

Introduction. Introduction (Contd.) Market Equilibrium and Spatial Variability in the Value of Housing Attributes. Urban location theory.

Satellite and gauge rainfall merging using geographically weighted regression

URBAN RAIL TRANSIT DEVELOPMENT IMPACTS IN DEVELOPING COUNTRIES: A CASE STUDY OF LAND PRICE IN BANGKOK, THAILAND

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.

REO 100% Leased Four (4) Unit Mixed-Use Property Along Ashland

Selection and Agglomeration Impact on Firm Productivity: A Study of Taiwan's Manufacturing Sector NARSC ANNUAL MEETING 2013

Modeling the Ecology of Urban Inequality in Space and Time

Urban land value, Accessibility and Night Light Data: A Case Study in China

STATISTICS 110/201 PRACTICE FINAL EXAM

Analysing the local geography of the relationship. between residential property prices and its determinants.

Outliers Detection in Multiple Circular Regression Model via DFBETAc Statistic

Modeling Spatial Relationships Using Regression Analysis. Lauren M. Scott, PhD Lauren Rosenshein Bennett, MS

Transcription:

MOVING WINDOW REGRESSION (MWR IN MASS APPRAISAL FOR PROPERTY RATING 1 Taher Buyong, Suriatini Ismail, Ibrahim Sipan, Mohamad Ghazali Hashim and 1 Mohammad Firdaus Azhar 1 Institute of Advanced Technology Universiti Putra Malaysia 43400 UPM Serdang, Malaysia Email: rehat_55@yahoo.com Faculty of Geoinformation Science and Engineering Universiti Teknologi Malaysia 81310 UTM Skudai, Malaysia suriatini@fksg.utm.my, ibrahim@fksg.utm.my, kp_kuat@hotmail.com ABSTRACT Mass appraisal can be defined as an art and science of valuing properties in a defined geographic area using maps, market data, comparative information and valid statistical procedure. Mass appraisal allows valuing a large number of properties in a reasonably short period of time and for a relatively small cost per property. Several statistical methods are available for mass appraisal of properties and this paper focuses on the use of Moving Window Regression (MWR. MWR is capable of handling spatial effects peculiar to property data where this characteristic is not available in the conventional Multiple Regression Analysis (MRA. This paper presents preliminary analysis of the MWR method using 196 housing sales observation occurring between 004 and 006 within Majlis Perbandaran Kulai. The results are compared with the MRA method to assess the predictive performance of the method. Keywords: Geographically Weighted Regression (GWR, spatial heterogeneity, spatial regression, property tax 1

INTRODUCTION Multiple Regression Analysis (MRA is a conventional statistical technique that is commonly used in mass appraisal of properties. It is used to predict the value of a dependent variable (price based on known values of independent variables (land area, floor area, number of bedrooms, etc.. The method starts by establishing a mathematical equation (a model that describes how the variables are related. The specified model is calibrated against a set of houses where their prices and related independent variables are known. The outcome of the model calibration, which is a set of estimated coefficients, is used in the prediction of prices of other houses in the same area where the values of the independent variables of the houses are known. The main problem with the MRA method is the difficulty of integrating location in the model. The importance of location in property valuation has long been established and thus cannot be ignored. It is accepted that properties are spatially unique and location is an important component that determines their values. Incorrect specifications of location effects in the model resulted in residuals that are spatially correlated and thus, violated one of the basic requirements of the MRA formulation. Several advances of multiple regression methods that take into account of location in their models have recently emerged. The methods include spatial hedonic modeling (SHM, geographically weighted regression (GWR, moving window regression (MWR and kriging. SHM models location explicitly through its spatial variables while GWR and MWR model location implicitly in the model formulation. The kriging method models location explicitly using local spatial autocorrelation. A research is being conducted to assess the predictive performance of the MWR method in mass appraisal of residential properties in Malaysia. The aim of this paper is to present preliminary findings of this research. The next section presents discussions on the MWR model. This will be followed by discussions on data used in the analysis. Results and discussions of the results are presented in the subsequent section. The paper concludes by highlighting important points. MWR MODEL The general linear regression model of MRA is given as y( i = b ( i + b1 ( i x1 + b ( i x +... + b ( i x e( 0 m m + i where y is the dependent variable; x1, x, xm are the independent variables; b0, b1, th b, b m are the coefficients that will be estimated; e is the error term and (i is the i observation.

The coefficients for the model can be estimated using ordinary least squares (OLS and the equation, in matrix form, is ˆ β = ( X X T 1 where βˆ is the estimated vector of coefficients, X is the matrix of independent variables with the elements of the first column set to 1, and Y is a vector of observations of the dependent variable. The fitness of the model an be measured using residuals sum of squares, RSS, standard error, s e, coefficient of determination, R², and adjusted coefficient of determination, R.The equations of these measures are Residual sum of squares (RSS X T Y RSS = n i 1 ( ˆ y i y i Standard error (s e s e ( y yˆ = i i ( n m 1 Coefficient of determination (R² Adjusted coefficient of determination ( R R ( yi yi ( yi yi = 1 ˆ R ( 1 ( n 1 = 1 R ( n m 1 where y i =observed value ŷ i =predicted value y i =mean yi n =number of data points m =number of b coefficient 3

Simple linear regression in a moving window with equal weights assigned to all observations within the moving window (Chris Lloyd, Ian Shuttleworth 005.The MWR method uses the same general linear regression model as the MRA. The only difference is that the regression is being applied to a smaller area (a window instead of the whole study region. Therefore, several regressions are performed in order to cover the study region. By doing so, the MWR method is able to capture spatial variations in the study region in which the MRA method does not have the capability to do so. For any particular window, the regression is based on the modeled relationship between the regression point and n observation points in the window. Only n neighbors of the regression point are taken into account in a regression. Observation data outside the window is not considered at all. The n observations within the window are weighted equally. It is quite common to refer the concept as all data within the window receive a weight of one, whereas all data outside of the window have a weight of zero. MWR weighting scheme can be defined as follows: w ij = 1 if j is one of the n th nearest neighbors of i; = 0 otherwise Window size might not be the same when different areas are considered because the size of the window will be dependent on the density of observations around the regression point. Window sizes have to be explored to determine the optimum size (or number of nearest neighbors that will give the best result. In order to determine the best window size a, cross validation is performed. The equation for cross validation is given as n CV = ( y i yˆ i i= 1 where n is the number of data points in a widow, is the observe value for the i th data point and ŷ is the estimated value of the i th i point obtained when i itself is omitted from the computation. y i 4

DATA For the purpose of this paper, the test dataset comprises 196 single storey terrace houses transacted between 00 and 006 within Majlis Perbandaran Kulai (MPKu area in Johor. The dependent variable is the transacted price while the independent variables are: Land area Main floor area Ancillary floor area Additional floor area Position (intermediate, end, corner Floor finishing (tiles, cement, tiles and cement Title (others, HS(Dfreehold, HS(Mfreehold and grant Originally, there were about 1,300 transacted houses as obtained from Jabatan Perkhidmatan Harta (JPPH, in Johor Bahru. After the first round of data screening, about 500 of the data were discarded primarily due to non-existence of lot number or grant number, the identifier to link the attribute component of the houses to their spatial component in order to utilize GIS functionalities. The second round of data screening focused on identifying outliers. We looked for abnormalities in the transacted prices (such as a corner lot house being transacted a much lower price, in the same year, than its neighboring same row intermediate houses and transacted price is too high/low in comparison to its neighboring houses with the same land and floor area, land area (such as an intermediate houses having land area bigger than its corner lot neighbor of the same row, and floor area (such as a house with main floor area greater than its land area. Finally, only 196 houses were available to be analyzed. RESULTS AND DISCUSSIONS Assessment of the prediction accuracy of the MRA and MWR models is done via insample data, i.e., the same dataset is used for calibration and prediction; The calibration of the models were carried out and the estimated coefficients obtained from the calibration were then used to predict the price of the houses where the observed or the actual prices are already known. Predictive accuracy of the models is determined by comparing the observed and predicted prices. Table 1 summarized the calibration output of the two models. 5

Table 1: Calibration Output MRA MWR Standard error 9639.1439 5979.0065 Residual Sum of Square (RSS 17,467,66,833.31 7,189,446,651. Coefficient of Determination (R 0.65 0.8558 Adjusted R-Square ( R 0.611 0.8508 In assessing the fitness of the models, the MWR model is better than the global MRA model. The adjusted R² of the MWR and MRA models are 0.8508 and 0.611, respectively, while the R² of the MWR and MRA models are 0.8558 and 0.65, respectively. The MWR model Residual Sum of Square (RSS and standard error are also smaller when compared to the MRA model. These results indicate that the fitness of the MWR model is better than the MRA model. Table shows the coefficients and the values of t-statistics for the independent variables of the MRA model. The t-statistics show that all determinants for the global MRA model are statistically significant, at the least at the 90% level of confidence. Land Area, MFA, AFA, Add_FA, Position and floor positively contribute to property price; an increase in the value of one of these variables increases the property price. Meanwhile, Title variable negatively contribute to property price. The analysis of t-statistics indicates that the global model specifications are appropriate. Table : Coefficients and t-statistic for the independent variables of the MRA model Independent Variables (Constant Land Area MFA Coefficient (t-statistics 947.563 (11.5367*** 75.6146 (.6141*** 158.075 (.0185** 6

AFA Add_FA Position Floor Title 16.9817 (1.970* 547.909 (8.3874*** 5456.546 (1.6414* 3458.3599 (3.5634*** -1676.833 (-.003*** Dependent Variable: Property Price *** Significant at 99%, **Significant at 95%, *Significant at 90% The predictive accuracy is measured by the value of the Coefficient of Dispersion (COD. COD is the average percentage of absolute dispersion around the median predicted ratio. The lower the COD, the more uniform or less disperse the prediction within the study region. The International Association of Assessing Officers (IAOO views COD 15.0% to indicate a high quality or commendable prediction (Strauss et. al, 003. The equation for calculating the COD is given as COD = 100 Median P / O * ( P / O n = Median i 1 i i P / O n where: P = predicted value of the th i i property O = observed price of the th i i property Median P/O = median of the distribution of Pi / Oi Values of the COD in Table 3 indicate that the MWR model predict better than the MRA model. The MWR model has 3.3% COD while the MRA model has 5.1% COD. The MWR model prediction is less disperses than the MRA model. Maximum and minimum deviations of predicted prices relative to the observed prices are other indicators of how well a model predict. The MWR model has smaller values than the global MRA model for both measures which indicate that the predicted prices of the MWR model are closer to observed values than the predicted prices of the MRA model. 7

Table 3: Summary of Prediction Accuracy MRA MWR Max deviation RM50,497 RM 6,808 Min deviation RM14 RM0 Coefficient Of Dispersion (COD 5.1% 3.3% CONCLUSIONS The main objective of this paper is to preliminary asses the performance of the MWR model in mass appraisal of residential properties in Malaysia. The focus of the assessment is on the prediction accuracy in order for the model to be useful for mass appraisal, without compromising other aspects. This was done by comparing the performance of the MWR model and the conventional MRA global model. It was found that, despite limitations of the test dataset (small size, uses structural variables only, ignoring temporal effects, and in-sample prediction the MWR model can predict the property price better than the global model. This research will proceed with testing of datasets without the limitations indicated above. Larger and more representative test datasets with structural, accessibility, and neighborhood variables will be built and out-sample prediction will be carried out. The model will also incorporate temporal aspects of the data. It is hoped that the suitability of the MWR model to be used for mass appraisal of residential properties for rating in Malaysia will be ascertained. ACKNOWLEDGEMENT Financial support of this research under Agreement No: NAPREC (R&D 1/07 is gratefully acknowledged. 8

REFERENCES Chris Lloyd, Ian Shuttleworth (003. Analysing commuting using local regression techniques: scale, sensitivity, and geographical patterning Environment and Planning A 005, volume 37, pages 81 103 F.Long, A.Paez and S.Faber (007 Spatial effect in hedonic price estimation: A case study in Toronto Centre for Spatial Analysis, McMaster University Fotheringham A.S., Brunsdon C. and Charlton M.E. (00 Geographically Weighed Regression: the Analysis of Spatially Varying Relationships. West Sussex: John Wiley & Sons Ltd Robert P. Strauss and David A. Strauss (003 Residential Real Estate Assessment Fairness in Four Urban Areas 96 TH Annual Conference on Taxation, Carnegie Mellon University Steven Farber (004. A Comparison of Localized Regression Models in a Hedonic House Price Context. Ryerson University, Toronto, Ontario, Canada 9