SPATIO-TEMPORAL REGRESSION MODELS FOR DEFORESTATION IN THE BRAZILIAN AMAZON

Similar documents
Copulas, a novel approach to model spatial and spatio-temporal dependence

*Department Statistics and Operations Research (UPC) ** Department of Economics and Economic History (UAB)

Time: the late arrival at the Geocomputation party and the need for considered approaches to spatio- temporal analyses

Roger S. Bivand Edzer J. Pebesma Virgilio Gömez-Rubio. Applied Spatial Data Analysis with R. 4:1 Springer

Franck Nadaud-CIRED Mário Jorge Mendonca- IPEA

International Journal of Remote Sensing, in press, 2006.

Spatio-temporal analysis and interpolation of PM10 measurements in Europe for 2009

Drew Behnke Food and Agriculture Organization of the United Nations UC Berkeley and UC Santa Barbara

Combining Regressive and Auto-Regressive Models for Spatial-Temporal Prediction

Statistics: A review. Why statistics?

The Tropical Rainforest Ecosystem

Land Accounts - The Canadian Experience

An Introduction to Spatial Statistics. Chunfeng Huang Department of Statistics, Indiana University

MODELLING AND UNDERSTANDING MULTI-TEMPORAL LAND USE CHANGES

NRCSE. Statistics, data, and deterministic models

Spatial bias modeling with application to assessing remotely-sensed aerosol as a proxy for particulate matter

Watershed Classification with GIS as an Instrument of Conflict Management in Tropical Highlands of the Lower Mekong Basin

Helping the Red Cross predict flooding in Togo

Stochastic Hydrology. a) Data Mining for Evolution of Association Rules for Droughts and Floods in India using Climate Inputs

EXPLORATORY SPATIAL DATA ANALYSIS OF BUILDING ENERGY IN URBAN ENVIRONMENTS. Food Machinery and Equipment, Tianjin , China

Site-specific Prediction of Mosquito Abundance using Spatio-Temporal Geostatistics

UNITED STATES AND SOUTH AMERICA OUTLOOK (FULL REPORT) Thursday, December 28, 2017

Creating and Managing a W Matrix

Temporal vs. Spatial Data

Brent Coull, Petros Koutrakis, Joel Schwartz, Itai Kloog, Antonella Zanobetti, Joseph Antonelli, Ander Wilson, Jeremiah Zhu.

Estimating the long-term health impact of air pollution using spatial ecological studies. Duncan Lee

UNITED STATES AND SOUTH AMERICA SNAPSHOT REPORT Wednesday, December 20, 2017

The Cost of Transportation : Spatial Analysis of US Fuel Prices

UNITED STATES AND SOUTH AMERICA SNAPSHOT REPORT Thursday, December 21, 2017

USE OF SATELLITE IMAGES FOR AGRICULTURAL STATISTICS

Introduction to Spatial Statistics and Modeling for Regional Analysis

Predictive spatio-temporal models for spatially sparse environmental data. Umeå University

UNITED STATES AND SOUTH AMERICA WEATHER ROUNDUP Sunday, December 17, 2017

Urban remote sensing: from local to global and back

Application of eigenvector-based spatial filtering approach to. a multinomial logit model for land use data

Parameter selection for region-growing image segmentation algorithms using spatial autocorrelation

Spatio-temporal statistical models for river monitoring networks

Gridded monthly temperature fields for Croatia for the period

On dealing with spatially correlated residuals in remote sensing and GIS

Spatio-temporal dynamics of Marbled Murrelet hotspots during nesting in nearshore waters along the Washington to California coast

Introduction to Econometrics

Effects of interannual climate variability on tropical tree cover

ENGRG Introduction to GIS

Hierarchical Bayesian Modeling of Multisite Daily Rainfall Occurrence

Heteroskedasticity. y i = β 0 + β 1 x 1i + β 2 x 2i β k x ki + e i. where E(e i. ) σ 2, non-constant variance.

Spatial Relationships in Rural Land Markets with Emphasis on a Flexible. Weights Matrix

Model Selection for Geostatistical Models

SPATIAL-TEMPORAL TECHNIQUES FOR PREDICTION AND COMPRESSION OF SOIL FERTILITY DATA

Supporting Information for: Effects of payments for ecosystem services on wildlife habitat recovery

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Geospatial technology for land cover analysis

Spatial modeling of the Amazon deforestation

Secondary Towns, Population and Welfare in Mexico

Outline. Overview of Issues. Spatial Regression. Luc Anselin

Preface. Geostatistical data

ESTIMATING THE MEAN LEVEL OF FINE PARTICULATE MATTER: AN APPLICATION OF SPATIAL STATISTICS

P -spline ANOVA-type interaction models for spatio-temporal smoothing

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Brazil Paper for the. Second Preparatory Meeting of the Proposed United Nations Committee of Experts on Global Geographic Information Management

Modelling Dependence in Space and Time with Vine Copulas

DATA BASE AND METHODOLOGY

Migration Clusters in Brazil: an Analysis of Areas of Origin and Destination Ernesto Friedrich Amaral

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

FAPESP RESEARCH PROGRAMME ON GLOBAL CLIMATE CHANGE

Spatio-temporal precipitation modeling based on time-varying regressions

GEOGRAPHY Practice Answer Paper - 2

Predicting daily total phosphorus and suspended solids across Wisconsin stream reaches for impairment assessment

Cambridge International Examinations Cambridge International General Certificate of Secondary Education

Regression Analysis. A statistical procedure used to find relations among a set of variables.

ENV208/ENV508 Applied GIS. Week 1: What is GIS?

Human Activity in the Amazon

Spatial Analysis 1. Introduction

Global Land Project: major scientific questions for coupled modelling

IDENTIFICATION OF TRENDS IN LAND USE/LAND COVER CHANGES IN THE MOUNT CAMEROON FOREST REGION

Spatio-temporal modeling of weekly malaria incidence in children under 5 for early epidemic detection in Mozambique

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Automatic Autocorrelation and Spectral Analysis

Edzer Pebesma (ifgi) Dan Cornford (AST) Gregoire Dubois (EC DG JRC) ifgi. Institute for Geoinformatics University of Münster

A time series is called strictly stationary if the joint distribution of every collection (Y t

MEADOWS PRIMARY SCHOOL and NURSERY GEOGRAPHY POLICY

Topic 10: Panel Data Analysis

I used 45 environmental variables to explain differences in species richness among grid cells

SPACE Workshop NSF NCGIA CSISS UCGIS SDSU. Aldstadt, Getis, Jankowski, Rey, Weeks SDSU F. Goodchild, M. Goodchild, Janelle, Rebich UCSB

Landforms. Lesson 1: Physical Geography of Canada Mexico. Canada s mountainous eastern and western edges create a central region of fertile plains.

Remote sensing technique to monitoring the risk of soil degradation using NDVI

Price Formation on Land Market Auctions in East Germany An Empirical Analysis

Urban land value, Accessibility and Night Light Data: A Case Study in China

Spatial analysis of residential development processes uncovering the Dutch densification potential

A Guide to Modern Econometric:

Bayesian dynamic modeling for large space-time weather datasets using Gaussian predictive processes

Lecture 7 Autoregressive Processes in Space

Proposal Framework to Spatial Data Mining

Name Class Date. For each pair of terms, explain how the meanings of the terms differ.

Application of Remote Sensing Techniques for Change Detection in Land Use/ Land Cover of Ratnagiri District, Maharashtra

Abstract: About the Author:

Econometric Forecasting Overview

FORECASTING SUGARCANE PRODUCTION IN INDIA WITH ARIMA MODEL

GMM estimation of spatial panels

Spatio-temporal prediction of site index based on forest inventories and climate change scenarios

ATMOSPHERIC MODEL. Iracema Fonseca Albuquerque Cavalcanti

Transcription:

SPATIO-TEMPORAL REGRESSION MODELS FOR DEFORESTATION IN THE BRAZILIAN AMAZON Giovana M. de Espindola a, Edzer Pebesma b,c1, Gilberto Câmara a a National institute for space research (INPE), Brazil b Institute for Geoinformatics, University of Münster, Germany c 52 North GmbH, Muenster KEY WORDS: land use change, spatial simultaneous autoregression ABSTRACT: Deforestation in the Brazilian Amazon has sharply decreased over the past five years. In this study we try to explain the spatio-temporal pattern of deforestation in a selected area by relating data from 2002-2008 to a number of explanatory variables, part of which are related to control actions conducted by the government. We do so by considering the yearly fraction of deforestation for 25 km x 25 km cells, and spatial multiple regression models that incorporate autoregressive components in space and in time, as well as spatial, temporal and spatio-temporal physical and human-induced predictors. The ultimate goal is to evaluate the effect of control actions, and to obtain process knowledge needed for land change models needed to evaluate future actions. 1. INTRODUCTION As one of the largest tropical forests in the world, the Brazilian Amazon is an area where deforestation affects environmental themes such as biodiversity and greenhouse gas emission with global proportions. After a long period of increase, deforestation in the Brazilian Amazon has sharply decreased over the past five years. Following Aguiar et al. (2007), in this study we try to explain the spatio-temporal changes of deforestation in the Brazilian Amazon by relating yearly data from 2002-2008 to a number of explanatory variables. We do so by considering the yearly fraction of deforestation for 25 km 25 km cells, and by using spatial multiple regression models that incorporate autoregressive components in space and in time, and predictors that vary over space, over time and over space-time. The goal is to understand the changes in deforestation, and ultimately to understand the effect of control actions and to obtain process knowledge needed for land change models that are developed to evaluate future actions. 2. DATA The dependent variable is yearly deforestation for 25 km 25 km grid cells, shown in figure 1. The variability of deforestation is explained by (i) an autocorrelation 1 corresponding author, edzer.pebesma@uni-muenster.de effect (in space, time, or space-time) and (ii) by external variables. A large number of external predictor variables. Of these predictor variables, some varied only over time (e.g. world market prices), some varied only over space (e.g. distance to river), and some varied over space and time. For the variables that changed over space and time, both the initial (2002) value was offered as a prediction, and the temporal change of the time before (e.g., for time step 2003, the change 2003-2002) was offered as predictor. Several of the socio-economic spatio-temporally varying variables were only available at the spatial level of administrative units. In these cases, they were assigned to the grid cell for which the unit was dominant. Temporal predictors included: price of soy bean and price of meat. Spatial precitors included: distance to nearest municipality, distance to nearest capital in Legal Amazon Euclidian distance to Sao Paulo, Euclidian distance to nearest port, Euclidian distance to nearest river, Euclidian distance to nearest mineral deposit, Euclidian distance to nearest road, Euclidian distance to nearest timber industry, Percentage of high fertility soils, Percentage of low fertility soils, Percentage of very low fertility soils, Strength of connection to ports through road networks, Strength of connection to Sao Paulo through road networks, Strength of connection 1

2003 2004 2005 0.20 0.15 2006 2007 2008 0.10 0.05 0.00 Figure 1: Yearly deforestation rates in the Brazilian Amazon, per year, as fractions of 25 km 25 km cells, over the period 2003-2008 to Sao Paulo and Recive through road networks, Average temperature for the three driest months, Average precipitation for the three driest months, Seasonal index, Humidity index, Percentage of conservation units in 2002, Total area of soybean in 2002, Total area of sugarcane in 2002, control actions in 2002, population in 2002, and total of exports in 2003. Spatio-temporal predictors included: change in percentage deforestation (as autoregressive predictor for model 1 only), change in cell percentage of conservation units, change in area of soybeans, change in area of sugarcane, change in control actions, change in population, and change in total exports. 3. METHODS Regression modelling approximates a dependent variable with n observations y = (y 1,..., y n ) to a set of p independent variables x j = (x 1j,..., x nj ) by a linear function, y = p β j x j + e = Xβ + e j=1 where X is the design matrix that has x ij on row i and column j. The regression coefficient vector β is typically estimated by minimizing the residual sum of squares, e e. Simultaneous autoregression (SAR) models (Cressie and Wikle, 2011) define the residual process y Xβ to follow an autoregressive process, i.e. which can be rewritten as Y Xβ = B(Y Xβ) + v Y = Xβ + (I B) 1 v (1) where v follows a zero-mean normal distribution with covariance matrix σ 2 I (i.e., is independent), and B defines which residuals are correlated, and to which degree. Typically, B is sparse, and B ii = 0. Non-zero values B ij occur only when Y i and Y j are neighbours. Additionally, we assume that the non-zero values of B have a single value, which is the parameter that describes the degree of autocorrelation. This value will be called λ: for any non-zero B ij, cells i and j are neighbours and B ij = λ. 2

To define spatial neighbours, in this study we used the queen neighbours, meaning the 8 cells adjacent to each grid cell, or less in case of boundary cells or missing valued (or masked) pixels in the neighbourhood. For a spatio-temporal regression model, where we will denote y [t] = (y 1,t,..., y n,t ) as the observation in grid cell i and time step t {1,..., m}. As a first step from purely spatial SAR models towards spatio-temporal SAR models, in addition to the spatial autoregressive effect of the residuals we can take a temporally lagged observation y [t 1] into the regression, as in y [t] = Xβ + γy [t 1] + (I B) 1 v, t = 2,..., m (2) where B addresses spatial neighbours only. We will call this model 1. In a second approach, the SAR model (1) is specified for all time steps, but the B matrix not only addresses spatial neighbours y i,t and y j,t with i j, but also the two temporal neighbours of y i,t, y i,t 1 and y i,t+1. A simplifying assumption here is that a single autocorrelation coeficient describes the correlation both in space and time. We will call this model model 2. The third model, model 3 extends model 2 with spatiotemporal neighbours, i.e. residuals y i,t and y j,t+1 are correlated when grid cells i and j are neighbours. Again, a single correlation coefficient is fitted to describe correlations between all (spatial, temporal, and spatio-temporal) neighbours. Figure 2 shows the different neighbours defined in models 1, 2 and 3. Regressions were carried out with the R function spautolm in R package spdep (Bivand et al., 2008). This function provides maximum likelihood estimation of β and λ, but does not simultaneously estimate λ, γ and β using maximum likelihood. One solution to this would be to define neighbours in space and time, combined with a weighting factor that defines how neighbouring in space compares to neighbouring in time, in terms of weights, would be a minimum requirement for this to make sense. The solution chosen here was to add the temporal factor to the fixed effects Xβ, effectively leading to an more least squares oriented solution. 4. RESULTS Maps of yearly deforestation for the period 2002-2007 are shown in figure 1. The explanatory variables addressed are, for each grid cell and time step t defined in section 2. Each of the regression models 1, 2 and 3 (section 3) were computed for the full set of prectors. Table 1 lists the regression coefficients for those variables that were found significant for at least one of the three models at the α = 0.1 level. From these results it can be seen that a fair number of predictors is significant, and has similar standardized regression coefficient values, for each of the three models. Interestingly, the two purely temporal (time series) variables are highly significant. This is relatively easy, meat prices for instance gradually increase over the time period considered and can account for the gradual decreas in deforestation rate. It is also clear from the λ values and the autoregression coefficient for change in deforestation t 1 that autocorrelation in space and time is different. This was ignored for models 2 and 3, where a single λ value was fitted. 5. DISCUSSION AND CONCLUSIONS Building on the work of Aguiar et al. (2007) who looked at spatial regressions, and Espindola et al. (in press) who compared regression models for two moments in time, This paper gives a first step into the direction of directly modelling and explaining temporal changes in deforestation for 25 km 25 km grid cells covering the Brazilian Amazon. We did this by including predictors related to changes in protected areas, changes in amount of cattle, changes in soy bean and sugar cane plantation coverage. The regression model evaluated here considers yearly deforestation as it depends on the very limited set for which spatially distributed time series were available. In addition, it was only evaluated to which extent the change in deforestation depended on the changes in each of these independent variables, i.e. the variables as such were not included directly as predictor. As a consequence, a number of effects found significant may result from confounding effects. No pure time series (e.g. market prices) or purely spatial factors (e.g. climate) were included. Improved understanding of the governing processes may be obtained by evaluating a wider range of regression models. The regression model entertained here (1) was held deliberately simple, and these first results should be interpreted with caution and some reservations. Improvement of these first results might be obtained when (i) transformation of the dependent and/or independent variables improve the linearity of the relationships, (ii) other grid cell 3

Model 1 Model 2 Model 3 t+1 t t 1 spatial correlation autoregression temporal correlation spatio temporal correlation Figure 2: Neighbours addressed for models 1, 2 and 3. sizes are used than the current 25 km 25 km cells used here, (iii) more than one time lagged autoregressive terms are used (iv) an estimation procedure is used that can model autocorrelation in space and time separately. Not only variables may be omitted, leading to bias and confounding of those present, it is also possible that some variables need be omitted as they may explain variability for the wrong reason (e.g. the increase in meat price may not be the reason for decrease in deforestation rates). Previous results have shown that protected areas are significant in preventing deforestation in high-pressure areas, and the creation of those areas have been increased as a control policy applied by the Brazilian government. On the other hand, a debate is growing about the extent of the deforestation as a result of the expansion of cattle (pasture) and soy industry. Most recent analyses suggest that deforestation is driven by the expansion of cattle ranching, rather than soy bean. Soy bean and sugar cane seem to be replacing deforested areas previously under pasture. References Aguiar, A. P. D., Camara, G., and Escada, M. I. S. (2007). Spatial statistical analysis of land-use determinants in the Brazilian Amazonia: exploring intra-regional heterogeneity. Ecological Modelling, Volume 209 (Issues 2-4), Pages 169-188 Bivand, R. S., Pebesma, E. J., and Gomez-Rubio, V. (2008). Applied Spatial Data Analysis with R. Springer, NY. Cressie, N. and C.K. Wikle, 2011. Statistics for spatiotemporal data. Wiley, NY. De Espindola, G., A.P.D. de Aguiar, E. Pebesma, G. Câmara, L. Fonseca, accepted. Agricultural land use dynamics in the Brazilian Amazon based on remote sensing and census data, Applied Geography, http://dx.doi.org/10.1016/j. apgeog.2011.04.003. 4

predictor model 1 model 2 model 3 change in conservation units ST -0.0133 * change in percentage soybean ST 0.0112. 0.0145 * change in control actions ST 0.0078 * change in deforestation t 1 ST 0.4450 *** price soy bean T 0.0593 *** 0.0845 *** 0.0700 *** price meat T -0.0736 *** -0.0496 ** -0.0217 ** distance to nearest municipality S -0.0177 * distance to Sao Paulo S -0.0474. -0.1487 * -0.1859 ** distance to roads S -0.0437 * -0.0721 ** -0.0543. Strength of connection to Sao Paulo S 0.0824 *** 0.1081 * 0.1100 * through road networks Seasonal index S -0.2126. -0.3278. Humidity index S 0.2102. 0.3207. Percentage of conservation units in S -0.0135 * -0.0323 *** -0.0316 *** 2002 Total area of soybean in 2002 S -0.0137 * -0.0330 *** -0.0405 *** control actions in 2002 S 0.0174 *** population in 2002 S -0.0145 * -0.0178. -0.0153. Percentage of very low fertility S 0.0800 * 0.0753 * soils Percentage of low fertility soils S 0.0411 * 0.0390. Percentage of high fertility soils S 0.0761 * 0.0726 * Average precipitation for the three S -0.0808. driest months λ 0.667 0.874 0.895 σ 2 0.371 0.382 0.460 Nagelkerke R 2 0.595 0.565 0.508 Table 1: Standardized regresion model coefficients, and their significance (codes: 0 < *** 0.001 < ** 0.01 < * 0.05 <. 0.1). S indicates purely spatial predictors, T purely temporal predictors, and ST spatio-temporally varying predictors. The predictor change in deforestation t 1 was only offered to model 1, as the other models dealt with autocorrelation in a different way (see figure 2). λ is the estimated autocorrelation coefficient, σ 2 the residual variance. 5