TRAFFIC FLOW MODELING AND FORECASTING THROUGH VECTOR AUTOREGRESSIVE AND DYNAMIC SPACE TIME MODELS

Similar documents
FORECASTING TRAFFIC FLOW CONDITIONS IN AN URBAN NETWORK: COMPARISON OF MULTIVARIATE AND UNIVARIATE APPROACHES

IBM Research Report. Road Traffic Prediction with Spatio-Temporal Correlations

Space time modeling of traffic flow

A multivariate state space approach for urban traffic flow modeling and prediction

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA

Open Research Online The Open University s repository of research publications and other research outputs

SOME COMMENTS ON THE THEOREM PROVIDING STATIONARITY CONDITION FOR GSTAR MODELS IN THE PAPER BY BOROVKOVA et al.

FORECASTING SUGARCANE PRODUCTION IN INDIA WITH ARIMA MODEL

Traffic Flow Forecasting for Urban Work Zones

Univariate Short-Term Prediction of Road Travel Times

FREEWAY SHORT-TERM TRAFFIC FLOW FORECASTING BY CONSIDERING TRAFFIC VOLATILITY DYNAMICS AND MISSING DATA SITUATIONS. A Thesis YANRU ZHANG

Multivariate Hierarchical Bayesian space-time models in economics

Dynamic Time Series Regression: A Panacea for Spurious Correlations

Multivariate Time Series Analysis and Its Applications [Tsay (2005), chapter 8]

Elements of Multivariate Time Series Analysis

Graphical LASSO for local spatio-temporal neighbourhood selection

Improving the travel time prediction by using the real-time floating car data

STARIMA-based Traffic Prediction with Time-varying Lags

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY

The PPP Hypothesis Revisited

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY

Improving forecasting under missing data on sparse spatial networks

Design of Time Series Model for Road Accident Fatal Death in Tamilnadu

FORECASTING. Methods and Applications. Third Edition. Spyros Makridakis. European Institute of Business Administration (INSEAD) Steven C Wheelwright

Predicting freeway traffic in the Bay Area

CHAPTER 5 FORECASTING TRAVEL TIME WITH NEURAL NETWORKS

Short-term traffic volume prediction using neural networks

A TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED

TIME SERIES ANALYSIS. Forecasting and Control. Wiley. Fifth Edition GWILYM M. JENKINS GEORGE E. P. BOX GREGORY C. REINSEL GRETA M.

Comparison between VAR, GSTAR, FFNN-VAR and FFNN-GSTAR Models for Forecasting Oil Production

Predictive spatio-temporal models for spatially sparse environmental data. Umeå University

Evaluation of Some Techniques for Forecasting of Electricity Demand in Sri Lanka

Estimating the vehicle accumulation: Data-fusion of loop-detector flow and floating car speed data

Fast Boundary Flow Prediction for Traffic Flow Models using Optimal Autoregressive Moving Average with Exogenous Inputs (ARMAX) Based Predictors

Part I State space models

A Brief Overview of Machine Learning Methods for Short-term Traffic Forecasting and Future Directions

ecompass eco-friendly urban Multi-modal route PlAnning Services for mobile users ecompass TR 045 Checking the Stationarity of the Datasets

MCMC analysis of classical time series algorithms.

Time Series I Time Domain Methods

at least 50 and preferably 100 observations should be available to build a proper model

Box-Jenkins ARIMA Advanced Time Series

Basics: Definitions and Notation. Stationarity. A More Formal Definition

A SEASONAL TIME SERIES MODEL FOR NIGERIAN MONTHLY AIR TRAFFIC DATA

FORECASTING METHODS AND APPLICATIONS SPYROS MAKRIDAKIS STEVEN С WHEELWRIGHT. European Institute of Business Administration. Harvard Business School

A Course in Time Series Analysis

ARIMA modeling to forecast area and production of rice in West Bengal

1 Spatio-temporal Variable Selection Based Support

Do we need Experts for Time Series Forecasting?

TAKEHOME FINAL EXAM e iω e 2iω e iω e 2iω

Development of Space Time Model with. Exogenous Variable by Using Transfer Function. Model Approach on the Rice Price Data

Travel Time Estimation with Correlation Analysis of Single Loop Detector Data

FORECASTING THE INVENTORY LEVEL OF MAGNETIC CARDS IN TOLLING SYSTEM

{ } Stochastic processes. Models for time series. Specification of a process. Specification of a process. , X t3. ,...X tn }

Time Series Analysis -- An Introduction -- AMS 586

A Comparison of the Forecast Performance of. Double Seasonal ARIMA and Double Seasonal. ARFIMA Models of Electricity Load Demand

Gaussian Processes for Short-Term Traffic Volume Forecasting

Multivariate forecasting with VAR models

Forecasting: Methods and Applications

Multivariate ARMA Processes

Financial Econometrics

Forecasting Egyptian GDP Using ARIMA Models

APPLIED TIME SERIES ECONOMETRICS

Modelling Monthly Rainfall Data of Port Harcourt, Nigeria by Seasonal Box-Jenkins Methods

Framework for Traffic Pattern Identification: Required Step for Short-term Forecasting

Forecasting Data Streams: Next Generation Flow Field Forecasting

ARIMA Modelling and Forecasting

New Introduction to Multiple Time Series Analysis

MGR-815. Notes for the MGR-815 course. 12 June School of Superior Technology. Professor Zbigniew Dziong

SCIENCE & TECHNOLOGY

Development and Application of the Network Weight Matrix to Predict Traffic Flow for Congested and Uncongested Conditions

Asian Economic and Financial Review. SEASONAL ARIMA MODELLING OF NIGERIAN MONTHLY CRUDE OIL PRICES Ette Harrison Etuk

Empirical Study of Traffic Velocity Distribution and its Effect on VANETs Connectivity

Classification of Forecasting Methods Based On Application

A Non-Parametric Approach of Heteroskedasticity Robust Estimation of Vector-Autoregressive (VAR) Models

Forecasting. Simon Shaw 2005/06 Semester II

Multivariate Short-term Traffic Flow Forecasting using Bayesian Vector Autoregressive Moving Average Model

Spatio-Temporal Analysis of Air Pollution Data in Malta

Available online at ScienceDirect. Procedia Computer Science 72 (2015 )

Analysis. Components of a Time Series

Lecture 2: Univariate Time Series

Time Series: Theory and Methods

Empirical Market Microstructure Analysis (EMMA)

ARIMA Models. Richard G. Pierse

TIME SERIES DATA PREDICTION OF NATURAL GAS CONSUMPTION USING ARIMA MODEL

Measuring Wave Velocities on Highways during Congestion using Cross Spectral Analysis

UNIVARIATE TIME SERIES ANALYSIS BRIEFING 1970

Frequency Forecasting using Time Series ARIMA model

Cross-sectional space-time modeling using ARNN(p, n) processes

REVIEW OF SHORT-TERM TRAFFIC FLOW PREDICTION TECHNIQUES

Automatic Forecasting

Advanced Econometrics

A SARIMAX coupled modelling applied to individual load curves intraday forecasting

Econ 423 Lecture Notes: Additional Topics in Time Series 1

Estimation and application of best ARIMA model for forecasting the uranium price.

Cointegrated VARIMA models: specification and. simulation

Residuals in Time Series Models

Departamento de Estadfstica y Econometrfa Statistics and Econometrics Series 27. Universidad Carlos III de Madrid December 1993 Calle Madrid, 126

Econometric Forecasting

Forecasting using R. Rob J Hyndman. 2.4 Non-seasonal ARIMA models. Forecasting using R 1

Seasonal Autoregressive Integrated Moving Average Model for Precipitation Time Series

Transcription:

TRAFFIC FLOW MODELING AND FORECASTING THROUGH VECTOR AUTOREGRESSIVE AND DYNAMIC SPACE TIME MODELS Kamarianakis Ioannis*, Prastacos Poulicos Foundation for Research and Technology, Institute of Applied & Computational Mathematics, Regional Analysis Division, Heraklion, Crete Tel: +30810 391771, Fax: +30810 391761, e-mail: kamarian@iacm.forth.gr ABSTRACT Research on purely inductive models for traffic flow description and forecasting has started at the beginning of the eighties. Till recently, this kind of methods were univariate in nature, that is for the prediction of the future state of traffic at a specific site only information from this site was taken into account and were used only for short term forecasting purposes. In recent studies, multivariate models that combine information from neighboring sites have been used and proved to forecast better from univariate models. Furthermore, multivariate models describe the spatiotemporal evolution of traffic flow whereas univariate models describe only the temporal evolution and may be used for scenario analysis (i.e. quantification of the effects of a change in traffic flow at a specific place to downstream locations) that is they may serve for control purposes. In this short paper, we present how the most significant aspects of a road network topology may be incorporated into a vector autoregressive model to result a space-time model that may be used for the description of the traffic flow of a road network as a whole. In the full paper, we use 7,5 minute volume and occupancy measurements from urban arterial streets near downtown Athens, to develop models that feed on data from upstream detectors to describe the flow at downstream locations. Model stability for different time periods of the day and different months of the year is checked and a comparative study of various inductive methods concerning short term forecasting performance also takes place. KEYWORDS: Traffic flow prediction, spatial time series, space time models 1 INTRODUCTION Urban traffic congestion necessitates the existence of systems for traffic flow monitoring, prediction and control. In numerous big cities, loop detectors, video cameras, radar systems and probe vehicles monitor the flow. Various inductive techniques have been applied to data coming from the aforementioned sources the last twenty years. Specifications have ranged from non-parametric regression (Davis & Nihan 1991), regression with time varying coefficients (Rice & Van Swet 2001), Kalman filtering (Whittaker 1997) to neural networks (van Lint & Hoogerdoorn 2002).

Application of time series techniques was a natural consequence since in econometrics, where these techniques are widely applied, there are high prediction accuracy needs and worse data from both qualitative and quantitative perspectives comparing to traffic flow problems. Initially, univariate ARIMA models were used to predict the future behavior of a given location from its historical data (Williams Durvasula & Brown, 1997, Lee & Fambro, 1999) and more recently multivariate models were applied to predict the future state of a site taking past information from itself and its neighbors (Williams, 2001, Stathopoulos & Karlaftis, 2002) and proved to be superior to univariate methods as far as short term forecasting is concerned. A significant argument in favor of multivariate methods is that they can be used for control purposes since they give (under some simplifying assumptions) one the opportunity to assess the effects of changes in traffic flows at specific locations to their neighbors. Another advantage is the ease in implementation for most methods through commercial software as SAS or MATLAB and the feasibility of dynamic representation of continuously refreshed predictions for short time intervals. The present paper concentrates on developing flexible and explicitly multivariate time series models that account for the characteristics of the network topology using core urban area loop detector data. The remainder of this paper is organized as follows. The next section is a brief overview of the Vector Autoregressive time series models that allow for jointly considering data from different detectors but cannot incorporate much of the network topology in their regular formulation. In the sequel, we concentrate on space-time models that can be formulated as restricted vector autoregressive models and focus on their implementation to the spatial time series that arise from multiple loop detector data. In the full paper we will present an application of the aforementioned techniques to multiple loop detector data taken from urban arterial streets near downtown Athens. Finally, a comparative study among various univariate and multivariate techniques concerning short term forecasting will take place. 2 METHODOLOGY 2.1 VECTOR AUOREGRESSIVE MOVING AVERAGE MODELS The Vector Autoregressive Moving Average (VARMA) models aim to determine the dynamic interactions among multiple time series. The determination refers to the detection, quantification and prediction of these dynamic interactions. VARMA models can be viewed as a subclass of the State-Space models that are widely used in physics and engineering. The State-Space approach models multiple input-multiple output systems and has recently been applied to observations from successive loop detectors placed in a single road in an effort to optimize short-term traffic flow forecasts (Stathopoulos & Karlaftis, 2002). VARMA

models can be applied to traffic flow data from multiple detectors of a road network in an unrestricted form if the aim is the identification of similar traffic flow patterns among various roads at different time lags in the past or in a restricted form that indicates for each detector the upstream neighbors that serve as inputs (apart from its own past) if the aim is the quantification of the output s dependence to each input at different time lags in the past. Weight matrices that mirror the placement of the loop detectors in the network under study can be incorporated in a VARMA model to result a Space-Time Autoregressive Moving Average (STARMA) model. STARMA models aim to describe and predict multiple spatially referenced time series and will be extensively discussed in the next subsection since their formulation fits very well to the problem under study. VARMA processes are distinguished by their autoregressive and moving average orders. The p-th autoregressive order, q-th moving average order process is written as y t = δ +Φ 1 y t 1 +... +Φ p y t p + ε t Θ 1 ε t 1... Θ q ε t q (2.1) where y t = ( y 1t,..., y kt ),t = 0,1,... denotes a k-dimensional time series vector which in our case contains the observations of the k loop detectors under study at time t, ε t is a vector white noise process with ε t = ( ε 1t,...,ε kt ) such that E ε t ( )= 0, E( ε t )= 0 for t s and Φ 1,...,Φ t p, Θ 1,...,Θ t q represent matrices containing unknown coefficients that have to be estimated. Thus, a VARMA model regresses present observations to their past and to past prediction errors. In practice one first estimates the appropriate order of the model through a penalized likelihood information criterion like Akaike s information criterion or Schwarz Bayesian criterion and afterwards proceeds to coefficient estimation through least squares or maximum likelihood. Finally the residuals have to be checked on the satisfaction of the model hypotheses. Restrictions can be placed to the coefficient matrices to reflect our beliefs on dependencies among detectors. Thus, if we want to quantify the dependence of detector b to some upstream ones that give information that serve as input we can restrict the b-th row of the matrices Φ 1,...,Φ t p, Θ 1,...,Θ t q to contain zero elements at columns that do not correspond to input detectors. VARMA models can easily be implemented the last couple of years through commercial software as SAS or MATLAB. For specialized text on this concept the reader should consult Lutkerpohl (1987,1993) and Ooms (1994). ε s 2.2 SPACE-TIME MODELS A multivariate time series model can be further refined if the process under study exhibits systematic dependence between the observations at each location and the observations at neighbouring locations. This phenomenon is called spatial correlation. Models that explicitly attempt to explain these dependencies across space are referred to as space-time models. Various formulations have appeared in the literature depending on the form of data

and the aim of the application. In this study, we follow the Space Time Autoregressive Moving Average (STARMA) approach firstly illustrated by Pfeifer and Deutsch (1980 a, b) that can be easily implemented through relatively simple modifications of vector autoregressive models (see Giacomini & Granger 2001). STARMA models are characterized by linear lagged spatiotemporal dependence. Suppose the observations z i (t) of the random variable Z i (t), which in our case reflects traffic conditions, are available for N spatial locations (i=1,..,n) and T time periods. The autoregressive structure of the STARMA model expresses the observations at time t taken from the i-th spatial location as a linear combination of past observations taken from location i and its neighboring measurement locations. If the same relationship holds for every site in the system, the process is said to exhibit spatial stationarity and is thus amenable to these forms of space-time models. Let L (l ), the spatial lag operator of spatial order l, be such that where L (0) z i (t) = z i (t) L (l ) z i (t) = N j =1 w ij (l ) z j (t) w ij are a set of weights with for all i and N j =1 w ij (l ) =1 w ij nonzero only if sites i and j are l th order neighbors. The matrix representation of the set of weights w ij is W, an N N square matrix with each row summing to one. For the N 1 column vector z (t) that contains the observations z i ( t), i = 1,..., N, the following relationships hold and L (0) z(t) = W (0) z(t) =Ι N z(t) L (l ) z(t) = W (l ) z(t) for l>0. The specification of the form of weights w ij network. Through w ij should reflect the configuration of the road one may incorporate the distances of the measurement locations i and j, natural barriers, or even the ease in accessibility of site i to site j. First order neighbors are those closest to the site of interest. Second order neighbors should be farther away than first order neighbors, but closer than third order neighbors (see figure 1).

2 2 1 2 2 1 0 1 2 2 1 2 2 Figure 1. First and second order neighbors for site 0. As in univariate time series, z i (t) is expressed as a linear combination of past observations and errors. Here, however, instead of allowing dependence of z i (t) only with past observations and errors at site i, dependence is allowed with neighboring sites of various spatial order. In particular p λ k q m k z i (t) = φ kl L (l ) z i (t k) θ kl L (l ) ε i ( t k)+ ε i (t) (2.1) k=1 l =0 k =1 l =0 where p is the autoregressive order, q is the moving average order, λk is the spatial order of the k th autoregressive term, m k is the spatial order of the k th moving average term, φ kl θ kl are parameters, and the ε (t) are random normal errors with i E[ ε i (t)]= 0 E[ ε i (t)ε j (t + s) ]= σ 2 for i=j, s=0 and 0 otherwise. The same model in vector form is p λ k q m k z(t) = φ kl W (l ) z i (t k) θ kl W ε( t k)+ ε(t) (2.2) k=1 l = 0 k =1 l =0 where ε (t) are normal with mean zero and E[ ε(t)ε(t + s) ]= σ 2 Ι N when s=0 και 0 otherwise. In order for the STARMA model to represent a stationary process, one in which the covariance structure does not change with time, certain conditions must be met. These conditions are called stationarity conditions and require that every x u that solves det x p u Ι p λ k k=1 l =0 φ kl W p k x u = 0

lies inside the unit circle ( x u <1). Effectively this requirement serves to determine a region of possible φ kl values that will result in a stationary process. For the spatial and temporal order identification, analogously to univariate time series, the sample properties of two functions, the spatial temporal autocorrelation function and the spatial temporal partial autocorrelation function, are examined. For a thorough discussion the reader should consult Pfeifer and Deutsch (1980 a, b). REFERENCES Davis, G.A. and Nihan, N.L. (1991). Nonparametric regression and short-term freeway traffic forecasting. ASCE Journal of Transportation Engineering 117(2), pp.168-178. Giacomini, R. and Granger, C.W.J. (2001). Aggregation of space-time processes. Manuscript, Department of Economics, University of California, San Diego. Lee, S. and Fambro, D.B. (1999). Application of subset autoregressive integrated moving average model for short-term freeway traffic volume forecasting. Transportation Research Record 1678, pp. 179-188. Lutkerpohl, H. (1987), Forecasting Vector ARMA Processes. Berlin: Springer-Verlag. Lutkerpohl, H. (1993), Introduction to Multiple Time Series Analysis. Berlin: Springer-Verlag. Ooms, M. (1994), Empirical Vector Autoregressive Modeling. Berlin: Springer-Verlag. Pfeifer, P.E. and Deutsch, S.J., (1980a). A three-stage iterative procedure for space-time modeling. Technometrics 22, 35-47. Pfeifer, P.E. and Deutsch, S.J., (1980b). Identification and interpretation of first-order space-time ARMA models. Technometrics 22, 397-408. Rice, J. and van Zwet E. (2001). A simple and effective method for predicting travel times on freeways. IEEE Intelligent Transportation Systems Proceedings. Stathopoulos, A. and Karlaftis, G.M. (2002). A multivariate state-space approach for urban traffic flow modeling and prediction. 81th Annual Transportation Research Board Meeting. Van Lint, J.W.C. and Hoogendoorn, S.P. (2002). Freeway travel time prediction with state-space neural networks. 81th Annual Transportation Research Board Meeting. Whittaker, J., Garside, S., Lindveld, K. (1997). Tracking and predicting a network traffic process. International Journal of Forecasting 13, 51-61. Williams, B.M., Durvasula, P.K. and Brown, D.E. (1997). Urban freeway travel prediction: application of seasonal ARIMA and Exponential Smoothing Models. 77th Annual Transportation Research Board Meeting. Williams, B.M. (2001). Multivariate vehicular traffic flow prediction: an evaluation of ARIMAX modeling. 80th Annual Transportation Research Board Meeting.