Journal of Asian Scientific Research PREDICTION OF FATAL ROAD TRAFFIC CRASHES IN IRAN USING THE BOX-JENKINS TIME SERIES MODEL. Ayad Bahadori Monfared

Similar documents
TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA

Suan Sunandha Rajabhat University

Forecasting Bangladesh's Inflation through Econometric Models

Empirical Approach to Modelling and Forecasting Inflation in Ghana

Spatiotemporal Analysis of Urban Traffic Accidents: A Case Study of Tehran City, Iran

Asitha Kodippili. Deepthika Senaratne. Department of Mathematics and Computer Science,Fayetteville State University, USA.

MODELING MAXIMUM MONTHLY TEMPERATURE IN KATUNAYAKE REGION, SRI LANKA: A SARIMA APPROACH

Design of Time Series Model for Road Accident Fatal Death in Tamilnadu

FORECASTING THE INVENTORY LEVEL OF MAGNETIC CARDS IN TOLLING SYSTEM

FORECASTING OF COTTON PRODUCTION IN INDIA USING ARIMA MODEL

Responsive Traffic Management Through Short-Term Weather and Collision Prediction

Discussion Papers In Economics And Business

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

AN ARTIFICIAL NEURAL NETWORK MODEL FOR ROAD ACCIDENT PREDICTION: A CASE STUDY OF KHULNA METROPOLITAN CITY

Autoregressive Integrated Moving Average Model to Predict Graduate Unemployment in Indonesia

ARIMA model to forecast international tourist visit in Bumthang, Bhutan

Contingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878

Lab: Box-Jenkins Methodology - US Wholesale Price Indicator

Time Series Analysis of Currency in Circulation in Nigeria

Spatial Variation in Local Road Pedestrian and Bicycle Crashes

Texas A&M University

Box-Jenkins ARIMA Advanced Time Series

ANALYZING THE IMPACT OF HISTORICAL DATA LENGTH IN NON SEASONAL ARIMA MODELS FORECASTING

Contingency Tables. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels.

Forecasting Area, Production and Yield of Cotton in India using ARIMA Model

Time Series Forecasting: A Tool for Out - Sample Model Selection and Evaluation

5 Autoregressive-Moving-Average Modeling

Univariate ARIMA Models

REVIEW OF SHORT-TERM TRAFFIC FLOW PREDICTION TECHNIQUES

Application of ARIMA Models in Forecasting Monthly Total Rainfall of Rangamati, Bangladesh

FORECASTING COARSE RICE PRICES IN BANGLADESH

Available online at ScienceDirect. Procedia Computer Science 72 (2015 )

Forecasting Foreign Direct Investment Inflows into India Using ARIMA Model

A Comparison of the Forecast Performance of. Double Seasonal ARIMA and Double Seasonal. ARFIMA Models of Electricity Load Demand

Trend and Variability Analysis and Forecasting of Wind-Speed in Bangladesh

STAC51: Categorical data Analysis

Chapter 3. Regression-Based Models for Developing Commercial Demand Characteristics Investigation

Automatic Forecasting

Influence Regularity of Fog on Expressway in China

The ARIMA Procedure: The ARIMA Procedure

Asian Economic and Financial Review. SEASONAL ARIMA MODELLING OF NIGERIAN MONTHLY CRUDE OIL PRICES Ette Harrison Etuk

A Comparison of Time Series Models for Forecasting Outbound Air Travel Demand *

THE APPLICATION OF GREY SYSTEM THEORY TO EXCHANGE RATE PREDICTION IN THE POST-CRISIS ERA

LATVIAN GDP: TIME SERIES FORECASTING USING VECTOR AUTO REGRESSION

Available online at ScienceDirect. Procedia Computer Science 64 (2015 )

Using Analysis of Time Series to Forecast numbers of The Patients with Malignant Tumors in Anbar Provinc

Forecasting the Prices of Indian Natural Rubber using ARIMA Model

Chapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables.

Chapter 3: Regression Methods for Trends

Time Series Analysis of Monthly Rainfall data for the Gadaref rainfall station, Sudan, by Sarima Methods

United Kingdom (UK) Road Safety Development. Fatalities

Detection and Forecasting of Islamic Calendar Effects in Time Series Data

11-2 Multinomial Experiment

STATE BANK OF PAKISTAN WORKING PAPERS. Detection and Forecasting of Islamic Calendar Effects in. Time Series Data

# % # & ())! +,,, # ) (. / 0 1 ) / 2 3 ) )/)

Chapter 2: Describing Contingency Tables - I

Minitab Project Report - Assignment 6

Automatic seasonal auto regressive moving average models and unit root test detection

Application of Time Sequence Model Based on Excluded Seasonality in Daily Runoff Prediction

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Time Series Forecasting Model for Chinese Future Marketing Price of Copper and Aluminum

Safety effects of lower speed limits during winter months

Multiplicative Sarima Modelling Of Nigerian Monthly Crude Oil Domestic Production

LISA Short Course Series Generalized Linear Models (GLMs) & Categorical Data Analysis (CDA) in R. Liang (Sally) Shan Nov. 4, 2014

Prediction of Annual National Coconut Production - A Stochastic Approach. T.S.G. PEIRIS Coconut Research Institute, Lunuwila, Sri Lanka

Department of Computer Science, University of Pittsburgh. Brigham and Women's Hospital and Harvard Medical School

Data Analysis and Statistical Methods Statistics 651

Jun Tu. Department of Geography and Anthropology Kennesaw State University

Lecture 14: Introduction to Poisson Regression

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

Making sense of Econometrics: Basics

TIME SERIES DATA PREDICTION OF NATURAL GAS CONSUMPTION USING ARIMA MODEL

Linear regression. Linear regression is a simple approach to supervised learning. It assumes that the dependence of Y on X 1,X 2,...X p is linear.

Comparing the Relationships Between Heat Stress Indices and Mortality in North Carolina

Data Analysis and Statistical Methods Statistics 651

Analysis. Components of a Time Series

A Hybrid ARIMA and Neural Network Model to Forecast Particulate. Matter Concentration in Changsha, China

Chapter 12: An introduction to Time Series Analysis. Chapter 12: An introduction to Time Series Analysis

Project Name: Implementation of Drought Early-Warning System over IRAN (DESIR)

Estimation and application of best ARIMA model for forecasting the uranium price.

ISSN Original Article Statistical Models for Forecasting Road Accident Injuries in Ghana.

DEVELOPMENT OF CRASH PREDICTION MODEL USING MULTIPLE REGRESSION ANALYSIS Harshit Gupta 1, Dr. Siddhartha Rokade 2 1

TRANSFER FUNCTION MODEL FOR GLOSS PREDICTION OF COATED ALUMINUM USING THE ARIMA PROCEDURE

Cohort Effect Structure in the Lee-Carter Residual Term. Naoki Sunamoto, FIAJ. Fukoku Mutual Life Insurance Company

Journal of Chemical and Pharmaceutical Research, 2014, 6(5): Research Article

INTERNATIONAL JOURNAL OF ENVIRONMENTAL SCIENCES Volume 6, No 6, Copyright by the authors - Licensee IPA- Under Creative Commons license 3.

AN ANALYSIS ON THE TRAFFIC ACCIDENTS TOURIST AT CASE STUDY: NANTOU COUNTY

Interrupted Time Series Analysis for Single Series and Comparative Designs: Using Administrative Data for Healthcare Impact Assessment

Time series count data models: an empirical application to traffic accidents

Hypothesis testing. Data to decisions

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables

Analysis of Violent Crime in Los Angeles County

Modeling and forecasting global mean temperature time series

ENHANCING ROAD SAFETY MANAGEMENT WITH GIS MAPPING AND GEOSPATIAL DATABASE

The simple linear regression model discussed in Chapter 13 was written as

Tourist Arrivals in Sri Lanka: A Comparative Study of Holt- Winter s versus Box- Jenkin s Modeling Methods

Clinical Trials. Olli Saarela. September 18, Dalla Lana School of Public Health University of Toronto.

5.1 Introduction. 5.2 Data Collection

Lecture-1: Introduction to Econometrics

An Introduction to State Space Time Series Analysis. Jacques J. F. Commandeur Siem Jan Koopman

Transcription:

Journal of Asian Scientific Research journal homepage: http://aessweb.com/journal-detail.php?id=5003 PREDICTION OF FATAL ROAD TRAFFIC CRASHES IN IRAN USING THE BOX-JENKINS TIME SERIES MODEL Ayad Bahadori Monfared Department of Epidemiology, School of Health, Shahid Beheshti University of Medical Sciences, Tehran, Iran Hamid Soori Department of Epidemiology, School of Health,& Safety Promotion & Injury Prevention Research Centre, Shahid Beheshti University of Medical Sciences, Tehran, Iran Yadollah Mehrabi Department of Epidemiology, School of Health,Shahid Beheshti University of Medical Sciences, Tehran, Iran Hossein Hatami Department of Public Health, School of Health, Shahid Beheshti University of Medical Sciences, Tehran, Iran Ali Delpisheh Department of Clinical Epidemiology& Prevention of Psychosocial Injuries, Research Centre Ilam University of Medical Sciences, Ilam-Iran ABSTRACT Introduction: Frequency of traffic related mortalities is increasing worldwide. The present study aimed to predict deaths from road traffic accidents for the first time in Iran. Methods: All death statistics from traffic accidents in Iran between 2004 and 2011 were available for analysis. The Box-Jenkins time series model was used for trends purposes. Death from traffic accidents were predicted from 2011 to 2013 and then compared with the actual frequencies. Results: Overall, 21548 deaths (95% CI: 15426-27669) due to road traffic accidents were predicted for 2011 compared to 2010, with a negative growth of 7.32%. The corresponding frequency was 20404 deaths from road traffic accidents (95% CI: 9914-30893) for 2013 compared to 2011 with a negative growth of 5.31%. An accuracy rate of 93% was found for prediction of fatal traffic accidents compared to the formal reports by the government. Conclusions: The Box-Jenkins time series model is an acceptable method for prediction of road traffic accidents. Keywords: Accidents, Death, Iran, Traffic, Prediction 425

INTRODUCTION Frequency of traffic related mortalities is increasing worldwide. It is estimated that by continuing the current trends for road traffic accidents to 2020, proportion of deaths will be increased up to 83% in middle and low income countries and up to 27% in high-income communities (Peden et al., 2004; Mohan, 2008). No accurate and reliable death prediction for road traffic accidents has been done Iran so far. According to the Iranian Forth Development Plan, the government has been obliged to decrease the trends of road traffic accidents by 10% annually. To achieve this goal, it is necessary to predict the rate of fatal accidents and to suggest controlling models. Time series is a sequence of numerical data that usually occurs in uniform intervals over a period of time. The main application of time series analysis is forecasting (Azar and Momeni, 2001; Jandaghi, 2005). There are several techniques to predict behavior of the time series forecasting model which can be categorized to quantitative and qualitative methods. The Box-Jenkins predicting model known as ARIMA 1 is one of the qualitative models in which the seasonal and random changes are considered in addition to the trends analysis. This model is only used for "stationary" time series. Therefore, the "differential methods" should be turned into a "stationary series" in advance. Using the ARIMA method, the model should be selected first, and then the model parameters as well as test series prediction (phase III) should be carried out. METHODS In the present cohort study, all traffic accidents from 2004 to 2011 reported by the Center of Information and Communication Technology (ICI, equivalent to FAWA abbreviation in Persian language) affiliated to the Iranian traffic police were analyzed using SAS software. To determine deaths trends in the country, the time series method was used. Monthly trends rather than annual were used to improve the predictions. Box and Jenkins time series models were employed. Predictive multiple regression was used to determine the relationship between a dependent variable and a series of independent variables. When the outcomes was more than two levels, or determine which variables affect the probability of a particular outcome, logistic regression was selected. Following model fitting, the importance of each variable that entered the model was confirmed. 1 Auto Regressive Integrated Moving Average 426

In addition to regression, the residuals were also checked to access the model. The residual in the logistic regression, is the difference between the observed and predicted probabilities. Time series steps were: remove the non-stationary component, identify specific ARIMA models, estimating parameters, residue analysis and finally, the evaluation model predict. Finally, the predicted rate of death from traffic accidents in the first 10 months of 2011 was compared with the actual numbers. The proposal of the present study was confirmed by the Shahid Beheshti University of Medical Sciences, Tehran-Iran Ethical Committee. RESULTS The death trends of road traffic accidents either monthly or annually showed a seasonal and a nonstationary pattern. Although, the available data were collected on a monthly breakdown, it was observed that the pattern changes were repeated once every 12 times. Increased traffic injuries leading to death was observed in certain months of the year (summer holidays) when the frequency of trips and journeys were rapidly increased. There are signs of non-stationary in the time series and for being stationary, it must be done by seasonal and then non-seasonal differencing (Figure1). Table 1 shows the index values of AIC2& SBC3 for estimating ARIMA models with the first difference, by the first order difference and then seasonal difference (1, period 12) of the time series. The seasonal or non-seasonal differencing with more than first order was not necessary. The AIC can increase by differences of seasonal and non-seasonal rank2. The higher differences of the ranks also elevated the differences seriously. The lowest AIC (790) for the model was 4SARIMA (0, 1, 1) (0, 1, 1)12 which is suitable for fitting to the series (Table 1). The next step was to investigate auto correlated of the residuals after fitting model (Residual =Predicted value - True value). The statistical test did not reject the null hypothesis of uncorrelated residual in any of the time intervals (Table 2). The correlation between the residuals and figures showed that the null hypothesis of uncorrelated residuals cannot be rejected. After determining the best-fit model for the series and estimating related parameters, the third phase of Box-Jenkins fitting model was evaluated for series prediction. Using the SARIMA (0,1,1) (0,1,1)12 model, it was predicted that in 2011 compared to 2010, approximately 21548 deaths (95% CI : 15426-27669) due to road traffic accidents will occurs, with a negative rate of 7.32%. 2AKAIKE Information Criterion 3SCHWARTZ Bayesian Criterion 4Seasonal Auto Regressive Integrated Moving Average 427

Also in 2013 compared to 2011, about 20404 deaths (95% CI: 9914-30893) with negative rate of 31.5% will be occurred (Figure2). Meanwhile, ARIMA model for time series of deaths due to traffic accident for males and females was extracted. This model predicts that in 2011 compared to 2010, there will be approximately 17088 deaths (95% CI: 12281-21895) by traffic accidents for males with a negative rate of 7.06%. Also in 2013 compared to 2011, overall, 16448 deaths (95% CI: 8531-24364) with negative rate of 3.75% in males. The corresponding rates for females were 4540 deaths (95% CI: 3008-6073) with negative growth of 6.64% in 2011 and 4169 deaths (95% CI: 1676-6661) with negative growth of 8.18% in 2013. Majority of deaths in both years and both genders, were occurred in August and September (summer holidays). DISCUSSION The present study is representative in terms of quantity (sample size) and quality of data which approved by the Iranian government. Also this study is unique as no prediction has been estimated so far for about car crashes leading to death in Iran. A significant reduction was observed in fatal traffic accidents in Iran during 2011 and 2012 compared to the previous years in general and in female drivers in particular. However, the magnitude of these reductions was not identical. Mortality rate from road accidents was significantly higher in summers compared to the other seasons, probably due to increased trips and journeys during summer holidays. A specific increase in fatal traffic accidents was observed in which is the first month of New Year in Iranian calendar. In a British study using the same methodology as the present study, it was estimated that by continuing the current deaths trend in this way, the rate of fatal traffic accident will be reduced to 33% (Diguiseppi et al., 1997). A Chinese study in Shanghai using the Gary dynamic statistical model GM (1, 1), found that risk of fatal traffic accidents in recent years has increased from 7.78 to 14.18 per 100,000 people and will be increasing for the coming years (Yan-hong et al., 2006). Also a similar Chinese study that investigated road traffic injuries from 1951 to 2003 using Box- Jenkins techniques and the ARIMA model showed a good agreement between the predictions made for fatal traffic accidents and real frequencies in 2003 (Li Yan-Hong et al., 2006). The WHO has predicted that deaths related to traffic accidents in the whole world and most countries (except for high- income countries) will be increased by 2020 (Peden et al., 2004). Therefore, it seems the slight reduction in fatal traffic accidents in Iran during the recent years is consisted with the pattern of high- income countries. 428

Number death Journal of Asian Scientific Research, 2013, 3(4):425-430 According to the actual results and data obtained from the Iranian government in 2011 by the end of January, overall, 17,643 deaths due to traffic accidents have been recorded. This figure is consisted and comparable with prediction of 18,964 deaths in traffic accidents made for 2011 by the end of January using the model fitted to the time series, which represents an accuracy rate of 93%. As a conclusion, the Box-Jenkins time series model is an acceptable method for prediction of road traffic accidents. It helps police and policy makers, to modify regulations and lows and establishing preventive and protective approaches and strategies. ACKNOWLEDGMENT The cooperation of the Iranian Legal Medical Organization, the Ministry of Health and Traffic Police is gratefully appreciated. This article is extracted from the PhD thesis of Ayad Bahadori Monfared and financially supported by the Shahid Beheshti University of Medical Sciences. Figur-1. Time series of traffic accidents deaths by month and year 3500 Time Series Plot of death_month 3000 2500 2000 1500 1000 500 Month Year 0 2005 2006 2007 2008 2009 2010 Table-1. AIC & SBC values for estimating ARIMA models and also by the first order differencing and then seasonal differencing (1, period 12) of the time series Row Difference (1) Seasonal difference (1, period 12) Model AIC SBC AIC SBC 1 ARIMA (0,1) 1009 1014 799 803 2 ARIMA (0,2) 1009 1014 812 816 3 ARIMA (0,12) 984 988 803 808 4 ARIMA (1,12) 986 992 790 797 5 ARIMA (2,12) 985 992 805 811 429

Number forecast deaths Journal of Asian Scientific Research, 2013, 3(4):425-430 Table-2. Chi square statistic of the autocorrelation coefficients of the time series residuals of deaths due to traffic accidents Intervals X 2 Degrees of freedom -P (value) 6 1.42 4 0.8405 12 1.85 10 0.9974 18 6.76 16 0.9776 24 15.08 22 0.8589 Figure-2. Deaths prediction from traffic accidents (with approximate forecasts) Forecast Iran road accident deaths by 95% confidence limits 3000 2500 Variable Forecast_death lower_95%ci upper_95%ci 2000 1500 1000 500 0 Month Year Mar 2011 Jul Nov Mar 2012 Jul Nov Mar 2013 REFERENCES Azar, A. and M. Momeni, 2001. Application of statistic and management. Tehran: Samt. Diguiseppi, C., I. Roberts and L. Li, 1997. Influence of changing travel patterns on child death rates from injury: Trend analysis. BMJ, 314(710). Jandaghi, G., 2005. Statistic with applictions in management and ecoromics. Qom: Elm Gostar Karimeh. Li Yan-Hong, R. Yousif, W. Lu, G.-X. Song, Y. Yan, D.Z. De, S.-N. Zhang, S.-F. Zhou, S.-M. Chen and B.-J. Yang, 2006. Field data: A study on trend and prediction of fatal traffic injuries prevalence in shanghai. Traffic Injury Prevention, 7: 403-407. Mohan, D., et al, 2008. Practical training manual prevent traffic accidents. Safety promotion and injury prevention research center. Tehran: Naja Traffic Police Office of Applied Research Peden, M., R. Scurfield, D. Sleet, D. Mohan, A. Hyder and E. Jarawan, 2004. World report on road traffic injury prevention. Geneva: World Health Organization. Yan-hong, L., Y. Rahim, L. Wei, S. Gui-xiang, Y. Yu and D. Zhou, 2006. Field data: A study on trend and prediction of fatal traffic injuries prevalence in shanghai. Traffic Injury Prevention, 7:403-7. 430