MODELING MAXIMUM MONTHLY TEMPERATURE IN KATUNAYAKE REGION, SRI LANKA: A SARIMA APPROACH

Similar documents
TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA

Suan Sunandha Rajabhat University

Empirical Approach to Modelling and Forecasting Inflation in Ghana

Application of ARIMA Models in Forecasting Monthly Total Rainfall of Rangamati, Bangladesh

Development of a Monthly Rainfall Prediction Model Using Arima Techniques in Krishnanagar Sub-Division, Nadia District, West Bengal

Time Series Analysis of United States of America Crude Oil and Petroleum Products Importations from Saudi Arabia

Asitha Kodippili. Deepthika Senaratne. Department of Mathematics and Computer Science,Fayetteville State University, USA.

ECONOMETRIA II. CURSO 2009/2010 LAB # 3

Time Series Analysis of Currency in Circulation in Nigeria

Minitab Project Report - Assignment 6

MODELING INFLATION RATES IN NIGERIA: BOX-JENKINS APPROACH. I. U. Moffat and A. E. David Department of Mathematics & Statistics, University of Uyo, Uyo

Comparing the Univariate Modeling Techniques, Box-Jenkins and Artificial Neural Network (ANN) for Measuring of Climate Index

Modeling Monthly Average Temperature of Dhahran City of Saudi-Arabia Using Arima Models

Forecasting the Prices of Indian Natural Rubber using ARIMA Model

Estimation and application of best ARIMA model for forecasting the uranium price.

Modeling climate variables using time series analysis in arid and semi arid regions

Lab: Box-Jenkins Methodology - US Wholesale Price Indicator

Modeling and Forecasting Currency in Circulation in Sri Lanka

Trend and Variability Analysis and Forecasting of Wind-Speed in Bangladesh

Technical note on seasonal adjustment for Capital goods imports

Short-run electricity demand forecasts in Maharashtra

Autoregressive Integrated Moving Average Model to Predict Graduate Unemployment in Indonesia

Sugarcane Productivity in Bihar- A Forecast through ARIMA Model

Technical note on seasonal adjustment for M0

STOCHASTIC MODELING OF MONTHLY RAINFALL AT KOTA REGION

Romanian Economic and Business Review Vol. 3, No. 3 THE EVOLUTION OF SNP PETROM STOCK LIST - STUDY THROUGH AUTOREGRESSIVE MODELS

Seasonal time-series modeling and forecasting of monthly mean temperature for decision making in the Kurdistan Region of Iraq

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

Univariate ARIMA Models

FORECASTING SUGARCANE PRODUCTION IN INDIA WITH ARIMA MODEL

Time Series Analysis Model for Rainfall Data in Jordan: Case Study for Using Time Series Analysis

Short-Term Load Forecasting Using ARIMA Model For Karnataka State Electrical Load

THE ROLE OF OCEAN STATE INDICES IN SEASONAL AND INTER-ANNUAL CLIMATE VARIABILITY OF THAILAND

Forecasting Foreign Direct Investment Inflows into India Using ARIMA Model

Scenario 5: Internet Usage Solution. θ j

Tourist Arrivals in Sri Lanka: A Comparative Study of Holt- Winter s versus Box- Jenkin s Modeling Methods

A Model for Daily Exchange Rates of the Naira and the XOF by Seasonal ARIMA Methods

Modelling Monthly Precipitation in Faridpur Region of Bangladesh Using ARIMA

Austrian Inflation Rate

Forecasting Temperature in the Coastal Area of Bay of Bengal-An Application of Box-Jenkins Seasonal ARIMA Model

Rainfall Drought Simulating Using Stochastic SARIMA Models for Gadaref Region, Sudan

Chapter 12: An introduction to Time Series Analysis. Chapter 12: An introduction to Time Series Analysis

A Univariate Time Series Autoregressive Integrated Moving Average Model for the Exchange Rate Between Nigerian Naira and US Dollar

Lesson 13: Box-Jenkins Modeling Strategy for building ARMA models

Analysis. Components of a Time Series

Prediction of Annual National Coconut Production - A Stochastic Approach. T.S.G. PEIRIS Coconut Research Institute, Lunuwila, Sri Lanka

A STUDY OF ARIMA AND GARCH MODELS TO FORECAST CRUDE PALM OIL (CPO) EXPORT IN INDONESIA

Firstly, the dataset is cleaned and the years and months are separated to provide better distinction (sample below).

Forecasting Bangladesh's Inflation through Econometric Models

Frequency Forecasting using Time Series ARIMA model

at least 50 and preferably 100 observations should be available to build a proper model

FORECASTING THE INVENTORY LEVEL OF MAGNETIC CARDS IN TOLLING SYSTEM

FORECASTING COARSE RICE PRICES IN BANGLADESH

Time Series Analysis of Monthly Rainfall data for the Gadaref rainfall station, Sudan, by Sarima Methods

Lecture 19 Box-Jenkins Seasonal Models

Univariate, Nonstationary Processes

TIME SERIES ANALYSIS AND FORECASTING OF MONTHLY AIR TEMPERATURE CHANGES IN NAIROBI KENYA

Oil price volatility in the Philippines using generalized autoregressive conditional heteroscedasticity

Global Climates. Name Date

Figure 1. Time Series Plot of arrivals from Western Europe

Classical Decomposition Model Revisited: I

Chapter 3. Regression-Based Models for Developing Commercial Demand Characteristics Investigation

Advanced Econometrics

ARIMA model to forecast international tourist visit in Bumthang, Bhutan

GAMINGRE 8/1/ of 7

Empirical Market Microstructure Analysis (EMMA)

WHEN IS IT EVER GOING TO RAIN? Table of Average Annual Rainfall and Rainfall For Selected Arizona Cities

A SEASONAL TIME SERIES MODEL FOR NIGERIAN MONTHLY AIR TRAFFIC DATA

Box-Jenkins ARIMA Advanced Time Series

Variability and trends in daily minimum and maximum temperatures and in diurnal temperature range in Lithuania, Latvia and Estonia

Forecasting of meteorological drought using ARIMA model

Time Series Forecasting: A Tool for Out - Sample Model Selection and Evaluation

Forecasting Area, Production and Yield of Cotton in India using ARIMA Model

Seasonal Autoregressive Integrated Moving Average Model for Precipitation Time Series

Forecasting using R. Rob J Hyndman. 2.4 Non-seasonal ARIMA models. Forecasting using R 1

INTERNATIONAL JOURNAL OF ENVIRONMENTAL SCIENCES Volume 6, No 6, Copyright by the authors - Licensee IPA- Under Creative Commons license 3.

FORECASTING OF COTTON PRODUCTION IN INDIA USING ARIMA MODEL

Circle a single answer for each multiple choice question. Your choice should be made clearly.

Seasonal Climate Outlook for South Asia (June to September) Issued in May 2014

A New Dengue Outbreak Statistical Model using the Time Series Analysis

ARIMA modeling to forecast area and production of rice in West Bengal

Daily Rainfall Disaggregation Using HYETOS Model for Peninsular Malaysia

2016 Meteorology Summary

Monthly Long Range Weather Commentary Issued: APRIL 18, 2017 Steven A. Root, CCM, Chief Analytics Officer, Sr. VP,

Dynamic Time Series Regression: A Panacea for Spurious Correlations

Forecasting the Price of Field Latex in the Area of Southeast Coast of Thailand Using the ARIMA Model

Atmospheric circulation analysis for seasonal forecasting

Forecasting of Soybean Yield in India through ARIMA Model

Review Session: Econometrics - CLEFIN (20192)

Average temperature ( F) World Climate Zones. very cold all year with permanent ice and snow. very cold winters, cold summers, and little rain or snow

THE CLIMATE OVER SRI LANKA YALA SEASON 2017

Stochastic Analysis and Forecasts of the Patterns of Speed, Acceleration, and Levels of Material Stock Accumulation in Society

Multiple Regression Analysis

Detection and Forecasting of Islamic Calendar Effects in Time Series Data

INTRODUCTION TO TIME SERIES ANALYSIS. The Simple Moving Average Model

Forecasting Precipitation Using SARIMA Model: A Case Study of. Mt. Kenya Region

A Comparison of the Forecast Performance of. Double Seasonal ARIMA and Double Seasonal. ARFIMA Models of Electricity Load Demand

Forecasting of the Austrian Inflation Rate

Climatography of the United States No

PRELIMINARY ASSESSMENT OF SURFACE WATER RESOURCES - A STUDY FROM DEDURU OYA BASIN OF SRI LANKA

Transcription:

MODELING MAXIMUM MONTHLY TEMPERATURE IN KATUNAYAKE REGION, SRI LANKA: A SARIMA APPROACH M.C.Alibuhtto 1 &P.A.H.R.Ariyarathna 2 1 Department of Mathematical Sciences, Faculty of Applied Sciences, South Eastern University of Sri Lanka 2 Faculty of Applied Sciences, South Eastern University of Sri Lanka mcabuhtto@seu.ac.lk, himashrashmika@gmail.com ABSTRACT: Time series analysis plays a major role in predicting and analyzing climatological data. Temperature is one of the most vital elements of the climate system and modeling of the temperature helps the interested party those who are depending on it directly or indirectly to prepare in advance. The aim of this study is to develop a time series model which can help in improving the predictions of monthly temperature of Katunayake. This paper describes the Box-Jenkins time series seasonal ARIMA model for prediction of monthly maximum temperature in Katunayake region, Sri Lanka. In this study, 181 monthly temperature data were considered during the period 2001-2016. For the model selection, 169 observations were used while the rest of 12 observations were used to validate the model. The results indicate that the SARIMA(3,0,2)(2,0,2) 12 model was the best model to predict monthly maximum temperature in Katunayake region. Keywords: Box-Jenkins, Climatology, Katunayake, SARIMA, Temperature 1. INTRODUCTION The weather of Sri Lanka can be considered as a tropical. Sri Lanka can be divided into two climatological zones such as dry zone and wet zone. The Northern, North Central and Eastern regions of the Sri Lanka are in Meteorological he dry zone and the South Western region is in the wet zone. The warmest average maximum temperature is 31 o C in February, March, April and May. The coolest average minimum temperature is 22 o C in uary and February. For the study, maximum temperature data were selected from Katunayake area, where it is located in Gampaha district, Western province, Sri Lanka. The Katunayake area is bounded to the Negombo lagoon from one side. The Meteorological Department in Sri Lanka is the authorized agency to measure, collect and forecast all climatological factors. The head department is located in BauddhalokaMawatha, Colombo 7. There are many branches located all over the country which are taking the climatological observations such as rainfall, temperature and wind speed. The main objectives of thisresearch were to explore the behaviour of maximum temperature in Katunayake area, develop a time series model for maximum temperature in Katunayake area and validate the developed model for an independent set of data. 396

2. MATERIALS AND METHODS 2.1 Data Collection The data were collected from the Metrological Department, Colombo. The selected data set was a complete record of monthly maximum temperature from 2001 to 2016 (181 observations), which was compiled for Katunayake area. The temperature data were measured in Celsius ( o C). The Box-Jenkins modelling was done with 169 data set while keeping the rest 12 observations for the validation of the model. 2.2 ACF and PACF Autocorrelation function and partial autocorrelation function are a type of graphs which containing correlations of different time lags. ACF and PACF can be used to identify the behaviour of the series whether stationary or not and to identify the number of components in ARMA model. The exponentially decaying spikes in ACF and PACF indicate the stationary series. The number of significant spikes in ACF indicates the number of MA terms in the model and the number of significant spikes in PACF indicates the number of AR terms in the model. (1) 2.3 Autoregressive Moving Average Model If current x t value can be expressed as a linear combination of both past p number of x t values and past q number of e t values, then it is an auto regressive moving average time series model of p and q. The autoregressive moving average model can be identified from the number of significant spikes of ACF and PACF graph. The ARMA(p,q) model can be written as; (2) 2.4 Unit Root Test The Augmented Dickey Fuller unit root test used to find the stationarity of a series. It uses hypothesis as, 2.5Autoregressive Integrated Moving Average Model If the autoregressive moving average model uses a 1 st or 2 nd order differenced series due to non-stationarity, then that model is an ARIMA model. The number of differences denoted by d. The ARIMA(p,d,q) model can be written as; (3) 397

2.6Seasonal Autoregressive Integrated Moving Average Model If the data series has differentiated into two parts such as seasonally adjusted component and seasonal component due to the existence of seasonality, then the ARIMA model used in both components can be identified as SARIMA model. The ARIMA analysis of seasonally adjusted component is non-seasonal ARIMA and the ARIMA analysis of the seasonal component is seasonal ARIMA component. The SARIMA model denoted by SARIMA(p,d,q)(P,D,Q)s, where first bracket and second bracket denote the seasonally adjusted and seasonal factor series respectively. The SARIMA model can be written as; (4) Where 2.7 Model Selection The Box-Jenkins methodology uses the Akaike Information Criterion (AIC) and Schwarz Information Criterion (SIC) to select the best model among tentative models. 3. Results and Discussion 3.1 Behaviours of temperature data The average of maximum temperature for the Katunayakeregion from 2001 uary to 2016 uary was 31.28 o C. The minimum and maximum temperature in this time period were29. 32 o C and 33. o C respectively. To identify the behaviour of the data set, the line chart (Figure 1) and the seasonal graph (Figure 2) of the data were obtained. The identifiable seasonality of the data set can be seen from the seasonal graph. 34 Temperature 33 32 31 29 01 02 03 04 05 06 07 08 09 10 11 12 13 14 15 Figure 1: Time series plot of the temperature data 398

Temperature by Season 34 33 32 31 29 Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec Means by Season Figure 2: Seasonal plot of the temperature data The Seasonal ARIMA modelling was done with 169 data which was 2001 uary to 2015 uary while therest of 12 observations were kept for measuring the accuracy of the forecasted model. According to the figure 2, it can be seen a seasonality of the data exist. Table 1. Results of Kruskal-Wallis Test. Kruskal-Wallis statistics Degrees of freedom Probability level 1.0749 11 Table 1 shows that the existence of a significant seasonality (P=0.000). Therefore the seasonality of the data has been extracted by using Census X-12 seasonal adjustment method. The line chart of the seasonally adjusted series (Figure 3) and seasonal factor series (Figure 4) was obtained to identify the behaviour of the both series. 33.0 Seasonally Adjusted Series 32.5 32.0 31.5 31.0.5.0 29.5 01 02 03 04 05 06 07 08 09 10 11 12 13 14 Figure 3: Seasonally adjusted series 399

Seasonal Factor Series 1.5 1.0 0.5 0.0-0.5-1.0 01 02 03 04 05 06 07 08 09 10 11 12 13 14 Figure 4: Seasonal factor series 3.2 Model identification and parameter estimates The stationarity of the both series has checked by using the ADF test. The unit root test for the seasonally adjusted series (P=) and seasonal factor series (P=) has confirmed the both series are stationary in 95% confidence level.the ACF and PACF were obtained to identify the number of AR and MA terms in both series. The model selection criterion was used to identify the best model. Table 2 and Table 3 used to identify the number of components in the seasonally adjusted series and seasonal factor series respectively. AIC, SIC and Durbin Watson (DW) values used for the model selection. The best model can be considered as SARIMA (3,0,2)(2,0,2) 12 because this model has the lowest AIC and SIC values. Table 2. Parameter estimates of seasonally adjusted series. Parameters Significance Overall sig. AIC SIC DW MA(3) 0.2477 0.8428 0.5056 0.9852 0.0036 0.8894 0.2498 0.5657 0.9785 1.1582 1.2693 1.9626 1.1358 1.2469 1.9528 1.1657 1.2953 1.9652 400

MA(3) 0.9964 AR(3) AR(3) MA(3) 0.0001 0.9539 0.9769 0.1577 0.6229 0.2732 0.4446 0.5513 0.5216 1.0993 1.2290 1.9760 1.1550 1.32 1.9433 Table 3. Parameter estimates of seasonal factor series. Parameters Significance Overall sig. AIC SIC DW MA(3) MA(3) AR(3) 0.9985 0.9989 0.9994 0.2537 0.5676 0.2875 0.8199 0.5274 0.3645 0.84459 0.9557 1.8804 0.0034 0.1145 1.7048-0.0654 0.0642 2.1587 0.4224 0.5335 2.0529 401

Residual 0.9983 Table 4. Parameter estimates of SARIMA(3,0,2)(2,0,2) 12 model. Variable Coefficient Standard error T-statistics Probability C 31.2600 0.0703 44.5511 2.1367 0.0016 11.180-1.6982 0.0005-3109.156 AR(3) 0.4027 0.0007 560.8814 S 1.0022 0.0196 51.0985 S -0.9994 0.0052-189.4518-1.0240 1.0192-1.0046 0.3166 0.9995 1.9386 0.5155 0.6069 S -1.7204 1.6347-1.0524 0.2942 S 0.9995 1.8974 0.5267 0.5991 Table 4 shows the estimated coefficients of Fitted SARIMA model. 3.3 Residual analysis The residual plot (Figure 5) was obtained from the developed model. The normal probability plot (Figure 6) of residuals and histogram(figure 7) of residuals also were obtained. According to the Figure 6 and Figure 7 residuals follow a normal distribution at 95% confidence level (P=0.523). The model suitability can be observed from the plot of observed and fitted data graph (Figure 8). Versus Order (response is x) 1.0 0.5 0.0-0.5-1.0-1.5 1 20 40 60 80 100 Observation Order 120 140 160 Figure 5: Residual plot 402

Data Frequency Percent Probability Plot of RESI1 Normal 99.9 99 95 90 80 70 60 50 40 20 10 5 Mean -0.006785 StDev 0.4556 N 169 AD 0.323 P-Value 0.523 1 0.1-1.5-1.0-0.5 0.0 RESI1 0.5 1.0 1.5 Figure 6: Normal probability plot of residuals 50 Histogram (response is x) 40 20 10 0-1.5-1.0-0.5 0.0 Residual 0.5 1.0 Figure 7: Histogram of residuals Time Series Plot of Observed, Fitted 33 Variable Observed Fitted 32 31 29 Month Year 2001 2003 2005 2007 2009 2011 2013 2015 Figure 8: Plot of observed & fitted data The forecasted values were obtained from the periods of 2015 February to 2016 uary using the fitted model and check the prediction accuracy of the fitted model. The results are given in table 5. 403

Data Table 5: The observed and forecasted maximum temperature. Observed temperature Predicted temperature Prediction error 32.6286 32.1597 0.4689 32.0290 32.4736-0.4446 31.7633 32.1770-0.4137 31.8129 31.8244-0.0115 31.2433.9343 0.9 31.3419.7551 0.5868 31.6355.9338 0.7017 31.0033.8277 0.1756.7194.9144-0.195.6100.9490-0.339 31.3258.8012 0.5246 33.2323 31.7577 1.4746 The error of the prediction was plotted to observe the behaviour (Figure 9). The residuals were distributed normally (P=0.540) which can be observed from the normal probability plot of residuals (Figure 10). Time Series Plot of Observed, Forecasted, Forecast Error 35 25 Variable Observed Forecasted Forecast Error 20 15 Month Year 10 5 0 Feb 2015 Mar Apr May Jun Jul Aug Sep Oct Nov Dec 2016 Figure 9: Prediction error plot 404

Percent Probability Plot of Forecated Error Normal 99 95 90 80 70 60 50 40 20 Mean 0.2364 StDev 0.5627 N 12 AD 0.294 P-Value 0.540 10 5 1-1.0-0.5 0.0 0.5 Forecated Error 1.0 1.5 Figure 10: Histogram of prediction error Table 6: The results of t-test for mean differences Test of mean difference = 0 (vs. not = 0) t-value = 1.46 p-value = 0.173 According to the table 6, there is no significant difference between predicted temperature and observed temperatureat 95% confidence level (P=0.173). Thus, the developed model is adequate to predict temperature. 4. CONCLUSION The aim of this paper is to develop a time series model using recorded monthly temperature data from 2001-2016, of the Katunayake region.this paper has considered the seasonal ARIMAmodelling of Katunayake monthly maximum temperature.numerous seasonal models were developed using Box-Jenkins methodology and thesarima (3,0,2)(2,0,2) 12 model was selected as the best model by using model selection criteria. Furthermore, the fitted model is confirmed and validated by the residual tests. The fitted SARIMA model is very useful to forecast future values of temperature in the Katunayakeregion. REFERENCES Kaushik, I. and Singh, S.M., (2008). Seasonal ARIMA model for forecasting of monthly rainfall and temperature. Journal of Environmental Research and Development Vol, 3(2). Khajavi, E., Behzadi, J., Nezami, M.T., Ghodrati, A. and Dadashi, M.A., (2012). Modeling ARIMA of air temperature of the southern Caspian Sea coasts. Intl Res J Appl Basic Sci, 3(6), pp.1279-1287. 405

Nury, A.H., Koch, M., Alam, M.J.B. (2013). Time Series Analysis and Forecasting of Temperatures in the Sylhet Division of Bangladesh. In 4 th international conference on environmental aspects of Bangladesh (ICEAB), pp. 24-26. Sarraf, A., Vahdat, S.F. and Behbahaninia, A., (2011). Relative Humidity and Mean Monthly Temperature Forecasts in Ahwaz Station with ARIMA Model in Time Series Analysis. In Proceedings of International Conference on Environment and In-dustrial Innovation (Vol. 6). Shamsnia, S.A., Shahidi, N., Liaghat, A., Sarraf, A. and Vahdat, S.F., (2011). Modeling of weather parameters using stochastic methods (ARIMA model)(case study: Abadeh Region, Iran). In International Conference on Environment and Industrial Innovation. IPCBEE (Vol. 12). Stein, M. and Lloret, J., (2001). Forecasting of air and water temperatures for fishery purposes with selected examples from Northwest Atlantic. Journal of Northwest Atlantic Fishery Science, 29, pp.23-. 406