Available online at ScienceDirect. Procedia Computer Science 72 (2015 )

Similar documents
FORECASTING THE INVENTORY LEVEL OF MAGNETIC CARDS IN TOLLING SYSTEM

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA

Design of Time Series Model for Road Accident Fatal Death in Tamilnadu

Suan Sunandha Rajabhat University

A Comparison of the Forecast Performance of. Double Seasonal ARIMA and Double Seasonal. ARFIMA Models of Electricity Load Demand

Time Series Analysis of United States of America Crude Oil and Petroleum Products Importations from Saudi Arabia

Forecasting using R. Rob J Hyndman. 2.4 Non-seasonal ARIMA models. Forecasting using R 1

Univariate ARIMA Models

Univariate and Multivariate Time Series Models to Forecast Train Passengers in Indonesia

Estimation and application of best ARIMA model for forecasting the uranium price.

FORECASTING SUGARCANE PRODUCTION IN INDIA WITH ARIMA MODEL

Using Analysis of Time Series to Forecast numbers of The Patients with Malignant Tumors in Anbar Provinc

Analysis. Components of a Time Series

Comparing the Univariate Modeling Techniques, Box-Jenkins and Artificial Neural Network (ANN) for Measuring of Climate Index

Time Series and Forecasting

MODELING INFLATION RATES IN NIGERIA: BOX-JENKINS APPROACH. I. U. Moffat and A. E. David Department of Mathematics & Statistics, University of Uyo, Uyo

Development of Demand Forecasting Models for Improved Customer Service in Nigeria Soft Drink Industry_ Case of Coca-Cola Company Enugu

arxiv: v1 [stat.me] 5 Nov 2008

ISSN Original Article Statistical Models for Forecasting Road Accident Injuries in Ghana.

Forecasting the Price of Field Latex in the Area of Southeast Coast of Thailand Using the ARIMA Model

Time Series and Forecasting

Dynamic Time Series Regression: A Panacea for Spurious Correlations

Technical note on seasonal adjustment for Capital goods imports

Autoregressive Integrated Moving Average Model to Predict Graduate Unemployment in Indonesia

ARIMA model to forecast international tourist visit in Bumthang, Bhutan

ANALYZING THE IMPACT OF HISTORICAL DATA LENGTH IN NON SEASONAL ARIMA MODELS FORECASTING

Seasonal Autoregressive Integrated Moving Average Model for Precipitation Time Series

Forecasting Network Activities Using ARIMA Method

Lecture 19 Box-Jenkins Seasonal Models

5 Autoregressive-Moving-Average Modeling

Forecasting Bangladesh's Inflation through Econometric Models

Forecasting Area, Production and Yield of Cotton in India using ARIMA Model

Frequency Forecasting using Time Series ARIMA model

A Procedure for Identification of Appropriate State Space and ARIMA Models Based on Time-Series Cross-Validation

Empirical Approach to Modelling and Forecasting Inflation in Ghana

Modelling Multi Input Transfer Function for Rainfall Forecasting in Batu City

A STUDY OF ARIMA AND GARCH MODELS TO FORECAST CRUDE PALM OIL (CPO) EXPORT IN INDONESIA

Firstly, the dataset is cleaned and the years and months are separated to provide better distinction (sample below).

Transformations for variance stabilization

Minitab Project Report - Assignment 6

Lab: Box-Jenkins Methodology - US Wholesale Price Indicator

{ } Stochastic processes. Models for time series. Specification of a process. Specification of a process. , X t3. ,...X tn }

FORECASTING YIELD PER HECTARE OF RICE IN ANDHRA PRADESH

BUSI 460 Suggested Answers to Selected Review and Discussion Questions Lesson 7

Technical note on seasonal adjustment for M0

Modeling and Forecasting Currency in Circulation in Sri Lanka

MODELING MAXIMUM MONTHLY TEMPERATURE IN KATUNAYAKE REGION, SRI LANKA: A SARIMA APPROACH

SOME COMMENTS ON THE THEOREM PROVIDING STATIONARITY CONDITION FOR GSTAR MODELS IN THE PAPER BY BOROVKOVA et al.

Univariate linear models

Forecasting the Prices of Indian Natural Rubber using ARIMA Model

Time Series I Time Domain Methods

Detection and Forecasting of Islamic Calendar Effects in Time Series Data

Box-Jenkins ARIMA Advanced Time Series

Short-run electricity demand forecasts in Maharashtra

Forecasting. Simon Shaw 2005/06 Semester II

at least 50 and preferably 100 observations should be available to build a proper model

ARIMA modeling to forecast area and production of rice in West Bengal

Short-Term Load Forecasting Using ARIMA Model For Karnataka State Electrical Load

Automatic seasonal auto regressive moving average models and unit root test detection

Application of Time Sequence Model Based on Excluded Seasonality in Daily Runoff Prediction

INTRODUCTION TO TIME SERIES ANALYSIS. The Simple Moving Average Model

STAT 436 / Lecture 16: Key

Automatic Forecasting

Forecasting Egyptian GDP Using ARIMA Models

Chapter 3. Regression-Based Models for Developing Commercial Demand Characteristics Investigation

Time Series Analysis

Comparison between VAR, GSTAR, FFNN-VAR and FFNN-GSTAR Models for Forecasting Oil Production

STATE BANK OF PAKISTAN WORKING PAPERS. Detection and Forecasting of Islamic Calendar Effects in. Time Series Data

Intervention Model of Sinabung Eruption to Occupancy of the Hotel in Brastagi, Indonesia

FORECASTING. Methods and Applications. Third Edition. Spyros Makridakis. European Institute of Business Administration (INSEAD) Steven C Wheelwright

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

S-GSTAR-SUR Model for Seasonal Spatio Temporal Data Forecasting ABSTRACT

A stochastic modeling for paddy production in Tamilnadu

SCIENCE & TECHNOLOGY

Paper SA-08. Are Sales Figures in Line With Expectations? Using PROC ARIMA in SAS to Forecast Company Revenue

ECONOMETRIA II. CURSO 2009/2010 LAB # 3

Study on Modeling and Forecasting of the GDP of Manufacturing Industries in Bangladesh

FORECASTING COARSE RICE PRICES IN BANGLADESH

Journal of Chemical and Pharmaceutical Research, 2014, 6(5): Research Article

Available online at ScienceDirect. Procedia Engineering 119 (2015 ) 13 18

On Monitoring Shift in the Mean Processes with. Vector Autoregressive Residual Control Charts of. Individual Observation

REVIEW OF SHORT-TERM TRAFFIC FLOW PREDICTION TECHNIQUES

Forecasting using R. Rob J Hyndman. 3.2 Dynamic regression. Forecasting using R 1

Unit root problem, solution of difference equations Simple deterministic model, question of unit root

Time Series Analysis of Currency in Circulation in Nigeria

Asitha Kodippili. Deepthika Senaratne. Department of Mathematics and Computer Science,Fayetteville State University, USA.

Application of Quantitative Forecasting Models in a Manufacturing Industry

Volume 11 Issue 6 Version 1.0 November 2011 Type: Double Blind Peer Reviewed International Research Journal Publisher: Global Journals Inc.

Author: Yesuf M. Awel 1c. Affiliation: 1 PhD, Economist-Consultant; P.O Box , Addis Ababa, Ethiopia. c.

FORECASTING OF COTTON PRODUCTION IN INDIA USING ARIMA MODEL

Time Series Analysis -- An Introduction -- AMS 586

The log transformation produces a time series whose variance can be treated as constant over time.

S 9 Forecasting Consumer Price Index of Education, Recreation, and Sport, using Feedforward Neural Network Model

Improved the Forecasting of ANN-ARIMA Model Performance: A Case Study of Water Quality at the Offshore Kuala Terengganu, Terengganu, Malaysia

TIME SERIES DATA PREDICTION OF NATURAL GAS CONSUMPTION USING ARIMA MODEL

Chapter 12: An introduction to Time Series Analysis. Chapter 12: An introduction to Time Series Analysis

Ross Bettinger, Analytical Consultant, Seattle, WA

Solar irradiance forecasting for Chulalongkorn University location using time series models

Chapter 3, Part V: More on Model Identification; Examples

Available online at ScienceDirect. Procedia Computer Science 55 (2015 )

Transcription:

Available online at www.sciencedirect.com ScienceDirect Procedia Computer Science 72 (2015 ) 630 637 The Third Information Systems International Conference Performance Comparisons Between Arima and Arimax Method in Moslem Kids Clothes Demand Forecasting: Case Study Wiwik Anggraeni, Retno Aulia Vinarti, Yuni Dwi Kurniawati Institut Teknologi Sepuluh Nopember, Faculty of Information Technology, Department of Information System, Jl. Arief Rahman Hakim, Surabaya 60111, Indonesia Abstract The Moslem kids clothes demand in Habibah Busana are likely increase at certain times, especially near the Eidholidays. that total demands in each year is varies and always increase when near the Eid holidays. The demand always increases when near Eid-holidays but decrease in next month. It s repeat every year but unfortunately it occur in different time every year. it means that Eid holidays occur in different time every year. It make the company in uncertainty condition. This research compare Arima method which make forecast in univariate data and Arimax as multivariate method which include independent variables such as different time of Eid holidays every year. The result show that Arimax method is better than Arima method in accuracy level of training, testing, and next time forecasting processes. There are minimum fourteen variables have to include in Arimax model in order to make accuracy level is not decrease. 2015 The Published Authors. Published by Elsevier by Elsevier Ltd. Selection B.V. This is and/or open access peer-review article under under the responsibility CC BY-NC-ND license of the scientific (http://creativecommons.org/licenses/by-nc-nd/4.0/). committee Peer-review under of The responsibility Third Information of organizing Systems committee International of Information Systems Conference International (ISICO Conference 2015) (ISICO2015) Keywords: Arima, Arimax, Eid Holidays, Demand, Forecasting, Multivariate 1. Introduction The Moslem kids clothes demand in Habibah Busana are likely increase at certain times, especially near the Eid-holidays. This makes Habibah Busana requires appropriate planning about the number of demand products that will occur in the next years based on demand data in the previous years. This planning is needed by Habibah Busana in order to prepare raw materials and workers if there is high demand in certain time and also to determine the highest sales forecast in the next year. So, the company will be able anticipate the next year sales. Based on product demand data in Habibah Busana know that total demands in each year is varies and always increase when near the Eid holidays. The demand always increases when near Eid-holidays but decrease in next month. It s repeat every year but unfortunately it occur in different time every year. It follow the Hijri calendar system which is used in Indonesia. Hijri calendar system have Eid holidays 1877-0509 2015 The Authors. Published by Elsevier B.V. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). Peer-review under responsibility of organizing committee of Information Systems International Conference (ISICO2015) doi:10.1016/j.procs.2015.12.172

Wiwik Anggraeni et al. / Procedia Computer Science 72 ( 2015 ) 630 637 631 which always occurred in different time, it means that Eid holidays occur in different time every year. It make the company in uncertainty condition. They must prepare some raw materials more and additional workforce to meet consumer needs when near Eid holidays but this time always different, so their planning don t meet the time. This research predict The moslem kids clothes demand with appropriate methods in order to make planning which match with the real condition. It compare two methods, these are ARIMA and ARIMAX. Lee and Liu had applied Arimax method based on calendar variation model for forecasting sales data with Ramadhan effect, but these research include independent variable before Eid holidays only. Actually, based on history data analysis, independent variable which denoted what happen after Eid holidays influence this demand. This research make comparison between Arima and Arimax methods to proof that there are many independent variables influence this demand. This research had indicated what independent variables have no more influence. These can make production planning in Habibah Busana more effective and efficient [3]. This models were implemented using Java programming language. Java use to implement models and JMSL Numerical java library to process the calculation of data in Arima and Arimax method. Combination of LinearRegression class and autoarima class are used to obtain the most suitable Arimax models to estimate the demand of moslem kids clothes. 2. ARIMA and ARIMAX Frank (1993) devide Arima models were into 4 groups: autoregressive model (AR), moving average (MA), autoregressive moving average (ARMA), and autoregressive integrated moving average (ARIMA). 1. Autoregressive (AR) Autoregressive models predict the future value based on a linear combination of the previous value. An autoregressive process with the order -p or AR (p) can be expressed as follows: (1) Z t is predicted value at t time, Z t-p adalah history data at (t-p) time and is estimated parameter. 2. Moving Average (MA) Moving Average models provide forecasts based on previous forecasting errors. Process Moving Average with orders q or MA (q) can be expressed as follows: where Z t is the predicted value at t time, Z t-q is the approximate error at (t-q) time, and β is an estimated parameter. 3. Autoregressive Moving Average (ARMA) These two components together form autoregressive moving average (ARMA) models. ARMA model assumes data set is stationer. Moving Average Autoregresive process for AR (1) and MA (1) can dianyatakan as follows: (2) (3)

632 Wiwik Anggraeni et al. / Procedia Computer Science 72 ( 2015 ) 630 637 Z t is the predicted value at t time, is the approximate error at (t-1) time, and an estimated parameter. If non stationarity factor added to ARMA process, then the model is become ARIMA (p,d,q). 4. Autoregressive Integrated Moving Average (ARIMA) Generally, Data time series modeling consist of several types, namely Autoregressive (AR), Moving Average (MA), a combination of both, or with special cases. Arima model denoted by Arima (p, d, q) is a a combination model of AR and MA who have undergone a differencing process. Commonly, Arima (p, d, q) with t times differencing denoted as follows: Wei (2006) and Liu (1986) mentions that a lot of business and economic data is monthly data time series and yearly likely be the subject of two types effects of calendar variations. The first calendar effect is the change of economic activity and businesses that depend on the number of active days in a week. The calendar effect is called trading day effects. The second effect is day off effect due to religious holidays and traditional festivals which called holidays effects. Time series modeling which done by adding some variables are considered have a significant impact on the data. It is done to increase the accuracy of forecasting. Arimax models is a modification of seasonal Arima. This modification is done by add predictor variables. Variations calendar effects is one of the predictor variables that are often used in the modeling. Generally, Arimax model with variations calender effects is denoted (4) (5) 3. Forecasting Model This research devide data into two groups, they are training data and testing data. The training data is a set of data that will be used to perform analysis and determine the model. Training data is data about product demand between 1 st period and (n-12) th period where n is total period of data. The testing data is a set of data that will be used to test the accuracy of the forecast results. Testing data is about product demand between (n-11) th period and n th period where n is the total data. 3.1. The Arima Model Arima modeling process begin with Estimation ARIMA model. This estimation include degree of autoregressive (p), differencing (d), and moving average (q) and seasonality factors. Data does not stationary so need differencing process to make stationary. The autocorrelations function of differencing process result in each lag is is shown in Figure 1. This data is stationary because this autocorellations approach zero exponentially after the second or third time lag. So degree of d for Arima models is 1. After we find degree of d then we have to find degree of p and q to make ARIMA model. Prediction degree of p and q can be seen from the correlogram plot of autocorellation function (ACF) and partial autocorellation function (PACF).

Wiwik Anggraeni et al. / Procedia Computer Science 72 ( 2015 ) 630 637 633 Fig. 1. autocorrelations function correlogram of training data after first differencing process Fig. 2. partial autocorrelations function Correlogram of training data Estimation of Arima models is (2, 1, 0) (0,1,0) 12. Due to this estimation process there is only one model is found, then as a comparison to prove that this model is better then used the model (1, 1, 0) (0,1,0) 12. After we find the fit model then the next step is testing and parameter estimation. This testing use hypotesis and if all parameters had p-value <α (0.05) so H 0 is rejected [7,8]. Table 1. Arima model parameters (1, 1, 0) (0,1,0) 12 and Arima (2, 1, 0) (0,1,0) 12 Model Parameters Estimate SE T Sig. Constant 0.755 7.37 0.102 0.919 (1, 1, 0) AR Lag 1-0.334 0.113-2.953 0.004 (0,1,0) 12 Difference 1 Seasonal Difference 1 (2, 1, 0) (0,1,0) 12 Constant 0.755 7.37 0.102 0.919 AR Lag 1-0.334 0.113-2.953 0.004 Lag 2-0.495 0.104-4.737 0 Difference 1 Seasonal Difference 1 Table 1 show that all of model parameters significant because the value of Sig in Lag 1 and Lag 2 are less than 0.05 (level of significancy is 95%). We have found 2 models that fit to forecast. The last step to

634 Wiwik Anggraeni et al. / Procedia Computer Science 72 ( 2015 ) 630 637 make sure that the models are best models is diagnostic test [7,8,9]. This test use Ljung Box test. Diagnostic tests use to determine Arima models which are fit for forecasting. In this step, Arima models should be White Noise. Diagnostic test show that Arima (2,1,0)(0,1,0) 12 only is white noise. 2.2 The Arimax Model Arimax model require independent variable that acts as an additional variable. Independent variables that used in this research are dummy variables that serves to incorporate the calendar variations effects, namely dummy variable month in year (D 1, D 2,..., D 12 ),dummy variable one month before the Eidholidays (D SIF ), dummy variables month of Eid-holidays (D IF ), dummy variables one month after the month of Eid holidays (D IFS ), dummy variables one month before the Eid-holidays, month of Eidholidays, and one month after the month of Eid holidays (D SIFIFIFS ), dummy variables month of Eidholidays, and one month after the month of Eid holidays (D IFIFS ). Fig. 3. autocorrelations function Correlogram of regression process residual after 1 st differencing process Fig. 4. partial autocorrelations function Correlogram of regression process residual These dummy variables used to make regression analysis. And the regression equation is

Wiwik Anggraeni et al. / Procedia Computer Science 72 ( 2015 ) 630 637 635 number = 234 jan + 160 feb + 127 mar + 88.4 apr + 77.6 mei + 94.1 jun + 136 jul + 249 ags + 205 sept + 224 okt + 218 nov + 266 des + 206 SIF + 97.8 IF - 54.6 IFS (6) Autocorrellation function of regression process residual show that data is not stationary yet, so we need differencing process to make it stationary. Figure 3 and Figure 4 are the result of autocorrelation function of residual after 1 st differencing and the result of partial autocorrelation function of residual. Table 2. Model parameters Arima (3,1,0)(0,1,0) 12 Model Parameters Estimate SE T Sig. Constant 0.767 2.539 0.302 0.764 Lag 1-0.509 0.115-4.433 0 (3, 1, 0) AR Lag 2-0.369 0.123-3.007 0.004 (0,1,0) 12 Lag 3-0.331 0.116-2.861 0.006 Difference 1 Seasonal Difference 1 Tabel 2 show that all of parameter in Arima (3,1,0)(0,1,0) 12 are significant. The next step to make sure that this is a fit model is LJung Box test for regression residual and the result is shown that Arima (3,1,0)(0,1,0) 12 is white-noise. 4. RESULTS AND DISCUSSION 5.1 Comparison between Arima and Arimax model Based on the experiments results can be analyzed which best method between Arima and Arimax that can be used to forecast the number of moslem kids demand. The selection of the best method can be done with some comparisons, such as : 5.1.1 Level of Accuracy. The training and testing process result from Arimax method have better accuracy level than Arima method. Comparison of forecasting accuracy both of methods are shown in Table 3. This accuracy is viewed by Akaike Information Criterion (AIC), Mean Absolute Percentage Error (MAPE), and Root Mean Square Error (RMSE). Table 3. Comparison of Arima and Arimax accuracy. Method Arima Arimax Data Degree of Optimum Model AIC MAPE RMSE p d q s Training 12.48% 23.02 2 1 0 12 929.693 Testing 17.29% 73.56 Training 11.64% 18.08 3 1 0 12 891.144 Testing 14.80% 72.09 It can be seen that the accuracy of some value criteria such as AIC, MAPE, RMSE are produced by Arimax has smaller value than Arima method. This smaller value indicate that the forecasting results more accurate. 5.1.2 Results Plot of Training and Testing Process. The results plot comparison of training proces using Arima and Arimax shown in Figure 5(a). Meanwhile, The results plot comparison of testing process using Arimax Arima shown in Figure 5(b). Based on Figure 5(a) and 5(b) can be seen that Arimax have predictive value approaches the actual data.

636 Wiwik Anggraeni et al. / Procedia Computer Science 72 ( 2015 ) 630 637 This is know that the plot of Arimax forecast (green line) coincide with the actual data. Meanwhile, the plot of Arima (red line) does not coincide. Fig. 5. (a) results plot comparison of training process; (b) results plot comparison of testing process 5.1.3 Next Periodes Forecast Results Comparison of forecasting result can be seen in Figure 6. Fig. 1. result of Arima model and Arimax model Based on data analysis known that the highest number of demand occur in the month before the month of Eid holidays for every year. Thus, if Eid holidays occurred in August then the highest forecast value should occur in July, not in November as produced by Arima. Arimax produce the highest forecast in July. This indicates that Arimax be able to identify the increase or decrease number of moslem kids demand in certain times. In addition, based on the analysis of data, in month of Eid holidays, demand of moslem kids always decline but will decrease again normally in next month. This caused customers make a lot of orders in the month before Eid holidays and reducing orders after the feast is finished. This is also shown by the results of Arimax method which decreased up to 83% and decrease in next month up to 59%. This is contrast with result of Arima which decrease up to 8% and 25% successively. So, Arimax method can provide better results than Arima.

Wiwik Anggraeni et al. / Procedia Computer Science 72 ( 2015 ) 630 637 637 5.2 Independent Variable Influence on Methods Arimax In the Arimax forecasting process, we included 18 independent variables in the form of a dummy variable for calendar variations. However, the regression process only use 15 variables. SIFIFIFS, IFIFS, and months removed from the equation because they have a very high correlation with other predictor variables. This is done automatically by the application. We do the tests to prove how much the influence of omitted variables. The following table shows the results of the test with some number of dummy variables are eliminated. The effect of this variable elimination is shown in Table 4. Table 4. Effect of Dummy variable elimination No Number of MAPE RMSE Variable Training Testing Training Testing 1 18 11.64% 14.80% 18.08 72.09 2 17 11.64% 14.80% 18.08 72.09 3 16 11.64% 14.80% 18.08 72.09 4 15 11.64% 14.80% 18.08 72.09 5 14 12.96% 17.21% 22.64 79.69 Based on this Table, it can be seen that remove 3 variables (SIFIF IFS, IFS and variable month) will not affect the forecasting results, but we have a very different result if we remove the variable other than these variable. It happen if we remove IFS dummy variable. If the dummy variable 1 month after the Eid (DIFS) is not included in the Arimax forecasting process then the AIC value, MAPE and RMSE increases. So, the level of accuracy is reduced. 5. CONCLUSION Arimax model have better performance than ARIMA method. It is seen from the comparison of AIC, MAPE and RMSE from Arimax smaller than ARIMA. Ability Arimax method to include the variation of calendar effect in the form of dummy variables, make Arimax method can generate the forecasting result with more accuracy level than Arima method. There are independent variables that can be used to generate forecasting with greater accuracy, namely month of the year dummy variables, dummy variable 1 month prior to the Eid, dummy variable in the time of the Eid, and dummy variable 1 month after the Eid. 6. References [1] Lee, M. H., Suhartono, dan Hamzah, N.A. (2010). Calendar variation model based on ARIMAX for forecasting sales data with Ramadhan effect.proc. RCSS 10, Malaysia, 349-361. [2] Chang, P. C., Wang, Y. W., & Liu, C. H. (2007). The development of a weighted evolving fuzzy neural. Expert Systems with Applications, 86 96. [3] Chan, W. W., dan Chan, L. C. (2008). Revenue Management Strategies Under the Lunar Solar Calendar: Evidence of Chinese Restaurant Operations. International Journal ofhospitality Management, 27, 381-390 [4] Liu, L.M. 1986. Identification of Time Series Models in the Presence of Calendar Variation. International Journal of Forecasting, 2, 357-372 [5] Cryer, J.D., & Chan, K.S. (2008).Time Series Analysis: with Application in R. 2nd edition, New York: Springer-Verlag [6] Bowerman, B.L. and O Connell, R.T., 1993. Forecasting and Time series: An Applied Approach, 3rd edition. Belmont, California : Duxbury Press. [7] Makridakis, S., Wheelwright, S. C., & Hyndman, R. J. (1998). Forecasting : Methods and Application. Hoboken: John Willey & Sons, Inc. [8] Shumway, R.H., & Stoffer, D.S. (2006).Time Series Analysis and Its Applications with R Examples.2nd edition, Berlin: Springer. [9] Wei, William, W. S., (2006), Time Series Analysis Univariate and Multivariate Methods. Addison Wesley Publishing Company, Inc, America.