Performance of Autoregressive Order Selection Criteria: A Simulation Study

Similar documents
On Autoregressive Order Selection Criteria

Terence Tai-Leung Chong. Abstract

Performance of lag length selection criteria in three different situations

Time Series Forecasting: A Tool for Out - Sample Model Selection and Evaluation

Model selection criteria Λ

LATVIAN GDP: TIME SERIES FORECASTING USING VECTOR AUTO REGRESSION

Minimum Message Length Autoregressive Model Order Selection

Comparison of New Approach Criteria for Estimating the Order of Autoregressive Process

Outlier detection in ARIMA and seasonal ARIMA models by. Bayesian Information Type Criteria

Information Criteria and Model Selection

SCIENCE & TECHNOLOGY

Additive Outlier Detection in Seasonal ARIMA Models by a Modified Bayesian Information Criterion

1 Introduction. 2 AIC versus SBIC. Erik Swanson Cori Saviano Li Zha Final Project

FORECASTING SUGARCANE PRODUCTION IN INDIA WITH ARIMA MODEL

Topic 4 Unit Roots. Gerald P. Dwyer. February Clemson University

10. Time series regression and forecasting

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY

Brief Sketch of Solutions: Tutorial 3. 3) unit root tests

Measures of Fit from AR(p)

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY

A Non-Parametric Approach of Heteroskedasticity Robust Estimation of Vector-Autoregressive (VAR) Models

Elements of Multivariate Time Series Analysis

FEA Working Paper No

Combining time series models for forecasting

Automatic Forecasting

A TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED

ARIMA Modelling and Forecasting

The causal relationship between energy consumption and GDP in Turkey

On Consistency of Tests for Stationarity in Autoregressive and Moving Average Models of Different Orders

Model selection using penalty function criteria

DEPARTMENT OF ECONOMETRICS AND BUSINESS STATISTICS

Review Session: Econometrics - CLEFIN (20192)

Volume 30, Issue 1. The relationship between the F-test and the Schwarz criterion: Implications for Granger-causality tests

Alfredo A. Romero * College of William and Mary

A Course in Time Series Analysis

Univariate linear models

A NOTE ON THE SELECTION OF TIME SERIES MODELS. June 2001

International Symposium on Mathematical Sciences & Computing Research (ismsc) 2015 (ismsc 15)

Autoregressive Moving Average (ARMA) Models and their Practical Applications

1. Introduction Over the last three decades a number of model selection criteria have been proposed, including AIC (Akaike, 1973), AICC (Hurvich & Tsa

First-order random coefficient autoregressive (RCA(1)) model: Joint Whittle estimation and information

Forecasting exchange rate volatility using conditional variance models selected by information criteria

An Evolutive Interval Type-2 TSK Fuzzy Logic System for Volatile Time Series Identification.

Variable Selection in Restricted Linear Regression Models. Y. Tuaç 1 and O. Arslan 1

Time series modelling and forecasting of Sarawak black pepper price

Lesson 13: Box-Jenkins Modeling Strategy for building ARMA models

Improved model selection criteria for SETAR time series models

Econometría 2: Análisis de series de Tiempo

ARDL Cointegration Tests for Beginner

The Influence of Additive Outliers on the Performance of Information Criteria to Detect Nonlinearity

Regression and Time Series Model Selection in Small Samples. Clifford M. Hurvich; Chih-Ling Tsai

Selection Criteria Based on Monte Carlo Simulation and Cross Validation in Mixed Models

Lecture 7: Model Building Bus 41910, Time Series Analysis, Mr. R. Tsay

FORECASTING COARSE RICE PRICES IN BANGLADESH

Section 2 NABE ASTEF 65

AN ILLUSTRATION OF GENERALISED ARMA (GARMA) TIME SERIES MODELING OF FOREST AREA IN MALAYSIA

Multivariate Time Series Analysis and Its Applications [Tsay (2005), chapter 8]

Forecasting Egyptian GDP Using ARIMA Models

Time Series Models and Inference. James L. Powell Department of Economics University of California, Berkeley

Postprint / Postprint Zeitschriftenartikel / journal article

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA

AR-order estimation by testing sets using the Modified Information Criterion

Arma-Arch Modeling Of The Returns Of First Bank Of Nigeria

Lecture on ARMA model

Quantitative Finance I

ISyE 691 Data mining and analytics

End-Semester Examination MA 373 : Statistical Analysis on Financial Data

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Bristol Business School

Saskia Rinke and Philipp Sibbertsen* Information criteria for nonlinear time series models

Forecasting Foreign Direct Investment Inflows into India Using ARIMA Model

Applied time-series analysis

SEMIPARAMETRIC COINTEGRATING RANK SELECTION. XU CHENG and PETER C. B. PHILLIPS COWLES FOUNDATION PAPER NO. 1272

Using the Divergence Information Criterion for the Determination of the Order of an Autoregressive Process

ARMA MODELS Herman J. Bierens Pennsylvania State University February 23, 2009

AE International Journal of Multi Disciplinary Research - Vol 2 - Issue -1 - January 2014

FORECASTING THE INVENTORY LEVEL OF MAGNETIC CARDS IN TOLLING SYSTEM

Revisiting linear and non-linear methodologies for time series prediction - application to ESTSP 08 competition data

Time Series. Chapter Time Series Data

Inflation Revisited: New Evidence from Modified Unit Root Tests

STAT Financial Time Series

Introduction to Econometrics

Dynamic Time Series Regression: A Panacea for Spurious Correlations

AR, MA and ARMA models

CORRELATION, ASSOCIATION, CAUSATION, AND GRANGER CAUSATION IN ACCOUNTING RESEARCH

The Role of "Leads" in the Dynamic Title of Cointegrating Regression Models. Author(s) Hayakawa, Kazuhiko; Kurozumi, Eiji

Time Series: Theory and Methods

A New Solution to Spurious Regressions *

Box-Jenkins ARIMA Advanced Time Series

Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach

The GARCH Analysis of YU EBAO Annual Yields Weiwei Guo1,a

Univariate ARIMA Models

Selection of the Appropriate Lag Structure of Foreign Exchange Rates Forecasting Based on Autocorrelation Coefficient

Financial Econometrics

Market efficiency of the bitcoin exchange rate: applications to. U.S. dollar and Euro

DYNAMIC ECONOMETRIC MODELS Vol. 9 Nicolaus Copernicus University Toruń Mariola Piłatowska Nicolaus Copernicus University in Toruń

ARMA models with time-varying coefficients. Periodic case.

Econometric Forecasting

Bristol Business School

FORECASTING OF COTTON PRODUCTION IN INDIA USING ARIMA MODEL

Transcription:

Pertanika J. Sci. & Technol. 6 (2): 7-76 (2008) ISSN: 028-7680 Universiti Putra Malaysia Press Performance of Autoregressive Order Selection Criteria: A Simulation Study Venus Khim-Sen Liew, Mahendran Shitan 2*, Choong Chee Keong 3 and Hooy Chee Wooi 3 Labuan School of International Business and Finance, Universiti Malaysia Sabah, Jalan Sungai Pagar, 87000 Labuan, Malaysia 2 Department of Mathematics/ Mathematics Research Institute, Universiti Putra Malaysia, 43400 UPM, Serdang, Selangor, Malaysia 3 Faculty of Accountancy and Management, Universiti Tunku Abdul Rahman, Jalan Genting Kelang, 53000 Kuala Lumpur, Malaysia ABSTRACT Proper selection of autoregressive order plays a crucial role in econometrics modeling cycles and testing procedures. This paper compares the performance of various autoregressive order selection criteria in selecting the true order. This simulation study shows that Schwarz information criterion (SIC), final prediction error (FPE), Hannan-Qiunn criterion (HQC) and Bayesian information criterion (BIC) have considerable high performance in selecting the true autoregressive order, even if the sample size is small, whereas Akaike s information criterion (AIC) over-estimated the true order with a probability of more than two-thirds. Further, this simulation study also shows that the probability of these criteria (except AIC) in correctly estimating the true order approaches one as sample size grows. Generally, these findings show that the most commonly used AIC might yield misleading policy conclusions due to its unsatisfactory performance. We note here that out of a class of commonly used criteria, BIC performs the best for a small sample size of 25 observations. Keywords: Autoregressive, order selection criteria, simulation INTRODUCTION Most econometric models are formulated based on the Autoregressive (AR) process. In particular, AR process forms the main building block of the celebrated Integrated Autoregressive Moving Average (ARIMA) model, Vector Autoregressive (VAR) model, Vector Error Correction (VEC) model, Autoregressive Distributive Lag (ARDL) model, Generalized Autoregressive Conditional Heteroscedasticity (GARCH) models, Threshold Autoregressive (TAR) model and the Smooth Transition Autoregressive (STAR) model, to name a few. A major problem with these econometric models is that the true order (p) of the AR (p) process is always unknown and the optimal order has to be determined by certain order selection criteria. Moreover, several econometric test procedures in empirical research such as the unit root tests, causality tests and cointegration tests are sensitive to the choice of autoregressive order. Thus, proper selection of autoregressive order is an important task in econometric modeling cycles and testing procedures. The final prediction error (FPE) criterion, Schwarz information criterion (SIC), Hannan-Qiunn criterion (HQC) and Akaike s information criterion (AIC), among Received : 4 July 2007 Accepted : 3 December 2007 * Corresponding Author 08. jst48/2008 7 /2/09, 7:00

Venus Khim-Sen Liew, Mahendran Shitan, Choong Chee Keong and Hooy Chee Wooi others, are commonly employed by empirical models researchers to determine the optimal autoregressive order. One interesting question that concerns many researchers is which criterion has the capability to provide the most reliable optimal order. However, there is no universally accepted solution to this question. In this study, firstly we attempt to compare via a simulation study, the performance of these criteria and second, to examine the effects of various sample sizes on the performance of these criteria in the selection of order. We find that all selection criteria under study (except AIC) performed quite well in estimating the true order even if the sample size is small. We note that this probability is less than 0.25 for AIC even when the sample size grows to the very large value of 00,000. AUTOREGRESSIVE ORDER SELECTION CRITERIA There has been considerable literature published on AR order selection criteria. A brief discussion of these criteria is available in Beveridge and Oickle (994), whereas Brockwell and Davis (996) presented greater theoretical and practical detail and additional references for many of these criteria. Suppose the data set {X,, X T } are in fact observations of an AR process of order p, denoted by AR(p) and defined as X = "# X + $ t p i= i t! i t () where # i, i =,, p are autoregressive parameters to be estimated and $ t, t =,, T is an independent and identically distributed (iid) disturbance or error term of a zero mean and a finite variance % 2, which can be easily calculated by ˆ % 2 = ( T! p! )! ˆ$ 2, where t t= symbol ^ stands for estimated value. The final prediction error (FPE) criterion, which was originally proposed by Akaike (969) for AR(p) order determination, is given by FPE = ˆ% 2! ( T! p) ( T + p). Having found that FPE is asymptotically inconsistent, Akaike (973) came up with the so-called Akaike information criterion (AIC), defined as AIC=ln ( ˆ% 2 ) +2p/T. Disregarding the fact that AIC is only asymptotically unbiased (Jones, 975) and has the tendency to pick models which are over-parameterized (Shibata, 976), AIC is a very commonly used criterion for order selection (Basci and Zaman, 998). Another statistics due to Akaike (979) is the Bayesian information criterion, which is defined as BIC = (T p)in[(t p) T ˆ% 2 ]+T[+In 2&)] T + p In[p 2 2 (" Xt! Tˆ% ). There is evidence to suggest that the BIC is better than the AIC as t= order selection criterion (Hannan, 980). Schwarz (978) derived an information criterion, known as Schwarz information criterion (SIC) defined as SIC = ln( ˆ% 2 )+ T - p ln(t). SIC is also called Schwarz-Rissanen criterion as Rissanen (978) also arrived at the same criterion independently, using different methodology. However, SIC has the advantage of being consistent over AIC (Basci and Zaman, 998). T " 72 08. jst48/2008 72 /2/09, 7:0

Performance of Autoregressive Order Selection Criteria: A Simulation Study Hannan and Quinn (979) and Hannan (980) constructed the Hannan-Quinn criterion (HQC) from the law of the iterated logarithm. It provides a penalty function, which decreases as fast as possible for a strongly consistent estimator, as sample size increases. Hannan-Quinn criterion is given by HQC = ln ˆ% 2 + 2T p ln(lnt). Hannan and Rissanen (982) replaced the term ln(lnt) by lnt to speed up the convergence of HQC. This revised version of HQ, however, was found to overestimate the model orders (Kavalieris, 99). SIMULATION RESULTS To achieve our objective, we simulated the AR process with normally distributed random errors of zero mean and a finite variance % 2. We arbitrarily set the true lag length p = 4 and generate # i, i =, 2, 3, 4 from a uniform distribution in the region ( 0.25, 0.25). The choice of this region allows us to avoid undesired nonstationary processes. Meanwhile c may take any value and is generated from an arbitrarily chosen uniform distribution in the region ( 00,00). We simulate data sets for various sample sizes: 25, 50, 00, 250, 500, 000, 0000, 00000. For each sample size, we simulated 000 independent series for the purpose of order estimation. The estimated order ˆp can be any integer from to 20. The probability of estimating the true order, denoted by P( ˆp = p) is then determined. In addition, this study also investigated the probabilities of under- and over-estimating the true order [denoted by P( ˆp > p) and P( ˆp < p) respectively], in which the estimated order based on the selection criteria is less than and more than the true order, respectively. The probabilities (in percentages) of various criteria incorrectly, under- and over- estimating the true order are presented in Figs. to 3. The most important finding is that all selection criteria under study (except AIC) perform quite well in estimating the true order even if the sample size is small. It is noteworthy that the performance of all criteria in correctly estimating the true order increases as the sample size increases. In particular, P( ˆp = p) almost converges to with a sample size of 00,000 observations for all criteria except in the case of AIC, where P( ˆp = p) is less than 0.25 even for this extraordinary large sample size (Fig. ). Taken together, these findings warrant us that the most commonly used AIC might yield misleading policy conclusions due to its unsatisfactory performance. We note here that out of a class of commonly used criteria, BIC performs the best, followed by HQC, FPE and SIC, whereas AIC shows the worst performance. Note that AR-type models (i.e. dynamic models with lagged dependent variables) are quite problematic in small samples (of less than 50 observations for instance) since both Ordinary Least Squares and Maximum Likelihood have only large small and the usual inference machinery like t-statistics are also only asymptotically justified in this case. However, in circumstance (for example, many economic time series hardly have more than 50 annual observations) that leaves us no choice, knowing the small sample performance of these criteria in identifying the order p become useful. 73 08. jst48/2008 73 /2/09, 7:0

Venus Khim-Sen Liew, Mahendran Shitan, Choong Chee Keong and Hooy Chee Wooi Fig. : Probability of correctly estimating the true order Fig. 2: Probability of under-estimating the true order Another important finding of this study is that whenever FPE, BIC, HQC and SIC fail to estimate the true order, it is due to under-estimation rather than over-estimation. The reverse is true for AIC. For illustration, P( ˆp < p) for these criteria are in the order, 22.7, 29.0, 32.7, 4.9 and 4.5 (Fig. 2), whereas P( ˆp > p), in the same order is.9, 5.3, 2.0, 0.4 and 76.7 (Fig. 3) for a sample size of 25 observations. It is noted that in general (excluding the case of AIC), the probabilities of both under- and over-estimation reduces as sample size increases, thereby providing room for improvement in the performance of these criteria. 74 08. jst48/2008 74 /2/09, 7:0

Performance of Autoregressive Order Selection Criteria: A Simulation Study HQ Fig. 3: Probability of over-estimating the true order CONCLUSIONS Econometric modeling cycles and testing procedures mostly involves the predetermination of the autoregressive order (p). In reality, the true order (p) of the autoregressive (AR) process is unknown and the best a researcher can do is to estimate it using certain commonly accepted order selection criteria. The resulting estimate is known as the optimal order. Among others, the final prediction error (FPE) criterion, Schwarz information criterion (SIC), Hannan-Qiunn criterion (HQC), Akaike s information criterion (AIC) are commonly employed in empirical study. Nonetheless, there is no consensus on which of these criteria has the capability to provide the most reliable optimal order. As these criteria have important implications on the reliability of the estimated autoregressive order and thereby the subsequent findings based on this estimation, it is important to understand their performances. In this regard, this study attempts to compare via a simulation study, the performance of these criteria. Further, the effects of various sample sizes on the performance of these criteria in the selection of order were also analyzed. This simulation study showed that SIC, FPE, HQC and BIC display considerably good performance in picking the true autoregressive order, even if the sample size is small, whereas AIC over-estimated the true order with a probability of more than two-third. Further, this simulation study also showed that the probability of these criteria (with the exception of AIC) correctly estimating the true order approaches one as sample size grows. In general, these findings show that the most commonly used AIC might yield misleading policy conclusions due to its unsatisfactory low performance. We note here that out of a class of commonly used criteria, BIC performs the best for a small sample size of 25 observations. 75 08. jst48/2008 75 /2/09, 7:0

Venus Khim-Sen Liew, Mahendran Shitan, Choong Chee Keong and Hooy Chee Wooi ACKNOWLEDGEMENTS The authors would like to thank the anonymous referees who have provided useful comments and suggestions on the previous version of this paper. REFERENCES AKAIKE, H. (969). Fitting autoregressive models for prediction. Annals of the Institute of Statistical Mathematics, 2, 243 247. AKAIKE, H. (973). Information theory and an extension of the maximum likelihood principle. In B. N. Perron and F. Csaki (Eds.), 2 nd International Symposium in Information Theory (pp. 267 28). Budapest: Akademial Kiado. AKAIKE, H. (979). A Bayesian extension of the minimum AIC procedure of autoregressive model fitting. Biometrika, 66, 237 242. BASCI, S. and ZAMAN, A. (998). Effects of skewness and kurtosis on model selection criteria. Economic Letters, 59, 7 22. BEVERIDGE, S. and OICKLE, C. (994). A comparison of Box-Jenkins and objective methods for determining the order of a non-seasonal ARMA model. Journal of Forecasting, 3, 49 434. BROCKWELL, P. J. and DAVIS, R. A. (996). Introduction to Time Series and Forecasting. New York: Springer. HANNAN, E. J. (980). The estimation of the order of an ARMA process. Annals of Statistics, 8, 07 08. HANNAN, E. J. and QUINN, B. G. (979). The determination of the order of an autoregression. Journal of Royal Statistical Society, 4, 90 95. HANNAN, E. J. and RISSANEN, J. (982). Recursive estimation of mixed autoregressive-moving average order. Biometrika, 69, 8 94. HURVICH, C. M. and TSAI, C. L. (989). Regression and time series model selection in small samples. Biometrika, 76, 297 307. JONES, R. H. (975). Fitting autoregressions. Journal of American Statistics Association, 70, 590 592. KAVALIERIS, L. (99). A note on estimating autoregressive-moving average order. Biometrika, 78, 920 922. SCHWARZ, G. (978). Estimating the dimension of a model. The Annals of Statistics, 6, 46 464. SHIBATA, R. (976). Selection of the order of an autoregressive model by Akaike s information criterion. Biometrika, 63, 7 26. RISSANEN, J. (978). Modelling by shortest data description. Automatica, 4, 467 47. 76 08. jst48/2008 76 /2/09, 7:0