Gaussian Copula Regression Application

Similar documents
FORECASTING SUGARCANE PRODUCTION IN INDIA WITH ARIMA MODEL

FE570 Financial Markets and Trading. Stevens Institute of Technology

Forecasting Egyptian GDP Using ARIMA Models

at least 50 and preferably 100 observations should be available to build a proper model

Forecasting Bangladesh's Inflation through Econometric Models

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY

Time Series Analysis. James D. Hamilton PRINCETON UNIVERSITY PRESS PRINCETON, NEW JERSEY

Empirical Approach to Modelling and Forecasting Inflation in Ghana

Lecture 2: Univariate Time Series

Time Series I Time Domain Methods

EASTERN MEDITERRANEAN UNIVERSITY ECON 604, FALL 2007 DEPARTMENT OF ECONOMICS MEHMET BALCILAR ARIMA MODELS: IDENTIFICATION

Time Series Analysis -- An Introduction -- AMS 586

Applied time-series analysis

Automatic seasonal auto regressive moving average models and unit root test detection

Chapter 8: Model Diagnostics

Romanian Economic and Business Review Vol. 3, No. 3 THE EVOLUTION OF SNP PETROM STOCK LIST - STUDY THROUGH AUTOREGRESSIVE MODELS

TAKEHOME FINAL EXAM e iω e 2iω e iω e 2iω

Review Session: Econometrics - CLEFIN (20192)

Empirical Market Microstructure Analysis (EMMA)

Application of Time Sequence Model Based on Excluded Seasonality in Daily Runoff Prediction

Double Gamma Principal Components Analysis

Module 3. Descriptive Time Series Statistics and Introduction to Time Series Models

LATVIAN GDP: TIME SERIES FORECASTING USING VECTOR AUTO REGRESSION

Arma-Arch Modeling Of The Returns Of First Bank Of Nigeria

STAT Financial Time Series

TIME SERIES ANALYSIS AND FORECASTING USING THE STATISTICAL MODEL ARIMA

Econ 423 Lecture Notes: Additional Topics in Time Series 1

Estimation and application of best ARIMA model for forecasting the uranium price.

Suan Sunandha Rajabhat University

Univariate, Nonstationary Processes

Problem Set 2: Box-Jenkins methodology

Chapter 6: Model Specification for Time Series

Prof. Dr. Roland Füss Lecture Series in Applied Econometrics Summer Term Introduction to Time Series Analysis

APPLIED ECONOMETRIC TIME SERIES 4TH EDITION

Multivariate Time Series: VAR(p) Processes and Models

MODELING INFLATION RATES IN NIGERIA: BOX-JENKINS APPROACH. I. U. Moffat and A. E. David Department of Mathematics & Statistics, University of Uyo, Uyo

Author: Yesuf M. Awel 1c. Affiliation: 1 PhD, Economist-Consultant; P.O Box , Addis Ababa, Ethiopia. c.

Dynamic Time Series Regression: A Panacea for Spurious Correlations

Chapter 12: An introduction to Time Series Analysis. Chapter 12: An introduction to Time Series Analysis

Autoregressive Moving Average (ARMA) Models and their Practical Applications

5 Autoregressive-Moving-Average Modeling

Comparing the Univariate Modeling Techniques, Box-Jenkins and Artificial Neural Network (ANN) for Measuring of Climate Index

Convolution Based Unit Root Processes: a Simulation Approach

A time series is called strictly stationary if the joint distribution of every collection (Y t

Ch 8. MODEL DIAGNOSTICS. Time Series Analysis

Ch 6. Model Specification. Time Series Analysis

Advanced Econometrics

Univariate ARIMA Models

A TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED

Modelling using ARMA processes

Lecture 3: Autoregressive Moving Average (ARMA) Models and their Practical Applications

Forecasting using R. Rob J Hyndman. 2.4 Non-seasonal ARIMA models. Forecasting using R 1

Oil price volatility in the Philippines using generalized autoregressive conditional heteroscedasticity

A Data-Driven Model for Software Reliability Prediction

Lecture 3: Autoregressive Moving Average (ARMA) Models and their Practical Applications

Economics 618B: Time Series Analysis Department of Economics State University of New York at Binghamton

Univariate Time Series Analysis; ARIMA Models

7. Integrated Processes

SOME BASICS OF TIME-SERIES ANALYSIS

Time Series Analysis of United States of America Crude Oil and Petroleum Products Importations from Saudi Arabia

Lab: Box-Jenkins Methodology - US Wholesale Price Indicator

On Consistency of Tests for Stationarity in Autoregressive and Moving Average Models of Different Orders

7. Integrated Processes

MCMC analysis of classical time series algorithms.

Forecasting. Simon Shaw 2005/06 Semester II

Econometric Forecasting

Circle the single best answer for each multiple choice question. Your choice should be made clearly.

DEPARTMENT OF ECONOMICS

Classic Time Series Analysis

ECONOMICS 7200 MODERN TIME SERIES ANALYSIS Econometric Theory and Applications

University of Oxford. Statistical Methods Autocorrelation. Identification and Estimation

Multivariate Time Series Analysis and Its Applications [Tsay (2005), chapter 8]

Lesson 13: Box-Jenkins Modeling Strategy for building ARMA models

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication

Basics: Definitions and Notation. Stationarity. A More Formal Definition

Box-Jenkins ARIMA Advanced Time Series

ARIMA Models. Jamie Monogan. January 25, University of Georgia. Jamie Monogan (UGA) ARIMA Models January 25, / 38

Unit root problem, solution of difference equations Simple deterministic model, question of unit root

Time Series Analysis

AR, MA and ARMA models

Circle a single answer for each multiple choice question. Your choice should be made clearly.

STAT 436 / Lecture 16: Key

Final Examination 7/6/2011

On Monitoring Shift in the Mean Processes with. Vector Autoregressive Residual Control Charts of. Individual Observation

MODELLING TIME SERIES WITH CONDITIONAL HETEROSCEDASTICITY

Topic 4 Unit Roots. Gerald P. Dwyer. February Clemson University

Long memory in the R$/US$ exchange rate: A robust analysis

Volatility. Gerald P. Dwyer. February Clemson University

Econometrics I: Univariate Time Series Econometrics (1)

Minitab Project Report - Assignment 6

Estimating AR/MA models

Research Article The Laplace Likelihood Ratio Test for Heteroscedasticity

Stochastic Processes

Univariate linear models

Part 1. Multiple Choice (50 questions, 1 point each) Part 2. Problems/Short Answer (10 questions, 5 points each)

ECONOMETRIA II. CURSO 2009/2010 LAB # 3

3 Theory of stationary random processes

Decision 411: Class 9. HW#3 issues

A SEASONAL TIME SERIES MODEL FOR NIGERIAN MONTHLY AIR TRAFFIC DATA

STAT 443 Final Exam Review. 1 Basic Definitions. 2 Statistical Tests. L A TEXer: W. Kong

Transcription:

International Mathematical Forum, Vol. 11, 2016, no. 22, 1053-1065 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/imf.2016.68118 Gaussian Copula Regression Application Samia A. Adham Department of Statistics, Faculty of Science King Abdulaziz University Jeddah, Kingdom of Saudi Arabia Mervat K. Abd Elaal Department of Statistics, Faculty of Science King Abdulaziz University Jeddah, Kingdom of Saudi Arabia & Department of Statistics, Facultu of Commerce Al-Azhar University, Girls Branch Ciro, Egypt Heba M. Malaka Department of Statistics, Faculty of Science King Abdulaziz University Jeddah, Kingdom of Saudi Arabia Copyright 2016 Samia A. Adham et al. This article is distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Abstract Gaussian copula models are frequently used to extend univariate regression models to the multivariate case. The main benefit of the topic is that the specification of the regression model is conveniently separated from the dependence structure described in the familiar form of the correlation matrix of a multivariate Gaussian distribution [1]. This form of flexibility has been successfully employed in several complex applications including longitudinal data analysis, spatial statistics, genetics and time series. In this paper the Gaussian marginal copula regression applied to currency exchange rate data set by using log-return transformation and difference transformation according to AIC which are computed by applying the Gaussian marginal copula regression function in GCMR R package.

1054 Samia A. Adham et al. Keywords: Copula regression, Gaussian copula, Time series 1 Introduction Various authors discussed likelihood inference for Gaussian copula models. [4] developed a Markov Chain Monte Carlo algorithm for Bayesian inference, [2] adopted a sequential importance sampling algorithm. Well-known limits of the Gaussian copula approach are the impossibility to deal with asymmetric dependence and the lack of tail dependence. These limits may impact the use of Gaussian copulas to model forms of dependence arising, for example, in extreme environmental events or in financial data. Conversely, [5] focuses on working Gaussian copulas used to conveniently handle dependence in regression analysis as described in [3] and [2]. In other terms, the parameters of interest are the regression coefficients, while the dependence structure identified by the Gaussian copula is a nuisance component. In [6] solve the problem of modeling extreme data with the Gaussian copula marginal regression. The model was applied to study the rise harvested area production centers in East Java and other areas. Some useful references can be found in [2] and [3]. Gaussian copula marginal regression models The known regression model is as it was introduced in [2]: Y i = g(x i, ε i ; λ), i = 1,, n, (1) where g( ) is a corresponding function of the regressors x i and the error term ε i, and λ is a vector of parameters. Among the many possible specifications for the function g( ), the selection of the model is as follows: Y i = F i 1 { Φ(ε i ); λ }, (2) where F i ( ; ) = F( x i ; ) the cumulative distribution functions of Y i given x i and Φ( ) is standard normal variate. Equation (2) includes all possible parametric regression models for continuous and non-continuous responses. The Gaussian linear regression model Y i = x i T β + σε i corresponds to set F i (Y i ; λ) = Φ{(Y i x i T β)/σ} in Eq. (2) with = (β T, σ) T. Model specification is then completed by assuming that the vector of errors ε = (ε 1,, ε n ) T is multivariate normal, ε N (0, Ω), (3) where Ω is a correlation matrix. In Gaussian copula regression the dependence between the variables is modelled with the Gaussian copula so that the joint data cumulative distribution function is given by P r (Y 1 y 1,, Y n y n ) = Φ n (ε 1,, ε n ; P), where ε i = Φ 1 {F(y i x i )}, with Φn( ;P) the n-dimensional multivariate standard

Gaussian copula regression application 1055 normal cumulative distribution function with correlation matrix P. This correlation matrix allows to deal with time series, clustered and longitudinal data. In this paper, the Gaussian copula marginal regression (GCMR) in R package will be used. This package gives implementation of Gaussian copula correlation matrices and it makes the forms of dependence arising handled easily, for example, time series and geometrics. The models can be fitting with the method of maximum likelihood inference based on a variate of the Geweke- Hajivassiliou-Keane (GHK) estimator in the continuous case. It also give a select models using information criteria like AIC. In addition, the log likelihood for inference can be computed and the implementation of residuals analysis to evaluate departures from the model assumptions (see [5]). Autoregressive Function The first step in time series analysis is to determine the form of the model which describes the time series behaviors. This step called identification. The basic tool to determine the model is Autoregressive Function (ACF). Autoregressive Function is a functional relationship between the autocorrelation ρ k between the values of the time series and the time lags k. It has been defined in [7] as ρ k = γ k cor(y t, y t k ) = γ 0 var (y t ) var (y t k ). Partial Autocorrelation Function The Partial Autocorrelation Function (PACF) is the correlation between y t and y t k after omitting the effect of y t,, y t k on this correlation. kk = corr (y t,, y t k y t k+1,, y t 1 ). PACF is a functional relationship between kk and the lags k. Autoregressive Moving Average Autoregressive Moving Average (ARMA) models are mathematical models of the persistence, or autocorrelation, in a time series. ARMA models are widely used in hydrology, dendrochronology, econometrics, and other fields. Modeling can contribute to understanding the physical system by revealing something about the physical process that builds persistence into the series. In [7] introduced the general ARMA(p, q) process as following: {X t } is an ARMA (p,q) process if {X t } is stationary and if for every t, X t 1 X t 1 p X p 1 = Z t + θ 1 Z t 1 + + θ q Z t q, where Z t referred to a white noise (with mean 0 and variance σ 2 ), {Z t }~N(0, σ 2 ), and the polynomials (1 1 z p z p ) and (1 θ 1 z + + θ q z q ) have no common factors; p is the number of the Autoregressive model

1056 Samia A. Adham et al. (AR) parameters and q is the number of the parameters. Moving Average model (MA) 2 Gaussian copula regression application on currency exchange rate This section introduces the application for copula by using the Gaussian copula regression with currency exchange rate of Japanese Yan against American dollar denoted by (JPY/USD) and Korean Won against American dollar denoted by (KRW/USD) in the period 1/1/2010 to 31/8/2015, which was taken from the University of British Columbia, Sauder school of business pacific exchange rate service. First of all we plot the time series for both JPY/USD and KRW/USD separately to detect seasonality, trend and stationarity for both of the time series. It is important to multiply JPY/USD by 10 in order to compare it with KRW/USD. Figure 1 Monthly exchange rate of JPY/USD and KWR/USD One can see from Figure 1 that the time series plot could have a seasonality effect, so it cannot be described by using one of the additive model for time series Autoregressive model (AR), Moving Average model (MA), Autoregressive Moving Average model (ARMA) and Autoregressive integrated Moving Average model (ARIMA) until the seasonality is removed. Now, to study the seasonality which is one of the important component of any time series it should decompose the time series to separate it to its three component which are trend, seasonal and an irregular.

Gaussian copula regression application 1057 Figure 2a The time series decompose for the JYP/USD Figure 2b The time series decompose for the KWR/USD Figure 2a and Figure 2b above show the original time series on the top, the estimated trend component on second from top, the estimated seasonal component on third from top and the estimated irregular component at the bottom. It can be seen that there is a seasonality and the estimated trend component for JYP/USD shows a decrease in 2012 followed by increase from then on to 2015. While in KWR/USD shows a decrease in 2012 and in 2014. Because the JYP/USD and KWR/USD have a seasonality it must be adjust by estimating the seasonal component and subtract the estimated seasonal component from the original time series. By using decompose () function in R package. After the calculation it shows that there is no difference at the time series. After that, the stationarity should be tested. One way of doing this is by applying the Dickey-Fuller test. The null hypothesis is that there is a unit root, δ = 0, which means the differences, y t+1 y t, are stationary. From Table (1) one can see that neither of these time series is significant at the 5% level (p > 0.05). So, the data non-stationarity. Table 1 Calculated p-value of Dickey-Fuller test The time series Test statistic p-value JPY/USD -1.7023 0.6961 KRW/USD -1.3619 0.8343

1058 Samia A. Adham et al. Therefore, it is desirable to have a transformation in order to get a stationarity times. Time series transformation give the benefit to be described using additive model. Log-return, natural log and the difference are the most famous time series transformations. Log return transformation Let x t and x t 1 are two consecutive observations in an exchange rate at time t and t-1, and y t is the log return (the continuously compounded return) is defined as: y t = log(x t ) log(x t 1 ). (4) The benefit of using log returns of series epically financial series is that one can see the changes in the variable. Figure 3 Time series of JPY/USD and KWR/USD after the logreturn transformation From Figure 3 it appears that the time series is stationarity in mean and variance. After satisfying the stationarity model it is possible to use ARMA model as additive model. Since ARMA model include AR and MA parts, it has exactly the same condition of AR(p) and MA(q). To select the ARMA model, which means finding the values of the most suitable values of p and q. It must be explore the Autocorrelation Function (ACF) and Partial Autocorrelation Function (PACF).

Gaussian copula regression application 1059 Figure 4a ACF and PACF for JPY/USD with log return transformation Figure 4b ACF and PACF for KWR/UAD with log return transformation From Figure 4a and Figure 4b it is visible from the ACF that the autocorrelation at lag 2 exceeds the significance bounds unlike the others do not pass the significance bound. And the PACF shows that the partial autocorrelation at lag 1 pass the significance bounds but the other autocorrelations do not overreach the significance bound. Depending on the ACF and PACF lags the possible ARMA models for the time series will be one of the following: ARMA(0,1), ARMA(1,0), ARMA(2,0), ARMA(2,1), ARMA (0,2) and ARMA (1,2). In Table 2 the different ARMA model are calculated to decide which mode is the best based on AIC in AIC = 2loglik(θ ) + 2p (5)

1060 Samia A. Adham et al. with Gaussian copula marginal regression function gcmr() in R package to study the relationship between JPY/USD and KWR/USD. Table 2 ARMA models with the currency exchange rate after log return transformation ARMA AR(1) AR(2) MA(1) MA(2) AIC GCMR ARMA(0,1) - - 0.5075 - -348.15-0.0676 ARMA(1,0) 0.3081 - - - -343.47-0.0152 ARMA(2,0) 0.3792-0.2458-0.1089 0.5079-345.4-0.0120 ARMA(2,1) -0.0669-0.1089 0.5079 - -345.16-0.0595 ARMA(0,2) - - 0.4355-0.1037-346.9-0.0644 ARMA(1,2) -0.1566 0.6098-346.74-0.0677 From Table 2 it is obvious based on AIC the model ARMA(0,1) is the best mode. The Gaussian copula marginal regression made the calculation easier than using the ARMA() function. After that, it is necessary to study the residuals of the GCMR model because it is very important to find autocorrelation in the residuals and if there is some information not accounted by the model. Figure 5 The quantile residual for the ARMA(0,1) model It is clear from Figure 5 the residuals are from a normal distribution and the points tend to fall along the reference line. The ACF plot of the calculated residual shows significant projection at lag 2 in all the different ARMA models. In order to test the autocorrelation, heteroscedasticity and normality three tests are applied. These tests are Box-Pierce test, Ljung-Box test and jarque bera test. The p- value of each test is computed.

Gaussian copula regression application 1061 Table 3 Residual study for the Gaussian copula regression model with log-return transformation Test p-value Box-Pierce 0.6494 Ljung-Box 0.6421 jarque bera LM p-value: 0.513 Table 3 reveals that all the three tests are significant, p-value at lag 1> 0.05. So that, there is a no serial correlation, no heteroscedasticity and the residual is normal. So, the additive model ARMA are a good model for the data. Difference transformation Difference transformation attempt to eliminate the trend term by differencing. One can define the first difference operator by: X t = X t X t 1 = (1 B) X t, where B is called the backward shift operator, B X t = X t 1. This process should be applied repeatedly until the stationarity is obtained. Figure 6 shows the time series after the = 1 and it is clear that the stationarity is satisfied. Figure 6 Monthly exchange rate JPY/USD and KRW/USD using the difference transformation From Figure 6 it is clear that the times series are stationary, and one of the following additive model ARMA and ARIMA can be used.

1062 Samia A. Adham et al. Figure 7a ACF and PACF for JPY/USD with difference transformation Figure 7b ACF and PACF for KWR/UAD with difference transformation From Figure 7a it is visible that ACF at lag 1 is positive and exceeds the significant bound, while the rest of the ACF will be zero after lag 1 but the PACF in Fig. 6a shows all the lags are significant. In Figure 7b it is clear that ACF at lag 2 is positive and exceeds the significant bound, while the rest of the ACF will be zero after lag 2; but in PACF plot shows lag 1 exceed the significant bound and the rest of the PACF will be zero after lag 1. The possible models will be ARMA(1,0), ARMA(0,1), ARMA(0,2), ARMA(2,0), ARMA(2,1) and ARMA(1,2). After using gcmr() function on the data after the difference of JPY/USD versus KRW/USD.

Gaussian copula regression application 1063 Table 4 ARMA models with the currency exchange rate after using difference transformation ARMA AR(1) AR(2) MA(1) MA(2) AIC GCMR ARMA(0,1) - - 0.1768 604.2 0.0436 ARMA(1,0) 0.2341 - - - 603.25 0.0256 ARMA(2,0) 0.1950 0.1683 - - 603.3 0.0202 ARMA(2,1) 0.4309 0.1202-0.2475 604.96 0.0223 ARMA(0,2) - - 0.1602 0.1483 604.56 0.0341 ARMA(1,2) 0.6728 - -0.4561-603.41 0.0234 From Table 4 depending on AIC value, one can tell that the ARMA(1,0) model is the best fit. Because the data at hand is time series, it is necessary to study the ACF and PACF plot of the residuals. It can reveal if there is any autocorrelation in the residuals. Figure 8 The quantile residual for the ARMA(1,0) model for the currency data afte differene transfomation It can be seen from Figure 8 that there is no departure from reference line so it is easy to say that, the residuals are from a normal distribution and ACF plot explain the remaining residuals show that the model has captured the patterns in the currency exchange rate data after the difference transformation very well, although there is a small amount of autocorrelation left in the residuals the significant spike at lag 2. This suggests that the model can be slightly improved by using any other finical transformation models for time series like autoregressive conditional heteroscedasticity (ARCH) or Generalized autoregressive conditional heteroscedasticity (GARCH) model which is famous in financial time series.

1064 Samia A. Adham et al. Table 5 Residual study for the Gaussian copula regression model with log-return transformation Test p-value Box-Pierce 0.754 Ljung-Box 0.7487 jarque bera LM p-value: 0.700 Table 5 revels that the tests for serial correlation, heteroscedasticity are significant. (p-value at lag 1> 0.05). So, there is a no serial correlation, no heteroscedasticity and the residuals are not normal, it is possible to refer this conclusion to the additive model selection. As a financial data it can be use other models. 3. Conclusions 1. Gaussian marginal copula regression applied to currency exchange rate data set it is found that the log-return transformation is better than difference transformation according to AIC which is computed by applying the Gaussian marginal copula regression function in GCMR R package and according to the analysis of the residual. 2. It was found that the GCMR simplifies the application of the time series. References [1] P. Song, Multivariate dispersion models generated from Gaussian copula, Scand. J. Stat., 27 (2000), no. 2, 305 320. https://doi.org/10.1111/1467-9469.00191 [2] G. Masarotto and C. Varin, Gaussian copula marginal regression, Electron. J. Stat., 6 (2012), 1517 1549. https://doi.org/10.1214/12-ejs721 [3] P. X.-K. Song, M. Li, and P. Zhang, Vector Generalized Linear Models: A Gaussian Copula Approach, Chapter in Copulae in Mathematical and Quantitative Finance, Springer Berlin Heidelberg, 2013, 251 276. https://doi.org/10.1007/978-3-642-35407-6_12 [4] B. M. Pitt and R. Kohn, Efficient Bayesian inference for Gaussian copula regression models, Biometrika, 93 (2006), no. 3, 537 554. https://doi.org/10.1093/biomet/93.3.537

Gaussian copula regression application 1065 [5] G. Masarotto and C. Varin, Gaussian copula regression using R, 2015. [6] Sutikno, Gaussian Copula Marginal Regression for Modeling Extreme Data with Application, J. Math. Stat., 10 (2014), no. 2, 192 200. https://doi.org/10.3844/jmssp.2014.192.200 [7] P. J. Brockwell and R. A. Davis, Introduction to Time Series and Forecasting, Springer New York, 2006. https://doi.org/10.1007/b97391 Received: September 6, 2016; Published: October 17, 2016