CHICAGO: A Fast and Accurate Method for Portfolio Risk Calculation

Similar documents
CHICAGO: A Fast and Accurate Method for Portfolio Risk Calculation

Intro VEC and BEKK Example Factor Models Cond Var and Cor Application Ref 4. MGARCH

Lecture 8: Multivariate GARCH and Conditional Correlation Models

Multivariate Asset Return Prediction with Mixture Models

Financial Times Series. Lecture 12

Time Series Models for Measuring Market Risk

Multivariate Distributions

Multivariate GARCH models.

Time Series Modeling of Financial Data. Prof. Daniel P. Palomar

Efficient estimation of a semiparametric dynamic copula model

GARCH Models. Eduardo Rossi University of Pavia. December Rossi GARCH Financial Econometrics / 50

New Statistical Model for the Enhancement of Noisy Speech

Portfolio Value at Risk Based On Independent Components Analysis

Multivariate Statistics

The Instability of Correlations: Measurement and the Implications for Market Risk

Independent Component Analysis and Its Applications. By Qing Xue, 10/15/2004

Regression: Ordinary Least Squares

SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions

Robust Backtesting Tests for Value-at-Risk Models

DYNAMIC CONDITIONAL CORRELATIONS FOR ASYMMETRIC PROCESSES

GARCH Models Estimation and Inference

MULTICHANNEL SIGNAL PROCESSING USING SPATIAL RANK COVARIANCE MATRICES

Introduction to ARMA and GARCH processes

Volatility. Gerald P. Dwyer. February Clemson University

Multivariate Volatility, Dependence and Copulas

Empirical properties of large covariance matrices in finance

Gaussian kernel GARCH models

1 Description of variables

Risk management with the multivariate generalized hyperbolic distribution, calibrated by the multi-cycle EM algorithm

Heteroskedasticity in Time Series

Vector autoregressions, VAR

Bayesian Semiparametric GARCH Models

Multivariate Normal-Laplace Distribution and Processes

Bayesian Semiparametric GARCH Models

Econ 423 Lecture Notes: Additional Topics in Time Series 1

1 Phelix spot and futures returns: descriptive statistics

MFE Financial Econometrics 2018 Final Exam Model Solutions

Time Series Copulas for Heteroskedastic Data

Factor Model Risk Analysis

TAKEHOME FINAL EXAM e iω e 2iω e iω e 2iω

Bayesian inference for the mixed conditional heteroskedasticity model

Estimation of Vector Error Correction Model with GARCH Errors. Koichi Maekawa Hiroshima University of Economics

Independent Component (IC) Models: New Extensions of the Multinormal Model

The Slow Convergence of OLS Estimators of α, β and Portfolio. β and Portfolio Weights under Long Memory Stochastic Volatility

Lecture 2: Univariate Time Series

Robust Performance Hypothesis Testing with the Sharpe Ratio

Testing the Constancy of Conditional Correlations in Multivariate GARCH-type Models (Extended Version with Appendix)

Partially Censored Posterior for Robust and Efficient Risk Evaluation.

Normal Probability Plot Probability Probability

Backtesting Marginal Expected Shortfall and Related Systemic Risk Measures

Assessing the VaR of a portfolio using D-vine copula based multivariate GARCH models

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

Financial Econometrics and Quantitative Risk Managenent Return Properties

Independent component analysis for functional data

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

CIFAR Lectures: Non-Gaussian statistics and natural images

Lecture 11: Regression Methods I (Linear Regression)

Statistics 910, #5 1. Regression Methods

Review (Probability & Linear Algebra)

July 31, 2009 / Ben Kedem Symposium

12 - Nonparametric Density Estimation

Generalized Autoregressive Score Models

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline.

Lecture 6: Univariate Volatility Modelling: ARCH and GARCH Models

Marginal Specifications and a Gaussian Copula Estimation

UNIVERSITÀ DEGLI STUDI DI PADOVA. Dipartimento di Scienze Economiche Marco Fanno

Multivariate Rotated ARCH Models

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

A Complete Spatial Downscaler

Factor Models for Asset Returns. Prof. Daniel P. Palomar

A Probability Review

Quantitative Methods in High-Frequency Financial Econometrics:Modeling Univariate and Multivariate Time Series

Foundations of Nonparametric Bayesian Methods

Empirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications

Stock index returns density prediction using GARCH models: Frequentist or Bayesian estimation?

Flexible Covariance-Targeting Volatility Models Using Rotated Returns

REGRESSION WITH SPATIALLY MISALIGNED DATA. Lisa Madsen Oregon State University David Ruppert Cornell University

Introduction to Algorithmic Trading Strategies Lecture 10

Multiple Random Variables

Lecture 11: Regression Methods I (Linear Regression)

Diagnostic Test for GARCH Models Based on Absolute Residual Autocorrelations

Assessing financial model risk

End-Semester Examination MA 373 : Statistical Analysis on Financial Data

Solutions of the Financial Risk Management Examination

Economics 536 Lecture 7. Introduction to Specification Testing in Dynamic Econometric Models

A Goodness-of-fit Test for Copulas

Determining and Forecasting High-Frequency Value-at-Risk by Using Lévy Processes

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao

Asymptotic distribution of the sample average value-at-risk

Time Varying Hierarchical Archimedean Copulae (HALOC)

13. Estimation and Extensions in the ARCH model. MA6622, Ernesto Mordecki, CityU, HK, References for this Lecture:

Approximating the Covariance Matrix with Low-rank Perturbations

ABC methods for phase-type distributions with applications in insurance risk problems

Gatsby Theoretical Neuroscience Lectures: Non-Gaussian statistics and natural images Parts I-II

An Introduction to Independent Components Analysis (ICA)

NBER WORKING PAPER SERIES THEORETICAL AND EMPIRICAL PROPERTIES OF DYNAMIC CONDITIONAL CORRELATION MULTIVARIATE GARCH. Robert F. Engle Kevin Sheppard

Bias-Correction in Vector Autoregressive Models: A Simulation Study

MEASURING THE CORRELATION OF SHOCKS BETWEEN THE UK AND THE CORE OF EUROPE

Estimation of large dimensional sparse covariance matrices

The Linear Regression Model

Transcription:

CHICAGO: A Fast and Accurate Method for Portfolio Risk Calculation University of Zürich April 28

Motivation Aim: Forecast the Value at Risk of a portfolio of d assets, i.e., the quantiles of R t = b r t, r t = (r i,1,..., r d,t ). Problem: Assets not independent, GARCH effects, heavy tails. Naive approach: Fit a univariate GARCH-type model to the time series of R t. Problem: need to re-estimate the model for each b. Solution: Multivariate (M-) GARCH models. But: Many parameters ( curse of dimensionality ), computationally demanding. We show how Conditional Heteroscedasticity-based Independent Component Analysis can be used to estimate a GO-GARCH model. Using Generalized Hyperbolic innovations and a saddlepoint approximation, we obtain fast and accurate portfolio VaR forecasts.

Outline 1 The GO-GARCH Model 2 3 4 5 6

Outline 1 The GO-GARCH Model 2 3 4 5 6

MGARCH-Models General Setup: r t = µ t + u t u t = Σ 1/2 t ɛ t ɛ t i.i.d.(, I) Most general rendition: VEC(p,q) (Bollerslev, Engle and Wooldridge, 1988): vech Σ t = c + O(d 4 ) parameters (!) p B vech Σ t p + i=1 q A vech u t i u t i i=1

The GO-Garch Model Need to imply restrictions to ensure Σ t >, and reduce number of parameters BEKK (Engel and Kroner, 1995), CCC (Bollerslev, 199), DCC (Engle, 21),... The GO-GARCH model (van der Weide, 22): innovations vector modelled as linear combination of d unobserved factors f t : u t = Af t, for some constant, invertible mixing matrix A. The unobserved factors are assumed to be independent of each other, and to have unit variance (identifying restriction).

The GO-Garch Model (2) A polar decomposition uniquely factorizes the mixing matrix A into a symmetric positive definite matrix Σ 1/2 and a rotation matrix U: where A = Σ 1/2 U, AA = Σ 1/2 UU Σ 1/2 = Σ is the unconditional covariance matrix. The d(d + 1)/2 free parameters in Σ 1/2 can be consistently estimated from the (unconditional) sample covariance. Estimating the d(d 1)/2 parameters in U requires conditional information, because any orthogonal matrix U gives rise to the same unconditional covariance matrix Σ (because U U = I).

The GO-Garch Model (3) As ( U is a rotation matrix, it can be decomposed as the product of d ) 2 basic rotation matrices Ri (θ i ), where each R i is a rotation of angle θ i in the plane spanned by one pair of axes in R d. For example, in the d = 2 case, [ cos θ U = sin θ sin θ cos θ Thus, U can be fully parameterized in terms of the Euler angles θ i. ].

Related Literature Hyvärinen, Karhunen and Oja (21): monograph on ICA Fan, Wang and Yao (25): Conditionally Uncorrelated Components Chen, Härdle and Spokoiny (26): ICA in a nonparametric framework Boswijk and van der Weide (26): 2-Step estimation of GO-GARCH model

Outline 1 The GO-GARCH Model 2 3 4 5 6

Estimation Estimation by ML is difficult, especially under non-normality. We would like to have a two-step approach (as in, e.g., the CCC and DCC models): Estimate A = Σ 1/2 U first, then fit d univariate GARCH models. Σ 1/2 can be estimated from unconditional information. Thus, consider the demeaned and whitened data z: z t = ˆΣ 1/2 û t, where û t (r t ˆµ t ) and ˆΣ (û t û t)/t. The goal is then to estimate U. We propose to use Independent Component Analysis.

Independent Component Analysis ICA is a relatively young technique used in signal processing. Assume we observe a d dimensional random vector u t. The u it are linear combinations of d independent random variables f it : u t = Af t. Only u t is observed. Aim: estimate A, i.e., find a matrix A such that the linear combinations y t = A 1 u t are independent. Naive approach of taking A = E [ (u t E[u t ])(u t E[u t ]) ] 1/2 yields uncorrelated components, but yields independent components only up to an orthogonal transformation: With U orthogonal, E[U y t y tu ] = U U = I, i.e., U ast y is also uncorrelated have to use more information Whitening is still useful, need to consider only orthogonal matrices.

Using Additional Information Which additional information to use depends on the problem at hand. Example: If the data are non-gaussian, that information can be exploited. Central Limit Theorem: Sums of independent r.v.s are more gaussian. The ICs are those linear combinations of the data that are least gaussian, as measured by, e.g., excess kurtosis. For financial data, it suggests itself to use the GARCH-effects as extra information: the sum of two (or more) garchy series will typically be less garchy. Thus, the independent components are those linear combinations of the data that have the most pronounced GARCH-effects, as measured by, e.g., autocorrelation of the squared series.

ICA using Variance Nonstationarity The algorithm of Hyvärinen, Karhunen and Oja (21, p.349) achieves this with cubic convergence. Given prewhitened data z t, the algorithm starts with U n = I and iterates U temp = z[z U n z U n z U n ]/T + z [z U n z U n z U n ]/T 2U n 4 CU n D n U n+1 = (U temp U temp) 1/2 U temp where z = [z 2,..., z T ], z = [z 1,..., z T 1 ], C = (zz + z z )/(2T ), D n = diag ( vecd ( U n CU n )), and denotes the Hadamard product. The iteration stops when 1 c < ɛ, where c is the minimum over the absolute values of the diagonal elements of U n+1 U n, and ɛ is a suitable convergence threshold (we use 1 12 ). In the rare cases that the algorithm fails to converge, one may fall back to, e.g., the kurtosis-based FastICA algorithm.

ICA Example 1 2 1 1 4 3 2 1 4 2 2 5 1 5 1 2 5 1 2 1 1 4 2 4 2 2 5 1 2 5 1 2 5 1 2 1 1 4 2 4 2 2 5 1 2 5 1 2 5 1

ICA Example 2 1 5 5 5 5 2 1 1 5 1 1 5 1 1 5 1 2 1 1 2 1 2 1 1 2 5 1 1 5 1 2 5 1 1 5 5 2 1 5 5 1 5 1 1 5 1 1 5 1

Alternative Method Boswijk and van der Weide (26) derive an estimator Û BW as the eigenvector matrix of ˆB, where 1 ˆB = argmin B:B=B T T t=1 which has to be solved numerically. ( [zt tr z t I d B ( z t 1 z ) ] ) 2 t 1 I d B, We compare the performance of the two methods with a small simulation study. We use d = 2, so only one parameter to estimate (rotation angle θ). T = 8, data are Normal-GARCH(1,1), with a 1 =.9, b 1 =.9, a 2 =.4, b 2 =.95,Σ = I.

Comparison of Estimators.45.4.35 RMSE ICA BW MLE.15.1 BIAS ICA BW MLE.3.5 RMSE.25.2 BIAS.15.5.1.5.1.2.3.4.5.6.7 θ.2.3.4.5.6.7 θ Figure: Performance Comparison of MLE, ICA, and Boswijk and van der Weide (BW). Average computation time for each of the 1 replications: 8.87 sec for MLE, 1.68 for BW, and.3 sec for ICA, 297 and 56 times faster, respectively.

Outline 1 The GO-GARCH Model 2 3 4 5 6

The Model Our aim is to obtain VaR forecasts for R t+1 = b Af t+1. Full specification of the model requires us to specify 1 the univariate factor dynamics and 2 the innovations distribution. Our two-step procedure allows us to keep these separate from the covariance specification.

Specification of IC Dynamics We model the ith IC as f i,t = σ it Z it. To capture the evolution of the scale parameters σ i,t, we use the A-PARCH model proposed by Ding, Granger and Engle (1993), given by σ δ i it = c i + r c ij ( f i,t j γ ij f i,t j ) δ i + j=1 s j=1 d ij σ δ i i,t j where z i,t = ɛ i,t σ i,t, c i,j >, d i,j, δ i >, and γ i,j < 1. We set r i = s i = 1 and δ i = 1, which was shown in Mittnik and Paolella (2) to produce very accurate VaR forecasts for fx data.

The Innovations Distribution We model the innovations as Generalized Hyperbolic: Z i,t GHyp(λ i, ω i, ρ i, σ i, µ i ), i.e., with λ R, ω >, ρ < 1, µ R and δ >, their density is f X (x; λ, ω, ρ, σ, µ) = ω λ ȳ λ 1 2 2πᾱ λ 1 2 σk λ (ω) K λ 1 2 (ᾱȳ) eρᾱz, where z x µ σ, ᾱ ω(1 ρ2 ) 1/2, ȳ 1 + z 2, and K ν (x) is the the modified Bessel function of the third kind with index ν, defined as K ν (x) = 1 2 t ν 1 e 1 2 x(t+t 1) dt. We standardize the generalized hyperbolic to have zero mean and unit variance and denote the standardized distribution as SGH:

Remarks GHyp is an extremely flexible distribution: It nests, among others, the Laplace, Student s t, Normal,... It possesses semi-heavy tails, i.e., its log-density decays roughly linearly, a common feature in financial data. δ and µ are location and scale parameters, respectively. ρ is a skewness parameter. ω dictates the tail-heaviness. The shape parameter λ is notoriously difficult to estimate. Solution: consider special cases 1 λ = 1 (Normal Inverse Gaussian (NIG)) 2 2 λ = 1 (Hyperbolic)

Outline 1 The GO-GARCH Model 2 3 4 5 6

CDF of GHyp Convolutions In order to compute VaR, we require the (conditional) inverse cdf of R t+1 = b Af t+1, i.e., a weighted sum of independent GHyp random variables. Simulation is one straightforward possibility, but because tail values are required, an extremely large number of replications will be required to obtain reasonable and reliable accuracy. Numerical inversion of the cf is another option, but the requisite integrand is oscillatory, rendering numerical quadrature difficult. We therefore propose use of a Saddlepoint Approximation.

Saddlepoint Approximations: Introduction The SPA can be thought of as 1 approximate inversion of the characteristic function, but without requiring integration 2 an Edgeworth expansion, but vastly more accurate and without the problems associated with the latter, such as negative values of the density and poor accuracy in the tails. Its accuracy for d > 1 will be similar, if not higher, than for the d = 1 case, because, as assets are summed, a central limit effect takes place, drawing the distribution of S closer to normality for which the SPA is exact.

PDF Saddlepoint Approximation The saddlepoint approximation to the density is given by 1 ˆf X (x) = 2π K X (ˆt ) exp { K X (ˆt ) xˆt }, x = K X(ˆt ), where ˆt = ˆt (x) is the solution to the saddlepoint equation and is referred to as the saddlepoint at x. Remarks 1 The saddlepoint equation must be solved (numerically) anew for each value of x. 2 The approximate pdf will not, in general, integrate to one, but can be renormalized. 3 SPA requires derivatives of the cgf see above.

CDF Saddlepoint Approximation The approximate cdf of X could be obtained by numerically integrating ˆf. In a celebrated paper, Lugannani and Rice (198) derived a simple expression for the SPA to the cdf, given for continuous r.v.s by ˆF X (x) = Pr (X < x) = Φ (ŵ) + φ (ŵ) { 1 ŵ 1 û }, x E [X ], where Φ and φ are the cdf and pdf of the standard normal distribution, respectively, ŵ = sgn (ˆt ) 2ˆtx 2K X (ˆt ) and û = ˆt K X(ˆt ). Application to sums of Ghyp is straightforward; requires only derivatives of the cgf, which is known. In the special case of the NIG distribution, the SPA is in closed form (!)

Illustration of Accuracy Density Function.45.4.35.3.25.2.15.1.5 Exact SPA RPE.5.5 1 5 4 3 2 1 1 2 x 5 4 3 2 1 1 2 x Figure: Left: The exact pdf (solid) and renormalized spa (dashed) for a GHyp with λ = 3, ω = 8, ρ = 1/3, σ = 1 and µ =. Right: The relative percentage error (RPE) of the cdf saddlepoint approximation

Illustration of Accuracy (2) 4 Quantiles of GHyp 3 QQ Plot 3 2 2 x q 1 1 True Quantiles 1 1 2 3 2 4.2.4.6.8 1 q 3 3 2 1 1 2 3 SPA Quantiles Figure: Quantiles of X 1 + X 2, X i GHyp(λ, ω i, ρ i, δ i, µ i ), λ =.5, ω 1 = 1.93, ω 2 =.9, ρ 1 =.2, ρ 2 =.6, δ 1 = 1.22, δ 1 =.81, µ 1 =.6, µ 2 =.3. Exact numbers (solid) took 8s to compute, SPA (dotted) 1s.

Outline 1 The GO-GARCH Model 2 3 4 5 6

Application to VaR Forecasting (1) d = 1 dimensional time series of Dow Jones stock returns (3M, Alcoa, Altria, American Express, American International Group, AT&T, Boeing, Caterpillar, Citigroup, Coca-Cola) Daily returns (9/23/92 to 3/23/7), T = 3, 29. Equally weighted portfolio: b i = 1/1. Using a moving estimation window of 1, observations, we compute 1-day ahead VaR forecasts.

Application to VaR Forecasting (2) 8 NIG Innovations 6 4 2 R t 2 4 6 8 1 15 2 25 3 t Figure: Returns, 1-day-ahead 1% and 5% VaR forecasts, and VaR violations, using NIG innovations. Empirical VaR:.95% and 3.85%

Application to VaR Forecasting (3).5 NIG.5 NIG.45.45.4.4.35.35.3.3 f(f t ).25 f(f t ).25.2.2.15.15.1.1.5.5 6 4 2 2 4 6 6 4 2 2 4 6 f t f t Figure: Kernel density (solid) of filtered residuals and fitted NIG densities (dashes) for two ICs.

Application to VaR Forecasting (4) 8 Hyperbolic Innovations 6 4 2 R t 2 4 6 8 1 15 2 25 3 t Figure: Returns, 1-day-ahead 1% and 5% VaR forecasts, and VaR violations, using hyperbolic innovations. Empirical VaR:.91% and 3.4%

Application to VaR Forecasting (5).5 Hyperbolic.5 Hyperbolic.45.45.4.4.35.35.3.3 f(f t ).25 f(f t ).25.2.2.15.15.1.1.5.5 6 4 2 2 4 6 6 4 2 2 4 6 f t f t Figure: Kernel density (solid) of filtered residuals and fitted hyperbolic densities (dashes) for two ICs.

Outline 1 The GO-GARCH Model 2 3 4 5 6

Conclusions CHICAGO method is capable of producing fast and accurate portfolio VaR forecasts. Modular approach: Estimation of factor loadings independent of specification of IC dynamics and distribution of innovations. After parameter estimates have been computed, SPA allows VaR forecasts to be evaluated extremely quickly for different portfolio weights, while maintaining outstanding accuracy, and thus allows the procedure to be used in, e.g., real-time portfolio optimization.

Future Research Use of Shrinkage / Bayesian estimators for µ t and Σ Use of weighted ICA to account for parameter changes / misspecification Use of marginal models with better predictive power (mixture models with pseudo long memory, better leverage effect modelling, and time-varying skewness and kurtosis) Use of several variations of CHICAGO and other easily estimated models for use with optimal combinations of density forecasts