Empirical properties of large covariance matrices in finance

Similar documents
arxiv: v1 [q-fin.st] 9 Mar 2009

Eigenvector stability: Random Matrix Theory and Financial Applications

Random Correlation Matrices, Top Eigenvalue with Heavy Tails and Financial Applications

Eigenvalue spectra of time-lagged covariance matrices: Possibilities for arbitrage?

Multivariate GARCH models.

Intro VEC and BEKK Example Factor Models Cond Var and Cor Application Ref 4. MGARCH

Stochastic Processes

Vast Volatility Matrix Estimation for High Frequency Data

Principal Component Analysis (PCA) Our starting point consists of T observations from N variables, which will be arranged in an T N matrix R,

Nonlinear Bivariate Comovements of Asset Prices: Theory and Tests

Applications of Random Matrix Theory to Economics, Finance and Political Science

Volatility. Gerald P. Dwyer. February Clemson University

Prior Information: Shrinkage and Black-Litterman. Prof. Daniel P. Palomar

ARANDOM-MATRIX-THEORY-BASEDANALYSISOF STOCKS OF MARKETS FROM DIFFERENT COUNTRIES

Probabilities & Statistics Revision

6. The econometrics of Financial Markets: Empirical Analysis of Financial Time Series. MA6622, Ernesto Mordecki, CityU, HK, 2006.

Regression: Ordinary Least Squares

VAR Model. (k-variate) VAR(p) model (in the Reduced Form): Y t-2. Y t-1 = A + B 1. Y t + B 2. Y t-p. + ε t. + + B p. where:

GARCH Models. Eduardo Rossi University of Pavia. December Rossi GARCH Financial Econometrics / 50

1 Linear Difference Equations

Stat260: Bayesian Modeling and Inference Lecture Date: February 10th, Jeffreys priors. exp 1 ) p 2

Lecture 6: Univariate Volatility Modelling: ARCH and GARCH Models

The Instability of Correlations: Measurement and the Implications for Market Risk

Non white sample covariance matrices.

Thomas J. Fisher. Research Statement. Preliminary Results

Financial Econometrics

Estimation of the Global Minimum Variance Portfolio in High Dimensions

Heteroskedasticity in Time Series

A Non-Parametric Approach of Heteroskedasticity Robust Estimation of Vector-Autoregressive (VAR) Models

Inference in VARs with Conditional Heteroskedasticity of Unknown Form

arxiv:physics/ v3 [physics.data-an] 29 Nov 2006

Introduction to Algorithmic Trading Strategies Lecture 10

Network Connectivity and Systematic Risk

Assessing the dependence of high-dimensional time series via sample autocovariances and correlations

Econophysics III: Financial Correlations and Portfolio Optimization

MFE Financial Econometrics 2018 Final Exam Model Solutions

Time Series Models for Measuring Market Risk

A simple graphical method to explore tail-dependence in stock-return pairs

Speculation and the Bond Market: An Empirical No-arbitrage Framework

Structure in Data. A major objective in data analysis is to identify interesting features or structure in the data.

CHICAGO: A Fast and Accurate Method for Portfolio Risk Calculation

arxiv:physics/ v2 [physics.soc-ph] 8 Dec 2006

INDIAN INSTITUTE OF SCIENCE STOCHASTIC HYDROLOGY. Lecture -33 Course Instructor : Prof. P. P. MUJUMDAR Department of Civil Engg., IISc.

arxiv:cond-mat/ v2 [cond-mat.stat-mech] 3 Feb 2004

EIGENVECTOR OVERLAPS (AN RMT TALK) J.-Ph. Bouchaud (CFM/Imperial/ENS) (Joint work with Romain Allez, Joel Bun & Marc Potters)

1 Data Arrays and Decompositions

9. Multivariate Linear Time Series (II). MA6622, Ernesto Mordecki, CityU, HK, 2006.

Generalized Autoregressive Score Models

R = µ + Bf Arbitrage Pricing Model, APM

Multivariate Time Series Analysis and Its Applications [Tsay (2005), chapter 8]

Estimation of large dimensional sparse covariance matrices

Subsampling Cumulative Covariance Estimator

GARCH Models Estimation and Inference

Multivariate Time Series: VAR(p) Processes and Models

Introduction to Algorithmic Trading Strategies Lecture 3

The Chaotic Marriage of Physics and Finance

Estimating Covariance Using Factorial Hidden Markov Models

Lecture 8: Multivariate GARCH and Conditional Correlation Models

Extracting information from noisy time series data

Statistics 910, #5 1. Regression Methods

Errata for Campbell, Financial Decisions and Markets, 01/02/2019.

Time Series Modeling of Financial Data. Prof. Daniel P. Palomar

Backtesting Marginal Expected Shortfall and Related Systemic Risk Measures

9) Time series econometrics

13. Estimation and Extensions in the ARCH model. MA6622, Ernesto Mordecki, CityU, HK, References for this Lecture:

Analytical derivates of the APARCH model

arxiv: v2 [cond-mat.stat-mech] 16 Jul 2012

Factor Models for Asset Returns. Prof. Daniel P. Palomar

Multivariate Volatility, Dependence and Copulas

ECONOMICS 7200 MODERN TIME SERIES ANALYSIS Econometric Theory and Applications

Research Article Optimal Portfolio Estimation for Dependent Financial Returns with Generalized Empirical Likelihood

Ross (1976) introduced the Arbitrage Pricing Theory (APT) as an alternative to the CAPM.

Linear Algebra and Dirac Notation, Pt. 2

arxiv:physics/ v2 [physics.soc-ph] 18 Jan 2010

Sample Exam Questions for Econometrics

Principal Component Models for Generating Large GARCH Covariance Matrices

235 Final exam review questions

Exponential tail inequalities for eigenvalues of random matrices

VARIANCE COMPUTATION OF MODAL PARAMETER ES- TIMATES FROM UPC SUBSPACE IDENTIFICATION

Prof. Dr. Roland Füss Lecture Series in Applied Econometrics Summer Term Introduction to Time Series Analysis

TESTING FOR CO-INTEGRATION

arxiv: v1 [q-fin.st] 18 Sep 2017

Econometría 2: Análisis de series de Tiempo

Mathematical foundations - linear algebra

Bootstrap tests of multiple inequality restrictions on variance ratios

Identifying Aggregate Liquidity Shocks with Monetary Policy Shocks: An Application using UK Data

Determining and Forecasting High-Frequency Value-at-Risk by Using Lévy Processes

Vector autoregressions, VAR

Factor Models for Multiple Time Series

Motivation Non-linear Rational Expectations The Permanent Income Hypothesis The Log of Gravity Non-linear IV Estimation Summary.

Econ 423 Lecture Notes: Additional Topics in Time Series 1

Cointegrated VAR s. Eduardo Rossi University of Pavia. November Rossi Cointegrated VAR s Financial Econometrics / 56

Gaussian Models (9/9/13)

Cleaning correlation matrices, Random Matrix Theory & HCIZ integrals

New Statistical Model for the Enhancement of Noisy Speech

Multivariate Distributions

Cross-Asset Speculation in Stock Markets

On the Estimation and Application of Max-Stable Processes

Introduction Eigen Values and Eigen Vectors An Application Matrix Calculus Optimal Portfolio. Portfolios. Christopher Ting.

Transcription:

Empirical properties of large covariance matrices in finance Ex: RiskMetrics Group, Geneva Since 2010: Swissquote, Gland December 2009

Covariance and large random matrices Many problems in finance require to estimate the covariance matrix Σ in a given universe of assets. Σ measures the volatilities and co-dependencies between time series. Today portfolios are very large, ranging from N = 100 to 100 000 positions. At first order, prices are random walks. With such large sizes, an approach with random matrices seems appropriate. Is such an approach valid, when information should be present in a financial universe?

Extracting information from prices From a given asset α, the prices p α (t) are the known informations from the trades. The returns (profit and loss), at the scale δt (1 day), are r α (t) = ln(p α (t)/p α (t δt)) The returns are dimentionless, but their magnitudes depend on δt.

The key specificity in finance: heteroskedasticity 100 80 60 40 20 0 20 40 60 80 100 1992 1994 1996 1998 2000 2002 2004 2006 Time series for the annualized returns r[δt] for the FTSE100. The dominant features of the data are heteroscedasticity and fat tails.

What is the correct measure of volatility? Goal: measure the volatility σ(t). Problem: many definitions can be given, with different properties. A process that reproduces the empirical features has to contain a dynamical volatility σ(t). This σ(t) is the volatility in the forthcoming time step. The conditional expectations of quadratic GARCH processes can be computed forecasts for any horizons. The forecasts take the form: σ 2 (t,t + n δt) = λ(i,n) r 2 (t i δt) i=0 λ(i, n) = 1 i The volatility forecast is such that the recent past carry more information than the distant past.

Definition of the covariance The covariance matrix Σ(t) is a cross product of the past return vectors Σ α,β (t,t + n δt) = λ(i,n) r α (t i δt) r β (t i δt) i=0 λ(i, n) = 1 i The mean return can also be subtracted. More general quadratic extensions can be used (see later). The dependency on n is weak n = 1. The covariance Σ(t) has a dynamical structure.

The covariance s weights Common choices for λ(i): Equal weights in a time window: λ(i) = 1/T A convenient definition for rigorous results. Exponential: λ(i) µ i = exp( i δt/τ) A convenient practical definition. Long memory: λ(i) 1 log(i δt)/log(τ 0 ) An optimal definition for forecasts. for i < T

The correlation matrix The correlation matrix ρ(t) is defined by ρ α,β (t) = Σ α,β (t) Σα,α (t) Σ β,β (t) The correlation matrix has simpler properties than the covariance, because it is not depending on the distribution of the volatilities Σ α,α.

Why covariances are so important in finance? The covariance is crucial in three areas in finance: Portfolio allocation: the optimal allocation of assets in a portfolio is depending on the volatilities (diagonal part of Σ) and diversification (off diagonal part of Σ). Risk: market risks are driven by the possible large losses in a portfolio. The profits and losses are driven by the covariance (more precisely: a forecast for the covariance matrix up to the desired risk horizon). Model building and inferences: a dynamical description of the market is done by random processes. For example, a multivariate Bachelier random walk is r α (t) = Σ 1/2 α,β ε β(t) where the innovations ε β (t) are independent random variables.

The problems with large sizes When N > T, the covariance contains zero eigenvalues. The covariance cannot be inverted! Is there actually N 2 (or NT ) information in Σ? Randomness should be a large part of the covariance. Can the information be separated from the noise? Can the dominant structure of Σ be understood in term of a few dominant factors (e.g. stock indexes, industrial sectors,...)? How many factors are needed?

A purely random model for Σ In 1929, Wishart proposed a purely random model for the covariance Σ α,β = 1 T r α (t ) r β (t ) 1 t T with r α (t) independent random variables with unit variance. In finance, the natural structure is a product of random vectors. Marcenko and Pastur (1967) derived the spectral density in the limit N and T, for a fixed ratio q = N/T : 4λq (λ + q 1) 2 ρ(λ) = λ [ (1 q) 2,(1 + q) 2] 2πλq

The data sets for the empirical study ICM dataset: 340 time series covering majors asset classes and world geographical areas (19 commodities, 78 foreign exchange rates, 52 equity indices, 127 interest rates, 54 individual stocks) G10 dataset: 55 time series covering the largest economies (European, Japan, and USA). USA dataset: 54 times series from USA, mostly large stocks with IR and 2 equity indices. For constant weights in a window, T = 260 days. Other memory kernels are evaluated with T = 260 days.

Spectrum For symmetric matrices, the eigen-decomposition is Σ = N α=1 e α v α v α and the eigenvectors v α are orthogonal. Similarly for the correlation matrix. All quantities are time dependent: Σ(t),ρ(t),e α (t),v α (t).

Time evolution of the spectrum (ICM data set) 10 5 Eigenvalues 10 0 10 5 10 3 2000 2001 2002 2003 2004 2005 2006 2007 2008 10 2 Eigenvalues 10 1 10 0 2000 2001 2002 2003 2004 2005 2006 2007 2008

Time evolution of the spectrum The bulk of the spectrum spectrum is very rigid. Only the first few eigenvalues have an interesting dynamics. This goes in the direction of a few meaningful eigenvalues + random spectrum.

Spectral density The spectral density in the interval [λ δλ,λ + δλ] counts the number of eigenvalues in the interval ρ(λ) = 1 2 δλ 1 T T t=1 1 N α χ(λ δλ < e α (t) < λ + δλ). The spectral density ρ(λ) measures the time average density of the eigenvalues around λ. The normalization is defined such that R ρ(λ) dλ = 1.

Spectral density for the correlation (constant weights) 10 2 10 0 spectral density 10 2 10 4 10 6 ICM M P q=1.269 G10 M P q=0.212 USA M P q=0.208 10 3 10 2 10 1 10 0 10 1 10 2 eigenvalue Mean spectral density of the correlation matrix ρ.

Spectral density for the correlation (long memory) 10 2 10 0 spectral density 10 2 10 4 10 6 ICM G10 USA 10 3 10 2 10 1 10 0 10 1 10 2 eigenvalue Mean spectral density of the correlation matrix ρ.

Spectral density for the covariance 10 4 10 2 spectral density 10 0 10 2 10 4 10 6 ICM G10 USA 0.1/eigenvalue 10 4 10 2 10 0 10 2 eigenvalue Mean spectral density of the covariance matrix Σ. The green curve is a simple Ansatz corresponding to e α e aα/n

Remarks on the spectral density The eigenvalues decrease exponentially fast toward zero. Problems arise even when N < T. The empirical density (for the correlation) is not close to the Marcenko-Pastur density. Marcenko-Pastur: N, T, q = N/T fixed. Empirical: N, T fixed So far, the focus is on the spectrum. What about the eigenvectors? The relevant objects to study are projectors.

Projector The projector P k (t) on the leading subspace of rank k is P k = k v α v α α=1 The mean projector is defined by the time average P k = 1 T T t=1 P k (t). The trace is preserved by the average tr P k = k. P k is not a projector as its eigenvalues are between 0 and 1.

Empirical spectrum of the mean projector (ICM data set) Eigenvalues 1 0.8 0.6 0.4 0.2 0 10 0 10 1 10 2 10 3 1 2 3 4 6 8 11 16 23 32 45 64 91 128 181 215 250 260 No clear invariant subspace!

Projector dynamics and fluctuation index The fluctuation index γ essentially measures the difference between P 2 (t) and P(t) 2, with a convenient normalization γ k = tr P 2 k tr ( P k 2) tr( P k ) = 1 1 k tr ( P 2). If P P(t), then P 2 P and γ 0. If the projector dynamics explores fully the available space P k k/n I, then γ k = γ k,max = 1 k/n. This is a dis-order parameter.

Empirical fluctuation index (ICM data set) 1 relative fluctuation index 0.8 0.6 0.4 0.2 0 10 0 10 1 10 2 Relative fluctuation index γ k /γ k,max as function of the projector rank k. Projectors of covariance (red) and correlation (blue).

Lagged correlation For a matrix X(t), the scalar lagged correlation is ρ(τ) = tr (X(t) X )(X(t + τ) X ) tr (X X ) 2. The matrices X(t) can be the covariance or projector of a given rank, or the correlation or projector of a given rank.

Empirical lagged correlation (USA data set, T = 21 days) lagged correlation 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 20 40 60 80 100 120 lag [day] lagged correlation 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 0 20 40 60 80 100 120 lag [day] Lagged correlations for the covariance (left) and correlation (right), for full matrix (blue) and the projectors of increasing size (red). This is the signature of heteroskedasticity and its long memory.

Summarizing the projector dynamics Regardless of the rank k, projectors tend to explore a large fraction of the available space. Covariance and correlation matrices have a similar dynamics. An approximation by constant correlations is inappropriate.

Multivariate ARCH processes x(t + δt) = x(t) + r(t + δt) r(t + δt) = Σ(γ,ξ;t) 1/2 ε(t + δt). ) Σ(γ,ξ) = (1 ξ) ((1 γ)σ + γ Σ diag + ξ tr(σ) I N The new terms are quadratic and keep the trace (total variance). For ξ > 0, all eigenvalues are positive. The innovations are iid: E [ ε α (t) ε β (t ) ] = δ α,β δ t, t To estimate the process on historical data requires to compute the innovations: ε(t + δt) = Σ 1/2 (γ,ξ;t) r(t + δt).

Statistical properties of the innovations εα ε β = 0 ε 2 α = 1 whitening quality 14 12 10 8 6 4 2 <stddev> 4 3.5 3 2.5 2 1.5 1 0.5 0 10 5 10 4 10 3 10 2 10 1 10 0 Regularization parameter 0 10 5 10 4 10 3 10 2 10 1 10 0 Regularization parameter Shrinkage parameter: γ = 0.0 (black), 0.05, 0.1, 0.2, 0.4 (blue). Horizontal black line: return correlation. Horizontal red lines: uncorrelated Student innovations.

Understanding the causes of the problem The inability to get white noise innovations is due to the followings. Small eigenvalues corresponds to directions (portfolios) without fluctuations (riskfree) Fluctuations along these directions do occur out-of-sample Large innovations are needed to compensate for the small eigenvalues Regularizing the covariance reduces the innovations...... but washes out the correlation structure Implication for volatility forecasts: for increasing forecast horizon, the historical correlation decreases for a structureless prior.

Conclusions The eigenvalues of the covariance are mostly uninformative, except the first three to ten eigenvalues. The eigenvalues of the correlation are even less informative, but the first fews are not constant. The transition from significant to noisy eigenvalues is gradual (no gap, no cusp) Eigenvectors have informative dynamics deep in the spectrum. No clear invariant subspace corresponding to the main market modes, but dynamic subspaces with regularly decreasing importance across the whole spectrum (no universal factors). Random matrix theory seems to lack results that can be used in this context (N with T fixed, properties of the projectors). The inverse covariance is ill defined, even when N T. A proper regularization should be used to estimate Σ 1. No good multivariate processes.

References The corresponding papers can be downloaded from www.ssrn.com (Social Science Research Network)