Random Correlation Matrices, Top Eigenvalue with Heavy Tails and Financial Applications

Similar documents
Eigenvector stability: Random Matrix Theory and Financial Applications

Cleaning correlation matrices, Random Matrix Theory & HCIZ integrals

arxiv: v1 [q-fin.st] 3 Oct 2007

Eigenvalue spectra of time-lagged covariance matrices: Possibilities for arbitrage?

Empirical properties of large covariance matrices in finance

Random Matrix Theory Lecture 1 Introduction, Ensembles and Basic Laws. Symeon Chatzinotas February 11, 2013 Luxembourg

EIGENVECTOR OVERLAPS (AN RMT TALK) J.-Ph. Bouchaud (CFM/Imperial/ENS) (Joint work with Romain Allez, Joel Bun & Marc Potters)

Large dimension forecasting models and random singular value spectra

Estimation of the Global Minimum Variance Portfolio in High Dimensions

Non white sample covariance matrices.

Random matrices: Distribution of the least singular value (via Property Testing)

Exponential tail inequalities for eigenvalues of random matrices

Large sample covariance matrices and the T 2 statistic

On the distinguishability of random quantum states

Random Matrix: From Wigner to Quantum Chaos

Concentration Inequalities for Random Matrices

Applications of Random Matrix Theory to Economics, Finance and Political Science

1 Tridiagonal matrices

arxiv:cond-mat/ v2 [cond-mat.stat-mech] 3 Feb 2004

arxiv: v2 [cond-mat.stat-mech] 16 Jul 2012

Large deviations of the top eigenvalue of random matrices and applications in statistical physics

1 Intro to RMT (Gene)

Prior Information: Shrinkage and Black-Litterman. Prof. Daniel P. Palomar

Quantum chaos in composite systems

THE EIGENVALUES AND EIGENVECTORS OF FINITE, LOW RANK PERTURBATIONS OF LARGE RANDOM MATRICES

Fluctuations from the Semicircle Law Lecture 4

Markov operators, classical orthogonal polynomial ensembles, and random matrices

Assessing the dependence of high-dimensional time series via sample autocovariances and correlations

Comparison Method in Random Matrix Theory

Painlevé Representations for Distribution Functions for Next-Largest, Next-Next-Largest, etc., Eigenvalues of GOE, GUE and GSE

Random Matrices and Multivariate Statistical Analysis

Optimality and Sub-optimality of Principal Component Analysis for Spiked Random Matrices

Bulk scaling limits, open questions

Fluctuations of the free energy of spherical spin glass

Potentially useful reading Sakurai and Napolitano, sections 1.5 (Rotation), Schumacher & Westmoreland chapter 2

Multivariate Distributions

Estimation of large dimensional sparse covariance matrices

The largest eigenvalues of the sample covariance matrix. in the heavy-tail case

Insights into Large Complex Systems via Random Matrix Theory

MIMO Capacities : Eigenvalue Computation through Representation Theory

Singular value decomposition (SVD) of large random matrices. India, 2010

OPTIMAL DETECTION OF SPARSE PRINCIPAL COMPONENTS. Philippe Rigollet (joint with Quentin Berthet)

Lecture 2: Linear Algebra Review

Random Matrix Theory and its Applications to Econometrics

REPRESENTATION THEORY WEEK 7

arxiv: v1 [math.pr] 30 Oct 2017

TOP EIGENVALUE OF CAUCHY RANDOM MATRICES

arxiv:physics/ v2 [physics.soc-ph] 8 Dec 2006

Principal Component Analysis (PCA) Our starting point consists of T observations from N variables, which will be arranged in an T N matrix R,

Preface to the Second Edition...vii Preface to the First Edition... ix

Linear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept,

Correlation Matrices and the Perron-Frobenius Theorem

Stochastic Volatility and Correction to the Heat Equation

A complete criterion for convex-gaussian states detection

Background Mathematics (2/2) 1. David Barber

The Matrix Dyson Equation in random matrix theory

On corrections of classical multivariate tests for high-dimensional data. Jian-feng. Yao Université de Rennes 1, IRMAR

Numerical Solutions to the General Marcenko Pastur Equation

The Slow Convergence of OLS Estimators of α, β and Portfolio. β and Portfolio Weights under Long Memory Stochastic Volatility

Universality of distribution functions in random matrix theory Arno Kuijlaars Katholieke Universiteit Leuven, Belgium

1. Matrix multiplication and Pauli Matrices: Pauli matrices are the 2 2 matrices. 1 0 i 0. 0 i

arxiv: v5 [math.na] 16 Nov 2017

Solutions of the Financial Risk Management Examination

Affine Processes. Econometric specifications. Eduardo Rossi. University of Pavia. March 17, 2009

Free Probability and Random Matrices: from isomorphisms to universality

The circular law. Lewis Memorial Lecture / DIMACS minicourse March 19, Terence Tao (UCLA)

Universal phenomena in random systems

Universality in Numerical Computations with Random Data. Case Studies.

forms Christopher Engström November 14, 2014 MAA704: Matrix factorization and canonical forms Matrix properties Matrix factorization Canonical forms

Methods for sparse analysis of high-dimensional data, II

Concentration inequalities: basics and some new challenges

arxiv: v2 [math.pr] 16 Aug 2014

SPECTRAL THEOREM FOR COMPACT SELF-ADJOINT OPERATORS

Invertibility of random matrices

arxiv:cond-mat/ v1 [cond-mat.stat-mech] 1 Aug 2001

On corrections of classical multivariate tests for high-dimensional data

Advances in Random Matrix Theory: Let there be tools

On the principal components of sample covariance matrices

APPENDIX A. Background Mathematics. A.1 Linear Algebra. Vector algebra. Let x denote the n-dimensional column vector with components x 1 x 2.

Probabilistic & Unsupervised Learning

Maximal height of non-intersecting Brownian motions

Determinantal point processes and random matrix theory in a nutshell

Page 712. Lecture 42: Rotations and Orbital Angular Momentum in Two Dimensions Date Revised: 2009/02/04 Date Given: 2009/02/04

Random matrices and free probability. Habilitation Thesis (english version) Florent Benaych-Georges

PCA with random noise. Van Ha Vu. Department of Mathematics Yale University

Wigner s semicircle law

Introduction to Algorithmic Trading Strategies Lecture 10

On the concentration of eigenvalues of random symmetric matrices

Fundamentals of Matrices

Eigenvalue variance bounds for Wigner and covariance random matrices

Random Toeplitz Matrices

Signal and noise in financial correlation matrices

Econophysics III: Financial Correlations and Portfolio Optimization

L3: Review of linear algebra and MATLAB

The eigenvalues and eigenvectors of finite, low rank perturbations of large random matrices

Universality of local spectral statistics of random matrices

Methods for sparse analysis of high-dimensional data, II

Fluctuation results in some positive temperature directed polymer models

Singular Value Decomposition and Principal Component Analysis (PCA) I

LECTURE NOTE #11 PROF. ALAN YUILLE

Transcription:

Random Correlation Matrices, Top Eigenvalue with Heavy Tails and Financial Applications J.P Bouchaud with: M. Potters, G. Biroli, L. Laloux, M. A. Miceli http://www.cfm.fr

Portfolio theory: Basics Portfolio weights w i Risk: variance of the portfolio returns R 2 = ij w i σ i C ij σ j w j where σi 2 matrix. is the variance of asset i and C ij is the correlation If predicted gains are g i then the expected gain of the portfolio is G = w i g i.

Empirical Correlation Matrix Large set of Assets (N) and (comparable) set of data points (T ) Empirical Variance σi 2 = 1 T relative square-error is (2 + κ)/t ( ) X t 2 i t Empirical Equal-Time Correlation Matrix E ij = 1 T t X t i Xt j σ i σ j order N 2 quantities estimated with NT datapoints. If T < N E has rank T < N, not even invertible.

Markowitz Optimization Find the portfolio with maximum expected return for a given risk or equivalently, minimum risk for a given return (G) In matrix notation: w C = G C 1 g g T C 1 g Where all returns are measured with respect to the risk-free rate and σ i = 1 (absorbed in g i ). Non-linear problem: i w i A a spin-glass problem!

Risk of Optimized Portfolios Let E be an noisy estimator of C such that E = C In-sample risk R 2 in = wt E Ew E = G2 g T E 1 g True minimal risk R 2 true = wt C Cw C = G2 g T C 1 g Out-of-sample risk R 2 out = wt E Cw E = G2 g T E 1 CE 1 g (g T E 1 g) 2

Risk of Optimized Portfolios Using convexity arguments, and for large matrices: R 2 in R2 true R2 out Importance of eigenvalue cleaning: w i kj λ 1 k V i k V j k g j = g i + kj (λ 1 k 1)V k i V k j g j Eigenvectors with λ > 1 are suppressed, Eigenvectors with λ < 1 are enhanced. Potentially very large weight on small eigenvalues. Must determine which eigenvalues to keep and which one to correct

Spectrum of Wishart Ensemble Consider an Empirical Correlation Matrix of N assets using T data points both very large with n = N/T finite. E ij = 1 T T k=1 X k i Xk j where X k i Xl j = C ijδ kl We need to find the trace of the resolvent or Stieljes transform: G(z) = 1 N Tr [ (zi E) 1] ρ(λ) = lim ɛ 0 1 I (G(λ iɛ)). π

Null hypothesis C = I E ij is a sum of (rotationally invariant) matrices E k ij = (Xk i Xk j )/T Free random matrix theory: Find the additive R-transform R(x) = B(x) 1/x; B(G(z)) = z) G k (z) = 1 ( 1 N z n + N 1 ) z defining n = N/T, inverting G k (z) to first order in 1/N, R k (x) = 1 T (1 nx) by additivity R E (x) = G E (z) = (z + n 1) (z + n 1) 2 4zn 2zn 1 (1 nx)

Null hypothesis C = I ρ(λ) = 4λn (λ + n 1) 2 2πλn λ [(1 n) 2, (1 + n) 2 ] Marcenko-Pastur (1967) (and many rediscoveries) Any eigenvalue beyond the Marcenko-Pastur band can be deemed to contain some information (but see below)

General C Case The general case for C cannot be directly written as a sum of Blue functions. Solution using different techniques (replicas, diagrams, S- transform: zg E (z) = ZG C (Z) where Z = z 1 + n(zg E (z) 1) For stocks, one large eigenvalue the market and several sectors

Empirical Correlation Matrix 1.0 Empirical data Marcenko Pastur Dressed Power Law "True" eigenvalues ρ(λ) 0.5 0.0 0 1 2 3 4 λ

Matrix Cleaning 150 Return 100 50 Raw in-sample Cleaned in-sample Cleaned out-of-sample Raw out-of-sample 0 0 10 20 30 Risk

General Correlation matrices Non equal time correlation matrices E τ ij = 1 T t X t i Xt+τ j σ i σ j N N but not symmetrical: leader-lagger relations General rectangular correlation matrices G αi = 1 T T Yα t Xt i t=1 N input factors X; M output factors Y Example: Yα t = Xj t+τ, N = M

Singular values and relevant correlations Singular values: Square root of the non zero eigenvalues of GG T or G T G, with associated eigenvectors u k α and vk i 1 s 1 > s 2 >...s (M,N) 0 Interpretation: k = 1: best linear combination of input variables with weights vi 1, to optimally predict the linear combination of output variables with weights u 1 α, with a crosscorrelation = s 1. s 1 : measure of the predictive power of the set of Xs with respect to Y s Other singular values: orthogonal, less predictive, linear combinations

Benchmark: no cross-correlations Null hypothesis: No correlations between Xs and Y s G = 0 But arbitrary correlations among Xs, C X, and Y s, C Y, are possible Consider exact normalized principal components for the sample variables Xs and Y s: and define Ĝ = Ŷ ˆX T. ˆX t i = 1 λi j U ij X t j ; Ŷ t α =...

Benchmark: no cross-correlations Tricks: Non zero eigenvalues of ĜĜ T are the same as those of ˆX T ˆXŶ T Ŷ A = ˆX T ˆX and B = Ŷ T Ŷ are mutually free, with n (m) eigenvalues equal to 1 and 1 n (1 m) equal to 0 S-transforms are multiplicative

Technicalities η A (y) = 1 n + η A (y) 1 T Tr 1 1 + ya. Σ A (x) 1 + x x η 1 A (1 + x). n 1 + y, η B(y) = 1 m + m 1 + y. Σ GG (x) = Σ A (x)σ B (x) = (1 + x) 2 (x + n)(x + m).

Benchmark: Random SVD Final result:([ll,mam,mp,jpb]) ρ(s) = (1 n, 1 m) + δ(s)+(m+n 1) + δ(s 1)+ with γ ± = n + m 2mn ± 2 (s 2 γ )(γ + s 2 ) πs(1 s 2 ) mn(1 n)(1 m), 0 γ ± 1 Analogue of the Marcenko-Pastur result for rectangular correlation matrices Many applications; finance, econometrics ( large models), genomics, etc.

Benchmark: Random SVD Simple cases: n = m, s [0, 2 n(1 n)] n, m 0, s [ m n, m + n] m = 1, s 1 n m 0, s n

RSVD: Numerical illustration 0.8 0.6 m=0.4, n=0.2 m=n=0.3 m=0.7, n=1 m=0.3 m=n=0.8 ρ(s) 0.4 0.2 0 0 0.5 1 s

RSVD: Numerical illustration 0.3 Theory (MP 2 ) Theory (Standardized) Simulation (Random SVD) ρ(s) 0.2 0.1 0 0 0.2 0.4 0.6 0.8 1 s

Inflation vs. Economic indicators 1 N = 50, M = 16, 0.8 P(s <s) 0.6 0.4 Data Benchmark 0.2 0 0.2 0.4 0.6 0.8 1 s T = 265.

Statistics of the Top Eigenvalue All previous results are true when N, M, T with fixed n, m How far is the top eigenvalue expected to leak out at finite N? Precise answer when matrix elements are iid Gaussian: Tracy- Widom statistics Width of the smoothed edge: N 2/3 Relation with the directed polymer problem + many others

Statistics of the Top Eigenvalue Exceptions Strong Rank One Perturbation emergence of an isolated eigenvalue with Gaussian, N 1/2 fluctuations (Baik, Ben-Arous, Péché) E.g.: E ij E ij +ρ(1 δ ij ) leads to a market mode λ max Nρ Fat tailed distribution of matrix elements

Fat tails and Top Eigenvalue: Wigner Case Eigenvalue statistics of large real symmetric matrices with iid elements X ij, P (x) X 1 µ Eigenvalue density: µ > 2 Wigner semi-circle in [ 2, 2] µ < 2 unbounded density with tails ρ(λ) λ 1 µ Note: µ < 2 non trivial statistics of eigenvectors (localized/delocalized) (Cizeau,JPB)

Fat tails and Top Eigenvalue: Wigner Case Largest Eigenvalue statistics ([GB,MP,JPB]) µ > 4: λ max 2 N 2/3 with a Tracy-Widom distribution (max of strongly correlated variables) 2 < µ < 4: λ max N 2 µ 1 2 with a Fréchet distribution (although the density goes to zero when λ > 2!!) µ = 4: λ max 2 but remains O(1), with a new distribution: P > (λ max ) = wθ(λ max 2) + (1 w)f (s) λ max = s + 1 s Note: The case µ > 4 still has a power-law tail for finite N, of amplitude N 2 µ/2

Fat tails and Correlation Matrices E ij = 1 T Xi t Xt j t µ > 4: λ max (1 + n) 2 N 2/3 (but with a power-law tail as above) µ < 4: λ max N 4 µ 1 n 1 2/µ Fat tails induce fictitious strong correlations important for applications in finance where µ 3 5.

EWMA Empirical Correlation Matrices Consider the case where the Empirical matrix is often computed using an exponentially weighted moving average (EWMA) with ɛ = 1/T E ij = ɛ k=0 (1 ɛ) k X k i Xk j where X k i Xl j = δ ijδ kl Above trick based R-functions still works: ρ(λ) = 1 IG(λ) where G(λ) solves λng = n log(1 ng) π

EWMA Empirical Correlation Matrices 1.2 1 exp Q=2 std Q=3.45 0.8 ρ(λ) 0.6 0.4 0.2 0 0 0.5 1 1.5 2 2.5 3 λ Spectrum of the exponentially weighted random matrix with n = 1/2 and the spectrum of the standard random matrix with n N/T = 1/3.45.

Dynamics of the top eigenvector Specific dynamics of large top eigenvalue and eigenvector: Ornstein-Uhlenbeck processes (on the unit sphere for V 1 ) The angle obeys the following SDE: dθ ɛ 2 sin 2θdt + ζ t dw t with [ ζt 2 1 ɛ2 2 sin2 2θ t + Λ ] 1 cos 2 2θ t Λ 0 Eigenvector dynamics: ψ0t+τ ψ 0t E(cos(θ t θ t+τ )) 1 ɛ Λ 1 Λ 0 (1 exp( ɛτ))

The variogram of the top eigenvector 0.2 0.1 Ornstein Uhlenbeck Log(Variogram cos(θ)) Variogram λ 0 0 0 100 200 300 400 τ