Rearrangement Algorithm and Maximum Entropy

Similar documents
Risk Aggregation with Dependence Uncertainty

Introduction to Dependence Modelling

Sharp bounds on the VaR for sums of dependent risks

VaR bounds in models with partial dependence information on subgroups

Risk Aggregation with Dependence Uncertainty

Sum of Two Standard Uniform Random Variables

Simulating Exchangeable Multivariate Archimedean Copulas and its Applications. Authors: Florence Wu Emiliano A. Valdez Michael Sherris

Review (Probability & Linear Algebra)

Dependence uncertainty for aggregate risk: examples and simple bounds

Risk bounds with additional structural and dependence information

arxiv: v1 [q-fin.rm] 11 Mar 2015

Submodularity in Machine Learning

Introduction to Computational Finance and Financial Econometrics Probability Theory Review: Part 2

Catastrophe Aversion and Risk Equity in an Interdependent World

an information-theoretic model for adaptive side-channel attacks

Introduction to Algorithmic Trading Strategies Lecture 3

Supermodular ordering of Poisson arrays

Generalized Autoregressive Score Models

Explicit Bounds for the Distribution Function of the Sum of Dependent Normally Distributed Random Variables

Nonlinear Bivariate Comovements of Asset Prices: Theory and Tests

Solutions of the Financial Risk Management Examination

Lecture Quantitative Finance Spring Term 2015

arxiv: v2 [math.pr] 23 Jun 2014

Backtesting Marginal Expected Shortfall and Related Systemic Risk Measures

1: PROBABILITY REVIEW

Robust Testing and Variable Selection for High-Dimensional Time Series

Intro VEC and BEKK Example Factor Models Cond Var and Cor Application Ref 4. MGARCH

Generalized quantiles as risk measures

Confidence Intervals, Testing and ANOVA Summary

Introduction to Probability and Statistics (Continued)

Statistics for Financial Engineering Session 2: Basic Set Theory March 19 th, 2006

Risk Aggregation. Paul Embrechts. Department of Mathematics, ETH Zurich Senior SFI Professor.

The distribution of a sum of dependent risks: a geometric-combinatorial approach

Lecture 6 & 7. Shuanglin Shao. September 16th and 18th, 2013

Mon Feb Matrix algebra and matrix inverses. Announcements: Warm-up Exercise:

Corrections to Theory of Asset Pricing (2008), Pearson, Boston, MA

Worst-Case Expected Shortfall with Univariate and Bivariate. Marginals

Week 1 Quantitative Analysis of Financial Markets Distributions A

Lifetime Dependence Modelling using a Generalized Multivariate Pareto Distribution

On the Estimation and Application of Max-Stable Processes

Mathematical Foundations -1- Constrained Optimization. Constrained Optimization. An intuitive approach 2. First Order Conditions (FOC) 7

Joint Mixability. Bin Wang and Ruodu Wang. 23 July Abstract

Elementary Matrices. MATH 322, Linear Algebra I. J. Robert Buchanan. Spring Department of Mathematics

MTH 464: Computational Linear Algebra

CS 630 Basic Probability and Information Theory. Tim Campbell

The Instability of Correlations: Measurement and the Implications for Market Risk

Notes on Recursive Utility. Consider the setting of consumption in infinite time under uncertainty as in

CONTAGION VERSUS FLIGHT TO QUALITY IN FINANCIAL MARKETS

x n -2.5 Definition A list is a list of objects, where multiplicity is allowed, and order matters. For example, as lists

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

Some Basic Concepts of Probability and Information Theory: Pt. 2

Asymmetric Dependence, Tail Dependence, and the. Time Interval over which the Variables Are Measured

(II.B) Basis and dimension

Measures of center. The mean The mean of a distribution is the arithmetic average of the observations:

Topological Data Analysis - Spring 2018

Multidimensional scaling (MDS)

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

Robust Principal Component Analysis

Numerical Linear Algebra

Modelling Dependent Credit Risks

STA 4273H: Statistical Machine Learning

MATH 2331 Linear Algebra. Section 2.1 Matrix Operations. Definition: A : m n, B : n p. Example: Compute AB, if possible.

Risk Aggregation and Model Uncertainty

Multivariate Statistics

Exercises with solutions (Set D)

Sparse Covariance Selection using Semidefinite Programming

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

Efficient estimation of a semiparametric dynamic copula model

A copula goodness-of-t approach. conditional probability integral transform. Daniel Berg 1 Henrik Bakken 2

BAYESIAN DECISION THEORY

Worst-Case Violation of Sampled Convex Programs for Optimization with Uncertainty

A Measure of Monotonicity of Two Random Variables

Labor-Supply Shifts and Economic Fluctuations. Technical Appendix

Summary of basic probability theory Math 218, Mathematical Statistics D Joyce, Spring 2016

An Academic Response to Basel 3.5

Vector autoregressions, VAR

Modeling Dependence of Daily Stock Prices and Making Predictions of Future Movements

STAT 200C: High-dimensional Statistics

is injective because f(a) = f(b) only when a = b. On the other hand, 1

Accounting for Missing Values in Score- Driven Time-Varying Parameter Models

Correlation & Dependency Structures

Modelling Dependence with Copulas and Applications to Risk Management. Filip Lindskog, RiskLab, ETH Zürich

1 Description of variables

Math 180B Problem Set 3

Matrices and systems of linear equations

Stein s method and zero bias transformation: Application to CDO pricing

Introduction to Algorithmic Trading Strategies Lecture 10

Modelling Dropouts by Conditional Distribution, a Copula-Based Approach

3. Vector spaces 3.1 Linear dependence and independence 3.2 Basis and dimension. 5. Extreme points and basic feasible solutions

Generalized Hypothesis Testing and Maximizing the Success Probability in Financial Markets

Order book resilience, price manipulation, and the positive portfolio problem

1 Joint and marginal distributions

Rational Inattention

Switching Regime Estimation

Solution of the Financial Risk Management Examination

A Goodness-of-fit Test for Copulas

Graduate Econometrics I: What is econometrics?

Asymptotic Bounds for the Distribution of the Sum of Dependent Random Variables

Convolution Based Unit Root Processes: a Simulation Approach

Recitation 2: Probability

Transcription:

University of Illinois at Chicago Joint with Carole Bernard Vrije Universiteit Brussel and Steven Vanduffel Vrije Universiteit Brussel R/Finance, May 19-20, 2017

Introduction An important problem in Finance: Given marginal distributions of random variables X 1,..., X d and of their weighted sum I = ω 1 X 1 +... + ω d X d, can we infer dependence among the variables? For example, if X j s are stock prices and I is the stock index, we can estimate their risk-neutral densities from traded options using Breeden and Litzenberger (1978) result: f (K) = 2 C(K) K 2. Using forward-looking risk-neutral densities implied by option prices, can we infer the dependence?

Introduction In this paper, we study properties of the Block Rearrangement Algorithm (BRA) in the context of inferring dependence among variables given their marginal distributions and the distribution of the sum. Although there are typically infinitely many theoretical solutions, we show that BRA yields solutions that are close to each other and exhibit maximum entropy. Thus, BRA is a stable algorithm for inferring dependence and its solution is economically meaningful.

Setup Inputs: d random variables X 1 F 1,..., X d F d. Goal: look for a dependence such that the variance of sum S = X 1 +... + X d is minimized. Assume that each X j is sampled into n equiprobable values, i.e., we consider the realizations x ij := F 1 j ( i 0.5 n ) and arrange them in an n d matrix: x 11 x 12... x 1d x 21 x 22... x 2d X = [X 1,..., X d ] =.... x n1 x n2... x nd Want to rearrange elements x ij (by columns) so that the variance of row sums is minimized. This is an NP complete problem. Brute force search requires checking (n!) (d 1) rearrangements!

Rearrangement Algorithm (RA) Greedy algorithm developed in Puccetti and Rüschendorf (2012) and Embrechts, Puccetti, and Rüschendorf (2013): 1 For j = 1,..., d, make the j th column anti-monotonic with the sum of the other columns. ( ) 2 If there is no improvement in var d k=1 X k, output the current matrix X, otherwise return to step 1. Step 1 ensures that the columns before rearranging (X k ) and after rearranging ( X k ) satisfy ( d ) ) var X k var X k. k=1 ( d k=1

Block Rearrangement Algorithm (BRA) When d > 3, the standard RA can be improved by considering blocks, Bernard and McLeish (2014), Bernard, Rüschendorf, and Vanduffel (2014): 1 Select a random sample of n sim possible partitions of the columns {1, 2,..., d} into two non-empty subsets {I 1, I 2 }. 2 For each of the n sim partitions, create block matrices X 1 and X 2 with corresponding row sums S 1 and S 2 and rearrange rows of X 2 so that S 2 is anti-monotonic to S 1. ( ) 3 If there is no improvement in var d k=1 X k, output the current matrix X, otherwise, return to step 1. When d is reasonably small, we can take n sim = 2 d 1 1 (all non-trivial partitions are considered). Otherwise, randomize.

Inferring Dependence Inputs: d random variables X 1 F 1,..., X d F d and their sum S F S. Assume that each X j and S are sampled into n equiprobable values, arranged in an n (d + 1) matrix: x 11 x 12... x 1d s 1 x 21 x 22... x 2d s 2 M = [X 1,..., X d, S] =........ x n1 x n2... x nd s n Apply BRA on the augmented matrix M. When row sums of the rearranged matrix are close to zero, a compatible dependence has been found.

Simulation Exercise: Gaussian Case Gaussian margins X j N[0, σ 2 j ] and Gaussian sum S N[0, σ2 S ]. Number of components d ranges from 3 to 10. Standard deviations σ i are linearly decreasing from 1 to 1/d. Set σ S such that ρ imp = 0.8. Discretization level n from 1,000 to 10,000. Run BRA K = 500 times. Find that for each k the inferred dependence is close to the one with the maximum entropy, which has a Gaussian copula, maximizes the determinant of the correlation matrix for X j.

Maximum Determinant and Maximum Entropy Entropy refers to disorder of a system, Shannon (1948). Let f be the density of a multivariate distribution of (X 1,..., X d ), then the entropy is H(X 1,..., X d ) = E(log( f (X 1,..., X d ))). Proposition: Maximum entropy The entropy of the multivariate distribution of (X 1,..., X d ) with Gaussian margins and invertible correlation matrix R satisfies H(X 1,.., X d ) d 2 (1 + ln(2π)) + 1 2 d ( ) ln σ 2 i + 1 ln (det(r)) 2 i=1 where the equality holds iff (X 1,..., X d ) is multivariate Gaussian.

Stability of BRA Figure: K = 500 blue dots correspond to different runs of BRA. Shaded gray area is set of feasible solutions; red star is maximal correlation matrix R M (=maximum entropy). Left panel: realized correlations ρ 12, ρ 13, and ρ 23. Right panel: relation of determinant versus ρ 12.

Stability of BRA Figure: K = 500 blue dots correspond to different runs of BRA. Shaded gray area is set of feasible solutions; red star is maximal correlation matrix R M (=maximum entropy). Left panel: realized correlations ρ 12, ρ 13, and ρ 23. Right panel: relation of determinant versus ρ 12.

Recovering Pairwise Correlations Normal Distribution: d = 3 and n = 1,000 1 0.5 0-0.5 0-0.2-0.4-0.6-0.8-1 PE1( d ) (%) -0.4-0.3-0.2-0.1 0 PE(ˆρ) (%) PE2( d ) (%) -0.4-0.3-0.2-0.1 0 PE(ˆρ) (%) 0.18 0.16 0.14 0.12 0.1 0.08 0.06 0.04 1 0.8 0.6 0.4 0.2 0 V -0.4-0.3-0.2-0.1 0 PE(ˆρ) (%) Correlations: 5 th, 50 th, and 90 th Percentiles ρ 12 ρ 13 ρ 23

Recovering Pairwise Correlations Normal Distribution: d = 10 and n = 1,000 PE1( d ) (%) 10-3 2 V 0-0.2-0.4-0.6-0.8-1 -1.4-1.2-1 -0.8-0.6-0.4-0.2 0 PE(ˆρ) (%) 10-3 PE2( d ) (%) 0-0.2-0.4-0.6-0.8-1 -1.4-1.2-1 -0.8-0.6-0.4-0.2 0 PE(ˆρ) (%) 10-3 1.5 1 0.5-1.4-1.2-1 -0.8-0.6-0.4-0.2 0 PE(ˆρ) (%) 10-3 Correlations: 5 th, 50 th, and 90 th Percentiles 1 0.8 0.6 0.4 0.2 0 ρ 12 ρ 23 ρ 34 ρ 45 ρ 56 ρ 67 ρ 78 ρ 89

Robustness to Initial Conditions Start from a particular candidate solution. Introduce small noise, by randomly swapping 0.2% of rows: 2 rows out of 1,000, 6 rows out of 3,000, 20 rows out of 10,000. Check where K = 500 runs of BRA converge.

Robustness to Initial Conditions Figure: K = 500 blue dots correspond to different runs of BRA. Each run starts at a particular solution (green star), but with 2 random rows swapped. Shaded gray area is set of feasible solutions; red star is maximal correlation matrix R M (=maximum entropy).

Robustness to Initial Conditions Figure: K = 500 blue dots correspond to different runs of BRA. Each run starts at a particular solution (green star), but with 6 random rows swapped. Shaded gray area is set of feasible solutions; red star is maximal correlation matrix R M (=maximum entropy).

Robustness to Initial Conditions Figure: K = 500 blue dots correspond to different runs of BRA. Each run starts at a particular solution (green star), but with 20 random rows swapped. Shaded gray area is set of feasible solutions; red star is maximal correlation matrix R M (=maximum entropy).

Conclusions Robust to non-gaussian distributions (e.g., Multivariate Skewed-t). Does not hold for the standard RA. Applications: Pricing multivariate options (basket, exchange, spread, etc.), Forward looking indicators of implied dependence, measures of tail risk, Down and Up implied correlation, Optimal portfolios. Paper is available on SSRN.