Accounting for extreme-value dependence in multivariate data

Similar documents
Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution

RANK-BASED INFERENCE FOR BIVARIATE EXTREME-VALUE COPULAS

Copula modeling for discrete data

A measure of radial asymmetry for bivariate copulas based on Sobolev norm

Semiparametric Gaussian Copula Models: Progress and Problems

A Brief Introduction to Copulas

ESTIMATING BIVARIATE TAIL

Semi-parametric estimation of non-stationary Pickands functions

Financial Econometrics and Volatility Models Copulas

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

On the Estimation and Application of Max-Stable Processes

Correlation: Copulas and Conditioning

Semiparametric Gaussian Copula Models: Progress and Problems

Journal de la Société Française de Statistique Vol. 154 No. 1 (2013) Copula parameter estimation using Blomqvist s beta

Robustness of a semiparametric estimator of a copula

Marginal Specifications and a Gaussian Copula Estimation

A simple graphical method to explore tail-dependence in stock-return pairs

Goodness-of-Fit Testing with Empirical Copulas

Multivariate Survival Data With Censoring.

ESTIMATION OF THE PICKANDS DEPENDENCE FUNCTION FOR BIVARIATE ARCHIMAX COPULAS

Simulation of Tail Dependence in Cot-copula

Modelling and Estimation of Stochastic Dependence

MULTIDIMENSIONAL POVERTY MEASUREMENT: DEPENDENCE BETWEEN WELL-BEING DIMENSIONS USING COPULA FUNCTION

Does k-th Moment Exist?

EVANESCE Implementation in S-PLUS FinMetrics Module. July 2, Insightful Corp

Probability Distributions and Estimation of Ali-Mikhail-Haq Copula

THE MODELLING OF HYDROLOGICAL JOINT EVENTS ON THE MORAVA RIVER USING AGGREGATION OPERATORS

Correlation & Dependency Structures

Tail Approximation of Value-at-Risk under Multivariate Regular Variation

COPULA-BASED DENSITY WEIGHTING FUNCTIONS

Lecture Quantitative Finance Spring Term 2015

Modelling Dependence with Copulas and Applications to Risk Management. Filip Lindskog, RiskLab, ETH Zürich

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS

Forecasting time series with multivariate copulas

Stat 5101 Lecture Notes

CHANGE DETECTION IN TIME SERIES

La dépendance de queue en analyse des événements hydrologiques bivariés. Rapport de recherche No R-1426 Décembre 2012

Extreme Value Analysis and Spatial Extremes

Identification of marginal and joint CDFs using Bayesian method for RBDO

The extremal elliptical model: Theoretical properties and statistical inference

A copula goodness-of-t approach. conditional probability integral transform. Daniel Berg 1 Henrik Bakken 2

T E C H N I C A L R E P O R T PROJECTION ESTIMATORS OF PICKANDS DEPENDENCE FUNCTIONS. FILS-VILLETARD, A., GUILLOU, A. and J.

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

Non parametric estimation of Archimedean copulas and tail dependence. Paris, february 19, 2015.

Bayesian Inference for Clustered Extremes

Copulas. MOU Lili. December, 2014

A New Generalized Gumbel Copula for Multivariate Distributions

Fitting Archimedean copulas to bivariate geodetic data

Efficient estimation of a semiparametric dynamic copula model

Copulas. Mathematisches Seminar (Prof. Dr. D. Filipovic) Di Uhr in E

1 Introduction. Amir T. Payandeh Najafabadi 1, Mohammad R. Farid-Rohani 1, Marjan Qazvini 2

arxiv: v1 [math.st] 5 Oct 2009

Multivariate Pareto distributions: properties and examples

Frailty Models and Copulas: Similarities and Differences

STT 843 Key to Homework 1 Spring 2018

I N S T I T U T D E S T A T I S T I Q U E B I O S T A T I S T I Q U E E T S C I E N C E S A C T U A R I E L L E S (I S B A)

ON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES

Multivariate Statistics

arxiv: v1 [math.st] 31 Oct 2014

Construction of asymmetric multivariate copulas

Chapter 1. Bayesian Inference for D-vines: Estimation and Model Selection

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables THE UNIVERSITY OF MANCHESTER. 21 June :45 11:45

Reducing Model Risk With Goodness-of-fit Victory Idowu London School of Economics

Modelling Dropouts by Conditional Distribution, a Copula-Based Approach

Estimating Bivariate Tail: a copula based approach

Estimating the tail-dependence coefficient: Properties and pitfalls

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Simple Linear Regression

Modelling Dependent Credit Risks

Review of probability and statistics 1 / 31

Lecture 2 One too many inequalities

Simulating Exchangeable Multivariate Archimedean Copulas and its Applications. Authors: Florence Wu Emiliano A. Valdez Michael Sherris

Course on Inverse Problems

Copulas and dependence measurement

Multivariate extremes. Anne-Laure Fougeres. Laboratoire de Statistique et Probabilites. INSA de Toulouse - Universite Paul Sabatier 1

Bivariate generalized Pareto distribution

A semiparametrie estimation procedure of dependence parameters in multivariate families of distributions

Copulas and Measures of Dependence

Multiple Random Variables

Local sensitivity analyses of goodness-of-fit tests for copulas

Random fraction of a biased sample

When is a copula constant? A test for changing relationships

Bayesian Inference for Conditional Copula models

MAXIMUM ENTROPIES COPULAS

Empirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications

First steps of multivariate data analysis

Dependence Patterns across Financial Markets: a Mixed Copula Approach

UNIVERSITÄT POTSDAM Institut für Mathematik

Bootstrapping high dimensional vector: interplay between dependence and dimensionality

Multivariate Non-Normally Distributed Random Variables

Systemic Weather Risk and Crop Insurance: The Case of China

Contents 1. Coping with Copulas. Thorsten Schmidt 1. Department of Mathematics, University of Leipzig Dec 2006

arxiv: v1 [math.pr] 24 Aug 2018

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS

If we want to analyze experimental or simulated data we might encounter the following tasks:

Long-Run Covariability

On the Choice of Parametric Families of Copulas

Songklanakarin Journal of Science and Technology SJST R1 Sukparungsee

CONTAGION VERSUS FLIGHT TO QUALITY IN FINANCIAL MARKETS

Transcription:

Accounting for extreme-value dependence in multivariate data 38th ASTIN Colloquium Manchester, July 15, 2008

Outline 1. Dependence modeling through copulas 2. Rank-based inference 3. Extreme-value dependence structures 4. Testing for extremeness 5. Nonparametric estimation with known margins 6. Proposed estimators when margins are unknown 7. Efficiency comparisons 8. Final remarks

1. Dependence modeling through copulas Consider a bivariate random vector (X, Y ) with continuous margins F(x) = Pr(X x), G(y) = Pr(Y y). Following Sklar (1959), one may write H(x,y) = Pr(X x,y y) = C(F(x),G(y)), for a unique copula C, viz. C(u,v) = Pr(U u,v v) = Pr{F(X) u,g(y ) v}.

Copula models A copula model for (X,Y ) obtains when H(x,y) = Pr(X x,y y) = C(F(x),G(y)) is assumed to hold for parametric families F (F α ), G (G β ), C (C θ ). Such models provide flexibility in the choice of X and Y margins. The copula induces the dependence between them, e.g., X Y C(u,v) uv.

2. Rank-based inference Copula models are not always appropriate, but the dependence in a continuous pair (X,Y ) is always characterized by its copula. Because copulas are invariant by increasing transformations of the margins, viz. (X,Y ) (φ(x),ψ(y )) the dependence in a random sample (X 1,Y 1 ),...,(X n,y n ) is best represented by the ranks, viz. R i = rank(x i ) = rank(φ(x i )), S i = rank(y i ) = rank(ψ(y i )).

To study the dependence, get rid of the marginals! The pairs (R i /n,s i /n) are pseudo-observations from the underlying copula that characterizes the dependence structure. 3 2 1 0 1 2 3 0 2 4 6 8 y y v 3 2 1 0 1 2 3 4 x 0 2 4 6 8 x u Samples of size 2000 from two distributions with the same Gumbel copula (τ = 1/2)

Example 1: LOSS/ALAE data 4 6 8 10 12 y v 2 4 6 8 10 12 14 x u Original data (left) and pairs of normalized ranks (ranks) Source: Frees & Valdez (1998)

Example 2: Nasdaq versus S&P500 innovations 5 S&P500 1 0.9 0 0.8 0.7 5 5 Q4 01 Q1 02 Q2 02 Q3 02 Nasdaq Nasdaq innovations 0.6 0.5 0.4 0.3 0 0.2 0.1 5 Q4 01 Q1 02 Q2 02 Q3 02 0 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 S&P500 innovations Original data (left) and pairs of normalized ranks (ranks) Source: Van den Goorbergh et al. (2005)

Example 3: Prices of oil and gas 0.10 0.05 0.00 0.05 0.10 0.15 0.20 Date Ret ur n 2004 2005 2006 0.05 0.00 0.05 Date Ret ur n 2004 2005 2006 Normalized Ranks (Crude Oil Returns) Normalized Ranks (Natural Gas Residuals) Original data (left) and pairs of normalized ranks (ranks) Source: Genest et al. (2008)

3. Extreme-value dependence structures Clusters of points near (0,0) and (1,1) suggest that the dependence structure may be an extreme-value copula, viz. [ { }] log(v) C(u,v) = exp log(uv)a. log(uv) where A : [0,1] [0,1] is convex and max(t,1 t) A(t) 1, t [0,1]. This happens in many other actuarial/financial applications, although (X,Y) does not have an extreme-value distribution.

Pickands dependence function Pickands dependence function A(t) t Tail dependence coefficient: λ = 2{1 A(1/2)} [0, 1]

Illustration v v u u Samples of size 2000 from two extreme-value copulas Left: Marshall Olkin (0.5, 0.4) Right: Student extreme-value copula with 2 d.f. and ρ = 0.5

Issues of interest A. How can one check that C is an extreme-value copula? B. If C is an extreme-value copula, how can it be estimated? N.B.: A rank-based estimate is useless for prediction purposes, but it can help select an appropriate family of extreme-value copulas; it can serve as a golden standard against which to test goodness-of-fit.

4. Testing for extremeness Ghoudi et al. (1998) show how to test the hypothesis H 0 : C is an extreme-value copula. Their test is based on the probability integral transformation (X,Y ) W = H(X,Y ) = C(U,V). The latter has been extensively studied by Genest & Rivest (1993, 2001), as well as Barbe et al. (1996).

Key observation Under H 0, the distribution of W = H(X,Y ) is given by K(w) = Pr(W w) = w (1 τ)w log(w), w (0,1] with τ = 4E(W ) 1 = 1 0 t(1 t) A(t) da (t). Consequently, µ i = E(W i ) = iτ + 1 (i + 1) 2, i {1,2,...}.

Test statistic If H 0 is true, one must have 1 + 8E(W ) 9E(W 2 ) = 0. This can be tested using the pseudo-observations W 1 = H n (X 1,Y 1 ),...,W n = H n (X n,y n ). The test statistic proposed by Ghoudi et al. (1998) amounts to S n = 1 + 8 n n W i 9 n i=1 n i=1 W 2 i.

Procedure Estimate the variance of S n by the jackknife, viz. ˆσ 2 n = n 1 n n i=1 (S ( i) n S n ) 2. Because S n is a U-statistic, an approximate P-value is given by ( Pr Z > S ) n. σ n See Lee (1990) or Ghoudi et al. (1998) for details.

Alternative solution of Ben Ghorbal et al. (submitted) To reduce the bias associated with the jackknife, estimate the finite- or large-sample variance of S n. 0.1 0.2 0.3 0.4 0.5 0.6 1 2 3 Illustration in the case of independence

Example For the LOSS/ALAE data of Frees & Valdez (1998), S n 0.059 and P value 95.26%. 4 6 8 10 12 y v 2 4 6 8 10 12 14 x u The Gumbel Hougaard extreme-value copula fits very very well!

Limitations of the test 1.- It tends to be too liberal when the sample size is small (e.g., it isn t conclusive for oil/gas prices). 2.- The power depends on the true family of extreme-value copulas under the null hypothesis. 1,0 1,0 0,8 0,8 0,6 0,6 Power Power 0,4 0,4 0,2 0,2 0,0 0 10 20 30 40 50 d 0,0 0 10 20 30 40 50 d

5. Nonparametric estimation with known margins When the margins F and G are known with certainty, one can assume WLOG that the sample (X 1,Y 1 ),...,(X n,y n ) arises from some copula C (i.e., margins are uniform). Under the hypothesis that C is an extreme-value copula, there are several possible estimates for the Pickands dependence function, A.

Choice of estimators Pickands (Bull. Inter. Statist. Inst., 1981) Deheuvels (Statist. Probab. Lett., 1991) Capéraà et al. (Biometrika, 1997) Hall & Tajvidi (Bernoulli, 2000) Jiménez et al. (JMVA, 2001) Segers (Nova Science Publ., 2007) Zhang et al. (JMVA, 2007)

Basic fact leading to all these estimators Recall (U,V) = (F(X),G(Y )) C. Let ξ(t) = log U 1 t log V t, t (0,1). Then ξ(t) exp{a(t)}, and hence E{ξ(t)} = 1/A(t), E{log ξ(t)} = log A(t) γ, where γ = 0.5772... is Euler s constant.

Pickands and Capéraà Fougères Genest (CFG) estimators Segers (2007) shows that they can be expressed in the form 1/A P n (t) = 1 n n ξ i,n (t) i=1 and as a function of log A CFG n (t) = 1 n n log ξ i,n (t) γ i=1 ξ i,n (t) = log U i 1 t log V i t, i {1,...,n}.

Endpoint corrections to ensure that Â(0) = Â(1) = 1 Correction to Pickands estimator proposed by Deheuvels (1991): 1/A D n (t) = 1/A P n (t) (1 t){1/a P n (0) 1} t{1/a P n (1) 1}. Simple endpoint corrections to the CFG estimator take the form log A CFG n (t) p(t)log A CFG n (0) {1 p(t)}log A CFG n (1), where p(t) = 1 t in Capéraà et al. (1997). The optimal choice of p(t) is identified by Segers (2007).

6. Proposed estimators when margins are unknown For i {1,...,n}, let U i,n = R i /n, V i,n = S i /n, and ξ i,n (t) = log U i,n 1 t log V i,n, t (0,1). t Rank-based versions of the Pickands and CFG estimators are: 1/A P n (t) = 1 n log A CFG n (t) = 1 n n ξ i,n (t), i=1 n log ξ i,n (t) γ. i=1

Reformulation in terms of Deheuvels empirical copula The empirical copula (Deheuvels, 1979) is defined by C n (u,v) = 1 n n 1(U i,n u,v i,n v), u,v [0,1]. i=1 The limiting behavior of A P n and A CFG n depends on C n, because 1 1/A P n (t) = C n (u 1 t,u t ) du u, log A CFG n (t) = 0 1 0 {C n (u 1 t,u t ) 1(u > e 1 )} du u log u γ.

Result from Stute (1984) and Tsukahara (2005) Let α be a C Brownian bridge, i.e., a centered Gaussian random field on [0,1] 2 with covariance function cov{α(u,v),α(u,v )} = C(u u,v v ) C(u,v)C(u,v ). If C has continuous partial derivatives of second order, then n (Cn C) C, where the limit is a centered Gaussian process on [0,1] 2, viz. C(u,v) = α(u,v) C(u,v) u α(u,1) C(u,v) v α(1,v).

Consequences If A is twice continuously differentiable, then as n, one finds A CFG n A P n = n (A P n A) AP, = n (A CFG n A) A CFG, in the space C([0,1]), where for all t [0,1], A P (t) = A 2 (t) A CFG (t) = A(t) 1 0 1 0 C(u 1 t,u t ) du u, C(u 1 t,u t ) du u log u.

Observations 1. The proof of this result: - is not a simple corollary of the Continuous Mapping Theorem; - uses strong approximations given by Tsukahara (2000); - requires a new result on weighted bivariate empirical processes. 2. Endpoint corrections make no difference asymptotically. 3. Limiting covariance functions depend on A and can be estimated consistently from the data. 4. Construction of asymptotic confidence intervals and bands is thus possible.

7. Efficiency comparisons X and Y are independent if and only if A 1. In that case, where A P n (t) N(0,σP), 2 A CFG n (t) N(0,σCFG), 2 σp 2 = 3t(1 t) (2 t)(1 + t), σ 2 CFG = 2 1 0 v 0 (1 v 1 t )(1 v t ) du dv log u v log v.

Comparison between Pickands Deheuvels and CFG Asymptotic variance var 0.0 0.1 0.2 0.3 0.4 0.5 Pickands Deheuvels CFG t

Comparison with the case of known marginals Asymptotic variance var 0.0 0.1 0.2 0.3 0.4 0.5 Pickands Deheuvels CFG Margins known Margins unknown t

Other dependence structures Extensive simulations imply that in terms of efficiency: CFG Pickands Deheuvels Hall-Tajvidi; Rank-based Known margins. Henmi (2005) documents the paradoxical effect of nuisance parameters on the efficiency of estimators.

Gumbel copula with λ 0.25 Gumbel copula; margins known Gumbel copula; margins unknown n * MSE 0.0 0.1 0.2 0.3 0.4 Deheuvels Hall Tajvidi CFG n * MSE 0.0 0.1 0.2 0.3 0.4 Deheuvels Hall Tajvidi CFG t t

Asymmetric Gumbel copula with λ 0.25 Asymmetric Gumbel copula; margins known Asymmetric Gumbel copula; margins unknown n * MSE 0.0 0.2 0.4 0.6 0.8 Deheuvels Hall Tajvidi CFG n * MSE 0.0 0.2 0.4 0.6 0.8 Deheuvels Hall Tajvidi CFG t t

MISE for the Cauchy extreme-value copula t EV copula margins known t EV copula margins unknown n * MISE 0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 Hall Tajvidi CFG n * MISE 0.00 0.05 0.10 0.15 0.20 0.25 0.30 0.35 Hall Tajvidi CFG 2{1 A(0.5)} 2{1 A(0.5)} Graph of 100 MISE as a function of λ = 2{1 A(1/2)}

8. Final remarks Rank-based versions of nonparametric estimators have been proposed for the Pickands dependence function of a bivariate extreme-value copula. The new estimators are asymptotically normal and unbiased. The CFG estimator is superior to the Pickands estimator and variants thereof. Even when margins are known, more efficient procedures arise if information about margins is ignored and ranks are used instead. Goodness-of-fit tests for extreme-value copulas are under development.

Acknowledgements This talk is based in part on joint work with Johan Segers, Johanna Nešlehová, and Noomen Ben Ghorbal. Funding in support of this work was provided by: the Natural Sciences and Engineering Research Council of Canada; the Fonds québécois de la recherche sur la nature et les technologies; the Institut de finance mathématique de Montréal.