Estimating Bivariate Tail: a copula based approach

Similar documents
ESTIMATING BIVARIATE TAIL

Estimating Bivariate Tail: a copula based approach

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

A proposal of a bivariate Conditional Tail Expectation

Extreme Value Analysis and Spatial Extremes

Extreme Value Theory and Applications

A Conditional Approach to Modeling Multivariate Extremes

Math 576: Quantitative Risk Management

MFM Practitioner Module: Quantitiative Risk Management. John Dodson. October 14, 2015

Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution

Non parametric estimation of Archimedean copulas and tail dependence. Paris, february 19, 2015.

Extreme value theory and high quantile convergence

On tail dependence coecients of transformed multivariate Archimedean copulas

Some multivariate risk indicators; minimization by using stochastic algorithms

Bivariate extension of the Pickands Balkema de Haan theorem

A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS

Bayesian Modelling of Extreme Rainfall Data

Bivariate generalized Pareto distribution

On the Estimation and Application of Max-Stable Processes

CONTAGION VERSUS FLIGHT TO QUALITY IN FINANCIAL MARKETS

arxiv: v4 [math.pr] 7 Feb 2012

A measure of radial asymmetry for bivariate copulas based on Sobolev norm

ASYMPTOTIC MULTIVARIATE EXPECTILES

Quantitative Modeling of Operational Risk: Between g-and-h and EVT

Models and estimation.

HIERARCHICAL MODELS IN EXTREME VALUE THEORY

Multivariate Pareto distributions: properties and examples

Analysis methods of heavy-tailed data

A Brief Introduction to Copulas

Lecture Quantitative Finance Spring Term 2015

Financial Econometrics and Volatility Models Extreme Value Theory

Estimation of multivariate critical layers: Applications to rainfall data

Quantile prediction of a random eld extending the gaussian setting

Modelling extreme-value dependence in high dimensions using threshold exceedances

MULTIVARIATE EXTREMES AND RISK

New Classes of Multivariate Survival Functions

Multivariate generalized Pareto distributions

Financial Econometrics and Volatility Models Copulas

Max-stable Processes for Threshold Exceedances in Spatial Extremes

On the Conditional Value at Risk (CoVaR) from the copula perspective

PREPRINT 2005:38. Multivariate Generalized Pareto Distributions HOLGER ROOTZÉN NADER TAJVIDI

The high order moments method in endpoint estimation: an overview

Correlation: Copulas and Conditioning

VaR vs. Expected Shortfall

Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets

Tail negative dependence and its applications for aggregate loss modeling

STA 2201/442 Assignment 2

Multivariate extremes. Anne-Laure Fougeres. Laboratoire de Statistique et Probabilites. INSA de Toulouse - Universite Paul Sabatier 1

Study Guide on Dependency Modeling for the Casualty Actuarial Society (CAS) Exam 7 (Based on Sholom Feldblum's Paper, Dependency Modeling)

Tail Approximation of Value-at-Risk under Multivariate Regular Variation

Multivariate generalized Pareto distributions

Probability Distribution And Density For Functional Random Variables

Goodness-of-Fit Testing with Empirical Copulas

Bayesian nonparametrics for multivariate extremes including censored data. EVT 2013, Vimeiro. Anne Sabourin. September 10, 2013

STAT 461/561- Assignments, Year 2015

Introduction to Algorithmic Trading Strategies Lecture 10

Semi-parametric predictive inference for bivariate data using copulas

arxiv: v1 [math.pr] 24 Aug 2018

Explicit Bounds for the Distribution Function of the Sum of Dependent Normally Distributed Random Variables

A Note on Tail Behaviour of Distributions. the max domain of attraction of the Frechét / Weibull law under power normalization

Stochastic orders: a brief introduction and Bruno s contributions. Franco Pellerey

Estimation of spatial max-stable models using threshold exceedances

Modelling Dependence with Copulas and Applications to Risk Management. Filip Lindskog, RiskLab, ETH Zürich

Change Point Analysis of Extreme Values

Three hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER.

DEPENDENCE STRUCTURES BEYOND COPULAS: A NEW MODEL OF A MULTIVARIATE REGULAR VARYING DISTRIBUTION BASED ON A FINITE VON MISES-FISHER MIXTURE MODEL

Dependence Comparison of Multivariate Extremes via Stochastic Tail Orders

Richard L. Smith Department of Statistics and Operations Research University of North Carolina Chapel Hill, NC

Convergence of Multivariate Quantile Surfaces

Classical Extreme Value Theory - An Introduction

Multivariate Operational Risk: Dependence Modelling with Lévy Copulas

Tail dependence in bivariate skew-normal and skew-t distributions

7 Influence Functions

Introduction A basic result from classical univariate extreme value theory is expressed by the Fisher-Tippett theorem. It states that the limit distri

Tail comonotonicity: properties, constructions, and asymptotic additivity of risk measures

MAS223 Statistical Inference and Modelling Exercises

First steps of multivariate data analysis

Technische Universität München Fakultät für Mathematik. Properties of extreme-value copulas

Spring 2012 Math 541B Exam 1

Statistical Methods for Clusters of Extreme Values

Emma Simpson. 6 September 2013

2 Functions of random variables

Statistics of Extremes

On Conditional Value at Risk (CoVaR) for tail-dependent copulas

Thomas Mikosch: How to model multivariate extremes if one must?

EXTREMAL DEPENDENCE OF MULTIVARIATE DISTRIBUTIONS AND ITS APPLICATIONS YANNAN SUN

The Generalized Pareto process; with application

Extreme value statistics: from one dimension to many. Lecture 1: one dimension Lecture 2: many dimensions

Physically-Based Statistical Models of Extremes arising from Extratropical Cyclones

Some conditional extremes of a Markov chain

Regression and Statistical Inference

A conditional approach for multivariate extreme values

Lehrstuhl für Statistik und Ökonometrie. Diskussionspapier 87 / Some critical remarks on Zhang s gamma test for independence

Multivariate extensions of expectiles risk measures

GENERAL MULTIVARIATE DEPENDENCE USING ASSOCIATED COPULAS

Contents 1. Coping with Copulas. Thorsten Schmidt 1. Department of Mathematics, University of Leipzig Dec 2006

Estimation de mesures de risques à partir des L p -quantiles

Simulation of Max Stable Processes

Tail Dependence of Multivariate Pareto Distributions

copulas Lei Hua March 11, 2011 Abstract In order to study copula families that have different tail patterns and tail asymmetry than multivariate

Transcription:

Estimating Bivariate Tail: a copula based approach Elena Di Bernardino, Université Lyon 1 - ISFA, Institut de Science Financiere et d'assurances - AST&Risk (ANR Project) Joint work with Véronique Maume-Deschamps and Clémentine Prieur 11ème Forum des Jeunes Mathématicien-ne-s - Université Toulouse 3 21-23 Novembre 2011 1 / 29

Plan Introduction 1 Introduction 2 3 Estimating the tail dependence structure Estimating the tail 4 2 / 29

Univariate POT and Univariate Pickands-Balkema-de Haan Theorem The univariate POT (Peaks-Over-Threshold) method is common for estimating extreme quantiles or tail distributions (see e.g. McNeil 1997, 1999 and references therein). A key idea of this method is that a distribution is in the domain of attraction of an extreme value distribution if and only if the distribution of excesses over high thresholds is asymptotically generalized Pareto (GPD) (e.g. Balkema and de Haan, 1974; Pickands, 1975): V ξ,σ (x) := { 1 ( ) 1 ξx 1 ξ σ, if ξ 0, σ > 0, 1 e x σ, if ξ = 0, σ > 0, and x 0 for ξ 0 or 0 x < σ ξ for ξ > 0. 3 / 29

Univariate POT and Univariate Pickands-Balkema-de Haan Theorem From the Univariate Pickands-Balkema-de Haan Theorem: β( ) such that lim sup Fu (x) V u x k,β(u)(x) = 0, F 0 x<x F u where F u (x) = P[X u x X > u] and x F := sup{x R F (x) < 1} (end point of function F ). POT estimator for univariate tail distribution: ( ) 1 1 F N (y u) ξ 1 ξ (y) = n σ, if ξ 0, ) (e (y u) σ, if ξ = 0, N n with u < y < (if ξ 0) or u < y < σ ξ number of excesses above the threshold. (if ξ > 0) and N the random 4 / 29

Framework Goal : estimating the tail of a bivariate distribution function. Idea : a general extension of the Peaks-Over-Threshold method. Tools : a two-dimensional version of the Pickands-Balkema-de Haan Theorem, Juri & Wüthrich's and Charpentier & Juri's approach of the tail dependence. dependence modeled by copulas. Asymptotic dependence as well as asymptotic independence are considered. 5 / 29

Conditionally versus unconditionally on N Summary of results Construction of a two-dimensional tail estimator, study of its asymptotic properties. A parameter that describes the nature of the tail dependence is introduced and estimated. X 1,..., X n i.i.d. F. We x some high threshold u, let N denote the number of excesses above u. In the following, two approaches will be considered: Conditionally on N. If n is the sample size and u n the associated threshold, the number of excesses is m n, with lim n m n = and lim n m n /n = 0. Unconditionally on N, i.e., N n as a binomial random variable N n Bi (n, 1 F (u n )) with lim n 1 F (u n ) = 0 and lim n n(1 F (u n )) =. 6 / 29

Upper-tail dependence copula C(u, v) is a 2-dimensional copula and C (u, v) is the associated survival copula. Let X and Y be uniformly distributed on [0, 1]. Let u be a threshold in [0, 1) such that C (1 u, 1 u) > 0. The Upper-tail dependence copula at level u [0, 1) is C up u (x, y) := P[ X F 1 X, u(x), Y F 1 Y, u(y) X > u, Y > u ], (x, y) [0, 1] 2, where F X, u, F Y, u conditioned on {X > u, Y > u}: are the distribution of X and Y F X, u(x) := P[ X x X > u, Y > u ] = 1 C (1 x u, 1 u) C. (1 u, 1 u) 7 / 29

Limit of the upper-tail dependence copula Proposition Assume that C (1 u, 1 v)/ u < 0 and C (1 u, 1 v)/ v < 0 for all u, v [0, 1). Furthermore, assume that there is a positive function G such that lim u 1 C (x (1 u), y (1 u)) C (1 u, 1 u) Then for all (x, y) [0, 1] 2 lim u 1 C up u (x, y) = x + y 1 + G(g 1 X = G(x, y), for all x, y > 0. 1 (1 x), g (1 y)) := C G (x, y), where g X (x) := G(x, 1), g Y (y) := G(1, y). Moreover there is a constant θ > 0 such that, for x > 0 { x θ g Y ( y x G(x, y) = ) for y [0, 1], x y θ g X ( x ) for y (1, ). y x 8 / 29 Y

Standing assumptions X and Y are two continuous real valued random variables, with marginal distributions, F X, F Y, and copula C. F X MDA(H ξ1 ), F Y MDA(H ξ2 ) Let X 1, X 2,..., X n be a sequence i.i.d. F. If (a n ) n>0 (sequence of positive values), and (b n ) n>0 (sequence of real values) such that [ ] lim P max{x1, X 2,..., X n } b n x = H k (x), x R, n a n then H k (x) is a Generalized Extreme Value Distribution: { e H k (x) = (1 k x) 1 k, if k 0, e e x, if k = 0, with 1 k x > 0, k R (k < 0 Fréchet family, k = 0 Gumbel, k > 0 Weibull).We say F MDA(H k ). C as in the above Proposition. V ξ1,a1( ) (resp. V ξ2,a2( )) is the univariate GPD distribution with parameters ξ 1 (resp. ξ 2 ) and a 1 ( ) (resp. a 2 ( )) of X (resp. Y ). 9 / 29

A two dimensional Pickands- Balkema-de Haan Theorem Theorem Under the standing assumptions, sup P[ X u x, Y F 1 (F Y X (u)) y X > u, Y > F 1 (F Y X (u)) ] A C G ( 1 g X (1 V ξ1,a1(u)(x)), 1 g Y (1 V ξ2,a2(f 1 Y (F X (u))) (y))) A := {(x, y) : 0 < x x FX u, 0 < y x FY F 1 (F Y X (u))}, with x FX := sup{x R F X (x) < 1}, x FY := sup{y R F Y (y) < 1}. u x FX 0, Proof: We generalize the proof by Jury and Wüthrich (2004) in the case of a symmetric copula (and same marginal distributions). 10 / 29

Stable tail dependence function Estimating the tail dependence structure Estimating the tail Assume that the bivariate distribution function F has stable tail dependence function l: lim t 0 or equivalently lim t 0 1 t P[1 F X (X ) t x or 1 F Y (Y ) t y] := l(x, y) (1) 1 t P[1 F X (X ) t x, 1 F Y (Y ) t y] := R(x, y) = x + y l(x, y), see e.g. Huang (1992). If F X, F Y are in the maximum domain of attraction of two extreme value distributions H X, H Y and if (1) holds then F is in the domain of attraction of an extreme value distribution H with marginals H X, H Y and with copula determined by l. 11 / 29

Estimating the tail dependence structure Estimating the tail Relation with the upper tail dependence coecient Recall the upper tail dependence coecient: λ := lim t 0 P[F 1 X 1 (X ) > 1 t F (Y ) > 1 t]. Y Asymptotic dependence: λ > 0. Asymptotic independence: λ = 0. λ = R(1, 1). Asymptotic independence i.e., when the data exhibit positive or negative association that only gradually disappears at more and more extreme levels. 12 / 29

Estimators Estimating the tail dependence structure Estimating the tail We introduce the non parametric estimators for R (see Einmahl et al., 2006): R(x, y) = 1 n 1 {R(X k i )>n k n x+1, R(Y i )>n k n y+1}, n i=1 where k n, k n /n 0 and R(X i ) is the rank of X i among (X 1,..., X n ), R(Y i ) is the rank of Y i among (Y 1,..., Y n ), for i = 1,..., n. The functions g X, g Y, G are estimated by ĝ X (x) = R(x, 1) R(1, 1), ĝ Y (x) = R(1, y) R(1, 1), and Ĝ(x, y) = R(x, y) R(1, 1), 13 / 29

Estimators Estimating the tail dependence structure Estimating the tail The functions g X, g Y, G are estimated by ĝ X (x) = R(x, 1) R(1, 1), ĝ Y (x) = R(1, y) R(1, 1), and Ĝ(x, y) = R(x, y) R(1, 1), The coecient θ is estimated by θ y x = log(ĝ(x,y)) log(ĝ Y ( y x )) log(x) if y x log(ĝ(x,y)) log(ĝ X ( x y )) log(y) if y x [0, 1], (1, ). Symmetric copula case, g X (x) = g Y (x) = g(x) = x θ g(1/x) for x > 0, and θ x = log(ĝ(x)) log(ĝ( 1 )) x. log(x) 14 / 29

Estimating the tail dependence structure Estimating the tail Convergence results (asymptotically dependent case) Theorem 2.2 in Einmahl et al. (2006) leads to the following consistency result. Proposition Under our standing assumptions, for v n such that v n / k n 0, for n, and λ > 0, v n sup 0<x 1 v n sup 0<x,y 1 Ĝ(x, y) G(x, y) P 0, n ĝx (x) g X (x) P 0, v n sup n 0<y 1 with k n, k n /n 0 and k n = o ( n 2α 1+2α ). ĝy (y) g Y (y) P n 0. 15 / 29

Estimating the tail dependence structure Estimating the tail Convergence results (asymptotically independent case) The case λ = R(1, 1) = 0 requires second order conditions. As in Draisma et al. (2004), we assume that: lim t 0 C (tx, ty) C (t, t) G(x, y) q 1 (t) := Q(x, y), for all x, y 0, x + y > 0, with q 1 is some positive function and Q is neither a constant nor a multiple of G. The above convergence is uniform on {x 2 + y 2 = 1}. Denote q(t) := P[1 F X (X ) < t, 1 F Y (Y ) < t]. 16 / 29

Estimating the tail dependence structure Estimating the tail Convergence results (asymptotically independent case) Proposition Under our standing assumptions and second order conditions, for a sequence k n such that a n := n q(k n /n) ψ n sup 0<x 1 ψ n sup 0<x,y 1 Ĝ(x, y) G(x, y) P 0, n ĝx (x) g X (x) P 0, ψ n sup n with ψ n = o( a n ). 0<y 1 ĝy (y) g Y (y) P n 0, 17 / 29

A new tail estimator Estimating the tail dependence structure Estimating the tail Main ingredients of our estimator. A high threshold u; 1 F Y Dene û Y = function and F 1 Y ( F X (u)), with F X (u) the empirical distribution the empirical quantile function of Y. k X, σ X (resp. k Y, σ Y ) the MLE based on the excesses of X (resp. Y). F X (x) (resp. FY (y)) the univariate tail estimator (see McNeil (1999)): F X (x) = (1 FX (u))v k, σ (x u) + F X (u), for x > u. F 1 (u, y) = exp{ l n ( log( F X (u)), log( F Y (y)))}, and F 2 (x, û Y ) = exp{ l n ( log( F X (x)), log( FY (û Y )))}. 18 / 29

A new tail estimator Estimating the tail dependence structure Estimating the tail We estimate F (x, y), for x > u, y > u Y = F 1 (F Y X (u)) and u large, by: F (x, y) = ( 1 n n 1 {Xi >u, Y i >u Y } ) (1 ĝx (1 (x u)) V ξx, σx i=1 ĝ Y (1 V ξy, σ Y (y u Y )) + Ĝ( 1 V ξx, σ X (x u), 1 V ξy, σ Y (y u Y ) )) + F 1 (u, y) + F 2 (x, u Y ) 1 n n 1 {Xi u, Y i u Y }, i=1 where ξ X, σ X Y ). (resp. ξ Y, σ Y ) are MLE based on the excesses of X (resp. 19 / 29

A new tail estimator Estimating the tail dependence structure Estimating the tail But u Y = F 1 (F Y X (u)) is unknown. Then, in order to compute in practice F 1 (x, y), we propose to estimate u Y by û Y = F ( F Y X (u)), with F X (u) = 1 n 1 1 n i=1 {X i u} and F the empirical quantile function of Y. Y We estimate F (x, y), for x > u and y > û Y, by: F (x, y) = ( 1 n n 1 {Xi >u, Y i >û Y } ) (1 ĝx (1 (x u)) V ξx, σx i=1 ĝ Y (1 V ξy, σ Y (y û Y )) + Ĝ( 1 V ξx, σ X (x u), 1 V ξy, σ Y (y û Y ) )) + F 1 (u, y) + F 2 (x, û Y ) 1 n n 1 {Xi u, Y i û Y }, i=1 20 / 29

Estimating the tail dependence structure Estimating the tail Convergence result for our bivariate POT estimator (asymptotically dependent case) λ > 0, standing assumptions, rst and second order conditions on the marginal laws and Smith's hypothesis on the thresholds above. Theorem k n (F (x n, y n ) F (x n, y n )) P 0, n with x n = f 1 (n)f 1 (n), y n = f 2 (n)f 2 (n). Moreover if f 2 (n) satises the thershlod conditions in probability then k n (F (x n, ŷ n ) F (x n, ŷ n )) P 0, n with ŷ n = f 2 (n)f 2 (n). We have k n, k n /n 0, k n = o(n 2α 1+2α ), α > 0. 21 / 29

Estimating the tail dependence structure Estimating the tail Convergence result for our bivariate POT estimator (asymptotically independent case) λ = 0, standing assumptions, rst and second order conditions on the marginal laws, second order condition on the joint distribution and Smith's hypothesis on the thresholds above. Theorem a n (F (x n, y n ) F (x n, y n )) P 0, n where x n = f 1 (n)f 1 (n), y n = f 2 (n)f 2 (n). Moreover if f 2 (n) satises the threshold conditions in probability then a n (F (x n, ŷ n ) F (x n, ŷ n )) P 0, n with ŷ n = f 2 (n)f 2 (n) and a n = n q(k n /n). k n /n 0, a n q 1 (q (a n /n)) 0 and k n = o(n 2α 1+2α ), for some α > 0. 22 / 29

s of the dependence structure : simulations (sensitivity of θ x to the sequence k n ) Clayton copula. Mean curve on 100 samples of size n = 1000 (full line) and the empirical standard deviation (dashed lines). Mean squared error for θ x is calculated on 100 samples of size n = 1000. Asymptotic independent case. Figure: Clayton copula with parameter 0.05: (left) estimator for θ, (k, θ x) with x = 0.7; (right) mean squared error for θ x with x = 0.7. 23 / 29

Real data Figure: Logarithmic scale (left) ALAE versus Loss; (right) Wave heights versus Water level. 24 / 29

Real data Stability of our estimation compared to the one of F, as well as the 1 estimation of parameter θ of these real cases. F 1 (y 1, y 2 ) = exp{ l( log( F Y1 (y 1)), log( F Y2 (y 2)))}, is known to produce a signicant bias for asymptotically independent data. 25 / 29

Loss / ALEA data: asymptotic dependent case Loss-ALAE data: Each claim consists of an indemnity payment (the loss, X ) and an allocated loss adjustment expense (ALAE, Y ). We estimate F (2.10 5, 10 5 ). Figure: (left) θ 0.04; (right) F (2.10 5, 10 5 ) (full line), F 1 (2.10 5, 10 5 ) (dashed line), with the empirical probability indicated with a horizontal line. 26 / 29

Wave height vs Water level: asymptotic independent case Wave height versus Water level data: recorded during 828 storm events spread over 13 years in front of the Dutch coast near the town of Petten. Figure: (left) θ 0.1 0.11 = θ 0.91; (right) F (5.93, 1.87) (full line), F 1 (5.93, 1.87) (dashed line), with the empirical probability indicated with a horizontal line. 27 / 29

Ideas for future developments Get the optimal rate (choice of k n ), a central limit theorem for the dependence parameter θ, the dependence function g X, g Y,G and the bivariate POT tail estimator F? Optimal choice of k n and deep comparison with estimators in literature (Ledford and Tawn; 1997,1998). Use the bivariate tail estimator F (x, y) to obtain estimation of bivariate upper-quantile curves, for high levels α (see article Plug-in estimation of level sets in a non-compact setting with applications in multivariate risk theory, written with Thomas Laloë, Véronique Maume-Deschamps and Clémentine Prieur, accepted for publication in ESAIM: Probability and Statistics journal. 28 / 29

Thanks for your attention 29 / 29