Lifetime Dependence Modelling using a Generalized Multivariate Pareto Distribution

Similar documents
Modelling and Estimation of Stochastic Dependence

Modelling Dependence with Copulas and Applications to Risk Management. Filip Lindskog, RiskLab, ETH Zürich

Lecture Quantitative Finance Spring Term 2015

Simulating Exchangeable Multivariate Archimedean Copulas and its Applications. Authors: Florence Wu Emiliano A. Valdez Michael Sherris

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

Modelling Dependent Credit Risks

Construction and estimation of high dimensional copulas

Introduction to Algorithmic Trading Strategies Lecture 10

Financial Econometrics and Volatility Models Copulas

Tail negative dependence and its applications for aggregate loss modeling

Department of Statistical Science FIRST YEAR EXAM - SPRING 2017

Extreme Value Analysis and Spatial Extremes

Dependence. Practitioner Course: Portfolio Optimization. John Dodson. September 10, Dependence. John Dodson. Outline.

Correlation: Copulas and Conditioning

CONTAGION VERSUS FLIGHT TO QUALITY IN FINANCIAL MARKETS

Clearly, if F is strictly increasing it has a single quasi-inverse, which equals the (ordinary) inverse function F 1 (or, sometimes, F 1 ).

Tail Dependence of Multivariate Pareto Distributions

A simple graphical method to explore tail-dependence in stock-return pairs

Probability Distributions and Estimation of Ali-Mikhail-Haq Copula

Tail Approximation of Value-at-Risk under Multivariate Regular Variation

ESTIMATING BIVARIATE TAIL

Switching Regime Estimation

Politecnico di Torino. Porto Institutional Repository

MFM Practitioner Module: Quantitiative Risk Management. John Dodson. October 14, 2015

Recovering Copulae from Conditional Quantiles

A Brief Introduction to Copulas

INSTITUTE AND FACULTY OF ACTUARIES. Curriculum 2019 SPECIMEN SOLUTIONS

Multivariate Distribution Models

Robustness of a semiparametric estimator of a copula

Aggregation and the use of copulas Tony Jeffery

Convexity of chance constraints with dependent random variables: the use of copulae

Multivariate Non-Normally Distributed Random Variables

Statistics Ph.D. Qualifying Exam: Part I October 18, 2003

A Conditional Approach to Modeling Multivariate Extremes

Statistics 3858 : Maximum Likelihood Estimators

Explicit Bounds for the Distribution Function of the Sum of Dependent Normally Distributed Random Variables

Part IA Probability. Theorems. Based on lectures by R. Weber Notes taken by Dexter Chua. Lent 2015

VaR vs. Expected Shortfall

Shape of the return probability density function and extreme value statistics

On Parameter-Mixing of Dependence Parameters

MATH c UNIVERSITY OF LEEDS Examination for the Module MATH2715 (January 2015) STATISTICAL METHODS. Time allowed: 2 hours

Efficient rare-event simulation for sums of dependent random varia

Math 576: Quantitative Risk Management

Exercises. (a) Prove that m(t) =

Probability Distribution And Density For Functional Random Variables

Multivariate Operational Risk: Dependence Modelling with Lévy Copulas

Parameter Estimation

Multivariate Distributions

Copulas and Measures of Dependence

A New Generalized Gumbel Copula for Multivariate Distributions

Parameter Estimation

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

A measure of radial asymmetry for bivariate copulas based on Sobolev norm

Introduction to Dependence Modelling

Multivariate survival modelling: a unified approach with copulas

Multivariate Normal-Laplace Distribution and Processes

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

EXTREMAL DEPENDENCE OF MULTIVARIATE DISTRIBUTIONS AND ITS APPLICATIONS YANNAN SUN

Multivariate Statistics

Dependence. MFM Practitioner Module: Risk & Asset Allocation. John Dodson. September 11, Dependence. John Dodson. Outline.

Experience Rating in General Insurance by Credibility Estimation

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER.

Statistical Inference and Methods

Contents 1. Coping with Copulas. Thorsten Schmidt 1. Department of Mathematics, University of Leipzig Dec 2006

The largest eigenvalues of the sample covariance matrix. in the heavy-tail case

STA 2201/442 Assignment 2

General approach to the optimal portfolio risk management

Estimation of risk measures for extreme pluviometrical measurements

Statistics & Data Sciences: First Year Prelim Exam May 2018

Distributions of Functions of Random Variables. 5.1 Functions of One Random Variable

Frailty Models and Copulas: Similarities and Differences

Frontier estimation based on extreme risk measures

Financial Econometrics and Volatility Models Extreme Value Theory

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

13. Parameter Estimation. ECE 830, Spring 2014

Nonparametric Estimation of the Dependence Function for a Multivariate Extreme Value Distribution

Nonparametric estimation of extreme risks from heavy-tailed distributions

Introduction to Rare Event Simulation

Correlation & Dependency Structures

Elements of Financial Engineering Course

Qualifying Exam CS 661: System Simulation Summer 2013 Prof. Marvin K. Nakayama

Elements of Financial Engineering Course

Stochastic Comparisons of Weighted Sums of Arrangement Increasing Random Variables

Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets

Copula Regression RAHUL A. PARSA DRAKE UNIVERSITY & STUART A. KLUGMAN SOCIETY OF ACTUARIES CASUALTY ACTUARIAL SOCIETY MAY 18,2011

Multiple Random Variables

SOLUTION FOR HOMEWORK 8, STAT 4372

A Measure of Association for Bivariate Frailty Distributions

Non parametric estimation of Archimedean copulas and tail dependence. Paris, february 19, 2015.

Information geometry for bivariate distribution control

Review Quiz. 1. Prove that in a one-dimensional canonical exponential family, the complete and sufficient statistic achieves the

Definition 1.1 (Parametric family of distributions) A parametric distribution is a set of distribution functions, each of which is determined by speci

APPROXIMATING THE GENERALIZED BURR-GAMMA WITH A GENERALIZED PARETO-TYPE OF DISTRIBUTION A. VERSTER AND D.J. DE WAAL ABSTRACT

Mathematical statistics

Multivariate Survival Data With Censoring.

On consistency of Kendall s tau under censoring

Step-Stress Models and Associated Inference

Bivariate Degradation Modeling Based on Gamma Process

Ordering Extremes of Interdependent Random Variables

Statistical Inference with Monotone Incomplete Multivariate Normal Data

Transcription:

Lifetime Dependence Modelling using a Generalized Multivariate Pareto Distribution Daniel Alai Zinoviy Landsman Centre of Excellence in Population Ageing Research (CEPAR) School of Mathematics, Statistics & Actuarial Science University of Kent Department of Statistics University of Haifa 18 July 2017 D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 1 / 23

Plan Introduction Multivariate Generalized Pareto Distribution Parameter Estimation Optimal Quantile Selection Bulk Annuity Pricing Conclusion D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 2 / 23

Introduction Motivation: Provide the means to assess the impact of dependent lifetimes on annuity valuation and risk management. Basis: systematic mortality improvements induce dependence. Could reframe as cohort, or pool of similar-risks, analysis. Investigate a multivariate generalized Pareto distribution because: Interesting family with potential for more flexible dependence. More suitable for older-age dependence due to presence of extremes. Resolve estimation in the presence of truncation (in a variety of ways). Moment-based estimation (applied to the minimum observation). Quantile-based estimation (with optimal levels). Assess the impact of dependence on the risk of a bulk annuity. Dependence increases the risk. D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 3 / 23

Modelling Dependent Lifetimes Assume m pools of n lives. Suppose the lives within a pool are dependent. Let X i,j be the lifetime of individual i in pool j. We apply the following model for lifetimes: X j h(θ, λ S ), where λ S = n i=1 λ i. This means pools are independent. Each pool is one draw from the multivariate distribution. The magnitudes of m and n determine the application. n = 2 joint-life products. j, n m Small m or n might pose difficulties! D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 4 / 23

Multiply Monotone Generated Distributions Let X = (X 1,..., X n ) be a multivariate random vector with strictly positive components X i > 0 such that its joint survival function is given by ( n P(X 1 > x 1,..., X n > x n ) = h λ i x i ), x i 0, for λ i > 0, i, where h is d-times monotone, d n. That is, for k {1,..., d}, ( 1) k h (k) (x) 0, x > 0. Two well-known examples include the Pareto and Weibull distributions. {Pareto} h(x) = (1 + x) 1 θ, x 0, θ R +, {Weibull} h(x) = exp( x 1 θ ), x 0, θ [1, ). The Pareto generator resembles the Clayton copula generator (1 + θx) 1/θ. The Weibull generator is just the Gumbel copula generator. i=1 D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 5 / 23

Joint Densities of Subsets of X The multiply monotone condition on h ensures we have admissible densities for all possible subsets of X! For example, the densities of X and X i are given by, f X (x 1,..., x n ) = ( 1) n λ 1 λ n h (n) ( n i=1 λ i x i ) 0, x i > 0, f i (x i ) = ( 1)λ i h (1) (λ i x i ) 0, x i > 0. Survival functions are always given by h: P(X i > x i, X j > x j ) = h(λ i x i + λ j x j ), x i, x j 0, i j, P(X i > x i ) = h(λ i x i ), x i 0. As such, we require that h(0) = 1 and lim x h(x) = 0. There is a clear link to Archimedean survival copulas. D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 6 / 23

Examples of h { } { } Pareto, θ 1 4, 1 2, 1, 2. Clayton, θ 1 4, 1 2, 1, 2. Gumbel, θ {1, 2, 4, 8}. Frank, θ { 4, 1, 1, 4}. { } AMH, θ 19 20, 1 4, 1 4, 3. Expo-Pareto, θ {1, 2, 4, 8}. 4 D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 7 / 23

Bivariate Marginal Correlations As well as exhibiting either light or heavy tails, each h produces a different correlation structure between marginals. Not surprisingly, heavy tailed examples permit only positive correlation, whereas light tailed distributions allow for negative correlation. For the Pareto and Clayton, Corr(X i, X j ) = θ, for i j. For the remaining examples, the bivariate correlation involves either the incomplete gamma, dilogarithm or trilogarithm function. Γ(s, x) = Li 2 (z) = x 0 Li 3 (z) = z 0 t s 1 e t dt, ln(1 t) dt, t z Li 2 (t) dt. t More on correlation later, after we ve addressed truncation! D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 8 / 23

Parameter Estimation We wish to make use of pool statistics to estimate model parameters. Mean and Variance; Minimum and Maximum; Quantiles! Within-pool dependence is a clear obstacle, but not the only one! We anticipate truncated observations. We require some theoretical results before we can proceed. D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 9 / 23

Mixed Truncated Moments Theorem (Mixed Moments) Consider X = (X 1,..., X n ) with distribution generated by d-times monotone h, d n. Let τ X i = {X i X > τ }. If finite, [ n E τ X k i i i=1 ] k 1 k n = h(λ S τ) 1 j 1 =0 j n=0 h ( n l=1 j l) (λ S τ) n i=1 ( 1) j i τ k i j i (k i j i )! where λ S = n i=1 λ i, k = n i=1 k i, k {1, 2,..., d}, and k i {0} Z + ; furthermore, where h ( k) (x) = x h ( (k 1)) (y)dy and h (0) (x) = h(x). k i!, Mean, variance and covariance results are especially relevant. This result can be used to find the moments of the minimum (and maximum). λ j i i Let s take a look at the bivariate correlation plots. They depend on τ! D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 10 / 23

Correlation Plots for τ {0, 1, 2, 5} Pareto Clayton Gumbel Frank Ali-Mikhail-Haq Exponential-Pareto D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 11 / 23

Comments on Mean-Variance Matching Mean, variance and covariance results enable us to determine the expectation of the sample (pool) mean and variance. Averaging these, respectively, across pools yields θ and λs. Consider the Pareto distribution with λ i = λ, i; we have E[a 1 ( τ X j )] = λ 1 + τ(n + θ 1 1) θ 1, 1 (λ 1 + τn) 2 E[ m 2 ( τ X j )] = (θ 1 1)(θ 1 2), where a 1 and m 2 denote the unbiased sample (pool) mean and variance. Note the relationship with pool size n. Inseparable from the truncation point τ. No indication that large n will produce more accurate estimation. Perhaps ideal for a portfolio of many joint-life annuities. D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 12 / 23

Comments on Minimum-Maximum Matching Sample moments of minima (or maxima) yield estimates θ and λ. Focus on minimum, since it looks much more promising. Consider the Pareto distribution with λ i = λ, i; we have E[a 1 ( τ X (1) )] = λ 1 /n + τθ 1 θ 1, 1 E[ m 2 ( τ X (1) )] = θ 1 (λ 1 /n + τ) 2 (θ 1 1) 2 (θ 1 2). Contrast the relationship with pool size n to the mean-variance matching. This time distinct from τ and indicative of more accuracy as n. Perhaps ideal for a portfolios of employer-based pension schemes. D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 13 / 23

Quantile Matching The previous two estimation procedures require sufficiently light tails! For the Pareto, 0 < θ < 1/2. Quantile-based estimation procedures do not impose this restriction! We apply quantile matching to the sample of pool minima! q τ X (1) (p) = h 1 ((1 p)h(λ S τ)) λ S. Our estimation procedure requires three {optimal} levels p1, p 2, and p 3. D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 14 / 23

Fisher Information: Establishing the Objective Function Consider a sample of iid X 1,..., X n with density f (x, ϑ), ϑ Θ R, differentiable with respect to ϑ for almost all x R. The Fisher information about ϑ contained in statistic T n (X 1,..., X n ) is ( ) ln ftn (x, ϑ) 2 I Tn (ϑ) = f Tn(x, ϑ)dx. ϑ R A higher Fisher information is indicative of more precise estimation. The Fisher information contained in the sample quantiles, I q(p1 ),..., q(p k )(ϑ), 0 = p 0 < p 1 <... < p k+1 = 1, is asymptotically equal to ni k (p 1,..., p k ); I k (p 1,..., p k ) = k i=0 (β i+1 β i ) 2 p i+1 p i, where β i = f (q(p i ), ϑ) q(p i )/ ϑ, i and β 0 = β k+1 = 0. Find optimal levels p 1,..., p k, such that I k is maximized! D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 15 / 23

The Pareto Distribution The optimal quantile selection procedure depends heavily on h. Let us focus on the Pareto distribution. We want to estimate θ (with λ S unknown) using two quantiles ( p 1 < p 2 ). I 2 (p 1, p 2 ) = β2 1 p 1 + (β 2 β 1 ) 2 p 2 p 1 + β2 2 1 p 2. For the Pareto distribution, and letting ˇp i = 1 p i, we obtain β i = θ ˇp i ln ˇp i. The objective function may be rewritten as follows ( ) I 2 (p 1, p 2 ) = θ 2 ˇp 2 1 ln2 ˇp 1 + (ˇp 2 ln ˇp 2 ˇp 1 ln ˇp 1 ) 2 + ˇp 2 ln 2 ˇp 2. p 1 p 2 p 1 Maximizing this does not require knowledge of θ and λs! Furthermore, it does not even depend on τ! D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 16 / 23

Finding p 1 and p 2 for the Pareto Distribution p 2 = 0.25. p 2 = 0.50. p 2 = 0.75. p 2 = 0.90. p 2 = 0.95. p 2 = 0.99. The optimal levels are p 1 = 0.6385 and p 2 = 0.9265. D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 17 / 23

Finding p 3 Armed with θ, we consider the optimal quantile level p 3 used to estimate λ S. Following the same method, optimal p 3 is found by maximizing ( ) ˇp 3 1 ˇp θ 2 3. p 3 This depends on θ, for which we luckily have an estimate! (a) θ = 1 4. (b) θ = 1. (c) θ = 1. (d) θ = 2. 2 The lighter the tail, the higher the optimal quantile level. D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 18 / 23

Optimal Quantiles in General The Pareto distribution is quite unique! The truncation point does not affect the optimal quantile levels. θ can be estimated optimally without knowledge of λs. In general, the truncation point complicates matters significantly. But even τ = 0 does not imply optimal quantile-levels can always be found. We can find optimal quantile levels p 1 and p 2 if we can write β (θ) = f (θ, λ S ) g(p) for some functions f and g. Achievable for the Pareto, Weibull and exponential-pareto distributions. β (θ) ˇp ln ˇp, for the Pareto and exponential-pareto, β (θ) ˇp ln ˇp ln( ln ˇp), for the Weibull. D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 19 / 23

The Bulk Annuity Consider a pool of lives τ X = ( τ X 1,..., τ X n ). A bulk annuity pays 1 to each survivor of the pool at the end of each year. Let τ A denote its value at inception (t = τ) and let τ S t denote the number of survivors in the pool at time t τ. In order to find the mean and variance of τ A, we need to find the distribution of τ S t and the joint distribution of ( τ S t, τ S s ), s > t. If the lives are independent, these can readily be found. What if the lives are dependent? D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 20 / 23

The Impact of Dependence (δ = 0.02, µ = 60, τ = 5) Marginal Moments Independent Pareto Multivariate Pareto n E[ τ X 1 ] Var( τ X 1 ) 1 2 E[ τ A] Var( τ A) 1 2 E[ τ A] Var( τ A) 1 2 2 75.00 17.32 14.38 11.50 14.38 13.11 20 75.00 10.95 154.70 32.79 154.70 52.07 Truncation affects the marginal distributions! Given n, we apply appropriate parameters for a fair comparison. D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 21 / 23

Conclusion Motivation: Provide the means to assess the impact of dependent lifetimes on annuity valuation and risk management. Basis: systematic mortality improvements induce dependence. Could reframe as cohort, or pool of similar-risks, analysis. Investigate a multivariate generalized Pareto distribution because: Interesting family with potential for more flexible dependence. More suitable for older-age dependence due to presence of extremes. Resolve estimation in the presence of truncation (in a variety of ways). Moment-based estimation (applied to the minimum observation). Quantile-based estimation (with optimal levels). Assess the impact of dependence on the risk of a bulk annuity. Dependence increases the risk. D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 22 / 23

Thank you! D. H. Alai (CEPAR, Kent) Lifetime Dependence Modelling 18 July 2017 23 / 23