Financial Econometrics and Volatility Models Extreme Value Theory

Similar documents
MFM Practitioner Module: Quantitiative Risk Management. John Dodson. October 14, 2015

Introduction to Algorithmic Trading Strategies Lecture 10

Extreme Value Theory and Applications

Zwiers FW and Kharin VV Changes in the extremes of the climate simulated by CCC GCM2 under CO 2 doubling. J. Climate 11:

HIERARCHICAL MODELS IN EXTREME VALUE THEORY

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

Bayesian Modelling of Extreme Rainfall Data

Assessing Dependence in Extreme Values

Extreme Value Analysis and Spatial Extremes

Extreme Event Modelling

Fin285a:Computer Simulations and Risk Assessment Section 6.2 Extreme Value Theory Daníelson, 9 (skim), skip 9.5

SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions

Financial Econometrics and Quantitative Risk Managenent Return Properties

Investigation of an Automated Approach to Threshold Selection for Generalized Pareto

Richard L. Smith Department of Statistics and Operations Research University of North Carolina Chapel Hill, NC

Math 576: Quantitative Risk Management

Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets

Efficient Estimation of Distributional Tail Shape and the Extremal Index with Applications to Risk Management

Financial Econometrics and Volatility Models Copulas

Extreme Value Theory.

Three hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER.

Extreme value theory and high quantile convergence

EVA Tutorial #2 PEAKS OVER THRESHOLD APPROACH. Rick Katz

EXTREMAL MODELS AND ENVIRONMENTAL APPLICATIONS. Rick Katz

Emma Simpson. 6 September 2013

Analysis methods of heavy-tailed data

Overview of Extreme Value Analysis (EVA)

Bivariate generalized Pareto distribution

Shape of the return probability density function and extreme value statistics

Lecture 2 APPLICATION OF EXREME VALUE THEORY TO CLIMATE CHANGE. Rick Katz

Modelação de valores extremos e sua importância na

RISK AND EXTREMES: ASSESSING THE PROBABILITIES OF VERY RARE EVENTS

Quantile-quantile plots and the method of peaksover-threshold

SOLUTIONS TO MATH68181 EXTREME VALUES AND FINANCIAL RISK EXAM

APPROXIMATING THE GENERALIZED BURR-GAMMA WITH A GENERALIZED PARETO-TYPE OF DISTRIBUTION A. VERSTER AND D.J. DE WAAL ABSTRACT

Models and estimation.

High-frequency data modelling using Hawkes processes

PENULTIMATE APPROXIMATIONS FOR WEATHER AND CLIMATE EXTREMES. Rick Katz

High-frequency data modelling using Hawkes processes

Estimation of Quantiles

SOLUTIONS TO MATH68181 EXTREME VALUES AND FINANCIAL RISK EXAM

1 Degree distributions and data

STATISTICAL METHODS FOR RELATING TEMPERATURE EXTREMES TO LARGE-SCALE METEOROLOGICAL PATTERNS. Rick Katz

Peaks-Over-Threshold Modelling of Environmental Data

MULTIVARIATE EXTREMES AND RISK

Generalized additive modelling of hydrological sample extremes

A Conditional Approach to Modeling Multivariate Extremes

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables and Statistical Tables THE UNIVERSITY OF MANCHESTER.

The Fundamentals of Heavy Tails Properties, Emergence, & Identification. Jayakrishnan Nair, Adam Wierman, Bert Zwart

Quantitative Modeling of Operational Risk: Between g-and-h and EVT

Challenges in implementing worst-case analysis

Extreme Value Theory as a Theoretical Background for Power Law Behavior

Wei-han Liu Department of Banking and Finance Tamkang University. R/Finance 2009 Conference 1

Extreme value modelling of rainfalls and

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

UNIVERSITY OF CALGARY. Inference for Dependent Generalized Extreme Values. Jialin He A THESIS SUBMITTED TO THE FACULTY OF GRADUATE STUDIES

Sharp statistical tools Statistics for extremes

On the Application of the Generalized Pareto Distribution for Statistical Extrapolation in the Assessment of Dynamic Stability in Irregular Waves

FORECAST VERIFICATION OF EXTREMES: USE OF EXTREME VALUE THEORY

Statistical distributions: Synopsis

L-momenty s rušivou regresí

Extreme Values and Their Applications in Finance

Time Series Models for Measuring Market Risk

Does k-th Moment Exist?

Classical Extreme Value Theory - An Introduction

Estimation of risk measures for extreme pluviometrical measurements

RISK ANALYSIS AND EXTREMES

Change Point Analysis of Extreme Values

Parameter Estimation

Bayesian nonparametrics for multivariate extremes including censored data. EVT 2013, Vimeiro. Anne Sabourin. September 10, 2013

simple if it completely specifies the density of x

Estimating Bivariate Tail: a copula based approach

R&D Research Project: Scaling analysis of hydrometeorological time series data

Estimation for Mean and Standard Deviation of Normal Distribution under Type II Censoring

Factor Model Risk Analysis

Spatial and temporal extremes of wildfire sizes in Portugal ( )

A Closer Look at the Hill Estimator: Edgeworth Expansions and Confidence Intervals

Financial Econometrics and Volatility Models Estimation of Stochastic Volatility Models

Extreme Precipitation: An Application Modeling N-Year Return Levels at the Station Level

Generalized Logistic Distribution in Extreme Value Modeling

STAT 6350 Analysis of Lifetime Data. Probability Plotting

Frequency Estimation of Rare Events by Adaptive Thresholding

Journal of Environmental Statistics

Statistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation

Using R in Undergraduate Probability and Mathematical Statistics Courses. Amy G. Froelich Department of Statistics Iowa State University

Variable inspection plans for continuous populations with unknown short tail distributions

Extreme value statistics: from one dimension to many. Lecture 1: one dimension Lecture 2: many dimensions

Fitting the generalized Pareto distribution to data using maximum goodness-of-fit estimators

Solutions of the Financial Risk Management Examination

Step-Stress Models and Associated Inference

arxiv: v1 [math.st] 4 Aug 2017

Estimation de mesures de risques à partir des L p -quantiles

Hybrid Censoring; An Introduction 2

Stat 5101 Lecture Notes

MATH68181: EXTREME VALUES FIRST SEMESTER ANSWERS TO IN CLASS TEST

The Goodness-of-fit Test for Gumbel Distribution: A Comparative Study

GARCH Models Estimation and Inference

ACTUARIAL MODELLING OF EXTREMAL EVENTS USING TRANSFORMED GENERALIZED EXTREME VALUE DISTRIBUTIONS AND GENERALIZED PARETO DISTRIBUTIONS

Modelling Multivariate Peaks-over-Thresholds using Generalized Pareto Distributions

Statistics for extreme & sparse data

Transcription:

Financial Econometrics and Volatility Models Extreme Value Theory Eric Zivot May 3, 2010

1 Lecture Outline Modeling Maxima and Worst Cases The Generalized Extreme Value Distribution Modeling Extremes Over a High Threshold Generalized Pareto Distribution Traditional Risk Measures

2 Reading MFTS, chapter 5 FMUND, chapter 7

3 Modeling Maxima and Worst Cases Example: Analysis of daily negative returns on S&P 500 index What is the probability that next year s annual maximum negative return exceeds all previous negative returns? In other words, what is the probability that next year s maximum negative return is a new record? What is the 20-year return level of the negative returns? That is, what is the negative return which, on average, should only be exceeded in one year every twenty years? To answer these questions, the distribution of extreme negative returns on the S&P 500 index is required.

4 The Generalized Extreme Value Distribution Let X 1,X 2,... be iid random variables representing risks or losses with an unknown CDF F (x) =Pr{X i x}. Define M n =max(x 1,...,X n ) as the worst-case loss in a sample of n losses. From the iid assumption, the CDF of M n is Pr{M n x} =Pr{X 1 x,...,x n x} = ny i=1 F (x) =F n (x)

Comments: The empirical distribution function is often a very poor estimator of F n (x) F n (x) 0 or 1 as n

4.1 Fisher-Tippet Theorem Define the standardized maximum value Z n = M n μ n σ n σ n > 0= scale parameter μ n = location parameter Fisher-Tippet Theorem: If the standardized maximum converges to some nondegenerate distribution function, it must be a generalized extreme value (GEV) distribution of the form H ξ (z) = ( exp n (1 + ξz) 1/ξ o exp { exp( z)} ξ 6= 0, 1+ξz > 0 ξ =0, z

Comments The CDF F of the underlying data is in the domain of attraction of H ξ. The Fisher-Tippet Theorem is the analog of the Central Limit Theorem for extreme values. ξ is a shape parameter and determines the tail behavior of H ξ. The parameter α =1/ξ is called the tail index if ξ>0.

4.1.1 GEV Types Result: The tail behavior of the distribution F of the underlying data determines the shape parameter ξ of the GEV distribution If the tail of F declines exponentially, then H ξ is of the Gumbel type and ξ =0. Distributions in the domain of attraction of the Gumbel type are thin tailed distributions such as the normal, log-normal, exponential, and gamma. For these distributions, all moments usually exist

If the tail of F declines by a power function, i.e. 1 F (x) =c x 1/ξ = x α for some constant c, then H ξ is of the Fréchet type and ξ>0. Distributions in the domain of attraction of the Fréchet type include fat tailed distributions like the Pareto, Cauchy, Student-t, alpha-stable with characteristic exponent in (0, 2), as well as various mixture models. Not all moments are finite for these distributions: E[X k ]= for k α =1/ξ. This type is most relevant for financial data.

if the tail of F is finite then H ξ is of the Weibull type and ξ<0. Distributions in the domain of attraction of the Weibull type include distributions with bounded support such as the uniform and beta distributions. All moments exist for these distributions.

4.1.2 Unstandardized Distributions For location and scale parameters μ and σ>0 we have µ x μ H ξ (z) =H ξ = H ξ,μ,σ (x) σ For large enough n ½ ¾ Mn μ Pr {Z n <z} =Pr n <z H ξ (z) σ n Letting x = σ n z + μ n then µ x μn Pr{M n <x} H ξ = H ξ,μn,σ σ n (x) n This result is used in practice to make inferences about the maximum loss M n.

4.2 Maximum Likelihood Estimation Let X 1,...,X T be iid losses with unknown CDF F and let M T denote the sample maximum. Divide the sample into m non-overlapping blocks of essentially equal size n = T/m [X 1,...,X n X n+1,...,x 2n... X (m 1)n+1,...,X mn ] M n (j) = maximum value of X i in block j =1,...,m Construct likelihood for ξ, σ n and μ n from {M n (1),...,M n (m) } Assumed that the block size n is sufficiently large so that the Fisher-Tippet Theorem holds.

The log-likelihood function for ξ 6= 0is l(μ, σ, ξ) = m ln(σ) (1 + 1/ξ) mx i=1 1+ξ mx and is maximized imposing the constraint 1+ξ M(i) n ln 1+ξ i=1 M(i) n σ μ μ > 0 σ M(i) n σ 1/ξ μ

Remarks: For ξ> 0.5 the mles for μ, σ and ξ are consistent and asymptotically normally distributed with asymptotic variance given by the inverse of the observed information matrix The bias of the mle is reduced by increasing the block size n, andthe variance of the mle is reduced by increasing the number of blocks m.

4.3 Example: S&P 500 negative returns The maximum likelihood estimates of ξ, μ and σ based on annual block maxima are ˆξ = 0.334, SE(ˆξ) d =0.208 ˆμ = 1.975, SE(ˆμ) d =0.151 ˆσ = 0.672, SE(ˆσ) d =0.131 What is the probability that next year s annual maximum negative return exceeds all previous negative returns? µ Pr M (29) µ 260 > max M (1) 260,...,M(28) 260 =Pr(M (29) 260 > 6.68) = 1 (6.68) = 0.0267 Hˆξ,ˆμ,ˆσ

4.4 Return Level The kn-block return level, R n,k, is that level which is exceeded in one out of every k blocks of size n. Thatis,R n,k is the loss value such that Pr{M n >R n,k } =1/k R n,k is simply the 1 1/k quantile of limiting GEV distribution: R n,k H 1 ξ,μ,σ (1 1/k) =μ σ ξ ³ 1 ( log(1 1/k)) ξ

5 Modeling Extremes Over High Thresholds Idea: Modeling only block maxima data is inefficient if other data on extreme values are available. A more efficient alternative approach that utilizes more data is to model the behavior of extreme values above some high threshold. This method is often called peaks over thresholds (POT). An advantage of the POT approach is that common risk measures like Value-at-Risk (VaR) and expected shortfall (ES) may easily be computed

5.1 Risk Measures Value-at-Risk (VaR). For 0.95 q<1, say, VaR q is the qth quantile of the loss distribution F where F 1 is the inverse of F. VaR q = F 1 (q) Expected Shortfall (ES). exceeded: ES q is the expected loss size, given that VaR q is Note: ES q is related to VaR q via ES q = E[X X >VaR q ] ES q = VaR q + E[X VaR q X>VaR q ]

5.2 The Generalized Pareto Distribution Let X 1,X 2,... be a sequence of iid random losses with an unknown CDF F and let M n =max{x 1,...,X n }. A natural measure of extreme events are values of the X i that exceed a high threshold u. Define the excedences above the high threhold u: Y = X u, X > u Then the excess distribution (aka tail distribution) above the threshold u is the conditional probability F u (y) =Pr(Y y) =Pr{X u y X >u} = for y>0. F (y + u) F (u), 1 F (u)

Result: For the class of distributions F such that the CDF of the standardized value of M n converges to a GEV distribution, for large enough u there exists a positive function β(u) such that F u (y) is well approximated by the generalized Pareto distribution (GPD) ( 1 (1+ξy/β(u)) for ξ 6= 0 G ξ,β(u) (y) = 1 exp( y/β(u)) for ξ =0, β(u) > 0 defined for y 0 when ξ 0 and 0 y β(u)/ξ when ξ<0 Remarks: Forasufficiently high threshold u, F u (y) G ξ,β(u) (y) for a wide class of loss distributions F. To implement this result, the threshold value u must be specified and estimates of the unknown parameters ξ and β(u) must be obtained.

There is a close connection between the limiting GEV distribution for block maxima and the limiting GPD for threshold excesses. The shape parameter ξ of the GEV distribution is the same shape parameter ξ in the GPD and is independent of the threshold value u. Consider a limiting GPD with shape parameter ξ and scale parameter β(u 0 ) for an excess distribution F u0 with threshold u 0. For an arbitrary threshold u > u 0, the excess distribution F u has a limiting GPD distribution with shape parameter ξ and scale parameter β(u) =β(u 0 )+ ξ(u u 0 ). Alternatively, for any y>0 the excess distribution F u0 +y has a limiting GPD distribution with shape parameter ξ and scale parameter β(u 0 )+ξy

5.3 Mean Excess Function Suppose the threshold excess X u 0 follows a GPD with parameters ξ<1 and β(u 0 ). Then the mean excess over the threshold u 0 is E[X u 0 X>u 0 ]= β(u 0) 1 ξ For any u>u 0,define the mean excess function e(u) as e(u) =E[X u X >u]= β(u 0)+ξ(u u 0 ) 1 ξ Alternatively, for any y>0 e(u 0 + y) =E[X (u 0 + y) X >u 0 + y] = β(u 0)+ξy 1 ξ The mean excess function is a linear function of y = u u 0.

5.3.1 Graphical Diagnostic for determining u 0 Define the empirical mean excess function X e n (u) = 1 n u n u i=1 (x (i) u) where x (i) (i =1,...,n u ) are the values of x i such that x i >u. The mean excess plot is a plot of e n (u) against u and should be linear in u for u>u 0

5.4 Maximum Likelihood Estimation Let x 1,...,x n be iid sample of losses with unknown CDF F. Foragivenu, extreme values are those x i values for which x i u>0. Denote these values x (1),...,x (k) Define the threshold excesses as y i = x (i) u for i =1,...,k. If u is large enough, then {y 1,...,y k } may be thought of as a random sample from a GPD with unknown parameters ξ and β(u). For ξ 6= 0, the log-likelihood function based on the GPD is l(ξ, β(u)) = k ln(β(u)) (1 + 1/ξ) kx i=1 ln(1 + ξy i /β(u))

5.5 Estimating the Tails of the Loss Distribution Forasufficiently high threshold u, F u (y) G ξ,β(u) (y). Seting x = u + y, an approximation to the tails of F (x) for x>uis given by F (x) =(1 F (u))g ξ,β(u) (y)+f (u) Estimate F (u) using the empirical CDF (n k) ˆF (u) = n k = number of exceedences over u

Combine ˆF (u) with (y) to give Gˆξ,ˆβ(u) ˆF (x) =1 k n à 1+ˆξ x u ˆβ(u)! This approximation is used for VaR computations based on the fitted GPD

5.6 Risk Measures Again VaR for GPD. Compute ˆF 1 (q) using VaR d q = u + ˆβ(u) ˆξ µ n k (1 q) ˆξ 1 ES for GPD des q = VaR d q 1 ˆξ + ˆβ(u) ˆξu 1 ˆξ

5.7 Non-Parametric Tail Estimation The shape parameter ξ, or equivalently, the tail index a =1/ξ, ofthegev and GPD distributions may be estimated non-parametrically in a number of ways. A popular method due to Hill (1975) applies to the case where ξ > 0 (α >0) so that the data is generated by some fat-tailed distribution in the domain of attraction of a Fréchet type GEV. Consider a sample of losses X 1,...,X T and define the order statistics as X (1) X (2) X (T )

For a positive integer k, thehill estimator of ξ is defined as ˆξ Hill (k) = 1 k and the Hill estimator of α is kx j=1 ³ log X(j) log X (k) ˆα Hill (k) =1/ˆξ Hill (k)

The Hill estimators of ξ and α depend on the integer k, which plays the same role as k in the analysis of the GPD. It can be shown that if F is in the domain of attraction of a GEV distribution, then ˆξ Hill (k) converges in probability to ξ as k and k n 0, and that ˆξ Hill (k) is asymptotically normally distributed with asymptotic variance avar(ˆξ Hill (k)) = ξ2 k By the delta method, ˆα Hill (k) is asymptotically normally distributed with asymptotic variance avar(ˆα Hill (k)) = α2 k

In practice, the Hill estimators ˆξ Hill (k) or ˆα Hill (k) are often plotted against k to find the value of k such that the estimator appears stable.

5.7.1 Hill Tail and Quantile Estimation Suppose that the loss distribution F is such that 1 F (x) =x α L(x) with α =1/ξ > 0, wherel(x) is a slowly varying function. Let x>x (k+1) where X (k+1) is a high order statistic. The Hill estimator of F (x) is given by ˆF Hill (x) =1 k n x X (k+1) ) ˆα Hill (k), x > X (k+1) Inverting the Hill tail estimator gives the Hill quantile estimator µ n ˆξ Hill k (1 q) (k) 1 ˆx Hill q,k where q>1 k/n. = X (k+1) X (k+1)