Challenges in implementing worst-case analysis

Similar documents
Tail Index Estimation: Quantile Driven Threshold Selection. Working Paper

Why risk is so hard to measure. Jon Danielsson Chen Zhou

Comparing downside risk measures for heavy tailed distributions

Extreme Value Theory as a Theoretical Background for Power Law Behavior

A NOTE ON SECOND ORDER CONDITIONS IN EXTREME VALUE THEORY: LINKING GENERAL AND HEAVY TAIL CONDITIONS

Extreme Value Analysis and Spatial Extremes

Does k-th Moment Exist?

Financial Econometrics and Volatility Models Extreme Value Theory

Efficient Estimation of Distributional Tail Shape and the Extremal Index with Applications to Risk Management

EXTREMAL QUANTILES OF MAXIMUMS FOR STATIONARY SEQUENCES WITH PSEUDO-STATIONARY TREND WITH APPLICATIONS IN ELECTRICITY CONSUMPTION ALEXANDR V.

Introduction to Algorithmic Trading Strategies Lecture 10

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

Using a Bootstrap Method to Choose the Sample Fraction in Tail Index Estimation 1

Discussion on Human life is unlimited but short by Holger Rootzén and Dmitrii Zholud

Zwiers FW and Kharin VV Changes in the extremes of the climate simulated by CCC GCM2 under CO 2 doubling. J. Climate 11:

Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets

arxiv: v1 [q-fin.cp] 23 Jan 2012

Estimation de mesures de risques à partir des L p -quantiles

On Backtesting Risk Measurement Models

Extreme Values and Their Applications in Finance

MFM Practitioner Module: Quantitiative Risk Management. John Dodson. October 14, 2015

A MODIFICATION OF HILL S TAIL INDEX ESTIMATOR

On the Estimation and Application of Max-Stable Processes

Estimation of risk measures for extreme pluviometrical measurements

CONTAGION VERSUS FLIGHT TO QUALITY IN FINANCIAL MARKETS

Classical Extreme Value Theory - An Introduction

The Behavior of Multivariate Maxima of Moving Maxima Processes

Shape of the return probability density function and extreme value statistics

Financial Econometrics Return Predictability

On the Application of the Generalized Pareto Distribution for Statistical Extrapolation in the Assessment of Dynamic Stability in Irregular Waves

Extreme Value Theory.

Applying the proportional hazard premium calculation principle

Refining the Central Limit Theorem Approximation via Extreme Value Theory

Quantile-quantile plots and the method of peaksover-threshold

SFB 823. A simple and focused backtest of value at risk. Discussion Paper. Walter Krämer, Dominik Wied

Do Markov-Switching Models Capture Nonlinearities in the Data? Tests using Nonparametric Methods

A class of probability distributions for application to non-negative annual maxima

Statistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation

Contributions for the study of high levels that persist over a xed period of time

Bias-corrected goodness-of-fit tests for Pareto-type behavior

ESTIMATING BIVARIATE TAIL

Models and estimation.

Research Article Strong Convergence Bound of the Pareto Index Estimator under Right Censoring

A Closer Look at the Hill Estimator: Edgeworth Expansions and Confidence Intervals

Asymmetric Dependence, Tail Dependence, and the. Time Interval over which the Variables Are Measured

Stat 5101 Lecture Notes

Expected Shortfall is not elicitable so what?

Estimation of Operational Risk Capital Charge under Parameter Uncertainty

Pareto approximation of the tail by local exponential modeling

Elicitability and backtesting

Extreme L p quantiles as risk measures

Long memory and changing persistence

Richard L. Smith Department of Statistics and Operations Research University of North Carolina Chapel Hill, NC

Market Risk. MFM Practitioner Module: Quantitiative Risk Management. John Dodson. February 8, Market Risk. John Dodson.

Bayesian Modelling of Extreme Rainfall Data

Statistics for Managers Using Microsoft Excel Chapter 9 Two Sample Tests With Numerical Data

Weak convergence to the t-distribution

WEIBULL RENEWAL PROCESSES

Rethinking Inequality Decomposition: Comment. *London School of Economics. University of Milan and Econpubblica

Bivariate generalized Pareto distribution

The extremal elliptical model: Theoretical properties and statistical inference

The Modelling of Rare Events: from methodology to practice and back. Paul Embrechts

Robust Backtesting Tests for Value-at-Risk Models

The Law of Large Numbers in the Real World. Nassim Nicholas Taleb Extreme Risk Ini=a=ve, NYU School of Engineering

Risk Aggregation with Dependence Uncertainty

Change Point Analysis of Extreme Values

Information Content Change under SFAS No. 131 s Interim Segment Reporting Requirements

Expected Shortfall is not elicitable so what?

Backtesting Marginal Expected Shortfall and Related Systemic Risk Measures

QED. Queen s Economics Department Working Paper No. 1244

1 Degree distributions and data

Quantitative Modeling of Operational Risk: Between g-and-h and EVT

Probabilities & Statistics Revision

arxiv:physics/ v1 [physics.soc-ph] 18 Aug 2006

HIERARCHICAL MODELS IN EXTREME VALUE THEORY

Asymptotic distribution of the sample average value-at-risk

Extreme Value Theory and Applications

Pitfalls in Using Weibull Tailed Distributions

University of Pretoria Department of Economics Working Paper Series

APPROXIMATING THE GENERALIZED BURR-GAMMA WITH A GENERALIZED PARETO-TYPE OF DISTRIBUTION A. VERSTER AND D.J. DE WAAL ABSTRACT

On the estimation of the second order parameter in extreme-value theory

Introduction & Random Variables. John Dodson. September 3, 2008

Financial Econometrics and Volatility Models Copulas

Asymptotics of Efficiency Loss in Competitive Market Mechanisms

Analysis methods of heavy-tailed data

Birkbeck Working Papers in Economics & Finance

A Partially Reduced-Bias Class of Value-at-Risk Estimators

Published: 26 April 2016

Abstract: In this short note, I comment on the research of Pisarenko et al. (2014) regarding the

Volatility. Gerald P. Dwyer. February Clemson University

Nonparametric estimation of extreme risks from heavy-tailed distributions

On the occurrence times of componentwise maxima and bias in likelihood inference for multivariate max-stable distributions

Estimating Expected Shortfall Using a Conditional Autoregressive Model: CARES

The Instability of Correlations: Measurement and the Implications for Market Risk

SPECIFICATION TESTS IN PARAMETRIC VALUE-AT-RISK MODELS

Modelling Operational Risk Using Bayesian Inference

AN ASYMPTOTICALLY UNBIASED MOMENT ESTIMATOR OF A NEGATIVE EXTREME VALUE INDEX. Departamento de Matemática. Abstract

Solutions of the Financial Risk Management Examination

A world-wide investigation of the probability distribution of daily rainfall

MEAN-OF-ORDER-p LOCATION-INVARIANT EX- TREME VALUE INDEX ESTIMATION

Transcription:

Challenges in implementing worst-case analysis Jon Danielsson Systemic Risk Centre, lse,houghton Street, London WC2A 2AE, UK Lerby M. Ergun Systemic Risk Centre, lse,houghton Street, London WC2A 2AE, UK Casper G. de Vries School of Economics, Erasmus University Rotterdam, 3000 DR Rotterdam, The Netherlands March 1, 2017 Abstract Worst-case analysis has increased in popularity among financial regulators in the wake of the recent financial crisis. In this paper we provide insight into this measure and provide some guidance on how to estimate it. We derive the bias for the non-parametric heavy tailed order-statistics and contrast it with the semi-parametric EVT approach. We find that if the return distribution has a heavy tail, the non-parametric worst-case analysis, i.e the minimum of the sample, is always downwards biased. Relying on semi-parametric EVT reduces the bias considerably in the case of relatively heavy tails. But for the less heavy tails this relationship is reversed. Estimates for a large sample of US stock returns indicates that this pattern in the bias is also present in financial data. With respect to risk management, this induces an overly conservative capital allocation Keywords: Worst-case analysis, EVT, quantile estimator, risk management JEL codes: C01, C14, C58 Corresponding author: 2AE, UK m.l.ergun@lse.ac.uk, Houghton Street, London WC2A 1

1 Introduction Worst-case analysis studies the worst expected outcome over a predetermined time length, with a typical question what is the worst daily market outcome in 10 years or 2,500 days. Worst-case analysis is increasingly common due to the recent financial crises. For instance, European Insurance and Occupational Pensions Authority (EIOPA) has embraced the worst-case analysis as an additional measure of risk for the insurance and pension industry. In spite of its increasing importance, little is known about worst-case analysis and what the appropriate way is to measure it. In this paper we provide insight in the bias of worst-case estimates and measure their effect for US stock returns. There are generally three main approaches. The simplest, and the most obvious, is to directly read the object of interest from the empirical distribution, in our case the historical minima, the non-parametric approach (EA). One can also assume a model only for the tail of the distribution and not model the center of the distribution, the semi-parametric approach (SP). The third approach is based on specifying a parametric distribution for all outcomes and estimate its parameters. Of these three alternatives, the last is the only one that cannot be recommended. The reason is that the estimates will be dominated by the center of the distribution, so that the fit is optimal for a typical observation, not the lowest, the focus of our interest. Therefore, such an approach would in most cases deliver less precise and more uncertain worst-case estimators than either the EA or the SP. We therefore focus on EA and the SP estimators and provide guidance on the appropriate use of either. 1.1 Statistical background To derive the bias of the worst observed observation as a worst-case estimator, we start with a relatively general approach. Begin by deriving the distribution of observations in an ordered sample. Suppose we observe some i.i.d heavy tailed random variable Y 1,..., Y n with distribution F, where 1 F (tx) lim t 1 F (t) = xα, α > 0. (1) The sorted sample, i.e order-statistics, can be represented as, max (Y 1,..., Y n ) = X (1,n) X (2,n) X (n,n) = min (Y 1,..., Y n ) 2

The distribution of the order-statistics can be studied through the number of exceedances. These follow a binomial distribution, k 1 ( ) n G (k,n) (x) = [1 F (x)] r [F (x)] n r. (2) r r=0 Suppose one is interested in the distribution of the maximum realization: Pr (max (Y 1,..., Y n ) < x) = G (1,n) (x) = [F (x)] n. (3) Similarly to the standard central limit theorem for the asymptotic distribution of the arithmetic mean, Fisher and Tippett (1928) and Gnedenko (1943) provide a limit theorem for the asymptotic distribution of the maximum, i.e extreme value theory (EVT). EVT gives the conditions under which there exist sequences b n and a n such that lim n [F (a nx + b n )] n G (1,n) (x), where G (1,n) (x) is the Fréchet distribution. Theorem 2.2.2 in Leadbetter et al. (1983) extends the EVT for the maximum to lower order-statistics by means of the Poisson property of the lower order-statistics. In particular, the asymptotic distribution of the k th largest order-statistic is: ( [ ]) s k 1 log G (1,n) (x) G (k,n) (x) G (1,n) (x). (4) s! s=0 From (4) we determine the expectation of the order-statistics E [ X (k,n)] = a [ n [k 1]! Γ k 1 ]. (5) α To determine this expectation for a specific heavy tailed distribution a n needs to be chosen appropriately. 1 To find a good approximation, we use the first order term of the Hall expansion (Hall and Welsh, 1985): Pr (Y y) = F( y) = Ay α [1 + By β + o(y β )]. (6) 1 For the heavy tailed distributions b n = 0. 3

For the Pareto distribution, F ( y) = Ay α, we observe that the Hall expansion perfectly fits the first order term. 2 For the Pareto distribution the scaling constant a n is (An) 1 α, where A is the scale parameter. Therefore, we can, through the Hall expansion, extract a good approximation of the expectation of the order-statistics of heavy tailed distributions. 1.2 The semi parametric approach To contrast the expectation of the maximum observation, we compare it to a semi-parametric estimator of the worst-case. By inverting the first order t/n expansion in (6), using the empirical counter part of A = measured X (t,n) α t at some threshold t and F ( y) = k /n, one obtains the semi-parametric tail quantile estimator ( ) 1 ˆx ( n k/n) t SP (t) = X (t,n) α t. k Goldie and Smith (1987) derive the distribution of the semi-parametric quantile estimator ( ) t ˆx ( n k/n) ( SP (t) sign (B) 1 N, 1 ), (7) log (t/np) x (p) 2βα α 2 where B and β are the second order scale and shape parameters from (6). 1.3 Comparing the empirical- and semi-parametric quantile estimator The two approaches, the EA and the SP, have each their own pros and cons. While the EA is much simpler to implement, the SP might be more accurate because it uses more observations in the estimation, and therefore might result in an estimate with a lower bias and uncertainty. However, the SP is dependent on correctly specifying the semi-parametric distribution and identifying the threshold X (t,n). To shed more light on the use of these two estimators, we compare their bias 2 All of the standard heavy tailed distributions satisfy the Hall expansion. It even applies to the GARCH(1,1) unconditional distribution. 4

at p = 1 /n. From (5) and (7) the bias of the two approaches are: ( ) ˆx ( n k/n) SP (t) sign (B) log (k) 1. SP (8) x (p) 2βα k ( ) ˆx ( n k/n) EA 1 1 [ x (p) [k 1]! Γ k 1 ] 1. EA (9) α (8) and (9) indicate that neither approach is dominating in all circumstances. For the EA the asymptotic bias approaches infinity as α approaches 1. However, as α increases the Γ function decreases rapidly. 3 For the SP estimator the bias is relatively small for moderate values of β and t. This leads to a crossing point in the bias of the two estimators with respect to α. Given values of t and β, we define switching point α. For α < α the absolute bias of the SP estimator is smaller than the EA estimator. When α > α, the relationship is reversed. For a fixed t, this relationship is depicted in Figure 1. This figure portraits at which combination of α and β the bias of the nonparametric worst-case estimator becomes smaller than the semi-parametric approach. For example, in the case of the family of Student-t distributions α [1, ) and β = 2. From Figure 1, we read that for t = e 2 the switching of the biases occurs around α 5. For higher and lower values of t, the α increases. For the family of symmetric stable distributions, the bias is always smaller for the SP, as β = α and α < 2. 2 Application In this section we investigate the bias in the EA and SP estimators for US stocks, using the CRSP security database. For this empirical application we use the Hill estimator to estimate the tail exponent α. This estimator depends on a selection of a high order-statistic as a threshold, i.e X (t,n). This nuisance statistic is obtained by the KS-distance metric. 4 Given the estimate of the tail index, the quantile can be estimated semi-parametrically. We compare the difference between the previously introduced worst-case estimators 3 The bias of EA is invariant to dependence in the time-series as measured by the extremal index. See Leadbetter et al. (1983) page 416. 4 The KS-distance metric chooses the threshold which minimizes the maximum quantile distance between the empirical and Pareto distribution.this approach is further explained in Danielsson et al. (2015). 5

Figure 1: Bias comparison This Figure depicts the area where the absolute bias of the semi-parametric estimator becomes larger than the bias of the order statistics (colored region). The biases of the estimators are at p = 1 /n as in (8) and (9). For this figure we fix t at e 2. To the right of the lines the combination of α and β produces a larger bias for the semi-parametric approach. The dotted line shows where the boundary shifts to when the threshold t is doubled to e 2. for each stock at the 1 /n quantile. These differences are collected in different buckets sorted by the ˆα of each stock. This way we are able to determine a switching point in the size of the biases between the two estimators. The theoretical results stipulate that the relative size of the bias changes as a function of ˆα. Table 1 portraits this pattern for the securities in the CRSP database. For these stocks the switch point is around ˆα = 3. It is difficult to determine the exact switch point for real data. This is because β, in the bias of the semi-parametric quantile estimator, is not estimated. In addition, the Hill estimator is estimated with a bias. This makes it difficult to determine the exact switch point. It is encouraging that we see a monotonic decline in the average difference as ˆα increases. This is supportive of the result that 6

Table 1: CRSP data All ˆα < 2 2 < ˆα < 3 3 < ˆα < 4 4 < ˆα < 5 ˆα > 5 r = 2, 230 r = 13 r = 712 r = 1, 001 r = 442 r = 62 Mean -0.003 0.074 0.010-0.005-0.017-0.026 Median -0.011 0.074 0.005-0.010-0.017-0.026 Ste. dev. 0.031 0.072 0.039 0.022 0.019 0.018 1% quantile -0.059-0.038-0.063-0.051-0.063-0.085 99% quantile 0.109 0.194 0.138 0.064 0.045 0.031 Rank sum test 0.070 0.305 0.397 0.162 0.003 0.047 This table reports the difference between the largest order statistic and semi-parametric quantile estimator for US stocks. These are stocks selected from the CRSP database. The securities need to be traded on NYSE, AMEX, NASDAQ, and NYSE Arca exchanges over the period from 01-01-1995 till 01-01-2011. The table reports various statistics on the distribution of the difference between the two estimators. Here r is the number of different stocks in the different buckets. To determine the number of order statistics for the Hill estimator we use the KS distance metric described in Danielsson et al. (2016). the bias of the EVT based quantile estimator overtakes the bias of the nonparametric quantile estimator. The results for the median convey the same story. The 1% and 99% quantiles of the buckets show that although the mean and median showcase a switch between the severity of the bias of the quantile estimators, this might be statistically insignificant. Therefore, we employ the rank-sum test to test for the difference in size of the observations of the empirical distribution from the SP and EA estimator. We find that for the lighter heavy tailed stock returns the estimates from the estimators are significantly different from one another. The empirical distribution of the semi-parametric quantile estimator tends to have larger values than the distribution of the non-parametric quantile estimator. For low values of ˆα the difference is in the expected direction, but insignificant. 3 Conclusion With worst-case analysis becoming increasingly common in both policymaking and practice, it is of interest to evaluate the qualities of common methods for such applications. The simplest, and perhaps the most common way is to estimate the worst-case by taking the most negative outcome in the historical sample. Alternatively, one could estimate the lower tail of the distribution by semi-parametric methods and use that to calculate the worst-case. In our main conclusion, either method is best, depending on how heavy 7

the tails are and their specific shape. Generally, for the heaviest, the semiparametric approach is best, and as it thins, the historical minima eventually becomes better. 4 Acknowledgements This work was supported by the Economic and Social Research Council (ESRC) in funding the Systemic Risk Centre [grant number ES/K002309/1] and the Netherlands Organisation for Scientific Research Mozaiek grant [grant number: 017.005.108] References Danielsson, J., L. M. Ergun, L. De Haan, and C. De Vries (2016). Tail index estimation: Quantile driven threshold selection. SRC Working paper. Fisher, R. A. and L. Tippett (1928). Limiting forms of the frequency distribution of the largest of smallest member of a sample. In Proceedings of the Cambridge Philosophical Society, Volume 24, pp. 180 190. Gnedenko, B. (1943). Sur la distribution limite du terme maximum d une serie aleatoire. Annals of Mathematics 44 (1), 423 453. Goldie, C. M. and R. L. Smith (1987). Slow variation with remainder: Theory and applications. The Quarterly Journal of Mathematics 38 (1), 45 71. Hall, P. and A. Welsh (1985). Adaptive estimates of parameters of regular variation. The Annals of Statistics 13 (1), 331 341. Leadbetter, M. R., G. Lindgren, and H. Rootzén (1983). Extremes and related properties of random sequences and processes. Springer Series in Statistics. 8