Change Point Analysis of Extreme Values

Similar documents
Change Point Analysis of Extreme Values

Change Point Analysis of Extreme Values

A Closer Look at the Hill Estimator: Edgeworth Expansions and Confidence Intervals

Spatial and temporal extremes of wildfire sizes in Portugal ( )

Sharp statistical tools Statistics for extremes

Variable inspection plans for continuous populations with unknown short tail distributions

Extreme Value Theory and Applications

NONPARAMETRIC ESTIMATION OF THE CONDITIONAL TAIL INDEX

Introduction to Algorithmic Trading Strategies Lecture 10

Analysis methods of heavy-tailed data

Financial Econometrics and Volatility Models Extreme Value Theory

APPROXIMATING THE GENERALIZED BURR-GAMMA WITH A GENERALIZED PARETO-TYPE OF DISTRIBUTION A. VERSTER AND D.J. DE WAAL ABSTRACT

Modelling Risk on Losses due to Water Spillage for Hydro Power Generation. A Verster and DJ de Waal

Goodness-of-fit testing and Pareto-tail estimation

Estimation de mesures de risques à partir des L p -quantiles

Modelação de valores extremos e sua importância na

Quantile-quantile plots and the method of peaksover-threshold

Department of Econometrics and Business Statistics

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

Pitfalls in Using Weibull Tailed Distributions

On the estimation of the second order parameter in extreme-value theory

Extreme Value Theory as a Theoretical Background for Power Law Behavior

Extreme L p quantiles as risk measures

A Robust Estimator for the Tail Index of Pareto-type Distributions

CHANGE DETECTION IN TIME SERIES

Emma Simpson. 6 September 2013

ESTIMATING BIVARIATE TAIL

Modelling Multivariate Peaks-over-Thresholds using Generalized Pareto Distributions

Assessing Dependence in Extreme Values

STAT 461/561- Assignments, Year 2015

Practical conditions on Markov chains for weak convergence of tail empirical processes

The Convergence Rate for the Normal Approximation of Extreme Sums

Estimation of the second order parameter for heavy-tailed distributions

Math 576: Quantitative Risk Management

CONTAGION VERSUS FLIGHT TO QUALITY IN FINANCIAL MARKETS

Extreme Value Analysis and Spatial Extremes

MFM Practitioner Module: Quantitiative Risk Management. John Dodson. October 14, 2015

Estimation of the extreme value index and high quantiles under random censoring

Does k-th Moment Exist?

Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets

Estimation of risk measures for extreme pluviometrical measurements

A THRESHOLD APPROACH FOR PEAKS-OVER-THRESHOLD MODELING USING MAXIMUM PRODUCT OF SPACINGS

Shape of the return probability density function and extreme value statistics

Bias-corrected goodness-of-fit tests for Pareto-type behavior

arxiv: v1 [stat.me] 18 May 2017

EXAMINERS REPORT & SOLUTIONS STATISTICS 1 (MATH 11400) May-June 2009

SIMULTANEOUS TAIL INDEX ESTIMATION

Bias Reduction in the Estimation of a Shape Second-order Parameter of a Heavy Right Tail Model

Wei-han Liu Department of Banking and Finance Tamkang University. R/Finance 2009 Conference 1

Estimating Bivariate Tail: a copula based approach

Overview of Extreme Value Analysis (EVA)

Investigation of an Automated Approach to Threshold Selection for Generalized Pareto

Non-parametric Inference and Resampling

Richard L. Smith Department of Statistics and Operations Research University of North Carolina Chapel Hill, NC

Problem 1 (20) Log-normal. f(x) Cauchy

APPLICATION OF EXTREMAL THEORY TO THE PRECIPITATION SERIES IN NORTHERN MORAVIA

A Conditional Approach to Modeling Multivariate Extremes

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao

A NOTE ON SECOND ORDER CONDITIONS IN EXTREME VALUE THEORY: LINKING GENERAL AND HEAVY TAIL CONDITIONS

Lecture 2: Basic Concepts and Simple Comparative Experiments Montgomery: Chapter 2

STAT 6350 Analysis of Lifetime Data. Probability Plotting

Contributions for the study of high levels that persist over a xed period of time

PARAMETER ESTIMATION FOR THE LOG-LOGISTIC DISTRIBUTION BASED ON ORDER STATISTICS

Model Selection Tutorial 2: Problems With Using AIC to Select a Subset of Exposures in a Regression Model

Nonlinear Time Series Modeling

Pareto approximation of the tail by local exponential modeling

Nonparametric Estimation of Extreme Conditional Quantiles

Bayesian Modelling of Extreme Rainfall Data

Empirical Tail Index and VaR Analysis

Estimation of Quantiles

AN ASYMPTOTICALLY UNBIASED MOMENT ESTIMATOR OF A NEGATIVE EXTREME VALUE INDEX. Departamento de Matemática. Abstract

On the estimation of the heavy tail exponent in time series using the max spectrum. Stilian A. Stoev

Mathematics Qualifying Examination January 2015 STAT Mathematical Statistics

STT 843 Key to Homework 1 Spring 2018

Quantitative Modeling of Operational Risk: Between g-and-h and EVT

ON THE ESTIMATION OF EXTREME TAIL PROBABILITIES. By Peter Hall and Ishay Weissman Australian National University and Technion

If we want to analyze experimental or simulated data we might encounter the following tasks:

Stable Limit Laws for Marginal Probabilities from MCMC Streams: Acceleration of Convergence

A TEST OF FIT FOR THE GENERALIZED PARETO DISTRIBUTION BASED ON TRANSFORMS

Estimation of the functional Weibull-tail coefficient

Sequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process

Tail dependence coefficient of generalized hyperbolic distribution

Estimation of spatial max-stable models using threshold exceedances

The Behavior of Multivariate Maxima of Moving Maxima Processes

Complex Systems Methods 11. Power laws an indicator of complexity?

Semi-parametric tail inference through Probability-Weighted Moments

NCoVaR Granger Causality

SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions

Empirical likelihood and self-weighting approach for hypothesis testing of infinite variance processes and its applications

Return periods of losses associated with European windstorm series in a changing climate

EVA Tutorial #2 PEAKS OVER THRESHOLD APPROACH. Rick Katz

arxiv: v1 [math.st] 4 Aug 2017

Tail Index Estimation of Heavy-tailed Distributions

The Bootstrap: Theory and Applications. Biing-Shen Kuo National Chengchi University

Is there evidence of log-periodicities in the tail of the distribution of seismic moments?

Extreme Event Modelling

Challenges in implementing worst-case analysis

1 Degree distributions and data

Math 494: Mathematical Statistics

Chapter 6. Order Statistics and Quantiles. 6.1 Extreme Order Statistics

Transcription:

Change Point Analysis of Extreme Values Lisboa 2013 p. 1/3 Change Point Analysis of Extreme Values Goedele Dierckx Economische Hogeschool Sint Aloysius, Brussels, Belgium Jef L. Teugels Katholieke Universiteit Leuven, Belgium

Change Point Analysis of Extreme Values Lisboa 2013 p. 2/3 2.2. CATASTROPHIC LOSSES (2012) EVENT T DATE LOSS EVENT T DATE LOSS Katrina H 35,65 74.686 Japan TS 41,19 35.000 Andrew H 22,64 25.641 WTC-attack M 31,70 23.848 Northridge E 24,05 21.239 Ike H 38,68 21.141 Ivan H 34,67 15350 Wilma H 35,80 14.468 Thailand F 41,57 12.000 N.Zealand E 41,14 12.000 Rita H 35,72 11.625 Charley H 34,61 9.583 Mireille T 21,74 8.899 Hugo H 19,71 8.292 Chile TS 40,16 8.248 Daria ES 20,07 8.036 Lothar ES 29,98 7.830 Alabama T 41,31 7.300 Missouri T 41,41 7.050 Kyrill ES 37,05 6.609 X 100 ES 17,79 6.135 Frances H 36,65 6.127 Vivian ES 35,15 5.491 Bart T 29,73 5.454 Irene H 41,64 5.300 N.Zealand E 40,68 5.155 Georges H 28,72 4.870 Allison S 31,43 4.577 Jeanne H 34,70 4.527 Songda T 34,68 4.268

Change Point Analysis of Extreme Values Lisboa 2013 p. 3/3 X 106 US 33,33 3.918 Floyd H 29,69 3.810 Opal H 25,75 3.697 Kobe, Japan E 25,51 3.648 Klaus ES 39,07 3.418 Martin S 29,99 3.240 X 105 US 23,19 3.055 X 101 ES 32,60 2.866 Xynthia S 40,16 2.840 US F 21,05 2.805 X 102 US 31,26 2.792 X 103 ES 37,48 2.696 Isabel H 33,71 2.659 Fran H 26,68 2.605 Anatol ES 29,92 2.569 Iniki H 22,70 2.563 Frederic H 9,66 2.472 X 104 ES 35,35 2.450 Petro US M 19,81 2.404 Tsunami E 34,99 2.380 Fifi S 4,71 2.279 X 105 ES 27,51 2.240 Luis H 25,67 2.212 Erwin ES 35,02 2.168 X 107 US 32,32 2.093 Gilbert H 18,69 2.077 X 108 US 29,34 2.004 X 109 US 13,96 1.984 X 110 US 33,26 1.968 X 1 US 4,25 1.961 Mississippi F 3,31 1.871 X 111 US 28,37 1.813 Loma Pieta E 19,79 1.813 Celia H 0,59 1.795 Vicki T 28,72 1.761 Fertilizer M 31,73 1.723 X 112 US 28,01 1.697 X 113 US 25,34 1.674 Grace H 21,80 1.650

Change Point Analysis of Extreme Values Lisboa 2013 p. 4/3 Estimates of extreme value index AMSE estimates of extreme value index gamma 0.0 0.2 0.4 0.6 0.8 1.0 AMSE gamma 0.0 0.05 0.10 0.15 0.20 0.25 0 10 20 30 40 k 0 10 20 30 40 k On the left four different estimates for γ. We choose the Hill-estimate. The choice k = 16 minimizes the empirical mean square error. The estimated value of γ is ˆγ = 0.69 with a 95%-confidence interval [0.50 1.02]. This corresponds to a distribution with finite mean but with infinite variance.

Change Point Analysis of Extreme Values Lisboa 2013 p. 5/3 Overview Motivating example 1. Introduction 2. Test statistic (a) Construction (b) Extreme value situation (c) Asymptotics (d) Practical procedure 3. Examples (a) Simulation (b) Malaysian Stock Index. Classical Approach (c) Malaysian Stock Index. Improved Approach (d) Nile Data (e) Swiss-Re Catastrophic Data (f) Further Examples 4. Conclusions 5. References

Change Point Analysis of Extreme Values Lisboa 2013 p. 6/3 1. INTRODUCTION We start with an example where a change point has occurred. 987 measurements of the Daily Stock Market Returns of the Malaysian Stock Index. Jan. 1995 Dec. 1998, covering the Asian financial crisis, July 1997. -0.1 0.0 0.1 0.2 Changes in distribution? 1 250 500 750 987 1/1/95 10/1/96 14/1/97 15/1/98 31/12/98 in parameters of a distribution? central behavior? tail behavior?

Change Point Analysis of Extreme Values Lisboa 2013 p. 7/3 2. TEST STATISTIC 2.a. Construction of Test Statistic Start with a sample X 1,..., X m, X m +1,... X n, from a density function f(x; θ i, η). Csörgő and Horváth (1997) test whether θ i changes at some point m H 0 : θ 1 = θ 2 =... = θ n versus H 1 : θ 1 =... = θ m θ m +1 =... = θ n for some m. using the test statistic Z n = max 1 m<n ( 2log Λ m), where Λ m = sup θ,η n i=1 f(x i; θ, η) sup θ,τ,η m i=1 f(x i; θ, η) n i=m+1 f(x i; τ, η).

Change Point Analysis of Extreme Values Lisboa 2013 p. 8/3 Example For the exponential distribution where X i has mean θ i 2log Λ m = 2 m log 1 m m X i (n m)log i=1 1 n m n i=m+1 X i + n log 1 n n i=1 X i For large n, m and n m one can expect normal behaviour expressed in terms of Brownian motions.

Change Point Analysis of Extreme Values Lisboa 2013 p. 9/3 2.b. Extreme Value Situation Put the data in increasing order: X 1,n X 2,n... X n,n. Hence X n,n is the maximum in a sample of independent random variables with a common distribution. The maximum domain of attraction condition is expressed as lim P n ( Xn,n b n a n where γ is a real-valued extreme value index and ) x = G γ (x). G γ (x) = exp {1 + γx} 1/γ + an extremal law. When γ > 0 we end up with heavy right-tailed distributions, the Pareto-Fréchet Case.

Change Point Analysis of Extreme Values Lisboa 2013 p. 10/3 For large values, log of Pareto-type with extreme value index γ is close to an exponential with mean γ. The most classical approach for the estimation of the extreme value index γ > 0 is to use the Hill estimator: H k,n = 1 k k log X n i+1,n log X n k,n. i=1 Hence, only a segment of the available data is used. The determination of the quantity k is important. The Hill estimator has small bias but large variance for small k large bias but small variance for large k. As a compromise we select k such that the empirical mean squared error is minimal.

Change Point Analysis of Extreme Values Lisboa 2013 p. 11/3 We concentrate on two specific examples that regularly appear in extreme value analysis. X has a Pareto-type distribution with parameter θ = γ, when the relative excesses of X over a high threshold u, given that X exceeds u, satisfy the condition P ( ) X u > x X > u x 1 γ, u, X follows a Generalized Pareto distribution (GPD) with parameter θ = (γ, σ) if the behavior of the absolute excesses over a high threshold u, given that X exceeds u, satisfies the condition P (X u > x X > u) ( 1 + γx ) 1 γ, u. σ

Change Point Analysis of Extreme Values Lisboa 2013 p. 12/3 1. Pareto-type density Suppose X 1,..., X m, X m+1,... X n are independent and Pareto-type distributed. We denote the extreme value index for X i by γ i. In order to determine whether the index γ changes or not, we perform the following test H 0 : γ 1 = γ 2 =... = γ n = γ versus H 1 : γ 1 = γ 2 =... = γ m γ m +1 =... = γ n 1 = γ n for some m. Hence Z n = where in turn max 1 m<n ( 2log Λ m) log Λ m = [ k 1 log H k1,m + (k k 1 ) log H k k1,n m k log H k,n ] [ ] 1 ( ) + k1 H k1,m + (k k 1 )H k k1,n m kh k,n H k,n.

Change Point Analysis of Extreme Values Lisboa 2013 p. 13/3 2. GPD. Suppose now that X i is GPD with parameters θ i = (γ i, σ i ).To perform the test H 0 : θ 1 = θ 2 =... = θ n versus H 1 : θ 1 =... = θ m θ m +1 =... = θ n for some m we use as test statistic Z n = max 1 m<n ( 2 log Λ m), where 2 log Λ m = 2 [ ] L k1 (ˆθ k1 ) + L + k (ˆθ + 1 k ) L 1 k (ˆθ k ) ( ) 1 m ( ) x L m (ˆθ m ) = m log ˆσ m + 1 log 1 + ˆγ m ˆγ m ˆσ i=1 m ( ) 1 L + m (ˆθ m + ) = (n m)log ˆσ+ m n ( + 1 log 1 + ˆγ m + ˆγ + m i=m+1 x ˆσ + m ) and likelihood estimators (ˆγ m, ˆσ m ) resp. (ˆγ + m, ˆσ + m) based on X 1, X 2,..., X m and X m+1,... X n are obtained by numerical procedures.

Change Point Analysis of Extreme Values Lisboa 2013 p. 14/3 2.c. Asymptotics Using the procedure suggested by Csörgő and Horváth we proved the following result. Theorem Suppose X 1,..., X m, X m+1,... X n are independent and identically distributed. Define Z n = max c n m<n d n ( 2 log Λ m ), with 2log Λ m as before. Let n, k such that k/n 0. Let further c n and d n be intermediate sequences for which c n /n 0 and d n /n 0. Then, under H 0 of our test, Z n d sup 0 t<1 sup 0 t<1 B 2 (t) t(1 t) if Pareto-type, B 2 2 (t) t(1 t) if GPD. B(t) is a Brownian bridge, B 2 (t) is a sum of two independent Brownian bridges.

Change Point Analysis of Extreme Values Lisboa 2013 p. 15/3 2.d. Practical Procedure Consecutive steps 1. Check on Pareto-type behavior of the data by Q Q plots. 2. Select a threshold u or the value of k = k opt,n that minimizes the asymptotic mean square error of the Hill estimator. We choose the optimal threshold u = X n kopt,n. 3. (a) Define c n as the smallest number such that at least k min = (log k opt,n ) 3/2 of the data points X 1,, X cn are larger than u. (b) Define d n as the smallest number such that at least k min of the data points X n dn +1,..., X n are larger than u. 4. Repeat the next step for all m from c n up to n d n. (a) Split the data up in two groups X 1, X 2,..., X m and X m+1,,..., X n. (b) Calculate 2log Λ m. 5. Calculate Z n = values for sample size k. max c n m<n d n ( 2 log Λ m ) and compare Z n with the critical

Change Point Analysis of Extreme Values Lisboa 2013 p. 16/3 3. EXAMPLES 3.a. Simulation We simulate 1000 data sets of size n (with n = 100, n = 500) from the Burr distribution Burr(β, τ, λ) with parameters as given by P(X > x) = ( β β + x τ ) λ, an example of a GPD with γ = (λτ) 1. The rejection probabilities are given below. H 0 true H 0 false n m γ = 1 γ 1 = 1 γ 1 = 2 γ 1 = 1 γ 1 =.5 γ 2 = 2 γ 2 = 1 γ 2 =.5 γ 2 = 2 100 20.096.191.460.486.182 50.075.517.512.519.559 500 50.029.181.782.799.144 100.044.378.955.951.645 250.019.894.951.966.909

Change Point Analysis of Extreme Values Lisboa 2013 p. 17/3 3. EXAMPLES The corresponding median of ˆm is given in the table below. H 0 false n m γ 1 = 1 γ 1 = 2 γ 1 = 1 γ 1 =.5 γ 2 = 2 γ 2 = 1 γ 2 =.5 γ 2 = 2 100 20 48 21 45 21 50 55 44 56 45 500 50 175 47 92 48 100 139 97 107 97 250 252 247 252 248

Change Point Analysis of Extreme Values Lisboa 2013 p. 18/3 3. EXAMPLES 400 -- 300 -- 200 -- 100 -- 1 2 3 4 5 Figure shows Boxplot of ˆm for the Burr cases for n = 500 and m = 100.

Change Point Analysis of Extreme Values Lisboa 2013 p. 19/3 3.b. Malaysian Stock Index: Classical approach The figure below indicates that the data are Pareto-type distributed. The mean squared error of the Hill estimator based on the whole data set attains a local minimum at the value k = k opt = 224 which results in threshold u = X 987 224,987 = 0.0099. 0.0 0.2 0.4 0.6 0.8 1.0 0 100 200 300 400 500

Change Point Analysis of Extreme Values Lisboa 2013 p. 20/3 1. Pareto-type distribution First 2log Λ m,1 m n 1 is plotted below. 1 250 500 750 987 1/1/95 10/1/96 14/1/97 15/1/98 31/12/98 Graph of (m, 2 log Λ m ) with critical value indicated with a horizontal line. We see that Z n = max( 2log Λ m ) = 5.8 falls above the critical value 3.14 and we reject H 0. The maximum is attained at m = 635, which corresponds to 1/08/1997, shortly after the beginning of the Asian crisis.

Change Point Analysis of Extreme Values Lisboa 2013 p. 21/3 2. GPD Now 2log Λ m, 1 m n 1 is plotted below. 1 250 500 750 987 1/1/95 10/1/96 14/1/97 15/1/98 31/12/98 Since Z n = max( 2log Λ m ) = 5.93 is above the critical value 3.18 we again reject H 0. The instant of change is for ˆm = 636 and yields 2/08/1997, almost as before.

Change Point Analysis of Extreme Values Lisboa 2013 p. 22/3 3.c. Malaysian Stock Index: Improved approach In the above analysis, we assumed that the data were independent. But market data are hardly ever independent. However, it is known that the Hill estimator withstands many forms of dependence. Alternatively, one can proceed as follows. Below we repeat the time series and an estimate of the extremal index that should be close to 1 if the data are independent. -0.1 0.0 0.1 0.2 1 250 500 750 987 1/1/95 10/1/96 14/1/97 15/1/98 31/12/98 0.0 0.2 0.4 0.6 0.8 1.0 0.90 0.92 0.94 0.96 0.98 A declusturing scheme cuts the data into clusters that can safely be taken as independent. Apply the previous procedure to the 76 cluster maxima.

Change Point Analysis of Extreme Values Lisboa 2013 p. 23/3 1. Pareto-type distribution 1 20 40 60 1/1/95 14/1/97 15/1/98 31/12/98 There is a local maximum for cluster maximum 48 which corresponds to m = 631 on 28/07/1997. However this local maximum is not larger than the critical. The actual maximum Z n is attained for cluster maximum 66 which corresponds to m = 854 on 22/6/98). We cannot reject the hypothesis.

Change Point Analysis of Extreme Values Lisboa 2013 p. 24/3 2. GPD 1 20 40 60 1/1/95 14/1/97 15/1/98 31/12/98 Now 2log Λ m, 1 m n 1 is plotted in the figure. The maximum Z n is attained for cluster maximum 48 which corresponds to m = 631 on 28/07/1997. The critical value 2.95 for the test is indicated with a horizontal line. On the basis of this test, we reject the hypothesis of no change.

Change Point Analysis of Extreme Values Lisboa 2013 p. 25/3 3.d. Nile Data Annual flow volume of the Nile River at Aswan from 1871 to 1970. 1120 1160 963 1210 1160 1160 813 1230 1370 1140 995 935 1110 994 1020 960 1180 799 958 1140 1100 1210 1150 1250 1260 1220 1030 1100 774 840 874 694 940 833 701 916 692 1020 1050 969 831 726 456 824 702 1120 1100 832 764 821 768 845 864 862 698 845 744 796 1040 759 781 865 845 944 984 897 822 1010 771 676 649 846 812 742 801 1040 860 874 848 890 744 749 838 1050 918 986 797 923 975 815 1020 906 901 1170 912 746 919 718 714 740

Change Point Analysis of Extreme Values Lisboa 2013 p. 26/3 The graph of the series is given below. nile 600 800 1000 1200 1400 0 20 40 60 80 100 Time

Change Point Analysis of Extreme Values Lisboa 2013 p. 27/3 To illustrate the extremal behaviour of both segments, take as group 1 the first 28 points and as group 2 the remaining 71 points. The two Pareto QQ plots are given below. They are based on the optimal values k = 17, resp. k = 13 leading to the estimates 0.07 and 0.13. Pareto quantile plot Pareto quantile plot log(x) 6.7 6.8 6.9 7.0 7.1 7.2 log(x) 6.2 6.4 6.6 6.8 7.0 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 Quantiles of Standard Exponential 0 1 2 3 Quantiles of Standard Exponential

Change Point Analysis of Extreme Values Lisboa 2013 p. 28/3 The change point detection based on the Pareto and the GPD model are given in the figure, both leading to a significant change point at ˆm = 28 at the beginning of construction of the Aswan dam. sqrt(hh$ll) 0 1 2 3 4 sqrt(hh$ll) 0 1 2 3 4 5 0 20 40 60 80 100 Time 0 20 40 60 80 100 Time

Change Point Analysis of Extreme Values Lisboa 2013 p. 29/3 3.e. CATASTROPHIC LOSSES (2012) EVENT T DATE LOSS EVENT T DATE LOSS Katrina H 35,65 74.686 Japan TS 41,19 35.000 Andrew H 22,64 25.641 WTC-attack M 31,70 23.848 Northridge E 24,05 21.239 Ike H 38,68 21.141 Ivan H 34,67 15350 Wilma H 35,80 14.468 Thailand F 41,57 12.000 N.Zealand E 41,14 12.000 Rita H 35,72 11.625 Charley H 34,61 9.583 Mireille T 21,74 8.899 Hugo H 19,71 8.292 Chile TS 40,16 8.248 Daria ES 20,07 8.036 Lothar ES 29,98 7.830 Alabama T 41,31 7.300 Missouri T 41,41 7.050 Kyrill ES 37,05 6.609 X 100 ES 17,79 6.135 Frances H 36,65 6.127 Vivian ES 35,15 5.491 Bart T 29,73 5.454 Irene H 41,64 5.300 N.Zealand E 40,68 5.155 Georges H 28,72 4.870 Allison S 31,43 4.577 Jeanne H 34,70 4.527 Songda T 34,68 4.268

Change Point Analysis of Extreme Values Lisboa 2013 p. 30/3 3.e.1. Pareto-type? We plot the pairs (x i, Y i ), i = 1,..., 69. The variable Y consists of the losses of the catastrophes in million US-dollars, corrected for inflation. To avoid empty places and erratic behavior, we re-scale the time axis so that one unit represents 5 years. The extreme value index γ is clearly positive and its estimation is done with the Hill estimator and the Peak Over Threshold estimator. The value k = 16 minimizes the empirical mean square error. The corresponding estimate for ˆγ = 0.69, hence the underlying distribution has finite mean but infinite variance. In 2010 we had ˆγ = 0.87.

Change Point Analysis of Extreme Values Lisboa 2013 p. 31/3 0.0 0.2 0.4 0.6 0.8 1.0 0 10 20 30 40 50 60 8 9 10 11 0 1 2 3 4 The Pareto QQ plot of the data is very close to linearity, suggesting that almost all the losses can be used in the estimation.

Change Point Analysis of Extreme Values Lisboa 2013 p. 32/3 3.e.2. Change point? In the figure below, the test statistic is well over the critical value 2.96 for all points, suggesting that there is not one single change point, but that the extreme value index keeps changing over time. The maximum hints at a value of 1982. 0 5 10 15 20 0 2 4 6 8

Change Point Analysis of Extreme Values Lisboa 2013 p. 33/3 3.e.3. Trend Analysis What about a trend model? We assume now that the relative excesses over some threshold u(x) follow a Pareto-type distribution, this means P(Y/u(x) > y Y > u(x)) y 1/γ(x) when u(x) where we further assume that the extreme value index follows a linear trend γ(x) = α 1 + α 2 x with α i, i = 1,2 constants. The latter two parameters are estimated by maximum likelihood. This is illustrated in the figures below and is quite insensitive to the choice of k. For k = 45 ˆα 1 = 0.001, whereas ˆα 2 = 0.159.

Change Point Analysis of Extreme Values Lisboa 2013 p. 34/3 0.00 0.02 0.04 0.06 0.08 0.10 0 10 20 30 40 50 60 0.00 0.05 0.10 0.15 0.20 0.25 0.30 0 10 20 30 40 50 60 On the left the estimation of α 1 is given, on the right that of α 2 with corresponding 95%-confidence bounds. Hence γ can be estimated as ˆγ(x) = 0.001 + 0.159x, where x denotes the number of 5-years since January 1, 1970. The estimated value for α 2 is significantly different from 0, as follows from the large sample behavior of maximum likelihood estimators.

Change Point Analysis of Extreme Values Lisboa 2013 p. 35/3 0.0 0.5 1.0 1.5 0.0 0.5 1.0 1.5 2.0 2.5 3.0 3.5 0 2 4 6 8 0 1 2 3 The left figure graphs the fit of the data with the estimation of γ(x). There is an alternative verification. When Y x follows a Pareto type distribution with extreme value index γ(x), then the random variable log Y/γ(x) follows approximately a standard exponential distribution, at least for the largest data points. On the right, we drafted the exponential QQ-plot with the first bisector corresponding to the expected standard exponential.

Change Point Analysis of Extreme Values Lisboa 2013 p. 36/3 3.f. Further Examples Other examples from 1. Daily rain data from Quebec City. Period 1915 to 2007, giving 34596 data points. We found a number of change points starting with the most important one in 1943. This was caused by a move of the measuring equipment from the city center to the airport. Same with daily minimal temperature data. 2. Pole vault data at Olympic Games.

Change Point Analysis of Extreme Values Lisboa 2013 p. 37/3 4. CONCLUSIONS What has been shown are just first attempts Trend analysis seems to apply more often There is a need for sufficiently large data sets Need for studies under specific dependence structures Multivariate extensions should be possible

Change Point Analysis of Extreme Values Lisboa 2013 p. 38/3 5. REFERENCES Beirlant, J., Goegebeur Y., Segers, J. and Teugels, J.L. (2004). Statistics of Extremes, Theory and Applications, Wiley, Chichester. Csörgő, M., Horváth, L. (1997). Limit Theorems in Change Point Analysis. Wiley, Chichester. Dierckx, G. and Teugels, J.L. (2010) Change point analysis of extreme values Environmetrics, 21, 661-686, 2010. Dierckx, G. and Teugels, J.L. (2011) Trend analysis of extreme values Probability Approximations and Beyond: A Conference in honour of Louis Chen on his 70th Birthday, A.D. Barbour, H.P. Chan & D. Siegmund, (eds.), Lecture Notes in Statistics, 205, 101 108, Springer-Verlag, Berlin, 2012.