Inference for clusters of extreme values

Size: px
Start display at page:

Download "Inference for clusters of extreme values"

Transcription

1 J. R. Statist. Soc. B (2003) 65, Part 2, pp Inference for clusters of extreme values Christopher A. T. Ferro University of Reading, UK and Johan Segers EURANDOM, Eindhoven, the Netherlands [Received May Final revision October 2002] Summary. Inference for clusters of extreme values of a time series typically requires the identification of independent clusters of exceedances over a high threshold. The choice of declustering scheme often has a significant effect on estimates of cluster characteristics. We propose an automatic declustering scheme that is justified by an asymptotic result for the times between threshold exceedances. The scheme relies on the extremal index, which we show may be estimated before declustering, and supports a bootstrap procedure for assessing the variability of estimates. Keywords: Bootstrap; Declustering; Extremal index; Extreme values; Interexceedance times 1. Introduction Let {ξ n } n 1 be a strictly stationary sequence of random variables with marginal distribution function F, finite or infinite right end point ω = sup{x : F.x/ < 1} and tail function F = 1 F. For integers 0 k<land n 1, put M k,l = max{ξ i : i = k + 1,...,l} and M n = M 0,n.The process {ξ n } n 1 is said to have extremal index θ [0, 1] if for each τ > 0 there is a sequence {u n } n 1 such that, as n, (a) nf.u n / τ and (b) P.M n u n / exp. θτ/; see Leadbetter et al. (1983). If θ = 1 then exceedances of an increasing threshold occur singly in the limit; if θ < 1 then exceedances tend to cluster in the limit. We consider the problem of making inferences about characteristics of such clusters. The extremal index is one such characteristic, which has an interpretation due to Leadbetter (1983) as the reciprocal of the mean cluster size. Theorem 4.5 of Hsing (1987) shows that clusters of exceedances may be considered independent in the limit. Consequently, a common approach to inference is to identify independent clusters of exceedances above a high threshold, to evaluate for each cluster the characteristic of interest and to form estimates from these values. The methods that are used to identify clusters define different estimators, the two most common being blocks and runs declustering (Leadbetter et al., 1989). Runs declustering, for example, assumes that exceedances belong to the same cluster if they are separated by fewer than a certain number, the run length, of values Address for correspondence: Christopher A. T. Ferro, Department of Meteorology, University of Reading, Earley Gate, PO Box 243, Reading, RG6 6BB, UK. c.a.t.ferro@reading.ac.uk 2003 Royal Statistical Society /03/65545

2 546 C. A. T. Ferro and J. Segers below the threshold. Hsing (1991) pointed out that a problem with these estimators is the selection of the declustering parameters, which is largely arbitrary: the choice of run length usually has a significant influence on the estimate of the cluster characteristic. Estimation of the extremal index has been developed by Smith and Weissman (1994) and Weissman and Novak (1998) among others; for an application in finance see Longin (2000). Examples of other cluster characteristics are the cluster maximum, which is the focus of peaks-over-threshold modelling (Davison and Smith, 1990), and the excess height statistic used by Leadbetter (1995) to monitor ozone levels. We investigate the point process of exceedance times (Hsing et al., 1988) and find that the asymptotic distribution of the interexceedance times belongs to a one-dimensional parametric family of distributions indexed by the extremal index. This result provides a limiting argument for one particular declustering scheme, characterized by the extremal index. We can estimate the extremal index without declustering by equating theoretical moments of the limiting distribution to their empirical counterparts. In this way we define an automatic declustering scheme that does not require a subjective choice of auxiliary parameter. Furthermore, the declustering scheme supports a bootstrap procedure for obtaining confidence intervals on estimates of cluster characteristics that accounts for the uncertainty in the scheme s estimation. We derive the asymptotic distribution of times between threshold exceedances in Section 2 and present our estimator for the extremal index in Section 3. In Section 4 we define the declustering scheme and bootstrap procedure. We investigate the performance of our extremal index estimator with a simulation study in Section 5, and we conclude in Section 6 with an application to a series of daily minimum temperatures recorded at Wooster, Ohio. 2. Interexceedance times The statistical development in this paper rests on the limiting distribution of the times between exceedances of a threshold u by the process {ξ n } n 1. Let T.u/ be a random variable equal in distribution to i.e. or, alternatively, min{n 1:ξ n+1 >u} given ξ 1 >u, P{T.u/ = n} = P.M 1,n u, ξ n+1 >u ξ 1 >u/ for n 1, P{T.u/ > n} = P.M 1,n+1 u ξ 1 >u/ for n 1: We compute the asymptotic distribution of T.u/. The case of independent random variables {ξ n } n 1 is straightforward. Clearly so that, for x>0, P{T.u/ > n} = F.u/ n for n 1, P{ F.u/ T.u/ > x} = P{T.u/ > x= F.u/ } = exp[ x= F.u/ log{f.u/}], where x denotes the integer part of x.iff has no atom at its end point ω then, since log.1 + "/ " as " 0, we have lim [P{ F.u/ T.u/ > x}] = exp. x/ for x>0: u ω

3 Clusters of Extreme Values 547 So F.u/ T.u/ is asymptotically standard exponentially distributed. This agrees with the result (Hsing et al., 1988) that the point process of exceedance times has a Poisson process limit. Now consider the general case with extremal index θ [0, 1]. The corresponding point process limit for the exceedance times is compound Poisson (Hsing et al., 1988). This leads us to expect that the limit distribution of the interexceedance times will be a mixture of an exponential distribution and a point mass on zero. This is indeed the case, as described in theorem 1 below. For real u and integers 1 k l, let F k,l.u/ be the σ-field generated by the events {ξ i >u}, k i l. Define the mixing coefficients α n,q.u/ = max sup P.B A/ P.B/, 1 k n q where the supremum is over all A F 1,k.u/ with P.A/ > 0 and all B F k+q,n.u/. Theorem 1. Let the positive integers r n and the thresholds u n,forn 1, be such that and r n, r n F.u n / τ P.M rn u n / exp. θτ/, for some τ.0, / and θ [0, 1]. If there are positive integers q n = o.r n / such that α crn,q n.u n / = o.1/ for all c>0, then P{ F.u n /T.u n />t} θ exp. θt/ for t>0: The proof is contained in Appendix A. Note that the mixing condition that is used here is similar to that of Weissman and Novak (1998). If we denote convergence in distribution by d then theorem 1 says that F.u/ T.u/ d T θ as u ω,.1/ where T θ is a random variable distributed according to the mixture distribution.1 θ/" 0 + θµ θ,.2/ " 0 is the degenerate probability distribution at 0 and µ θ is the exponential distribution with mean θ 1. Note the double role of the extremal index: θ is both the proportion of non-zero interexceedance times and the reciprocal of the mean of the non-zero interexceedance times. These features are evident in Fig. 1, in which normalized interexceedance times are plotted against standard exponential quantiles for two sets of data: one simulated and one observed. The simulated data are the largest 1000 values in a sequence of length generated from a max-autoregressive process with extremal index θ = 0:5 (see expression (6) in Section 5); the observed data are the largest 85 values in the temperature series that is analysed in Section Estimation of the extremal index In this section we describe an estimator for the extremal index that is based on the limit result (1). Suppose that we have a sample ξ 1,...,ξ n and a high threshold u. Let N = N n.u/ = n I.ξ i >u/

4 548 C. A. T. Ferro and J. Segers Interexceedance Times Standard Exponential Quantiles (a) Interexceedance Times Standard Exponential Quantiles Fig. 1. Quantile quantile plots of normalized interexceedance times against standard exponential quantiles for (a) simulated data and (b) observed data (j,.1 θ/-quantile; the sloping line has gradient θ 1 ): the true value, θ D 0:5, is used for the simulated data; an estimate, θ D 0:68, from the estimator developed in Section 3.1 is used for the observed data (b) be the number of observations exceeding u, and let 1 S 1 <... <S N n be the exceedance times. The observed interexceedance times are T i = S i+1 S i for i = 1,...,N Intervals estimator The second moment of T θ is E.Tθ 2 / = 2=θ, which we estimate to obtain a first estimator for θ. Let F n.u/ be an estimator for F.u/. Then For example, if F n.u/ = N=n, then we have 2.N 1/ θ n.u/ = F n.u/ 2 N 1 T 2 i θ n.u/ = 2n2.N 1/ : N 2 N 1 We can improve on this, however. The first moment of T θ is1soθ is related to the coefficient of variation, ν, of the interexceedance times by T 2 i 1 + ν 2 = E.T 2 θ /=E.T θ/ 2 = 2θ 1 : In particular, the interexceedance times are overdispersed, with clustering in the limit, if and only if θ < 1. This relationship motivates another estimator for θ, ˆθ n.u/ = 2 ( N 1 T i ) 2.N 1/ N 1 where we do not need to estimate F.u/. A further improvement is possible if we consider a penultimate approximation to the limit- :, Ti 2

5 Clusters of Extreme Values 549 ing mixture distribution (2). If we set r n = n in the proof of theorem 1, then we see that the distribution of the interexceedance times satisfies P{T.u n />n} = θ F.u n / nθ + o.1/: We can derive an estimator for θ based on this relationship. Let T denote a random variable on the positive integers whose distribution is given by P.T>n/= θp nθ for n 1,.3/ where θ.0, 1] and p.0, 1/ may be thought of as F.u n /. Note that 2 E.T/ 2 E.T 2 / = 2{1.1 θ/p θ } 2 2θp θ + θp θ.1 p θ / +.1 p θ / 2 ( = θ + θ 2 3θ ).1 p/ + O{.1 p/ 2 } as p 1, 2 which follows from E.T 1/ = P.T>n/= θp θ.1 p θ / 1, n=1 { } T.T 1/ E = np.t>n/= θp θ.1 p θ / 2 : 2 n=1 Therefore, the first-order bias of ˆθ n.u/ at a threshold u is θ.2 3θ=2/ F.u/. In contrast, the relationship 2{E.T 1/} 2 E{.T 1/.T 2/} = θ.4/ motivates the estimator { } 2 N 1 2.T i 1/ ˆθ Å n.u/ =,.N 1/ N 1.T i 1/.T i 2/ whose first-order bias is 0. Compared with ˆθ n.u/, the estimators for the first and second moments from which ˆθ Å n.u/ is constructed have been shrunk towards 0. This is intuitively appealing. The smallest observed interexceedance times are positive, whereas the limiting distribution (2) models them as 0. Estimator ˆθ Å n.u/ ensures that the contributions from the smallest interexceedance times are indeed 0; the larger interexceedance times are relatively unaffected. Finally, we note that ˆθ Å n.u/ can take values that are greater than 1 and is undefined if the largest interexceedance time is no greater than 2. To avoid these cases, we define { 1 ˆθ θ n.u/ = n.u/ if max{t i : 1 i N 1} 2, 1 ˆθ Å n.u/ if max{t.5/ i : 1 i N 1} > 2, which we call the intervals estimator for the extremal index. Consistency of the intervals estimator for m-dependent processes is stated in the following theorem, the proof of which may be found in Ferro and Segers (2002). Recall that, for a positive integer m, the sequence {ξ n } n 1 is m dependent if, for all positive integers k, the σ-fields σ.ξ i :1 i k/ and σ.ξ i : i k + m/ are independent.

6 550 C. A. T. Ferro and J. Segers Theorem 2. Let the positive integers r n and the thresholds u n,forn 1, be such that and r n = o.n/, r n, r n F.u n / τ P.M rn u n / exp. θτ/, for some τ.0, / and θ.0, 1]. If {ξ n } n 1 is m dependent, then θ n.u n / p θ: 3.2. Maximum likelihood estimators We have described two models for the interexceedance times: the limiting form (2) and the penultimate approximation (3). In this section we examine the possibility of using these models to construct maximum likelihood estimators for the extremal index. This contrasts with the moment-based intervals estimator (5). We shall construct the likelihoods under the assumption that the interexceedance times are independent since we have models for their marginal distribution only. If we write t i = NT i =n then the log-likelihood from the limiting model (2) is N 1 log[.1 θ/ I.ti=0/ {θ 2 exp. θt i /} I.ti>0/ ] = 2.N 1/ log.θ/ θ N 1 t i, 0 θ 1, since the observed interexceedance times are always strictly positive. The maximum likelihood estimator for θ is min.1, 2= t/, where t is the mean of the t i. This estimator tends in distribution to1asn increases: a failure arising from the model assigning all the interexceedance times to the exponential component of the mixture distribution. It is possible to circumvent this problem by grouping a certain number of the smallest interexceedance times and treating them as though they were equal to 0. This solution requires the choice of an auxiliary parameter, however. The choice is arbitrary and can have a significant effect on estimates so we are no better off than if we had used runs or blocks declustering. The log-likelihood from the penultimate model (3) is m 1 log.1 θp θ / + {log.θ/ + log.1 p θ /}.N 1 m 1 / + θ log.p/ N 1.T i 1/, where m 1 is the number of interexceedance times that are equal to 1. Maximum likelihood estimates may be found by numerical optimization; unfortunately, a combination of two features can cause them to perform poorly. First, the distribution (3), from which the likelihood is constructed, is in general not a good model for the smallest interexceedance times. In particular, P.T > 1/ = θp θ converges to θ as p 1, where θ arises as the limit of P.M 1,m u ξ 1 >u/as u ω and m at an appropriate rate (O Brien, 1987). In contrast, P{T.u/ > 1} = P.ξ 2 u ξ 1 >u/is larger than P.M 1,m u ξ 1 >u/and need not converge to θ. For example, the doubly stochastic process of Smith and Weissman (1994) with parameters 0 < ψ < 1 and 0 < η < 1 has lim u ω [P{T.u/ > 1}] = 1 ηψ.1 ψ/=.1 ψ + ψη/ = θ. The second, troublesome, feature is the sensitivity of the log-likelihood to m 1, which is clearer after not-

7 Clusters of Extreme Values 551 ing that Σ N 1.T i 1/ = S N S 1.N 1/ is asymptotically equivalent to n as N and N=n 0. Our previous comments warn that the observed value of m 1 may be far from the value that is expected under the model, in which case poor estimates result. Grouping small interexceedance times and modelling only the larger times is again a possible but unattractive solution. In contrast with the problems that are encountered with likelihood estimation, the intervals estimator (5) is quite robust to deviations of the smallest interexceedance times from the model postulated. This is due to relationship (4) in which the numerator and denominator both tend to as p 1, so the smallest interexceedance times play no role asymptotically and the estimator depends most heavily on the well-modelled large interexceedance times. 4. Declustering and bootstrapping Although we have shown how to estimate the extremal index without recourse to declustering, estimating other cluster characteristics still may require clusters to be identified. All declustering schemes proposed in the literature require an auxiliary parameter, the choice of which is largely arbitrary. In this section we explain how the limiting distribution (2) may be used to identify clusters without making an arbitrary choice. As we shall see, the limit theory also supports a bootstrap procedure for assessing estimation uncertainty; we describe a procedure to compute confidence limits for estimates of general cluster characteristics. Importantly, this procedure accounts for the uncertainty in the choice of declustering scheme. As the limiting process of exceedance times is a compound Poisson process, we may categorize interexceedance times into two types: independent intercluster times (between clusters) and independent sets of intracluster times (within clusters). As mentioned at the end of Section 2, the extremal index is the proportion of interexceedance times that may be regarded as intercluster times. Suppose that we observe N exceedance times, S 1 <... <S N, with interexceedance times T i = S i+1 S i for i = 1,...,N 1. Then we can assume that the largest C 1 = θn inter- exceedance times are approximately independent intercluster times that divide the remainder into approximately independent sets of intracluster times. To be precise, if T.C/ is the Cth largest interexceedance time and T ij is the jth interexceedance time to exceed T.C/, then {T ij } C 1 j=1 is a set of approximately independent intercluster times. In the case of ties, decrease C until T.C 1/ is strictly greater than T.C/. Let T j = {T ij 1 +1,...,T ij 1}, where i 0 = 0, i C = N and T j = if i j = i j Then {T j } C j=1 is a collection of approximately independent sets of intracluster times. Furthermore, each set T j has associated with it a set of threshold exceedances C j = {ξ k : k S j }, where S j = {S ij 1 +1,...,S ij }. This interpretation justifies a decomposition of the observed process into C clusters, where the jth cluster comprises the exceedances C j. This is equivalent to runs declustering with run length T.C/. In practice, C is defined by replacing θ with an estimate ˆθ. Any estimator for θ may be used, but if we employ the intervals estimator (5) then we have an entirely automatic declustering procedure, justified by asymptotic theory. Suppose now that we are interested in making inferences about a cluster functional H. This could be the excess height statistic of Leadbetter (1995) for example. We can evaluate the functional for each cluster to obtain values {H j } C j=1 that may be used to find estimates H of properties of H. For example, we might estimate the expectation of H by H = C 1.H H C /. We can use the bootstrap to obtain confidence limits on H and ˆθ. Given the decomposition of the process of exceedances described above, we recommend the following procedure. (a) Resample with replacement C 1 intercluster times from {T ij } C 1 j=1.

8 552 C. A. T. Ferro and J. Segers (b) Resample with replacement C sets of intracluster times, some of which may be empty, and associated exceedances from {.T j, C j /} C j=1. (c) Intercalate these interexceedance times and clusters to form a bootstrap replication of the process. (d) Compute N for the bootstrap replication, estimate θ and decluster accordingly. (e) Compute H from the declustered bootstrap replication. Forming B such bootstrap processes yields collections of estimates, { ˆθ.1/,..., ˆθ.B/ } and { H.1/,..., H.B/ }, that may be used to approximate the distributions of the original point estimates. In particular, the empirical α- and.1 α/-quantiles of each sample define.1 2α/- confidence intervals. We demonstrate this procedure for the data set in Section 6 and make use of it in the simulation study of the following section. 5. Simulation study In this section we investigate the performance of the intervals estimator (5) for the extremal index and compare it with the performance of the runs estimator, which may be written in terms of the interexceedance times as { } ˆθ n.u; r/ = N N 1 I.T i >r/ when the run length is r. We simulate data from two stationary processes: a max-autoregressive process and a first-order Markov chain with extreme value transition distribution. Choose θ.0, 1]. Let W n,forn 1, be independent unit Fréchet random variables and put ξ 1 = W 1 =θ,.6/ ξ n = max{.1 θ/ξ n 1, W n } for n 2: Then {ξ n } n 1 is a max-autoregressive process with extremal index θ. For the Markov chain, let β.0, 1] and P.ξ 1 x 1, ξ 2 x 2 / = exp{.x 1=β 1 + x 1=β 2 / β }: This is a bivariate extreme value distribution with symmetric logistic dependence structure; the extremal index of {ξ n } n 1 for specific values of β may be found by simulation (Smith et al., 1997). We simulate 1000 sequences of length 5000 from both of these processes for each of three extremal indices, 0.25, 0.5 and 0.75, corresponding to β = 0:43, 0:64, 0.82 for the Markov chain. For each sequence, we compute the intervals estimator and three runs estimators, with run lengths 1, 5 and 9, at a range of thresholds chosen so that there are N exceedances, with N ranging from 10 to We report the root-mean-square error (RMSE) of the point estimates and the coverage probability of bootstrapped confidence intervals, computed with B = 1000, in each case. Fig. 2 shows the empirical RMSEs and coverage probabilities for the four estimators applied to the three max-autoregressive processes. The sensitivity of the runs estimator to run length, threshold and the true extremal index is clear; the intervals estimator is robust to these. This robustness is particularly evident at lower thresholds, where the coverage probability for the intervals estimator remains close to the nominal value. The cause of the poor coverage for the runs estimator may be traced to step (d) of the bootstrap algorithm: the run length is fixed so that any uncertainty in the declustering scheme is unaccounted for, resulting in confidence intervals

9 Clusters of Extreme Values 553 that are too narrow. The apparently uniform decrease of RMSE with threshold for the intervals estimator is only indicative of the fact that the optimal threshold for the intervals estimator is typically lower than that for the runs estimator. The reason for this is the greater sensitivity of the runs estimator to disproportionately large numbers of small interexceedance times, which enhances the bias at low thresholds. As we commented in Section 3.2, the intervals estimator is robust to such discrepancies. However, the runs estimators appear to outperform the intervals estimator for high thresholds. Similar results, not shown, were obtained for the Markov chains, except that not every choice of run length for the runs estimator led to a performance that was superior at high thresholds to that of the intervals estimator. 6. Data example We conclude with an application of the intervals estimator and intervals declustering to a time series of negated daily minimum temperatures, recorded to the nearest degree Fahrenheit, at Wooster, Ohio. See Smith et al. (1997) for a description of the data and additional analy- Fig. 2. (a) RMSEs and (b) coverage probabilities for the intervals estimator ( ) and runs estimators with run lengths 1 ( ), 5 (... )and9(- - - ) applied to max-autoregressive sequences of length 5000 with extremal indices 0.25 (top), 0.5 (middle) and 0.75 (bottom) (the nominal coverage probability (0.95) is indicated by a horizontal line)

10 554 C. A. T. Ferro and J. Segers sis. We shall estimate the extremal index and mean cluster excess of the series at thresholds u = 0:5, 1:5,...,14:5, where the excess of a cluster {ξ k : k S} of exceedances over a threshold u is defined by Σ k S.ξ k u/ and is an indicator for the severity of a cold period. The intervals estimates of the extremal index are shown in Fig. 3(a) with bootstrap 95% confidence limits. For comparison, runs estimates with run length 5 days are shown in Fig. 3(b). As expected, the variance of the intervals estimator is greater than that of the runs estimator, which does not account for uncertainty in the run length. Both plots support an extremal index of about 0.6, which is similar to the values found by Smith et al. (1997). The estimates of the mean cluster excess are presented in Fig. 3(c). The results are not clear cut although the graph appears to level out at a threshold of 10 F and a mean cluster excess of about 6 F. Finally, returning to the bootstrap procedure of Section 4, notice that each bootstrapped process has an associated run length r.b/ = T.C.b/ /, where C.b/ = ˆθ.b/ N.b/ and N.b/ is the number of exceedances in the process. At each threshold, the collection {r.b/ } B b=1 describes the uncertainty in the declustering scheme and indicates the extent of the uncertainty ignored by using runs declustering with an arbitrary run length. Fig. 3(d) summarizes the run length distribution. A run length of about 1 week is suggested with the interpretation that cold periods separated by more than 1 week can be considered independent. Acknowledgements We thank Jonathan Tawn for helpful comments and acknowledge the financial support of the Fig. 3. Estimates (ı) of cluster characteristics for the Wooster temperature series: (a) intervals estimator and (b) runs estimator with run length 5 for the extremal index and (c) mean cluster excess and (d) run length ( , bootstrapped 95% confidence limits; the number of exceedances at each threshold is indicated on the upper axes)

11 Clusters of Extreme Values 555 Engineering and Physical Sciences Research Council (CATF), the Fund for Scientific Research, Flanders (JS), the Department of Mathematics and Statistics at Lancaster University and the Bernoulli Society for subsidizing our attendance at the Gothenburg meeting (December 10th 16th, 2001) of the Séminaires Européens de Statistiques. Appendix A: Proof of theorem 1 Since both q n = o.r n / and α n = α rn,qn.u n / = o.1/, we can find positive integers p n such that p n = o.r n /, r n α n = o.p n / and q n = o.p n /: (Take for instance p n = {r n max.q n, r n α n /} 1=2.) Since F.u n / rn exp. τ/, all conditions of corollary 2.3 of O Brien (1987) are fulfilled. We derive P.M rn u n / F.u n / rnp.m 1, pn un ξ 1>u n/ 0: But P.M rn u n / exp. θτ/ by assumption, so necessarily P.M 1,pn u n ξ 1 >u n / θ: Now we can proceed with the proof of the stated limit relationship itself. Put k n = t= F.u n /. Wehave P{ F.u n /T.u n />t} = P{T.u n />k n } = P.M 1,1+kn u n ξ 1 >u n /: For n sufficiently large that q n <p n,wehave so P.M pn,pn+q n >u n ξ 1 >u n / P.M pn,pn+q n >u n / + α n q n F.u n / + α n 0, P.M 1,1+kn u n ξ 1 >u n / = P.M 1,pn u n, M pn+q n,k n u n ξ 1 >u n / + o.1/: But k n tτ 1 r n = O.r n /, so by assumption P.M 1,pn u n, M pn+q n,k n u n ξ 1 >u n / = P.M pn+q n,k n u n M 1,pn u n, ξ 1 >u n /P.M 1,pn u n ξ 1 >u n / = {P.M pn+q n,k n u n / + o.1/}{θ + o.1/}: Now also s n = k n.p n + q n / tτ 1 r n = O.r n /. Apply corollary 2.3 of O Brien (1987) again to deduce that P.M pn+q n,k n u n / = P.M sn u n / = F.u n / snp.m 1, pn un ξ 1>u n/ + o.1/ exp. θt/: References Davison, A. C. and Smith, R. L. (1990) Models for exceedances over high thresholds (with discussion). J. R. Statist. Soc. B, 52, Ferro, C. A. T. and Segers, J. (2002) Automatic declustering of extreme values via an estimator for the extremal index. Technical Report EURANDOM, Eindhoven. (Available from Hsing, T. (1987) On the characterization of certain point processes. Stochast. Process. Applic., 26, Hsing, T. (1991) Estimating the parameters of rare events. Stochast. Process. Applic., 37, Hsing, T., Hüsler, J. and Leadbetter, M. R. (1988) On the exceedance point process for a stationary sequence. Probab. Theory Reltd Flds, 78, Leadbetter, M. R. (1983) Extremes and local dependence in stationary sequences. Z. Wahrsch. Ver. Geb., 65, Leadbetter, M. R. (1995) On high-level exceedance modeling and tail-inference. J. Statist. Planng Inf., 45, Leadbetter, M. R., Lindgren, G. and Rootzén, H. (1983) Extremes and Related Properties of Random Sequences and Processes, ch. 3. New York: Springer. Leadbetter, M. R., Weissman, I., de Haan, L. and Rootzén, H. (1989) On clustering of high values in statistically stationary series. Technical Report 253. Center for Stochastic Processes, University of North Carolina, Chapel Hill.

12 556 C. A. T. Ferro and J. Segers Longin, F. M. (2000) From value at risk to stress testing: the extreme value approach. J. Bank. Finan., 24, O Brien, G. L. (1987) Extreme values for stationary and Markov sequences. Ann. Probab., 15, Smith, R. L., Tawn, J. A. and Coles, S. G. (1997) Markov chain models for threshold exceedances. Biometrika, 84, Smith, R. L. and Weissman, I. (1994) Estimating the extremal index. J. R. Statist. Soc. B, 56, Weissman, I. and Novak, S. Yu. (1998) On blocks and runs estimators of the extremal index. J. Statist. Planng Inf., 66,

Bayesian Inference for Clustered Extremes

Bayesian Inference for Clustered Extremes Newcastle University, Newcastle-upon-Tyne, U.K. lee.fawcett@ncl.ac.uk 20th TIES Conference: Bologna, Italy, July 2009 Structure of this talk 1. Motivation and background 2. Review of existing methods Limitations/difficulties

More information

Investigation of an Automated Approach to Threshold Selection for Generalized Pareto

Investigation of an Automated Approach to Threshold Selection for Generalized Pareto Investigation of an Automated Approach to Threshold Selection for Generalized Pareto Kate R. Saunders Supervisors: Peter Taylor & David Karoly University of Melbourne April 8, 2015 Outline 1 Extreme Value

More information

Statistical Methods for Clusters of Extreme Values

Statistical Methods for Clusters of Extreme Values Statistical Methods for Clusters of Extreme Values Christopher A. T. Ferro, B.Sc. M.Sc. Submitted for the degree of Doctor of Philosophy at Lancaster University, September 2003. I declare that the work

More information

Extreme Value Analysis and Spatial Extremes

Extreme Value Analysis and Spatial Extremes Extreme Value Analysis and Department of Statistics Purdue University 11/07/2013 Outline Motivation 1 Motivation 2 Extreme Value Theorem and 3 Bayesian Hierarchical Models Copula Models Max-stable Models

More information

A Conditional Approach to Modeling Multivariate Extremes

A Conditional Approach to Modeling Multivariate Extremes A Approach to ing Multivariate Extremes By Heffernan & Tawn Department of Statistics Purdue University s April 30, 2014 Outline s s Multivariate Extremes s A central aim of multivariate extremes is trying

More information

Bayesian Modelling of Extreme Rainfall Data

Bayesian Modelling of Extreme Rainfall Data Bayesian Modelling of Extreme Rainfall Data Elizabeth Smith A thesis submitted for the degree of Doctor of Philosophy at the University of Newcastle upon Tyne September 2005 UNIVERSITY OF NEWCASTLE Bayesian

More information

EXTREMAL QUANTILES OF MAXIMUMS FOR STATIONARY SEQUENCES WITH PSEUDO-STATIONARY TREND WITH APPLICATIONS IN ELECTRICITY CONSUMPTION ALEXANDR V.

EXTREMAL QUANTILES OF MAXIMUMS FOR STATIONARY SEQUENCES WITH PSEUDO-STATIONARY TREND WITH APPLICATIONS IN ELECTRICITY CONSUMPTION ALEXANDR V. MONTENEGRIN STATIONARY JOURNAL TREND WITH OF ECONOMICS, APPLICATIONS Vol. IN 9, ELECTRICITY No. 4 (December CONSUMPTION 2013), 53-63 53 EXTREMAL QUANTILES OF MAXIMUMS FOR STATIONARY SEQUENCES WITH PSEUDO-STATIONARY

More information

The Spatial Variation of the Maximum Possible Pollutant Concentration from Steady Sources

The Spatial Variation of the Maximum Possible Pollutant Concentration from Steady Sources International Environmental Modelling and Software Society (iemss) 2010 International Congress on Environmental Modelling and Software Modelling for Environment s Sake, Fifth Biennial Meeting, Ottawa,

More information

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

Overview of Extreme Value Theory. Dr. Sawsan Hilal space Overview of Extreme Value Theory Dr. Sawsan Hilal space Maths Department - University of Bahrain space November 2010 Outline Part-1: Univariate Extremes Motivation Threshold Exceedances Part-2: Bivariate

More information

Modelling Multivariate Peaks-over-Thresholds using Generalized Pareto Distributions

Modelling Multivariate Peaks-over-Thresholds using Generalized Pareto Distributions Modelling Multivariate Peaks-over-Thresholds using Generalized Pareto Distributions Anna Kiriliouk 1 Holger Rootzén 2 Johan Segers 1 Jennifer L. Wadsworth 3 1 Université catholique de Louvain (BE) 2 Chalmers

More information

PREPRINT 2005:38. Multivariate Generalized Pareto Distributions HOLGER ROOTZÉN NADER TAJVIDI

PREPRINT 2005:38. Multivariate Generalized Pareto Distributions HOLGER ROOTZÉN NADER TAJVIDI PREPRINT 2005:38 Multivariate Generalized Pareto Distributions HOLGER ROOTZÉN NADER TAJVIDI Department of Mathematical Sciences Division of Mathematical Statistics CHALMERS UNIVERSITY OF TECHNOLOGY GÖTEBORG

More information

Practical conditions on Markov chains for weak convergence of tail empirical processes

Practical conditions on Markov chains for weak convergence of tail empirical processes Practical conditions on Markov chains for weak convergence of tail empirical processes Olivier Wintenberger University of Copenhagen and Paris VI Joint work with Rafa l Kulik and Philippe Soulier Toronto,

More information

Multivariate generalized Pareto distributions

Multivariate generalized Pareto distributions Multivariate generalized Pareto distributions Holger Rootzén and Nader Tajvidi Abstract Statistical inference for extremes has been a subject of intensive research during the past couple of decades. One

More information

Change Point Analysis of Extreme Values

Change Point Analysis of Extreme Values Change Point Analysis of Extreme Values TIES 2008 p. 1/? Change Point Analysis of Extreme Values Goedele Dierckx Economische Hogeschool Sint Aloysius, Brussels, Belgium e-mail: goedele.dierckx@hubrussel.be

More information

Some conditional extremes of a Markov chain

Some conditional extremes of a Markov chain Some conditional extremes of a Markov chain Seminar at Edinburgh University, November 2005 Adam Butler, Biomathematics & Statistics Scotland Jonathan Tawn, Lancaster University Acknowledgements: Janet

More information

MFM Practitioner Module: Quantitiative Risk Management. John Dodson. October 14, 2015

MFM Practitioner Module: Quantitiative Risk Management. John Dodson. October 14, 2015 MFM Practitioner Module: Quantitiative Risk Management October 14, 2015 The n-block maxima 1 is a random variable defined as M n max (X 1,..., X n ) for i.i.d. random variables X i with distribution function

More information

A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS

A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS Statistica Sinica 20 2010, 365-378 A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS Liang Peng Georgia Institute of Technology Abstract: Estimating tail dependence functions is important for applications

More information

Estimation de mesures de risques à partir des L p -quantiles

Estimation de mesures de risques à partir des L p -quantiles 1/ 42 Estimation de mesures de risques à partir des L p -quantiles extrêmes Stéphane GIRARD (Inria Grenoble Rhône-Alpes) collaboration avec Abdelaati DAOUIA (Toulouse School of Economics), & Gilles STUPFLER

More information

Chapter 6. Order Statistics and Quantiles. 6.1 Extreme Order Statistics

Chapter 6. Order Statistics and Quantiles. 6.1 Extreme Order Statistics Chapter 6 Order Statistics and Quantiles 61 Extreme Order Statistics Suppose we have a finite sample X 1,, X n Conditional on this sample, we define the values X 1),, X n) to be a permutation of X 1,,

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

The Bootstrap: Theory and Applications. Biing-Shen Kuo National Chengchi University

The Bootstrap: Theory and Applications. Biing-Shen Kuo National Chengchi University The Bootstrap: Theory and Applications Biing-Shen Kuo National Chengchi University Motivation: Poor Asymptotic Approximation Most of statistical inference relies on asymptotic theory. Motivation: Poor

More information

large number of i.i.d. observations from P. For concreteness, suppose

large number of i.i.d. observations from P. For concreteness, suppose 1 Subsampling Suppose X i, i = 1,..., n is an i.i.d. sequence of random variables with distribution P. Let θ(p ) be some real-valued parameter of interest, and let ˆθ n = ˆθ n (X 1,..., X n ) be some estimate

More information

Additive functionals of infinite-variance moving averages. Wei Biao Wu The University of Chicago TECHNICAL REPORT NO. 535

Additive functionals of infinite-variance moving averages. Wei Biao Wu The University of Chicago TECHNICAL REPORT NO. 535 Additive functionals of infinite-variance moving averages Wei Biao Wu The University of Chicago TECHNICAL REPORT NO. 535 Departments of Statistics The University of Chicago Chicago, Illinois 60637 June

More information

The Behavior of Multivariate Maxima of Moving Maxima Processes

The Behavior of Multivariate Maxima of Moving Maxima Processes The Behavior of Multivariate Maxima of Moving Maxima Processes Zhengjun Zhang Department of Mathematics Washington University Saint Louis, MO 6313-4899 USA Richard L. Smith Department of Statistics University

More information

A NOTE ON SECOND ORDER CONDITIONS IN EXTREME VALUE THEORY: LINKING GENERAL AND HEAVY TAIL CONDITIONS

A NOTE ON SECOND ORDER CONDITIONS IN EXTREME VALUE THEORY: LINKING GENERAL AND HEAVY TAIL CONDITIONS REVSTAT Statistical Journal Volume 5, Number 3, November 2007, 285 304 A NOTE ON SECOND ORDER CONDITIONS IN EXTREME VALUE THEORY: LINKING GENERAL AND HEAVY TAIL CONDITIONS Authors: M. Isabel Fraga Alves

More information

Estimators for the binomial distribution that dominate the MLE in terms of Kullback Leibler risk

Estimators for the binomial distribution that dominate the MLE in terms of Kullback Leibler risk Ann Inst Stat Math (0) 64:359 37 DOI 0.007/s0463-00-036-3 Estimators for the binomial distribution that dominate the MLE in terms of Kullback Leibler risk Paul Vos Qiang Wu Received: 3 June 009 / Revised:

More information

Frequency Estimation of Rare Events by Adaptive Thresholding

Frequency Estimation of Rare Events by Adaptive Thresholding Frequency Estimation of Rare Events by Adaptive Thresholding J. R. M. Hosking IBM Research Division 2009 IBM Corporation Motivation IBM Research When managing IT systems, there is a need to identify transactions

More information

Emma Simpson. 6 September 2013

Emma Simpson. 6 September 2013 6 September 2013 Test What is? Beijing during periods of low and high air pollution Air pollution is composed of sulphur oxides, nitrogen oxides, carbon monoxide and particulates. Particulates are small

More information

What Can We Infer From Beyond The Data? The Statistics Behind The Analysis Of Risk Events In The Context Of Environmental Studies

What Can We Infer From Beyond The Data? The Statistics Behind The Analysis Of Risk Events In The Context Of Environmental Studies What Can We Infer From Beyond The Data? The Statistics Behind The Analysis Of Risk Events In The Context Of Environmental Studies Sibusisiwe Khuluse, Sonali Das, Pravesh Debba, Chris Elphinstone Logistics

More information

Multivariate generalized Pareto distributions

Multivariate generalized Pareto distributions Bernoulli 12(5), 2006, 917 930 Multivariate generalized Pareto distributions HOLGER ROOTZÉN 1 and NADER TAJVIDI 2 1 Chalmers University of Technology, S-412 96 Göteborg, Sweden. E-mail rootzen@math.chalmers.se

More information

Estimation of spatial max-stable models using threshold exceedances

Estimation of spatial max-stable models using threshold exceedances Estimation of spatial max-stable models using threshold exceedances arxiv:1205.1107v1 [stat.ap] 5 May 2012 Jean-Noel Bacro I3M, Université Montpellier II and Carlo Gaetan DAIS, Università Ca Foscari -

More information

Robust Backtesting Tests for Value-at-Risk Models

Robust Backtesting Tests for Value-at-Risk Models Robust Backtesting Tests for Value-at-Risk Models Jose Olmo City University London (joint work with Juan Carlos Escanciano, Indiana University) Far East and South Asia Meeting of the Econometric Society

More information

Extremal behaviour of chaotic dynamics

Extremal behaviour of chaotic dynamics Extremal behaviour of chaotic dynamics Ana Cristina Moreira Freitas CMUP & FEP, Universidade do Porto joint work with Jorge Freitas and Mike Todd (CMUP & FEP) 1 / 24 Extreme Value Theory Consider a stationary

More information

Spatial and temporal extremes of wildfire sizes in Portugal ( )

Spatial and temporal extremes of wildfire sizes in Portugal ( ) International Journal of Wildland Fire 2009, 18, 983 991. doi:10.1071/wf07044_ac Accessory publication Spatial and temporal extremes of wildfire sizes in Portugal (1984 2004) P. de Zea Bermudez A, J. Mendes

More information

LARGE DEVIATION PROBABILITIES FOR SUMS OF HEAVY-TAILED DEPENDENT RANDOM VECTORS*

LARGE DEVIATION PROBABILITIES FOR SUMS OF HEAVY-TAILED DEPENDENT RANDOM VECTORS* LARGE EVIATION PROBABILITIES FOR SUMS OF HEAVY-TAILE EPENENT RANOM VECTORS* Adam Jakubowski Alexander V. Nagaev Alexander Zaigraev Nicholas Copernicus University Faculty of Mathematics and Computer Science

More information

ON THE ESTIMATION OF EXTREME TAIL PROBABILITIES. By Peter Hall and Ishay Weissman Australian National University and Technion

ON THE ESTIMATION OF EXTREME TAIL PROBABILITIES. By Peter Hall and Ishay Weissman Australian National University and Technion The Annals of Statistics 1997, Vol. 25, No. 3, 1311 1326 ON THE ESTIMATION OF EXTREME TAIL PROBABILITIES By Peter Hall and Ishay Weissman Australian National University and Technion Applications of extreme

More information

Selected Exercises on Expectations and Some Probability Inequalities

Selected Exercises on Expectations and Some Probability Inequalities Selected Exercises on Expectations and Some Probability Inequalities # If E(X 2 ) = and E X a > 0, then P( X λa) ( λ) 2 a 2 for 0 < λ

More information

ESTIMATING BIVARIATE TAIL

ESTIMATING BIVARIATE TAIL Elena DI BERNARDINO b joint work with Clémentine PRIEUR a and Véronique MAUME-DESCHAMPS b a LJK, Université Joseph Fourier, Grenoble 1 b Laboratoire SAF, ISFA, Université Lyon 1 Framework Goal: estimating

More information

Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets

Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets Athanasios Kottas Department of Applied Mathematics and Statistics,

More information

Approximate Bayesian computation for spatial extremes via open-faced sandwich adjustment

Approximate Bayesian computation for spatial extremes via open-faced sandwich adjustment Approximate Bayesian computation for spatial extremes via open-faced sandwich adjustment Ben Shaby SAMSI August 3, 2010 Ben Shaby (SAMSI) OFS adjustment August 3, 2010 1 / 29 Outline 1 Introduction 2 Spatial

More information

Extreme Value Theory and Applications

Extreme Value Theory and Applications Extreme Value Theory and Deauville - 04/10/2013 Extreme Value Theory and Introduction Asymptotic behavior of the Sum Extreme (from Latin exter, exterus, being on the outside) : Exceeding the ordinary,

More information

Bahadur representations for bootstrap quantiles 1

Bahadur representations for bootstrap quantiles 1 Bahadur representations for bootstrap quantiles 1 Yijun Zuo Department of Statistics and Probability, Michigan State University East Lansing, MI 48824, USA zuo@msu.edu 1 Research partially supported by

More information

On robust and efficient estimation of the center of. Symmetry.

On robust and efficient estimation of the center of. Symmetry. On robust and efficient estimation of the center of symmetry Howard D. Bondell Department of Statistics, North Carolina State University Raleigh, NC 27695-8203, U.S.A (email: bondell@stat.ncsu.edu) Abstract

More information

Generalized additive modelling of hydrological sample extremes

Generalized additive modelling of hydrological sample extremes Generalized additive modelling of hydrological sample extremes Valérie Chavez-Demoulin 1 Joint work with A.C. Davison (EPFL) and Marius Hofert (ETHZ) 1 Faculty of Business and Economics, University of

More information

A New Class of Tail-dependent Time Series Models and Its Applications in Financial Time Series

A New Class of Tail-dependent Time Series Models and Its Applications in Financial Time Series A New Class of Tail-dependent Time Series Models and Its Applications in Financial Time Series Zhengjun Zhang Department of Mathematics, Washington University, Saint Louis, MO 63130-4899, USA Abstract

More information

Published: 26 April 2016

Published: 26 April 2016 Electronic Journal of Applied Statistical Analysis EJASA, Electron. J. App. Stat. Anal. http://siba-ese.unisalento.it/index.php/ejasa/index e-issn: 2070-5948 DOI: 10.1285/i20705948v9n1p68 The Lawrence-Lewis

More information

APPLICATION OF EXTREMAL THEORY TO THE PRECIPITATION SERIES IN NORTHERN MORAVIA

APPLICATION OF EXTREMAL THEORY TO THE PRECIPITATION SERIES IN NORTHERN MORAVIA APPLICATION OF EXTREMAL THEORY TO THE PRECIPITATION SERIES IN NORTHERN MORAVIA DANIELA JARUŠKOVÁ Department of Mathematics, Czech Technical University, Prague; jarus@mat.fsv.cvut.cz 1. Introduction The

More information

Models and estimation.

Models and estimation. Bivariate generalized Pareto distribution practice: Models and estimation. Eötvös Loránd University, Budapest, Hungary 7 June 2011, ASMDA Conference, Rome, Italy Problem How can we properly estimate the

More information

Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes

Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Threshold estimation in marginal modelling of spatially-dependent non-stationary extremes Philip Jonathan Shell Technology Centre Thornton, Chester philip.jonathan@shell.com Paul Northrop University College

More information

Introduction to Algorithmic Trading Strategies Lecture 10

Introduction to Algorithmic Trading Strategies Lecture 10 Introduction to Algorithmic Trading Strategies Lecture 10 Risk Management Haksun Li haksun.li@numericalmethod.com www.numericalmethod.com Outline Value at Risk (VaR) Extreme Value Theory (EVT) References

More information

Shape of the return probability density function and extreme value statistics

Shape of the return probability density function and extreme value statistics Shape of the return probability density function and extreme value statistics 13/09/03 Int. Workshop on Risk and Regulation, Budapest Overview I aim to elucidate a relation between one field of research

More information

Models for the extremes of Markov chains

Models for the extremes of Markov chains Biometrika (1998), 85, 4, pp. 851-867 Printed in Great Britain Models for the extremes of Markov chains BY PAOLA BORTOT Department of Statistical Sciences, University of Padova, 35121 Padova, Italy bortot@hal.stat.unipd.it

More information

HIERARCHICAL MODELS IN EXTREME VALUE THEORY

HIERARCHICAL MODELS IN EXTREME VALUE THEORY HIERARCHICAL MODELS IN EXTREME VALUE THEORY Richard L. Smith Department of Statistics and Operations Research, University of North Carolina, Chapel Hill and Statistical and Applied Mathematical Sciences

More information

Challenges in implementing worst-case analysis

Challenges in implementing worst-case analysis Challenges in implementing worst-case analysis Jon Danielsson Systemic Risk Centre, lse,houghton Street, London WC2A 2AE, UK Lerby M. Ergun Systemic Risk Centre, lse,houghton Street, London WC2A 2AE, UK

More information

Some Approximations of the Logistic Distribution with Application to the Covariance Matrix of Logistic Regression

Some Approximations of the Logistic Distribution with Application to the Covariance Matrix of Logistic Regression Working Paper 2013:9 Department of Statistics Some Approximations of the Logistic Distribution with Application to the Covariance Matrix of Logistic Regression Ronnie Pingel Working Paper 2013:9 June

More information

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3

Hypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3 Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest

More information

Extreme Precipitation: An Application Modeling N-Year Return Levels at the Station Level

Extreme Precipitation: An Application Modeling N-Year Return Levels at the Station Level Extreme Precipitation: An Application Modeling N-Year Return Levels at the Station Level Presented by: Elizabeth Shamseldin Joint work with: Richard Smith, Doug Nychka, Steve Sain, Dan Cooley Statistics

More information

ORDER STATISTICS, QUANTILES, AND SAMPLE QUANTILES

ORDER STATISTICS, QUANTILES, AND SAMPLE QUANTILES ORDER STATISTICS, QUANTILES, AND SAMPLE QUANTILES 1. Order statistics Let X 1,...,X n be n real-valued observations. One can always arrangetheminordertogettheorder statisticsx (1) X (2) X (n). SinceX (k)

More information

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Jae-Kwang Kim Department of Statistics, Iowa State University Outline 1 Introduction 2 Observed likelihood 3 Mean Score

More information

Math 576: Quantitative Risk Management

Math 576: Quantitative Risk Management Math 576: Quantitative Risk Management Haijun Li lih@math.wsu.edu Department of Mathematics Washington State University Week 11 Haijun Li Math 576: Quantitative Risk Management Week 11 1 / 21 Outline 1

More information

STAT 512 sp 2018 Summary Sheet

STAT 512 sp 2018 Summary Sheet STAT 5 sp 08 Summary Sheet Karl B. Gregory Spring 08. Transformations of a random variable Let X be a rv with support X and let g be a function mapping X to Y with inverse mapping g (A = {x X : g(x A}

More information

Bayesian nonparametric estimation of finite population quantities in absence of design information on nonsampled units

Bayesian nonparametric estimation of finite population quantities in absence of design information on nonsampled units Bayesian nonparametric estimation of finite population quantities in absence of design information on nonsampled units Sahar Z Zangeneh Robert W. Keener Roderick J.A. Little Abstract In Probability proportional

More information

The Nonparametric Bootstrap

The Nonparametric Bootstrap The Nonparametric Bootstrap The nonparametric bootstrap may involve inferences about a parameter, but we use a nonparametric procedure in approximating the parametric distribution using the ECDF. We use

More information

Financial Econometrics and Volatility Models Extreme Value Theory

Financial Econometrics and Volatility Models Extreme Value Theory Financial Econometrics and Volatility Models Extreme Value Theory Eric Zivot May 3, 2010 1 Lecture Outline Modeling Maxima and Worst Cases The Generalized Extreme Value Distribution Modeling Extremes Over

More information

Analysis methods of heavy-tailed data

Analysis methods of heavy-tailed data Institute of Control Sciences Russian Academy of Sciences, Moscow, Russia February, 13-18, 2006, Bamberg, Germany June, 19-23, 2006, Brest, France May, 14-19, 2007, Trondheim, Norway PhD course Chapter

More information

Nonparametric Tests for Multi-parameter M-estimators

Nonparametric Tests for Multi-parameter M-estimators Nonparametric Tests for Multi-parameter M-estimators John Robinson School of Mathematics and Statistics University of Sydney The talk is based on joint work with John Kolassa. It follows from work over

More information

Refining the Central Limit Theorem Approximation via Extreme Value Theory

Refining the Central Limit Theorem Approximation via Extreme Value Theory Refining the Central Limit Theorem Approximation via Extreme Value Theory Ulrich K. Müller Economics Department Princeton University February 2018 Abstract We suggest approximating the distribution of

More information

Multivariate Pareto distributions: properties and examples

Multivariate Pareto distributions: properties and examples Multivariate Pareto distributions: properties and examples Ana Ferreira 1, Laurens de Haan 2 1 ISA UTL and CEAUL, Portugal 2 Erasmus Univ Rotterdam and CEAUL EVT2013 Vimeiro, September 8 11 Univariate

More information

Other properties of M M 1

Other properties of M M 1 Other properties of M M 1 Přemysl Bejda premyslbejda@gmail.com 2012 Contents 1 Reflected Lévy Process 2 Time dependent properties of M M 1 3 Waiting times and queue disciplines in M M 1 Contents 1 Reflected

More information

New Classes of Multivariate Survival Functions

New Classes of Multivariate Survival Functions Xiao Qin 2 Richard L. Smith 2 Ruoen Ren School of Economics and Management Beihang University Beijing, China 2 Department of Statistics and Operations Research University of North Carolina Chapel Hill,

More information

On estimating extreme tail. probabilities of the integral of. a stochastic process

On estimating extreme tail. probabilities of the integral of. a stochastic process On estimating extreme tail probabilities of the integral of a stochastic process Ana Ferreira Instituto uperior de Agronomia, UTL and CEAUL Laurens de Haan University of Tilburg, Erasmus University Rotterdam

More information

Fitting the generalized Pareto distribution to data using maximum goodness-of-fit estimators

Fitting the generalized Pareto distribution to data using maximum goodness-of-fit estimators Computational Statistics & Data Analysis 51 (26) 94 917 www.elsevier.com/locate/csda Fitting the generalized Pareto distribution to data using maximum goodness-of-fit estimators Alberto Luceño E.T.S. de

More information

On the Estimation and Application of Max-Stable Processes

On the Estimation and Application of Max-Stable Processes On the Estimation and Application of Max-Stable Processes Zhengjun Zhang Department of Statistics University of Wisconsin Madison, WI 53706, USA Co-author: Richard Smith EVA 2009, Fort Collins, CO Z. Zhang

More information

Economics Division University of Southampton Southampton SO17 1BJ, UK. Title Overlapping Sub-sampling and invariance to initial conditions

Economics Division University of Southampton Southampton SO17 1BJ, UK. Title Overlapping Sub-sampling and invariance to initial conditions Economics Division University of Southampton Southampton SO17 1BJ, UK Discussion Papers in Economics and Econometrics Title Overlapping Sub-sampling and invariance to initial conditions By Maria Kyriacou

More information

Bivariate generalized Pareto distribution

Bivariate generalized Pareto distribution Bivariate generalized Pareto distribution in practice Eötvös Loránd University, Budapest, Hungary Minisymposium on Uncertainty Modelling 27 September 2011, CSASC 2011, Krems, Austria Outline Short summary

More information

Long-Run Covariability

Long-Run Covariability Long-Run Covariability Ulrich K. Müller and Mark W. Watson Princeton University October 2016 Motivation Study the long-run covariability/relationship between economic variables great ratios, long-run Phillips

More information

Invariant HPD credible sets and MAP estimators

Invariant HPD credible sets and MAP estimators Bayesian Analysis (007), Number 4, pp. 681 69 Invariant HPD credible sets and MAP estimators Pierre Druilhet and Jean-Michel Marin Abstract. MAP estimators and HPD credible sets are often criticized in

More information

Pitfalls in Using Weibull Tailed Distributions

Pitfalls in Using Weibull Tailed Distributions Pitfalls in Using Weibull Tailed Distributions Alexandru V. Asimit, Deyuan Li & Liang Peng First version: 18 December 2009 Research Report No. 27, 2009, Probability and Statistics Group School of Mathematics,

More information

Estimation of Quantiles

Estimation of Quantiles 9 Estimation of Quantiles The notion of quantiles was introduced in Section 3.2: recall that a quantile x α for an r.v. X is a constant such that P(X x α )=1 α. (9.1) In this chapter we examine quantiles

More information

Asymptotic statistics using the Functional Delta Method

Asymptotic statistics using the Functional Delta Method Quantiles, Order Statistics and L-Statsitics TU Kaiserslautern 15. Februar 2015 Motivation Functional The delta method introduced in chapter 3 is an useful technique to turn the weak convergence of random

More information

Contributions for the study of high levels that persist over a xed period of time

Contributions for the study of high levels that persist over a xed period of time Contributions for the study of high levels that persist over a xed period of time Marta Ferreira Universidade do Minho, Dep. Matemática, CMAT Luísa Canto e Castro Faculdade de Ciências da Universidade

More information

Accounting for extreme-value dependence in multivariate data

Accounting for extreme-value dependence in multivariate data Accounting for extreme-value dependence in multivariate data 38th ASTIN Colloquium Manchester, July 15, 2008 Outline 1. Dependence modeling through copulas 2. Rank-based inference 3. Extreme-value dependence

More information

High-frequency data modelling using Hawkes processes

High-frequency data modelling using Hawkes processes Valérie Chavez-Demoulin joint work with High-frequency A.C. Davison data modelling and using A.J. Hawkes McNeil processes(2005), J.A EVT2013 McGill 1 /(201 High-frequency data modelling using Hawkes processes

More information

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES Statistica Sinica 19 (2009), 71-81 SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES Song Xi Chen 1,2 and Chiu Min Wong 3 1 Iowa State University, 2 Peking University and

More information

arxiv: v2 [math.pr] 4 Sep 2017

arxiv: v2 [math.pr] 4 Sep 2017 arxiv:1708.08576v2 [math.pr] 4 Sep 2017 On the Speed of an Excited Asymmetric Random Walk Mike Cinkoske, Joe Jackson, Claire Plunkett September 5, 2017 Abstract An excited random walk is a non-markovian

More information

Brownian Motion. 1 Definition Brownian Motion Wiener measure... 3

Brownian Motion. 1 Definition Brownian Motion Wiener measure... 3 Brownian Motion Contents 1 Definition 2 1.1 Brownian Motion................................. 2 1.2 Wiener measure.................................. 3 2 Construction 4 2.1 Gaussian process.................................

More information

CS Lecture 19. Exponential Families & Expectation Propagation

CS Lecture 19. Exponential Families & Expectation Propagation CS 6347 Lecture 19 Exponential Families & Expectation Propagation Discrete State Spaces We have been focusing on the case of MRFs over discrete state spaces Probability distributions over discrete spaces

More information

UPPER DEVIATIONS FOR SPLIT TIMES OF BRANCHING PROCESSES

UPPER DEVIATIONS FOR SPLIT TIMES OF BRANCHING PROCESSES Applied Probability Trust 7 May 22 UPPER DEVIATIONS FOR SPLIT TIMES OF BRANCHING PROCESSES HAMED AMINI, AND MARC LELARGE, ENS-INRIA Abstract Upper deviation results are obtained for the split time of a

More information

High-frequency data modelling using Hawkes processes

High-frequency data modelling using Hawkes processes High-frequency data modelling using Hawkes processes Valérie Chavez-Demoulin 1 joint work J.A McGill 1 Faculty of Business and Economics, University of Lausanne, Switzerland Boulder, April 2016 Boulder,

More information

The formal relationship between analytic and bootstrap approaches to parametric inference

The formal relationship between analytic and bootstrap approaches to parametric inference The formal relationship between analytic and bootstrap approaches to parametric inference T.J. DiCiccio Cornell University, Ithaca, NY 14853, U.S.A. T.A. Kuffner Washington University in St. Louis, St.

More information

Sharp statistical tools Statistics for extremes

Sharp statistical tools Statistics for extremes Sharp statistical tools Statistics for extremes Georg Lindgren Lund University October 18, 2012 SARMA Background Motivation We want to predict outside the range of observations Sums, averages and proportions

More information

Week 1 Quantitative Analysis of Financial Markets Distributions A

Week 1 Quantitative Analysis of Financial Markets Distributions A Week 1 Quantitative Analysis of Financial Markets Distributions A Christopher Ting http://www.mysmu.edu/faculty/christophert/ Christopher Ting : christopherting@smu.edu.sg : 6828 0364 : LKCSB 5036 October

More information

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA

Gauge Plots. Gauge Plots JAPANESE BEETLE DATA MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA JAPANESE BEETLE DATA JAPANESE BEETLE DATA 6 MAXIMUM LIKELIHOOD FOR SPATIALLY CORRELATED DISCRETE DATA Gauge Plots TuscaroraLisa Central Madsen Fairways, 996 January 9, 7 Grubs Adult Activity Grub Counts 6 8 Organic Matter

More information

Recall that in order to prove Theorem 8.8, we argued that under certain regularity conditions, the following facts are true under H 0 : 1 n

Recall that in order to prove Theorem 8.8, we argued that under certain regularity conditions, the following facts are true under H 0 : 1 n Chapter 9 Hypothesis Testing 9.1 Wald, Rao, and Likelihood Ratio Tests Suppose we wish to test H 0 : θ = θ 0 against H 1 : θ θ 0. The likelihood-based results of Chapter 8 give rise to several possible

More information

arxiv: v1 [stat.co] 26 May 2009

arxiv: v1 [stat.co] 26 May 2009 MAXIMUM LIKELIHOOD ESTIMATION FOR MARKOV CHAINS arxiv:0905.4131v1 [stat.co] 6 May 009 IULIANA TEODORESCU Abstract. A new approach for optimal estimation of Markov chains with sparse transition matrices

More information

On the Estimation and Application of Max-Stable Processes

On the Estimation and Application of Max-Stable Processes On the Estimation and Application of Max-Stable Processes Zhengjun Zhang Department of Statistics University of Wisconsin Madison, WI 53706 USA Richard L Smith Department of Statistics University of North

More information

Max stable Processes & Random Fields: Representations, Models, and Prediction

Max stable Processes & Random Fields: Representations, Models, and Prediction Max stable Processes & Random Fields: Representations, Models, and Prediction Stilian Stoev University of Michigan, Ann Arbor March 2, 2011 Based on joint works with Yizao Wang and Murad S. Taqqu. 1 Preliminaries

More information

Estimation of AUC from 0 to Infinity in Serial Sacrifice Designs

Estimation of AUC from 0 to Infinity in Serial Sacrifice Designs Estimation of AUC from 0 to Infinity in Serial Sacrifice Designs Martin J. Wolfsegger Department of Biostatistics, Baxter AG, Vienna, Austria Thomas Jaki Department of Statistics, University of South Carolina,

More information

Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses

Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses Ann Inst Stat Math (2009) 61:773 787 DOI 10.1007/s10463-008-0172-6 Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses Taisuke Otsu Received: 1 June 2007 / Revised:

More information

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Jonathan Gruhl March 18, 2010 1 Introduction Researchers commonly apply item response theory (IRT) models to binary and ordinal

More information