Tail Index Estimation of Heavy-tailed Distributions

Size: px
Start display at page:

Download "Tail Index Estimation of Heavy-tailed Distributions"

Transcription

1 CHAPTER 2 Tail Index Estimation of Heavy-tailed Distributions 2.1 Introduction In many diverse fields such as meteriology, finance, hydrology, climatology, environmental sciences, telecommunication, insurance and genetics, heavy-tailed distributions are recommended to model the data, see Embrechts et al. (1997) and hence the problem of estimation of the tail index of a heavy-tailed distribution has been paid much attention in recent years. The tail shape of heavy-tailed distributions resembles to a first approximation the hyperbolic shape of the Pareto distribution characterized by the so-called tail index. Besides, this parameter plays a key role in connection with determination of extreme quantiles, upper tail probabilities, mean excess functions and excess of loss and stop reinsurance premium etc. Small relative errors in the estimation of tail index can produce large relative errors in estimation of such quantities. Moreover, in the context of semi-parametric Some results included in this chapter have appeared in the paper Dais George and Sebastian George (2009). 23

2 modeling, one assumes merely a Pareto type distribution, as an approximate model for the upper observations of a sample, to situations, where the upper tail of the model is not parametrically related to the central parts and thus is estimated separately, using only the uppermost observations. Extreme value models focus attention on the tail of a statistical distribution of events rather than imposing a single functional form to hold for the entire distribution. To estimate the tail index, numerous tail index estimators have been proposed in the literature including earlier contributions by Hill (1975) and Pickands (1975) in which the Hill estimator has become somewhat of a benchmark to which later proposed estimators are compared. Again many of the later proposed estimators of tail index are the alternatives of the Hill estimator. They are the smoohill estimator, Hill estimator in alt scale, smoohill estimator in alt scale, for details, see Resnick (1997) and weighted Hill estimator [Gomes et al. (2008)]. The other popular tail index estimators are the qq-estimator [Kratz and Resnick (1996)], harmonic moment estimator [Henry (2009)] and more recently the tail index estimator for dependant hetrogeneous data introduced by Hill (2010). The Hill estimator, its alternatives and the qq-estimator are methods of estimation by plotting techniques. The traditional Hill plot offers little scope of correctly discerning the value of α, but the alternatives such as smoohill plot, Hill plot in altscale, smoohill in alt scale and qq-estimator seems to reveal the value of α fairly clearly. Though the asymptotic variance of the qq-estimator is larger than that of the Hill estimator, the volatility of the plot always seems to be less than that of the Hill plots. Being aware of the limitations of these estimators, we introduce another estimator in this chapter namely, the smooweighted Hill estimator, for estimating the tail index, which is a better one compared to most of the estimators proposed in the literature [Dais and Sebastian (2009)]. It is an improved version of the weighted Hill estimator proposed by Gomes et al. (2008). This chapter is organized as follows. Section 2 describes the heavytailed distributions. In Section 3, the plotting techniques for estimating the tail index of a heavy-tail distribution ((Hill estimator, its alternatives, weighted Hill estimator and the qq-estimator) are reviewed. A new tail index estimator namely, the smooweighted Hill estimator is introduced in Section 4. In Section 5, we establish the performance of the the smooweighted Hill estimator through a comparative study. Bootstrap confidence intervals and coverage probabilities are computed for the Hill and weighted Hill estimators in Section 24

3 6. In Section 7, we concludes the chapter. 2.2 Heavy-tailed Distributions Heavy-tailed distributions (also known as power-law distributions) have been observed in many natural phenomena including both physical and sociological phenomena. Definition and examples of heavy-tailed distributions are given in Section A useful and tractable model with relatively high probability in the upper tail is the Pareto distribution which is hyperbolic over its entire range and has the probability density function, f(x) = αc α x α 1, α, c > 0, x c. (2.2.1) The cumulative distribution function is given by F (x) = P [X x] = 1 (c/x) α, x c where α is the shape parameter which is a measure of tail heaviness of the distribution and c represents the smallest value, that the random variable can take. As α decreases, an arbitrarily large portion of the probability mass may be present in the tail of the distribution. This means that a random variable that follows a heavy tailed distribution can give raise to extremely large values with non-negligible probability. As c increases only the tail of the distribution is modeled. The Pareto distribution belongs to the one parameter exponential family of distributions for known value of c, as the density function can be written as f(x) = C(α) exp[q(α)t(x)]h(x) where C(α) = αc α, Q(α) = (α + 1), t(x) = lnx and h(x)=1 (lnx denotes logarithm to the base e). For general overviews of the role of Pareto tailed distributions in actuarial science, econometrics, telecommunications and other fields we refer to Arnold (1983), Johnson et al. (1994) and Resnick (2007). New application contexts continue to arise: for example, the cost distribution of combinatorial research algorithms, has recently been shown to exhibit Pareto type tail behavior, see Gomes et al. (1997). 25

4 2.3 Estimating the Tail Index α In this section, we discuss some of the plotting techniques to estimate the parameter α. We also consider various properties of these estimators and possible alternatives The Hill Estimator Hill estimator is one of the most popular tools for detecting the presence of heavy tails of the marginal distribution of stationary sequences of random variables. Suppose X 1, X 2,..., X n are iid random variables from a distribution F. Let X (1) > X (2) >... > X (n) be the order statistics. If X has an exact Pareto distribution with 1 F (x) = x α, x > 1, then lnx follows an exponential distribution with parameter α. Thus lnx 1,..., lnx n is a sample from an exponential density with parameter α. Since the mean of the exponential distribution with parameter α is α 1, the maximum likelihood estimator (MLE) of α 1 is the sample mean and thus n H n = 1 n I=1 lnx (i) is the MLE of α 1. Instead of assuming a Pareto Distribution, we assume that 1 F (x) = x α L(x), x, where L(x) is a slowly varying function, satisfying (1.3.2). For k < n, the Hill estimator [Hill (1975)] of α is defined as H k,n = 1 k ( ) X(i) ln, (2.3.1) X (k+1) where k is the number of upper order statistics used in the estimation. A rough idea behind using only k upper order statistics is that we should only sample from that part of the distribution which looks most Pareto like. 26

5 Properties of Hill Estimator The Hill estimator of the tail index α is a pseudo-maximum likelihood estimator based on the exponential approximation of the normalized log-spacings Y j = jln 2,..., k. The following are some of the important properties of the Hill estimator. ( X(j) X (j+1) ) for j = 1, 1. When X is are iid, Hill estimator is a consistent estimator of α 1, a fact which is equivalent to regular variation of 1-F in a way made precise by Mason (1982). The condition for consistency requires that k = k(n), a function of n, k and (k/n) 0 as n. 2. The Hill estimator can be surprisingly sensitive to changes in location. A shift in location does not theoretically affect the tail index, but may throw the hill estimate way off. 3. The Hill estimator is asymptotically normal [Resnick (2007)]. That is (k)[h (k,n) ˆα 1 d ] N(0, α 2 ). In practice, the Hill estimator is used as follows. We graph {(k, H 1 k,n ), 1 k n} and hope the graph looks stable so that we can pick out a value of α = H 1 k,n from the stable regime of this plot. We have conducted a simulation study to estimate the tail index α based on simulated observations from a Pareto distribution with α = 0.5. Figure 2.1 shows the Hill plot for the simulated Pareto data and the Hill estimate seems to lie between.48 and 0.5. Later it has shown that the Hill estimator could successfully be applied to a variety of dependent sequences of random variables, see Hsing (1991) and Resnick and Starica (1995). Unfortunately, the practical use of this estimator is hampered by the fact that the Hill estimator has optimality properties only when the underlying distribution is close to Pareto. If the distribution is far from Pareto, there may be outrageous bias even for sample sizes such as That is why we sample from that part of the distribution which is most Pareto like. The Hill plot is not always so revealing. One of the difficulty that we are facing using the Hill estimator is that the graph may exhibit considerable volatility and/or the true answer may be hidden in the graph. For the bias problem, though there is no completely 27

6 Figure 2.1: Hill Plot for Pareto, α = 0.5. satisfied resolution, one possibility is the bootstrap method which we consider in the last section. Again the estimate is often highly sensitive to the choice of k, the number of upper order statistics used to calculate H k,n. Resnick (1997) suggested some alternative graphical tools which can minimize these difficulties. They are the smoohill estimator and its alternatives SmooHill Estimator This is a technique for smoothing the Hill estimator [Resnick and Starica (1997)], though it is powerless to correct bias, reduces the volatility of the plot and the uncertainty about how to pick out an estimate of α. Thus, it make use of the Hill estimator more reliable through an averaging technique which reduces the asymptotic variance. As a direct result, the range in which the smoothed estimator varies as a function of k decreases and the successful use of the estimator is made less dependent on the choice of k. The smoohill estimator is defined as SmooH k,n = 1 (u 1)k uk j=k+1 where u > 1 is an integer (usually we pick u between n 0.1 and n 0.2, that is 2 or 3) and uk n where n is the sample size. The asymptotic variance of the Hill estimator is 1/α 2 H j,n 28

7 Figure 2.2: SmooHill Plot for Pareto α = 0.5. where as the asymptotic variance of smoohill estimator is ( ) ( ) ( ( )) 1 2 lnu 1 α 2 u u which is less than 1/α 2. The bigger the u, the more the asymptotic variance is reduced. But for large u fewer points are plotted in smoohill. Here u=3. So we stop averaging, when k reached Figure 2.2 shows the result of smoothing in the Hill estimate which gives smoohill estimate of α as Alt Plotting This is an alternative to the Hill plot using a change of scale. Here we plot {(θ, H 1 [n θ ],n), 0 θ 1}, where we write [n θ ] for the integer part of n θ and θ = ln(k)/ln(n) so that we are getting an alternative to the Hill plot abbreviated as althill. This method was suggested by Resnick and Starica (1995). As in the case of traditional Hill plot, here also we read out the estimated value of α from the stable portion of the graph. Figure 2.3 and Figure 2.4 give views of Hill plot and smoohill plot of Pareto data in alt scale. Here α = 0.49 in both cases, more likely choices. 29

8 Figure 2.3: AltHill Plot for Pareto α = 0.5. Figure 2.4: AltsmooHill Plot for Pareto α =

9 This alternative display is more helpful since the significant part of the graph (the part corresponding to a relatively small number of order statistics) gets to be shown more clearly, covering a bigger portion of the displayed space. That is, the part of the graph corresponding to a high number of order statistics, which covers a disproportionately large part in the traditional Hill plot gets rescaled. Thus, the interpretation of the graph is easier and more accurate. Picking k mid way between 1 and n or picking θ so close to 1 seems unwise Weighted Hill Estimator Gomes et al. (2008) proposed a bias reduced Hill estimator called weighted Hill estimator which gives less weight to the largest than to the intermediate order statistics. Doing this in an adequate way, the dominant component of the asymptotic bias of the Hill estimator can be reduced without increasing the asymptotic variance. Moreover, in statistics of extremes, although all top observations are relevant, by giving less weight to the extreme and more weight to the intermediate order statistics, the performance of the estimator can be increased. Here heavy-tailed models, with a specific regularly varying tail function in Hall s class, that is with a tail ( x 1 F (x) = C where C > 0, ρ < 0 and β 0 are considered. ) 1/γ ( β ( x ) ρ/γ o(x ρ/γ ), as x ρ C Then we are in the domain of attraction for maxima of an extreme value distribution function EV γ (x) = exp( (1 + γx) 1/γ ), 1 + γx 0, γ > 0, and we write F D M (EV γ>0 ). The parameter γ is the extreme value index, one of the primary parameters of extreme or even rare events. In the context of heavy tails, with the notation U(t) = F (1 1/t), t 1 where F (y) = inf{x : F (x) y} and for the class of regularly varying functions with index of regular variation α, usually denoted by RV α F D M (EV γ>0 ) iff F = 1 F RV 1/γ iff U RV γ. 31

10 The second order parameter ρ is the non-positive value that appears in the limiting relation lnu(tx) lnu(t) γlnx lim t A(t) = xρ 1, ρ is assumed to hold for every x > 0, in order to derive the non-degenerate behavior of the estimators. Then, A(t) is of regular variation with index ρ. Here ρ is restricted as ρ < 0 and the parametrization, A(t) = γβt ρ is used [for details, see Gomes et al. (2008)]. For intermediate k (a sequence of integers k = k n between 1 and n), k = k n and k n = o(n), as n, the log-excesses are given by V ik = lnx n i+1:n lnx n k:n, 1 i k < n, where X i:n denotes the i th ascending order statistics, 1 i n, associated to a random sample (X 1, X 2,..., X n ). Under the first order frame work, V ik d = lnu(y n i+1:n ) lnu(y n k:n ) d = lnu(y n k:n Y k i+1:k ) lnu(y n k:n ) lny k i+1:k d = α 1 E k i+1:k. That is, the V ik s, 1 i k are approximately k order statistics from an exponential random sample with mean value α 1. This argument justifies the Hill estimator given by (2.3.1). That is H k,n 1 k ( represents equivalent to ). Under the second order framework, for immediate k (1 i k), V ik V ik = d α 1 lny k i+1:k + Y ρ k i+1:k 1 A(n/k)(1 + o p (1)). ρ 32

11 That is, V ik can be written as This follows that ( V ik = α 1 lny k i+1:k 1 + A(n/k) [ A(n/k) = α 1 exp [ A(n/k) V ik α 1 exp Also for 1 i k, α 1 α 1 Y ρ Y ρ k i+1:k 1 ρlny k i+1:k Y ρ k i+1:k 1 (i/k) ρ 1 ρlny k i+1:k ρln(i/k) α 1 k i+1:k 1 ρlny k i+1:k Y ρ k i+1:k 1 (1 + o p (1)) ρlny k i+1:k ) ] E k i+1:k + o p (A(n/k)). ] E k i+1:k = o p ( Vik α 1 E k i+1:k ). = ψ ik ψ(i/k) ψ ik (ρ) [ψ kk 1], with ψ a limiting function and represents approximately equal to. If instead of assuming that V ik come from an exponential family with mean value equal to α 1 as in the case of Hill estimator, assume that V ik come from an exponential family with mean value not equal to α 1, but depend on i (and k). Thus, α 1 ik = α 1 exp [ A(n/k)ψ ik /α 1], 1 i k, A(t) = α 1 βt ρ. The previous result will leads us to get a less biased estimator of α. Since V ik can be approximately written as V ik α 1 exp [β(n/k) ρ ψ ik ] E k i+1:k, ψ ik = (i/k)ρ 1 ρln(i/k), 1 i k, the likelihood associated to V ik, 1 i k, is [ ( n ) ρ L (α, β, ρ) = exp klnα 1 β ψ ik 1 k α 1 ] ( n ) ρ V ik exp β ψik. k Knowing β and ρ, the maximization of L will lead to the weighted Hill estimator of α 1 33

12 denoted by WH given by ˆ α 1 = W H β,ρ (k) = 1/k exp [ β(n/k) ρ ψ ik ] V ik. That is, to obtain the weighted Hill estimator, the Hill estimator is replaced by a weighted combination of the log-excesses so that W H ˆβ,ˆρ (k) = 1/k ] exp [ ˆβ(n/k)ˆρ ˆψik ln ( Xn i+1:n X n k:n ) where ˆβ and ˆρ are any adequate, consistent estimators of the second order parameters β and ρ respectively such that both { ˆβ β} and (ˆρ ρ)ln(n/k) are o p (1/( ka(n/k))), for the k- values on which the estimation of α is based. These conditions for ˆρ and ˆβ hold true, if we consider levels k such that ka 2 (n/k) λ 0, finite. The key success of the WH-estimator lies in the estimation of β and ρ at a level k 1 such that k = o(k 1 ), where k is the number of top order statistics used for the estimation of α. For more details, see Gomes et al. (2008). Estimation of ρ For estimating ρ, the class of ρ estimators in Alves et al. (2003) is used. Such a class of estimators is parameterized using a tuning parameter τ. In the simulations we consider only the statistics associated to τ = 0 and to τ = 1, which are usually preferable when ρ 1 and ρ > 1 respectively. The ρ estimators are, ˆρ i = 3[T (i) n (k 1 ) 1] [T (i) n (k 1 ) 3], i = 0, 1, where ( [ ]) 2n k 1 = min n 1,. lnlnn 34

13 Here T (τ) n (k) = ( ln ( 1 2 ln M (1) n ( (k) ) 1 2 ) ln ( 1 3 ln ) M n (2) (k)/2 M n (2) (k)/2 M n (3) (k)/6 ( ) M n (1) τ ( ) (k) M n (2) τ/2 (k)/2 ( ) τ/2 ( M n (2) (k)/2 M n (3) (k)/6 ) if τ = 0 ) τ/3 if τ > 0, where M (j) n = 1 k [ ln X ] j n i+1:n, j 1, [M n (1) = H k, n ]. X n k:n These statistics converge towards 3(1 ρ) for every τ 0 whenever the second order (3 ρ) condition holds (k is intermediate and as n, ka(n/k) ). Estimation of β For the estimation of β we consider the estimator of β obtained in Gomes and Martins (2002). It is based on the scaled log-spacings U i = i (lnx n i+1:n lnx n i:n ), 1 i k. The estimator is given by ˆβ U (k; ˆρ) = (k/n)ˆρ ( ( (1/k) (1/k) ) (i/k) ((1/k) ˆρ ) (i/k) ((1/k) ˆρ ) ( U i (1/k) ) (i/k) ˆρ U i ) ( (i/k) ˆρ U i (1/k) ). (i/k) 2ˆρ U i For the simulation study, we considered the simulated data from the Pareto distribution. On the basis of the stability criterion for large k, the estimated value of ρ is -0.9, associated to τ = 0 so that we get ˆβ = 0.5. We graph {(k, W H 1 (k)), 1 k n} and pick out that value of α for which the graph ˆρ, ˆβ looks stable. Figure 2.5 is the weighted Hill plot for Pareto data. The weighted Hill plot for Pareto seems to nail at α = correctly. The weighted Hill estimator is asymptotically normal, with asymptotic variance ˆα 2. That is as n (k) ( W H ˆβ,ˆρ (k) ˆα 1 ) d N ( 0, ˆα 2). 35

14 Figure 2.5: Weighted Hill Plot for Pareto, α = Alternative Estimators qq-plotting This graphical technique is a commonly used method of visually assessing goodness of fit and of estimating location and scale parameters, see Rice (1988) and Castilo (1988). It can also be adapted to the problem of detecting heavy tails and for estimating α [Rensnick (1997)]. It rests on the simple observation that for a sample of size n uniformly distributed over (0,1), the spacings are identically distributed, plotting i/(n + 1) against the i th largest in the sample should yield approximately a straight line of slope 1. The procedure is as follows: Suppose {X 1, X 2,..., X n } are iid with distribution F. Pick k upper order statistics X (1) > X (2) >... > X (k) and neglect the rest. Plot {( ( ) ) } j ln 1, lnx (j), 1 j k. (2.3.2) (k + 1) The data is approximately Pareto or even if 1 F is only regularly varying, this should be approximately a straight line with slope α 1. The slope of the least squares line through the points is an estimator called the qq-estimator [Kratz and Resnick (1996)]. Thus, the 36

15 Figure 2.6: Static qq-plot for Pareto, α = 0.5. qq-estimator is given by ˆα 1 k,n = [ 1 k 1 k ( ( )) ( ) i X(i) ln ln 1 (k + 1) X (k+1) k ( ( )) ] i ln H k,n k + 1 ( ( )) ( 2 i ln 1 ( ) ) 2 i ln 1 (k + 1) k (k + 1) Based on the qq-estimator, two different plots can be drawn. They are [( ) ] 1 1. Dynamic qq-plot obtained from plotting k, α 1 k,n, 1 k n which is similar to Hill plot. 2. The static qq-plot obtained by choosing and fixing k, plotting the points in (2.3.2) and putting the least square line through the points while computing the slope as the estimate of α 1. Figure 2.6 shows the static qq-plots for Pareto data with k = 8000 which is approximately a straight line with slope α 1. Here ˆα = Again Figure 2.7 gives a view of the dynamic qq-plot. Here the graph is stable and the value of ˆα = If k and (k/n) 0, the qq-estimator is consistent and under a second order. 37

16 Figure 2.7: qq-plot for Pareto, α = 0.5. regular variation condition and further restriction on k(n), it is asymptotically normal with asymptotic variance 2α 2 which is larger than the asymptotic variance of the Hill estimator. But the volatility of the qq-plot is less than that of the Hill plot, see Resnick (2007). Based on the above studies, we introduce a better tail index estimator, namely smooweighted Hill estimator and its performance is compared with the above mentioned estimators are considered in the following sections. 2.4 Smooweighted Hill Estimator Our method is based on the smoothing procedure suggested by Resnick and Starica (1997). It is a simple averaging technique which reduces the volatility of the plot. This smoothing procedure consists of averaging the weighted Hill estimator values corresponding to different numbers of order statistics. The smoothed weighted Hill estimator denoted as smooweighted Hill estimator is given by smooweightedhˆρ, ˆβ(k) = 1 (u 1)k uk j=k+1 W Hˆρ, ˆβ(j), where u > 1 is an integer (usually 2 or 3) as in the case of smoohill estimator. Here also, u = 3 so that we stop averaging when k reaches at 3300 (uk < 10000). Since in the case of weighted Hill estimator, we give less weight to largest than to 38

17 Figure 2.8: Smooweighted Hill Plot for Pareto, α = 0.5. intermediate observations in an adequate way, it won t affect the asymptotic variance. So the asymptotic variance of the weighted Hill estimator is the same as that of the Hill estimator and the asymptotic variance of the smooweighted Hill estimator is the same as that of the smoohill estimator. Figure 2.8 shows the smooweighted Hill plot for Pareto data. Here the smooweighted Hill estimate is 0.5 exactly. The smoothing procedure consists of averaging the weighted Hill estimator values W H ˆβ,ˆρ (k) over a broad range of k, the number of order statistics. The order of the number of terms involved in the averaging is k. Therefore, when n, k we will be averaging larger and larger number of weighted Hill estimator values with a consequent reduction in asymptotic variance. The asymptotics of the smooweighted Hill estimator can be proved as in the case of the smoohill estimator. For details, see Resnick and Starica (1997). 2.5 Comparison of Various Tail Index Estimators The performance of the parameter estimation method is compared using the bias and efficiency of the parameter in terms of standard deviation (SD) and root mean squared error (RMSE). Here we compare the various tail index estimators of simulated Pareto data in terms of bias and RMSE. As we can see in Table 2.1, the estimate of α consistently out perform (with regard to bias, SD and RMSE) for smooweighted Hill estimator. We extend our study to find out the bootstrap confidence interval for the Hill estimator 39

18 Table 2.1: Comparison of Tail Index Estimators. Estimate of α k = 100 k = 200 k = 300 Bias Hill SmooHill qq- estimator Weighted Hill SmooWeighted Hill SD Hill SmooHill qq-estimator Weighted Hill SmooWeighted Hill RMSE Hill SmooHill qq-estimator Weighted Hill SmooWeighted Hill and weighted Hill estimator in the following section, based on their asymptotic normality. 2.6 The Bootstrap Confidence Interval for the Hill Estimator of α Bootstrapping is a method of resembling similar to a Monte Carlo experiment. In a Monte Carlo study we generate the random variables from a given distribution such as the normal. The bootstrap takes a different approach, the random variables are drawn from their observed distribution. In essence, the bootstrap uses the plug-in principle-observed distribution of their actual distribution of the random variables is the best estimate of their actual distribution. The idea of bootstrap was developed in Efron (1979). The key point made by Efron is that the observed data set is a random sample of size T drawn from the actual probability distribution generating the data, in a sense, the empirical distribution of the data is the best estimate of the actual distribution of the data. The confidence interval for the means of a heavy-tailed distribution was proposed by Hall and Lepage (1996). Conditional on X 1, X 2,..., X n, let X1, X2,..., Xn denote independently and identically distributed random variables drawn randomly with replacement from X 1, X 2,..., X n. Generate n j, j=1,2,...,n such samples. 40

19 Let α n j = 1 n j α nj = [ 1 k ( ) ] 1 X ln i, Xk+1 n j [ n j ] 1/2 α nj and Sn 1 j = (α nj αn j ) 2. n j j=1 Let, where T n = k(αnj α n ) S n n Denote α n = 1 n αn n j and Sn = j=1 j=1 n j S n j. n n j j=1 x 1 β = sup{x : P ( T n x X 1 X 2 X n ) 1 β}. Then a nominal 1 β level confidence interval for α is I 1 β = α n x 1 β S n k, α n + x 1 β S n k, which has asymptotic coverage probability 1 β under very mild regularity conditions including n j /n 0 as n. In a similar way we can found out the a nominal 1 β level confidence interval for α using weighted Hill estimator by replacing the Hill estimator of α by the weighted Hill estimator of α in the above procedure. We generate 200 pseudo random samples of size n = 1000 from the Pareto distribution with α =.5, for k = 100, 200 and 300. The confidence intervals based on this method with confidence level.95 are computed for both Hill estimator and weighted Hill estimator, given by confidence interval using Hill estimator = ( , ) and confidence interval using weighted Hill estimator = (.49981, ). Confidence interval for various sub samples 41

20 Figure 2.9: Hill Plot for Pareto from the Bootstrap Sub Sample. are calculated and coverage probability is also calculated. It is obtained as.954 and.946 respectively, which is near to the nominal value. For the sub sample bootstrap method we drew 200 sub samples each time. A Hill plot from the bootstrap sub sample is given in Figure Conclusion In this chapter, we suggest a better estimate namely smooweighted Hill estimator for estimating the tail index of heavy-tailed distributions and its performance is compared with several tail index estimators using simulated Pareto data. Also we construct bootstrap confidence intervals for both the Hill estimator and the weighted Hill estimator. Hill plot and static qq-plot made a good agreement when estimating the tail index from the graph. The smoohill plot and Hill plot in alt scale also gives a good agreement. The traditional Hill plot offers little scope of correctly discerning the value of α, but the alternatives smoohill plot, Hill plot in altscale, smoohill in altscale and qq-estimator seems to reveal the value of α fairly clearly. Though the asymptotic variance of the qq-estimator is larger than that of the Hill estimator, the volatility of the plot always seems to less than that of the Hill plots. In the case of smoohill, althill, smoohill in altscale, weighted Hill and smooweighted Hill estimators, the asymptotic variance is less than that of the ordinary Hill estimator and at the same time volatility of the plot is also smaller than the volatility of the Hill plots. Among 42

21 all the above mentioned estimators smooweighted Hill estimator is the best estimator with least bias, least asymptotic variance and least RMSE. Coverage probabilities of the confidence intervals through bootstrap method are nearer to the nominal value for both Hill and weighted Hill estimators. References Arnold, B. C. and Press, S. J. (1983). Bayesian inference for Pareto population, Journal of Econometrics, 21, Castillo, E. (1988). Extreme Value Theory in Engineering, Academic Press, San Diego. Dais George and Sebastian George, (2009). Analyzing the tail heavyness of Web server data, International Journal of Web Applications, 1(4), Efron, (1979). Bootstrap methods, another look at the Jacknife, Annals of Statistics, 7, 1-6. Embrechts, P., Klppelberg, C., Mikosch and Thomas, (1997). Modelling Extremal Events for Insurance and Finance, Berlin: Springer. Fraga Alves, M. I., Gomes, M. I. and de Hann, L. (2003). A new class of semi-parametric estimators of the second order parameter, Portugaliae Mathematica, 60:1, Gomes, C. P., Selman, B., and Crato, N. (1997). Heavy-tailed distributions in combinatorial search, In Smolka, G. (ed.), Principles and Practice of Constraint Programming CP-97, Lecture Notes in Computer Science Springer, New York. Gomes, M. I, Laurens de Hann and Ligia Rodrigues, (2008). Tail index estimation for heavy tailed model: accomodation of bias in weighted log excesses, Journal of Royal Statistical Society:Series B (Statistical Methodology), 70, Gomes, M. I. and Martins, M. J. (2002). Asymptotically unbiased estimators of the extreme value index based on external estimation of the second order parameter, Extremes, 5:1, Hall, P. and Lepage, R. (1996). On bootstrap estimation of the distribution of the standardized mean, Ann Inst Statist Math, 48, Henry 111, J. B. (2009). A harmonic moment tail index estimator, Journal of Statistical Theory and Applications, 8,

22 Hill, B. M. (1975). A simple general approach to interface about the tail of a distribution, Ann Statistics, 3, Hill, J. (2010). On tail index estimation for dependent, heterogeneous data, Econometric Theory, 26. Hsing,T. (1991). On tail estimation using dependent data, Ann. Statist.,19, Johnson, N. I., and Kotz, S. and Balakrishnan, N. (1994). Continuous Univariate Distributions 1 (2nd ed), Wiley, New York. Kratz, M. and Resnick, S. (1996). The qq-estimator and heavy tails, Stochastic Models, 12, Mason, D. (1982). Laws of large numbers for sums of extreme values, Ann. Probability,10, Pickands, J. (1975). Statistical inference using extreme order statistics, The Annals of Statistics, 25, Resnick, S. I. (1997). Heavy-tail modeling and teletraffic data, The Annals of Statistics, 25, Resnick, S. I. (2007). Heavy-Tail Phenomena: Probabilistic and Stochastic Modeling, Springer Science+ Business Media LLC, New York, U.S.A. Resnick, S. I. and Starica, C. (1995). Consistency of Hill estimator for dependent data, Journal of Appl.Probab., 32, Resnick, S. I. and Starica, C. (1997). Smoothing the Hill estimator, J. Appl. Probab., 29, Rice, J. (1988). Mathematical Statistics and Data Analysis, Brookes/Cole, Pacific Grove, CA. 44

Does k-th Moment Exist?

Does k-th Moment Exist? Does k-th Moment Exist? Hitomi, K. 1 and Y. Nishiyama 2 1 Kyoto Institute of Technology, Japan 2 Institute of Economic Research, Kyoto University, Japan Email: hitomi@kit.ac.jp Keywords: Existence of moments,

More information

A Partially Reduced-Bias Class of Value-at-Risk Estimators

A Partially Reduced-Bias Class of Value-at-Risk Estimators A Partially Reduced-Bias Class of Value-at-Risk Estimators M. Ivette Gomes CEAUL and DEIO, FCUL, Universidade de Lisboa, Portugal, e-mail: ivette.gomes@fc.ul.pt Frederico Caeiro CMA and DM, Universidade

More information

AN ASYMPTOTICALLY UNBIASED MOMENT ESTIMATOR OF A NEGATIVE EXTREME VALUE INDEX. Departamento de Matemática. Abstract

AN ASYMPTOTICALLY UNBIASED MOMENT ESTIMATOR OF A NEGATIVE EXTREME VALUE INDEX. Departamento de Matemática. Abstract AN ASYMPTOTICALLY UNBIASED ENT ESTIMATOR OF A NEGATIVE EXTREME VALUE INDEX Frederico Caeiro Departamento de Matemática Faculdade de Ciências e Tecnologia Universidade Nova de Lisboa 2829 516 Caparica,

More information

A NOTE ON SECOND ORDER CONDITIONS IN EXTREME VALUE THEORY: LINKING GENERAL AND HEAVY TAIL CONDITIONS

A NOTE ON SECOND ORDER CONDITIONS IN EXTREME VALUE THEORY: LINKING GENERAL AND HEAVY TAIL CONDITIONS REVSTAT Statistical Journal Volume 5, Number 3, November 2007, 285 304 A NOTE ON SECOND ORDER CONDITIONS IN EXTREME VALUE THEORY: LINKING GENERAL AND HEAVY TAIL CONDITIONS Authors: M. Isabel Fraga Alves

More information

Research Article Strong Convergence Bound of the Pareto Index Estimator under Right Censoring

Research Article Strong Convergence Bound of the Pareto Index Estimator under Right Censoring Hindawi Publishing Corporation Journal of Inequalities and Applications Volume 200, Article ID 20956, 8 pages doi:0.55/200/20956 Research Article Strong Convergence Bound of the Pareto Index Estimator

More information

A MODIFICATION OF HILL S TAIL INDEX ESTIMATOR

A MODIFICATION OF HILL S TAIL INDEX ESTIMATOR L. GLAVAŠ 1 J. JOCKOVIĆ 2 A MODIFICATION OF HILL S TAIL INDEX ESTIMATOR P. MLADENOVIĆ 3 1, 2, 3 University of Belgrade, Faculty of Mathematics, Belgrade, Serbia Abstract: In this paper, we study a class

More information

Bias Reduction in the Estimation of a Shape Second-order Parameter of a Heavy Right Tail Model

Bias Reduction in the Estimation of a Shape Second-order Parameter of a Heavy Right Tail Model Bias Reduction in the Estimation of a Shape Second-order Parameter of a Heavy Right Tail Model Frederico Caeiro Universidade Nova de Lisboa, FCT and CMA M. Ivette Gomes Universidade de Lisboa, DEIO, CEAUL

More information

Empirical Tail Index and VaR Analysis

Empirical Tail Index and VaR Analysis International Conference on Mathematical and Statistical Modeling in Honor of Enrique Castillo. June 28-30, 2006 Empirical Tail Index and VaR Analysis M. Ivette Gomes Universidade de Lisboa, DEIO, CEAUL

More information

New reduced-bias estimators of a positive extreme value index

New reduced-bias estimators of a positive extreme value index New reduced-bias estimators of a positive extreme value index M. Ivette Gomes CEAUL and DEIO, FCUL, Universidade de Lisboa, e-mail: ivette.gomes@fc.ul.pt M. Fátima Brilhante CEAUL and DM, Universidade

More information

A Closer Look at the Hill Estimator: Edgeworth Expansions and Confidence Intervals

A Closer Look at the Hill Estimator: Edgeworth Expansions and Confidence Intervals A Closer Look at the Hill Estimator: Edgeworth Expansions and Confidence Intervals Erich HAEUSLER University of Giessen http://www.uni-giessen.de Johan SEGERS Tilburg University http://www.center.nl EVA

More information

Reduced-bias tail index estimators under a third order framework

Reduced-bias tail index estimators under a third order framework Reduced-bias tail index estimators under a third order framewor Frederico Caeiro Universidade Nova de Lisboa and C.M.A. M. Ivette Gomes Universidade de Lisboa, D.E.I.O. and C.E.A.U.L. Lígia Henriques Rodrigues

More information

The Convergence Rate for the Normal Approximation of Extreme Sums

The Convergence Rate for the Normal Approximation of Extreme Sums The Convergence Rate for the Normal Approximation of Extreme Sums Yongcheng Qi University of Minnesota Duluth WCNA 2008, Orlando, July 2-9, 2008 This talk is based on a joint work with Professor Shihong

More information

1 Degree distributions and data

1 Degree distributions and data 1 Degree distributions and data A great deal of effort is often spent trying to identify what functional form best describes the degree distribution of a network, particularly the upper tail of that distribution.

More information

Analysis methods of heavy-tailed data

Analysis methods of heavy-tailed data Institute of Control Sciences Russian Academy of Sciences, Moscow, Russia February, 13-18, 2006, Bamberg, Germany June, 19-23, 2006, Brest, France May, 14-19, 2007, Trondheim, Norway PhD course Chapter

More information

NONPARAMETRIC ESTIMATION OF THE CONDITIONAL TAIL INDEX

NONPARAMETRIC ESTIMATION OF THE CONDITIONAL TAIL INDEX NONPARAMETRIC ESTIMATION OF THE CONDITIONAL TAIL INDE Laurent Gardes and Stéphane Girard INRIA Rhône-Alpes, Team Mistis, 655 avenue de l Europe, Montbonnot, 38334 Saint-Ismier Cedex, France. Stephane.Girard@inrialpes.fr

More information

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

Overview of Extreme Value Theory. Dr. Sawsan Hilal space Overview of Extreme Value Theory Dr. Sawsan Hilal space Maths Department - University of Bahrain space November 2010 Outline Part-1: Univariate Extremes Motivation Threshold Exceedances Part-2: Bivariate

More information

Department of Econometrics and Business Statistics

Department of Econometrics and Business Statistics Australia Department of Econometrics and Business Statistics http://www.buseco.monash.edu.au/depts/ebs/pubs/wpapers/ Minimum Variance Unbiased Maximum Lielihood Estimation of the Extreme Value Index Roger

More information

Applying the proportional hazard premium calculation principle

Applying the proportional hazard premium calculation principle Applying the proportional hazard premium calculation principle Maria de Lourdes Centeno and João Andrade e Silva CEMAPRE, ISEG, Technical University of Lisbon, Rua do Quelhas, 2, 12 781 Lisbon, Portugal

More information

Statistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation

Statistics - Lecture One. Outline. Charlotte Wickham  1. Basic ideas about estimation Statistics - Lecture One Charlotte Wickham wickham@stat.berkeley.edu http://www.stat.berkeley.edu/~wickham/ Outline 1. Basic ideas about estimation 2. Method of Moments 3. Maximum Likelihood 4. Confidence

More information

Extreme Value Theory and Applications

Extreme Value Theory and Applications Extreme Value Theory and Deauville - 04/10/2013 Extreme Value Theory and Introduction Asymptotic behavior of the Sum Extreme (from Latin exter, exterus, being on the outside) : Exceeding the ordinary,

More information

Extreme Value Analysis and Spatial Extremes

Extreme Value Analysis and Spatial Extremes Extreme Value Analysis and Department of Statistics Purdue University 11/07/2013 Outline Motivation 1 Motivation 2 Extreme Value Theorem and 3 Bayesian Hierarchical Models Copula Models Max-stable Models

More information

PARAMETER ESTIMATION FOR THE LOG-LOGISTIC DISTRIBUTION BASED ON ORDER STATISTICS

PARAMETER ESTIMATION FOR THE LOG-LOGISTIC DISTRIBUTION BASED ON ORDER STATISTICS PARAMETER ESTIMATION FOR THE LOG-LOGISTIC DISTRIBUTION BASED ON ORDER STATISTICS Authors: Mohammad Ahsanullah Department of Management Sciences, Rider University, New Jersey, USA ahsan@rider.edu) Ayman

More information

Resampling Methodologies and Reliable Tail Estimation

Resampling Methodologies and Reliable Tail Estimation Resampling Methodologies and Reliable Tail Estimation M. Ivette Gomes Universidade de Lisboa, Portugal Research partially supported by National Funds through FCT Fundação para a Ciência e a Tecnologia,

More information

MFM Practitioner Module: Quantitiative Risk Management. John Dodson. October 14, 2015

MFM Practitioner Module: Quantitiative Risk Management. John Dodson. October 14, 2015 MFM Practitioner Module: Quantitiative Risk Management October 14, 2015 The n-block maxima 1 is a random variable defined as M n max (X 1,..., X n ) for i.i.d. random variables X i with distribution function

More information

Adaptive Reduced-Bias Tail Index and VaR Estimation via the Bootstrap Methodology

Adaptive Reduced-Bias Tail Index and VaR Estimation via the Bootstrap Methodology This article was downloaded by: [b-on: Biblioteca do conhecimento online UL] On: 13 October 2011, At: 00:12 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 1072954

More information

Estimation of the second order parameter for heavy-tailed distributions

Estimation of the second order parameter for heavy-tailed distributions Estimation of the second order parameter for heavy-tailed distributions Stéphane Girard Inria Grenoble Rhône-Alpes, France joint work with El Hadji Deme (Université Gaston-Berger, Sénégal) and Laurent

More information

Financial Econometrics and Volatility Models Extreme Value Theory

Financial Econometrics and Volatility Models Extreme Value Theory Financial Econometrics and Volatility Models Extreme Value Theory Eric Zivot May 3, 2010 1 Lecture Outline Modeling Maxima and Worst Cases The Generalized Extreme Value Distribution Modeling Extremes Over

More information

A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS

A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS Statistica Sinica 20 2010, 365-378 A PRACTICAL WAY FOR ESTIMATING TAIL DEPENDENCE FUNCTIONS Liang Peng Georgia Institute of Technology Abstract: Estimating tail dependence functions is important for applications

More information

Nonlinear Time Series Modeling

Nonlinear Time Series Modeling Nonlinear Time Series Modeling Part II: Time Series Models in Finance Richard A. Davis Colorado State University (http://www.stat.colostate.edu/~rdavis/lectures) MaPhySto Workshop Copenhagen September

More information

Introduction to Algorithmic Trading Strategies Lecture 10

Introduction to Algorithmic Trading Strategies Lecture 10 Introduction to Algorithmic Trading Strategies Lecture 10 Risk Management Haksun Li haksun.li@numericalmethod.com www.numericalmethod.com Outline Value at Risk (VaR) Extreme Value Theory (EVT) References

More information

MEAN-OF-ORDER-p LOCATION-INVARIANT EX- TREME VALUE INDEX ESTIMATION

MEAN-OF-ORDER-p LOCATION-INVARIANT EX- TREME VALUE INDEX ESTIMATION MEAN-OF-ORDER-p LOCATION-INVARIANT EX- TREME VALUE INDEX ESTIMATION Authors: M. Ivette Gomes CEAUL and DEIO, FCUL, Universidade de Lisboa, Portugal ivette.gomes@fc.ul.pt) Lígia Henriques-Rodrigues Universidade

More information

Some Theoretical Properties and Parameter Estimation for the Two-Sided Length Biased Inverse Gaussian Distribution

Some Theoretical Properties and Parameter Estimation for the Two-Sided Length Biased Inverse Gaussian Distribution Journal of Probability and Statistical Science 14(), 11-4, Aug 016 Some Theoretical Properties and Parameter Estimation for the Two-Sided Length Biased Inverse Gaussian Distribution Teerawat Simmachan

More information

On the estimation of the heavy tail exponent in time series using the max spectrum. Stilian A. Stoev

On the estimation of the heavy tail exponent in time series using the max spectrum. Stilian A. Stoev On the estimation of the heavy tail exponent in time series using the max spectrum Stilian A. Stoev (sstoev@umich.edu) University of Michigan, Ann Arbor, U.S.A. JSM, Salt Lake City, 007 joint work with:

More information

Asymptotic Tail Probabilities of Sums of Dependent Subexponential Random Variables

Asymptotic Tail Probabilities of Sums of Dependent Subexponential Random Variables Asymptotic Tail Probabilities of Sums of Dependent Subexponential Random Variables Jaap Geluk 1 and Qihe Tang 2 1 Department of Mathematics The Petroleum Institute P.O. Box 2533, Abu Dhabi, United Arab

More information

Information Matrix for Pareto(IV), Burr, and Related Distributions

Information Matrix for Pareto(IV), Burr, and Related Distributions MARCEL DEKKER INC. 7 MADISON AVENUE NEW YORK NY 6 3 Marcel Dekker Inc. All rights reserved. This material may not be used or reproduced in any form without the express written permission of Marcel Dekker

More information

Estimation de mesures de risques à partir des L p -quantiles

Estimation de mesures de risques à partir des L p -quantiles 1/ 42 Estimation de mesures de risques à partir des L p -quantiles extrêmes Stéphane GIRARD (Inria Grenoble Rhône-Alpes) collaboration avec Abdelaati DAOUIA (Toulouse School of Economics), & Gilles STUPFLER

More information

On the estimation of the second order parameter in extreme-value theory

On the estimation of the second order parameter in extreme-value theory On the estimation of the second order parameter in extreme-value theory by El Hadji DEME (1,3), Laurent Gardes (2) and Stéphane Girard (3) (1) LERSTAD, Université Gaston Berger, Saint-Louis, Sénégal. (2)

More information

A New Estimator for a Tail Index

A New Estimator for a Tail Index Acta Applicandae Mathematicae 00: 3, 2003. 2003 Kluwer Academic Publishers. Printed in the Netherlands. A New Estimator for a Tail Index V. PAULAUSKAS Department of Mathematics and Informatics, Vilnius

More information

Computational statistics

Computational statistics Computational statistics Markov Chain Monte Carlo methods Thierry Denœux March 2017 Thierry Denœux Computational statistics March 2017 1 / 71 Contents of this chapter When a target density f can be evaluated

More information

Semi-parametric tail inference through Probability-Weighted Moments

Semi-parametric tail inference through Probability-Weighted Moments Semi-parametric tail inference through Probability-Weighted Moments Frederico Caeiro New University of Lisbon and CMA fac@fct.unl.pt and M. Ivette Gomes University of Lisbon, DEIO, CEAUL and FCUL ivette.gomes@fc.ul.pt

More information

Confidence Intervals for the Autocorrelations of the Squares of GARCH Sequences

Confidence Intervals for the Autocorrelations of the Squares of GARCH Sequences Confidence Intervals for the Autocorrelations of the Squares of GARCH Sequences Piotr Kokoszka 1, Gilles Teyssière 2, and Aonan Zhang 3 1 Mathematics and Statistics, Utah State University, 3900 Old Main

More information

Using a Bootstrap Method to Choose the Sample Fraction in Tail Index Estimation 1

Using a Bootstrap Method to Choose the Sample Fraction in Tail Index Estimation 1 Journal of Multivariate Analysis 76, 226248 (2001) doi:10.1006jmva.2000.1903, available online at http:www.idealibrary.com on Using a Bootstrap Method to Choose the Sample Fraction in Tail Index Estimation

More information

ABC methods for phase-type distributions with applications in insurance risk problems

ABC methods for phase-type distributions with applications in insurance risk problems ABC methods for phase-type with applications problems Concepcion Ausin, Department of Statistics, Universidad Carlos III de Madrid Joint work with: Pedro Galeano, Universidad Carlos III de Madrid Simon

More information

Extreme Value Theory as a Theoretical Background for Power Law Behavior

Extreme Value Theory as a Theoretical Background for Power Law Behavior Extreme Value Theory as a Theoretical Background for Power Law Behavior Simone Alfarano 1 and Thomas Lux 2 1 Department of Economics, University of Kiel, alfarano@bwl.uni-kiel.de 2 Department of Economics,

More information

A Simple, Graphical Procedure for Comparing Multiple Treatment Effects

A Simple, Graphical Procedure for Comparing Multiple Treatment Effects A Simple, Graphical Procedure for Comparing Multiple Treatment Effects Brennan S. Thompson and Matthew D. Webb May 15, 2015 > Abstract In this paper, we utilize a new graphical

More information

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review STATS 200: Introduction to Statistical Inference Lecture 29: Course review Course review We started in Lecture 1 with a fundamental assumption: Data is a realization of a random process. The goal throughout

More information

Estimation of the long Memory parameter using an Infinite Source Poisson model applied to transmission rate measurements

Estimation of the long Memory parameter using an Infinite Source Poisson model applied to transmission rate measurements of the long Memory parameter using an Infinite Source Poisson model applied to transmission rate measurements François Roueff Ecole Nat. Sup. des Télécommunications 46 rue Barrault, 75634 Paris cedex 13,

More information

Asymptotics and Simulation of Heavy-Tailed Processes

Asymptotics and Simulation of Heavy-Tailed Processes Asymptotics and Simulation of Heavy-Tailed Processes Department of Mathematics Stockholm, Sweden Workshop on Heavy-tailed Distributions and Extreme Value Theory ISI Kolkata January 14-17, 2013 Outline

More information

AN EMPIRICAL TAIL INDEX and VaR ANALYSIS 9

AN EMPIRICAL TAIL INDEX and VaR ANALYSIS 9 AN EMPIRICAL TAIL INDEX and VaR ANALYSIS 9 M. Ivette Gomes Clara Viseu CEAUL and DEIO, FCUL ISCA, Universidade de Coimbra Universidade de Lisboa, Portugal CEAUL, Universidade de Lisboa, Portugal Abstract:

More information

Bias-corrected goodness-of-fit tests for Pareto-type behavior

Bias-corrected goodness-of-fit tests for Pareto-type behavior Bias-corrected goodness-of-fit tests for Pareto-type behavior Yuri Goegebeur University of Southern Denmark, Department of Statistics JB Winsløws Ve 9B DK5000 Odense C, Denmark E-mail: yurigoegebeur@statsdudk

More information

The Nonparametric Bootstrap

The Nonparametric Bootstrap The Nonparametric Bootstrap The nonparametric bootstrap may involve inferences about a parameter, but we use a nonparametric procedure in approximating the parametric distribution using the ECDF. We use

More information

Parametric Techniques

Parametric Techniques Parametric Techniques Jason J. Corso SUNY at Buffalo J. Corso (SUNY at Buffalo) Parametric Techniques 1 / 39 Introduction When covering Bayesian Decision Theory, we assumed the full probabilistic structure

More information

A TEST OF FIT FOR THE GENERALIZED PARETO DISTRIBUTION BASED ON TRANSFORMS

A TEST OF FIT FOR THE GENERALIZED PARETO DISTRIBUTION BASED ON TRANSFORMS A TEST OF FIT FOR THE GENERALIZED PARETO DISTRIBUTION BASED ON TRANSFORMS Dimitrios Konstantinides, Simos G. Meintanis Department of Statistics and Acturial Science, University of the Aegean, Karlovassi,

More information

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems Principles of Statistical Inference Recap of statistical models Statistical inference (frequentist) Parametric vs. semiparametric

More information

Multivariate Normal-Laplace Distribution and Processes

Multivariate Normal-Laplace Distribution and Processes CHAPTER 4 Multivariate Normal-Laplace Distribution and Processes The normal-laplace distribution, which results from the convolution of independent normal and Laplace random variables is introduced by

More information

Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets

Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets Bayesian Point Process Modeling for Extreme Value Analysis, with an Application to Systemic Risk Assessment in Correlated Financial Markets Athanasios Kottas Department of Applied Mathematics and Statistics,

More information

Introduction to Rare Event Simulation

Introduction to Rare Event Simulation Introduction to Rare Event Simulation Brown University: Summer School on Rare Event Simulation Jose Blanchet Columbia University. Department of Statistics, Department of IEOR. Blanchet (Columbia) 1 / 31

More information

Chapter 9. Non-Parametric Density Function Estimation

Chapter 9. Non-Parametric Density Function Estimation 9-1 Density Estimation Version 1.1 Chapter 9 Non-Parametric Density Function Estimation 9.1. Introduction We have discussed several estimation techniques: method of moments, maximum likelihood, and least

More information

Quantile-quantile plots and the method of peaksover-threshold

Quantile-quantile plots and the method of peaksover-threshold Problems in SF2980 2009-11-09 12 6 4 2 0 2 4 6 0.15 0.10 0.05 0.00 0.05 0.10 0.15 Figure 2: qqplot of log-returns (x-axis) against quantiles of a standard t-distribution with 4 degrees of freedom (y-axis).

More information

Chapter 9. Non-Parametric Density Function Estimation

Chapter 9. Non-Parametric Density Function Estimation 9-1 Density Estimation Version 1.2 Chapter 9 Non-Parametric Density Function Estimation 9.1. Introduction We have discussed several estimation techniques: method of moments, maximum likelihood, and least

More information

Inference For High Dimensional M-estimates. Fixed Design Results

Inference For High Dimensional M-estimates. Fixed Design Results : Fixed Design Results Lihua Lei Advisors: Peter J. Bickel, Michael I. Jordan joint work with Peter J. Bickel and Noureddine El Karoui Dec. 8, 2016 1/57 Table of Contents 1 Background 2 Main Results and

More information

Spatial and temporal extremes of wildfire sizes in Portugal ( )

Spatial and temporal extremes of wildfire sizes in Portugal ( ) International Journal of Wildland Fire 2009, 18, 983 991. doi:10.1071/wf07044_ac Accessory publication Spatial and temporal extremes of wildfire sizes in Portugal (1984 2004) P. de Zea Bermudez A, J. Mendes

More information

UNIVERSITÄT POTSDAM Institut für Mathematik

UNIVERSITÄT POTSDAM Institut für Mathematik UNIVERSITÄT POTSDAM Institut für Mathematik Testing the Acceleration Function in Life Time Models Hannelore Liero Matthias Liero Mathematische Statistik und Wahrscheinlichkeitstheorie Universität Potsdam

More information

STAT 512 sp 2018 Summary Sheet

STAT 512 sp 2018 Summary Sheet STAT 5 sp 08 Summary Sheet Karl B. Gregory Spring 08. Transformations of a random variable Let X be a rv with support X and let g be a function mapping X to Y with inverse mapping g (A = {x X : g(x A}

More information

Parametric Techniques Lecture 3

Parametric Techniques Lecture 3 Parametric Techniques Lecture 3 Jason Corso SUNY at Buffalo 22 January 2009 J. Corso (SUNY at Buffalo) Parametric Techniques Lecture 3 22 January 2009 1 / 39 Introduction In Lecture 2, we learned how to

More information

arxiv: v1 [math.st] 4 Aug 2017

arxiv: v1 [math.st] 4 Aug 2017 Exponentiated Generalized Pareto Distribution: Properties and applications towards Extreme Value Theory Se Yoon Lee Joseph H. T. Kim arxiv:78.686v [math.st] 4 Aug 27 Abstract The Generalized Pareto Distribution

More information

A THRESHOLD APPROACH FOR PEAKS-OVER-THRESHOLD MODELING USING MAXIMUM PRODUCT OF SPACINGS

A THRESHOLD APPROACH FOR PEAKS-OVER-THRESHOLD MODELING USING MAXIMUM PRODUCT OF SPACINGS Statistica Sinica 20 2010, 1257-1272 A THRESHOLD APPROACH FOR PEAKS-OVER-THRESHOLD MODELING USING MAXIMUM PRODUCT OF SPACINGS Tony Siu Tung Wong and Wai Keung Li The University of Hong Kong Abstract: We

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Fitting the generalized Pareto distribution to data using maximum goodness-of-fit estimators

Fitting the generalized Pareto distribution to data using maximum goodness-of-fit estimators Computational Statistics & Data Analysis 51 (26) 94 917 www.elsevier.com/locate/csda Fitting the generalized Pareto distribution to data using maximum goodness-of-fit estimators Alberto Luceño E.T.S. de

More information

Statistics. Lecture 2 August 7, 2000 Frank Porter Caltech. The Fundamentals; Point Estimation. Maximum Likelihood, Least Squares and All That

Statistics. Lecture 2 August 7, 2000 Frank Porter Caltech. The Fundamentals; Point Estimation. Maximum Likelihood, Least Squares and All That Statistics Lecture 2 August 7, 2000 Frank Porter Caltech The plan for these lectures: The Fundamentals; Point Estimation Maximum Likelihood, Least Squares and All That What is a Confidence Interval? Interval

More information

HIERARCHICAL MODELS IN EXTREME VALUE THEORY

HIERARCHICAL MODELS IN EXTREME VALUE THEORY HIERARCHICAL MODELS IN EXTREME VALUE THEORY Richard L. Smith Department of Statistics and Operations Research, University of North Carolina, Chapel Hill and Statistical and Applied Mathematical Sciences

More information

University of California San Diego and Stanford University and

University of California San Diego and Stanford University and First International Workshop on Functional and Operatorial Statistics. Toulouse, June 19-21, 2008 K-sample Subsampling Dimitris N. olitis andjoseph.romano University of California San Diego and Stanford

More information

Asymptotic behavior for sums of non-identically distributed random variables

Asymptotic behavior for sums of non-identically distributed random variables Appl. Math. J. Chinese Univ. 2019, 34(1: 45-54 Asymptotic behavior for sums of non-identically distributed random variables YU Chang-jun 1 CHENG Dong-ya 2,3 Abstract. For any given positive integer m,

More information

The Goodness-of-fit Test for Gumbel Distribution: A Comparative Study

The Goodness-of-fit Test for Gumbel Distribution: A Comparative Study MATEMATIKA, 2012, Volume 28, Number 1, 35 48 c Department of Mathematics, UTM. The Goodness-of-fit Test for Gumbel Distribution: A Comparative Study 1 Nahdiya Zainal Abidin, 2 Mohd Bakri Adam and 3 Habshah

More information

Pitfalls in Using Weibull Tailed Distributions

Pitfalls in Using Weibull Tailed Distributions Pitfalls in Using Weibull Tailed Distributions Alexandru V. Asimit, Deyuan Li & Liang Peng First version: 18 December 2009 Research Report No. 27, 2009, Probability and Statistics Group School of Mathematics,

More information

ON THE TAIL INDEX ESTIMATION OF AN AUTOREGRESSIVE PARETO PROCESS

ON THE TAIL INDEX ESTIMATION OF AN AUTOREGRESSIVE PARETO PROCESS Discussiones Mathematicae Probability and Statistics 33 (2013) 65 77 doi:10.7151/dmps.1149 ON THE TAIL INDEX ESTIMATION OF AN AUTOREGRESSIVE PARETO PROCESS Marta Ferreira Center of Mathematics of Minho

More information

Modeling and Interpolation of Non-Gaussian Spatial Data: A Comparative Study

Modeling and Interpolation of Non-Gaussian Spatial Data: A Comparative Study Modeling and Interpolation of Non-Gaussian Spatial Data: A Comparative Study Gunter Spöck, Hannes Kazianka, Jürgen Pilz Department of Statistics, University of Klagenfurt, Austria hannes.kazianka@uni-klu.ac.at

More information

On probabilities of large and moderate deviations for L-statistics: a survey of some recent developments

On probabilities of large and moderate deviations for L-statistics: a survey of some recent developments UDC 519.2 On probabilities of large and moderate deviations for L-statistics: a survey of some recent developments N. V. Gribkova Department of Probability Theory and Mathematical Statistics, St.-Petersburg

More information

A Note on Tail Behaviour of Distributions. the max domain of attraction of the Frechét / Weibull law under power normalization

A Note on Tail Behaviour of Distributions. the max domain of attraction of the Frechét / Weibull law under power normalization ProbStat Forum, Volume 03, January 2010, Pages 01-10 ISSN 0974-3235 A Note on Tail Behaviour of Distributions in the Max Domain of Attraction of the Frechét/ Weibull Law under Power Normalization S.Ravi

More information

Finite-time Ruin Probability of Renewal Model with Risky Investment and Subexponential Claims

Finite-time Ruin Probability of Renewal Model with Risky Investment and Subexponential Claims Proceedings of the World Congress on Engineering 29 Vol II WCE 29, July 1-3, 29, London, U.K. Finite-time Ruin Probability of Renewal Model with Risky Investment and Subexponential Claims Tao Jiang Abstract

More information

Shape of the return probability density function and extreme value statistics

Shape of the return probability density function and extreme value statistics Shape of the return probability density function and extreme value statistics 13/09/03 Int. Workshop on Risk and Regulation, Budapest Overview I aim to elucidate a relation between one field of research

More information

On the Application of the Generalized Pareto Distribution for Statistical Extrapolation in the Assessment of Dynamic Stability in Irregular Waves

On the Application of the Generalized Pareto Distribution for Statistical Extrapolation in the Assessment of Dynamic Stability in Irregular Waves On the Application of the Generalized Pareto Distribution for Statistical Extrapolation in the Assessment of Dynamic Stability in Irregular Waves Bradley Campbell 1, Vadim Belenky 1, Vladas Pipiras 2 1.

More information

Estimation of risk measures for extreme pluviometrical measurements

Estimation of risk measures for extreme pluviometrical measurements Estimation of risk measures for extreme pluviometrical measurements by Jonathan EL METHNI in collaboration with Laurent GARDES & Stéphane GIRARD 26th Annual Conference of The International Environmetrics

More information

A Class of Weighted Weibull Distributions and Its Properties. Mervat Mahdy Ramadan [a],*

A Class of Weighted Weibull Distributions and Its Properties. Mervat Mahdy Ramadan [a],* Studies in Mathematical Sciences Vol. 6, No. 1, 2013, pp. [35 45] DOI: 10.3968/j.sms.1923845220130601.1065 ISSN 1923-8444 [Print] ISSN 1923-8452 [Online] www.cscanada.net www.cscanada.org A Class of Weighted

More information

Spring 2017 Econ 574 Roger Koenker. Lecture 14 GEE-GMM

Spring 2017 Econ 574 Roger Koenker. Lecture 14 GEE-GMM University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 14 GEE-GMM Throughout the course we have emphasized methods of estimation and inference based on the principle

More information

Journal of Statistical Research 2007, Vol. 41, No. 1, pp Bangladesh

Journal of Statistical Research 2007, Vol. 41, No. 1, pp Bangladesh Journal of Statistical Research 007, Vol. 4, No., pp. 5 Bangladesh ISSN 056-4 X ESTIMATION OF AUTOREGRESSIVE COEFFICIENT IN AN ARMA(, ) MODEL WITH VAGUE INFORMATION ON THE MA COMPONENT M. Ould Haye School

More information

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),

More information

PAijpam.eu ADAPTIVE K-S TESTS FOR WHITE NOISE IN THE FREQUENCY DOMAIN Hossein Arsham University of Baltimore Baltimore, MD 21201, USA

PAijpam.eu ADAPTIVE K-S TESTS FOR WHITE NOISE IN THE FREQUENCY DOMAIN Hossein Arsham University of Baltimore Baltimore, MD 21201, USA International Journal of Pure and Applied Mathematics Volume 82 No. 4 2013, 521-529 ISSN: 1311-8080 (printed version); ISSN: 1314-3395 (on-line version) url: http://www.ijpam.eu doi: http://dx.doi.org/10.12732/ijpam.v82i4.2

More information

Do Markov-Switching Models Capture Nonlinearities in the Data? Tests using Nonparametric Methods

Do Markov-Switching Models Capture Nonlinearities in the Data? Tests using Nonparametric Methods Do Markov-Switching Models Capture Nonlinearities in the Data? Tests using Nonparametric Methods Robert V. Breunig Centre for Economic Policy Research, Research School of Social Sciences and School of

More information

Chapter 2 Asymptotics

Chapter 2 Asymptotics Chapter Asymptotics.1 Asymptotic Behavior of Student s Pdf Proposition.1 For each x R d,asν, f ν,,a x g a, x..1 Proof Let a = 0. Using the well-known formula that Ɣz = π z e z z z 1 + O 1, as z,. z we

More information

Multivariate generalized Pareto distributions

Multivariate generalized Pareto distributions Multivariate generalized Pareto distributions Holger Rootzén and Nader Tajvidi Abstract Statistical inference for extremes has been a subject of intensive research during the past couple of decades. One

More information

1 Random walks and data

1 Random walks and data Inference, Models and Simulation for Complex Systems CSCI 7-1 Lecture 7 15 September 11 Prof. Aaron Clauset 1 Random walks and data Supposeyou have some time-series data x 1,x,x 3,...,x T and you want

More information

interval forecasting

interval forecasting Interval Forecasting Based on Chapter 7 of the Time Series Forecasting by Chatfield Econometric Forecasting, January 2008 Outline 1 2 3 4 5 Terminology Interval Forecasts Density Forecast Fan Chart Most

More information

If we want to analyze experimental or simulated data we might encounter the following tasks:

If we want to analyze experimental or simulated data we might encounter the following tasks: Chapter 1 Introduction If we want to analyze experimental or simulated data we might encounter the following tasks: Characterization of the source of the signal and diagnosis Studying dependencies Prediction

More information

The Slow Convergence of OLS Estimators of α, β and Portfolio. β and Portfolio Weights under Long Memory Stochastic Volatility

The Slow Convergence of OLS Estimators of α, β and Portfolio. β and Portfolio Weights under Long Memory Stochastic Volatility The Slow Convergence of OLS Estimators of α, β and Portfolio Weights under Long Memory Stochastic Volatility New York University Stern School of Business June 21, 2018 Introduction Bivariate long memory

More information

Generalized least-squares estimators for the thickness of heavy tails

Generalized least-squares estimators for the thickness of heavy tails Journal of Statistical Planning and Inference 119 (2004) 341 352 www.elsevier.com/locate/jspi Generalized least-squares estimators for the thickness of heavy tails Inmaculada B. Aban ;1, Mark M. Meerschaert

More information

STA 2201/442 Assignment 2

STA 2201/442 Assignment 2 STA 2201/442 Assignment 2 1. This is about how to simulate from a continuous univariate distribution. Let the random variable X have a continuous distribution with density f X (x) and cumulative distribution

More information

Inferring from data. Theory of estimators

Inferring from data. Theory of estimators Inferring from data Theory of estimators 1 Estimators Estimator is any function of the data e(x) used to provide an estimate ( a measurement ) of an unknown parameter. Because estimators are functions

More information

Normal Probability Plot Probability Probability

Normal Probability Plot Probability Probability Modelling multivariate returns Stefano Herzel Department ofeconomics, University of Perugia 1 Catalin Starica Department of Mathematical Statistics, Chalmers University of Technology Reha Tutuncu Department

More information

Multivariate Heavy Tails, Asymptotic Independence and Beyond

Multivariate Heavy Tails, Asymptotic Independence and Beyond Multivariate Heavy Tails, endence and Beyond Sidney Resnick School of Operations Research and Industrial Engineering Rhodes Hall Cornell University Ithaca NY 14853 USA http://www.orie.cornell.edu/ sid

More information