Fitting the generalized Pareto distribution to data using maximum goodness-of-fit estimators

Size: px
Start display at page:

Download "Fitting the generalized Pareto distribution to data using maximum goodness-of-fit estimators"

Transcription

1 Computational Statistics & Data Analysis 51 (26) Fitting the generalized Pareto distribution to data using maximum goodness-of-fit estimators Alberto Luceño E.T.S. de Ingenieros de Caminos, University of Cantabria, 395 Santander, Spain Received 15 January 25; received in revised form 12 July 25; accepted 2 September 25 Available online 12 October 25 Abstract Some of the most powerful techniques currently available to test the goodness of fit of a hypothesized continuous cumulative distribution function (CDF) use statistics based on the empirical distribution function (EDF), such as those of Kolmogorov, Cramer von Mises and Anderson Darling, among others. The use of EDF statistics was analyzed for estimation purposes. In this approach, maximum goodness-of-fit estimators (also called minimum distance estimators) of the parameters of the CDF can be obtained by minimizing any of the EDF statistics with respect to the unknown parameters. The results showed that there is no unique EDF statistic that can be considered most efficient for all situations. Consequently, the possibility of defining new EDF statistics is entertained; in particular, an Anderson Darling statistic of degree two and one-sided Anderson Darling statistics of degree one and two appear to be notable in some situations. The procedure is shown to be able to deal successfully with the estimation of the parameters of homogeneous and heterogeneous generalized Pareto distributions, even when maximum likelihood and other estimation methods fail. 25 Elsevier B.V. All rights reserved. Keywords: Anderson Darling statistic; Cramer von Mises statistic; Empirical distribution function; Generalized linear models; Kolmogorov distance; Minimum distance estimator 1. Introduction The maximum likelihood (ML) estimation method is known to be asymptotically optimal to estimate the parameters of many discrete and continuous distributions. However, there is a considerable number of continuous distributions for which the probability density function and, subsequently, the likelihood function can be made arbitrarily large at some point, and hence maximum likelihood estimators (MLEs) of the parameters of these distributions do not generally exist. Moreover, if the range of the distribution depends on unknown parameters, MLEs may not possess their classical statistical properties because Cramer s regularity conditions often fail to hold in this situation. There are also some distributions such that the likelihood function may not have a local maximum for some sample values so that no MLE exists. When the ML method cannot be used, an alternative estimation method is the method of moments (MOM). However, there is also a considerable number of distributions for which some of the first few moments are not finite, with the consequence that moment estimates do not exist. Even when the moments exist, the MOM method may produce inefficient estimators. In addition, the MOM method cannot be used in the context of generalized linear models. Tel.: ; fax: address: lucenoa@unican.es /$ - see front matter 25 Elsevier B.V. All rights reserved. doi:1.116/j.csda

2 A. Luceño / Computational Statistics & Data Analysis 51 (26) An interesting example of a distribution that poses these difficulties is provided by the generalized Pareto CDF given by { 1 (1 kx/θ) 1/k if k =, F θ,k (x) = 1 exp( x/θ) if k =, (1) where θ > is a scale parameter and k is a shape parameter. The range of x is x for k and x θ/k for k>. When k>1, MLEs do not exist because the probability density corresponding to (1) tends to infinity when x tends to θ/k. Moreover, Cramer s regularity conditions do not hold for k> 1 3. The mean and variance are μ = θ/(1 + k) and σ 2 = θ 2 / { (1 + k) 2 (1 + 2k) }, so that μ and σ 2 are finite only for k> 1 and k> 2 1, respectively. Consequently, moments estimators do not exist for k 2 1. The problem of fitting the generalized Pareto distribution (GPD) to data has been approached by several authors including Hosking et al. (1985), Hosking and Wallis (1987), Davison and Smith (199), Walshaw (199), Grimshaw (1993), Castillo and Hadi (1997), and Castillo et al. (25), among others. Goodness-of-fit tests for the GPD have been suggested by Choulakian and Stephens (21). The GPD is also important because it contains the exponential distribution with mean θ as a limiting case when k tends to, the uniform distribution in the range [, θ] when k = 1, and the standard Pareto distribution when k<. Moreover, its relevance has recently increased considerably (see Appendix A) because as shown by Pickands (1975) it can be put in connection with the generalized extreme value distribution (GEVD)having CDF { { exp (1 k(x μ)/ψ) 1/k } if k =, F μ,ψ,k (x) = exp[ exp{ (x μ)/ψ)}] if k =. (2) In this paper we analyze a method for estimating the parameters of continuous CDFs, which is based on minimizing empirical distribution function (EDF) statistics and can be used as an alternative or as a complement to other estimation methods. Because EDF statistics are used to test the goodness of fit of continuous distributions, we call this method the maximum goodness of fit (MGF) estimation method. The estimators provided by the MGF method will be called maximum goodness-of-fit estimators (MGFEs). The origin of the MGF method goes back to Wolfowitz (1953, 1957) and Kac et al. (1955), under the name of minimum distance method. Moreover, Pollard (198) proved the n-consistency of the minimum distance estimators and found its asymptotic distribution. Because the name minimum distance method is often used in other contexts not related to EDF statistics (for example, when minimizing functions of the sample and population autocorrelations of the residuals in time series, or the distance between sample and predicted moments), we prefer to use the name MGF throughout the paper. One important property of the MGF method is that it can be used in situations in which there are no MOM or ML estimators. In contrast with the MOM or ML methods which lead to unique estimators, the MGF method provides several estimators depending on the particular EDF statistic chosen, thus providing a wider inductive basis. For instance, one particular EDF statistic could provide more weight to the left tail of the distribution, whereas a second EDF statistic could assign more weight to the right tail or to the central part of the distribution, and a third statistic could assign equal weight to every part of the distribution. Even though the MGF method seems to have been disregarded as a useful estimation method to fit the GPD to data (see, e.g., Castillo et al., 25; Coles, 21; Smith, 23), we shall show throughout the paper that the MGF method can be successfully used to estimate the parameters of the GPD (and of generalized linear models based on the GPD) even for very extreme values of the shape parameter. Section 2 compiles the classical EDF statistics used throughout the paper together with some new EDF statistics that are useful for estimation purposes. Section 3 describes the MGF estimation method. The performance of MGF estimators is analyzed for homogeneous GPDs in Section 4 and for generalized linear models based on GPDs in Section 5; a real example of application to ocean engineering is also considered in Section 4. Concluding remarks are given in Section Some EDF statistics useful for estimation Let (x 1,...,x n ) be a sample of n IID observations on a continuous random variable X with CDF F(x). Let x (1) x (n) be the corresponding order statistics and S n (x) be the empirical distribution function (see Rao, 1973).

3 96 A. Luceño / Computational Statistics & Data Analysis 51 (26) Table 1 Three classical EDF statistics Statistic Acronym Formula Kolmogorov distance KS D n = sup x F(x) S n (x) Cramer von Mises CM Wn 2 = n {F(x) S n(x)} 2 df(x) Anderson Darling AD A 2 n = n {F(x) S n (x)} 2 F(x){1 F(x)} df(x) Table 2 Modified Anderson Darling statistics Statistic Acronym Formula Right-tail AD ADR Rn 2 = n {F(x) S n (x)} 2 df(x) 1 F(x) Left-tail AD ADL L 2 n = n {F(x) S n (x)} 2 df(x) F(x) Right-tail AD of second degree AD2R rn 2 = n {F(x) S n (x)} 2 {1 F(x)} 2 df(x) Left-tail AD of second degree AD2L ln 2 = n {F(x) S n (x)} 2 {F(x)} 2 df(x) AD of second degree AD2 an 2 = r2 n + l2 n The purpose of EDF statistics is to measure the distance between F(x) and S n (x). Throughout the paper, the three classical EDF statistics in Table 1, as well as the five modified EDF statistics introduced in Table 2, will be considered. The AD statistic A 2 n gives more weight to the tails of the CDF than the CM statistic W n 2. Similarly, the ADR and ADL statistics assign more weight to the selected tail of the CDF than the CM statistic. Either of the tails, or both of them, can receive even larger weights by using second degree Anderson Darling statistics. Note also that an 2 is defined in analogy with the relationship A 2 n = R2 n + L2 n satisfied by the AD, ADR and ADL statistics. Because S n(x) is a step function with jumps at the order statistics, the EDF statistics in Tables 1 and 2 can be written in alternative forms, which are more useful for computational purposes (see Appendix B). 3. Maximum goodness-of-fit estimation 3.1. Homogeneous populations Suppose that (x 1,...,x n ) is a sample of n IID observations on a continuous random variable X having CDF F Θ (x), where Θ is a vector of unknown parameters. MGFEs for Θ can be obtained minimizing any one of the EDF statistics provided in Section Heterogeneous populations Now suppose that (y 1,...,y n ) is a sample of independent but not necessarily identically distributed observations on n continuous random variables (Y 1,...,Y n ) having CDFs F i,θ (y i ), i = 1,...,n, where the functions F i,θ ( ) share the same set of unknown parameters Θ. In this case one cannot directly evaluate EDF statistics for (y 1,...,y n ) because these data come from different CDFs. However, any one of the random variables (U 1,...,U n ) defined by the transformations U i = F i,θ (Y i ), where U i 1 and i = 1,...,n, must have the same uniform U[, 1] distribution if F i,θ ( ) is the true CDF of Y i. This can be easily demonstrated as follows: u i = F i,θ (y i ) = Pr (Y i y i ) = Pr [ F i,θ (Y i ) F i,θ (y i ) ] = Pr (U i u i ) = F Ui (u i ),

4 A. Luceño / Computational Statistics & Data Analysis 51 (26) where u i 1. Hence S Θ (u 1,...,u n ) is a sample of n IID realizations on a U[, 1] random variable, for which one can evaluate EDF statistics. MGFEs for Θ can be computed by minimizing with respect to Θ any one of the EDF statistics obtained for the transformed samples S Θ. 4. Performance of the MGF estimators in homogeneous populations 4.1. Uniform distribution The U[, θ] CDF is a particular case of the GPD, obtained using k = 1 in (1). Suppose that (x 1,...,x n ) is a sample of n IID observations on a U[, θ] random variable X. Although Cramer s regularity conditions do not hold, the scale parameter θ can be estimated using the ML method. The MLE of θ is ˆθ = max (x 1,...,x n ). (3) ) ) This estimator is biased because E (ˆθ =nθ/(n+1) = θ, but superefficient because var (ˆθ =nθ 2 /{ (n + 2)(n + 1) 2}. The root mean squared error (RMSE) of ˆθ ) is RMSE (ˆθ = θ{(n + 1)(n + 2)/2} 1/2. We have simulated 1 samples of size n = 1 on a U[, 1] distribution and have computed MGFEs θ for θ by minimizing all the EDF statistics in Section 2. (The results are, obviously, invariant with respect to the value of θ.) The minimizations have been performed subject to the boundary condition θ max (x 1,...,x n ), which is required when no outliers are present. Table 3 provides the estimated bias and RMSEs for the MGFEs corresponding to the eight EDF statistics of Section 2, which have been ordered in ascending order of their RMSE estimates. The table also provides the bias and RMSE corresponding to the MLE ˆθ in ordered position. Table 3 shows that the MGFEs obtained minimizing AD2R and AD2 statistics have smaller bias and RMSE than the MLE ˆθ. The ) ADR and AD statistics provide estimators with smaller bias than ˆθ, although their RMSEs are larger than RMSE (ˆθ. Although the MGFEs using AD2R and AD2 statistics outperform the MLE of θ for the U[, θ] distribution, one cannot expect that MGFEs will generally outperform MLEs in terms of the RMSE they produce, particularly if Cramer s regularity conditions hold, because in this case the ML method provides asymptotically optimal estimators. However, we shall provide evidence that MGFEs are often close to MLEs in terms of RMSEs, thus providing a justification for the use of MGFEs when the ML method lacks its optimality properties Some exponential distributions The exponential Exp(θ) CDF is a particular case of the GPD, obtained using k = in (1). Suppose that (x 1,...,x n ) is a sample) of n IID observations on an Exp(θ) random variable X. The MLE ˆθ = x is unbiased, asymptotically optimal and var (ˆθ = θ 2 /n. Therefore, MGFEs are expected to be inferior to ˆθ in terms of RMSEs. We have simulated 1 samples of size 1 for the Exp(1) distribution and have computed the corresponding MGFEs. (The results are, obviously, invariant with respect to the value of θ.) Table 4 provides the estimated bias and RMSEs for the MGFEs corresponding to the eight EDF statistics of Section 2, together with the bias and RMSE of the MLE, ordered according to their RMSEs. Although the MLE is optimal, the difference is small with respect to the MGFEs using ADR and AD statistics. Suppose now that the samples of size 1 are contaminated so that 95 observations come from the Exp(θ) distribution with θ = 1, but five observations correspond to the Exp(4θ) distribution. Table 5 provides the new values of the bias and RMSEs for the ML and MGF estimators of θ. One can see that the MLE is now outperformed by five of the eight MGFEs. Consequently, MGFEs may be more robust than MLEs. As a third model, suppose that the random variable X has the shifted exponential CDF { 1 exp{ (x α)/θ} if x α, F α,θ (x) = if x<α.

5 98 A. Luceño / Computational Statistics & Data Analysis 51 (26) Table 3 Estimated bias and RMSEs for the MGFEs corresponding to the eight EDF statistics of Section 2, together with the bias and RMSE of the MLE, based on 1 samples of size 1 on a uniform U[, θ] distribution with θ = 1, in ascending order of the RMSEs AD2R AD2 MLE ADR AD CM KS ADL AD2L Bias RMSE Table 4 Estimated bias and RMSEs for the MGFEs corresponding to the eight EDF statistics of Section 2, together with the bias and RMSE of the MLE, based on 1 samples of size 1 on an exponential Exp(θ) distribution with θ = 1, in ascending order of the RMSEs MLE ADR AD CM KS AD2 ADL AD2R AD2L Bias RMSE Table 5 Estimated bias and RMSEs for the MGFEs corresponding to the eight EDF statistics of Section 2, together with the bias and RMSE of the MLE, based on 1 samples of size 1 on the contaminated exponential distribution of Section 4.2 with θ = 1 AD CM KS ADR ADL MLE AD2L AD2 AD2R Bias RMSE Table 6 Estimated bias and RMSEs for the MGFEs corresponding to the eight EDF statistics of Section 2, together with the bias and RMSE of the MLE, based on 1 samples of size 1 on the shifted exponential distribution of Section 4.2 with α = 1 and θ = 1 α AD2L ADL MLE AD2 AD CM KS ADR AD2R Bias RMSE θ MLE ADR AD CM KS AD2 ADL AD2L AD2R Bias RMSE The MLEs are ˆα = min (x 1,...,x n ) and ˆθ = x ˆα. These estimators are biased, because E (ˆα ) ) = α + θ/n and E (ˆθ = θ(n 1)/n, and have variances var (ˆα ) ) = θ 2 /n 2 and var (ˆθ = θ 2 (n 1)/n 2 so that ˆα is superefficient for the location parameter α. Cramer s regularity conditions do not hold because of α. Table 6 provides the values of the bias and RMSEs for the MGF and ML estimators based on 1 samples of size 1 with α=1 and θ=1. The minimizations of the EDF statistics have been performed subject to the boundary condition α min (x 1,...,x n ). The MLE of θ is better than the corresponding MGFEs. However, the MGFEs for α based on AD2L and ADL statistics outperform the MLE ˆα both in terms of bias and RMSE; in addition, most of the MGFEs for α have smaller bias than the MLE ˆα Normal distribution For completeness, Table 7 shows the results corresponding to a normal N ( μ, σ 2) distribution with mean μ = 1 and standard deviation σ = 1. Obviously, the MLEs of μ and σ are optimal in this situation. Nevertheless, the difference between the RMSEs provided by the MLEs and several MGFEs (e.g., using the AD statistic) is very small, particularly for the estimation of μ.

6 A. Luceño / Computational Statistics & Data Analysis 51 (26) Table 7 Estimated bias and RMSEs for the MGFEs corresponding to the eight EDF statistics of Section 2, together with the bias and RMSE of the MLE, based on 1 samples of size 1 on a normal N(1, 1) distribution, in ascending order of the RMSEs μ MLE CM AD ADR ADL AD2 KS AD2R AD2L Bias RMSE σ MLE AD ADL ADR KS CM AD2 AD2R AD2L Bias RMSE Generalized Pareto distribution As shown in Section 1, the ML method fails to provide estimators for the parameters k and θ of the GPD, given in (1), for most positive values of the parameter k. Thus, Hosking and Wallis (1987) compare MLEs with MOM estimates and with the estimates provided by a method of probability-weighted moments (PWM), but only consider a small range of values of k, namely, k <.5. Castillo and Hadi (1997) introduce the elemental percentile method (EPM) and expand the range of values of k to k 2, but disregard MLEs. Following these authors, we shall compare MGF estimators with MOM, PWM and EPM estimators, and also with the estimator provided by a quasi-ml (QML) method. QML method: Loosely speaking, our QML method uses a combination of the standard ML method when k<.75 and a modified ML method when k.75. But, because it is not possible to know the exact value of k using solely the information in a random sample (x 1,...,x n ), the decision about whether to use the standard or the modified ML method must be taken on empirical grounds. Therefore, we have adopted the QML method having the following steps: (1) Compute k = 1 n 1 ( ) x (i) ln 1 n 1 max (x i=1 1,...,x n ) and ni=1 xi 2 Z = 1 /n 2 x 2. (5) (2) If k<.75 and Z<, compute standard MLEs for k and θ. (3) Otherwise, estimate k using Eq. (4) and estimate θ using θ = k max (x 1,...,x n ). (6) (4) The justification of this method is as follows. When k is large, the range of the GPD is x θ/k and the ML method fails. Therefore, for large values of k, one can use estimators such that θ/ k = max (x 1,...,x n ), which is in agreement with the MLE provided by Eq. (3) for k = 1. This justifies (6). Moreover, introducing (6) in the log-likelihood function of the parameters k and θ given the remaining sample ( ) x (1),...,x (n 1), and maximizing the resulting expression with respect to k leads to (4). The additional criterium based on Eq. (5) can be justified on intuitive grounds because the asymptotic value of Z is 1 3 for the uniform distribution (k = 1) and for the exponential distribution (k = ). Eq. (5) can also be justified because nz is the slope of the log-likelihood function in the direction of k when k tends to and ˆθ = x (for k =, the MLE of θ is ˆθ = x, so that nz is related to the sign of ˆk in the neighborhood of k = ). Simulation results: Table 8 provides estimated bias and RMSEs for the MGF estimators obtained using the eight EDF statistics of Section 2 together with the estimators provided by the methods referred to in this section (QML or ML, MOM, PWM and EPM). These estimated bias and RMSEs are based on 1 samples of size 1 on the GPD with θ = 1 and k ={ 2, 1,, 1, 2}. The minimizations of the EDF statistics have been performed subject to the boundary condition θ/ k max (x 1,...,x n ) whenever k>. (The results are, obviously, invariant with respect to the value of θ.)

7 91 A. Luceño / Computational Statistics & Data Analysis 51 (26) Table 8 Estimated bias and RMSEs for the MGFEs corresponding to the eight EDF statistics of Section 2 and the QML, ML, MOM, PWM and EPM methods of Section 4.4, based on 1 samples of size 1 on the GPD with θ = 1 and k ={ 2, 1,, 1, 2}, in ascending order of the RMSEs k = 2 θ QML ADR EPM AD2R AD CM KS AD2 ADL PWM MOM AD2L Bias RMSE k QML ADR AD2R EPM AD CM KS ADL AD2 PWM MOM AD2L Bias RMSE k = 1 θ QML AD2R ADR EPM AD CM KS AD2 ADL MOM PWM AD2L Bias RMSE k QML AD2R EPM ADR AD CM KS AD2 ADL PWM MOM AD2L Bias RMSE k = θ MOM EPM ADR PWM QML AD CM AD2R KS ADL AD2 AD2L Bias RMSE k MOM PWM ADR AD2R QML EPM AD CM KS ADL AD2 AD2L Bias RMSE k = 1 θ AD MLE ADR CM KS ADL EPM AD2R AD2 AD2L PWM MOM Bias RMSE k MLE ADR AD CM AD2R PWM KS ADL EPM AD2 MOM AD2L Bias RMSE k = 2 θ AD ADL MLE CM KS ADR AD2 EPM AD2R AD2L PWM MOM Bias RMSE k MLE ADR AD CM KS ADL AD2R AD2 EPM PWM AD2L MOM Bias RMSE One can see that the QML method appears to provide the best estimators for θ and k when k = 2 and 1, even though this method has not been previously considered in the literature. The second best estimator appears to be a MGF estimator, namely, the ADR statistic for k = 2 and the AD2R statistic for k = 1. The MOM estimator appears to be the best for θ and k when k =, whereas the ADR statistic provides the third best estimator for θ and k in this case; the second place is shared by the EPM method for θ and the PWM method for k. The QML occupies the fifth position whereas the MLE cannot be used yet (because Eqs. (4) and (5) do not always suggest using the ML method when the unknown value of k equals ). Note that the MOM estimator appears to be among the worst estimators of θ and k for k ={ 2, 1, 1, 2}. The AD statistic appears to provide the best estimator for θ when k = 1 and 2, even though the ML method is always (i.e., with sampling frequency equal to 1) chosen by Eqs. (4) and (5). The MLE appears, however, in the first position as an estimator of k when k = 1 and 2, closely followed by the ADR and AD statistics in the second and third places. The fact that the MLE for θ is outperformed by the MGF estimator based on the AD statistic when k = 1 and 2 can probably be explained because MLEs do not display their asymptotic efficiency for samples of size 1 on the GPD, which was shown by Hosking and Wallis (1987) for k <.5. Although Hosking and Wallis showed that the

8 A. Luceño / Computational Statistics & Data Analysis 51 (26) standard ML method gives worse performance than the MOM and PWM methods for samples sizes around 1 and k <.5, this does not hold for k = 1or 2. Example: To illustrate the use of MGFEs, we consider the Bilbao waves data X analyzed by Castillo and Hadi (1997). These are 179 zero-crossing hourly mean periods (in seconds) of the sea waves measured in a Bilbao buoy in January According to Pickands (1975), for large enough threshold u, the CDF of X u, conditional on X>u, converges to the GPD as u increases (see Appendix A). Hence Castillo and Hadi (1997) analyze the data using thresholds at u = 7, 7.5, 8, 8.5, 9 and 9.5. Fig. 1 shows the empirical and estimated CDFs obtained for u = 7 with the eight MGFEs in Section 2. One can see that the GPD does not provide a good fit of the data, particularly in the left tail of the distribution; apparently, the number of small values of X 7 contained in the sample is smaller than would be expected under the GPD. Therefore, the threshold u = 7 is not large enough for Pickands asymptotic results to hold approximately. Fig. 2 is a reproduction of Fig. 1 using now threshold u = 7.5 (the resulting sample size is 154). One can see that the fit provided by the GPD improves drastically, particularly for the estimates obtained using KS, CM, AD, ADR and ADL statistics. Although one could increase the threshold value and repeat the analysis, the results obtained would be based on smaller sample sizes as the threshold increased, with the consequence that important information in the sample would be disregarded. Careful study of Figs. 1 and 2 reveals subtle differences among the estimated CDFs obtained with the eight MGFEs, depending on the weights they assign to each tail of the CDF. The upper part of Fig. 3 is a Cartesian representation of the estimates ( θ, k) obtained using all the estimation methods considered in this section. The lower part of Fig. 3 shows the corresponding values of the mean μ and range θ/ k estimated for {X 7.5 X>7.5}. The standard errors of θ and k can be obtained by simulation; for example, using u = 7.5 and 1 random samples, the resulting standard errors for the MGFEs based on the ADR statistic are.168 and.86, respectively. Fig. 3 shows that the MGFEs provided by the AD, ADR and ADL statistics and the MOM and PWM estimates are very close to each other, thus appearing as a compact cluster of estimates in the figure. The KS and CM statistics provide close but somewhat smaller estimates for θ and k, and larger estimates for θ/k. The remaining estimates (EPM, QML, AD2, AD2R and AD2L) are more distant apart from each other showing larger values of θ and k, and smaller values of θ/ k; note that most of the methods in this group assign large weights to the data in the right tail (QML, AD2 and AD2R) and/or to the data in the left tail (AD2 and AD2L). 5. Performance of the MGF estimators in heterogeneous populations One important property of the MGF estimation method is that it can be used to fit generalized linear models using the procedure in Section 3.2. This is in contrast with the MOM, PWM, EPM and QML methods of Section 4.4, which cannot be used when the information about the unknown parameters is contained in a sample (Y 1,...,Y n ) of independent but not necessarily identically distributed random variables Y i, i = 1,...,n. To illustrate the performance of the MGF method, we use two generalized linear models based on the GPD. According to the first model, the CDF of Y i is given by { 1 {1 ky/ (θ1 + θ 2 X i )} 1/k if k =, F θ1,θ 2,k(y) = 1 exp { y/(θ 1 + θ 2 X i )} if k =, where X i is a covariate. We have taken k = 2 so that none of the methods in Section 4.4, including the ML method, can be used to estimate θ 1, θ 2 and k with the exception of the MGF methods. We have also taken θ 1 = 1, θ 2 = 1 and X i = i/1, for i = 1,...,1, so that the values of the scale parameters θ 1 + θ 2 X i range from 1.1 to 11. Table 9 shows the estimated bias and RMSEs of the MGF estimators θ 1, θ2 and k corresponding to the eight EDF statistics of Section 2, based on 1 samples on the random vector (Y 1,...,Y 1 ). The AD statistic appears to provide the most efficient estimator for the three unknown parameters. The second and third places are occupied by the ADL and CM statistics, respectively.

9 912 A. Luceño / Computational Statistics & Data Analysis 51 (26) KS 1 CM 1 AD 1 ADR 1 ADL 1 AD2 1 AD2R 1 AD2L Fig. 1. Empirical and estimated generalized Pareto CDFs for the Bilbao wave exceedances data using threshold u = 7 and the MGF estimators in Section 2.

10 A. Luceño / Computational Statistics & Data Analysis 51 (26) KS 1 CM 1 AD 1 ADR 1 ADL 1 AD2 1 AD2R 1 AD2L Fig. 2. Empirical and estimated generalized Pareto CDFs for the Bilbao wave exceedances data using threshold u = 7.5 and the MGF estimators in Section 2.

11 914 A. Luceño / Computational Statistics & Data Analysis 51 (26) Threshold = k.7 KS CM AD 5 ADR ADL AD2 AD2R AD2L EPM MOM PWM QML θ θ/k Threshold = 7.5 KS CM AD ADR ADL AD2 AD2R AD2L EPM MOM PWM QML µ Fig. 3. Estimated values of θ versus k (upper part) and the mean μ versus the range θ/k (lower part) for {X 7.5 X>7.5}, obtained using the GPD and all the estimation methods in Section 4.4 for the Bilbao wave exceedances data. The second model we have considered is defined by the following CDFs for the projections of the random vector (Y 1,...,Y n ): { 1 {1 (k1 + k 2 X i ) y/θ} 1/(k 1+k 2 X i ) if k 1 + k 2 X i =, F θ,k1,k2(y) = 1 exp{ y/θ} if k 1 + k 2 X i =, where X i is a covariate. For illustration, we have taken k 1 = 1, k 2 = 1, θ = 1, n = 1 and X i = 2ln{(i.5)/2)} so that the shape parameters k 1 + k 2 X i range from to 4.29 (note that this range is considerably larger than those used previously in the literature). Consequently, the MOM, PWM, EPM, QML and ML methods in Section 4.4 cannot be used to estimate θ, k 1 and k 2. Table 1 shows the estimated bias and RMSEs of the MGF estimators θ, k 1 and k 2 corresponding to the eight EDF statistics of Section 2, based on 1 samples on the random vector (Y 1,...,Y 1 ).

12 A. Luceño / Computational Statistics & Data Analysis 51 (26) Table 9 Estimated bias and RMSEs for the MGFEs corresponding to the eight EDF statistics of Section 2, based on 1 samples (Y 1,...,Y 1 ) on independent generalized Pareto random variables Y i, i = 1,...,1, having parameters k = 2 and θ i = θ 1 + θ 2 X i, where θ 1 = 1, θ 2 = 1 and X i = i/1, in ascending order of the RMSEs θ 1 AD ADL CM KS AD2 AD2R ADR AD2L Bias RMSE θ 2 AD ADL CM KS ADR AD2 AD2R AD2L Bias RMSE k AD ADL CM ADR AD2 AD2R KS AD2L Bias RMSE Table 1 Estimated bias and RMSEs for the MGFEs corresponding to the eight EDF statistics of Section 2, based on 1 samples (Y 1,...,Y 1 ) on independent generalized Pareto random variables Y i, i = 1,...,1, having parameters θ = 1 and k i = k 1 + k 2 X i, where k 1 = 1, k 2 = 1 and X i = 2ln{(i.5)/2}, in ascending order of the RMSEs θ AD ADL CM ADR AD2L AD2 KS AD2R Bias RMSE k 1 AD ADL CM ADR KS AD2 AD2R AD2L Bias RMSE k 2 AD ADL ADR CM AD2 AD2L AD2R KS Bias RMSE The AD statistic appears again to provide the most efficient estimator for the three unknown parameters, followed by the ADL statistic in second place. 6. Concluding remarks In this article, we propose using statistics based on the empirical distribution function to estimate parameters of probability distributions in homogenous populations and parameters of generalized linear models in heterogeneous populations. Among these statistics, we consider the classical Kolmogorov Smirnov, Cramer von Mises and Anderson Darling statistics, which are routinely used to perform goodness-of-fit tests in homogeneous populations. In addition, we introduce modified statistics such as the left-tail and right-tail Anderson Darling statistics and the Anderson Darling statistics of second degree and illustrate their usefulness in the estimation context. We have shown that some of the new estimation methods, generically called maximum goodness-of-fit methods, outperform the ML method to estimate the parameters of some distributions; for example, the scale parameter of the uniform distribution in Section 4.1, the location parameter of the shifted exponential distribution in Section 4.2, and the scale parameter of the generalized Pareto distribution in Section 4.4. We have carefully considered the generalized Pareto distribution (GPD), which has recently received considerably attention after the discovery by Pickands (1975) of its close connection with the generalized extreme value distribution. The GPD has since become one of the most important distributions to model extreme values in financial, insurance, environmental, hydrological and ocean statistics, among others. It is also very interesting from the estimation point of view, because classical estimation methods such as the ML and moment methods fail for important ranges of the shape parameter of the GPD. However, we have shown in Section 4.4 that the maximum goodness-of-fit estimation methods

13 916 A. Luceño / Computational Statistics & Data Analysis 51 (26) can always be used, whichever the values of the shape parameter might be, and also that one can always find some maximum goodness-of-fit estimators having the best or close to the best efficiency among the estimators included in the analysis. In this context we have introduced a quasi-maximum likelihood method, which appears to have very good efficiency for positive values of the shape parameter. Moreover, we have shown in Sections 3 and 5 that maximum goodness-of-fit estimation methods can be used in the context of generalized linear models. In particular, we have used two generalized Pareto models for heterogeneous populations, whose parameters cannot be estimated by any of the methods considered, including ML, quasi-maximum likelihood, moment, probability-weighted moment and elemental percentile methods. However, these generalized Pareto models can be fitted using maximum goodness-of-fit estimators. The efficiency attained by these estimators in this context compares well with the efficiency attained for homogeneous populations. Acknowledgements This research was partially supported by the Spanish DGI Grant MTM I thank Professor George E.P. Box for great help during my repeated visits to the University of Wisconsin-Madison, where part of the research was performed. I am also grateful to a co-editor, an associate editor and two referees for their very helpful suggestions, which have led to an improved manuscript. Appendix A. Some connections between the GPD and the GEVD Pickands (1975) shows that classical limit results for sample maxima lead to parallel limit results for exceedances over thresholds. In particular, suppose that X 1,X 2,...is a sequence of independent and identically distributed (IID) random variables such that an appropriately normalized CDF of M n =max (X 1,...,X n ) converges to the GEVD. Then, for large enough threshold u, the CDF of X i u, conditional on X i >u, converges to the GPD as u increases. Both distributions share the same shape parameter k. A connection among the remaining parameters, which also embraces the intensity of a Poisson process, can be established using a Poisson-GPD process (see Coles, 21; Smith, 23) in which the number N of exceedances over the level u in any particular period (e.g., any year) has a Poisson distribution with mean λ and, conditional on N 1, the exceedances X 1,...,X N are IID random variables following the GPD. Then Prob {max (X 1,...,X N ) x} = e λ λ i e λ + {1 (1 k(x u)/θ) 1/k} i i! i=1 { = exp λ(1 k(x u)/θ) 1/k}. Identifying Eqs. (2) and (A.1) leads to the following relationships among the threshold u, the Poisson mean λ and the parameters of the GPD and GEVD: θ = ψλ k = ψ k(u μ). In addition, if u changes, the scale of the GPD becomes a function of u, say θ u, such that θ u + ku stays constant independently of u. Clearly, the GPD provides a wider inductive basis than the GEVD because the number of exceedances over interesting thresholds in any given period is usually much larger than one, which is the number of maxima in the period. This applies to many natural phenomena such as floods, waves, winds, temperatures, or earthquakes, among others. (A.1) Appendix B. Computational forms for the EDF statistics Using the notation z i = F ( x (i) ) and considering that Sn (x) is a step function with jumps at the order statistics, the EDF statistics in Tables 1 and 2 can be written in the forms of Table B.1.

14 A. Luceño / Computational Statistics & Data Analysis 51 (26) Table B.1 Computational forms for the EDF statistics Acronym Formula KS D n = 1 2n + max 1 i n z i i 1/2 n CM Wn 2 = 1 12n + n ( z i i 1/2 ) 2 i=1 n AD A 2 n = n 1 n (2i 1) {ln z i + ln (1 z n+1 i )} n i=1 ADR Rn 2 = n 2 2 n z i 1 n (2i 1) ln (1 z n+1 i ) i=1 n i=1 ADL L 2 n = 3n n z i 1 n (2i 1) ln z i i=1 n i=1 AD2R rn 2 = 2 n n 2i 1 AD2L AD2 ln (1 z i ) + 1 i=1 n i=1 1 z n+1 i ln 2 = 2 n ln z i + 1 n 2i 1 i=1 n i=1 z i an 2 = 2 n {ln z i + ln (1 z i )} + 1 n i=1 n i=1 ( 2i 1 + 2i 1 ) z i 1 z n+1 i References Castillo, E., Hadi, A.S., Fitting the generalized Pareto distribution to data. J. Amer. Statist. Assoc. 92, Castillo, E., Hadi, A.S., Balakrishnan, N., Sarabia, J.M., 25. Extreme Value and Related Models with Applications in Engineering and Science. Wiley, New York. Choulakian, V., Stephens, M.A., 21. Goodness-of-fit tests for the generalized Pareto distribution. Technometrics 43, Coles, S., 21. An Introduction to Statistical Modeling of Extreme Values. Springer, London. Davison, A.C., Smith, R.L., 199. Models for exceedances over high thresholds (with comments). J. Roy. Statist. Soc. B 52, Grimshaw, S.D., Computing maximum likelihood estimates for the generalized Pareto distribution. Technometrics 35, Hosking, J.R.M., Wallis, J.R., Parameter and quantile estimation for the generalized Pareto distribution. Technometrics 29, Hosking, J.R.M., Wallis, J.R., Wood, E.F., Estimation of the generalized extreme-value distribution by the method of probability-weighted moments. Technometrics 27, Kac, M., Fiefer, J., Wolfowitz, J., On tests of normality and other tests of goodness of fit based on distance methods. Ann. Math. Statist. 26, Pickands, J., Statistical inference using extreme order statistics. Ann. Statist. 3, Pollard, D., 198. The minimum distance method of testing. Metrika 27, Rao, C.R., Linear Statistical Inference and its Applications. second ed. Wiley, New York. Smith, R.L., 23. Statistics of Extremes, with Applications in Environment, Insurance and Finance. Department of Statistics, University of North Carolina, Chapel Hill, NC. Walshaw, D., 199. Discussion of Models for exceedances over high thresholds by A. C. Davison and R. L. Smith. J. Roy. Statist. Soc. B 52, Wolfowitz, J., Estimation by the minimum distance method. Ann. Inst. Statist. Math. 5, Wolfowitz, J., The minimum distance method. Ann. Math. Statist. 28,

A TEST OF FIT FOR THE GENERALIZED PARETO DISTRIBUTION BASED ON TRANSFORMS

A TEST OF FIT FOR THE GENERALIZED PARETO DISTRIBUTION BASED ON TRANSFORMS A TEST OF FIT FOR THE GENERALIZED PARETO DISTRIBUTION BASED ON TRANSFORMS Dimitrios Konstantinides, Simos G. Meintanis Department of Statistics and Acturial Science, University of the Aegean, Karlovassi,

More information

The Goodness-of-fit Test for Gumbel Distribution: A Comparative Study

The Goodness-of-fit Test for Gumbel Distribution: A Comparative Study MATEMATIKA, 2012, Volume 28, Number 1, 35 48 c Department of Mathematics, UTM. The Goodness-of-fit Test for Gumbel Distribution: A Comparative Study 1 Nahdiya Zainal Abidin, 2 Mohd Bakri Adam and 3 Habshah

More information

Zwiers FW and Kharin VV Changes in the extremes of the climate simulated by CCC GCM2 under CO 2 doubling. J. Climate 11:

Zwiers FW and Kharin VV Changes in the extremes of the climate simulated by CCC GCM2 under CO 2 doubling. J. Climate 11: Statistical Analysis of EXTREMES in GEOPHYSICS Zwiers FW and Kharin VV. 1998. Changes in the extremes of the climate simulated by CCC GCM2 under CO 2 doubling. J. Climate 11:2200 2222. http://www.ral.ucar.edu/staff/ericg/readinggroup.html

More information

Modified Kolmogorov-Smirnov Test of Goodness of Fit. Catalonia-BarcelonaTECH, Spain

Modified Kolmogorov-Smirnov Test of Goodness of Fit. Catalonia-BarcelonaTECH, Spain 152/304 CoDaWork 2017 Abbadia San Salvatore (IT) Modified Kolmogorov-Smirnov Test of Goodness of Fit G.S. Monti 1, G. Mateu-Figueras 2, M. I. Ortego 3, V. Pawlowsky-Glahn 2 and J. J. Egozcue 3 1 Department

More information

PROPERTIES OF THE GENERALIZED NONLINEAR LEAST SQUARES METHOD APPLIED FOR FITTING DISTRIBUTION TO DATA

PROPERTIES OF THE GENERALIZED NONLINEAR LEAST SQUARES METHOD APPLIED FOR FITTING DISTRIBUTION TO DATA Discussiones Mathematicae Probability and Statistics 35 (2015) 75 94 doi:10.7151/dmps.1172 PROPERTIES OF THE GENERALIZED NONLINEAR LEAST SQUARES METHOD APPLIED FOR FITTING DISTRIBUTION TO DATA Mirta Benšić

More information

Overview of Extreme Value Theory. Dr. Sawsan Hilal space

Overview of Extreme Value Theory. Dr. Sawsan Hilal space Overview of Extreme Value Theory Dr. Sawsan Hilal space Maths Department - University of Bahrain space November 2010 Outline Part-1: Univariate Extremes Motivation Threshold Exceedances Part-2: Bivariate

More information

Physics and Chemistry of the Earth

Physics and Chemistry of the Earth Physics and Chemistry of the Earth 34 (2009) 626 634 Contents lists available at ScienceDirect Physics and Chemistry of the Earth journal homepage: www.elsevier.com/locate/pce Performances of some parameter

More information

Abstract: In this short note, I comment on the research of Pisarenko et al. (2014) regarding the

Abstract: In this short note, I comment on the research of Pisarenko et al. (2014) regarding the Comment on Pisarenko et al. Characterization of the Tail of the Distribution of Earthquake Magnitudes by Combining the GEV and GPD Descriptions of Extreme Value Theory Mathias Raschke Institution: freelancer

More information

Recall the Basics of Hypothesis Testing

Recall the Basics of Hypothesis Testing Recall the Basics of Hypothesis Testing The level of significance α, (size of test) is defined as the probability of X falling in w (rejecting H 0 ) when H 0 is true: P(X w H 0 ) = α. H 0 TRUE H 1 TRUE

More information

Regional Estimation from Spatially Dependent Data

Regional Estimation from Spatially Dependent Data Regional Estimation from Spatially Dependent Data R.L. Smith Department of Statistics University of North Carolina Chapel Hill, NC 27599-3260, USA December 4 1990 Summary Regional estimation methods are

More information

Variable inspection plans for continuous populations with unknown short tail distributions

Variable inspection plans for continuous populations with unknown short tail distributions Variable inspection plans for continuous populations with unknown short tail distributions Wolfgang Kössler Abstract The ordinary variable inspection plans are sensitive to deviations from the normality

More information

ON THE TWO STEP THRESHOLD SELECTION FOR OVER-THRESHOLD MODELLING

ON THE TWO STEP THRESHOLD SELECTION FOR OVER-THRESHOLD MODELLING ON THE TWO STEP THRESHOLD SELECTION FOR OVER-THRESHOLD MODELLING Pietro Bernardara (1,2), Franck Mazas (3), Jérôme Weiss (1,2), Marc Andreewsky (1), Xavier Kergadallan (4), Michel Benoît (1,2), Luc Hamm

More information

Financial Econometrics and Volatility Models Extreme Value Theory

Financial Econometrics and Volatility Models Extreme Value Theory Financial Econometrics and Volatility Models Extreme Value Theory Eric Zivot May 3, 2010 1 Lecture Outline Modeling Maxima and Worst Cases The Generalized Extreme Value Distribution Modeling Extremes Over

More information

arxiv: v1 [stat.me] 25 May 2018

arxiv: v1 [stat.me] 25 May 2018 Body and Tail arxiv:1805.10040v1 [stat.me] 5 May 018 Separating the distribution function by an efficient tail-detecting procedure in risk management Abstract Ingo Hoffmann a,, Christoph J. Börner a a

More information

UNIVERSITY OF CALGARY. A New Hybrid Estimation Method for the. Generalized Exponential Distribution. Shan Zhu A THESIS

UNIVERSITY OF CALGARY. A New Hybrid Estimation Method for the. Generalized Exponential Distribution. Shan Zhu A THESIS UNIVERSITY OF CALGARY A New Hybrid Estimation Method for the Generalized Exponential Distribution by Shan Zhu A THESIS SUBMITTED TO THE FACULTY OF GRADUATE STUDIES IN PARTIAL FULFILLMENT OF THE REQUIREMENTS

More information

Journal of Environmental Statistics

Journal of Environmental Statistics jes Journal of Environmental Statistics February 2010, Volume 1, Issue 3. http://www.jenvstat.org Exponentiated Gumbel Distribution for Estimation of Return Levels of Significant Wave Height Klara Persson

More information

Extreme Precipitation: An Application Modeling N-Year Return Levels at the Station Level

Extreme Precipitation: An Application Modeling N-Year Return Levels at the Station Level Extreme Precipitation: An Application Modeling N-Year Return Levels at the Station Level Presented by: Elizabeth Shamseldin Joint work with: Richard Smith, Doug Nychka, Steve Sain, Dan Cooley Statistics

More information

Lecture 2: CDF and EDF

Lecture 2: CDF and EDF STAT 425: Introduction to Nonparametric Statistics Winter 2018 Instructor: Yen-Chi Chen Lecture 2: CDF and EDF 2.1 CDF: Cumulative Distribution Function For a random variable X, its CDF F () contains all

More information

A comparison of inverse transform and composition methods of data simulation from the Lindley distribution

A comparison of inverse transform and composition methods of data simulation from the Lindley distribution Communications for Statistical Applications and Methods 2016, Vol. 23, No. 6, 517 529 http://dx.doi.org/10.5351/csam.2016.23.6.517 Print ISSN 2287-7843 / Online ISSN 2383-4757 A comparison of inverse transform

More information

Asymptotic inference for a nonstationary double ar(1) model

Asymptotic inference for a nonstationary double ar(1) model Asymptotic inference for a nonstationary double ar() model By SHIQING LING and DONG LI Department of Mathematics, Hong Kong University of Science and Technology, Hong Kong maling@ust.hk malidong@ust.hk

More information

Richard L. Smith Department of Statistics and Operations Research University of North Carolina Chapel Hill, NC

Richard L. Smith Department of Statistics and Operations Research University of North Carolina Chapel Hill, NC EXTREME VALUE THEORY Richard L. Smith Department of Statistics and Operations Research University of North Carolina Chapel Hill, NC 27599-3260 rls@email.unc.edu AMS Committee on Probability and Statistics

More information

Package homtest. February 20, 2015

Package homtest. February 20, 2015 Version 1.0-5 Date 2009-03-26 Package homtest February 20, 2015 Title Homogeneity tests for Regional Frequency Analysis Author Alberto Viglione Maintainer Alberto Viglione

More information

Testing Goodness-of-Fit for Exponential Distribution Based on Cumulative Residual Entropy

Testing Goodness-of-Fit for Exponential Distribution Based on Cumulative Residual Entropy This article was downloaded by: [Ferdowsi University] On: 16 April 212, At: 4:53 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 172954 Registered office: Mortimer

More information

HIERARCHICAL MODELS IN EXTREME VALUE THEORY

HIERARCHICAL MODELS IN EXTREME VALUE THEORY HIERARCHICAL MODELS IN EXTREME VALUE THEORY Richard L. Smith Department of Statistics and Operations Research, University of North Carolina, Chapel Hill and Statistical and Applied Mathematical Sciences

More information

American Society for Quality

American Society for Quality American Society for Quality Parameter and Quantile Estimation for the Generalized Pareto Distribution Author(s): J. R. M. Hosking and J. R. Wallis Reviewed work(s): Source: Technometrics, Vol. 29, No.

More information

MFM Practitioner Module: Quantitiative Risk Management. John Dodson. October 14, 2015

MFM Practitioner Module: Quantitiative Risk Management. John Dodson. October 14, 2015 MFM Practitioner Module: Quantitiative Risk Management October 14, 2015 The n-block maxima 1 is a random variable defined as M n max (X 1,..., X n ) for i.i.d. random variables X i with distribution function

More information

Statistics. Lecture 2 August 7, 2000 Frank Porter Caltech. The Fundamentals; Point Estimation. Maximum Likelihood, Least Squares and All That

Statistics. Lecture 2 August 7, 2000 Frank Porter Caltech. The Fundamentals; Point Estimation. Maximum Likelihood, Least Squares and All That Statistics Lecture 2 August 7, 2000 Frank Porter Caltech The plan for these lectures: The Fundamentals; Point Estimation Maximum Likelihood, Least Squares and All That What is a Confidence Interval? Interval

More information

HANDBOOK OF APPLICABLE MATHEMATICS

HANDBOOK OF APPLICABLE MATHEMATICS HANDBOOK OF APPLICABLE MATHEMATICS Chief Editor: Walter Ledermann Volume VI: Statistics PART A Edited by Emlyn Lloyd University of Lancaster A Wiley-Interscience Publication JOHN WILEY & SONS Chichester

More information

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables THE UNIVERSITY OF MANCHESTER. 21 June :45 11:45

Two hours. To be supplied by the Examinations Office: Mathematical Formula Tables THE UNIVERSITY OF MANCHESTER. 21 June :45 11:45 Two hours MATH20802 To be supplied by the Examinations Office: Mathematical Formula Tables THE UNIVERSITY OF MANCHESTER STATISTICAL METHODS 21 June 2010 9:45 11:45 Answer any FOUR of the questions. University-approved

More information

Extreme Value Analysis and Spatial Extremes

Extreme Value Analysis and Spatial Extremes Extreme Value Analysis and Department of Statistics Purdue University 11/07/2013 Outline Motivation 1 Motivation 2 Extreme Value Theorem and 3 Bayesian Hierarchical Models Copula Models Max-stable Models

More information

LQ-Moments for Statistical Analysis of Extreme Events

LQ-Moments for Statistical Analysis of Extreme Events Journal of Modern Applied Statistical Methods Volume 6 Issue Article 5--007 LQ-Moments for Statistical Analysis of Extreme Events Ani Shabri Universiti Teknologi Malaysia Abdul Aziz Jemain Universiti Kebangsaan

More information

Max. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes

Max. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes Maximum Likelihood Estimation Econometrics II Department of Economics Universidad Carlos III de Madrid Máster Universitario en Desarrollo y Crecimiento Económico Outline 1 3 4 General Approaches to Parameter

More information

A General Overview of Parametric Estimation and Inference Techniques.

A General Overview of Parametric Estimation and Inference Techniques. A General Overview of Parametric Estimation and Inference Techniques. Moulinath Banerjee University of Michigan September 11, 2012 The object of statistical inference is to glean information about an underlying

More information

Recall that in order to prove Theorem 8.8, we argued that under certain regularity conditions, the following facts are true under H 0 : 1 n

Recall that in order to prove Theorem 8.8, we argued that under certain regularity conditions, the following facts are true under H 0 : 1 n Chapter 9 Hypothesis Testing 9.1 Wald, Rao, and Likelihood Ratio Tests Suppose we wish to test H 0 : θ = θ 0 against H 1 : θ θ 0. The likelihood-based results of Chapter 8 give rise to several possible

More information

Mathematics Ph.D. Qualifying Examination Stat Probability, January 2018

Mathematics Ph.D. Qualifying Examination Stat Probability, January 2018 Mathematics Ph.D. Qualifying Examination Stat 52800 Probability, January 2018 NOTE: Answers all questions completely. Justify every step. Time allowed: 3 hours. 1. Let X 1,..., X n be a random sample from

More information

Estimation of Quantiles

Estimation of Quantiles 9 Estimation of Quantiles The notion of quantiles was introduced in Section 3.2: recall that a quantile x α for an r.v. X is a constant such that P(X x α )=1 α. (9.1) In this chapter we examine quantiles

More information

Keywords: Estimator, Bias, Mean-squared error, normality, generalized Pareto distribution

Keywords: Estimator, Bias, Mean-squared error, normality, generalized Pareto distribution Testing approxiate norality of an estiator using the estiated MSE and bias with an application to the shape paraeter of the generalized Pareto distribution J. Martin van Zyl Abstract In this work the norality

More information

Robust and Efficient Estimation for the Generalized Pareto Distribution

Robust and Efficient Estimation for the Generalized Pareto Distribution Robust and Efficient Estimation for the Generalized Pareto Distribution Sergio F. Juárez Faculty of Statistics and Informatics Veracruzana University, Xalapa, Ver, México email: sejuarez@uv.mx and William

More information

ASTIN Colloquium 1-4 October 2012, Mexico City

ASTIN Colloquium 1-4 October 2012, Mexico City ASTIN Colloquium 1-4 October 2012, Mexico City Modelling and calibration for non-life underwriting risk: from empirical data to risk capital evaluation Gian Paolo Clemente Dept. Mathematics, Finance Mathematics

More information

Spring 2012 Math 541A Exam 1. X i, S 2 = 1 n. n 1. X i I(X i < c), T n =

Spring 2012 Math 541A Exam 1. X i, S 2 = 1 n. n 1. X i I(X i < c), T n = Spring 2012 Math 541A Exam 1 1. (a) Let Z i be independent N(0, 1), i = 1, 2,, n. Are Z = 1 n n Z i and S 2 Z = 1 n 1 n (Z i Z) 2 independent? Prove your claim. (b) Let X 1, X 2,, X n be independent identically

More information

Parameter Estimation

Parameter Estimation Parameter Estimation Consider a sample of observations on a random variable Y. his generates random variables: (y 1, y 2,, y ). A random sample is a sample (y 1, y 2,, y ) where the random variables y

More information

Spatial and temporal extremes of wildfire sizes in Portugal ( )

Spatial and temporal extremes of wildfire sizes in Portugal ( ) International Journal of Wildland Fire 2009, 18, 983 991. doi:10.1071/wf07044_ac Accessory publication Spatial and temporal extremes of wildfire sizes in Portugal (1984 2004) P. de Zea Bermudez A, J. Mendes

More information

Bayesian Inference for Clustered Extremes

Bayesian Inference for Clustered Extremes Newcastle University, Newcastle-upon-Tyne, U.K. lee.fawcett@ncl.ac.uk 20th TIES Conference: Bologna, Italy, July 2009 Structure of this talk 1. Motivation and background 2. Review of existing methods Limitations/difficulties

More information

L-momenty s rušivou regresí

L-momenty s rušivou regresí L-momenty s rušivou regresí Jan Picek, Martin Schindler e-mail: jan.picek@tul.cz TECHNICKÁ UNIVERZITA V LIBERCI ROBUST 2016 J. Picek, M. Schindler, TUL L-momenty s rušivou regresí 1/26 Motivation 1 Development

More information

Module 6: Model Diagnostics

Module 6: Model Diagnostics St@tmaster 02429/MIXED LINEAR MODELS PREPARED BY THE STATISTICS GROUPS AT IMM, DTU AND KU-LIFE Module 6: Model Diagnostics 6.1 Introduction............................... 1 6.2 Linear model diagnostics........................

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Theory of Maximum Likelihood Estimation. Konstantin Kashin

Theory of Maximum Likelihood Estimation. Konstantin Kashin Gov 2001 Section 5: Theory of Maximum Likelihood Estimation Konstantin Kashin February 28, 2013 Outline Introduction Likelihood Examples of MLE Variance of MLE Asymptotic Properties What is Statistical

More information

Goodness-of-fit tests for randomly censored Weibull distributions with estimated parameters

Goodness-of-fit tests for randomly censored Weibull distributions with estimated parameters Communications for Statistical Applications and Methods 2017, Vol. 24, No. 5, 519 531 https://doi.org/10.5351/csam.2017.24.5.519 Print ISSN 2287-7843 / Online ISSN 2383-4757 Goodness-of-fit tests for randomly

More information

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata

Testing Hypothesis. Maura Mezzetti. Department of Economics and Finance Università Tor Vergata Maura Department of Economics and Finance Università Tor Vergata Hypothesis Testing Outline It is a mistake to confound strangeness with mystery Sherlock Holmes A Study in Scarlet Outline 1 The Power Function

More information

Introduction to Algorithmic Trading Strategies Lecture 10

Introduction to Algorithmic Trading Strategies Lecture 10 Introduction to Algorithmic Trading Strategies Lecture 10 Risk Management Haksun Li haksun.li@numericalmethod.com www.numericalmethod.com Outline Value at Risk (VaR) Extreme Value Theory (EVT) References

More information

Bayesian Modelling of Extreme Rainfall Data

Bayesian Modelling of Extreme Rainfall Data Bayesian Modelling of Extreme Rainfall Data Elizabeth Smith A thesis submitted for the degree of Doctor of Philosophy at the University of Newcastle upon Tyne September 2005 UNIVERSITY OF NEWCASTLE Bayesian

More information

A Very Brief Summary of Statistical Inference, and Examples

A Very Brief Summary of Statistical Inference, and Examples A Very Brief Summary of Statistical Inference, and Examples Trinity Term 2009 Prof. Gesine Reinert Our standard situation is that we have data x = x 1, x 2,..., x n, which we view as realisations of random

More information

Statistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation

Statistics - Lecture One. Outline. Charlotte Wickham  1. Basic ideas about estimation Statistics - Lecture One Charlotte Wickham wickham@stat.berkeley.edu http://www.stat.berkeley.edu/~wickham/ Outline 1. Basic ideas about estimation 2. Method of Moments 3. Maximum Likelihood 4. Confidence

More information

Exact Linear Likelihood Inference for Laplace

Exact Linear Likelihood Inference for Laplace Exact Linear Likelihood Inference for Laplace Prof. N. Balakrishnan McMaster University, Hamilton, Canada bala@mcmaster.ca p. 1/52 Pierre-Simon Laplace 1749 1827 p. 2/52 Laplace s Biography Born: On March

More information

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes:

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes: Practice Exam 1 1. Losses for an insurance coverage have the following cumulative distribution function: F(0) = 0 F(1,000) = 0.2 F(5,000) = 0.4 F(10,000) = 0.9 F(100,000) = 1 with linear interpolation

More information

Sharp statistical tools Statistics for extremes

Sharp statistical tools Statistics for extremes Sharp statistical tools Statistics for extremes Georg Lindgren Lund University October 18, 2012 SARMA Background Motivation We want to predict outside the range of observations Sums, averages and proportions

More information

Different methods of estimation for generalized inverse Lindley distribution

Different methods of estimation for generalized inverse Lindley distribution Different methods of estimation for generalized inverse Lindley distribution Arbër Qoshja & Fatmir Hoxha Department of Applied Mathematics, Faculty of Natural Science, University of Tirana, Albania,, e-mail:

More information

Weighted empirical likelihood estimates and their robustness properties

Weighted empirical likelihood estimates and their robustness properties Computational Statistics & Data Analysis ( ) www.elsevier.com/locate/csda Weighted empirical likelihood estimates and their robustness properties N.L. Glenn a,, Yichuan Zhao b a Department of Statistics,

More information

Mathematics Qualifying Examination January 2015 STAT Mathematical Statistics

Mathematics Qualifying Examination January 2015 STAT Mathematical Statistics Mathematics Qualifying Examination January 2015 STAT 52800 - Mathematical Statistics NOTE: Answer all questions completely and justify your derivations and steps. A calculator and statistical tables (normal,

More information

A scale-free goodness-of-t statistic for the exponential distribution based on maximum correlations

A scale-free goodness-of-t statistic for the exponential distribution based on maximum correlations Journal of Statistical Planning and Inference 18 (22) 85 97 www.elsevier.com/locate/jspi A scale-free goodness-of-t statistic for the exponential distribution based on maximum correlations J. Fortiana,

More information

Robust and Efficient Fitting of the Generalized Pareto Distribution with Actuarial Applications in View

Robust and Efficient Fitting of the Generalized Pareto Distribution with Actuarial Applications in View Robust and Efficient Fitting of the Generalized Pareto Distribution with Actuarial Applications in View Vytaras Brazauskas 1 University of Wisconsin-Milwaukee Andreas Kleefeld 2 University of Wisconsin-Milwaukee

More information

H 2 : otherwise. that is simply the proportion of the sample points below level x. For any fixed point x the law of large numbers gives that

H 2 : otherwise. that is simply the proportion of the sample points below level x. For any fixed point x the law of large numbers gives that Lecture 28 28.1 Kolmogorov-Smirnov test. Suppose that we have an i.i.d. sample X 1,..., X n with some unknown distribution and we would like to test the hypothesis that is equal to a particular distribution

More information

On the Comparison of Fisher Information of the Weibull and GE Distributions

On the Comparison of Fisher Information of the Weibull and GE Distributions On the Comparison of Fisher Information of the Weibull and GE Distributions Rameshwar D. Gupta Debasis Kundu Abstract In this paper we consider the Fisher information matrices of the generalized exponential

More information

Department of Econometrics and Business Statistics

Department of Econometrics and Business Statistics Australia Department of Econometrics and Business Statistics http://www.buseco.monash.edu.au/depts/ebs/pubs/wpapers/ Minimum Variance Unbiased Maximum Lielihood Estimation of the Extreme Value Index Roger

More information

Statistic Distribution Models for Some Nonparametric Goodness-of-Fit Tests in Testing Composite Hypotheses

Statistic Distribution Models for Some Nonparametric Goodness-of-Fit Tests in Testing Composite Hypotheses Communications in Statistics - Theory and Methods ISSN: 36-926 (Print) 532-45X (Online) Journal homepage: http://www.tandfonline.com/loi/lsta2 Statistic Distribution Models for Some Nonparametric Goodness-of-Fit

More information

On the Application of the Generalized Pareto Distribution for Statistical Extrapolation in the Assessment of Dynamic Stability in Irregular Waves

On the Application of the Generalized Pareto Distribution for Statistical Extrapolation in the Assessment of Dynamic Stability in Irregular Waves On the Application of the Generalized Pareto Distribution for Statistical Extrapolation in the Assessment of Dynamic Stability in Irregular Waves Bradley Campbell 1, Vadim Belenky 1, Vladas Pipiras 2 1.

More information

Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed

Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed 18.466 Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed 1. MLEs in exponential families Let f(x,θ) for x X and θ Θ be a likelihood function, that is, for present purposes,

More information

A Very Brief Summary of Statistical Inference, and Examples

A Very Brief Summary of Statistical Inference, and Examples A Very Brief Summary of Statistical Inference, and Examples Trinity Term 2008 Prof. Gesine Reinert 1 Data x = x 1, x 2,..., x n, realisations of random variables X 1, X 2,..., X n with distribution (model)

More information

Generalized fiducial confidence intervals for extremes

Generalized fiducial confidence intervals for extremes Extremes (2012) 15:67 87 DOI 10.1007/s10687-011-0127-9 Generalized fiducial confidence intervals for extremes Damian V. Wandler Jan Hannig Received: 1 December 2009 / Revised: 13 December 2010 / Accepted:

More information

A goodness of fit test for the Pareto distribution

A goodness of fit test for the Pareto distribution Chilean Journal of Statistics Vol. 9, No. 1, April 2018, 33-46 A goodness of fit test for the Pareto distribution Javier Suárez-Espinosa, José A. Villaseñor-Alva, Annel Hurtado-Jaramillo and Paulino Pérez-Rodríguez

More information

WEIGHTED LIKELIHOOD NEGATIVE BINOMIAL REGRESSION

WEIGHTED LIKELIHOOD NEGATIVE BINOMIAL REGRESSION WEIGHTED LIKELIHOOD NEGATIVE BINOMIAL REGRESSION Michael Amiguet 1, Alfio Marazzi 1, Victor Yohai 2 1 - University of Lausanne, Institute for Social and Preventive Medicine, Lausanne, Switzerland 2 - University

More information

Statistics GIDP Ph.D. Qualifying Exam Theory Jan 11, 2016, 9:00am-1:00pm

Statistics GIDP Ph.D. Qualifying Exam Theory Jan 11, 2016, 9:00am-1:00pm Statistics GIDP Ph.D. Qualifying Exam Theory Jan, 06, 9:00am-:00pm Instructions: Provide answers on the supplied pads of paper; write on only one side of each sheet. Complete exactly 5 of the 6 problems.

More information

Practice Problems Section Problems

Practice Problems Section Problems Practice Problems Section 4-4-3 4-4 4-5 4-6 4-7 4-8 4-10 Supplemental Problems 4-1 to 4-9 4-13, 14, 15, 17, 19, 0 4-3, 34, 36, 38 4-47, 49, 5, 54, 55 4-59, 60, 63 4-66, 68, 69, 70, 74 4-79, 81, 84 4-85,

More information

ON THE FAILURE RATE ESTIMATION OF THE INVERSE GAUSSIAN DISTRIBUTION

ON THE FAILURE RATE ESTIMATION OF THE INVERSE GAUSSIAN DISTRIBUTION ON THE FAILURE RATE ESTIMATION OF THE INVERSE GAUSSIAN DISTRIBUTION ZHENLINYANGandRONNIET.C.LEE Department of Statistics and Applied Probability, National University of Singapore, 3 Science Drive 2, Singapore

More information

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at American Society for Quality Parameter and Quantile Estimation for the Generalized Pareto Distribution Author(s): J. R. M. Hosking and J. R. Wallis Source: Technometrics, Vol. 29, No. 3 (Aug., 1987), pp.

More information

Skew Generalized Extreme Value Distribution: Probability Weighted Moments Estimation and Application to Block Maxima Procedure

Skew Generalized Extreme Value Distribution: Probability Weighted Moments Estimation and Application to Block Maxima Procedure Skew Generalized Extreme Value Distribution: Probability Weighted Moments Estimation and Application to Block Maxima Procedure Pierre Ribereau ( ), Esterina Masiello ( ), Philippe Naveau ( ) ( ) Université

More information

Extreme Value Theory as a Theoretical Background for Power Law Behavior

Extreme Value Theory as a Theoretical Background for Power Law Behavior Extreme Value Theory as a Theoretical Background for Power Law Behavior Simone Alfarano 1 and Thomas Lux 2 1 Department of Economics, University of Kiel, alfarano@bwl.uni-kiel.de 2 Department of Economics,

More information

Financial Econometrics and Quantitative Risk Managenent Return Properties

Financial Econometrics and Quantitative Risk Managenent Return Properties Financial Econometrics and Quantitative Risk Managenent Return Properties Eric Zivot Updated: April 1, 2013 Lecture Outline Course introduction Return definitions Empirical properties of returns Reading

More information

Overview of Extreme Value Analysis (EVA)

Overview of Extreme Value Analysis (EVA) Overview of Extreme Value Analysis (EVA) Brian Reich North Carolina State University July 26, 2016 Rossbypalooza Chicago, IL Brian Reich Overview of Extreme Value Analysis (EVA) 1 / 24 Importance of extremes

More information

simple if it completely specifies the density of x

simple if it completely specifies the density of x 3. Hypothesis Testing Pure significance tests Data x = (x 1,..., x n ) from f(x, θ) Hypothesis H 0 : restricts f(x, θ) Are the data consistent with H 0? H 0 is called the null hypothesis simple if it completely

More information

Journal of Biostatistics and Epidemiology

Journal of Biostatistics and Epidemiology Journal of Biostatistics and Epidemiology Original Article Robust correlation coefficient goodness-of-fit test for the Gumbel distribution Abbas Mahdavi 1* 1 Department of Statistics, School of Mathematical

More information

ECE531 Lecture 10b: Maximum Likelihood Estimation

ECE531 Lecture 10b: Maximum Likelihood Estimation ECE531 Lecture 10b: Maximum Likelihood Estimation D. Richard Brown III Worcester Polytechnic Institute 05-Apr-2011 Worcester Polytechnic Institute D. Richard Brown III 05-Apr-2011 1 / 23 Introduction So

More information

Application of Homogeneity Tests: Problems and Solution

Application of Homogeneity Tests: Problems and Solution Application of Homogeneity Tests: Problems and Solution Boris Yu. Lemeshko (B), Irina V. Veretelnikova, Stanislav B. Lemeshko, and Alena Yu. Novikova Novosibirsk State Technical University, Novosibirsk,

More information

Central Limit Theorem ( 5.3)

Central Limit Theorem ( 5.3) Central Limit Theorem ( 5.3) Let X 1, X 2,... be a sequence of independent random variables, each having n mean µ and variance σ 2. Then the distribution of the partial sum S n = X i i=1 becomes approximately

More information

COMPARISON OF THE ESTIMATORS OF THE LOCATION AND SCALE PARAMETERS UNDER THE MIXTURE AND OUTLIER MODELS VIA SIMULATION

COMPARISON OF THE ESTIMATORS OF THE LOCATION AND SCALE PARAMETERS UNDER THE MIXTURE AND OUTLIER MODELS VIA SIMULATION (REFEREED RESEARCH) COMPARISON OF THE ESTIMATORS OF THE LOCATION AND SCALE PARAMETERS UNDER THE MIXTURE AND OUTLIER MODELS VIA SIMULATION Hakan S. Sazak 1, *, Hülya Yılmaz 2 1 Ege University, Department

More information

ENTROPY-BASED GOODNESS OF FIT TEST FOR A COMPOSITE HYPOTHESIS

ENTROPY-BASED GOODNESS OF FIT TEST FOR A COMPOSITE HYPOTHESIS Bull. Korean Math. Soc. 53 (2016), No. 2, pp. 351 363 http://dx.doi.org/10.4134/bkms.2016.53.2.351 ENTROPY-BASED GOODNESS OF FIT TEST FOR A COMPOSITE HYPOTHESIS Sangyeol Lee Abstract. In this paper, we

More information

CHARACTERIZATION OF THE TAIL OF RIVER FLOW DATA BY GENERALIZED PARETO DISTRIBUTION

CHARACTERIZATION OF THE TAIL OF RIVER FLOW DATA BY GENERALIZED PARETO DISTRIBUTION Journal of Statistical Research 2016, Vol. 48-50, No. 2, pp. 55-70 ISSN 0256-422 X CHARACTERIZATION OF THE TAIL OF RIVER FLOW DATA BY GENERALIZED PARETO DISTRIBUTION KUMER PIAL DAS Department of Mathematics,

More information

A class of probability distributions for application to non-negative annual maxima

A class of probability distributions for application to non-negative annual maxima Hydrol. Earth Syst. Sci. Discuss., doi:.94/hess-7-98, 7 A class of probability distributions for application to non-negative annual maxima Earl Bardsley School of Science, University of Waikato, Hamilton

More information

Exact goodness-of-fit tests for censored data

Exact goodness-of-fit tests for censored data Ann Inst Stat Math ) 64:87 3 DOI.7/s463--356-y Exact goodness-of-fit tests for censored data Aurea Grané Received: February / Revised: 5 November / Published online: 7 April The Institute of Statistical

More information

Chapter 31 Application of Nonparametric Goodness-of-Fit Tests for Composite Hypotheses in Case of Unknown Distributions of Statistics

Chapter 31 Application of Nonparametric Goodness-of-Fit Tests for Composite Hypotheses in Case of Unknown Distributions of Statistics Chapter Application of Nonparametric Goodness-of-Fit Tests for Composite Hypotheses in Case of Unknown Distributions of Statistics Boris Yu. Lemeshko, Alisa A. Gorbunova, Stanislav B. Lemeshko, and Andrey

More information

Theory of Statistical Tests

Theory of Statistical Tests Ch 9. Theory of Statistical Tests 9.1 Certain Best Tests How to construct good testing. For simple hypothesis H 0 : θ = θ, H 1 : θ = θ, Page 1 of 100 where Θ = {θ, θ } 1. Define the best test for H 0 H

More information

A MODIFICATION OF HILL S TAIL INDEX ESTIMATOR

A MODIFICATION OF HILL S TAIL INDEX ESTIMATOR L. GLAVAŠ 1 J. JOCKOVIĆ 2 A MODIFICATION OF HILL S TAIL INDEX ESTIMATOR P. MLADENOVIĆ 3 1, 2, 3 University of Belgrade, Faculty of Mathematics, Belgrade, Serbia Abstract: In this paper, we study a class

More information

First Year Examination Department of Statistics, University of Florida

First Year Examination Department of Statistics, University of Florida First Year Examination Department of Statistics, University of Florida August 19, 010, 8:00 am - 1:00 noon Instructions: 1. You have four hours to answer questions in this examination.. You must show your

More information

STATISTICS/ECONOMETRICS PREP COURSE PROF. MASSIMO GUIDOLIN

STATISTICS/ECONOMETRICS PREP COURSE PROF. MASSIMO GUIDOLIN Massimo Guidolin Massimo.Guidolin@unibocconi.it Dept. of Finance STATISTICS/ECONOMETRICS PREP COURSE PROF. MASSIMO GUIDOLIN SECOND PART, LECTURE 2: MODES OF CONVERGENCE AND POINT ESTIMATION Lecture 2:

More information

Terminology Suppose we have N observations {x(n)} N 1. Estimators as Random Variables. {x(n)} N 1

Terminology Suppose we have N observations {x(n)} N 1. Estimators as Random Variables. {x(n)} N 1 Estimation Theory Overview Properties Bias, Variance, and Mean Square Error Cramér-Rao lower bound Maximum likelihood Consistency Confidence intervals Properties of the mean estimator Properties of the

More information

Estimation, Inference, and Hypothesis Testing

Estimation, Inference, and Hypothesis Testing Chapter 2 Estimation, Inference, and Hypothesis Testing Note: The primary reference for these notes is Ch. 7 and 8 of Casella & Berger 2. This text may be challenging if new to this topic and Ch. 7 of

More information

Lecture 7 Introduction to Statistical Decision Theory

Lecture 7 Introduction to Statistical Decision Theory Lecture 7 Introduction to Statistical Decision Theory I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 20, 2016 1 / 55 I-Hsiang Wang IT Lecture 7

More information

Spline Density Estimation and Inference with Model-Based Penalities

Spline Density Estimation and Inference with Model-Based Penalities Spline Density Estimation and Inference with Model-Based Penalities December 7, 016 Abstract In this paper we propose model-based penalties for smoothing spline density estimation and inference. These

More information

SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions

SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions SYSM 6303: Quantitative Introduction to Risk and Uncertainty in Business Lecture 4: Fitting Data to Distributions M. Vidyasagar Cecil & Ida Green Chair The University of Texas at Dallas Email: M.Vidyasagar@utdallas.edu

More information

Inferring from data. Theory of estimators

Inferring from data. Theory of estimators Inferring from data Theory of estimators 1 Estimators Estimator is any function of the data e(x) used to provide an estimate ( a measurement ) of an unknown parameter. Because estimators are functions

More information