A central limit theorem for the Hellinger loss of Grenander-type estimators

Size: px
Start display at page:

Download "A central limit theorem for the Hellinger loss of Grenander-type estimators"

Transcription

1 Delft University of Technology A central limit theorem for the Hellinger loss of Grenander-type estimators Lopuhaa, Rik; Musta, Eni DOI./stan.53 Publication date 8 Document Version Other version Published in Statistica Neerlandica Citation APA) Lopuhaä, H. P., & Musta, E. 8). A central limit theorem for the Hellinger loss of Grenander-type estimators. Statistica Neerlandica, Important note To cite this publication, please use the final published version if applicable). Please check the document version above. Copyright Other than for strictly personal use, it is not permitted to download, forward or distribute the text or part of it, without the consent of the authors) and/or copyright holders), unless the work is under an open content license such as Creative Commons. Takedown policy Please contact us and provide details if you believe this document breaches copyrights. We will remove access to the work immediately and investigate your claim. This work is downloaded from Delft University of Technology. For technical reasons the number of authors shown on this cover page is limited to a maximum of.

2 Received: December 6 Revised: 6 November 7 Accepted: July 8 DOI:./stan.53 ORIGINAL ARTICLE A central limit theorem for the Hellinger loss of Grenander-type estimators Hendrik P. Lopuhaä Eni Musta Delft Institute of Applied Mathematics, Delft University of Technology, Delft, The Netherlands Correspondence Eni Musta, Delft Institute of Applied Mathematics, Delft University of Technology, 68 XE Delft, The Netherlands. E.Musta@tudelft.nl We consider Grenander-type estimators for a monotone function λ [, ] R +, obtained as the slope of a concave convex) estimate of the primitive of λ. Our main result is a central limit theorem for the Hellinger loss, which applies to estimation of a probability density, a regression function or a failure rate. In the case of density estimation, the limiting variance of the Hellinger loss turns out to be independent of λ. KEYWORDS central limit theorem, Grenander estimator, Hellinger distance, isotonic estimation INTRODUCTION One of the problems in shape-constrained nonparametric statistics is to estimate a real-valued function under monotonicity constraints. Early references for this type of problem can be found in Grenander 956), Brunk 958), and Marshall and Proschan 965), concerning the estimation of a probability density, a regression function, and a failure rate under monotonicity constraints. The asymptotic distribution of these types of estimators was first obtained by Prakasa Rao 969, 97) and reproved by Groeneboom 985), who introduced a more accessible approach based on inverses. The latter approach initiated a stream of research on isotonic estimators, for example, see Groeneboom and Wellner 99), Huang and Zhang 994), Huang and Wellner 995), and Lopuhaä and Nane 3). Typically, the pointwise asymptotic behavior of isotonic estimators is characterized by a cube-root n rate of convergence and a nonnormal limit distribution. The situation is different for global distances. In Groeneboom 985), a central limit theorem was obtained for the L -error of the Grenander estimator of a monotone density Groeneboom, Hooghiemstra, & Lopuhaä, 999), and a similar result was established in Durot ) for the regression context. Extensions to general L p -errors can be found in Kulikov and Lopuhaä 5) and Durot 7), where the latter provides a unified approach that applies to a variety of statistical... This is an open access article under the terms of the Creative Commons Attribution License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. 8 The Authors. Statistica Neerlandica published by John Wiley & Sons Ltd on behalf of VVS. Statistica Neerlandica. 8; 7. wileyonlinelibrary.com/journal/stan

3 LOPUHAÄ AND MUSTA models. For the same general setup, an extremal limit theorem for the supremum distance was obtained in Durot, Kulikov, and Lopuhaä ). Another widely used global measure of departure from the true parameter of interest is the Hellinger distance. It is a convenient metric in maximum likelihood problems, which goes back to the works of LeCam 97, 973), and it has nice connections with Bernstein norms and empirical process theory methods to obtain rates of convergence, due fundamentally to the works of Birgé and Massart 993), Wong and Shen 995), and others; see section 3.4 of van der Vaart and Wellner 996) or chapter 4 in van de Geer ) for a more detailed overview. Consistency in Hellinger distance of shape-constrained maximum likelihood estimators was investigated by Pal, Woodroofe, and Meyer 7), Seregin and Wellner ), and Doss and Wellner 6), whereas rates on Hellinger risk measures were obtained in Seregin and Wellner ), Kim and Samworth 6), and Kim, Guntuboyina, and Samworth 6). In contrast with L p -distances or the supremum distance, there is no distribution theory available for the Hellinger loss of shape-constrained nonparametric estimators. In this paper, we present a first result in this direction, that is, a central limit theorem for the Hellinger loss of Grenander-type estimators for a monotone function λ. This type of isotonic estimator was also considered by Durot 7) and is defined as the left-hand slope of a concave or convex) estimate of the primitive of λ,basedonn observations. We will establish our results under the same general setup of Durot 7), which includes estimation of a probability density, a regression function, or a failure rate under monotonicity constraints. In fact, after approximating the squared Hellinger distance by a weighted L -distance, a central limit theorem can be obtained by mimicking the approach introduced in the work of Durot 7). An interesting feature of our main result is that, in the monotone density model, the variance of the limiting normal distribution for the Hellinger distance does not depend on the underlying density. This phenomena was also encountered for the L -distance in Groeneboom 985) and Groeneboom et al. 999). In Section, we define the setup and approximate the squared Hellinger loss by a weighted L -distance. A central limit theorem for the Hellinger distance is established in Section 3. We end this paper by a short discussion on the consequences for particular statistical models and a simulation study on testing exponentiality against a nonincreasing density. DEFINITIONS AND PREPARATORY RESULTS Consider the problem of estimating a nonincreasing or nondecreasing) function λ [, ] R + on the basis of n observations. Suppose that we have at hand a cadlag step estimator Λ n for t Λt) = λu)du, t [, ]. If λ is nonincreasing, then the Grenander-type estimator λ n for λ is defined as the left-hand slope of the least concave majorant of Λ n,with λ n ) = lim t λ n t). Ifλ is nondecreasing, then the Grenander-type estimator λ n for λ is defined as the left-hand slope of the greatest convex minorant of Λ n,with λ n ) = lim t λ n t). We aim at proving the asymptotic normality of the Hellinger distance between λ n and λ defined by H λ n,λ)= λ n t) λt)) dt). )

4 LOPUHAÄ AND MUSTA 3 We will consider the same general setup as in the work of Durot 7), that is, we will assume the following conditions: A) λ is monotone and differentiable on [, ] with < inf t λ t) sup t λ t) <. A') Let M n =Λ n Λ.ThereexistC > such that, for all x > andt =,, [ ] E sup M n u) M n t)) Cx u [,],x t u x n. ) Durot 7) also considered an additional condition A) in order to obtain bounds on pth moments; see theorem and corollary in Durot 7). However, we only need condition A') for our purposes. A3) λ n ) and λ n ) are stochastically bounded. A4) Let B n be either a Brownian bridge or a Brownian motion. There exists q >, C q >, L [, ] R, and versions of M n =Λ n Λand B n,suchthat P n q sup M nt) n B n Lt) ) > x C q x q t [,] for x, n]. Moreover, L is increasing and twice differentiable on [, ] with sup t L t) < and inf t L t) >. In Durot 7), a variety of statistical models are discussed for which the above assumptions are satisfied, such as estimation of a monotone probability density, a monotone regression function, and a monotone failure rate under right censoring. In Section 4, we briefly discuss the consequence of our main result for these models. We restrict ourselves to the case of a nonincreasing function λ. The case of nondecreasing λ can be treated similarly. Note that, even if this may not be a natural assumption, for example, in the regression setting, we need to assume that λ is positive for the Hellinger distance to be well defined. The reason that one can expect a central limit theorem for the Hellinger distance is the fact that the squared Hellinger distance can be approximated by a weighted squared L -distance. This can be seen as follows: λ n t) λt)) dt = λ n t) λt) ) λ n t)+ λt)) dt λ n t) λt) ) 4λt)) dt. 3) Because L -distances for Grenander-type estimators obey a central limit theorem e.g., Durot, 7; Kulikov & Lopuhaä, 5), similar behavior might be expected for the squared Hellinger distance. An application of the delta method will then do the rest. The next lemma makes the approximation in 3) precise. Lemma. Assume A), A'), A3), and A4). Moreover, suppose that there are C > and s > 3 4 with λ t) λ x) C t x s, for all t, x [, ]. 4) If λ is strictly positive, we have that λ n t) λt)) dt = λ n t) λt) ) 4λt)) dt + o p n 5 6 ).

5 4 LOPUHAÄ AND MUSTA In order to prove Lemma, we need the preparatory lemma below. To this end, we introduce the inverse of λ n, defined by { Û n a) =argmax Λ + n u) au }, for all a R, 5) u [,] where { } Λ + n t) =max Λ n t), lim Λ n u) u t. Note that λ n t) a Û n a) t. 6) Furthermore, let g denotetheinverseofλ. Wethenhavethefollowingresult. Lemma. Under the conditions of Lemma, it holds λ n t) λt) 3 dt = op n 5 6 ). Proof. We follow the line of reasoning in the first step of the proof of theorem in Durot 7) with p = 3. For completeness, we briefly sketch the main steps. We will first show that λ n t) λt) 3 λ) dt = 3 λ) Ûnb) gb) λ gb)) db + o P n 5 6 ). To this end, consider I = λ n t) λt) ) 3 dt, I + = λt) λ n t) ) 3 dt, + where x + = max{x, }. We approximate I by J = { λ n t) λt)+a 3 } dadt. λ) λt)) 3 From the reasoning on page,9 of Durot 7), we deduce that n 3 log n I J λ n t) λt) ) 3 dt + λ + n ) λ) 3 {n 3 Û n λ))>log n}. Because the λ n ) is stochastically bounded and λ) is bounded, together with lemma 4 in Durot 7), the second term is of the order o p n 5/6 ). Furthermore, for the first term, we can choose p [, ) such that the first term on the right-hand side is bounded by n 3 log n λ n ) λ) 3 p λ n t) λt) p dt. As in Durot 7), we get [ ] n 3 log n E λ n t) λt) p dt Kn +p ) 3 log n = o n 5 6), by choosing p 3, ). It follows that I = J + o P n 5/6 ). By a change of variable b = λt) +a /3,wefind λ) Û n b) I = 3b λt)) {gb)<ûn b)} dtdb + o p n 5 6 ). λ) gb)

6 LOPUHAÄ AND MUSTA 5 Then, by a Taylor expansion, A), and 4), there exists a K >, such that b λt)) { gb) t) λ gb)) } Kt gb)) +s, 7) for all b λ),λ)) and t gb), ].Wefind λ) Û n b) I = 3t gb)) λ gb)) λ) {gb)<ûn b)} dtdb + R n + o p n 5 6 ), 8) gb) where λ) Û n b) R n 3Kt gb)) +s {gb)<ûn b)} dtdb λ) gb) λ) 3K 3+s 3 + s λ) Ûnb) gb) db = Op n 3+s) 3 ) = o p n 5 6 ), by using 3) in Durot 7), that is, for every q < 3q ),thereexistsk q > suchthat It follows that I = λ) E [ n 3 Û n a) ga) ) q ] K q, for all a R. 9) λ) Ûn b) gb) ) 3 λ gb)) {gb)<ûn b)} db + o p n 5 6 ). In the same way, one finds λ) I = gb) Ûn b) ) 3 λ gb)) {gb)>ûn b)} db + o p n 5 6 ), λ) and it follows that λ n t) λt) 3 λ) dt = I + I = 3 λ) Ûnb) gb) λ gb)) db + o p n 5 6 ). Now, because λ is bounded, by Markov's inequality, for each ε>, we can write λ) P n 5 6 ) 3 λ) Ûnb) gb) λ gb)) db >ε λ) [ E n Û cεn 6 λ) n b) gb) 3 ] db Kn 6. For the last inequality, we again used 9) with q = 3. It follows that which finishes the proof. λ) λ) 3 Ûnb) gb) λ gb)) db = o P n 5 6 ), ) Proof of Lemma. Similar to 3), we write λ n t) λt)) dt = λ n t) λt) ) 4λt)) dt + R n, where R n = λ n t) λt) ) { λ n t)+ } λt)) 4λt)) dt.

7 6 LOPUHAÄ AND MUSTA Write 4λt) λ n t)+ λt)) = λt) λ n t) λt) λ n t) ) λt) = λt) λ n t) ) λt) + λ n t)+. λt) Because <λ) λt) λ) <, this implies that R n λ n t) λt) ) 4λt) λ n t)+ ) λt) 4λt) λ n t)+ ) dt C λ n t) λt) 3 dt λt) for some positive constant C only depending on λ) and λ). Then, from Lemma, it follows that n 5/6 R n = o P ). 3 MAIN RESULT In order to formulate the central limit theorem for the Hellinger distance, we introduce the process X, defined as { Xa) =argmax Wu) u a) }, a R, ) u R with W being a standard two-sided Brownian motion. This process was introduced and investigated by Groeneboom 985, 989) and plays a key role in the asymptotic behavior of isotonic estimators. The distribution of the random variable X) is the pointwise limiting distribution of several isotonic estimators, and the constant k = cov X), Xa) a ) da ) appears in the limit variance of the L p -error of isotonic estimators e.g., Durot,, 7; Groeneboom, 985; Groeneboom et al., 999; Kulikov & Lopuhaä, 5). We then have the following central limit theorem for the squared Hellinger loss. Theorem. Assume A), A'), A3), A4), and 4). Moreover, suppose that λ is strictly positive. Then, the following holds: { n 6 n 3 λ n t) ) } λt) dt μ N,σ ), where μ = E [ X) ] where k is defined in ). λ t)l t) 3 dt, 3 λt) σ = 3 k λ t)l t) 3 L t) dt, λt)

8 LOPUHAÄ AND MUSTA 7 Proof. According to Lemma, it is sufficient to show that n 6 n 3 I n μ ) N,σ ),with I n = λ n t) λt) ) 4λt)) dt. Again, we follow the same line of reasoning as in the proof of theorem in Durot 7). We briefly sketch the main steps of the proof. We first express I n in terms of the inverse process Û n, defined in 5). To this end, similar to the proof of Lemma, consider Ĩ = λ n t) λt) ) + 4λt)) dt, Ĩ = For the first integral, we can now write Then, if we introduce we obtain J = Ĩ = λ) λt)) 4λt) Ĩ J Û n λ)) { λ n t) λt)+ 4aλt)} dadt. λ) λt)) 4aλt) 4λ) Û n λ)) λt) λ n t) ) + 4λt)) dt. { λ n t) λt)+ 4aλt)} dadt, 3) λ n t) λt) ) + dt. { λ n t) λt)+ 4aλt)} dadt Similar to the reasoning in the proof of Lemma, we conclude that Ĩ = J + o p n 5 6 ). Next, the change of variable b = λt)+ 4aλt) yields J = λ) λ) = λ) λ) + λ) λ) Û n b) gb) Û n b) gb) Û n b) gb) b λt) λt) { Û n b)>gb)} dtdb b λt) b { Û n b)>gb)} dtdb b λt)) bλt) { Û n b)>gb)} dtdb. 4) Let us first consider the second integral on the right-hand side of 4). We then have λ) λ) Û n b) gb) λ) = λ) 6λ) b λt)) bλt) { Û n b)>gb)} dtdb λ) λ) sup x [,] sup x [,] Û n b) gb) b λt)) { Û n b)>gb)} dtdb λ) Û λ n b) x) { Û n b)>gb)} t gb)) dtdb λ) gb) λ) λ x) { Û n b)>gb)}ûn b) gb) ) 3 db = op n 5 6 ), λ)

9 8 LOPUHAÄ AND MUSTA again by using 9) with q = 3. Then, consider the first integral on the right-hand side of 4). Similar to 7), there exists K > suchthat b λt) gb) t)λ gb))) Kt gb))+s, for all b λ),λ)) and t gb), ]. Taking into account that λ gb)) <, similar to 8), it follows that where Ĩ = λ) λ) R n λ) λ) Û n b) gb) Û n b) gb) λ gb)) t gb)) b { Û n b)>gb)} dtdb + R n + o p n 5 6 ), K λ) t gb))+s {gb)<ûn b)} dtdb λ) K +s λ) + s) λ) Ûnb) gb) db = Op n +s) 3 ) = o p n 5 6 ), by using 9) once more, and the fact that s > 3 4. It follows that In the same way, Ĩ = λ) λ) Ĩ = λ) λ) λ gb))ûn b) gb) ) { Û 4b n b)>gb)} db + o p n 5 6 ). λ gb))ûn b) gb) ) { Û 4b n b)<gb)} db + o p n 5 6 ), so that λ) I n = Ĩ + Ĩ = Ûn b) gb) ) λ gb)) db + o P n 5 6 ). λ) 4b We then mimic step in the proof of theorem in Durot 7). Consider the representation B n t) =W n t) ξ n t, where W n is a standard Brownian motion, ξ n = ifb n is a Brownian motion, and ξ n is a standard normal random variable independent of B n if B n is a Brownian bridge. Then, define W t u) =n 6 { W n Lt)+n 3 u ) W n Lt)) }, for t [, ], which has the same distribution as a standard Brownian motion. Now, for t [, ], let dt) =λ t) L t) ) and define { Ṽt) =argmax Wt u) dt)u }. 5) u log n Then, similar to 6) in Durot 7), we will obtain n 3 I n = Ṽt) n 6 ξ n λ t) dt) L t) 4λt) dt + o P n 6 ). 6) To prove 6), by using the approximation Û n a) ga) LÛ n a)) Lga)) L ga))

10 LOPUHAÄ AND MUSTA 9 and a change of variable a ξ = a n / ξ n L ga)), we first obtain n 3 I n = n 3 λ) δ n λ)+δ n LÛ n a ξ )) Lga ξ )) λ ga)) L ga))) 4a da + o p n 6 ), where δ n = n 6 log n. Apart from the factor 4a, the integral on the right-hand side is the same as in the proof of theorem in Durot 7) for p =. This means that we can apply the same series of succeeding approximations for LÛ n a ξ )) Lga ξ )) as in Durot 7), which yields λ) δ n 3 I n = n 3 n Ṽga)) n 6 ξ n λ ga)) λ)+δ n dga)) L ga))) 4a da + o p n 6 ). Finally, because the integrals over [λ),λ)+δ n ] and [λ) δ n,λ)] are of the order o p n /6 ), this yields 6) by a change of variables t = ga). Thenextstepistoshowthatthetermwithξ n can be removed from 6). This can be done exactly as in Durot 7), because the only difference with the corresponding integral in Durot 7) is the factor 4λt), which is bounded and does not influence the argument in Durot 7). We find that n 3 I n = Ṽt) λ t) L t) 4λt) dt + o P n 6 ). Then, define Y n t) = Ṽt) E [ Ṽt) ]) λ t) L t) 4λt). 7) By approximating Ṽt) by { Vt) =argmax Wt u) dt)u }, u R and using that, by Brownian scaling, dt) /3 Vt) has the same distribution as X) see Durot, 7, for details), we have that E [ Ṽt) ] λ t) L t) 4λt) dt = E [ X) ] dt) 4 3 λ t) L t) = μ + o n 6). 4λt) dt + o n 6) It follows that We then first show that n 6 I n μ )=n 6 Var n 6 Y n t)dt + o P ). ) Y n t)dt σ. 8) Once more, following the proof in Durot 7), we have ) v n = Var Y n t)dt = s λ t) λ s) L t) L s) 4λt) 4λs) cov Ṽt), Ṽs) ) dtds.

11 LOPUHAÄ AND MUSTA After the same sort of approximations as in Durot 7), we get min,s+c n ) v n = λ s) 4 s L s) 4λs)) cov V t s), V s s) ) dtds + o n 3), where c n = n 3 log n inf t L t) and where, for all s and t, { V t s) =argmax Wt u) ds)u }. u R Then, use that ds) /3 V t s) has the same distribution as X n 3 ds) 3 Lt) Ls)) ) n 3 ds)lt) Ls)) so that the change of variable a = n /3 ds) /3 Lt) Ls)) in v n leads to n 3 v n λ s) 4 L s) 4λs)) ds) 3 L s) cov Xa), X) ) dads k λ s) 4 3 L s) 7 3 L s) ds = σ, 4λs)) λ s) 3 which proves 8). Finally, asymptotic normality of n 6 Y nt)dt follows by Bernstein's method of big blocks and small blocks in the same way as in step 6 of the proof of theorem in Durot 7). Corollary. Assume A), A' ), A3), A4), and 4) and let H λ n,λ) be the Hellinger distance defined in ). Moreover, suppose that λ is strictly positive. Then, n 6 { n 3 H λ n,λ) μ } N, σ ). μ = μ and σ = σ 8μ, where μ and σ are defined in Theorem. Proof. This follows immediately by applying the delta method with φx) = x to the result in Theorem. 4 EXAMPLES The type of scaling for the Hellinger distance in Corollary is similar to that in the central limit theorem for L p -distances. This could be expected in view of the approximation in terms of a weighted squared L -distance see Lemma ), and the results, for example, in Kulikov and Lopuhaä 5) and Durot 7). Actually, this is not always the case. The phenomenon of observing different speeds of convergence for the Hellinger distance from those for the L and L norms was considered by Birgé 986). In fact, this is related to the existence of a lower bound for the function we are estimating. If the function of interest is bounded from below, which is the case considered in this paper, then the approximation 3) holds; see Birgé 986) for an explanation. When we insert the expressions for μ and σ from Theorem, then we get σ k = 4E [ λ t)l t) 3 L t)λt) dt X) ], λ t)l t) 3 λt) dt

12 LOPUHAÄ AND MUSTA where k is defined in ). This means that, in statistical models where L =Λin condition A4) and, hence, L = λ, the limiting variance σ = k 4E[X) ]) does not depend on λ. One such a model is estimation of the common monotone density λ on [, ] of independent random variables X,, X n.then,λ n is the empirical distribution function of X,, X n,and λ n is Grenander's estimator Grenander, 956). In that case, if inf t λt) >, the conditions of Corollary are satisfied with L = Λ see theorem 6 in Durot, 7), so that the limiting variance of the Hellinger loss for the Grenander estimator does not depend on the underlying density. This behavior was conjectured in Wellner 5) and coincides with that of the limiting variance in the central limit theorem for the L -error for the Grenander estimator, first discovered by Groeneboom 985); see also Durot, 7), Groeneboom et al. 999), and Kulikov and Lopuhaä 5). Another example is when we observe independent identically distributed inhomogeneous Poisson processes N,, N n with common mean function Λ on [, ] with derivative λ,forwhich Λ) <.Then,Λ n is the restriction of N + + N n ) n to [, ]. Also in that case, the conditions of Corollary are satisfied with L =Λsee theorem 4 in Durot, 7), so that the limiting variance of the Hellinger loss for λ n does not depend on the common underlying intensity λ. However, note that, for this model, the L -loss for λ n is asymptotically normal according to theorem in Durot 7) but with limiting variance depending on the value Λ) Λ). Consider the monotone regression model y i,n = λi n)+ε i,n,fori =,, n,wheretheε i,n 's are i.i.d. random variables with mean zero and variance σ >. Let Λ n t) = y i,n, t [, ] n i nt be the empirical distribution function. Then, λ n is a slight modification of) Brunk's 958) estimator. Under appropriate moment conditions on the ε i,n, the conditions of Corollary are satisfied with Lt) =tσ see theorem 5 in Durot, 7). In this case, the limiting variance of the Hellinger loss for λ n depends on both λ and σ,whereasthel -loss for λ n is asymptotically normal according to theorem in Durot 7) but with limiting variance only depending on σ. Suppose we observe a right-censored sample X, Δ ),, X n, Δ n ),wherex i = mint i, Y i ) and Δ i = {Ti Y i },withthet i 's being nonnegative i.i.d. failure times and the Y i 's are i.i.d. censoring times independent of the T i 's. Let F be the distribution function of the T i 's with density f and let G be the distribution function of the Y i 's. The parameter of interest is the monotone failure rate λ = f F) on [, ].Inthiscase,Λ n is the restriction of the Nelson Aalen estimator to [, ]. If we assume A) and inf t λt) >, then, under suitable assumptions on F and G, the conditions of Corollary hold with Lt) = t λu) du, t [, ]; Fu)) Gu)) see theorem 3 in Durot 7). This means that the limiting variance of the Hellinger loss depends on λ, F, andg, whereas the limiting variance of the L -loss depends only on their values at and. In particular, in the case of nonrandom censoring times, L = F ), the limiting variance of the Hellinger loss depends on λ and F, whereas the limiting variance of the L -loss depends only on the value F).

13 LOPUHAÄ AND MUSTA 5 TESTING EXPONENTIALITY AGAINST A NONDECREASING DENSITY In this section, we investigate a possible application of Theorem, that is, testing for an exponential density against a nonincreasing alternative by means of the Hellinger loss. The exponential distribution is one of the most used and well-known distributions. It plays a very important role in reliability, survival analysis, and renewal process theory, when modeling random times until some event. As a result, a lot of attention has been given in the literature to testing for exponentiality against a wide variety of alternatives, by making use of different properties and characterizations of the exponential distribution Alizadeh Noughabi & Arghami, ; Haywood & Khmaladze, 8; Jammalamadaka & Taufer, 3; Meintanis, 7). In this section, we consider a test for exponentiality, assuming that data come from a decreasing density. The test is based on the Hellinger distance between the parametric estimator of the exponential density and the Grenander-type estimator of a general decreasing density. In order to be able to apply the result of Corollary, we first investigate a test whether the data is exponentially distributed with a fixed parameter λ >. Because such a test may not be very interesting from a practical point of view, we also investigate testing exponentiality, leaving the parameter λ> unspecified. 5. Testing a simple null hypothesis of exponentiality Let f λ x) =λe λx {x } be the exponential density with parameter λ >. Assume we have a sample of i.i.d. observations X,, X n from some distribution with density f and for λ > fixed, we want to test H f = f λ against H f is nonincreasing. Under the alternative hypothesis, we can estimate f on an interval [,τ] by the Grenander-type estimator f n from Section. Then, as a test statistic, we take T n = H f n,f λ ), the Hellinger distance on [,τ] between f n and f λ, and at level α, we reject the null hypothesis if T n > c n,α,λ,for some critical value c n,α,λ >. According to Corollary, it follows that T n is asymptotically normally distributed, but the mean and the variance depend on the constant k defined in ). To avoid computation of k,we estimate the mean and the variance of T n empirically. We generate B =, samples from f λ. For each of these samples, we compute the Grenander estimator f n,i and the Hellinger distance T n,i = H f n,i,f λ ),fori =,,, B. Finally, we compute the mean T and the variance s T of the values T n,,, T n,b. For the critical value of the test, we take c n,α,λ = T+q α s T,whereq α is the α)% quantile of the standard normal distribution. Note that, even if in the density model the asymptotic variance is independent of the underlying distribution, the asymptotic mean does depend on λ, that is, the test is not distribution free. Another possibility, instead of the normal approximation, is to take as a critical value c n,α,λ the empirical α)% quantile of the values T n,,, T n,b. To investigate the performance of the test, we generate N =, samples from f λ.for each sample, we compute the value of the test statistic T n = H f n,f λ ) and we reject the null hypothesis if T n > c n,α,λ or if T n > c n,α,λ ). The percentage of rejections gives an approximation of the level of the test. Table shows the results of the simulations for different sample sizes n and two values of λ and α =.,.5,.. Here, we take τ = 5 because the mass of the exponential distribution with parameter one or five outside the interval [, 5] is negligible. We observe that the percentage of rejections is close to the nominal level if we use c n,α,λ as a critical value for the

14 LOPUHAÄ AND MUSTA 3 TABLE Simulated levels of T n using top) c n,α,λ and bottom) c n,α,λ,with α =.,.5,., under the null hypothesis varying the sample size n and the parameter λ λ = λ = 5 α =. α =.5 α =. α =. α =.5 α =. n = n = n = FIGURE Simulated powers using solid) c n,α,λ and dashed) c n,α,λ,withα =.5, of T n and the power of dotted) the likelihood ratio test for λ =, ν =.4,.45,,, and n = test, but it is a bit higher if we use c n,α,λ. This is due to the fact that, for small sample sizes, the normal approximation of Corollary is not very precise. Moreover, to investigate the power, we generate a sample from the Weibull distribution with shape parameter ν and scale parameter λ. Recall that Weibull,λ ) corresponds to the exponential distribution with parameter λ and that a Weibull distribution with ν<has a decreasing density. We compute the Hellinger distance T n = H f n,f λ ) and we reject the null hypothesis if T n > c n,α,λ or if T n > c n,α,λ ). After repeating the procedure N =, times, we compute the percentage of times that we reject the null hypothesis, which gives an approximation of the power of the test. The results of the simulations, done with n =, λ =, α =.5, and alternatives for which ν varies between.4 and by steps of.5, are shown in Figure. As a benchmark, we compute the power of the likelihood ratio LR) test statistic for each ν. As expected, our test is less powerful with respect to the LR test, which is designed to test against a particular alternative. However, as the sample size increases, the performance improves significantly and the difference of the results when using c n,α,λ or c n,α,λ becomes smaller.

15 4 LOPUHAÄ AND MUSTA TABLE Simulated levels of R n under the null hypothesis varying the sample size n and the parameter λ λ n = n = n = n =, Testing a composite null hypothesis of exponentiality Assume we have a sample of i.i.d. observations X,, X n from some distribution with density f and we want to test H f = f λ, for some λ> against H f is nonincreasing. Under the null hypothesis, we can construct a parametric estimator of the density that is given by f λ n,where λ n = n n i= X i is the maximum likelihood estimator of λ. On the other hand, under the alternative hypothesis, we can estimate f on aninterval [,τ] by the Grenander-type estimator f n from Section. Then, as a test statistic, we take R n = H f n,f λ n ), the Hellinger distance on [,τ] between the two estimators, and at level α, we reject the null hypothesis if R n > d n,α for some critical value d n,α >. Because the limit distribution of the test statistic is not known, we use a bootstrap procedure to calibrate the test. We generate B =, bootstrap samples of size n from f λ n, and for each of them, we compute the estimators f λ f n,i, and the test statistic n,i R = Hf n,i λ f n,i, n,i ),fori =,,, B. Then, we determine the αth upper percentile d n,α of the values R n,,, R n,b. Finally, we reject the null hypothesis if R n > d n,α. To investigate the level of the test, for α =.5 and λ>fixed, we start with a sample from an exponential distribution with parameter λ and repeat the above procedure N =, times. We count the number of times we reject the null hypothesis, that is, the number of times the value of the test statistics exceeds the corresponding 5th upper percentile. Dividing this number by N gives an approximation of the level. Table shows the results of the simulations for different sample sizes n and different values of λ. The rejection probabilities are close to.5 for all the values of λ, which shows that the test performs well in the different scenarios slightly and strongly decreasing densities). To investigate the power, for α =.5 and fixed < ν < andλ >, we now start with a sample from a Weibull distribution with shape parameter ν and scale parameter λ and compute the value R n = Hf λ n, f n ). In order to calibrate the test, we treat this sample as if it were an exponential sample and estimate λ by λ n = n n i= X i. Next, we generate B =, bootstrap samples of size n from the exponential density with parameter λ n. For each bootstrap sample, we compute the test statistic R = Hf n,i λ f n,i, ),fori =,,, B, and we determine n,i the 5th upper percentile d n,.5 of the values R n,,, R. Finally, we reject the null hypothesis n,b if R n > d n,.5. After repeating the above procedure N =, times, each time starting with a Weibull sample, we compute the percentage of times that we reject the null hypothesis, which gives an approximation of the power of the test. We compare the Hellinger distance test to some of the tests from Alizadeh Noughabi and Arghami ), which are designed to test exponentiality against all the possible alternatives, that is, not only against decreasing densities. These tests are all distribution free, which means that their critical values can be computed independently of λ. Then, for each of the Weibull samples generated before, we count the percentage of times that the tests

16 LOPUHAÄ AND MUSTA a) b) FIGURE Simulated powers of black solid) the Hellinger distance test and some other competitor tests, that is, blue) T, green) T, yellow) ω n, brown) S n,red)ep n,purple)kl mn, and orange) CO n, and the power of black dotted) the likelihood ratio test for left) n =, λ =,.4 ν and right) β 8. a) Weibull. b) Beta T, T,ω n, S n, EP n, KL mn, and CO n see Alizadeh Noughabi & Arghami,, for a precise definition) reject the null hypothesis. Finally, we also compare the power of our test with the LR test for each ν. The results of the simulations, done with n =, λ =, and alternatives for which ν varies between.4 and, are shown in the left panel in Figure. Actually, we also investigated the power for different choices of λ, and we observed similar behavior as for λ =. The figure shows that the test based on the Hellinger distance performs worse than the other tests. In this case, the test of Cox and Oakes CO n has greater power. However, Alizadeh Noughabi and Arghami ) concluded that none of the tests is uniformly most powerful with respect to the others. We repeated the experiment taking, instead of the Weibull distribution, the beta distribution with parameters α = and β 8 as alternative. Note that it has a nonincreasing density on [, ] proportional to x) β, and the extreme case β = corresponds to the uniform distribution. Results are shown in the right panel in Figure. We observe that, for small values of β, the Hellinger distance test again behaves worse than the others, and in this case, R n and EP n have greater power. However, for larger β, the Hellinger distance test outperforms all the others. ORCID Eni Musta REFERENCES Alizadeh Noughabi, H., & Arghami, N. R. ). Testing exponentiality based on characterizations of the exponential distribution. Journal of Statistical Computation and Simulation, 8), Birgé, L. 986). On estimating a density using Hellinger distance and some other strange facts. Probability Theory and Related Fields, 7), 7 9.

17 6 LOPUHAÄ AND MUSTA Birgé, L., & Massart, P. 993). Rates of convergence for minimum contrast estimators. Probability Theory and Related Fields, 97 ), 3 5. Brunk, H. D. 958). On the estimation of parameters restricted by inequalities. Annals of Mathematical Statistics, 9, Doss, C. R., & Wellner, J. A. 6). Global rates of convergence of the MLEs of logconcave and s-concave densities. The Annals of Statistics, 443), Durot, C. ). Sharp asymptotics for isotonic regression. Probability Theory and Related Fields, ), 4. Durot, C. 7). On the L p -error of monotonicity constrained estimators. The Annals of Statistics, 353), 8 4. Durot, C., Kulikov, V. N., & Lopuhaä, H. P. ). The limit distribution of the L -error of Grenander-type estimators. The Annals of Statistics, 43), van de Geer, S. A. ). Applications of empirical process theory. Vol. 6. Cambridge Series in Statistical and Probabilistic Mathematics pp. xii 86). Cambridge, UK: Cambridge University Press. Grenander, U. 956). On the theory of mortality measurement II. Skand Aktuarietidskr, 39, Groeneboom, P. 985). Estimating a monotone density. In Berkeley Conference in Honor of Jerzy Neyman and Jack Kiefer, Vol. II. The Wadsworth & Brooks/Cole Statistics/Probability Series pp ). Groeneboom, P. 989). Brownian motion with a parabolic drift and airy functions. Probability Theory Related Fields, 8), Groeneboom, P., Hooghiemstra, G., & Lopuhaä, H. P. 999). Asymptotic normality of the L error of the Grenander estimator. The Annals of Statistics, 74), Groeneboom, P., & Wellner, J. A. 99). Information bounds and nonparametric maximum likelihood estimation. Vol. 9). Basel, Switzerland: Birkhäuser Verlag. DMV Seminar. Haywood, J., & Khmaladze, E. 8). On distribution-free goodness-of-fit testing of exponentiality. Journal of Economics, 43), 5 8. Huang, J., & Wellner, J. A. 995). Estimation of a monotone density or monotone hazard under random censoring. Scandinavian Journal of Statistics Theory and Applications, ), Huang, Y., & Zhang, C. H. 994). Estimating a monotone density from censored observations. The Annals of Statistics, 3), Jammalamadaka, S. R., & Taufer, E. 3). Testing exponentiality by comparing the empirical distribution function of the normalized spacings with that of the original data. Journal of Nonparametric Statistics, 56), Kim, A. K. H., Guntuboyina, A., & Samworth, R. J. 6). Adaptation in log-concave density estimation. arxiv: Kim, A. K. H., & Samworth, R. J. 6). Global rates of convergence in log-concave density estimation. The Annals of Statistics, 446), Kulikov, V. N., & Lopuhaä, H. P. 5). Asymptotic normality of the L k -error of the Grenander estimator. The Annals of Statistics, 335), LeCam, L. 97). On the weak convergence of probability measures. Annals of Mathematical Statistics, 4, LeCam, L. 973). Convergence of estimates under dimensionality restrictions. The Annals of Statistics,, Lopuhaä, H. P., & Nane, G. F. 3). Shape constrained non-parametric estimators of the baseline distribution in Cox proportional hazards model. Scandinavian Journal of Statistics Theory and Applications, 43), Marshall, A. W., & Proschan, F. 965). Maximum likelihood estimation for distributions with monotone failure rate. Annals of Mathematical Statistics, 36, Meintanis, S. G. 7). Test for exponentiality against Weibull and gamma decreasing hazard rate alternatives. Kybernetika Prague), 433), Pal, J. K., Woodroofe, M., & Meyer, M. 7). Estimating a Polya frequency function. Complex datasets and inverse problems. IMS Lecture Notes Monograph Series, 54, Prakasa Rao, B. L. S. 969). Estimation of a unimodal density. Sankhyā Statistics) The Indian Journal of Statistics Series A, 3, Prakasa Rao, B. L. S. 97). Estimation for distributions with monotone failure rate. Annals of Mathematical Statistics, 4, Seregin, A., & Wellner, J. A. ). Nonparametric estimation of multivariate convextransformed densities. The Annals of Statistics, 386), With supplementary material available online.

18 LOPUHAÄ AND MUSTA 7 van der Vaart, A. W., & Wellner, J. A. 996). Weak convergence and empirical processes: With applications to statistics. Springer Series in Statistics, pp. xvi 58. New York, NY: Springer-Verlag. Wellner, J. A. 5). Musings about shape constrained estimation and inference: Some problems. Paper presented at the workshop of Shape Constrained Inference: Open Problems and New Directions, Lorentz Centre, Leiden, The Netherlands. Wong, W. H., & Shen, X. 995). Probability inequalities for likelihood ratios and convergence rates of sieve MLEs. The Annals of Statistics, 3), How to cite this article: Lopuhaä HP, Musta E. A central limit theorem for the Hellinger loss of Grenander-type estimators. Statistica Neerlandica. 8; 7.

Asymptotic normality of the L k -error of the Grenander estimator

Asymptotic normality of the L k -error of the Grenander estimator Asymptotic normality of the L k -error of the Grenander estimator Vladimir N. Kulikov Hendrik P. Lopuhaä 1.7.24 Abstract: We investigate the limit behavior of the L k -distance between a decreasing density

More information

Nonparametric estimation under shape constraints, part 2

Nonparametric estimation under shape constraints, part 2 Nonparametric estimation under shape constraints, part 2 Piet Groeneboom, Delft University August 7, 2013 What to expect? Theory and open problems for interval censoring, case 2. Same for the bivariate

More information

arxiv:math/ v1 [math.st] 11 Feb 2006

arxiv:math/ v1 [math.st] 11 Feb 2006 The Annals of Statistics 25, Vol. 33, No. 5, 2228 2255 DOI: 1.1214/95365462 c Institute of Mathematical Statistics, 25 arxiv:math/62244v1 [math.st] 11 Feb 26 ASYMPTOTIC NORMALITY OF THE L K -ERROR OF THE

More information

The International Journal of Biostatistics

The International Journal of Biostatistics The International Journal of Biostatistics Volume 1, Issue 1 2005 Article 3 Score Statistics for Current Status Data: Comparisons with Likelihood Ratio and Wald Statistics Moulinath Banerjee Jon A. Wellner

More information

Maximum likelihood: counterexamples, examples, and open problems

Maximum likelihood: counterexamples, examples, and open problems Maximum likelihood: counterexamples, examples, and open problems Jon A. Wellner University of Washington visiting Vrije Universiteit, Amsterdam Talk at BeNeLuxFra Mathematics Meeting 21 May, 2005 Email:

More information

Nonparametric estimation under Shape Restrictions

Nonparametric estimation under Shape Restrictions Nonparametric estimation under Shape Restrictions Jon A. Wellner University of Washington, Seattle Statistical Seminar, Frejus, France August 30 - September 3, 2010 Outline: Five Lectures on Shape Restrictions

More information

STATISTICAL INFERENCE UNDER ORDER RESTRICTIONS LIMIT THEORY FOR THE GRENANDER ESTIMATOR UNDER ALTERNATIVE HYPOTHESES

STATISTICAL INFERENCE UNDER ORDER RESTRICTIONS LIMIT THEORY FOR THE GRENANDER ESTIMATOR UNDER ALTERNATIVE HYPOTHESES STATISTICAL INFERENCE UNDER ORDER RESTRICTIONS LIMIT THEORY FOR THE GRENANDER ESTIMATOR UNDER ALTERNATIVE HYPOTHESES By Jon A. Wellner University of Washington 1. Limit theory for the Grenander estimator.

More information

Maximum likelihood: counterexamples, examples, and open problems. Jon A. Wellner. University of Washington. Maximum likelihood: p.

Maximum likelihood: counterexamples, examples, and open problems. Jon A. Wellner. University of Washington. Maximum likelihood: p. Maximum likelihood: counterexamples, examples, and open problems Jon A. Wellner University of Washington Maximum likelihood: p. 1/7 Talk at University of Idaho, Department of Mathematics, September 15,

More information

Score Statistics for Current Status Data: Comparisons with Likelihood Ratio and Wald Statistics

Score Statistics for Current Status Data: Comparisons with Likelihood Ratio and Wald Statistics Score Statistics for Current Status Data: Comparisons with Likelihood Ratio and Wald Statistics Moulinath Banerjee 1 and Jon A. Wellner 2 1 Department of Statistics, Department of Statistics, 439, West

More information

arxiv: v3 [math.st] 29 Nov 2018

arxiv: v3 [math.st] 29 Nov 2018 A unified study of nonparametric inference for monotone functions arxiv:186.1928v3 math.st] 29 Nov 218 Ted Westling Center for Causal Inference University of Pennsylvania tgwest@pennmedicine.upenn.edu

More information

STAT Sample Problem: General Asymptotic Results

STAT Sample Problem: General Asymptotic Results STAT331 1-Sample Problem: General Asymptotic Results In this unit we will consider the 1-sample problem and prove the consistency and asymptotic normality of the Nelson-Aalen estimator of the cumulative

More information

NONPARAMETRIC CONFIDENCE INTERVALS FOR MONOTONE FUNCTIONS. By Piet Groeneboom and Geurt Jongbloed Delft University of Technology

NONPARAMETRIC CONFIDENCE INTERVALS FOR MONOTONE FUNCTIONS. By Piet Groeneboom and Geurt Jongbloed Delft University of Technology NONPARAMETRIC CONFIDENCE INTERVALS FOR MONOTONE FUNCTIONS By Piet Groeneboom and Geurt Jongbloed Delft University of Technology We study nonparametric isotonic confidence intervals for monotone functions.

More information

Shape constrained estimation: a brief introduction

Shape constrained estimation: a brief introduction Shape constrained estimation: a brief introduction Jon A. Wellner University of Washington, Seattle Cowles Foundation, Yale November 7, 2016 Econometrics lunch seminar, Yale Based on joint work with: Fadoua

More information

Maximum likelihood estimation of a log-concave density based on censored data

Maximum likelihood estimation of a log-concave density based on censored data Maximum likelihood estimation of a log-concave density based on censored data Dominic Schuhmacher Institute of Mathematical Statistics and Actuarial Science University of Bern Joint work with Lutz Dümbgen

More information

Lecture 22 Survival Analysis: An Introduction

Lecture 22 Survival Analysis: An Introduction University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 22 Survival Analysis: An Introduction There is considerable interest among economists in models of durations, which

More information

Nonparametric estimation under shape constraints, part 1

Nonparametric estimation under shape constraints, part 1 Nonparametric estimation under shape constraints, part 1 Piet Groeneboom, Delft University August 6, 2013 What to expect? History of the subject Distribution of order restricted monotone estimates. Connection

More information

Likelihood Based Inference for Monotone Response Models

Likelihood Based Inference for Monotone Response Models Likelihood Based Inference for Monotone Response Models Moulinath Banerjee University of Michigan September 5, 25 Abstract The behavior of maximum likelihood estimates (MLE s) the likelihood ratio statistic

More information

Les deux complices. Piet Groeneboom, Delft University. Juin 28, 2018

Les deux complices. Piet Groeneboom, Delft University. Juin 28, 2018 Les deux complices Piet Groeneboom, Delft University Juin 28, 2018 Monsieur Teste (Paul Valéry) Je me suis rarement perdu de vue Je me suis détesté je me suis adoré Puis, nous avons vieilli ensemble (La

More information

A STATISTICAL TEST FOR MONOTONIC AND NON-MONOTONIC TREND IN REPAIRABLE SYSTEMS

A STATISTICAL TEST FOR MONOTONIC AND NON-MONOTONIC TREND IN REPAIRABLE SYSTEMS A STATISTICAL TEST FOR MONOTONIC AND NON-MONOTONIC TREND IN REPAIRABLE SYSTEMS Jan Terje Kvaløy Department of Mathematics and Science, Stavanger University College, P.O. Box 2557 Ullandhaug, N-491 Stavanger,

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan

More information

Nonparametric estimation of log-concave densities

Nonparametric estimation of log-concave densities Nonparametric estimation of log-concave densities Jon A. Wellner University of Washington, Seattle Seminaire, Institut de Mathématiques de Toulouse 5 March 2012 Seminaire, Toulouse Based on joint work

More information

arxiv:math/ v1 [math.st] 21 Jul 2005

arxiv:math/ v1 [math.st] 21 Jul 2005 The Annals of Statistics 2005, Vol. 33, No. 3, 1330 1356 DOI: 10.1214/009053605000000138 c Institute of Mathematical Statistics, 2005 arxiv:math/0507427v1 [math.st] 21 Jul 2005 ESTIMATION OF A FUNCTION

More information

Chapter 2 Inference on Mean Residual Life-Overview

Chapter 2 Inference on Mean Residual Life-Overview Chapter 2 Inference on Mean Residual Life-Overview Statistical inference based on the remaining lifetimes would be intuitively more appealing than the popular hazard function defined as the risk of immediate

More information

11 Survival Analysis and Empirical Likelihood

11 Survival Analysis and Empirical Likelihood 11 Survival Analysis and Empirical Likelihood The first paper of empirical likelihood is actually about confidence intervals with the Kaplan-Meier estimator (Thomas and Grunkmeier 1979), i.e. deals with

More information

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances Advances in Decision Sciences Volume 211, Article ID 74858, 8 pages doi:1.1155/211/74858 Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances David Allingham 1 andj.c.w.rayner

More information

Concentration behavior of the penalized least squares estimator

Concentration behavior of the penalized least squares estimator Concentration behavior of the penalized least squares estimator Penalized least squares behavior arxiv:1511.08698v2 [math.st] 19 Oct 2016 Alan Muro and Sara van de Geer {muro,geer}@stat.math.ethz.ch Seminar

More information

arxiv:submit/ [math.st] 6 May 2011

arxiv:submit/ [math.st] 6 May 2011 A Continuous Mapping Theorem for the Smallest Argmax Functional arxiv:submit/0243372 [math.st] 6 May 2011 Emilio Seijo and Bodhisattva Sen Columbia University Abstract This paper introduces a version of

More information

Likelihood Based Inference for Monotone Response Models

Likelihood Based Inference for Monotone Response Models Likelihood Based Inference for Monotone Response Models Moulinath Banerjee University of Michigan September 11, 2006 Abstract The behavior of maximum likelihood estimates (MLEs) and the likelihood ratio

More information

ISOTONIC MAXIMUM LIKELIHOOD ESTIMATION FOR THE CHANGE POINT OF A HAZARD RATE

ISOTONIC MAXIMUM LIKELIHOOD ESTIMATION FOR THE CHANGE POINT OF A HAZARD RATE Sankhyā : The Indian Journal of Statistics 1997, Volume 59, Series A, Pt. 3, pp. 392-407 ISOTONIC MAXIMUM LIKELIHOOD ESTIMATION FOR THE CHANGE POINT OF A HAZARD RATE By S.N. JOSHI Indian Statistical Institute,

More information

Product-limit estimators of the survival function with left or right censored data

Product-limit estimators of the survival function with left or right censored data Product-limit estimators of the survival function with left or right censored data 1 CREST-ENSAI Campus de Ker-Lann Rue Blaise Pascal - BP 37203 35172 Bruz cedex, France (e-mail: patilea@ensai.fr) 2 Institut

More information

Goodness-of-Fit Test for Monotone Functions

Goodness-of-Fit Test for Monotone Functions Scandinavian Journal of Statistics doi: 1.1111/j.1467-9469.21.688.x 21 Board of the Foundation of the Scandinavian Journal of Statistics. Published by Blackwell Publishing Ltd. Goodness-of-Fit Test for

More information

Chernoff s distribution is log-concave. But why? (And why does it matter?)

Chernoff s distribution is log-concave. But why? (And why does it matter?) Chernoff s distribution is log-concave But why? (And why does it matter?) Jon A. Wellner University of Washington, Seattle University of Michigan April 1, 2011 Woodroofe Seminar Based on joint work with:

More information

Least squares under convex constraint

Least squares under convex constraint Stanford University Questions Let Z be an n-dimensional standard Gaussian random vector. Let µ be a point in R n and let Y = Z + µ. We are interested in estimating µ from the data vector Y, under the assumption

More information

Maximum Likelihood Estimation under Shape Constraints

Maximum Likelihood Estimation under Shape Constraints Maximum Likelihood Estimation under Shape Constraints Hanna K. Jankowski June 2 3, 29 Contents 1 Introduction 2 2 The MLE of a decreasing density 3 2.1 Finding the Estimator..............................

More information

University of Groningen. Statistical inference via fiducial methods Salomé, Diemer

University of Groningen. Statistical inference via fiducial methods Salomé, Diemer University of Groningen Statistical inference via fiducial methods Salomé, Diemer IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's PDF) if you wish to cite from it. Please

More information

asymptotic normality of nonparametric M-estimators with applications to hypothesis testing for panel count data

asymptotic normality of nonparametric M-estimators with applications to hypothesis testing for panel count data asymptotic normality of nonparametric M-estimators with applications to hypothesis testing for panel count data Xingqiu Zhao and Ying Zhang The Hong Kong Polytechnic University and Indiana University Abstract:

More information

A nonparametric two-sample wald test of equality of variances

A nonparametric two-sample wald test of equality of variances University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 211 A nonparametric two-sample wald test of equality of variances David

More information

Smooth nonparametric estimation of a quantile function under right censoring using beta kernels

Smooth nonparametric estimation of a quantile function under right censoring using beta kernels Smooth nonparametric estimation of a quantile function under right censoring using beta kernels Chanseok Park 1 Department of Mathematical Sciences, Clemson University, Clemson, SC 29634 Short Title: Smooth

More information

A Law of the Iterated Logarithm. for Grenander s Estimator

A Law of the Iterated Logarithm. for Grenander s Estimator A Law of the Iterated Logarithm for Grenander s Estimator Jon A. Wellner University of Washington, Seattle Probability Seminar October 24, 2016 Based on joint work with: Lutz Dümbgen and Malcolm Wolff

More information

STAT 331. Martingale Central Limit Theorem and Related Results

STAT 331. Martingale Central Limit Theorem and Related Results STAT 331 Martingale Central Limit Theorem and Related Results In this unit we discuss a version of the martingale central limit theorem, which states that under certain conditions, a sum of orthogonal

More information

Exercises. (a) Prove that m(t) =

Exercises. (a) Prove that m(t) = Exercises 1. Lack of memory. Verify that the exponential distribution has the lack of memory property, that is, if T is exponentially distributed with parameter λ > then so is T t given that T > t for

More information

PROBABILITY DISTRIBUTIONS

PROBABILITY DISTRIBUTIONS Review of PROBABILITY DISTRIBUTIONS Hideaki Shimazaki, Ph.D. http://goo.gl/visng Poisson process 1 Probability distribution Probability that a (continuous) random variable X is in (x,x+dx). ( ) P x < X

More information

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Survival Analysis. Lu Tian and Richard Olshen Stanford University 1 Survival Analysis Lu Tian and Richard Olshen Stanford University 2 Survival Time/ Failure Time/Event Time We will introduce various statistical methods for analyzing survival outcomes What is the survival

More information

Nonparametric estimation under Shape Restrictions

Nonparametric estimation under Shape Restrictions Nonparametric estimation under Shape Restrictions Jon A. Wellner University of Washington, Seattle Statistical Seminar, Frejus, France August 30 - September 3, 2010 Outline: Five Lectures on Shape Restrictions

More information

Lecture 7 Introduction to Statistical Decision Theory

Lecture 7 Introduction to Statistical Decision Theory Lecture 7 Introduction to Statistical Decision Theory I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 20, 2016 1 / 55 I-Hsiang Wang IT Lecture 7

More information

Monte Carlo Studies. The response in a Monte Carlo study is a random variable.

Monte Carlo Studies. The response in a Monte Carlo study is a random variable. Monte Carlo Studies The response in a Monte Carlo study is a random variable. The response in a Monte Carlo study has a variance that comes from the variance of the stochastic elements in the data-generating

More information

Testing Goodness-of-Fit for Exponential Distribution Based on Cumulative Residual Entropy

Testing Goodness-of-Fit for Exponential Distribution Based on Cumulative Residual Entropy This article was downloaded by: [Ferdowsi University] On: 16 April 212, At: 4:53 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 172954 Registered office: Mortimer

More information

Research Article Exponential Inequalities for Positively Associated Random Variables and Applications

Research Article Exponential Inequalities for Positively Associated Random Variables and Applications Hindawi Publishing Corporation Journal of Inequalities and Applications Volume 008, Article ID 38536, 11 pages doi:10.1155/008/38536 Research Article Exponential Inequalities for Positively Associated

More information

Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½

Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½ University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 1998 Asymptotic Nonequivalence of Nonparametric Experiments When the Smoothness Index is ½ Lawrence D. Brown University

More information

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling Estimation and Inference of Quantile Regression for Survival Data under Biased Sampling Supplementary Materials: Proofs of the Main Results S1 Verification of the weight function v i (t) for the lengthbiased

More information

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates Communications in Statistics - Theory and Methods ISSN: 0361-0926 (Print) 1532-415X (Online) Journal homepage: http://www.tandfonline.com/loi/lsta20 Analysis of Gamma and Weibull Lifetime Data under a

More information

Estimation of the Bivariate and Marginal Distributions with Censored Data

Estimation of the Bivariate and Marginal Distributions with Censored Data Estimation of the Bivariate and Marginal Distributions with Censored Data Michael Akritas and Ingrid Van Keilegom Penn State University and Eindhoven University of Technology May 22, 2 Abstract Two new

More information

DA Freedman Notes on the MLE Fall 2003

DA Freedman Notes on the MLE Fall 2003 DA Freedman Notes on the MLE Fall 2003 The object here is to provide a sketch of the theory of the MLE. Rigorous presentations can be found in the references cited below. Calculus. Let f be a smooth, scalar

More information

Stochastic Comparisons of Order Statistics from Generalized Normal Distributions

Stochastic Comparisons of Order Statistics from Generalized Normal Distributions A^VÇÚO 1 33 ò 1 6 Ï 2017 c 12 Chinese Journal of Applied Probability and Statistics Dec. 2017 Vol. 33 No. 6 pp. 591-607 doi: 10.3969/j.issn.1001-4268.2017.06.004 Stochastic Comparisons of Order Statistics

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

Nonparametric estimation: s concave and log-concave densities: alternatives to maximum likelihood

Nonparametric estimation: s concave and log-concave densities: alternatives to maximum likelihood Nonparametric estimation: s concave and log-concave densities: alternatives to maximum likelihood Jon A. Wellner University of Washington, Seattle Statistics Seminar, York October 15, 2015 Statistics Seminar,

More information

Research Article Least Squares Estimators for Unit Root Processes with Locally Stationary Disturbance

Research Article Least Squares Estimators for Unit Root Processes with Locally Stationary Disturbance Advances in Decision Sciences Volume, Article ID 893497, 6 pages doi:.55//893497 Research Article Least Squares Estimators for Unit Root Processes with Locally Stationary Disturbance Junichi Hirukawa and

More information

Refining the Central Limit Theorem Approximation via Extreme Value Theory

Refining the Central Limit Theorem Approximation via Extreme Value Theory Refining the Central Limit Theorem Approximation via Extreme Value Theory Ulrich K. Müller Economics Department Princeton University February 2018 Abstract We suggest approximating the distribution of

More information

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky EMPIRICAL ENVELOPE MLE AND LR TESTS Mai Zhou University of Kentucky Summary We study in this paper some nonparametric inference problems where the nonparametric maximum likelihood estimator (NPMLE) are

More information

1. Introduction In many biomedical studies, the random survival time of interest is never observed and is only known to lie before an inspection time

1. Introduction In many biomedical studies, the random survival time of interest is never observed and is only known to lie before an inspection time ASYMPTOTIC PROPERTIES OF THE GMLE WITH CASE 2 INTERVAL-CENSORED DATA By Qiqing Yu a;1 Anton Schick a, Linxiong Li b;2 and George Y. C. Wong c;3 a Dept. of Mathematical Sciences, Binghamton University,

More information

Nonparametric estimation of log-concave densities

Nonparametric estimation of log-concave densities Nonparametric estimation of log-concave densities Jon A. Wellner University of Washington, Seattle Northwestern University November 5, 2010 Conference on Shape Restrictions in Non- and Semi-Parametric

More information

Asymptotic statistics using the Functional Delta Method

Asymptotic statistics using the Functional Delta Method Quantiles, Order Statistics and L-Statsitics TU Kaiserslautern 15. Februar 2015 Motivation Functional The delta method introduced in chapter 3 is an useful technique to turn the weak convergence of random

More information

Modelling the risk process

Modelling the risk process Modelling the risk process Krzysztof Burnecki Hugo Steinhaus Center Wroc law University of Technology www.im.pwr.wroc.pl/ hugo Modelling the risk process 1 Risk process If (Ω, F, P) is a probability space

More information

Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics.

Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics. Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics. Dragi Anevski Mathematical Sciences und University November 25, 21 1 Asymptotic distributions for statistical

More information

Examination paper for TMA4275 Lifetime Analysis

Examination paper for TMA4275 Lifetime Analysis Department of Mathematical Sciences Examination paper for TMA4275 Lifetime Analysis Academic contact during examination: Ioannis Vardaxis Phone: 95 36 00 26 Examination date: Saturday May 30 2015 Examination

More information

Extremum Sieve Estimation in k-out-of-n Systems. 1

Extremum Sieve Estimation in k-out-of-n Systems. 1 Extremum Sieve Estimation in k-out-of-n Systems. 1 Tatiana Komarova, 2 This version: July 24 2013 ABSTRACT The paper considers nonparametric estimation of absolutely continuous distribution functions of

More information

ORDER RESTRICTED STATISTICAL INFERENCE ON LORENZ CURVES OF PARETO DISTRIBUTIONS. Myongsik Oh. 1. Introduction

ORDER RESTRICTED STATISTICAL INFERENCE ON LORENZ CURVES OF PARETO DISTRIBUTIONS. Myongsik Oh. 1. Introduction J. Appl. Math & Computing Vol. 13(2003), No. 1-2, pp. 457-470 ORDER RESTRICTED STATISTICAL INFERENCE ON LORENZ CURVES OF PARETO DISTRIBUTIONS Myongsik Oh Abstract. The comparison of two or more Lorenz

More information

NETWORK-REGULARIZED HIGH-DIMENSIONAL COX REGRESSION FOR ANALYSIS OF GENOMIC DATA

NETWORK-REGULARIZED HIGH-DIMENSIONAL COX REGRESSION FOR ANALYSIS OF GENOMIC DATA Statistica Sinica 213): Supplement NETWORK-REGULARIZED HIGH-DIMENSIONAL COX REGRESSION FOR ANALYSIS OF GENOMIC DATA Hokeun Sun 1, Wei Lin 2, Rui Feng 2 and Hongzhe Li 2 1 Columbia University and 2 University

More information

Nonparametric estimation of. s concave and log-concave densities: alternatives to maximum likelihood

Nonparametric estimation of. s concave and log-concave densities: alternatives to maximum likelihood Nonparametric estimation of s concave and log-concave densities: alternatives to maximum likelihood Jon A. Wellner University of Washington, Seattle Cowles Foundation Seminar, Yale University November

More information

ST745: Survival Analysis: Nonparametric methods

ST745: Survival Analysis: Nonparametric methods ST745: Survival Analysis: Nonparametric methods Eric B. Laber Department of Statistics, North Carolina State University February 5, 2015 The KM estimator is used ubiquitously in medical studies to estimate

More information

SOME CONVERSE LIMIT THEOREMS FOR EXCHANGEABLE BOOTSTRAPS

SOME CONVERSE LIMIT THEOREMS FOR EXCHANGEABLE BOOTSTRAPS SOME CONVERSE LIMIT THEOREMS OR EXCHANGEABLE BOOTSTRAPS Jon A. Wellner University of Washington The bootstrap Glivenko-Cantelli and bootstrap Donsker theorems of Giné and Zinn (990) contain both necessary

More information

and Comparison with NPMLE

and Comparison with NPMLE NONPARAMETRIC BAYES ESTIMATOR OF SURVIVAL FUNCTIONS FOR DOUBLY/INTERVAL CENSORED DATA and Comparison with NPMLE Mai Zhou Department of Statistics, University of Kentucky, Lexington, KY 40506 USA http://ms.uky.edu/

More information

Minimum distance tests and estimates based on ranks

Minimum distance tests and estimates based on ranks Minimum distance tests and estimates based on ranks Authors: Radim Navrátil Department of Mathematics and Statistics, Masaryk University Brno, Czech Republic (navratil@math.muni.cz) Abstract: It is well

More information

Understanding Regressions with Observations Collected at High Frequency over Long Span

Understanding Regressions with Observations Collected at High Frequency over Long Span Understanding Regressions with Observations Collected at High Frequency over Long Span Yoosoon Chang Department of Economics, Indiana University Joon Y. Park Department of Economics, Indiana University

More information

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Asymptotic Statistics-VI. Changliang Zou

Asymptotic Statistics-VI. Changliang Zou Asymptotic Statistics-VI Changliang Zou Kolmogorov-Smirnov distance Example (Kolmogorov-Smirnov confidence intervals) We know given α (0, 1), there is a well-defined d = d α,n such that, for any continuous

More information

Efficiency of Profile/Partial Likelihood in the Cox Model

Efficiency of Profile/Partial Likelihood in the Cox Model Efficiency of Profile/Partial Likelihood in the Cox Model Yuichi Hirose School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand Summary. This paper shows

More information

Multistate Modeling and Applications

Multistate Modeling and Applications Multistate Modeling and Applications Yang Yang Department of Statistics University of Michigan, Ann Arbor IBM Research Graduate Student Workshop: Statistics for a Smarter Planet Yang Yang (UM, Ann Arbor)

More information

Empirical Power of Four Statistical Tests in One Way Layout

Empirical Power of Four Statistical Tests in One Way Layout International Mathematical Forum, Vol. 9, 2014, no. 28, 1347-1356 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/imf.2014.47128 Empirical Power of Four Statistical Tests in One Way Layout Lorenzo

More information

Parameter addition to a family of multivariate exponential and weibull distribution

Parameter addition to a family of multivariate exponential and weibull distribution ISSN: 2455-216X Impact Factor: RJIF 5.12 www.allnationaljournal.com Volume 4; Issue 3; September 2018; Page No. 31-38 Parameter addition to a family of multivariate exponential and weibull distribution

More information

3 Integration and Expectation

3 Integration and Expectation 3 Integration and Expectation 3.1 Construction of the Lebesgue Integral Let (, F, µ) be a measure space (not necessarily a probability space). Our objective will be to define the Lebesgue integral R fdµ

More information

The properties of L p -GMM estimators

The properties of L p -GMM estimators The properties of L p -GMM estimators Robert de Jong and Chirok Han Michigan State University February 2000 Abstract This paper considers Generalized Method of Moment-type estimators for which a criterion

More information

Approximate Self Consistency for Middle-Censored Data

Approximate Self Consistency for Middle-Censored Data Approximate Self Consistency for Middle-Censored Data by S. Rao Jammalamadaka Department of Statistics & Applied Probability, University of California, Santa Barbara, CA 93106, USA. and Srikanth K. Iyer

More information

Poisson Processes. Particles arriving over time at a particle detector. Several ways to describe most common model.

Poisson Processes. Particles arriving over time at a particle detector. Several ways to describe most common model. Poisson Processes Particles arriving over time at a particle detector. Several ways to describe most common model. Approach 1: a) numbers of particles arriving in an interval has Poisson distribution,

More information

Functional Limit theorems for the quadratic variation of a continuous time random walk and for certain stochastic integrals

Functional Limit theorems for the quadratic variation of a continuous time random walk and for certain stochastic integrals Functional Limit theorems for the quadratic variation of a continuous time random walk and for certain stochastic integrals Noèlia Viles Cuadros BCAM- Basque Center of Applied Mathematics with Prof. Enrico

More information

ROBUST - September 10-14, 2012

ROBUST - September 10-14, 2012 Charles University in Prague ROBUST - September 10-14, 2012 Linear equations We observe couples (y 1, x 1 ), (y 2, x 2 ), (y 3, x 3 ),......, where y t R, x t R d t N. We suppose that members of couples

More information

Estimating monotone, unimodal and U shaped failure rates using asymptotic pivots

Estimating monotone, unimodal and U shaped failure rates using asymptotic pivots Estimating monotone, unimodal and U shaped failure rates using asymptotic pivots Moulinath Banerjee University of Michigan March 17, 2006 Abstract We propose a new method for pointwise estimation of monotone,

More information

Interval Estimation. Chapter 9

Interval Estimation. Chapter 9 Chapter 9 Interval Estimation 9.1 Introduction Definition 9.1.1 An interval estimate of a real-values parameter θ is any pair of functions, L(x 1,..., x n ) and U(x 1,..., x n ), of a sample that satisfy

More information

Mathematics Department, Birjand Branch, Islamic Azad University, Birjand, Iran

Mathematics Department, Birjand Branch, Islamic Azad University, Birjand, Iran REVSTAT Statistical Journal Volume 14, Number 3, June 216, 229 244 PROPERTIES OF n-laplace TRANSFORM RATIO ORDER AND -CLASS OF LIFE DISTRIBUTIONS Authors: Jalil Jarrahiferiz Mathematics Department, Birjand

More information

On probabilities of large and moderate deviations for L-statistics: a survey of some recent developments

On probabilities of large and moderate deviations for L-statistics: a survey of some recent developments UDC 519.2 On probabilities of large and moderate deviations for L-statistics: a survey of some recent developments N. V. Gribkova Department of Probability Theory and Mathematical Statistics, St.-Petersburg

More information

Theoretical Statistics. Lecture 1.

Theoretical Statistics. Lecture 1. 1. Organizational issues. 2. Overview. 3. Stochastic convergence. Theoretical Statistics. Lecture 1. eter Bartlett 1 Organizational Issues Lectures: Tue/Thu 11am 12:30pm, 332 Evans. eter Bartlett. bartlett@stat.

More information

Closest Moment Estimation under General Conditions

Closest Moment Estimation under General Conditions Closest Moment Estimation under General Conditions Chirok Han and Robert de Jong January 28, 2002 Abstract This paper considers Closest Moment (CM) estimation with a general distance function, and avoids

More information

δ -method and M-estimation

δ -method and M-estimation Econ 2110, fall 2016, Part IVb Asymptotic Theory: δ -method and M-estimation Maximilian Kasy Department of Economics, Harvard University 1 / 40 Example Suppose we estimate the average effect of class size

More information

A comparison study of the nonparametric tests based on the empirical distributions

A comparison study of the nonparametric tests based on the empirical distributions 통계연구 (2015), 제 20 권제 3 호, 1-12 A comparison study of the nonparametric tests based on the empirical distributions Hyo-Il Park 1) Abstract In this study, we propose a nonparametric test based on the empirical

More information

Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach

Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach By Shiqing Ling Department of Mathematics Hong Kong University of Science and Technology Let {y t : t = 0, ±1, ±2,

More information

Theorem 2.1 (Caratheodory). A (countably additive) probability measure on a field has an extension. n=1

Theorem 2.1 (Caratheodory). A (countably additive) probability measure on a field has an extension. n=1 Chapter 2 Probability measures 1. Existence Theorem 2.1 (Caratheodory). A (countably additive) probability measure on a field has an extension to the generated σ-field Proof of Theorem 2.1. Let F 0 be

More information

THE DVORETZKY KIEFER WOLFOWITZ INEQUALITY WITH SHARP CONSTANT: MASSART S 1990 PROOF SEMINAR, SEPT. 28, R. M. Dudley

THE DVORETZKY KIEFER WOLFOWITZ INEQUALITY WITH SHARP CONSTANT: MASSART S 1990 PROOF SEMINAR, SEPT. 28, R. M. Dudley THE DVORETZKY KIEFER WOLFOWITZ INEQUALITY WITH SHARP CONSTANT: MASSART S 1990 PROOF SEMINAR, SEPT. 28, 2011 R. M. Dudley 1 A. Dvoretzky, J. Kiefer, and J. Wolfowitz 1956 proved the Dvoretzky Kiefer Wolfowitz

More information

Analysis of Type-II Progressively Hybrid Censored Data

Analysis of Type-II Progressively Hybrid Censored Data Analysis of Type-II Progressively Hybrid Censored Data Debasis Kundu & Avijit Joarder Abstract The mixture of Type-I and Type-II censoring schemes, called the hybrid censoring scheme is quite common in

More information

Survival Analysis Math 434 Fall 2011

Survival Analysis Math 434 Fall 2011 Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup

More information