Comparing Distribution Functions via Empirical Likelihood

Size: px
Start display at page:

Download "Comparing Distribution Functions via Empirical Likelihood"

Transcription

1 Georgia State University Georgia State University Mathematics and Statistics Faculty Publications Department of Mathematics and Statistics 25 Comparing Distribution Functions via Empirical Likelihood Yichuan Zhao Georgia State University, yichuan@gsu.edu Ian W. McKeague Follow this and additional works at: Part of the Biostatistics Commons, and the Mathematics Commons Recommended Citation McKeague, Ian W. and Zhao, Yichuan 25 "Comparing Distribution Functions via Empirical Likelihood," The International Journal of Biostatistics: Vol. 1 : Iss. 1, Article 5. This Article is brought to you for free and open access by the Department of Mathematics and Statistics at Georgia State University. It has been accepted for inclusion in Mathematics and Statistics Faculty Publications by an authorized administrator of Georgia State University. For more information, please contact scholarworks@gsu.edu.

2 The International Journal of Biostatistics Volume 1, Issue 1 25 Article 5 Comparing Distribution Functions via Empirical Likelihood Ian W. McKeague Yichuan Zhao Columbia University, im2131@columbia.edu Georgia State University, dz27@gmail.com Copyright c 25 The Berkeley Electronic Press. All rights reserved.

3 Comparing Distribution Functions via Empirical Likelihood Ian W. McKeague and Yichuan Zhao Abstract This paper develops empirical likelihood based simultaneous confidence bands for differences and ratios of two distribution functions from independent samples of right-censored survival data. The proposed confidence bands provide a flexible way of comparing treatments in biomedical settings, and bring empirical likelihood methods to bear on important target functions for which only Wald-type confidence bands have been available in the literature. The approach is illustrated with a real data example. KEYWORDS: Kaplan Meier estimator, Nelson Aalen estimator, plug-in, right censoring

4 1 Introduction McKeague and Zhao: Comparing Distribution Functions The purpose of this paper is to develop simultaneous confidence bands for the difference and the ratio of two distribution functions using the empirical likelihood approach. The proposed confidence bands provide an attractive graphical comparison of treatment and control groups in biomedical studies on the basis of independent right-censored survival time data from each group. The graphical comparison of two survival distributions can be done in various ways. Empirical likelihood EL techniques have been used to provide confidence bands for Q- Q plots Einmahl and McKeague, 1999, ratios of survival functions McKeague and Zhao, 22, and P-P plots Claeskens et al., 23; references to the earlier literature may be found in these papers. The simplest and most natural way to carry out such a comparison, however, is to target the difference and the ratio of the two distribution functions, which represent directly interpretable measures of treatment effect and relative risk. The difference is suitable when an absolute measure of treatment effect is needed, the ratio when a relative measure is needed. Parzen et al constructed a Waldtype simultaneous confidence band for a difference between two distribution functions, but, as far as we know, there is no EL band available in the literature. We develop our approach in terms of differences and ratios of linear functionals of the cumulative hazard functions: αt = t g 1 s da 1 s t g 2 s da 2 s and t / t βt = g 1 s da 1 s g 2 s da 2 s, where A j t is the cumulative hazard function for group j, and g j t =S j t =1 F j tis the survival function for group j. The difference between the two distribution functions is then seen to be αt =F 1 t F 2 t =S 2 t S 1 t, and the ratio of the two distribution functions is βt =F 1 t/f 2 t. The difference and the ratio of the cumulative hazard functions are obtained by taking g j 1, and our approach extends essentially without change to that case as well. These various ways of comparing the two distributions provide greater flexibility than what is currently available. Published by The Berkeley Electronic Press, 25 1

5 The International Journal of Biostatistics, Vol. 1 [25], Iss. 1, Art. 5 The unknown survival functions g j = S j in the above representations of αt and βt can be seen as nuisance parameters in the EL statistic, so we take the approach of plugging-in the corresponding Kaplan Meier estimates. Plug-in for unknown parameters in estimating equations has been used extensively in conjunction with EL, and typically perturbs the usual chi-squared limit distribution into a more complicated form, see, e.g., Hjort et al. 25. The present case is no exception: we find that the empirical likelihood statistic with plug-in of the Kaplan Meier estimates is not asymptotically distribution free; a bootstrap procedure is thus needed to determine critical values for the EL confidence bands. An EL confidence interval for a linear functional gs das of a cumulative hazard function A, where g is known, has been developed by Pan and Zhou 22. Their approach is based on a Poisson extension of the likelihood cf. Murphy, 1995, but we found that it is not easy to deal with the target functions αt and βt using a likelihood of this form. Instead, our approach is based on the standard nonparametric likelihood for S 1,S 2, cf. McKeague and Zhao 22. The EL function or nonparametric likelihood ratio is constructed by substituting A j t = log S j t in the estimating equation that defines the target function of interest, and we find that this leads to a mathematically tractable formulation. The main results underlying our derivation of the proposed confidence bands are presented in Section 2. In Section 3 we develop the bootstrap procedure needed to construct the bands in practice. In Section 4 we give an illustrative example. Some concluding remarks are given in Section 5. Proofs are contained in the Appendix. 2 Main results 2.1 Preliminaries We consider the standard two-sample framework with independent right censoring. That is, we have two independent samples of i.i.d. observations of the form Z ji,δ ji, where j =1, 2 indexes the sample, i =1,...,n j indexes the observations within each sample, and Z ji = X ji Y ji, δ ji =1 {Xji Y ji }. The distribution functions of X ji and Y ji are DOI: 1.222/

6 McKeague and Zhao: Comparing Distribution Functions denoted F j and G j, respectively. The survival functions S j =1 F j are assumed to be continuous. The total sample size is n = n 1 + n 2. We work with independent and non-negative X ji and Y ji. The nonparametric likelihood is given by L S 1, S 2 = n 2 j { S j Z ji S j Z ji } δ ji Sj Z ji 1 δ ji, 2.1 j=1 where S j belongs to Γ, the space of all survival functions on [,. The target functions αt and βt may be written in the general form θt = θt, S 1,S 2,g 1,g 2 by substitution of the cumulative hazard functions A j t = log S j t. The empirical likelihood ratio for θt, with plug-in of estimators ĝ j for the g j, is then given by R θt,t,ĝ 1, ĝ 2 = sup{l S 1, S 2 :θt, S 1, S 2, ĝ 1, ĝ 2 = θt, S 1, S 2 Γ Γ} sup{l S 1, S 2 : S 1, S, 2 Γ Γ} 2.2 which can be expressed more explicitly in the case of θt = αt as follows. The ordered uncensored survival times, i.e., the X ji with corresponding δ ji = 1, are written T j1 T jnj <, and r ji = n j k=1 1 {Z jk T ji } denotes the size of the risk set at T ji, d ji = n j k=1 1 {Z jk =T ji,δ jk =1} denotes the number of deaths occurring at time T ji. Define K j t =#{i : T ji t} and D j = max i:tji td ji r ji /ĝ j T ji 1 {ĝj T ji >}. Using Lagrange s method [cf. Thomas and Grunkemeier 1975 or Li 1995], it can be shown that = 2 2 log Rαt,t,ĝ 1, ĝ 2 2 K 1 t K 2 t d 1i log 1+ λ nĝ 1 T 1i log 1+ λ nĝ 1 T 1i d 1i r 2i d 2i log 1 λ nĝ 2 T 2i r 2i log 1 λ nĝ 2 T 2i r 2i d 2i r 2i where the Lagrange multiplier D 1 <λ n < D 2 satisfies the equation K 1 t log 1 d 1i ĝ 1 T 1i + λ n ĝ 1 T 1i K 2 t log 1 d 2i ĝ 2 T 2i r 2i λ n ĝ 2 T 2i 2.3 = αt. 2.4 Published by The Berkeley Electronic Press, 25 3

7 The International Journal of Biostatistics, Vol. 1 [25], Iss. 1, Art. 5 Here we are suppressing the dependence of the Lagrange multiplier on t. The equation 2.4 has a unique solution λ n provided D j <, ĝ j T ji >, i =1,...,N j, j =1, 2, because as a function of λ n the l.h.s. of 2.4 is continuous, strictly increasing and tends to ± as λ n D 2 or λ n D 1. Similarly, in the case of θt =βt, we have 2 log Rβt,t,ĝ 1, ĝ 2 = 2 2 j=1 K j t r ji log { r ji d ji log 1+ βtj 1 λ n ĝ j T ji r ji d ji } 1+ βtj 1 λ n ĝ j T ji, 2.5 r ji where the Lagrange multiplier D 1 <λ n < D 2 /βt satisfies the equation K 1 t log 1 K 2 t βt d 1i ĝ 1 T 1i + λ n ĝ 1 T 1i log 1 d 2i ĝ 2 T 2i =. 2.6 r 2i βtλ n ĝ 2 T 2i If βt >, the equation 2.6 has a unique solution λ n provided ĝ j T ji >, i = 1,...,K j t, j =1, 2, because, as a function of λ n, the l.h.s. of 2.6 is continuous, strictly increasing and tends to ± as λ n D 2 /βt orλ n D 1. We assume throughout that n j /n p j > asn. The plug-in estimate of g j t =S j t is specified by ĝ j t =S j,nj t, where S j,nj t is the Kaplan Meier estimator of S j t. Define H j s =S j s1 G j s, let τ 1 be such that S j τ 1 < 1, and let τ 2 τ 1 be such that H j τ 2 >, j =1, 2. For future convenience, we define γ j t = t df j s 1 G j s, 2.7 σdiff 2 t =γ 1t/p 1 + γ 2 t/p 2 and σratiot 2 =γ 1 t/p 1 + βt 2 γ 2 t/p 2. These functions appear in the limiting distributions of the likelihood ratio statistics and need to be estimated. It can be shown see Lemma A.3 that ˆσ diff 2 t =n[ˆγ 1t/n 1 +ˆγ 2 t/n 2 ] and ˆσ ratiot 2 =n[ˆγ 1 t/n 1 + ˆβt 2ˆγ 2 t/n 2 ] are uniformly consistent estimators of σdiff 2 t and σratiot 2 over [τ 1,τ 2 ], where ˆγ j t = DOI: 1.222/ t df j,nj s 1 G j,nj s, 2.8 4

8 McKeague and Zhao: Comparing Distribution Functions F j,nj and G j,nj are the Kaplan Meier estimators of F j and G j, and ˆβt = F 1,n1 t/f 2,n2 t. 2.2 Confidence bands We now state our main results and explain how they can be used to construct the proposed simultaneous confidence bands. Theorem 2.1. The process 2 log Rαt,t,ĝ 1, ĝ 2, t [τ 1,τ 2 ], converges in distribution to U1 2 t/σdiff 2 t in D[τ 1,τ 2 ], where U 1 t is a Gaussian process with mean zero and covariance function covu 1 s,u 1 t = S 1 ss 1 tσ1s 2 t/p 1 + S 2 ss 2 tσ2s 2 t/p 2 and σ 2 j t = t df js/s j sh j s. It follows that sup 2 log Rαt,t,ĝ 1, ĝ 2 t [τ 1,τ 2 ] D U1 2 t sup t [τ 1,τ 2 ] σdiff 2 t by the continuous mapping theorem, and we obtain B diff = {t, αt : 2 log R αt,t,ĝ 1, ĝ 2 c α [τ 1,τ 2 ], t [τ 1,τ 2 ]} 2.9 as an asymptotic 11 α% confidence band for αt over[τ 1,τ 2 ], where the critical value c α [τ 1,τ 2 ] is the upper α-quantile of the distribution of sup t [τ1,τ 2 ] U1 2 t/σdiff 2 t. A simulation method is developed in Section 3 to obtain this critical value. Implementation. We now explain how to compute the confidence band B diff. For fixed t, let φλ n denote the r.h.s. of 2.3. Its derivative is φ λ n = + K 1 t K 2 t 2λ n ĝ 1 T 1i d 1i d 1i + λ n ĝ 1 T 1i + λ n ĝ 1 T 1i 2λ n ĝ 2 T 2i d 2i r 2i d 2i λ n ĝ 2 T 2i r 2i λ n ĝ 2 T 2i As in McKeague and Zhao 22, there exist exactly two roots λ L < < λ U for φλ L =φλ U =c α [τ 1,τ 2 ] and {λ n : φλ n c α [τ 1,τ 2 ]} =[λ L,λ U ]. Published by The Berkeley Electronic Press, 25 5

9 The International Journal of Biostatistics, Vol. 1 [25], Iss. 1, Art. 5 The confidence set for αt is a closed interval [ α L, α U ], where α L = K 1 t log 1 d 1i / + λ L ĝ 1 T 1i ĝ 1 T 1i + K2 t log 1 d 2i /r 2i λ L ĝ 2 T 2i ĝ 2 T 2i and α U is the same as α L, but with λ L replaced by λ U. Next we state a parallel result for βt. Theorem 2.2. The process 2 log Rβt,t,ĝ 1, ĝ 2, t [τ 1,τ 2 ], converges in distribution to U2 2 t/σratiot 2 in D[τ 1,τ 2 ], where U 2 t is a Gaussian process with mean zero and covariance function covu 2 s,u 2 t = S 1 ss 1 tσ1s t/p 2 1 +S 2 ss 2 tβsβtσ2s 2 t/p 2. It follows that an asymptotic 11 α% confidence band for βt is given by B ratio = {t, βt : 2 log R βt,t,ĝ 1, ĝ 2 K α [τ 1,τ 2 ], t [τ 1,τ 2 ]}, where K α [τ 1,τ 2 ] is the upper α-quantile of the distribution of sup t [τ1,τ 2 ] U2 2 t/σratiot, 2 which can be obtained by simulation, see Section 3. 3 Bootstrap procedure The limiting distributions of the EL statistics in Section 2 are complicated and include unknown parameters, so we need to develop a Monte Carlo method to determine the critical values. To that end we adapt the Gaussian multiplier bootstrap procedure of Lin et al for checking the adequacy of the Cox proportional hazards model. First we define some standard counting process notation: N ji t =1 {Zji t,δ ji =1}, Y ji t =1 {Zji t}, and Y j t = n j Y jit is the size of the risk set at t. The processes M ji s =N ji s s α jsy ji s ds are orthogonal locally square integrable martingales, where α j s is hazard rate corresponding to F j [see Andersen et al. 1993, II.4]. From the proof of Theorem 2.1 in the Appendix, and using the martingale representation of n j S j,nj t S j t, the process U 1 t/σ diff t is seen to be asymptotically equivalent to n 1/2 t n1 S 1,n1 t dm t n2 1is + S 2,n2 t dm 2is. ˆσ diff t Y 1 s Y 2 s DOI: 1.222/

10 McKeague and Zhao: Comparing Distribution Functions A version of this process that can be simulated is t U1 t = n1/2 n1 S 1,n1 t G 1idN 1i s t + S 2,n2 t ˆσ diff t Y 1 s n2 G 2idN 2i s Y 2 s where G 11,...,G 1n1,G 21,...,G 2n2 are i.i.d. N, 1 random variables independent of the data. Conditional on the data, the limiting distribution of U1 t coincides with that of U 1 t/σ diff t for almost all data sequences. This can be shown by noting that U1 is a Gaussian process with independent components whose covariance converges to that of U 1 /σ diff with probability 1, and by verifying tightness by applying the argument of Lin et al. 1993, Appendix 1 to each of the terms. The bootstrap resampling method is then used to obtain c α [τ 1,τ 2 ]: generate a large number, say L = 3, independent copies U11,...,U 1L of U 1 and take c α [τ 1,τ 2 ]tobe the upper α-quantile of sup U11 2 t,, sup U1Lt. 2 t [τ 1,τ 2 ] t [τ 1,τ 2 ] A similar method gives the critical value K α [τ 1,τ 2 ]: use U 2 t = n 1/2 t S 1,n1 t ˆσ ratio t n1 G 1i dn 1i s t + S 2,n2 t Y 1 s ˆβt as the bootstrap approximation for U 2 t/σ ratio t. n2, G 2i dn 2i s Y 2 s, 4 Numerical example The data come from a Mayo Clinic trial involving a treatment for primary biliary cirrhosis of the liver, see Fleming and Harrington 1991 for discussion. A total of n = 312 patients participated in the randomized clinical trial, 158 receiving the treatment Dpenicillamine and 154 receiving a placebo. Censoring is heavy 187 of the 312 observations are censored. We use τ 1 = 34 and τ 2 = 4427, respectively. Figure 1 displays the proposed confidence band B diff for the difference of the distribution functions placebo minus treatment. The corresponding difference of the Kaplan Meier curves is also displayed. Note that the simultaneous band contains the horizontal line difference =, so there is no evidence of a difference between treatment Published by The Berkeley Electronic Press, 25 7

11 The International Journal of Biostatistics, Vol. 1 [25], Iss. 1, Art. 5 and placebo. As expected, the confidence band is much narrower in the left tail than in the right. Difference of distribution functions Empirical likelihood confidence band Estimated difference of distribution functions Days after start of treatment Figure 1: Mayo Clinic trial, 95% simultaneous confidence band for the difference of distribution functions placebo treatment. Empirical likelihood confidence band Estimated ratio of distribution functions Ratio of distribution functions Days after start of treatment Figure 2: Mayo Clinic trial, 9% simultaneous confidence band for the ratio of distribution functions placebo/treatment. Figure 2 displays the proposed confidence band B ratio for the ratio of the distribu- DOI: 1.222/

12 McKeague and Zhao: Comparing Distribution Functions tion functions placebo over treatment. The corresponding estimate of the ratio of the distribution functions is also displayed. Note that the simultaneous band contains the horizontal line ratio = 1, so there is no evidence of a difference between treatment and placebo on the basis of this analysis. The lower bound of confidence band is greater than zero, which is within the range of parameter βt. As expected, the confidence band is much narrower in the right tail than in the left: the distribution function tends to zero in the left tail, so the variance of the ratio estimate blows up. 5 Discussion This article has developed an empirical likelihood approach for comparing the distributions of survival times from two independent samples of right censored survival data in terms of ratios, differences, and other functionals of the underlying distribution functions. Our methods have potential application in epidemiological cohort studies, and in randomized clinical trials for the comparison of treatment and placebo groups. Standard approaches to making such comparisons have been via pointwise confidence intervals expressed in terms of Kaplan Meier estimates, with the standard errors computed by the bootstrap or Greenwood s formula in conjunction with the delta method; see, e.g., Brenner and Hakulinen 25 on the estimation of relative survival rates of cancer patients. We have shown, on the other hand, that it is possible to obtain simultaneous confidence bands in this setting without relying on the Wald approach of centering the confidence band on a point estimate of the target function cf., Parzen et al., Our approach gives an added perspective to that obtained from other EL-type confidence bands for the comparison of survival distributions: Q-Q plots Einmahl and McKeague, 1999, ratios of survival functions McKeague and Zhao, 22, and P-P plots Claeskens et al., 23, the latter only being available in the absence of censoring. Our proposed confidence bands are computationally intensive compared with the closed form of Wald-type bands because they require the solution of a nonlinear equation at each uncensored survival time, and rely on the Gaussian multiplier simulation technique. For this reason, a simulation study to assess their performance would be Published by The Berkeley Electronic Press, 25 9

13 The International Journal of Biostatistics, Vol. 1 [25], Iss. 1, Art. 5 time-consuming, and we have not carried out such a study for the present article. Nevertheless, based on a previous simulation study of an EL-type confidence band in a survival analysis setting Hollander et al., 1997, we expect that the present EL bands will have significant advantages over their Wald-type counterparts in terms of coverage accuracy and adaption to skewness in the sampling distribution of the point estimates. Acknowledgements Ian McKeague acknowledges partial support under NSF grant DMS The authors are grateful to the editor and two referees for their helpful comments. Appendix: Proofs We need the following lemma to prove Theorem 2.1. Lemma A.1. Under the assumptions of Theorem 2.1, the Lagrange multiplier solving 2.4 satisfies λ n = λ n t =O P n 1/2 uniformly over [τ 1,τ 2 ]. Proof. First assume λ n t <. Along similar lines as Li 1995, p , it can be shown that K 1 t and K 2 t log 1 log 1 d 1i ĝ 1 T 1i + λ n tĝ 1 T 1i K 1 t d 1i K 2 t d 2i d 2i ĝ 2 T 2i r 2i λ n tĝ 2 T 2i r 2i + K 2 t log Combining the above two inequalities and 2.4 we get n 1 n 1 λ n t ĝ 1 T 1i n 2 n 2 + λ n t ĝ 2 T 2i ĝ 1 T 1i, ĝ 2 T 2i 1 d 2i + d 2i ĝ 2 T 2i. r 2i r 2i DOI: 1.222/

14 McKeague and Zhao: Comparing Distribution Functions αt + K 1 t K 2 t K 2 t d 1i ĝ 1 T 1i d 2i ĝ 2 T 2i r 2i n 1 n 1 λ n ĝ 1 T 1i n 2 n 2 + λ n ĝ 2 T 2i K 2 t + d 2i ĝ 2 T 2i r 2i ĝ 2 T 2i log 1 d 2i. A.1 r 2i Denote T =[τ 1,τ 2 ], ˆθ j t = K j t ĝ j T ji d ji /r ji, and θ j t = t g jsda j s = F j t, for t T. Let θ j t = K j t log1 d ji /r ji ĝ j T ji. Using 1/1 + x 1 for x and 1/1 x 1+x for x<1, from A.1 we obtain αt+ θ 2 t ˆθ 1 t K 1 t d 1i ĝ 2 1T 1i λ n n 1. A.2 In the case that λ n t, a similar argument leads to K 1 t αt + K 1 t d 1i ĝ 1 T 1i ĝ 1 T 1i log n 1 n 1 + λ n ĝ 1 T 1i K 1 t + 1 d K 2 t 1i d 2i ĝ 2 T 2i + r 2i From A.3, in a similar fashion to A.2, we obtain d 1i ĝ 1 T 1i n 2. A.3 n 2 λ n ĝ 2 T 2i αt+ θ 1 t ˆθ 2 t K 2 t d 2i ĝ 2 2T 2i r 2i λ n n 2. A.4 Next, in terms of the Nelson Aalen estimator Âj of A j,wehave nj ˆθ j t θ j t = t t n j S j,nj s dâjs S j s da j s = n j S j t S j,nj t, from the Volterra integral equation that relates the Nelson Aalen estimator and the Kaplan Meier estimator see Andersen et al., 1993, p. 92, and hence n j ˆθ j t D θ j t,j =1, 2 S j tu j t,j =1, 2 in D[τ 1,τ 2 ] D[τ 1,τ 2 ], where the U j t are Published by The Berkeley Electronic Press, 25 11

15 The International Journal of Biostatistics, Vol. 1 [25], Iss. 1, Art. 5 independent Gaussian martingales with mean zero and varu j s = σj 2 s Andersen et al., 1993, p By n j /n p j >, it follows that n{[ˆθ1 t θ 1 t] [ˆθ 2 t θ 2 t]} or in terms of θ 1 t θ 2 t =αt, we have D S 1tU 1 t p1 + S 2tU 2 t p2, nˆθ1 t ˆθ 2 t αt D S 1tU 1 t p1 + S 2tU 2 t p2. A.5 Using log1 x+x x 2 for x<1, we have nj ˆθ j t θ j t = K j t n j = max i K j t max i K j t log 1 d K j t ji d ji ĝ j T ji + r ji d ji nj r ji nj r ji Kj t ˆθ j t d ji ĝ j T ji r ji P ĝ j T ji r ji A.6 uniformly in t T, where in the last equality we use d ji = 1 a.s., which is a consequence of the continuity of S j, and in the final step we used the Glivenko Cantelli theorem and the uniform convergence in probability of ˆθ j t toθ j t ont. Combining A.5 and A.6, we have nαt ˆθ1 t+ θ 2 t D S 1tU 1 t p1 + S 2tU 2 t p2, nαt θ1 t+ˆθ 2 t D S 1tU 1 t p1 + S 2tU 2 t p2. Thus the l.h.s. of A.2 and A.4 are O P n 1/2 uniformly for t T. Applying Lenglart s inequality to the martingale integral t ĝk j s dâj A j s cf. Andersen et al., 1993, p. 19, where k 1, shows that it converges to zero uniformly in probability over t T. Since S j,nj t is a uniformly consistent estimator of S j t, we have that ĝj k t is a uniformly consistent estimator of gj k t. Thus, t ĝk j s gj k s da j s uniformly in probability over t T, and K j t d ji ĝ k j T ji r ji P t g k j sda j s A.7 DOI: 1.222/

16 McKeague and Zhao: Comparing Distribution Functions uniformly over t T, for j =1, 2 and any fixed k 1. Using n j /n p j >, A.2, A.4, A.7, we find that λ n = O P n 1/2 uniformly for t T. Proof of Theorem 2.1. Let f j λ n = K j t log1 d ji /r ji +λ n ĝ j T ji ĝ j T ji, and recall θ j t = K j t log1 d ji /r ji ĝ j T ji. Then f j = θ j t, f j = γ j t/n j, where γ j t = n j i:t ji t ĝ 2 j T ji d ji r ji r ji d ji. A.8 is a uniformly consistent estimator of γ j t over t [τ 1,τ 2 ]. This is proved in Lemma A.3. For any λ n = O P n 1/2, by a Taylor expansion we have f j 1 j 1 λ n = θ j t+ γ jt 1 j 1 λ n + f j ξ jn λ 2 n, A.9 n j 2 where ξ jn λ n. By A.7 with k =3,f j ξ jn λ 2 n = O P n 1 j. Using n j /n p j >, 2.4 and A.9 we then obtain αt = θ 1 t+ θ 2 t+ γ 1tλ n n 1 + γ 2tλ n n 2 + O P 1. A.1 It follows from A.1 that λ n = nˆσ 2 diff αt θ 1 t+ θ 2 t+o P n 1. A.11 Since λ n ĝ j T ji /r ji d ji = o P 1 and λ n ĝ j T ji /r ji = o P 1 uniformly in i = 1,...,K j τ 2, using a Taylor expansion for 2.3 and A.11 we obtain K 1 t 2 log Rαt,t,ĝ 1, ĝ 2 = λ 2 ĝ 2 K 2 t 1T 1i d 1i n r 1i d 1i + ĝ2t 2 2i d 2i r 2i r 2i d 2i 2λ3 n 3 K 1 t ĝ1t 3 1i 1 d 1i 1 2 r1i 2 Published by The Berkeley Electronic Press, 25 13

17 + 2λ3 K 2 t n 3 + λ4 n 2 + λ4 n 2 K 1 t K 2 t ĝ2t 3 2i ĝ1t 4 1i ĝ2t 4 2i 1 r 2i d 2i 2 1 r 2 2i 1 d 1i 1 3 r1i 3 1 r 2i d 2i 1 3 r2i 3 + o P 1 = nˆσ 2 diff αt θ 1 t+ θ 2 t+o P n o P 1, where in the last equality we use A.7 for k =3, 4. Combining A.5, A.6 and the uniform consistency of ˆσ diff 2 t shows that the above process has the limiting distribution indicated in the theorem. In order to prove Theorem 2.2 we need the following lemma. Lemma A.2. Under the assumptions of Theorem 2.2, the Lagrange multiplier solving 2.5 satisfies λ n = λ n t =O P n 1/2 uniformly over [τ 1,τ 2 ]. Proof. The proof follows a similar pattern to the proof of Lemma A.1. First assume λ n t <. Then, as in Li 1995, p , K 1 t log 1 The International Journal of Biostatistics, Vol. 1 [25], Iss. 1, Art. 5 d 1i ĝ 1 T 1i + λ n tĝ 1 T 1i K 1 t d 1i n 1 n 1 λ n t ĝ 1 T 1i ĝ 1 T 1i and K 2 t log 1 d 2i ĝ 2 T 2i r 2i βtλ n tĝ 2 T 2i K 2 t K 2 t log 1 d 2i + d 2i ĝ 2 T 2i r 2i r 2i d 2i r 2i n 2 ĝ 2 T 2i n 2 + βt λ n t ĝ 2 T 2i. Combining the above two inequalities and 2.6, using 1/1 + x 1 for x and 1/1 x 1+x for x<1, we obtain ˆθ 1 t+βt θ 2 t K 1 t d 1i ĝ 2 1T 1i λ n n 1. A.12 DOI: 1.222/

18 McKeague and Zhao: Comparing Distribution Functions Second, supposing λ n t, a similar argument leads to K 2 t βt d 2i ĝ 2 T 2i r 2i n 2 n 2 βt λ n ĝ 2 T 2i + + K 1 t K 1 t K 1 t In a similar fashion to A.12, from A.13 we obtain K 2 t θ 1 t βtˆθ 2 t β 2 t By n j /n p j >, we have n{[ˆθ1 t θ 1 t] βt[ˆθ 2 t θ 2 t]} D d 1i ĝ 1 T 1i d 1i ĝ 1 T 1i n 1 n 1 + λ n ĝ 1 T 1i ĝ 1 T 1i log 1 d 1i. A.13 d 2i ĝ 2 2T 2i r 2i λ n n 2. S 1tU 1 t p1 + βts 2tU 2 t p2, A.14 or in terms of θ 1 t/θ 2 t =βt, we have D nˆθ1 t βtˆθ 2 t S 1tU 1 t p1 + βts 2tU 2 t p2. A.15 Combining A.6 and A.15 gives nˆθ1 t βt θ 2 t n θ1 t βtˆθ 2 t D D S 1tU 1 t p1 S 1tU 1 t p1 + βts 2tU 2 t p2, + βts 2tU 2 t p2. Thus the l.h.s. of A.12 and A.14 are O P n 1/2. Using n j /n p j >, βt θ 1 τ 1 /θ 2 τ 2 =F 1 τ 1 /F 2 τ 2 >, A.7, A.12, and A.14, we find that λ n = O P n 1/2 uniformly for t T. Proof of Theorem 2.2. This proof is a variation of the proof of Theorem 2.1. Recall f j = γ j t/n j and note that βt θ 1 τ 2 /θ 2 τ 1, t T. For any λ n = O P n 1/2, by a Taylor expansion we have f j β j 1 tλ n = θ j t+ γ jt βt j 1 λ n + f j βt j 1 ξ jn βt 2j 1 λ 2 n, n j 2 A.16 Published by The Berkeley Electronic Press, 25 15

19 The International Journal of Biostatistics, Vol. 1 [25], Iss. 1, Art. 5 where ξ jn λ n. By A.7 with k =3,f j ξ jn λ 2 n = O P n 1 j, and using n j /n p j >, 2.6 and A.16, we obtain = θ 1 t+βt θ 2 t+ γ 1tλ n n 1 + γ 2tβ 2 tλ n n 2 + O P 1. A.17 It follows from A.17 that λ n = nˆσ 2 ratio θ 1 t βt θ 2 t+o P n 1. A.18 Since λ n ĝ j T ji /r ji d ji = o P 1 and λ n ĝ j T ji /r ji = o P 1 uniformly in i = 1,...,K j τ 2, using a Taylor expansion for 2.5 and A.18 we have K 1 t 2 log Rβt,t,ĝ 1, ĝ 2 = λ 2 ĝ 2 K 2 t 1T 1i d 1i n r 1i d 1i + β 2 tĝ2t 2 2i d 2i r 2i r 2i d 2i 2λ3 n 3 K 1 t + 2β3 tλ 3 n 3 + λ4 n 2 K 1 t + β4 tλ 4 n 2 ĝ1t 3 1i K 2 t ĝ1t 4 1i K 2 t 1 d 1i 1 2 r1i 2 ĝ2t 3 2i 1 r 2i d 2i 1 2 r2i 2 1 d 1i 1 3 r1i 3 ĝ2t 4 2i 1 r 2i d 2i 1 3 r2i 3 + o P 1 = nˆσ 2 ratio θ 1 t βt θ 2 t+o P n o P 1, where in the last equality we use A.7 for k =3, 4. Combining A.6, A.15 and the uniform consistency of ˆσ ratiot 2 completes the proof. Lemma A.3. The estimators ˆγ j and γ j defined in 2.8 and A.8, respectively, converge uniformly in probability to γ j over [τ 1,τ 2 ]. Proof. Note that φf j,g j t = t df js/1 G j s, t [,τ 2 ], is a continuous functional of cdfs F j and G j in supremum norm Andersen et al., 1993, Proposition DOI: 1.222/

20 McKeague and Zhao: Comparing Distribution Functions II.8.6., p. 113, so the result for ˆγ j follows from the uniform consistency of the Kaplan Meier estimators of F j and G j on [,τ 2 ]. The result for γ j can be proved by adapting the argument on pages of Andersen et al References Andersen, P.K., Borgan, O., Gill, R.D., and Keiding, N Statistical Models Based on Counting Processes, New York: Springer-Verlag. Brenner, H., and Hakulinen, T. 25 Substantial overestimation of standard errors of relative survival rates of cancer patients. Amer. J. Epidemiol., 161, Claeskens, G., Jing, B.-Y., Peng, L., and Zhou, W. 23 Empirical likelihood confidence regions for comparison distributions and ROC curves. Canadian J. Statist., 31, Einmahl, J.H., and McKeague, I.W Confidence tubes for multiple quantile plots via empirical likelihood. Ann. Statist., 27, Fleming, T.R., and Harrington, D.P Counting Processes and Survival Analysis, New York: Wiley. Hjort, N. L., McKeague, I. W., and Van Keilegom, I. 25 Extending the scope of empirical likelihood. Unpublished manuscript. Hollander, M., McKeague, I. W., and Yang, J Likelihood ratio based confidence bands for survival functions. J. Amer. Statist. Assoc., 92, Li, G On nonparametric likelihood ratio estimation of survival probabilities for censored data. Statist. Probab. Lett., 25, Lin, D.Y., Wei, L.J., and Ying, Z Checking the Cox model with cumulative sums of martingale-based residuals. Biometrika, 8, McKeague, I.W., and Zhao, Y. 22 Simultaneous confidence bands for ratios of survival functions via empirical likelihood. Statist. Probab. Lett., 6, Murphy, S Likelihood ratio-based confidence intervals in survival analysis. J. Amer. Statist. Assoc., 9, Owen, A Empirical likelihood ratio confidence intervals for a single functional. Biometrika, 75, Owen, A. 199 Empirical likelihood and confidence regions. Ann. Statist., 18, Owen, A. 21 Empirical Likelihood, Chapman & Hall: New York. Published by The Berkeley Electronic Press, 25 17

21 The International Journal of Biostatistics, Vol. 1 [25], Iss. 1, Art. 5 Pan, X. R., and Zhou, M. 22 Empirical likelihood ratio in terms of cumulative hazard function for censored data. J. Multi. Anal., 8, Parzen, M.I., Wei, L.J., and Ying, Z Simultaneous confidence intervals for the difference of two survival functions. Scand. J. Statist., 24, Thomas, D.R., and Grunkemeier, G.L Confidence interval estimation for survival probabilities for censored data. J. Amer. Statist. Assoc., 7, DOI: 1.222/

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan

More information

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),

More information

Likelihood ratio confidence bands in nonparametric regression with censored data

Likelihood ratio confidence bands in nonparametric regression with censored data Likelihood ratio confidence bands in nonparametric regression with censored data Gang Li University of California at Los Angeles Department of Biostatistics Ingrid Van Keilegom Eindhoven University of

More information

STAT Sample Problem: General Asymptotic Results

STAT Sample Problem: General Asymptotic Results STAT331 1-Sample Problem: General Asymptotic Results In this unit we will consider the 1-sample problem and prove the consistency and asymptotic normality of the Nelson-Aalen estimator of the cumulative

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

Size and Shape of Confidence Regions from Extended Empirical Likelihood Tests

Size and Shape of Confidence Regions from Extended Empirical Likelihood Tests Biometrika (2014),,, pp. 1 13 C 2014 Biometrika Trust Printed in Great Britain Size and Shape of Confidence Regions from Extended Empirical Likelihood Tests BY M. ZHOU Department of Statistics, University

More information

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky EMPIRICAL ENVELOPE MLE AND LR TESTS Mai Zhou University of Kentucky Summary We study in this paper some nonparametric inference problems where the nonparametric maximum likelihood estimator (NPMLE) are

More information

Quantile Regression for Residual Life and Empirical Likelihood

Quantile Regression for Residual Life and Empirical Likelihood Quantile Regression for Residual Life and Empirical Likelihood Mai Zhou email: mai@ms.uky.edu Department of Statistics, University of Kentucky, Lexington, KY 40506-0027, USA Jong-Hyeon Jeong email: jeong@nsabp.pitt.edu

More information

Empirical Likelihood Confidence Intervals for ROC Curves Under Right Censorship

Empirical Likelihood Confidence Intervals for ROC Curves Under Right Censorship Georgia State University ScholarWorks @ Georgia State University Mathematics Theses Department of Mathematics and Statistics 9-16-2010 Empirical Likelihood Confidence Intervals for ROC Curves Under Right

More information

Empirical Likelihood in Survival Analysis

Empirical Likelihood in Survival Analysis Empirical Likelihood in Survival Analysis Gang Li 1, Runze Li 2, and Mai Zhou 3 1 Department of Biostatistics, University of California, Los Angeles, CA 90095 vli@ucla.edu 2 Department of Statistics, The

More information

11 Survival Analysis and Empirical Likelihood

11 Survival Analysis and Empirical Likelihood 11 Survival Analysis and Empirical Likelihood The first paper of empirical likelihood is actually about confidence intervals with the Kaplan-Meier estimator (Thomas and Grunkmeier 1979), i.e. deals with

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH Jian-Jian Ren 1 and Mai Zhou 2 University of Central Florida and University of Kentucky Abstract: For the regression parameter

More information

Product-limit estimators of the survival function with left or right censored data

Product-limit estimators of the survival function with left or right censored data Product-limit estimators of the survival function with left or right censored data 1 CREST-ENSAI Campus de Ker-Lann Rue Blaise Pascal - BP 37203 35172 Bruz cedex, France (e-mail: patilea@ensai.fr) 2 Institut

More information

On the Breslow estimator

On the Breslow estimator Lifetime Data Anal (27) 13:471 48 DOI 1.17/s1985-7-948-y On the Breslow estimator D. Y. Lin Received: 5 April 27 / Accepted: 16 July 27 / Published online: 2 September 27 Springer Science+Business Media,

More information

Jackknife Empirical Likelihood for the Variance in the Linear Regression Model

Jackknife Empirical Likelihood for the Variance in the Linear Regression Model Georgia State University ScholarWorks @ Georgia State University Mathematics Theses Department of Mathematics and Statistics Summer 7-25-2013 Jackknife Empirical Likelihood for the Variance in the Linear

More information

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Noname manuscript No. (will be inserted by the editor) A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Mai Zhou Yifan Yang Received: date / Accepted: date Abstract In this note

More information

Treatment Comparison in Biomedical Studies Using Survival Function

Treatment Comparison in Biomedical Studies Using Survival Function Georgia State University ScholarWorks @ Georgia State University Mathematics Dissertations Department of Mathematics and Statistics Spring 5-3-2 Treatment Comparison in Biomedical Studies Using Survival

More information

Harvard University. Harvard University Biostatistics Working Paper Series

Harvard University. Harvard University Biostatistics Working Paper Series Harvard University Harvard University Biostatistics Working Paper Series Year 2008 Paper 94 The Highest Confidence Density Region and Its Usage for Inferences about the Survival Function with Censored

More information

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling Estimation and Inference of Quantile Regression for Survival Data under Biased Sampling Supplementary Materials: Proofs of the Main Results S1 Verification of the weight function v i (t) for the lengthbiased

More information

Consistency of bootstrap procedures for the nonparametric assessment of noninferiority with random censorship

Consistency of bootstrap procedures for the nonparametric assessment of noninferiority with random censorship Consistency of bootstrap procedures for the nonparametric assessment of noninferiority with random censorship Gudrun Freitag 1 and Axel Mun Institut für Mathematische Stochasti Georg-August-Universität

More information

Visualizing multiple quantile plots

Visualizing multiple quantile plots Visualizing multiple quantile plots Marko A.A. Boon Eindhoven University of Technology John H.J. Einmahl Tilburg University Ian W. McKeague Columbia University January 18, 2012 Abstract. Multiple quantile

More information

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Jonathan Taylor & Kristin Cobb Statistics 262: Intermediate Biostatistics p.1/?? Overview of today s class Kaplan-Meier Curve

More information

ST745: Survival Analysis: Nonparametric methods

ST745: Survival Analysis: Nonparametric methods ST745: Survival Analysis: Nonparametric methods Eric B. Laber Department of Statistics, North Carolina State University February 5, 2015 The KM estimator is used ubiquitously in medical studies to estimate

More information

1 Glivenko-Cantelli type theorems

1 Glivenko-Cantelli type theorems STA79 Lecture Spring Semester Glivenko-Cantelli type theorems Given i.i.d. observations X,..., X n with unknown distribution function F (t, consider the empirical (sample CDF ˆF n (t = I [Xi t]. n Then

More information

Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm

Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm Mai Zhou 1 University of Kentucky, Lexington, KY 40506 USA Summary. Empirical likelihood ratio method (Thomas and Grunkmier

More information

Extending the two-sample empirical likelihood method

Extending the two-sample empirical likelihood method Extending the two-sample empirical likelihood method Jānis Valeinis University of Latvia Edmunds Cers University of Latvia August 28, 2011 Abstract In this paper we establish the empirical likelihood method

More information

CONFIDENCE TUBES FOR MULTIPLE QUANTILE PLOTS VIA EMPIRICAL LIKELIHOOD

CONFIDENCE TUBES FOR MULTIPLE QUANTILE PLOTS VIA EMPIRICAL LIKELIHOOD The Annals of Statistics 999, Vol. 7, No. 4, 348367 CONFIDENCE TUBES FOR MULTIPLE QUANTILE PLOTS VIA EMPIRICAL LIKELIHOOD BY JOHN H. J. EINMAHL AND IAN W. MCKEAGUE Eindhoven University of Technology and

More information

Efficiency of Profile/Partial Likelihood in the Cox Model

Efficiency of Profile/Partial Likelihood in the Cox Model Efficiency of Profile/Partial Likelihood in the Cox Model Yuichi Hirose School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand Summary. This paper shows

More information

Full likelihood inferences in the Cox model: an empirical likelihood approach

Full likelihood inferences in the Cox model: an empirical likelihood approach Ann Inst Stat Math 2011) 63:1005 1018 DOI 10.1007/s10463-010-0272-y Full likelihood inferences in the Cox model: an empirical likelihood approach Jian-Jian Ren Mai Zhou Received: 22 September 2008 / Revised:

More information

Estimation of the Bivariate and Marginal Distributions with Censored Data

Estimation of the Bivariate and Marginal Distributions with Censored Data Estimation of the Bivariate and Marginal Distributions with Censored Data Michael Akritas and Ingrid Van Keilegom Penn State University and Eindhoven University of Technology May 22, 2 Abstract Two new

More information

A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1. By Thomas H. Scheike University of Copenhagen

A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1. By Thomas H. Scheike University of Copenhagen The Annals of Statistics 21, Vol. 29, No. 5, 1344 136 A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1 By Thomas H. Scheike University of Copenhagen We present a non-parametric survival model

More information

UNIVERSITY OF CALIFORNIA, SAN DIEGO

UNIVERSITY OF CALIFORNIA, SAN DIEGO UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department

More information

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Takeshi Emura and Hisayuki Tsukuma Abstract For testing the regression parameter in multivariate

More information

Visualizing multiple quantile plots

Visualizing multiple quantile plots Visualizing multiple quantile plots Boon, M.A.A.; Einmahl, J.H.J.; McKeague, I.W. Published: 01/01/2011 Document Version Publisher s PDF, also known as Version of Record (includes final page, issue and

More information

Asymptotic Properties of Kaplan-Meier Estimator. for Censored Dependent Data. Zongwu Cai. Department of Mathematics

Asymptotic Properties of Kaplan-Meier Estimator. for Censored Dependent Data. Zongwu Cai. Department of Mathematics To appear in Statist. Probab. Letters, 997 Asymptotic Properties of Kaplan-Meier Estimator for Censored Dependent Data by Zongwu Cai Department of Mathematics Southwest Missouri State University Springeld,

More information

TESTINGGOODNESSOFFITINTHECOX AALEN MODEL

TESTINGGOODNESSOFFITINTHECOX AALEN MODEL ROBUST 24 c JČMF 24 TESTINGGOODNESSOFFITINTHECOX AALEN MODEL David Kraus Keywords: Counting process, Cox Aalen model, goodness-of-fit, martingale, residual, survival analysis. Abstract: The Cox Aalen regression

More information

AFT Models and Empirical Likelihood

AFT Models and Empirical Likelihood AFT Models and Empirical Likelihood Mai Zhou Department of Statistics, University of Kentucky Collaborators: Gang Li (UCLA); A. Bathke; M. Kim (Kentucky) Accelerated Failure Time (AFT) models: Y = log(t

More information

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints and Its Application to Empirical Likelihood

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints and Its Application to Empirical Likelihood Noname manuscript No. (will be inserted by the editor) A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints and Its Application to Empirical Likelihood Mai Zhou Yifan Yang Received:

More information

Empirical Processes & Survival Analysis. The Functional Delta Method

Empirical Processes & Survival Analysis. The Functional Delta Method STAT/BMI 741 University of Wisconsin-Madison Empirical Processes & Survival Analysis Lecture 3 The Functional Delta Method Lu Mao lmao@biostat.wisc.edu 3-1 Objectives By the end of this lecture, you will

More information

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES Statistica Sinica 19 (2009), 71-81 SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES Song Xi Chen 1,2 and Chiu Min Wong 3 1 Iowa State University, 2 Peking University and

More information

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen Outline Cox s proportional hazards model. Goodness-of-fit tools More flexible models R-package timereg Forthcoming book, Martinussen and Scheike. 2/38 University of Copenhagen http://www.biostat.ku.dk

More information

EMPIRICAL LIKELIHOOD ANALYSIS FOR THE HETEROSCEDASTIC ACCELERATED FAILURE TIME MODEL

EMPIRICAL LIKELIHOOD ANALYSIS FOR THE HETEROSCEDASTIC ACCELERATED FAILURE TIME MODEL Statistica Sinica 22 (2012), 295-316 doi:http://dx.doi.org/10.5705/ss.2010.190 EMPIRICAL LIKELIHOOD ANALYSIS FOR THE HETEROSCEDASTIC ACCELERATED FAILURE TIME MODEL Mai Zhou 1, Mi-Ok Kim 2, and Arne C.

More information

On the generalized maximum likelihood estimator of survival function under Koziol Green model

On the generalized maximum likelihood estimator of survival function under Koziol Green model On the generalized maximum likelihood estimator of survival function under Koziol Green model By: Haimeng Zhang, M. Bhaskara Rao, Rupa C. Mitra Zhang, H., Rao, M.B., and Mitra, R.C. (2006). On the generalized

More information

A Simulation Study on Confidence Interval Procedures of Some Mean Cumulative Function Estimators

A Simulation Study on Confidence Interval Procedures of Some Mean Cumulative Function Estimators Statistics Preprints Statistics -00 A Simulation Study on Confidence Interval Procedures of Some Mean Cumulative Function Estimators Jianying Zuo Iowa State University, jiyizu@iastate.edu William Q. Meeker

More information

University of California San Diego and Stanford University and

University of California San Diego and Stanford University and First International Workshop on Functional and Operatorial Statistics. Toulouse, June 19-21, 2008 K-sample Subsampling Dimitris N. olitis andjoseph.romano University of California San Diego and Stanford

More information

The International Journal of Biostatistics

The International Journal of Biostatistics The International Journal of Biostatistics Volume 1, Issue 1 2005 Article 3 Score Statistics for Current Status Data: Comparisons with Likelihood Ratio and Wald Statistics Moulinath Banerjee Jon A. Wellner

More information

Survival Analysis for Case-Cohort Studies

Survival Analysis for Case-Cohort Studies Survival Analysis for ase-ohort Studies Petr Klášterecký Dept. of Probability and Mathematical Statistics, Faculty of Mathematics and Physics, harles University, Prague, zech Republic e-mail: petr.klasterecky@matfyz.cz

More information

Logistic regression model for survival time analysis using time-varying coefficients

Logistic regression model for survival time analysis using time-varying coefficients Logistic regression model for survival time analysis using time-varying coefficients Accepted in American Journal of Mathematical and Management Sciences, 2016 Kenichi SATOH ksatoh@hiroshima-u.ac.jp Research

More information

and Comparison with NPMLE

and Comparison with NPMLE NONPARAMETRIC BAYES ESTIMATOR OF SURVIVAL FUNCTIONS FOR DOUBLY/INTERVAL CENSORED DATA and Comparison with NPMLE Mai Zhou Department of Statistics, University of Kentucky, Lexington, KY 40506 USA http://ms.uky.edu/

More information

Multistate Modeling and Applications

Multistate Modeling and Applications Multistate Modeling and Applications Yang Yang Department of Statistics University of Michigan, Ann Arbor IBM Research Graduate Student Workshop: Statistics for a Smarter Planet Yang Yang (UM, Ann Arbor)

More information

BARTLETT IDENTITIES AND LARGE DEVIATIONS IN LIKELIHOOD THEORY 1. By Per Aslak Mykland University of Chicago

BARTLETT IDENTITIES AND LARGE DEVIATIONS IN LIKELIHOOD THEORY 1. By Per Aslak Mykland University of Chicago The Annals of Statistics 1999, Vol. 27, No. 3, 1105 1117 BARTLETT IDENTITIES AND LARGE DEVIATIONS IN LIKELIHOOD THEORY 1 By Per Aslak Mykland University of Chicago The connection between large and small

More information

Investigation of goodness-of-fit test statistic distributions by random censored samples

Investigation of goodness-of-fit test statistic distributions by random censored samples d samples Investigation of goodness-of-fit test statistic distributions by random censored samples Novosibirsk State Technical University November 22, 2010 d samples Outline 1 Nonparametric goodness-of-fit

More information

Empirical Likelihood Confidence Band

Empirical Likelihood Confidence Band University of Kentucky UKnowledge Theses and Dissertations--Statistics Statistics 2015 Empirical Likelihood Confidence Band Shihong Zhu University of Kentucky, shihong2015@gmail.com Click here to let us

More information

Chapter 4 Fall Notations: t 1 < t 2 < < t D, D unique death times. d j = # deaths at t j = n. Y j = # at risk /alive at t j = n

Chapter 4 Fall Notations: t 1 < t 2 < < t D, D unique death times. d j = # deaths at t j = n. Y j = # at risk /alive at t j = n Bios 323: Applied Survival Analysis Qingxia (Cindy) Chen Chapter 4 Fall 2012 4.2 Estimators of the survival and cumulative hazard functions for RC data Suppose X is a continuous random failure time with

More information

Tests of independence for censored bivariate failure time data

Tests of independence for censored bivariate failure time data Tests of independence for censored bivariate failure time data Abstract Bivariate failure time data is widely used in survival analysis, for example, in twins study. This article presents a class of χ

More information

Analysis of Time-to-Event Data: Chapter 2 - Nonparametric estimation of functions of survival time

Analysis of Time-to-Event Data: Chapter 2 - Nonparametric estimation of functions of survival time Analysis of Time-to-Event Data: Chapter 2 - Nonparametric estimation of functions of survival time Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term

More information

Goodness-of-Fit Tests With Right-Censored Data by Edsel A. Pe~na Department of Statistics University of South Carolina Colloquium Talk August 31, 2 Research supported by an NIH Grant 1 1. Practical Problem

More information

Exercises. (a) Prove that m(t) =

Exercises. (a) Prove that m(t) = Exercises 1. Lack of memory. Verify that the exponential distribution has the lack of memory property, that is, if T is exponentially distributed with parameter λ > then so is T t given that T > t for

More information

Lecture 3. Truncation, length-bias and prevalence sampling

Lecture 3. Truncation, length-bias and prevalence sampling Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in

More information

Median regression using inverse censoring weights

Median regression using inverse censoring weights Median regression using inverse censoring weights Sundarraman Subramanian 1 Department of Mathematical Sciences New Jersey Institute of Technology Newark, New Jersey Gerhard Dikta Department of Applied

More information

1 Introduction. 2 Residuals in PH model

1 Introduction. 2 Residuals in PH model Supplementary Material for Diagnostic Plotting Methods for Proportional Hazards Models With Time-dependent Covariates or Time-varying Regression Coefficients BY QIQING YU, JUNYI DONG Department of Mathematical

More information

Analytical Bootstrap Methods for Censored Data

Analytical Bootstrap Methods for Censored Data JOURNAL OF APPLIED MATHEMATICS AND DECISION SCIENCES, 6(2, 129 141 Copyright c 2002, Lawrence Erlbaum Associates, Inc. Analytical Bootstrap Methods for Censored Data ALAN D. HUTSON Division of Biostatistics,

More information

Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics.

Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics. Asymptotic Distributions for the Nelson-Aalen and Kaplan-Meier estimators and for test statistics. Dragi Anevski Mathematical Sciences und University November 25, 21 1 Asymptotic distributions for statistical

More information

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models NIH Talk, September 03 Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models Eric Slud, Math Dept, Univ of Maryland Ongoing joint project with Ilia

More information

Marko A. A. Boon a, John H. J. Einmahl b & Ian W. McKeague c a Department of Mathematics and Computer Science, Eindhoven

Marko A. A. Boon a, John H. J. Einmahl b & Ian W. McKeague c a Department of Mathematics and Computer Science, Eindhoven This article was downloaded by: [Columbia University] On: 8 March 013, At: 1: Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 107954 Registered office: Mortimer

More information

STAT 331. Martingale Central Limit Theorem and Related Results

STAT 331. Martingale Central Limit Theorem and Related Results STAT 331 Martingale Central Limit Theorem and Related Results In this unit we discuss a version of the martingale central limit theorem, which states that under certain conditions, a sum of orthogonal

More information

Definitions and examples Simple estimation and testing Regression models Goodness of fit for the Cox model. Recap of Part 1. Per Kragh Andersen

Definitions and examples Simple estimation and testing Regression models Goodness of fit for the Cox model. Recap of Part 1. Per Kragh Andersen Recap of Part 1 Per Kragh Andersen Section of Biostatistics, University of Copenhagen DSBS Course Survival Analysis in Clinical Trials January 2018 1 / 65 Overview Definitions and examples Simple estimation

More information

Efficiency Comparison Between Mean and Log-rank Tests for. Recurrent Event Time Data

Efficiency Comparison Between Mean and Log-rank Tests for. Recurrent Event Time Data Efficiency Comparison Between Mean and Log-rank Tests for Recurrent Event Time Data Wenbin Lu Department of Statistics, North Carolina State University, Raleigh, NC 27695 Email: lu@stat.ncsu.edu Summary.

More information

COMPUTATION OF THE EMPIRICAL LIKELIHOOD RATIO FROM CENSORED DATA. Kun Chen and Mai Zhou 1 Bayer Pharmaceuticals and University of Kentucky

COMPUTATION OF THE EMPIRICAL LIKELIHOOD RATIO FROM CENSORED DATA. Kun Chen and Mai Zhou 1 Bayer Pharmaceuticals and University of Kentucky COMPUTATION OF THE EMPIRICAL LIKELIHOOD RATIO FROM CENSORED DATA Kun Chen and Mai Zhou 1 Bayer Pharmaceuticals and University of Kentucky Summary The empirical likelihood ratio method is a general nonparametric

More information

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University

More information

Statistical Analysis of Competing Risks With Missing Causes of Failure

Statistical Analysis of Competing Risks With Missing Causes of Failure Proceedings 59th ISI World Statistics Congress, 25-3 August 213, Hong Kong (Session STS9) p.1223 Statistical Analysis of Competing Risks With Missing Causes of Failure Isha Dewan 1,3 and Uttara V. Naik-Nimbalkar

More information

GROUPED SURVIVAL DATA. Florida State University and Medical College of Wisconsin

GROUPED SURVIVAL DATA. Florida State University and Medical College of Wisconsin FITTING COX'S PROPORTIONAL HAZARDS MODEL USING GROUPED SURVIVAL DATA Ian W. McKeague and Mei-Jie Zhang Florida State University and Medical College of Wisconsin Cox's proportional hazard model is often

More information

Log-linearity for Cox s regression model. Thesis for the Degree Master of Science

Log-linearity for Cox s regression model. Thesis for the Degree Master of Science Log-linearity for Cox s regression model Thesis for the Degree Master of Science Zaki Amini Master s Thesis, Spring 2015 i Abstract Cox s regression model is one of the most applied methods in medical

More information

Smoothing the Nelson-Aalen Estimtor Biostat 277 presentation Chi-hong Tseng

Smoothing the Nelson-Aalen Estimtor Biostat 277 presentation Chi-hong Tseng Smoothing the Nelson-Aalen Estimtor Biostat 277 presentation Chi-hong seng Reference: 1. Andersen, Borgan, Gill, and Keiding (1993). Statistical Model Based on Counting Processes, Springer-Verlag, p.229-255

More information

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);

More information

Bahadur representations for bootstrap quantiles 1

Bahadur representations for bootstrap quantiles 1 Bahadur representations for bootstrap quantiles 1 Yijun Zuo Department of Statistics and Probability, Michigan State University East Lansing, MI 48824, USA zuo@msu.edu 1 Research partially supported by

More information

The Central Limit Theorem Under Random Truncation

The Central Limit Theorem Under Random Truncation The Central Limit Theorem Under Random Truncation WINFRIED STUTE and JANE-LING WANG Mathematical Institute, University of Giessen, Arndtstr., D-3539 Giessen, Germany. winfried.stute@math.uni-giessen.de

More information

Asymptotic statistics using the Functional Delta Method

Asymptotic statistics using the Functional Delta Method Quantiles, Order Statistics and L-Statsitics TU Kaiserslautern 15. Februar 2015 Motivation Functional The delta method introduced in chapter 3 is an useful technique to turn the weak convergence of random

More information

LEAST ABSOLUTE DEVIATIONS ESTIMATION FOR THE ACCELERATED FAILURE TIME MODEL

LEAST ABSOLUTE DEVIATIONS ESTIMATION FOR THE ACCELERATED FAILURE TIME MODEL Statistica Sinica 17(2007), 1533-1548 LEAST ABSOLUTE DEVIATIONS ESTIMATION FOR THE ACCELERATED FAILURE TIME MODEL Jian Huang 1, Shuangge Ma 2 and Huiliang Xie 1 1 University of Iowa and 2 Yale University

More information

Large sample theory for merged data from multiple sources

Large sample theory for merged data from multiple sources Large sample theory for merged data from multiple sources Takumi Saegusa University of Maryland Division of Statistics August 22 2018 Section 1 Introduction Problem: Data Integration Massive data are collected

More information

Least Absolute Deviations Estimation for the Accelerated Failure Time Model. University of Iowa. *

Least Absolute Deviations Estimation for the Accelerated Failure Time Model. University of Iowa. * Least Absolute Deviations Estimation for the Accelerated Failure Time Model Jian Huang 1,2, Shuangge Ma 3, and Huiliang Xie 1 1 Department of Statistics and Actuarial Science, and 2 Program in Public Health

More information

Power and Sample Size Calculations with the Additive Hazards Model

Power and Sample Size Calculations with the Additive Hazards Model Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine

More information

Product-limit estimators of the gap time distribution of a renewal process under different sampling patterns

Product-limit estimators of the gap time distribution of a renewal process under different sampling patterns Product-limit estimators of the gap time distribution of a renewal process under different sampling patterns arxiv:13.182v1 [stat.ap] 28 Feb 21 Richard D. Gill Department of Mathematics University of Leiden

More information

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials Progress, Updates, Problems William Jen Hoe Koh May 9, 2013 Overview Marginal vs Conditional What is TMLE? Key Estimation

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview

Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations

More information

Package Rsurrogate. October 20, 2016

Package Rsurrogate. October 20, 2016 Type Package Package Rsurrogate October 20, 2016 Title Robust Estimation of the Proportion of Treatment Effect Explained by Surrogate Marker Information Version 2.0 Date 2016-10-19 Author Layla Parast

More information

Chapter 2 Inference on Mean Residual Life-Overview

Chapter 2 Inference on Mean Residual Life-Overview Chapter 2 Inference on Mean Residual Life-Overview Statistical inference based on the remaining lifetimes would be intuitively more appealing than the popular hazard function defined as the risk of immediate

More information

A comparison study of the nonparametric tests based on the empirical distributions

A comparison study of the nonparametric tests based on the empirical distributions 통계연구 (2015), 제 20 권제 3 호, 1-12 A comparison study of the nonparametric tests based on the empirical distributions Hyo-Il Park 1) Abstract In this study, we propose a nonparametric test based on the empirical

More information

Multi-state models: prediction

Multi-state models: prediction Department of Medical Statistics and Bioinformatics Leiden University Medical Center Course on advanced survival analysis, Copenhagen Outline Prediction Theory Aalen-Johansen Computational aspects Applications

More information

ICSA Applied Statistics Symposium 1. Balanced adjusted empirical likelihood

ICSA Applied Statistics Symposium 1. Balanced adjusted empirical likelihood ICSA Applied Statistics Symposium 1 Balanced adjusted empirical likelihood Art B. Owen Stanford University Sarah Emerson Oregon State University ICSA Applied Statistics Symposium 2 Empirical likelihood

More information

NONPARAMETRIC TESTING FOR MULTIPLE SURVIVAL FUNCTIONS WITH NON-INFERIORITY MARGINS

NONPARAMETRIC TESTING FOR MULTIPLE SURVIVAL FUNCTIONS WITH NON-INFERIORITY MARGINS Submitted to the Annals of Statistics NONPARAMETRIC TESTING FOR MULTIPLE SURVIVAL FUNCTIONS WITH NON-INFERIORITY MARGINS By Hsin-Wen Chang, Ian W. McKeague Academia Sinica and Columbia University New nonparametric

More information

Symmetric Tests and Condence Intervals for Survival Probabilities and Quantiles of Censored Survival Data Stuart Barber and Christopher Jennison Depar

Symmetric Tests and Condence Intervals for Survival Probabilities and Quantiles of Censored Survival Data Stuart Barber and Christopher Jennison Depar Symmetric Tests and Condence Intervals for Survival Probabilities and Quantiles of Censored Survival Data Stuart Barber and Christopher Jennison Department of Mathematical Sciences, University of Bath,

More information

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t)

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t) PhD course in Advanced survival analysis. (ABGK, sect. V.1.1) One-sample tests. Counting process N(t) Non-parametric hypothesis tests. Parametric models. Intensity process λ(t) = α(t)y (t) satisfying Aalen

More information

Chapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample

Chapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample Bios 323: Applied Survival Analysis Qingxia (Cindy) Chen Chapter 7 Fall 2012 Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample H 0 : S(t) = S 0 (t), where S 0 ( ) is known survival function,

More information

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Statistica Sinica 20 (2010), 441-453 GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Antai Wang Georgetown University Medical Center Abstract: In this paper, we propose two tests for parametric models

More information

A note on profile likelihood for exponential tilt mixture models

A note on profile likelihood for exponential tilt mixture models Biometrika (2009), 96, 1,pp. 229 236 C 2009 Biometrika Trust Printed in Great Britain doi: 10.1093/biomet/asn059 Advance Access publication 22 January 2009 A note on profile likelihood for exponential

More information

Survival Times (in months) Survival Times (in months) Relative Frequency. Relative Frequency

Survival Times (in months) Survival Times (in months) Relative Frequency. Relative Frequency Smooth Goodness-of-Fit Tests in Hazard-Based Models by Edsel A. Pe~na Department of Statistics University of South Carolina at Columbia E-Mail: pena@stat.sc.edu Univ. of Virginia Colloquium Talk Department

More information

A Primer on Asymptotics

A Primer on Asymptotics A Primer on Asymptotics Eric Zivot Department of Economics University of Washington September 30, 2003 Revised: October 7, 2009 Introduction The two main concepts in asymptotic theory covered in these

More information

Empirical likelihood based tests for stochastic ordering under right censorship

Empirical likelihood based tests for stochastic ordering under right censorship Electronic Journal of Statistics Vol. 10 (2016) 2511 2536 ISSN: 1935-7524 DOI: 10.1214/16-EJS1180 Empirical likelihood based tests for stochastic ordering under right censorship Hsin-wen Chang Institute

More information

Survival Times (in months) Survival Times (in months) Relative Frequency. Relative Frequency

Survival Times (in months) Survival Times (in months) Relative Frequency. Relative Frequency Smooth Goodness-of-Fit Tests in Hazard-Based Models by Edsel A. Pe~na Department of Statistics University of South Carolina at Columbia E-Mail: pena@stat.sc.edu Univ. of Georgia Colloquium Talk Department

More information