Median regression using inverse censoring weights

Size: px
Start display at page:

Download "Median regression using inverse censoring weights"

Transcription

1 Median regression using inverse censoring weights Sundarraman Subramanian 1 Department of Mathematical Sciences New Jersey Institute of Technology Newark, New Jersey Gerhard Dikta Department of Applied Science and Technology Fachhochschule Aachen Ginsterweg 1, D Jülich, Germany Abstract We implement semiparametric random censorship model aided inference for censored median regression models. This is based on the idea that, when the censoring is specified by a common distribution, a semiparametric survival function estimator acts as an improved weight in the so-called inverse censoring weighted estimating function. We show that the proposed method will always produce estimates of the model parameters that are as good as or better than an existing estimator based on the traditional Kaplan Meier weights. We also provide an illustration of the method through an analysis of a lung cancer data set. KEY WORDS: Asymptotically normal, Least absolute deviation, Local linearity, Loewner ordering, Lung cancer study, Minimum dispersion statistic. 1 Corresponding author s address: Sundar Subramanian, Associate Professor, Department of Mathematical Sciences, New Jersey Institute of Technology, Newark, New Jersey ( sundars@njit.edu). 1

2 1 Introduction Semiparametric random censorship (SRC) models provide a more compelling framework than fully nonparametric models because they not only improve the precision of estimates but also, in some cases, obviate smoothing. Dikta (1998) proved that an SRC estimator of a survival function has improved efficiency in comparison with the Kaplan Meier (KM) estimator. On the other hand, when there are missing censoring indicators (MCIs), SRC should be the approach of choice (Subramanian 2004b), since nonparametric estimation would require pre-smoothing (Subramanian 2004a), which, apart from requiring data-driven bandwidths, can also lead to less precise fits when the censoring proportion is high. The SRC approach requires specifying a parametric model for m(t), the conditional expectation of the censoring indicator given the observed survival time (possibly censored), and estimating the model parameter using maximum likelihood. In this article, we demonstrate the utility of the SRC approach for the fitting of median regression (MR) models from right censored data. Specifically, when the censoring has a covariate free distribution, its KM estimator figures as a key element in the approach of inverse censoring weighted (ICW) median regression (Ying, Jung, and Wei 1995; Subramanian 2007b; Yin, Zeng, and Li 2008). Instead, as we will show, using a SRC estimator of the censoring distribution produces at least as good or better inference. Denote by Z a p + 1 dimensional covariate with first component 1, and suppose T = β Z + ɛ, where β is an unknown parameter. The joint distribution of ɛ and Z is unspecified, but the conditional median of ɛ given Z is known to be 0. The data are n iid copies of (X, δ, Z), where X = min(t, C), δ = I(X = T ), and C is a censoring variable independent 2

3 of T. The goal is inference for β. Censored median and, more generally, quantile regression models have received much attention (Powell 1986; Newey and Powell 1990; Ying et al. 1995; Lindgren 1997; Yang 1999; McKeague, Subramanian, and Sun 2001; Subramanian 2002, 2007a, 2007b; Honoré, Khan, and Powell 2002; Portnoy 2003; Qin and Tsao 2003; Zhou and Wang 2005; Peng and Huang 2008; Yin et al. 2008). Indeed, in many lifetime data analysis situations, an MR model is perhaps preferable over other competing models such as the accelerated failure time and Cox proportional hazards models because it possesses the twin advantages of robustness and ease of extraction of target information. A key assumption of the approaches of Ying et al. (1995) and Yin et al. (2008) is the independence between the censoring C and covariate Z, mainly designed to avoid curse of dimensionality; see also Honoré et al. (2002). Under this assumption and that of random censoring, the global KM estimator of the distribution of C is typically plugged into a censoring-adjusted least absolute deviation (LAD) estimating function, arrived at by an application of the ICW approach; see also Koul, Susarla, and Van Ryzin (1981), Robins and Rotnitzky (1992), Bang and Tsiatis (2002), and Subramanian (2007a, 2007b). The asymptotic distribution of the estimator of β is normal but with a variance-covariance matrix that depends on the unknown error density, which is difficult to estimate. Ying et al. (1995) and, later, Subramanian (2007a) obtained confidence regions via Basawa and Koul s (1988) minimum dispersion (MD) statistic. Subramanian (2007b) implemented profile empirical likelihood for censored MR models. Yin et al. (2008), on the other hand, employed a resampling procedure based on a perturbation method (Jin, Wei, and Ying 2001). 3

4 Our proposed modification is driven by the rationale that, as evidenced in the homogeneous case (Dikta 1998), with the right choice of a model, m(t, θ), θ IR k, for m(t) = P (δ = 0 X = t), the SRC paradigm would improve the precision of the parameter estimates. We plug an SRC estimate of G(t), the survival function of C, into the ICW estimating function and choose the minimizer as our estimate of β. We show that the estimator of β (denoted by ˆβ), obtained using the SRC weights in the ICW estimating function, is as good as or better than the estimator obtained by employing the usual KM weights. In practice, specification of a model for m(t) should pose no serious difficulty, since a plethora of choices are available for binary data (Cox and Snell 1989; Dikta, Kvesic, and Schmidt 2006). Indeed, for our illustration involving a lung cancer study, we are able to zero in on an appropriate choice, namely a generalized proportional hazards candidate, see Section 3. Justifying that the chosen model is an appropriate one can be accomplished via the bootstrap method proposed by Dikta et al. (2006), see Section 3. Finally, the proposed approach using SRC weights extends readily even when there are MCIs, since, under the assumption of missing at random, θ can be estimated based on only the complete cases (Lu and Tsiatis 2001; Subramanian 2004b). This is not possible with the ICW approach of Ying et al. (1995). The article is organized as follows. In the next section, we investigate SRC-based ICW inference for the MR parameter. In Section 3, we present an illustration of our proposed methodology using data from a lung cancer study. We complete the article with a conclusion section. Technical details are given in the three Appendixes that appear after Section 4. 4

5 2 Inference for the regression parameter This section begins with a review of the approach proposed by Ying et al. (1995). We then present our modification and carry out a theoretical comparison between the asymptotic variance-covariance matrices of the estimators from the two competing approaches. 2.1 The ICW estimating function with KM weights Let β 0 denote the true value of β and let Λ G (t) denote the censoring cumulative hazard function. Ying et al. (1995) used KM weights in the estimating function for β given by T n (β) = { } I(X i β Z i ) Ĝ KM (β Z i ) 1 Z i, (1) 2 where ĜKM(t) is the KM estimator of the censoring distribution, and obtained their estimator ˆβ Y JW as the minimizer of T n (β), where denotes the Euclidean norm. For any vector a, let a 2 = aa, and let r(t) and y 1 (t) denote the limits of n 1 n I(β 0Z i t)z i and Y 1 (t) = n 1 n I(X i t). Ying et al. (1995) proved that, under certain regularity conditions (see next subsection), n 1/2 T n (β 0 ) is asymptotically normal with zero mean and variance-covariance matrix given by 1 Γ = lim n n [ I(Xi β 0Z i ) 1 G(β 0Z i ) 2 ] 2 Z 2 i 1 4 r(t)r T (t) dλ G (t). (2) y 1 (t) Let f ɛ (t z) = f(t + β 0z z) denote the conditional density of ɛ given Z = z, where f(t z) is the conditional density of T given Z. The estimator ˆβ Y JW is consistent and n 1/2 ( ˆβ Y JW β 0 ) D N (0, Υ 1 ΓΥ 1 ), where Υ = E[Z 2 f ɛ (0 Z)]. Let ˆΓ denote a consistent estimator of Γ. A test-based approach of computing confidence regions for β 0 uses the statistic 5

6 V (β 0 ) = n 1 T n(β 0 )ˆΓ 1 T n (β 0 ), which is approximately chi-square distributed with p + 1 degrees of freedom, leading to a 100(1 α)% confidence region {β : V (β) < χ 2 p+1(1 α)} for β 0. Frequently, however, inference for a specific sub-vector is desired, so letting β = ( β (1), β (2)), ( ) β (1) 0, β (2) 0, the MD where β (1) is the q 1 parameter sub-vector of interest, and β 0 = ( ) { statistic given by D β (1) 0 = min β2 n 1 T n ( β (1) 0, β (2) ) ˆΓ 1 S n ( β (1) 0, β (2) )} is used to construct confidence regions for β (1) 0. The MD statistic is approximately chi-square distributed with q degrees of freedom, from which a 100(1 α)% confidence region for β (1) 0 is given by { β (1) : D ( β (1)) < χ 2 q(1 α) }. In particular, this test-based approach also provides confidence regions for individual regression effects. 2.2 The ICW estimating function with SRC weights Define m(x) = P (δ = 0 X = x) and specify m(x) = m(x, θ), where m is known up to the k-dimensional parameter θ. Let θ 0 denote the true value of θ and m 0 (s) = m(s, θ 0 ). Denote the MLE of θ by ˆθ and write ˆm(s) = m(s, ˆθ). Let Ĥ(s) denote the empirical distribution based on X 1,..., X n. Our proposed modification employs SRC weights in the ICW estimating function for β and is given by S n (β) = { I(X i β Z i ) Ĝ SP (β Z i ) 1 2 } Z i, (3) where ĜSP (t) is the SRC model-based estimator of G proposed and studied by Dikta (1998). Note that ĜSP (t) is the product integral of the cumulative hazard estimator given by ˆΛ SP (t) = t ˆm(s) dĥ(s). (4) Y 1 (s) To investigate the large sample properties of ˆβ, we shall assume all the SRC model 6

7 regularity conditions (Dikta 1998), as well as the following (Ying et al. 1995): 1. The true value β 0 is in the interior of a known bounded convex region D. 2. The covariate vector Z is bounded, say Z L. 3. For β D, there exists a τ such that P (X > τ Z) > 0 and β Z τ almost surely. 4. The derivatives of the conditional survival function F (t z) of T given Z and G(t z) with respect to t are uniformly bounded in (t, z) F = (, τ] [ L, L]. 5. The matrix A = E[Z 2 f ɛ (0 Z)] is positive definite. We first investigate the asymptotic distribution of n 1/2 S n (β 0 ). We shall need additional notation. For r = 1,..., k, write m r (s, θ 0 ) = m(s, θ)/ θ r θ=θ0 and M 0 (s) = [m 1 (s, θ 0 ),..., m k (s, θ 0 )]. Let p 0 (x) = m 0 (x)(1 m 0 (x)) and define the information matrix I 0 = E[M 2 0 (X)/p 0 (X)]. Also, let α(s, t) = M 0(s)I 1 0 M 0 (t) and r(t) = E (I(β 0Z t)z). Write Â(t) = n 1 (1 δ i ) m 0 (X i ) α(t, X i ). p 0 (X i ) Let K( ) be such that K(s)dH(s) <. From Lemma 3.5 of Dikta (1998), we have that n 1/2 { ˆm(s) m 0 (s)} = n 1/2 Â(s) + R n (s), (5) where, uniformly for t τ, the term n 1/2 R n (t) is bounded above by K(t) 1 r,s k (ˆθ r θ 0,r )(ˆθ s θ 0,s ) = K(t) ( O p (n 1/2 ) + o p (n 1/2 ) ) 2 = K(t)Op (n 1 ). Then we have the following representation [Dikta 1998], uniformly for t τ: ˆΛ SP (t) Λ G (t) = 1 n + 1 n t 1 y 1 (s) {m 0(s)d I(X i s) I(X i s)dλ G (s)} (1 δ i ) m 0 (X i ) p 0 (X i ) 7 t α(s, X i ) y 1 (s) dh(s) + o p(n 1/2 ). (6)

8 Define the following random quantities [Note that E(U) = E(W ) = E(V 1 + V 2 ) = 0]: { I(X β U = 0 Z) 1 G(β 0Z) 2 V 1 = 1 r(x) 2 y 1 (X) m 0(X), V 2 = 1 2 } Z, W = 1 (1 δ) m 0 (X) 2 p 0 (X) r(t) y 1 (t) I(X t)dλ G(t). r(t)α(t, X) dh(t), y 1 (t) Theorem 1 Under the conditions stated above, the distribution of n 1/2 S n (β 0 ) is asymptotically normal with zero mean and variance-covariance matrix given by Σ = Γ where = { 1 4 r(t)r (t) (1 m 0 (t))dλ G (t) 1 y 1 (t) 4 Proof We have that S n (β 0 ) = S n (β 0 ) + o p (n 1/2 ) where S n (β 0 ) = { I(Xi β 0Z i ) 1 } Z G(β 0Z i i ) 2 } r(s)r (t) α(s, t)dh(s)dh(t). (7) y 1 (s)y 1 (t) Z i I(X i β 0Z (β 0Z i ) G(β 0Z i ) i )ĜSP. G 2 (β 0Z i ) Let R(t) = n 1 n Z ii(β 0Z i X i t). Denote the second term of Sn (β 0 ) by Q(β 0 ). Applying the Duhamel equation (Andersen, Borgan, Gill, and Keiding 1992), interchanging integrals, and then applying Lenglart s inequality (Andersen et al. 1992), we obtain Q(β 0 ) = n [ t = n = n = n Ĝ SP (t) G(t) dr(t) G 2 (t) Ĝ SP (s ) G(s) [ ] dr(s) ĜSP (t ) t G(s) G(t) [ dr(s) G(s) t ) ] dr(t) d (ˆΛSP (s) Λ G (s) G(t) ] d (ˆΛSP (t) Λ G (t) ) d (ˆΛSP (t) Λ G (t) ) + o p (n 1/2 ). Recall that r(t) = E (I(β 0Z t)z). The integral inside the square brackets may be expressed as 1 n ( ) I(X i β 0Z i ) I(X β I(β G(β 0Z i ) 0Z i t)z i = E 0 Z) I(β G(β 0Z t)z + o 0Z) p (1) = r(t) 2 + o p(1). 8

9 Then, 2Q(β 0 ) = n ) (ˆΛSP r(t) d (t) Λ G (t) + o p (n 1/2 ). From Eq. (6), we have Q(β 0 ) = n 1 n (V i + W i ) + o p (n 1/2 ), where V i and W i are independent copies of V = V 1 + V 2 and W respectively. It follows that n 1/2 S n (β 0 ) is asymptotically normal with influence function U + V + W. The covariance matrix of the limiting distribution is Σ, see Appendix A. It is clear from Theorem 1 that is the difference between the asymptotic variancecovariance matrices of n 1/2 T n (β 0 ) and n 1/2 S n (β 0 ). Let a IR p+1 and write b (t) = a r(t). Then c(t) = b (t)b(t) IR. In this case, a a equals the right hand side of Eq. (7), after replacing r(t)r (t) throughout with c(t) in that equation. It follows from Corollary 2.7 of Dikta (1998), that a a 0. That is, the covariance matrices are ordered with regard to the Loewner ordering (Shklyar and Schneeweiss, 2005) in the sense that Γ Σ. The proof of consistency of ˆβ will follow as for ˆβ Y JW, if we can show that, for any ɛ > 0, sup ĜSP (t) G(t) = o ( n 1/2+ɛ), almost surely. (8) t τ A proof of Eq. (8) is given in Appendix B. Finally, asymptotic normality of ˆβ would follow from a property called the local linearity for S n (β). This means that for β β 0 = O(n 1/3 ), the estimating function S n (β) should satisfy (cf. Ying et al. 1995) S n (β) = S n (β 0 ) + na(β β 0 ) + o p (max(n 1/2, n β β 0 )). (9) From Eq. (9), we can deduce that ˆβ is asymptotically normal with mean zero and covariance matrix given by na 1 ΓA 1. A proof of Eq. (9) is given in Appendix C. The covariance matrices of ˆβ, denoted by Σ, and that of ˆβ Y JW, denoted by Γ, also obey the Loewner ordering in the sense that Γ Σ. This means that both the estimating function 9

10 S n (β 0 ) and the estimator ˆβ are each as good as or more efficient than the corresponding Ying et al. (1995) counterparts. This shows that, when the model for m(t) is chosen well, our modification would yield the desired improved inference for the regression parameter β. 3 Illustration using a lung cancer data set We now analyze data from a lung cancer study (Ying et al. 1995), in which patients with small cell lung cancer were assigned randomly to two treatments. The response is the base 10 log failure time, and age and treatment indicator are the two covariates. Letting θ = (θ 1, θ 2 ), we fitted the model m(x, θ) = (10 x /365) θ 2 /(θ 1 + (10 x /365) θ 2 ) to the data (X i, 1 δ i ), i = 1,..., n. This model is just one minus the generalized proportional hazards model, see Eq. (13) of Dikta et al. (2006). Note that 10 x /365 is just the original failure time expressed in years. We used the genetic optimization algorithm of Mebane and Sekhon (1998) and obtained the estimates ˆθ 1 = 610 and ˆθ 2 = The fitted model along with the (X, 1 δ) data are plotted in figure 1. The KM and SRC model-based survival curves are shown in figure 2. We also tested H 0 : m( ) M = { m(x, θ) = (10 x /365) θ 2 /(θ 1 + (10 x /365) θ 2 ) } against H 1 : m( ) / M. We used the model-based resampling test of Dikta et al. (2006), which produced a p-value of 0.28 based on 1000 resamples, implying that H 0 cannot be rejected. It was reported elsewhere (Subramanian, 2007) that, using the KM weights for the censoring, the point estimates of the intercept, age, and treatment effects were respectively 2.90, 0.002, Using the SRC weights for the censoring, our analysis using the proposed methodology produced the respective point estimates 3.16, 0.006, Treating the intercept β (1) and the treatment effect β (2) as nuisance parameters, a 95% confidence 10

11 interval for the age effect β (3) 0 using the MD statistic and KM weights is ( , ). The proposed methodology using the the MD statistic and SRC weights produced an interval estimate of ( , ) for the age effect. The null hypothesis of no age effect is still not rejected but now only barely so. Treating the intercept β (1) and the age effect β (3) as nuisance parameters, a 95% confidence interval for the treatment effect β (2) 0 using the MD statistic and KM weights is ( , ). The proposed methodology produced an interval estimate of ( , ) for the treatment effect. 4 Conclusion Much like the existing method, the modified ICW estimating function for censored MR models investigated in this paper is tailored for the case that the censoring distribution is free of the covariate. As such, the proposed method applies when the subjects are administratively censored, as in the case of the lung cancer study above, or when a preliminary analysis reveals that the censoring is not likely to be related to the covariates, as in a data set analyzed by Yin et al. (2008). Censored MR methods that address the general case of covariate dependent censoring perhaps do not possess the simple motivation and formulation of the ICW approach, and hence may not be appealing to a practitioner when it is known in advance that censoring is in fact free of the covariate. Our method relies on a properly chosen parametric model for the conditional probability of censoring given the covariate and, given the right choice, is shown to work as well or better than the approach that employs KM weights. For the lung cancer data analyzed in this paper, our approach suggests a possible negative association of entry age with survival time, a result also corroborated by 11

12 Yang s (1999) analysis. Interestingly, the approach using KM weights was unable to detect this possible negative association. We conclude that the proposed modification is a useful complement to the existing censored MR methodology, including for the case of MCIs. Appendix A Covariance calculations Recall the definitions of U, V = V 1 + V 2, and W from the expressions following Eq. (6) in section 2.2. The assumption of a common censoring distribution implies that E(UV ) = E {ZV I(X β 0Z)/G(β 0Z)}. We calculate { } I(X β E 0 Z) ZV G(β 0Z) 1 = 1 [ ] I(X β 2 E 0 Z)m 0 (X)Zr (X) G(β 0Z)y 1 (X) [ = 1 ] 2 E Z r (t)dλ G(β 0Z) G (t) β 0 Z = 1 [ ] I(β E 0 Z t)z r (t)dλ 2 G(β 0Z) G (t). Next, since I(X β 0Z)(I(X t)z = ZI(β 0Z < t)i(x t) + ZI(β 0Z t)i(x β 0Z), we can show using the assumption of a common censoring distribution that { } I(X β E 0 Z) V G(β 0Z) 2 = = 1 4 [ ] I(X β E 0 Z)I(β 0Z t)z r (t) G(β 0Z) y 1 (t) dλ G(t) [ ] I(β E 0 Z < t)z r (t)dλ G(β 0Z) G (t) r(t)r (t) dλ G (t) 1 [ I(β E 0 Z < t)z y 1 (t) 2 G(β 0Z) ] r (t)dλ G (t). Therefore, under assumption that the covariate Z is continuous, we have E(UV ) = 1 4 r(t)r (t) dλ G (t). y 1 (t) 12

13 Likewise, we can show that E(V U ) = E(UV ). Also, it is clear that E(UW ) = E(W U ) = 0. Furthermore, we can show that the following expressions hold: E [V 1 V 1] = 1 4 E [W W ] = 1 4 r(t)r (t) y 1 (t) m 0 (t)dλ G (t) E [V 2 V 2] = E [V 1 V 2] E [V 2 V 1] r(s)r (t) α(s, t)dh(s)dh(t). y 1 (s)y 1 (t) After some algebra, the expression for Σ follows. Appendix B Proof of Eq. (8) Write H 0 (t) = P (X t, δ = 0) and denote its empirical estimator by Ĥ0(t). By the Duhamel equation and Lenglart s inequality it suffices to show that sup t τ ˆΛ SP (t) Λ G (t) = o ( n 1/2+ɛ) almost surely. From Eq. (6), we can write sup t τ ˆΛ SP (t) Λ G (t) = I 1 + I 2 + o p (n 1/2 ), where I 1 = sup t τ sup t τ 1 n t t 1 y 1 (s) {m 0(s)d I(X i s) I(X i s)dλ G (s)} m 0 (s) } {Ĥ(s) y 1 (s) d t H(s) + sup (Ĥ(s) H(s))dΛ G(s) t τ y 1 (s). The second term of I 1 is bounded above by the quantity sup t τ Ĥ(t) H(t) Λ G(τ)/y 1 (τ), which is O ( (n/ log n) 1/2) almost surely. After integration by parts, we can easily show that the first term of I 1 is bounded above by the quantity sup t τ Ĥ(t) H(t) /y 1(τ) plus sup t τ Ĥ(t) H(t) [ sup t τ m 0(t)/y 1 (τ) + sup t τ m 0 (t) sup t τ y 1(t)/y 2 1(τ) ], which would be O ( (n/ log n) 1/2) almost surely, provided m 0 (t), its first derivative, the density functions f and g (of F and G respectively) are all bounded uniformly for t τ. Next we have I 2 = sup t τ 1 n (1 δ i ) m 0 (X i ) p 0 (X i ) 13 t α(s, X i ) y 1 (s) dh(s).

14 Write ρ(t, u) = t α(s, u)dh(s)/y 1(s). Note that m 0 (u)dh(u) = dh 0 (u). Then I 2 = sup ρ(t, u) t τ p 0 (u) dĥ0(u) m 0 (u)ρ(t, u) p 0 (u) dĥ(u) = sup ρ(t, u) t τ p 0 (u) d{ĥ0(u) m 0 (u)ρ(t, u) H 0 (u)} p 0 (u) d{ĥ(u) H(u)}. Each term can be shown to be O ( (n/ log n) 1/2) almost surely, by integration by parts coupled with an appropriate uniform boundedness assumption on ρ(t, u) and the assumption that m 0 (t) is bounded away from 0 and 1. The proof is completed. Appendix C Proof of local linearity of proposed estimating function It can be shown that S n (β) T n (β) = [ĜKM Z i I(X i β (β Z i ) G(β Z i ) Z i ) G 2 (β Z i ) + o p (n 1/2 ), ĜSP (β Z i ) G(β Z i ) G 2 (β Z i ) ] uniformly for β D. It follows that S n (β) S n (β 0 ) = T n (β) T n (β 0 ) + R 1 + R 2, where R 2 is just R 1 defined below but ĜSP replacing ĜKM: } {ĜKM R 1 = Z i I(X i β (β Z i ) G(β Z i ) Z i ) G 2 (β Z i ) } {ĜKM Z i I(X i β 0Z (β i ) 0Z i ) G(β 0Z i ), G 2 (β 0Z i ) which, by lemmas 1 and 2 of Ying et al. (1995), can be shown to be o p (n 1/2 ) when β β 0 = O(n 1/3 ). Define the following limit: 1 r β (t) = lim n n I(X i β Z i ) Z i I(β Z G(β i t). Z i ) Note that r β0 (t) is just r(t)/2. Write d(t) = r β (t) r β0 (t). Following the method of obtaining the asymptotic representation for Q(β 0 ) in the proof of Theorem 1, we can show that R 2 has 14

15 influence function Ṽ1 + Ṽ2 + W, where Ṽi and W are defined like V i and W but r(t) in those definitions replaced with d(t). The asymptotic covariance matrix of R 2 can be shown to be the same as the expression given by Eq. (7), but r(t) there as well replaced with d(t). Since β is in the n 1/3 neighborhood of β 0, the covariance matrix is arbitrarily close to the null matrix. It follows that R(t) = o p (n 1/2 ). Therefore we have S n (β) S n (β 0 ) = T n (β) T n (β 0 )+o p (n 1/2 ). Using the local linearity for T n (β), we have that S n (β) = S n (β 0 ) + S n (β) + o p (n 1/2 ), where S n (β) = n {0.5 P (T i β Z i )}Z i = n {0.5 F i((β β 0 ) Z i )}Z i. Local linearity follows by taking a Taylor s expansion of S n (β) about β 0. References 1. P.K. Andersen, O.Borgan, R.D. Gill, N. Keiding, Statistical Models Based on Counting Processes, Springer-Verlag, New York, H. Bang, A.A. Tsiatis, Median regression with censored cost data, Biometrics 58 (2002) I.V. Basawa, H.L. Koul, Large-sample statistics based on quadratic dispersion, Int. Statist. Rev. 56 (1980) D.R. Cox, E.J. Snell, Analysis of binary data, Chapman and Hall, London, S. Csörgő, L. Horváth, The rate of strong uniform consistency for the product-limit estimator, Z. Wahrsch. Verw. Gebiete. 62 (1983) G. Dikta, On semiparametric random censorship models, J. Statist. Plann. Inference 66 (1998)

16 7. G. Dikta, M. Kvesic, C. Schmidt, Bootstrap Approximations in Model Checks for Binary Data, J. Amer. Statist. Assoc. 101 (2006) B. Honoré, S. Khan, J.L. Powell, Quantile regression under random censoring, J. Econometrics 109 (2002) Z. Jin, Z. Ying, L.J. Wei, A simple resampling method by perturbing the minimand, Biometrika 88 (2001) H. Koul, V. Susarla, J. Van Ryzin, Regression analysis with randomly right censored data, Ann. Statist. 9 (1981) A. Lindgren, Quantile regression with censored data using generalized L 1 minimization, Comput. Statist. Data Anal. 23 (1997) K. Lu, A.A. Tsiatis, Multiple imputation methods for estimating regression coefficients in proportional hazards models with missing cause of failure, Biometrics 57 (2001) I.W. McKeague, S. Subramanian, Y. Sun, Median regression and the missing information principle, J. Nonparametr. Stat. 13 (2001) W.R. Mebane, Jr., J.S. Sekhon, Genetic optimization using derivatives: The rgenoud package for R, To appear in J. Statist. Software. 15. J.L. Powell, Censored Regression Quantiles, J. Econometrics 32 (1986) S. Portnoy, Censored Regression Quantiles, J. Amer. Statist. Assoc. 98 (2003)

17 17. L. Peng, Y. Huang, Survival analysis with quantile regression models, J. Ameri. Statist. Assoc. 103 (2008) G. Qin, M. Tsao, Empirical likelihood inference for median regression models for censored survival data, J. Multivar. Anal. 85 (2003) J.M. Robins, A. Rotnitzky, Recovery of information and adjustment for dependent censoring using surrogate markers, in AIDS Epidemiology - Methodological Issues, eds. N. Jewell, K. Dietz, and V. Farewell, Boston: Birkhauser, (1992) S. Shklyar, H. Schneeweiss, A comparison of asymptotic covariance matrices of three consistent estimators in the Poisson regression model with measurement errors, J. Multivar. Anal. 94 (2005) S. Subramanian, Median regression using nonparametric kernel estimation, J. Nonparametr. Statist. 14 (2002) S. Subramanian, Asymptotically efficient estimation of a survival function in the missing censoring indicator model, J. Nonparametr. Stat. 16 (2004a) S. Subramanian, The missing censoring-indicator model of random censorship, Handbook of Statistics 23: Advances in Survival Analysis. Eds. N. Balakrishnan and C.R.Rao (2004b) S. Subramanian, Median regression analysis from data with left and right censored observations, Stat. Methodol. 4 (2007a) S. Subramanian, Censored median regression and profile empirical likelihood, Stat. Methodol. 4 (2007b)

18 26. S. Yang, Censored median regression using weighted empirical survival and hazard functions, J. Amer. Statist. Assoc. 94 (1999) G. Yin, D. Zeng, H.Li, Power-transformed linear quantile regression with censored data, J. Ameri. Statist. Assoc. 103 (2008) Z. Ying, S. Jung, L.J. Wei, Survival analysis with median regression models, J. Amer. Statist. Assoc. 90 (1995) Estimated m(t) c c c cc c cc c cc cccc c ccc ccccc c c ccccccc c ccccccccccccc cc ccc c c c t Figure 1: Plot of the fitted model for m(t) for the lung cancer data. 18

19 Estimated G(t) Kaplan Meier estimator Semiparametric estimator t Figure 2: Plot of the Kaplan Meier and Semiparametric random censorship model based survival curves of the censoring distribution for lung cancer data. 19

An augmented inverse probability weighted survival function estimator

An augmented inverse probability weighted survival function estimator An augmented inverse probability weighted survival function estimator Sundarraman Subramanian & Dipankar Bandyopadhyay Abstract We analyze an augmented inverse probability of non-missingness weighted estimator

More information

AFT Models and Empirical Likelihood

AFT Models and Empirical Likelihood AFT Models and Empirical Likelihood Mai Zhou Department of Statistics, University of Kentucky Collaborators: Gang Li (UCLA); A. Bathke; M. Kim (Kentucky) Accelerated Failure Time (AFT) models: Y = log(t

More information

Quantile Regression for Residual Life and Empirical Likelihood

Quantile Regression for Residual Life and Empirical Likelihood Quantile Regression for Residual Life and Empirical Likelihood Mai Zhou email: mai@ms.uky.edu Department of Statistics, University of Kentucky, Lexington, KY 40506-0027, USA Jong-Hyeon Jeong email: jeong@nsabp.pitt.edu

More information

Model checks via bootstrap when there are missing binary data.

Model checks via bootstrap when there are missing binary data. 1 INTRODUCTION Model checks via bootstrap when there are missing binary data. Gerhard Dikta a, Sundarraman Subramanian b, Thorsten Winkler a a Department of Medizintechnik und Technomathematik, Fachhochschule

More information

Efficiency of Profile/Partial Likelihood in the Cox Model

Efficiency of Profile/Partial Likelihood in the Cox Model Efficiency of Profile/Partial Likelihood in the Cox Model Yuichi Hirose School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand Summary. This paper shows

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

STAT Sample Problem: General Asymptotic Results

STAT Sample Problem: General Asymptotic Results STAT331 1-Sample Problem: General Asymptotic Results In this unit we will consider the 1-sample problem and prove the consistency and asymptotic normality of the Nelson-Aalen estimator of the cumulative

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan

More information

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),

More information

Tests of independence for censored bivariate failure time data

Tests of independence for censored bivariate failure time data Tests of independence for censored bivariate failure time data Abstract Bivariate failure time data is widely used in survival analysis, for example, in twins study. This article presents a class of χ

More information

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky EMPIRICAL ENVELOPE MLE AND LR TESTS Mai Zhou University of Kentucky Summary We study in this paper some nonparametric inference problems where the nonparametric maximum likelihood estimator (NPMLE) are

More information

Size and Shape of Confidence Regions from Extended Empirical Likelihood Tests

Size and Shape of Confidence Regions from Extended Empirical Likelihood Tests Biometrika (2014),,, pp. 1 13 C 2014 Biometrika Trust Printed in Great Britain Size and Shape of Confidence Regions from Extended Empirical Likelihood Tests BY M. ZHOU Department of Statistics, University

More information

EMPIRICAL LIKELIHOOD ANALYSIS FOR THE HETEROSCEDASTIC ACCELERATED FAILURE TIME MODEL

EMPIRICAL LIKELIHOOD ANALYSIS FOR THE HETEROSCEDASTIC ACCELERATED FAILURE TIME MODEL Statistica Sinica 22 (2012), 295-316 doi:http://dx.doi.org/10.5705/ss.2010.190 EMPIRICAL LIKELIHOOD ANALYSIS FOR THE HETEROSCEDASTIC ACCELERATED FAILURE TIME MODEL Mai Zhou 1, Mi-Ok Kim 2, and Arne C.

More information

Statistical Analysis of Competing Risks With Missing Causes of Failure

Statistical Analysis of Competing Risks With Missing Causes of Failure Proceedings 59th ISI World Statistics Congress, 25-3 August 213, Hong Kong (Session STS9) p.1223 Statistical Analysis of Competing Risks With Missing Causes of Failure Isha Dewan 1,3 and Uttara V. Naik-Nimbalkar

More information

11 Survival Analysis and Empirical Likelihood

11 Survival Analysis and Empirical Likelihood 11 Survival Analysis and Empirical Likelihood The first paper of empirical likelihood is actually about confidence intervals with the Kaplan-Meier estimator (Thomas and Grunkmeier 1979), i.e. deals with

More information

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models NIH Talk, September 03 Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models Eric Slud, Math Dept, Univ of Maryland Ongoing joint project with Ilia

More information

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling Estimation and Inference of Quantile Regression for Survival Data under Biased Sampling Supplementary Materials: Proofs of the Main Results S1 Verification of the weight function v i (t) for the lengthbiased

More information

Empirical Likelihood in Survival Analysis

Empirical Likelihood in Survival Analysis Empirical Likelihood in Survival Analysis Gang Li 1, Runze Li 2, and Mai Zhou 3 1 Department of Biostatistics, University of California, Los Angeles, CA 90095 vli@ucla.edu 2 Department of Statistics, The

More information

Likelihood ratio confidence bands in nonparametric regression with censored data

Likelihood ratio confidence bands in nonparametric regression with censored data Likelihood ratio confidence bands in nonparametric regression with censored data Gang Li University of California at Los Angeles Department of Biostatistics Ingrid Van Keilegom Eindhoven University of

More information

Product-limit estimators of the survival function with left or right censored data

Product-limit estimators of the survival function with left or right censored data Product-limit estimators of the survival function with left or right censored data 1 CREST-ENSAI Campus de Ker-Lann Rue Blaise Pascal - BP 37203 35172 Bruz cedex, France (e-mail: patilea@ensai.fr) 2 Institut

More information

Comparing Distribution Functions via Empirical Likelihood

Comparing Distribution Functions via Empirical Likelihood Georgia State University ScholarWorks @ Georgia State University Mathematics and Statistics Faculty Publications Department of Mathematics and Statistics 25 Comparing Distribution Functions via Empirical

More information

1 Glivenko-Cantelli type theorems

1 Glivenko-Cantelli type theorems STA79 Lecture Spring Semester Glivenko-Cantelli type theorems Given i.i.d. observations X,..., X n with unknown distribution function F (t, consider the empirical (sample CDF ˆF n (t = I [Xi t]. n Then

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH Jian-Jian Ren 1 and Mai Zhou 2 University of Central Florida and University of Kentucky Abstract: For the regression parameter

More information

CENSORED QUANTILE REGRESSION WITH COVARIATE MEASUREMENT ERRORS

CENSORED QUANTILE REGRESSION WITH COVARIATE MEASUREMENT ERRORS Statistica Sinica 21 211, 949-971 CENSORED QUANTILE REGRESSION WITH COVARIATE MEASUREMENT ERRORS Yanyuan Ma and Guosheng Yin Texas A&M University and The University of Hong Kong Abstract: We study censored

More information

Least Absolute Deviations Estimation for the Accelerated Failure Time Model. University of Iowa. *

Least Absolute Deviations Estimation for the Accelerated Failure Time Model. University of Iowa. * Least Absolute Deviations Estimation for the Accelerated Failure Time Model Jian Huang 1,2, Shuangge Ma 3, and Huiliang Xie 1 1 Department of Statistics and Actuarial Science, and 2 Program in Public Health

More information

LEAST ABSOLUTE DEVIATIONS ESTIMATION FOR THE ACCELERATED FAILURE TIME MODEL

LEAST ABSOLUTE DEVIATIONS ESTIMATION FOR THE ACCELERATED FAILURE TIME MODEL Statistica Sinica 17(2007), 1533-1548 LEAST ABSOLUTE DEVIATIONS ESTIMATION FOR THE ACCELERATED FAILURE TIME MODEL Jian Huang 1, Shuangge Ma 2 and Huiliang Xie 1 1 University of Iowa and 2 Yale University

More information

Estimation of the Bivariate and Marginal Distributions with Censored Data

Estimation of the Bivariate and Marginal Distributions with Censored Data Estimation of the Bivariate and Marginal Distributions with Censored Data Michael Akritas and Ingrid Van Keilegom Penn State University and Eindhoven University of Technology May 22, 2 Abstract Two new

More information

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);

More information

Chapter 2 Inference on Mean Residual Life-Overview

Chapter 2 Inference on Mean Residual Life-Overview Chapter 2 Inference on Mean Residual Life-Overview Statistical inference based on the remaining lifetimes would be intuitively more appealing than the popular hazard function defined as the risk of immediate

More information

Package Rsurrogate. October 20, 2016

Package Rsurrogate. October 20, 2016 Type Package Package Rsurrogate October 20, 2016 Title Robust Estimation of the Proportion of Treatment Effect Explained by Surrogate Marker Information Version 2.0 Date 2016-10-19 Author Layla Parast

More information

log T = β T Z + ɛ Zi Z(u; β) } dn i (ue βzi ) = 0,

log T = β T Z + ɛ Zi Z(u; β) } dn i (ue βzi ) = 0, Accelerated failure time model: log T = β T Z + ɛ β estimation: solve where S n ( β) = n i=1 { Zi Z(u; β) } dn i (ue βzi ) = 0, Z(u; β) = j Z j Y j (ue βz j) j Y j (ue βz j) How do we show the asymptotics

More information

Empirical Processes & Survival Analysis. The Functional Delta Method

Empirical Processes & Survival Analysis. The Functional Delta Method STAT/BMI 741 University of Wisconsin-Madison Empirical Processes & Survival Analysis Lecture 3 The Functional Delta Method Lu Mao lmao@biostat.wisc.edu 3-1 Objectives By the end of this lecture, you will

More information

Lecture 5 Models and methods for recurrent event data

Lecture 5 Models and methods for recurrent event data Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.

More information

Asymptotic Properties of Kaplan-Meier Estimator. for Censored Dependent Data. Zongwu Cai. Department of Mathematics

Asymptotic Properties of Kaplan-Meier Estimator. for Censored Dependent Data. Zongwu Cai. Department of Mathematics To appear in Statist. Probab. Letters, 997 Asymptotic Properties of Kaplan-Meier Estimator for Censored Dependent Data by Zongwu Cai Department of Mathematics Southwest Missouri State University Springeld,

More information

Power-Transformed Linear Quantile Regression With Censored Data

Power-Transformed Linear Quantile Regression With Censored Data Power-Transformed Linear Quantile Regression With Censored Data Guosheng YIN, Donglin ZENG, and Hui LI We propose a class of power-transformed linear quantile regression models for survival data subject

More information

UNIVERSITY OF CALIFORNIA, SAN DIEGO

UNIVERSITY OF CALIFORNIA, SAN DIEGO UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department

More information

Smoothed and Corrected Score Approach to Censored Quantile Regression With Measurement Errors

Smoothed and Corrected Score Approach to Censored Quantile Regression With Measurement Errors Smoothed and Corrected Score Approach to Censored Quantile Regression With Measurement Errors Yuanshan Wu, Yanyuan Ma, and Guosheng Yin Abstract Censored quantile regression is an important alternative

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview

Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations

More information

Likelihood Construction, Inference for Parametric Survival Distributions

Likelihood Construction, Inference for Parametric Survival Distributions Week 1 Likelihood Construction, Inference for Parametric Survival Distributions In this section we obtain the likelihood function for noninformatively rightcensored survival data and indicate how to make

More information

TMA 4275 Lifetime Analysis June 2004 Solution

TMA 4275 Lifetime Analysis June 2004 Solution TMA 4275 Lifetime Analysis June 2004 Solution Problem 1 a) Observation of the outcome is censored, if the time of the outcome is not known exactly and only the last time when it was observed being intact,

More information

Accelerated Failure Time Models: A Review

Accelerated Failure Time Models: A Review International Journal of Performability Engineering, Vol. 10, No. 01, 2014, pp.23-29. RAMS Consultants Printed in India Accelerated Failure Time Models: A Review JEAN-FRANÇOIS DUPUY * IRMAR/INSA of Rennes,

More information

Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm

Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm Mai Zhou 1 University of Kentucky, Lexington, KY 40506 USA Summary. Empirical likelihood ratio method (Thomas and Grunkmier

More information

STAT331. Cox s Proportional Hazards Model

STAT331. Cox s Proportional Hazards Model STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations

More information

TESTINGGOODNESSOFFITINTHECOX AALEN MODEL

TESTINGGOODNESSOFFITINTHECOX AALEN MODEL ROBUST 24 c JČMF 24 TESTINGGOODNESSOFFITINTHECOX AALEN MODEL David Kraus Keywords: Counting process, Cox Aalen model, goodness-of-fit, martingale, residual, survival analysis. Abstract: The Cox Aalen regression

More information

STAT 331. Martingale Central Limit Theorem and Related Results

STAT 331. Martingale Central Limit Theorem and Related Results STAT 331 Martingale Central Limit Theorem and Related Results In this unit we discuss a version of the martingale central limit theorem, which states that under certain conditions, a sum of orthogonal

More information

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Takeshi Emura and Hisayuki Tsukuma Abstract For testing the regression parameter in multivariate

More information

Smooth nonparametric estimation of a quantile function under right censoring using beta kernels

Smooth nonparametric estimation of a quantile function under right censoring using beta kernels Smooth nonparametric estimation of a quantile function under right censoring using beta kernels Chanseok Park 1 Department of Mathematical Sciences, Clemson University, Clemson, SC 29634 Short Title: Smooth

More information

Full likelihood inferences in the Cox model: an empirical likelihood approach

Full likelihood inferences in the Cox model: an empirical likelihood approach Ann Inst Stat Math 2011) 63:1005 1018 DOI 10.1007/s10463-010-0272-y Full likelihood inferences in the Cox model: an empirical likelihood approach Jian-Jian Ren Mai Zhou Received: 22 September 2008 / Revised:

More information

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Statistica Sinica 20 (2010), 441-453 GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Antai Wang Georgetown University Medical Center Abstract: In this paper, we propose two tests for parametric models

More information

Quantile Regression for Recurrent Gap Time Data

Quantile Regression for Recurrent Gap Time Data Biometrics 000, 1 21 DOI: 000 000 0000 Quantile Regression for Recurrent Gap Time Data Xianghua Luo 1,, Chiung-Yu Huang 2, and Lan Wang 3 1 Division of Biostatistics, School of Public Health, University

More information

1 Introduction. 2 Residuals in PH model

1 Introduction. 2 Residuals in PH model Supplementary Material for Diagnostic Plotting Methods for Proportional Hazards Models With Time-dependent Covariates or Time-varying Regression Coefficients BY QIQING YU, JUNYI DONG Department of Mathematical

More information

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Noname manuscript No. (will be inserted by the editor) A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Mai Zhou Yifan Yang Received: date / Accepted: date Abstract In this note

More information

A note on L convergence of Neumann series approximation in missing data problems

A note on L convergence of Neumann series approximation in missing data problems A note on L convergence of Neumann series approximation in missing data problems Hua Yun Chen Division of Epidemiology & Biostatistics School of Public Health University of Illinois at Chicago 1603 West

More information

Lecture 3. Truncation, length-bias and prevalence sampling

Lecture 3. Truncation, length-bias and prevalence sampling Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in

More information

Power and Sample Size Calculations with the Additive Hazards Model

Power and Sample Size Calculations with the Additive Hazards Model Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine

More information

and Comparison with NPMLE

and Comparison with NPMLE NONPARAMETRIC BAYES ESTIMATOR OF SURVIVAL FUNCTIONS FOR DOUBLY/INTERVAL CENSORED DATA and Comparison with NPMLE Mai Zhou Department of Statistics, University of Kentucky, Lexington, KY 40506 USA http://ms.uky.edu/

More information

asymptotic normality of nonparametric M-estimators with applications to hypothesis testing for panel count data

asymptotic normality of nonparametric M-estimators with applications to hypothesis testing for panel count data asymptotic normality of nonparametric M-estimators with applications to hypothesis testing for panel count data Xingqiu Zhao and Ying Zhang The Hong Kong Polytechnic University and Indiana University Abstract:

More information

CALCULATION METHOD FOR NONLINEAR DYNAMIC LEAST-ABSOLUTE DEVIATIONS ESTIMATOR

CALCULATION METHOD FOR NONLINEAR DYNAMIC LEAST-ABSOLUTE DEVIATIONS ESTIMATOR J. Japan Statist. Soc. Vol. 3 No. 200 39 5 CALCULAION MEHOD FOR NONLINEAR DYNAMIC LEAS-ABSOLUE DEVIAIONS ESIMAOR Kohtaro Hitomi * and Masato Kagihara ** In a nonlinear dynamic model, the consistency and

More information

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Guosheng Yin Department of Statistics and Actuarial Science The University of Hong Kong Joint work with J. Xu PSI and RSS Journal

More information

Practical Bayesian Quantile Regression. Keming Yu University of Plymouth, UK

Practical Bayesian Quantile Regression. Keming Yu University of Plymouth, UK Practical Bayesian Quantile Regression Keming Yu University of Plymouth, UK (kyu@plymouth.ac.uk) A brief summary of some recent work of us (Keming Yu, Rana Moyeed and Julian Stander). Summary We develops

More information

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES Statistica Sinica 19 (2009), 71-81 SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES Song Xi Chen 1,2 and Chiu Min Wong 3 1 Iowa State University, 2 Peking University and

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models

Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations

More information

Smoothing the Nelson-Aalen Estimtor Biostat 277 presentation Chi-hong Tseng

Smoothing the Nelson-Aalen Estimtor Biostat 277 presentation Chi-hong Tseng Smoothing the Nelson-Aalen Estimtor Biostat 277 presentation Chi-hong seng Reference: 1. Andersen, Borgan, Gill, and Keiding (1993). Statistical Model Based on Counting Processes, Springer-Verlag, p.229-255

More information

Longitudinal + Reliability = Joint Modeling

Longitudinal + Reliability = Joint Modeling Longitudinal + Reliability = Joint Modeling Carles Serrat Institute of Statistics and Mathematics Applied to Building CYTED-HAROSA International Workshop November 21-22, 2013 Barcelona Mainly from Rizopoulos,

More information

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon

Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon Discussion of the paper Inference for Semiparametric Models: Some Questions and an Answer by Bickel and Kwon Jianqing Fan Department of Statistics Chinese University of Hong Kong AND Department of Statistics

More information

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data Malaysian Journal of Mathematical Sciences 11(3): 33 315 (217) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal homepage: http://einspem.upm.edu.my/journal Approximation of Survival Function by Taylor

More information

Rank Regression Analysis of Multivariate Failure Time Data Based on Marginal Linear Models

Rank Regression Analysis of Multivariate Failure Time Data Based on Marginal Linear Models doi: 10.1111/j.1467-9469.2005.00487.x Published by Blacwell Publishing Ltd, 9600 Garsington Road, Oxford OX4 2DQ, UK and 350 Main Street, Malden, MA 02148, USA Vol 33: 1 23, 2006 Ran Regression Analysis

More information

On the Breslow estimator

On the Breslow estimator Lifetime Data Anal (27) 13:471 48 DOI 1.17/s1985-7-948-y On the Breslow estimator D. Y. Lin Received: 5 April 27 / Accepted: 16 July 27 / Published online: 2 September 27 Springer Science+Business Media,

More information

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where STAT 331 Accelerated Failure Time Models Previously, we have focused on multiplicative intensity models, where h t z) = h 0 t) g z). These can also be expressed as H t z) = H 0 t) g z) or S t z) = e Ht

More information

BARTLETT IDENTITIES AND LARGE DEVIATIONS IN LIKELIHOOD THEORY 1. By Per Aslak Mykland University of Chicago

BARTLETT IDENTITIES AND LARGE DEVIATIONS IN LIKELIHOOD THEORY 1. By Per Aslak Mykland University of Chicago The Annals of Statistics 1999, Vol. 27, No. 3, 1105 1117 BARTLETT IDENTITIES AND LARGE DEVIATIONS IN LIKELIHOOD THEORY 1 By Per Aslak Mykland University of Chicago The connection between large and small

More information

ANALYSIS OF COMPETING RISKS DATA WITH MISSING CAUSE OF FAILURE UNDER ADDITIVE HAZARDS MODEL

ANALYSIS OF COMPETING RISKS DATA WITH MISSING CAUSE OF FAILURE UNDER ADDITIVE HAZARDS MODEL Statistica Sinica 18(28, 219-234 ANALYSIS OF COMPETING RISKS DATA WITH MISSING CAUSE OF FAILURE UNDER ADDITIVE HAZARDS MODEL Wenbin Lu and Yu Liang North Carolina State University and SAS Institute Inc.

More information

Analysis of least absolute deviation

Analysis of least absolute deviation Analysis of least absolute deviation By KANI CHEN Department of Mathematics, HKUST, Kowloon, Hong Kong makchen@ust.hk ZHILIANG YING Department of Statistics, Columbia University, NY, NY, 10027, U.S.A.

More information

4. Comparison of Two (K) Samples

4. Comparison of Two (K) Samples 4. Comparison of Two (K) Samples K=2 Problem: compare the survival distributions between two groups. E: comparing treatments on patients with a particular disease. Z: Treatment indicator, i.e. Z = 1 for

More information

Competing risks data analysis under the accelerated failure time model with missing cause of failure

Competing risks data analysis under the accelerated failure time model with missing cause of failure Ann Inst Stat Math 2016 68:855 876 DOI 10.1007/s10463-015-0516-y Competing risks data analysis under the accelerated failure time model with missing cause of failure Ming Zheng Renxin Lin Wen Yu Received:

More information

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Anastasios (Butch) Tsiatis Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University

More information

The International Journal of Biostatistics

The International Journal of Biostatistics The International Journal of Biostatistics Volume 1, Issue 1 2005 Article 3 Score Statistics for Current Status Data: Comparisons with Likelihood Ratio and Wald Statistics Moulinath Banerjee Jon A. Wellner

More information

Goodness-of-fit test for the Cox Proportional Hazard Model

Goodness-of-fit test for the Cox Proportional Hazard Model Goodness-of-fit test for the Cox Proportional Hazard Model Rui Cui rcui@eco.uc3m.es Department of Economics, UC3M Abstract In this paper, we develop new goodness-of-fit tests for the Cox proportional hazard

More information

Censored quantile regression with varying coefficients

Censored quantile regression with varying coefficients Title Censored quantile regression with varying coefficients Author(s) Yin, G; Zeng, D; Li, H Citation Statistica Sinica, 2013, p. 1-24 Issued Date 2013 URL http://hdl.handle.net/10722/189454 Rights Statistica

More information

On robust and efficient estimation of the center of. Symmetry.

On robust and efficient estimation of the center of. Symmetry. On robust and efficient estimation of the center of symmetry Howard D. Bondell Department of Statistics, North Carolina State University Raleigh, NC 27695-8203, U.S.A (email: bondell@stat.ncsu.edu) Abstract

More information

Goodness-of-fit tests for the cure rate in a mixture cure model

Goodness-of-fit tests for the cure rate in a mixture cure model Biometrika (217), 13, 1, pp. 1 7 Printed in Great Britain Advance Access publication on 31 July 216 Goodness-of-fit tests for the cure rate in a mixture cure model BY U.U. MÜLLER Department of Statistics,

More information

Regression and Statistical Inference

Regression and Statistical Inference Regression and Statistical Inference Walid Mnif wmnif@uwo.ca Department of Applied Mathematics The University of Western Ontario, London, Canada 1 Elements of Probability 2 Elements of Probability CDF&PDF

More information

Approximate Self Consistency for Middle-Censored Data

Approximate Self Consistency for Middle-Censored Data Approximate Self Consistency for Middle-Censored Data by S. Rao Jammalamadaka Department of Statistics & Applied Probability, University of California, Santa Barbara, CA 93106, USA. and Srikanth K. Iyer

More information

Modification and Improvement of Empirical Likelihood for Missing Response Problem

Modification and Improvement of Empirical Likelihood for Missing Response Problem UW Biostatistics Working Paper Series 12-30-2010 Modification and Improvement of Empirical Likelihood for Missing Response Problem Kwun Chuen Gary Chan University of Washington - Seattle Campus, kcgchan@u.washington.edu

More information

3 Joint Distributions 71

3 Joint Distributions 71 2.2.3 The Normal Distribution 54 2.2.4 The Beta Density 58 2.3 Functions of a Random Variable 58 2.4 Concluding Remarks 64 2.5 Problems 64 3 Joint Distributions 71 3.1 Introduction 71 3.2 Discrete Random

More information

Partial Rank Estimation of Transformation Models with General forms of Censoring

Partial Rank Estimation of Transformation Models with General forms of Censoring Partial Rank Estimation of Transformation Models with General forms of Censoring Shakeeb Khan a and Elie Tamer b a Department of Economics, University of Rochester, Rochester NY 14627, USA b Department

More information

Lecture 22 Survival Analysis: An Introduction

Lecture 22 Survival Analysis: An Introduction University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 22 Survival Analysis: An Introduction There is considerable interest among economists in models of durations, which

More information

TESTS FOR LOCATION WITH K SAMPLES UNDER THE KOZIOL-GREEN MODEL OF RANDOM CENSORSHIP Key Words: Ke Wu Department of Mathematics University of Mississip

TESTS FOR LOCATION WITH K SAMPLES UNDER THE KOZIOL-GREEN MODEL OF RANDOM CENSORSHIP Key Words: Ke Wu Department of Mathematics University of Mississip TESTS FOR LOCATION WITH K SAMPLES UNDER THE KOIOL-GREEN MODEL OF RANDOM CENSORSHIP Key Words: Ke Wu Department of Mathematics University of Mississippi University, MS38677 K-sample location test, Koziol-Green

More information

Supplement to Quantile-Based Nonparametric Inference for First-Price Auctions

Supplement to Quantile-Based Nonparametric Inference for First-Price Auctions Supplement to Quantile-Based Nonparametric Inference for First-Price Auctions Vadim Marmer University of British Columbia Artyom Shneyerov CIRANO, CIREQ, and Concordia University August 30, 2010 Abstract

More information

Nonparametric Estimation for Semi-Markov Processes Based on K-Sample Paths with Application to Reliability

Nonparametric Estimation for Semi-Markov Processes Based on K-Sample Paths with Application to Reliability Nonparametric Estimation for Semi-Markov Processes Based on K-Sample Paths with Application to Reliability Nikolaos Limnios 1 and Brahim Ouhbi 2 1 Laboratoire de Mathématiques Appliquées, Université de

More information

Marginal Screening and Post-Selection Inference

Marginal Screening and Post-Selection Inference Marginal Screening and Post-Selection Inference Ian McKeague August 13, 2017 Ian McKeague (Columbia University) Marginal Screening August 13, 2017 1 / 29 Outline 1 Background on Marginal Screening 2 2

More information

Nonparametric Bayes Estimator of Survival Function for Right-Censoring and Left-Truncation Data

Nonparametric Bayes Estimator of Survival Function for Right-Censoring and Left-Truncation Data Nonparametric Bayes Estimator of Survival Function for Right-Censoring and Left-Truncation Data Mai Zhou and Julia Luan Department of Statistics University of Kentucky Lexington, KY 40506-0027, U.S.A.

More information

Frailty Models and Copulas: Similarities and Differences

Frailty Models and Copulas: Similarities and Differences Frailty Models and Copulas: Similarities and Differences KLARA GOETHALS, PAUL JANSSEN & LUC DUCHATEAU Department of Physiology and Biometrics, Ghent University, Belgium; Center for Statistics, Hasselt

More information

UNIVERSITÄT POTSDAM Institut für Mathematik

UNIVERSITÄT POTSDAM Institut für Mathematik UNIVERSITÄT POTSDAM Institut für Mathematik Testing the Acceleration Function in Life Time Models Hannelore Liero Matthias Liero Mathematische Statistik und Wahrscheinlichkeitstheorie Universität Potsdam

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2009 Paper 248 Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials Kelly

More information

High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data

High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data Song Xi CHEN Guanghua School of Management and Center for Statistical Science, Peking University Department

More information

Survival Analysis for Case-Cohort Studies

Survival Analysis for Case-Cohort Studies Survival Analysis for ase-ohort Studies Petr Klášterecký Dept. of Probability and Mathematical Statistics, Faculty of Mathematics and Physics, harles University, Prague, zech Republic e-mail: petr.klasterecky@matfyz.cz

More information

A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1. By Thomas H. Scheike University of Copenhagen

A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1. By Thomas H. Scheike University of Copenhagen The Annals of Statistics 21, Vol. 29, No. 5, 1344 136 A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1 By Thomas H. Scheike University of Copenhagen We present a non-parametric survival model

More information

Efficiency Comparison Between Mean and Log-rank Tests for. Recurrent Event Time Data

Efficiency Comparison Between Mean and Log-rank Tests for. Recurrent Event Time Data Efficiency Comparison Between Mean and Log-rank Tests for Recurrent Event Time Data Wenbin Lu Department of Statistics, North Carolina State University, Raleigh, NC 27695 Email: lu@stat.ncsu.edu Summary.

More information

Published online: 10 Apr 2012.

Published online: 10 Apr 2012. This article was downloaded by: Columbia University] On: 23 March 215, At: 12:7 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 172954 Registered office: Mortimer

More information