Consistency of the Semi-parametric MLE under the Cox Model with Linearly Time-dependent Covariates and Interval-censored Data

Size: px
Start display at page:

Download "Consistency of the Semi-parametric MLE under the Cox Model with Linearly Time-dependent Covariates and Interval-censored Data"

Transcription

1 Journal of Advanced Statistics, Vol. 4, No. 1, March Consistency of the Semi-parametric MLE under the Cox Model with Linearly Time-dependent Covariates and Interval-censored Data Qiqing Yu * and Qinggang Diao Department of Mathematical Sciences, SUNY, Binghamton, NY 13902, USA qyu@math.binghamton.edu Abstract Yu and Diao [1] studied the estimation problem under the Cox model with linearly time-dependent covariates and with interval-censored (IC) data under the distribution-free set-up. They proposed a modified semi-parametric MLE (MSMLE) and the simulation results suggest that the MSMLE is consistent. In this paper, we establish the consistency of the MSMLE. Keywords: Cox s model, time-dependent covariates, semi-parametric MLE, consistency. 1 Introduction In this paper, we establish the consistency of the semiparametric estimator proposed in Yu and Diao [1] under the proportional hazards (PH) model with a special continuous time-dependent covariates and with interval-censored data. Let Y be a continuous random variable, with the cumulative distribution function (cdf) F Y, where F Y (t) = P (Y t). Its survival function is denoted by S Y (t) = 1 F Y (t), its density function by f Y (t), f Y (t) S Y (t ) and its hazard function by h Y (t) =. Let z be a p 1 covariate vector. We say that (z, Y ) follows the PH (or Cox) model if the conditional hazard satisfies h(t z) = h Y z(t z) = h o (t)e βz, for t < τ, (1.1) where βz = β z, β is the transpose of the p 1 vector β, τ = supt : h o (t) > 0}, and h o is the hazard function of Y (z = 0). The PH model has been extended to the time-dependent covariates PH (TDCPH) model (see [2] p.113). For instance, replace z in (1.1) by z = z(t) = uβg(t), where u is a time-independent covariate, and g(t) is a function of the time t, e.g., g(t) = (t a)1(t a), where 1(A) is the indicator function of an event A (see [2] p.113). We shall call the latter case the PH model with linearly timedependent covariates (LDCPH model). Interval-censored (IC) data are (L i, R i ), i = 1,..., n, where the true survival time Y i (L i, R i ]. A realistic model for the IC data is the mixed case interval censorship model (see [3]). One of its special case is the case 2 interval censorship model (C2 model) (see [4]). The TDCPH model including the LDCPH model has been commonly used for right-censored (RC) data. The semi-parametric estimation with IC data under the LDCPH model was first studied by Yu and Diao [1], that is, h(t z(t)) satisfies Eq. (1.1) with Z(t) = U (t a)1(t a), where U is a time-independent covariate vector, (1.2) a is a real number and both β and S o are unknown. As explained in their paper, this covariate Z(t) is very typical and share the light on how to estimate (β, h o ) under the PH model with IC data and with other types of time-dependent covariates. Under the semi-parametric set-up, the typical estimation approach for right-censored data is the partial likelihood estimation. However, it is well known that this approach only works for right-censored data, but not for IC data (see, for example, Wong and Yu [5]). The main findings about the LDCPH model in Yu and Diao [1] are as follows. (1) Even if the parameter β is not a vector, β may not be identifiable if the support set (of the observable random vector) contains only finitely many elements. This is quite different from the case of the PH model with time-independent covariates, under which β is identifiable even if the support set contains only one point.

2 2 Journal of Advanced Statistics, Vol. 4, No. 1, March 2019 (2) The generalized likelihood function needs to be modified, as it must be of the form of hazard functions under the semi-parametric set-up in (1.2). Otherwise, there is no consistent estimator of β. (3) Several naive modifications on the generalized likelihood function do not lead to consistent estimators. (4) A modified semi-parametric MLE (MSMLE) was proposed and their simulation studies suggested that the MSMLE of β is consistent. In this paper, we prove the consistency of the MSMLE. The main difficulty in the proof is that there may not exist a convergent subsequence of the estimates of h o in (1.1), but the MSMLE is in the form of the estimate of h o. The paper is orgainized as follows. The MSMLE is introduced in Section 2. The consistency of the MSMLE is established in Section 3. The proofs of some lemmas are put in Section 4. 2 The MSMLE The generalized likelihood function with IC data (L i, R i ) s is often given by L = n i=1 µ S( )(I i Z i ), where µ S( ) (I i Z i ) = S(L i Z i ) S(R i Z i ) and I i = (L i, R i ]. (2.1) L depends on the survival function S(t z). Yu and Diao [1] showed that if S(t z) satisfies the PH model and is absolutely continuous, and Z = Z(t) = (t a)u1(t a), then t S(t z(t)) = exp( e βu(x a)1(t a) h o (x)dx) 0 S o (t) if t a or u = 0 = S o (a) exp( t a eβu(x a) h o (x)dx) if t > a and u 0. (2.2) Hereafter, abusing notations, we write S(t u) = S(t z(t)) and h(t u) = h o (t) exp(βu1(t a)(t a)). Notice that the two hazard functions h o (t) = 1(t > 0) and h 1 = 1(t (0, 2) (2, )) both lead to S o (t) = exp( t) if t > 0. Since h o (or f o ) can differ on a set A satisfying A df o(t) = 0, where F F o = 1 S o, we define f o (t) = o(t) if F o(t) exists for identifiability of f o and h o. Let S F be the 0 otherwise support set of the cdf F or a measure df, in the sense that x S F iff F (x + ϵ) F (x ϵ) > 0, ϵ > 0. Proposition 1. (Yu and Diao [1]). The survival function S(t u) is identifiable if t S FL S FR. An identifiability condition for β under the mixed case IC model is that S FL S FR contains infinitely many points t j } j 1 with a limiting point, say t o j t j in (a, ), provided that F o(t o ) > 0. An interval A is called an innermost interval (II) if it is an intersection of the observed intervals I 1,..., I n, and if A I i = A or for each I i. It is well known (see [8]) that under the PH model with time-independent covariates in order to maximize L, it suffices to put the weights of S o to the II s. Moreover, the weight to each II is uniquely determined, but not the distribution of the weight within the II. Let A 1,..., A m be all the II s induced by I i s and let (v j, w j ) be the pair of endpoints of A j, where w 0 = < w 1 < w 2 < < w m. For each i, let ξ i = 1(R i < w m ) and define l i and r i by w ri R i < w ri +1 and w li L i < w li +1. Then the likelihood function in (2.1) becomes L 1 (β, S o ) = (S o (w li ) S o (w ri )) R i a or u i=0 L i<a<r i,u i 0 [exp( w j (a,l i ] S o (w li ) ξ i S o (a) exp( wj w j [a,r i ] wj v j e β(x a) h o (x)dx) ξ i exp( v j e β(x a) h o (x)dx)} w j (a,r i ] wj v j L i>a,u i 0 e β(x a) h o (x)dx)]. S o (a) Since h o is a function of x and needs to be properly defined on [v j, w j ] for all j < m (note that S o (w m ) = 0), Yu and Diao proposed to modify the likelihood as follows. First, let h o (x) = 0 if x / k (v k, w k ], where (v 1, w 1 ],..., (v m, w m ] are all the II s. Moreover, notice that each (v k, w k ] will be contained by several modified observed intervals I i s (see Remark 2 in Yu and (2.3)

3 Journal of Advanced Statistics, Vol. 4, No. 1, March Diao [1]) with J ( 1) distinct values of u i s, where J depends on k. There are two types of (v k, w k ): (1) w k v k 0, or w k a, or w k a 0 and a (v k, w k ]; (2) otherwise. For k < m, define constant on (v k, w k ] if (v k, w k ) belongs to type (1) h o (x) = J-piece-wise constant on (v k, w k ] if (v k, w k ) belongs to type (2) (in particular, if (v k, w k ) belongs to type (2), then h o (x) = J h kj 1(x (v kj, w kj ]) for x (v k, w k ], (2.4) j=1 wk v k where v k = v k1, w k1 = v k2, w k2 = v k3,..., w kj = w k, w kj v kj = J if a (v k, w k ], j 1,..., J} w k v k J 1 if a / (v k, w k ], j 2,..., J} and h kj s are constant. If k = m, simply define S o (w m ) = 0 (h o can be arbitrary on (v m, w m ], provided that h o 0 and wm v m h o (x)dx = ). Abusing notations, let (a j, b j ] be the interval in which h o is constant, as specified in (2.4). Then h o (x) = j h j1(x (a j, b j ]), where h j is a constant and (a j, b j ] may not be an II. Then L 1 in (2.3) becomes L(β, S o ) = n (S(L i U i ) S(R i U i )) i=1 = exp( } h j [b j a j ])[1 exp( h j [b j a j ])] ξ i R i a, or u i=0 b j L i b j (L i,r i] S o (a) exp( e auiβ h j [e uiβbj e uiβaj ]) u i β L i a,u i 0 b j (a,l i ] [1 exp( e au iβ } h j [e u iβb j e u iβa j ])] u i β b j (L i,r i] exp( h j [b j a j ] ξi ) L i <a<r i,u i 0 b j L i [1 exp( h j (b j a j ) e au iβ } h j [e u iβb j e u iβa j ])] ξ i. u i β b j (L i,a] b j (L i,r i ] (2.5) The MSMLE maximizes L over all h j s and β. It is well known that the Newton Raphson method does not work for deriving the SMLE or the MSMLE under the PH model with IC data (see Wong and Yu [5]). Yu and Diao [1] suggested to use the steepest decent method. 3 Consistency In order to simplify the presentation of the proof of consistency, we shall only make use of the C2 model and assume that β (, ). The C2 model assumes that C 1 and C 2 are two random followup times, (L, R) = (, C 1 )1(Y C 1 ) + (C 1, C 2 )1(Y (C 1, C 2 ]) + (C 2, )1(Y > C 2 ), (C 1, C 2 ) and (Y, U) are independent, and P (C 1 < C 2 ) = 1. Define a measure µ on the Borel σ-field B on R 1 by µ(b) = P (C 1 B)+P (C 2 B), B B. Assume that (L i, R i, U i ) s are i.i.d. from F L,R,U ( def = Q). Denote ˆQ = ˆQ(x, y, z) = n i=1 1(L i x, R i y, U i z)/n. Theorem 1. Assume that the censoring satisfies the C2 model, S(t u) satisfies (2.2), the identifiable condition in Proposition 1 holds, and U takes on at least two distinct values. Then Ŝn S o dµ a.s. 0 and a.s. ˆβ n β, where (Ŝn, ˆβ n ) is obtained by maximizing L in (2.5). Let Ω be the sample space. In order to prove Theorem 1, we shall make use of the next 3 lemmas.

4 4 Journal of Advanced Statistics, Vol. 4, No. 1, March 2019 Lemma 1. Suppose that µ n } n 1 is a sequence of measures on the measurable space (A, Σ) such that µ n (B) µ(b), B Σ. Let f n } n 1 be a sequence of non-negative integrable functions. Then A lim f n dµ lim A f n dµ n. n n Lemma 2. P (Ω ˆQ) = 1, where Ω ˆQ = ω Ω : sup x,y,z ˆQ(x, y, z)(ω) Q(x, y, z)(ω) 0}. Lemma 3. Given ω Ω, for each sub-sequence of Ŝn( )} n 1, there exists a further subsequnce, say Ŝn j ( )} j 1 and a function S ( ) such that Ŝn j (t u) S (t u) for each (t, u). Lemma 1 is Fatou s Lemma with varying measures. It is almost the same as Proposition 17 in [7] (page 231), and so is its proof. Thus we skip the proof of Lemma 1. Lemma 2 is the multvariate version of the Glivenko - Cantelli theorem. Eddy and Hartigan [8] presented a similar version of Lemma 2 with certain addtional regularity conditions such as P ((L, R, U) B) = 0, where B is the boundary of S FL,R,U. However, P (B) > 0 under the assumptions in this paper. Thus their result is not applicable here. Lemma 3 is a variation of Helly s selection theorem. The proofs of Lemmas 2 and 3 are relegated to Section 4 for a better presentation. Proof of Theorem 1. We shall give the proof in 3 steps. Step 1 (preliminary). Let S (ub) (t) = S (t u) = exp( t 0 eu(s a)b1(s>a) h (s)ds), where h is a hazard function and let S = S (0) and F = 1 S. By Eq. (2.5), the normalized generalized log-likelihood is L n (S, b) = 1 n n j=1 log(s (ujb) (L j ) S (ujb) (R j )). By the strong law of large numbers (SLLN), L n (S, b) a.s. L o (S, b)( def = E(L n (S, b))). Let w S (c 1, c 2, U) = F (c 1 U) log F (c 1 U) + S(c 2 U) log S (c 2 U) + (S(c 1 U) S(c 2 U)) log(s (c 1 U) S (c 2 U)). Then L o (S, b) =E(L n (S, b)) = E(log(S (L U) S (R U))) =E(E(log(S (L U) S (R U))) U) =E(E(w S (C 1, C 2, U) U)). Step 2 : L o (S, b) is maximized iff S (t u) = S(t u) for each (t, u) S µ S FU and b = β. It is easy to check that the expression w S (c 1, c 2, u) is maximized by a conditional survival function S ( ), if and only if S (c i u) = S(c i u), i 1, 2}. Since sup p log p : 0 p 1} 1, w S (c 1, c 2, u) is bounded by 3, we see that L o (S, b) is finite. Thus S( ) maximizes L o (S, b). Moreover, by Eq. (2.2), S(t u) = exp( t 0 eβu(x a)1(t a) h o (x)dx) is continuous in (t, u), thus any other S ( ) that maximizes L o (S, b) satisfies S ( u) = S( u) on S µ S FU. Furthermore, notice that one can write S(t u) = exp t 0 e(u u o)β(s c)1(s>c) h 1 (s)ds}, where u o S FU and h 1 (s) = e u oβ(s c)1(s>c) h o (s). Without loss of generality (WLOG), we can assume that 0 S FU. Thus S (t 0) = S(t 0) for t S µ. Now, S (t u) = exp t 0 eub(s c)1(s>c) h o (s)ds}, We shall show that b = β. Since the identifiable conditions in Proposition 1 hold, for t o and t k S FC1 S FC2 satisfying t k t o and F o(t o ) > 0, log(s h (t o ) (t k 0)) ( log(s (t o 0)) log(s o (t k )) ( log(s o (t o )) tk 0 h o(s)ds t o 0 h o(s)ds = h o (t o ).

5 Journal of Advanced Statistics, Vol. 4, No. 1, March By assumption, there exists some u 0, then e ub(t o a)1(t o a) log(s(t k 0)) ( log(s(t o 0)) h (t o ) log(s o (ub) (t k )) ( log(s o (ub) (t o )) tk 0 eu(s a)β1(s>a) h o (s)ds t o 0 eu(s a)β1(s>a) h o (s)ds = e u(t o a)β1(t o a) h o (t o ). Thus, we must have e u(to a)b1(to a) h(t o ) = e u(to a)β1(to a) h o (t o ) and h(t o ) = h o (t o ), as F o(t o ) > 0. Then h o (t o ) > 0, which means b = β. That is, L o (S, b) is maximized by (S o, β). This proves the claim. Step 3 (final conclusion). Let Ω o = ω : lim n L n (S o, β) = L o (S o, β)} Ω ˆQ. It follows from the SLLN and Lemma 2 that P (Ω o ) = 1. Hereafter, fix an ω Ω o. By Lemma 3, given any subsequence n i } i 1 of n} n 1, there exist a futher subsequence, say itself such that Ŝn i (t u) S (t u) for each (t, u). Moreover, since ˆβ ni (, ), there is a subsequence which converges to b [, ]. WLOG, we can assume that Ŝn( ) S ( ) and ˆβ n b. Then lim L n(ŝn, ˆβ n ) log(ŝn(l u) Ŝn(r u))d ˆQ(l, r, u) n n = lim log(ŝn(l u) Ŝn(r u))d ˆQ(l, r, u)) (3.1) n lim log(ŝn(l u) Ŝn(r u))d ˆQ(l, r, u) (by Lemma 1, n as (1) ˆQ(η) Q(η) for every Borel subset η of = (l, r, u) : l < r, u (, )} and (2) log(ŝn(l u) Ŝn(r u)) 0). Consequently, L o (S o, β) lim L n(ŝn, ˆβ n ) (as L n (S o, β) L n (Ŝn, ˆβ n )) n lim log(ŝn(l u) Ŝn(r u))d ˆQ(l, r, u) (by (3.1)) n = log(s (l u) S (r u))dq(l, r, u) (as Ŝn(t u) S (t u)) = L o (S, b) L o (S o, b) (by the claim in Step 2), ω Ω o. Notice that P (Ω o ) = 1. Thus S = S o a.s. µ and b = β by the claim in Step Proofs Proof of Lemma 2. Given ϵ = 1/k, there exist x 1,..., x m, y 1,..., y m, z 1,..., z m (, ] such that maxp (L (x i 1, x i )), P (R (y i 1, y i ))P (U (z i 1, z i ))} < ϵ for i 1,..., m}, where x 0 = y 0 = z 0 = and x m = y m = z m =. Let Ω k be the event that max i,j,h ˆQ(x i, y j, z h ) Q(x i, y j, z h ) + ˆF L (x i ) F L (x i ) + ˆF R (y j ) F R (y j ) + ˆF U (z h ) F U (z h ) } 0. Since m is finite, P (Ω k ) = 1. Let ω Ω k. There exists n o such that ˆQ(x i, y j, z h ) Q(x i, y j, z h ) < ϵ, ˆQ(x i,, ) Q(x i,, ) < ϵ, ˆQ(, y j, ) Q(, y j, ) < ϵ, ˆQ(,, z h ) Q(,, z h ) < ϵ, i, j, h 0, 1, 2,..., m}.

6 6 Journal of Advanced Statistics, Vol. 4, No. 1, March 2019 Now (x, y, z), some (i, j, h) such that (x, y, z) (x i 1, x i ] (y j 1, y j ] (z h 1, z h ]. If (x, y, z) = (x i, y j, z h ), then ˆQ(x, y, z) Q(x, y, z) < ϵ. Otherwise, there are 12 disjoint cases such as: (1) x (x i 1, x i ), y (y j 1, y j ) and z (z h 1, z h ); (2) x (x i 1, x i ), y = y j and z (z h 1, z h ); (3) x (x i 1, x i ), y (y j 1, y j ) and z = z h ; (4) x (x i 1, x i ), y = y j and z = z h ; etc. In each of the 12 cases, we can also show that ˆQ(x, y, z) Q(x, y, z) < ϵ. For instance, in Case (1), ˆQ(x, y, z) Q(x, y, z) ˆQ(x i, y j, z h ) Q(x i 1, y j 1, z h 1 ) + Q(x i, y j, z h ) ˆQ(x i 1, y j 1, z h 1 ) ˆQ(x i, y j, z h ) Q(x i, y j, z h ) + Q(x i, y j, z h ) Q(x i 1, y j 1, z h 1 ) + ˆQ(x i 1, y j 1, z h 1 ) Q(x i 1,zh 1, y j 1, z h 1 ) + Q(x i 1, y j 1, z h 1 ) Q(x i, y j, z h ) ˆQ(x i, y j, z h ) Q(x i, y j, z h ) + }} Q(x i, y j, z h ) Q(x i 1, y j 1, z h 1 ) }} ϵ (F L (x i ) F L (x i 1))+(F R (y j ) F R (y j 1))+(F U (z h ) F U (z h 1 )) + ˆQ(x i 1, y j 1, z h 1 ) Q(x i 1, y j 1, z h 1 ) + Q(x i 1, y j 1, z h 1 ) Q(x i, y j, z j ) 8ϵ, (as ω Ω k ). In Case (4), ˆQ(x, y, z) Q(x, y, z) ˆQ(x i, y j, z h ) Q(x i 1, y j, z h ) + Q(x i, y j, z h ) ˆQ(x i 1, y j, z h ) ˆQ(x i, y j, z h ) Q(x i, y j, z h ) + Q(x i, y j, z h ) Q(x i 1, y j, z h ) + ˆQ(x i 1, y j, z h ) Q(x i 1,zh, y j, z h ) + Q(x i 1, y j, z h ) Q(x i, y j, z h ) ˆQ(x i, y j, z h ) Q(x i, y j, z h ) + Q(x i, y j, z h ) Q(x i 1, y j, z h ) }}}} ϵ (F L (x i ) F L (x i 1 )) 4ϵ. + ˆQ(x i 1, y j, z h ) Q(x i 1, y j, z h ) + Q(x i 1, y j, z h ) Q(x i, y j, z j ) The proofs for the other 10 cases are similar and are skipped. Notice ϵ = 1/k 0, P (Ω k ) = 1 and thus P ( k=1 Ω k) = 1. Proof of Lemma 3. Let u j } j 1 be the collection of all rational numbers. Since Ŝn(t u 1 ) is a sequence of bounded decreasing functions in t, by Helly s selection theorem, given any subsequence of n} n 1, there exists a further subsequence, say n 1i } i 1 such that Ŝn 1i (t u 1 ) converges for each t. Moreover, for j 2, there exists a subsequence, say n ji } i 1 of n (j 1)i } i 1 such that Ŝn ji (t u j ) converges for each t. It is easy to verify that Ŝn ii (t u j ) converges for each (t, u j ), say Ŝn ii (t u j ) S 1 (t u j ). In view of Eq. (2.2), write Ŝn(t u) = ˆβ n) Ŝ(u n (t) = exp( t 0 eu ˆβ n(s a)1(s>a) ĥ n (s)ds). Ŝ n (t u) is a bounded decreasing function of u ˆβ n. Thus S 1 (t u j ) is a bounded decreasing function of bu j. Since the set of rational numbers u j } j 1 is dense in (, ), S 1 (t u j ) can be extended to S 1 (t u) for each u (, ) by letting S 1 (t u) uj u S 1 (t u j ) if u is not a rational number. WLOG, we can assume b 0, then S 1 (t u) is a bounded decreasing function in u and is thus continuous except on a countable subset, say D. We shall show that S nii (t u) S 1 (t u) t and u / D. (A.1) It suffices to consider the case that u / D and u is not a rational number. Let w and v be two rationals such that w < u < v. Since the set of rationals are dense, w and v can be selected as close to u as possible. as i as v u Then S nii (t v) S nii (t u) S nii (t w). Verify that S nii (t v) S 1 (t v) S 1 (t u), as x = u is a continuous point of S(t x). Moreover, S nii (t w) S nii (t u) S 1 (t u). That is, (A.1) holds. as i S 1 (t w) as w u S 1 (t u). By the sandwich theorem,

7 Journal of Advanced Statistics, Vol. 4, No. 1, March Since D is countable, using the arguement similar to showing that Ŝn ii (t u j ) converges for each (t, u j ), we can show that there is a subsequence of n ii } i 1, say m k } k 1 n ii } i 1 such that Ŝm k (t u) converges for each u D. Consequently, Ŝm k (t u) converges for each (t, u). Acknowledgments. The authors thank the editor and a referee for their invaluable comments. References 1. Q.Q. Yu, and Q.G. Diao, The Proportional Hazards Model with linearly time-dependent covariates and interval-censored data", Journal of Advanced Statistics vol. 3, no. 4, 58-70, D.R. Cox, and D. Oakes, Analysis of Survival Data, Chapman & Hall, A. Schick, and Q.Q. Yu, Consistency of the GMLE with mixed case interval-censored data", Scandinavian Journal of Statistics, vol. 27, pp , P. Groeneboom, and J.A. Wellner, Information bounds and nonparametric maximum likelihood estimation, Birkhäuser Verlag, Basel, G.Y.C. Wong, and Q.Q. Yu, Estimation under the Lehmann regression model with interval-censored data", Communications in Statistics Simulation and Computation, vol. 41, , G.Y.C. Wong, Q.G. Diao, and Q.Q. Yu, Piecewise proportional hazards models with interval-censored data", Journal of Statistical Computation and Simulation, vol. 88, pp , H.L. Royden, Real Analysis, Second edition, New York: Macmillan, W. F. Eddy, and J. A. Hartigan, Uniform Convergence of the Empirical Distribution Function Over Convex Sets", The Annals of Statistics vol. 5, no. 2, pp , 1977.

1. Introduction In many biomedical studies, the random survival time of interest is never observed and is only known to lie before an inspection time

1. Introduction In many biomedical studies, the random survival time of interest is never observed and is only known to lie before an inspection time ASYMPTOTIC PROPERTIES OF THE GMLE WITH CASE 2 INTERVAL-CENSORED DATA By Qiqing Yu a;1 Anton Schick a, Linxiong Li b;2 and George Y. C. Wong c;3 a Dept. of Mathematical Sciences, Binghamton University,

More information

(Y; I[X Y ]), where I[A] is the indicator function of the set A. Examples of the current status data are mentioned in Ayer et al. (1955), Keiding (199

(Y; I[X Y ]), where I[A] is the indicator function of the set A. Examples of the current status data are mentioned in Ayer et al. (1955), Keiding (199 CONSISTENCY OF THE GMLE WITH MIXED CASE INTERVAL-CENSORED DATA By Anton Schick and Qiqing Yu Binghamton University April 1997. Revised December 1997, Revised July 1998 Abstract. In this paper we consider

More information

1 Glivenko-Cantelli type theorems

1 Glivenko-Cantelli type theorems STA79 Lecture Spring Semester Glivenko-Cantelli type theorems Given i.i.d. observations X,..., X n with unknown distribution function F (t, consider the empirical (sample CDF ˆF n (t = I [Xi t]. n Then

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

1 Introduction. 2 Residuals in PH model

1 Introduction. 2 Residuals in PH model Supplementary Material for Diagnostic Plotting Methods for Proportional Hazards Models With Time-dependent Covariates or Time-varying Regression Coefficients BY QIQING YU, JUNYI DONG Department of Mathematical

More information

Full likelihood inferences in the Cox model: an empirical likelihood approach

Full likelihood inferences in the Cox model: an empirical likelihood approach Ann Inst Stat Math 2011) 63:1005 1018 DOI 10.1007/s10463-010-0272-y Full likelihood inferences in the Cox model: an empirical likelihood approach Jian-Jian Ren Mai Zhou Received: 22 September 2008 / Revised:

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH Jian-Jian Ren 1 and Mai Zhou 2 University of Central Florida and University of Kentucky Abstract: For the regression parameter

More information

Efficiency of Profile/Partial Likelihood in the Cox Model

Efficiency of Profile/Partial Likelihood in the Cox Model Efficiency of Profile/Partial Likelihood in the Cox Model Yuichi Hirose School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand Summary. This paper shows

More information

and Comparison with NPMLE

and Comparison with NPMLE NONPARAMETRIC BAYES ESTIMATOR OF SURVIVAL FUNCTIONS FOR DOUBLY/INTERVAL CENSORED DATA and Comparison with NPMLE Mai Zhou Department of Statistics, University of Kentucky, Lexington, KY 40506 USA http://ms.uky.edu/

More information

STAT331. Cox s Proportional Hazards Model

STAT331. Cox s Proportional Hazards Model STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued

Introduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued Introduction to Empirical Processes and Semiparametric Inference Lecture 02: Overview Continued Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations Research

More information

UNIVERSITY OF CALIFORNIA, SAN DIEGO

UNIVERSITY OF CALIFORNIA, SAN DIEGO UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department

More information

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University Survival Analysis: Weeks 2-3 Lu Tian and Richard Olshen Stanford University 2 Kaplan-Meier(KM) Estimator Nonparametric estimation of the survival function S(t) = pr(t > t) The nonparametric estimation

More information

Piecewise Cox Models With Right-Censored Data. a Department of Integrative Medicine, Mount Sinai Beth Israel, New York, NY 10003

Piecewise Cox Models With Right-Censored Data. a Department of Integrative Medicine, Mount Sinai Beth Israel, New York, NY 10003 Piecewise Cox Models With Right-Censored Data By George Y. C. Wong, a Michael P. Osborne b, Qinggang Diao c and Qiqing Yu c a Department of Integrative Medicine, Mount Sinai Beth Israel, New York, NY 10003

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan

More information

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where STAT 331 Accelerated Failure Time Models Previously, we have focused on multiplicative intensity models, where h t z) = h 0 t) g z). These can also be expressed as H t z) = H 0 t) g z) or S t z) = e Ht

More information

Regularization in Cox Frailty Models

Regularization in Cox Frailty Models Regularization in Cox Frailty Models Andreas Groll 1, Trevor Hastie 2, Gerhard Tutz 3 1 Ludwig-Maximilians-Universität Munich, Department of Mathematics, Theresienstraße 39, 80333 Munich, Germany 2 University

More information

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models NIH Talk, September 03 Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models Eric Slud, Math Dept, Univ of Maryland Ongoing joint project with Ilia

More information

Infinitely Imbalanced Logistic Regression

Infinitely Imbalanced Logistic Regression p. 1/1 Infinitely Imbalanced Logistic Regression Art B. Owen Journal of Machine Learning Research, April 2007 Presenter: Ivo D. Shterev p. 2/1 Outline Motivation Introduction Numerical Examples Notation

More information

Survival Analysis. Stat 526. April 13, 2018

Survival Analysis. Stat 526. April 13, 2018 Survival Analysis Stat 526 April 13, 2018 1 Functions of Survival Time Let T be the survival time for a subject Then P [T < 0] = 0 and T is a continuous random variable The Survival function is defined

More information

Approximate Self Consistency for Middle-Censored Data

Approximate Self Consistency for Middle-Censored Data Approximate Self Consistency for Middle-Censored Data by S. Rao Jammalamadaka Department of Statistics & Applied Probability, University of California, Santa Barbara, CA 93106, USA. and Srikanth K. Iyer

More information

asymptotic normality of nonparametric M-estimators with applications to hypothesis testing for panel count data

asymptotic normality of nonparametric M-estimators with applications to hypothesis testing for panel count data asymptotic normality of nonparametric M-estimators with applications to hypothesis testing for panel count data Xingqiu Zhao and Ying Zhang The Hong Kong Polytechnic University and Indiana University Abstract:

More information

Random partition masking model for censored and masked competing risks data

Random partition masking model for censored and masked competing risks data Ann Inst Stat Math (2012) 64:69 85 DOI 10.1007/s10463-010-0303-8 Random partition masking model for censored and masked competing risks data Qiqing Yu G. Y. C. Wong Hao Qin Jiaping Wang Received: 13 May

More information

Preservation Theorems for Glivenko-Cantelli and Uniform Glivenko-Cantelli Classes

Preservation Theorems for Glivenko-Cantelli and Uniform Glivenko-Cantelli Classes Preservation Theorems for Glivenko-Cantelli and Uniform Glivenko-Cantelli Classes This is page 5 Printer: Opaque this Aad van der Vaart and Jon A. Wellner ABSTRACT We show that the P Glivenko property

More information

Survival Analysis Math 434 Fall 2011

Survival Analysis Math 434 Fall 2011 Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup

More information

Inconsistency of the MLE for the joint distribution. of interval censored survival times. and continuous marks

Inconsistency of the MLE for the joint distribution. of interval censored survival times. and continuous marks Inconsistency of the MLE for the joint distribution of interval censored survival times and continuous marks By M.H. Maathuis and J.A. Wellner Department of Statistics, University of Washington, Seattle,

More information

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University

More information

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);

More information

Large Sample Theory. Consider a sequence of random variables Z 1, Z 2,..., Z n. Convergence in probability: Z n

Large Sample Theory. Consider a sequence of random variables Z 1, Z 2,..., Z n. Convergence in probability: Z n Large Sample Theory In statistics, we are interested in the properties of particular random variables (or estimators ), which are functions of our data. In ymptotic analysis, we focus on describing the

More information

Supplementary Online Material for Ideologues or Pragmatists?

Supplementary Online Material for Ideologues or Pragmatists? Supplementary Online Material for Ideologues or Pragmatists? Ethan Bueno de Mesquita Amanda Friedenberg September 24, 2009 Harris School, University of Chicago, 1155 E. 60 th St., Chicago, IL 60637, bdm@uchicago.edu.

More information

Supplementary Materials for Residuals and Diagnostics for Ordinal Regression Models: A Surrogate Approach

Supplementary Materials for Residuals and Diagnostics for Ordinal Regression Models: A Surrogate Approach Supplementary Materials for Residuals and Diagnostics for Ordinal Regression Models: A Surrogate Approach Part A: Figures and tables Figure 2: An illustration of the sampling procedure to generate a surrogate

More information

The main results about probability measures are the following two facts:

The main results about probability measures are the following two facts: Chapter 2 Probability measures The main results about probability measures are the following two facts: Theorem 2.1 (extension). If P is a (continuous) probability measure on a field F 0 then it has a

More information

REAL AND COMPLEX ANALYSIS

REAL AND COMPLEX ANALYSIS REAL AND COMPLE ANALYSIS Third Edition Walter Rudin Professor of Mathematics University of Wisconsin, Madison Version 1.1 No rights reserved. Any part of this work can be reproduced or transmitted in any

More information

Smooth nonparametric estimation of a quantile function under right censoring using beta kernels

Smooth nonparametric estimation of a quantile function under right censoring using beta kernels Smooth nonparametric estimation of a quantile function under right censoring using beta kernels Chanseok Park 1 Department of Mathematical Sciences, Clemson University, Clemson, SC 29634 Short Title: Smooth

More information

Lecture 22 Survival Analysis: An Introduction

Lecture 22 Survival Analysis: An Introduction University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 22 Survival Analysis: An Introduction There is considerable interest among economists in models of durations, which

More information

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Survival Analysis. Lu Tian and Richard Olshen Stanford University 1 Survival Analysis Lu Tian and Richard Olshen Stanford University 2 Survival Time/ Failure Time/Event Time We will introduce various statistical methods for analyzing survival outcomes What is the survival

More information

Cox s proportional hazards model and Cox s partial likelihood

Cox s proportional hazards model and Cox s partial likelihood Cox s proportional hazards model and Cox s partial likelihood Rasmus Waagepetersen October 12, 2018 1 / 27 Non-parametric vs. parametric Suppose we want to estimate unknown function, e.g. survival function.

More information

Nonparametric estimation of log-concave densities

Nonparametric estimation of log-concave densities Nonparametric estimation of log-concave densities Jon A. Wellner University of Washington, Seattle Seminaire, Institut de Mathématiques de Toulouse 5 March 2012 Seminaire, Toulouse Based on joint work

More information

arxiv:math/ v2 [math.st] 17 Jun 2008

arxiv:math/ v2 [math.st] 17 Jun 2008 The Annals of Statistics 2008, Vol. 36, No. 3, 1031 1063 DOI: 10.1214/009053607000000974 c Institute of Mathematical Statistics, 2008 arxiv:math/0609020v2 [math.st] 17 Jun 2008 CURRENT STATUS DATA WITH

More information

Cox s proportional hazards/regression model - model assessment

Cox s proportional hazards/regression model - model assessment Cox s proportional hazards/regression model - model assessment Rasmus Waagepetersen September 27, 2017 Topics: Plots based on estimated cumulative hazards Cox-Snell residuals: overall check of fit Martingale

More information

Spring 2014 Advanced Probability Overview. Lecture Notes Set 1: Course Overview, σ-fields, and Measures

Spring 2014 Advanced Probability Overview. Lecture Notes Set 1: Course Overview, σ-fields, and Measures 36-752 Spring 2014 Advanced Probability Overview Lecture Notes Set 1: Course Overview, σ-fields, and Measures Instructor: Jing Lei Associated reading: Sec 1.1-1.4 of Ash and Doléans-Dade; Sec 1.1 and A.1

More information

STAT 479: Short Term Actuarial Models

STAT 479: Short Term Actuarial Models STAT 479: Short Term Actuarial Models Jianxi Su, FSA, ACIA Purdue University, Department of Statistics Week 1. Some important things about this course you may want to know... The course covers a large

More information

Empirical Likelihood in Survival Analysis

Empirical Likelihood in Survival Analysis Empirical Likelihood in Survival Analysis Gang Li 1, Runze Li 2, and Mai Zhou 3 1 Department of Biostatistics, University of California, Los Angeles, CA 90095 vli@ucla.edu 2 Department of Statistics, The

More information

Inference For High Dimensional M-estimates. Fixed Design Results

Inference For High Dimensional M-estimates. Fixed Design Results : Fixed Design Results Lihua Lei Advisors: Peter J. Bickel, Michael I. Jordan joint work with Peter J. Bickel and Noureddine El Karoui Dec. 8, 2016 1/57 Table of Contents 1 Background 2 Main Results and

More information

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),

More information

Serena Doria. Department of Sciences, University G.d Annunzio, Via dei Vestini, 31, Chieti, Italy. Received 7 July 2008; Revised 25 December 2008

Serena Doria. Department of Sciences, University G.d Annunzio, Via dei Vestini, 31, Chieti, Italy. Received 7 July 2008; Revised 25 December 2008 Journal of Uncertain Systems Vol.4, No.1, pp.73-80, 2010 Online at: www.jus.org.uk Different Types of Convergence for Random Variables with Respect to Separately Coherent Upper Conditional Probabilities

More information

Likelihood Construction, Inference for Parametric Survival Distributions

Likelihood Construction, Inference for Parametric Survival Distributions Week 1 Likelihood Construction, Inference for Parametric Survival Distributions In this section we obtain the likelihood function for noninformatively rightcensored survival data and indicate how to make

More information

Problem set 1, Real Analysis I, Spring, 2015.

Problem set 1, Real Analysis I, Spring, 2015. Problem set 1, Real Analysis I, Spring, 015. (1) Let f n : D R be a sequence of functions with domain D R n. Recall that f n f uniformly if and only if for all ɛ > 0, there is an N = N(ɛ) so that if n

More information

Ph.D. Qualifying Exam Friday Saturday, January 3 4, 2014

Ph.D. Qualifying Exam Friday Saturday, January 3 4, 2014 Ph.D. Qualifying Exam Friday Saturday, January 3 4, 2014 Put your solution to each problem on a separate sheet of paper. Problem 1. (5166) Assume that two random samples {x i } and {y i } are independently

More information

Efficiency Comparison Between Mean and Log-rank Tests for. Recurrent Event Time Data

Efficiency Comparison Between Mean and Log-rank Tests for. Recurrent Event Time Data Efficiency Comparison Between Mean and Log-rank Tests for Recurrent Event Time Data Wenbin Lu Department of Statistics, North Carolina State University, Raleigh, NC 27695 Email: lu@stat.ncsu.edu Summary.

More information

STAT Sample Problem: General Asymptotic Results

STAT Sample Problem: General Asymptotic Results STAT331 1-Sample Problem: General Asymptotic Results In this unit we will consider the 1-sample problem and prove the consistency and asymptotic normality of the Nelson-Aalen estimator of the cumulative

More information

Theorem 2.1 (Caratheodory). A (countably additive) probability measure on a field has an extension. n=1

Theorem 2.1 (Caratheodory). A (countably additive) probability measure on a field has an extension. n=1 Chapter 2 Probability measures 1. Existence Theorem 2.1 (Caratheodory). A (countably additive) probability measure on a field has an extension to the generated σ-field Proof of Theorem 2.1. Let F 0 be

More information

CURE MODEL WITH CURRENT STATUS DATA

CURE MODEL WITH CURRENT STATUS DATA Statistica Sinica 19 (2009), 233-249 CURE MODEL WITH CURRENT STATUS DATA Shuangge Ma Yale University Abstract: Current status data arise when only random censoring time and event status at censoring are

More information

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling Estimation and Inference of Quantile Regression for Survival Data under Biased Sampling Supplementary Materials: Proofs of the Main Results S1 Verification of the weight function v i (t) for the lengthbiased

More information

LECTURE 15: COMPLETENESS AND CONVEXITY

LECTURE 15: COMPLETENESS AND CONVEXITY LECTURE 15: COMPLETENESS AND CONVEXITY 1. The Hopf-Rinow Theorem Recall that a Riemannian manifold (M, g) is called geodesically complete if the maximal defining interval of any geodesic is R. On the other

More information

Nonconcave Penalized Likelihood with A Diverging Number of Parameters

Nonconcave Penalized Likelihood with A Diverging Number of Parameters Nonconcave Penalized Likelihood with A Diverging Number of Parameters Jianqing Fan and Heng Peng Presenter: Jiale Xu March 12, 2010 Jianqing Fan and Heng Peng Presenter: JialeNonconcave Xu () Penalized

More information

A Law of the Iterated Logarithm. for Grenander s Estimator

A Law of the Iterated Logarithm. for Grenander s Estimator A Law of the Iterated Logarithm for Grenander s Estimator Jon A. Wellner University of Washington, Seattle Probability Seminar October 24, 2016 Based on joint work with: Lutz Dümbgen and Malcolm Wolff

More information

Lecture 5 Models and methods for recurrent event data

Lecture 5 Models and methods for recurrent event data Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.

More information

3 Integration and Expectation

3 Integration and Expectation 3 Integration and Expectation 3.1 Construction of the Lebesgue Integral Let (, F, µ) be a measure space (not necessarily a probability space). Our objective will be to define the Lebesgue integral R fdµ

More information

log T = β T Z + ɛ Zi Z(u; β) } dn i (ue βzi ) = 0,

log T = β T Z + ɛ Zi Z(u; β) } dn i (ue βzi ) = 0, Accelerated failure time model: log T = β T Z + ɛ β estimation: solve where S n ( β) = n i=1 { Zi Z(u; β) } dn i (ue βzi ) = 0, Z(u; β) = j Z j Y j (ue βz j) j Y j (ue βz j) How do we show the asymptotics

More information

MATH5011 Real Analysis I. Exercise 1 Suggested Solution

MATH5011 Real Analysis I. Exercise 1 Suggested Solution MATH5011 Real Analysis I Exercise 1 Suggested Solution Notations in the notes are used. (1) Show that every open set in R can be written as a countable union of mutually disjoint open intervals. Hint:

More information

Zdzis law Brzeźniak and Tomasz Zastawniak

Zdzis law Brzeźniak and Tomasz Zastawniak Basic Stochastic Processes by Zdzis law Brzeźniak and Tomasz Zastawniak Springer-Verlag, London 1999 Corrections in the 2nd printing Version: 21 May 2005 Page and line numbers refer to the 2nd printing

More information

Lecture 2: Martingale theory for univariate survival analysis

Lecture 2: Martingale theory for univariate survival analysis Lecture 2: Martingale theory for univariate survival analysis In this lecture T is assumed to be a continuous failure time. A core question in this lecture is how to develop asymptotic properties when

More information

Preservation Theorems for Glivenko-Cantelli and Uniform Glivenko-Cantelli Classes

Preservation Theorems for Glivenko-Cantelli and Uniform Glivenko-Cantelli Classes Preservation Theorems for Glivenko-Cantelli and Uniform Glivenko-Cantelli Classes Aad van der Vaart and Jon A. Wellner Free University and University of Washington ABSTRACT We show that the P Glivenko

More information

Appendix B for The Evolution of Strategic Sophistication (Intended for Online Publication)

Appendix B for The Evolution of Strategic Sophistication (Intended for Online Publication) Appendix B for The Evolution of Strategic Sophistication (Intended for Online Publication) Nikolaus Robalino and Arthur Robson Appendix B: Proof of Theorem 2 This appendix contains the proof of Theorem

More information

Semiparametric Regression

Semiparametric Regression Semiparametric Regression Patrick Breheny October 22 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/23 Introduction Over the past few weeks, we ve introduced a variety of regression models under

More information

PRESERVATION THEOREMS FOR GLIVENKO-CANTELLI AND UNIFORM GLIVENKO-CANTELLI CLASSES AAD VAN DER VAART AND JON A. WELLNER ABSTRACT We show that the P Gli

PRESERVATION THEOREMS FOR GLIVENKO-CANTELLI AND UNIFORM GLIVENKO-CANTELLI CLASSES AAD VAN DER VAART AND JON A. WELLNER ABSTRACT We show that the P Gli PRESERVATION THEOREMS FOR GLIVENKO-CANTELLI AND UNIFORM GLIVENKO-CANTELLI CLASSES AAD VAN DER VAART AND JON A. WELLNER ABSTRACT We show that the P Glivenko property of classes of functions F ;::: ;F k

More information

arxiv:submit/ [math.st] 6 May 2011

arxiv:submit/ [math.st] 6 May 2011 A Continuous Mapping Theorem for the Smallest Argmax Functional arxiv:submit/0243372 [math.st] 6 May 2011 Emilio Seijo and Bodhisattva Sen Columbia University Abstract This paper introduces a version of

More information

Analysing geoadditive regression data: a mixed model approach

Analysing geoadditive regression data: a mixed model approach Analysing geoadditive regression data: a mixed model approach Institut für Statistik, Ludwig-Maximilians-Universität München Joint work with Ludwig Fahrmeir & Stefan Lang 25.11.2005 Spatio-temporal regression

More information

Estimation of the Bivariate and Marginal Distributions with Censored Data

Estimation of the Bivariate and Marginal Distributions with Censored Data Estimation of the Bivariate and Marginal Distributions with Censored Data Michael Akritas and Ingrid Van Keilegom Penn State University and Eindhoven University of Technology May 22, 2 Abstract Two new

More information

AFT Models and Empirical Likelihood

AFT Models and Empirical Likelihood AFT Models and Empirical Likelihood Mai Zhou Department of Statistics, University of Kentucky Collaborators: Gang Li (UCLA); A. Bathke; M. Kim (Kentucky) Accelerated Failure Time (AFT) models: Y = log(t

More information

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data Malaysian Journal of Mathematical Sciences 11(3): 33 315 (217) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal homepage: http://einspem.upm.edu.my/journal Approximation of Survival Function by Taylor

More information

Quantile Regression for Residual Life and Empirical Likelihood

Quantile Regression for Residual Life and Empirical Likelihood Quantile Regression for Residual Life and Empirical Likelihood Mai Zhou email: mai@ms.uky.edu Department of Statistics, University of Kentucky, Lexington, KY 40506-0027, USA Jong-Hyeon Jeong email: jeong@nsabp.pitt.edu

More information

Qualifying Exam in Probability and Statistics. https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf

Qualifying Exam in Probability and Statistics. https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf Part 1: Sample Problems for the Elementary Section of Qualifying Exam in Probability and Statistics https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf Part 2: Sample Problems for the Advanced Section

More information

Statistics 581 Revision of Section 4.4: Consistency of Maximum Likelihood Estimates Wellner; 11/30/2001

Statistics 581 Revision of Section 4.4: Consistency of Maximum Likelihood Estimates Wellner; 11/30/2001 Statistics 581 Revision of Section 4.4: Consistency of Maximum Likelihood Estimates Wellner; 11/30/2001 Some Uniform Strong Laws of Large Numbers Suppose that: A. X, X 1,...,X n are i.i.d. P on the measurable

More information

STATISTICAL BEHAVIOR AND CONSISTENCY OF CLASSIFICATION METHODS BASED ON CONVEX RISK MINIMIZATION

STATISTICAL BEHAVIOR AND CONSISTENCY OF CLASSIFICATION METHODS BASED ON CONVEX RISK MINIMIZATION STATISTICAL BEHAVIOR AND CONSISTENCY OF CLASSIFICATION METHODS BASED ON CONVEX RISK MINIMIZATION Tong Zhang The Annals of Statistics, 2004 Outline Motivation Approximation error under convex risk minimization

More information

Measures. 1 Introduction. These preliminary lecture notes are partly based on textbooks by Athreya and Lahiri, Capinski and Kopp, and Folland.

Measures. 1 Introduction. These preliminary lecture notes are partly based on textbooks by Athreya and Lahiri, Capinski and Kopp, and Folland. Measures These preliminary lecture notes are partly based on textbooks by Athreya and Lahiri, Capinski and Kopp, and Folland. 1 Introduction Our motivation for studying measure theory is to lay a foundation

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models

Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations

More information

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 3 9/10/2008 CONDITIONING AND INDEPENDENCE

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 3 9/10/2008 CONDITIONING AND INDEPENDENCE MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 3 9/10/2008 CONDITIONING AND INDEPENDENCE Most of the material in this lecture is covered in [Bertsekas & Tsitsiklis] Sections 1.3-1.5

More information

ABSTRACT INTEGRATION CHAPTER ONE

ABSTRACT INTEGRATION CHAPTER ONE CHAPTER ONE ABSTRACT INTEGRATION Version 1.1 No rights reserved. Any part of this work can be reproduced or transmitted in any form or by any means. Suggestions and errors are invited and can be mailed

More information

TMA 4275 Lifetime Analysis June 2004 Solution

TMA 4275 Lifetime Analysis June 2004 Solution TMA 4275 Lifetime Analysis June 2004 Solution Problem 1 a) Observation of the outcome is censored, if the time of the outcome is not known exactly and only the last time when it was observed being intact,

More information

STAT 7032 Probability Spring Wlodek Bryc

STAT 7032 Probability Spring Wlodek Bryc STAT 7032 Probability Spring 2018 Wlodek Bryc Created: Friday, Jan 2, 2014 Revised for Spring 2018 Printed: January 9, 2018 File: Grad-Prob-2018.TEX Department of Mathematical Sciences, University of Cincinnati,

More information

Asymptotic normality of the L k -error of the Grenander estimator

Asymptotic normality of the L k -error of the Grenander estimator Asymptotic normality of the L k -error of the Grenander estimator Vladimir N. Kulikov Hendrik P. Lopuhaä 1.7.24 Abstract: We investigate the limit behavior of the L k -distance between a decreasing density

More information

Econometrica Supplementary Material

Econometrica Supplementary Material Econometrica Supplementary Material SUPPLEMENT TO USING INSTRUMENTAL VARIABLES FOR INFERENCE ABOUT POLICY RELEVANT TREATMENT PARAMETERS Econometrica, Vol. 86, No. 5, September 2018, 1589 1619 MAGNE MOGSTAD

More information

WELL-POSEDNESS FOR HYPERBOLIC PROBLEMS (0.2)

WELL-POSEDNESS FOR HYPERBOLIC PROBLEMS (0.2) WELL-POSEDNESS FOR HYPERBOLIC PROBLEMS We will use the familiar Hilbert spaces H = L 2 (Ω) and V = H 1 (Ω). We consider the Cauchy problem (.1) c u = ( 2 t c )u = f L 2 ((, T ) Ω) on [, T ] Ω u() = u H

More information

(U) =, if 0 U, 1 U, (U) = X, if 0 U, and 1 U. (U) = E, if 0 U, but 1 U. (U) = X \ E if 0 U, but 1 U. n=1 A n, then A M.

(U) =, if 0 U, 1 U, (U) = X, if 0 U, and 1 U. (U) = E, if 0 U, but 1 U. (U) = X \ E if 0 U, but 1 U. n=1 A n, then A M. 1. Abstract Integration The main reference for this section is Rudin s Real and Complex Analysis. The purpose of developing an abstract theory of integration is to emphasize the difference between the

More information

A note on profile likelihood for exponential tilt mixture models

A note on profile likelihood for exponential tilt mixture models Biometrika (2009), 96, 1,pp. 229 236 C 2009 Biometrika Trust Printed in Great Britain doi: 10.1093/biomet/asn059 Advance Access publication 22 January 2009 A note on profile likelihood for exponential

More information

On Laplace Finite Marchi Fasulo Transform Of Generalized Functions. A.M.Mahajan

On Laplace Finite Marchi Fasulo Transform Of Generalized Functions. A.M.Mahajan On Laplace Finite Marchi Fasulo Transform Of Generalized Functions A.M.Mahajan Department of Mathematics, Walchand College of Arts and Science, Solapur-43 006 email:-ammahajan9@gmail.com Abstract : In

More information

The International Journal of Biostatistics

The International Journal of Biostatistics The International Journal of Biostatistics Volume 1, Issue 1 2005 Article 3 Score Statistics for Current Status Data: Comparisons with Likelihood Ratio and Wald Statistics Moulinath Banerjee Jon A. Wellner

More information

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis STAT 6350 Analysis of Lifetime Data Failure-time Regression Analysis Explanatory Variables for Failure Times Usually explanatory variables explain/predict why some units fail quickly and some units survive

More information

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Takeshi Emura and Hisayuki Tsukuma Abstract For testing the regression parameter in multivariate

More information

h(x) lim H(x) = lim Since h is nondecreasing then h(x) 0 for all x, and if h is discontinuous at a point x then H(x) > 0. Denote

h(x) lim H(x) = lim Since h is nondecreasing then h(x) 0 for all x, and if h is discontinuous at a point x then H(x) > 0. Denote Real Variables, Fall 4 Problem set 4 Solution suggestions Exercise. Let f be of bounded variation on [a, b]. Show that for each c (a, b), lim x c f(x) and lim x c f(x) exist. Prove that a monotone function

More information

1 Local Asymptotic Normality of Ranks and Covariates in Transformation Models

1 Local Asymptotic Normality of Ranks and Covariates in Transformation Models Draft: February 17, 1998 1 Local Asymptotic Normality of Ranks and Covariates in Transformation Models P.J. Bickel 1 and Y. Ritov 2 1.1 Introduction Le Cam and Yang (1988) addressed broadly the following

More information

1. Propositional Calculus

1. Propositional Calculus 1. Propositional Calculus Some notes for Math 601, Fall 2010 based on Elliott Mendelson, Introduction to Mathematical Logic, Fifth edition, 2010, Chapman & Hall. 2. Syntax ( grammar ). 1.1, p. 1. Given:

More information

Survival Distributions, Hazard Functions, Cumulative Hazards

Survival Distributions, Hazard Functions, Cumulative Hazards BIO 244: Unit 1 Survival Distributions, Hazard Functions, Cumulative Hazards 1.1 Definitions: The goals of this unit are to introduce notation, discuss ways of probabilistically describing the distribution

More information

11 Survival Analysis and Empirical Likelihood

11 Survival Analysis and Empirical Likelihood 11 Survival Analysis and Empirical Likelihood The first paper of empirical likelihood is actually about confidence intervals with the Kaplan-Meier estimator (Thomas and Grunkmeier 1979), i.e. deals with

More information

CURRENT STATUS LINEAR REGRESSION. By Piet Groeneboom and Kim Hendrickx Delft University of Technology and Hasselt University

CURRENT STATUS LINEAR REGRESSION. By Piet Groeneboom and Kim Hendrickx Delft University of Technology and Hasselt University CURRENT STATUS LINEAR REGRESSION By Piet Groeneboom and Kim Hendrickx Delft University of Technology and Hasselt University We construct n-consistent and asymptotically normal estimates for the finite

More information

II - REAL ANALYSIS. This property gives us a way to extend the notion of content to finite unions of rectangles: we define

II - REAL ANALYSIS. This property gives us a way to extend the notion of content to finite unions of rectangles: we define 1 Measures 1.1 Jordan content in R N II - REAL ANALYSIS Let I be an interval in R. Then its 1-content is defined as c 1 (I) := b a if I is bounded with endpoints a, b. If I is unbounded, we define c 1

More information

Qualifying Exam in Probability and Statistics. https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf

Qualifying Exam in Probability and Statistics. https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf Part : Sample Problems for the Elementary Section of Qualifying Exam in Probability and Statistics https://www.soa.org/files/edu/edu-exam-p-sample-quest.pdf Part 2: Sample Problems for the Advanced Section

More information

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Anastasios (Butch) Tsiatis Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information