Score tests for dependent censoring with survival data

Size: px
Start display at page:

Download "Score tests for dependent censoring with survival data"

Transcription

1 Score tests for dependent censoring with survival data Mériem Saïd, Nadia Ghazzali & Louis-Paul Rivest Département de mathématiques et de statistique, Université Laval, Québec G1K 7P4 Canada Abstract In a standard survival data analysis, the observed time is the minimum of the survival time T and a censoring time independent of T. In this paper, we consider models involving two censoring times U and V. The distribution of U is possibly related with that of T, while the second censoring time V is independent of both T and U. These models involve an Archimedean copula to incorporate a possible dependency between T and U. Score tests for dependent censoring are derived when a parametric model, e.g. Weibull or exponential, is assumed for T. One is fully parametric; it assumes that the marginal distributions of T, U, and V are either exponential or Weibull. The other is semiparametric; no assumptions are made on neither U nor V. The relative efficiencies of these two tests are compared with that of tests involving uncensored data. A numerical example is presented. 1 Introduction A standard assumption underlying most statistical models for the analysis of lifetime data is the independence between survival and censoring times. This assumption is acceptable for administrative censoring associated with the termination of a study. However it is questionable in follow-up studies with self-selected removal where patients leave for causes possibly associated with the study variable. For such withdrawals, Frangakis & Rubin (21) use 1

2 the term dropout censoring. It is convenient to distinguish these two kinds of censoring. Administrative censoring is referred to as being independent while dropout censoring is called dependent. The goal of this paper is to develop score tests to ascertain whether dropout censoring is truly dependent. In a typical study the data set consists of the minimum (X) of survival, dependent censoring and independent censoring times T, U and V, respectively, is observed, with an indicator function δ taking value 1 if X = T, if X = U and -1 if X = V. Tsiatis (1975) showed that the independence between T, U and V is needed for their marginal distributions to be identifiable. This suggests that it might be difficult, not to say impossible, to test the independence between T and U without assuming a parametric model for at least one of their two marginal distributions. Nonparametric tests of independence might be feasible using additional follow-up of patients that either dropped out (see Lee & Wolfe, 1995, 1998, Lee, 1996) or died (see Lin, Robins & Wei, 1996) before the study termination. When only {(X i, δ i )} is observed, Zheng & Klein (1995) acknowledge that the level of dependency between T and U is not estimable from the data. It has to be determined a priori before implementing their survival distribution estimator that accounts for this dependency. Parametric models featuring distributions for T and U and a copula for their association have been considered by Emoto & Matthews (199) and Bhattacharyya (1997). We consider models similar to theirs. The joint survival function S T,U,V (t, u, v) of (T, U, V ) is written in terms of S T (t), S U (u) and S V (v), the marginal survival functions, and a copula C{x, y} for the dependency between T and U (Nelsen, 1999), as S T,U,V (t, u, v) = C {S T (t), S U (u)} S V (v), t, u, v >. (1.1) Our derivations use the Archimedean copulas introduced, in a statistical context, by Genest & McKay (1986) which are defined by C a {y 1, y 2 } = φ 1 a {φ a (y 1 ) + φ a (y 2 )}, (y 1, y 2 ) [, 1] 2, (1.2) where φ a ( ) is a strictly decreasing and convex function defined on (, 1] satisfying φ a (1) =. Parameter a measures the dependency with a = when T and U are independent. Thus at a =, (1.2) gives C {y 1, y 2 } = y 1 y 2 corresponding to φ (t) = ln(t). We construct two kinds of score tests for independence (H : a = ). The first one is fully parametric; each marginal distribution is assumed to 2

3 belong to a parametric family. The second test is semiparametric, a parametric model being assumed only for the distribution of survival time T. The asymptotic normality of these tests is established, and expressions for their respective asymptotic variances are derived. For all the models considered in this paper, the parametric test is shown to have a Pitman efficiency larger than 1 when compared with the semiparametric test. In section 2, we motivate the choice of Archimedean copulas to model the dependence in survival analyses. Likelihoods underlying the score tests, parametric and semiparametric, are then constructed. These two tests are described in sections 3 and 4, respectively. Section 5 is devoted to the study of their relative performances using Pitman efficiencies. A numerical example is presented in section 6. Section 7 provides some conclusions and perspectives. 2 Models and likelihoods This section presents background informations about Archimedean copulas. Two parametric copula families, considered for the construction of score tests, are also introduced. The likelihoods for the parametric and semiparametric score tests are presented. A more detailed presentation of Archimedean copulas is available in Genest & McKay (1986) and Nelsen (1999). 2.1 Dependence framework and Archimedean copula An attractive characteristic of Archimedean copulas is their close relationship to frailty models, as shown in Oakes (1989). Assuming that there exists a latent unobservable variable, say Y, such that the joint survival function of (T, U), conditioned by Y = y, is {ST (t) S U (u)}y, where ST (t) and S U (u) denote baseline survival functions, then the unconditional survival function of (T, U) is C {S T (t), S U (u)}, where C(x, y) is an Archimedean copula. To study independence between survival and censoring, one needs to distinguish between net and crude hazard rates, say λ T (t) and λ # T (t), defined as 1 λ T (t) = lim h h λ # T (t) = lim h 1 h {t T < t + h T t}, {t T < t + h T t, U t}. 3

4 Rivest & Wells (21) showed that when the joint survival distribution of T and U is given by φ 1 a {φ a(s T (t)) + φ a (S U (u))}, the net and crude hazard rates are proportional as stated formally in the next proposition. Proposition 2.1 Let π a (t) = φ 1 a {φ a(s T (t)) + φ a (S U (t))} be the survival function of min(t, U) and ψ a (t) = tφ a(t); then λ # at (t) = λ T (t) ψ a [S T (t)] ψ a [π a (t)]. There exists a similar relation for net and crude hazard rates associated with U, say λ U (u) and λ # au (u). The next section reviews some basic properties for the two Archimedean copula families used in this paper. 2.2 Two families of Archimedean copulas Example 2.1 The Clayton family is generated by φ a (t) = 1 a { t a 1 }, a [ 1, ) \ {}. Independence corresponds to a. For this family, { C a {S T (t), S U (u)} = max, { [S T (t)] a + [S U (u)] a 1 } } 1/a (2.1) and When a is close to λ # at (t) = λ T (t) λ # at (t) λ T (t)s U (t) a. { } a ST (t). (2.2) π a (t) A dependency à la Clayton adds a decreasing component to the risk function when a is positive. The following results, derived from (2.1), are used in the derivations presented later, ln π a (x) a 2 ln π a (x) a 2 2 ln π a (x) a θ a ln S T (x) ln S U (x), a ln S T (x) ln S U (x) {ln S T (x) + ln S U (x)}, a [S T (x)] 1 Ṡ T (x) ln S U (x) + [S U (x)] 1 Ṡ U (x) ln S T (x), 4

5 where θ denotes the vector of marginal parameters, and a point denotes a derivative with respect to θ. Example 2.2 The Ali-Mikhail-Haq family is generated by φ a (t) = 1 { } 1 + a(t 1) 1 a ln, a [ 1, 1), t and can be used to model a limited dependence; independence corresponds to a =. For this family, C a {S T (t), S U (u)} = S T (t)s U (u) 1 a [1 S T (t)] [1 S U (u)] (2.3) and λ # at (t) = λ T (t) { } 1 + a [πa (t) 1]. (2.4) 1 + a [S T (t) 1] When a is close to, λ # at (t) λ T (t)[1 as T (t){1 S U (t)}]. A Ali-Mikhail-Haq dependency multiplies the risk function by a bathtub shape function equal to 1 when t is either small or large. The following results, derived from (2.3), are used in the sequel ln π a (x) a = {1 S T (x)} {1 S U (x)}, a= 2 ln π a (x) a 2 = {1 S T (x)} 2 {1 S U (x)} 2, a= 2 ln π a (x) a θ = ṠT (x) {1 S U (x)} ṠU(x) {1 S T (x)}. a= This copula is closely related to Frank s (Genest, 1987): these two copulas have the same behavior in the neighborhood of independence. The score test for independence for Frank s family is the same as that for Ali-Mikhail-Haq; thus all the results derived in this paper for Ali-Mikhail-Haq s copula also apply to Frank s copula. 5

6 2.3 Likelihoods This section constructs two likelihoods for inference on dependence parameter a using the sample {(x i, δ i ); i = 1,..., n}, where x i = min(t i, U i, V i ) and δ i takes the value 1 if T i is observed, if x i = U i and -1 if x i = V i Fully parametric model The complete likelihood can be written in terms of the joint survival function S T,U,V (t, u, v) as L(a, θ) = { S } { T,U,V (t, x i, x i ) t S } T,U,V (x i, u, x i ) δ i =1 t=xi u δ i = u=xi { S } T,U,V (x i, x i, v). v δ i = 1 v=xi For model (1.1), proposition 2.1 yields t S T,U,V (t, x i, x i ) = λ # at (x i) π a (x i ) S V (x i ). t=xi The two equations above yield L(a, θ) = δ i =1 λ # at (x i) δ i = λ # au (x i) δ i = 1 λ V (x i ) Define the crude survival functions, for W = T or W = U, as t } S # aw { (t) = exp λ # aw (s) ds. n π a (x i )S V (x i ). (2.5) Now π a ( ) can be expressed as the product of the crude survival functions, π a (x) = S # at (x) S# au (x). Using this relation in (2.5) yields an alternative form for the likelihood, { } { } n n L(a, θ) = λ # at (x i) S # at (x i) λ # au (x i) S # au (x i) δ i =1 { δ i = 1 λ V (x i ) δ i = } n S V (x i ). (2.6) 6

7 Expressions (2.5) and (2.6) for the likelihood do not rely on the assumption that the copula is Archimedean. The likelihood for the parameters of (1.1) has this form even when the dependency between T and U is modeled using a non-archimedean copula Semiparametric model When nothing is known about the marginal distributions of U and V, inference about the dependence parameter a can be carried out with a pseudolikelihood, comprising only the first term of (2.6). This first term depends on π a ( ). The pseudo-likelihood is constructed by replacing π a ( ) by a nonparametric estimator, say ˆπ( ). This pseudo-likelihood is then given by L (a, θ T ) = δ i =1 λ at (x i ) n SaT (x i ), (2.7) where θ T is the vector of parameters for the marginal distribution of T, λ at (t) = λ T (t) ψ a [S T (t)] ψ a [ˆπ(t)], (2.8) and { t } SaT (t) = exp λ at (s) ds. Observe that (2.7) is a pseudo-likelihood that can be used to obtain consistent estimators of a and of the marginal parameters for T. This is however not pursued here; we use this pseudo-likelihood in section 4 to derive tests of independence. In the derivation of Section 4, ˆπ( ) needs to be a left continuous function. It can be set equal to a left continuous Kaplan-Meier estimator for the survival function of min(t, U) calculated using the sample {(x i, δi ) : i = 1,..., n}, where δi = 1 if either δ i = 1 or δ i =. When only dependent censoring is observed, this estimator reduces to the empirical estimator ˆπ(x) = (1/n) n 11 [x i x]. 3 Parametric tests In this section, we construct score tests for independence, i.e. for H : a =, using expression (2.5) for the likelihood. The marginals for T, U and 7

8 V are assumed to be known up to a parameter θ. Two marginal models, the exponential and the Weibull, are used in the derivation. The parametrization for each one is given in Table 1. For the Weibull model, the parameter β is assumed to be estimated independently for T, U, and V, however the calculations are carried out assuming, as stated in Table 1, that β is the same for these three distributions. Our primary goal, which is to compare the Pitman efficiencies of tests for H : a = for several marginal models for T, explains this somewhat artificial treatment for the Weibull distribution. Table 1: Marginal models Model λ T (t) λ U (t) λ V (t) Exponential γ T γ U γ V Weibull γ T β t β 1 γ U β t β 1 γ V β t β 1 General results about the construction of score tests are first reviewed. The results of the derivations for Clayton and Ali-Mikhail-Haq copulas and for the two marginal models of Table 1 are then presented. 3.1 General results In a parametric framework, with a finite-dimensional parameter, a score test for H is constructed using the statistic U P (, θ) = a a=,θ= θ l(a, θ), (3.1) where l(a, θ) = ln L(a, θ) is the log-likelihood for (2.5) and θ denotes the estimator of marginal parameter θ under H ; index P stands for parametric test. From (2.6), neglecting terms that do not depend on a, the log-likelihood l(a, θ) is equal to ln λ # at (x i) + ln λ # au (x i) + δ i =1 δ i = ln π a (x i ), where λ # at (x i), λ # au (x i), and π a (x i ) depend on the models for the marginals and for the copula. Let T P = n 1 U P (, θ) denote the normalized score statistic. 8

9 Classical asymptotic results imply that score statistic T P has a limiting normal distribution as the sample size n goes to (see Cox & Hinkley, 1979, ch. 9). To complete the derivation, we have now to estimate its variance. Let I(a, θ) be the observed information matrix, partitioned as [ ] 2 l(a,θ) 2 l(a,θ) [ ] I(a, θ) = a 2 Iaa (a, θ) I a θ = aθ (a, θ), (3.2) I θa (a, θ) I θθ (a, θ) 2 l(a,θ) θ a 2 l(a,θ) θ 2 and let I(, θ) be this matrix evaluated under H. To evaluate the variance of (3.1), we use the Fisher s information matrix I(, θ) =,θ {I(, θ)}. Partitioning I(a, θ) as (3.2), the variance of n 1/2 T P is estimated by σ 2 P = n 1 { I aa (, θ) I aθ (, θ) I 1 θθ (, θ) I θa(, θ) }, (3.3) evaluated at θ = θ. The null hypothesis is rejected at level α if n 1/2 T P / ˆσ P 2 is larger than z 1 α/2, the 1α/2 percentile of the normal distribution. From expression (2.6) for the likelihood, it is clear that I θθ (, θ) depends only on marginal parameters. Note also that this matrix is block diagonal, with blocks I θt θ T (, θ), I θu θ U (, θ) and I θv θ V (, θ), corresponding to components of the null log-likelihood associated with the marginal distributions of T, U and V, respectively. When the marginal distribution for T is exponential, θ T = γ T while θ T = (β, γ T ) for a Weibull margin. The parameters θ U and θ V are defined in a similar way. The inverse I 1 θθ (, θ) is also block diagonal. The next proposition gives I 1 θ T θ T (, θ) for the parameters of the distribution of T in table 1. The blocks for U and V are deduced in a similar way. The proof is technical; a sketch is provided in Appendix A. Proposition 3.1 Let T, U and V be independent. (i) Let T, U and V exponentially distributed, with hazard rate γ T, γ U and γ V, respectively; then n I 1 θ T θ T (, θ) = γ T (γ T + γ U + γ V ). (ii) Let T, U and V Weibull distributed, with common shape parameter β and scale parameters γ T, γ U and γ V, respectively. Let c.5772 denote Euler s constant and b = 1 c ln γ where γ = γ T + γ U + γ V ; then ] [ n I 1 θ T θ T (, θ) = 6 γ β 2 γ T π 2 b β ( b β γ T π b2 ). 9

10 3.2 Score test for Clayton family This section shows how to carry out the score test for Clayton s copula. All the derivations are presented in the appendix. Example 2.1 (continued) For any marginal model, statistic (3.1) is U P (, θ) = ln S T (x i ) + ln S U (x i ) + ln S T (x i ) ln S U (x i ). δ i = δ i =1 For exponential margins, the variance is and for the Weibull model σ 2 P = γ T γ U (γ T + γ U ) (γ T + γ U + γ V ) 3, σ 2 P = γ T γ U (γ T + γ U ) (γ T + γ U + γ V ) 3 {1 6π 2 }. 3.3 Score test for Ali-Mikhail-Haq family This section shows how to carry out the score test for Ali-Mikhail-Haq s copula. All the derivations are presented in the appendix. Example 2.2 (continued) For any marginal model, the statistic (3.1) is U P (, θ) = {1 S T (x i )} {1 S U (x i )} δ i = δ i =1 + 2 {1 S T (x i )} {1 S U (x i )} δ i = 1 {1 S T (x i )} {1 S U (x i )}. For this copula the variance formulas are more complex than for Clayton s. They are expressed as functions k 1 ( ) and k 2 ( ) of the marginal parameters defined by k 1 (γ T, γ U, γ V ) = 2 γ T γu 2 (2 γ T + γ)(2 γ T + γ U + γ)(2 γ T + 2 γ U + γ) γ T γu 2 γ (3.4) (γ T + γ) 2 (γ T + γ U + γ) 2 1

11 and, k 2 (γ T, γ U, γ V ) = k 1 (γ T, γ U, γ V ) 6 γ γ { T ln [γ/(γt + γ)] ln [γ/(γ } 2 T + γ U + γ)], (3.5) π 2 γ T + γ γ T + γ U + γ where γ = γ T + γ U + γ V. Lengthy derivations sketched in appendix yield the following variance formula, σ 2 P = k 1(γ T, γ U, γ V ) + k 1 (γ U, γ T, γ V ), for exponential margins, and for the Weibull model. σ 2 P = k 2(γ T, γ U, γ V ) + k 2 (γ U, γ T, γ V ) 4 Semiparametric tests In this section, we construct score tests for independence using the pseudolikelihood (2.7), assuming that the marginal distribution for T is in a parametric family. These tests do not assume any parametric model for censoring times U and V. 4.1 General results Let l (a, θ T ) = ln L (a, θ T ) be the pseudo log-likelihood for (2.7). The semiparametric test is constructed using the statistic U SP (, θ T ) = a l (a, θ T ), (4.1) a=,θt = θ T where θ T denotes the estimator of θ T under H. To study the asymptotic properties of this statistic, we write the pseudo log-likelihood using counting processes N(t) = n 11 [X i t,δ i =1] and Ȳ (t) = n 11 [X i t] as l (a, θ T ) = τ ln λ at (t) d N(t) 11 τ λ at (t) Ȳ (t) dt, (4.2)

12 where τ = inf { t : Ȳ (t) = } and λ at (t) is given by (2.8). Let and H a,θ T (t) = ln λ at (t) a M a,θt (t) = N(t) t be the martingale associated with counting process Harrington, 1991, Theorem 1.3.1). Using (4.2)-(4.4), statistic (4.1) becomes U SP (, θ T ) = τ (4.3) λ # at (s) Ȳ (s) ds (4.4) N(t) (see Fleming & H, θ T (t) d M, θt (t). (4.5) Note that M, θt (t), i.e. Ma,θT (t) evaluated at a = and θ T = θ T, is not a martingale. This fact motivates the proof of the following proposition which is presented in the appendix. Proposition 4.1 Let T SP (, θ T ) = n 1 U SP (, θ T ), S X ( ) be the survival function of X = min(t, U, V ), and I θt θ T (, θ) be the Fisher s information matrix associated with θ T under H. Suppose that H,θ T (t) satisfies the assumptions of Rebolledo s theorem (Andersen et al., 1993, p. 83). Then, under H : a =, n 1/2 T SP (, θ d T ) N(, σsp 2 ), where where and σsp 2 = [G,θT (t)] 2 S X (t) λ T (t) dt, (4.6) G,θT (t) = H,θT (t) A [I θt θ T (, θ)] 1 λ T (t) λ T (t), (4.7) A = where λ T (t) = λ T (t)/ θ T. H,θT (t) S X (t) λ T (t) dt, (4.8) This semiparametric test is easily implemented for the two copulas considered in this work, for any marginal model for T. For Clayton s, from (4.3) and (2.2), H,θ T (t) = ln ˆπ(t) ln S T (t), (4.9) 12

13 while for Ali-Mikhail-Haq s, (4.3) and (2.4) yield H,θ T (t) = ˆπ(t) S T (t). (4.1) We estimate σ 2 SP nonparametrically by ˆσ SP 2 = 1 { 2 G n, θ T (x i )}, (4.11) δ i =1 where G (t) is obtained from G, θ,θt (t), replacing π(t) by ˆπ(t) and θ T T θ T, and  = 1 1 H n, θ T (x i ) [ λt (x i )] λt (x i ) δ i =1 For the model with exponential margin, the observed information matrix I θt θ T (, θ) and the estimated covariance matrix of the score function coincide, so we obtain in this case ˆσ SP 2 = 1 n δ i =1 { } 2 H 1, θ T (x i ) D T { } 2 H, θ T (x i ) δ i =1 by (4.12) To calculate relative efficiencies of this test with respect to the fully parametric tests of Section 3, one needs to evaluate the variance σsp 2 for the parametric models of Section 3. This is done next. 4.2 Evaluation of σsp 2 for the copulas of Clayton and of Ali-Mikhail-Haq Example 2.1 (continued) For Clayton s family with any specified marginal distributions, (4.9) yields H,θT (t) = ln S U (t). Using the expressions for I 1 θ T θ T (, θ) given in proposition 3.1 and evaluating (4.6)-(4.8), with exponential margins for T, U and V as defined in table 1, yields σ 2 SP = γ T γ 2 U (γ T + γ U + γ V ) 3. When T, U and V have the Weibull margins given in table 1, similar manipulations give σsp 2 = γ T γu {1 2 6π }. (γ T + γ U + γ V )

14 Example 2.2 (continued) For the Ali-Mikhail-Haq family with any specified marginal distributions, (4.1) yields H,θT = S U (t){1 S T (t)}. Along the lines of the derivations for Clayton s family, we prove that, when T, U and V have exponential margins, σ 2 SP = k 1(γ T, γ U, γ V ); with Weibull margins for T, U and V, σ 2 SP = k 2(γ T, γ U, γ V ), where k 1 ( ) and k 2 ( ) are defined by (3.4) and (3.5), respectively. Calculations are more tedious than those for Clayton s copula. They are similar to those underlying the derivation of the parametric tests. 5 Relative efficiencies 5.1 General results In this section, we compare fully parametric and semiparametric tests of independence, using Pitman efficiencies. Consider two test statistics for H : a =, say T 1 and T 2, and suppose that the limiting distribution of n 1/2 (T j µ j (a n ))/σ j (a n ) is, for j = 1, 2, a standard normal distribution, where a n is O(n 1/2 ) and µ j (a n ) and σj 2(a n)/n are the asymptotic mean and variance of T j when the dependence parameter is a n. The relative efficiency or Pitman efficiency of T 1 with respect to T 2 for testing H : a = is defined by ( ) µ 2 E(T 1, T 2 ) = 1 ()/σ 1 () µ 2 ()/σ. 2() A detailed discussion of the conditions for Pitman s efficiency to be well defined is given in van der Vaart (2, p ). The efficiency E(T 1, T 2 ) gives the ratio of the sample sizes n 2 /n 1 needed for the tests based on T 1 and T 2 to achieve asymptotically the same local power. An efficiency larger than 1 means that the test based on T 1 is locally more powerful than the one based on T 2. To calculate the efficiencies of score tests, it is convenient to consider estimators â of the dependence parameter obtained by solving U P (a, θ) =. 14

15 When a = and as the sample size goes to, the asymptotic distribution of â is normal with mean and variance given by the (a, a) entry of I(, θ) 1. Standard results on the inverses of partitioned matrices can be used to demonstrate that Var(n 1/2 â) = n{i aa (, θ) I aθ (, θ) I 1 θθ (, θ) I θa(, θ)} 1 = 1/σ 2 P (). According to Kendall & Stuart (1979, ch. 25), the above variance satisfies 1/σP 2 () = [σ P ()/µ P ()/]2 ; in other words µ P () = σ P () 2. Therefore the Pitman efficiency of one score test with respect to another is equal to the ratio of their asymptotic variances. For the semiparametric test, µ SP () is evaluated in appendix C and is shown to satisfy µ SP () = σ2 SP (). This implies that the efficiency of the parametric test with respect to the semiparametric test is given by E(T P, T SP ) = σp 2 /σsp 2, where σp 2 and σ2 SP are given by (3.3) and (4.6), respectively. In the next section, we recall some properties of the standard score tests for independence, constructed with uncensored samples of bivariate observations for (T, U). Pitman efficiencies of the parametric tests for censored sample with respect to those for uncensored data provide information about the loss of information due to censoring. 5.2 Score tests of independence for uncensored data Let T C be a normalized score statistic to test for independence in an uncensored data set (index C stands for complete samples). The score function for a is independent of the score function for the marginal parameters when a = (see Genest, Ghoudi & Rivest, 1995). Thus functions µ C (a) and σc 2 (a) needed to calculate Pitman efficiencies do not depend on the margins for T and U. For score statistic T C, the (a, θ) submatrix of the Fisher information matrix for uncensored samples is null in (3.3). Thus the asymptotic variance of nt C is the (a, a) term of the above Fisher information matrix. In other words, σc 2 = 1 { } 2 l C (a) n a 2, a= where l C (a) = n ln f a(y 1i, y 2i ), {(y 1i, y 2i ); i = 1,..., n} is a bivariate sample of independent random variables uniformly distributed on [, 1], and 15

16 f a (y 1, y 2 ) is the density for copula C a defined in (1.2), f a (y 1, y 2 ) = φ a [C a (y 1, y 2 )] φ a(y 1 ) φ a(y 2 ) {φ a [C a(y 1, y 2 )]} 3. The arguments of Kendall & Stuart (1979, ch. 25) mentioned above, to prove µ P () = σ2 P, can be used here to obtain µ C () = σ2 C. Example 2.1 (continued) For the Clayton family, the joint density is f a (y 1, y 2 ) = (1 + a) y a 1 1 y a 1 2 {C a (y 1, y 2 )} 2 a+1 ; using results of section 2.2, we prove that 1 n 2 l C (a) a 2 and finally, taking expectation under H, a 1 ln y 1 ln y 2 {4 + ln y 1 + ln y 2 }, σ 2 C = 1. Example 2.2 (continued) For the Ali-Mikhail-Haq family, the joint density is f a (y 1, y 2 ) = C a(y 1, y 2 ) {1 + a [C a (y 1, y 2 ) 1]} {1 + a [2 C a (y 1, y 2 ) 1]} ; y 1 y 2 {1 + a(y 1 1)} {1 + a(y 2 1)} using results of section 2.2 yields 1 2 l C (a) n a 2 = 6 y 1 y 2 (1 y 1 )(1 y 2 ) (1 y 1 ) 2 (1 y 2 ) 2 a= and finally (1 y 1 ) 2 (1 y 2 ) 2 + (y 1 y 2 1) 2 + (2 y 1 y 2 1) 2, σ 2 C = 1/9. In the next section, we give some numerical results about E(T P, T SP ) and E(T C, T P ) for Clayton s and Ali-Mikhail-Haq s families. 16

17 5.3 Calculations of relative efficiencies The efficiencies have the following simple form, E(T P, T SP ) = γ T + γ U γ U for both exponential and Weibull margins under Clayton s model. For Ali- Mikhail-Haq s they are written in terms of the functions k 1 ( ) and k 2 ( ) (see (3.4)-(3.5)) as E(T P, T SP ) = 1 + k 1(γ U, γ T, γ V ) k 1 (γ T, γ U, γ V ) and E(T P, T SP ) = 1 + k 2(γ U, γ T, γ V ) k 2 (γ T, γ U, γ V ), for the exponential and Weibull models respectively. Observe that in all cases E(T P, T SP ) > 1, as expected. The semiparametric test relies on fewer assumptions; this results in a loss of power. Also when γ T = γ U, E(T P, T SP ) = 2. These efficiencies depend only on the expected proportions for the three possible outcomes, say γ T /γ, γ U /γ and γ V /γ. Table 2 and 3 give E(T P, T SP ) and E(T C, T P ) for Clayton s and Ali-Mikhail-Haq s copula for several patterns of outcomes. Table 2: Relative efficiencies for the Clayton family Exponential margins Weibull margins γ T /γ γ U /γ γ V /γ E(T P, T SP ) E(T C, T P ) E(T P, T SP ) E(T C, T P ) In table 2, when γ V /γ = (as defined in section 3, γ = γ T + γ U + γ V ), E(T P, T SP ) decreases as the proportion of dependent censoring γ U /γ increases, and E(T C, T P ) is minimal when γ T /γ = γ U /γ. When the proportion γ T /γ is fixed, E(T P, T SP ) and E(T C, T P ) decrease as γ U /γ increases. Comparing E(T C, T P ) for exponential and Weibull models shows that, the additional 17

18 Table 3: Relative efficiencies for the Ali-Mikhail-Haq family Exponential margins Weibull margins γ T /γ γ U /γ γ V /γ E(T P, T SP ) E(T C, T P ) E(T P, T SP ) E(T C, T P ) parameter of the Weibull margin requires an increase of 1{6/(π 2 6)} = 155% in the sample size to reach the local power obtained with exponential margins. In table 3, the loss of efficiency with respect to tests for complete samples is much more important than in table 2. A possible explanation for this is that under Ali-Mikhail-Haq s model, when a is small, λ # at (t) λ T (t){1 as T (t)[1 S U (t)]}. Thus the crude and net hazard are equal for small and large values of t. They differ only for medium values of t. Discriminating between λ # at (t) and λ T (t) is therefore more difficult than for Clayton s copula for which λ # at (t)/λ T (t) decreases in t when a is small. Remark that E(T P, T SP ) is greater for exponential margins as compared to Weibull s. When γ V /γ =, E(T P, T SP ) decreases as γ U /γ increases; a similar behaviour was noted for Clayton s family. For Weibull margins, E(T C, T P ) is minimal when γ T /γ = γ U /γ. However, for exponential margins, γ T /γ = γ U /γ corresponds in fact to a local maximum for E(T C, T P ); there are two local minima at γ U /γ.24,.76. When the proportion γ T /γ is fixed, E(T P, T SP ) and E(T C, T P ) decrease as γ U /γ increases. Comparing E(T C, T P ) for exponential and Weibull models shows that, for Weibull margins, much more data are needed to reach a given level of local power. 6 A numerical example This section considers the survival times, in weeks, of 61 patients with unoperable lung cancer treated with cyclophosphamide considered in Lagakos 18

19 & Williams (1978) and in Lee & Wolfe (1998). The data set is:.14 +,.14 +,.29 +,.43 +,.43,.57 +,.57 +, , 2.86, 3. +, 3. +, 3.14, 3.14, , , 3.43, 3.43, 3.71, 3.86, 6. +, 6. +, , 6.14, 6.86, , 9., 9.43, , 1.71, 1.86, 11.14, , 13., 14.43, , 15.71, , , 18.43, 18.57, , 2.71, , , 26. +, , 29.14, 29.71, , , 4.57, , 48.57, 49.43, 53.86, 61.86, 66.57, 68.71, 68.96, 72.86, The exponent + identifies the 28 censored observations. They correspond to patients whose treatment was terminated because of a deteriorating condition. Following Lagakos & Williams (1978), we fit exponential distributions to both the survival and the censoring distributions. The parameter estimates are γ T =.276, (s.e. =.48) and γ U =.234, (s.e. =.44). This section investigates a possible dependency between these two variables. All the results of the previous sections apply; the lack of independent censoring simplifies the calculations. The form of the statistic U P (, θ) and of the estimated variance ˆσ P 2 for the parametric tests are easily deduced from the results of subsections 3.2 and 3.3. For the semiparametric tests with exponential margin, the estimator of σsp 2 is given by (4.12), but some technical manipulations are needed to evaluate U SP (, θ T ) from (4.1). From (4.5), we obtain U SP (, θ T ) = δ i =1 H, θ T (x i ) γ T τ H, θ T (t) Ȳ (t) dt, (6.1) where H (t) = f[ˆπ(t)] f[exp( γ, θ T t)], with f(y) = log y for Clayton s T copula, and f(y) = y for Ali-Mikhail-Haq s copula. Assume that all the X i s are distinct, and let = X () < X (1) < < X (n) denote the corresponding order statistics. Since ˆπ(x) = [n (i 1)]/n, x (X (i 1), X (i) ], the following holds, τ f[ˆπ(t)] Ȳ (t) dt = = = X(i) f[ˆπ(t)]dt i [ n (j 1) f n j=1 [ n (i 1) (n i + 1)f n ] (X (j) X (j 1) ) ] (X (i) X (i 1) ) 19

20 = n ˆπ(X (i) )f[π(x (i) )](X (i) X (i 1) ) This allows to simplify score statistics (6.1). For Clayton s copula, U SP (, θ T ) = log ˆπ(X i ) + γ T X i δ i =1 n γ T while for Ali-Mikhail-Haq, ˆπ(X (i) ) log[π(x (i) )] ( X (i) X (i 1) ) + γ 2 T 2 x 2 i, U SP (, θ T ) = δ i =1 ˆπ(X i ) exp( γ T X i ) n γ T ˆπ(X (i) ) ( ) 2 X (i) X (i 1) + [1 exp( γ T X i )]. The table below gives the p-values for the four tests considered here. There is a signicant dependency for the parametric test constructed using Ali-Mikhail- Haq s copula. Table 4: P -values of four tests of independence for Lagakos & Williams data Model Parametric test Semiparametric test Clayton Ali-Mikhail-Haq Given the poor showing of the score test for Ali-Mikhail-Haq family in the efficiency comparisons presented in section 5, it is unexpected that this test gives the only significant result. To investigate this further, one can estimate the failure rate of the survival time by fitting a piecewise exponential distribution. This reveals that the hazard rate has a bathtub shape: it is high in the first 2 weeks and after the 6th and relatively low in between. According to the discussion in section 2.2 Ali-Mikhail-Haq copula is more appropriate to model this type of discrepancy than Clayton s. This explains the significant outcome of Table 4. The increase in p-value when one goes to the semiparametric test is in line with the findings of section 5. 2

21 7 Discussion In many instances a positive dependence between T and U is the most interesting alternative to H : a =. One-sided versions of the tests presented here could then be considered. These tests would reject the null hypothesis if the score statistics is too large. This would permit a much needed increase in power for the independence tests. The efficiency comparisons of section 5 suggest that dependent censoring is hard to detect using standard lifetime data; additional follow-ups of censored or dead units might be needed to make such a detection. Testing for a dependency between T and U amounts to testing the validity of the parametric models for the margins. As shown in section 2, a Clayton s dependency multiplies the failure rate by a decreasing function while for Ali-Mikhail-Haq, the multiplicative term has a bathtub shape. Tables 2 and 3, and some simulations not presented here, suggest that a dependency is much more difficult to detect for Ali-Mikhail-Haq s model that for Clayton s. The expressions for λ # at (t) presented in section shed some light on this finding: for Ali-Mikhail-Haq s model λ # at (t) is much closer to a Weibull failure rate that for Clayton s. This highlights that in competing risk models certain types of dependency might be nearly impossible to detect. Still, as noted by Zheng & Klein (1995) and Rivest & Wells (21), once a dependency is detected and its level ascertained the estimator of the marginal survival function does not depend too much on the family of copulas chosen to model the dependency. This paper leaves the door open for future work. Improvements over the semiparametric tests presented in Section 4 could possibly be achieved by using the full score statistics evaluated at the Kaplan-Meier estimator of S U (t). The investigation of the estimators of the parameters of the marginal distribution of T and of the dependence parameter a obtained by maximizing the pseudo-likelihood (2.7) would be of interest. The sampling distribution of the resulting estimators could possibly be derived following the arguments presented in Section 4. Acknowledgements We are grateful to Bas Werker for stimulating discussion on semiparametric estimation. Research funds in partial support of this work were granted by the Natural Sciences and Engineering Research Council of Canada and 21

22 by the Fonds pour la formation de chercheurs et l aide à la recherche du Gouvernement du Québec. Appendix A Proofs for the parametric test Recall that X = min(t, U, V ) and δ indicates what kind of event is observed, say δ = 1 if X = T, δ = if X = U and δ = 1 if X = V. Let D T, D U and D V the respective frequencies of each kind of event among the n units in the sample. The evaluations of the Fisher information matrices needed to construct the score tests rely on the following lemma. Lemma A.1 Let T, U and V be independent. (i) If T, U and V are exponentially distributed, with hazard rate γ T, γ U and γ V, respectively, then X and (D T, D U, D V ) are independent, and (a) X Exponential(γ), where γ = γ T + γ U + γ V ; (b) (D T, D U, D V ) Multinomial(γ T /γ, γ U /γ, γ V /γ). (ii) If T, U and V are Weibull distributed, with common shape parameter β, and respective scale parameter γ T, γ U and γ V, then X and (D T, D U, D V ) are independent and (a) X W eibull(β, γ), where γ = γ T + γ U + γ V ; (b) (D T, D U, D V ) Multinomial(γ T /γ, γ U /γ, γ V /γ); (c) Y = γx β Exponential(1), and ln Y has an extreme value distribution, so {ln Y } = c, where c.5772 is Euler s constant, and Var{ln Y } = π 2 /6 (see Lawless, 1982, section 1.3); in addition, we have {Y ln Y } = 1 c, { Y 2 ln Y } = 3 2c, { Y (ln Y ) 2 } = π2 6 + c(c 2), 22

23 and, for any λ >, { e (λ/γ)y } = { Y e (λ/γ)y } = { Y ln Y e (λ/γ)y } = γ λ + γ, ( ) 2 γ, λ + γ ( ) 2 { ( )} γ γ 1 c + ln. λ + γ λ + γ A.1 Proof of Proposition 3.1 Using the complete likelihood (2.6), at a =, we have the general form I θθ (, θ) = { } 2 ln λ T (x i ) { } 2 ln S T (x i ) θ 2 θ 2 δ i =1 { } 2 ln λ U (x i ) { } 2 ln S U (x i ) θ 2 θ 2 δ i = { } 2 ln λ V (x i ) { } 2 ln S V (x i ). (A.1) θ 2 θ 2 δ i = 1 From (A.1), it is clear that I θθ (, θ) is block diagonal, with blocks I θt θ T (, θ), I θu θ U (, θ) and I θv θ V (, θ). Exponential model The three blocks of I θθ (, γ) are 1 1 matrices. From (A.1), the component for survival time T is I θt θ T (, θ) = D T /γt 2 (see Lawless, 1982, p. 17). Part (i) of proposition 3.1 follows immediately from lemma A.1 (i). Weibull model From (A.1), the entries of the 2 2 observed information matrix for the parameters of survival time T are 2 l(, θ) = D T β 2 β + γ 2 T x β i (ln x i) 2, 2 l(, θ) γ 2 T 2 l(, θ) β γ T = = D T, γ 2 T x β i ln x i. 23

24 Using { lemma A.1 (ii), and noting b = 1 c ln γ (c.5772), we have X β ln X } { = b/(βγ) and X β (ln X) 2} = {π 2 /6 1 + b 2 } /(β 2 γ), and we deduce n 1 I θt θ T (, θ) = 1 γ [ γt β 2 ( π b2 ) Inverting this matrix yields part (ii) of proposition 3.1. b β b β 1 γ T ]. A.2 Derivations underlying tests for Clayton s family From expression (2.5) for the likelihood, and using (2.1)-(2.2), we have U P (a, θ) = ln S T (x i ) ln S U (x i ) + ln π(x i ) δ i =1 δ i = δ i 1 { } ln π(xi ) + + a { } ln π(xi ). (A.2) a a δ i 1 Using the results of section 2.2, example 2.1, completes the calculations of U P (, θ) for Clayton s family. Taking the partial derivatives of (A.2), at a =, and the results in section 2.2 yields, for any marginal model, I aa (, θ) = ln S T (x i ) ln S U (x i ) {ln S T (x i ) + ln S U (x i )} 2 I θa (, θ) = δ i = δ i 1 ln S T (x i ) ln S U (x i ), [S T (x i )] 1 Ṡ T (x i ) δ i =1 [S U (x i )] 1 Ṡ U (x i ) (A.3) [S T (x i )] 1 Ṡ T (x i ) ln S U (x i ) [S U (x i )] 1 Ṡ U (x i ) ln S T (x i ). (A.4) Exponential model Using lemma A.1(i), (A.3)-(A.4) yields n 1 I aa (, θ) = 2 γ T γ U (γ T + γ U ) γ 3 24

25 and ( n 1 γu I aθ (, θ) = γ 2, γ ) T γ,, 2 where γ = γ T +γ U +γ V, and proposition 3.1 (i) yields σp 2 with exponential margins. for Clayton s family Weibull model Using lemma A.1 (ii), we deduce from (A.3)-(A.4) that n 1 I aa (, θ) = 2 γ T γ U (γ T + γ U ) γ 3 and n 1 I aθ (, θ) = 1 γ 2 ( γt γ U (1 + b) β, γ U, γ ) T γ U (1 + b), γ T,,, β where γ = γ T + γ U + γ V and b = 1 c ln γ, and using proposition 3.1 (ii), we deduce σp 2 for Clayton s family with Weibull margins. A.3 Derivations underlying tests for Ali-Mikhail-Haq s family From likelihood (2.5), and using (2.3)-(2.4), we have U P (a, θ) = 1 S T (x i ) 1 + a [S T (x i ) 1] + 1 S U (x i ) 1 + a [S U (x i ) 1] δ i =1 δ i = ( ) ln π(x i ) + + π(x i ) 1 + a π(xi ) a. (A.5) a 1 + a [π(x i ) 1] δ i 1 Using the results of section 2.2, example 2.2, completes the proof of U P (, θ) for the Ali-Mikhail-Haq s family. Expressing the partial derivatives of (A.5), at a =, in terms of 1 S T (x i ) and 1 S U (x i ), and using the results of section 2.2 yields, for any marginal model, I aa (, θ) = δ i = {1 S T (x i )} 2 + δ i =1 {1 S U (x i )} 2 25

26 2 {1 S T (x i )} 2 {1 S U (x i )} 2 + δ i = 1 {1 S T (x i )} 2 {1 S U (x i )} 2, (A.6) I θa (, θ) = Ṡ T (x i ) Ṡ U (x i ) δ i = δ i =1 + 2 {ṠT (x i ) [1 S U (x i )] + ṠU(x i ) [1 S T (x i )]} δ i = 1 {ṠT (x i ) [1 S U (x i )] + ṠU(x i ) [1 S T (x i )]}. (A.7) Exponential model Using lemma A.1 (i), we deduce from (A.6)-(A.7) that n 1 I aa (, θ) = 2 γ T γu 2 (2 γ T + γ)(2 γ T + γ U + γ)(2 γ T + 2 γ U + γ) and + n 1 I aθ (, θ) = 2 γ 2 T γ U (2 γ U + γ)(γ T + 2 γ U + γ)(2 γ T + 2 γ U + γ) ( ) 1 γu γ T + γ U + γ γ T + γ, γ T γ U + γ,, where γ = γ T + γ U + γ V, and using the proposition 3.1 (i), we deduce σ 2 P for Ali-Mikhail-Haq s family with exponential margins. Note that proof I aθ (, θ) is straightforward, however tedious calculations are needed to obtain I aa (, θ). Weibull model Using lemma A.1 (ii), we deduce easily from (A.6)-(A.7) that I aa (, θ) is the same as that for the exponential model. The contribution, I aθt (, θ), of T to I aθ (, θ) = (I aθt (, θ), I aθu (, θ),, ) is } n γ T β { 1 c ln(γt + γ) 1 c ln(γ T + γ U + γ) γ T + γ γ T + γ U + γ γ U (γ T + γ)(γ T + γ U + γ) where γ = γ T + γ U + γ V. Using the proposition 3.1 (ii), we deduce σp 2 for Ali-Mikhail-Haq s family with Weibull margins. Lengthy derivations are required to obtain the first term of I aθt (, θ); calculation of the final variance σp 2 requires additional technical manipulations. 26,

27 B Proof of proposition 4.1 Writing M, θt (t) = M,θT (t) t { λ T (s) λ T (s)}ȳ (s) ds, where M,θT (t) is a martingale under H (see (4.4)), and using first order expansions for H (t) H, θ T,θ T (t) and λ T (s) λ T (s), (4.5) becomes τ T SP (, θ T ) = 1 H,θ n T (t)d M,θT (t) ) 1 τ ( θt θ T H,θ n T (t) Ȳ (t) λ T (t) dt ) 1 τ + ( θt θ T Ḣ,θT (t) d n M,θT (t). (B.1) Under classical regularity conditions (see Andersen et al., 1993, p ), one has n( θ T θ T ) d N (, [I θt θ T (, θ)/n] 1 ); furthermore the strong law of large numbers yields that n 1 τ Ḣ,θT (t) d M,θT (t). Therefore the third term in (B.1) is negligible. Note that Ȳ (t)/n is the empirical estimator of the survival distribution S X ( ) of X = min(t, U, V ). Assuming that H,θ T (t) is uniformly bounded (see (4.3) and (2.8)), one has 1 n τ and (B.1) becomes where under H, H,θ T (t) Ȳ (t) λ T (t) dt A = T SP (, θ T ) 1 n τ H,θT (t) S X (t) λ T (t) dt, H,θ T (t)d M,θT (t) A ( θt θ T ), (B.2) τ θ T θ T = [I θt θ T (, θ)] 1 [λ T (t)] 1 λt (t) d M,θT (t) + R, (B.3) with n 1/2 R. By (B.2)-(B.3), we have now that where T SP (, θ τ T ) n 1 G,θ T (t) d M,θT (t), G,θ T (t) = H,θ T (t) A [I θt θ T (, θ)] 1 λ T (t) λ T (t). (B.4) 27

28 Convergence in law of n 1/2 T SP (, θ T ) results now from the classical martingale limit central theorem (see Andersen et al., 1993, th. II.5.3). The asymptotic variance given in proposition 4.1 comes from the evaluation of {[T SP (, θ } T )] 2, using expression (B.4) (see Fleming & Harrington, 1991, th ). C Evaluation of µ SP () Writing M,θT (t) = M a,θt (t) t {λ T (s) λ # at (s)}ȳ (s) ds, where M a,θt (t) is defined by (4.4), when the value of the dependence parameter is a, one can write (B.4) as n 1 U SP (, θ T ) 1 n 1 n τ τ G,θ T (t) d M a,θt (t) { } G,θ T (t) λ T (t) λ # at (t) Ȳ (t) dt, where M a,θt (t) is a martingale when a is non zero. The expectation of the first term of the right hand side is null. Since Ȳ (t)/n is an estimator for S X (t), the second term yields µ SP () = G,θT (t) H,θT (t) λ T (t) S X (t) dt, where G,θT (t) = H,θT (t) A [I θt θ T (, θ)] 1 λt (t)/λ T (t). above expression yields µ SP () = σ2 SP (). Evaluating the References [1] Andersen, P. K., Borgan, O., Gill, R. D. & Keiding, N. (1993) Statistical Models Based on Counting Processes. Springer-Verlag, New-York. [2] Bhattacharyya, A. (1997) Modeling exponential survival data with dependent censoring. Sankhyā, Series A, Indian Journal of Statistics 59: [3] Cox, D. R. & Hinkley, D. V. (1979) Theoretical Statistics. Chapman & Hall. 28

29 [4] Emoto, S. & Matthews, P. (199) A Weibull model for dependent censoring. The Annals of Statistics 18: [5] Fleming, T. R. & Harrington, D. P. (1991) Counting processes and survival analysis. Wiley. [6] Frangakis, C. E. & Rubin, D. B. (21) Addressing an idiosyncrasy in estimating survival curves using double sampling in the presence of selfselected right censoring. Biometrics 57: [7] Genest, C. (1987) Frank s family of bivariate distributions. Biometrika 74: [8] Genest, C. & McKay, R. (1986) Copules archimédiennes et familles bidimensionnelles dont les marges sont données. The Canadian Journal of Statistics 14: [9] Genest, C., Ghoudi, K. & Rivest, L.-P. (1995) A semiparametric estimation procedure of dependence parameters in multivariate families of distributions. Biometrika 82: [1] Hougaard, P. (2) Analysis of multivariate survival data. Springer. [11] Kendall, M. & Stuart, A. (1979) The advanced theory of statistics, vol. 2. Hafner Publishing Company. [12] Lagakos, S. W. & Williams, J. S. (1978) Models for censored survival analysis: A cone class of variable-sum models. Biometrika 65: [13] Lawless, J. F. (1982) Statistical models and methods for lifetime data. Wiley. [14] Lee, S. & Wolfe, R. A. (1995) A class of linear nonparametric rank tests for dependent censoring. Communications in Statistics, Part A Theory and Methods 24: [15] Lee, S. & Wolfe, R. A. (1998) A simple test for independent censoring under the proportional hazards model. Biometrics 54: [16] Lee, S. (1996) Power calculation for a score test in the dependent censoring model. Statistics in Medicine 15:

30 [17] Lin, D. Y., Robins, J. M. & Wei, L. J. (1996) Comparing two failure time distributions in the presence of dependent censoring. Biometrika 83: [18] Nelsen, R. (1999) An introduction to copula. Lecture Notes in Statistics 139. Springer-Verlag, New-York. [19] Oakes, D. (1989) Bivariate survival models induced by frailties. JASA 84: [2] Rivest, L.-P. & Wells, M. T. (21) A martingale approach to the copulagraphic estimator for the survival function under dependent censoring. Journal of Multivariate Analysis 79: [21] Tsiatis, A. (1975) A nonidentifiability aspect of the problem of competing risks. Proceedings of the National Academy of Sciences of the United States of America. 72: [22] van der Vaart, A. W. (2) Asymptotic statistics. Cambridge University Press. [23] Zheng, M. & Klein, J.(1995) Estimates of marginal survival for dependent competing risks based on an assumed copula. Biometrika 82:

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Statistica Sinica 20 (2010), 441-453 GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Antai Wang Georgetown University Medical Center Abstract: In this paper, we propose two tests for parametric models

More information

Multivariate Survival Data With Censoring.

Multivariate Survival Data With Censoring. 1 Multivariate Survival Data With Censoring. Shulamith Gross and Catherine Huber-Carol Baruch College of the City University of New York, Dept of Statistics and CIS, Box 11-220, 1 Baruch way, 10010 NY.

More information

Frailty Models and Copulas: Similarities and Differences

Frailty Models and Copulas: Similarities and Differences Frailty Models and Copulas: Similarities and Differences KLARA GOETHALS, PAUL JANSSEN & LUC DUCHATEAU Department of Physiology and Biometrics, Ghent University, Belgium; Center for Statistics, Hasselt

More information

A Measure of Association for Bivariate Frailty Distributions

A Measure of Association for Bivariate Frailty Distributions journal of multivariate analysis 56, 6074 (996) article no. 0004 A Measure of Association for Bivariate Frailty Distributions Amita K. Manatunga Emory University and David Oakes University of Rochester

More information

Lecture 5 Models and methods for recurrent event data

Lecture 5 Models and methods for recurrent event data Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data 1 Part III. Hypothesis Testing III.1. Log-rank Test for Right-censored Failure Time Data Consider a survival study consisting of n independent subjects from p different populations with survival functions

More information

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where STAT 331 Accelerated Failure Time Models Previously, we have focused on multiplicative intensity models, where h t z) = h 0 t) g z). These can also be expressed as H t z) = H 0 t) g z) or S t z) = e Ht

More information

Estimation and Goodness of Fit for Multivariate Survival Models Based on Copulas

Estimation and Goodness of Fit for Multivariate Survival Models Based on Copulas Estimation and Goodness of Fit for Multivariate Survival Models Based on Copulas by Yildiz Elif Yilmaz A thesis presented to the University of Waterloo in fulfillment of the thesis requirement for the

More information

STAT Sample Problem: General Asymptotic Results

STAT Sample Problem: General Asymptotic Results STAT331 1-Sample Problem: General Asymptotic Results In this unit we will consider the 1-sample problem and prove the consistency and asymptotic normality of the Nelson-Aalen estimator of the cumulative

More information

Survival Analysis using Bivariate Archimedean Copulas. Krishnendu Chandra

Survival Analysis using Bivariate Archimedean Copulas. Krishnendu Chandra Survival Analysis using Bivariate Archimedean Copulas Krishnendu Chandra Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy under the Executive Committee of the

More information

Imputation Algorithm Using Copulas

Imputation Algorithm Using Copulas Metodološki zvezki, Vol. 3, No. 1, 2006, 109-120 Imputation Algorithm Using Copulas Ene Käärik 1 Abstract In this paper the author demonstrates how the copulas approach can be used to find algorithms for

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan

More information

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);

More information

Empirical Processes & Survival Analysis. The Functional Delta Method

Empirical Processes & Survival Analysis. The Functional Delta Method STAT/BMI 741 University of Wisconsin-Madison Empirical Processes & Survival Analysis Lecture 3 The Functional Delta Method Lu Mao lmao@biostat.wisc.edu 3-1 Objectives By the end of this lecture, you will

More information

Statistical Inference and Methods

Statistical Inference and Methods Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 31st January 2006 Part VI Session 6: Filtering and Time to Event Data Session 6: Filtering and

More information

A copula model for dependent competing risks.

A copula model for dependent competing risks. A copula model for dependent competing risks. Simon M. S. Lo Ralf A. Wilke January 29 Abstract Many popular estimators for duration models require independent competing risks or independent censoring.

More information

Survival Analysis for Case-Cohort Studies

Survival Analysis for Case-Cohort Studies Survival Analysis for ase-ohort Studies Petr Klášterecký Dept. of Probability and Mathematical Statistics, Faculty of Mathematics and Physics, harles University, Prague, zech Republic e-mail: petr.klasterecky@matfyz.cz

More information

Lifetime Dependence Modelling using a Generalized Multivariate Pareto Distribution

Lifetime Dependence Modelling using a Generalized Multivariate Pareto Distribution Lifetime Dependence Modelling using a Generalized Multivariate Pareto Distribution Daniel Alai Zinoviy Landsman Centre of Excellence in Population Ageing Research (CEPAR) School of Mathematics, Statistics

More information

Robustness of a semiparametric estimator of a copula

Robustness of a semiparametric estimator of a copula Robustness of a semiparametric estimator of a copula Gunky Kim a, Mervyn J. Silvapulle b and Paramsothy Silvapulle c a Department of Econometrics and Business Statistics, Monash University, c Caulfield

More information

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates Communications in Statistics - Theory and Methods ISSN: 0361-0926 (Print) 1532-415X (Online) Journal homepage: http://www.tandfonline.com/loi/lsta20 Analysis of Gamma and Weibull Lifetime Data under a

More information

Likelihood Construction, Inference for Parametric Survival Distributions

Likelihood Construction, Inference for Parametric Survival Distributions Week 1 Likelihood Construction, Inference for Parametric Survival Distributions In this section we obtain the likelihood function for noninformatively rightcensored survival data and indicate how to make

More information

UNIVERSITY OF CALIFORNIA, SAN DIEGO

UNIVERSITY OF CALIFORNIA, SAN DIEGO UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department

More information

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data Communications for Statistical Applications and Methods 2015, Vol. 22, No. 6, 599 604 DOI: http://dx.doi.org/10.5351/csam.2015.22.6.599 Print ISSN 2287-7843 / Online ISSN 2383-4757 Estimation of Conditional

More information

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),

More information

Lecture 22 Survival Analysis: An Introduction

Lecture 22 Survival Analysis: An Introduction University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 22 Survival Analysis: An Introduction There is considerable interest among economists in models of durations, which

More information

Politecnico di Torino. Porto Institutional Repository

Politecnico di Torino. Porto Institutional Repository Politecnico di Torino Porto Institutional Repository [Article] On preservation of ageing under minimum for dependent random lifetimes Original Citation: Pellerey F.; Zalzadeh S. (204). On preservation

More information

Exercises. (a) Prove that m(t) =

Exercises. (a) Prove that m(t) = Exercises 1. Lack of memory. Verify that the exponential distribution has the lack of memory property, that is, if T is exponentially distributed with parameter λ > then so is T t given that T > t for

More information

Power and Sample Size Calculations with the Additive Hazards Model

Power and Sample Size Calculations with the Additive Hazards Model Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine

More information

From semi- to non-parametric inference in general time scale models

From semi- to non-parametric inference in general time scale models From semi- to non-parametric inference in general time scale models Thierry DUCHESNE duchesne@matulavalca Département de mathématiques et de statistique Université Laval Québec, Québec, Canada Research

More information

Lecture 3. Truncation, length-bias and prevalence sampling

Lecture 3. Truncation, length-bias and prevalence sampling Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in

More information

Multivariate survival modelling: a unified approach with copulas

Multivariate survival modelling: a unified approach with copulas Multivariate survival modelling: a unified approach with copulas P. Georges, A-G. Lamy, E. Nicolas, G. Quibel & T. Roncalli Groupe de Recherche Opérationnelle Crédit Lyonnais France May 28, 2001 Abstract

More information

On consistency of Kendall s tau under censoring

On consistency of Kendall s tau under censoring Biometria (28), 95, 4,pp. 997 11 C 28 Biometria Trust Printed in Great Britain doi: 1.193/biomet/asn37 Advance Access publication 17 September 28 On consistency of Kendall s tau under censoring BY DAVID

More information

Efficiency Comparison Between Mean and Log-rank Tests for. Recurrent Event Time Data

Efficiency Comparison Between Mean and Log-rank Tests for. Recurrent Event Time Data Efficiency Comparison Between Mean and Log-rank Tests for Recurrent Event Time Data Wenbin Lu Department of Statistics, North Carolina State University, Raleigh, NC 27695 Email: lu@stat.ncsu.edu Summary.

More information

Efficiency of Profile/Partial Likelihood in the Cox Model

Efficiency of Profile/Partial Likelihood in the Cox Model Efficiency of Profile/Partial Likelihood in the Cox Model Yuichi Hirose School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand Summary. This paper shows

More information

Accounting for extreme-value dependence in multivariate data

Accounting for extreme-value dependence in multivariate data Accounting for extreme-value dependence in multivariate data 38th ASTIN Colloquium Manchester, July 15, 2008 Outline 1. Dependence modeling through copulas 2. Rank-based inference 3. Extreme-value dependence

More information

On the Breslow estimator

On the Breslow estimator Lifetime Data Anal (27) 13:471 48 DOI 1.17/s1985-7-948-y On the Breslow estimator D. Y. Lin Received: 5 April 27 / Accepted: 16 July 27 / Published online: 2 September 27 Springer Science+Business Media,

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH Jian-Jian Ren 1 and Mai Zhou 2 University of Central Florida and University of Kentucky Abstract: For the regression parameter

More information

A Goodness-of-fit Test for Copulas

A Goodness-of-fit Test for Copulas A Goodness-of-fit Test for Copulas Artem Prokhorov August 2008 Abstract A new goodness-of-fit test for copulas is proposed. It is based on restrictions on certain elements of the information matrix and

More information

Parameter addition to a family of multivariate exponential and weibull distribution

Parameter addition to a family of multivariate exponential and weibull distribution ISSN: 2455-216X Impact Factor: RJIF 5.12 www.allnationaljournal.com Volume 4; Issue 3; September 2018; Page No. 31-38 Parameter addition to a family of multivariate exponential and weibull distribution

More information

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky

EMPIRICAL ENVELOPE MLE AND LR TESTS. Mai Zhou University of Kentucky EMPIRICAL ENVELOPE MLE AND LR TESTS Mai Zhou University of Kentucky Summary We study in this paper some nonparametric inference problems where the nonparametric maximum likelihood estimator (NPMLE) are

More information

Simulation of Tail Dependence in Cot-copula

Simulation of Tail Dependence in Cot-copula Int Statistical Inst: Proc 58th World Statistical Congress, 0, Dublin (Session CPS08) p477 Simulation of Tail Dependence in Cot-copula Pirmoradian, Azam Institute of Mathematical Sciences, Faculty of Science,

More information

CHAPTER 1 A MAINTENANCE MODEL FOR COMPONENTS EXPOSED TO SEVERAL FAILURE MECHANISMS AND IMPERFECT REPAIR

CHAPTER 1 A MAINTENANCE MODEL FOR COMPONENTS EXPOSED TO SEVERAL FAILURE MECHANISMS AND IMPERFECT REPAIR CHAPTER 1 A MAINTENANCE MODEL FOR COMPONENTS EXPOSED TO SEVERAL FAILURE MECHANISMS AND IMPERFECT REPAIR Helge Langseth and Bo Henry Lindqvist Department of Mathematical Sciences Norwegian University of

More information

Composite likelihood and two-stage estimation in family studies

Composite likelihood and two-stage estimation in family studies Biostatistics (2004), 5, 1,pp. 15 30 Printed in Great Britain Composite likelihood and two-stage estimation in family studies ELISABETH WREFORD ANDERSEN The Danish Epidemiology Science Centre, Statens

More information

Semi-Competing Risks on A Trivariate Weibull Survival Model

Semi-Competing Risks on A Trivariate Weibull Survival Model Semi-Competing Risks on A Trivariate Weibull Survival Model Cheng K. Lee Department of Targeting Modeling Insight & Innovation Marketing Division Wachovia Corporation Charlotte NC 28244 Jenq-Daw Lee Graduate

More information

A semiparametrie estimation procedure of dependence parameters in multivariate families of distributions

A semiparametrie estimation procedure of dependence parameters in multivariate families of distributions Biometrika (1995), 82, 3, pp. 543-52 Primed in Great Britain A semiparametrie estimation procedure of dependence parameters in multivariate families of distributions BY C. GENEST, K. GHOUDI AND L.-P. RIVEST

More information

A Regression Model for the Copula Graphic Estimator

A Regression Model for the Copula Graphic Estimator Discussion Papers in Economics Discussion Paper No. 11/04 A Regression Model for the Copula Graphic Estimator S.M.S. Lo and R.A. Wilke April 2011 2011 DP 11/04 A Regression Model for the Copula Graphic

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS3301 / MAS8311 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-10 1 13 The Cox proportional hazards model 13.1 Introduction In the

More information

Product-limit estimators of the survival function with left or right censored data

Product-limit estimators of the survival function with left or right censored data Product-limit estimators of the survival function with left or right censored data 1 CREST-ENSAI Campus de Ker-Lann Rue Blaise Pascal - BP 37203 35172 Bruz cedex, France (e-mail: patilea@ensai.fr) 2 Institut

More information

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University

More information

Full likelihood inferences in the Cox model: an empirical likelihood approach

Full likelihood inferences in the Cox model: an empirical likelihood approach Ann Inst Stat Math 2011) 63:1005 1018 DOI 10.1007/s10463-010-0272-y Full likelihood inferences in the Cox model: an empirical likelihood approach Jian-Jian Ren Mai Zhou Received: 22 September 2008 / Revised:

More information

ST745: Survival Analysis: Nonparametric methods

ST745: Survival Analysis: Nonparametric methods ST745: Survival Analysis: Nonparametric methods Eric B. Laber Department of Statistics, North Carolina State University February 5, 2015 The KM estimator is used ubiquitously in medical studies to estimate

More information

On the Comparison of Fisher Information of the Weibull and GE Distributions

On the Comparison of Fisher Information of the Weibull and GE Distributions On the Comparison of Fisher Information of the Weibull and GE Distributions Rameshwar D. Gupta Debasis Kundu Abstract In this paper we consider the Fisher information matrices of the generalized exponential

More information

Tests of independence for censored bivariate failure time data

Tests of independence for censored bivariate failure time data Tests of independence for censored bivariate failure time data Abstract Bivariate failure time data is widely used in survival analysis, for example, in twins study. This article presents a class of χ

More information

Pairwise dependence diagnostics for clustered failure-time data

Pairwise dependence diagnostics for clustered failure-time data Biometrika Advance Access published May 13, 27 Biometrika (27), pp. 1 15 27 Biometrika Trust Printed in Great Britain doi:1.193/biomet/asm24 Pairwise dependence diagnostics for clustered failure-time data

More information

A Goodness-of-fit Test for Semi-parametric Copula Models of Right-Censored Bivariate Survival Times

A Goodness-of-fit Test for Semi-parametric Copula Models of Right-Censored Bivariate Survival Times A Goodness-of-fit Test for Semi-parametric Copula Models of Right-Censored Bivariate Survival Times by Moyan Mei B.Sc. (Honors), Dalhousie University, 2014 Project Submitted in Partial Fulfillment of the

More information

11 Survival Analysis and Empirical Likelihood

11 Survival Analysis and Empirical Likelihood 11 Survival Analysis and Empirical Likelihood The first paper of empirical likelihood is actually about confidence intervals with the Kaplan-Meier estimator (Thomas and Grunkmeier 1979), i.e. deals with

More information

log T = β T Z + ɛ Zi Z(u; β) } dn i (ue βzi ) = 0,

log T = β T Z + ɛ Zi Z(u; β) } dn i (ue βzi ) = 0, Accelerated failure time model: log T = β T Z + ɛ β estimation: solve where S n ( β) = n i=1 { Zi Z(u; β) } dn i (ue βzi ) = 0, Z(u; β) = j Z j Y j (ue βz j) j Y j (ue βz j) How do we show the asymptotics

More information

1 Glivenko-Cantelli type theorems

1 Glivenko-Cantelli type theorems STA79 Lecture Spring Semester Glivenko-Cantelli type theorems Given i.i.d. observations X,..., X n with unknown distribution function F (t, consider the empirical (sample CDF ˆF n (t = I [Xi t]. n Then

More information

Modelling Dropouts by Conditional Distribution, a Copula-Based Approach

Modelling Dropouts by Conditional Distribution, a Copula-Based Approach The 8th Tartu Conference on MULTIVARIATE STATISTICS, The 6th Conference on MULTIVARIATE DISTRIBUTIONS with Fixed Marginals Modelling Dropouts by Conditional Distribution, a Copula-Based Approach Ene Käärik

More information

Inference based on the em algorithm for the competing risk model with masked causes of failure

Inference based on the em algorithm for the competing risk model with masked causes of failure Inference based on the em algorithm for the competing risk model with masked causes of failure By RADU V. CRAIU Department of Statistics, University of Toronto, 100 St. George Street, Toronto Ontario,

More information

A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks

A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks Y. Xu, D. Scharfstein, P. Mueller, M. Daniels Johns Hopkins, Johns Hopkins, UT-Austin, UF JSM 2018, Vancouver 1 What are semi-competing

More information

Likelihood and p-value functions in the composite likelihood context

Likelihood and p-value functions in the composite likelihood context Likelihood and p-value functions in the composite likelihood context D.A.S. Fraser and N. Reid Department of Statistical Sciences University of Toronto November 19, 2016 Abstract The need for combining

More information

Statistical Analysis of Competing Risks With Missing Causes of Failure

Statistical Analysis of Competing Risks With Missing Causes of Failure Proceedings 59th ISI World Statistics Congress, 25-3 August 213, Hong Kong (Session STS9) p.1223 Statistical Analysis of Competing Risks With Missing Causes of Failure Isha Dewan 1,3 and Uttara V. Naik-Nimbalkar

More information

Chapter 2 Inference on Mean Residual Life-Overview

Chapter 2 Inference on Mean Residual Life-Overview Chapter 2 Inference on Mean Residual Life-Overview Statistical inference based on the remaining lifetimes would be intuitively more appealing than the popular hazard function defined as the risk of immediate

More information

A STATISTICAL TEST FOR MONOTONIC AND NON-MONOTONIC TREND IN REPAIRABLE SYSTEMS

A STATISTICAL TEST FOR MONOTONIC AND NON-MONOTONIC TREND IN REPAIRABLE SYSTEMS A STATISTICAL TEST FOR MONOTONIC AND NON-MONOTONIC TREND IN REPAIRABLE SYSTEMS Jan Terje Kvaløy Department of Mathematics and Science, Stavanger University College, P.O. Box 2557 Ullandhaug, N-491 Stavanger,

More information

Survival Distributions, Hazard Functions, Cumulative Hazards

Survival Distributions, Hazard Functions, Cumulative Hazards BIO 244: Unit 1 Survival Distributions, Hazard Functions, Cumulative Hazards 1.1 Definitions: The goals of this unit are to introduce notation, discuss ways of probabilistically describing the distribution

More information

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Jonathan Taylor & Kristin Cobb Statistics 262: Intermediate Biostatistics p.1/?? Overview of today s class Kaplan-Meier Curve

More information

Probability Distributions and Estimation of Ali-Mikhail-Haq Copula

Probability Distributions and Estimation of Ali-Mikhail-Haq Copula Applied Mathematical Sciences, Vol. 4, 2010, no. 14, 657-666 Probability Distributions and Estimation of Ali-Mikhail-Haq Copula Pranesh Kumar Mathematics Department University of Northern British Columbia

More information

THE SINGULARITY OF THE INFORMATION MATRIX OF THE MIXED PROPORTIONAL HAZARD MODEL

THE SINGULARITY OF THE INFORMATION MATRIX OF THE MIXED PROPORTIONAL HAZARD MODEL Econometrica, Vol. 71, No. 5 (September, 2003), 1579 1589 THE SINGULARITY OF THE INFORMATION MATRIX OF THE MIXED PROPORTIONAL HAZARD MODEL BY GEERT RIDDER AND TIEMEN M. WOUTERSEN 1 This paper presents

More information

Semiparametric Regression

Semiparametric Regression Semiparametric Regression Patrick Breheny October 22 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/23 Introduction Over the past few weeks, we ve introduced a variety of regression models under

More information

Conditional Copula Models for Right-Censored Clustered Event Time Data

Conditional Copula Models for Right-Censored Clustered Event Time Data Conditional Copula Models for Right-Censored Clustered Event Time Data arxiv:1606.01385v1 [stat.me] 4 Jun 2016 Candida Geerdens 1, Elif F. Acar 2, and Paul Janssen 1 1 Center for Statistics, Universiteit

More information

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Takeshi Emura and Hisayuki Tsukuma Abstract For testing the regression parameter in multivariate

More information

1 Local Asymptotic Normality of Ranks and Covariates in Transformation Models

1 Local Asymptotic Normality of Ranks and Covariates in Transformation Models Draft: February 17, 1998 1 Local Asymptotic Normality of Ranks and Covariates in Transformation Models P.J. Bickel 1 and Y. Ritov 2 1.1 Introduction Le Cam and Yang (1988) addressed broadly the following

More information

Interval Estimation for Parameters of a Bivariate Time Varying Covariate Model

Interval Estimation for Parameters of a Bivariate Time Varying Covariate Model Pertanika J. Sci. & Technol. 17 (2): 313 323 (2009) ISSN: 0128-7680 Universiti Putra Malaysia Press Interval Estimation for Parameters of a Bivariate Time Varying Covariate Model Jayanthi Arasan Department

More information

Department of Econometrics and Business Statistics

Department of Econometrics and Business Statistics ISSN 1440-771X Australia Department of Econometrics and Business Statistics http://www.buseco.monash.edu.au/depts/ebs/pubs/wpapers/ Semiparametric estimation of the dependence parameter of the error terms

More information

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis STAT 6350 Analysis of Lifetime Data Failure-time Regression Analysis Explanatory Variables for Failure Times Usually explanatory variables explain/predict why some units fail quickly and some units survive

More information

A Bivariate Weibull Regression Model

A Bivariate Weibull Regression Model c Heldermann Verlag Economic Quality Control ISSN 0940-5151 Vol 20 (2005), No. 1, 1 A Bivariate Weibull Regression Model David D. Hanagal Abstract: In this paper, we propose a new bivariate Weibull regression

More information

Quantile Regression for Residual Life and Empirical Likelihood

Quantile Regression for Residual Life and Empirical Likelihood Quantile Regression for Residual Life and Empirical Likelihood Mai Zhou email: mai@ms.uky.edu Department of Statistics, University of Kentucky, Lexington, KY 40506-0027, USA Jong-Hyeon Jeong email: jeong@nsabp.pitt.edu

More information

A measure of radial asymmetry for bivariate copulas based on Sobolev norm

A measure of radial asymmetry for bivariate copulas based on Sobolev norm A measure of radial asymmetry for bivariate copulas based on Sobolev norm Ahmad Alikhani-Vafa Ali Dolati Abstract The modified Sobolev norm is used to construct an index for measuring the degree of radial

More information

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models NIH Talk, September 03 Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models Eric Slud, Math Dept, Univ of Maryland Ongoing joint project with Ilia

More information

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL The Cox PH model: λ(t Z) = λ 0 (t) exp(β Z). How do we estimate the survival probability, S z (t) = S(t Z) = P (T > t Z), for an individual with covariates

More information

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Anastasios (Butch) Tsiatis Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information

Survival Prediction Under Dependent Censoring: A Copula-based Approach

Survival Prediction Under Dependent Censoring: A Copula-based Approach Survival Prediction Under Dependent Censoring: A Copula-based Approach Yi-Hau Chen Institute of Statistical Science, Academia Sinica 2013 AMMS, National Sun Yat-Sen University December 7 2013 Joint work

More information

Goodness-of-Fit Tests With Right-Censored Data by Edsel A. Pe~na Department of Statistics University of South Carolina Colloquium Talk August 31, 2 Research supported by an NIH Grant 1 1. Practical Problem

More information

Accelerated Failure Time Models: A Review

Accelerated Failure Time Models: A Review International Journal of Performability Engineering, Vol. 10, No. 01, 2014, pp.23-29. RAMS Consultants Printed in India Accelerated Failure Time Models: A Review JEAN-FRANÇOIS DUPUY * IRMAR/INSA of Rennes,

More information

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t)

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t) PhD course in Advanced survival analysis. (ABGK, sect. V.1.1) One-sample tests. Counting process N(t) Non-parametric hypothesis tests. Parametric models. Intensity process λ(t) = α(t)y (t) satisfying Aalen

More information

Consistent Bivariate Distribution

Consistent Bivariate Distribution A Characterization of the Normal Conditional Distributions MATSUNO 79 Therefore, the function ( ) = G( : a/(1 b2)) = N(0, a/(1 b2)) is a solu- tion for the integral equation (10). The constant times of

More information

Continuous Time Survival in Latent Variable Models

Continuous Time Survival in Latent Variable Models Continuous Time Survival in Latent Variable Models Tihomir Asparouhov 1, Katherine Masyn 2, Bengt Muthen 3 Muthen & Muthen 1 University of California, Davis 2 University of California, Los Angeles 3 Abstract

More information

STAT331. Cox s Proportional Hazards Model

STAT331. Cox s Proportional Hazards Model STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations

More information

Meei Pyng Ng 1 and Ray Watson 1

Meei Pyng Ng 1 and Ray Watson 1 Aust N Z J Stat 444), 2002, 467 478 DEALING WITH TIES IN FAILURE TIME DATA Meei Pyng Ng 1 and Ray Watson 1 University of Melbourne Summary In dealing with ties in failure time data the mechanism by which

More information

Comparing Distribution Functions via Empirical Likelihood

Comparing Distribution Functions via Empirical Likelihood Georgia State University ScholarWorks @ Georgia State University Mathematics and Statistics Faculty Publications Department of Mathematics and Statistics 25 Comparing Distribution Functions via Empirical

More information

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints

A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Noname manuscript No. (will be inserted by the editor) A Recursive Formula for the Kaplan-Meier Estimator with Mean Constraints Mai Zhou Yifan Yang Received: date / Accepted: date Abstract In this note

More information

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Guosheng Yin Department of Statistics and Actuarial Science The University of Hong Kong Joint work with J. Xu PSI and RSS Journal

More information

Tail Dependence of Multivariate Pareto Distributions

Tail Dependence of Multivariate Pareto Distributions !#"%$ & ' ") * +!-,#. /10 243537698:6 ;=@?A BCDBFEHGIBJEHKLB MONQP RS?UTV=XW>YZ=eda gihjlknmcoqprj stmfovuxw yy z {} ~ ƒ }ˆŠ ~Œ~Ž f ˆ ` š œžÿ~ ~Ÿ œ } ƒ œ ˆŠ~ œ

More information

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/25 Right censored

More information

Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm

Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm Empirical likelihood ratio with arbitrarily censored/truncated data by EM algorithm Mai Zhou 1 University of Kentucky, Lexington, KY 40506 USA Summary. Empirical likelihood ratio method (Thomas and Grunkmier

More information

Inference for the Positive Stable Laws Based on a Special Quadratic Distance

Inference for the Positive Stable Laws Based on a Special Quadratic Distance Inference for the Positive Stable Laws Based on a Special Quadratic Distance Andrew Luong* and Louis G. Doray** 1 *École d actuariat, Université Laval, Cité Universitaire, Ste-Foy, Québec, Canada G1K 7P4

More information

UNIVERSITÄT POTSDAM Institut für Mathematik

UNIVERSITÄT POTSDAM Institut für Mathematik UNIVERSITÄT POTSDAM Institut für Mathematik Testing the Acceleration Function in Life Time Models Hannelore Liero Matthias Liero Mathematische Statistik und Wahrscheinlichkeitstheorie Universität Potsdam

More information

Multistate models and recurrent event models

Multistate models and recurrent event models Multistate models Multistate models and recurrent event models Patrick Breheny December 10 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/22 Introduction Multistate models In this final lecture,

More information