Score tests for dependent censoring with survival data

Size: px

Start display at page:

Download "Score tests for dependent censoring with survival data"

Elizabeth Barrett
5 years ago
Views:

1 Score tests for dependent censoring with survival data Mériem Saïd, Nadia Ghazzali & Louis-Paul Rivest Département de mathématiques et de statistique, Université Laval, Québec G1K 7P4 Canada Abstract In a standard survival data analysis, the observed time is the minimum of the survival time T and a censoring time independent of T. In this paper, we consider models involving two censoring times U and V. The distribution of U is possibly related with that of T, while the second censoring time V is independent of both T and U. These models involve an Archimedean copula to incorporate a possible dependency between T and U. Score tests for dependent censoring are derived when a parametric model, e.g. Weibull or exponential, is assumed for T. One is fully parametric; it assumes that the marginal distributions of T, U, and V are either exponential or Weibull. The other is semiparametric; no assumptions are made on neither U nor V. The relative efficiencies of these two tests are compared with that of tests involving uncensored data. A numerical example is presented. 1 Introduction A standard assumption underlying most statistical models for the analysis of lifetime data is the independence between survival and censoring times. This assumption is acceptable for administrative censoring associated with the termination of a study. However it is questionable in follow-up studies with self-selected removal where patients leave for causes possibly associated with the study variable. For such withdrawals, Frangakis & Rubin (21) use 1

2 the term dropout censoring. It is convenient to distinguish these two kinds of censoring. Administrative censoring is referred to as being independent while dropout censoring is called dependent. The goal of this paper is to develop score tests to ascertain whether dropout censoring is truly dependent. In a typical study the data set consists of the minimum (X) of survival, dependent censoring and independent censoring times T, U and V, respectively, is observed, with an indicator function δ taking value 1 if X = T, if X = U and -1 if X = V. Tsiatis (1975) showed that the independence between T, U and V is needed for their marginal distributions to be identifiable. This suggests that it might be difficult, not to say impossible, to test the independence between T and U without assuming a parametric model for at least one of their two marginal distributions. Nonparametric tests of independence might be feasible using additional follow-up of patients that either dropped out (see Lee & Wolfe, 1995, 1998, Lee, 1996) or died (see Lin, Robins & Wei, 1996) before the study termination. When only {(X i, δ i )} is observed, Zheng & Klein (1995) acknowledge that the level of dependency between T and U is not estimable from the data. It has to be determined a priori before implementing their survival distribution estimator that accounts for this dependency. Parametric models featuring distributions for T and U and a copula for their association have been considered by Emoto & Matthews (199) and Bhattacharyya (1997). We consider models similar to theirs. The joint survival function S T,U,V (t, u, v) of (T, U, V ) is written in terms of S T (t), S U (u) and S V (v), the marginal survival functions, and a copula C{x, y} for the dependency between T and U (Nelsen, 1999), as S T,U,V (t, u, v) = C {S T (t), S U (u)} S V (v), t, u, v >. (1.1) Our derivations use the Archimedean copulas introduced, in a statistical context, by Genest & McKay (1986) which are defined by C a {y 1, y 2 } = φ 1 a {φ a (y 1 ) + φ a (y 2 )}, (y 1, y 2 ) [, 1] 2, (1.2) where φ a ( ) is a strictly decreasing and convex function defined on (, 1] satisfying φ a (1) =. Parameter a measures the dependency with a = when T and U are independent. Thus at a =, (1.2) gives C {y 1, y 2 } = y 1 y 2 corresponding to φ (t) = ln(t). We construct two kinds of score tests for independence (H : a = ). The first one is fully parametric; each marginal distribution is assumed to 2

3 belong to a parametric family. The second test is semiparametric, a parametric model being assumed only for the distribution of survival time T. The asymptotic normality of these tests is established, and expressions for their respective asymptotic variances are derived. For all the models considered in this paper, the parametric test is shown to have a Pitman efficiency larger than 1 when compared with the semiparametric test. In section 2, we motivate the choice of Archimedean copulas to model the dependence in survival analyses. Likelihoods underlying the score tests, parametric and semiparametric, are then constructed. These two tests are described in sections 3 and 4, respectively. Section 5 is devoted to the study of their relative performances using Pitman efficiencies. A numerical example is presented in section 6. Section 7 provides some conclusions and perspectives. 2 Models and likelihoods This section presents background informations about Archimedean copulas. Two parametric copula families, considered for the construction of score tests, are also introduced. The likelihoods for the parametric and semiparametric score tests are presented. A more detailed presentation of Archimedean copulas is available in Genest & McKay (1986) and Nelsen (1999). 2.1 Dependence framework and Archimedean copula An attractive characteristic of Archimedean copulas is their close relationship to frailty models, as shown in Oakes (1989). Assuming that there exists a latent unobservable variable, say Y, such that the joint survival function of (T, U), conditioned by Y = y, is {ST (t) S U (u)}y, where ST (t) and S U (u) denote baseline survival functions, then the unconditional survival function of (T, U) is C {S T (t), S U (u)}, where C(x, y) is an Archimedean copula. To study independence between survival and censoring, one needs to distinguish between net and crude hazard rates, say λ T (t) and λ # T (t), defined as 1 λ T (t) = lim h h λ # T (t) = lim h 1 h {t T < t + h T t}, {t T < t + h T t, U t}. 3

4 Rivest & Wells (21) showed that when the joint survival distribution of T and U is given by φ 1 a {φ a(s T (t)) + φ a (S U (u))}, the net and crude hazard rates are proportional as stated formally in the next proposition. Proposition 2.1 Let π a (t) = φ 1 a {φ a(s T (t)) + φ a (S U (t))} be the survival function of min(t, U) and ψ a (t) = tφ a(t); then λ # at (t) = λ T (t) ψ a [S T (t)] ψ a [π a (t)]. There exists a similar relation for net and crude hazard rates associated with U, say λ U (u) and λ # au (u). The next section reviews some basic properties for the two Archimedean copula families used in this paper. 2.2 Two families of Archimedean copulas Example 2.1 The Clayton family is generated by φ a (t) = 1 a { t a 1 }, a [ 1, ) \ {}. Independence corresponds to a. For this family, { C a {S T (t), S U (u)} = max, { [S T (t)] a + [S U (u)] a 1 } } 1/a (2.1) and When a is close to λ # at (t) = λ T (t) λ # at (t) λ T (t)s U (t) a. { } a ST (t). (2.2) π a (t) A dependency à la Clayton adds a decreasing component to the risk function when a is positive. The following results, derived from (2.1), are used in the derivations presented later, ln π a (x) a 2 ln π a (x) a 2 2 ln π a (x) a θ a ln S T (x) ln S U (x), a ln S T (x) ln S U (x) {ln S T (x) + ln S U (x)}, a [S T (x)] 1 Ṡ T (x) ln S U (x) + [S U (x)] 1 Ṡ U (x) ln S T (x), 4

5 where θ denotes the vector of marginal parameters, and a point denotes a derivative with respect to θ. Example 2.2 The Ali-Mikhail-Haq family is generated by φ a (t) = 1 { } 1 + a(t 1) 1 a ln, a [ 1, 1), t and can be used to model a limited dependence; independence corresponds to a =. For this family, C a {S T (t), S U (u)} = S T (t)s U (u) 1 a [1 S T (t)] [1 S U (u)] (2.3) and λ # at (t) = λ T (t) { } 1 + a [πa (t) 1]. (2.4) 1 + a [S T (t) 1] When a is close to, λ # at (t) λ T (t)[1 as T (t){1 S U (t)}]. A Ali-Mikhail-Haq dependency multiplies the risk function by a bathtub shape function equal to 1 when t is either small or large. The following results, derived from (2.3), are used in the sequel ln π a (x) a = {1 S T (x)} {1 S U (x)}, a= 2 ln π a (x) a 2 = {1 S T (x)} 2 {1 S U (x)} 2, a= 2 ln π a (x) a θ = ṠT (x) {1 S U (x)} ṠU(x) {1 S T (x)}. a= This copula is closely related to Frank s (Genest, 1987): these two copulas have the same behavior in the neighborhood of independence. The score test for independence for Frank s family is the same as that for Ali-Mikhail-Haq; thus all the results derived in this paper for Ali-Mikhail-Haq s copula also apply to Frank s copula. 5

6 2.3 Likelihoods This section constructs two likelihoods for inference on dependence parameter a using the sample {(x i, δ i ); i = 1,..., n}, where x i = min(t i, U i, V i ) and δ i takes the value 1 if T i is observed, if x i = U i and -1 if x i = V i Fully parametric model The complete likelihood can be written in terms of the joint survival function S T,U,V (t, u, v) as L(a, θ) = { S } { T,U,V (t, x i, x i ) t S } T,U,V (x i, u, x i ) δ i =1 t=xi u δ i = u=xi { S } T,U,V (x i, x i, v). v δ i = 1 v=xi For model (1.1), proposition 2.1 yields t S T,U,V (t, x i, x i ) = λ # at (x i) π a (x i ) S V (x i ). t=xi The two equations above yield L(a, θ) = δ i =1 λ # at (x i) δ i = λ # au (x i) δ i = 1 λ V (x i ) Define the crude survival functions, for W = T or W = U, as t } S # aw { (t) = exp λ # aw (s) ds. n π a (x i )S V (x i ). (2.5) Now π a ( ) can be expressed as the product of the crude survival functions, π a (x) = S # at (x) S# au (x). Using this relation in (2.5) yields an alternative form for the likelihood, { } { } n n L(a, θ) = λ # at (x i) S # at (x i) λ # au (x i) S # au (x i) δ i =1 { δ i = 1 λ V (x i ) δ i = } n S V (x i ). (2.6) 6

7 Expressions (2.5) and (2.6) for the likelihood do not rely on the assumption that the copula is Archimedean. The likelihood for the parameters of (1.1) has this form even when the dependency between T and U is modeled using a non-archimedean copula Semiparametric model When nothing is known about the marginal distributions of U and V, inference about the dependence parameter a can be carried out with a pseudolikelihood, comprising only the first term of (2.6). This first term depends on π a ( ). The pseudo-likelihood is constructed by replacing π a ( ) by a nonparametric estimator, say ˆπ( ). This pseudo-likelihood is then given by L (a, θ T ) = δ i =1 λ at (x i ) n SaT (x i ), (2.7) where θ T is the vector of parameters for the marginal distribution of T, λ at (t) = λ T (t) ψ a [S T (t)] ψ a [ˆπ(t)], (2.8) and { t } SaT (t) = exp λ at (s) ds. Observe that (2.7) is a pseudo-likelihood that can be used to obtain consistent estimators of a and of the marginal parameters for T. This is however not pursued here; we use this pseudo-likelihood in section 4 to derive tests of independence. In the derivation of Section 4, ˆπ( ) needs to be a left continuous function. It can be set equal to a left continuous Kaplan-Meier estimator for the survival function of min(t, U) calculated using the sample {(x i, δi ) : i = 1,..., n}, where δi = 1 if either δ i = 1 or δ i =. When only dependent censoring is observed, this estimator reduces to the empirical estimator ˆπ(x) = (1/n) n 11 [x i x]. 3 Parametric tests In this section, we construct score tests for independence, i.e. for H : a =, using expression (2.5) for the likelihood. The marginals for T, U and 7

8 V are assumed to be known up to a parameter θ. Two marginal models, the exponential and the Weibull, are used in the derivation. The parametrization for each one is given in Table 1. For the Weibull model, the parameter β is assumed to be estimated independently for T, U, and V, however the calculations are carried out assuming, as stated in Table 1, that β is the same for these three distributions. Our primary goal, which is to compare the Pitman efficiencies of tests for H : a = for several marginal models for T, explains this somewhat artificial treatment for the Weibull distribution. Table 1: Marginal models Model λ T (t) λ U (t) λ V (t) Exponential γ T γ U γ V Weibull γ T β t β 1 γ U β t β 1 γ V β t β 1 General results about the construction of score tests are first reviewed. The results of the derivations for Clayton and Ali-Mikhail-Haq copulas and for the two marginal models of Table 1 are then presented. 3.1 General results In a parametric framework, with a finite-dimensional parameter, a score test for H is constructed using the statistic U P (, θ) = a a=,θ= θ l(a, θ), (3.1) where l(a, θ) = ln L(a, θ) is the log-likelihood for (2.5) and θ denotes the estimator of marginal parameter θ under H ; index P stands for parametric test. From (2.6), neglecting terms that do not depend on a, the log-likelihood l(a, θ) is equal to ln λ # at (x i) + ln λ # au (x i) + δ i =1 δ i = ln π a (x i ), where λ # at (x i), λ # au (x i), and π a (x i ) depend on the models for the marginals and for the copula. Let T P = n 1 U P (, θ) denote the normalized score statistic. 8

9 Classical asymptotic results imply that score statistic T P has a limiting normal distribution as the sample size n goes to (see Cox & Hinkley, 1979, ch. 9). To complete the derivation, we have now to estimate its variance. Let I(a, θ) be the observed information matrix, partitioned as [ ] 2 l(a,θ) 2 l(a,θ) [ ] I(a, θ) = a 2 Iaa (a, θ) I a θ = aθ (a, θ), (3.2) I θa (a, θ) I θθ (a, θ) 2 l(a,θ) θ a 2 l(a,θ) θ 2 and let I(, θ) be this matrix evaluated under H. To evaluate the variance of (3.1), we use the Fisher s information matrix I(, θ) =,θ {I(, θ)}. Partitioning I(a, θ) as (3.2), the variance of n 1/2 T P is estimated by σ 2 P = n 1 { I aa (, θ) I aθ (, θ) I 1 θθ (, θ) I θa(, θ) }, (3.3) evaluated at θ = θ. The null hypothesis is rejected at level α if n 1/2 T P / ˆσ P 2 is larger than z 1 α/2, the 1α/2 percentile of the normal distribution. From expression (2.6) for the likelihood, it is clear that I θθ (, θ) depends only on marginal parameters. Note also that this matrix is block diagonal, with blocks I θt θ T (, θ), I θu θ U (, θ) and I θv θ V (, θ), corresponding to components of the null log-likelihood associated with the marginal distributions of T, U and V, respectively. When the marginal distribution for T is exponential, θ T = γ T while θ T = (β, γ T ) for a Weibull margin. The parameters θ U and θ V are defined in a similar way. The inverse I 1 θθ (, θ) is also block diagonal. The next proposition gives I 1 θ T θ T (, θ) for the parameters of the distribution of T in table 1. The blocks for U and V are deduced in a similar way. The proof is technical; a sketch is provided in Appendix A. Proposition 3.1 Let T, U and V be independent. (i) Let T, U and V exponentially distributed, with hazard rate γ T, γ U and γ V, respectively; then n I 1 θ T θ T (, θ) = γ T (γ T + γ U + γ V ). (ii) Let T, U and V Weibull distributed, with common shape parameter β and scale parameters γ T, γ U and γ V, respectively. Let c.5772 denote Euler s constant and b = 1 c ln γ where γ = γ T + γ U + γ V ; then ] [ n I 1 θ T θ T (, θ) = 6 γ β 2 γ T π 2 b β ( b β γ T π b2 ). 9

10 3.2 Score test for Clayton family This section shows how to carry out the score test for Clayton s copula. All the derivations are presented in the appendix. Example 2.1 (continued) For any marginal model, statistic (3.1) is U P (, θ) = ln S T (x i ) + ln S U (x i ) + ln S T (x i ) ln S U (x i ). δ i = δ i =1 For exponential margins, the variance is and for the Weibull model σ 2 P = γ T γ U (γ T + γ U ) (γ T + γ U + γ V ) 3, σ 2 P = γ T γ U (γ T + γ U ) (γ T + γ U + γ V ) 3 {1 6π 2 }. 3.3 Score test for Ali-Mikhail-Haq family This section shows how to carry out the score test for Ali-Mikhail-Haq s copula. All the derivations are presented in the appendix. Example 2.2 (continued) For any marginal model, the statistic (3.1) is U P (, θ) = {1 S T (x i )} {1 S U (x i )} δ i = δ i =1 + 2 {1 S T (x i )} {1 S U (x i )} δ i = 1 {1 S T (x i )} {1 S U (x i )}. For this copula the variance formulas are more complex than for Clayton s. They are expressed as functions k 1 ( ) and k 2 ( ) of the marginal parameters defined by k 1 (γ T, γ U, γ V ) = 2 γ T γu 2 (2 γ T + γ)(2 γ T + γ U + γ)(2 γ T + 2 γ U + γ) γ T γu 2 γ (3.4) (γ T + γ) 2 (γ T + γ U + γ) 2 1

11 and, k 2 (γ T, γ U, γ V ) = k 1 (γ T, γ U, γ V ) 6 γ γ { T ln [γ/(γt + γ)] ln [γ/(γ } 2 T + γ U + γ)], (3.5) π 2 γ T + γ γ T + γ U + γ where γ = γ T + γ U + γ V. Lengthy derivations sketched in appendix yield the following variance formula, σ 2 P = k 1(γ T, γ U, γ V ) + k 1 (γ U, γ T, γ V ), for exponential margins, and for the Weibull model. σ 2 P = k 2(γ T, γ U, γ V ) + k 2 (γ U, γ T, γ V ) 4 Semiparametric tests In this section, we construct score tests for independence using the pseudolikelihood (2.7), assuming that the marginal distribution for T is in a parametric family. These tests do not assume any parametric model for censoring times U and V. 4.1 General results Let l (a, θ T ) = ln L (a, θ T ) be the pseudo log-likelihood for (2.7). The semiparametric test is constructed using the statistic U SP (, θ T ) = a l (a, θ T ), (4.1) a=,θt = θ T where θ T denotes the estimator of θ T under H. To study the asymptotic properties of this statistic, we write the pseudo log-likelihood using counting processes N(t) = n 11 [X i t,δ i =1] and Ȳ (t) = n 11 [X i t] as l (a, θ T ) = τ ln λ at (t) d N(t) 11 τ λ at (t) Ȳ (t) dt, (4.2)

12 where τ = inf { t : Ȳ (t) = } and λ at (t) is given by (2.8). Let and H a,θ T (t) = ln λ at (t) a M a,θt (t) = N(t) t be the martingale associated with counting process Harrington, 1991, Theorem 1.3.1). Using (4.2)-(4.4), statistic (4.1) becomes U SP (, θ T ) = τ (4.3) λ # at (s) Ȳ (s) ds (4.4) N(t) (see Fleming & H, θ T (t) d M, θt (t). (4.5) Note that M, θt (t), i.e. Ma,θT (t) evaluated at a = and θ T = θ T, is not a martingale. This fact motivates the proof of the following proposition which is presented in the appendix. Proposition 4.1 Let T SP (, θ T ) = n 1 U SP (, θ T ), S X ( ) be the survival function of X = min(t, U, V ), and I θt θ T (, θ) be the Fisher s information matrix associated with θ T under H. Suppose that H,θ T (t) satisfies the assumptions of Rebolledo s theorem (Andersen et al., 1993, p. 83). Then, under H : a =, n 1/2 T SP (, θ d T ) N(, σsp 2 ), where where and σsp 2 = [G,θT (t)] 2 S X (t) λ T (t) dt, (4.6) G,θT (t) = H,θT (t) A [I θt θ T (, θ)] 1 λ T (t) λ T (t), (4.7) A = where λ T (t) = λ T (t)/ θ T. H,θT (t) S X (t) λ T (t) dt, (4.8) This semiparametric test is easily implemented for the two copulas considered in this work, for any marginal model for T. For Clayton s, from (4.3) and (2.2), H,θ T (t) = ln ˆπ(t) ln S T (t), (4.9) 12

13 while for Ali-Mikhail-Haq s, (4.3) and (2.4) yield H,θ T (t) = ˆπ(t) S T (t). (4.1) We estimate σ 2 SP nonparametrically by ˆσ SP 2 = 1 { 2 G n, θ T (x i )}, (4.11) δ i =1 where G (t) is obtained from G, θ,θt (t), replacing π(t) by ˆπ(t) and θ T T θ T, and Â = 1 1 H n, θ T (x i ) [ λt (x i )] λt (x i ) δ i =1 For the model with exponential margin, the observed information matrix I θt θ T (, θ) and the estimated covariance matrix of the score function coincide, so we obtain in this case ˆσ SP 2 = 1 n δ i =1 { } 2 H 1, θ T (x i ) D T { } 2 H, θ T (x i ) δ i =1 by (4.12) To calculate relative efficiencies of this test with respect to the fully parametric tests of Section 3, one needs to evaluate the variance σsp 2 for the parametric models of Section 3. This is done next. 4.2 Evaluation of σsp 2 for the copulas of Clayton and of Ali-Mikhail-Haq Example 2.1 (continued) For Clayton s family with any specified marginal distributions, (4.9) yields H,θT (t) = ln S U (t). Using the expressions for I 1 θ T θ T (, θ) given in proposition 3.1 and evaluating (4.6)-(4.8), with exponential margins for T, U and V as defined in table 1, yields σ 2 SP = γ T γ 2 U (γ T + γ U + γ V ) 3. When T, U and V have the Weibull margins given in table 1, similar manipulations give σsp 2 = γ T γu {1 2 6π }. (γ T + γ U + γ V )

14 Example 2.2 (continued) For the Ali-Mikhail-Haq family with any specified marginal distributions, (4.1) yields H,θT = S U (t){1 S T (t)}. Along the lines of the derivations for Clayton s family, we prove that, when T, U and V have exponential margins, σ 2 SP = k 1(γ T, γ U, γ V ); with Weibull margins for T, U and V, σ 2 SP = k 2(γ T, γ U, γ V ), where k 1 ( ) and k 2 ( ) are defined by (3.4) and (3.5), respectively. Calculations are more tedious than those for Clayton s copula. They are similar to those underlying the derivation of the parametric tests. 5 Relative efficiencies 5.1 General results In this section, we compare fully parametric and semiparametric tests of independence, using Pitman efficiencies. Consider two test statistics for H : a =, say T 1 and T 2, and suppose that the limiting distribution of n 1/2 (T j µ j (a n ))/σ j (a n ) is, for j = 1, 2, a standard normal distribution, where a n is O(n 1/2 ) and µ j (a n ) and σj 2(a n)/n are the asymptotic mean and variance of T j when the dependence parameter is a n. The relative efficiency or Pitman efficiency of T 1 with respect to T 2 for testing H : a = is defined by ( ) µ 2 E(T 1, T 2 ) = 1 ()/σ 1 () µ 2 ()/σ. 2() A detailed discussion of the conditions for Pitman s efficiency to be well defined is given in van der Vaart (2, p ). The efficiency E(T 1, T 2 ) gives the ratio of the sample sizes n 2 /n 1 needed for the tests based on T 1 and T 2 to achieve asymptotically the same local power. An efficiency larger than 1 means that the test based on T 1 is locally more powerful than the one based on T 2. To calculate the efficiencies of score tests, it is convenient to consider estimators â of the dependence parameter obtained by solving U P (a, θ) =. 14

15 When a = and as the sample size goes to, the asymptotic distribution of â is normal with mean and variance given by the (a, a) entry of I(, θ) 1. Standard results on the inverses of partitioned matrices can be used to demonstrate that Var(n 1/2 â) = n{i aa (, θ) I aθ (, θ) I 1 θθ (, θ) I θa(, θ)} 1 = 1/σ 2 P (). According to Kendall & Stuart (1979, ch. 25), the above variance satisfies 1/σP 2 () = [σ P ()/µ P ()/]2 ; in other words µ P () = σ P () 2. Therefore the Pitman efficiency of one score test with respect to another is equal to the ratio of their asymptotic variances. For the semiparametric test, µ SP () is evaluated in appendix C and is shown to satisfy µ SP () = σ2 SP (). This implies that the efficiency of the parametric test with respect to the semiparametric test is given by E(T P, T SP ) = σp 2 /σsp 2, where σp 2 and σ2 SP are given by (3.3) and (4.6), respectively. In the next section, we recall some properties of the standard score tests for independence, constructed with uncensored samples of bivariate observations for (T, U). Pitman efficiencies of the parametric tests for censored sample with respect to those for uncensored data provide information about the loss of information due to censoring. 5.2 Score tests of independence for uncensored data Let T C be a normalized score statistic to test for independence in an uncensored data set (index C stands for complete samples). The score function for a is independent of the score function for the marginal parameters when a = (see Genest, Ghoudi & Rivest, 1995). Thus functions µ C (a) and σc 2 (a) needed to calculate Pitman efficiencies do not depend on the margins for T and U. For score statistic T C, the (a, θ) submatrix of the Fisher information matrix for uncensored samples is null in (3.3). Thus the asymptotic variance of nt C is the (a, a) term of the above Fisher information matrix. In other words, σc 2 = 1 { } 2 l C (a) n a 2, a= where l C (a) = n ln f a(y 1i, y 2i ), {(y 1i, y 2i ); i = 1,..., n} is a bivariate sample of independent random variables uniformly distributed on [, 1], and 15

16 f a (y 1, y 2 ) is the density for copula C a defined in (1.2), f a (y 1, y 2 ) = φ a [C a (y 1, y 2 )] φ a(y 1 ) φ a(y 2 ) {φ a [C a(y 1, y 2 )]} 3. The arguments of Kendall & Stuart (1979, ch. 25) mentioned above, to prove µ P () = σ2 P, can be used here to obtain µ C () = σ2 C. Example 2.1 (continued) For the Clayton family, the joint density is f a (y 1, y 2 ) = (1 + a) y a 1 1 y a 1 2 {C a (y 1, y 2 )} 2 a+1 ; using results of section 2.2, we prove that 1 n 2 l C (a) a 2 and finally, taking expectation under H, a 1 ln y 1 ln y 2 {4 + ln y 1 + ln y 2 }, σ 2 C = 1. Example 2.2 (continued) For the Ali-Mikhail-Haq family, the joint density is f a (y 1, y 2 ) = C a(y 1, y 2 ) {1 + a [C a (y 1, y 2 ) 1]} {1 + a [2 C a (y 1, y 2 ) 1]} ; y 1 y 2 {1 + a(y 1 1)} {1 + a(y 2 1)} using results of section 2.2 yields 1 2 l C (a) n a 2 = 6 y 1 y 2 (1 y 1 )(1 y 2 ) (1 y 1 ) 2 (1 y 2 ) 2 a= and finally (1 y 1 ) 2 (1 y 2 ) 2 + (y 1 y 2 1) 2 + (2 y 1 y 2 1) 2, σ 2 C = 1/9. In the next section, we give some numerical results about E(T P, T SP ) and E(T C, T P ) for Clayton s and Ali-Mikhail-Haq s families. 16

17 5.3 Calculations of relative efficiencies The efficiencies have the following simple form, E(T P, T SP ) = γ T + γ U γ U for both exponential and Weibull margins under Clayton s model. For Ali- Mikhail-Haq s they are written in terms of the functions k 1 ( ) and k 2 ( ) (see (3.4)-(3.5)) as E(T P, T SP ) = 1 + k 1(γ U, γ T, γ V ) k 1 (γ T, γ U, γ V ) and E(T P, T SP ) = 1 + k 2(γ U, γ T, γ V ) k 2 (γ T, γ U, γ V ), for the exponential and Weibull models respectively. Observe that in all cases E(T P, T SP ) > 1, as expected. The semiparametric test relies on fewer assumptions; this results in a loss of power. Also when γ T = γ U, E(T P, T SP ) = 2. These efficiencies depend only on the expected proportions for the three possible outcomes, say γ T /γ, γ U /γ and γ V /γ. Table 2 and 3 give E(T P, T SP ) and E(T C, T P ) for Clayton s and Ali-Mikhail-Haq s copula for several patterns of outcomes. Table 2: Relative efficiencies for the Clayton family Exponential margins Weibull margins γ T /γ γ U /γ γ V /γ E(T P, T SP ) E(T C, T P ) E(T P, T SP ) E(T C, T P ) In table 2, when γ V /γ = (as defined in section 3, γ = γ T + γ U + γ V ), E(T P, T SP ) decreases as the proportion of dependent censoring γ U /γ increases, and E(T C, T P ) is minimal when γ T /γ = γ U /γ. When the proportion γ T /γ is fixed, E(T P, T SP ) and E(T C, T P ) decrease as γ U /γ increases. Comparing E(T C, T P ) for exponential and Weibull models shows that, the additional 17

18 Table 3: Relative efficiencies for the Ali-Mikhail-Haq family Exponential margins Weibull margins γ T /γ γ U /γ γ V /γ E(T P, T SP ) E(T C, T P ) E(T P, T SP ) E(T C, T P ) parameter of the Weibull margin requires an increase of 1{6/(π 2 6)} = 155% in the sample size to reach the local power obtained with exponential margins. In table 3, the loss of efficiency with respect to tests for complete samples is much more important than in table 2. A possible explanation for this is that under Ali-Mikhail-Haq s model, when a is small, λ # at (t) λ T (t){1 as T (t)[1 S U (t)]}. Thus the crude and net hazard are equal for small and large values of t. They differ only for medium values of t. Discriminating between λ # at (t) and λ T (t) is therefore more difficult than for Clayton s copula for which λ # at (t)/λ T (t) decreases in t when a is small. Remark that E(T P, T SP ) is greater for exponential margins as compared to Weibull s. When γ V /γ =, E(T P, T SP ) decreases as γ U /γ increases; a similar behaviour was noted for Clayton s family. For Weibull margins, E(T C, T P ) is minimal when γ T /γ = γ U /γ. However, for exponential margins, γ T /γ = γ U /γ corresponds in fact to a local maximum for E(T C, T P ); there are two local minima at γ U /γ.24,.76. When the proportion γ T /γ is fixed, E(T P, T SP ) and E(T C, T P ) decrease as γ U /γ increases. Comparing E(T C, T P ) for exponential and Weibull models shows that, for Weibull margins, much more data are needed to reach a given level of local power. 6 A numerical example This section considers the survival times, in weeks, of 61 patients with unoperable lung cancer treated with cyclophosphamide considered in Lagakos 18

19 & Williams (1978) and in Lee & Wolfe (1998). The data set is:.14 +,.14 +,.29 +,.43 +,.43,.57 +,.57 +, , 2.86, 3. +, 3. +, 3.14, 3.14, , , 3.43, 3.43, 3.71, 3.86, 6. +, 6. +, , 6.14, 6.86, , 9., 9.43, , 1.71, 1.86, 11.14, , 13., 14.43, , 15.71, , , 18.43, 18.57, , 2.71, , , 26. +, , 29.14, 29.71, , , 4.57, , 48.57, 49.43, 53.86, 61.86, 66.57, 68.71, 68.96, 72.86, The exponent + identifies the 28 censored observations. They correspond to patients whose treatment was terminated because of a deteriorating condition. Following Lagakos & Williams (1978), we fit exponential distributions to both the survival and the censoring distributions. The parameter estimates are γ T =.276, (s.e. =.48) and γ U =.234, (s.e. =.44). This section investigates a possible dependency between these two variables. All the results of the previous sections apply; the lack of independent censoring simplifies the calculations. The form of the statistic U P (, θ) and of the estimated variance ˆσ P 2 for the parametric tests are easily deduced from the results of subsections 3.2 and 3.3. For the semiparametric tests with exponential margin, the estimator of σsp 2 is given by (4.12), but some technical manipulations are needed to evaluate U SP (, θ T ) from (4.1). From (4.5), we obtain U SP (, θ T ) = δ i =1 H, θ T (x i ) γ T τ H, θ T (t) Ȳ (t) dt, (6.1) where H (t) = f[ˆπ(t)] f[exp( γ, θ T t)], with f(y) = log y for Clayton s T copula, and f(y) = y for Ali-Mikhail-Haq s copula. Assume that all the X i s are distinct, and let = X () < X (1) < < X (n) denote the corresponding order statistics. Since ˆπ(x) = [n (i 1)]/n, x (X (i 1), X (i) ], the following holds, τ f[ˆπ(t)] Ȳ (t) dt = = = X(i) f[ˆπ(t)]dt i [ n (j 1) f n j=1 [ n (i 1) (n i + 1)f n ] (X (j) X (j 1) ) ] (X (i) X (i 1) ) 19

20 = n ˆπ(X (i) )f[π(x (i) )](X (i) X (i 1) ) This allows to simplify score statistics (6.1). For Clayton s copula, U SP (, θ T ) = log ˆπ(X i ) + γ T X i δ i =1 n γ T while for Ali-Mikhail-Haq, ˆπ(X (i) ) log[π(x (i) )] ( X (i) X (i 1) ) + γ 2 T 2 x 2 i, U SP (, θ T ) = δ i =1 ˆπ(X i ) exp( γ T X i ) n γ T ˆπ(X (i) ) ( ) 2 X (i) X (i 1) + [1 exp( γ T X i )]. The table below gives the p-values for the four tests considered here. There is a signicant dependency for the parametric test constructed using Ali-Mikhail- Haq s copula. Table 4: P -values of four tests of independence for Lagakos & Williams data Model Parametric test Semiparametric test Clayton Ali-Mikhail-Haq Given the poor showing of the score test for Ali-Mikhail-Haq family in the efficiency comparisons presented in section 5, it is unexpected that this test gives the only significant result. To investigate this further, one can estimate the failure rate of the survival time by fitting a piecewise exponential distribution. This reveals that the hazard rate has a bathtub shape: it is high in the first 2 weeks and after the 6th and relatively low in between. According to the discussion in section 2.2 Ali-Mikhail-Haq copula is more appropriate to model this type of discrepancy than Clayton s. This explains the significant outcome of Table 4. The increase in p-value when one goes to the semiparametric test is in line with the findings of section 5. 2

21 7 Discussion In many instances a positive dependence between T and U is the most interesting alternative to H : a =. One-sided versions of the tests presented here could then be considered. These tests would reject the null hypothesis if the score statistics is too large. This would permit a much needed increase in power for the independence tests. The efficiency comparisons of section 5 suggest that dependent censoring is hard to detect using standard lifetime data; additional follow-ups of censored or dead units might be needed to make such a detection. Testing for a dependency between T and U amounts to testing the validity of the parametric models for the margins. As shown in section 2, a Clayton s dependency multiplies the failure rate by a decreasing function while for Ali-Mikhail-Haq, the multiplicative term has a bathtub shape. Tables 2 and 3, and some simulations not presented here, suggest that a dependency is much more difficult to detect for Ali-Mikhail-Haq s model that for Clayton s. The expressions for λ # at (t) presented in section shed some light on this finding: for Ali-Mikhail-Haq s model λ # at (t) is much closer to a Weibull failure rate that for Clayton s. This highlights that in competing risk models certain types of dependency might be nearly impossible to detect. Still, as noted by Zheng & Klein (1995) and Rivest & Wells (21), once a dependency is detected and its level ascertained the estimator of the marginal survival function does not depend too much on the family of copulas chosen to model the dependency. This paper leaves the door open for future work. Improvements over the semiparametric tests presented in Section 4 could possibly be achieved by using the full score statistics evaluated at the Kaplan-Meier estimator of S U (t). The investigation of the estimators of the parameters of the marginal distribution of T and of the dependence parameter a obtained by maximizing the pseudo-likelihood (2.7) would be of interest. The sampling distribution of the resulting estimators could possibly be derived following the arguments presented in Section 4. Acknowledgements We are grateful to Bas Werker for stimulating discussion on semiparametric estimation. Research funds in partial support of this work were granted by the Natural Sciences and Engineering Research Council of Canada and 21

22 by the Fonds pour la formation de chercheurs et l aide à la recherche du Gouvernement du Québec. Appendix A Proofs for the parametric test Recall that X = min(t, U, V ) and δ indicates what kind of event is observed, say δ = 1 if X = T, δ = if X = U and δ = 1 if X = V. Let D T, D U and D V the respective frequencies of each kind of event among the n units in the sample. The evaluations of the Fisher information matrices needed to construct the score tests rely on the following lemma. Lemma A.1 Let T, U and V be independent. (i) If T, U and V are exponentially distributed, with hazard rate γ T, γ U and γ V, respectively, then X and (D T, D U, D V ) are independent, and (a) X Exponential(γ), where γ = γ T + γ U + γ V ; (b) (D T, D U, D V ) Multinomial(γ T /γ, γ U /γ, γ V /γ). (ii) If T, U and V are Weibull distributed, with common shape parameter β, and respective scale parameter γ T, γ U and γ V, then X and (D T, D U, D V ) are independent and (a) X W eibull(β, γ), where γ = γ T + γ U + γ V ; (b) (D T, D U, D V ) Multinomial(γ T /γ, γ U /γ, γ V /γ); (c) Y = γx β Exponential(1), and ln Y has an extreme value distribution, so {ln Y } = c, where c.5772 is Euler s constant, and Var{ln Y } = π 2 /6 (see Lawless, 1982, section 1.3); in addition, we have {Y ln Y } = 1 c, { Y 2 ln Y } = 3 2c, { Y (ln Y ) 2 } = π2 6 + c(c 2), 22

23 and, for any λ >, { e (λ/γ)y } = { Y e (λ/γ)y } = { Y ln Y e (λ/γ)y } = γ λ + γ, ( ) 2 γ, λ + γ ( ) 2 { ( )} γ γ 1 c + ln. λ + γ λ + γ A.1 Proof of Proposition 3.1 Using the complete likelihood (2.6), at a =, we have the general form I θθ (, θ) = { } 2 ln λ T (x i ) { } 2 ln S T (x i ) θ 2 θ 2 δ i =1 { } 2 ln λ U (x i ) { } 2 ln S U (x i ) θ 2 θ 2 δ i = { } 2 ln λ V (x i ) { } 2 ln S V (x i ). (A.1) θ 2 θ 2 δ i = 1 From (A.1), it is clear that I θθ (, θ) is block diagonal, with blocks I θt θ T (, θ), I θu θ U (, θ) and I θv θ V (, θ). Exponential model The three blocks of I θθ (, γ) are 1 1 matrices. From (A.1), the component for survival time T is I θt θ T (, θ) = D T /γt 2 (see Lawless, 1982, p. 17). Part (i) of proposition 3.1 follows immediately from lemma A.1 (i). Weibull model From (A.1), the entries of the 2 2 observed information matrix for the parameters of survival time T are 2 l(, θ) = D T β 2 β + γ 2 T x β i (ln x i) 2, 2 l(, θ) γ 2 T 2 l(, θ) β γ T = = D T, γ 2 T x β i ln x i. 23

24 Using { lemma A.1 (ii), and noting b = 1 c ln γ (c.5772), we have X β ln X } { = b/(βγ) and X β (ln X) 2} = {π 2 /6 1 + b 2 } /(β 2 γ), and we deduce n 1 I θt θ T (, θ) = 1 γ [ γt β 2 ( π b2 ) Inverting this matrix yields part (ii) of proposition 3.1. b β b β 1 γ T ]. A.2 Derivations underlying tests for Clayton s family From expression (2.5) for the likelihood, and using (2.1)-(2.2), we have U P (a, θ) = ln S T (x i ) ln S U (x i ) + ln π(x i ) δ i =1 δ i = δ i 1 { } ln π(xi ) + + a { } ln π(xi ). (A.2) a a δ i 1 Using the results of section 2.2, example 2.1, completes the calculations of U P (, θ) for Clayton s family. Taking the partial derivatives of (A.2), at a =, and the results in section 2.2 yields, for any marginal model, I aa (, θ) = ln S T (x i ) ln S U (x i ) {ln S T (x i ) + ln S U (x i )} 2 I θa (, θ) = δ i = δ i 1 ln S T (x i ) ln S U (x i ), [S T (x i )] 1 Ṡ T (x i ) δ i =1 [S U (x i )] 1 Ṡ U (x i ) (A.3) [S T (x i )] 1 Ṡ T (x i ) ln S U (x i ) [S U (x i )] 1 Ṡ U (x i ) ln S T (x i ). (A.4) Exponential model Using lemma A.1(i), (A.3)-(A.4) yields n 1 I aa (, θ) = 2 γ T γ U (γ T + γ U ) γ 3 24

25 and ( n 1 γu I aθ (, θ) = γ 2, γ ) T γ,, 2 where γ = γ T +γ U +γ V, and proposition 3.1 (i) yields σp 2 with exponential margins. for Clayton s family Weibull model Using lemma A.1 (ii), we deduce from (A.3)-(A.4) that n 1 I aa (, θ) = 2 γ T γ U (γ T + γ U ) γ 3 and n 1 I aθ (, θ) = 1 γ 2 ( γt γ U (1 + b) β, γ U, γ ) T γ U (1 + b), γ T,,, β where γ = γ T + γ U + γ V and b = 1 c ln γ, and using proposition 3.1 (ii), we deduce σp 2 for Clayton s family with Weibull margins. A.3 Derivations underlying tests for Ali-Mikhail-Haq s family From likelihood (2.5), and using (2.3)-(2.4), we have U P (a, θ) = 1 S T (x i ) 1 + a [S T (x i ) 1] + 1 S U (x i ) 1 + a [S U (x i ) 1] δ i =1 δ i = ( ) ln π(x i ) + + π(x i ) 1 + a π(xi ) a. (A.5) a 1 + a [π(x i ) 1] δ i 1 Using the results of section 2.2, example 2.2, completes the proof of U P (, θ) for the Ali-Mikhail-Haq s family. Expressing the partial derivatives of (A.5), at a =, in terms of 1 S T (x i ) and 1 S U (x i ), and using the results of section 2.2 yields, for any marginal model, I aa (, θ) = δ i = {1 S T (x i )} 2 + δ i =1 {1 S U (x i )} 2 25

26 2 {1 S T (x i )} 2 {1 S U (x i )} 2 + δ i = 1 {1 S T (x i )} 2 {1 S U (x i )} 2, (A.6) I θa (, θ) = Ṡ T (x i ) Ṡ U (x i ) δ i = δ i =1 + 2 {ṠT (x i ) [1 S U (x i )] + ṠU(x i ) [1 S T (x i )]} δ i = 1 {ṠT (x i ) [1 S U (x i )] + ṠU(x i ) [1 S T (x i )]}. (A.7) Exponential model Using lemma A.1 (i), we deduce from (A.6)-(A.7) that n 1 I aa (, θ) = 2 γ T γu 2 (2 γ T + γ)(2 γ T + γ U + γ)(2 γ T + 2 γ U + γ) and + n 1 I aθ (, θ) = 2 γ 2 T γ U (2 γ U + γ)(γ T + 2 γ U + γ)(2 γ T + 2 γ U + γ) ( ) 1 γu γ T + γ U + γ γ T + γ, γ T γ U + γ,, where γ = γ T + γ U + γ V, and using the proposition 3.1 (i), we deduce σ 2 P for Ali-Mikhail-Haq s family with exponential margins. Note that proof I aθ (, θ) is straightforward, however tedious calculations are needed to obtain I aa (, θ). Weibull model Using lemma A.1 (ii), we deduce easily from (A.6)-(A.7) that I aa (, θ) is the same as that for the exponential model. The contribution, I aθt (, θ), of T to I aθ (, θ) = (I aθt (, θ), I aθu (, θ),, ) is } n γ T β { 1 c ln(γt + γ) 1 c ln(γ T + γ U + γ) γ T + γ γ T + γ U + γ γ U (γ T + γ)(γ T + γ U + γ) where γ = γ T + γ U + γ V. Using the proposition 3.1 (ii), we deduce σp 2 for Ali-Mikhail-Haq s family with Weibull margins. Lengthy derivations are required to obtain the first term of I aθt (, θ); calculation of the final variance σp 2 requires additional technical manipulations. 26,

27 B Proof of proposition 4.1 Writing M, θt (t) = M,θT (t) t { λ T (s) λ T (s)}ȳ (s) ds, where M,θT (t) is a martingale under H (see (4.4)), and using first order expansions for H (t) H, θ T,θ T (t) and λ T (s) λ T (s), (4.5) becomes τ T SP (, θ T ) = 1 H,θ n T (t)d M,θT (t) ) 1 τ ( θt θ T H,θ n T (t) Ȳ (t) λ T (t) dt ) 1 τ + ( θt θ T Ḣ,θT (t) d n M,θT (t). (B.1) Under classical regularity conditions (see Andersen et al., 1993, p ), one has n( θ T θ T ) d N (, [I θt θ T (, θ)/n] 1 ); furthermore the strong law of large numbers yields that n 1 τ Ḣ,θT (t) d M,θT (t). Therefore the third term in (B.1) is negligible. Note that Ȳ (t)/n is the empirical estimator of the survival distribution S X ( ) of X = min(t, U, V ). Assuming that H,θ T (t) is uniformly bounded (see (4.3) and (2.8)), one has 1 n τ and (B.1) becomes where under H, H,θ T (t) Ȳ (t) λ T (t) dt A = T SP (, θ T ) 1 n τ H,θT (t) S X (t) λ T (t) dt, H,θ T (t)d M,θT (t) A ( θt θ T ), (B.2) τ θ T θ T = [I θt θ T (, θ)] 1 [λ T (t)] 1 λt (t) d M,θT (t) + R, (B.3) with n 1/2 R. By (B.2)-(B.3), we have now that where T SP (, θ τ T ) n 1 G,θ T (t) d M,θT (t), G,θ T (t) = H,θ T (t) A [I θt θ T (, θ)] 1 λ T (t) λ T (t). (B.4) 27

28 Convergence in law of n 1/2 T SP (, θ T ) results now from the classical martingale limit central theorem (see Andersen et al., 1993, th. II.5.3). The asymptotic variance given in proposition 4.1 comes from the evaluation of {[T SP (, θ } T )] 2, using expression (B.4) (see Fleming & Harrington, 1991, th ). C Evaluation of µ SP () Writing M,θT (t) = M a,θt (t) t {λ T (s) λ # at (s)}ȳ (s) ds, where M a,θt (t) is defined by (4.4), when the value of the dependence parameter is a, one can write (B.4) as n 1 U SP (, θ T ) 1 n 1 n τ τ G,θ T (t) d M a,θt (t) { } G,θ T (t) λ T (t) λ # at (t) Ȳ (t) dt, where M a,θt (t) is a martingale when a is non zero. The expectation of the first term of the right hand side is null. Since Ȳ (t)/n is an estimator for S X (t), the second term yields µ SP () = G,θT (t) H,θT (t) λ T (t) S X (t) dt, where G,θT (t) = H,θT (t) A [I θt θ T (, θ)] 1 λt (t)/λ T (t). above expression yields µ SP () = σ2 SP (). Evaluating the References [1] Andersen, P. K., Borgan, O., Gill, R. D. & Keiding, N. (1993) Statistical Models Based on Counting Processes. Springer-Verlag, New-York. [2] Bhattacharyya, A. (1997) Modeling exponential survival data with dependent censoring. Sankhyā, Series A, Indian Journal of Statistics 59: [3] Cox, D. R. & Hinkley, D. V. (1979) Theoretical Statistics. Chapman & Hall. 28

29 [4] Emoto, S. & Matthews, P. (199) A Weibull model for dependent censoring. The Annals of Statistics 18: [5] Fleming, T. R. & Harrington, D. P. (1991) Counting processes and survival analysis. Wiley. [6] Frangakis, C. E. & Rubin, D. B. (21) Addressing an idiosyncrasy in estimating survival curves using double sampling in the presence of selfselected right censoring. Biometrics 57: [7] Genest, C. (1987) Frank s family of bivariate distributions. Biometrika 74: [8] Genest, C. & McKay, R. (1986) Copules archimédiennes et familles bidimensionnelles dont les marges sont données. The Canadian Journal of Statistics 14: [9] Genest, C., Ghoudi, K. & Rivest, L.-P. (1995) A semiparametric estimation procedure of dependence parameters in multivariate families of distributions. Biometrika 82: [1] Hougaard, P. (2) Analysis of multivariate survival data. Springer. [11] Kendall, M. & Stuart, A. (1979) The advanced theory of statistics, vol. 2. Hafner Publishing Company. [12] Lagakos, S. W. & Williams, J. S. (1978) Models for censored survival analysis: A cone class of variable-sum models. Biometrika 65: [13] Lawless, J. F. (1982) Statistical models and methods for lifetime data. Wiley. [14] Lee, S. & Wolfe, R. A. (1995) A class of linear nonparametric rank tests for dependent censoring. Communications in Statistics, Part A Theory and Methods 24: [15] Lee, S. & Wolfe, R. A. (1998) A simple test for independent censoring under the proportional hazards model. Biometrics 54: [16] Lee, S. (1996) Power calculation for a score test in the dependent censoring model. Statistics in Medicine 15:

30 [17] Lin, D. Y., Robins, J. M. & Wei, L. J. (1996) Comparing two failure time distributions in the presence of dependent censoring. Biometrika 83: [18] Nelsen, R. (1999) An introduction to copula. Lecture Notes in Statistics 139. Springer-Verlag, New-York. [19] Oakes, D. (1989) Bivariate survival models induced by frailties. JASA 84: [2] Rivest, L.-P. & Wells, M. T. (21) A martingale approach to the copulagraphic estimator for the survival function under dependent censoring. Journal of Multivariate Analysis 79: [21] Tsiatis, A. (1975) A nonidentifiability aspect of the problem of competing risks. Proceedings of the National Academy of Sciences of the United States of America. 72: [22] van der Vaart, A. W. (2) Asymptotic statistics. Cambridge University Press. [23] Zheng, M. & Klein, J.(1995) Estimates of marginal survival for dependent competing risks based on an assumed copula. Biometrika 82:

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS

Statistica Sinica 20 (2010), 441-453 GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS Antai Wang Georgetown University Medical Center Abstract: In this paper, we propose two tests for parametric models