Meei Pyng Ng 1 and Ray Watson 1

Size: px
Start display at page:

Download "Meei Pyng Ng 1 and Ray Watson 1"

Transcription

1 Aust N Z J Stat 444), 2002, DEALING WITH TIES IN FAILURE TIME DATA Meei Pyng Ng 1 and Ray Watson 1 University of Melbourne Summary In dealing with ties in failure time data the mechanism by which the data are observed should be considered If the data are discrete, the process is relatively simple and is determined by what is actually observed With continuous data, ties are not supposed to occur, but they do because the data are grouped into intervals even if only rounding intervals) In this case there is actually a non-identifiability problem which can only be resolved by modelling the process Various reasonable modelling assumptions are investigated in this paper They lead to better ways of dealing with ties between observed failure times and censoring times of different individuals The current practice is to assume that the censoring times occur after all the failures with which they are tied Key words: competing risk; discrete failure time; Kaplan Meier estimator; non-identifiability; tied observations 1 Introduction Discrete failure time data occur naturally and frequently The data may be genuinely discrete, such as counting data The failure time may be the number of trials till the occurrence of an event of interest, such as pregnancy in an in-vitro fertilization program IVF) or component failure in an industrial life-testing experiment Continuous data are sometimes modelled as discrete data; see eg Willett & Singer 1993) They can be regarded as continuous data grouped into intervals Indeed, all recorded continuous data are rounded or truncated, and so might be regarded as discrete data The importance of the discreteness is determined by the relative frequency of ties in the data Discrete failure time data have been modelled in the literature; see, for example, Cox 1972), Kalbfleisch & Prentice 1980) or Adams & Watson 1989) However, there has been little discussion on how tied failure times and censoring times should be handled When time is treated as continuous, tied observations supposedly cannot occur and so their occurrence cannot be handled theoretically In practice, tied observations do occur owing to the crudeness of measurements The usual convention, when there are failures and censorings occurring at the same time t, is to shift the censoring times infinitesimally to the right so that they occur after all the failures at time t This is commonly used when computing, for example, a Kaplan Meier estimate for the survival function Kaplan & Meier, 1958) When time is treated as discrete, then ties can occur with non-zero probability, and should be part of the modelling process However, it is usual to apply the convention used for continuous data to discrete data; see, for example, Kalbfleish & Prentice 1980 p 99) or Lawless Received July 2001; accepted September 2001 Author to whom correspondence should be addressed 1 Dept of Mathematics and Statistics, University of Melbourne, Parkville, Vic 3010, Australia mpn@unimelbeduau c Australian Statistical Publishing Association Inc 2002 Published by Blackwell Publishing Ltd

2 468 MEEI PYNG NG AND RAY WATSON 1982 p 372) When applied to continuous data grouped into intervals, this convention in effect is equivalent to allowing censoring to occur only ust before the end of the interval If the interval is wide, there can be serious overestimation of the survival function The failure-before-censoring convention for continuous data makes the hazard estimate as small as possible among possible orderings of the apparently tied observations Another aspect of discrete failure time that has escaped attention is the operation of the censoring mechanism when the potential failure time and the potential censoring time are tied for an individual In a random censorship model, each individual has a latent failure time T and a latent censoring time W What is actually observed when T = W? The answer to this question affects how ties between failure times and censoring times of different individuals should be handled In Section 2, we discuss the various censoring mechanisms that can occur in situations when the failure time and potential censoring time are tied for an individual In one of these situations there is a non-identifiability problem, which is discussed in Section 3 Section 4 deals with estimation and various reasonable assumptions that can be used to overcome this problem The failure-before-censoring convention creates an inconsistency when applied to competing-risk problems In a competing-risk problem, failures are classified into types, according to the cause of failure A cause-specific hazard rate, λ c t), for failure type cc= 1,,l) is commonly estimated by a Kaplan Meier estimate, treating all failures of type not equal to c as censored observations A consequence of this is that the relation Ft) = l c=1 F c t), where F c t) = exp t 0 λ c u) du) and Ft) is the overall survival function, does not hold for the corresponding Kaplan Meier estimates when there are ties between different failure types This is because different orders of occurrence of the tied observations are used, depending on which of the F c t) is being estimated The competing-risk problem is considered further in Section 5, where we describe an approach that overcomes the inconsistency problem described above In this paper, we investigate how ties in failure time data should be handled It is shown that there are more appropriate ways of handling ties than the standard failure-before-censoring method 2 Censoring mechanism We consider a random censorship model for discrete failure time With each individual in the population is associated a failure time T and a potential censoring time W, and T and W are independent When T < W, T is observed; when T > W, W is observed What happens when T = W depends on the context of the problem We consider three cases Case 1 When T = W, both T and W are observed This situation can occur if T and W correspond to two failure types T and W may be time to two types of failure that correspond to two different components in a piece of equipment under test The equipment is put through cycles of use until failure occurs The data can be represented as Y i,δ i ), i = 1,,n, where Y i = mint i,w i ) and δ i = 0ifY i = W i <T i, δ i = 1ifY i = T i <W i and δ i = 2ifY i = W i = T i In this case therefore and only in this case) we actually observe ties: observations for which δ i = 2 These data are expressed per individual, but when it comes to estimation of the hazard c Australian Statistical Publishing Association Inc 2002

3 DEALING WITH TIES IN FAILURE TIME DATA 469 and survivor functions the data are aggregated at each time point Thus in this case we have observations on: w = freqy = t,δ = 0) = freqt > t,w = t ), d = freqy = t,δ = 1) = freqt = t,w >t ), m = freqy = t,δ = 2) = freqt = t,w = t ) Case 2 When T = W, T is observed and W is not observed This means that, having observed T = t, all we know about the censoring time is that W t This situation is actually a special case of Case 3, but it is important and occurs often It can occur, for example, in data from an IVF program, where T is the number of completed cycles of treatment before pregnancy, and W is the number of completed cycles of treatment before withdrawal from the program If T = W = t, then pregnancy would be observed after t cycles and it would not be known that the patient would have withdrawn from the program had the last treatment not succeeded The data can be represented, as in Case 1, by Y i,δ i ), i = 1,,n But here, δ i takes only values 0 and 1: δ i = 0ifY i = W i <T i and δ i = 1ifY i = T i W i In this case we have observations on: w = freqy = t,δ = 0) = freqt > t,w = t ), d = freqy = t,δ = 1) = freqt = t,w t ) Case 3 When T = W, one and only one of them is observed, and which of them is observed is a random event for any particular individual This situation models the case when the discrete time arises from continuous time grouped into intervals Let T and W denote the underlying continuous failure time and censoring time respectively Suppose the time axis is partitioned into k intervals [a 1,a ), = 1,,k, where a 0 = 0 and a k = When T and W both lie in the interval t = [a 1,a ), then T = W = t However, T is observed if T W ; otherwise, W is observed That is, an individual with both failure time and censoring time in the interval t = [a 1,a ) could be censored at time t because the individual is censored before it fails if W <T ) and it is never known that it would have failed by time a had it not been censored On the other hand, it could be observed to fail at t,ift <W, the probability of which is denoted by π = PrT <W a 1 T <a,a 1 W <a ) The data can be represented as in Case 2, with δ i = 1 if failure is observed and δ i = 0 if the datum is censored The situation is represented in Figure 1: d = freqt = t,w >t ), u = freqt = t,w = t ; T observed), d = d + u ; w = freqt > t,w = t ), v = freqt = t,w = t ; W observed), w = w + v ; m = freqt = t,w = t ) = u + v ; n = freqt t,w t ), the number in the risk set at time t ; so that k n = d + u + v + w + n +1 = d + w + n +1 and n = d l + w l ) l= c Australian Statistical Publishing Association Inc 2002

4 470 MEEI PYNG NG AND RAY WATSON W d =d' +u m = u + v d' t u w' w = w' + v v T t Figure 1 Observed data in relation to failure time and censoring time Usually we observe only n, w and d = 1,,k), as is assumed for Case 3 Case 1 is unusual as it models the truly discrete case where ties between individuals failure times and censoring times can be observed, so that m is available It is more common for only one of failure or censoring to be observed In Case 2, it is assumed that v = 0 which brings about a simplification of the data and the subsequent analysis The difficulty with most discrete or grouped data is the generally unknown) subdivision of the m tied observations Of course, if there are few ties then this is not a serious problem However, if ties are common, we need at least to consider the possible influence of various assumptions about the subdivision between failure times and censoring times, and if they are potentially influential they should be allowed for by appropriate modelling 3 Non-identifiability Let t 1 <t 2 < <t k be the mass points of T and W They may represent intervals to which continuous-time variables have been grouped Let P = PrT t ), p = PrT = t ) and λ = PrT = t T t ), for = 1,,k We note that P = 1 l=1 1 λ l ), p = 1 λ l=1 1 λ l ), and P 1 = k l=1 p l = 1 = λ k In a non-parametric setting, λ 1,,λ k 1 are the parameters of the distribution of T Similarly, let Q = PrW t ), q = PrW = t ) and κ = PrW = t W t ) In a survival study, the primary interest is in the distribution of the failure time T Since only censored observations Y, δ) are available, the distribution of T is said to be nonidentifiable if it is not uniquely determined by the oint distribution of Y, δ) It is well-known that the distribution of T is not identifiable unless T and W are independent However, it is not generally known that in discrete-time survival analysis the distribution of T is nonidentifiable when the censoring mechanism has a random element, as in Case 3 above We consider the case when T = W and either T or W is observed at random; T and W are assumed to be independent Let π denote the probability that T is observed, when T = W = t Then the oint distribution of Y, δ) is p 1 = PrY = t,δ = 1) = λ 1 1 π )κ ) P Q, p 0 = PrY = t,δ = 0) = κ 1 π λ ) P Q, 1) c Australian Statistical Publishing Association Inc 2002

5 DEALING WITH TIES IN FAILURE TIME DATA 471 where k =1 p 0 + p 1 ) = 1 Letting p0 = p 0 /P Q ) and p 1 = p 1 /P Q ), the above can be written as: p 1 = λ 1 1 π )κ ), p 0 = κ 1 π λ ) These can be solved for λ and κ to give 1 λ = 1 p 0 + π p 0 + p 1 ) p 0 + π p 0 + p 1 )) 2 4π p1, 2π κ = p 0 + p 1 λ 1 λ Noting that P and Q can be expressed in terms of {λ l,κ l ; l = 1,, 1}, the above are solutions of λ and κ in terms of {π l,p l0,p l1 ; l = 1,, 1} We note in particular that, for fixed values of {p l0,p l1 ; l = 1,,k}, we can vary the values of π and obtain different solutions of λ That is, the distribution of T is not determined uniquely by the oint distribution of Y, δ) For identifiability, we need to make assumptions on π Some reasonable assumptions are considered in Section 4 Another indication of the non-identifiability of the situation can be gleaned from considering the likelihood function L, which is given in the appendix It is seen that L is a function of the parameters {λ,κ,π : = 1,,k} It can be shown that the score equations log L λ = log L κ = log L π = 0 = 1, 2,,k) are dependent and hence cannot produce a unique solution for all the λ,κ and π Indeed, they contain only 2k independent equations and so only 2k parameters can be estimated 4 Estimation Interest usually centres on estimating λ and P Maximum likelihood principles can be applied We treat the three cases described in Section 2 separately, using the notation given in Section 2 Case 1 In this case, when T = W = t, both T and W are observed to occur and so m is observed Standard likelihood methods give: ˆλ = d + m n and seˆλ ) = ˆλ 1 ˆλ ) n These are the usual results for binomial estimates Case 2 Here, when T = W = t, only T is observed, and W is not: this is the failurebefore-censoring scenario The case can be handled in much the same way as Case 1, since d = d + m = freqt = t,w t ) is observed We obtain: ˆλ = d and seˆλ n ) = ˆλ 1 ˆλ ) n c Australian Statistical Publishing Association Inc 2002

6 472 MEEI PYNG NG AND RAY WATSON Case 3 In this case, when T = W = t, only one of T and W is observed and it may be either one We assume that T is observed with probability π Whether T or W is observed is really a separate sub-experiment: if u and v were observed then we would have ˆλ = d + u + v )/n, as above As u and v are not available, this approach fails; and we need to re-write the likelihood in terms of d and w Also, as explained above, we have insufficient data to estimate the π, and so we need to make assumptions about the π One straightforward assumption is to give π a specified value: π = π The likelihood equations then give If π For π n π λ2 n w + π d + w )) λ + d = 0 = 0, the equation is linear and we have > 0, the quadratic equation gives ˆλ 0) d = n w n ˆλ = n w + π d + w ) w + π d + w )) 2 4 π n d 2 π n Then, P can be estimated by ˆλ 1) = d /n, as in Case 2; and P ˆ 1) estimate of the survivor function For π Pˆ = 1 l=1 1 ˆλ l ) For π = 1 l=1 n l d l )/n l, we obtain: = 1 2 2n ˆλ h) + d w 2n + d w ) 2 8n d =, 2n = 1, the above simplifies to which is the Kaplan Meier which is the maximum likelihood estimate given by Elandt-Johnson & Johnson 1980 p 164) under the assumption that the underlying continuous failure time is uniformly distributed in the interval and that all censoring occurs at the mid-point of the interval For an approximate standard error for this estimate, refer to Elandt-Johnson & Johnson 1980 p164) Another approach is to assume an explicit expression for π as a function of λ and κ As Case 3 is intended to model grouped continuous data, a plausible approach is to make assumptions about the underlying continuous failure time T and censoring time W and derive the consequent expression for π One such assumption is that T and W are uniformly distributed in the interval t = [a 1,a ) This gives π = 2 1, which has been dealt with above Another assumption is that both T and W have hazard functions that are constant in the interval [a 1,a ) These can be considered as piecewise approximations to the unknown hazard functions of T and W More generally, one can assume that the hazards of T and W are proportional over the interval [a 1,a ) Then π can be expressed as a function of λ and κ : π = λ )1 κ ) ) log1 κ )) κ λ κ log 1 λ )1 κ ) ) 2) c Australian Statistical Publishing Association Inc 2002

7 DEALING WITH TIES IN FAILURE TIME DATA 473 Table 1 Survivor function estimates for suicide ideation data age in years): S 0 denotes the survivor function estimate for π = 0, S J the oint risk estimate, S h the estimate for π = 1 2, and S 1 the Kaplan Meier estimate π = 1) Age, t years) n t d t w t S 0 t) S J t) S h t) S 1 t) If this relation is assumed then the problem is identifiable It is shown in the appendix that the resulting maximum likelihood estimate of λ is given by with standard error given by ˆλ J ) n d w ) d /d +w ) = 1, 3) n seˆλ J ) ) 2 = 1 ˆλ ) 2 d 2 n d + w )n d w ) + d w n d w )) d + w ) 3 log2 n The corresponding survivor function estimate given by se ˆ P J ) ) 2 = ˆ P J ) 1 ) 2 l=1 P ˆ J ) 4) = 1 l=1 1 ˆλ J ) l ) has standard error d 2 n d + w )n d w ) + d w n d w )) d + w ) 3 log2 n 5) The estimate in 3) is the oint-risk estimate of λ cited by Kaplan & Meier 1958) They indicate that the oint-risk estimate is the maximum likelihood estimate when the censoring is random with rate proportional to the failure rate This is precisely the assumption made here Further, Ng 1993) showed the oint-risk estimate to be the geometric mean of the Kaplan Meier estimates corresponding to all the possible orderings of the tied failure and censoring times: thus it represents an average over the possible orderings The derivations of the oint-risk estimate and its standard error are given in the appendix When w = 0, the estimate in 3) reduces to the usual Kaplan Meier estimate, and the variance given in 5) also reduces to the usual Greenwood formula used as variance of the Kaplan Meier estimate of the survival function Kaplan & Meier, 1958) Indeed, all the estimates λ i), i = 0, 1,h,J reduce to the same expression when w = 0 In practice, w = 0 means that no censoring time may occur at the same time as any other failure time: censorings are assumed to occur between the failure times Example 1 The data on first suicide ideation used by Willett & Singer 1993) are significantly grouped, with many ties between failure times The data were collected retrospectively from c Australian Statistical Publishing Association Inc 2002

8 474 MEEI PYNG NG AND RAY WATSON Table 2 Survivor function estimates for Stanford heart transplant data grouped into months: S 0 denotes the survivor function estimate for π = 0, S J the oint risk estimate, S h the estimate for π = 1 2 and S 1 the Kaplan Meier estimate π = 1) S JD denotes the oint risk estimate based on the daily data and S KD the Kaplan Meier estimate based on the daily data t months) S 0 t) S J t) S h t) S 1 t) S JD t) S KD t) undergraduates, who reported the age at which they first thought about suicide An observation was censored if the student had not thought of suicide by the date of data collection Of the 417 students, 287 had considered suicide by the date of data collection, and 130 had not; the latter are censored observations There was no censoring until age 16 years The data and the survivor function estimates for t 16 are given in Table 1 The full dataset can be found in Willett & Singer 1993) who report the Kaplan Meier estimate of the survivor function Table 1 shows that there is a difference between the survivor function estimates For these data, the oint risk estimate or the estimate for π = 2 1 would seem more appropriate, as they lie mid-way between the two extremes Example 2 The survival times in days for 249 patients in the Stanford heart transplant trial are a commonly used dataset Here we use pre-transplant survival time, and patients given transplants are considered censored The data are available in Kleinbaum 1996 p 297) In this form there are few ties and no problem with using the standard Kaplan Meier survivor function estimate However, if these data were recorded in months 30-day intervals), there would be a significant proportion of ties and a corresponding discrepancy in the survivor function estimates, as indicated in Table 2 If the degree of grouping approaches this level then significant errors can be made by using the Kaplan Meier survivor function estimate Here S J and S h provide closer estimates; in this case S h is closest 5 Competing-risk problem A competing-risk problem gives rise to data of the form {T i,δ i,c i ); i = 1,,n} where T i is the observed failure time, δ i the censoring indicator and C i is the failure type, which does not enter into the likelihood if δ i = 0, assuming a homogeneous population Kalbfleisch & Prentice, 1980 Chapter 7) Let λ c t) = lim t 0 Prt T<t+ t, C = c T t)/ t denote the cause-specific hazard rate, c = 1,,lThen λt) = lim t 0 Prt T<t+ t T t)/ t is the overall hazard rate The hazard rates are related by λt) = l c=1 λ c t), assuming each study unit can c Australian Statistical Publishing Association Inc 2002

9 DEALING WITH TIES IN FAILURE TIME DATA 475 fail because of only one failure type The overall survival function Ft) = exp t 0 λu)du) is related to the functions F c t) = exp t 0 λ c u)du) by Ft) = l c=1 F c t) We consider failure time as a continuous variable grouped into intervals; all observed continuous data are of this nature Then a competing-risk problem is similar to Case 3 as described in Section 2 with the added complication of failures being classified into types Suppose the observed data are d c = freqt = t,δ = 1,C = c), w = freqt = t,δ = 0), for = 1,,k and c = 1,,l Let n denote the number at risk at t, and d = c d c denote the total number of failures at time t The estimation of λt) and Ft) are as given in Section 4 for Case 3 The generalized maximum likelihood estimate of λ c t ) is obtained by treating all failures of types not equal to c as censored data Let d c = w + d c d d The estimates given for Case 3 can be extended to a competing-risk problem The extension of λ 0) and λ 1) is straightforward; λ h) becomes l ˆλ h) c t ) = l + 1)n ld c + d c ) 2 + 1)n ld c + d c 4l + 1)n d c 2n In this, we have replaced π = 2 1 by π censoring; λ J ) becomes = 1/l + 1) to account for the l failure types and The corresponding estimate of F c t) is n ˆλ J c ) d w ) dc/d +w ) t ) = 1 n ˆF c t) = t <t 1 ˆλ c t ) ) The relation Ft) = l c=1 F c t) holds for their estimates if there are no ties between the failure times of different types If ties occur, then the relation still holds for the oint-risk estimate ˆλ J c ) t ) but not for the other three The reason for this internal consistency of the oint-risk estimates can be seen by considering latent failure times of different failure types Let T c denote the time to failure of type c Assuming that T c c = 1,,l) are independent, the hazard function of T c is λ c t) Suppose G is a group of failure types to be aggregated into one type Let T G denote the time to failure of any type in G Then T G = min c G T c and its hazard rate is λ G t) = c G λ c t) This means that if all T c have proportional hazards in the interval [a,b), their hazards are also proportional to λ G t) for any group G In particular, by assuming all cause-specific hazard rates to be proportional and proportional to the hazard rate of censoring time W, we see that T = min T c has hazard proportional to W and T c has hazard proportional to T c = min{w,t d ; d c} Thus, the oint-risk formula is applicable to the estimation of λt ) and λ c t ) and the results are consistent within themselves On the other hand, if we assume c Australian Statistical Publishing Association Inc 2002

10 476 MEEI PYNG NG AND RAY WATSON T c and W to have uniform distribution within an interval [a,b), then T G does not have a uniform distribution within [a,b) Thus, λ h) does not extend to give internally consistent estimates of Ft) and F c t) 6 Discussion When failure time is genuinely discrete, the censoring mechanism can be identified and incorporated in the modelling process Estimation and inference are then straightforward In the more common case where the times are made discrete through rounding or grouping, assumptions are necessary to achieve identifiability of the survival function The various known estimates of the hazard rate Elandt-Johnson & Johnson, 1980 p 164; Kaplan & Meier, 1958; oint risk) can be derived under the same general structure by making different assumptions about π, the probability that failure is observed when there is a tie All the estimates considered are identical when there are no ties between failure times and censoring times Thus, they can be thought of as different ways of handling such ties The two extreme assumptions are π = 0 and π = 1 and they give bounds to the plausible estimates of the hazard rates and hence the survival function The Kaplan Meier corresponds to one extreme π = 1) In practice, if there are few such ties, all the estimates given in Section 4 are close and the Kaplan Meier estimate is a sensible simple choice, given that it is the current standard practice However, if there is a significant proportion of ties then this standard estimate can be quite wrong In that situation, ˆλ J ) or ˆλ h) are more moderate and are usually close to each other Either could be used We recommend that, as a matter of prudent practice, the estimates of the survival function corresponding to ˆλ 0) and ˆλ 1) should be routinely computed to check that they are not very different If the difference is small then the Kaplan Meier estimate ˆλ 1) ) can be used Otherwise ˆλ J ) should be used It is our view that ˆλ J ) has advantages over ˆλ h) It gives internally consistent estimates for the cause-specific hazards, when applied to competing-risk problems, as explained in Section 5 It is also the geometric mean of the Kaplan Meier estimates corresponding to all the possible orderings of the tied failure and censoring times see Ng, 1993) Further, it has a simpler algebraic expression than ˆλ h) and the expression for its standard error is more tractable For these reasons, ˆλ J ) is our preferred hazard rate estimate and might be better used as the default Nevertheless there is merit in computing the bounding estimates based on ˆλ 0) and ˆλ 1) to give an indication of possible error Appendix The maximum likelihood estimate of λ and its standard error as given in 3) and 4) are derived here Using the data description given for Case 3, the likelihood function is L = = k ) λ 1 ) d π )κ 1 λ l )1 κ l ) κ 1 π λ ) 1 λ l )1 κ l ) =1 l=1 k )) d λ 1 1 π )κ κ 1 π λ ) ) w 1 λ )1 κ ) ) n d w =1 l=1 ) w c Australian Statistical Publishing Association Inc 2002

11 DEALING WITH TIES IN FAILURE TIME DATA 477 Using the expression for π given in 2), 1 1 λ )1 κ ) ) log1 κ ) κ 1 π λ ) = log 1 λ )1 κ ) ) = α 1 β ), A1) log1 λ ) where α = 1 1 λ )1 κ ) and β = log 1 λ )1 κ ) ) By 1), α 1 β ) is the probability that an individual is censored at t given that the individual is at risk at t Similarly, 1 1 λ )1 κ ) ) log1 λ ) λ 1 1 π )κ ) = log 1 λ )1 κ ) ) = α β A2) This is the probability that an individual is observed to fail at t, given that the individual is still at risk at t Asn is the number of individuals at risk at time t,wehave d d d d n = Bin,α β ), w n = Bin,α 1 β )) and d + w n = Bin,α ) Using A1) and A2), the likelihood function simplifies to k L = 1 α ) n d w α d +w β d 1 β )w =1 from which we obtain the maximum likelihood estimates ˆα = d + w d and ˆβ n = d + w The estimate in 3) is then obtained using the relation λ = 1 1 α ) β The asymptotic variances and covariance of ˆα and ˆβ can be found by evaluating the information matrix We note that 2 log L α 2 = n d w 1 α ) 2 d + w α 2, 2 log L β 2 = d β 2 w 1 β ) 2, and all other second-order partial derivatives are zero Further, conditional on n, the number at risk at time t, 2 log L ) n 2 E α 2 = α 1 α ), E log L ) n α β 2 = β 1 β ) Thus the information matrix is diagonal and its inverse gives var ˆα ) α 1 α ), var ˆβ n ) β 1 β ) and cov ˆα α n, ˆβ ) 0 Then using the relation ˆλ = 1 1 ˆα ) ˆβ, the asymptotic variance of ˆλ is found to be as given by 5) References Adams, GG & Watson, RK 1989) A discrete time parametric model for the analysis of failure time data Aust J Statist 31, Cox, DR 1972) Regression models and life tables with discussion) J Roy Stat Soc Ser B 34, c Australian Statistical Publishing Association Inc 2002

12 478 MEEI PYNG NG AND RAY WATSON Elandt-Johnson, RC & Johnson, NL 1980) Survival Models and Data Analysis New York: Wiley Kalbfleisch, JD & Prentice, RL 1980) The Statistical Analysis of Failure Time Data New York: Wiley Kaplan, EL & Meier, P 1958) Non-parametric estimation from incomplete observations J Amer Statist Assoc 53, Kleinbaum, DG 1996) Survival Analysis New York: Springer Lawless, JF 1982) Statistical Models and Methods for Lifetime Data New York: Wiley Ng, MP 1993) A relation between product-limit estimate and oint-risk estimate Comm Statist Theory Methods 22, Willett, JB & Singer, JD 1993) Investigating onset, cessation, relapse and recovery: why you should, and how you can, use discrete-time survival analysis to examine event occurrence J Consulting and Clinical Psychology 61, c Australian Statistical Publishing Association Inc 2002

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

Survival Analysis. Stat 526. April 13, 2018

Survival Analysis. Stat 526. April 13, 2018 Survival Analysis Stat 526 April 13, 2018 1 Functions of Survival Time Let T be the survival time for a subject Then P [T < 0] = 0 and T is a continuous random variable The Survival function is defined

More information

Analysis of Survival Data Using Cox Model (Continuous Type)

Analysis of Survival Data Using Cox Model (Continuous Type) Australian Journal of Basic and Alied Sciences, 7(0): 60-607, 03 ISSN 99-878 Analysis of Survival Data Using Cox Model (Continuous Type) Khawla Mustafa Sadiq Department of Mathematics, Education College,

More information

UNIVERSITY OF CALIFORNIA, SAN DIEGO

UNIVERSITY OF CALIFORNIA, SAN DIEGO UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department

More information

Survival Analysis for Case-Cohort Studies

Survival Analysis for Case-Cohort Studies Survival Analysis for ase-ohort Studies Petr Klášterecký Dept. of Probability and Mathematical Statistics, Faculty of Mathematics and Physics, harles University, Prague, zech Republic e-mail: petr.klasterecky@matfyz.cz

More information

Typical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction

Typical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction Outline CHL 5225H Advanced Statistical Methods for Clinical Trials: Survival Analysis Prof. Kevin E. Thorpe Defining Survival Data Mathematical Definitions Non-parametric Estimates of Survival Comparing

More information

Survival Distributions, Hazard Functions, Cumulative Hazards

Survival Distributions, Hazard Functions, Cumulative Hazards BIO 244: Unit 1 Survival Distributions, Hazard Functions, Cumulative Hazards 1.1 Definitions: The goals of this unit are to introduce notation, discuss ways of probabilistically describing the distribution

More information

Multistate Modeling and Applications

Multistate Modeling and Applications Multistate Modeling and Applications Yang Yang Department of Statistics University of Michigan, Ann Arbor IBM Research Graduate Student Workshop: Statistics for a Smarter Planet Yang Yang (UM, Ann Arbor)

More information

STAT331. Cox s Proportional Hazards Model

STAT331. Cox s Proportional Hazards Model STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan

More information

TMA 4275 Lifetime Analysis June 2004 Solution

TMA 4275 Lifetime Analysis June 2004 Solution TMA 4275 Lifetime Analysis June 2004 Solution Problem 1 a) Observation of the outcome is censored, if the time of the outcome is not known exactly and only the last time when it was observed being intact,

More information

Exercises. (a) Prove that m(t) =

Exercises. (a) Prove that m(t) = Exercises 1. Lack of memory. Verify that the exponential distribution has the lack of memory property, that is, if T is exponentially distributed with parameter λ > then so is T t given that T > t for

More information

Lecture 22 Survival Analysis: An Introduction

Lecture 22 Survival Analysis: An Introduction University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 22 Survival Analysis: An Introduction There is considerable interest among economists in models of durations, which

More information

Multi-state Models: An Overview

Multi-state Models: An Overview Multi-state Models: An Overview Andrew Titman Lancaster University 14 April 2016 Overview Introduction to multi-state modelling Examples of applications Continuously observed processes Intermittently observed

More information

Key Words: survival analysis; bathtub hazard; accelerated failure time (AFT) regression; power-law distribution.

Key Words: survival analysis; bathtub hazard; accelerated failure time (AFT) regression; power-law distribution. POWER-LAW ADJUSTED SURVIVAL MODELS William J. Reed Department of Mathematics & Statistics University of Victoria PO Box 3060 STN CSC Victoria, B.C. Canada V8W 3R4 reed@math.uvic.ca Key Words: survival

More information

Multi-state models: prediction

Multi-state models: prediction Department of Medical Statistics and Bioinformatics Leiden University Medical Center Course on advanced survival analysis, Copenhagen Outline Prediction Theory Aalen-Johansen Computational aspects Applications

More information

Product-limit estimators of the survival function with left or right censored data

Product-limit estimators of the survival function with left or right censored data Product-limit estimators of the survival function with left or right censored data 1 CREST-ENSAI Campus de Ker-Lann Rue Blaise Pascal - BP 37203 35172 Bruz cedex, France (e-mail: patilea@ensai.fr) 2 Institut

More information

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data 1 Part III. Hypothesis Testing III.1. Log-rank Test for Right-censored Failure Time Data Consider a survival study consisting of n independent subjects from p different populations with survival functions

More information

4. Comparison of Two (K) Samples

4. Comparison of Two (K) Samples 4. Comparison of Two (K) Samples K=2 Problem: compare the survival distributions between two groups. E: comparing treatments on patients with a particular disease. Z: Treatment indicator, i.e. Z = 1 for

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS330 / MAS83 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-0 8 Parametric models 8. Introduction In the last few sections (the KM

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH Jian-Jian Ren 1 and Mai Zhou 2 University of Central Florida and University of Kentucky Abstract: For the regression parameter

More information

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),

More information

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);

More information

Lecture 3. Truncation, length-bias and prevalence sampling

Lecture 3. Truncation, length-bias and prevalence sampling Lecture 3. Truncation, length-bias and prevalence sampling 3.1 Prevalent sampling Statistical techniques for truncated data have been integrated into survival analysis in last two decades. Truncation in

More information

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis STAT 6350 Analysis of Lifetime Data Failure-time Regression Analysis Explanatory Variables for Failure Times Usually explanatory variables explain/predict why some units fail quickly and some units survive

More information

Chapter 4 Fall Notations: t 1 < t 2 < < t D, D unique death times. d j = # deaths at t j = n. Y j = # at risk /alive at t j = n

Chapter 4 Fall Notations: t 1 < t 2 < < t D, D unique death times. d j = # deaths at t j = n. Y j = # at risk /alive at t j = n Bios 323: Applied Survival Analysis Qingxia (Cindy) Chen Chapter 4 Fall 2012 4.2 Estimators of the survival and cumulative hazard functions for RC data Suppose X is a continuous random failure time with

More information

Survival Analysis I (CHL5209H)

Survival Analysis I (CHL5209H) Survival Analysis Dalla Lana School of Public Health University of Toronto olli.saarela@utoronto.ca January 7, 2015 31-1 Literature Clayton D & Hills M (1993): Statistical Models in Epidemiology. Not really

More information

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES Cox s regression analysis Time dependent explanatory variables Henrik Ravn Bandim Health Project, Statens Serum Institut 4 November 2011 1 / 53

More information

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University Survival Analysis: Weeks 2-3 Lu Tian and Richard Olshen Stanford University 2 Kaplan-Meier(KM) Estimator Nonparametric estimation of the survival function S(t) = pr(t > t) The nonparametric estimation

More information

Lecture 7. Proportional Hazards Model - Handling Ties and Survival Estimation Statistics Survival Analysis. Presented February 4, 2016

Lecture 7. Proportional Hazards Model - Handling Ties and Survival Estimation Statistics Survival Analysis. Presented February 4, 2016 Proportional Hazards Model - Handling Ties and Survival Estimation Statistics 255 - Survival Analysis Presented February 4, 2016 likelihood - Discrete Dan Gillen Department of Statistics University of

More information

Censoring and Truncation - Highlighting the Differences

Censoring and Truncation - Highlighting the Differences Censoring and Truncation - Highlighting the Differences Micha Mandel The Hebrew University of Jerusalem, Jerusalem, Israel, 91905 July 9, 2007 Micha Mandel is a Lecturer, Department of Statistics, The

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS3301 / MAS8311 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-10 1 13 The Cox proportional hazards model 13.1 Introduction In the

More information

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require Chapter 5 modelling Semi parametric We have considered parametric and nonparametric techniques for comparing survival distributions between different treatment groups. Nonparametric techniques, such as

More information

Cox s proportional hazards model and Cox s partial likelihood

Cox s proportional hazards model and Cox s partial likelihood Cox s proportional hazards model and Cox s partial likelihood Rasmus Waagepetersen October 12, 2018 1 / 27 Non-parametric vs. parametric Suppose we want to estimate unknown function, e.g. survival function.

More information

Analysis of competing risks data and simulation of data following predened subdistribution hazards

Analysis of competing risks data and simulation of data following predened subdistribution hazards Analysis of competing risks data and simulation of data following predened subdistribution hazards Bernhard Haller Institut für Medizinische Statistik und Epidemiologie Technische Universität München 27.05.2013

More information

Technical Report - 7/87 AN APPLICATION OF COX REGRESSION MODEL TO THE ANALYSIS OF GROUPED PULMONARY TUBERCULOSIS SURVIVAL DATA

Technical Report - 7/87 AN APPLICATION OF COX REGRESSION MODEL TO THE ANALYSIS OF GROUPED PULMONARY TUBERCULOSIS SURVIVAL DATA Technical Report - 7/87 AN APPLICATION OF COX REGRESSION MODEL TO THE ANALYSIS OF GROUPED PULMONARY TUBERCULOSIS SURVIVAL DATA P. VENKATESAN* K. VISWANATHAN + R. PRABHAKAR* * Tuberculosis Research Centre,

More information

Linear rank statistics

Linear rank statistics Linear rank statistics Comparison of two groups. Consider the failure time T ij of j-th subject in the i-th group for i = 1 or ; the first group is often called control, and the second treatment. Let n

More information

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials Progress, Updates, Problems William Jen Hoe Koh May 9, 2013 Overview Marginal vs Conditional What is TMLE? Key Estimation

More information

Tests of independence for censored bivariate failure time data

Tests of independence for censored bivariate failure time data Tests of independence for censored bivariate failure time data Abstract Bivariate failure time data is widely used in survival analysis, for example, in twins study. This article presents a class of χ

More information

Semi-Competing Risks on A Trivariate Weibull Survival Model

Semi-Competing Risks on A Trivariate Weibull Survival Model Semi-Competing Risks on A Trivariate Weibull Survival Model Cheng K. Lee Department of Targeting Modeling Insight & Innovation Marketing Division Wachovia Corporation Charlotte NC 28244 Jenq-Daw Lee Graduate

More information

Estimation for Modified Data

Estimation for Modified Data Definition. Estimation for Modified Data 1. Empirical distribution for complete individual data (section 11.) An observation X is truncated from below ( left truncated) at d if when it is at or below d

More information

A SMOOTHED VERSION OF THE KAPLAN-MEIER ESTIMATOR. Agnieszka Rossa

A SMOOTHED VERSION OF THE KAPLAN-MEIER ESTIMATOR. Agnieszka Rossa A SMOOTHED VERSION OF THE KAPLAN-MEIER ESTIMATOR Agnieszka Rossa Dept. of Stat. Methods, University of Lódź, Poland Rewolucji 1905, 41, Lódź e-mail: agrossa@krysia.uni.lodz.pl and Ryszard Zieliński Inst.

More information

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL The Cox PH model: λ(t Z) = λ 0 (t) exp(β Z). How do we estimate the survival probability, S z (t) = S(t Z) = P (T > t Z), for an individual with covariates

More information

A Regression Model For Recurrent Events With Distribution Free Correlation Structure

A Regression Model For Recurrent Events With Distribution Free Correlation Structure A Regression Model For Recurrent Events With Distribution Free Correlation Structure J. Pénichoux(1), A. Latouche(2), T. Moreau(1) (1) INSERM U780 (2) Université de Versailles, EA2506 ISCB - 2009 - Prague

More information

Package threg. August 10, 2015

Package threg. August 10, 2015 Package threg August 10, 2015 Title Threshold Regression Version 1.0.3 Date 2015-08-10 Author Tao Xiao Maintainer Tao Xiao Depends R (>= 2.10), survival, Formula Fit a threshold regression

More information

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Jonathan Taylor & Kristin Cobb Statistics 262: Intermediate Biostatistics p.1/?? Overview of today s class Kaplan-Meier Curve

More information

Likelihood Construction, Inference for Parametric Survival Distributions

Likelihood Construction, Inference for Parametric Survival Distributions Week 1 Likelihood Construction, Inference for Parametric Survival Distributions In this section we obtain the likelihood function for noninformatively rightcensored survival data and indicate how to make

More information

Simple techniques for comparing survival functions with interval-censored data

Simple techniques for comparing survival functions with interval-censored data Simple techniques for comparing survival functions with interval-censored data Jinheum Kim, joint with Chung Mo Nam jinhkim@suwon.ac.kr Department of Applied Statistics University of Suwon Comparing survival

More information

Ph.D. course: Regression models. Introduction. 19 April 2012

Ph.D. course: Regression models. Introduction. 19 April 2012 Ph.D. course: Regression models Introduction PKA & LTS Sect. 1.1, 1.2, 1.4 19 April 2012 www.biostat.ku.dk/~pka/regrmodels12 Per Kragh Andersen 1 Regression models The distribution of one outcome variable

More information

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY Ingo Langner 1, Ralf Bender 2, Rebecca Lenz-Tönjes 1, Helmut Küchenhoff 2, Maria Blettner 2 1

More information

Survival Regression Models

Survival Regression Models Survival Regression Models David M. Rocke May 18, 2017 David M. Rocke Survival Regression Models May 18, 2017 1 / 32 Background on the Proportional Hazards Model The exponential distribution has constant

More information

ST745: Survival Analysis: Cox-PH!

ST745: Survival Analysis: Cox-PH! ST745: Survival Analysis: Cox-PH! Eric B. Laber Department of Statistics, North Carolina State University April 20, 2015 Rien n est plus dangereux qu une idee, quand on n a qu une idee. (Nothing is more

More information

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Survival Analysis. Lu Tian and Richard Olshen Stanford University 1 Survival Analysis Lu Tian and Richard Olshen Stanford University 2 Survival Time/ Failure Time/Event Time We will introduce various statistical methods for analyzing survival outcomes What is the survival

More information

STAT Sample Problem: General Asymptotic Results

STAT Sample Problem: General Asymptotic Results STAT331 1-Sample Problem: General Asymptotic Results In this unit we will consider the 1-sample problem and prove the consistency and asymptotic normality of the Nelson-Aalen estimator of the cumulative

More information

Notes largely based on Statistical Methods for Reliability Data by W.Q. Meeker and L. A. Escobar, Wiley, 1998 and on their class notes.

Notes largely based on Statistical Methods for Reliability Data by W.Q. Meeker and L. A. Escobar, Wiley, 1998 and on their class notes. Unit 2: Models, Censoring, and Likelihood for Failure-Time Data Notes largely based on Statistical Methods for Reliability Data by W.Q. Meeker and L. A. Escobar, Wiley, 1998 and on their class notes. Ramón

More information

Full likelihood inferences in the Cox model: an empirical likelihood approach

Full likelihood inferences in the Cox model: an empirical likelihood approach Ann Inst Stat Math 2011) 63:1005 1018 DOI 10.1007/s10463-010-0272-y Full likelihood inferences in the Cox model: an empirical likelihood approach Jian-Jian Ren Mai Zhou Received: 22 September 2008 / Revised:

More information

Ph.D. course: Regression models. Regression models. Explanatory variables. Example 1.1: Body mass index and vitamin D status

Ph.D. course: Regression models. Regression models. Explanatory variables. Example 1.1: Body mass index and vitamin D status Ph.D. course: Regression models Introduction PKA & LTS Sect. 1.1, 1.2, 1.4 25 April 2013 www.biostat.ku.dk/~pka/regrmodels13 Per Kragh Andersen Regression models The distribution of one outcome variable

More information

Philosophy and Features of the mstate package

Philosophy and Features of the mstate package Introduction Mathematical theory Practice Discussion Philosophy and Features of the mstate package Liesbeth de Wreede, Hein Putter Department of Medical Statistics and Bioinformatics Leiden University

More information

Investigation of goodness-of-fit test statistic distributions by random censored samples

Investigation of goodness-of-fit test statistic distributions by random censored samples d samples Investigation of goodness-of-fit test statistic distributions by random censored samples Novosibirsk State Technical University November 22, 2010 d samples Outline 1 Nonparametric goodness-of-fit

More information

1 Introduction. 2 Residuals in PH model

1 Introduction. 2 Residuals in PH model Supplementary Material for Diagnostic Plotting Methods for Proportional Hazards Models With Time-dependent Covariates or Time-varying Regression Coefficients BY QIQING YU, JUNYI DONG Department of Mathematical

More information

Model Based Clustering of Count Processes Data

Model Based Clustering of Count Processes Data Model Based Clustering of Count Processes Data Tin Lok James Ng, Brendan Murphy Insight Centre for Data Analytics School of Mathematics and Statistics May 15, 2017 Tin Lok James Ng, Brendan Murphy (Insight)

More information

Censoring mechanisms

Censoring mechanisms Censoring mechanisms Patrick Breheny September 3 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/23 Fixed vs. random censoring In the previous lecture, we derived the contribution to the likelihood

More information

Parameters Estimation for a Linear Exponential Distribution Based on Grouped Data

Parameters Estimation for a Linear Exponential Distribution Based on Grouped Data International Mathematical Forum, 3, 2008, no. 33, 1643-1654 Parameters Estimation for a Linear Exponential Distribution Based on Grouped Data A. Al-khedhairi Department of Statistics and O.R. Faculty

More information

Power and Sample Size Calculations with the Additive Hazards Model

Power and Sample Size Calculations with the Additive Hazards Model Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine

More information

Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions

Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions Econ 513, USC, Department of Economics Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions I Introduction Here we look at a set of complications with the

More information

Interval Estimation for Parameters of a Bivariate Time Varying Covariate Model

Interval Estimation for Parameters of a Bivariate Time Varying Covariate Model Pertanika J. Sci. & Technol. 17 (2): 313 323 (2009) ISSN: 0128-7680 Universiti Putra Malaysia Press Interval Estimation for Parameters of a Bivariate Time Varying Covariate Model Jayanthi Arasan Department

More information

TGDR: An Introduction

TGDR: An Introduction TGDR: An Introduction Julian Wolfson Student Seminar March 28, 2007 1 Variable Selection 2 Penalization, Solution Paths and TGDR 3 Applying TGDR 4 Extensions 5 Final Thoughts Some motivating examples We

More information

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals John W. Mac McDonald & Alessandro Rosina Quantitative Methods in the Social Sciences Seminar -

More information

Statistical Inference and Methods

Statistical Inference and Methods Department of Mathematics Imperial College London d.stephens@imperial.ac.uk http://stats.ma.ic.ac.uk/ das01/ 31st January 2006 Part VI Session 6: Filtering and Time to Event Data Session 6: Filtering and

More information

Credit risk and survival analysis: Estimation of Conditional Cure Rate

Credit risk and survival analysis: Estimation of Conditional Cure Rate Stochastics and Financial Mathematics Master Thesis Credit risk and survival analysis: Estimation of Conditional Cure Rate Author: Just Bajželj Examination date: August 30, 2018 Supervisor: dr. A.J. van

More information

Longitudinal + Reliability = Joint Modeling

Longitudinal + Reliability = Joint Modeling Longitudinal + Reliability = Joint Modeling Carles Serrat Institute of Statistics and Mathematics Applied to Building CYTED-HAROSA International Workshop November 21-22, 2013 Barcelona Mainly from Rizopoulos,

More information

Hacettepe Journal of Mathematics and Statistics Volume 45 (5) (2016), Abstract

Hacettepe Journal of Mathematics and Statistics Volume 45 (5) (2016), Abstract Hacettepe Journal of Mathematics and Statistics Volume 45 (5) (2016), 1605 1620 Comparing of some estimation methods for parameters of the Marshall-Olkin generalized exponential distribution under progressive

More information

A SIMPLE IMPROVEMENT OF THE KAPLAN-MEIER ESTIMATOR. Agnieszka Rossa

A SIMPLE IMPROVEMENT OF THE KAPLAN-MEIER ESTIMATOR. Agnieszka Rossa A SIMPLE IMPROVEMENT OF THE KAPLAN-MEIER ESTIMATOR Agnieszka Rossa Dept of Stat Methods, University of Lódź, Poland Rewolucji 1905, 41, Lódź e-mail: agrossa@krysiaunilodzpl and Ryszard Zieliński Inst Math

More information

Goodness-of-fit tests for randomly censored Weibull distributions with estimated parameters

Goodness-of-fit tests for randomly censored Weibull distributions with estimated parameters Communications for Statistical Applications and Methods 2017, Vol. 24, No. 5, 519 531 https://doi.org/10.5351/csam.2017.24.5.519 Print ISSN 2287-7843 / Online ISSN 2383-4757 Goodness-of-fit tests for randomly

More information

Statistical Methods for Alzheimer s Disease Studies

Statistical Methods for Alzheimer s Disease Studies Statistical Methods for Alzheimer s Disease Studies Rebecca A. Betensky, Ph.D. Department of Biostatistics, Harvard T.H. Chan School of Public Health July 19, 2016 1/37 OUTLINE 1 Statistical collaborations

More information

STAT 526 Spring Final Exam. Thursday May 5, 2011

STAT 526 Spring Final Exam. Thursday May 5, 2011 STAT 526 Spring 2011 Final Exam Thursday May 5, 2011 Time: 2 hours Name (please print): Show all your work and calculations. Partial credit will be given for work that is partially correct. Points will

More information

Lecture 7 Time-dependent Covariates in Cox Regression

Lecture 7 Time-dependent Covariates in Cox Regression Lecture 7 Time-dependent Covariates in Cox Regression So far, we ve been considering the following Cox PH model: λ(t Z) = λ 0 (t) exp(β Z) = λ 0 (t) exp( β j Z j ) where β j is the parameter for the the

More information

Survival Analysis Math 434 Fall 2011

Survival Analysis Math 434 Fall 2011 Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup

More information

ST495: Survival Analysis: Maximum likelihood

ST495: Survival Analysis: Maximum likelihood ST495: Survival Analysis: Maximum likelihood Eric B. Laber Department of Statistics, North Carolina State University February 11, 2014 Everything is deception: seeking the minimum of illusion, keeping

More information

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data Malaysian Journal of Mathematical Sciences 11(3): 33 315 (217) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal homepage: http://einspem.upm.edu.my/journal Approximation of Survival Function by Taylor

More information

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data Communications for Statistical Applications and Methods 2015, Vol. 22, No. 6, 599 604 DOI: http://dx.doi.org/10.5351/csam.2015.22.6.599 Print ISSN 2287-7843 / Online ISSN 2383-4757 Estimation of Conditional

More information

Multistate models and recurrent event models

Multistate models and recurrent event models Multistate models Multistate models and recurrent event models Patrick Breheny December 10 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/22 Introduction Multistate models In this final lecture,

More information

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis CIMAT Taller de Modelos de Capture y Recaptura 2010 Known Fate urvival Analysis B D BALANCE MODEL implest population model N = λ t+ 1 N t Deeper understanding of dynamics can be gained by identifying variation

More information

Analysis of transformation models with censored data

Analysis of transformation models with censored data Biometrika (1995), 82,4, pp. 835-45 Printed in Great Britain Analysis of transformation models with censored data BY S. C. CHENG Department of Biomathematics, M. D. Anderson Cancer Center, University of

More information

DAGStat Event History Analysis.

DAGStat Event History Analysis. DAGStat 2016 Event History Analysis Robin.Henderson@ncl.ac.uk 1 / 75 Schedule 9.00 Introduction 10.30 Break 11.00 Regression Models, Frailty and Multivariate Survival 12.30 Lunch 13.30 Time-Variation and

More information

FAILURE-TIME WITH DELAYED ONSET

FAILURE-TIME WITH DELAYED ONSET REVSTAT Statistical Journal Volume 13 Number 3 November 2015 227 231 FAILURE-TIME WITH DELAYED ONSET Authors: Man Yu Wong Department of Mathematics Hong Kong University of Science and Technology Hong Kong

More information

4 Testing Hypotheses. 4.1 Tests in the regression setting. 4.2 Non-parametric testing of survival between groups

4 Testing Hypotheses. 4.1 Tests in the regression setting. 4.2 Non-parametric testing of survival between groups 4 Testing Hypotheses The next lectures will look at tests, some in an actuarial setting, and in the last subsection we will also consider tests applied to graduation 4 Tests in the regression setting )

More information

Multivariate Survival Analysis

Multivariate Survival Analysis Multivariate Survival Analysis Previously we have assumed that either (X i, δ i ) or (X i, δ i, Z i ), i = 1,..., n, are i.i.d.. This may not always be the case. Multivariate survival data can arise in

More information

Analysis of Progressive Type-II Censoring. in the Weibull Model for Competing Risks Data. with Binomial Removals

Analysis of Progressive Type-II Censoring. in the Weibull Model for Competing Risks Data. with Binomial Removals Applied Mathematical Sciences, Vol. 5, 2011, no. 22, 1073-1087 Analysis of Progressive Type-II Censoring in the Weibull Model for Competing Risks Data with Binomial Removals Reza Hashemi and Leila Amiri

More information

49th European Organization for Quality Congress. Topic: Quality Improvement. Service Reliability in Electrical Distribution Networks

49th European Organization for Quality Congress. Topic: Quality Improvement. Service Reliability in Electrical Distribution Networks 49th European Organization for Quality Congress Topic: Quality Improvement Service Reliability in Electrical Distribution Networks José Mendonça Dias, Rogério Puga Leal and Zulema Lopes Pereira Department

More information

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology Group Sequential Tests for Delayed Responses Christopher Jennison Department of Mathematical Sciences, University of Bath, UK http://people.bath.ac.uk/mascj Lisa Hampson Department of Mathematics and Statistics,

More information

Survival Analysis. STAT 526 Professor Olga Vitek

Survival Analysis. STAT 526 Professor Olga Vitek Survival Analysis STAT 526 Professor Olga Vitek May 4, 2011 9 Survival Data and Survival Functions Statistical analysis of time-to-event data Lifetime of machines and/or parts (called failure time analysis

More information

Logistic regression model for survival time analysis using time-varying coefficients

Logistic regression model for survival time analysis using time-varying coefficients Logistic regression model for survival time analysis using time-varying coefficients Accepted in American Journal of Mathematical and Management Sciences, 2016 Kenichi SATOH ksatoh@hiroshima-u.ac.jp Research

More information

TESTS FOR LOCATION WITH K SAMPLES UNDER THE KOZIOL-GREEN MODEL OF RANDOM CENSORSHIP Key Words: Ke Wu Department of Mathematics University of Mississip

TESTS FOR LOCATION WITH K SAMPLES UNDER THE KOZIOL-GREEN MODEL OF RANDOM CENSORSHIP Key Words: Ke Wu Department of Mathematics University of Mississip TESTS FOR LOCATION WITH K SAMPLES UNDER THE KOIOL-GREEN MODEL OF RANDOM CENSORSHIP Key Words: Ke Wu Department of Mathematics University of Mississippi University, MS38677 K-sample location test, Koziol-Green

More information

Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai

Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective Anastasios (Butch) Tsiatis and Xiaofei Bai Department of Statistics North Carolina State University 1/35 Optimal Treatment

More information

Extensions of Cox Model for Non-Proportional Hazards Purpose

Extensions of Cox Model for Non-Proportional Hazards Purpose PhUSE 2013 Paper SP07 Extensions of Cox Model for Non-Proportional Hazards Purpose Jadwiga Borucka, PAREXEL, Warsaw, Poland ABSTRACT Cox proportional hazard model is one of the most common methods used

More information

Statistical Issues in the Use of Composite Endpoints in Clinical Trials

Statistical Issues in the Use of Composite Endpoints in Clinical Trials Statistical Issues in the Use of Composite Endpoints in Clinical Trials Longyang Wu and Richard Cook Statistics and Actuarial Science, University of Waterloo May 14, 2010 Outline Introduction: A brief

More information

Lecture 5 Models and methods for recurrent event data

Lecture 5 Models and methods for recurrent event data Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.

More information

Inference for the dependent competing risks model with masked causes of

Inference for the dependent competing risks model with masked causes of Inference for the dependent competing risks model with masked causes of failure Radu V. Craiu and Benjamin Reiser Abstract. The competing risks model is useful in settings in which individuals/units may

More information

Multistate models in survival and event history analysis

Multistate models in survival and event history analysis Multistate models in survival and event history analysis Dorota M. Dabrowska UCLA November 8, 2011 Research supported by the grant R01 AI067943 from NIAID. The content is solely the responsibility of the

More information