Rank-based inference for the accelerated failure time model

Size: px
Start display at page:

Download "Rank-based inference for the accelerated failure time model"

Transcription

1 Biometrika (2003), 90, 2, pp Biometrika Trust Printed in reat Britain Rank-based inference for the accelerated failure time model BY ZHEZHEN JIN Department of Biostatistics, Columbia University, Ne York, Ne York 10032, U.S.A. D. Y. LIN Department of Biostatistics, University of North Carolina, Chapel Hill, North Carolina , U.S.A. L. J. WEI Department of Biostatistics, Harvard University, Boston, Massachusetts 02115, U.S.A. AND ZHILIAN YIN Department of Statistics, Columbia University, Ne York, Ne York 10027, U.S.A. SUMMARY A broad class of rank-based monotone estimating functions is developed for the semiparametric accelerated failure time model ith censored observations. The corresponding estimators can be obtained via linear programming, and are shon to be consistent and asymptotically normal. The limiting covariance matrices can be estimated by a resampling technique, hich does not involve nonparametric density estimation or numerical derivatives. The ne estimators represent consistent roots of the non-monotone estimating equations based on the familiar eighted log-rank statistics. Simulation studies demonstrate that the proposed methods perform ell in practical settings. To real examples are provided. Some key ords: Accelerated life model; Censoring; ehan statistic; Linear programming; Rank estimator; Survival data; Weighted log-rank statistic. 1. INTRODUCTION The accelerated failure time model or accelerated life model relates the logarithm of the failure time linearly to the covariates (Kalbfleisch & Prentice, 1980, pp. 32 4; Cox & Oakes, 1984, pp. 64 5). As a result of its direct physical interpretation, this model provides an attractive alternative to the popular Cox (1972) proportional hazards model for the regression analysis of censored failure time data. The presence of censoring in failure time data creates a serious challenge in the semiparametric analysis of the accelerated failure time model. Several semiparametric estimators ere proposed around 1980, Buckley & James (1979) providing a modification of the Donloaded from at Columbia University Libraries on March 27, 2015

2 342 Z. JIN, D. Y. LIN, L. J. WEI AND Z. YIN least-squares estimator to accommodate censoring and Prentice (1978) proposing rank estimators based on the ell-knon eighted log-rank statistics. The asymptotic properties of the Buckley James and rank estimators ere rigorously studied by Ritov (1990), Tsiatis (1990), Lai & Ying (1991a, b) and Ying (1993) among others. Despite the theoretical advances, semiparametric methods for the accelerated failure time model have rarely been used in applications, mainly because of the lack of efficient and reliable computational methods. The existing semiparametric estimating functions are step functions of the regression parameters ith potentially multiple roots, and the corresponding estimators may not be ell defined. Furthermore, analytical evaluations of the covariance matrices of the estimators ould require nonparametric density function estimation. To bypass the variance-covariance estimation, Wei et al. (1990) developed an inference procedure based on the so-called minimum-dispersion statistic. Calculation of the minimum-dispersion statistics and of the semiparametric estimators involves minimisations of discrete objective functions ith potentially multiple local minima. Such minimisation problems cannot be solved by conventional optimisation algorithms. Lin & eyer (1992) suggested a computational method based on simulated annealing (Kirkpatrick et al., 1983), but their algorithm is not guaranteed to yield the true minimum. In the present paper, e provide simple and reliable methods for implementing the aforementioned rank estimators. We first sho that the rank estimator ith the ehan (1965)-type eight function can be readily obtained by minimising a convex objective function through a standard linear programming technique. We then introduce a class of monotone estimating functions to approximate the possibly non-monotone eighted logrank estimating functions around the true values of the regression parameters. These estimating equations are solved through an iterative algorithm ith the ehan-type estimator as the initial value. Each iteration can also be executed via linear programming. This procedure yields a consistent root of the eighted log-rank estimating equation, hich potentially contains inconsistent roots. The covariance matrices for both the ehantype and iterative estimators can be easily estimated by a resampling technique, hich does not involve density estimation. 2. PROPOSED METHODS 2 1. Accelerated failure time model and rank estimators For,...,n, let T be the failure time for the ith subject and let X be the associated i i p-vector of covariates. The accelerated failure time model specifies that log T i =b 0 X i +e i (,...,n), (2 1) here b is a p-vector of unknon regression parameters and e (,...,n)are independent error terms ith a common, but completely unspecified, distribution. 0 i Let C be the censoring time for T. Assume that T and C are independent conditionally i i i i on X. We impose Conditions 1 4 of Ying (1993, p. 80). The data consist of (TB, D, X ) i i i i (,...,n), here TB =T mc and D =1. Here and in the sequel, amb=min(a, b), i i i i {Ti C i } and 1 is the indicator function. {.} Define e (b)=log TB b X,N(b; t)=d 1 i i i i i {ei (b) t} and Y i (b; t)=1 {e i (b)t}. Note that N i and Y are the counting process and at-risk process on the time scale of the residual. Write i S(0)(b; t)=n 1 n Y (b; t), i S(1)(b; t)=n 1 n Y (b; t)x. i i Donloaded from at Columbia University Libraries on March 27, 2015

3 Accelerated failure time model The eighted log-rank estimating function for b takes the form 0 U (b)= n D {b; e(b)}[x X9 {b; e(b)}], i i i i or 343 U (b)= n P 2 (b; t){x X9 (b; t)} dn (b; t), (2 2) i i 2 here X9 (b; t)=s(1)(b; t)/s(0)(b; t), and is a possibly data-dependent eight function satisfying Condition 5 of Ying (1993, p. 90). The choices of =1 and =S(0) correspond to the log-rank (Mantel, 1966) and ehan (1965) statistics, respectively. Let b@ be a root of the estimating function U (b). It has been established that the random vector nd b ) is asymptotically zero-mean normal ith covariance matrix 0 A 1 B A 1, here A = lim n 1 n P 2 (b ; t){x X9 (b ;t)}e2{l<(t)/l(t)} dn (b ; t), 0 i 0 i 0 n2 2 B = lim n 1 n P 2 2(b ; t){x X9 (b ;t)}e2 dn (b ; t), 0 i 0 i 0 n2 2 l(.) is the common hazard function of the error terms, and l<(t)=dl(t)/dt (Tsiatis, 1990; Lai & Ying, 1991b; Ying, 1993). The foregoing result requires that A be nonsingular. This condition is assumed to hold in the present paper. In general, U (b) is neither continuous nor componentise monotone in b. Thus, it is difficult to solve the equation U (b)=0, especially hen b is high-dimensional. In fact, there are potentially multiple solutions to the equation, some of hich are inconsistent. Although one may define the estimator to be a minimiser of the norm of U (b), itisnot easy to locate a minimiser either, because of the discontinuity and non-monotonicity. Another difficulty lies in the variance estimation. Since A involves the derivative of the hazard function, direct evaluation of the covariance matrix ould require nonparametric estimation of the density function, hich cannot be done reliably in practical samples ith censored observations. One might estimate A by the numerical derivative of n 1U ; hoever, this kind of estimator can be highly unstable ehan-type eight function Considerable simplification arises in the special case of (b; t)=s(0)(b; t), hich is referred to as the ehan-type eight function. In this case, U can be ritten as or U (b)= n D S(0){b; e(b)}[x X9 {b; e(b)}], i i i i U (b)=n 1 n n D (X X )1 i i j {ei (b) e j=1 j (b)}, (2 3) hich is monotone in each component of b (Fygenson & Ritov, 1994). It is not difficult to see that (X X )1 i j {ei (b) e j (b)} is the gradient in b of {e i (b) e (b)}, here j Donloaded from at Columbia University Libraries on March 27, 2015

4 344 Z. JIN, D. Y. LIN, L. J. WEI AND Z. YIN a = a 1 {a<0}. Thus, the right-hand side of (2 3) is the gradient of the convex function L (b))n 1 n n D {e (b) e (b)}. i i j j=1 We denote a minimiser of L (b) by b@. Although b@ is not necessarily unique, all the minimisers of L (b) are asymptotically equivalent. The minimisation of L (b) can be carried out by linear programming: e minimise the linear function Wn Wn D u subject to the linear constraints u 0 and j=1 i ij ij u ij {e i (b) e j (b)} (i, j=1,...,n). This type of linear programming has been used extensively in the econometrics literature, especially in connection ith the regression quantiles introduced by Koenker & Bassett (1978), and as previously considered by Lin et al. (1998) for the rank estimation of the accelerated time model for counting processes. The minimisation of L (b) is equivalent to the minimisation of n n D e (b) e (b) + n n D (X X ) K i i j KM b, k l k j=1 k=1 l=1 here M is an extremely large number. The latter type of minimisation can be implemented ith the algorithm of Koenker & D Orey (1987), hich is available in S-Plus and other softare. According to the general asymptotic theory for the rank estimators stated in 2 1, the random vector nd b ) is asymptotically zero-mean normal ith covariance matrix 0 A 1 B A 1, here A and B are A and B evaluated at =. As discussed before, A involves the unknon hazard function, so that it is difficult to estimate the covariance matrix analytically. We ill develop a resampling scheme similar to those of Rao & Zhao (1992), Parzen et al. (1994) and Jin et al. (2001) to approximate the distribution of b@, particularly its variance-covariance matrix. To be specific, e define a ne loss function L *(b))n 1 n n D {e (b) e (b)} Z, i i j i j=1 here Z (,...,n)are independent positive random variables ith E(Z )=var(z )=1, i i i and are independent of the data (TB, D, X )(,...,n).this loss function is essentially i i i a perturbed version of the original L (b), perturbed by random variables Z (,...,n). i Minimisation of L * is also a linear programming problem. Let b@* be a minimiser of L *, that is a root of the estimating function U* (b)) n D S(0){b; e(b)}[x X9 {b; e(b)}]z. i i i i i Note that U* has the same mean and approximately the same variance as U. We sho in the Appendix that the asymptotic distribution of nd b ) can be approximated by 0 the conditional distribution of nd* b@ ) given the data (TB, D,X)(,...,n). i i i Conditional on the data (TB, D, X )(,...,n), the only random elements in L *(b) i i i are the Z s. To approximate the distribution of b@, e produce a large number of realisations of b@* by repeatedly generating the random sample (Z,...,Z ) hile holding the 1 n i data (TB, D, X )(,...,n) at their observed values. The covariance matrix of b@ can i i i then be approximated by the empirical covariance matrix of b@*. Confidence intervals Donloaded from at Columbia University Libraries on March 27, 2015

5 Accelerated failure time model for individual components of b 0 can be obtained from the percentiles of the empirical distribution of b@* or by the Wald method eneral eight functions The efficiency of the ehan statistic and the corresponding rank estimator b@ depends on the censoring distribution. The variance of b@ is minimised if the limit of (t) is proportional to l<(t)/l(t). Thus, it is desirable to use general eight functions. The simple conversion of the ehan estimating function U to an L 1 -type loss function does not extend directly to the general eighted log-rank estimating functions. On the basis of b@, hoever, it is possible to construct eighted ehan-type loss functions, hose minimisers can be readily obtained by the linear programming algorithm. We consider the folloing modification of (2 2): UB (b; b@)) n P 2 y{b@; t+(b b@) X }S(0)(b; t){x X9 (b; t)} dn (b; t), (2 4) i i i 2 here y(b; x)=(b; x)/s(0)(b; x), and b@ is a preliminary consistent estimator of b, b@ say. 0 Clearly, UB (b; b@)= n D y{b@; e(b)+(b b@) X }S(0){b; e(b)}[x i i i i i X9 {b; e(b)}] i =n 1 n n y{b@; e)}d (X X )1 i i i j {ei (b) e j=1 j (b)}. This is the same as (2 3) except for the eights y{b@; e)}, hich are free of b. Thus, i UB (b; b@) is monotone in each component of b and is the gradient of the folloing extension of L (b): L (b; b@))n 1 n n y{b@; e)}d {e (b) e (b)}. i i i j j=1 As in the case of L, minimisation of L (.; b@) can be implemented via linear programming. We propose the folloing iterative algorithm for estimating b : 0 b@ =b@, b@ =arg min L (b; b@ ) (k1). (0) b By comparing (2 2) and (2 4), e see that, if b@ converges to a limit as the number of iterations k2, then the limit must satisfy U (b)=0. In such situations, the proposed algorithm constitutes a numerical method for solving the original estimating equation U (b)=0. The resulting estimator is consistent although the original estimating equation may contain inconsistent roots. In all the simulated and real datasets e have tested, b@ converges as k increases and, for k as small as 3, b@ is already close to the limit. It is important to point out that b@ itself is a legitimate estimator for any k and the convergence of the algorithm is not essential to the validity of the proposed method. We sho in the Appendix that, for any k, the estimator b@ is consistent and asymptotically normal. In fact, it follos from (A 5) in the Appendix that Donloaded from at Columbia University Libraries on March 27, 2015 b@ ={(A +D ) 1D }kb@ +[I {(A +D ) 1D }k]b@ +o p (n D),

6 346 Z. JIN, D. Y. LIN, L. J. WEI AND Z. YIN here I is the p p identity matrix, D = lim n 1 n P y < (t)s(0)(b ; t){x X9 (b ;t)}e2 dn (b ; t), (2 5) 0 0 i 0 i 0 n2 and y< (t) is the derivative of y (t), the limit of y(b ;t)as n2. Thus, b@ is asymptotically a eighted average of b@ and b@, and can be regarded as an approximation to b@ If {(A +D ) 1D }k0 as k2, then the limit of b@ as k2, denoted by b@,is (2) asymptotically equivalent to b@. Note that {(A +D ) 1D }k0 as k2 is equivalent to all the eigenvalues of (A +D ) 1D being less than 1 in absolute value. This condition holds if A is positive definite and D is nonnegative definite. The most commonly used eight functions, including the log-rank, Prentice Wilcoxon (Prentice, 1978) and the more general r class of Harrington & Fleming (1982), are all nonnegative and nonincreasing. For such eight functions, A is indeed positive definite provided that the distribution of X does not concentrate on a (p 1)-dimensional hyperplane (Ying, 1993, pp. 81 2). It is evident from (2 5) that D ill be nonnegative definite if y< (.)0, hich is true of all commonly used eight functions. Therefore, b@ is asymptotically equivalent to b@, at least for most important eight functions. (2) 0 We sho in the Appendix that, under mild conditions, b@ b@ =o (n D) for any k of p order log(n) provided that {(A +D ) 1D }k0 as k2. This implies that, ith a finite number of iterations, b@ is asymptotically equivalent to the consistent roots of the original estimating function U (b). In practice, one ould terminate the iteration process hen the difference beteen successive estimates is less than a certain bound. This convergence criterion may be set on the basis of the observed b@ and its estimated standard error, as demonstrated in the next section. Although b@ can be easily obtained, it is again difficult to estimate the limiting covariance matrix analytically. As in the case of b@, e appeal to the resampling approach. To be specific, e construct the perturbed objective function L *(b; b))n 1 n n y{b; e (b)}d {e (b) e (b)} Z, i i i j i j=1 and define b@* =b@* and b@* =arg min L *(b; b@* )(k1). We sho in the Appendix (0) b that the asymptotic distribution of nd b ) can be approximated by the conditional 0 distribution of nd* b@ ) given the data (TB, D, X )(,...,n). Inference about b i i i 0 can then be carried out on the basis of the empirical distribution of b@*. 3. NUMERICAL RESULTS 3 1. Real examples The results reported here and in 3 2 are based on y=1 and y=1/s(0), hich are referred to as the ehan and log-rank eight functions. For the resampling procedure, e simulated the Z i from the unit exponential distribution and generated 500 realisations. The results ere similar hen other distributions of the Z i ere used. The Wald method as used to construct the confidence intervals. We first applied the proposed methods to a study on multiple myeloma reported by Krall et al. (1975). Out of a total of 65 patients ho ere treated ith alkylating agents, 48 died during the study and 17 survived. This is the main example in the PROC PHRE of the SAS/STAT User s uide (1999, pp , ), hich focused on the Cox model ith haemoglobin, HB, and the logarithm of blood urea nitrogen, BUN, as the Donloaded from at Columbia University Libraries on March 27, 2015

7 Accelerated failure time model covariates. To improve numerical efficiencies, e standardised the to covariates log(bun) and HB in our analysis. The parameter estimates for standardised log(bun) and HB based on the ehan eight function are and ith estimated standard errors of and The corresponding 95% confidence intervals are ( 0 818, 0 246) and ( 0 039, 0 623). We used the iterative algorithm to obtain the estimates based on the log-rank eight function. The results from the ehan eight function suggested that it is not necessary to have accurate numbers beyond the second decimal point. Thus, e set the convergence criterion for the iteration to be 0 01 beteen successive estimates. Based on this criterion, convergence as achieved at the third iteration. The resulting estimates are and ith 95% confidence intervals ( 0 880, 0 130) and ( 0 017, 0 553). The estimates are identical up to the eighth decimal point after 5 iterations. For further illustration, e considered the ell-knon Mayo primary biliary cirrhosis data (Fleming & Harrington, 1991, Appendix D.1). The database contains information about the survival time and prognostic factors for 418 patients. The data ere used by Dickson et al. (1989) to build a Cox proportional hazards model for the natural history of the disease ith five covariates, age, log(albumin), log(bilirubin), oedema and log(protime); see Fleming & Harrington (1991, Table 4.6.3). We considered the same set of covariates in our analysis. We again standardised the covariates. The estimates based on the ehan eight function are 0 270, 0 204, 0 593, and ith estimated standard errors of 0 062, 0 069, 0 071, and These results also suggested that the convergence criterion of 0 01 can be used for calculating the log-rank-type estimates. This criterion as again met at the third iteration. The estimates ere identical up to the eighth decimal point after 12 iterations. We display in Table 1 the results for the unstandardised covariates. Although the conclusions are not qualitatively different, the analysis under the accelerated failure time model provides an alternative and more direct interpretation of the effects of covariates on the survival time, as compared to the Cox regression analysis. The results based on the to eight functions are fairly similar. The point estimates for the log-rank eight function are similar to those of Lin & eyer (1992), hich ere obtained by a simulated annealing algorithm. Table 1. Accelerated failure time regression for the Mayo primary biliary cirrhosis data ehan eight function Log-rank eight function Parameter Est SE 95% CI Est SE 95% CI Age ( 0 037, 0 014) ( 0 035, 0 018) log(albumin) (0 542, 2 639) (0 934, 2 378) log(bilirubin) ( 0 716, 0 442) ( 0 674, 0 496) Oedema ( 1 421, 0 336) ( 1 083, 0 385) log(protime) ( 4 549, 0 987) ( 2 850, 1 038) Est, estimate; SE, standard error; CI, confidence interval. 347 Donloaded from at Columbia University Libraries on March 27, Simulation studies Extensive simulation studies ere conducted to assess the operating characteristics of the proposed methods in practical settings. One series of the experiments mimicked the foregoing multiple myeloma study. The failure times ere generated from the model log T = log(bun) HB+e, (3 1)

8 348 Z. JIN, D. Y. LIN, L. J. WEI AND Z. YIN here e is a normal random variable ith mean 0 and standard deviation The regression coefficients and the standard deviation of e in model (3 1) ere estimated from the parametric accelerated failure time model. The censoring times ere generated from the Un(0, t) distribution, here t as chosen to yield a desired level of censoring. We first examined the difference beteen log-rank-type estimates obtained after a fixed number of iterations versus those obtained after convergence. We generated 1000 datasets ith sample size of 65 from model (3 1). The standardised covariate values from the original myeloma study ere used. The algorithm as considered convergent if the difference beteen to consecutive estimates as less than ; convergence as usually achieved ithin 10 iterations. Figure 1(a) plots the estimates obtained after convergence versus those obtained after three iterations for the estimation of the first regression coefficient under 20% censoring; the to sets of estimates are clearly very similar. Figure 1(b) displays the corresponding plot for the log-rank estimates versus the ehan estimates, the to of hich are noticeably different. Thus, the phenomenon in Fig.1(a) is not an artifact of the closeness of the ehan and log-rank estimates. The plots for the estimation of the second regression coefficient and for other censoring levels revealed similar results. The values of the log-rank estimating functions at convergence versus after three iterations ere similar: under 20% censoring, the medians of the L 1 -norms of n 1UB among the 1000 datasets ere at convergence and after three iterations; the corresponding maxima ere 0 07 and These results demonstrated that a small number of iterations is sufficient. Log-rank-type est. after conv (a) Log-rank-type est. after 3 iter. Log-rank-type est. after conv (b) ehan-type est. Fig. 1. Comparisons of different estimates of b 01 under 20% censoring: (a) log-rank-type estimates after convergence versus log-rank-type estimates after three iterations; (b) log-rank-type estimates after convergence versus ehan-type estimates. Donloaded from at Columbia University Libraries on March 27, 2015 Most of the simulation studies ere concerned ith the performance of the proposed inference procedures based on the ehan-type estimator and the log-rank-type estimator ith three iterations. We considered sample sizes of 65 and 130. For n=65, the standardised covariate values from the multiple myeloma dataset ere used; for n=130, each covariate value in the original dataset as duplicated. For each configuration of the simulation parameters, e generated 500 datasets ith the same set of covariate values. The results of the simulation studies are summarised in Table 2. The parameter estimators

9 Accelerated failure time model 349 Table 2. Summary statistics for the simulation studies: regression coeycients b 01 and for log(bun) and HB, respectively n Censoring Weight Parameter Bias SE SEE 90% CP 95% CP 65 0% ehan b Log-rank b % ehan b Log-rank b % ehan b Log-rank b % ehan b Log-rank b % ehan b Log-rank b % ehan b Log-rank b Bias, bias of the estimator; SE, standard error of the estimator; SEE, mean of the standard error estimator; 90% CP, coverage probability of the 90% Wald-type confidence interval; 95% CP, coverage probability of the 95% Wald-type confidence interval appear to be virtually unbiased. The standard error estimators reflect ell the true variabilities of the parameter estimators, and the corresponding confidence intervals have satisfactory coverage probabilities. 4. REMARKS Fygenson & Ritov (1994) characterised the class of eight functions in (2 2) hich yields monotone estimating functions. The class is quite restrictive and involves the censoring time distribution. By contrast, the class of monotone estimating functions developed in the present paper is very broad and yields consistent roots of the original eighted logrank estimating equations. On the basis of (2 4), e can obtain a one-step approximation to b@ for any eight function, as follos. Define Donloaded from at Columbia University Libraries on March 27, 2015 ya (b; x)=s(0)(b; x) 1(b; x)i D A 1, and replace y{b@; t+(b b@) X i } in (2 4) by ya {b@ ;t+(b b@ ) X i }. It then follos from (A 4) in the Appendix that the resulting estimator is asymptotically equivalent to b@.to

10 350 Z. JIN, D. Y. LIN, L. J. WEI AND Z. YIN implement this one-step estimator, one ould need to estimate A and D. A consistent estimator of A can be easily obtained because A 1 B A 1 and B can be consistently estimated from the data. The estimation of D ould require an estimator of l(.), hich can be obtained either nonparametrically or parametrically. The proposed estimators do not achieve the semiparametric efficiency bound. By adopting the approach of Lai & Ying (1991b, 3), one can construct data-dependent eight functions hich yield asymptotically efficient estimators provided that the iterative procedure converges. ACKNOWLEDEMENT This research as supported by the National Institutes of Health, the National Science Foundation and the Ne York City Council Speaker s Fund for Public Health Research. The authors thank the revieers for their helpful comments. Softare implementing the proposed methods is available from the authors. APPENDIX Derivation of asymptotic results Asymptotic properties of b@*. Both n 1L and n 1L * are convex functions and, by the strong la of large numbers, converge almost surely to the same limiting function. The first derivative of the limiting function is zero at b and its second derivative at b is A, hich is nonsingular. Thus, 0 0 this function has a unique minimiser b. It then follos from convex analysis that b@ b and 0 0 b@* b almost surely. The latter convergence is ith respect to the joint probability space of 0 (TB, D, X )(,...,n) and Z (,...,n). We shall use F to denote the s-field generated by i i i i the original data. It follos from Theorem 2 of Ying (1993) that almost surely By similar arguments, U )=U (b 0 )+na )+o(nd+ndb@ d). (A 1) U* * )=U* )+na * b@ )+o(nd+ndb@* b@ d) (A 2) almost surely. The functions U* and U have the same asymptotic slope matrix A in (A 1) and (A 2) because E{U* (b) F}=U (b) and b@ and b@* are consistent. We have that U* )=U* ) U )+o(nd) since b@ is a root of U (b). Thus n DU* )=n D n P 2 S(0) ; t){x X9 ; t)} dn ;t)(z 1)+o(1). i i i 2 Conditionally on F, the right-hand side of the above display is a normalised sum of independent zero-mean random vectors. Since its conditional covariance matrix converges almost surely to B, the multivariate central limit theorem implies that n DU* ) converges in distribution to N(0, B ) almost surely. It then follos from (A 2) that the conditional distribution of nd* b@ ) given F converges almost surely to N(0, A 1 B A 1 ), hich is the limiting distribution of b ). 0 Asymptotic properties of b@. We first sho that b@ is strongly consistent for any given k. Recall that b@ =b@. As a result of the strong consistency of b@, the function n 1L (b; b@ ) (0) (0) (0) converges almost surely to lim n 1L (b; b ), hich reaches its minimum at b=b. Since 0 0 n 1L (b; b@ ) is convex, b@ must converge to b almost surely. Applying these arguments successively for k=2,3,...,e see that b@ b almost surely as n2 for all k. 0 (0) (1) 0 To avoid possible tail instabilities, e assume that, for any b and g hich converge almost n n Donloaded from at Columbia University Libraries on March 27, 2015

11 surely to b 0 and 0, respectively, Accelerated failure time model y(b n ;t+g n )=y(b n ;t)+y< 0 (t)g n +o(n D+g n ) uniformly in t. This condition holds for virtually all the existing eight functions for t aay from the endpoint. For it to hold for all t, e typically ould have to use hich is truncated at the tail, that is (t)=0 for t near the endpoint. Under this condition and Conditions 1 5 of Ying (1993), e can expand UB (b; b@ ) to yield 351 UB (b; b@ )= n P ; t)s(0)(b; t){x i X9 (b; t)} dn i (b; t) + n P y < (t)s(0)(b; t){x X9 (b; t)}x dn (b; t)(b b@ ) 0 i i i +o(nd+ndb b@ d) as n2 and bb 0. By Theorem 2 of Ying (1993), the first term on the right-hand side of the above display has the linear expansion U (b 0 )+na (b )+o(nd+ndb d). Thus, UB ; b@ )=U (b 0 )+n(a +D ) ) nd ) +o(nd+ndb@ d+ndb@ d). Suppose that A,A and A +D are nonsingular. By combining (A 1) and (A 3) ith k=1, e obtain nd (1) )= n D(A +D ) 1{U (b 0 )+D A 1 U (b 0 )} +o(1+nd db@ (1) d+nddb@ d). It then follos from recursive use of (A 3) that in general or nd b )= n D k {(A +D ) 1D }j 1(A +D ) 1U (b ) 0 0 j=1 n D{(A +D ) 1D }ka 1 U (b 0 )+o 1+nD k db@ b d A B (j) 0 j=0 nd b )= n D[I {(A +D ) 1D }k]a 1 0 U (b 0 ) n D{(A +D ) 1D }ka 1 U (b 0 )+o 1+nD k db@ b d A (j) 0 j=0 B. (A 5) The asymptotic normality of nd b ) then follos from that of n DU (b ) and n DU (b ) Suppose that is truncated at the tail and the error density is sufficiently smooth. Then it follos from equation (4.5) of Lai & Ying (1991b) that, for some g>0 and c>0, (A 3) holds ith o(nd g) as the remainder term uniformly for b@ and b@ in an O(n D+c) neighbourhood of b. 0 Thus, if {(A +D ) 1D }k0 as k2, then all the b@ (j k) stay in an O(n D+c) neighbourhood (j) of b so that the remainder term in (A 5) becomes o(kn g)=o(1) provided that k2 and 0 k=o(log n). Hence, nd b )=nd b )+o (1) for k2 and k=o(log n). 0 0 p Asymptotic properties of b@*. We impose the same regularity conditions as in the previous section. The convexity arguments and the strong la of large numbers can again be employed to establish the strong consistency of b@*. By definition, b@* is a root of (A 3) (A 4) Donloaded from at Columbia University Libraries on March 27, 2015 UB * (b; b@* ) ) n P ;t+(b b@* ) X i }S(0)(b; t){x i X9 (b; t)} dn i (b; t)z i.

12 352 Z. JIN, D. Y. LIN, L. J. WEI AND Z. YIN By asymptotic linear expansions similar to those used in establishing (A 3), e have UB * * ; b@* )= n P ;t+ b@ ) X }S(0) ; t){x X9 ; t)} dn ; t)z i i i i +n(a +D )* b@ ) nd * b@ )+d*, (A 6) n here d* n =o(nd+n Wk j=0 db@ (j) d+n Wk j=0 db@* (j) d). We can replace the Z i in the first term on the right-hand side of (A 6) by Z i 1 since b@ is a root of UB (b; b@ ). With this replacement, the first term has mean 0 conditionally on F, so that the integral can be approximated by (b 0 ; t){x i X9 (b 0 ; t)} dn i (b 0 ; t). Thus, hich yields Therefore, UB * * ; b@* )= n nd* b@ )= n D(A +D ) 1 n P (b 0 ; t){x i X9 (b 0 ; t)} dn i (b 0 ;t)(z i 1) +n(a +D )* b@ ) nd * b@ )+d* n, P (b 0 ; t){x i X9 (b 0 ; t)} dn i (b 0 ;t)(z i 1) +(A +D ) 1D nd* b@ )+n Dd* n. nd* b@ )= n D [I {(A +D ) 1D }k]a 1 n P (b 0 ; t){x i X9 (b 0 ; t)} dn i (b 0 ;t)(z i 1) n D{(A +D ) 1D }ka 1 n P S(0)(b 0 ; t){x i X9 (b 0 ; t)} dn i (b 0 ;t)(z i 1)+n Dd* n. Comparing the above equation ith (A 5), e see that the conditional distribution of nd* b@ ) given F converges almost surely to the limiting distribution of nd b ). 0 Under the additional conditions stated in the last paragraph of the previous section, (A 6) can be strengthened ith d* =o(n j) for some j>0. Hence, for k2 and k=o(log n), n nd* b@ ) has the same asymptotic distribution as nd b ). 0 REFERENCES BUCKLEY, J. & JAMES, I. (1979). Linear regression ith censored data. Biometrika 66, COX, D. R. (1972). Regression models and life-tables (ith Discussion). J. R. Statist. Soc. B 34, COX, D. R. & OAKES, D. (1984). Analysis of Survival Data. London: Chapman and Hall. DICKSON, E. R., RAMBSCH, P. M., FLEMIN, T. R., FISHER, L. D. & LANWORTHY, A. (1989). Prognosis in primary biliary cirrhosis: model for decision making. Hepatology 10, 1 7. FLEMIN, T. R. & HARRINTON, D. P. (1991). Counting Processes and Survival Analysis. Ne York: Wiley. FYENSON, M. & RITOV, Y. (1994). Monotone estimating equations for censored data. Ann. Statist. 22, EHAN, E. A. (1965). A generalized Wilcoxon test for comparing arbitrarily single-censored samples. Biometrika 52, HARRINTON, D. P. & FLEMIN, T. R. (1982). A class of rank test procedures for censored survival data. Biometrika 69, JIN, Z., YIN, Z. & WEI, L. J. (2001). A simple resampling method by perturbing the minimand. Biometrika 88, KALBFIEISCH, J. D. & PRENTICE, R. L. (1980). T he Statistical Analysis of Failure T ime Data. Ne York: Wiley. KIRKPATRICK, S., ELATT, C. D. & VECCHI, M. P. (1983). Optimisation by simulated annealing. Science 220, Donloaded from at Columbia University Libraries on March 27, 2015

13 Accelerated failure time model KOENKER, R. & BASSETT,. S. (1978). Regression quantiles. Econometrica 46, KOENKER, R. & D OREY, V. (1987). Computing regression quantiles. Appl. Statist. 36, KRALL, J. M., UTHOFF, V. A. & HARLEY, J. B. (1975). A step-up procedure for selecting variables associated ith survival. Biometrics 31, LAI, T. L. & YIN, Z. (1991a). Large sample theory of a modified Buckley James estimator for regression analysis ith censored data. Ann. Statist. 19, LAI, T. L. & YIN, Z. (1991b). Rank regression methods for left-truncated and right-censored data. Ann. Statist. 19, LIN, D. Y. & EYER, C. J. (1992). Computational methods for semiparametric linear regression ith censored data. J. Comp. raph. Statist. 1, LIN, D. Y., WEI, L. J. & YIN, Z. (1998). Accelerated failure time models for counting processes. Biometrika 85, MANTEL, N. (1966). Evaluation of survival data and to ne rank order statistics arising in its considerations. Cancer Chemo. Rep. 50, PARZEN, M. I., WEI, L. J. & YIN, Z. (1994). A resampling method based on pivotal estimating functions. Biometrika 81, PRENTICE, R. L. (1978). Linear rank tests ith right censored data. Biometrika 65, RAO, C. R. & ZHAO, L. C. (1992). Approximation to the distribution of M-estimates in linear models by randomly eighted bootstrap. Sankhya: A 54, RITOV, Y. (1990). Estimation in a linear regression model ith censored data. Ann. Statist. 18, SAS INSTITUTE, INC. (1999). SAS/STAT User s uide, Version 8. Cary, NC: SAS Institute Inc. TSIATIS, A. A. (1990). Estimating regression parameters using linear rank tests for censored data. Ann. Statist. 18, WEI, L. J., YIN, Z. & LIN, D. Y. (1990). Linear regression analysis of censored survival data based on rank tests. Biometrika 77, YIN, Z. (1993). A large sample study of rank estimation for censored regression data. Ann. Statist. 21, [Received August Revised November 2002] 353 Donloaded from at Columbia University Libraries on March 27, 2015

On least-squares regression with censored data

On least-squares regression with censored data Biometrika (2006), 93, 1, pp. 147 161 2006 Biometrika Trust Printed in Great Britain On least-squares regression with censored data BY ZHEZHEN JIN Department of Biostatistics, Columbia University, New

More information

Statistical inference based on non-smooth estimating functions

Statistical inference based on non-smooth estimating functions Biometrika (2004), 91, 4, pp. 943 954 2004 Biometrika Trust Printed in Great Britain Statistical inference based on non-smooth estimating functions BY L. TIAN Department of Preventive Medicine, Northwestern

More information

Tests of independence for censored bivariate failure time data

Tests of independence for censored bivariate failure time data Tests of independence for censored bivariate failure time data Abstract Bivariate failure time data is widely used in survival analysis, for example, in twins study. This article presents a class of χ

More information

LSS: An S-Plus/R program for the accelerated failure time model to right censored data based on least-squares principle

LSS: An S-Plus/R program for the accelerated failure time model to right censored data based on least-squares principle computer methods and programs in biomedicine 86 (2007) 45 50 journal homepage: www.intl.elsevierhealth.com/journals/cmpb LSS: An S-Plus/R program for the accelerated failure time model to right censored

More information

Rank Regression Analysis of Multivariate Failure Time Data Based on Marginal Linear Models

Rank Regression Analysis of Multivariate Failure Time Data Based on Marginal Linear Models doi: 10.1111/j.1467-9469.2005.00487.x Published by Blacwell Publishing Ltd, 9600 Garsington Road, Oxford OX4 2DQ, UK and 350 Main Street, Malden, MA 02148, USA Vol 33: 1 23, 2006 Ran Regression Analysis

More information

Least Absolute Deviations Estimation for the Accelerated Failure Time Model. University of Iowa. *

Least Absolute Deviations Estimation for the Accelerated Failure Time Model. University of Iowa. * Least Absolute Deviations Estimation for the Accelerated Failure Time Model Jian Huang 1,2, Shuangge Ma 3, and Huiliang Xie 1 1 Department of Statistics and Actuarial Science, and 2 Program in Public Health

More information

Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika. Biometrika Trust A Simple Resampling Method by Perturbing the Minimand Author(s): Zhezhen Jin, Zhiliang Ying and L. J. Wei Source: Biometrika, Vol. 88, No. 2 (Jun., 2001), pp. 381-390 Published by: Biometrika

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan

More information

AFT Models and Empirical Likelihood

AFT Models and Empirical Likelihood AFT Models and Empirical Likelihood Mai Zhou Department of Statistics, University of Kentucky Collaborators: Gang Li (UCLA); A. Bathke; M. Kim (Kentucky) Accelerated Failure Time (AFT) models: Y = log(t

More information

LEAST ABSOLUTE DEVIATIONS ESTIMATION FOR THE ACCELERATED FAILURE TIME MODEL

LEAST ABSOLUTE DEVIATIONS ESTIMATION FOR THE ACCELERATED FAILURE TIME MODEL Statistica Sinica 17(2007), 1533-1548 LEAST ABSOLUTE DEVIATIONS ESTIMATION FOR THE ACCELERATED FAILURE TIME MODEL Jian Huang 1, Shuangge Ma 2 and Huiliang Xie 1 1 University of Iowa and 2 Yale University

More information

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where STAT 331 Accelerated Failure Time Models Previously, we have focused on multiplicative intensity models, where h t z) = h 0 t) g z). These can also be expressed as H t z) = H 0 t) g z) or S t z) = e Ht

More information

Published online: 10 Apr 2012.

Published online: 10 Apr 2012. This article was downloaded by: Columbia University] On: 23 March 215, At: 12:7 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 172954 Registered office: Mortimer

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

On the Breslow estimator

On the Breslow estimator Lifetime Data Anal (27) 13:471 48 DOI 1.17/s1985-7-948-y On the Breslow estimator D. Y. Lin Received: 5 April 27 / Accepted: 16 July 27 / Published online: 2 September 27 Springer Science+Business Media,

More information

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University

More information

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);

More information

Approximate Median Regression via the Box-Cox Transformation

Approximate Median Regression via the Box-Cox Transformation Approximate Median Regression via the Box-Cox Transformation Garrett M. Fitzmaurice,StuartR.Lipsitz, and Michael Parzen Median regression is used increasingly in many different areas of applications. The

More information

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling Estimation and Inference of Quantile Regression for Survival Data under Biased Sampling Supplementary Materials: Proofs of the Main Results S1 Verification of the weight function v i (t) for the lengthbiased

More information

Semiparametric analysis of transformation models with censored data

Semiparametric analysis of transformation models with censored data Biometrika (22), 89, 3, pp. 659 668 22 Biometrika Trust Printed in Great Britain Semiparametric analysis of transformation models with censored data BY KANI CHEN Department of Mathematics, Hong Kong University

More information

Quantile Regression for Residual Life and Empirical Likelihood

Quantile Regression for Residual Life and Empirical Likelihood Quantile Regression for Residual Life and Empirical Likelihood Mai Zhou email: mai@ms.uky.edu Department of Statistics, University of Kentucky, Lexington, KY 40506-0027, USA Jong-Hyeon Jeong email: jeong@nsabp.pitt.edu

More information

STAT331. Cox s Proportional Hazards Model

STAT331. Cox s Proportional Hazards Model STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations

More information

Power and Sample Size Calculations with the Additive Hazards Model

Power and Sample Size Calculations with the Additive Hazards Model Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine

More information

SIMULTANEOUS CONFIDENCE BANDS FOR THE PTH PERCENTILE AND THE MEAN LIFETIME IN EXPONENTIAL AND WEIBULL REGRESSION MODELS. Ping Sa and S.J.

SIMULTANEOUS CONFIDENCE BANDS FOR THE PTH PERCENTILE AND THE MEAN LIFETIME IN EXPONENTIAL AND WEIBULL REGRESSION MODELS. Ping Sa and S.J. SIMULTANEOUS CONFIDENCE BANDS FOR THE PTH PERCENTILE AND THE MEAN LIFETIME IN EXPONENTIAL AND WEIBULL REGRESSION MODELS " # Ping Sa and S.J. Lee " Dept. of Mathematics and Statistics, U. of North Florida,

More information

Smoothed and Corrected Score Approach to Censored Quantile Regression With Measurement Errors

Smoothed and Corrected Score Approach to Censored Quantile Regression With Measurement Errors Smoothed and Corrected Score Approach to Censored Quantile Regression With Measurement Errors Yuanshan Wu, Yanyuan Ma, and Guosheng Yin Abstract Censored quantile regression is an important alternative

More information

y(x) = x w + ε(x), (1)

y(x) = x w + ε(x), (1) Linear regression We are ready to consider our first machine-learning problem: linear regression. Suppose that e are interested in the values of a function y(x): R d R, here x is a d-dimensional vector-valued

More information

CHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION. From Exploratory Factor Analysis Ledyard R Tucker and Robert C. MacCallum

CHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION. From Exploratory Factor Analysis Ledyard R Tucker and Robert C. MacCallum CHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION From Exploratory Factor Analysis Ledyard R Tucker and Robert C. MacCallum 1997 19 CHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION 3.0. Introduction

More information

Efficiency Comparison Between Mean and Log-rank Tests for. Recurrent Event Time Data

Efficiency Comparison Between Mean and Log-rank Tests for. Recurrent Event Time Data Efficiency Comparison Between Mean and Log-rank Tests for Recurrent Event Time Data Wenbin Lu Department of Statistics, North Carolina State University, Raleigh, NC 27695 Email: lu@stat.ncsu.edu Summary.

More information

Efficient Estimation of Censored Linear Regression Model

Efficient Estimation of Censored Linear Regression Model 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 24 25 26 27 28 29 3 3 32 33 34 35 36 37 38 39 4 4 42 43 44 45 46 47 48 Biometrika (2), xx, x, pp. 4 C 28 Biometrika Trust Printed in Great Britain Efficient Estimation

More information

A Resampling Method on Pivotal Estimating Functions

A Resampling Method on Pivotal Estimating Functions A Resampling Method on Pivotal Estimating Functions Kun Nie Biostat 277,Winter 2004 March 17, 2004 Outline Introduction A General Resampling Method Examples - Quantile Regression -Rank Regression -Simulation

More information

Competing risks data analysis under the accelerated failure time model with missing cause of failure

Competing risks data analysis under the accelerated failure time model with missing cause of failure Ann Inst Stat Math 2016 68:855 876 DOI 10.1007/s10463-015-0516-y Competing risks data analysis under the accelerated failure time model with missing cause of failure Ming Zheng Renxin Lin Wen Yu Received:

More information

log T = β T Z + ɛ Zi Z(u; β) } dn i (ue βzi ) = 0,

log T = β T Z + ɛ Zi Z(u; β) } dn i (ue βzi ) = 0, Accelerated failure time model: log T = β T Z + ɛ β estimation: solve where S n ( β) = n i=1 { Zi Z(u; β) } dn i (ue βzi ) = 0, Z(u; β) = j Z j Y j (ue βz j) j Y j (ue βz j) How do we show the asymptotics

More information

ST745: Survival Analysis: Nonparametric methods

ST745: Survival Analysis: Nonparametric methods ST745: Survival Analysis: Nonparametric methods Eric B. Laber Department of Statistics, North Carolina State University February 5, 2015 The KM estimator is used ubiquitously in medical studies to estimate

More information

A TWO-STAGE LINEAR MIXED-EFFECTS/COX MODEL FOR LONGITUDINAL DATA WITH MEASUREMENT ERROR AND SURVIVAL

A TWO-STAGE LINEAR MIXED-EFFECTS/COX MODEL FOR LONGITUDINAL DATA WITH MEASUREMENT ERROR AND SURVIVAL A TWO-STAGE LINEAR MIXED-EFFECTS/COX MODEL FOR LONGITUDINAL DATA WITH MEASUREMENT ERROR AND SURVIVAL Christopher H. Morrell, Loyola College in Maryland, and Larry J. Brant, NIA Christopher H. Morrell,

More information

Analysis of least absolute deviation

Analysis of least absolute deviation Analysis of least absolute deviation By KANI CHEN Department of Mathematics, HKUST, Kowloon, Hong Kong makchen@ust.hk ZHILIANG YING Department of Statistics, Columbia University, NY, NY, 10027, U.S.A.

More information

UNIVERSITY OF CALIFORNIA, SAN DIEGO

UNIVERSITY OF CALIFORNIA, SAN DIEGO UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department

More information

Simple and Fast Overidentified Rank Estimation for Right-Censored Length-Biased Data and Backward Recurrence Time

Simple and Fast Overidentified Rank Estimation for Right-Censored Length-Biased Data and Backward Recurrence Time Biometrics 74, 77 85 March 2018 DOI: 10.1111/biom.12727 Simple and Fast Overidentified Rank Estimation for Right-Censored Length-Biased Data and Backward Recurrence Time Yifei Sun, 1, * Kwun Chuen Gary

More information

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Takeshi Emura and Hisayuki Tsukuma Abstract For testing the regression parameter in multivariate

More information

Adaptive Noise Cancellation

Adaptive Noise Cancellation Adaptive Noise Cancellation P. Comon and V. Zarzoso January 5, 2010 1 Introduction In numerous application areas, including biomedical engineering, radar, sonar and digital communications, the goal is

More information

Power-Transformed Linear Quantile Regression With Censored Data

Power-Transformed Linear Quantile Regression With Censored Data Power-Transformed Linear Quantile Regression With Censored Data Guosheng YIN, Donglin ZENG, and Hui LI We propose a class of power-transformed linear quantile regression models for survival data subject

More information

Semiparametric Regression

Semiparametric Regression Semiparametric Regression Patrick Breheny October 22 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/23 Introduction Over the past few weeks, we ve introduced a variety of regression models under

More information

Analysis of transformation models with censored data

Analysis of transformation models with censored data Biometrika (1995), 82,4, pp. 835-45 Printed in Great Britain Analysis of transformation models with censored data BY S. C. CHENG Department of Biomathematics, M. D. Anderson Cancer Center, University of

More information

Efficiency of Profile/Partial Likelihood in the Cox Model

Efficiency of Profile/Partial Likelihood in the Cox Model Efficiency of Profile/Partial Likelihood in the Cox Model Yuichi Hirose School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand Summary. This paper shows

More information

Quantile Regression for Recurrent Gap Time Data

Quantile Regression for Recurrent Gap Time Data Biometrics 000, 1 21 DOI: 000 000 0000 Quantile Regression for Recurrent Gap Time Data Xianghua Luo 1,, Chiung-Yu Huang 2, and Lan Wang 3 1 Division of Biostatistics, School of Public Health, University

More information

A comparison study of the nonparametric tests based on the empirical distributions

A comparison study of the nonparametric tests based on the empirical distributions 통계연구 (2015), 제 20 권제 3 호, 1-12 A comparison study of the nonparametric tests based on the empirical distributions Hyo-Il Park 1) Abstract In this study, we propose a nonparametric test based on the empirical

More information

Introduction to Statistical Analysis

Introduction to Statistical Analysis Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive

More information

Working Paper No Maximum score type estimators

Working Paper No Maximum score type estimators Warsaw School of Economics Institute of Econometrics Department of Applied Econometrics Department of Applied Econometrics Working Papers Warsaw School of Economics Al. iepodleglosci 64 02-554 Warszawa,

More information

Double Bootstrap Confidence Interval Estimates with Censored and Truncated Data

Double Bootstrap Confidence Interval Estimates with Censored and Truncated Data Journal of Modern Applied Statistical Methods Volume 13 Issue 2 Article 22 11-2014 Double Bootstrap Confidence Interval Estimates with Censored and Truncated Data Jayanthi Arasan University Putra Malaysia,

More information

Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai

Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective Anastasios (Butch) Tsiatis and Xiaofei Bai Department of Statistics North Carolina State University 1/35 Optimal Treatment

More information

Sample size calculations for logistic and Poisson regression models

Sample size calculations for logistic and Poisson regression models Biometrika (2), 88, 4, pp. 93 99 2 Biometrika Trust Printed in Great Britain Sample size calculations for logistic and Poisson regression models BY GWOWEN SHIEH Department of Management Science, National

More information

Linear life expectancy regression with censored data

Linear life expectancy regression with censored data Linear life expectancy regression with censored data By Y. Q. CHEN Program in Biostatistics, Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, U.S.A.

More information

Attributable Risk Function in the Proportional Hazards Model

Attributable Risk Function in the Proportional Hazards Model UW Biostatistics Working Paper Series 5-31-2005 Attributable Risk Function in the Proportional Hazards Model Ying Qing Chen Fred Hutchinson Cancer Research Center, yqchen@u.washington.edu Chengcheng Hu

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview

Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations

More information

Survival Analysis I (CHL5209H)

Survival Analysis I (CHL5209H) Survival Analysis Dalla Lana School of Public Health University of Toronto olli.saarela@utoronto.ca January 7, 2015 31-1 Literature Clayton D & Hills M (1993): Statistical Models in Epidemiology. Not really

More information

Survival Analysis for Case-Cohort Studies

Survival Analysis for Case-Cohort Studies Survival Analysis for ase-ohort Studies Petr Klášterecký Dept. of Probability and Mathematical Statistics, Faculty of Mathematics and Physics, harles University, Prague, zech Republic e-mail: petr.klasterecky@matfyz.cz

More information

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California Texts in Statistical Science Bayesian Ideas and Data Analysis An Introduction for Scientists and Statisticians Ronald Christensen University of New Mexico Albuquerque, New Mexico Wesley Johnson University

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS3301 / MAS8311 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-10 1 13 The Cox proportional hazards model 13.1 Introduction In the

More information

GROUPED SURVIVAL DATA. Florida State University and Medical College of Wisconsin

GROUPED SURVIVAL DATA. Florida State University and Medical College of Wisconsin FITTING COX'S PROPORTIONAL HAZARDS MODEL USING GROUPED SURVIVAL DATA Ian W. McKeague and Mei-Jie Zhang Florida State University and Medical College of Wisconsin Cox's proportional hazard model is often

More information

Bivariate Uniqueness in the Logistic Recursive Distributional Equation

Bivariate Uniqueness in the Logistic Recursive Distributional Equation Bivariate Uniqueness in the Logistic Recursive Distributional Equation Antar Bandyopadhyay Technical Report # 629 University of California Department of Statistics 367 Evans Hall # 3860 Berkeley CA 94720-3860

More information

Survival Analysis Math 434 Fall 2011

Survival Analysis Math 434 Fall 2011 Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup

More information

Simulation-based robust IV inference for lifetime data

Simulation-based robust IV inference for lifetime data Simulation-based robust IV inference for lifetime data Anand Acharya 1 Lynda Khalaf 1 Marcel Voia 1 Myra Yazbeck 2 David Wensley 3 1 Department of Economics Carleton University 2 Department of Economics

More information

Censored quantile regression with varying coefficients

Censored quantile regression with varying coefficients Title Censored quantile regression with varying coefficients Author(s) Yin, G; Zeng, D; Li, H Citation Statistica Sinica, 2013, p. 1-24 Issued Date 2013 URL http://hdl.handle.net/10722/189454 Rights Statistica

More information

EMPIRICAL LIKELIHOOD ANALYSIS FOR THE HETEROSCEDASTIC ACCELERATED FAILURE TIME MODEL

EMPIRICAL LIKELIHOOD ANALYSIS FOR THE HETEROSCEDASTIC ACCELERATED FAILURE TIME MODEL Statistica Sinica 22 (2012), 295-316 doi:http://dx.doi.org/10.5705/ss.2010.190 EMPIRICAL LIKELIHOOD ANALYSIS FOR THE HETEROSCEDASTIC ACCELERATED FAILURE TIME MODEL Mai Zhou 1, Mi-Ok Kim 2, and Arne C.

More information

Quantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be

Quantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be Quantile methods Class Notes Manuel Arellano December 1, 2009 1 Unconditional quantiles Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be Q τ (Y ) q τ F 1 (τ) =inf{r : F

More information

A note on profile likelihood for exponential tilt mixture models

A note on profile likelihood for exponential tilt mixture models Biometrika (2009), 96, 1,pp. 229 236 C 2009 Biometrika Trust Printed in Great Britain doi: 10.1093/biomet/asn059 Advance Access publication 22 January 2009 A note on profile likelihood for exponential

More information

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data Communications for Statistical Applications and Methods 2015, Vol. 22, No. 6, 599 604 DOI: http://dx.doi.org/10.5351/csam.2015.22.6.599 Print ISSN 2287-7843 / Online ISSN 2383-4757 Estimation of Conditional

More information

Nuisance Parameter Estimation in Survival Models

Nuisance Parameter Estimation in Survival Models Nuisance Parameter Estimation in Survival Models A Dissertation submitted to the faculty of the University of Minnesota by Andrew Wey In partial fulfillment of the requirements for the degree of Doctor

More information

On a connection between the Bradley-Terry model and the Cox proportional hazards model

On a connection between the Bradley-Terry model and the Cox proportional hazards model On a connection between the Bradley-Terry model and the Cox proportional hazards model Yuhua Su and Mai Zhou Department of Statistics University of Kentucky Lexington, KY 40506-0027, U.S.A. SUMMARY This

More information

Stat 8112 Lecture Notes Weak Convergence in Metric Spaces Charles J. Geyer January 23, Metric Spaces

Stat 8112 Lecture Notes Weak Convergence in Metric Spaces Charles J. Geyer January 23, Metric Spaces Stat 8112 Lecture Notes Weak Convergence in Metric Spaces Charles J. Geyer January 23, 2013 1 Metric Spaces Let X be an arbitrary set. A function d : X X R is called a metric if it satisfies the folloing

More information

The Restricted Likelihood Ratio Test at the Boundary in Autoregressive Series

The Restricted Likelihood Ratio Test at the Boundary in Autoregressive Series The Restricted Likelihood Ratio Test at the Boundary in Autoregressive Series Willa W. Chen Rohit S. Deo July 6, 009 Abstract. The restricted likelihood ratio test, RLRT, for the autoregressive coefficient

More information

Issues on quantile autoregression

Issues on quantile autoregression Issues on quantile autoregression Jianqing Fan and Yingying Fan We congratulate Koenker and Xiao on their interesting and important contribution to the quantile autoregression (QAR). The paper provides

More information

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen Outline Cox s proportional hazards model. Goodness-of-fit tools More flexible models R-package timereg Forthcoming book, Martinussen and Scheike. 2/38 University of Copenhagen http://www.biostat.ku.dk

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH Jian-Jian Ren 1 and Mai Zhou 2 University of Central Florida and University of Kentucky Abstract: For the regression parameter

More information

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),

More information

Harvard University. Harvard University Biostatistics Working Paper Series

Harvard University. Harvard University Biostatistics Working Paper Series Harvard University Harvard University Biostatistics Working Paper Series Year 2008 Paper 94 The Highest Confidence Density Region and Its Usage for Inferences about the Survival Function with Censored

More information

Lecture 22 Survival Analysis: An Introduction

Lecture 22 Survival Analysis: An Introduction University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 22 Survival Analysis: An Introduction There is considerable interest among economists in models of durations, which

More information

Unit 14: Nonparametric Statistical Methods

Unit 14: Nonparametric Statistical Methods Unit 14: Nonparametric Statistical Methods Statistics 571: Statistical Methods Ramón V. León 8/8/2003 Unit 14 - Stat 571 - Ramón V. León 1 Introductory Remarks Most methods studied so far have been based

More information

The Accelerated Failure Time Model Under Biased. Sampling

The Accelerated Failure Time Model Under Biased. Sampling The Accelerated Failure Time Model Under Biased Sampling Micha Mandel and Ya akov Ritov Department of Statistics, The Hebrew University of Jerusalem, Israel July 13, 2009 Abstract Chen (2009, Biometrics)

More information

Model Selection, Estimation, and Bootstrap Smoothing. Bradley Efron Stanford University

Model Selection, Estimation, and Bootstrap Smoothing. Bradley Efron Stanford University Model Selection, Estimation, and Bootstrap Smoothing Bradley Efron Stanford University Estimation After Model Selection Usually: (a) look at data (b) choose model (linear, quad, cubic...?) (c) fit estimates

More information

CALCULATION METHOD FOR NONLINEAR DYNAMIC LEAST-ABSOLUTE DEVIATIONS ESTIMATOR

CALCULATION METHOD FOR NONLINEAR DYNAMIC LEAST-ABSOLUTE DEVIATIONS ESTIMATOR J. Japan Statist. Soc. Vol. 3 No. 200 39 5 CALCULAION MEHOD FOR NONLINEAR DYNAMIC LEAS-ABSOLUE DEVIAIONS ESIMAOR Kohtaro Hitomi * and Masato Kagihara ** In a nonlinear dynamic model, the consistency and

More information

Linear Regression Linear Regression with Shrinkage

Linear Regression Linear Regression with Shrinkage Linear Regression Linear Regression ith Shrinkage Introduction Regression means predicting a continuous (usually scalar) output y from a vector of continuous inputs (features) x. Example: Predicting vehicle

More information

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Jonathan Taylor & Kristin Cobb Statistics 262: Intermediate Biostatistics p.1/?? Overview of today s class Kaplan-Meier Curve

More information

CHARACTERIZATION OF ULTRASONIC IMMERSION TRANSDUCERS

CHARACTERIZATION OF ULTRASONIC IMMERSION TRANSDUCERS CHARACTERIZATION OF ULTRASONIC IMMERSION TRANSDUCERS INTRODUCTION David D. Bennink, Center for NDE Anna L. Pate, Engineering Science and Mechanics Ioa State University Ames, Ioa 50011 In any ultrasonic

More information

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems

Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Jeremy S. Conner and Dale E. Seborg Department of Chemical Engineering University of California, Santa Barbara, CA

More information

Enhancing Generalization Capability of SVM Classifiers with Feature Weight Adjustment

Enhancing Generalization Capability of SVM Classifiers with Feature Weight Adjustment Enhancing Generalization Capability of SVM Classifiers ith Feature Weight Adjustment Xizhao Wang and Qiang He College of Mathematics and Computer Science, Hebei University, Baoding 07002, Hebei, China

More information

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY Ingo Langner 1, Ralf Bender 2, Rebecca Lenz-Tönjes 1, Helmut Küchenhoff 2, Maria Blettner 2 1

More information

Rank Regression with Normal Residuals using the Gibbs Sampler

Rank Regression with Normal Residuals using the Gibbs Sampler Rank Regression with Normal Residuals using the Gibbs Sampler Stephen P Smith email: hucklebird@aol.com, 2018 Abstract Yu (2000) described the use of the Gibbs sampler to estimate regression parameters

More information

A nonparametric two-sample wald test of equality of variances

A nonparametric two-sample wald test of equality of variances University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 211 A nonparametric two-sample wald test of equality of variances David

More information

Comparing Distribution Functions via Empirical Likelihood

Comparing Distribution Functions via Empirical Likelihood Georgia State University ScholarWorks @ Georgia State University Mathematics and Statistics Faculty Publications Department of Mathematics and Statistics 25 Comparing Distribution Functions via Empirical

More information

Max-Margin Ratio Machine

Max-Margin Ratio Machine JMLR: Workshop and Conference Proceedings 25:1 13, 2012 Asian Conference on Machine Learning Max-Margin Ratio Machine Suicheng Gu and Yuhong Guo Department of Computer and Information Sciences Temple University,

More information

A turbulence closure based on the maximum entropy method

A turbulence closure based on the maximum entropy method Advances in Fluid Mechanics IX 547 A turbulence closure based on the maximum entropy method R. W. Derksen Department of Mechanical and Manufacturing Engineering University of Manitoba Winnipeg Canada Abstract

More information

A Bootstrap Test for Conditional Symmetry

A Bootstrap Test for Conditional Symmetry ANNALS OF ECONOMICS AND FINANCE 6, 51 61 005) A Bootstrap Test for Conditional Symmetry Liangjun Su Guanghua School of Management, Peking University E-mail: lsu@gsm.pku.edu.cn and Sainan Jin Guanghua School

More information

Simulating Properties of the Likelihood Ratio Test for a Unit Root in an Explosive Second Order Autoregression

Simulating Properties of the Likelihood Ratio Test for a Unit Root in an Explosive Second Order Autoregression Simulating Properties of the Likelihood Ratio est for a Unit Root in an Explosive Second Order Autoregression Bent Nielsen Nuffield College, University of Oxford J James Reade St Cross College, University

More information

What s New in Econometrics? Lecture 14 Quantile Methods

What s New in Econometrics? Lecture 14 Quantile Methods What s New in Econometrics? Lecture 14 Quantile Methods Jeff Wooldridge NBER Summer Institute, 2007 1. Reminders About Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile Regression

More information

A Non-parametric bootstrap for multilevel models

A Non-parametric bootstrap for multilevel models A Non-parametric bootstrap for multilevel models By James Carpenter London School of Hygiene and ropical Medicine Harvey Goldstein and Jon asbash Institute of Education 1. Introduction Bootstrapping is

More information

Censored partial regression

Censored partial regression Biostatistics (2003), 4, 1,pp. 109 121 Printed in Great Britain Censored partial regression JESUS ORBE, EVA FERREIRA, VICENTE NÚÑEZ-ANTÓN Departamento de Econometría y Estadística, Facultad de Ciencias

More information

Multivariate Survival Analysis

Multivariate Survival Analysis Multivariate Survival Analysis Previously we have assumed that either (X i, δ i ) or (X i, δ i, Z i ), i = 1,..., n, are i.i.d.. This may not always be the case. Multivariate survival data can arise in

More information

COMPARING TRANSFORMATIONS USING TESTS OF SEPARATE FAMILIES. Department of Biostatistics, University of North Carolina at Chapel Hill, NC.

COMPARING TRANSFORMATIONS USING TESTS OF SEPARATE FAMILIES. Department of Biostatistics, University of North Carolina at Chapel Hill, NC. COMPARING TRANSFORMATIONS USING TESTS OF SEPARATE FAMILIES by Lloyd J. Edwards and Ronald W. Helms Department of Biostatistics, University of North Carolina at Chapel Hill, NC. Institute of Statistics

More information

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Glenn Heller and Jing Qin Department of Epidemiology and Biostatistics Memorial

More information

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances

Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances Advances in Decision Sciences Volume 211, Article ID 74858, 8 pages doi:1.1155/211/74858 Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances David Allingham 1 andj.c.w.rayner

More information

Homework 6 Solutions

Homework 6 Solutions Homeork 6 Solutions Igor Yanovsky (Math 151B TA) Section 114, Problem 1: For the boundary-value problem y (y ) y + log x, 1 x, y(1) 0, y() log, (1) rite the nonlinear system and formulas for Neton s method

More information