Rank-based inference for the accelerated failure time model
|
|
- Rosamond Parker
- 6 years ago
- Views:
Transcription
1 Biometrika (2003), 90, 2, pp Biometrika Trust Printed in reat Britain Rank-based inference for the accelerated failure time model BY ZHEZHEN JIN Department of Biostatistics, Columbia University, Ne York, Ne York 10032, U.S.A. D. Y. LIN Department of Biostatistics, University of North Carolina, Chapel Hill, North Carolina , U.S.A. L. J. WEI Department of Biostatistics, Harvard University, Boston, Massachusetts 02115, U.S.A. AND ZHILIAN YIN Department of Statistics, Columbia University, Ne York, Ne York 10027, U.S.A. SUMMARY A broad class of rank-based monotone estimating functions is developed for the semiparametric accelerated failure time model ith censored observations. The corresponding estimators can be obtained via linear programming, and are shon to be consistent and asymptotically normal. The limiting covariance matrices can be estimated by a resampling technique, hich does not involve nonparametric density estimation or numerical derivatives. The ne estimators represent consistent roots of the non-monotone estimating equations based on the familiar eighted log-rank statistics. Simulation studies demonstrate that the proposed methods perform ell in practical settings. To real examples are provided. Some key ords: Accelerated life model; Censoring; ehan statistic; Linear programming; Rank estimator; Survival data; Weighted log-rank statistic. 1. INTRODUCTION The accelerated failure time model or accelerated life model relates the logarithm of the failure time linearly to the covariates (Kalbfleisch & Prentice, 1980, pp. 32 4; Cox & Oakes, 1984, pp. 64 5). As a result of its direct physical interpretation, this model provides an attractive alternative to the popular Cox (1972) proportional hazards model for the regression analysis of censored failure time data. The presence of censoring in failure time data creates a serious challenge in the semiparametric analysis of the accelerated failure time model. Several semiparametric estimators ere proposed around 1980, Buckley & James (1979) providing a modification of the Donloaded from at Columbia University Libraries on March 27, 2015
2 342 Z. JIN, D. Y. LIN, L. J. WEI AND Z. YIN least-squares estimator to accommodate censoring and Prentice (1978) proposing rank estimators based on the ell-knon eighted log-rank statistics. The asymptotic properties of the Buckley James and rank estimators ere rigorously studied by Ritov (1990), Tsiatis (1990), Lai & Ying (1991a, b) and Ying (1993) among others. Despite the theoretical advances, semiparametric methods for the accelerated failure time model have rarely been used in applications, mainly because of the lack of efficient and reliable computational methods. The existing semiparametric estimating functions are step functions of the regression parameters ith potentially multiple roots, and the corresponding estimators may not be ell defined. Furthermore, analytical evaluations of the covariance matrices of the estimators ould require nonparametric density function estimation. To bypass the variance-covariance estimation, Wei et al. (1990) developed an inference procedure based on the so-called minimum-dispersion statistic. Calculation of the minimum-dispersion statistics and of the semiparametric estimators involves minimisations of discrete objective functions ith potentially multiple local minima. Such minimisation problems cannot be solved by conventional optimisation algorithms. Lin & eyer (1992) suggested a computational method based on simulated annealing (Kirkpatrick et al., 1983), but their algorithm is not guaranteed to yield the true minimum. In the present paper, e provide simple and reliable methods for implementing the aforementioned rank estimators. We first sho that the rank estimator ith the ehan (1965)-type eight function can be readily obtained by minimising a convex objective function through a standard linear programming technique. We then introduce a class of monotone estimating functions to approximate the possibly non-monotone eighted logrank estimating functions around the true values of the regression parameters. These estimating equations are solved through an iterative algorithm ith the ehan-type estimator as the initial value. Each iteration can also be executed via linear programming. This procedure yields a consistent root of the eighted log-rank estimating equation, hich potentially contains inconsistent roots. The covariance matrices for both the ehantype and iterative estimators can be easily estimated by a resampling technique, hich does not involve density estimation. 2. PROPOSED METHODS 2 1. Accelerated failure time model and rank estimators For,...,n, let T be the failure time for the ith subject and let X be the associated i i p-vector of covariates. The accelerated failure time model specifies that log T i =b 0 X i +e i (,...,n), (2 1) here b is a p-vector of unknon regression parameters and e (,...,n)are independent error terms ith a common, but completely unspecified, distribution. 0 i Let C be the censoring time for T. Assume that T and C are independent conditionally i i i i on X. We impose Conditions 1 4 of Ying (1993, p. 80). The data consist of (TB, D, X ) i i i i (,...,n), here TB =T mc and D =1. Here and in the sequel, amb=min(a, b), i i i i {Ti C i } and 1 is the indicator function. {.} Define e (b)=log TB b X,N(b; t)=d 1 i i i i i {ei (b) t} and Y i (b; t)=1 {e i (b)t}. Note that N i and Y are the counting process and at-risk process on the time scale of the residual. Write i S(0)(b; t)=n 1 n Y (b; t), i S(1)(b; t)=n 1 n Y (b; t)x. i i Donloaded from at Columbia University Libraries on March 27, 2015
3 Accelerated failure time model The eighted log-rank estimating function for b takes the form 0 U (b)= n D {b; e(b)}[x X9 {b; e(b)}], i i i i or 343 U (b)= n P 2 (b; t){x X9 (b; t)} dn (b; t), (2 2) i i 2 here X9 (b; t)=s(1)(b; t)/s(0)(b; t), and is a possibly data-dependent eight function satisfying Condition 5 of Ying (1993, p. 90). The choices of =1 and =S(0) correspond to the log-rank (Mantel, 1966) and ehan (1965) statistics, respectively. Let b@ be a root of the estimating function U (b). It has been established that the random vector nd b ) is asymptotically zero-mean normal ith covariance matrix 0 A 1 B A 1, here A = lim n 1 n P 2 (b ; t){x X9 (b ;t)}e2{l<(t)/l(t)} dn (b ; t), 0 i 0 i 0 n2 2 B = lim n 1 n P 2 2(b ; t){x X9 (b ;t)}e2 dn (b ; t), 0 i 0 i 0 n2 2 l(.) is the common hazard function of the error terms, and l<(t)=dl(t)/dt (Tsiatis, 1990; Lai & Ying, 1991b; Ying, 1993). The foregoing result requires that A be nonsingular. This condition is assumed to hold in the present paper. In general, U (b) is neither continuous nor componentise monotone in b. Thus, it is difficult to solve the equation U (b)=0, especially hen b is high-dimensional. In fact, there are potentially multiple solutions to the equation, some of hich are inconsistent. Although one may define the estimator to be a minimiser of the norm of U (b), itisnot easy to locate a minimiser either, because of the discontinuity and non-monotonicity. Another difficulty lies in the variance estimation. Since A involves the derivative of the hazard function, direct evaluation of the covariance matrix ould require nonparametric estimation of the density function, hich cannot be done reliably in practical samples ith censored observations. One might estimate A by the numerical derivative of n 1U ; hoever, this kind of estimator can be highly unstable ehan-type eight function Considerable simplification arises in the special case of (b; t)=s(0)(b; t), hich is referred to as the ehan-type eight function. In this case, U can be ritten as or U (b)= n D S(0){b; e(b)}[x X9 {b; e(b)}], i i i i U (b)=n 1 n n D (X X )1 i i j {ei (b) e j=1 j (b)}, (2 3) hich is monotone in each component of b (Fygenson & Ritov, 1994). It is not difficult to see that (X X )1 i j {ei (b) e j (b)} is the gradient in b of {e i (b) e (b)}, here j Donloaded from at Columbia University Libraries on March 27, 2015
4 344 Z. JIN, D. Y. LIN, L. J. WEI AND Z. YIN a = a 1 {a<0}. Thus, the right-hand side of (2 3) is the gradient of the convex function L (b))n 1 n n D {e (b) e (b)}. i i j j=1 We denote a minimiser of L (b) by b@. Although b@ is not necessarily unique, all the minimisers of L (b) are asymptotically equivalent. The minimisation of L (b) can be carried out by linear programming: e minimise the linear function Wn Wn D u subject to the linear constraints u 0 and j=1 i ij ij u ij {e i (b) e j (b)} (i, j=1,...,n). This type of linear programming has been used extensively in the econometrics literature, especially in connection ith the regression quantiles introduced by Koenker & Bassett (1978), and as previously considered by Lin et al. (1998) for the rank estimation of the accelerated time model for counting processes. The minimisation of L (b) is equivalent to the minimisation of n n D e (b) e (b) + n n D (X X ) K i i j KM b, k l k j=1 k=1 l=1 here M is an extremely large number. The latter type of minimisation can be implemented ith the algorithm of Koenker & D Orey (1987), hich is available in S-Plus and other softare. According to the general asymptotic theory for the rank estimators stated in 2 1, the random vector nd b ) is asymptotically zero-mean normal ith covariance matrix 0 A 1 B A 1, here A and B are A and B evaluated at =. As discussed before, A involves the unknon hazard function, so that it is difficult to estimate the covariance matrix analytically. We ill develop a resampling scheme similar to those of Rao & Zhao (1992), Parzen et al. (1994) and Jin et al. (2001) to approximate the distribution of b@, particularly its variance-covariance matrix. To be specific, e define a ne loss function L *(b))n 1 n n D {e (b) e (b)} Z, i i j i j=1 here Z (,...,n)are independent positive random variables ith E(Z )=var(z )=1, i i i and are independent of the data (TB, D, X )(,...,n).this loss function is essentially i i i a perturbed version of the original L (b), perturbed by random variables Z (,...,n). i Minimisation of L * is also a linear programming problem. Let b@* be a minimiser of L *, that is a root of the estimating function U* (b)) n D S(0){b; e(b)}[x X9 {b; e(b)}]z. i i i i i Note that U* has the same mean and approximately the same variance as U. We sho in the Appendix that the asymptotic distribution of nd b ) can be approximated by 0 the conditional distribution of nd* b@ ) given the data (TB, D,X)(,...,n). i i i Conditional on the data (TB, D, X )(,...,n), the only random elements in L *(b) i i i are the Z s. To approximate the distribution of b@, e produce a large number of realisations of b@* by repeatedly generating the random sample (Z,...,Z ) hile holding the 1 n i data (TB, D, X )(,...,n) at their observed values. The covariance matrix of b@ can i i i then be approximated by the empirical covariance matrix of b@*. Confidence intervals Donloaded from at Columbia University Libraries on March 27, 2015
5 Accelerated failure time model for individual components of b 0 can be obtained from the percentiles of the empirical distribution of b@* or by the Wald method eneral eight functions The efficiency of the ehan statistic and the corresponding rank estimator b@ depends on the censoring distribution. The variance of b@ is minimised if the limit of (t) is proportional to l<(t)/l(t). Thus, it is desirable to use general eight functions. The simple conversion of the ehan estimating function U to an L 1 -type loss function does not extend directly to the general eighted log-rank estimating functions. On the basis of b@, hoever, it is possible to construct eighted ehan-type loss functions, hose minimisers can be readily obtained by the linear programming algorithm. We consider the folloing modification of (2 2): UB (b; b@)) n P 2 y{b@; t+(b b@) X }S(0)(b; t){x X9 (b; t)} dn (b; t), (2 4) i i i 2 here y(b; x)=(b; x)/s(0)(b; x), and b@ is a preliminary consistent estimator of b, b@ say. 0 Clearly, UB (b; b@)= n D y{b@; e(b)+(b b@) X }S(0){b; e(b)}[x i i i i i X9 {b; e(b)}] i =n 1 n n y{b@; e)}d (X X )1 i i i j {ei (b) e j=1 j (b)}. This is the same as (2 3) except for the eights y{b@; e)}, hich are free of b. Thus, i UB (b; b@) is monotone in each component of b and is the gradient of the folloing extension of L (b): L (b; b@))n 1 n n y{b@; e)}d {e (b) e (b)}. i i i j j=1 As in the case of L, minimisation of L (.; b@) can be implemented via linear programming. We propose the folloing iterative algorithm for estimating b : 0 b@ =b@, b@ =arg min L (b; b@ ) (k1). (0) b By comparing (2 2) and (2 4), e see that, if b@ converges to a limit as the number of iterations k2, then the limit must satisfy U (b)=0. In such situations, the proposed algorithm constitutes a numerical method for solving the original estimating equation U (b)=0. The resulting estimator is consistent although the original estimating equation may contain inconsistent roots. In all the simulated and real datasets e have tested, b@ converges as k increases and, for k as small as 3, b@ is already close to the limit. It is important to point out that b@ itself is a legitimate estimator for any k and the convergence of the algorithm is not essential to the validity of the proposed method. We sho in the Appendix that, for any k, the estimator b@ is consistent and asymptotically normal. In fact, it follos from (A 5) in the Appendix that Donloaded from at Columbia University Libraries on March 27, 2015 b@ ={(A +D ) 1D }kb@ +[I {(A +D ) 1D }k]b@ +o p (n D),
6 346 Z. JIN, D. Y. LIN, L. J. WEI AND Z. YIN here I is the p p identity matrix, D = lim n 1 n P y < (t)s(0)(b ; t){x X9 (b ;t)}e2 dn (b ; t), (2 5) 0 0 i 0 i 0 n2 and y< (t) is the derivative of y (t), the limit of y(b ;t)as n2. Thus, b@ is asymptotically a eighted average of b@ and b@, and can be regarded as an approximation to b@ If {(A +D ) 1D }k0 as k2, then the limit of b@ as k2, denoted by b@,is (2) asymptotically equivalent to b@. Note that {(A +D ) 1D }k0 as k2 is equivalent to all the eigenvalues of (A +D ) 1D being less than 1 in absolute value. This condition holds if A is positive definite and D is nonnegative definite. The most commonly used eight functions, including the log-rank, Prentice Wilcoxon (Prentice, 1978) and the more general r class of Harrington & Fleming (1982), are all nonnegative and nonincreasing. For such eight functions, A is indeed positive definite provided that the distribution of X does not concentrate on a (p 1)-dimensional hyperplane (Ying, 1993, pp. 81 2). It is evident from (2 5) that D ill be nonnegative definite if y< (.)0, hich is true of all commonly used eight functions. Therefore, b@ is asymptotically equivalent to b@, at least for most important eight functions. (2) 0 We sho in the Appendix that, under mild conditions, b@ b@ =o (n D) for any k of p order log(n) provided that {(A +D ) 1D }k0 as k2. This implies that, ith a finite number of iterations, b@ is asymptotically equivalent to the consistent roots of the original estimating function U (b). In practice, one ould terminate the iteration process hen the difference beteen successive estimates is less than a certain bound. This convergence criterion may be set on the basis of the observed b@ and its estimated standard error, as demonstrated in the next section. Although b@ can be easily obtained, it is again difficult to estimate the limiting covariance matrix analytically. As in the case of b@, e appeal to the resampling approach. To be specific, e construct the perturbed objective function L *(b; b))n 1 n n y{b; e (b)}d {e (b) e (b)} Z, i i i j i j=1 and define b@* =b@* and b@* =arg min L *(b; b@* )(k1). We sho in the Appendix (0) b that the asymptotic distribution of nd b ) can be approximated by the conditional 0 distribution of nd* b@ ) given the data (TB, D, X )(,...,n). Inference about b i i i 0 can then be carried out on the basis of the empirical distribution of b@*. 3. NUMERICAL RESULTS 3 1. Real examples The results reported here and in 3 2 are based on y=1 and y=1/s(0), hich are referred to as the ehan and log-rank eight functions. For the resampling procedure, e simulated the Z i from the unit exponential distribution and generated 500 realisations. The results ere similar hen other distributions of the Z i ere used. The Wald method as used to construct the confidence intervals. We first applied the proposed methods to a study on multiple myeloma reported by Krall et al. (1975). Out of a total of 65 patients ho ere treated ith alkylating agents, 48 died during the study and 17 survived. This is the main example in the PROC PHRE of the SAS/STAT User s uide (1999, pp , ), hich focused on the Cox model ith haemoglobin, HB, and the logarithm of blood urea nitrogen, BUN, as the Donloaded from at Columbia University Libraries on March 27, 2015
7 Accelerated failure time model covariates. To improve numerical efficiencies, e standardised the to covariates log(bun) and HB in our analysis. The parameter estimates for standardised log(bun) and HB based on the ehan eight function are and ith estimated standard errors of and The corresponding 95% confidence intervals are ( 0 818, 0 246) and ( 0 039, 0 623). We used the iterative algorithm to obtain the estimates based on the log-rank eight function. The results from the ehan eight function suggested that it is not necessary to have accurate numbers beyond the second decimal point. Thus, e set the convergence criterion for the iteration to be 0 01 beteen successive estimates. Based on this criterion, convergence as achieved at the third iteration. The resulting estimates are and ith 95% confidence intervals ( 0 880, 0 130) and ( 0 017, 0 553). The estimates are identical up to the eighth decimal point after 5 iterations. For further illustration, e considered the ell-knon Mayo primary biliary cirrhosis data (Fleming & Harrington, 1991, Appendix D.1). The database contains information about the survival time and prognostic factors for 418 patients. The data ere used by Dickson et al. (1989) to build a Cox proportional hazards model for the natural history of the disease ith five covariates, age, log(albumin), log(bilirubin), oedema and log(protime); see Fleming & Harrington (1991, Table 4.6.3). We considered the same set of covariates in our analysis. We again standardised the covariates. The estimates based on the ehan eight function are 0 270, 0 204, 0 593, and ith estimated standard errors of 0 062, 0 069, 0 071, and These results also suggested that the convergence criterion of 0 01 can be used for calculating the log-rank-type estimates. This criterion as again met at the third iteration. The estimates ere identical up to the eighth decimal point after 12 iterations. We display in Table 1 the results for the unstandardised covariates. Although the conclusions are not qualitatively different, the analysis under the accelerated failure time model provides an alternative and more direct interpretation of the effects of covariates on the survival time, as compared to the Cox regression analysis. The results based on the to eight functions are fairly similar. The point estimates for the log-rank eight function are similar to those of Lin & eyer (1992), hich ere obtained by a simulated annealing algorithm. Table 1. Accelerated failure time regression for the Mayo primary biliary cirrhosis data ehan eight function Log-rank eight function Parameter Est SE 95% CI Est SE 95% CI Age ( 0 037, 0 014) ( 0 035, 0 018) log(albumin) (0 542, 2 639) (0 934, 2 378) log(bilirubin) ( 0 716, 0 442) ( 0 674, 0 496) Oedema ( 1 421, 0 336) ( 1 083, 0 385) log(protime) ( 4 549, 0 987) ( 2 850, 1 038) Est, estimate; SE, standard error; CI, confidence interval. 347 Donloaded from at Columbia University Libraries on March 27, Simulation studies Extensive simulation studies ere conducted to assess the operating characteristics of the proposed methods in practical settings. One series of the experiments mimicked the foregoing multiple myeloma study. The failure times ere generated from the model log T = log(bun) HB+e, (3 1)
8 348 Z. JIN, D. Y. LIN, L. J. WEI AND Z. YIN here e is a normal random variable ith mean 0 and standard deviation The regression coefficients and the standard deviation of e in model (3 1) ere estimated from the parametric accelerated failure time model. The censoring times ere generated from the Un(0, t) distribution, here t as chosen to yield a desired level of censoring. We first examined the difference beteen log-rank-type estimates obtained after a fixed number of iterations versus those obtained after convergence. We generated 1000 datasets ith sample size of 65 from model (3 1). The standardised covariate values from the original myeloma study ere used. The algorithm as considered convergent if the difference beteen to consecutive estimates as less than ; convergence as usually achieved ithin 10 iterations. Figure 1(a) plots the estimates obtained after convergence versus those obtained after three iterations for the estimation of the first regression coefficient under 20% censoring; the to sets of estimates are clearly very similar. Figure 1(b) displays the corresponding plot for the log-rank estimates versus the ehan estimates, the to of hich are noticeably different. Thus, the phenomenon in Fig.1(a) is not an artifact of the closeness of the ehan and log-rank estimates. The plots for the estimation of the second regression coefficient and for other censoring levels revealed similar results. The values of the log-rank estimating functions at convergence versus after three iterations ere similar: under 20% censoring, the medians of the L 1 -norms of n 1UB among the 1000 datasets ere at convergence and after three iterations; the corresponding maxima ere 0 07 and These results demonstrated that a small number of iterations is sufficient. Log-rank-type est. after conv (a) Log-rank-type est. after 3 iter. Log-rank-type est. after conv (b) ehan-type est. Fig. 1. Comparisons of different estimates of b 01 under 20% censoring: (a) log-rank-type estimates after convergence versus log-rank-type estimates after three iterations; (b) log-rank-type estimates after convergence versus ehan-type estimates. Donloaded from at Columbia University Libraries on March 27, 2015 Most of the simulation studies ere concerned ith the performance of the proposed inference procedures based on the ehan-type estimator and the log-rank-type estimator ith three iterations. We considered sample sizes of 65 and 130. For n=65, the standardised covariate values from the multiple myeloma dataset ere used; for n=130, each covariate value in the original dataset as duplicated. For each configuration of the simulation parameters, e generated 500 datasets ith the same set of covariate values. The results of the simulation studies are summarised in Table 2. The parameter estimators
9 Accelerated failure time model 349 Table 2. Summary statistics for the simulation studies: regression coeycients b 01 and for log(bun) and HB, respectively n Censoring Weight Parameter Bias SE SEE 90% CP 95% CP 65 0% ehan b Log-rank b % ehan b Log-rank b % ehan b Log-rank b % ehan b Log-rank b % ehan b Log-rank b % ehan b Log-rank b Bias, bias of the estimator; SE, standard error of the estimator; SEE, mean of the standard error estimator; 90% CP, coverage probability of the 90% Wald-type confidence interval; 95% CP, coverage probability of the 95% Wald-type confidence interval appear to be virtually unbiased. The standard error estimators reflect ell the true variabilities of the parameter estimators, and the corresponding confidence intervals have satisfactory coverage probabilities. 4. REMARKS Fygenson & Ritov (1994) characterised the class of eight functions in (2 2) hich yields monotone estimating functions. The class is quite restrictive and involves the censoring time distribution. By contrast, the class of monotone estimating functions developed in the present paper is very broad and yields consistent roots of the original eighted logrank estimating equations. On the basis of (2 4), e can obtain a one-step approximation to b@ for any eight function, as follos. Define Donloaded from at Columbia University Libraries on March 27, 2015 ya (b; x)=s(0)(b; x) 1(b; x)i D A 1, and replace y{b@; t+(b b@) X i } in (2 4) by ya {b@ ;t+(b b@ ) X i }. It then follos from (A 4) in the Appendix that the resulting estimator is asymptotically equivalent to b@.to
10 350 Z. JIN, D. Y. LIN, L. J. WEI AND Z. YIN implement this one-step estimator, one ould need to estimate A and D. A consistent estimator of A can be easily obtained because A 1 B A 1 and B can be consistently estimated from the data. The estimation of D ould require an estimator of l(.), hich can be obtained either nonparametrically or parametrically. The proposed estimators do not achieve the semiparametric efficiency bound. By adopting the approach of Lai & Ying (1991b, 3), one can construct data-dependent eight functions hich yield asymptotically efficient estimators provided that the iterative procedure converges. ACKNOWLEDEMENT This research as supported by the National Institutes of Health, the National Science Foundation and the Ne York City Council Speaker s Fund for Public Health Research. The authors thank the revieers for their helpful comments. Softare implementing the proposed methods is available from the authors. APPENDIX Derivation of asymptotic results Asymptotic properties of b@*. Both n 1L and n 1L * are convex functions and, by the strong la of large numbers, converge almost surely to the same limiting function. The first derivative of the limiting function is zero at b and its second derivative at b is A, hich is nonsingular. Thus, 0 0 this function has a unique minimiser b. It then follos from convex analysis that b@ b and 0 0 b@* b almost surely. The latter convergence is ith respect to the joint probability space of 0 (TB, D, X )(,...,n) and Z (,...,n). We shall use F to denote the s-field generated by i i i i the original data. It follos from Theorem 2 of Ying (1993) that almost surely By similar arguments, U )=U (b 0 )+na )+o(nd+ndb@ d). (A 1) U* * )=U* )+na * b@ )+o(nd+ndb@* b@ d) (A 2) almost surely. The functions U* and U have the same asymptotic slope matrix A in (A 1) and (A 2) because E{U* (b) F}=U (b) and b@ and b@* are consistent. We have that U* )=U* ) U )+o(nd) since b@ is a root of U (b). Thus n DU* )=n D n P 2 S(0) ; t){x X9 ; t)} dn ;t)(z 1)+o(1). i i i 2 Conditionally on F, the right-hand side of the above display is a normalised sum of independent zero-mean random vectors. Since its conditional covariance matrix converges almost surely to B, the multivariate central limit theorem implies that n DU* ) converges in distribution to N(0, B ) almost surely. It then follos from (A 2) that the conditional distribution of nd* b@ ) given F converges almost surely to N(0, A 1 B A 1 ), hich is the limiting distribution of b ). 0 Asymptotic properties of b@. We first sho that b@ is strongly consistent for any given k. Recall that b@ =b@. As a result of the strong consistency of b@, the function n 1L (b; b@ ) (0) (0) (0) converges almost surely to lim n 1L (b; b ), hich reaches its minimum at b=b. Since 0 0 n 1L (b; b@ ) is convex, b@ must converge to b almost surely. Applying these arguments successively for k=2,3,...,e see that b@ b almost surely as n2 for all k. 0 (0) (1) 0 To avoid possible tail instabilities, e assume that, for any b and g hich converge almost n n Donloaded from at Columbia University Libraries on March 27, 2015
11 surely to b 0 and 0, respectively, Accelerated failure time model y(b n ;t+g n )=y(b n ;t)+y< 0 (t)g n +o(n D+g n ) uniformly in t. This condition holds for virtually all the existing eight functions for t aay from the endpoint. For it to hold for all t, e typically ould have to use hich is truncated at the tail, that is (t)=0 for t near the endpoint. Under this condition and Conditions 1 5 of Ying (1993), e can expand UB (b; b@ ) to yield 351 UB (b; b@ )= n P ; t)s(0)(b; t){x i X9 (b; t)} dn i (b; t) + n P y < (t)s(0)(b; t){x X9 (b; t)}x dn (b; t)(b b@ ) 0 i i i +o(nd+ndb b@ d) as n2 and bb 0. By Theorem 2 of Ying (1993), the first term on the right-hand side of the above display has the linear expansion U (b 0 )+na (b )+o(nd+ndb d). Thus, UB ; b@ )=U (b 0 )+n(a +D ) ) nd ) +o(nd+ndb@ d+ndb@ d). Suppose that A,A and A +D are nonsingular. By combining (A 1) and (A 3) ith k=1, e obtain nd (1) )= n D(A +D ) 1{U (b 0 )+D A 1 U (b 0 )} +o(1+nd db@ (1) d+nddb@ d). It then follos from recursive use of (A 3) that in general or nd b )= n D k {(A +D ) 1D }j 1(A +D ) 1U (b ) 0 0 j=1 n D{(A +D ) 1D }ka 1 U (b 0 )+o 1+nD k db@ b d A B (j) 0 j=0 nd b )= n D[I {(A +D ) 1D }k]a 1 0 U (b 0 ) n D{(A +D ) 1D }ka 1 U (b 0 )+o 1+nD k db@ b d A (j) 0 j=0 B. (A 5) The asymptotic normality of nd b ) then follos from that of n DU (b ) and n DU (b ) Suppose that is truncated at the tail and the error density is sufficiently smooth. Then it follos from equation (4.5) of Lai & Ying (1991b) that, for some g>0 and c>0, (A 3) holds ith o(nd g) as the remainder term uniformly for b@ and b@ in an O(n D+c) neighbourhood of b. 0 Thus, if {(A +D ) 1D }k0 as k2, then all the b@ (j k) stay in an O(n D+c) neighbourhood (j) of b so that the remainder term in (A 5) becomes o(kn g)=o(1) provided that k2 and 0 k=o(log n). Hence, nd b )=nd b )+o (1) for k2 and k=o(log n). 0 0 p Asymptotic properties of b@*. We impose the same regularity conditions as in the previous section. The convexity arguments and the strong la of large numbers can again be employed to establish the strong consistency of b@*. By definition, b@* is a root of (A 3) (A 4) Donloaded from at Columbia University Libraries on March 27, 2015 UB * (b; b@* ) ) n P ;t+(b b@* ) X i }S(0)(b; t){x i X9 (b; t)} dn i (b; t)z i.
12 352 Z. JIN, D. Y. LIN, L. J. WEI AND Z. YIN By asymptotic linear expansions similar to those used in establishing (A 3), e have UB * * ; b@* )= n P ;t+ b@ ) X }S(0) ; t){x X9 ; t)} dn ; t)z i i i i +n(a +D )* b@ ) nd * b@ )+d*, (A 6) n here d* n =o(nd+n Wk j=0 db@ (j) d+n Wk j=0 db@* (j) d). We can replace the Z i in the first term on the right-hand side of (A 6) by Z i 1 since b@ is a root of UB (b; b@ ). With this replacement, the first term has mean 0 conditionally on F, so that the integral can be approximated by (b 0 ; t){x i X9 (b 0 ; t)} dn i (b 0 ; t). Thus, hich yields Therefore, UB * * ; b@* )= n nd* b@ )= n D(A +D ) 1 n P (b 0 ; t){x i X9 (b 0 ; t)} dn i (b 0 ;t)(z i 1) +n(a +D )* b@ ) nd * b@ )+d* n, P (b 0 ; t){x i X9 (b 0 ; t)} dn i (b 0 ;t)(z i 1) +(A +D ) 1D nd* b@ )+n Dd* n. nd* b@ )= n D [I {(A +D ) 1D }k]a 1 n P (b 0 ; t){x i X9 (b 0 ; t)} dn i (b 0 ;t)(z i 1) n D{(A +D ) 1D }ka 1 n P S(0)(b 0 ; t){x i X9 (b 0 ; t)} dn i (b 0 ;t)(z i 1)+n Dd* n. Comparing the above equation ith (A 5), e see that the conditional distribution of nd* b@ ) given F converges almost surely to the limiting distribution of nd b ). 0 Under the additional conditions stated in the last paragraph of the previous section, (A 6) can be strengthened ith d* =o(n j) for some j>0. Hence, for k2 and k=o(log n), n nd* b@ ) has the same asymptotic distribution as nd b ). 0 REFERENCES BUCKLEY, J. & JAMES, I. (1979). Linear regression ith censored data. Biometrika 66, COX, D. R. (1972). Regression models and life-tables (ith Discussion). J. R. Statist. Soc. B 34, COX, D. R. & OAKES, D. (1984). Analysis of Survival Data. London: Chapman and Hall. DICKSON, E. R., RAMBSCH, P. M., FLEMIN, T. R., FISHER, L. D. & LANWORTHY, A. (1989). Prognosis in primary biliary cirrhosis: model for decision making. Hepatology 10, 1 7. FLEMIN, T. R. & HARRINTON, D. P. (1991). Counting Processes and Survival Analysis. Ne York: Wiley. FYENSON, M. & RITOV, Y. (1994). Monotone estimating equations for censored data. Ann. Statist. 22, EHAN, E. A. (1965). A generalized Wilcoxon test for comparing arbitrarily single-censored samples. Biometrika 52, HARRINTON, D. P. & FLEMIN, T. R. (1982). A class of rank test procedures for censored survival data. Biometrika 69, JIN, Z., YIN, Z. & WEI, L. J. (2001). A simple resampling method by perturbing the minimand. Biometrika 88, KALBFIEISCH, J. D. & PRENTICE, R. L. (1980). T he Statistical Analysis of Failure T ime Data. Ne York: Wiley. KIRKPATRICK, S., ELATT, C. D. & VECCHI, M. P. (1983). Optimisation by simulated annealing. Science 220, Donloaded from at Columbia University Libraries on March 27, 2015
13 Accelerated failure time model KOENKER, R. & BASSETT,. S. (1978). Regression quantiles. Econometrica 46, KOENKER, R. & D OREY, V. (1987). Computing regression quantiles. Appl. Statist. 36, KRALL, J. M., UTHOFF, V. A. & HARLEY, J. B. (1975). A step-up procedure for selecting variables associated ith survival. Biometrics 31, LAI, T. L. & YIN, Z. (1991a). Large sample theory of a modified Buckley James estimator for regression analysis ith censored data. Ann. Statist. 19, LAI, T. L. & YIN, Z. (1991b). Rank regression methods for left-truncated and right-censored data. Ann. Statist. 19, LIN, D. Y. & EYER, C. J. (1992). Computational methods for semiparametric linear regression ith censored data. J. Comp. raph. Statist. 1, LIN, D. Y., WEI, L. J. & YIN, Z. (1998). Accelerated failure time models for counting processes. Biometrika 85, MANTEL, N. (1966). Evaluation of survival data and to ne rank order statistics arising in its considerations. Cancer Chemo. Rep. 50, PARZEN, M. I., WEI, L. J. & YIN, Z. (1994). A resampling method based on pivotal estimating functions. Biometrika 81, PRENTICE, R. L. (1978). Linear rank tests ith right censored data. Biometrika 65, RAO, C. R. & ZHAO, L. C. (1992). Approximation to the distribution of M-estimates in linear models by randomly eighted bootstrap. Sankhya: A 54, RITOV, Y. (1990). Estimation in a linear regression model ith censored data. Ann. Statist. 18, SAS INSTITUTE, INC. (1999). SAS/STAT User s uide, Version 8. Cary, NC: SAS Institute Inc. TSIATIS, A. A. (1990). Estimating regression parameters using linear rank tests for censored data. Ann. Statist. 18, WEI, L. J., YIN, Z. & LIN, D. Y. (1990). Linear regression analysis of censored survival data based on rank tests. Biometrika 77, YIN, Z. (1993). A large sample study of rank estimation for censored regression data. Ann. Statist. 21, [Received August Revised November 2002] 353 Donloaded from at Columbia University Libraries on March 27, 2015
On least-squares regression with censored data
Biometrika (2006), 93, 1, pp. 147 161 2006 Biometrika Trust Printed in Great Britain On least-squares regression with censored data BY ZHEZHEN JIN Department of Biostatistics, Columbia University, New
More informationStatistical inference based on non-smooth estimating functions
Biometrika (2004), 91, 4, pp. 943 954 2004 Biometrika Trust Printed in Great Britain Statistical inference based on non-smooth estimating functions BY L. TIAN Department of Preventive Medicine, Northwestern
More informationTests of independence for censored bivariate failure time data
Tests of independence for censored bivariate failure time data Abstract Bivariate failure time data is widely used in survival analysis, for example, in twins study. This article presents a class of χ
More informationLSS: An S-Plus/R program for the accelerated failure time model to right censored data based on least-squares principle
computer methods and programs in biomedicine 86 (2007) 45 50 journal homepage: www.intl.elsevierhealth.com/journals/cmpb LSS: An S-Plus/R program for the accelerated failure time model to right censored
More informationRank Regression Analysis of Multivariate Failure Time Data Based on Marginal Linear Models
doi: 10.1111/j.1467-9469.2005.00487.x Published by Blacwell Publishing Ltd, 9600 Garsington Road, Oxford OX4 2DQ, UK and 350 Main Street, Malden, MA 02148, USA Vol 33: 1 23, 2006 Ran Regression Analysis
More informationLeast Absolute Deviations Estimation for the Accelerated Failure Time Model. University of Iowa. *
Least Absolute Deviations Estimation for the Accelerated Failure Time Model Jian Huang 1,2, Shuangge Ma 3, and Huiliang Xie 1 1 Department of Statistics and Actuarial Science, and 2 Program in Public Health
More informationBiometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.
Biometrika Trust A Simple Resampling Method by Perturbing the Minimand Author(s): Zhezhen Jin, Zhiliang Ying and L. J. Wei Source: Biometrika, Vol. 88, No. 2 (Jun., 2001), pp. 381-390 Published by: Biometrika
More informationUniversity of California, Berkeley
University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan
More informationAFT Models and Empirical Likelihood
AFT Models and Empirical Likelihood Mai Zhou Department of Statistics, University of Kentucky Collaborators: Gang Li (UCLA); A. Bathke; M. Kim (Kentucky) Accelerated Failure Time (AFT) models: Y = log(t
More informationLEAST ABSOLUTE DEVIATIONS ESTIMATION FOR THE ACCELERATED FAILURE TIME MODEL
Statistica Sinica 17(2007), 1533-1548 LEAST ABSOLUTE DEVIATIONS ESTIMATION FOR THE ACCELERATED FAILURE TIME MODEL Jian Huang 1, Shuangge Ma 2 and Huiliang Xie 1 1 University of Iowa and 2 Yale University
More informationSTAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where
STAT 331 Accelerated Failure Time Models Previously, we have focused on multiplicative intensity models, where h t z) = h 0 t) g z). These can also be expressed as H t z) = H 0 t) g z) or S t z) = e Ht
More informationPublished online: 10 Apr 2012.
This article was downloaded by: Columbia University] On: 23 March 215, At: 12:7 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 172954 Registered office: Mortimer
More informationFULL LIKELIHOOD INFERENCES IN THE COX MODEL
October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach
More informationOn the Breslow estimator
Lifetime Data Anal (27) 13:471 48 DOI 1.17/s1985-7-948-y On the Breslow estimator D. Y. Lin Received: 5 April 27 / Accepted: 16 July 27 / Published online: 2 September 27 Springer Science+Business Media,
More informationPENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA
PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University
More informationOther Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model
Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);
More informationApproximate Median Regression via the Box-Cox Transformation
Approximate Median Regression via the Box-Cox Transformation Garrett M. Fitzmaurice,StuartR.Lipsitz, and Michael Parzen Median regression is used increasingly in many different areas of applications. The
More informationEstimation and Inference of Quantile Regression. for Survival Data under Biased Sampling
Estimation and Inference of Quantile Regression for Survival Data under Biased Sampling Supplementary Materials: Proofs of the Main Results S1 Verification of the weight function v i (t) for the lengthbiased
More informationSemiparametric analysis of transformation models with censored data
Biometrika (22), 89, 3, pp. 659 668 22 Biometrika Trust Printed in Great Britain Semiparametric analysis of transformation models with censored data BY KANI CHEN Department of Mathematics, Hong Kong University
More informationQuantile Regression for Residual Life and Empirical Likelihood
Quantile Regression for Residual Life and Empirical Likelihood Mai Zhou email: mai@ms.uky.edu Department of Statistics, University of Kentucky, Lexington, KY 40506-0027, USA Jong-Hyeon Jeong email: jeong@nsabp.pitt.edu
More informationSTAT331. Cox s Proportional Hazards Model
STAT331 Cox s Proportional Hazards Model In this unit we introduce Cox s proportional hazards (Cox s PH) model, give a heuristic development of the partial likelihood function, and discuss adaptations
More informationPower and Sample Size Calculations with the Additive Hazards Model
Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine
More informationSIMULTANEOUS CONFIDENCE BANDS FOR THE PTH PERCENTILE AND THE MEAN LIFETIME IN EXPONENTIAL AND WEIBULL REGRESSION MODELS. Ping Sa and S.J.
SIMULTANEOUS CONFIDENCE BANDS FOR THE PTH PERCENTILE AND THE MEAN LIFETIME IN EXPONENTIAL AND WEIBULL REGRESSION MODELS " # Ping Sa and S.J. Lee " Dept. of Mathematics and Statistics, U. of North Florida,
More informationSmoothed and Corrected Score Approach to Censored Quantile Regression With Measurement Errors
Smoothed and Corrected Score Approach to Censored Quantile Regression With Measurement Errors Yuanshan Wu, Yanyuan Ma, and Guosheng Yin Abstract Censored quantile regression is an important alternative
More informationy(x) = x w + ε(x), (1)
Linear regression We are ready to consider our first machine-learning problem: linear regression. Suppose that e are interested in the values of a function y(x): R d R, here x is a d-dimensional vector-valued
More informationCHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION. From Exploratory Factor Analysis Ledyard R Tucker and Robert C. MacCallum
CHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION From Exploratory Factor Analysis Ledyard R Tucker and Robert C. MacCallum 1997 19 CHAPTER 3 THE COMMON FACTOR MODEL IN THE POPULATION 3.0. Introduction
More informationEfficiency Comparison Between Mean and Log-rank Tests for. Recurrent Event Time Data
Efficiency Comparison Between Mean and Log-rank Tests for Recurrent Event Time Data Wenbin Lu Department of Statistics, North Carolina State University, Raleigh, NC 27695 Email: lu@stat.ncsu.edu Summary.
More informationEfficient Estimation of Censored Linear Regression Model
2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 24 25 26 27 28 29 3 3 32 33 34 35 36 37 38 39 4 4 42 43 44 45 46 47 48 Biometrika (2), xx, x, pp. 4 C 28 Biometrika Trust Printed in Great Britain Efficient Estimation
More informationA Resampling Method on Pivotal Estimating Functions
A Resampling Method on Pivotal Estimating Functions Kun Nie Biostat 277,Winter 2004 March 17, 2004 Outline Introduction A General Resampling Method Examples - Quantile Regression -Rank Regression -Simulation
More informationCompeting risks data analysis under the accelerated failure time model with missing cause of failure
Ann Inst Stat Math 2016 68:855 876 DOI 10.1007/s10463-015-0516-y Competing risks data analysis under the accelerated failure time model with missing cause of failure Ming Zheng Renxin Lin Wen Yu Received:
More informationlog T = β T Z + ɛ Zi Z(u; β) } dn i (ue βzi ) = 0,
Accelerated failure time model: log T = β T Z + ɛ β estimation: solve where S n ( β) = n i=1 { Zi Z(u; β) } dn i (ue βzi ) = 0, Z(u; β) = j Z j Y j (ue βz j) j Y j (ue βz j) How do we show the asymptotics
More informationST745: Survival Analysis: Nonparametric methods
ST745: Survival Analysis: Nonparametric methods Eric B. Laber Department of Statistics, North Carolina State University February 5, 2015 The KM estimator is used ubiquitously in medical studies to estimate
More informationA TWO-STAGE LINEAR MIXED-EFFECTS/COX MODEL FOR LONGITUDINAL DATA WITH MEASUREMENT ERROR AND SURVIVAL
A TWO-STAGE LINEAR MIXED-EFFECTS/COX MODEL FOR LONGITUDINAL DATA WITH MEASUREMENT ERROR AND SURVIVAL Christopher H. Morrell, Loyola College in Maryland, and Larry J. Brant, NIA Christopher H. Morrell,
More informationAnalysis of least absolute deviation
Analysis of least absolute deviation By KANI CHEN Department of Mathematics, HKUST, Kowloon, Hong Kong makchen@ust.hk ZHILIANG YING Department of Statistics, Columbia University, NY, NY, 10027, U.S.A.
More informationUNIVERSITY OF CALIFORNIA, SAN DIEGO
UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department
More informationSimple and Fast Overidentified Rank Estimation for Right-Censored Length-Biased Data and Backward Recurrence Time
Biometrics 74, 77 85 March 2018 DOI: 10.1111/biom.12727 Simple and Fast Overidentified Rank Estimation for Right-Censored Length-Biased Data and Backward Recurrence Time Yifei Sun, 1, * Kwun Chuen Gary
More informationHypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations
Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Takeshi Emura and Hisayuki Tsukuma Abstract For testing the regression parameter in multivariate
More informationAdaptive Noise Cancellation
Adaptive Noise Cancellation P. Comon and V. Zarzoso January 5, 2010 1 Introduction In numerous application areas, including biomedical engineering, radar, sonar and digital communications, the goal is
More informationPower-Transformed Linear Quantile Regression With Censored Data
Power-Transformed Linear Quantile Regression With Censored Data Guosheng YIN, Donglin ZENG, and Hui LI We propose a class of power-transformed linear quantile regression models for survival data subject
More informationSemiparametric Regression
Semiparametric Regression Patrick Breheny October 22 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/23 Introduction Over the past few weeks, we ve introduced a variety of regression models under
More informationAnalysis of transformation models with censored data
Biometrika (1995), 82,4, pp. 835-45 Printed in Great Britain Analysis of transformation models with censored data BY S. C. CHENG Department of Biomathematics, M. D. Anderson Cancer Center, University of
More informationEfficiency of Profile/Partial Likelihood in the Cox Model
Efficiency of Profile/Partial Likelihood in the Cox Model Yuichi Hirose School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand Summary. This paper shows
More informationQuantile Regression for Recurrent Gap Time Data
Biometrics 000, 1 21 DOI: 000 000 0000 Quantile Regression for Recurrent Gap Time Data Xianghua Luo 1,, Chiung-Yu Huang 2, and Lan Wang 3 1 Division of Biostatistics, School of Public Health, University
More informationA comparison study of the nonparametric tests based on the empirical distributions
통계연구 (2015), 제 20 권제 3 호, 1-12 A comparison study of the nonparametric tests based on the empirical distributions Hyo-Il Park 1) Abstract In this study, we propose a nonparametric test based on the empirical
More informationIntroduction to Statistical Analysis
Introduction to Statistical Analysis Changyu Shen Richard A. and Susan F. Smith Center for Outcomes Research in Cardiology Beth Israel Deaconess Medical Center Harvard Medical School Objectives Descriptive
More informationWorking Paper No Maximum score type estimators
Warsaw School of Economics Institute of Econometrics Department of Applied Econometrics Department of Applied Econometrics Working Papers Warsaw School of Economics Al. iepodleglosci 64 02-554 Warszawa,
More informationDouble Bootstrap Confidence Interval Estimates with Censored and Truncated Data
Journal of Modern Applied Statistical Methods Volume 13 Issue 2 Article 22 11-2014 Double Bootstrap Confidence Interval Estimates with Censored and Truncated Data Jayanthi Arasan University Putra Malaysia,
More informationOptimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai
Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective Anastasios (Butch) Tsiatis and Xiaofei Bai Department of Statistics North Carolina State University 1/35 Optimal Treatment
More informationSample size calculations for logistic and Poisson regression models
Biometrika (2), 88, 4, pp. 93 99 2 Biometrika Trust Printed in Great Britain Sample size calculations for logistic and Poisson regression models BY GWOWEN SHIEH Department of Management Science, National
More informationLinear life expectancy regression with censored data
Linear life expectancy regression with censored data By Y. Q. CHEN Program in Biostatistics, Division of Public Health Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington 98109, U.S.A.
More informationAttributable Risk Function in the Proportional Hazards Model
UW Biostatistics Working Paper Series 5-31-2005 Attributable Risk Function in the Proportional Hazards Model Ying Qing Chen Fred Hutchinson Cancer Research Center, yqchen@u.washington.edu Chengcheng Hu
More informationIntroduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview
Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations
More informationSurvival Analysis I (CHL5209H)
Survival Analysis Dalla Lana School of Public Health University of Toronto olli.saarela@utoronto.ca January 7, 2015 31-1 Literature Clayton D & Hills M (1993): Statistical Models in Epidemiology. Not really
More informationSurvival Analysis for Case-Cohort Studies
Survival Analysis for ase-ohort Studies Petr Klášterecký Dept. of Probability and Mathematical Statistics, Faculty of Mathematics and Physics, harles University, Prague, zech Republic e-mail: petr.klasterecky@matfyz.cz
More informationRonald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California
Texts in Statistical Science Bayesian Ideas and Data Analysis An Introduction for Scientists and Statisticians Ronald Christensen University of New Mexico Albuquerque, New Mexico Wesley Johnson University
More informationMAS3301 / MAS8311 Biostatistics Part II: Survival
MAS3301 / MAS8311 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-10 1 13 The Cox proportional hazards model 13.1 Introduction In the
More informationGROUPED SURVIVAL DATA. Florida State University and Medical College of Wisconsin
FITTING COX'S PROPORTIONAL HAZARDS MODEL USING GROUPED SURVIVAL DATA Ian W. McKeague and Mei-Jie Zhang Florida State University and Medical College of Wisconsin Cox's proportional hazard model is often
More informationBivariate Uniqueness in the Logistic Recursive Distributional Equation
Bivariate Uniqueness in the Logistic Recursive Distributional Equation Antar Bandyopadhyay Technical Report # 629 University of California Department of Statistics 367 Evans Hall # 3860 Berkeley CA 94720-3860
More informationSurvival Analysis Math 434 Fall 2011
Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup
More informationSimulation-based robust IV inference for lifetime data
Simulation-based robust IV inference for lifetime data Anand Acharya 1 Lynda Khalaf 1 Marcel Voia 1 Myra Yazbeck 2 David Wensley 3 1 Department of Economics Carleton University 2 Department of Economics
More informationCensored quantile regression with varying coefficients
Title Censored quantile regression with varying coefficients Author(s) Yin, G; Zeng, D; Li, H Citation Statistica Sinica, 2013, p. 1-24 Issued Date 2013 URL http://hdl.handle.net/10722/189454 Rights Statistica
More informationEMPIRICAL LIKELIHOOD ANALYSIS FOR THE HETEROSCEDASTIC ACCELERATED FAILURE TIME MODEL
Statistica Sinica 22 (2012), 295-316 doi:http://dx.doi.org/10.5705/ss.2010.190 EMPIRICAL LIKELIHOOD ANALYSIS FOR THE HETEROSCEDASTIC ACCELERATED FAILURE TIME MODEL Mai Zhou 1, Mi-Ok Kim 2, and Arne C.
More informationQuantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be
Quantile methods Class Notes Manuel Arellano December 1, 2009 1 Unconditional quantiles Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be Q τ (Y ) q τ F 1 (τ) =inf{r : F
More informationA note on profile likelihood for exponential tilt mixture models
Biometrika (2009), 96, 1,pp. 229 236 C 2009 Biometrika Trust Printed in Great Britain doi: 10.1093/biomet/asn059 Advance Access publication 22 January 2009 A note on profile likelihood for exponential
More informationEstimation of Conditional Kendall s Tau for Bivariate Interval Censored Data
Communications for Statistical Applications and Methods 2015, Vol. 22, No. 6, 599 604 DOI: http://dx.doi.org/10.5351/csam.2015.22.6.599 Print ISSN 2287-7843 / Online ISSN 2383-4757 Estimation of Conditional
More informationNuisance Parameter Estimation in Survival Models
Nuisance Parameter Estimation in Survival Models A Dissertation submitted to the faculty of the University of Minnesota by Andrew Wey In partial fulfillment of the requirements for the degree of Doctor
More informationOn a connection between the Bradley-Terry model and the Cox proportional hazards model
On a connection between the Bradley-Terry model and the Cox proportional hazards model Yuhua Su and Mai Zhou Department of Statistics University of Kentucky Lexington, KY 40506-0027, U.S.A. SUMMARY This
More informationStat 8112 Lecture Notes Weak Convergence in Metric Spaces Charles J. Geyer January 23, Metric Spaces
Stat 8112 Lecture Notes Weak Convergence in Metric Spaces Charles J. Geyer January 23, 2013 1 Metric Spaces Let X be an arbitrary set. A function d : X X R is called a metric if it satisfies the folloing
More informationThe Restricted Likelihood Ratio Test at the Boundary in Autoregressive Series
The Restricted Likelihood Ratio Test at the Boundary in Autoregressive Series Willa W. Chen Rohit S. Deo July 6, 009 Abstract. The restricted likelihood ratio test, RLRT, for the autoregressive coefficient
More informationIssues on quantile autoregression
Issues on quantile autoregression Jianqing Fan and Yingying Fan We congratulate Koenker and Xiao on their interesting and important contribution to the quantile autoregression (QAR). The paper provides
More informationGoodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen
Outline Cox s proportional hazards model. Goodness-of-fit tools More flexible models R-package timereg Forthcoming book, Martinussen and Scheike. 2/38 University of Copenhagen http://www.biostat.ku.dk
More informationFULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH
FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH Jian-Jian Ren 1 and Mai Zhou 2 University of Central Florida and University of Kentucky Abstract: For the regression parameter
More informationA COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky
A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),
More informationHarvard University. Harvard University Biostatistics Working Paper Series
Harvard University Harvard University Biostatistics Working Paper Series Year 2008 Paper 94 The Highest Confidence Density Region and Its Usage for Inferences about the Survival Function with Censored
More informationLecture 22 Survival Analysis: An Introduction
University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 22 Survival Analysis: An Introduction There is considerable interest among economists in models of durations, which
More informationUnit 14: Nonparametric Statistical Methods
Unit 14: Nonparametric Statistical Methods Statistics 571: Statistical Methods Ramón V. León 8/8/2003 Unit 14 - Stat 571 - Ramón V. León 1 Introductory Remarks Most methods studied so far have been based
More informationThe Accelerated Failure Time Model Under Biased. Sampling
The Accelerated Failure Time Model Under Biased Sampling Micha Mandel and Ya akov Ritov Department of Statistics, The Hebrew University of Jerusalem, Israel July 13, 2009 Abstract Chen (2009, Biometrics)
More informationModel Selection, Estimation, and Bootstrap Smoothing. Bradley Efron Stanford University
Model Selection, Estimation, and Bootstrap Smoothing Bradley Efron Stanford University Estimation After Model Selection Usually: (a) look at data (b) choose model (linear, quad, cubic...?) (c) fit estimates
More informationCALCULATION METHOD FOR NONLINEAR DYNAMIC LEAST-ABSOLUTE DEVIATIONS ESTIMATOR
J. Japan Statist. Soc. Vol. 3 No. 200 39 5 CALCULAION MEHOD FOR NONLINEAR DYNAMIC LEAS-ABSOLUE DEVIAIONS ESIMAOR Kohtaro Hitomi * and Masato Kagihara ** In a nonlinear dynamic model, the consistency and
More informationLinear Regression Linear Regression with Shrinkage
Linear Regression Linear Regression ith Shrinkage Introduction Regression means predicting a continuous (usually scalar) output y from a vector of continuous inputs (features) x. Example: Predicting vehicle
More informationStatistics 262: Intermediate Biostatistics Non-parametric Survival Analysis
Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Jonathan Taylor & Kristin Cobb Statistics 262: Intermediate Biostatistics p.1/?? Overview of today s class Kaplan-Meier Curve
More informationCHARACTERIZATION OF ULTRASONIC IMMERSION TRANSDUCERS
CHARACTERIZATION OF ULTRASONIC IMMERSION TRANSDUCERS INTRODUCTION David D. Bennink, Center for NDE Anna L. Pate, Engineering Science and Mechanics Ioa State University Ames, Ioa 50011 In any ultrasonic
More informationAnalysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems
Analysis of the AIC Statistic for Optimal Detection of Small Changes in Dynamic Systems Jeremy S. Conner and Dale E. Seborg Department of Chemical Engineering University of California, Santa Barbara, CA
More informationEnhancing Generalization Capability of SVM Classifiers with Feature Weight Adjustment
Enhancing Generalization Capability of SVM Classifiers ith Feature Weight Adjustment Xizhao Wang and Qiang He College of Mathematics and Computer Science, Hebei University, Baoding 07002, Hebei, China
More informationBIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY
BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY Ingo Langner 1, Ralf Bender 2, Rebecca Lenz-Tönjes 1, Helmut Küchenhoff 2, Maria Blettner 2 1
More informationRank Regression with Normal Residuals using the Gibbs Sampler
Rank Regression with Normal Residuals using the Gibbs Sampler Stephen P Smith email: hucklebird@aol.com, 2018 Abstract Yu (2000) described the use of the Gibbs sampler to estimate regression parameters
More informationA nonparametric two-sample wald test of equality of variances
University of Wollongong Research Online Faculty of Informatics - Papers (Archive) Faculty of Engineering and Information Sciences 211 A nonparametric two-sample wald test of equality of variances David
More informationComparing Distribution Functions via Empirical Likelihood
Georgia State University ScholarWorks @ Georgia State University Mathematics and Statistics Faculty Publications Department of Mathematics and Statistics 25 Comparing Distribution Functions via Empirical
More informationMax-Margin Ratio Machine
JMLR: Workshop and Conference Proceedings 25:1 13, 2012 Asian Conference on Machine Learning Max-Margin Ratio Machine Suicheng Gu and Yuhong Guo Department of Computer and Information Sciences Temple University,
More informationA turbulence closure based on the maximum entropy method
Advances in Fluid Mechanics IX 547 A turbulence closure based on the maximum entropy method R. W. Derksen Department of Mechanical and Manufacturing Engineering University of Manitoba Winnipeg Canada Abstract
More informationA Bootstrap Test for Conditional Symmetry
ANNALS OF ECONOMICS AND FINANCE 6, 51 61 005) A Bootstrap Test for Conditional Symmetry Liangjun Su Guanghua School of Management, Peking University E-mail: lsu@gsm.pku.edu.cn and Sainan Jin Guanghua School
More informationSimulating Properties of the Likelihood Ratio Test for a Unit Root in an Explosive Second Order Autoregression
Simulating Properties of the Likelihood Ratio est for a Unit Root in an Explosive Second Order Autoregression Bent Nielsen Nuffield College, University of Oxford J James Reade St Cross College, University
More informationWhat s New in Econometrics? Lecture 14 Quantile Methods
What s New in Econometrics? Lecture 14 Quantile Methods Jeff Wooldridge NBER Summer Institute, 2007 1. Reminders About Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile Regression
More informationA Non-parametric bootstrap for multilevel models
A Non-parametric bootstrap for multilevel models By James Carpenter London School of Hygiene and ropical Medicine Harvey Goldstein and Jon asbash Institute of Education 1. Introduction Bootstrapping is
More informationCensored partial regression
Biostatistics (2003), 4, 1,pp. 109 121 Printed in Great Britain Censored partial regression JESUS ORBE, EVA FERREIRA, VICENTE NÚÑEZ-ANTÓN Departamento de Econometría y Estadística, Facultad de Ciencias
More informationMultivariate Survival Analysis
Multivariate Survival Analysis Previously we have assumed that either (X i, δ i ) or (X i, δ i, Z i ), i = 1,..., n, are i.i.d.. This may not always be the case. Multivariate survival data can arise in
More informationCOMPARING TRANSFORMATIONS USING TESTS OF SEPARATE FAMILIES. Department of Biostatistics, University of North Carolina at Chapel Hill, NC.
COMPARING TRANSFORMATIONS USING TESTS OF SEPARATE FAMILIES by Lloyd J. Edwards and Ronald W. Helms Department of Biostatistics, University of North Carolina at Chapel Hill, NC. Institute of Statistics
More informationPairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion
Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion Glenn Heller and Jing Qin Department of Epidemiology and Biostatistics Memorial
More informationResearch Article A Nonparametric Two-Sample Wald Test of Equality of Variances
Advances in Decision Sciences Volume 211, Article ID 74858, 8 pages doi:1.1155/211/74858 Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances David Allingham 1 andj.c.w.rayner
More informationHomework 6 Solutions
Homeork 6 Solutions Igor Yanovsky (Math 151B TA) Section 114, Problem 1: For the boundary-value problem y (y ) y + log x, 1 x, y(1) 0, y() log, (1) rite the nonlinear system and formulas for Neton s method
More information