Outline. Cox's regression model Goodness-of-t methods. Cox's proportional hazards model: Survival analysis

Size: px
Start display at page:

Download "Outline. Cox's regression model Goodness-of-t methods. Cox's proportional hazards model: Survival analysis"

Transcription

1 Outline Cox's regression model Goodness-of-t methods Giuliana Cortese Advanced Survival Analysis 21 Copenhagen Cox's proportional hazards model: assumptions Goodness-of-t for Cox's model: graphical procedures and tests. Possible solutions to lack-of-t and extensions of Cox's model. Survival analysis Standard setup for right-censored survival data. T1,..., T i,..., T n are i.i.d. copies of (T, D) where T = T C D = I (T C) with T the true survival time and C the (potential) censoring time. Hazard-function Counting process Martingale 1 α(t) = lim h h P(t T < t + h T t, F ). N i(t) = I (T i t, D i = 1) M i(t) = N i(t) Λ i(t) where Λ i(t) = Y i(s)α(s) ds, Y i(t) = I (t T i), (compensator) (at risk process). Cox's proportional hazards model One studies a covariate vector: X i(t) (p-dimensional). Hazard conditional on covariates: α i(t, X i). Cox's proportional hazards model: α i(t) = α(t) exp (β T X i) where α(t) is unspecied baseline hazard (hazard for X i = ). This model has the nice property that the hazards ratios are constant over time. For a covariate X1 (other covariates being xed) then the relative risk α(t, X1 + 1) = exp (β1) α(t, X1) is not depending on time (key assumption)!

2 Cox's proportional hazards model Assumptions in Cox's model are proportional hazards (regression coecients are time-constant), functional form of continuous covariates, Examples: X (linear), X 2, log(x ), link function between hazard and linear predictor (log-linearity). If these conditions are not violated, then Cox's model ts our data well. One needs to check goodness-of-t of a regression model! Usually each assumption is tested at a time, while assuming the other two hold true (dicult to test simultaneously!). PBC data (primary biliary cirrhosis): 418 patients are followed until death or censoring. PBC is a fatal chronic liver disease. Important explanatory variables: Age Albumin Bilirubin Edema Prothrombin time Fitting Cox's model in R. Graphical procedure for testing proportional hazards > library(survival) > data(pbc) > attach(pbc) > cbind(time,status,age,edema,bili,protime,alb)[1:5,] time status age edema bili protime alb [1,] [2,] [3,] [4,] [5,] > sum(status) [1] 161 > fit.pbc<-coxph(surv(time/365, status) ~ age+edema+log(bili)+log(protime)+log(alb)) > fit.pbc Call: coxph(formula = Surv(time/365, status) ~ age +edema +log(bili) +log(protime) +log(alb)) coef exp(coef) se(coef) z p e-7 age edema e-3 log(bili) e+ log(protime) e-3 log(alb) e-4 Likelihood ratio test=234 on 5 df, p= n= 418 Stratify according to X1, categorical with k levels, λ(t) = Y (t)λk(t) exp(β2x2(t) + + β px p(t)), with k = 1,..., K denoting the K possible strata. Now, if Cox's model is valid the stratied baselines will be (approximately) proportional λk(t) = λ(t)θ k. Graphical representation of log(ˆλk(t)) for k = 1,..., K! If hazards are proportional, plots of estimated curves versus t should be approximately parallel. It helps to plot dierences log(ˆλk(t)) log(ˆλ1(t)).

3 Graphical procedure Graphical procedure with condence intervals Edema Alb: 1.quartile Log(Cumulative baseline hazard) Log(Cumulative baseline hazard) Difference in log cumulative hazard Difference in log cumulative hazard Alb: 2.quartile Difference in log cumulative hazard Difference in log cumulative hazard Alb: 3.quartile Graphical procedure with condence bands broken=pointwise, Red = correct uniform band, blue = without variation in Cox estimate Drawbacks of graphical procedure Difference in log cumulative hazard betahat Edema Graphical methods based on cumulative hazards are useful but also have some limitations: They can suggest type of deviation from proportionality, only if a single covariate is in Cox model. Continuous covariates need to be categorized arbitrarily. How much can estimated curves deviate from being parallel? Dicult to see in practice! Help in plotting dierences log(ˆλk(t)) log(ˆλ1(t)). Useful to plot condence intervals/bands, but hard in practice. It is not simple to work with asymptotic properties of ˆΛk(t), but i.i.d. decomposition can be used to do this by resampling techniques

4 Alternative methods for checking proportionality Cumulative martingale residuals There exists alternative methods based on both graphical procedures and test. Graphical procedures based on Schoenfeld residuals, smoothing. Tests ad hoc where specic deviations from proportionality are veried ( β1(t) = β1 + θlog(t) ), coe. being time-constant if θ = ). Tests and plots based on cumulative martingale residuals. As alternative: Cumulative martingale residuals (Lin et al., 1993). The martingales under the Cox regression model can be written as M i(t) = N i(t) Y i(s) exp(x T β)dλ(s) T ˆM i(t) = N i(t) Y i(s) exp(x ˆβ)d ˆΛ(s) = N i(t) Y i(s) exp(x T i (s) ˆβ) 1 dn (s). S(s, ˆβ) One idea is now to look at dierent groupings of the these residuals and see if they behave as they should under the model. Cumulative martingale residuals Cumulative martingale residuals: iid decomposition The score function, evaluated in the estimate ˆβ and seen as a function of time (observed score process), can be written as U( ˆβ, t) = X i(s)d ˆMi(s) = (X i(s) E(t, ˆβ))d ˆMi(s). where S(t, β) = Yi (t) exp(x T i (t)β), S1(t, β) = Yi (t) exp(x T i (t)β)xi (t), S2(t, β) = Yi (t) exp(x T i (t)β)xi (t) 2 S1(t, β) E(t, β) = S(t, β). and with minus the derivative of the score ( ) S2(s, β) I (t, β) = E(s, β) 2 dn i(s) S(s, β) n 1/2 U( ˆβ, t) is asymptotically equivalent to the process n 1/2 W (t, β) = n (M1(t) 1/2 I (t, ˆβ) I 1 (τ, ˆβ) ) M1(τ), where M1(t) = M1i(t) = (X i(s) e(s, β)) dm i(s) with e(t, β) = lim p E(t, β). M1(t) is a sum of iid terms with mean zero, and n 1/2 W (t, β) converges to a zero-mean Gaussian process.

5 Cumulative martingale residuals: resampling The asymptotic distribution of n 1/2 W (t, β) can be obtained by resampling procedures. The distribution of the process n 1/2 M1(t) (t [, τ]) is asymptotically equivalent to n 1/2 (Xi (s) E(s, ˆβ))dNi (s)gi where G1,..., Gn are independent standard normals. Since Mi 's are i.i.d. with variance E(Ni ), they are equivalent to processes Ni Gi with known distribution. Alternatively to this resampling approach, it is also shown (Lin et al., 2) that n 1/2 M1(t) is asympt. equivalent to n 1/2 ˆM1i (t)gi, with ˆM1i (t) = (Xi (s) E(s, ˆβ))d ˆMi (s). Therefore it can be shown that n 1/2 U( ˆβ, t) is asymptotically equivalent to ( n 1/2 Ŵ (t, ˆβ) = n 1/2 ˆM1i (t) I (t, ˆβ) I 1 (τ, ˆβ) ˆM1i (τ)) for almost any sequence of the counting processes that determines ˆM and I. Gi, Cumulative martingale residuals: test for proportional hazards A summary test statistic can be sup Uj( ˆβ, t) or t [,τ] sup t [δ,τ δ] Uj( ˆβ, t) var(uj( ˆ ˆβ, t)) = sup t [δ,τ δ] U w j ( ˆβ, t) j = 1..., p, where δ is a small positive number to avoid numerical problems at the edges, and var(u( ˆ ˆβ, t)) = ( ˆM1i (t) I (t, ˆβ)I 1 (τ, ˆβ) ) 2 ˆM1i (τ), is a consistent estimator of the variance of the observed score process (but also other choices). Graphical procedures can suggest type of departure from proportionality: plots of Uj( ˆβ, t) or U w j ( ˆβ, t) versus t, for j = 1,..., p, together with simulated processes Ŵ h(t, ˆβ) or Ŵ h(t, ˆβ)/ var(uj( ˆ ˆβ, t)) (h = 1,..., 1) under the model. > library(timereg) > fit<-cox.aalen(surv(time/365,status==2)~ +prop(edema)+prop(logbili)+ + +, weighted.score=,pbc) Right censored survival times, Cox-Aalen Survival Model, Num Simulations = 5 > summary(ourcox) Score Test for Proportionality sup hat U(t) p-value H_ prop(edema) prop(logbili) prop(edema) prop(logbili) > plot(fitw,score=t,xlab="",ylab="") > fitw<-cox.aalen(surv(time/365,status==2)~ +prop(edema)+prop(logbili)+ + +, weighted.score=1,pbc) > summary(ourcox) Score Test for Proportionality sup hat U(t) p-value H_ prop(edema) prop(logbili) > plot(fitw,score=t,xlab="",ylab="")

6 Schoenfeld residuals With prop(edema) prop(logbili) { } 2 S2(t, β) S1(t, E(t, β) = S1(t, β)/s(t, β); V (t, β) = S(t, β) β), S(t, β) the Schoenfeld residuals (Schoenfeld,1982) are dened as r k(β) = X (k)(t k) E(t k, β), and the scaled Schoenfeld residuals (Grambsch and Therneau, 1994) as r k = V 1 (t k, β)r k(β) with X (k) the covariate vector of the subject failing at t k. Given Cox estimates ˆβ, if proportional hazards hold, the estimated residuals ˆr = r ( ˆβ) have E(ˆr ) = k. k k k Plots of ˆr kj versus t k (j = 1,..., p) should be centered around the zero line. Schoenfeld residuals Schoenfeld residuals Assume a Cox model with time-varying coecients of type β j(t) = β j + θ jg j(t), j = 1,..., p, (1) with g j(t) as known functions (predictable). Interest in testing H : θ j = (β j(t) = β j) for all covariates j. Under (1), using a Taylor-series expansion of E(t, β(t)) about β(t) = β, it is E(rkj(β)) θ jg j(t), j. It follows that E(ˆr kj) + ˆβ j β j(t). Plots and smoothing of (r + ˆβ) gives a way of estimating β(t). k ˆθ can be obtained as one-step Newton-Raphson maximum likelihood estimate, based on the constant start β(t) = ˆβ. Under a Cox's model with β(t) = β + θg(t), standard asymptotics are still valid with parameter (β, θ) and covariates (Xi, Xi g(t)). The score test statistic gives many goodness-of-t tests for H : θj =, for dierent choices of g(t). The score function around (ˆβ, ) becomes U( ˆβ, ) = g(t)(xi E( ˆβ, s))dni (s), i therefore if this score is equal to for all g(t), it follows that the cumulative Schoenfeld residuals and the Schoenfeld residuals must be (Xi E( ˆβ, s))dni (s) = t or (Xi E( ˆβ, s))dni (s) =. Under the null, the score statistic is T (g(t)) = U( ˆβ, )J 1 22 U( ˆβ, ), with J22( ˆβ, ) being the block for θ in the observed information matrix. What if H : θj = is not rejected for some choices of g(t)?

7 > library(survival) > fit.pbc <- coxph(surv(time/365, status==2) ~ age+as.factor(edema) +log(bili)+ + log(protime)+log(albumin)) > stest= cox.zph(fit.pbc,transform="log") > stest > par(mfrow=c(2,3)) > for (i in 1:6) { > plot(stest[i]) > abline(,, lty=3) > } rho chisq p e age as.factor(edema) e as.factor(edema) e log(bili) e log(protime) e+.24 log(albumin) e GLOBAL NA 1.57e > plot(stest$x, stest$y[,2]) Add the linear fit of regression line r^*_k2 ~ log(time) > abline(lm(stest$y[,2] ~ stest$x)$coefficients, lty=4, col=3) Beta(t) for log(bili) Beta(t) for age Beta(t) for as.factor(edema) Beta(t) for log(protime) Beta(t) for as.factor(edema) Beta(t) for log(albumin) for checking other assumptions Lin et.al (1993) suggested a more general cumulated sum over t and z: Mc(t, z) = K T z (s)d ˆM(s) where Kz(t) = f (Xi ) I (Xi (t) z). Some special cases of Mc(t, z) are: Global model assessment. Kz(t) is an n 1 matrix with elements I (Xi1(t) z) for i = 1,..., n focusing here on the continuous covariate X1. Thus one has cumulative residuals versus both time and the covariate values. Checking functional form of covariates. A 1-dimensional process can be considered: τ Mc(z) = K T z (t)d ˆM(t), (2) where Kz(t) has still elements I (Xi1(t) z). Plots of Mc(z) versus z can reveal if the functional form is appropriate. Checking the link function As in (2), but Kz(t) is an n 1 matrix with elements I (X T i (t) ˆβ z). These processes can also be resampled. Results can be used for graphical procedures and for estimating p-values for supremum tests. PBC data: To show if the there are problems for the levels of a continuous covariate over time one may also group the residuals after its level. > fit.pbc <- cox.aalen(surv(time/365, status==2) ~ +prop(edema)+prop(logbili)++, + residuals=1, n.sim=,pbc) > X <- model.matrix( ~ -1+ cut(bili,quantile(bili),include.lowest=t),pbc) > colnames(x) <- c("1.quartile","2.quartile","3.quartile","4.quartile") > resids <- cum.residuals(fit.pbc,pbc,x,n.sim=1,cum.resid=1) > summary(resids) Test for cumulative MG-residuals Grouped Residuals consistent with model sup hat B(t) p-value H_: B(t)= quartile 2. quartile quartile quartile int ( B(t) )^2 dt p-value H_: B(t)= quartile 2. quartile quartile quartile Residual versus covariates consistent with model sup hat B(t) p-value H_: B(t)= prop(edema) prop(logbili)

8 PBC data: levels of Bilirubin PBC data: martingale residuals for functional form 1. quartile 2. quartile prop(logbili) quartile 4. quartile prop(logbili) PBC data: martingale residuals for functional form PBC data: levels of Bilirubin > fit.pbc <- cox.aalen(surv(time/365, status==2) ~ +prop(edema)+prop(bili)+ + +, residuals=1, n.sim=,pbc) 1. quartile 2. quartile > resids <- cum.residuals(fit.pbc,pbc,x,n.sim=1,cum.resid=1) > summary(resids) Test for cumulative MG-residuals Grouped Residuals consistent with model sup hat B(t) p-value H_: B(t)= 1. quartile quartile quartile quartile int ( B(t) )^2 dt p-value H_: B(t)= 1. quartile quartile quartile 4. quartile 3. quartile quartile Residual versus covariates consistent with model sup hat B(t) p-value H_: B(t)= prop(edema) prop(bili)

9 PBC data: functional form General cumulative residuals process prop(bili) Dene a cumulative residual process by M U(t) = U T (t)d ˆM(s) = U T (t)g( ˆβ, s)dm(s) with G(β, t) = I Y (β, t)y (β, t) Y (β, t) = diag(exp(x T i (t)β))y (t), Y (β, t) = (Y T (t)diag(exp(x T i (t)β))y (t)) 1 Y T (t). prop(bili) The residual is asymptotically equivalent to [ U T (t)g(β, s)dm(s) U T (s)g(s)diag { Y (β, s)y (β, s)dn(s) } ] Z(s) ( ˆβ β) and the last expression is [ U T (s)g(s)diag { Y (β, s)y (β, s)dn(s) } Z(s) Denote the second integral in the latter display by B U(β, t). ] I 1 (β, τ)u(β, τ) + op(n 1/2 ) processes: estimated variance, IID decomposition The variance of MU(t) can be estimated by the optional variation process [MU] (t) = U T (s)g( ˆβ, s)diag(dn(s))u(s)g( ˆβ, s) + BU( ˆβ, t) [U] (τ)b T U ( ˆβ, t) BU( ˆβ, t) [MU, U] (t) [U, MU] (t)b T U ( ˆβ, t). MU(t) is asymptotically equivalent to { 1 Ui (t) U T (s)y (β, s) Y T (β, s)w (s)y (β, s)} Yi (s)dmi (s) [ U T (s)g(s)diag { Y (β, s)y (β, s)dn(s) } ] I 1 (β, τ)z(s) W1,i + op(n 1/ = Wi (t) + op(n 1/2 ), where Wi (t) are i.i.d. an W1,i is an i.i.d. decomposition of ˆβ β. Resample construction is given as Ŵi (t)gi, where G1,..., Gn are standard normals, and Ŵi (t) is obtained by using ˆMi instead of Mi. This resampled process has the same asymptotic distribution of MU(t).

10 Extensions of Cox's model First, check if an extended Cox's model with strata or/and time-dependent covariates ts well the data! (and responds to the clinical questions of interest) Cox's model can be extended as λ(t) exp(z T β(t)). It can be tted by Sieves (Murphy & Sen, 1991) or local likelihood methods (Pons, 2; Hastie & Tibshirani, 1993; Cai & Sun, 23; Tian et al., 25) or by smoothing splines (Zucker & Karr, 199). There exists dierent methods to estimate β(t) directly, or to estimate cumulative coecients B(t) = β(s)ds (Martinussen et al., 22). Spline-penalized likelihood approach : β(s) splines on [, τ] with baseline λ τ PL(t, β()) + b β (s) 2 ds, parametric problem. Extension of Cox's model Local Likelihood : β(s) C 2 [, τ] with baseline λ PL(t, β()) = τ (X T i (s)β(s) log( Y k(s)exp(x T k (s)β(s))))dni (s) i k Now, a local linear approximation of β(s) yields (for xed t) β(s) = β(t) + θ(t)(s t) such that a local PL is given by ( ( )) PL l (t, ν) = τ 1 K((s t)/b) b i X T i (s)νt log T Y k(s) exp( X k k with ν = (β(t), β (t)), covariates Xi (s) = (Xi, (s t) Xi ), and K( ) a Kernel function. PL l(t, ν) is concave! Newton-Raphson yields consistent estimators such that n 2/5 ( ˆβ(t) β(t)) N(Bt, Vt). Cumulative estimate B(t) = ˆβ(s)ds based on this method have the same asymptotic properties proved by other approaches (Martinussen et al., 22). dn i (s Resampling exersise GOF test Exercise Consider the PBC data and t a Cox model. Run the code from the slides and repeat the arguments from the lectures. Obtain the observed score process (based on martingale residuals) for checking proportional hazards. By considering the iid decomposition, construct 1 resampled score processes under the model, by using the two dierent approaches by Lin et al. (1993) and Lin et al. (2). Construct the supremum test statistics (weighted and unweighted) and compute p-values from the resampled processes. Compare with results from the program output: > fit.pbc<-cox.aalen(surv(time/365, status==2) ~ +prop(edema)+ + prop(logbili)+ +, weighted.test=, + resample.iid=1, residuals=1, pbc) > summary(fit.pbc) > plot(fit.pbc, score=t,xlab="",ylab="") > fit.pbc.w <-cox.aalen(surv(time/365, status==2) ~ +prop(edema)+ + prop(logbili)+ +, weighted.test=1, + resample.iid=1, residuals=1, pbc) > summary(fit.pbc.w) > par(mfrow=c(2,3)) > plot(fit.pbc.w, score=t,xlab="",ylab="") Fit a Cox model to the PBC data with all continuous covariates in linear form. data(pbc); attach(pbc) time <- time/365; fit<-coxph(surv(time,status==2)~age + as.factor(edema) + bili + protime + alb, pbc) Check proportional hazards assumption by using Schoenfeld residuals and cumulative residuals, compare results. # : time <- time+.1*runif(418) fit<-cox.aalen(surv(time,status==2)~+prop(edema)+ prop(bili)+prop(protime)+prop(alb),pbc,residuals=1) par(mfrow=c(2,3)) plot(fit.pbc, score=t, xlab="",ylab="") plot(fit)

11 GOF test Exercise GOF test Exercise Check functional form of continuous covariates with cumulative martingale residuals fit.mg <- cum.residuals(fit,pbc,n.sim=2,cum.resid=1) summary(fit.mg) par(mfrow=c(2,2)) plot(fit.mg,score=2) X <- model.matrix( ~ -1+factor(cut(protime,quantile(protime), include.lowest=t)),pbc) colnames(x) <- c("1. quartile","2. quartile","3. quartile","4. quartile") resids <- cum.residuals(fit,pbc,x,n.sim=1,cum.resid=1) summary(resids) plot(resids, score=1) plot(resids, score=2,specific.comps=c(1,3:5)) Compare condence intervals and condence bands of the observed processes of cumulative residuals grouped after the covariate levels Consider the TRACE data on myocardial infarction (data(trace)) Make a goodness-of-t test in a Cox model with covariates "chf", "age", "sex", "diabetes", "vf". Compare with the conclusions from the extended Cox model with time-varying coecients: fit<-timecox(surv(time,status>=7)~ chf+age+sex+diabetes+ vf, TRACE) par(mfrow=c(2,3)) plot(fit) How could this model be simplied? plot(resids,pointwise.c,hw.ci=2,sim.ci=3) What is the conclusion? How should the model be adapted? Does it provide a satisfactory goodness-of-t to the data? References References Cai, Z. and Sun, Y. (23). Local linear estimation for time-dependent coecients in Cox s regression models. Scand. J. Statist. 3, Grambsch, P.M. and Therneau, T.M. (1994). Proportional hazards tests and diagnostics based on weighted residuals (corr: 95v82 p668). Biometrika 81, Hastie, T. and Tibshirani, R. (1993). Varying-coecient models. J. Roy. Stat. Soc. Ser. B 55, Lin, D.Y., Wei, L., Yang, I., and Ying, Z. (2). Semiparametric regression for the mean and rate functions of recurrent events. J. Roy. Stat. Soc. Ser. B 62, Lin, D.Y., Wei, L.J., and Ying, Z. (1993). Checking the Cox model with cumulative sums of martingale-based residuals. Biometrika 8, Martinussen, T., Scheike, T.H. and Skovgaard, I.M. (22). Ecient estimation of xed and time-varying covariate eects in multiplicative intensity models. Scand. J. Statistics 29, Murphy, S. A. and Sen, P. K. (1991). Cox regression model with time dependent covariates. Stochastic Processes and their Applications 39, Pons, O. (2). Nonparametric estimation in a varying-coecient Cox model. Math. Meth. Statist. 9, Schoenfeld, D. (1982). Partial residuals for the proportional hazards regression model. Biometrika 69, Tian, L., Zucker, D., and Wei, L. (25). On the Cox model with time-varying regression coecients. J. Amer. Statist. Assoc. 1, Zucker, D.M. and Karr, A.F. (199). Nonparametric survival analysis with time-dependent covariate eects: A penalized partial likelihood approach. Ann. Statist. 18,

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen Outline Cox s proportional hazards model. Goodness-of-fit tools More flexible models R-package timereg Forthcoming book, Martinussen and Scheike. 2/38 University of Copenhagen http://www.biostat.ku.dk

More information

Faculty of Health Sciences. Cox regression. Torben Martinussen. Department of Biostatistics University of Copenhagen. 20. september 2012 Slide 1/51

Faculty of Health Sciences. Cox regression. Torben Martinussen. Department of Biostatistics University of Copenhagen. 20. september 2012 Slide 1/51 Faculty of Health Sciences Cox regression Torben Martinussen Department of Biostatistics University of Copenhagen 2. september 212 Slide 1/51 Survival analysis Standard setup for right-censored survival

More information

Survival analysis in R

Survival analysis in R Survival analysis in R Niels Richard Hansen This note describes a few elementary aspects of practical analysis of survival data in R. For further information we refer to the book Introductory Statistics

More information

Survival analysis in R

Survival analysis in R Survival analysis in R Niels Richard Hansen This note describes a few elementary aspects of practical analysis of survival data in R. For further information we refer to the book Introductory Statistics

More information

1 Introduction. 2 Residuals in PH model

1 Introduction. 2 Residuals in PH model Supplementary Material for Diagnostic Plotting Methods for Proportional Hazards Models With Time-dependent Covariates or Time-varying Regression Coefficients BY QIQING YU, JUNYI DONG Department of Mathematical

More information

Lecture 10. Diagnostics. Statistics Survival Analysis. Presented March 1, 2016

Lecture 10. Diagnostics. Statistics Survival Analysis. Presented March 1, 2016 Statistics 255 - Survival Analysis Presented March 1, 2016 Dan Gillen Department of Statistics University of California, Irvine 10.1 Are model assumptions correct? Is the proportional hazards assumption

More information

Residuals and model diagnostics

Residuals and model diagnostics Residuals and model diagnostics Patrick Breheny November 10 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/42 Introduction Residuals Many assumptions go into regression models, and the Cox proportional

More information

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data Outline Frailty modelling of Multivariate Survival Data Thomas Scheike ts@biostat.ku.dk Department of Biostatistics University of Copenhagen Marginal versus Frailty models. Two-stage frailty models: copula

More information

Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics

Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/25 Residuals for the

More information

Time-dependent coefficients

Time-dependent coefficients Time-dependent coefficients Patrick Breheny December 1 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/20 Introduction As we discussed previously, stratification allows one to handle variables that

More information

( t) Cox regression part 2. Outline: Recapitulation. Estimation of cumulative hazards and survival probabilites. Ørnulf Borgan

( t) Cox regression part 2. Outline: Recapitulation. Estimation of cumulative hazards and survival probabilites. Ørnulf Borgan Outline: Cox regression part 2 Ørnulf Borgan Department of Mathematics University of Oslo Recapitulation Estimation of cumulative hazards and survival probabilites Assumptions for Cox regression and check

More information

The coxvc_1-1-1 package

The coxvc_1-1-1 package Appendix A The coxvc_1-1-1 package A.1 Introduction The coxvc_1-1-1 package is a set of functions for survival analysis that run under R2.1.1 [81]. This package contains a set of routines to fit Cox models

More information

Goodness-of-Fit Tests With Right-Censored Data by Edsel A. Pe~na Department of Statistics University of South Carolina Colloquium Talk August 31, 2 Research supported by an NIH Grant 1 1. Practical Problem

More information

A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1. By Thomas H. Scheike University of Copenhagen

A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1. By Thomas H. Scheike University of Copenhagen The Annals of Statistics 21, Vol. 29, No. 5, 1344 136 A GENERALIZED ADDITIVE REGRESSION MODEL FOR SURVIVAL TIMES 1 By Thomas H. Scheike University of Copenhagen We present a non-parametric survival model

More information

Goodness-of-fit test for the Cox Proportional Hazard Model

Goodness-of-fit test for the Cox Proportional Hazard Model Goodness-of-fit test for the Cox Proportional Hazard Model Rui Cui rcui@eco.uc3m.es Department of Economics, UC3M Abstract In this paper, we develop new goodness-of-fit tests for the Cox proportional hazard

More information

Cox s proportional hazards/regression model - model assessment

Cox s proportional hazards/regression model - model assessment Cox s proportional hazards/regression model - model assessment Rasmus Waagepetersen September 27, 2017 Topics: Plots based on estimated cumulative hazards Cox-Snell residuals: overall check of fit Martingale

More information

More on Cox-regression

More on Cox-regression More on Cox-regression p. 1/45 More on Cox-regression STK4080 H16 1. Repetition 2. Left truncation 3. Time-dependent covariates 4. Stratified Cox-regression 5. Residuals - Model check 6. How to handle

More information

Tests of independence for censored bivariate failure time data

Tests of independence for censored bivariate failure time data Tests of independence for censored bivariate failure time data Abstract Bivariate failure time data is widely used in survival analysis, for example, in twins study. This article presents a class of χ

More information

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA Kasun Rathnayake ; A/Prof Jun Ma Department of Statistics Faculty of Science and Engineering Macquarie University

More information

Comparing Distribution Functions via Empirical Likelihood

Comparing Distribution Functions via Empirical Likelihood Georgia State University ScholarWorks @ Georgia State University Mathematics and Statistics Faculty Publications Department of Mathematics and Statistics 25 Comparing Distribution Functions via Empirical

More information

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations Takeshi Emura and Hisayuki Tsukuma Abstract For testing the regression parameter in multivariate

More information

Lecture 12. Multivariate Survival Data Statistics Survival Analysis. Presented March 8, 2016

Lecture 12. Multivariate Survival Data Statistics Survival Analysis. Presented March 8, 2016 Statistics 255 - Survival Analysis Presented March 8, 2016 Dan Gillen Department of Statistics University of California, Irvine 12.1 Examples Clustered or correlated survival times Disease onset in family

More information

Definitions and examples Simple estimation and testing Regression models Goodness of fit for the Cox model. Recap of Part 1. Per Kragh Andersen

Definitions and examples Simple estimation and testing Regression models Goodness of fit for the Cox model. Recap of Part 1. Per Kragh Andersen Recap of Part 1 Per Kragh Andersen Section of Biostatistics, University of Copenhagen DSBS Course Survival Analysis in Clinical Trials January 2018 1 / 65 Overview Definitions and examples Simple estimation

More information

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data Outline Frailty modelling of Multivariate Survival Data Thomas Scheike ts@biostat.ku.dk Department of Biostatistics University of Copenhagen Marginal versus Frailty models. Two-stage frailty models: copula

More information

MAS3301 / MAS8311 Biostatistics Part II: Survival

MAS3301 / MAS8311 Biostatistics Part II: Survival MAS3301 / MAS8311 Biostatistics Part II: Survival M. Farrow School of Mathematics and Statistics Newcastle University Semester 2, 2009-10 1 13 The Cox proportional hazards model 13.1 Introduction In the

More information

SSUI: Presentation Hints 2 My Perspective Software Examples Reliability Areas that need work

SSUI: Presentation Hints 2 My Perspective Software Examples Reliability Areas that need work SSUI: Presentation Hints 1 Comparing Marginal and Random Eects (Frailty) Models Terry M. Therneau Mayo Clinic April 1998 SSUI: Presentation Hints 2 My Perspective Software Examples Reliability Areas that

More information

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520 REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520 Department of Statistics North Carolina State University Presented by: Butch Tsiatis, Department of Statistics, NCSU

More information

Survival Analysis Math 434 Fall 2011

Survival Analysis Math 434 Fall 2011 Survival Analysis Math 434 Fall 2011 Part IV: Chap. 8,9.2,9.3,11: Semiparametric Proportional Hazards Regression Jimin Ding Math Dept. www.math.wustl.edu/ jmding/math434/fall09/index.html Basic Model Setup

More information

TESTINGGOODNESSOFFITINTHECOX AALEN MODEL

TESTINGGOODNESSOFFITINTHECOX AALEN MODEL ROBUST 24 c JČMF 24 TESTINGGOODNESSOFFITINTHECOX AALEN MODEL David Kraus Keywords: Counting process, Cox Aalen model, goodness-of-fit, martingale, residual, survival analysis. Abstract: The Cox Aalen regression

More information

Analysis of competing risks data and simulation of data following predened subdistribution hazards

Analysis of competing risks data and simulation of data following predened subdistribution hazards Analysis of competing risks data and simulation of data following predened subdistribution hazards Bernhard Haller Institut für Medizinische Statistik und Epidemiologie Technische Universität München 27.05.2013

More information

Nonparametric two-sample tests of longitudinal data in the presence of a terminal event

Nonparametric two-sample tests of longitudinal data in the presence of a terminal event Nonparametric two-sample tests of longitudinal data in the presence of a terminal event Jinheum Kim 1, Yang-Jin Kim, 2 & Chung Mo Nam 3 1 Department of Applied Statistics, University of Suwon, 2 Department

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 24 Paper 153 A Note on Empirical Likelihood Inference of Residual Life Regression Ying Qing Chen Yichuan

More information

Power and Sample Size Calculations with the Additive Hazards Model

Power and Sample Size Calculations with the Additive Hazards Model Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine

More information

UNIVERSITY OF CALIFORNIA, SAN DIEGO

UNIVERSITY OF CALIFORNIA, SAN DIEGO UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department

More information

Survival Regression Models

Survival Regression Models Survival Regression Models David M. Rocke May 18, 2017 David M. Rocke Survival Regression Models May 18, 2017 1 / 32 Background on the Proportional Hazards Model The exponential distribution has constant

More information

Reduced-rank hazard regression

Reduced-rank hazard regression Chapter 2 Reduced-rank hazard regression Abstract The Cox proportional hazards model is the most common method to analyze survival data. However, the proportional hazards assumption might not hold. The

More information

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);

More information

β j = coefficient of x j in the model; β = ( β1, β2,

β j = coefficient of x j in the model; β = ( β1, β2, Regression Modeling of Survival Time Data Why regression models? Groups similar except for the treatment under study use the nonparametric methods discussed earlier. Groups differ in variables (covariates)

More information

SEMIPARAMETRIC REGRESSION WITH TIME-DEPENDENT COEFFICIENTS FOR FAILURE TIME DATA ANALYSIS

SEMIPARAMETRIC REGRESSION WITH TIME-DEPENDENT COEFFICIENTS FOR FAILURE TIME DATA ANALYSIS Statistica Sinica 2 (21), 853-869 SEMIPARAMETRIC REGRESSION WITH TIME-DEPENDENT COEFFICIENTS FOR FAILURE TIME DATA ANALYSIS Zhangsheng Yu and Xihong Lin Indiana University and Harvard School of Public

More information

Lecture 7 Time-dependent Covariates in Cox Regression

Lecture 7 Time-dependent Covariates in Cox Regression Lecture 7 Time-dependent Covariates in Cox Regression So far, we ve been considering the following Cox PH model: λ(t Z) = λ 0 (t) exp(β Z) = λ 0 (t) exp( β j Z j ) where β j is the parameter for the the

More information

Lecture 8 Stat D. Gillen

Lecture 8 Stat D. Gillen Statistics 255 - Survival Analysis Presented February 23, 2016 Dan Gillen Department of Statistics University of California, Irvine 8.1 Example of two ways to stratify Suppose a confounder C has 3 levels

More information

Log-linearity for Cox s regression model. Thesis for the Degree Master of Science

Log-linearity for Cox s regression model. Thesis for the Degree Master of Science Log-linearity for Cox s regression model Thesis for the Degree Master of Science Zaki Amini Master s Thesis, Spring 2015 i Abstract Cox s regression model is one of the most applied methods in medical

More information

4. Comparison of Two (K) Samples

4. Comparison of Two (K) Samples 4. Comparison of Two (K) Samples K=2 Problem: compare the survival distributions between two groups. E: comparing treatments on patients with a particular disease. Z: Treatment indicator, i.e. Z = 1 for

More information

Part III Measures of Classification Accuracy for the Prediction of Survival Times

Part III Measures of Classification Accuracy for the Prediction of Survival Times Part III Measures of Classification Accuracy for the Prediction of Survival Times Patrick J Heagerty PhD Department of Biostatistics University of Washington 102 ISCB 2010 Session Three Outline Examples

More information

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require Chapter 5 modelling Semi parametric We have considered parametric and nonparametric techniques for comparing survival distributions between different treatment groups. Nonparametric techniques, such as

More information

Lecture 5 Models and methods for recurrent event data

Lecture 5 Models and methods for recurrent event data Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.

More information

Issues on quantile autoregression

Issues on quantile autoregression Issues on quantile autoregression Jianqing Fan and Yingying Fan We congratulate Koenker and Xiao on their interesting and important contribution to the quantile autoregression (QAR). The paper provides

More information

Part [1.0] Measures of Classification Accuracy for the Prediction of Survival Times

Part [1.0] Measures of Classification Accuracy for the Prediction of Survival Times Part [1.0] Measures of Classification Accuracy for the Prediction of Survival Times Patrick J. Heagerty PhD Department of Biostatistics University of Washington 1 Biomarkers Review: Cox Regression Model

More information

Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models

Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations

More information

Modelling Survival Events with Longitudinal Data Measured with Error

Modelling Survival Events with Longitudinal Data Measured with Error Modelling Survival Events with Longitudinal Data Measured with Error Hongsheng Dai, Jianxin Pan & Yanchun Bao First version: 14 December 29 Research Report No. 16, 29, Probability and Statistics Group

More information

Modelling geoadditive survival data

Modelling geoadditive survival data Modelling geoadditive survival data Thomas Kneib & Ludwig Fahrmeir Department of Statistics, Ludwig-Maximilians-University Munich 1. Leukemia survival data 2. Structured hazard regression 3. Mixed model

More information

LSS: An S-Plus/R program for the accelerated failure time model to right censored data based on least-squares principle

LSS: An S-Plus/R program for the accelerated failure time model to right censored data based on least-squares principle computer methods and programs in biomedicine 86 (2007) 45 50 journal homepage: www.intl.elsevierhealth.com/journals/cmpb LSS: An S-Plus/R program for the accelerated failure time model to right censored

More information

A Regression Model For Recurrent Events With Distribution Free Correlation Structure

A Regression Model For Recurrent Events With Distribution Free Correlation Structure A Regression Model For Recurrent Events With Distribution Free Correlation Structure J. Pénichoux(1), A. Latouche(2), T. Moreau(1) (1) INSERM U780 (2) Université de Versailles, EA2506 ISCB - 2009 - Prague

More information

Survival Analysis for Case-Cohort Studies

Survival Analysis for Case-Cohort Studies Survival Analysis for ase-ohort Studies Petr Klášterecký Dept. of Probability and Mathematical Statistics, Faculty of Mathematics and Physics, harles University, Prague, zech Republic e-mail: petr.klasterecky@matfyz.cz

More information

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

FULL LIKELIHOOD INFERENCES IN THE COX MODEL October 20, 2007 FULL LIKELIHOOD INFERENCES IN THE COX MODEL BY JIAN-JIAN REN 1 AND MAI ZHOU 2 University of Central Florida and University of Kentucky Abstract We use the empirical likelihood approach

More information

Survival Analysis. STAT 526 Professor Olga Vitek

Survival Analysis. STAT 526 Professor Olga Vitek Survival Analysis STAT 526 Professor Olga Vitek May 4, 2011 9 Survival Data and Survival Functions Statistical analysis of time-to-event data Lifetime of machines and/or parts (called failure time analysis

More information

Validation. Terry M Therneau. Dec 2015

Validation. Terry M Therneau. Dec 2015 Validation Terry M Therneau Dec 205 Introduction When I use a word, Humpty Dumpty said, in rather a scornful tone, it means just what I choose it to mean - neither more nor less. The question is, said

More information

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models NIH Talk, September 03 Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models Eric Slud, Math Dept, Univ of Maryland Ongoing joint project with Ilia

More information

Chapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample

Chapter 7 Fall Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample Bios 323: Applied Survival Analysis Qingxia (Cindy) Chen Chapter 7 Fall 2012 Chapter 7 Hypothesis testing Hypotheses of interest: (A) 1-sample H 0 : S(t) = S 0 (t), where S 0 ( ) is known survival function,

More information

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where STAT 331 Accelerated Failure Time Models Previously, we have focused on multiplicative intensity models, where h t z) = h 0 t) g z). These can also be expressed as H t z) = H 0 t) g z) or S t z) = e Ht

More information

STAT331. Combining Martingales, Stochastic Integrals, and Applications to Logrank Test & Cox s Model

STAT331. Combining Martingales, Stochastic Integrals, and Applications to Logrank Test & Cox s Model STAT331 Combining Martingales, Stochastic Integrals, and Applications to Logrank Test & Cox s Model Because of Theorem 2.5.1 in Fleming and Harrington, see Unit 11: For counting process martingales with

More information

Müller: Goodness-of-fit criteria for survival data

Müller: Goodness-of-fit criteria for survival data Müller: Goodness-of-fit criteria for survival data Sonderforschungsbereich 386, Paper 382 (2004) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner Goodness of fit criteria for survival data

More information

From semi- to non-parametric inference in general time scale models

From semi- to non-parametric inference in general time scale models From semi- to non-parametric inference in general time scale models Thierry DUCHESNE duchesne@matulavalca Département de mathématiques et de statistique Université Laval Québec, Québec, Canada Research

More information

Survival Model Predictive Accuracy and ROC Curves

Survival Model Predictive Accuracy and ROC Curves UW Biostatistics Working Paper Series 12-19-2003 Survival Model Predictive Accuracy and ROC Curves Patrick Heagerty University of Washington, heagerty@u.washington.edu Yingye Zheng Fred Hutchinson Cancer

More information

Survival Analysis. Stat 526. April 13, 2018

Survival Analysis. Stat 526. April 13, 2018 Survival Analysis Stat 526 April 13, 2018 1 Functions of Survival Time Let T be the survival time for a subject Then P [T < 0] = 0 and T is a continuous random variable The Survival function is defined

More information

Regularization in Cox Frailty Models

Regularization in Cox Frailty Models Regularization in Cox Frailty Models Andreas Groll 1, Trevor Hastie 2, Gerhard Tutz 3 1 Ludwig-Maximilians-Universität Munich, Department of Mathematics, Theresienstraße 39, 80333 Munich, Germany 2 University

More information

Proportional hazards regression

Proportional hazards regression Proportional hazards regression Patrick Breheny October 8 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/28 Introduction The model Solving for the MLE Inference Today we will begin discussing regression

More information

DAGStat Event History Analysis.

DAGStat Event History Analysis. DAGStat 2016 Event History Analysis Robin.Henderson@ncl.ac.uk 1 / 75 Schedule 9.00 Introduction 10.30 Break 11.00 Regression Models, Frailty and Multivariate Survival 12.30 Lunch 13.30 Time-Variation and

More information

Efficiency of Profile/Partial Likelihood in the Cox Model

Efficiency of Profile/Partial Likelihood in the Cox Model Efficiency of Profile/Partial Likelihood in the Cox Model Yuichi Hirose School of Mathematics, Statistics and Operations Research, Victoria University of Wellington, New Zealand Summary. This paper shows

More information

e 4β e 4β + e β ˆβ =0.765

e 4β e 4β + e β ˆβ =0.765 SIMPLE EXAMPLE COX-REGRESSION i Y i x i δ i 1 5 12 0 2 10 10 1 3 40 3 0 4 80 5 0 5 120 3 1 6 400 4 1 7 600 1 0 Model: z(t x) =z 0 (t) exp{βx} Partial likelihood: L(β) = e 10β e 10β + e 3β + e 5β + e 3β

More information

Package Rsurrogate. October 20, 2016

Package Rsurrogate. October 20, 2016 Type Package Package Rsurrogate October 20, 2016 Title Robust Estimation of the Proportion of Treatment Effect Explained by Surrogate Marker Information Version 2.0 Date 2016-10-19 Author Layla Parast

More information

Journal of Statistical Software

Journal of Statistical Software JSS Journal of Statistical Software January 2011, Volume 38, Issue 2. http://www.jstatsoft.org/ Analyzing Competing Risk Data Using the R timereg Package Thomas H. Scheike University of Copenhagen Mei-Jie

More information

ST745: Survival Analysis: Cox-PH!

ST745: Survival Analysis: Cox-PH! ST745: Survival Analysis: Cox-PH! Eric B. Laber Department of Statistics, North Carolina State University April 20, 2015 Rien n est plus dangereux qu une idee, quand on n a qu une idee. (Nothing is more

More information

Cox s proportional hazards model and Cox s partial likelihood

Cox s proportional hazards model and Cox s partial likelihood Cox s proportional hazards model and Cox s partial likelihood Rasmus Waagepetersen October 12, 2018 1 / 27 Non-parametric vs. parametric Suppose we want to estimate unknown function, e.g. survival function.

More information

On the Breslow estimator

On the Breslow estimator Lifetime Data Anal (27) 13:471 48 DOI 1.17/s1985-7-948-y On the Breslow estimator D. Y. Lin Received: 5 April 27 / Accepted: 16 July 27 / Published online: 2 September 27 Springer Science+Business Media,

More information

Maximum likelihood estimation for Cox s regression model under nested case-control sampling

Maximum likelihood estimation for Cox s regression model under nested case-control sampling Biostatistics (2004), 5, 2,pp. 193 206 Printed in Great Britain Maximum likelihood estimation for Cox s regression model under nested case-control sampling THOMAS H. SCHEIKE Department of Biostatistics,

More information

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis STAT 6350 Analysis of Lifetime Data Failure-time Regression Analysis Explanatory Variables for Failure Times Usually explanatory variables explain/predict why some units fail quickly and some units survive

More information

Lecture 2: Martingale theory for univariate survival analysis

Lecture 2: Martingale theory for univariate survival analysis Lecture 2: Martingale theory for univariate survival analysis In this lecture T is assumed to be a continuous failure time. A core question in this lecture is how to develop asymptotic properties when

More information

SEMIPARAMETRIC METHODS FOR ESTIMATING CUMULATIVE TREATMENT EFFECTS IN THE PRESENCE OF NON-PROPORTIONAL HAZARDS AND DEPENDENT CENSORING

SEMIPARAMETRIC METHODS FOR ESTIMATING CUMULATIVE TREATMENT EFFECTS IN THE PRESENCE OF NON-PROPORTIONAL HAZARDS AND DEPENDENT CENSORING SEMIPARAMETRIC METHODS FOR ESTIMATING CUMULATIVE TREATMENT EFFECTS IN THE PRESENCE OF NON-PROPORTIONAL HAZARDS AND DEPENDENT CENSORING by Guanghui Wei A dissertation submitted in partial fulfillment of

More information

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t)

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t) PhD course in Advanced survival analysis. (ABGK, sect. V.1.1) One-sample tests. Counting process N(t) Non-parametric hypothesis tests. Parametric models. Intensity process λ(t) = α(t)y (t) satisfying Aalen

More information

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Jonathan Taylor & Kristin Cobb Statistics 262: Intermediate Biostatistics p.1/?? Overview of today s class Kaplan-Meier Curve

More information

GROUPED SURVIVAL DATA. Florida State University and Medical College of Wisconsin

GROUPED SURVIVAL DATA. Florida State University and Medical College of Wisconsin FITTING COX'S PROPORTIONAL HAZARDS MODEL USING GROUPED SURVIVAL DATA Ian W. McKeague and Mei-Jie Zhang Florida State University and Medical College of Wisconsin Cox's proportional hazard model is often

More information

STAT 331. Martingale Central Limit Theorem and Related Results

STAT 331. Martingale Central Limit Theorem and Related Results STAT 331 Martingale Central Limit Theorem and Related Results In this unit we discuss a version of the martingale central limit theorem, which states that under certain conditions, a sum of orthogonal

More information

Smoothing Spline-based Score Tests for Proportional Hazards Models

Smoothing Spline-based Score Tests for Proportional Hazards Models Smoothing Spline-based Score Tests for Proportional Hazards Models Jiang Lin 1, Daowen Zhang 2,, and Marie Davidian 2 1 GlaxoSmithKline, P.O. Box 13398, Research Triangle Park, North Carolina 27709, U.S.A.

More information

POWER AND SAMPLE SIZE DETERMINATIONS IN DYNAMIC RISK PREDICTION. by Zhaowen Sun M.S., University of Pittsburgh, 2012

POWER AND SAMPLE SIZE DETERMINATIONS IN DYNAMIC RISK PREDICTION. by Zhaowen Sun M.S., University of Pittsburgh, 2012 POWER AND SAMPLE SIZE DETERMINATIONS IN DYNAMIC RISK PREDICTION by Zhaowen Sun M.S., University of Pittsburgh, 2012 B.S.N., Wuhan University, China, 2010 Submitted to the Graduate Faculty of the Graduate

More information

8. Parametric models in survival analysis General accelerated failure time models for parametric regression

8. Parametric models in survival analysis General accelerated failure time models for parametric regression 8. Parametric models in survival analysis 8.1. General accelerated failure time models for parametric regression The accelerated failure time model Let T be the time to event and x be a vector of covariates.

More information

Product-limit estimators of the survival function with left or right censored data

Product-limit estimators of the survival function with left or right censored data Product-limit estimators of the survival function with left or right censored data 1 CREST-ENSAI Campus de Ker-Lann Rue Blaise Pascal - BP 37203 35172 Bruz cedex, France (e-mail: patilea@ensai.fr) 2 Institut

More information

Fahrmeir: Discrete failure time models

Fahrmeir: Discrete failure time models Fahrmeir: Discrete failure time models Sonderforschungsbereich 386, Paper 91 (1997) Online unter: http://epub.ub.uni-muenchen.de/ Projektpartner Discrete failure time models Ludwig Fahrmeir, Universitat

More information

Efficient Estimation of Censored Linear Regression Model

Efficient Estimation of Censored Linear Regression Model 2 3 4 5 6 7 8 9 2 3 4 5 6 7 8 9 2 2 22 23 24 25 26 27 28 29 3 3 32 33 34 35 36 37 38 39 4 4 42 43 44 45 46 47 48 Biometrika (2), xx, x, pp. 4 C 28 Biometrika Trust Printed in Great Britain Efficient Estimation

More information

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling

Estimation and Inference of Quantile Regression. for Survival Data under Biased Sampling Estimation and Inference of Quantile Regression for Survival Data under Biased Sampling Supplementary Materials: Proofs of the Main Results S1 Verification of the weight function v i (t) for the lengthbiased

More information

Relative-risk regression and model diagnostics. 16 November, 2015

Relative-risk regression and model diagnostics. 16 November, 2015 Relative-risk regression and model diagnostics 16 November, 2015 Relative risk regression More general multiplicative intensity model: Intensity for individual i at time t is i(t) =Y i (t)r(x i, ; t) 0

More information

Beyond GLM and likelihood

Beyond GLM and likelihood Stat 6620: Applied Linear Models Department of Statistics Western Michigan University Statistics curriculum Core knowledge (modeling and estimation) Math stat 1 (probability, distributions, convergence

More information

Extensions of Cox Model for Non-Proportional Hazards Purpose

Extensions of Cox Model for Non-Proportional Hazards Purpose PhUSE Annual Conference 2013 Paper SP07 Extensions of Cox Model for Non-Proportional Hazards Purpose Author: Jadwiga Borucka PAREXEL, Warsaw, Poland Brussels 13 th - 16 th October 2013 Presentation Plan

More information

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Survival Analysis. Lu Tian and Richard Olshen Stanford University 1 Survival Analysis Lu Tian and Richard Olshen Stanford University 2 Survival Time/ Failure Time/Event Time We will introduce various statistical methods for analyzing survival outcomes What is the survival

More information

TMA 4275 Lifetime Analysis June 2004 Solution

TMA 4275 Lifetime Analysis June 2004 Solution TMA 4275 Lifetime Analysis June 2004 Solution Problem 1 a) Observation of the outcome is censored, if the time of the outcome is not known exactly and only the last time when it was observed being intact,

More information

USING MARTINGALE RESIDUALS TO ASSESS GOODNESS-OF-FIT FOR SAMPLED RISK SET DATA

USING MARTINGALE RESIDUALS TO ASSESS GOODNESS-OF-FIT FOR SAMPLED RISK SET DATA USING MARTINGALE RESIDUALS TO ASSESS GOODNESS-OF-FIT FOR SAMPLED RISK SET DATA Ørnulf Borgan Bryan Langholz Abstract Standard use of Cox s regression model and other relative risk regression models for

More information

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements

[Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements [Part 2] Model Development for the Prediction of Survival Times using Longitudinal Measurements Aasthaa Bansal PhD Pharmaceutical Outcomes Research & Policy Program University of Washington 69 Biomarkers

More information

STAT Sample Problem: General Asymptotic Results

STAT Sample Problem: General Asymptotic Results STAT331 1-Sample Problem: General Asymptotic Results In this unit we will consider the 1-sample problem and prove the consistency and asymptotic normality of the Nelson-Aalen estimator of the cumulative

More information

Lecture 22 Survival Analysis: An Introduction

Lecture 22 Survival Analysis: An Introduction University of Illinois Department of Economics Spring 2017 Econ 574 Roger Koenker Lecture 22 Survival Analysis: An Introduction There is considerable interest among economists in models of durations, which

More information

Chapter 7: Hypothesis testing

Chapter 7: Hypothesis testing Chapter 7: Hypothesis testing Hypothesis testing is typically done based on the cumulative hazard function. Here we ll use the Nelson-Aalen estimate of the cumulative hazard. The survival function is used

More information