Survival Regression Models

Similar documents
β j = coefficient of x j in the model; β = ( β1, β2,

MAS3301 / MAS8311 Biostatistics Part II: Survival

Lecture 8 Stat D. Gillen

Part [1.0] Measures of Classification Accuracy for the Prediction of Survival Times

Lecture 7 Time-dependent Covariates in Cox Regression

Lecture 12. Multivariate Survival Data Statistics Survival Analysis. Presented March 8, 2016

Lecture 9. Statistics Survival Analysis. Presented February 23, Dan Gillen Department of Statistics University of California, Irvine

Survival Analysis Math 434 Fall 2011

Lecture 7. Proportional Hazards Model - Handling Ties and Survival Estimation Statistics Survival Analysis. Presented February 4, 2016

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require

Relative-risk regression and model diagnostics. 16 November, 2015

Multivariable Fractional Polynomials

Survival analysis in R

Cox s proportional hazards model and Cox s partial likelihood

Survival Analysis. STAT 526 Professor Olga Vitek

Survival analysis in R

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

Multivariable Fractional Polynomials

Chapter 4 Regression Models

Lecture 11. Interval Censored and. Discrete-Time Data. Statistics Survival Analysis. Presented March 3, 2016

Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics

Tied survival times; estimation of survival probabilities

Time-dependent coefficients

Applied Survival Analysis Lab 10: Analysis of multiple failures

Residuals and model diagnostics

Survival Analysis. Stat 526. April 13, 2018

Beyond GLM and likelihood

Definitions and examples Simple estimation and testing Regression models Goodness of fit for the Cox model. Recap of Part 1. Per Kragh Andersen

Multistate models and recurrent event models

The nltm Package. July 24, 2006

The coxvc_1-1-1 package

Multistate models and recurrent event models

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models

9 Estimating the Underlying Survival Distribution for a

Multistate Modeling and Applications

Fitting Cox Regression Models

Extensions of Cox Model for Non-Proportional Hazards Purpose

Power and Sample Size Calculations with the Additive Hazards Model

STAT331. Cox s Proportional Hazards Model

You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?

SSUI: Presentation Hints 2 My Perspective Software Examples Reliability Areas that need work

Chapter 7: Hypothesis testing

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

Philosophy and Features of the mstate package

Longitudinal Modeling with Logistic Regression

The Proportional Hazard Model and the Modelling of Recurrent Failure Data: Analysis of a Disconnector Population in Sweden. Sweden

On a connection between the Bradley-Terry model and the Cox proportional hazards model

A fast routine for fitting Cox models with time varying effects

Semiparametric Regression

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

8. Parametric models in survival analysis General accelerated failure time models for parametric regression

TMA 4275 Lifetime Analysis June 2004 Solution

Case-control studies

UNIVERSITY OF CALIFORNIA, SAN DIEGO

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL

Longitudinal + Reliability = Joint Modeling

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Cox s proportional hazards/regression model - model assessment

Typical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction

Survival Analysis I (CHL5209H)

Statistics in medicine

Introduction to Statistical Analysis

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where

Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 18.1 Logistic Regression (Dose - Response)

Multivariate Survival Analysis

ST745: Survival Analysis: Cox-PH!

5. Parametric Regression Model

Extensions of Cox Model for Non-Proportional Hazards Purpose

Accelerated Failure Time Models

Booklet of Code and Output for STAD29/STA 1007 Midterm Exam

Validation. Terry M Therneau. Dec 2015

11 November 2011 Department of Biostatistics, University of Copengen. 9:15 10:00 Recap of case-control studies. Frequency-matched studies.

You can specify the response in the form of a single variable or in the form of a ratio of two variables denoted events/trials.

Logistic Regression. Interpretation of linear regression. Other types of outcomes. 0-1 response variable: Wound infection. Usual linear regression

Survival Analysis I (CHL5209H)

MAS3301 / MAS8311 Biostatistics Part II: Survival

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

Simple logistic regression

A Survival Analysis of GMO vs Non-GMO Corn Hybrid Persistence Using Simulated Time Dependent Covariates in SAS

Multi-state models: prediction

e 4β e 4β + e β ˆβ =0.765

( t) Cox regression part 2. Outline: Recapitulation. Estimation of cumulative hazards and survival probabilites. Ørnulf Borgan

Generalized linear models

Proportional hazards regression

DAGStat Event History Analysis.

1 Glivenko-Cantelli type theorems

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data

SAS Analysis Examples Replication C8. * SAS Analysis Examples Replication for ASDA 2nd Edition * Berglund April 2017 * Chapter 8 ;

11 Survival Analysis and Empirical Likelihood

THESIS for the degree of MASTER OF SCIENCE. Modelling and Data Analysis

Description Syntax for predict Menu for predict Options for predict Remarks and examples Methods and formulas References Also see

Lecture 5 Models and methods for recurrent event data

Lecture 22 Survival Analysis: An Introduction

Chapter 1 Statistical Inference

Stat 642, Lecture notes for 04/12/05 96

Package crrsc. R topics documented: February 19, 2015

Multi-state Models: An Overview

Transcription:

Survival Regression Models David M. Rocke May 18, 2017 David M. Rocke Survival Regression Models May 18, 2017 1 / 32

Background on the Proportional Hazards Model The exponential distribution has constant hazard f (t) = λe λt S(t) = e λt h(t) = λ Let s make two generalizations. First, let s let the hazard depend on covariates x 1, x 2,... x p. Second, let s let the base hazard depend on t. We can do this using either parametric or semi-parametric approaches. David M. Rocke Survival Regression Models May 18, 2017 2 / 32

The Cox Model The generalization is that the hazard function is η = β 1 x 1 + + β p x p h(t covariates) = h 0 (t)e η This has a log link as in a generalized linear model. It is semi-parametric because the linear predictor depends on estimated parameters but the base hazard function is unspecified. There is no constant term because it is absorbed in the base hazard. Note that for two different individuals with possibly different covariates, the ratio of the hazard functions is exp(η 1 )/ exp(η 2 ) = exp(η 1 η 2 ) which does not depend on t. David M. Rocke Survival Regression Models May 18, 2017 3 / 32

The Cox Model How do we fit this model? We need to estimate the coefficients of the covariates, and we need to estimate the base hazard h 0 (t). For the covariates, supposing for simplicity that there are no tied event times, let the event times for the whole data set be t 1, t 2,..., t D. Let the risk set at time t i be R(t i ) and η j = β 1 x j1 + + β p x jp θ j = e η j h(t covariates) = h 0 (t)e η = θh 0 (t) David M. Rocke Survival Regression Models May 18, 2017 4 / 32

The Cox Model Conditional on a single failure at time t i, the probability that the event is due to subject j R(t i ) is Pr(j fails 1 failure at t i ) = = h 0 (t i )e η j k R(t i ) h 0(t i )e η k θ j k R(t i ) θ k David M. Rocke Survival Regression Models May 18, 2017 5 / 32

The Cox Model If subject j(i) is the one who fails at time t i, then the partial likelihood is L(β T ) = i θ j(i) k R(t i ) θ k and we can numerically maximize this with respect to the coefficients. When there are tied event times adjustments need to be made, but the likelihood is still similar. Note that we don t need to know the base hazard to solve for the coefficients. David M. Rocke Survival Regression Models May 18, 2017 6 / 32

The Cox Model Once we have coefficient estimates ˆβ 1,..., ˆβ p, this also defines ˆη j and ˆθ j and then the estimated base cumulative hazard function is Ĥ(t) = t i <t d i k R(t i ) θ k which reduces to the Nelson-Aalen estimate when there are no covariates. There are numerous other estimates that have been proposed as well. David M. Rocke Survival Regression Models May 18, 2017 7 / 32

Adjustment for Ties At each time t i at which more than one of the subjects has an event, let d i be the number of events at that time, D i the set of subjects with events at that time, and let s i be a covariate vector for an artificial subject obtained by adding up the covariate values for the subjects with an event at time t i. Let and θ i = exp( η i ). η i = β 1 s i1 + + β p s ip David M. Rocke Survival Regression Models May 18, 2017 8 / 32

Adjustment for Ties Let s i be a covariate vector for an artificial subject obtained by adding up the covariate values for the subjects with an event at time t i. Note that η i = j D i β 1 x j1 + + β p x jp = β 1 s i1 + + β p s ip θ i = exp( η i ) = j D i θ i David M. Rocke Survival Regression Models May 18, 2017 9 / 32

Adjustment for Ties Breslow s method estimates the partial likelihood as L(β T ) = i = i θ i [ k R(t i ) θ k] d i θ j k R(t i ) θ k j D i This method is equivalent to treating each event as distinct and using the non-ties formula. It works best when the number of ties is small. It is the default in many statistical packages, including PROC PHREG in SAS. David M. Rocke Survival Regression Models May 18, 2017 10 / 32

Adjustment for Ties The other common method is Efron s, which is the default in R. L(β T ) = i θ i di j=1 [ k R(t i ) θ k j 1 d i k D i θ k ] This is closer to the exact discrete partial likelihood when there are many ties. The third option in R (and an option also in SAS as discrete ) is the exact method, which is the same one used for matched logistic regression. David M. Rocke Survival Regression Models May 18, 2017 11 / 32

Suppose as an example we have a time t where there are 20 individuals at risk and three failures. Let the three individuals have risk parameters θ 1, θ 2, θ 3 and let the sum of the risk parameters of the remaining 17 individuals be θ R. Then the factor in the partial likelihood at time t using Breslow s method is ( θ 1 ) ( θ 2 ) ( θ 3 ) θ R + θ 1 + θ 2 + θ 3 θ R + θ 1 + θ 2 + θ 3 θ R + θ 1 + θ 2 + θ 3 David M. Rocke Survival Regression Models May 18, 2017 12 / 32

If on the other hand, they had died in the order 1,2, 3, then the contribution to the partial likelihood would be ( θ 1 ) ( θ 2 ) ( ) θ3 θ R + θ 1 + θ 2 + θ 3 θ R + θ 2 + θ 3 θ R + θ 3 as the risk set got smaller with each failure. But we don t know the order they failed in, so instead of reducing the denominator by one risk coefficient each time, we reduce it by the same fraction. This is Efron s method. David M. Rocke Survival Regression Models May 18, 2017 13 / 32

Efron s method: ( θ 1 ) ( θ 2 ) ( θ 3 ) θ R + θ 1 + θ 2 + θ 3 θ R + 2(θ 1 + θ 2 + θ 3 )/3 θ R + (θ 1 + θ 2 + θ 3 )/3 David M. Rocke Survival Regression Models May 18, 2017 14 / 32

The coxph() Function in R coxph {survival} R Documentation Fit Proportional Hazards Regression Model Description Fits a Cox proportional hazards regression model. Time dependent variables, time dependent strata, multiple events per subject, and other extensions are incorporated using the counting process formulation of Andersen and Gill. Usage coxph(formula, data=, weights, subset, na.action, init, control, ties=c("efron","breslow","exact"), singular.ok=true, robust=false, model=false, x=false, y=true, tt, method,...) David M. Rocke Survival Regression Models May 18, 2017 15 / 32

The coxph() Function in R coxph(formula, data=, weights, subset, na.action, init, control, ties=c("efron","breslow","exact"), singular.ok=true, robust=false, model=false, x=false, y=true, tt, method,...) Arguments formula data weights subset a formula object, with the response on the left of a ~ operator, and the terms on the right. The response must be a survival object as returned by the Surv function. a data.frame in which to interpret the variables named in the formula, or in the subset and the weights argument. vector of case weights. If weights is a vector of integers, then the estimated coefficients are equivalent to estimating the model from data with the individual cases replicated as many times as indicated by weights. expression indicating which subset of the rows of data should be used in the fit. All observations are included by default. David M. Rocke Survival Regression Models May 18, 2017 16 / 32

The coxph() Function in R coxph(formula, data=, weights, subset, na.action, init, control, ties=c("efron","breslow","exact"), singular.ok=true, robust=false, model=false, x=false, y=true, tt, method,...) Arguments ties a character string specifying the method for tie handling. If there are no tied death times all the methods are equivalent. Nearly all Cox regression programs use the Breslow method by default, but not this one. The Efron approximation is used as the default here, it is more accurate when dealing with tied death times, and is as efficient computationally. The exact partial likelihood is equivalent to a conditional logistic model, and is appropriate when the times are a small set of discrete values. See further below. David M. Rocke Survival Regression Models May 18, 2017 17 / 32

The proportional hazards model is usually expressed in terms of a single survival time value for each person, with possible censoring. Andersen and Gill reformulated the same problem as a counting process; as time marches onward we observe the events for a subject, rather like watching a Geiger counter. The data for a subject is presented as multiple rows or observations, each of which applies to an interval of observation (start, stop]. David M. Rocke Survival Regression Models May 18, 2017 18 / 32

There are special terms that may be used in the model equation. A strata term identifies a stratified Cox model; separate baseline hazard functions are fit for each stratum. The cluster term is used to compute a robust variance for the model. The term + cluster(id) where each value of id is unique is equivalent to specifying the robust=t argument. If the id variable is not unique, it is assumed that it identifies clusters of correlated observations. The robust estimate arises from many different arguments and thus has had many labels. It is variously known as the Huber sandwich estimator, White s estimate (linear models/econometrics), the Horvitz-Thompson estimate (survey sampling), the working independence variance (generalized estimating equations), the infinitesimal jackknife, and the Wei, Lin, Weissfeld (WLW) estimate. David M. Rocke Survival Regression Models May 18, 2017 19 / 32

Cox Model for bmt Survival vs. Group > dfsurv <- Surv(bmt$t2,bmt$d3) > bmt.cox <- coxph(dfsurv~factor(group),data=bmt) > summary(bmt.cox) n= 137, number of events= 83 coef exp(coef) se(coef) z Pr(> z ) factor(group)2-0.5742 0.5632 0.2873-1.999 0.0457 * factor(group)3 0.3834 1.4673 0.2674 1.434 0.1516 --- Signif. codes: 0 *** 0.001 ** 0.01 * 0.05. 0.1 1 exp(coef) exp(-coef) lower.95 upper.95 factor(group)2 0.5632 1.7757 0.3207 0.989 factor(group)3 1.4673 0.6815 0.8688 2.478 Concordance= 0.625 (se = 0.031 ) Rsquare= 0.094 (max possible= 0.996 ) Likelihood ratio test= 13.45 on 2 df, p=0.001199 Wald test = 13.03 on 2 df, p=0.00148 Score (logrank) test = 13.81 on 2 df, p=0.001004 David M. Rocke Survival Regression Models May 18, 2017 20 / 32

Cox Model for bmt Survival vs. Group Hypothesis tests for factor levels comparing groups 2 and 3 to group 1. Group 3 has the highest hazard so the most significant comparison is not directly shown. The coefficient 0.3834 is on the log-hazard-ratio scale, as in log-risk-ratio. The next column gives the risk ratio 1.4673, and a hypothesis (Wald) test. The (not shown) group 3 vs. group 2 log hazard ratio is 0.3834 + 0.5742 = 0.9576. The hazard ratio is then exp(0.9576) or 2.605. Inference on all coefficients and combinations can be constructed using coef(bmt.cox) and vcov(bmt.cox) as with logistic regression. coef exp(coef) se(coef) z Pr(> z ) factor(group)2-0.5742 0.5632 0.2873-1.999 0.0457 * factor(group)3 0.3834 1.4673 0.2674 1.434 0.1516 --- Signif. codes: 0 *** 0.001 ** 0.01 * 0.05. 0.1 1 David M. Rocke Survival Regression Models May 18, 2017 21 / 32

Cox Model for bmt Survival vs. Group The next part of the output gives 95% confidence intervals for the relative risk. For the difference between groups 2 and 3 we need to use coef(bmt.cox) and vcov(bmt.cox). exp(coef) exp(-coef) lower.95 upper.95 factor(group)2 0.5632 1.7757 0.3207 0.989 factor(group)3 1.4673 0.6815 0.8688 2.478 > c1 <- coef(bmt.cox) > v1 <- vcov(bmt.cox) > cont1 <- c(-1,1) > t(cont1) %*% c1 [,1] [1,] 0.9576104 log hazard ratio group 3 to group2 > t(cont1) %*% v1 %*% cont1 [,1] [1,] 0.07040675 > sqrt(t(cont1) %*% v1 %*% cont1) [,1] [1,] 0.2653427 standard error of log hazard ratio David M. Rocke Survival Regression Models May 18, 2017 22 / 32

Cox Model for bmt Survival vs. Group Concordance is agreement of first failure between pairs of subjects and higher predicted risk between those subjects, omitting non-informative pairs. The Rsquare value is Cox and Snell s pseudo R-squared and is not very useful. There are three tests for whether the model with the group covariate is better than the one without --Usual likelihood ratio chi-squared --Wald test chi-squared, obtained by adding the squares of the z-scores --score = log-rank test, as with comparison of survival functions. The likelihood ratio test is probably best in smaller samples, followed by the Wald test. Concordance= 0.625 (se = 0.031 ) Rsquare= 0.094 (max possible= 0.996 ) Likelihood ratio test= 13.45 on 2 df, p=0.001199 Wald test = 13.03 on 2 df, p=0.00148 Score (logrank) test = 13.81 on 2 df, p=0.001004 David M. Rocke Survival Regression Models May 18, 2017 23 / 32

Cox Model for bmt Survival vs. Group The anova function performs the likelihood ratio test for comparing models. One can use drop1(), add1(), step(), or compare two explicit models. > anova(bmt.cox) Analysis of Deviance Table Cox model: response is dfsurv Terms added sequentially (first to last) loglik Chisq Df Pr(> Chi ) NULL -373.30 factor(group) -366.57 13.452 2 0.001199 ** --- Signif. codes: 0 *** 0.001 ** 0.01 * 0.05. 0.1 1 David M. Rocke Survival Regression Models May 18, 2017 24 / 32

Survival Curves from the Cox Model We defined the base hazard but then did not use it to estimate the coefficients. In fact, there is no meaningful base hazard. In this case, it is the hazard for group 1. We can use survfit to get survival functions, but by default it produces the hazard for an average individual with average covariates. This is like the family with 1.5 children it does not exist. Always it is best to specify the covariate level(s) for which you want the survival curve(s). In this case, we can plot the Cox model survival curves, which are in fact proportional along with the individual KM curves for the groups. David M. Rocke Survival Regression Models May 18, 2017 25 / 32

plot(survfit(dfsurv~group,data=bmt)) lines(survfit(bmt.cox,data.frame(group=1:3),conf.int=f),col="red") legend("bottomleft",c("kaplan-meier","cox Model"),col=c("black","red"),lwd=1) title("survival Functions for Three Groups by KM and Cox Model") When we use survfit() with a Cox model, we should include a data frame with the same named columns as the predictors in the Cox model and one or more levels of each. For example > data.frame(group=1:3,age=50) group age 1 1 50 2 2 50 3 3 50 > data.frame(group=rep(1:3,2),age=rep(c(50,60),each=3)) group age 1 1 50 2 2 50 3 3 50 4 1 60 5 2 60 6 3 60 David M. Rocke Survival Regression Models May 18, 2017 26 / 32

Survival Functions for Three Groups by KM and Cox Model 0.0 0.2 0.4 0.6 0.8 1.0 Kaplan Meier Cox Model 0 500 1000 1500 2000 2500 David M. Rocke Survival Regression Models May 18, 2017 27 / 32

Homework The addicts data set from the text (see page 91) is from a study by Caplehorn et al. (Methadone Dosage and Retention of Patients in Maintenance Treatment, Med. J. Aust., 1991). These data comprise the times in days spent by heroin addicts from entry to departure from one of two methadone clinics. There are two further covariates, namely, prison record and methadone dose, believed to affect the survival times. David M. Rocke Survival Regression Models May 18, 2017 28 / 32

Homework The data set and R input code are on the website. The variables are as follows: id: Subject ID clinic: Clinic (1 or 2) status: Survival status (0 = censored, 1 = departed from clinic) time: Survival time in days prison: Prison record (0 = none, 1 = any) methadone: Methadone dose (mg/day) David M. Rocke Survival Regression Models May 18, 2017 29 / 32

Homework 1 Plot the Kaplan-Meier survival curves for the two clinics. 2 Test for whether the two survival curves could have come from the same process using survdiff. 3 Plot the cumulative hazards from the Nelson-Aalen estimator. David M. Rocke Survival Regression Models May 18, 2017 30 / 32

Homework 4 A common comparison plot for proportional hazards is the complimentary log-log survival plot which plots ln( ln[ŝ(x)]) = ln Ĥ(t) against ln(t). If the hazards are proportional, then so are the cumulative hazards, and after taking logs, the curves should be parallel. If the lines are straight, then the Weibull model may be appropriate. Make this plot for the two clinics using the Nelson-Aalen estimator and comment on the results. You will use the cloglog option in the plot command. David M. Rocke Survival Regression Models May 18, 2017 31 / 32

Homework 5 Construct a Cox model using only the clinic variable. Is the survival different at the two clinics? What is the estimated hazard (risk) ratio, a test for significance, and a 95% confidence interval? 6 Pick one test for the null hypothesis that the clinics do not differ. Why would you depend on this test more than the others? 7 Consider adding the prison and methodone variables. Which of these covariates will improve the model? 8 Plot the two survival curves from your chosen Cox model and add the two KM survival curves. What do you think? David M. Rocke Survival Regression Models May 18, 2017 32 / 32