Part [1.0] Measures of Classification Accuracy for the Prediction of Survival Times

Similar documents
Survival analysis in R

Survival Regression Models

Lecture 8 Stat D. Gillen

Survival analysis in R

β j = coefficient of x j in the model; β = ( β1, β2,

Tied survival times; estimation of survival probabilities

MAS3301 / MAS8311 Biostatistics Part II: Survival

STAT331. Cox s Proportional Hazards Model

Lecture 7 Time-dependent Covariates in Cox Regression

Cox s proportional hazards model and Cox s partial likelihood

Lecture 7. Proportional Hazards Model - Handling Ties and Survival Estimation Statistics Survival Analysis. Presented February 4, 2016

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

Relative-risk regression and model diagnostics. 16 November, 2015

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables

Residuals and model diagnostics

Lecture 12. Multivariate Survival Data Statistics Survival Analysis. Presented March 8, 2016

Proportional hazards regression

Survival Analysis. STAT 526 Professor Olga Vitek

Fitting Cox Regression Models

Multistate models and recurrent event models

Survival Analysis. Stat 526. April 13, 2018

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

Multistate models and recurrent event models

Statistics in medicine

Semiparametric Regression

Cox regression: Estimation

Chapter 4 Regression Models

Survival Analysis Math 434 Fall 2011

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

Part III Measures of Classification Accuracy for the Prediction of Survival Times

Faculty of Health Sciences. Cox regression. Torben Martinussen. Department of Biostatistics University of Copenhagen. 20. september 2012 Slide 1/51

Survival Analysis I (CHL5209H)

Extensions of Cox Model for Non-Proportional Hazards Purpose

Extensions of Cox Model for Non-Proportional Hazards Purpose

Survival Analysis. 732G34 Statistisk analys av komplexa data. Krzysztof Bartoszek

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen

Lecture 10. Diagnostics. Statistics Survival Analysis. Presented March 1, 2016

THESIS for the degree of MASTER OF SCIENCE. Modelling and Data Analysis

Case-control studies

Lecture 11. Interval Censored and. Discrete-Time Data. Statistics Survival Analysis. Presented March 3, 2016

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

11 November 2011 Department of Biostatistics, University of Copengen. 9:15 10:00 Recap of case-control studies. Frequency-matched studies.

Multivariable Fractional Polynomials

Lecture 9. Statistics Survival Analysis. Presented February 23, Dan Gillen Department of Statistics University of California, Irvine

Typical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction

UNIVERSITY OF CALIFORNIA, SAN DIEGO

Survival Model Predictive Accuracy and ROC Curves

Lecture 22 Survival Analysis: An Introduction

Time-dependent coefficients

Multivariate Survival Analysis

On a connection between the Bradley-Terry model and the Cox proportional hazards model

Multivariable Fractional Polynomials

LSS: An S-Plus/R program for the accelerated failure time model to right censored data based on least-squares principle

Dynamic Prediction of Disease Progression Using Longitudinal Biomarker Data

You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?

Power and Sample Size Calculations with the Additive Hazards Model

4. Comparison of Two (K) Samples

PhD course in Advanced survival analysis. One-sample tests. Properties. Idea: (ABGK, sect. V.1.1) Counting process N(t)

ST745: Survival Analysis: Cox-PH!

Ph.D. course: Regression models. Introduction. 19 April 2012

Instrumental variables estimation in the Cox Proportional Hazard regression model

Definitions and examples Simple estimation and testing Regression models Goodness of fit for the Cox model. Recap of Part 1. Per Kragh Andersen

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Introduction to Statistical Analysis

Ph.D. course: Regression models. Regression models. Explanatory variables. Example 1.1: Body mass index and vitamin D status

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models

Log-linearity for Cox s regression model. Thesis for the Degree Master of Science

Lecture Outline. Biost 518 Applied Biostatistics II. Choice of Model for Analysis. Choice of Model. Choice of Model. Lecture 10: Multiple Regression:

Consider Table 1 (Note connection to start-stop process).

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL

Lecture 12: Effect modification, and confounding in logistic regression

Missing covariate data in matched case-control studies: Do the usual paradigms apply?

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics

Full likelihood inferences in the Cox model: an empirical likelihood approach

Ph.D. course: Regression models

A new strategy for meta-analysis of continuous covariates in observational studies with IPD. Willi Sauerbrei & Patrick Royston

ECLT 5810 Linear Regression and Logistic Regression for Classification. Prof. Wai Lam

BIOSTATS Intermediate Biostatistics Spring 2017 Exam 2 (Units 3, 4 & 5) Practice Problems SOLUTIONS

Booklet of Code and Output for STAD29/STA 1007 Midterm Exam

FULL LIKELIHOOD INFERENCES IN THE COX MODEL: AN EMPIRICAL LIKELIHOOD APPROACH

PubHlth Intermediate Biostatistics Spring 2015 Exam 2 (Units 3, 4 & 5) Study Guide

Calculating Effect-Sizes. David B. Wilson, PhD George Mason University

4 Testing Hypotheses. 4.1 Tests in the regression setting. 4.2 Non-parametric testing of survival between groups

GOV 2001/ 1002/ E-2001 Section 10 1 Duration II and Matching

3003 Cure. F. P. Treasure

Logistic regression model for survival time analysis using time-varying coefficients

The influence of categorising survival time on parameter estimates in a Cox model

Lab 8. Matched Case Control Studies

Accelerated Failure Time Models

Logistic Regression. Continued Psy 524 Ainsworth

Logistic regression. 11 Nov Logistic regression (EPFL) Applied Statistics 11 Nov / 20

Generalized Linear Modeling - Logistic Regression

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

Lecture 8. Poisson models for counts

Survival Analysis for Case-Cohort Studies

Matched Pair Data. Stat 557 Heike Hofmann

Transcription:

Part [1.0] Measures of Classification Accuracy for the Prediction of Survival Times Patrick J. Heagerty PhD Department of Biostatistics University of Washington 1 Biomarkers

Review: Cox Regression Model Introduction Cox (1972) Model hazard model log hazard, survival models PH assumption Interpretation of coefficients Specific examples 2 Biomarkers

Cox Regression Model Estimation Coefficients Partial likelihood Approximation for ties Survival curve(s) Hazard curve(s) Stratification Using covariate Using true stratification 3 Biomarkers

Cox (1972) D.R. Cox (1972) Regression Models and Life-Tables (with discussion) JRSS-B, 74: 187-220. The present paper is largely concerned with the extension of the results of Kaplan and Meier to the comparison of life tables and more generally to the incorporation of regression-like arguments into life-table analysis. (p. 187) Model proposed: λ(t X) = λ 0 (t) exp(xβ) In the present paper we shall, however, concentrate on exploring the consequence of allowing λ 0 (t) to be arbitrary, main interest being in the regression parameters. (p. 190) A Conditional Likelihood later called Partial Likelihood. Score Test = LogRank Test 4 Biomarkers

Discussion: Impact: Mr. Richard Peto (Oxford University): I have greatly enjoyed Professor Cox s paper. It seems to me to formulate and to solve the problem of regression of prognosis on other factors perfectly, and it is very pretty. Science Citation Index: 29,140 citations (29 June 2015) David R. Cox is knighted in 1985 in recognition of his scientific contributions. 5 Biomarkers

Sir David R. Cox 6 Biomarkers

Cox Regression Model Response Variable: Observed: (Y i, δ i ) Of Interest: T i, or λ(t) T i survival, with distribution given by: Survival function: S(t) Hazard function: λ(t) Observed Covariates: X 1, X 2,..., X k For subject j we observe: (Y j, δ j ), X 1j, X 2j,..., X kj IDEA: same as with other regression models Model relates the covariates X 1,..., X k to the distribution (either S(t) or λ(t)) of the response variable of interest, T. 7 Biomarkers

Cox Regression Model Model: λ(t X 1, X 2,..., X k ) = λ 0 (t) exp(β 1 X 1 + β 2 X 2 +... + β k X k ) Model: alternatively expressed as log λ(t X 1,..., X k ) = log λ 0 (t) + β 1 X 1 + β 2 X 2 +... + β k X k S(t X 1,..., X k ) = [S 0 (t)] [exp(β 1X 1 +β 2 X 2 +...+β k X k )] Note definitions: λ 0 (t) = λ(t X 1 = 0, X 2 = 0,..., X k = 0) S 0 (t) = S(t X 1 = 0, X 2 = 0,..., X k = 0) 8 Biomarkers

Interpreting Cox Regression Coefficients Proportional Hazards: RR = λ(t X 1, X 2,..., X k ) λ(t X 1 = 0, X 2 = 0,..., X k = 0) = exp(β 1 X 1 + β 2 X 2 +... + β k X k ) RR above is: Relative risk, or hazard, of death comparing subjects with covariate values (X 1, X 2,..., X k ) to subjects with covariate values (0, 0,..., 0). 9 Biomarkers

Interpreting Cox Regression Coefficients In General: β m is the log RR (or log hazard ratio, log HR) comparing subjects with X m = (x + 1) to subjects with X m = x, given that all other covariates are constant (ie. the same for the groups compared). here {}}{ λ(t X 1,..., X m = (x + 1)..., X k ) λ(t X 1,..., X m = ( x ),..., X k ) }{{} here λ 0 (t) exp(β 1 X 1 +... β m (x + 1) +... + β k X k ) λ 0 (t) exp(β 1 X 1 +... β m (x) +... + β k X k ) = = exp(β m ) 10 Biomarkers

Interpreting Cox Regression Coefficients The RR Comparing 2 Covariate Values (vectors): RR comparing (X 1, X 2,..., X k ) to (X 1, X 2,..., X k ). RR(X vs. X ) = λ(t X 1, X 2,..., X k ) λ(t X 1, X 2..., X k ) = exp [ β 1 (X 1 X 1) + β 2 (X 2 X 2) +... + β k (X k X k) ] 11 Biomarkers

Cox Model Examples 1: One dichotomous covariate X E = 1 if exposed; X E = 0 if not exposed. λ(t X E ) = λ 0 (t) exp(βx E ) Hazard Functions log Hazard Functions hazard (lambda) 0.0 0.5 1.0 1.5 2.0 log hazard (log lambda) 0.5 0.0 0.5 0.0 0.2 0.4 0.6 0.8 1.0 Time 0.0 0.2 0.4 0.6 0.8 1.0 Time 12 Biomarkers

Cox Model Examples 2: Dichotomous covariate; Dichotomous confounder X C = 1 if level 2; X C = 0 if level 1. λ(t X E, X C ) = λ 0 (t) exp(β 1 X E + β 2 X C ) log hazard (log lambda) 0.5 0.0 0.5 1.0 1.5 2.0 0.0 0.2 0.4 0.6 0.8 1.0 Time 13 Biomarkers

Cox Model Examples 3: Dichotomous covariate; confounder; (interaction) With interaction λ(t X E, X C ) = λ 0 (t) exp(β 1 X E + β 2 X C + β 3 X E X C ) log hazard (log lambda) 0.5 0.0 0.5 1.0 1.5 2.0 0.0 0.2 0.4 0.6 0.8 1.0 Time 14 Biomarkers

Cox Models: Comments In each example the hazard functions are parallel that is, the change in hazard over time was the same for each covariate value. For regression models there are different possible tests for a hypothesis about coefficients: likelihood ratio; score; Wald. (more later!) The score test for example (1) with H 0 : β = 0 is the LogRank Test. The score test using dummy variables to code (4) groups with H 0 : β 2 = β 3 = β 4 = 0 is the same as the K-sample Heterogeneity test (generalization of LogRank). 15 Biomarkers

Fitting the Cox Model Obtain estimates of β 1, β 2,..., β k by maximizing the partial likelihood function: P L(β 1, β 2,..., β k ). β 1, β 2,..., β k are MPLE s CI s for β j using: β j ± Z 1 α/2 SE( β j ). CI s for hazard ratio (HR) using: exp[ β j Z 1 α/2 SE( β j )], exp[ β j +Z 1 α/2 SE( β j )] Wald test, score test, and likelihood ratio test similar to logistic regression. Now using the partial likelihood. 16 Biomarkers

Partial Likelihood Model: λ(t X 1,..., X k ) = λ 0 (t) exp(β 1 X 1 +... + β k X k ) Order Data: t (i) is the ith ordered failure time. Assume no ties, and let X (i) = (X 1(i), X 2(i),..., X k(i) ) be the covariates for the subject who dies at time t (i). Let R i denote the risk set at time t (i), which denotes all subjects with Y j t (i). Partial Likelihood: (no ties) P L(β 1,..., β k ) = J i=1 exp(β 1 X 1(i) + β 2 X 2(i) +... + β k X k(i) ) j R i exp(β 1 X 1j + β 2 X 2j +... + β k X kj ) 17 Biomarkers

Risk Set Illustration D=death, L=lost, A=alive Subject 0 2 4 6 D L D D D L 0 2 4 6 8 Study Time 18 Biomarkers

Risk Set Illustration Failure times: t (1) = 1, t (2) = 3, t (3) = 4, t (4) = 6. Risk sets: R 1 = { } R 2 = { } R 3 = { } R 4 = { } 19 Biomarkers

Partial Likelihood Justification Cox (1972) No information can be contributed about β by time intervals in which no failures occur because the component λ 0 (t) might conceivably be identically zero in such intervals. Cox (1972) We therefore argue conditionally on the set {t (i) } of instants at which failure occur. Cox (1972) For the particular failure at time t (i) conditional on the risk set, R i, the probability that the failure is on the individual as observed is: exp(β 1 X 1(i) + β 2 X 2(i) +... + β k X k(i) ) j R i exp(β 1 X 1j + β 2 X 2j +... + β k X kj ). Note: This likelihood contribution has the exact same form as a (matched) logistic regression conditional likelihood. 20 Biomarkers

Partial Likelihood Justification Q: What is the probability of the observed data at time t (i) given that one person was observed to die among the risk set? Note : P [T (t, t + t] T t] λ(t) t Person who died : λ 0 (t) exp(β 1 X 1(i) +... + β k X k(i) ) t = P (i) Generic j in R i : λ 0 (t) exp(β 1 X 1j +... + β k X kj ) t = P j 21 Biomarkers

Partial Likelihood Justification Probability One Death, Was (i) : P (i) (1 P 1 ) (1 P 2 )... skip(i) (1 P k ) Probability of One Death: P( One Death ) P( j died, others lived ) = = P( 1 died, others lived ) + P( 2 died, others lived ) +... + P( k died, others lived ) P j k j(1 P k ) Note: (1 P j ) 1 for small t. 22 Biomarkers

Partial Likelihood Justification Now calculate the desired quantity: P( Observed Data 1 death ) = = P( Only (i) Dies ) P( One Death ) P (i) k (i) (1 P k) j R i P j k j (1 P k) P (i) j R i P j P (i) j R i P j = = λ 0 (t) exp(β 1 X 1(i) + β 2 X 2(i) +... + β k X k(i) ) t j R i λ 0 (t) exp(β 1 X 1j + β 2 X 2j +... + β k X kj ) t exp(β 1 X 1(i) + β 2 X 2(i) +... + β k X k(i) ) j R i exp(β 1 X 1j + β 2 X 2j +... + β k X kj ) 23 Biomarkers

Partial Likelihood Comments Notice that our model is equivalent to log λ(t X 1... X k ) = α(t) + β 1 X 1 +... β k X k where α(t) = log λ 0 (t), but the PL does not depend on α(t). Using the partial likelihood (PL) to estimate parameters provides estimates of the regression coefficients, β j, only. The model is called semi-parametric since we only need to parameterize the effect of covariates, and do not say anything about the baseline hazard. Q: Why not just use standard maximum likelihood? A: To do so would require choosing a model for the baseline hazard, but we actually don t need to do that! 24 Biomarkers

Partial Likelihood and Ties If there is more than one death at time t (i) then the denominator for the partial likelihood contribution will involve a large number of terms. For example if there are 20 people at risk at time t (i) and 3 die then there are 20 choose 3 = 1140 terms. Approximation (Breslow, Peto) default in STATA The numerator can be calculated and represented using: Sum X 1 for deaths: s 1i = j:y j =t (i),δ j =1 X 1j Sum X 2 for deaths: s 2i = j:y j =t (i),δ j =1 X 2j etc. The approximation with D i deaths at time t (i) is: P L A = J i=1 exp(β 1 s 1i + β 2 s 2i +... + β k s ki ) [ j Ri exp(β 1X 1j + β 2 X 2j +... + β k X kj )] Di 25 Biomarkers

Comments on Ties If continuous times, T i, then ties should not be an issue. Time recorded in (days,minutes). Modest sample size. If discrete times, T i [t k, t k+1 ), recorded then consider methods appropriate for discrete-time data (e.g. variants on logistic regression) See Singer & Willett (2003) chpts 10 12; H& L pp. 268-9. 25-1 Biomarkers

Comments on Ties However, there is plenty of room between continuous and discrete. Example: USRDS Data = 200,000 subjects. 25% annual mortality = 50,000 deaths/year. 50,000 deaths/365 days = 137 deaths/day. Kalbfleisch & Prentice (2002), section 4.2.3 summarize options and relative pros/cons. Breslow method simple to implement/justify; some bias if discrete. Efron method also simple comp; performs well. exact method justified; comp challenge. Should be minor issue in general, and if not then perhaps a discrete-time approach should be considered. 25-2 Biomarkers

(Partial) Likelihood Ratio Tests Full Model: λ(t X) = λ 0 (t) exp(β 1 X 1 +... + β p X p + β p+1 X p+1 +... β k X k ) }{{} extra Reduced Model: In order to test: λ(t X) = λ 0 (t) exp(β 1 X 1 +... + β p X p ) H 0 : Reduced model H 0 : β p+1 =... = β k = 0 H 1 : Full model H 1 : extra coeff 0 somewhere Use the partial likelihood ratio statistic X 2 P LR = [2 log P L(FullModel) 2 log P L(ReducedModel)] 26 Biomarkers

(Partial) Likelihood Ratio Tests Under H 0 (reduced is correct) then X 2 P LR χ2 (df = (k p)) Degrees of freedom, df = (k p), equals the number of parameters set to 0 by the null hypothesis. Application is for situations where the models are nested the reduced model is a special case of the full model. Also can use Wald tests, and/or score tests. The PLR (Partial Likelihood Ratio) test is particularly useful when df> 1. The PLR statistic is equivalent (using a double negative ) to: X 2 P LR = {[ 2 log P L(ReducedModel)] [ 2 log P L(FullModel)]} 27 Biomarkers

Example of Cox Regression Primary Billiary Cirrhosis (PBC) Data: A randomized trial with n = 312 subjects. Long-term follow-up (10 years!) Objective: Could the available clinical information be used to construct a predictive model that could be used to guide medical decisions? 28 Biomarkers

Example Analysis using R PBC pbc.data <- read.table( "pbc-data.txt", header=t ) # pbc.data$survival.time <- pbc.data$fudays pbc.data$survival.status <- as.integer( pbc.data$status==2 ) # ##### library(survival) ##### # ##### Kaplan-Meier curve # Sout <- survfit( Surv(survival.time,survival.status) ~ 1, data=pbc.data plot( Sout, mark.time=f, col="blue", xlab="time (days)", ylab="survival" lwd=2 ) 29 Biomarkers

PBC Data Kaplan Meier Estimate of Survival Survival 0.0 0.2 0.4 0.6 0.8 1.0 0 1000 2000 3000 4000 Time (days) 30 Biomarkers

Example of Cox Regression PBC ###### Cox model with log(bili), log(protime), edema, albumin, age ###### generate linear predictor for ordinary cox model: eta5 # fit <- coxph( Surv(survival.time,survival.status) ~ log(bili) + log(protime) + edema + albumin + age, data=pbc.data ) summary( fit ) # ##### get the risk score # eta5 <- fit$linear.predictors 31 Biomarkers

Example of Cox Regression PBC n= 312, number of events= 125 coef exp(coef) se(coef) z Pr(> z ) log(bili) 8.773e-01 2.404e+00 9.895e-02 8.866 < 2e-16 *** log(protime) 3.013e+00 2.035e+01 1.025e+00 2.939 0.003296 ** edema 7.846e-01 2.192e+00 2.998e-01 2.617 0.008872 ** albumin -9.445e-01 3.889e-01 2.370e-01-3.985 6.73e-05 *** age 9.169e-05 1.000e+00 2.363e-05 3.881 0.000104 *** --- Rsquare= 0.471 (max possible= 0.983 ) Likelihood ratio test= 198.5 on 5 df, p=0 32 Biomarkers

Example of Cox Regression PBC KM Estimates by Model(5) Tertiles Survival 0.0 0.2 0.4 0.6 0.8 1.0 0 1000 2000 3000 4000 Time (days) 33 Biomarkers

Example of Cox Regression PBC ##### generate linear predictor for ordinary cox model: eta4 # # fit <- coxph( Surv(survival.time,survival.status) ~ log(protime) + edema + albumin + age, data=pbc.data ) summary( fit ) # # get the risk score # eta4 <- fit$linear.predictors 34 Biomarkers

Example of Cox Regression PBC n= 312, number of events= 125 coef exp(coef) se(coef) z Pr(> z ) log(protime) 4.140e+00 6.283e+01 8.703e-01 4.758 1.96e-06 *** edema 1.190e+00 3.288e+00 2.953e-01 4.031 5.56e-05 *** albumin -1.314e+00 2.687e-01 2.228e-01-5.897 3.71e-09 *** age 6.689e-05 1.000e+00 2.515e-05 2.660 0.00782 ** --- Rsquare= 0.32 (max possible= 0.983 ) Likelihood ratio test= 120.5 on 4 df, p=0 35 Biomarkers

Example of Cox Regression PBC KM Estimates by Model(4) Tertiles Survival 0.0 0.2 0.4 0.6 0.8 1.0 0 1000 2000 3000 4000 Time (days) 36 Biomarkers

Example of Cox Regression PBC Compare eta5 and eta4 eta5 2 0 2 4 2 1 0 1 2 3 4 eta4 37 Biomarkers

Summary Cox regression is semi-parametric (and popular!) Covariates can be modeled in standard ways and inference performed using partial likelihood, score, and Wald tests. An important idea is the risk set at time t which usually includes a single CASE and multiple CONTROLS (at that time). Q: How well does the predictive model perform? Q: How to link the model to making medical decisions? 38 Biomarkers