Time-dependent coefficients

Similar documents
Residuals and model diagnostics

Multistate models and recurrent event models

Multistate models and recurrent event models

Proportional hazards regression

Survival analysis in R

Accelerated Failure Time Models

The coxvc_1-1-1 package

Tied survival times; estimation of survival probabilities

Survival analysis in R

Semiparametric Regression

Survival Regression Models

Cox regression: Estimation

Cox s proportional hazards/regression model - model assessment

SSUI: Presentation Hints 2 My Perspective Software Examples Reliability Areas that need work

Lecture 8 Stat D. Gillen

Lecture 9. Statistics Survival Analysis. Presented February 23, Dan Gillen Department of Statistics University of California, Irvine

Lecture 12. Multivariate Survival Data Statistics Survival Analysis. Presented March 8, 2016

Reduced-rank hazard regression

The Weibull Distribution

Lecture 7 Time-dependent Covariates in Cox Regression

Lecture 10. Diagnostics. Statistics Survival Analysis. Presented March 1, 2016

Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics

UNIVERSITY OF CALIFORNIA, SAN DIEGO

On a connection between the Bradley-Terry model and the Cox proportional hazards model

Continuous case Discrete case General case. Hazard functions. Patrick Breheny. August 27. Patrick Breheny Survival Data Analysis (BIOS 7210) 1/21

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require

Censoring mechanisms

Checking model assumptions with regression diagnostics

On a connection between the Bradley Terry model and the Cox proportional hazards model

Relative-risk regression and model diagnostics. 16 November, 2015

Chapter 4 Regression Models

Part [1.0] Measures of Classification Accuracy for the Prediction of Survival Times

Lecture 7. Proportional Hazards Model - Handling Ties and Survival Estimation Statistics Survival Analysis. Presented February 4, 2016

Validation. Terry M Therneau. Dec 2015

More on Cox-regression

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data

ST745: Survival Analysis: Cox-PH!

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen

Multivariate Survival Analysis

Time-varying covariates and coefficients in Cox regression models

Fitting Cox Regression Models

Other likelihoods. Patrick Breheny. April 25. Multinomial regression Robust regression Cox regression

Extensions of Cox Model for Non-Proportional Hazards Purpose

Stat 642, Lecture notes for 04/12/05 96

Survival Analysis. STAT 526 Professor Olga Vitek

A fast routine for fitting Cox models with time varying effects

Time-dependent covariates

Multivariable Fractional Polynomials

Joint Modeling of Longitudinal Item Response Data and Survival

Package crrsc. R topics documented: February 19, 2015

Stability and the elastic net

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

Regularization in Cox Frailty Models

You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?

Multivariable Fractional Polynomials

Survival Model Predictive Accuracy and ROC Curves

Faculty of Health Sciences. Cox regression. Torben Martinussen. Department of Biostatistics University of Copenhagen. 20. september 2012 Slide 1/51

Theoretical results for lasso, MCP, and SCAD

Multinomial Logistic Regression Models

Modelling geoadditive survival data

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

Survival Analysis Math 434 Fall 2011

β j = coefficient of x j in the model; β = ( β1, β2,

1 Introduction. 2 Residuals in PH model

Part III Measures of Classification Accuracy for the Prediction of Survival Times

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

Survival Analysis. 732G34 Statistisk analys av komplexa data. Krzysztof Bartoszek

Longitudinal + Reliability = Joint Modeling

Müller: Goodness-of-fit criteria for survival data

Cox Proportional-Hazards Regression for Survival Data in R

Outline. Cox's regression model Goodness-of-t methods. Cox's proportional hazards model: Survival analysis

Luke B Smith and Brian J Reich North Carolina State University May 21, 2013

Extensions of Cox Model for Non-Proportional Hazards Purpose

Understanding the Cox Regression Models with Time-Change Covariates

FULL LIKELIHOOD INFERENCES IN THE COX MODEL

STAT 525 Fall Final exam. Tuesday December 14, 2010

Chapter 7: Hypothesis testing

USING MARTINGALE RESIDUALS TO ASSESS GOODNESS-OF-FIT FOR SAMPLED RISK SET DATA

MAS3301 / MAS8311 Biostatistics Part II: Survival

Lecture 11. Interval Censored and. Discrete-Time Data. Statistics Survival Analysis. Presented March 3, 2016

Package penmsm. R topics documented: February 20, Type Package

Lecture 22 Survival Analysis: An Introduction

Smoothing Spline-based Score Tests for Proportional Hazards Models

One-sample categorical data: approximate inference

Covariance test Selective inference. Selective inference. Patrick Breheny. April 18. Patrick Breheny High-Dimensional Data Analysis (BIOS 7600) 1/20

MAS3301 / MAS8311 Biostatistics Part II: Survival

7.1 The Hazard and Survival Functions

Survival Prediction Under Dependent Censoring: A Copula-based Approach

Some general observations.

The lasso. Patrick Breheny. February 15. The lasso Convex optimization Soft thresholding

Survival Models for the Social and Political Sciences Week 6: More on Cox Regression

Variable Selection and Model Choice in Survival Models with Time-Varying Effects

Amendments and Errata for Cook and Lawless (2007)

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

Package CoxRidge. February 27, 2015

Description Syntax for predict Menu for predict Options for predict Remarks and examples Methods and formulas References Also see

Chapter 4: Generalized Linear Models-II

Dynamic Models Part 1

49th European Organization for Quality Congress. Topic: Quality Improvement. Service Reliability in Electrical Distribution Networks

Transcription:

Time-dependent coefficients Patrick Breheny December 1 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/20

Introduction As we discussed previously, stratification allows one to handle variables that do not satisfy the proportional hazards assumption, but this comes at a cost Namely, the variables no longer act upon the likelihood in a regular way, and we cannot use any of the methods we have developed in this course to carry out inference or model selection concerning them An alternative approach is to directly model the dependence of those variables upon time Patrick Breheny Survival Data Analysis (BIOS 7210) 2/20

Schoenfeld residuals We will start with exploratory methods for examining the effect of a dependence on time; the most widely used residuals for this purpose are Schoenfeld residuals The Schoenfeld residual s ij is, in a sense, the contribution that subject i makes to the jth score function: s ij = d i (x ij π T i x j ) = d i (x ij E i X j ); note that s ij = 0 for all censored observations In general, the Schoenfeld residuals have similar uses as delta-beta residuals in terms of identifying influential observations with respect to estimating β j Patrick Breheny Survival Data Analysis (BIOS 7210) 3/20

Schoenfeld residuals and time-varying effects However, Schoenfeld residuals have a very useful property in terms of investigating the dependence of an effect on time Suppose that the effect of covariate j is not necessarily constant, but varies with time: η i (t) = x i1 β 1 + x i2 β 2 + + x ij β j (t) Then one can define a scaled Schoenfeld residual, s ij = β j + s ij /a, where a is an appropriate scaling factor, such that the scaled residuals possess the following property: E s ij = β j (t) Patrick Breheny Survival Data Analysis (BIOS 7210) 4/20

Schoenfeld residuals and time-varying effects (cont d) In the interest of time, we ll have to skip the details of this derivation, but the details are in Grambsch and Therneau (1994), Proportional hazards tests and diagnostics based on weighted residuals Both Schoenfeld residuals (type='schoenfeld') and scaled Schoenfeld residuals (type='scaledsch') are available as options from residuals.coxph Patrick Breheny Survival Data Analysis (BIOS 7210) 5/20

GVHD data: Treatment 3 2 Scaled Schoenfield residuals 1 0 1 2 3 10 20 30 40 50 Time (Days) Patrick Breheny Survival Data Analysis (BIOS 7210) 6/20

VA Lung: Karnofsky 0.0 0.5 1.0 1.5 0.20 0.15 0.10 0.05 0.00 0.05 0.10 Time (Years) Scaled Schoenfield residuals Patrick Breheny Survival Data Analysis (BIOS 7210) 7/20

GVHD data: Age 0.20 0.15 Scaled Schoenfield residuals 0.10 0.05 0.00 0.05 0.10 0.15 10 20 30 40 50 Time (Days) Patrick Breheny Survival Data Analysis (BIOS 7210) 8/20

Testing; R code One can also carry out a formal test for non-proportionality by testing whether the correlation between time and Schoenfeld residuals is zero These methods (the exploratory plot and the test) are available from the cox.zph function: > fit <- coxph(surv(time, Status) ~ Group, GVHD) > cox.zph(fit) rho chisq p GroupMTX+CSP -0.537 6.08 0.0137 > plot(cox.zph(fit)) Patrick Breheny Survival Data Analysis (BIOS 7210) 9/20

Results For the VA lung data, Karnofsky status is highly non-proportional (p = 0.0003) For the GVHD data, the proportional hazards assumption is violated for treatment (p = 0.01) but not for age (p = 0.6) This would suggest that while a constant effect β is adequate for some terms, like age, we might wise to consider allowing time-dependent effects, β(t), for other variables Patrick Breheny Survival Data Analysis (BIOS 7210) 10/20

Derivation R code Time-varying coefficients: Main idea To illustrate the basic idea, let s consider a simple linear time dependence: β j (t) = β j0 + β j1 t; β j1 = 0 would correspond to the usual proportional hazards structure In principle, this poses no great challenge to estimation: the Cox partial likelihood revolves around calculating π ij values, the probability of subject i failing at time t j Patrick Breheny Survival Data Analysis (BIOS 7210) 11/20

Specifics Derivation R code With a time-dependent effect, we simply need to plug the appropriate t j into β j (t) when calculating π ij Let s assume for the sake of simplicity that we only have one term in the model; we then have π ij = exp(x iβ 0 + x i β 1 t j ) k exp(x kβ 0 + x k β 1 t j ) The score and information are simply functions of {π ij }, so this is all we need to modify in order to estimate β 0 and β 1 and perform the usual inferential procedures Patrick Breheny Survival Data Analysis (BIOS 7210) 12/20

GVHD data: Treatment effect Derivation R code 2 0 Treatment effect 2 4 6 8 10 10 20 30 40 50 Time (Days) Patrick Breheny Survival Data Analysis (BIOS 7210) 13/20

What not to do Derivation R code On the surface, the idea of a time-varying effect may seem equivalent to an interaction between treatment and time In particular, you may be tempted to code it as coxph(s ~ Trt + Trt:Time, GVHD) This is completely wrong, though; it results in η i = x i β 0 + x i t i β 1 ; where η i is static across time, the effect being frozen at the point where subject i leaves the study Patrick Breheny Survival Data Analysis (BIOS 7210) 14/20

The tt function Derivation R code Instead, we need some way to tell coxph to recalculate the linear predictors at each distinct time t j This is accomplished through the tt function (for time transform ): coxph(s ~ Trt + tt(trt), GVHD, tt=function(x, t,...) {x*t}) tt can be more complicated than this; in particular, it can return a matrix to represent a multiparameter function of time Patrick Breheny Survival Data Analysis (BIOS 7210) 15/20

Summary of tt models Derivation R code summary will provide the usual summary of the coefficients, for both the proportional hazard coefficients and the coefficients that represent the time-varying effect: > summary(fit) n= 64, number of events= 20 coef exp(coef) se(coef) z Pr(> z ) Trt 2.01256 7.48242 1.51909 1.325 0.1852 tt(trt) -0.15904 0.85296 0.08242-1.930 0.0537.... Concordance= 0.594 (se = 0.221 ) Rsquare= 0.171 (max possible= 0.915 ) Likelihood ratio test= 12.03 on 2 df, p=0.002437 In this case, the coefficient for the departure from proportionality is only borderline significant, but the overall test is highly significant Patrick Breheny Survival Data Analysis (BIOS 7210) 16/20

Derivation R code VA Lung data: Karnofsky effect (linear) 0.06 Treatment effect 0.04 0.02 0.00 0.02 0.04 0.06 0.0 0.5 1.0 1.5 Time (Years) Patrick Breheny Survival Data Analysis (BIOS 7210) 17/20

Exponential decay Derivation R code Our exploratory plots, however, indicated that a linear term would not accurately capture the time-dependent effect of Karnofsky status A more appropriate model might be something like exponential decay: β(t) = β 0 + β 1 exp( t/τ), where β(t) gradually decays to a constant β 0 in the long run, and τ controls the rate of that decay Now, the τ parameter is not a typical linear coefficient, so the standard Cox Newton-Raphson method will not be able to estimate it; we will have to employ a grid search Patrick Breheny Survival Data Analysis (BIOS 7210) 18/20

Exponential decay rate Derivation R code 468.0 468.5 Log likelihood 469.0 469.5 470.0 0.5 1.0 1.5 2.0 τ For comparison, the log likelihood of the linear model is -471.4 Patrick Breheny Survival Data Analysis (BIOS 7210) 19/20

Derivation R code VA Lung data: Karnofsky effect (exponential decay) 0.02 Treatment effect 0.00 0.02 0.04 0.06 0.08 0.0 0.5 1.0 1.5 Time (Years) Patrick Breheny Survival Data Analysis (BIOS 7210) 20/20