Residuals and model diagnostics

Similar documents
Proportional hazards regression

Tied survival times; estimation of survival probabilities

Accelerated Failure Time Models

Time-dependent coefficients

Cox regression: Estimation

Survival Analysis Math 434 Fall 2011

The Weibull Distribution

Cox s proportional hazards/regression model - model assessment

Lecture 7 Time-dependent Covariates in Cox Regression

Censoring mechanisms

Ph.D. course: Regression models

Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics

Survival Regression Models

Semiparametric Regression

You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?

Lecture 10. Diagnostics. Statistics Survival Analysis. Presented March 1, 2016

Part [1.0] Measures of Classification Accuracy for the Prediction of Survival Times

Multistate models and recurrent event models

Part III Measures of Classification Accuracy for the Prediction of Survival Times

Survival analysis in R

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen

Extensions of Cox Model for Non-Proportional Hazards Purpose

Multistate models and recurrent event models

Continuous case Discrete case General case. Hazard functions. Patrick Breheny. August 27. Patrick Breheny Survival Data Analysis (BIOS 7210) 1/21

Ph.D. course: Regression models. Introduction. 19 April 2012

MAS3301 / MAS8311 Biostatistics Part II: Survival

Ph.D. course: Regression models. Regression models. Explanatory variables. Example 1.1: Body mass index and vitamin D status

Support Vector Hazard Regression (SVHR) for Predicting Survival Outcomes. Donglin Zeng, Department of Biostatistics, University of North Carolina

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

Müller: Goodness-of-fit criteria for survival data

Definitions and examples Simple estimation and testing Regression models Goodness of fit for the Cox model. Recap of Part 1. Per Kragh Andersen

Survival Analysis. Stat 526. April 13, 2018

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL

Lecture 7. Proportional Hazards Model - Handling Ties and Survival Estimation Statistics Survival Analysis. Presented February 4, 2016

Lecture 8 Stat D. Gillen

Faculty of Health Sciences. Cox regression. Torben Martinussen. Department of Biostatistics University of Copenhagen. 20. september 2012 Slide 1/51

Other likelihoods. Patrick Breheny. April 25. Multinomial regression Robust regression Cox regression

Lecture 5 Models and methods for recurrent event data

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Time-dependent covariates

Extensions of Cox Model for Non-Proportional Hazards Purpose

Faculty of Health Sciences. Regression models. Counts, Poisson regression, Lene Theil Skovgaard. Dept. of Biostatistics

Multivariate Survival Analysis

Survival analysis in R

1 Introduction. 2 Residuals in PH model

STAT Sample Problem: General Asymptotic Results

Survival Analysis. STAT 526 Professor Olga Vitek

8. Parametric models in survival analysis General accelerated failure time models for parametric regression

Chapter 4 Regression Models

Checking model assumptions with regression diagnostics

Survival Analysis I (CHL5209H)

Consider Table 1 (Note connection to start-stop process).

Log-linearity for Cox s regression model. Thesis for the Degree Master of Science

Building a Prognostic Biomarker

Chapter 1 Statistical Inference

Longitudinal + Reliability = Joint Modeling

Joint longitudinal and time-to-event models via Stan

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Description Syntax for predict Menu for predict Options for predict Remarks and examples Methods and formulas References Also see

1 Glivenko-Cantelli type theorems

β j = coefficient of x j in the model; β = ( β1, β2,

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation

Power and Sample Size Calculations with the Additive Hazards Model

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

Correlation and regression

ST745: Survival Analysis: Cox-PH!

Exercises. (a) Prove that m(t) =

1 The problem of survival analysis

Individualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models

( t) Cox regression part 2. Outline: Recapitulation. Estimation of cumulative hazards and survival probabilites. Ørnulf Borgan

The coxvc_1-1-1 package

Reduced-rank hazard regression

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Poisson regression: Further topics

Survival Analysis. 732G34 Statistisk analys av komplexa data. Krzysztof Bartoszek

In contrast, parametric techniques (fitting exponential or Weibull, for example) are more focussed, can handle general covariates, but require

A NOTE ON ROBUST ESTIMATION IN LOGISTIC REGRESSION MODEL

Philosophy and Features of the mstate package

TMA 4275 Lifetime Analysis June 2004 Solution

9 Estimating the Underlying Survival Distribution for a

Model comparison: Deviance-based approaches

Validation. Terry M Therneau. Dec 2015

Multi-state Models: An Overview

Joint Modeling of Longitudinal Item Response Data and Survival

Modelling geoadditive survival data

Multi-state models: prediction

Relative-risk regression and model diagnostics. 16 November, 2015

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Survival Analysis for Case-Cohort Studies

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

4. Comparison of Two (K) Samples

Logistic regression: Miscellaneous topics

STAT331. Combining Martingales, Stochastic Integrals, and Applications to Logrank Test & Cox s Model

UNIVERSITY OF CALIFORNIA, SAN DIEGO

Lecture 9. Statistics Survival Analysis. Presented February 23, Dan Gillen Department of Statistics University of California, Irvine

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables

STAT331 Lebesgue-Stieltjes Integrals, Martingales, Counting Processes

Transcription:

Residuals and model diagnostics Patrick Breheny November 10 Patrick Breheny Survival Data Analysis (BIOS 7210) 1/42

Introduction Residuals Many assumptions go into regression models, and the Cox proportional hazards model, despite making no assumptions about the baseline hazard, is no exception Diagnostic methods are useful in all types of regression models to investigate the validity of those assumptions and identify ways in which they might be violated Residuals play a big role in regression method diagnostics To build model diagnostics for Cox regression, we first need to discuss methods for extending residuals to the case of censored data Patrick Breheny Survival Data Analysis (BIOS 7210) 2/42

Cumulative hazard transformation We begin with the following useful theorem: Theorem: Suppose T is a continuous nonnegative random variable with cumulative hazard function Λ. Then the random variable Y = Λ(T ) follows an exponential distribution with rate λ = 1. Thus, one way of checking the validity of a model is by comparing the model s estimates {ˆΛ(t i )} against the standard exponential distribution Patrick Breheny Survival Data Analysis (BIOS 7210) 3/42

In the context of the proportional hazards model, we have ê i = ˆΛ 0 (t i ) exp(x T i β), although the idea is very general and can be applied to any kind of model The terms {ê i } are called the, although residual is perhaps a misnomer, in that it s more of a transformation than a residual Patrick Breheny Survival Data Analysis (BIOS 7210) 4/42

Diagnostic plot of : PBC data Diagnostics based on are based on fitting a Kaplan-Meier (or Nelson-Aalen) curve to {ê i } and comparing it to that of the standard exponential For the PBC data with trt, stage, hepato, and bili included, we have 2.0 Cumulative hazard 1.5 1.0 0.5 0.0 0.0 0.5 1.0 1.5 2.0 Residual Patrick Breheny Survival Data Analysis (BIOS 7210) 5/42

Simulated example: Lack of fit To illustrate the utility of the plot, let s simulate some data from a lognormal AFT model and fit a Cox model (note that the PH assumption is violated here): 6 5 Cumulative hazard 4 3 2 1 0 0 1 2 3 4 5 6 Residual Patrick Breheny Survival Data Analysis (BIOS 7210) 6/42

Zooming in Residuals One shortcoming of this plot is that violations at small e values tend to be hidden: 0.25 0.20 Cumulative hazard 0.15 0.10 0.05 0.00 0.00 0.05 0.10 0.15 0.20 0.25 Residual Patrick Breheny Survival Data Analysis (BIOS 7210) 7/42

Plotting the standardized difference Alternatively, one may consider plotting the standardized difference between the fitted cumulative hazard and the standard exponential, which reveals the lack of fit much more clearly: 3 2 (Λ^(e) e) SE 1 0 1 2 3 0 0.02 0.14 1 7.39 Residual Patrick Breheny Survival Data Analysis (BIOS 7210) 8/42

Standardized differences: PBC data Here is what the standardized plot looks like for the PBC data: 3 2 (Λ^(e) e) SE 1 0 1 2 3 0.02 0.1 0.51 2.72 Residual Patrick Breheny Survival Data Analysis (BIOS 7210) 9/42

GVHD data Residuals Stratification of the residuals according to one of the variables in the model can also help to discover model violations For example, in the GVHD data, here is what the overall plot looks like: 0.6 0.5 Cumulative hazard 0.4 0.3 0.2 0.1 0.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 Residual Patrick Breheny Survival Data Analysis (BIOS 7210) 10/42

GVHD data: Stratifying by treatment And here is the plot stratifying by treatment: 0.6 0.5 Cumulative hazard 0.4 0.3 0.2 0.1 0.0 0.0 0.1 0.2 0.3 0.4 0.5 0.6 Residual We will discuss stratification in more detail in the next lecture Patrick Breheny Survival Data Analysis (BIOS 7210) 11/42

Shortcomings of the Cox-Snell residual One drawback to the is that they don t provide much insight into why the model s assumptions are violated It would be more appealing if each residual took on a positive or negative value indicating whether they patient survived longer, as opposed to shorter, that the model predicts, and by how much Patrick Breheny Survival Data Analysis (BIOS 7210) 12/42

Consider, then, the following residual: ˆm i = d i ˆΛ i (t i ) = d i ê i ; we have seen this quantity a few times already (one-sample logrank test, score function for exponential regression) This represents the discrepancy between the observed value of a subject s failure indicator and its expected value, integrated over the time for which that patient was at risk Positive values mean that the patient died sooner than expected (according to the model); negative values mean that the patient lived longer than expected (or were censored) Patrick Breheny Survival Data Analysis (BIOS 7210) 13/42

Martingales Residuals A stochastic process M(t) that satisfies (i) EM(t) = 0 for all t and (ii) E{M(t) M(s)} = M(s)} for all s < t is known in statistics as a martingale The stochastic process N(t) t 0 Y (t)dλ(t), where N(t) is the counting process that records whether the subject has failed by time t or not, satisfies these two properties For this reason, the residuals { ˆm i } defined on the previous slide are known as martingale residuals Patrick Breheny Survival Data Analysis (BIOS 7210) 14/42

for the PBC model Censored Died / Liver failure Martingale residual 1 0 1 2 3 4 0 2 4 6 8 10 12 Time The large outlier is a patient with stage 4 cirrhosis and a bilirubin concentration of 14.4 (96th percentile), yet survived 7 years Patrick Breheny Survival Data Analysis (BIOS 7210) 15/42

residuals Residuals can be obtained from the survival package by calling residuals(fit), where fit is a fitted coxph model (resid(fit) also works as a shortcut) The martingale residuals are returned by default, although seven other options are available and can be requested by specifying the type option It may be noted that type= coxsnell is not one of the options, although you can easily calculate the Cox-Snell residuals from the martingale residuals Patrick Breheny Survival Data Analysis (BIOS 7210) 16/42

Comments Residuals are very useful and can be used for many of the usual purposes that we use residuals for in other models (identifying outliers, choosing a functional form for the covariate, etc.) However, the primary drawback to the martingale residual is its clear asymmetry (its upper bound is 1, but it has no lower bound) For this reason, I ll hold off on these plots until we discuss a more symmetric, normally distributed residual Patrick Breheny Survival Data Analysis (BIOS 7210) 17/42

: Motivation A technique for creating symmetric, normalized residuals that is widely used in generalized linear modeling is to construct a deviance residual The idea behind the deviance residual is to examine the difference between the log-likelihood for subject i under a given model and the maximum possible log-likelihood for that subject: 2( l i l i ), in a sense, constructing a miniature likelihood ratio test for individual i Patrick Breheny Survival Data Analysis (BIOS 7210) 18/42

: Definition As it is essentially a likelihood ratio test, the quantity on the previous slide should approximately follow a χ 2 1 distribution To turn it into a quantity that approximately follows a normal distribution, we can use ˆd i = sign( ˆm i ) 2( l i l i ); this is known as the deviance residual I am leaving the details of deriving l i and working out a simple expression for the deviance residual as homework In R: residuals(fit, type= deviance ) Patrick Breheny Survival Data Analysis (BIOS 7210) 19/42

for the PBC data The deviance residuals are much more symmetric: 0 2 4 6 8 10 12 2 1 0 1 2 Time Deviance residual Censored Died / Liver failure Patrick Breheny Survival Data Analysis (BIOS 7210) 20/42

Outliers Residuals have several uses that we will now illustrate One is identifying outliers For the PBC data, there are no extreme outliers; the largest residuals are only 2.5 SDs away from zero Note that the skewness of the martingale residual makes one subject look like an extreme outlier; according to the deviance residuals, however, that subject is only the 5th largest outlier by absolute value Patrick Breheny Survival Data Analysis (BIOS 7210) 21/42

Outliers (cont d) The five largest negative residuals: time status trt stage hepato bili ˆd 6.95 1 1 4 1 14.40-2.44 12.19 0 2 4 1 2.10-2.18 12.38 0 2 4 1 1.80-2.14 11.09 0 1 4 1 1.30-2.09 11.49 0 2 4 1 1.20-2.05 And the five largest positive residuals: time status trt stage hepato bili ˆd 0.38 1 1 3 0 2.40 2.64 0.54 1 1 3 0 1.10 2.54 0.30 1 2 4 1 2.50 2.51 0.14 1 2 4 0 12.60 2.49 0.11 1 1 4 0 17.90 2.43 Patrick Breheny Survival Data Analysis (BIOS 7210) 22/42

Residuals vs. covariates The other main use of residuals is to plot them against covariates to assess the relationship between a covariate and unexplained variation This can be done using covariates that are already in the model as well as new covariates that one is considering adding to the model; we will consider examples of both Patrick Breheny Survival Data Analysis (BIOS 7210) 23/42

Residuals vs. albumin Albumin is a protein synthesized by the liver and often used as a marker of liver disease: 2.0 2.5 3.0 3.5 4.0 4.5 2 1 0 1 2 Albumin Deviance residual Patrick Breheny Survival Data Analysis (BIOS 7210) 24/42

Assessing functional forms Albumin Bilirubin Stage As the previous figure shows, deviance residuals are helpful not only for checking whether new variables should be added to a model, but also for assessing whether the relationship between the predictor and the (log) hazard is linear In the case of albumin, the exploratory plot, as well as biological insight (the normal range of serum albumin is 3.4-5.4 g/dl) suggest a piecewise linear model with a change point (sometimes called a knot ) at 3.4 Patrick Breheny Survival Data Analysis (BIOS 7210) 25/42

Implementation details Albumin Bilirubin Stage This can be implemented in various ways; here is one: f <- function(x) {pmin(x, 3.5)} fit <- coxph(s ~... + f(albumin), PBC) For nonlinear functional forms, it is typically helpful to plot the modeled relationship between a covariate and the outcome Again, this can be done in various ways, but the visreg package is useful here: visreg(fit, 'albumin') Patrick Breheny Survival Data Analysis (BIOS 7210) 26/42

Albumin Bilirubin Stage Changepoint model for albumin: Illustration The dots here are the partial deviance residuals, η i + ˆd i 2.0 2.5 3.0 3.5 4.0 4.5 2 1 0 1 2 3 Albumin Linear predictor Patrick Breheny Survival Data Analysis (BIOS 7210) 27/42

Albumin summary Albumin Bilirubin Stage The AIC values for four possible models: No albumin: 1352.3 Linear: 1334.9 Changepoint, no effect in normal range: 1332.2 Changepoint, linear effect in normal: 1333.8 The changepoint model that assumes a flat line after 3.5 is the reasonably clear choice based on AIC, as well as being a very reasonable model from a biological perspective Patrick Breheny Survival Data Analysis (BIOS 7210) 28/42

Albumin Bilirubin Stage Residuals for bilirubin: In and out of the model 0 5 10 15 20 25 2 1 0 1 2 Bilirubin Deviance residual without bili 0 5 10 15 20 25 2 1 0 1 2 Bilirubin Deviance residual with bili Patrick Breheny Survival Data Analysis (BIOS 7210) 29/42

Albumin Bilirubin Stage log(bilirubin) The residual plots on the previous page suggest log(bili) as a functional form: 0 5 10 15 20 25 2 1 0 1 2 3 4 Bilirubin Linear predictor Patrick Breheny Survival Data Analysis (BIOS 7210) 30/42

More on logs Residuals Albumin Bilirubin Stage One advantage of the log scale as a functional form is that it has a convenient interpretation in proportional hazards models Consider comparing two individuals, one with double the bilirubin concentration of the other, but all other covariates equal; the hazard ratio comparing the two is: HR = 2 β j For log(bili), β j = 0.895; thus, doubling the bilirubin concentration increases the hazard by 86% (2 0.895 = 1.86) Patrick Breheny Survival Data Analysis (BIOS 7210) 31/42

Splines Residuals Albumin Bilirubin Stage Splines are also an attractive option for incorporating nonlinear functional forms The basic idea of splines is similar to our albumin model from earlier: fit a piecewise polynomial model, with restrictions that make the resulting functional form smooth and continuous The details of this are very interesting, but unfortunately we don t have time to get into them in this course Still, it is worth knowing that the survival package has a nice utility built in for fitting Cox models with spline terms: fit <- coxph(s ~... + pspline(bili), PBC) Patrick Breheny Survival Data Analysis (BIOS 7210) 32/42

Albumin Bilirubin Stage Spline illustration 0 5 10 15 20 25 2 1 0 1 2 3 4 Bilirubin Linear predictor Patrick Breheny Survival Data Analysis (BIOS 7210) 33/42

Bilirubin: Summary Albumin Bilirubin Stage The AIC values for four possible models: No bilirubin: 1383.9 Linear: 1332.2 Log(bilirubin) 1301.0 Splines: 1319.8 The log model seems to be the clear choice here Patrick Breheny Survival Data Analysis (BIOS 7210) 34/42

Albumin Bilirubin Stage Stage Before moving on, let s quickly revisit the effect of stage: 4 3 2 1 0 1 2 Stage Linear predictor 1 2 3 4 Patrick Breheny Survival Data Analysis (BIOS 7210) 35/42

Remarks Residuals Albumin Bilirubin Stage The effect of stage seems to be very linear among stages 2, 3, and 4 Stage 1 patients, however, seem to be at substantially lower risk Still, it is difficult to tell from the data alone whether this phenomenon is real, since we have few stage 1 patients in the study (just 16 out of 312 patients), which explains why AIC and the likelihood ratio test prefer the simpler model Patrick Breheny Survival Data Analysis (BIOS 7210) 36/42

Influence Residuals One final issue on the topic of regression diagnostics is the assessment of influence An influential measurement is one that has a large effect on the model fit; this can be measured in a variety of ways, both absolute and relative A variety of residuals (score residuals, Schoenfeld residuals, delta-beta residuals) have been proposed as ways of quantifying and assessing influence; we ll focus on delta-beta residuals Patrick Breheny Survival Data Analysis (BIOS 7210) 37/42

Delta-beta plots The idea behind delta-beta residuals is very simple: let β (i) j denote the estimate of β j obtained if we leave subject i out of the model The delta-beta residual for coefficient j, subject i is therefore defined as ˆ ij = β j β (i) j This might seem computer intensive, but there are various computational tricks that allow one to fairly quickly refit models leaving individual observations out In R: resid(fit, type= dfbeta ) returns a matrix of delta-beta residuals Patrick Breheny Survival Data Analysis (BIOS 7210) 38/42

Treatment Residuals Treatment: 1 2 0.04 0.02 ^ij 0.00 0.02 0 50 100 150 200 250 300 Index (ordered by time on study) Patrick Breheny Survival Data Analysis (BIOS 7210) 39/42

Stage Residuals Stage: 1 2 3 4 0.03 0.02 0.01 ^ij 0.00 0.01 0.02 0.03 0 50 100 150 200 250 300 Index (ordered by time on study) Patrick Breheny Survival Data Analysis (BIOS 7210) 40/42

Bilirubin Residuals High bili Low bili 0.01 0.00 0.01 ^ij 0.02 0.03 0.04 0 50 100 150 200 250 300 Index (ordered by time on study) Patrick Breheny Survival Data Analysis (BIOS 7210) 41/42

Remarks Residuals Delta-beta plots are always interesting to look at and offer a great deal of insight into the inner mechanics of complicated models Furthermore, they can indicate whether the estimation of a coefficient is dominated by just a few individuals, which would be clear cause for concern What action to take in the presence of influential observations is often a complicated decision; my advice, however, is to strongly prefer modifying the model to fit the data as opposed to manipulating the data to fit the model (i.e., by removing outliers) Patrick Breheny Survival Data Analysis (BIOS 7210) 42/42