SSUI: Presentation Hints 2 My Perspective Software Examples Reliability Areas that need work

Similar documents
Lecture 12. Multivariate Survival Data Statistics Survival Analysis. Presented March 8, 2016

Multistate models and recurrent event models

Multivariate Survival Analysis

Multistate models and recurrent event models

Time-dependent coefficients

The coxvc_1-1-1 package

Survival Regression Models

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data

Modelling geoadditive survival data

\bad" values of!. For the moment assume that, avector of tuning parameters, is known and constant. Following Gray [9], leti be the usual Cox model inf

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data

β j = coefficient of x j in the model; β = ( β1, β2,

Lecture 8 Stat D. Gillen

frailtyem: An R Package for Estimating Semiparametric Shared Frailty Models

Applied Survival Analysis Lab 10: Analysis of multiple failures

Lecture 7 Time-dependent Covariates in Cox Regression

Multivariable Fractional Polynomials

Chapter 4 Regression Models

Survival analysis in R

Generalized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.

Package crrsc. R topics documented: February 19, 2015

Regularization in Cox Frailty Models

Mantel-Haenszel Test Statistics. for Correlated Binary Data. Department of Statistics, North Carolina State University. Raleigh, NC

Multivariable Fractional Polynomials

Frailty Modeling for clustered survival data: a simulation study

Survival analysis in R

Reduced-rank hazard regression

Relative-risk regression and model diagnostics. 16 November, 2015

CTDL-Positive Stable Frailty Model

Contrasting Marginal and Mixed Effects Models Recall: two approaches to handling dependence in Generalized Linear Models:

Subject-specific observed profiles of log(fev1) vs age First 50 subjects in Six Cities Study

Multi-state Models: An Overview

TGDR: An Introduction

Philosophy and Features of the mstate package

A Handbook of Statistical Analyses Using R 2nd Edition. Brian S. Everitt and Torsten Hothorn

Longitudinal Modeling with Logistic Regression

Cox regression: Estimation

Statistics 203: Introduction to Regression and Analysis of Variance Course review

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis


Modelling and Analysis of Recurrent Event Data

A fast routine for fitting Cox models with time varying effects

On a connection between the Bradley-Terry model and the Cox proportional hazards model

Validation. Terry M Therneau. Dec 2015

Bayesian linear regression

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

Two-level lognormal frailty model and competing risks model with missing cause of failure

Semiparametric Regression

Statistical Methods III Statistics 212. Problem Set 2 - Answer Key

4. Comparison of Two (K) Samples

Fahrmeir: Discrete failure time models

Lecture 3 Linear random intercept models

Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.

Exam Applied Statistical Regression. Good Luck!

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen

More Accurately Analyze Complex Relationships

Inversion Base Height. Daggot Pressure Gradient Visibility (miles)

* * * * * * * * * * * * * * * ** * **

Lecture 7. Proportional Hazards Model - Handling Ties and Survival Estimation Statistics Survival Analysis. Presented February 4, 2016

A Regression Model For Recurrent Events With Distribution Free Correlation Structure

Package penmsm. R topics documented: February 20, Type Package

Outline. Linear OLS Models vs: Linear Marginal Models Linear Conditional Models. Random Intercepts Random Intercepts & Slopes

Residuals and model diagnostics

Package JointModel. R topics documented: June 9, Title Semiparametric Joint Models for Longitudinal and Counting Processes Version 1.

Software for GEE: PROC GENMOD and SUDAAN Babubhai V. Shah, Research Triangle Institute, Research Triangle Park, NC

STAT 526 Advanced Statistical Methodology

Part [1.0] Measures of Classification Accuracy for the Prediction of Survival Times

Lecture 9. Statistics Survival Analysis. Presented February 23, Dan Gillen Department of Statistics University of California, Irvine

Analysis of competing risks data and simulation of data following predened subdistribution hazards

STAT 526 Spring Final Exam. Thursday May 5, 2011

Simple Regression Model Setup Estimation Inference Prediction. Model Diagnostic. Multiple Regression. Model Setup and Estimation.

Finite Population Sampling and Inference

Bayesian variable selection via. Penalized credible regions. Brian Reich, NCSU. Joint work with. Howard Bondell and Ander Wilson

Analysis of Longitudinal Data. Patrick J. Heagerty PhD Department of Biostatistics University of Washington

Models for Multivariate Panel Count Data

Inference Methods for the Conditional Logistic Regression Model with Longitudinal Data Arising from Animal Habitat Selection Studies

Amendments and Errata for Cook and Lawless (2007)

dqd: A command for treatment effect estimation under alternative assumptions

UNIVERSITY of PENNSYLVANIA CIS 520: Machine Learning Final, Fall 2014

Part 7: Hierarchical Modeling

On a connection between the Bradley Terry model and the Cox proportional hazards model

Theorems. Least squares regression

Extensions of Cox Model for Non-Proportional Hazards Purpose

Package CoxRidge. February 27, 2015

SAS Analysis Examples Replication C8. * SAS Analysis Examples Replication for ASDA 2nd Edition * Berglund April 2017 * Chapter 8 ;

Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics

Sampling bias in logistic models

Petr Volf. Model for Difference of Two Series of Poisson-like Count Data

Extensions of Cox Model for Non-Proportional Hazards Purpose

Midterm exam CS 189/289, Fall 2015

Consistent high-dimensional Bayesian variable selection via penalized credible regions

Rene Tabanera y Palacios 4. Danish Epidemiology Science Center. Novo Nordisk A/S Gentofte. September 1, 1995

Model Selection in Bayesian Survival Analysis for a Multi-country Cluster Randomized Trial

Generalized Boosted Models: A guide to the gbm package

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Practical considerations for survival models

A Modern Look at Classical Multivariate Techniques

The Proportional Hazard Model and the Modelling of Recurrent Failure Data: Analysis of a Disconnector Population in Sweden. Sweden

Lecture 2: Poisson and logistic regression

Transcription:

SSUI: Presentation Hints 1 Comparing Marginal and Random Eects (Frailty) Models Terry M. Therneau Mayo Clinic April 1998

SSUI: Presentation Hints 2 My Perspective Software Examples Reliability Areas that need work

SSUI: Presentation Hints 3 Software Maturity Exists but you wouldn't want it Worked application(s) in a paper software shared with friends General program/ Splus function { available from author and/or Statlib { advertised Available in a package In SAS

SSUI: Presentation Hints 4 The models attempt to deal with correlated data Multiple events per subject Multiple event types per subject Correlated family members Correlation due to sampling plan.

SSUI: Presentation Hints 5 Hueristic rationale: Marginal models Consider a weighted linear model, either the case of subject weights or a fully known variance matrix. If we t the model ignoring the weights ^ usually changes very little Var( ^) may be badly incorrect

SSUI: Presentation Hints 6 Marginal Cox models There are three steps Set up the data set appropriately. Fit the data as though it were independent observations. Fix up the variance, post t, using a grouped jackknife. { Compute the dfbeta residual for each observation { Add these up to get the D j: = eect of the jth group { V = D 0 D { (Identical to the working-independence method of GEE)

SSUI: Presentation Hints 7 Frailty Models A good overview of problems and motivation is given in Aalen, \Heterogeneity in survival analysis", Statistics in Medicine, 1988. Assume that subject i from family j has an inherent, and unmeasured risk (frailty)! j, such that i (t) = 0 (t)! j e X i = 0 (t)e X i+z i where Z ij = 1 i subject i is a member of family j. Further assume The subjects are conditionally independent given. is a random variable from a known distribution.

then a solution can be computed.

SSUI: Presentation Hints 8 Issues Dierent models available Distribution of the frailty { gamma { log-normal { positive stable { inverse Gaussian { :::Does it make any dierence? Pattern of random eects { 1 per subject or family { Nested! block diagonal variance for { Crossed Each combination has it's own, unique variant of the EM algorithm.

SSUI: Presentation Hints 9 Penalized models PPL = Penalized Partial Likelihood = PL - penalty Formally, let the Cox model hazard be i (t) = 0 (t)e X i+z i where = unpenalized eects = penalized eects So PPL = PL( data) ; p( ) A simple example is \ridge regression" with p( ) = X 2 j :

SSUI: Presentation Hints 10 Solution If the ancillary parameter(s) is known The solution of the Cox model for ( ) is simple. { Straightforward addition to a standard Cox program. { Need p, p 0 and p 00 Well-recognized variance formulas. { H ;1, where H ij = @ 2 PLL=@ i @ j { H ;1 IH ;1, where I ij = @ 2 PL=@ i @ j Denitions for degrees of freedom. Connections to other techniques such as GCV, AIC. An array of numerical techniques available.

SSUI: Presentation Hints 11 Implementation This has been added to the S-Plus coxph function. A given method requires dene the penalty p, and its derivatives p 0 and p 00 dene the logic for determining setup These can be user-dened functions. The gamma frailty (Nielson et al 1992) and log-normal frailty (McGilchrist 93) map exactly onto this structure. See ftp://ftp.mayo.edu/pub/therneau

SSUI: Presentation Hints 12 Rat data Female rats from Mantel et. al. 3 rats/litter, one treated + two placebo 50 litters Marginal analysis The events are unordered: the time scale for each observation starts at zero and goes to event/censoring. The event types (death of rat A, death of rat B) are identical: only one stratum. The data set will have 150 observations, identical to an \iid" data set. Variables time, status, rx and litter. > coxph(surv(time, status) rx + cluster(litter), data=rats) coef exp(coef) se(coef) robust se z p rx 0.898 2.46 0.317 0.3 2.99 0.0028 Likelihood ratio test=7.87 on 1 df, p=0.00503 n= 150

SSUI: Presentation Hints 13 Gamma frailty > coxph(surv(time, status) rx + gfrail(litter), rats) coef se(coef) se2 Chisq DF p rx 0.906 0.323.319 7.88 1.0 0.005 Iterations: 6 outer, 19 Newton-Raphson Variance of random effect= 0.474 Degrees of freedom for terms= 1.0 13.8 Likelihood ratio test=9.4 on 14.82 df, p=0.847 n= 150 Gaussian (normal) frailty > coxph(surv(time, status) rx + nfrail(litter), rats) coef se(coef) se2 Chisq DF p rx 0.904 0.322 0.318 7.89 1.0 0.005 Iterations: 5 outer, 14 Newton-Raphson Variance of random effect= 0.39 Degrees of freedom for terms= 1.0 11.4 Likelihood ratio test=34 on 12.39 df, p=0.000864 n= 150

SSUI: Presentation Hints 14 Colon cancer data 929 patients { 315 Observation { 310 Levamisole { 304 Levamisole + 5FU Time to death and progression for each subject { 423 No events { 92 One event { 414 Two events Up to 9 years of follow-up

SSUI: Presentation Hints 15 Marginal t > coxph(surv(time, status) rx + extent + node4 + cluster(id) + strata(etype), data=colon) coef exp(coef) se(coef) robust se z p rxlev -0.0362 0.964 0.0768 0.1056-0.343 7.3e-01 rxlev+5fu -0.4488 0.638 0.0840 0.1168-3.842 1.2e-04 extent 0.5155 1.674 0.0796 0.1097 4.701 2.6e-06 node4 0.8799 2.411 0.0681 0.0961 9.160 0.0e+00 Gamma Gamma Normal WLW = 1 Frailty Frailty Lev vs Obs -0.04-0.04 0.04-0.03 Lev/5FU vs Obs -0.45-0.57-0.51-0.79 Extent 0.52 0.81 1.34 1.13 Nodes > 4 0.88 1.48 2.33 2.12 2 1 ; 8.06 6.95 iterations 4 1/5 10/76 8/103 `df' 4 513 1377 931

SSUI: Presentation Hints 16 Hidden Covariate Example True hazard for a subject is (t) = exp(x 1 ; x 2 ). X 1 is uniform(-2,2) and unknown. X 2 is the 0/1 treatment variable, known. All observations have the same total follow-up. Multiple sequential events per subject. We purposely have chosen that the hidden variate have a larger eect than treatment. The true coecients are +1 and -1. 250 replications, 100 subjects per arm (606 events) Number of events 0 1-2 3-5 6-10 Control 11 21 34 34 Treatment 29 46 26 1

SSUI: Presentation Hints 17 Fit two marginal models: Events are sequential, so time is (0, rst event], (rst, second],... A subject is not at risk for a second event until they have had their rst. Conditional model: First event, second event, etc are each in a separate strata. (Events may change the subject). AG model: All observations in one strata. (Poisson process). fit1 <- coxph(surv(start, stop, status) rx + strata(enum) + cluster( fit2 <- coxph(surv(start, stop, status) rx + cluster(id)) fit3 <- coxph(surv(start, stop, status) rx + strata(enum) + gfrail(i

SSUI: Presentation Hints 18 Summary Seems to work. The precision of the random eect is poor. The connection to degrees of freedom is unclear. Less exible than marginal models. Not yet reliable.

SSUI: Presentation Hints 19 Open Issues Code { Beta version available on ftp://ftp.mayo.edu/pub/therneau { Will be a part of Splus proper eventually. Variance { Which estimate is better? { Should we bootstrap it? Frailty { What other distributions can be accomodated? { Crossed or nested random eects. { How well do the more general, extensible rules work? AIC, BIC, PRESS, CIC, :::