Frailty Modeling for clustered survival data: a simulation study

Similar documents
Multivariate Survival Analysis

CTDL-Positive Stable Frailty Model

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY

Frailty Modeling for Spatially Correlated Survival Data, with Application to Infant Mortality in Minnesota By: Sudipto Banerjee, Mela. P.

A Regression Model For Recurrent Events With Distribution Free Correlation Structure

Analysing geoadditive regression data: a mixed model approach

Model Selection in Bayesian Survival Analysis for a Multi-country Cluster Randomized Trial

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

A general mixed model approach for spatio-temporal regression data

Joint Modeling of Longitudinal Item Response Data and Survival

Survival Analysis Math 434 Fall 2011

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Lecture 5 Models and methods for recurrent event data

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models

Modelling geoadditive survival data

You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?

Frailty Models and Copulas: Similarities and Differences

FRAILTY MODELS FOR MODELLING HETEROGENEITY

Lecture 12. Multivariate Survival Data Statistics Survival Analysis. Presented March 8, 2016

TMA 4275 Lifetime Analysis June 2004 Solution

Lecture 6 PREDICTING SURVIVAL UNDER THE PH MODEL

Survival Analysis. Stat 526. April 13, 2018

REGRESSION ANALYSIS FOR TIME-TO-EVENT DATA THE PROPORTIONAL HAZARDS (COX) MODEL ST520

Integrated likelihoods in survival models for highlystratified

Now consider the case where E(Y) = µ = Xβ and V (Y) = σ 2 G, where G is diagonal, but unknown.

Regularization in Cox Frailty Models

MAS3301 / MAS8311 Biostatistics Part II: Survival

Models for Multivariate Panel Count Data

Duration Analysis. Joan Llull

Proportional hazards model for matched failure time data

3003 Cure. F. P. Treasure

Mixture modelling of recurrent event times with long-term survivors: Analysis of Hutterite birth intervals. John W. Mac McDonald & Alessandro Rosina

Power and Sample Size Calculations with the Additive Hazards Model

Lecture 7 Time-dependent Covariates in Cox Regression

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates

Introduction to Statistical Analysis

The Weibull Distribution

Research & Reviews: Journal of Statistics and Mathematical Sciences

Statistical Inference and Methods

Instrumental variables estimation in the Cox Proportional Hazard regression model

Multistate models and recurrent event models

Censoring mechanisms

Dependence structures with applications to actuarial science

Continuous Time Survival in Latent Variable Models

STAT 6350 Analysis of Lifetime Data. Failure-time Regression Analysis

Practical considerations for survival models

Creating New Distributions

Lecture 9. Statistics Survival Analysis. Presented February 23, Dan Gillen Department of Statistics University of California, Irvine

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials

Introduction to repairable systems STK4400 Spring 2011

UNIVERSITY OF CALIFORNIA, SAN DIEGO

Multistate models and recurrent event models

FAILURE-TIME WITH DELAYED ONSET

Cox s proportional hazards/regression model - model assessment

ASYMPTOTIC PROPERTIES AND EMPIRICAL EVALUATION OF THE NPMLE IN THE PROPORTIONAL HAZARDS MIXED-EFFECTS MODEL

Exercises. (a) Prove that m(t) =

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

Lecture 22 Survival Analysis: An Introduction

Lecture 3. Truncation, length-bias and prevalence sampling

Unobserved Heterogeneity

Analysis of competing risks data and simulation of data following predened subdistribution hazards

Faculty of Health Sciences. Regression models. Counts, Poisson regression, Lene Theil Skovgaard. Dept. of Biostatistics

Survival Analysis I (CHL5209H)

SSUI: Presentation Hints 2 My Perspective Software Examples Reliability Areas that need work

Institute of Actuaries of India

Analysis of Time-to-Event Data: Chapter 6 - Regression diagnostics

Time-dependent covariates

Multistate Modeling and Applications

University of California, Berkeley

Package SimSCRPiecewise

Multivariate Survival Data With Censoring.

Stock Sampling with Interval-Censored Elapsed Duration: A Monte Carlo Analysis

Nonparametric Model Construction

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes:

GOV 2001/ 1002/ E-2001 Section 10 1 Duration II and Matching

THESIS for the degree of MASTER OF SCIENCE. Modelling and Data Analysis

Dynamic Models Part 1

Modeling Purchases as Repeated Events

7.1 The Hazard and Survival Functions

Inference on the Univariate Frailty Model: An Approach Bayesian Reference Analysis

Statistics in medicine

Survival Analysis for Case-Cohort Studies

SAMPLE SIZE ESTIMATION FOR SURVIVAL OUTCOMES IN CLUSTER-RANDOMIZED STUDIES WITH SMALL CLUSTER SIZES BIOMETRICS (JUNE 2000)

Estimation in Generalized Linear Models with Heterogeneous Random Effects. Woncheol Jang Johan Lim. May 19, 2004

Multivariate spatial modeling

Frailty Probit model for multivariate and clustered interval-censor

Semiparametric Mixed Effects Models with Flexible Random Effects Distribution

Dynamic Prediction of Disease Progression Using Longitudinal Biomarker Data

Hakone Seminar Recent Developments in Statistics

Two-level lognormal frailty model and competing risks model with missing cause of failure

Part III Measures of Classification Accuracy for the Prediction of Survival Times

Probability Distributions Columns (a) through (d)

A Joint Model with Marginal Interpretation for Longitudinal Continuous and Time-to-event Outcomes

IE 303 Discrete-Event Simulation

Extensions of Cox Model for Non-Proportional Hazards Purpose

Modelling the risk process

Parameter Estimation

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data

Transcription:

Frailty Modeling for clustered survival data: a simulation study IAA Oslo 2015 Souad ROMDHANE LaREMFiQ - IHEC University of Sousse (Tunisia) souad_romdhane@yahoo.fr Lotfi BELKACEM LaREMFiQ - IHEC University of Sousse (Tunisia) Monday, June 8, 2015

Introduction Motivation The current regulatory context of Solvency II > partial internal model > the construction of experience table as an alternative of the national life table. Risks within an insured portfolio are heterogeneous > difference in mortality rates and life expectancy between : Men / women Smokers / Non-smokers Chronically-diseases dependant (diabetes, obesity.. ) / Insurers with good health (etc.) (Souad ROMDHANE - LaREMFiQ) Simulation Study 2 / 18

Introduction Motivation First approach (+) Modeling the behavior of each specific group in an independent way. ( ) Problems of insufficient data, problems of choice of optimal segmentation and raise the risk estimation. (Souad ROMDHANE - LaREMFiQ) Simulation Study 3 / 18

Introduction Motivation First approach (+) Modeling the behavior of each specific group in an independent way. ( ) Problems of insufficient data, problems of choice of optimal segmentation and raise the risk estimation. Second approach (+) Turning to models integrating the observable factors of heterogeneity starting from explanatory variables : (Souad ROMDHANE - LaREMFiQ) Simulation Study 3 / 18

Introduction Motivation First approach (+) Modeling the behavior of each specific group in an independent way. ( ) Problems of insufficient data, problems of choice of optimal segmentation and raise the risk estimation. Second approach (+) Turning to models integrating the observable factors of heterogeneity starting from explanatory variables : "Proportional Hazards Cox model" (Souad ROMDHANE - LaREMFiQ) Simulation Study 3 / 18

Introduction Motivation First approach (+) Modeling the behavior of each specific group in an independent way. ( ) Problems of insufficient data, problems of choice of optimal segmentation and raise the risk estimation. Second approach (+) Turning to models integrating the observable factors of heterogeneity starting from explanatory variables : "Proportional Hazards Cox model" ( )Unobservable risk factors have been disregarded. In such case of human life expectancy studies : No matter how many covariates we want to add, it will never be the complete one. (Souad ROMDHANE - LaREMFiQ) Simulation Study 3 / 18

Introduction Motivation First approach (+) Modeling the behavior of each specific group in an independent way. ( ) Problems of insufficient data, problems of choice of optimal segmentation and raise the risk estimation. Second approach (+) Turning to models integrating the observable factors of heterogeneity starting from explanatory variables : "Proportional Hazards Cox model" ( )Unobservable risk factors have been disregarded. In such case of human life expectancy studies : No matter how many covariates we want to add, it will never be the complete one. Third approach (+) Integrating the unobservable factors of heterogeneity :"Frailty models" (Souad ROMDHANE - LaREMFiQ) Simulation Study 3 / 18

Introduction Objective We are interested in the models of the 3 rd approach which integrates observable and non observable factors of heterogeneity as an explanatory variables. Investigate, by means of simulation studies in a comprehensive way, the particular situations where this shared frailty model becomes applicable and preferable to classical Cox proportional hazards model and when it becomes inaccurate to use such model. Explore in more details the link between fixed factors estimates and some parameters combinations such as percent censoring, group size, number of groups, magnitude of the variance parameters associated with the distribution of the random effects etc. Compare several frequentist estimation approaches of frailty models and studying their limitations for different parameters combinations (e.g EM, PPL, REML, marginal likelihood approach, MCEM). (Souad ROMDHANE - LaREMFiQ) Simulation Study 4 / 18

Theoretical consideration Survival Analysis Survival analysis is concerned with studying the time between entry to a study (birth date) and a subsequent event (death date). A Cox model is a statistical technique for exploring the relationship between the survival of an insured and several explanotory variable. A significant feature : censored survival times : The period of observation was cut off before the event of interest occurred. (Souad ROMDHANE - LaREMFiQ) Simulation Study 5 / 18

Theoretical consideration Survival Analysis Survival analysis is concerned with studying the time between entry to a study (birth date) and a subsequent event (death date). A Cox model is a statistical technique for exploring the relationship between the survival of an insured and several explanotory variable. A significant feature : censored survival times : The period of observation was cut off before the event of interest occurred. Proportional Hazards assumption : This means that the hazard functions for any two individuals at any point in time are proportional and time independent. (Souad ROMDHANE - LaREMFiQ) Simulation Study 5 / 18

Theoretical consideration Survival Analysis Survival analysis is concerned with studying the time between entry to a study (birth date) and a subsequent event (death date). A Cox model is a statistical technique for exploring the relationship between the survival of an insured and several explanotory variable. A significant feature : censored survival times : The period of observation was cut off before the event of interest occurred. Proportional Hazards assumption : This means that the hazard functions for any two individuals at any point in time are proportional and time independent. Unknown baseline hazard function. (Souad ROMDHANE - LaREMFiQ) Simulation Study 5 / 18

Theoretical consideration Survival models : Cox model Heterogeneity only due to observable risk factors λ(t i β) = λ0 (t) exp ( x t i β) Assumption : The hazard function for each individual is proportional to the basline hazard λ 0 (t). This assumption implies that the hazard function is fully determined by the covariate vector. Problem : There may be unobserved covariates that cause this assumption to be violated. (Souad ROMDHANE - LaREMFiQ) Simulation Study 6 / 18

Theoretical consideration Survival models :Shared Frailty model Heterogeneity due to observable and unobservable risk factors. βi λ(t ij, b i ) = λ 0 (t ij ) w ij exp ( x t ij β ) i = λ 0 (t ij ) exp ( x t ij β i + z t ij b i ) δij = 1 {Tij D ij } i.e δ ij equal to 1 if T ij = Y ij, and 0 if T ij = C ij. W ij = e zt ij b i with b i N(0, θ) If b i = 0 i.e. W ij = 1, Shared frailty model ==> Cox s proportional hazards model. (Souad ROMDHANE - LaREMFiQ) Simulation Study 7 / 18

Simulation Study Generation survival times Bender(2005) where : T = H 1 0 log(u) w ij exp(β t i x ij) U Uni[0, 1], H 1 0 is the inverse of a cumulative baseline hazard function. H0 can take Exponential, Weibull or Gompertz distribution. (Souad ROMDHANE - LaREMFiQ) Simulation Study 8 / 18

Simulation Study Data simulation Each simulated dataset is composed of (x = 1000) i.i.d. replications of the quartet (T ij, δ ij, X ij, Z ij ) : where : Tij = min(y ij, C ij ) with Y ij is the failure-time variable corresponding to individual j from cluster i. C ij is non-informative right-censoring time, independent of Y ij. δij is the censoring indicator (The indicator of failure) with δ ij equal to 1 if T ij = Y ij, and 0 if T ij = C ij. Xij is a vector of covariates associated with the fixed-effect parameters. Zij is a vector of covariates associated with the random-effect parameter b i. (Souad ROMDHANE - LaREMFiQ) Simulation Study 9 / 18

Simulation Study Data simulation Figure: Mechanism of the simulation study for different datasets combination. (Souad ROMDHANE - LaREMFiQ) Simulation Study 10 / 18

Simulation Study Data simulation Cluster size :20 clusters of 60 observations - 60 clusters of 20 observations. Fixed censoring effects : 0%, 30%, 60% and 90% Variance of frailty : Classical Cox model (θ = 0) 2 Shared frailty models with various levels of heterogeneity (θ = 0.1, θ = 0.5, θ = 1). 3 Covariates : Covariates Distribution regression coefficients X 1 Binomial B[n, p = 0.5] B 1 = 1 X 2 Normal N[0, 1] B 2 = 1 X 3 Uniform U[0, 2] B 3 = 0.3 Table: Simulation with an example of 3 covariates (Souad ROMDHANE - LaREMFiQ) Simulation Study 11 / 18

Simulation Study Estimation methods To compare the performance of different inference procedures for shared frailty : Fit1 : method PPL Fit2 : method PPL Fit3 : method EM Fit4 : method REML Fit5 : method PPL Fit6 : method MCEM (Souad ROMDHANE - LaREMFiQ) Simulation Study 12 / 18

Illustration Results : σ 2 (θ) = 0 Censoring : 30%(left colun), 90%(right colun)

Illustration Results : Gamma frailty with θ = 0.5 (left colun) log-normal frailty with θ = 0.5 (right column)

Illustration Results : lognormal frailty with θ = 0.1 (1st line),θ = 0.5 (2nd line) ; θ = 1 (3rd line) Censoring : 30%(left colun), 90%(right colun)

Simulation Study Conclusions The empirical variability of the parameter estimates, by means of EM, REML, PPL or MCEM methods, was in general similar for both gamma and lognormal frailty survival data. A misspecification of the distribution of the frailties has no significant impact on the estimates of β. The bias increases with increasing the censoring rates the variance of random effects θ, if we do not take it into account through an appropriate frailty model. for survival data without frailty effect (θ = 0), Cox s model provides better results in terms of bias and MAE than the estimates of any frailty approaches. for further increase in heterogeneity effects, the bias in the estimates obtained from Cox s model increases. However, those from the frailty model decrease and converge towards true parameter. (Souad ROMDHANE - LaREMFiQ) Simulation Study 16 / 18

Simulation Study Conclusions when survival data includes a some level of heterogeneity whatever its distribution, the frailty model approach to estimate β coefficients performs better than the standard Cox s proportional hazard model. > Cox approach has lost its usefulness when heterogeneity parameter increases more than 0.1. > Cox model is still better than any frailty models when the heterogeneity parameter is close to zero. for all models, the bias for the fixed-effect estimates seems to reduce with the increase in cluster size and for the random-effect estimates with the increasing of the number of clusters. (Souad ROMDHANE - LaREMFiQ) Simulation Study 17 / 18

Thank You