SAMPLE SIZE ESTIMATION FOR SURVIVAL OUTCOMES IN CLUSTER-RANDOMIZED STUDIES WITH SMALL CLUSTER SIZES BIOMETRICS (JUNE 2000)

Similar documents
Lecture 5 Models and methods for recurrent event data

GOODNESS-OF-FIT TESTS FOR ARCHIMEDEAN COPULA MODELS

Estimating Causal Effects of Organ Transplantation Treatment Regimes

Multivariate Survival Data With Censoring.

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

Model Selection in Bayesian Survival Analysis for a Multi-country Cluster Randomized Trial

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials

Statistical Methods for Alzheimer s Disease Studies

Frailty Models and Copulas: Similarities and Differences

Quantile Regression for Residual Life and Empirical Likelihood

Semiparametric Regression

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

Power and Sample Size Calculations with the Additive Hazards Model

Survival Analysis Math 434 Fall 2011

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data

Hypothesis Testing Based on the Maximum of Two Statistics from Weighted and Unweighted Estimating Equations

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data

Likelihood Construction, Inference for Parametric Survival Distributions

Instrumental variables estimation in the Cox Proportional Hazard regression model

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials

Accurate Sample Size Calculations in Trials with Non-Proportional Hazards

Practical considerations for survival models

STAT331. Cox s Proportional Hazards Model

Frailty Modeling for clustered survival data: a simulation study

Sample size and robust marginal methods for cluster-randomized trials with censored event times

Censoring mechanisms

Lecture 22 Survival Analysis: An Introduction

Tests of independence for censored bivariate failure time data

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology

On consistency of Kendall s tau under censoring

Let us use the term failure time to indicate the time of the event of interest in either a survival analysis or reliability analysis.

Fundamentals of Reliability Engineering and Applications

Pairwise dependence diagnostics for clustered failure-time data

Multistate models and recurrent event models

Estimating Bivariate Survival Function by Volterra Estimator Using Dynamic Programming Techniques

Survival Analysis. Stat 526. April 13, 2018

A joint modeling approach for multivariate survival data with random length

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

Sample Size and Power Considerations for Longitudinal Studies

Multistate Modeling and Applications

Extensions of Cox Model for Non-Proportional Hazards Purpose

Analysis of competing risks data and simulation of data following predened subdistribution hazards

TMA 4275 Lifetime Analysis June 2004 Solution

A Generalized Global Rank Test for Multiple, Possibly Censored, Outcomes

Typical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction

Multistate models and recurrent event models

Lecture 3. Truncation, length-bias and prevalence sampling

Ph.D. course: Regression models. Introduction. 19 April 2012

University of California, Berkeley

A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky

Part IV Extensions: Competing Risks Endpoints and Non-Parametric AUC(t) Estimation

NONPARAMETRIC ADJUSTMENT FOR MEASUREMENT ERROR IN TIME TO EVENT DATA: APPLICATION TO RISK PREDICTION MODELS

BIOS 312: Precision of Statistical Inference

ADVANCED STATISTICAL ANALYSIS OF EPIDEMIOLOGICAL STUDIES. Cox s regression analysis Time dependent explanatory variables

Harvard University. Harvard University Biostatistics Working Paper Series

Multivariate Survival Analysis

Lecture 4 - Survival Models

Joint Modeling of Longitudinal Item Response Data and Survival

Proportional hazards model for matched failure time data

Ph.D. course: Regression models. Regression models. Explanatory variables. Example 1.1: Body mass index and vitamin D status

Step-Stress Models and Associated Inference

Part III Measures of Classification Accuracy for the Prediction of Survival Times

Analysis of Time-to-Event Data: Chapter 4 - Parametric regression models

GOV 2001/ 1002/ E-2001 Section 10 1 Duration II and Matching

Semiparametric maximum likelihood estimation in normal transformation models for bivariate survival data

Multi-state Models: An Overview

Published online: 10 Apr 2012.

Introduction to Empirical Processes and Semiparametric Inference Lecture 01: Introduction and Overview

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY

Implementing Response-Adaptive Randomization in Multi-Armed Survival Trials

A TWO-STAGE LINEAR MIXED-EFFECTS/COX MODEL FOR LONGITUDINAL DATA WITH MEASUREMENT ERROR AND SURVIVAL

Unit 10: Planning Life Tests

Statistics in medicine

PENALIZED LIKELIHOOD PARAMETER ESTIMATION FOR ADDITIVE HAZARD MODELS WITH INTERVAL CENSORED DATA

Survival Prediction Under Dependent Censoring: A Copula-based Approach

Harvard University. Harvard University Biostatistics Working Paper Series. Year 2016 Paper 208

Survival Regression Models

Part [1.0] Measures of Classification Accuracy for the Prediction of Survival Times

Conditional Copula Models for Right-Censored Clustered Event Time Data

Survival Analysis I (CHL5209H)

Introduction to Statistical Analysis

and Comparison with NPMLE

A Measure of Association for Bivariate Frailty Distributions

GROUPED SURVIVAL DATA. Florida State University and Medical College of Wisconsin

3003 Cure. F. P. Treasure

Exercises. (a) Prove that m(t) =

Pairwise rank based likelihood for estimating the relationship between two homogeneous populations and their mixture proportion

Investigation of goodness-of-fit test statistic distributions by random censored samples

Support Vector Hazard Regression (SVHR) for Predicting Survival Outcomes. Donglin Zeng, Department of Biostatistics, University of North Carolina

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Analysing Survival Endpoints in Randomized Clinical Trials using Generalized Pairwise Comparisons

Longitudinal + Reliability = Joint Modeling

Harvard University. The use of permutation tests for the analysis of parallel and stepped-wedge cluster randomized trials. Rui Wang Victor DeGruttola

Analysis of Survival Data Using Cox Model (Continuous Type)

Constrained estimation for binary and survival data

Regularization in Cox Frailty Models

4. Comparison of Two (K) Samples

Transcription:

SAMPLE SIZE ESTIMATION FOR SURVIVAL OUTCOMES IN CLUSTER-RANDOMIZED STUDIES WITH SMALL CLUSTER SIZES BIOMETRICS (JUNE 2000) AMITA K. MANATUNGA THE ROLLINS SCHOOL OF PUBLIC HEALTH OF EMORY UNIVERSITY SHANDE CHEN RUSH-PRESBYTEIAN-ST. LUKE S MEDICAL CENTER PRESENTATION BY EVALYN VAERA BREIKŠS DUKE UNIVERSITY 24 MARCH 2017 CURRENT PROBLEMS IN BIOSTATISTICS (BIOS 900)

OUTLINE Review of Survival Data Power Calculation Differences Non-CRT Sample Size Estimation Methods How CRT Changes Things Discussion of Simulation Studies

WHAT IS SURVIVAL DATA? Interested in Time-to-Event outcomes rather than typical measurements We observe Y ik = min T ik, C ik and δ ik, where T ik is the survival time, C ik the censoring time, and δ ik the failure indicator of the ith individual in treatment arm k, a 1 indicating T ik C ik and a 0 otherwise. Censoring is when an individual s survival time is not observed due to leaving the study for other reasons, including the conclusion of the study. Generally fit Cox-Proportional Hazards models using the Kaplan-Meier Product-Limit estimator to estimate the survivorship curve, that is, modeling the proportion of individuals that have not yet experienced a failure event as a function of time and other covariates.

HOW POWER CALCULATIONS IN SURVIVORSHIP STUDIES DIFFER Power is a function of number of events rather than of number of sampled individuals. Our treatment effect of interest is the difference in rates of survival, that is, the hazard ratios, rather than some directly measureable difference between groups using means. Necessary sample sizes are estimated by making assumptions of the event rates.

CURRENT METHODS FOR SAMPLE SIZE ESTIMATION Required number of events: Events = z 2 α/2+z β, where π π 1 π 2 log HR 2 i is the proportion of events in treatment arm i, HR is the assumed Hazard Ratio between the two groups, and the z s are from the standard normal distribution. Need an event rate, P(event) Work with bivariate marginal distributions P event = 1 (π 1 S 1 T + π 2 S 2 T ) S i (T) is the survivorship function (1-CDF), often presumed to be exponential Often historical or pilot data aids in estimating the parameter(s) of S i (T), or just assume some

HOW DOES CRT CHANGE THINGS? Now have clusters, naïve method involves averaging their sizes Observe Y ijk = min T ijk, C ijk and δ ijk, where T ijk is the survival time, C ijk the censoring time, and δ ijk the failure indicator of the jth individual in cluster i in treatment arm k, a 1 indicating T ijk C ijk and a 0 otherwise. Generalize to the Clayton-Oakes model S t 1, t 2 = S(t 1 ) 1 θ + S(t 2 ) 1 θ 1 1/(θ 1) θ is the measure of association between T 1 and T 2. θ = 1 means T s independent, θ means they approach maximal (+) dependence

HOW DOES CRT CHANGE THINGS? (CON T) Asymptotic normality assumption, λ s are the hazard ratios for each treatment arm Where Λ k = U k(λ k ) Γ k λ k 2, U k λ k = E n k λ k λ k N(0, Λ k ) 1 λ k i j δ ijk i j y ijk 2, Γk λk = 1 λ k 2 E i j δ ijk. Much algebra later, in the framework of a hypothesis test, we arrive at N log λ 1 λ 2 = z α γ 2 U 2 λ 2 + γ 1 U 1 (λ 1 ) γ 1 λ 1 Γ 1 λ 1 + γ 2 λ 2 Γ 2 λ 2 2 1 + 1 1 + z γ 1 γ β 2 2 λ Λ 2 + 1 2 2 λ Λ 1 1 Solve for N or z β as desired. Where N = n 1 + n 2, γ 1 = n 1 /N, γ 2 = n 2 /N.

SIMULATION STUDY DISCUSSION Compare their expected power of 90% to empirical power in simulation Numerically solved for N, then used a uniform distribution to simulate censorship and compute empirical power to detect imposed clinical difference in hazard ratios Authors used a Weibull distribution for S(T), varying shape parameter to obtain different event rates, more events = more power Overall their method is more conservative, especially with smaller cluster sizes and lower risk ratios (that is, smaller clinical differences) They chalked it up to non-normality conformity in small samples

REFERENCES Manatunga, Amita K., and Shande Chen. "Sample Size Estimation for Survival Outcomes in Cluster Randomized Studies with Small Cluster Sizes." Biometrics 56.2 (2000): 616-621. Weaver, Mark A., PhD. "Sample Size Calculations for Survival Analysis." Family Health International. India, Goa. Web. 24 Mar. 2017. <http://www.icssc.org/documents/advbiosgoa/tab%2026.00_survss.pdf>.