Two-stage Adaptive Randomization for Delayed Response in Clinical Trials

Similar documents
The Design of a Survival Study

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology

Part III. Hypothesis Testing. III.1. Log-rank Test for Right-censored Failure Time Data

A simulation study for comparing testing statistics in response-adaptive randomization

Monitoring clinical trial outcomes with delayed response: incorporating pipeline data in group sequential designs. Christopher Jennison

Sample size re-estimation in clinical trials. Dealing with those unknowns. Chris Jennison. University of Kyoto, January 2018

Sample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA

Group Sequential Tests for Delayed Responses

Survival Analysis: Weeks 2-3. Lu Tian and Richard Olshen Stanford University

Bayesian Enhancement Two-Stage Design for Single-Arm Phase II Clinical. Trials with Binary and Time-to-Event Endpoints

Hypothesis Test. The opposite of the null hypothesis, called an alternative hypothesis, becomes

Bios 6649: Clinical Trials - Statistical Design and Monitoring

Sequential Monitoring of Clinical Trials Session 4 - Bayesian Evaluation of Group Sequential Designs

Group Sequential Designs: Theory, Computation and Optimisation

TMA 4275 Lifetime Analysis June 2004 Solution

Statistics and Probability Letters. Using randomization tests to preserve type I error with response adaptive and covariate adaptive randomization

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

Power and Sample Size Calculations with the Additive Hazards Model

Adaptive Designs: Why, How and When?

Adaptive Prediction of Event Times in Clinical Trials

Play-the-winner rule in clinical trials: Models for adaptative designs and Bayesian methods

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

Pubh 8482: Sequential Analysis

Uniformly Most Powerful Bayesian Tests and Standards for Statistical Evidence

Implementing Response-Adaptive Randomization in Multi-Armed Survival Trials

Statistical Methods for Alzheimer s Disease Studies

TUTORIAL 8 SOLUTIONS #

Likelihood Construction, Inference for Parametric Survival Distributions

Statistical Inference

SPRING 2007 EXAM C SOLUTIONS

6 Sample Size Calculations

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing

The University of Hong Kong Department of Statistics and Actuarial Science STAT2802 Statistical Models Tutorial Solutions Solutions to Problems 71-80

Detection theory. H 0 : x[n] = w[n]

Type I error rate control in adaptive designs for confirmatory clinical trials with treatment selection at interim

Distribution Fitting (Censored Data)

Adaptive designs to maximize power. trials with multiple treatments

Individualized Treatment Effects with Censored Data via Nonparametric Accelerated Failure Time Models

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Investigation of goodness-of-fit test statistic distributions by random censored samples

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates

Interim Monitoring of Clinical Trials: Decision Theory, Dynamic Programming. and Optimal Stopping

Statistics 135 Fall 2008 Final Exam

SOLUTION FOR HOMEWORK 8, STAT 4372

Review. December 4 th, Review

Hypothesis testing: theory and methods

(1) Introduction to Bayesian statistics

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

Sample Size Determination

Harvard University. Harvard University Biostatistics Working Paper Series

Introductory Econometrics

BIOS 312: Precision of Statistical Inference

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

Response Adaptive Randomization using Surrogate and Primary Endpoints

Optimising Group Sequential Designs. Decision Theory, Dynamic Programming. and Optimal Stopping

A Bayesian Nonparametric Approach to Causal Inference for Semi-competing risks

Practice Exam 1. (A) (B) (C) (D) (E) You are given the following data on loss sizes:

Sequential Procedure for Testing Hypothesis about Mean of Latent Gaussian Process

Statistical Inference on Constant Stress Accelerated Life Tests Under Generalized Gamma Lifetime Distributions

University of California, Berkeley

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials

Exam 2 Practice Questions, 18.05, Spring 2014

Beyond GLM and likelihood

Clinical Trial Design Using A Stopped Negative Binomial Distribution. Michelle DeVeaux and Michael J. Kane and Daniel Zelterman

Robust estimates of state occupancy and transition probabilities for Non-Markov multi-state models

Institute of Actuaries of India

RESPONSE ADAPTIVE CLINICAL TRIALS WITH CENSORED LIFETIMES

Technical Manual. 1 Introduction. 1.1 Version. 1.2 Developer

Adaptive designs beyond p-value combination methods. Ekkehard Glimm, Novartis Pharma EAST user group meeting Basel, 31 May 2013

Summary of Chapters 7-9

arxiv: v1 [stat.me] 24 Jul 2013

Mixtures of multiple testing procedures for gatekeeping applications in clinical trials

SAMPLE SIZE ESTIMATION FOR SURVIVAL OUTCOMES IN CLUSTER-RANDOMIZED STUDIES WITH SMALL CLUSTER SIZES BIOMETRICS (JUNE 2000)

Step-Stress Models and Associated Inference

DIFFERENT APPROACHES TO STATISTICAL INFERENCE: HYPOTHESIS TESTING VERSUS BAYESIAN ANALYSIS

Typical Survival Data Arising From a Clinical Trial. Censoring. The Survivor Function. Mathematical Definitions Introduction

Reliability Engineering I

Exam C Solutions Spring 2005

Semiparametric Regression

A Sequential Bayesian Approach with Applications to Circadian Rhythm Microarray Gene Expression Data

EECS564 Estimation, Filtering, and Detection Exam 2 Week of April 20, 2015

18.05 Practice Final Exam

Topic 19 Extensions on the Likelihood Ratio

Goodness-of-fit tests for randomly censored Weibull distributions with estimated parameters

4. Comparison of Two (K) Samples

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

Estimation MLE-Pandemic data MLE-Financial crisis data Evaluating estimators. Estimation. September 24, STAT 151 Class 6 Slide 1

ORF 245 Fundamentals of Statistics Chapter 9 Hypothesis Testing

the unification of statistics its uses in practice and its role in Objective Bayesian Analysis:

CIMAT Taller de Modelos de Capture y Recaptura Known Fate Survival Analysis

Logistic regression model for survival time analysis using time-varying coefficients

Bayes Factor Single Arm Time-to-event User s Guide (Version 1.0.0)

Introduction to Statistical Analysis

Central Limit Theorem ( 5.3)

Parameters Estimation for a Linear Exponential Distribution Based on Grouped Data

4 Testing Hypotheses. 4.1 Tests in the regression setting. 4.2 Non-parametric testing of survival between groups

Pubh 8482: Sequential Analysis

Monitoring clinical trial outcomes with delayed response: Incorporating pipeline data in group sequential and adaptive designs. Christopher Jennison

Political Science 236 Hypothesis Testing: Review and Bootstrapping

Transcription:

Two-stage Adaptive Randomization for Delayed Response in Clinical Trials Guosheng Yin Department of Statistics and Actuarial Science The University of Hong Kong Joint work with J. Xu PSI and RSS Journal Club May 21, 2015 1 / 29

Equal/Adaptive Randomization In a clinical trial with multiple treatments, the goal is to identify the superior treatment quickly, as well as treating patients effectively. Equal randomization (ER) is a simple and efficient way for patient allocation. Response-based adaptive randomization (AR) tends to assign more patients to better treatments based on the information accumulated in the trial. Prior to the implementation of AR, a prerun of ER is typically used to stabilize the parameter estimates. However, it is not clear how large the prerun sample size should be, and it is often chosen arbitrarily in practice. 2 / 29

Adaptive Randomization Pioneering work can be traced back to Thompson (1933), Robbins (1952), and Feldman (1962) etc. Play-the-winner rule (Zelen, 1969): Continue using the same treatment if a success response is observed; otherwise switch to the other treatment. Randomized play-the-winner rule (Wei and Durham, 1978): A higher randomization probability is given to the treatment that has produced a success response. Bandit problems and Bayesian adaptive randomization (Berry and Eick, 1995). 3 / 29

Play-the-Winner Rule S S S F S F S S F F Sequence of enrollment A A A A B B A B A B A A B A B A B A B A Figure 1: Play-the-winner rule and urn model with treatments A and B. 4 / 29

Optimal Allocation in AR (Binary Data) We can calculate the optimal allocation ratio by minimizing the variance (equivalently, maximizing power), or by minimizing the expected number of nonresponders in a trial. Let p 1 and p 2 denote the response rates of treatments 1 and 2. By minimizing the variance of the difference between ˆp 1 and ˆp 2, the allocation ratio between arm 1 and arm 2 is p1 (1 p 1 ) p2 (1 p 2 ), which is known as Neyman s allocation. By minimizing the number of nonresponders while fixing the variance (Rosenberger et al., 2001), the allocation ratio is p1 p2. 5 / 29

Continuous Data For continuous data, let µ 1 and µ 2 denote the means of two normal distributions, and let σ1 2 and σ2 2 denote the corresponding variances. Neyman s allocation ratio is which minimizes the variance. σ 1 σ 2, For the case where a smaller response is preferred, Zhang and Rosenberger (2005) proposed an optimal allocation ratio of σ 1 µ2 σ 2 µ1, by minimizing the total expected response from all patients. 6 / 29

Bayesian Adaptive Randomization In the Bayesian approach, we may naturally assign patients to treatment 1 with a probability of λ = Pr(p 1 > p 2 y 1, y 2 ), where y 1 and y 2 represent the accumulated data in the two arms (Yin, 2012). By comparing the posterior distributions of p 1 and p 2, it automatically accounts for both the point and variance estimates of the treatment response rates. 7 / 29

Bayesian Estimates (Early vs. Late Stages) Posterior distribution 0 1 2 3 4 5 6 Earlier stage Posterior distribution 0 1 2 3 4 5 6 Later stage 0.0 0.4 0.8 0.0 0.4 0.8 Response rate Response rate Figure 2: Posterior distributions of the response rates at the earlier and later stages of a trial. 8 / 29

Power Transformation We can explore a class of randomization probabilities, π(λ, γ) = λ γ λ γ + (1 λ) γ. If γ = 0, the randomization scheme reduces to ER with an equal assigning probability of 0.5 regardless of the value of λ; and if γ = 1, π(λ, γ) = λ. It may depend on the accumulating sample size n, γ n = where N is the total sample size. n 2N, 9 / 29

Delayed Response with τ = 6a Cohort 2 Cohort 1 0 a 2a 3a 4a 5a 6a Time Assessment period τ Figure 3: By the time a new cohort is ready for treatment, some of the patients in the trial may be partially followed and their efficacy outcomes have not yet been observed. 10 / 29

Parametric Survival Model Zhang and Rosenberger (2007) developed an optimal allocation scheme under the assumption of parametric survival models. Let T denote the survival time; under an exponential model the survival function of T is given by ) S j (t) = exp( λ j t) = exp ( tθj, j = 1, 2, where λ j is the constant hazard rate for treatment arm j, and θ j = 1/λ j is the mean survival time. Let 1i and 2i be the censoring indicators in group 1 and group 2, respectively. Denote δ 1 = E( 1i ) and δ 2 = E( 2i ). 11 / 29

Time-to-Event Endpoint Consider the hypothesis test The variance of ˆθ 1 ˆθ 2 is H 0 : θ 1 = θ 2 versus H 1 : θ 1 θ 2. Var(ˆθ 1 ˆθ 2 ) = θ2 1 n 1 δ 1 + θ2 2 n 2 δ 2. Zhang and Rosenberger (2007) obtained the optimal allocation ratio by minimizing the total expected hazard n 1 θ1 1 + n 2 θ2 1, subject to fixing the variance as a constant, θ1 3 r θ = δ 2. θ2 3δ 1 12 / 29

Allocation Ratio with Survival Function If the patient response is a good event, then the sooner patients experience the event, the better. We minimize the total number of patients who have not responded within the assessment window (0, τ). We derive the optimal allocation ratio by minimizing n 1 S 1 (τ, λ 1 ) + n 2 S 2 (τ, λ 2 ) subject to fixing Var{S 1 (τ, ˆλ 1 ) S 2 (τ, ˆλ 2 )} = K. The optimal allocation ratio is r S = λ 1 δ2 exp( λ 1 τ) λ 2 δ1 exp( λ 2 τ). When the sample size is large and both p 1 and p 2 are small, r S reduces to that of the binary case, i.e., r S p 1 / p 2. 13 / 29

Two-stage Response-Adaptive Randomization We consider a two-arm clinical trial with binary endpoints, Y 1i Bernoulli(p 1 ), i = 1,..., n 1, Y 2i Bernoulli(p 2 ), i = 1,..., n 2. The null and alternative hypotheses are formulated as H 0 : p 1 = p 2 versus H 1 : p 1 p 2. The trial starts with ER, and continuously makes decisions on when to switch to AR as more data are collected. 14 / 29

Likelihood Ratio Test With m patients in each arm, the likelihood ratio test statistic is max H p m i=1 (y 1i +y 2i ) (1 p) m 0:p 1=p 2=p T m = 2log max 1 (1 p 1 ) m m i=1 (1 y 1i p ) i=1 y 2i 2 (1 p 2 ) m m i=1 p y 1i p 1,p 2 i=1 (2 y 1i y 2i ) i=1 (1 y 2i ) Under the null hypothesis, the likelihood ratio test statistic follows a chi-squared distribution with one degree of freedom, i.e., T m χ 2 (1).. We can compute ˆT m by plugging in the MLEs of p 1 and p 2, and the rejection region is defined as ˆT m > χ 2 (1)(1 α). 15 / 29

Role of α As a threshold level for switching from ER to AR, α should be greater than the trial s type I error rate α. If the treatment difference is large, n E would be small so that the trial moves to AR quickly; and if the treatment difference is small, n E would be large as ER and AR are not much different so that it would take a longer time before switching to AR. By controlling α, the two-stage design can automatically adapt to the true difference between p 1 and p 2. In contrast, if we fix the sample size n E in the ER stage, it would not be adjustable to the treatment difference. 16 / 29

Two-stage Procedure In stage 1, the trial begins with equal randomization, and continuously updates the likelihood ratio test statistic after enrolling every new patient. If ˆT m < χ 2 (1)(1 α), equal randomization remains; otherwise, the trial proceeds to stage 2. In stage 2, we start to implement response-adaptive randomization for each patient based on an optimal allocation ratio, e.g., using p 1 / p 2 as the allocation ratio to minimize the number of nonresponders. 17 / 29

Nonparametric Fractional Model for Delayed Response The missing or censoring of response poses immense difficulties when applying response-adaptive randomization during the trial conduct. If we view the efficacy endpoint as an event of interest, we can model the time to efficacy using the Kaplan-Meier estimator, and fractionize the censored observations based on patients exposure times in the trial. If a drug-related efficacy event occurs, it is expected to occur within the observation window [0, τ]. { 0 if the subject does not respond within [0, τ], Y = 1 if the subject responded within [0, τ]. 18 / 29

Fractional Contribution for Censored Data Let T 1i denote the time to efficacy, and let u 1i (u 1i τ) denote the actual follow-up time for subject i in arm 1. The patient s response is censored if he/she has not responded (u 1i < T 1i ) and also has not been fully followed up to τ (u 1i < τ). If we observe a censored event before τ, i.e., efficacy has not occurred yet, we can obtain a fraction of 1 as the contribution of the censored observation to the response probability. Censoring Failure 0 u i t i τ 19 / 29

Redistribution to the Right (Kaplan Meier Estimator) If subject i is censored by the decision-making time u 1i, we take the fractional contribution as Pr(T 1i < τ T 1i > u 1i ) = Pr(u 1i < T 1i < τ). Pr(T 1i > u 1i ) A fractional contribution for a censored observation is ŷ 1i = Ŝ1(u 1i ) Ŝ 1 (τ), Ŝ 1 (u 1i ) where Ŝ 1 ( ) is the Kaplan Meier estimator for arm 1. The estimated response rate is ˆp 1 = n 1 i=1 r 1i/n 1, where 0 if patient i does not respond, r 1i = 1 if patient i has responded, ŷ 1i if the response of patient i is censored. 20 / 29

Simulations Our simulation studies considered a two-arm trial with binary outcomes for investigating the operating characteristics of the proposed two-stage fractional AR design. The assessment period for efficacy was τ = 12 weeks. The accrual time interval between two consecutive cohorts was a = 1 week, i.e., every week a new cohort (4 patients) would enter the trial. The sample size was calculated based on the type I error and type II error rates, α = 0.1 and β = 0.2 for a two-sided test. We fixed the threshold level for ER/AR switching α = 0.3. For each configuration, we replicated 10,000 trials. 21 / 29

Weibull Distributions 0.6 Arm 1: T Weibull (γ=5, λ=0.0729) Arm 2: T Weibull (γ=1, λ=0.0186) 0.5 Response probability 0.4 0.3 0.2 0.1 0 0 2 4 6 8 10 12 14 Time (weeks) Figure 4: Weibull CDFs with the response probability at time τ being 0.4 for arm 1 and 0.2 for arm 2. The response probability of arm 2 is clearly higher than that of arm 1 before week 10. 22 / 29

Comparison of Three Methods Complete-data AR follows each subject till the occurrence of response or the end of the assessment period prior to randomizing each new patient. Fractional AR utilizes the scheme of redistribution to the right for censored data, so that each patient would be immediately randomized upon arrival. Observed-data AR is based on the observed efficacy data only, while treating censored patients (who have not responded or have been fully followed yet) as nonresponders. 23 / 29

Comparison of AR Designs Table 1: Comparison of the two-stage observed-data, complete-data, and fractional AR designs with p 2 = 0.2 and n = 132. Two-stage Allocation Allocation Number of Rejection Trial ER p 1 design arm 1 (%) S.D. responders rate (%) duration n E 0.2 Observed 47.4 0.06 26.5 10.1 52.7 56.9 Complete 50.0 0.06 26.4 10.5 362.1 53.9 Fractional 50.3 0.07 26.4 9.8 52.7 45.2 0.4 Observed 53.5 0.06 40.5 81.1 53.4 44.5 Complete 57.8 0.06 41.6 80.5 370.8 27.5 Fractional 57.5 0.07 41.6 80.9 53.4 30.1 0.6 Observed 57.4 0.06 56.7 99.9 53.6 30.8 Complete 62.5 0.06 59.3 99.9 372.7 16.1 Fractional 61.8 0.06 59.0 99.9 53.6 22.1 0.8 Observed 60.7 0.06 74.4 100.0 53.5 22.4 Complete 65.7 0.05 78.4 100.0 370.0 12.5 Fractional 64.8 0.06 77.7 100.0 53.5 17.7 24 / 29

Simulation Results For p 1 = 0.2, the two treatments have the same response rate, all three designs maintained the type I error rate at α = 0.1. Since a much higher response rate in arm 2 was observed at the beginning of the follow-up, the observed-data AR design falsely assigned more patients to arm 2. For p 1 = 0.4, it corresponds to the alternative hypothesis, which thus has the targeting power of 80% under all three designs. The fractional and the complete-data designs yielded similar allocation ratios, while both are better than the observed-data design. 25 / 29

Simulation Results As the difference between the two response rates increases, the sample size of ER becomes smaller because fewer patients are needed to detect a larger difference. For p 1 = 0.8, fractional AR increased the number of responders by more than three patients over the observed-data design. Comparing the duration of the trial between the proposed fractional design and the complete-data design, the trial time was dramatically reduced from 370 weeks to 53 weeks. 26 / 29

Discussion The two-stage fractional design addresses two practical issues for response-adaptive randomization: (a) the number of patients in the ER stage is not clearly defined, (b) patient response cannot be observed quickly enough for real-time AR. In the new design, unobserved efficacy outcomes are naturally treated as censored data, and their fractional point masses are calculated to help making decisions on treatment assignment. The nonparametric fractional design is robust and easy to implement, as it only uses the Kaplan Meier estimator. The likelihood ratio test with α is only used for deciding when to switch from ER to AR. 27 / 29

Main References Xu, J. and Yin, G. (2014). Two-stage adaptive randomization for delayed response in clinical trials. Journal of Royal Statistical Society C Applied Statistics, 63, 559 578. Yin, G. (2012). Clinical Trial Design: Bayesian and Frequentist Adaptive Methods, John Wiley & Sons (Wiley Series in Probability and Statistics). Berry, D. A. and Eick, S. G. (1995). Adaptive assignment versus balanced randomization in clinical trials: A decision analysis. Statistics in Medicine, 14, 231 246. Efron, B. (1967). The two sample problem with censored data. In Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, volume 4, pages 831 853. University of California Press, Berkeley. Rosenberger, W. F., Stallard, N., Ivanova, A., Harper, C. N. and Ricks, M. L. (2001). Optimal adaptive designs for binary response trials. Biometrics, 57, 909 913. Zelen, M. (1969). Play the winner rule and the controlled clinical trial. Journal of the American Statistical Association, 64, 131 146. 28 / 29

Questions? 29 / 29