Group sequential designs for Clinical Trials with multiple treatment arms

Similar documents
Type I error rate control in adaptive designs for confirmatory clinical trials with treatment selection at interim

Testing a secondary endpoint after a group sequential test. Chris Jennison. 9th Annual Adaptive Designs in Clinical Trials

Adaptive Dunnett Tests for Treatment Selection

Adaptive, graph based multiple testing procedures and a uniform improvement of Bonferroni type tests.

clinical trials Abstract Multi-arm multi-stage (MAMS) trials can improve the efficiency of the drug

Power assessment in group sequential design with multiple biomarker subgroups for multiplicity problem

Adaptive designs beyond p-value combination methods. Ekkehard Glimm, Novartis Pharma EAST user group meeting Basel, 31 May 2013

Superchain Procedures in Clinical Trials. George Kordzakhia FDA, CDER, Office of Biostatistics Alex Dmitrienko Quintiles Innovation

Designing multi-arm multi-stage clinical trials using a risk-benefit criterion for treatment selection

The Design of Group Sequential Clinical Trials that Test Multiple Endpoints

Adaptive Treatment Selection with Survival Endpoints

A Gatekeeping Test on a Primary and a Secondary Endpoint in a Group Sequential Design with Multiple Interim Looks

Adaptive clinical trials with subgroup selection

Estimation in Flexible Adaptive Designs

Multiple Testing in Group Sequential Clinical Trials

Adaptive Extensions of a Two-Stage Group Sequential Procedure for Testing a Primary and a Secondary Endpoint (II): Sample Size Re-estimation

Mixtures of multiple testing procedures for gatekeeping applications in clinical trials

OPTIMAL, TWO STAGE, ADAPTIVE ENRICHMENT DESIGNS FOR RANDOMIZED TRIALS USING SPARSE LINEAR PROGRAMMING

Citation for final published version: Publishers page:

ADAPTIVE SEAMLESS DESIGNS: SELECTION AND PROSPECTIVE TESTING OF HYPOTHESES

Pubh 8482: Sequential Analysis

A Mixture Gatekeeping Procedure Based on the Hommel Test for Clinical Trial Applications

University of California, Berkeley

Adaptive Survival Trials

CHL 5225H Advanced Statistical Methods for Clinical Trials: Multiplicity

On Generalized Fixed Sequence Procedures for Controlling the FWER

Maximum type 1 error rate inflation in multiarmed clinical trials with adaptive interim sample size modifications

Two-Phase, Three-Stage Adaptive Designs in Clinical Trials

Testing a Primary and a Secondary Endpoint in a Confirmatory Group Sequential Clinical Trial

Group sequential designs with negative binomial data

MULTISTAGE AND MIXTURE PARALLEL GATEKEEPING PROCEDURES IN CLINICAL TRIALS

Finding Critical Values with Prefixed Early. Stopping Boundaries and Controlled Type I. Error for A Two-Stage Adaptive Design

Pubh 8482: Sequential Analysis

The Design of a Survival Study

Sample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA

Overrunning in Clinical Trials: a Methodological Review

The Limb-Leaf Design:

Statistical Aspects of Futility Analyses. Kevin J Carroll. nd 2013

Adaptive Designs: Why, How and When?

Pubh 8482: Sequential Analysis

Optimizing trial designs for targeted therapies - A decision theoretic approach comparing sponsor and public health perspectives

SAMPLE SIZE RE-ESTIMATION FOR ADAPTIVE SEQUENTIAL DESIGN IN CLINICAL TRIALS

A note on tree gatekeeping procedures in clinical trials

Group sequential designs for negative binomial outcomes

Sequential/Adaptive Benchmarking

Use of frequentist and Bayesian approaches for extrapolating from adult efficacy data to design and interpret confirmatory trials in children

Optimising Group Sequential Designs. Decision Theory, Dynamic Programming. and Optimal Stopping

New Developments in East

Comparing Adaptive Designs and the. Classical Group Sequential Approach. to Clinical Trial Design

Sample size re-estimation in clinical trials. Dealing with those unknowns. Chris Jennison. University of Kyoto, January 2018

Group-Sequential Tests for One Proportion in a Fleming Design

Optimal group sequential designs for simultaneous testing of superiority and non-inferiority

A Type of Sample Size Planning for Mean Comparison in Clinical Trials

Control of Directional Errors in Fixed Sequence Multiple Testing

Family-wise Error Rate Control in QTL Mapping and Gene Ontology Graphs

METHODS OF EVALUATION OF PERFORMANCE OF ADAPTIVE DESIGNS ON TREATMENT EFFECT INTERVALS AND METHODS OF

An Alpha-Exhaustive Multiple Testing Procedure

Monitoring clinical trial outcomes with delayed response: incorporating pipeline data in group sequential designs. Christopher Jennison

Optimal exact tests for multiple binary endpoints

Repeated confidence intervals for adaptive group sequential trials

ROI ANALYSIS OF PHARMAFMRI DATA:

Approaches to sample size calculation for clinical trials in small populations

Interim Monitoring of Clinical Trials: Decision Theory, Dynamic Programming. and Optimal Stopping


Bonferroni - based gatekeeping procedure with retesting option

Bios 6649: Clinical Trials - Statistical Design and Monitoring

Generalized Likelihood Ratio Statistics and Uncertainty Adjustments in Efficient Adaptive Design of Clinical Trials

Comparison of Different Methods of Sample Size Re-estimation for Therapeutic Equivalence (TE) Studies Protecting the Overall Type 1 Error

An Adaptive Futility Monitoring Method with Time-Varying Conditional Power Boundary

Group Sequential Trial with a Biomarker Subpopulation

A superiority-equivalence approach to one-sided tests on multiple endpoints in clinical trials

Step-down FDR Procedures for Large Numbers of Hypotheses

Pubh 8482: Sequential Analysis

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology

Stepwise Gatekeeping Procedures in Clinical Trial Applications

Multiple Endpoints: A Review and New. Developments. Ajit C. Tamhane. (Joint work with Brent R. Logan) Department of IE/MS and Statistics

c Copyright 2014 Navneet R. Hakhu

Designs for Clinical Trials

The SEQDESIGN Procedure

Precision of maximum likelihood estimation in adaptive designs

Analysis of Multiple Endpoints in Clinical Trials

Technical Manual. 1 Introduction. 1.1 Version. 1.2 Developer

An extrapolation framework to specify requirements for drug development in children

Group Sequential Methods. for Clinical Trials. Christopher Jennison, Dept of Mathematical Sciences, University of Bath, UK

UNIFORMLY MOST POWERFUL TESTS FOR SIMULTANEOUSLY DETECTING A TREATMENT EFFECT IN THE OVERALL POPULATION AND AT LEAST ONE SUBPOPULATION

Alpha-recycling for the analyses of primary and secondary endpoints of. clinical trials

Adaptive Enrichment Designs for Randomized Trials with Delayed Endpoints, using Locally Efficient Estimators to Improve Precision

Statistica Sinica Preprint No: SS R1

Exact Inference for Adaptive Group Sequential Designs

Evidence synthesis for a single randomized controlled trial and observational data in small populations

The International Journal of Biostatistics

Clinical Trial Design Using A Stopped Negative Binomial Distribution. Michelle DeVeaux and Michael J. Kane and Daniel Zelterman

Two-Stage Sequential Designs Regulatory Perspective

BLINDED EVALUATIONS OF EFFECT SIZES IN CLINICAL TRIALS: COMPARISONS BETWEEN BAYESIAN AND EM ANALYSES

Approximating mixture distributions using finite numbers of components

arxiv: v2 [stat.me] 6 Sep 2013

Group Sequential Tests for Delayed Responses

Pubh 8482: Sequential Analysis

Group Sequential Designs: Theory, Computation and Optimisation

SAS/STAT 15.1 User s Guide The SEQDESIGN Procedure

Transcription:

Group sequential designs for Clinical Trials with multiple treatment arms Susanne Urach, Martin Posch Cologne, June 26, 2015 This project has received funding from the European Union s Seventh Framework Programme for research, technological development and demonstration under grant agreement number FP HEALTH 2013-603160. ASTERIX Project - http://www.asterix-fp7.eu/ 1 / 20

Objectives of multi-arm multi-stage trials Aim: Comparison of several treatments to a common control Advantages: less patients needed than for separate controlled clinical trials especially important for limited set of patients (rare diseases, children) larger number of patients are randomised to experimental treatments allows changes to be made during the trial using the trial data so far, e.g. stopping for efficacy or futility Objective: Identify all treatments that are superior to control Objective: Identify at least one treatment that is superior to control different kind of stopping rules!! 2 / 20

Multi-arm multi-stage trials Design setup: group sequential Dunnett test control of the FamilyWise Error Rate (FWER) = 0.025 comparison of two treatments to a control normal endpoints, variance known one sided tests: H A : µ A µ C 0 and H B : µ B µ C 0 two stage group sequential trial: one interim analysis at Nmax 2 power to reject at least one hypothesis = 0.8 Z A,i, Z B,i are the cumulative z-statistics at stage i=1,2 3 / 20

Classical group sequential Dunnett tests with separate stopping Classical group sequential Dunnett tests with separate stopping 4 / 20

Classical group sequential Dunnett tests with separate stopping Classical group sequential Dunnett tests Objective: Identify all treatments that are superior to control separate stopping rule : Treatment arms, for which a stopping boundary is crossed, stop. E.g.: H B is rejected at interim A can go on and is tested again at the end Magirr, Jaki, Whitehead (2012) 5 / 20

Classical group sequential Dunnett tests with separate stopping Closed testing - sequentially rejective tests Local group sequential tests for H A H B and H A, H B are needed!!! H A H B group sequential test for H A H B H A H B group sequential test for H A group sequential test for H B A hypothesis is rejected with FWER α if the intersection hypothesis and the corresponding elementary hypothesis are rejected locally at level α. Xi, Tamhane (2015) Maurer, Bretz (2013) 6 / 20

Classical group sequential Dunnett tests with separate stopping Closed testing - sequentially rejective tests H A H B Reject if max(z A,1, Z B,1 ) > u 1 or max(z A,2, Z B,2 ) > u 2 H A H B Reject if Z A,1 > v 1 or Z A,2 > v 2 Reject if Z B,1 > v 1 or Z B,2 > v 2 u 1, u 2...global boundaries v 1, v 2...elementary boundaries Koenig, Brannath, Bretz and Posch (2008) 6 / 20

Group sequential Dunnett tests with simultaneous stopping Group sequential Dunnett tests with simultaneous stopping 7 / 20

Group sequential Dunnett tests with simultaneous stopping Group sequential simultaneous stopping designs simultaneous stopping rule : If at least one rejection boundary is crossed, the whole trial stops. Objective: Identify at least one treatment that is superior to control E.g.: H B is rejected at interim There is no second stage! 8 / 20

Group sequential Dunnett tests with simultaneous stopping Simultaneous versus Separate stopping FWER is controlled using the separate stopping design boundaries. Lower expected sample size compared to separate stopping designs. The power to reject any null hypothesis is the same as for separate stopping designs. both null hypotheses is lower than for separate stopping designs. Trade-off between ESS and conjunctive power!!! 9 / 20

Group sequential Dunnett tests with simultaneous stopping Construction of efficient simultaneous stopping designs 1 Can one relax the interim boundaries when stopping simultaneously? 2 How large is the impact on ESS and power when stopping simultaneously or separately? 3 How to optimize the critical boundaries for either stopping rule? 10 / 20

Question 1: Relaxation of interim boundaries? Question 1: Relaxation of interim boundaries? For simultaneous stopping: The boundaries u 1, u 2 for the local test of H A H B cannot be relaxed. The boundaries v 1, v 2 for the local test of H j can be relaxed. Intuitive explanation If, e.g., H B is rejected at interim, but H A not, H A is no longer tested at the final analysis and not all α is spent. It s possible to choose improved boundaries for the elementary tests. 11 / 20

Question 1: Relaxation of interim boundaries? Example: O Brien Flemming boundaries What changes when stopping simultaneously? H A H B Reject if max(z A,1, Z B,1 ) > u 1 or max(z A,2, Z B,2 ) > u 2 u 1 = 3.14, u 2 = 2.22 H A H B Reject if Z A,1 > v 1 or Z A,2 > v 2 Reject if Z B,1 > v 1 or Z B,2 > v 2 v 1 = 2.80, v 2 = 1.98 v 1 = 2.80, v 2 = 1.98 12 / 20

Question 1: Relaxation of interim boundaries? Example: O Brien Flemming boundaries H A H B Reject if max(z A,1, Z B,1 ) > u 1 or max(z A,2, Z B,2 ) > u 2 u 1 = 3.14, u 2 = 2.22 H A H B Reject if Z A,1 > v 1 or Z A,2 > v 2 Reject if Z B,1 > v 1 or Z B,2 > v 2 v 1 = 2.80, v 2 = 1.98 v 1 = 2.80, v 2 = 1.98 For simultaneous stopping there is no second stage test if one of the null hypotheses can already be rejected at interim. 12 / 20

Question 1: Relaxation of interim boundaries? FWER for simultaneous stopping if only H A holds (δ A = 0) 13 / 20

Question 1: Relaxation of interim boundaries? FWER for simultaneous stopping if only H A holds (δ A = 0) 13 / 20

Question 1: Relaxation of interim boundaries? Example: O Brien Flemming form of rejection boundaries Improved boundary at interim for simultaneous stopping: H A H B Reject if max(z A,1, Z B,1 ) > u 1 or max(z A,2, Z B,2 ) > u 2 u 1 = 3.14, u 2 = 2.22 H A H B Reject if Z A,1 > v 1 or Z A,2 > v 2 Reject if Z B,1 > v 1 or Z B,2 > v 2 v 1 = 2.08, v 2 = 1.98 v 1 = 2.08, v 2 = 1.98 For simultaneous stopping there is no second stage test if one of the null hypotheses can already be rejected at interim. 14 / 20

Question 2: Impact on ESS and power? Question 2: Impact on ESS and power? global boundaries u 1 = 3.14, u 2 = 2.22 local α for test of H A H B α = 0.025 separate simultaneous improved stopping rule stopping rule simultan. local α for test of H j 0.025 0.019 0.025 interim boundary v 1 2.80 2.80 2.08 final boundary v 2 1.98 1.98 1.98 15 / 20

Question 2: Impact on ESS and power? Question 2: Impact on ESS and power? global boundaries u 1 = 3.14, u 2 = 2.22 local α for test of H A H B α = 0.025 separate simultaneous improved stopping rule stopping rule simultan. local α for test of H j 0.025 0.019 0.025 interim boundary v 1 2.80 2.80 2.08 final boundary v 2 1.98 1.98 1.98 disj. power 0.8 0.8 0.8 N 162 162 162 15 / 20

Question 2: Impact on ESS and power? Question 2: Impact on ESS and power? global boundaries u 1 = 3.14, u 2 = 2.22 local α for test of H A H B α = 0.025 separate simultaneous improved stopping rule stopping rule simultan. local α for test of H j 0.025 0.019 0.025 interim boundary v 1 2.80 2.80 2.08 final boundary v 2 1.98 1.98 1.98 disj. power 0.8 0.8 0.8 N for δ A = δ B = 0.5 162 162 162 ESS for δ A = δ B = 0.5 154 149 149 conj. power for δ A = δ B = 0.5 0.59 0.50 0.56 15 / 20

Question 3: Optimizing stopping boundaries Optimized multi-arm multi-stage designs 16 / 20

Question 3: Optimizing stopping boundaries Optimal designs Scenario Separate Simultaneous Improved simult. stopping stopping stopping Boundaries classical group classical group improved group sequential sequential sequential Stopping rule separate simultaneous simultaneous stopping rule stopping rule stopping rule 17 / 20

Question 3: Optimizing stopping boundaries Optimal designs Scenario Separate Simultaneous Improved simult. stopping stopping stopping Boundaries classical group classical group improved group sequential sequential sequential Stopping rule separate simultaneous simultaneous stopping rule stopping rule stopping rule N max chosen to achieve disjunctive power of 0.8 Obj. function minimize ESS under certain optimize u 1, u 2 parameter configuration 17 / 20

Question 3: Optimizing stopping boundaries Optimal designs Scenario Separate Simultaneous Improved simult. stopping stopping stopping Boundaries classical group classical group improved group sequential sequential sequential Stopping rule separate simultaneous simultaneous stopping rule stopping rule stopping rule N max chosen to achieve disjunctive power of 0.8 Obj. function minimize ESS under certain optimize u 1, u 2 parameter configuration Obj. function minimize maximize conjunctive optimize v 1, v 2 ESS power 17 / 20

Question 3: Optimizing stopping boundaries Power to reject both null hypotheses Power to reject at least one hypothesis = 0.8 18 / 20

Question 3: Optimizing stopping boundaries Power to reject both null hypotheses Power to reject at least one hypothesis = 0.8 18 / 20

Question 3: Optimizing stopping boundaries Power to reject both null hypotheses Power to reject at least one hypothesis = 0.8 18 / 20

Question 3: Optimizing stopping boundaries Optimal expected sample size (ESS) Remarks: Percentual reduction gets bigger, but stays between 5 and 12% 19 / 20

Discussion Summary The optimal design depends on the objective: Reject all hypotheses Reject at least one hypothesis Simultaneous stopping compared to separate stopping leads to lower expected sample size the same power to reject any hypothesis lower power to reject both hypotheses Improved boundaries can be used to rescue some of the power to reject both null hypotheses. Limitation: If improved boundaries are used, the simultaneous stopping rule must be adhered to! Extensions: unknown variance: t-test: p-value approach more treatment arms, stopping for futility optimal choice of first stage sample size/allocation ratio 20 / 20

Discussion References Thall et al. (1989): one treatment continues, futility stopping, two stages, power comparisons under LFC Follmann et al. (1994): Pocock and OBF MAMS designs, Dunnett and Tukey generalisations, several stages Stallard & Todd (2003): only one treatment is taken forward, several stages, power comparisons Stallard & Friede (2008): stagewise prespecified number of treatments Magirr, Jaki, Whitehead (2012): FWER of generalised Dunnett Koenig, Brannath, Bretz (2008): closure principle for Dunnett test, adaptive Dunnett test Magirr, Stallard, Jaki (2014): Flexible sequential designs Di Scala & Glimm (2011): Time to event endpoints Wason & Jaki (2012): Optimal MAMS designs Tamhane & Xi (2013): multiple hypotheses and closure principle Maurer & Bretz (2013): Multiple testing using graphical approaches 20 / 20

Appendix FWER inflation when u 1 = z 1 α=1.96 20 / 20

Appendix Difference in expected sample size: OBF design 20 / 20

Appendix Difference in conjunctive power: OBF design 20 / 20

Appendix Unknown variance: Extension to the t test p-value approach = quantile substitution (Pocock (1977)): z-score boundaries are converted to p-value boundaries and then converted to t-score boundaries: u i = T 2ni 2(Φ 1 (u i )) for known variance: sample size per arm per stage n of 8 for a power to reject at least one of 0.8 at δ A = δ B = 1 (separate: ESS=32/power=0.61; improved simultaneous: ESS=30/power=0.51) Simulation of t-statistics for p-value approach (δ A = δ B = 1) Design n α power at least one power both ESS separate 8 0.0260 0.80 0.56 34 separate 10 0.0258 0.89 0.70 43 imp. sim. 8 0.0260 0.79 0.49 32 imp. sim. 10 0.0258 0.88 0.61 39 20 / 20

Appendix Optimal boundaries δ A = 0.5, δ B = 0.5 Design separate simultaneous improved simult. u 1 2.69 2.48 2.48 u 2 2.28 2.37 2.37 v 1 2.18 2.17 2.06 v 2 2.18 2.18 1.96 conj. power 0.54 0.44 0.51 ESS 147 137 137 20 / 20