Testing a Primary and a Secondary Endpoint in a Confirmatory Group Sequential Clinical Trial

Similar documents
Testing a secondary endpoint after a group sequential test. Chris Jennison. 9th Annual Adaptive Designs in Clinical Trials

A Gatekeeping Test on a Primary and a Secondary Endpoint in a Group Sequential Design with Multiple Interim Looks

Adaptive Extensions of a Two-Stage Group Sequential Procedure for Testing a Primary and a Secondary Endpoint (II): Sample Size Re-estimation

The Design of Group Sequential Clinical Trials that Test Multiple Endpoints

Power assessment in group sequential design with multiple biomarker subgroups for multiplicity problem

A note on tree gatekeeping procedures in clinical trials

Superchain Procedures in Clinical Trials. George Kordzakhia FDA, CDER, Office of Biostatistics Alex Dmitrienko Quintiles Innovation

Group Sequential Trial with a Biomarker Subpopulation

MULTISTAGE AND MIXTURE PARALLEL GATEKEEPING PROCEDURES IN CLINICAL TRIALS

SAMPLE SIZE RE-ESTIMATION FOR ADAPTIVE SEQUENTIAL DESIGN IN CLINICAL TRIALS

Group sequential designs for Clinical Trials with multiple treatment arms

Type I error rate control in adaptive designs for confirmatory clinical trials with treatment selection at interim

Group Sequential Tests for Delayed Responses. Christopher Jennison. Lisa Hampson. Workshop on Special Topics on Sequential Methodology

Confidence intervals and point estimates for adaptive group sequential trials

Monitoring clinical trial outcomes with delayed response: incorporating pipeline data in group sequential designs. Christopher Jennison

Exact Inference for Adaptive Group Sequential Designs

Repeated confidence intervals for adaptive group sequential trials

A Mixture Gatekeeping Procedure Based on the Hommel Test for Clinical Trial Applications

Two-Phase, Three-Stage Adaptive Designs in Clinical Trials

Adaptive, graph based multiple testing procedures and a uniform improvement of Bonferroni type tests.

Group sequential designs with negative binomial data

CHL 5225H Advanced Statistical Methods for Clinical Trials: Multiplicity

Adaptive Designs: Why, How and When?

Adaptive Dunnett Tests for Treatment Selection

Mixtures of multiple testing procedures for gatekeeping applications in clinical trials

On Generalized Fixed Sequence Procedures for Controlling the FWER

Adaptive designs beyond p-value combination methods. Ekkehard Glimm, Novartis Pharma EAST user group meeting Basel, 31 May 2013

Multiple Testing in Group Sequential Clinical Trials

Pubh 8482: Sequential Analysis

Group Sequential Tests for Delayed Responses

Division of Pharmacoepidemiology And Pharmacoeconomics Technical Report Series

The SEQDESIGN Procedure

Group Sequential Designs: Theory, Computation and Optimisation

Adaptive clinical trials with subgroup selection

Monitoring clinical trial outcomes with delayed response: Incorporating pipeline data in group sequential and adaptive designs. Christopher Jennison

Overrunning in Clinical Trials: a Methodological Review

Finding Critical Values with Prefixed Early. Stopping Boundaries and Controlled Type I. Error for A Two-Stage Adaptive Design

Estimation in Flexible Adaptive Designs

The Design of a Survival Study

Analysis of Multiple Endpoints in Clinical Trials

Alpha-recycling for the analyses of primary and secondary endpoints of. clinical trials

Sample size re-estimation in clinical trials. Dealing with those unknowns. Chris Jennison. University of Kyoto, January 2018

Adaptive Treatment Selection with Survival Endpoints

SAS/STAT 15.1 User s Guide The SEQDESIGN Procedure

On weighted Hochberg procedures

Statistica Sinica Preprint No: SS R1

clinical trials Abstract Multi-arm multi-stage (MAMS) trials can improve the efficiency of the drug

Optimal group sequential designs for simultaneous testing of superiority and non-inferiority

A Type of Sample Size Planning for Mean Comparison in Clinical Trials

Bonferroni - based gatekeeping procedure with retesting option

Statistics and Probability Letters. Using randomization tests to preserve type I error with response adaptive and covariate adaptive randomization

Designing multi-arm multi-stage clinical trials using a risk-benefit criterion for treatment selection

Comparing Adaptive Designs and the. Classical Group Sequential Approach. to Clinical Trial Design

Two-Stage Sequential Designs Regulatory Perspective

Pubh 8482: Sequential Analysis

BLINDED EVALUATIONS OF EFFECT SIZES IN CLINICAL TRIALS: COMPARISONS BETWEEN BAYESIAN AND EM ANALYSES

Group sequential designs for negative binomial outcomes

Interim Monitoring of Clinical Trials: Decision Theory, Dynamic Programming. and Optimal Stopping

Bayesian methods for sample size determination and their use in clinical trials

An Adaptive Futility Monitoring Method with Time-Varying Conditional Power Boundary

Stepwise Gatekeeping Procedures in Clinical Trial Applications

New Developments in East

Control of Directional Errors in Fixed Sequence Multiple Testing

Pubh 8482: Sequential Analysis

Adaptive Trial Designs

Group Sequential Methods. for Clinical Trials. Christopher Jennison, Dept of Mathematical Sciences, University of Bath, UK

On the Inefficiency of the Adaptive Design for Monitoring Clinical Trials

Pubh 8482: Sequential Analysis

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.

University of California, Berkeley

Optimal exact tests for multiple binary endpoints

Chapter 2 Continuous Co-primary Endpoints

Power and Sample Sizes in RCTs Cape Town 2007

AN INVESTIGATION OF TWO-STAGE TESTS. Marc Vandemeulebroecke. Otto-von-Guericke-Universität Magdeburg. and. Schering AG

Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

Sample Size and Power I: Binary Outcomes. James Ware, PhD Harvard School of Public Health Boston, MA

Optimising Group Sequential Designs. Decision Theory, Dynamic Programming. and Optimal Stopping

Adaptive Prediction of Event Times in Clinical Trials

METHODS OF EVALUATION OF PERFORMANCE OF ADAPTIVE DESIGNS ON TREATMENT EFFECT INTERVALS AND METHODS OF

Structured testing of 2 2 factorial effects: an analytic plan requiring fewer observations

MULTIPLE TESTING TO ESTABLISH SUPERIORITY/EQUIVALENCE OF A NEW TREATMENT COMPARED WITH k STANDARD TREATMENTS

Calculation of Efficacy and Futiltiy Boundaries using GrpSeqBnds (Version 2.3)

Step-down FDR Procedures for Large Numbers of Hypotheses

Multiple Testing. Anjana Grandhi. BARDS, Merck Research Laboratories. Rahway, NJ Wenge Guo. Department of Mathematical Sciences

Multiple Endpoints: A Review and New. Developments. Ajit C. Tamhane. (Joint work with Brent R. Logan) Department of IE/MS and Statistics

Bios 6649: Clinical Trials - Statistical Design and Monitoring

In Defence of Score Intervals for Proportions and their Differences

Compatible simultaneous lower confidence bounds for the Holm procedure and other Bonferroni based closed tests

INTERIM MONITORING AND CONDITIONAL POWER IN CLINICAL TRIALS. by Yanjie Ren BS, Southeast University, Nanjing, China, 2013

Interim Monitoring of. Clinical Trials. Christopher Jennison, Dept of Mathematical Sciences, University of Bath, UK

Sequential tests controlling generalized familywise error rates

Bayesian Optimal Interval Design for Phase I Clinical Trials

A Gate-keeping Approach for Selecting Adaptive Interventions under General SMART Designs

Two-stage k-sample designs for the ordered alternative problem

Comparison of Different Methods of Sample Size Re-estimation for Therapeutic Equivalence (TE) Studies Protecting the Overall Type 1 Error

This paper has been submitted for consideration for publication in Biometrics

PubH 7470: STATISTICS FOR TRANSLATIONAL & CLINICAL RESEARCH

Statistical Aspects of Futility Analyses. Kevin J Carroll. nd 2013

Blinded sample size reestimation with count data

Bios 6649: Clinical Trials - Statistical Design and Monitoring

Transcription:

Testing a Primary and a Secondary Endpoint in a Confirmatory Group Sequential Clinical Trial ExL Pharma Workshop 2010 Rockville, MD Cyrus R. Mehta President, Cytel Corporation January 26, 2010 email: mehta@cytel.com web: www.cytel.com tel: 617-661-2011 1 ExL Pharma. 26 Jan 2010 Acknowledgements Joint work with Ajit Tamhane and Lingyun Liu Accepted for publication in Biometrics 2 ExL Pharma. 26 Jan 2010

Motivating Example CAPTURE: clinical trial of placebo vs. abciximab for coronary intervention in refractory unstable angina (1997) Primary endpoint was a composite of death, MI or urgent intervention for recurrent ischemia within 30 days Secondary endpoint was death or MI within 30 days Expect 15% event rate for placebo on primary endpoint Enroll 1400 patients; 80% power for a 1-sided level 0.025 test to detect a 5% drop in event rate with abciximab One interim analysis planned for possible early stopping Test primary and secondary endpoints hierarchically 3 ExL Pharma. 26 Jan 2010 Two-Stage Design:Notation Let (n 1,n 2 ) be the cumulative sample sizes at (Stage 1, Stage 2) Primary Statistics: (X 1,X 2 ) at (Stage 1, Stage 2) are distributed thus: X 1 N(δ 1 n1, 1); X 2 N(δ 1 n1 + n 2, 1) Secondary Statistics: (Y 1,Y 2 ) at (Stage 1, Stage 2) are distributed thus: Y 1 N(δ 2 n1, 1); Y 2 N(δ 2 n1 + n 2, 1) corr(x 1,X 2 )=corr(y 1,Y 2 )= n1 n 1 +n 2 corr(x 1,Y 1 )=corr(x 2,Y 2 )=ρ Interested in testing H j : δ j =0,forj =1, 2 and controlling FWER 4 ExL Pharma. 26 Jan 2010

Hierarchical Group Sequential Testing Procedure 5 ExL Pharma. 26 Jan 2010 Control of FWER Suppose interim analysis planned after 50% of the data arrive O Brien-Fleming 1-sided level-0.025 boundaries are adopted for the primary efficacy endpoint: c 1 =1.98 2, c 2 =1.98 With these boundaries it can be shown that P H1 (X 1 c 1 )+P H1 (X 1 <c 1,X 2 c 2 )=0.025 FWER is controlled under H 1 H 2 or H 1 H 2 Question: How to select (d 1,d 2 ) for the secondary efficacy endpoint such that FWER 0.025 under H 1 H 2? 6 ExL Pharma. 26 Jan 2010

OF Boundaries at Level-0.025 for Primary Endpoint If H 1 is true, FWER =0.025 regardless of truth or falsity of H 2 7 ExL Pharma. 26 Jan 2010 Protecting FWER under H 1 H 2 Hereafter, assume that all probabilities are computed under δ 1 > 0 and δ 2 =0 Define Δ 1 = δ 1 n1, Δ 2 = δ 1 n1 + n 2 Thus, for j =1, 2, X j N(Δ j, 1), Y j N(0, 1), corr(x j,y j )=ρ The FWER is given by FWER = P (X 1 c 1,Y 1 d 1 )+P (X 1 <c 1,X 2 c 2,Y 2 d 2 ) Given (c 1,c 2 ) is level-α, find (d 1,d 2 ) such that FWER 0.025 8 ExL Pharma. 26 Jan 2010

General Expression for FWER + c 2 Δ 2 Φ ( ) d1 + ρu FWER = Φ φ(u)du c 1 Δ 1 1 ρ 2 ( ) ( ) c1 Δ 1 γu d2 + ρu Φ φ(u)du 1 γ 2 1 ρ 2 where Δ 1 = δ 1 n1, Δ 2 = δ 1 n1 + n 2,andγ = n1 n 1 +n 2 FWER depends on δ 1 and ρ, both of which are unknown Find (d 1,d 2 ) such that FWER α regardless of δ 1 and ρ 9 ExL Pharma. 26 Jan 2010 1. d 1 = d 2 = z α won t work In single-stage designs we may test H 2 at level α provided a level-α test of H 1 acts as a gatekeeper It is natural to try the same strategy for two-stage designs; i.e., test H 2 at level α the first time that H 1 rejects This strategy fails. We have proven that: FWER α if ρ =0 max Δ1 FWER >αif ρ =1 FWER α as Δ 1 for all ρ 0 10 ExL Pharma. 26 Jan 2010

FWER with (c 1 =1.98 2,c 2 =1.98), and (d 1 = d 2 =1.96) 11 ExL Pharma. 26 Jan 2010 2. FWER α if (d 1,d 2 ) is level-α We consider three cases: 1. If c 1 = d 1, c 2 = d 2, then for ρ =1, max Δ1 FWER = α and is attained at Δ 1 =0 2. If c 1 >d 1, c 2 <d 2 (OF primary, PO secondary), then for ρ =1, max Δ1 FWER = α and is attained at Δ 1 = c 1 d 1 3. If c 1 <d 1, c 2 >d 2 (PO primary, OF secondary), then for ρ =1, max Δ1 FWER <αand is attained at Δ 1 =(c 1 d 1 ) n1 n 1 +n 2 Assume c 1 =1.98 2,c 2 =1.98 (OF boundry at level 0.025) 12 ExL Pharma. 26 Jan 2010

FWER with (c 1 =1.98 2,c 2 =1.98), and (d 1 =1.98 2,d 2 =1.98) 13 ExL Pharma. 26 Jan 2010 FWER with (c 1 =1.98 2,c 2 =1.98), and (d 1 = d 2 =2.18) 14 ExL Pharma. 26 Jan 2010

FWER with (c 1 = c 2 =2.18), and (d 1 =1.98 2,d 2 =1.98) 15 ExL Pharma. 26 Jan 2010 Review what we have learned The FWER depends on ρ and Δ 1 = δ 1 n1,bothofwhich are unknown; denote it by FWER(Δ 1,ρ) For fixed ρ, let FWER(Δ 1 (ρ),ρ)=max Δ 1 FWER(Δ 1,ρ) FWER(Δ 1 (ρ),ρ) increases with ρ Worst-case FWER occurs at FWER(Δ 1 (1), 1) If (c 1,c 2 ) are level-α primary boundaries and (d 1,d 2 ) are level-α secondary boundaries then FWER(Δ 1 (1), 1) = α if c 1 >d 1,c 2 <d 2 FWER(Δ 1 (1), 1) <αif c 1 <d 1,c 2 >d 2 In either case preservation of FWER α is guaranteed 16 ExL Pharma. 26 Jan 2010

Whydowecatertoworstcase? Wedesignfortheworstcase,ρ =1,becauseρ is unknown But suppose ρ were known. Then, for the case c 1 =1.98 2,c 2 =1.98 (level-0.025 OF boundary for primary bdry) d 1 =2.18,d 2 =2.18 (level-0.025 PO boundary for secondary bdry) the FWER is tabulated below as a function of ρ ρ 1 0.8 0.6 0.4 FWER(Δ 1 (ρ),ρ) 0.025 0.019 0.017 0.016 If ρ<1, then FWER(Δ 1 (ρ),ρ) < 0.025. Therefore we can generate a more liberal secondary boundary (d 1 = d 2 < 2.18) such that FWER(Δ 1 (ρ),ρ)=0.025 17 ExL Pharma. 26 Jan 2010 Example: suppose we knew for a fact that ρ =0.8 If ρ =0.8, then FWER(Δ 1 (0.8), 0.8) = 0.019 at d 1 = d 2 =2.18. There still remains some alpha (0.025 0.019 = 0.006) to be utilized Wemaygoondecreasingd 1 and d 2 until we exhaust all available alpha so that FWER(Δ 1 (0.8), 0.8) = 0.025 This occurs at d 1 = d 2 =2.07 which is a level-0.032 Pocock boundary By increasing the level of the Pocock boundary for the secondary endpoint from 0.025 to 0.032, we have increased overall power However, it is unrealistic to assume that ρ is known with certainty 18 ExL Pharma. 26 Jan 2010

Estimate ρ from the stage 1 data Obtain 100 (1 ɛ) upper bound for ρ, sayρ Suppose (Δ (0) 1,ρ(0) ) are the true (unknown) values of (Δ 1,ρ). Then we can show that FWER(Δ (0) 1,ρ0 ) < FWER(Δ 1 (ρ ),ρ ) (1 ɛ) +ɛ The FWER will be preserved at level α if we select secondary boundaries (d 1,d 2 ) which are such that FWER(Δ 1 (ρ ),ρ ) (1 ɛ) +ɛ = α If ρ < 1, the secondary boundaries (d 1,d 2 ) can have type-1 error greater than α 19 ExL Pharma. 26 Jan 2010 Some Results OF primary boundary at level-0.025: c 1 =1.98 2,c 2 =1.98 Allowable Error for PO Secondary Boundary given ρ Boundaries P-Values Error Spent ρ d 1 d 2 p 1 p 2 Stage 1 Stage 2 Total 1 2.18 2.18 0.015 0.015 0.015 0.010 0.025 0.8 2.08 2.08 0.019 0.019 0.019 0.013 0.032 0.6 2.04 2.04 0.021 0.021 0.021 0.014 0.035 0.4 2.01 2.01 0.022 0.022 0.022 0.015 0.037 ρ is the 99.9% upper confidence bound on ρ 20 ExL Pharma. 26 Jan 2010

Revisit the CAPTURE example If sponsor wants secondary indication included in product label, he must pre-specify α-spending functions for both primary and secondary endpoints Conservative Strategy: pre-specify level-0.025 OF spending function for primary endpoint and level-0.025 OF spending function for secondary endpoint Aggressive Stragegy: pre-specify level-0.025 OF spending function for primary endpoint and level-0.025 PO spending function for secondary endpoint 21 ExL Pharma. 26 Jan 2010 Table 1: Interim Results and Boundaries Under Conservative Strategy (OF1 OF2) Event Rates Interim Stopping Boundaries Endpoint Placebo Abciximab Test Statistic Interim Final Primary 84/532 (16%) 55/518 (11%) X 1 =2.49 c 1 =2.34 c 2 =2.01 Secondary 44/532 (8%) 26/518 (5%) Y 1 =2.12 d 1 =2.34 d 2 =2.01 Table 2: Interim Results and Boundaries for CAPTURE Under Strategy 2 (Aggressive) Event Rates Test Stopping Boundaries Endpoint Placebo Abciximab Statistic Interim Final Primary 84/532 (16%) 55/518 (11%) X 1 =2.49 c 1 =2.34 c 2 =2.01 Secondary 44/532 (8%) 26/518 (5%) Y 1 =2.12 d 1 =2.04 d 2 =2.26 In this situation, the aggressive strategy would have paid off 22 ExL Pharma. 26 Jan 2010

Concluding Remarks Primary endpoint boundaries (c 1,c 2 ) must be level-α Cannot use d 1 = d 2 = z α for secondary endpoint Any choice of level-α boundaries (d 1,d 2 ) for secondary endpoint will guarantee FWER α Under certain conditions, it is possible to use secondary boundaries (d 1,d 2 ) whose level exceeds α 1. If c 1 <d 1 and c 2 >d 2 2. If we replace ρ by its 100 (1 ɛ) upper bound ρ Secondary boundaries at a level that exceeds α are likely to face regulatory obstacles despite their statistical validity 23 ExL Pharma. 26 Jan 2010 Related References 1. Glimm E, Maurer, Bretz (2010). Hierarchical testing of multiple endpoints in group-sequential trials. Statistics in Medicine, 29, 219-228. 2. Hung HJM, Wang S-J, O Neill R (2007). Statistical considerations for testing multiple endpoints in group sequential or adaptive clinical trials. J. Biopharm. Statist., 17: 1201-1210. 3. Tamhane AC, Mehta CR, Liu L (2010). Testing a primary and a secondary endpoint in a group sequential design. Biometrics (in press). 4. The CAPTURE Investigators (1997). Randomized placebo-controlled trial of abciximab before and during conronary intervention in refractory unstabel angina. Lancet, 349, 1429-1435. 24 ExL Pharma. 26 Jan 2010