Compatible simultaneous lower confidence bounds for the Holm procedure and other Bonferroni based closed tests

Similar documents
Superchain Procedures in Clinical Trials. George Kordzakhia FDA, CDER, Office of Biostatistics Alex Dmitrienko Quintiles Innovation

On Generalized Fixed Sequence Procedures for Controlling the FWER

Mixtures of multiple testing procedures for gatekeeping applications in clinical trials

Testing a secondary endpoint after a group sequential test. Chris Jennison. 9th Annual Adaptive Designs in Clinical Trials

Closure properties of classes of multiple testing procedures

Adaptive, graph based multiple testing procedures and a uniform improvement of Bonferroni type tests.

Hochberg Multiple Test Procedure Under Negative Dependence

Applying the Benjamini Hochberg procedure to a set of generalized p-values

Lecture 6 April

Adaptive Dunnett Tests for Treatment Selection

A note on tree gatekeeping procedures in clinical trials

Multiple Testing in Group Sequential Clinical Trials

Type I error rate control in adaptive designs for confirmatory clinical trials with treatment selection at interim

Control of Directional Errors in Fixed Sequence Multiple Testing

INTRODUCTION TO INTERSECTION-UNION TESTS

MULTISTAGE AND MIXTURE PARALLEL GATEKEEPING PROCEDURES IN CLINICAL TRIALS

A Mixture Gatekeeping Procedure Based on the Hommel Test for Clinical Trial Applications

Family-wise Error Rate Control in QTL Mapping and Gene Ontology Graphs

IMPROVING TWO RESULTS IN MULTIPLE TESTING

The Design of Group Sequential Clinical Trials that Test Multiple Endpoints

Statistica Sinica Preprint No: SS R1

arxiv: v1 [math.st] 14 Nov 2012

ON TWO RESULTS IN MULTIPLE TESTING

Structured testing of 2 2 factorial effects: an analytic plan requiring fewer observations

A class of improved hybrid Hochberg Hommel type step-up multiple test procedures

Optimal exact tests for multiple binary endpoints

Bonferroni - based gatekeeping procedure with retesting option

ON STEPWISE CONTROL OF THE GENERALIZED FAMILYWISE ERROR RATE. By Wenge Guo and M. Bhaskara Rao

IEOR165 Discussion Week 12

The International Journal of Biostatistics

On weighted Hochberg procedures

Familywise Error Rate Controlling Procedures for Discrete Data

Adaptive clinical trials with subgroup selection

An Alpha-Exhaustive Multiple Testing Procedure

Multiple testing: Intro & FWER 1

Multiple Testing. Anjana Grandhi. BARDS, Merck Research Laboratories. Rahway, NJ Wenge Guo. Department of Mathematical Sciences

An Application of the Closed Testing Principle to Enhance One-Sided Confidence Regions for a Multivariate Location Parameter

A Brief Introduction to Intersection-Union Tests. Jimmy Akira Doi. North Carolina State University Department of Statistics

QED. Queen s Economics Department Working Paper No Hypothesis Testing for Arbitrary Bounds. Jeffrey Penney Queen s University

Accepted for publication in: Comm. Statist. Theory and Methods April 14, 2015 ASYMPTOTICS OF GOODNESS-OF-FIT TESTS BASED ON MINIMUM P-VALUE STATISTICS

Sanat Sarkar Department of Statistics, Temple University Philadelphia, PA 19122, U.S.A. September 11, Abstract

Multiple Comparison Methods for Means

Consonance and the Closure Method in Multiple Testing. Institute for Empirical Research in Economics University of Zurich

On adaptive procedures controlling the familywise error rate

Chapter Seven: Multi-Sample Methods 1/52

Multiple Testing of General Contrasts: Truncated Closure and the Extended Shaffer-Royen Method

Chapter 7. Hypothesis Testing

Control of Generalized Error Rates in Multiple Testing

Group sequential designs for Clinical Trials with multiple treatment arms

STAT22200 Spring 2014 Chapter 5

How to analyze many contingency tables simultaneously?

Linear Combinations. Comparison of treatment means. Bruce A Craig. Department of Statistics Purdue University. STAT 514 Topic 6 1

Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

Exact and Approximate Stepdown Methods For Multiple Hypothesis Testing

STAT 135 Lab 5 Bootstrapping and Hypothesis Testing

Looking at the Other Side of Bonferroni

Introductory Econometrics

Solution: First note that the power function of the test is given as follows,

Topic 3: Hypothesis Testing

Exact and Approximate Stepdown Methods for Multiple Hypothesis Testing

Lecture 13: p-values and union intersection tests

Statistical Inference

Summary and discussion of: Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

SIMULTANEOUS CONFIDENCE BOUNDS WITH APPLICATIONS TO DRUG STABILITY STUDIES. Xiaojing Lu. A Dissertation

Study Ch. 9.3, #47 53 (45 51), 55 61, (55 59)

UNIFORMLY MOST POWERFUL TESTS FOR SIMULTANEOUSLY DETECTING A TREATMENT EFFECT IN THE OVERALL POPULATION AND AT LEAST ONE SUBPOPULATION

Finding Critical Values with Prefixed Early. Stopping Boundaries and Controlled Type I. Error for A Two-Stage Adaptive Design

Optimal testing of multiple hypotheses with common effect direction

Partitioning the Parameter Space. Topic 18 Composite Hypotheses

Statistical Applications in Genetics and Molecular Biology

ROI ANALYSIS OF PHARMAFMRI DATA:

Estimation in Flexible Adaptive Designs

Adaptive Treatment Selection with Survival Endpoints

David Giles Bayesian Econometrics

STEPDOWN PROCEDURES CONTROLLING A GENERALIZED FALSE DISCOVERY RATE. National Institute of Environmental Health Sciences and Temple University

Compatible Confidence Regions

Lecture 12 November 3

Visual interpretation with normal approximation

Adaptive Extensions of a Two-Stage Group Sequential Procedure for Testing a Primary and a Secondary Endpoint (II): Sample Size Re-estimation

Step-down FDR Procedures for Large Numbers of Hypotheses

Stat 710: Mathematical Statistics Lecture 40

Statistical Inference

On Procedures Controlling the FDR for Testing Hierarchically Ordered Hypotheses

Multistage Tests of Multiple Hypotheses

Multiple Testing. Hoang Tran. Department of Statistics, Florida State University

Sequential tests controlling generalized familywise error rates

Tests about a population mean

Stepwise Gatekeeping Procedures in Clinical Trial Applications

STAT 5200 Handout #7a Contrasts & Post hoc Means Comparisons (Ch. 4-5)

MULTIPLE TESTING PROCEDURES AND SIMULTANEOUS INTERVAL ESTIMATES WITH THE INTERVAL PROPERTY

Multiple Endpoints: A Review and New. Developments. Ajit C. Tamhane. (Joint work with Brent R. Logan) Department of IE/MS and Statistics

Simultaneous Confidence Intervals and Multiple Contrast Tests

PROCEDURES CONTROLLING THE k-fdr USING. BIVARIATE DISTRIBUTIONS OF THE NULL p-values. Sanat K. Sarkar and Wenge Guo

Specific Differences. Lukas Meier, Seminar für Statistik

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

Lecture 21: October 19

FDR-CONTROLLING STEPWISE PROCEDURES AND THEIR FALSE NEGATIVES RATES

Two-stage stepup procedures controlling FDR

Topic 15: Simple Hypotheses

(Foundation of Medical Statistics)

Transcription:

Compatible simultaneous lower confidence bounds for the Holm procedure and other Bonferroni based closed tests K. Strassburger 1, F. Bretz 2 1 Institute of Biometrics & Epidemiology German Diabetes Center, Düsseldorf 2 Novartis Pharma AG Tuesday, July 10/ MCP 2007, Vienna

Notation Parameter of interest: Data: Thresholds : ϑ = (ϑ 1,..., ϑ k ) Θ IR k X X (δ 1,..., δ k ) Θ Hypotheses: H i : ϑ i δ i, i I = {1,..., k} Corresponding p-values : p i = p i (X), i I

The Closure Principle Construct level-α tests for all intersection hypotheses H J = j J H j with = J I Reject H i, iff all H J with i J can be rejected. Such closed test procedures control the familywise error rate (FWER) at level α.

A Class (B) of Bonferroni Based Closed Tests Basis: Local α-levels α i (J) 0, i I, J I with α i (J) α, J I. i I An intersection hypothesis H J is rejected, if min j J (p j α j (J)) 0. Bonferroni s inequality ensures the control of the FWER at level α. Closure test: Reject all those hypotheses H i with max J:i J I min j J (p j α j (J)) 0

Simultaneous Compatible Confidence Bounds Let R = R(x) be the index set of rejected hypotheses H i R = {i I : max min(p j α j (J)) 0} J I:i J j J Problem: Find lower confidence bounds L i = L i (X), i I, such that P ϑ (L i (X) < ϑ i, for all i I) 1 α and x X : i R(x) L i (x) δ i.

Examples (Single-step) Bonferroni test R = {i : p i α/k} Local α levels: α i (J) = α/k, i I, J I (Step-down) Bonferroni-Holm test Ordered p-values: p (1) p (2) p (k) R = {(i) : max j=1,...,i (k j + 1)p (j) α} { α/ J if i J, Local α levels: α i (J) = 0 else. Simultaneous confidence bounds?

Construction Principles Bofinger (1987). Step down procedures for comparison with a control. Aust. J. Stat. 29, 348-364. Finner (1994). Testing Multiple Hypotheses: General Theory, Specific Problems, and Relationships to Other Multiple Decision Procedures. Habilitationsschrift. Universität Trier, FB IV Mathematik/Statistik. Finner & Straßburger (2006). On equivalence with the best in k-sample models. J. Am. Stat. Assoc. 101, 737-746. Finner & Straßburger (2007). Step-up related simultaneous confidence intervals for MCC and MCB. Biometrical J 49, 40-5 Hayter & Hsu (1994). On the relationship between stepwise decision procedures and confidence sets. Journal of the American Statistical Association 89, 128-136. Stefansson, Kim & Hsu (1988). On confidence sets in multiple comparisons. Statistical decision theory and related topics IV, Pap. 4th Purdue Symp.; West Lafayette/Indiana 1986, Vol. 2, 89-104

Basis: p-value Compatible Local Confidence Bounds For the construction of simultaneous confidence bounds one needs local, lower confidence bounds L i (α ) = L i (X, α ) fulfilling: For all i I and α [0, α] it holds, P ϑ (L i (X, α ) < ϑ i ) 1 α, x X : L i (x, α ) δ i p i (x) α.

Compatible Confidence Bounds for Bonferroni Based Closure Tests Compatible confidence bounds for multiple tests in B can be constructed as follows: min J J :i J max{δ i, L i (α i(j))} if i R L i = min J J :i J L i (α i(j)) if i I \ R, where J J J = {J I : min j J (p j α j (J)) > 0}. intersection hypothesis H J can not be rejected

The Subclass (SCB) of so Called Short-Cut Procedures Hommel, Bretz und Maurer (Stat. Med. 2007) showed that there exists an easy implementable short-cut for all procedures in B with α i (J) α i (J ) for all i I and all J, J with i J J I. Examples: (weighted) Bonferroni-Holm test, Test of apriori ordered hypotheses, Fallback and Gatekeeping procedures.

Simplified Confidence Bounds for Short-Cut Procedures Simultaneous compatible confidence bounds for multiple tests in SCB reduce to: min J I\R max{δ i, L i (α i(j))} if i R, L i = L i (α i(i \ R)) if i I \ R.

Further Reduction for α-exhaustive Multiple Tests Necessary condition for maximizing the number of rejected hypotheses: α i (J) = 0, for all i, J with i J and = J I. Reduced confidence bounds: max{δ i, L i (α i( )) if R = I, L i = δ i if i R I, L i (α i(i \ R)) if i I \ R. In case of i R I the lower bound L i only reflects the test decision ϑ i > δ i.

A Numerical Example Observations: X = (X 1, X 2,..., X 5 ), X i N(ϑ i, 1) Hypotheses: H i : ϑ i 0, i = 1,..., 5 FWER: α = 5% p-values: p i = 1 Φ(X i ) Loc. conf. bounds: L i (α ) = X i u 1 α i 1 2 3 4 5 x i 4.261 3.530 2.252 0.417 1.338 p i 0.00001 0.0002 0.0122 0.338 0.910

(Single-step) Bonferroni Test Local levels: α i (J) = 0.01, i I, J I i 1 2 3 4 5 x i 4.261 3.530 2.252 0.417 1.338 p i 0.00001 0.0002 0.0122 0.338 0.910 α/k 0.01 0.01 0.01 0.01 0.01 L i 1.935 1.203 0.074 1.909 3.664 Rejected hypotheses: R = {1, 2} Confidence bounds: L i = x i u 1 α/5 = x i 2.326

(Step-down) Bonferroni-Holm Test Local levels: α i (J) = 0.05/ J, i I, J I i 1 2 3 4 5 x i 4.261 3.530 2.252 0.417 1.338 p i 0.00001 0.0002 0.0122 0.338 0.910 α/i 0.01 0.0125 0.0166 0.025 0.05 L i 0 0 0 1.543 3.298 Rejected hypotheses: R = {1, 2, 3} Confidence bounds: α i (I \ R) = α/ I \ R = α/2 { 0 if i R L i = x i u 1 α/2 = x i 1.960 if i I \ R

Comparison of Single-step & Step-down Confidence Bounds i 1 2 3 4 5 Single-step L i 1.935 1.203 0.074 1.909 3.664 Step-down L i 0 0 0 1.543 3.298 Step-down versus single-step bounds / Assets and drawbacks: (+) H 3 can be rejected by the step-down procedure. (+) (moderate) larger confidence bounds for ϑ 3, ϑ 4 and ϑ 5 ( ) much smaller confidence bounds for ϑ 1 and ϑ 2