Hochberg Multiple Test Procedure Under Negative Dependence

Similar documents
A Mixture Gatekeeping Procedure Based on the Hommel Test for Clinical Trial Applications

A class of improved hybrid Hochberg Hommel type step-up multiple test procedures

On weighted Hochberg procedures

GENERALIZING SIMES TEST AND HOCHBERG S STEPUP PROCEDURE 1. BY SANAT K. SARKAR Temple University

arxiv: v1 [math.st] 13 Mar 2008

Mixtures of multiple testing procedures for gatekeeping applications in clinical trials

MULTISTAGE AND MIXTURE PARALLEL GATEKEEPING PROCEDURES IN CLINICAL TRIALS

Lecture 6 April

Applying the Benjamini Hochberg procedure to a set of generalized p-values

PROCEDURES CONTROLLING THE k-fdr USING. BIVARIATE DISTRIBUTIONS OF THE NULL p-values. Sanat K. Sarkar and Wenge Guo

A note on tree gatekeeping procedures in clinical trials

On Generalized Fixed Sequence Procedures for Controlling the FWER

Multiple Endpoints: A Review and New. Developments. Ajit C. Tamhane. (Joint work with Brent R. Logan) Department of IE/MS and Statistics

ON STEPWISE CONTROL OF THE GENERALIZED FAMILYWISE ERROR RATE. By Wenge Guo and M. Bhaskara Rao

On adaptive procedures controlling the familywise error rate

Modified Simes Critical Values Under Positive Dependence

Control of Generalized Error Rates in Multiple Testing

Adaptive, graph based multiple testing procedures and a uniform improvement of Bonferroni type tests.

Superchain Procedures in Clinical Trials. George Kordzakhia FDA, CDER, Office of Biostatistics Alex Dmitrienko Quintiles Innovation

Familywise Error Rate Controlling Procedures for Discrete Data

The International Journal of Biostatistics

Compatible simultaneous lower confidence bounds for the Holm procedure and other Bonferroni based closed tests

Statistica Sinica Preprint No: SS R1

Control of Directional Errors in Fixed Sequence Multiple Testing

Multiple Testing. Anjana Grandhi. BARDS, Merck Research Laboratories. Rahway, NJ Wenge Guo. Department of Mathematical Sciences

FDR-CONTROLLING STEPWISE PROCEDURES AND THEIR FALSE NEGATIVES RATES

On Methods Controlling the False Discovery Rate 1

Procedures controlling generalized false discovery rate

Exact and Approximate Stepdown Methods For Multiple Hypothesis Testing

STEPDOWN PROCEDURES CONTROLLING A GENERALIZED FALSE DISCOVERY RATE. National Institute of Environmental Health Sciences and Temple University

Exact and Approximate Stepdown Methods for Multiple Hypothesis Testing

Consonance and the Closure Method in Multiple Testing. Institute for Empirical Research in Economics University of Zurich

Multiple Random Variables

IMPROVING TWO RESULTS IN MULTIPLE TESTING

Adaptive Extensions of a Two-Stage Group Sequential Procedure for Testing a Primary and a Secondary Endpoint (II): Sample Size Re-estimation

ON TWO RESULTS IN MULTIPLE TESTING

An Alpha-Exhaustive Multiple Testing Procedure

arxiv: v1 [math.st] 14 Nov 2012

Stepwise Gatekeeping Procedures in Clinical Trial Applications

Testing a secondary endpoint after a group sequential test. Chris Jennison. 9th Annual Adaptive Designs in Clinical Trials

Sanat Sarkar Department of Statistics, Temple University Philadelphia, PA 19122, U.S.A. September 11, Abstract

On Procedures Controlling the FDR for Testing Hierarchically Ordered Hypotheses

Multiple Testing in Group Sequential Clinical Trials

MULTIPLE TESTING TO ESTABLISH SUPERIORITY/EQUIVALENCE OF A NEW TREATMENT COMPARED WITH k STANDARD TREATMENTS

Optimal exact tests for multiple binary endpoints

Chapter 1. Stepdown Procedures Controlling A Generalized False Discovery Rate

Chapter 5 continued. Chapter 5 sections

Two-stage stepup procedures controlling FDR

c 2011 Kuo-mei Chen ALL RIGHTS RESERVED

The Design of Group Sequential Clinical Trials that Test Multiple Endpoints

Family-wise Error Rate Control in QTL Mapping and Gene Ontology Graphs

Multiple Testing. Hoang Tran. Department of Statistics, Florida State University

FALSE DISCOVERY AND FALSE NONDISCOVERY RATES IN SINGLE-STEP MULTIPLE TESTING PROCEDURES 1. BY SANAT K. SARKAR Temple University

A Gatekeeping Test on a Primary and a Secondary Endpoint in a Group Sequential Design with Multiple Interim Looks

Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

Multiple Testing of One-Sided Hypotheses: Combining Bonferroni and the Bootstrap

arxiv: v1 [math.st] 17 Jun 2009

Closure properties of classes of multiple testing procedures

Part IA Probability. Definitions. Based on lectures by R. Weber Notes taken by Dexter Chua. Lent 2015

A NEW APPROACH FOR LARGE SCALE MULTIPLE TESTING WITH APPLICATION TO FDR CONTROL FOR GRAPHICALLY STRUCTURED HYPOTHESES

Chapter 5. Chapter 5 sections

GOTEBORG UNIVERSITY. Department of Statistics

Power and sample size determination for a stepwise test procedure for finding the maximum safe dose

Probability signatures of multistate systems made up of two-state components

Expectation Maximization (EM) Algorithm. Each has it s own probability of seeing H on any one flip. Let. p 1 = P ( H on Coin 1 )

Spring 2012 Math 541B Exam 1

Rejoinder on: Control of the false discovery rate under dependence using the bootstrap and subsampling

First Year Examination Department of Statistics, University of Florida

Multivariate Random Variable

Random Variables and Their Distributions

STEPUP PROCEDURES FOR CONTROL OF GENERALIZATIONS OF THE FAMILYWISE ERROR RATE

Step-down FDR Procedures for Large Numbers of Hypotheses

Statistical Applications in Genetics and Molecular Biology

Precept 4: Hypothesis Testing

STAT Chapter 5 Continuous Distributions

Exam 2. Jeremy Morris. March 23, 2006

Design of the Fuzzy Rank Tests Package

Structured testing of 2 2 factorial effects: an analytic plan requiring fewer observations

MULTIPLE TESTING PROCEDURES AND SIMULTANEOUS INTERVAL ESTIMATES WITH THE INTERVAL PROPERTY

arxiv: v2 [stat.me] 6 Sep 2013

An Application of the Closed Testing Principle to Enhance One-Sided Confidence Regions for a Multivariate Location Parameter

High-Throughput Sequencing Course. Introduction. Introduction. Multiple Testing. Biostatistics and Bioinformatics. Summer 2018

Random vectors X 1 X 2. Recall that a random vector X = is made up of, say, k. X k. random variables.

Permutation Tests and Multiple Testing

Lecture 13: p-values and union intersection tests

UNIFORMLY MOST POWERFUL TESTS FOR SIMULTANEOUSLY DETECTING A TREATMENT EFFECT IN THE OVERALL POPULATION AND AT LEAST ONE SUBPOPULATION

Optimal testing of multiple hypotheses with common effect direction

4. Distributions of Functions of Random Variables

Machine learning - HT Maximum Likelihood

18 Bivariate normal distribution I

Probability Models. 4. What is the definition of the expectation of a discrete random variable?

Multiple Comparison Methods for Means

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

PROBABILITY DISTRIBUTIONS. J. Elder CSE 6390/PSYC 6225 Computational Modeling of Visual Perception

Random Variables. Saravanan Vijayakumaran Department of Electrical Engineering Indian Institute of Technology Bombay

Joint Probability Distributions and Random Samples (Devore Chapter Five)

Estimation of a Two-component Mixture Model

Multistage Tests of Multiple Hypotheses

Chapter 3 sections. SKIP: 3.10 Markov Chains. SKIP: pages Chapter 3 - continued

Probability and Distributions

Transcription:

Hochberg Multiple Test Procedure Under Negative Dependence Ajit C. Tamhane Northwestern University Joint work with Jiangtao Gou (Northwestern University) IMPACT Symposium, Cary (NC), November 20, 2014 1 / 27

Outline Preliminaries Conservative Simes Test Multivariate Uniform Distribution Models Error Rate Control for n = 2 Error Rate Control for n 3 Error Rate Control Under Negative Quadrant Dependence Simulation Results Conclusions 2 / 27

Basic Setup Test hypotheses H 1, H 2,..., H n based on their observed marginal p-values: p 1, p 2,..., p n. Label the ordered p-values: p (1) p (n) and the corresponding hypotheses: H (1),..., H (n). Denote the corresponding random variables by P (1) P (n). Familywise error rate (FWER) strong control (Hochberg & Tamhane 1987): FWER = Pr{Reject at least one true H i } α, for all combinations of the true and false H i s. 3 / 27

Hochberg Procedure Step-up Procedure: Start by testing H (n). If at the ith step p (n i+1) α/i then stop & reject H (n i+1),..., H (1) ; else accept H (n i+1) and continue testing. H (1) H (2) H (n 1) H (n) p (1) p (2) p (n 1) p (n) α α α α n n 1 2 1 Known to control FWER under independence and (certain types of) positive dependence among the p-values. Holm (1979) procedure operates exactly in reverse (step-down) manner and requires no dependence assumption (since it is based on the Bonferroni test), but is less powerful. 4 / 27

Closure Method Marcus, Peritz & Gabriel (1976). Test all nonempty intersection hypotheses H(I) = i I H i, using local α-level tests where I {1, 2,..., n}. Reject H(I) iff all H(J) for J I are rejected, in particular, reject H i iff all H(I) with i I are rejected. Strongly controls FWER α. Ensures coherence (Gabriel 1969): If I J then acceptance of H(J) implies acceptance of H(I). Stepwise shortcuts to closed MTPs exist under certain conditions. If the Bonferroni test is used as local α-level test then the resulting shortcut is the Holm step-down procedure. 5 / 27

Closure Method: Example for n = 3 6 / 27

Simes Test Simes Test: Reject H 0 = n i=1 H i at level α if p (i) iα n for some i = 1,..., n. More powerful than the Bonferroni test. Based on the Simes identity: If the P i s are independent then under H 0 : Pr (P (i) iαn ) for some i = α. Simes test is conservative under (certain types of) positive dependence: Sarkar & Chang (1997) and Sarkar (1998). Simes test is anti-conservative under (certain types of) negative dependence: Hochberg & Rom (1995), Samuel-Cahn (1996), Block, Savits & Wang (2008). 7 / 27

Hommel Procedure Under Negative Dependence When the Simes test is used as a local α-level test for all intersection hypotheses, the exact shortcut to the closure procedure is the Hommel (1988) multiple test procedure. So the Hommel procedure is more powerful than the Holm procedure. Since the Simes test controls α under independence/positive dependence but not under negative dependence, the Hommel procedure also controls/does not control FWER under the same conditions. Hochberg derived his procedure as a conservative shortcut to the exact shortcut to the closure procedure (i.e., Hommel procedure), so it also controls FWER under independence/positive dependence. 8 / 27

Hochberg Procedure Under Negative Dependence The common perception is that the Hochberg procedure may not control FWER under negative dependence. So practitioners are reluctant to use it if negative correlations are expected. They use the less powerful but more generally applicable Holm procedure. But the Hochberg procedure is conservative by construction. So, does it control FWER under also under negative dependence? 9 / 27

Conservative Simes Test Better to think of the Hochberg procedure as an exact stepwise shortcut to the closure procedure which uses a conservative Simes local α-level test (Wei 1996). Conservative Simes test: Reject H 0 = n i=1 H i at level α if p (i) α n i + 1 for some i = 1,..., n. It is conservative because α/(n i + 1) iα/n with equalities iff i = 1 and i = n. So the question of FWER control under negative dependence by the Hochberg procedure reduces to showing Pr ( P (i) under negative dependence. α n i + 1 ) for some i α 10 / 27

Conservative Simes Test For n = 2, the exact Simes test and the conservative Simes test are the same. So both are anti-conservative under negative dependence. Does the conservative Simes test remain conservative under negative dependence for n > 2? 11 / 27

Multivariate Uniform Distribution Models for P -Values Sarkar s (1998) method, used by Block & Wang (2008) to show the anti-conservatism of the Simes test, does not work for the conservative Simes test since that method requires the critical constants c n i+1 used to compare with p (i) to have the monotonicity property that c n i+1 /i must be nondecreasing in i. But for the conservative Simes test, c n i+1 /i = 1/i(n i + 1) are decreasing (resp., increasing) in i for i (n + 1)/2 (resp., i > (n + 1)/2). To study the performance of the Simes/conservative Simes test under negative dependence we chose to use a multivariate uniform distribution for P -values. The distribution should be tractable enough to deal with ordered correlated multivariate uniform random variables. 12 / 27

Normal Model Let X 1,..., X n be multivariate normal with E(X i ) = 0, Var(X i ) = 1 and Corr(X i, X j ) = γ ij (1 i < j n). Define P i = Φ(X i ) where Φ( ) is the standard normal c.d.f.: one-sided marginal P -value. Then P i U[0, 1] with ρ ij = Corr(P i, P j ) a monotone and symmetric (around zero) function of γ ij (1 i < j n). γ ij = γ 0 0.1 0.3 0.5 0.7 0.9 1 ρ ij = ρ 0 0.0955 0.2876 0.4826 0.6829 0.8915 1 This model is not analytically tractable. 13 / 27

Mixture Model U 1,..., U n i.i.d. U[0, β], V 1,..., V n i.i.d. U[β, 1], where β (0, 1) is fixed. Independent of the U i s and V i s, W is Bernoulli with parameter β. Define X i = U i W + V i (1 W ) (1 i n). Let Y i be independent Bernoulli with parameters π i and define P i = X i Y i + (1 X i )(1 Y i ) (1 i n). Then the P i are U[0, 1] distributed with Corr(P i, P j ) = ρ ij = 3β(1 β)(2π i 1)(2π j 1) (1 i < j n). Note that 3/4 ρ ij +3/4 and ρ ij > 0 π i, π j > 1/2 or < 1/2. This model is also not analytically tractable. 14 / 27

Ferguson s Model for n = 2 Ferguson (1995) Theorem: Suppose X is a continuous random variable with p.d.f. g(x) on x [0, 1]. Let the joint p.d.f. of (P 1, P 2 ) be given by f(p 1, p 2 ) = 1 2 [g( p 1 p 2 )+g(1 1 (p 1 +p 2 ) )] for p 1, p 2 (0, 1). Then P 1, P 2 are jointly distributed on the unit square with U[0, 1] marginals and ρ = Corr(P 1, P 2 ) = 1 6E(X 2 ) + 4E(X 3 ). 15 / 27

Ferguson s Model for n = 2 We chose g(x) = { U[0, θ] ρ = (1 θ)(1 + θ θ 2 ) > 0 U[1 θ, 1] ρ = (1 θ)(1 + θ θ 2 ) < 0. If θ = 1, i.e., X U[0, 1], then ρ = 0 for both models. If θ = 0 then ρ = +1 if g(x) = U[0, θ] and ρ = 1 if g(x) = U[1 θ, 1]: point mass distributions with all mass at (0, 0) and (1, 1), respectively. 16 / 27

Ferguson s Model for Bivariate Uniform Distribution p 2 p 2 1 θ 0 1 1/θ 1/θ 0 0 1/2θ 0 1/θ θ 1 p 1 1 θ 1/2θ 0 1/θ 0 1 θ 1 Left Panel: Positive correlation, Right Panel: Negative correlation p 1 17 / 27

Ferguson s Model for Multivariate Uniform Distribution Define the joint p.d.f. as f(p 1,..., p n ) = 1 i<j n w ij f ij (p i, p j ) for p i, p j [0, 1] where the w ij are the mixing probabilities which sum to 1. We use g ij (x) = U[0, θ ij ] or g ij (x) = U[1 θ ij, 1] for +ve and ve correlations, respectively. Corr(P i, P j ) = ρ ij are given by ρ ij = ±w ij (1 θ ij )(1 + θ ij θ 2 ij). 18 / 27

Type I Error of the Simes Test for n = 2 Theorem: For the Simes test, P = Pr(Type I Error) α for all ρ 0 under the Ferguson model with negative dependence. max P = 1 ( 1 + α ) 1 2α + α 2 2 /2 > α, and is achieved at θ = 1 2α + α 2 /2. For α = 0.05, max P = 0.0503 when Corr(P 1, P 2 ) = 0.053. For the bivariate normal model max P = 0.0501 when Corr(P 1, P 2 ) = 0.184. These excesses are negligible. We can choose ( ) 1 1 α c 1 = 1, c 2 = 1 + < 1 1 1.5α 2 to control Pr(Type I Error) α for all ρ 0 at a negligible loss of power. 19 / 27

Idea of the Proof for n = 2 f(p 1, p 2 ) = 0 f(p 1, p 2 ) = 1/2θ (a) (b) (c) (d) f(p 1, p 2 ) = 1/θ α (a): 0 < θ 1 2α α + (1 θ 2α)2 4θ (b): 1 2α < θ 1 3 2 P = α α + 1 2 α2 (1 θ α) 2 4θ (c): 1 3 2 α < θ 1 1 2 α α + (1 θ)2 4θ (d): 1 1 2 α < θ < 1. 20 / 27

Type I Error of Conservative Simes Test for n = 2 0.102 0.1 0.098 0.096 type I error 0.094 0.092 0.09 0.088 0.086 0.084 0.082 1 0.8 0.6 0.4 0.2 0 0.2 0.4 0.6 0.8 1 correlation Plot of type I error vs. Corr(P 1, P 2 ) in the bivariate case for Ferguson s model (solid curve) and Normal model (dashed curve) (α = 0.10) 21 / 27

Type I Error of the Conservative Simes Test for n 3 Proof of max P α for all negative correlations under the Ferguson model proceeds in two steps. First show that the result is true for n = 3. This is quite a laborious proof. Then use an induction argument to extend the result to all n > 3. 22 / 27

Idea of the Proof for n = 3 The rejection region { p (3) α/1 } { p (2) α/2 } { p (1) α/3 } for n = 3: p 2 p 3 p 1 23 / 27

Idea of the Proof for n = 3 0 p 3 α/3 α/3 p 3 α/2 α/2 p 3 α α p 3 1 Slice the rejection region along the p 3 -axis as shown above and find the probability of each two-dimensional slice using the results from the n = 2 case. This results in nine different expressions depending on the θ value for the bivariate distribution. Show that all nine expressions α. Hence their weighted sum (weighted by the probabilities of the slices) is α. 24 / 27

Error Rate Control Under Negative Quadrant Dependence Theorem: If (P 1,..., P n ) follow a multivariate uniform distribution which is a mixture of bivariate distributions f ij (p i, p j ) with mixing probabilities w ij > 0 where all pairs (P i, P j ) are negatively quadrant dependent then the conservative Simes test controls the type I error at level α < 1/2 for n 4. Negative Quadrant Dependence (Lehmann 1966): Two random variables, X and Y, are said to be negatively quadrant dependent if Pr {(X x) (Y y)} Pr (X x) Pr (Y y). The proof uses an upper bound on P (Type I error) from Hochberg & Rom (1995). 25 / 27

Simulation Results We performed simulations of type I error of the conservative Simes test for n = 3, 5, 7 for the following cases. Equicorrelated normal model for γ = 0.1/(n 1), 0.5/(n 1), 0.9/(n 1). Mixture model with β = 0.1, 0.3, 0.5 and each π i = 0.5 ± δ with δ = 0.1, 0.25, 0.4 (more than half of the ρ ij < 0). Product-correlated normal model with the same correlation matrix as the mixture model. Ferguson model with the same correlation matrix as the mixture model: Uniform distribution: g ij (x) = U[0, θ] or g ij (x) = U[1 θ, 1]. Beta distribution: g ij (x) = Beta(r, s). All simulations show that the conservative Simes test and hence the Hochberg procedure remain conservative under negative dependence for n 3. 26 / 27

Conclusions Showed that the Simes test is anti-conservative under negative dependence using Ferguson s model for n = 2. The amount of anti-conservatism is negligibly small. Showed that the critical constant c 2 of this test can be made slightly smaller than 1/2 to control P (Type I error) with negligible loss of power. Showed that the conservative Simes test remains conservative under negative dependence using Ferguson s model for n 3. The amount of conservatism increases with n. Future research: Show that the conservative Simes test remains conservative under other negative dependence models, especially under the normal model. 27 / 27