Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

Similar documents
Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

ON STEPWISE CONTROL OF THE GENERALIZED FAMILYWISE ERROR RATE. By Wenge Guo and M. Bhaskara Rao

On adaptive procedures controlling the familywise error rate

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

On weighted Hochberg procedures

A class of improved hybrid Hochberg Hommel type step-up multiple test procedures

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

A Note on a Method for the Analysis of Significances en masse. Paul Seeger. Technometrics, Vol. 10, No. 3. (Aug., 1968), pp

Applying the Benjamini Hochberg procedure to a set of generalized p-values

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

International Biometric Society is collaborating with JSTOR to digitize, preserve and extend access to Biometrics.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Mind Association. Oxford University Press and Mind Association are collaborating with JSTOR to digitize, preserve and extend access to Mind.

Detection of Influential Observation in Linear Regression. R. Dennis Cook. Technometrics, Vol. 19, No. 1. (Feb., 1977), pp

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

A note on tree gatekeeping procedures in clinical trials

Testing hypotheses in order

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Journal of Applied Probability, Vol. 13, No. 3. (Sep., 1976), pp

An Application of the Closed Testing Principle to Enhance One-Sided Confidence Regions for a Multivariate Location Parameter

The American Mathematical Monthly, Vol. 100, No. 8. (Oct., 1993), pp

The American Mathematical Monthly, Vol. 104, No. 8. (Oct., 1997), pp

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Hochberg Multiple Test Procedure Under Negative Dependence

Journal of the American Mathematical Society, Vol. 2, No. 2. (Apr., 1989), pp

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

A Mixture Gatekeeping Procedure Based on the Hommel Test for Clinical Trial Applications

Adaptive, graph based multiple testing procedures and a uniform improvement of Bonferroni type tests.

Variance of Lipschitz Functions and an Isoperimetric Problem for a Class of Product Measures

MULTISTAGE AND MIXTURE PARALLEL GATEKEEPING PROCEDURES IN CLINICAL TRIALS

On the Asymptotic Power of Tests for Independence in Contingency Tables from Stratified Samples

Proceedings of the American Mathematical Society, Vol. 17, No. 5. (Oct., 1966), pp

Mathematics of Computation, Vol. 17, No. 83. (Jul., 1963), pp

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Mixtures of multiple testing procedures for gatekeeping applications in clinical trials

The College Mathematics Journal, Vol. 16, No. 2. (Mar., 1985), pp

The College Mathematics Journal, Vol. 24, No. 4. (Sep., 1993), pp

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Control of Directional Errors in Fixed Sequence Multiple Testing

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

An Improved Approximate Formula for Calculating Sample Sizes for Comparing Two Binomial Distributions

Lecture 6 April

Covering Lemmas and an Application to Nodal Geometry on Riemannian Manifolds

Consonance and the Closure Method in Multiple Testing. Institute for Empirical Research in Economics University of Zurich

The College Mathematics Journal, Vol. 21, No. 4. (Sep., 1990), pp

GOTEBORG UNIVERSITY. Department of Statistics

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

11] Index Number Which Shall Meet Certain of Fisher's Tests 397

ON TWO RESULTS IN MULTIPLE TESTING

On Generalized Fixed Sequence Procedures for Controlling the FWER

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

E. DROR, W. G. DWYER AND D. M. KAN

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

IMPROVING TWO RESULTS IN MULTIPLE TESTING

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

The Econometric Society is collaborating with JSTOR to digitize, preserve and extend access to Econometrica.

Annals of Mathematics

Philosophy of Science, Vol. 43, No. 4. (Dec., 1976), pp

Philosophy of Science Association

Proceedings of the American Mathematical Society, Vol. 88, No. 1. (May, 1983), pp

arxiv: v4 [math.st] 21 Apr 2018

Proceedings of the American Mathematical Society, Vol. 57, No. 2. (Jun., 1976), pp

Multistage Tests of Multiple Hypotheses

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

American Journal of Mathematics, Vol. 109, No. 1. (Feb., 1987), pp

Annals of Mathematics

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

The International Journal of Biostatistics

The Review of Economic Studies, Ltd.

Department of Biostatistics University of North Carolina. Institute of Statistics Mimeo Series No Oct 1997

Annals of Mathematics

Multiple Testing of General Contrasts: Truncated Closure and the Extended Shaffer-Royen Method

Compatible simultaneous lower confidence bounds for the Holm procedure and other Bonferroni based closed tests

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

High-Throughput Sequencing Course. Introduction. Introduction. Multiple Testing. Biostatistics and Bioinformatics. Summer 2018

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

American Society for Quality

Mathematical Association of America

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

False discovery control for multiple tests of association under general dependence

Mathematics of Operations Research, Vol. 2, No. 2. (May, 1977), pp

are equal to zero, where, q = p 1. For each gene j, the pairwise null and alternative hypotheses are,

The Periodogram and its Optical Analogy.

The Econometric Society is collaborating with JSTOR to digitize, preserve and extend access to Econometrica.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

econstor Make Your Publications Visible.

QED. Queen s Economics Department Working Paper No Hypothesis Testing for Arbitrary Bounds. Jeffrey Penney Queen s University

Closure properties of classes of multiple testing procedures

Transcription:

Biometrika Trust A Stagewise Rejective Multiple Test Procedure Based on a Modified Bonferroni Test Author(s): G. Hommel Source: Biometrika, Vol. 75, No. 2 (Jun., 1988), pp. 383-386 Published by: Biometrika Trust Stable URL: http://www.jstor.org/stable/2336190 Accessed: 06/07/2010 05:47 Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in the JSTOR archive only for your personal, non-commercial use. Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at http://www.jstor.org/action/showpublisher?publishercode=bio. Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact support@jstor.org. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika. http://www.jstor.org

Biometrika (1988), 75, 2, pp. 383-6 Printed in Great Britain A stagewise rejective multiple test procedure based on a modified Bonferroni test BY G. HOMMEL Institutfiir Medizinische Statistik und Dokumentation, University of Mainz, D-6500 Mainz, Federal Republic of Germany SUMMARY Simes (1986) has proposed a modified Bonferroni procedure for the test of an overall hypothesis which is the combination of n individual hypotheses. In contrast to the classical Bonferroni procedure, it is not obvious how statements about individual hypotheses are to be made for this procedure. In the present paper a multiple test procedure allowing statements on individual hypotheses is proposed. It is based on the principle of closed test procedures (Marcus, Peritz & Gabriel, 1976) and controls the multiple level a. Some key words: Closed test procedure; Control of multiple level; Modified Bonferroni procedure; Multiple test procedure; Quasi-coherence. 1. INTRODUCTION When n hypotheses HI,..., Hn with associated test statistics T1,..., Tn are to be tested, one can make use of the corresponding p-values PI,-.., Pn. A first step to aim for an overall statement can be based on a test of the overall hypothesis Ho= ri{hi: i = 1,..., n}. Application of the Bonferroni inequality leads to a very simple level a test of Ho: reject Ho, if P(,) - a/ n, where P(,) is the smallest one of the p-values. A disadvantage of this procedure is that it may be very conservative, in particular, if the test statistics are highly correlated; moreover, it is often inappropriate to use only the smallest p-value. Another level a test which might avoid this disadvantage is based on Riiger's (1978) inequality: reject Ho, if P(k) - ka/n, where P(k) is the kth smallest of the p-values; here k (2< k s n) has to be determined before performing the n tests. If one wishes to avoid the problem of choosing k in advance, one can combine the Bonferroni test and all (n - 1) possible Riiger tests and obtain the following level a test of Ho (Hommel, 1983): reject Ho, if P(k) < ka/(ncn) for at least one k (1 - k - n), where Cn = 1 +2+... + 1/n. A very similar test of Ho which is less conservative because of omitting the constant Cn has been proposed by Simes (1986): reject Ho, if P(k)jka/n for at least one k (1sk n). Since the inequalities of Bonferroni, Riiger and Hommel are all strict, there will be constellations of dependencies among the test statistics where the test of Ho has exactly the level a; it seems, however, that these situations are rather pathological. In practical applications, the corresponding tests of Ho can be expected to be conservative. As Simes pointed out, his procedure does not always lead to a level a test of Ho; nevertheless, he suggested by a simulation study that the level of his procedure is less than or equal to a for a large family of multivariate distributions of (T1,..., T,,), and he proved that the level is exactly equal to a if the test statistics are independent. Therefore, in such cases application of Simes's procedure is recommended since it is strictly more powerful than each of the other three procedures. When, by any of these procedures, Ho has been rejected, the question remains which of the individual hypotheses Hi (i = 1,..., n) should be rejected. An answer is easy for the Bonferroni procedure, where one can reject all Hi with Pi S a/n. For the other procedures, however, it is not quite clear which of the Hi should be rejected. Simes has proposed for his procedure to reject

384 G. HOMMEL in an exploratory sense the individual hypotheses H(l),..., H(j), wherej = max {k: P(k) - ka/ n}, H(i) being the hypotheses corresponding to P(i) for i = 1,...,j. However, this procedure is not always satisfactory. Suppose that the test statistics Ti (i= 1,..., n) are independent, that m individual hypotheses are true, and that the other (n - m) hypotheses Hi are false to such an extent that pr (Pi - a/n) is nearly equal to 1. Then the probability of rejecting at least one of the m true Hi is nearly equal to 1-{1-(n-m+1)a/n}I'. If, for example, a=005, n=100 and m =50, then the probability of committing a type I error is 0 725, and it tends to 1 for m =2n and n -oo. In the following, multiple test procedures are proposed which are based on the described tests of the overall hypothesis and keep the probability of committing a type I error less than or equal to a. 2. CLOSED TEST PROCEDURES We apply the following modification of the principle of 'closed test procedures' (Marcus et al., 1976; Sonnemann, 1982). THEOREM (Hommel, 1986). Let there be given, for n > 1, n individual hypotheses H1,..., Hn, and define HI = n{hi: i E I} for all I c K, where K is the set of all nonempty subsets of {1,..., n}. Assume that there exists for each I c K a level a test based on a test statistic T,. Reject HI if it is rejected by T, and if all HJ with J D I, J c K, are rejected by TJ, too. Then this multiple test procedure controls the multiple level a; that is the probability of committing any type I error when testing all HI, I E K, is at most a irrespective of which of the HI are true. Since every HI is the intersection of the individual hypotheses Hi, i c I, it can be interpreted as a possibly 'small' overall hypothesis. Hence a level a test of HI which is only based on the p-values Pi, i E I, can be found as described in? 1. If one chooses Bonferroni tests for testing HI with the decision rule 'reject HI if there is at least one Pi, i c I, with Pi - a/lil', then the application of the Theorem leads directly to Holm's (1979) sequentially rejective procedure which is an improvement of the classical multiple Bonferroni procedure. Test strategies arising when the theorem is applied to overall tests based on the inequalities of Riiger or of Hommel, are described by Hommel (1986). When all overall tests are tests as proposed by Simes, the arising multiple test procedure can be presented by the flow chart of Hommel (1986, Fig. 3) with the choice 8kj = kalj. The decisions for the individual hypotheses can be performed in the following simpler way: compute j = max {i c{1,..., n}: P(n-i+k)> ka/ i for k = 1,..., i}. If the maximum does not exist, reject all Hi (i = 1,..., n), otherwise reject all Hi with Pi - acxj. It follows that this procedure controls the multiple level a provided each of Simes's tests for HI is a level a test. In particular, the multiple level a is kept if the n tests are independent. 3. USE OF LOGICAL RELATIONS AMONG THE HYPOTHESES Shaffer (1986) gives the following improvement of Holm's (1979) general procedure. Let, for a given system of hypotheses HI,..., Hn, S be the set of all j c {1,..., n} such that it can occur that exactly j of the n hypotheses are true and the remaining (n -j) are false. Define ti = max {j c S: j - n - i + 1} for i = 1,..., n. Then the stagewise rejective procedure using the stepwise significance bounds a/ti instead of Holm's bounds a/(n - i+ 1) controls the multiple level a. As an example, consider all n = 10 pairwise comparisons of 5 distributions. Then S= {1, 2,3,4,6, 1O} (Shaffer, 1986, Table 2), and ti= 10, t2= t3= t4= t5=6, t6= t7=4, t8=3, tg=2, t1o= 1. An analogous improvement can be found for the multiple Simes procedure provided each of Simes's tests for HI is a level a test. For this improved procedure, one has to compute j= max {i E S: P(n-i?k) > ka! i for k = 1,..., i}; then the decisions for the individual hypotheses can be taken as described for the general procedure.

Miscellanea 385 4. NUMERICAL EXAMPLE Assume that in a study with n =10 statistical tests the p-values are ordered as following: PI = 0-0021, P2= 00074, P3= 0-0093, P4= 0*0106, P5s 00121, P6=0 0218, P7= 0-0238, P8= 0-0352, P9 = 0-0466, Plo = 0-0605. Let a = 0-05 be chosen as the multiple level. In this example, because of P1 - a/10 and P2> a/9, Holm's procedure rejects HI as the only individual hypothesis. If the general multiple Simes procedure is applied, one obtains j = 5, and therefore all Hi with Pi - a/5; that is H1, H2, H3 are rejected. If it is known that the 10 tests are all pairwise comparison tests of 5 distributions, because of 5 E S one obtains j = 4. Therefore all Hi with Pi - a/4, that is H,..., H5, are rejected by the improved procedure. In order to ensure that the multiple level a for this procedure is kept, it is sufficient that Simes's test for each HI is a level a test. A simulation study was performed for the case that each pairwise comparison test is based on a test statistic Ti= XI -XmI/2, where XI, Xm (1, 1< m <5) are independently N(0, 1)-distributed, which is fulfilled asymptotically for many types of pairwise comparisons. Simulations were carried out at an IBM/AT personal computer using the SAS function RANNOR for generating random normal variables. From the results in Table 1, the level a = 0 05 is exceeded for no type of hypothesis, and the type I error rates of Simes's tests are slightly higher than those of the Bonferroni tests. Table 1. Type I error rates* for Simes's, S, and Bonferroni, B, overall testsfor all types of intersection hypotheses HI, for 10 pairwise comparisons of 5 distribution parameters il, /2, /13, /14, I15; III = number of individual hypotheses implied by HI 'Typical' hypothesis III S B 'Typical' hypothesis III S B 1 = 2= 3 = 4= L5 10 0-044 0-040 Hk = t2= 3 3 0-046 0-044 /11 = /12 = /13 = 14 6 0-045 0-041 1= 2 and 3= 4 2 0 05t 0-049375t 1= 2= 3 and 4= 5 4 0 047 0 045 l= 2 1 0 05t 0 05t * Based on 20000 simulations each; estimated standard error -0-0015. t Exact error rates. 5. DiscussION The proposed multiple test procedure is strictly not less powerful than Holm's procedure as well as all other procedures mentioned by Hommel (1986); in many cases, it seems to be considerably more powerful. The computations needed for testing the individual hypotheses are very simple. If decisions for all HI, I E K, are to be taken, it is recommended to use a computer program based on Hommel (1986, Fig. 3). This can be performed also for a large n, since the computational time is proportional to n2. An important logical property of multiple test procedures is coherence (Gabriel, 1969); i.e. if a hypothesis is retained, all its implications also have to be retained. As Hommel (1986) pointed out, general multiple test procedures, as the Bonferroni or Holm's procedure, need not be coherent, but they should be quasi-coherent; i.e. if HI = r{hi: i c I} is retained, all HJ with J c I are retained. Since the theorem is applied, the proposed procedure is quasi-coherent. In? 3 it is shown how logical dependencies in a given system of hypotheses can lead to an improvement of the procedure. Another question is how one can make use of stochastical dependencies between the test statistics. A solution of this problem seems to be more difficult; on the other hand, the overall tests according to Simes are much more flexible against different structures of stochastical dependence than, for example, Bonferroni overall tests. ACKNOWLEDGEMENT The author would like to thank Peter Bauer for helpful comments and suggestions.

386 G. HOMMEL REFERENCES GABRIEL, K. R. (1969). Simultaneous test procedures-some theory of multiple comparisons. Ann. Math. Statist. 40, 224-50. HOLM, S. (1979). A simple sequentially rejective multiple test procedure. Scand. J. Statist. 6, 65-70. HOMMEL, G. (1983). Tests of the overall hypothesis for arbitrary dependence structures. Biom. J. 25, 423-30. HOMMEL, G. (1986). Multiple test procedures for arbitrary dependence structures. Metrika 33, 321-36. MARCUS, R., PERITZ, E. & GABRIEL, K. R. (1976). On closed testing procedures with special reference to ordered analysis of variance. Biometrika 63, 655-60. RUGER, B. (1978). Das maximale Signifikanzniveau des Tests "Lehne Ho ab, wenn k unter n gegebenen Tests zur Ablehnung fuihren". Metrika 25, 171-8. SHAFFER, J. P. (1986). Modified sequentially rejective multiple test procedures. J. Am. Statist. Assoc. 81, 826-31. SIMES, R. J. (1986). An improved Bonferroni procedure for multiple tests of significance. Biometrika 73, 751-4. SONNEMANN, E. (1982). Allgemeine Losungen multipler Testprobleme. EDV in Med. u. Biol. 13, 120-8. [Received July 1987. Revised October 1987]