Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

Similar documents
Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

International Biometric Society is collaborating with JSTOR to digitize, preserve and extend access to Biometrics.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

A Note on a Method for the Analysis of Significances en masse. Paul Seeger. Technometrics, Vol. 10, No. 3. (Aug., 1968), pp

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

An Improved Approximate Formula for Calculating Sample Sizes for Comparing Two Binomial Distributions

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Variance of Lipschitz Functions and an Isoperimetric Problem for a Class of Product Measures

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

On the Asymptotic Power of Tests for Independence in Contingency Tables from Stratified Samples

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

The College Mathematics Journal, Vol. 16, No. 2. (Mar., 1985), pp

The American Mathematical Monthly, Vol. 100, No. 8. (Oct., 1993), pp

Mind Association. Oxford University Press and Mind Association are collaborating with JSTOR to digitize, preserve and extend access to Mind.

Detection of Influential Observation in Linear Regression. R. Dennis Cook. Technometrics, Vol. 19, No. 1. (Feb., 1977), pp

The Econometric Society is collaborating with JSTOR to digitize, preserve and extend access to Econometrica.

Mathematics of Computation, Vol. 17, No. 83. (Jul., 1963), pp

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

The College Mathematics Journal, Vol. 24, No. 4. (Sep., 1993), pp

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

Journal of the American Mathematical Society, Vol. 2, No. 2. (Apr., 1989), pp

E. DROR, W. G. DWYER AND D. M. KAN

Journal of Applied Probability, Vol. 13, No. 3. (Sep., 1976), pp

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Inferences on a Normal Covariance Matrix and Generalized Variance with Monotone Missing Data

The College Mathematics Journal, Vol. 21, No. 4. (Sep., 1990), pp

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

American Journal of Mathematics, Vol. 109, No. 1. (Feb., 1987), pp

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Philosophy of Science Association

On adaptive procedures controlling the familywise error rate

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Applied Multivariate and Longitudinal Data Analysis

Hypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc.

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

A note on profile likelihood for exponential tilt mixture models

Visual interpretation with normal approximation

The American Mathematical Monthly, Vol. 104, No. 8. (Oct., 1997), pp

Ecological Society of America is collaborating with JSTOR to digitize, preserve and extend access to Ecology.

Annals of Mathematics

On Selecting Tests for Equality of Two Normal Mean Vectors

Distribution Theory. Comparison Between Two Quantiles: The Normal and Exponential Cases

On weighted Hochberg procedures

Philosophy of Science, Vol. 43, No. 4. (Dec., 1976), pp

Covering Lemmas and an Application to Nodal Geometry on Riemannian Manifolds

A Monte-Carlo study of asymptotically robust tests for correlation coefficients

Week 5 Video 1 Relationship Mining Correlation Mining

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Proceedings of the American Mathematical Society, Vol. 17, No. 5. (Oct., 1966), pp

ON STEPWISE CONTROL OF THE GENERALIZED FAMILYWISE ERROR RATE. By Wenge Guo and M. Bhaskara Rao

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Testing hypotheses in order

Mathematics of Operations Research, Vol. 2, No. 2. (May, 1977), pp

Applying the Benjamini Hochberg procedure to a set of generalized p-values

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Proceedings of the American Mathematical Society, Vol. 88, No. 1. (May, 1983), pp

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Proceedings of the American Mathematical Society, Vol. 57, No. 2. (Jun., 1976), pp

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Multivariate Statistical Analysis

Summary and discussion of: Controlling the False Discovery Rate: A Practical and Powerful Approach to Multiple Testing

The Review of Economic Studies, Ltd.

A class of improved hybrid Hochberg Hommel type step-up multiple test procedures

11] Index Number Which Shall Meet Certain of Fisher's Tests 397

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

Multivariate Process Control Chart for Controlling the False Discovery Rate

Statistical Inference

The Periodogram and its Optical Analogy.

Karl-Rudolf Koch Introduction to Bayesian Statistics Second Edition

Annals of Mathematics

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

EC2001 Econometrics 1 Dr. Jose Olmo Room D309

Confidence Interval Estimation

Modified Simes Critical Values Under Positive Dependence

HOTELLING'S T 2 APPROXIMATION FOR BIVARIATE DICHOTOMOUS DATA

i=1 X i/n i=1 (X i X) 2 /(n 1). Find the constant c so that the statistic c(x X n+1 )/S has a t-distribution. If n = 8, determine k such that

Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

The American Mathematical Monthly, Vol. 81, No. 4. (Apr., 1974), pp

COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS

INFORMS is collaborating with JSTOR to digitize, preserve and extend access to Management Science.

Transcription:

Biometrika Trust An Improved Bonferroni Procedure for Multiple Tests of Significance Author(s): R. J. Simes Source: Biometrika, Vol. 73, No. 3 (Dec., 1986), pp. 751-754 Published by: Biometrika Trust Stable URL: http://www.jstor.org/stable/2336545 Accessed: 14/04/2010 09:33 Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at http://www.jstor.org/page/info/about/policies/terms.jsp. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in the JSTOR archive only for your personal, non-commercial use. Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at http://www.jstor.org/action/showpublisher?publishercode=bio. Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact support@jstor.org. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika. http://www.jstor.org

Biometrika (1986), 73, 3, pp. 751-4 Printed in Great Britain An improved Bonferroni procedure for multiple tests of significance BY R. J. SIMES Ludwig Institute for Cancer Research, University of Sydney, Sydney, N.S. W. 2006, Australia SUMMARY A modification of the Bonferroni procedure for testing multiple hypotheses is presented. The method, based on the ordered p-values of the individual tests, is less conservative than the classical Bonferroni procedure but is still simple to apply. A simulation study shows that the probability of a type I error of the procedure does not exceed the nominal significance level, a, for a variety of multivariate normal and multivariate gamma test statistics. For independent tests the procedure has type I error probability equal to a. The method appears particularly advantageous over the classical Bonferroni procedure when several highly-correlated test statistics are involved. Some key words: Bonferroninequality; Multiple comparisons; Simultaneous test procedures. 1. INTRODUCTION The Bonferroni inequality is often used when conducting multiple tests of significance to set an upper bound on the overall significance level a (Miller, 1981, pp. 67-70). If TI,..., T" is a set of n statistics with corresponding p-values Pl,..., P,, for testing hypotheses H,..., 1H, the classical Bonferroni multiple test procedure is usually performed by rejecting Ho= {H1,..., Hn} if any p-value is less than a/ n. Furthermore the specific hypothesis Hi is rejected for each Pi S a/ n (i = 1,..., n). The Bonferroni inequality, n pr,u (Pi -c / n) --< (O -< a,< ), i=l ensures that the probability of rejecting at least one hypothesis when all are true is no greater than a. Although several multivariate methods have been developed for multiple statistical inference, the Bonferroni procedure is still valuable, being simple to use, requiring no distributional assumptions and enabling individual alternative hypotheses to be identified. Nevertheless, the procedure is conservative and lacks power if several highly correlated tests are undertaken. This paper introduces a modified Bonferroni procedure, based on the ordered p-values of the individual tests, which has an actual significance level closer to the nominal level in a wide range of circumstances and which has a lower type II error rate for a given nominal significance level than the classical procedure. Section 2 describes the procedure and shows that the probability of a type I error for the test procedure equals a for independent test stastistics. Simulation studies in? 3 show that a is an upper bound on the type I error probability for a variety of multivariate normal and chi-squared distributions. The powers of the classical and modified procedures are compared for some alternative hypotheses in? 4. 2. MODIFIED BONFERRONI PROCEDURE Let P(l),..., P(n) be the ordered p-values for testing hypotheses Ho = {H(1)... H(n)} Then Ho is rejected if P(j) ja /n for any j = 1,..., n. This test procedure has type I error probability equal to a for independent tests as shown by the following result.

752 R. J. SIMES THEOREM. Let P(),... P(n) be the order statistics of n independent uniform (0, 1) random variables and let An(a)= pr {P(j) >jal/n; j = 1,..., n} (Os ae - 1). Then An(a) = 1 - a. Proof. The result is clearly true for n = 1. For n > 1, {P(1)/P(n),..., P(n-1)/P(n)} are the order statistics of n -1 independent uniform random variables on (0, 1), independent of P(n), and P(n) has distribution function pfn (O <p < 1). Hence An(a) = An-1{pn 1 npn- dp. If An_l(a) = 1 - a then An(a) = 1 - a follows. Hence the result is proved by induction. O The modified test procedure is conservative provided pr {U P(j) j a /n} < a. This inequality is not true in general as counterexamples, albeit pathological, can be found. Nevertheless, it may well be true for a large family of iriultivariate distributions as suggested by the simulation studies below. 3. SIMULATION STUDIES Test statistics T1,..., Tn were simulated from an n-variate normal distribution, N(O, fl) with unit variances and common correlation coefficients p (O< p < 1). Two-sided p-values were obtained from each univariate normal statistic as Pi = 2 min (Yi, 1- Y1), where Yi = CD(Ti), and cd is the standard normal distribution function. Then the classical and modified Bonferroni test procedures were applied to each set of simulated p-values. Random variables with a multivariate gamma distribution were constructed from linear combinations of independent gamma variables. The degree of dependence between the resulting random variables was determined by the number of gamma variables used in common for each sum. Let {Xij} be a set of independent gamma (01, 02) variables. Then T1,..., Tn, defined by I m-i Ti = Lxoi+ xii (i=l.. n), j=l j=l is a set of gamma (mo1, 02) variables with common correlation coefficient p = I/ m. Individual chi-squared test statistics with 1 and 5 degrees of freedom were constructed by choosing m = 10 and 01 = 005 and 0-25 respectively. Then P-values corresponding to the right-hand tail of each gamma variate were obtained: Pi = 1- G( Y,) for i = 1,..., n, where G is the gamma (mo1, 02) distribution function. Simulations were carried out on a VAX 11/780 computer using IMSL subroutines GGNSM for multivariate normal and GGAMR for gamma variables. The results of the type I error rates for the modified and classical Bonferroni procedures are shown in Table 1 using n = 5 and 10 simultaneous Table 1. Type I error rates* for modified, M, and classical, c, Bonferroni test procedures; p, correlation coefficient Distribution Distribution ~~T1 ~~ 2 2 L I2 2 p Normal Xi X5 Normal Xi X5 M C M C M C M C M C M C Number of tests = 5 Number of tests = 10 0.0 0 049 0.048 0 050 0 049 0-049 0.048 0 049 0.048 0*049 0.048 0 049 0.048 0 3 0 049 0.048 0 045 0 040 0-044 0 040 0*047 0 045 0.043 0 037 0-042 0 039 0.6 0.043 0.039 0.044 0.029 0-039 0.033 0-039 0.034 0042 0.026 0*035 0029 09 0A024 O033 0A048 0A016 0-041 0019 0Q028 0@017 O0047 0A012 0039 0A014 * Based on 100000 simulations each; estimated standard error <0'007

Miscellanea 753 tests, p = 00, 0 3, 0-6 and 0 9 and a = 0'05. The results for the modified procedure, based on 100000 simulations in each case, are consistent with an upper bound on the type I error probability of 0*05. The estimated error rate drops as low as 0-028 for highly correlated multivariate normal statistics but is in the range 0 04-0 05 for most conditions simulated. The results for the classical Bonferroni procedure demonstrate that it has a similar type I error rate for independent tests but is appreciably more conservative than the modified procedure for highly correlated tests. This is particularly so for the y2 distribution. 4. POWER COMPARISONS Since the modified Bonferroni test procedure contains the classical Bonferroni procedure it is clear that the power of the modified procedure is greater than the classical procedure at the same nominal significance level. A simulation study, undertaken to evaluate the relative power of the procedures for a range of alternative hypotheses, is illustrated in Table 2 for the multivariate normal case with 10 simultaneous tests. Alternative hypotheses examined were of the form HA: =,u (i = 1,..., k inm), 0t, = 0 otherwise, with choices of 8 = l, 1 or 1l. The results are expressed as the ratio of the powers of the classical to modified procedures for each alternative. Table 2. Power of classical Bonferroni test procedure relative to modified procedure*; multivariate normal, n= 10; p, correlation coefficient No. of correct No. of correct alternatives p =2 2,=1,U2=12 alternatives =2 2,=1,U2=12 5 0.0 0*97 0*97 0*96 10 097 095 094 0*3 0*96 0.95 0*96 0*94 0.92 0*94 0*6 0*90 0*92 0*94 0-87 0 90 0.91 09 073 081 085 0-64 0-71 075 * Based on 15000 simulations; each estimate of relative power has standard error less than M%. The results for multivariate normal tests indicate little advantage to the modified test procedure over the classical method when the test statistics are independent or poorly correlated. However, the modified procedure is considerably more powerful when the test statistics are highly correlated and several alternative hypotheses are correct, particularly when the magnitude of the alternatives is small. Presumably, this is due mainly to the unduly small true type I error of the classical procedure. 5. DiSCUSSION A criticism of the classical Bonferroni test procedure is that it is too conservative for highlycorrelated test statistics. Then the modified procedure should be advantageous by having an actual significance level much closer to the nominal level and a consequent lower type II error probability. Even when the advantage is small, the only disadvantage seems to be a slight increase in computation. Since the Bonferroni inequality leads to a conservative test procedure, there have been several attempts to improve on the method. Sidaik (1968, 1971) has shown that the significance level for each test a/ n can be improved by using 1- (1 -a) l/n under certain conditions, although the degree of improvement for n < 10 and a = 0 05 is slight. Worsley (1982) found an upper bound on the probability of a type I error which is an improvement over both the Bonferroni and Sidiak upper bounds, but it requires knowledge of the joint probabilities of pairs of events and thus is not directly applicable here.

754 R. J. SIMES If the overall null hypothesis Ho is rejected, statements about individual hypotheses can be made using the classical Bonferroni test procedure: any individual null hypothesis H, can be rejected provided Pi6 a/n. An improvement on this method suggested by Holm (1979) is the sequentially rejective Bonferroni test. This procedure rejects the specific hypothesis H(i) for i= 1,..., n, provided both P(i)-<a/(n-i+1) and H(1),...,H(1l have all been rejected. The sequential test procedure has multiple level of significance a for free combinations of null hypotheses. A question arises as to what statements about individual hypotheses can be made using the modified Bonferroni test procedure. One possiblity is to reject the individual hypotheses H(1)9..., H(j), wherej = max {j: P(j) < ja/ n}. However, since there is no formal basis for rejecting the individual hypotheses H(i+1),..., H(j) not rejected by the sequentially rejective test, statements about these latter hypotheses should be considered exploratory and preferably confirmed in subsequent studies. ACKNOWLEDGEMENTS The work was supported by an NH&MRC Applied Health Sciences Research Fellowship and by a National Cancer Institute grant and undertaken in part at the Harvard School of Public Health. The author would like to thank Malcolm Hudson, Marvin Zelen and a referee for constructive suggestions and a shorter proof of the theorem. REFERENCES HOLM, S. (1979). A simple sequentially rejective multiple test procedure. Scand. J. Statist. 6, 65-70. MILLER, R. G. (1981). Simultaneous Statistical Inference, 2nd ed. New York: Springer-Verlag. SIDAK, Z. (1968). On multivariate normal probabilities of rectangles: their dependence on correlations. Ann. Math. Statist. 39, 1425-34. SIDAK, Z. (1971). On probabilities of rectangles in multivariate Student distributions: their dependence on correlations. Ann. Math. Statist. 42, 169-75. WORSLEY, K. J. (1982). An improved Bonferroni inequality and applications. Biometrika 69, 297-302. [Received June 1985. Revised February 1986]