An Improved Approximate Formula for Calculating Sample Sizes for Comparing Two Binomial Distributions

Similar documents
Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

Detection of Influential Observation in Linear Regression. R. Dennis Cook. Technometrics, Vol. 19, No. 1. (Feb., 1977), pp

The College Mathematics Journal, Vol. 16, No. 2. (Mar., 1985), pp

The College Mathematics Journal, Vol. 24, No. 4. (Sep., 1993), pp

On the Asymptotic Power of Tests for Independence in Contingency Tables from Stratified Samples

The College Mathematics Journal, Vol. 21, No. 4. (Sep., 1990), pp

Journal of Applied Probability, Vol. 13, No. 3. (Sep., 1976), pp

Proceedings of the American Mathematical Society, Vol. 17, No. 5. (Oct., 1966), pp

A Note on a Method for the Analysis of Significances en masse. Paul Seeger. Technometrics, Vol. 10, No. 3. (Aug., 1968), pp

The American Mathematical Monthly, Vol. 100, No. 8. (Oct., 1993), pp

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

American Journal of Mathematics, Vol. 109, No. 1. (Feb., 1987), pp

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Journal of the American Mathematical Society, Vol. 2, No. 2. (Apr., 1989), pp

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Variance of Lipschitz Functions and an Isoperimetric Problem for a Class of Product Measures

Proceedings of the American Mathematical Society, Vol. 88, No. 1. (May, 1983), pp

The American Mathematical Monthly, Vol. 104, No. 8. (Oct., 1997), pp

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Mathematics of Operations Research, Vol. 2, No. 2. (May, 1977), pp

Mathematics of Computation, Vol. 17, No. 83. (Jul., 1963), pp

Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

Philosophy of Science Association

Proceedings of the American Mathematical Society, Vol. 57, No. 2. (Jun., 1976), pp

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Mind Association. Oxford University Press and Mind Association are collaborating with JSTOR to digitize, preserve and extend access to Mind.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Proceedings of the National Academy of Sciences of the United States of America, Vol. 15, No. 2. (Feb. 15, 1929), pp

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Statistics in medicine

A SAS/AF Application For Sample Size And Power Determination

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Confidence Intervals of the Simple Difference between the Proportions of a Primary Infection and a Secondary Infection, Given the Primary Infection

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Philosophy of Science, Vol. 43, No. 4. (Dec., 1976), pp

Clinical Trials. Olli Saarela. September 18, Dalla Lana School of Public Health University of Toronto.

Inverse Sampling for McNemar s Test

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

The College Mathematics Journal, Vol. 32, No. 5. (Nov., 2001), pp

International Biometric Society is collaborating with JSTOR to digitize, preserve and extend access to Biometrics.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Covering Lemmas and an Application to Nodal Geometry on Riemannian Manifolds

Asymptotically Efficient Estimation of Covariance Matrices with Linear Structure. The Annals of Statistics, Vol. 1, No. 1. (Jan., 1973), pp

Duke University. Duke Biostatistics and Bioinformatics (B&B) Working Paper Series. Randomized Phase II Clinical Trials using Fisher s Exact Test

The American Mathematical Monthly, Vol. 81, No. 4. (Apr., 1974), pp

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

The Use of Confidence or Fiducial Limits Illustrated in the Case of the Binomial

APPENDIX B Sample-Size Calculation Methods: Classical Design

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Biometrika Trust. Biometrika Trust is collaborating with JSTOR to digitize, preserve and extend access to Biometrika.

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

SAMPLE SIZE RE-ESTIMATION FOR ADAPTIVE SEQUENTIAL DESIGN IN CLINICAL TRIALS

The Annals of Human Genetics has an archive of material originally published in print format by the Annals of Eugenics ( ).

A comparison of inverse transform and composition methods of data simulation from the Lindley distribution

The Limiting Similarity, Convergence, and Divergence of Coexisting Species

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:

MATH 240. Chapter 8 Outlines of Hypothesis Tests

Introduction to Probability and Statistics Twelfth Edition

DISCRETE RANDOM VARIABLES EXCEL LAB #3

1 Introduction A common problem in categorical data analysis is to determine the effect of explanatory variables V on a binary outcome D of interest.

Derivation of Monotone Likelihood Ratio Using Two Sided Uniformly Normal Distribution Techniques

1 Inverse Transform Method and some alternative algorithms

CONTINUOUS RANDOM VARIABLES

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

STATISTICAL ANALYSIS WITH MISSING DATA

Journal of Educational and Behavioral Statistics

Statistics Handbook. All statistical tables were computed by the author.

Sensitivity analysis and distributional assumptions

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Arrow Pushing in Organic Chemistry

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

The Econometric Society is collaborating with JSTOR to digitize, preserve and extend access to Econometrica.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

ON THE MONOTONE LIKELIHOOD RATIO OFA "FAULTY INSPECTION" DISTRIBUTION ABSTRACT

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

In Defence of Score Intervals for Proportions and their Differences

Epidemiology Wonders of Biostatistics Chapter 11 (continued) - probability in a single population. John Koval

Journal of the American Statistical Association, Vol. 91, No (Jun., 1996), pp

Chapitre 3. 5: Several Useful Discrete Distributions

Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at

Formulas and Tables by Mario F. Triola

SAMPLE SAMPLE SAMPLE SAMPLE

Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 16 Introduction

Random Signals and Systems. Chapter 1. Jitendra K Tugnait. Department of Electrical & Computer Engineering. James B Davis Professor.

CHAPTER 7. Parameters are numerical descriptive measures for populations.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

THE OPTICAL SCIENCES COMPANY P.O. Box 446, Placentia, California Phone: (714) Pulsed Laser Backscatter. Glenn A. Tyler.

The Suntory and Toyota International Centres for Economics and Related Disciplines

Optimal rejection regions for testing multiple binary endpoints in small samples

American Society for Quality

Correlation. Martin Bland. Correlation. Correlation coefficient. Clinical Biostatistics

Transcription:

An Improved Approximate Formula for Calculating Sample Sizes for Comparing Two Binomial Distributions J. T. Casagrande; M. C. Pike; P. G. Smith Biometrics, Vol. 34, No. 3. (Sep., 1978), pp. 483-486. Stable URL: http://links.jstor.org/sici?sici=0006-341x%28197809%2934%3a3%3c483%3aaiaffc%3e2.0.co%3b2-t Biometrics is currently published by International Biometric Society. Your use of the JSTOR archive indicates your acceptance of JSTOR's Terms and Conditions of Use, available at http://www.jstor.org/about/terms.html. JSTOR's Terms and Conditions of Use provides, in part, that unless you have obtained prior permission, you may not download an entire issue of a journal or multiple copies of articles, and you may use content in the JSTOR archive only for your personal, non-commercial use. Please contact the publisher regarding any further use of this work. Publisher contact information may be obtained at http://www.jstor.org/journals/ibs.html. Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. The JSTOR Archive is a trusted digital repository providing for long-term preservation and access to leading academic journals and scholarly literature from around the world. The Archive is supported by libraries, scholarly societies, publishers, and foundations. It is an initiative of JSTOR, a not-for-profit organization with a mission to help the scholarly community take advantage of advances in technology. For more information regarding JSTOR, please contact support@jstor.org. http://www.jstor.org Tue Oct 30 13:03:30 2007

BIOMETRICS 34, 483-486 September, 1978 An Improved Approximate Formula for Calculating Sample Sizes for Comparing Two Binomial Distributions J. T. CASAGRANDE and M. C. PIKE Department of Community and Family Medicine, University of Southern California School of Medicine, 2025 Zonal Avenue, Los Angeles, California 90033, U.S.A. P. G. SMITH D.H.S.S.Cancer Epidemiology and Clinical Trials Unit, Oxford University, Oxford, England Summary An improt'ed approximate formula is deriuedfor determination of the sample sizes required for detecting the diference between two binomial parameters with specijied type I and type II errors. I. Introduction If x and y are each binomially distributed with index n and parameters p, and p, respectively, then the comparison of these two binomial distributions is usually displayed as a 2 X 2 table and Fisher's "exact" (one-sided) test may be used to test the null hypothesis,p, = p,, against the alternative hypothesis that p, > p,. The exact test is based on arguing conditionally on the observed number of "successes", i.e., x + y, see Yates (1934) and Fisher (1935). The distribution of x with x + y = m fixed depends on p, and p, only through the odds ratio 8 = @,q,)/(q,p,), where q, = 1 - p,. 'l'he conditional distribution is Pr (x / 8) = C (n,x)c (n,y) fix/z,c(n,i)c(n,m - i)ol, where i takes the values L = max(0, m - n) to U = min(n, m). Let x, be the critical value of x for the exact test ofp, > p, (i.e., 8 > 1) against the null hypothesis 8 = 1 with type I error a, so that 2 Pr(il8 = 1) <a and Pr(il0 = l)>n. i=xc I=XC-1 The conditional power is thus P(8I m) = Z,=xc Pr(i) 8) and the expected power is P(8, p,) = Zm P(8 ( mpr(nz), where m takes the values 1 to 2n - 1, and To find the minimum n to achieve a power of loop percent an iterative procedure is required. This involves very extensive calculations and numerous approximations have thus bev suggested. The two most commonly employed are: {l) The "arcsin formula" as given, for example, in Cochran and Cox (1957), - Key Words: Sample size; 2 X 2 table.

484 BIOMETRICS, SEPTEMBER 1978 n = (z1-, + ~p)~/[2(arcsin ;pl - arcsin ~p,)~], (1) where @(z,) = y and @ is the cumulative normal distribution. (2) The "uncorrected x2formula" as given, for example, in Fleiss (1973), n = [z1-, J(2D4) + zp ;'(plql + p2q2)i2/(p1- p2i2, (2 where p = (pl + p2)/2, q = 1 - p. In general formulae (1) and (2) give very similar answers, which are too low (Haseman 1978). A "corrected X2 method" given by Kramer and Greenhouse (1959) is where A = [zl-,,'(2pq) + zp 4(plql + p2q,)i2. It can be seen that equation (3) is equation (2) with a correction factor. Unlike equations (1) and (2), equation (3) is conservative. Table 1 shows the exact values compared to the values obtained by the three approximations for several values of p1 and p,. Inspection of this table shows that an average of the corrected and uncorrected x2approximations would give a very close estimate of the exact values. This led us to reassess the derivation of the Kramer-Greenhouse corrected x2method. 2. Derivation of Corrected x2 Consider the statistic d = 0.5(x - y). The usual x2test with Yates' correction may be written where the variance of d (on the null hypothesis) is V(d) = 0.51 [(x + y)/2n][l - (x + y)/2n]. Now simplify (4) by replacing x + y by its expectation, and then solve it ford* with X set to the specified type I error, i.e., z,-,~ = (d* - 0.5)2/(0.5n~q), thus If this n and d* are to lead to a test with power (I then Pr(d 2 d*) = (I. But d has expected value 0.5n(pl - p,) and variance 0.25(p1q, + p2q2), so that Pr(d 2 d*) 1 - @(w), where @ is the cumulative normal distribution function, w = [d* - 0.5n(p1 - p,) - c]/,'[0.25n(plql + p2q2)], and c is a correction factor to allow for the discreteness in the distribution of d. Thus we obtain a second equation for d* to satisfy, i.e., -ZP = [d* - 0.5n(p1 - pn) - c]/~[0.25n(p~q~ + p2q2)]. TABLE 1 Comparison of Exact Values with Those of Three Approximations with a 90% Chance of Declaring a Significant Result on a One-Sided Test at the 5% Level Exact method 178 71 445 47 Arcsin Approximation 165 63 420 41 Uncorrected x2 166 63 423 42 Kramer-Greenhousex2 191 78 462 54

SAMPLE SIZE FOR COMPARING TWO BINOMIALS 485 TABLE 2 Comparison of 'Exact' Values with Those Obtained by an Improved Approximate Formula for a Power of 90% and a One-Sided Test at the 5% Level 6 = p, - p, where p, > p, p, 0.05 0.10 0.15 0.20 0.25 0.30 0.35 0.40 0.45 0.50 0.55 0.60 0.65 0.70 Upper figure = Exact Value Thus d* = 0.5n(pl - p2) + c - zp,[0.25(nplql + np,q,)]. (6) TO solve for n we set (5) = (6). This leads to Setting c = -0.5 in (7), one obtains the Kramer-Greenhouse formula (equation 3) and setting c = 0.5, one obtains the uncorrected formula (2). For c = 0 a value for n roughly midway between (2) and (3) is obtained. The choice of c = 0 would appear to be correct since the equations ford* are not integral and the correction factor has already been employed in its calculation (equation 5). The choice c = -0.5 will clearly lead to a conservative test. We therefore suggest as an improved X2 formula We have tested this approximation over a wide range of values of p, and p,. Table 2 compares the exact values with those obtained with the improved approximation at the onesided significance level a = 0.05 for @ = 0.90. The values of p, and p, were chosen to correspond to those of Cochran and Cox (1957). Formula (8) is clearly an excellent approximation and effectively eliminates the necessity of ever requiring the exact values. Isographs calculated using Formula (1) were recently presented by Feigl (1978) with the statement that care should be exercised in their use due to the known inaccuracy of Formula (1); recomputed isographs using Formula (8) could be used without such a precaution.

486 BIOMETRICS, SEPTEMBER 1978 A cknowledgments This research is supported by Contract N01-CP-53500 and Grant Pol-04-17054 from the National Cancer Institute. National Institutes of Health. Une formulation d'approximation ame'liore'e est de'termine'e pour les tailles d'e'chantillons nicessaires a la se'lection des d~re'rences entre deux paramptres binomiaux aaec des erreurs de type I et II spe'cifib. References Cochran, W. G,and Cox, 6.M. (1957). Experimental Designs, John Wiley and Sons, Inc., New York. Feigl, P. (1978). A graphical aid for determining sample size when comparing two independent proportions. Biornetrics 34, 111-122. Fisher, R. A. (1935). The logic of inductive inference. Journal of the Royal Statistical Society, Series A 98, 39-54. Fleiss, J. L. (1973). Statistical Methods for Rates and Proportions, John Wiley and Sons, Inc., New York. Haseman, J. K. (1978). Exact sample sizes for use with the Fisher-Irwin Test for 2 X 2 tables. Biotnetrics 34, 106-109. Kramer, M, and Greenhouse, S. W. (1959). Determination of sample size and selection of cases. In Psychophartr~acology: Problems in Evaluation. J. 0. Cole and R. W. Gerard (eds.), National Academy of Sciences, National Research Council, Washington, D. C., Publication 583, 356-37 1. Yates, F. (1934). Contingency tables involving small numbers and the X2 test. Journal of the Royal Statistical Society, Supplement 1, 217-235. Received November 1977; Revised March 1978