On the coverage probability of the Clopper-Pearson confidence interval

Similar documents
Two results in statistical decision theory for detecting signals with unknown distributions and priors in white Gaussian noise.

arxiv: v1 [math.st] 5 Jul 2007

William C.L. Stewart 1,2,3 and Susan E. Hodge 1,2

Basic building blocks for a triple-double intermediate format

Charles Geyer University of Minnesota. joint work with. Glen Meeden University of Minnesota.

Laboratoire de l Informatique du Parallélisme

Centre d Economie de la Sorbonne UMR 8174

Outils de Recherche Opérationnelle en Génie MTH Astuce de modélisation en Programmation Linéaire

arxiv: v1 [stat.ap] 17 Mar 2012

Lawrence D. Brown, T. Tony Cai and Anirban DasGupta

The Core of a coalitional exchange economy

Random variables. Florence Perronnin. Univ. Grenoble Alpes, LIG, Inria. September 28, 2018

A Simple Approximate Procedure for Constructing Binomial and Poisson Tolerance Intervals

Weizhen Wang & Zhongzhan Zhang

A generalization of Cramér large deviations for martingales

Multiplication by an Integer Constant: Lower Bounds on the Code Length

arxiv: v1 [math.st] 26 Jun 2011

Replicator Dynamics and Correlated Equilibrium

Kato s inequality when u is a measure. L inégalité de Kato lorsque u est une mesure

RN-coding of numbers: definition and some properties

Best linear unbiased prediction when error vector is correlated with other random vectors in the model

THE SHORTEST CONFIDENCE INTERVAL FOR PROPORTION IN FINITE POPULATIONS

ANNALES. FLORENT BALACHEFF, ERAN MAKOVER, HUGO PARLIER Systole growth for finite area hyperbolic surfaces

Compositions and samples of geometric random variables with constrained multiplicities

Numerical analysis for an interpretation of the pressuremeter test in cohesive soil

STATISTICAL ANALYSIS AND COMPARISON OF SIMULATION MODELS OF HIGHLY DEPENDABLE SYSTEMS - AN EXPERIMENTAL STUDY. Peter Buchholz Dennis Müller

Estimation of Gutenberg-Richter seismicity parameters for the Bundaberg region using piecewise extended Gumbel analysis

A note on the moving hyperplane method

Journal de la Société Française de Statistique Vol. 153 No. 3 (2014) Interval reliability for semi-markov systems in discrete time

Statistics for Applications Spring 2009

Minimum sizes of identifying codes in graphs differing by one edge or one vertex

On the existence of maximum likelihood estimators in a Poissongamma HGLM and a negative binomial regression model

Regression on Parametric Manifolds: Estimation of Spatial Fields, Functional Outputs, and Parameters from Noisy Data

Delaunay triangulation of a random sample of a good sample has linear size

Mesurer des déplacements ET des contraintes par correlation mécanique d images

Interval estimation via tail functions

Good Confidence Intervals for Categorical Data Analyses. Alan Agresti

On the uniform Poincaré inequality

Stickelberger s congruences for absolute norms of relative discriminants

Bound on Run of Zeros and Ones for Images of Floating-Point Numbers by Algebraic Functions

Chromatic roots as algebraic integers

Bounded Normal Approximation in Highly Reliable Markovian Systems

ON THE ASYMPTOTICS OF GREEN S FUNCTIONS OF ELLIPTIC OPERATORS WITH CONSTANT COEFFICIENTS. Shmuel Agmon

The impact of heterogeneity on master-slave on-line scheduling

Maximizing the Cohesion is NP-hard

Coverage Properties of Confidence Intervals for Proportions in Complex Sample Surveys

ANNALES DE LA FACULTÉ DES SCIENCES DE TOULOUSE

Convex Sets Strict Separation in Hilbert Spaces

Tolerance Intervals With Improved Coverage Probabilities for Binomial and Poisson Variables

Estimating binomial proportions

Apprentissage automatique Méthodes à noyaux - motivation

ADDITION BEHAVIOR OF A NUMERICAL SEMIGROUP. Maria Bras-Amorós

A new lack-of-fit test for quantile regression models using logistic regression

SEPARABILITY AND COMPLETENESS FOR THE WASSERSTEIN DISTANCE

A set of formulas for primes

Irrationality exponent and rational approximations with prescribed growth

About partial probabilistic information

Odd perfect polynomials over F 2

OPTIMAL SURE PARAMETERS FOR SIGMOIDAL WAVELET SHRINKAGE

On a simple construction of bivariate probability functions with fixed marginals 1

DETERMINING HIGH VOLTAGE CABLE CONDUCTOR TEMPERATURES. Guy Van der Veken. Euromold, Belgium. INVESTIGATIONS. INTRODUCTION.

Generalized Budan-Fourier theorem and virtual roots

Design of the Fuzzy Rank Tests Package

University of Lisbon, Portugal

MGDA II: A direct method for calculating a descent direction common to several criteria

Evaluating uncertainty in measurements of fish shoal aggregate backscattering cross-section caused by small shoal size relative to beam width

On Polynomial Pairs of Integers

Expression of Dirichlet boundary conditions in terms of the strain tensor in linearized elasticity

A Convenient Way of Generating Gamma Random Variables Using Generalized Exponential Distribution

Generalized Budan-Fourier theorem and virtual roots

Sur des classes de fonctions à seuil caractérisables par des contraintes relationnelles

Contrôle adaptatif backstepping de structures à base isolée hystérétiques

ISSN X Quadratic loss estimation of a location parameter when a subset of its components is unknown

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Numerical modification of atmospheric models to include the feedback of oceanic currents on air-sea fluxes in ocean-atmosphere coupled models

OPTIMIZATION OF AN ULTRATHIN CENTRIFUGAL FAN BASED ON THE TAGUCHI METHOD WITH FUZZY LOGICS. Kuang-Hung Hsien and Shyh-Chour Huang

Capillary rise between closely spaced plates : effect of Van der Waals forces

cedram Article mis en ligne dans le cadre du Centre de diffusion des revues académiques de mathématiques

An non-ad hoc decision rule for Automatic Target Identification using Data Fusion of Dissimilar Sensors

Permuting the partitions of a prime

Confidence intervals for a binomial proportion in the presence of ties

A set of formulas for primes

The FLRW cosmological model revisited: relation of the local time with th e local curvature and consequences on the Heisenberg uncertainty principle

PROD. TYPE: COM. Simple improved condence intervals for comparing matched proportions. Alan Agresti ; and Yongyi Min UNCORRECTED PROOF

A set of formulas for primes

CURRICULUM VITÆ. Mathematical interests : Number Theory, Logic (Model Theory), Algebraic Geometry, Complex and p-adic Analysis.

On a multilinear character sum of Burgess

Optimal and asymptotically optimal CUSUM rules for change point detection in the Brownian Motion model with multiple alternatives

Laboratoire de l Informatique du Parallélisme. École Normale Supérieure de Lyon Unité Mixte de Recherche CNRS-INRIA-ENS LYON n o 8512

Adsorption of chain molecules with a polar head a scaling description

ISSN: (Print) (Online) Journal homepage:

Lecture 2: The figures of the city, 1: interactions

On the survival and Irreductibility Assumptions for Financial Markets with Nominal Assets

Generalized Ricci Bounds and Convergence of Metric Measure Spaces

Finding good 2-partitions of digraphs II. Enumerable properties

A set of formulas for primes

BERGE VAISMAN AND NASH EQUILIBRIA: TRANSFORMATION OF GAMES

The 0-1 Outcomes Feature Selection Problem : a Chi-2 Approach

A New Confidence Interval for the Difference Between Two Binomial Proportions of Paired Data

Markov Decision Processes

Transcription:

On the coverage probability of the Clopper-Pearson confidence interval Sur la probabilité de couverture de l intervalle de confiance de Clopper-Pearson Rapport Interne GET / ENST Bretagne Dominique Pastor Ecole Nationale Supérieure des Télécommunications de Bretagne, CNRS TAMCIC (UMR 2872), Technopôle de Brest Iroise, CS 83818, 29238 BREST Cedex, FRANCE e-mail: dominique.pastor@enst-bretagne.fr 1

Abstract Given α (0, 1), the coverage probability of the 100(1 α)% Clopper-Pearson Confidence Interval (CPCI) for estimating a binomial parameter p is proved to be larger than or equal to 1 α/2 for sample sizes less than a bound that depends on p. This is a mathematical evidence that, as noticed in recent papers on the basis of numerical results, the CPCI coverage probability can be much higher than the desired confidence level and thence, that the Clopper- Pearson approach is mostly inappropriate for forming confidence intervals with coverage probabilities close to the desired confidence level. Keywords Binomial proportion, Clopper-Pearson Confidence Interval, Confidence interval, Coverage probability, Incomplete beta function. 2

Résumé Soit α (0, 1). Considérons l intervalle de confiance de Clopper-Pearson de niveau de confiance 100(1 α)%. On prouve que pour estimer une proportion binomiale p, la probabilité de couverture de cet intervalle est supérieure ou égale à 1 α/2 lorsque la taille de l échantillon est inférieure à une borne qui dépend de p. Ceci montre que, comme l ont remarqué certains auteurs sur la base de résultats numériques, la probabilité de couverture de l intervalle de confiance de Clopper-Pearson peut être largement supérieure au niveau de confiance 1 α. Aussi, l intervalle de confiance de Clopper-Pearson n est pas adéquat dès que l on veut garantir une probabilité de couverture proche du niveau de confiance spécifié. Mots-clés Intervalle de confiance, Intervalle de confiance de Clopper-Pearson, Fonction beta incomplète, Proportion binomiale, Probabilité de couverture. 3

Contents 1 Introduction 5 2 Theoretical results 5 3 Proof 7 4

1 Introduction Interval estimation of a binomial proportion p is one of the most basic and methodologically important problems in practical statistics. A considerable literature exists on the topic and manifold methods have been suggested for bracketing a binomial proportion within a confidence interval. Among those methods, the Clopper-Pearson Confidence Interval (CPCI), originally introduced in 1934 by Clopper and Pearson ([4]), is often referred as the exact procedure by some authors for it derives from the binomial distribution without resorting to any approximation to the binomial. However, despite this exacteness, the CPCI is basically conservative because of the discreteness of the binomial distribution: given α (0, 1), the coverage probability of the 100(1 α)% CPCI is above or equal to the nominal confidence level (1 α). In this paper, it is proved that this coverage probability is actually larger than or equal to 1 α/2 if the sample size is less than ln(α/2)/ ln(max(p, 1 p)). This bound basically depends on the proportion itself. Thence, as suggested by numerical results presented in [2] and [3], the CPCI is not suitable in practical situations when getting a coverage probability close to the specified confidence level is more desirable than guaranteeing a coverage probability above or equal to the said confidence level. 2 Theoretical results Throughout the rest of this paper, α stands for some real value in the interval (0, 1) and n for some natural number. We start by giving a description of the 100(1 α)% CPCI, that is the CPCI with level of confidence (1 α) where α (0, 1). This description is purposely brief for it focuses only on the material needed for stating proposition 1 below. For further details on the construction of the CPCI, the reader can refer to numerous papers and textbooks on statistics ([2], [3], [?], [7] amongst many others). Let (l k ) k {0,...,n} be the sequence defined as follows. We put i=k l 0 := 0. (1) For k {1,..., n}, l k is defined as the unique solution in (0, 1) for θ in the equation n ( ) n θ i (1 θ) n i = α/2. (2) i 5

Similarly, the sequence (u k ) k {0,...,n} is defined as follows. We set i=0 u n := 1 (3) and, for k {0,..., n 1}, u k is the unique solution in (0, 1) for θ in the equation k ( ) n θ i (1 θ) n i = α/2. (4) i Let X henceforth stand for a binomial variate for sample size n and binomial parameter p [0, 1]. In other words, X stands for the total number of successes in n independent trials with constant probability p of success. Since X is valued in {0, 1,..., n}, the random variables l X and u X are well-defined. The 100(1 α)% CPCI for size n is then the random interval (l X, u X ) whose lower and upper endpoints are l X and u X respectively. The CPCI coverage probability is defined as the probability P ({l X < p < u X }) that the CPCI actually brackets the true value p of the proportion. The 100(1 α)% CPCI is known to be conservative in the sense that its coverage probability is always above or equal to the confidence level ([2], [3], [4]). This standard result is refined by the following proposition whose proof is given in section 3. Proposition 2.1 With the same notations as above, (i) If n < ln(α/2)/ ln(2), then P ({l X < p < u X }) = 0 for p = 0, 1 α/2 for 0 < p (α/2) 1/n, = 1 for (α/2) 1/n < p < 1 (α/2) 1/n, 1 α/2 for 1 (α/2) 1/n p < 1, = 0 for p = 1. (ii) If n ln(α/2)/ ln(2), then P ({l X < p < u X }) = 0 for p = 0, 1 α/2 for 0 < p < 1 (α/2) 1/n, 1 α for 1 (α/2) 1/n p (α/2) 1/n, 1 α/2 for (α/2) 1/n < p < 1, = 0 for p = 1. The following result then easily derives from the proposition above. 6

Lemma 2.2 With the same notations as above, for every proportion p (0, 1) and every natural number n less than ln(α/2)/ ln(max(p, 1 p)), the CPCI coverage probability is larger than or equal to 1 α/2. Proof: Since max(p, 1 p) is larger than or equal to 1/2, ln(α/2)/ ln(2) is less than ln(α/2)/ ln(max(p, 1 p)). If n < ln(α/2)/ ln(2), the result then follows from proposition 1, statement (i). If ln(α/2)/ ln(2) n < ln(α/2)/ ln(max(p, 1 p)), p is either larger than (α/2) 1/n or smaller than 1 (α/2) 1/n and the result follows from proposition 1, statement (ii). The theoretical results above are thus mathematical evidences that, as suggested in [2] and [3], the CPCI is inaccurate in the sense that [...]its actual coverage probability can be much larger than 1 α unless the sample size n is quite large. ([3, Sec. 4.2.1]). 4.2.1). Figures 1, 2 and 3 illustrate the foregoing by displaying the coverage probability of the 95% CPCI for p = 0.1, p = 0.05 and p = 0.01 respectively and sample sizes ranging from 1 to 500. In each figure, the value of ln(α/2)/ ln(max(p, 1 p)) is represented by a vertical red line. On the left hand side of this line, coverage probabilities are all larger than or equal to 97.5%; on the right hand side of this same line, coverage probabilities can be less than 97.5% and even close to 95%. 1 0.995 0.99 0.985 Coverage probabilities 0.98 0.975 0.97 0.965 0.96 0.955 0.95 0 50 100 150 200 250 300 350 400 450 500 ln(δ/2)/ln(max(p,1 p)) = 35.0120 Sample sizes Figure 1: The 95% CPCI-BS Coverage probabilities for proportion p = 0.1 3 Proof Given θ [0, 1], let P θ stand for the distribution of a binomial variate for sample size n with binomial parameter θ and let F θ be the distribution function defined for every real value x by F θ (x) = P θ ((, x]). It is then convenient to set G θ (x) = P θ ([x, )) for every real value x. According to the definitions 7

1 0.995 0.99 0.985 Coverage probabilities 0.98 0.975 0.97 0.965 0.96 0.955 0.95 0 50 100 150 200 250 300 350 400 450 500 ln(δ/2)/log(max(p,1 p)) = 71.9174 Sample sizes Figure 2: The 95% CPCI-BS Coverage probabilities for proportion p = 0.05 1 0.995 0.99 0.985 Coverage probabilities 0.98 0.975 0.97 0.965 0.96 0.955 0 50 100 150 200 250 300 350 400 450 500 Sample sizes ln(δ/2)/ln(max(p,1 p)) = 367.0404 Figure 3: The 95% CPCI-BS Coverage probabilities for proportion p = 0.01 of F θ and G θ, the left hand sides in (2) and (4) are trivially equal to G θ (k) and F θ (k) respectively. Therefore, by definition of l k and u k, and G lk (k) = α/2 for every k {1,..., n} (5) F uk (k) = α/2 for every k {0,..., n 1}. (6) According to [1, Eq. 6.6.4, p. 263], for every given θ [0, 1], G θ (k) = I θ (k, n k + 1) for k {1,..., n}, (7) and F θ (k) = 1 I θ (k + 1, n k) for k {0,..., n 1}. (8) 8

where I x (a, b) stands for the Incomplete Beta Function ([1, Eq. 6.2.1, p. 258, Eq. 6.6.2, p. 263, Eq. 26.5.1, p. 944]). Trivially, the maps θ [0, 1] I θ (k + 1, n k) and θ [0, 1] I θ (k, n k + 1) are strictly increasing. Thereby, we can straightforwardly state the following result. Lemma 3.1 Given θ [0, 1], (i) for every k {0,..., n 1}, the map θ [0, 1] F θ (k) [0, 1] is strictly decreasing; (ii) for every k {1,..., n}, the map θ [0, 1] G θ (k) [0, 1] is strictly increasing. The subsequent lemma states useful properties concerning the real values l k and u k. Lemma 3.2 With the same notations as above, (i) the sequence (l k ) k {0,...,n} is strictly increasing and l n = (α/2) 1/n ; (ii) the sequence (u k ) k {0,...,n} is strictly increasing and u 0 = 1 (α/2) 1/n. (iii) for every given k {0,..., n}, l k < u k. Proof: Proof of statement (i). Given k {1,..., n}, l k (0, 1) and is therefore larger than l 0 = 0. It then suffices to prove that l k < l k+1 for k {1,..., n 1} to establish the strict increasingness of the sequence (l k ) k {0,...,n}. Let k {1,..., n 1}. Trivially, G lk+1 (k + 1) < G lk+1 (k). According to (5), G lk+1 (k+1) = G lk (k). It thus follows that G lk (k) < G lk+1 (k). According to lemma 1, statement (ii), we can conclude that l k < l k+1. For every θ [0, 1], G θ (n) = θ n. It then follows from (5) that l n n = α/2, which completes the proof of statement (i). Proof of statement (ii). The strict increasingness of the sequence (u k ) k {0,...,n} derives from the same type of arguments as those used for proving statement (i). Given k {0,..., n 1}, u k belongs to (0, 1) and is therefore less than u n = 1. It then suffices to show that u k < u k+1 for every k {0,..., n 2} to establish the strict increasingness of the sequence. Let k {0,..., n 2}. Trivially, F uk+1 (k + 1) > F uk+1 (k). According to (6), F uk+1 (k + 1) = F uk (k). Therefore, F uk (k) > F uk+1 (k). The result then follows from lemma 1, statement (i). 9

Given θ (0, 1), F θ (0) = (1 θ) n. Therefore, by (6), we obtain that F u0 (0) = (1 u 0 ) n = α/2. Proof of statement (iii). According to (1), (3) and the values of l n and u 0, statement (iii) trivially holds true for k = 0 and k = n. Consider any k {1,..., n 1}. We can write that F lk (k 1) = 1 G lk (k) = 1 (α/2) > α/2. This follows from the definition of G θ, (5) and the fact that α < 1. Since F lk (k) > F lk (k 1) and α/2 = F uk (k) according to (6), we finally obtain that F lk (k) > F uk (k). The fact that l k < u k then straightforwardly derives from lemma 1, statement (i). Lemma 3.3 With the same notations as above, (i) if p = 0, P ({ l X p }) = 1, (ii) if 0 < p (α/2) 1/n, P ({ l X p }) α/2 (iii) if (α/2) 1/n < p 1, P ({ l X p }) = 0. Proof: Statement (i) is trivial since l X 0. As far as statements (ii) and (iii) are concerned, it is convenient to introduce the set E = {k {0,..., n} : l k p}. Statement (i) of lemma 2 implies the following facts. First, E is non empty if and only if p (α/2) 1/n ; second, if 0 < p (α/2) 1/n, E = {m,..., n} where m 1 according to (1). Thereby, l X p if and only if X m. We therefore have that P ({l X p}) = G p (m). Now, since p l m, it follows from lemma 1, statement (ii), that G p (m) G lm (m). According to (5), the right hand side in this inequality equals α/2. Therefore, we obtain that P ({l X p}) α/2. If (α/2) 1/n < p 1, E is empty. Therefore, l X < p and statement (iii) follows. Lemma 3.4 With the same notations as above, (i) if 0 p < 1 (α/2) 1/n, P ({ u X p }) = 0, (ii) if 1 (α/2) 1/n p < 1, P ({ u X p }) α/2, (iii) if p = 1, P ({ u X p }) = 1. 10

Proof: Set F = {k {0,..., n} : u k p}. It follows from statement (ii) of lemma 2 that F is empty if and only if 0 p < 1 (α/2) 1/n. Therefore, if 0 p < 1 (α/2) 1/n, u X is larger than p and statement (i) holds true. Under the condition 1 (α/2) 1/n p < 1, F is not empty. According to statement (ii) of lemma 2 and (3), we obtain that F = {0,..., L} with L n 1. Therefore, the event {u X p} is the event {X L} so that P ({u X p}) = F p (L). Since u L p, it follows from statement (i) of lemma 1 that F p (L) F ul (L). According to (6), the right hand side in this inequality equals α/2 and statement (ii) follows. Statement (iii) trivially holds true since u X 1. We now complete the proof of proposition 2.1. According to lemma 3.2, statement (iii), l X < u X. Thereby, {p l X } {p < u X } so that P ({l X < p < u X }) = P ({p < u X }) P ({p l X }) = 1 P ({p u X }) P ({p l X }) Since the condition 1 (α/2) 1/n (α/2) 1/n is equivalent to n ln(α/2)/ ln(2), proposition 1 straightforwardly follows from lemmas 3.3 and 3.4. References [1] Abramowitz, M. and Stegun, I. (1972). Handbook of Mathematical Functions, Dover Publications, Inc., New York, Ninth printing. [2] Agresti, A. and Coull, B. A. (1998). Approximate is better than exact for interval estimation of binomial proportions, The American Statistican, 52, NO. 2, 119-126. [3] Brown, L. D. and Cai, T. and DasGupta, A. (2001). Interval Estimation for a Binomial Distribution, Statist. Sci., 16, 101-133. [4] Clopper, C. J. and Pearson, E. S. (1934). The use of confidence of fiducial limits illustrated in the case of the binomial, Biometrika, 26, 404-413. [5] Johnson, N. L. and Kotz, S. (1970). Distributions in Statistics, Continuous Univariate Distributions-2, John Wiley & Sons. [6] Johnson, N. L. and Kotz, S. (1970). Distributions in Statistics, Discrete Distributions, John Wiley & Sons. [7] Stuart, A. and Ord, J. K. (1987). Kendall s Advanced Theory of Statistics, Fifth Edition, Charles Griffin. 11