William C.L. Stewart 1,2,3 and Susan E. Hodge 1,2
|
|
- Damian Leonard
- 6 years ago
- Views:
Transcription
1 A Targeted Investigation into Clopper-Pearson Confidence Intervals William C.L. Stewart 1,2,3 and Susan E. Hodge 1,2 1Battelle Center for Mathematical Medicine, The Research Institute, Nationwide Children s Hospital, Columbus, Ohio, Department of Pediatrics, 3 Department of Statistics, The Ohio State University, Columbus, Ohio Corresponding author: William C.L. Stewart, Ph.D., Research Building III, 575 Children s Crossroad, Columbus, OH william.stewart@nationwidechildrens.org
2 Abstract In this special report, we consider Clopper-Pearson (CP) confidence intervals (CIs) for the binomial probability distribution. We review their direct construction from coverage probabilities, and review how this construction corresponds to the more common derivation based on hypothesis testing. We then explore some features of CP CIs: We use the direct construction to elucidate their bizarre patterns of coverage and to understand the origin of these patterns. We present an argument for constructing closed rather than open CIs; and we show how poorly the uncorrected normal approximation performs relative to CP CIs, even when n is large (e.g., n = 500). We briefly discuss the difference between CP and fiducial CIs. These insights should provide users with a different perspective on and appreciation of Clopper- Pearson CIs. Keywords: confidence intervals, Clopper-Pearson, fiducial intervals, and coverage
3 1. Introduction In addition to parameter estimation, one is often concerned with the question: How close is my estimate to the truth? Although confidence intervals (CIs) do not answer this question, they do provide random intervals that contain the truth with some minimum probability (e.g. 95%). As such, they are widely used and favored (by many) over hypothesis tests [e.g., Poole, 2001; Rothman et al., 2012; Rotondi and Donner, 2012; Cocks and Torgerson, 2013]. In this paper, we highlight some intriguing features of Clopper-Pearson (CP) CIs (Clopper and Pearson, 1934), which are both exact 1 and applicable to discrete distributions that are not amenable to the most common method of CI construction (Peña et al., 1992). CP CIs are usually derived from the acceptance regions of hypothesis tests, but they can be derived directly from the definition of coverage as shown in Appendix A; see also Schilling and Doi (2014). The direct derivation sheds light on the seemingly bizarre coverage pattern of CP CIs (Section 3), provides yet another rationale for reporting closed rather than open CIs (Section 4), and illustrates an important difference between fiducial intervals and CP CIs (Section 5). In Section 6, we demonstrate the surprisingly poor coverage of the normal approximation, even when CP CIs are constructed from fairly large numbers of independent Bernoulli trials. 2. Clopper-Pearson (CP) Confidence Intervals (CIs)! Consider the binomial probability mass function p(x;n,θ) = # n$ " x% & θ x (1 θ) n x with success probability θ. To construct an exact (1 α) CI for θ, we seek a lower confidence limit U(X) and an upper confidence limit V(X) defined on [0, 1] such that, for any fixed α in (0,1) (θ (U,V )) 1 α, θ (0,1). (1) For CP CIs, the coverage of the two outer intervals is also constrained, so that (θ [0,U]) α 2 and (θ [V,1]) α 2. (2) We refer to the probabilities in (2) as the left interval (LI) and right interval (RI) coverage, respectively, and we refer to their sum as the outer coverage; then one minus the outer coverage yields the CI coverage in (1). Given n trials and X = x observed successes, the lower and upper limits for the CP CI are defined as 1 Exact refers to the exact probabilities used to calculate the CIs, which are conservative, with coverage 1 α.
4 " u x = # 0 for x = 0 $ for 0 < x n θ L " and v x = θ U for 0 x < n #, respectively, where θ L and θ U are the implicit $ 1 for x = n solutions to n! n # $ " j % & θ j (1 θ) n j = α x! 2, and n # $ " j % & θ j (1 θ) n j = α 2, (3) j=x j=0 respectively. The equations in (3) are the standard ones for constructing CP CIs, and they yield the following left interval (LI) and right interval (RI) coverages, respectively: $ (0,α 2] for u (U θ) : 0 <θ u n % & = 0 for u n <θ 1, # (V θ) = = 0 for 0 θ < v 0 $ % (0,α 2] for v 0 θ < v n (4) Accordingly, the CP CI coverage is 1 minus the sum of LI and RI coverage probabilities in (4): $ [1 α 2, 1) for 0 <θ < v 0 & (U <θ < V ) = % [1 α, 1) for v 0 θ u n & ' [1 α 2, 1) for u n <θ <1 (5) The special cases in which θ = 0 and θ =1 are discussed in Section 4. (Pastor (2005) has similar equations but breaks down the cases differently, depending on the value of n.) 3. Shapes of coverage curves Figure 1 graphs LI and RI coverages as separate functions of θ, for n = 10 and 20. The LI (RI) coverage function has discontinuities whenever θ happens to equal u j (v j,) for j in {0,, n}. For example, if θ happens to equals u j for j in {1,, n}, then the LI coverage equals α 2. However, because the LI coverage decreases smoothly and monotonically in the direction of u j 1, there are discontinuities at u j 1 where the coverage jumps to α 2. 2 Also, because the LI coverage is zero whenever θ exceeds u n, the function is discontinuous at u n as well. A similar line of reasoning explains the discontinuities in the RI coverage function. This leads to the forward and backward J -shaped segments of Figure 1. The forward segments correspond to the LI coverage function, while the backward segments correspond to the RI coverage function. Moreover, as n increases, the total number of segments increases, and the LI segments spread to the right, while the RI segments spread to the left. Because the coverage of the CP CI is simply 1 2 LI coverage jumps to 1 when θ = 0, and RI coverage jumps to 1 when θ =1 ; see Section 4.
5 minus the outer coverage, the interplay between forward and backward segments explains why the coverage functions for CP CIs are increasingly strange and discontinuous as n increases (Figure 2). For n > 5, the forward and backward segments overlap symmetrically about θ =.5, and the amount of overlap also increases with increasing n. (a) Figure 1. (b) (U θ) (V θ) θ Figure 1. LI (left interval) coverage, in red, and RI (right interval) coverage, in blue, as functions of, α =.05 θ = u j θ = v j (a) for n = 10 and (b) for n = 20 (with ). Black marks indicate those points ( for the LIs, for the RIs) at which coverage is exactly.025. Probabilities are plotted for a dense set of discrete values along (0, 1), so as not to obscure the discontinuities.
6 (a) Figure 2. (b) (U <θ < V ) Figure 2. CI coverage, as function of, (a) for n = 10 and (b) for n = 20. These curves result from subtracting the sum of Figure 1 s red and blue curves from 1 at each value of. θ 4. Open versus closed Cis Some authors have advocated for open CP CIs, whereas others suggest that the intervals should be closed. Casella (1986) states that CIs are always defined as closed, but points out that, in practice, there is no real difference between considering the intervals open or closed, and has advocated for half-open θ
7 CIs, i.e., in our notation, (u x, v x ]. Some authors, e.g., Zwillinger and Koskosa (2000), Fleiss et al. (2003), and Pastor (2005) report open CIs, whereas others, e.g., Lehmann and Romano (1959) and Casella and Berger (2002), imply the CI is closed. Stuart and Ord (1991) imply that both the CI and the outer intervals are closed, which seems logically inconsistent. Moreover, the entire issue of whether one should report open or closed CIs is rarely discussed. Nevertheless, there are good reasons for reporting closed CP CIs. First, when θ happens to equal one of the confidence limits, the open CP CI does not include the true value. For example, if n = 10 and θ =.1216, the event X = 4 (which occurs with probability.0201) yields the open CI (.1216,.7376), which does not include the true value. In general, when θ equals u j or v j there is a small but non-negligible probability (.0201 in this example) that the endpoint of the interval will equal θ. Whenever this happens, [u x, v x ] which is only infinitesimally larger than (u x, v x ), will have greater coverage. This is especially important when θ equals 0, since the open CP CI, (0,v 0 ), never includes θ, and its coverage is zero. In contrast, the closed CP CI, [0, v 0 ], always includes θ, so its coverage is one. A similar pathology exists when θ equals 1: (u n, 1) has coverage zero, whereas [u n, 1] has coverage one. Second, the special relationship between hypothesis testing and confidence intervals seems to suggest that CP CIs should be closed. Briefly, the validity of an exact CI constructed from an inverted hypothesis test is rooted in the following tautology: θ C(x 1,, x n ) (x 1,, x n ) A(θ), where C is a confidence interval that depends on the observed data, and A is the acceptance region of a level α hypothesis test (Lehmann and Romano, 1959; Casella and Berger, 2002). Moreover, because the equations in (3) provide unique solutions: θ L and θ U, and because the binomial distribution permits a stochastic ordering, the confidence interval C is implicitly closed. Hence, from a set theoretic perspective, and from the vantage point of robust coverage, it is both legitimate and logically more consistent to report closed CIs rather than open ones. 5. Behavior of the normal approximation for large n The normal approximation arises from the central limit theorem applied to ˆ θ = x = x n, and as a general rule of thumb, the approximation is presumed to be reasonably close for n 30. We briefly
8 examine CI coverage when the normal approximation is used for large samples. 3 We compare the normal approximation confidence limits with the exact CP CI limits (Figure 3). For n = 30 and α =.05, the two sets of limits appear to be quite close, with the normal-approximation limits a little low for observations less than n/2 and a little high for observations greater than n/2. However, the coverage probabilities from the normal approximation deviate considerably from the exact coverage probabilities of the CP CIs, with coverage well below 95% for most of the parameter space when n = 30 and α =.05 (Figure 4). We define Figure 3. Lower and upper 95% confidence limits, for n = 30, calculated by both Clopper-Pearson method and normal approximation. Q as the proportion of the parameter space that has coverage of at least 95%. Figure 5 shows that even increasing the sample size to n = 500, yields a Q of only.258. Moreover, even when the coverage criterion 3 We considered only the uncorrected normal approximation, where the 95% CI s lower limit is x 1.96 SEP and the upper limit is x SEP, with SEP = x(1 x) n.
9 is relaxed (requiring that CI coverage only exceed 94%, for example), it takes nearly 100 independent samples to attain a Q-value of 50% (details not shown). 6. Fiducial intervals One type of fiducial interval is defined by the set of [parameter] values that could have given rise to the observed value with the specified probability [greater than or equal to] 1 α (Wang, 2000; (a) Figure 4. (b) Figure 4. CI coverage, as function of, compared between (a) Clopper-Pearson intervals and (b) normal approximations, for n = 30,. The vertical axis is shown from 0.9 to 1. CP coverage is for all values of, whereas normal approximation coverage is for most values of. θ α = θ <.95 θ
10 Hannig et al., 2016). This contrasts with the focus of the CP CI on coverage probabilities (Equation 1). The two approaches: CP CIs and fiducial intervals, yield identical intervals for all outcomes except X = 0 and X = n. When X = n, one finds from Equation (3) that θ L equals α 2. However, the fiducial interval { } ( ) 1 n would yield θ L = α 1 n, since this set θ : α 1 n θ 1 satisfies the fiducial definition above. (Similarly, Figure 5. Figure 5. Graph of Q (proportion of the parameter space with at least 95% coverage) versus sample size n, when the normal approximation (without continuity correction) is used, for sample sizes from 20 to 500. ( ) 1 n 1 α θ when X = 0 the fiducial upper limit is θ U =1 α 1 n instead of θ U =1 α 2, which is the corresponding CP CI upper limit.) Thus, the fiducial approach does not ensure coverage of at least for all. Specifically, for values of θ between ( α 2) 1 n and α 1 n, LI (left interval) coverage will be greater than α 2 ; it will increase from α 2 to α as θ moves from ( α 2) 1 n to α 1 n ; and similarly for RI coverage for θ between 1 α 1 n and 1 ( α 2) 1 n. This removes the guarantee that CI coverage will never dip below 1 α for all θ. (Whether the CI coverage is actually <1 α for a given θ depends on the interplay between the RI and LI coverages at θ.) Figure 6 shows the differences in coverage for n =10.
11 Figure 6. (a) (b) (c) (d) Figure 6. LI and RI coverages and CI coverage, compared between CP vs. fiducial CIs, for n = 10 and. (a) LI and RI coverages from CP CIs; same as Figure 1(a), but on different vertical scale. (b) CI coverage from CP CIs; same as Figure 2(a), but on different vertical scale. (c) LI and RI coverages from fiducial intervals. LI coverage exceeds.025 for between ~.692 and ~.741; RI coverage exceeds.025 for θ θ between ~.259 and ~.308. (d) CI coverages from fiducial intervals. Coverage differs from that in (b) for between ~.259 and ~.308, and between ~.692 and ~.741. α = Conclusions We examine Clopper-Pearson binomial confidence intervals through the lens of coverage probabilities (see Appendix A for details). The bizarre and discontinuous patterns of CI coverage, as a function of θ, are more clearly understood as arising from the overlaps between the forward and backward J -shaped LI (left interval) and RI (right interval) coverage functions. After considering settheoretic arguments, and after carefully considering the impact of infinitesimally larger CP CIs, a clear and logically consistent rationale for reporting closed CIs (i.e., CIs that include the endpoints) emerges. Comparing CP intervals with those obtained from the (uncorrected) normal approximation reveals the θ
12 surprising result that even at large sample sizes, e.g., n = 500, the normal approximation yields conservative coverage for only a small proportion of the parameter space. While it is well known that the uncorrected normal approximation does not behave well for small n (e.g., Brown et al. 2001, 2002), it is perhaps less widely appreciated how poorly this approximation continues to perform for larger sample sizes as well. Finally, the distinction between one type of fiducial intervals and CP CIs is demonstrated by the differences in coverage when X = 0 or X = n. Acknowledgments This project was supported by The Research Institute, Nationwide Children s Hospital, Columbus, Ohio. References Brown, L.D., Cai, T.T., DasGupta, A. (2001). Interval estimation for a binomial proportion. Statistical Sci. 16, Brown, L.D., Cai, T.T., DasGupta, A. (2002). Confidence intervals for a binomial proportion and asymptotic expansions. Ann. Stat. 30, Casella, G. (1986). Refining binomial confidence intervals. Can. J. Stat. 14, Casella, G., Berger, R.L. (2002). Statistical Inference. Duxbury, Wadsworth Group, Boston. Clopper, C., Pearson, E.S. (1934). The use of confidence or fiducial limits illustrated in the case of the binomial. Biometrika 26, Cocks, K., Torgerson, D.J. (2013). J. Clin. Epidemiol. 66, Fleiss, J.L., Levin, B., Paik, M.C. (2003). Statistical Methods for Rates and Proportions, 3 rd ed. Wiley Interscience, John Wiley & Sons, Hoboken NJ. Hannig, J., Iyer, H.K., Lai, R.C.S., Lee, T.C.M. (2016) Generalized fiducial infrence: A review and new results. J Am Stat Assoc 111: Lehmann, E.L., Romano, J.P. (1959). Testing Statistical Hypotheses, 3 rd ed., Springer, New York, NY. Pastor, D. (2005). On the coverage probability of the Clopper-Pearson confidence interval. Technical report, ENST Bretagne. Peña, E.A., Rohatgi, V.K., Székely, G.J. (1992). On the non-existence of ancillary statistics. Stat. Prob. Letters 15, Poole, C. Low P-values or narrow confidence intervals: which are more durable? (2001) Epidemiology 12:
13 Rothman, K.J., Greenland, S., Lash, T. (2012) Modern Epidemiology, 3 rd edition. Wolters Kluwer / Lippincott Williams & Wilkins. Rotondi, M.A., Donner, A. (2012) A confidence interval approach to sample size estimation for interobserver agreement studies with multiple raters and outcomes. J Clin Epidemiol 65: Schilling, M.F., Doi J.A. (2014). A coverage probability approach to finding an optimal binomial confidence procedure. Am. Statist. 68, Stuart, A., Ord, J.K. (1991). Kendall s Advanced Theory of Statistics. Vol 2. Classical Inference and Relationship, 5 th ed. Oxford U Press, New York. Wang, Y.H. (2000). Fiducial intervals: what are they? Am Statist 54, Zwillinger, D., Kokoska, S. (2000). Standard Probability and Statistics Tables and Formulae. Chapman & Hall/CRC, New York. Appendix A: Direct construction of CP CIs Let θ denote the true but unknown value of the binomial parameter in the probability mass! function p(x;n,θ) = # n$. To construct an exact confidence interval (CI) for the binomial " x% & θ x (1 θ) n x (1 α) success probability, we seek random variables U < V, with U and V defined on [0, 1] such that the open interval (U, V) has probability of at least 1 α of covering θ : (θ (U,V )) 1 α. There is an additional constraint of symmetry on the two outer intervals: (θ [0,U]) α 2 and (θ [V,1]) α. (A.1) 2 We refer to the probabilities in (A.1) as the left interval (LI) and right interval (RI) coverage, respectively; the random variables U and V are the lower and upper confidence limits. The CI coverage equals 1 minus the sum of the two probabilities in (A.1): (θ (U,V )) =1 (U θ) (V θ) (A.2) Note that U is a discrete random variable, defined on a set of n +1 values: U :{ 0 = u 0 < u 1 < u n <1}. (A.3) Each value of u corresponds to exactly one value of x, where the random variable X is defined on { } u j x j X : 0, 1,, n. There is a one-to-one relationship between and for j = 0, n. Therefore,
14 (U = u j ) = (X = j). Similarly, V is a random variable, defined on V : 0 < v 0 < v 1 < v n =1. By u 0 convention, is set to 0 and is set to 1. v n The LI coverage in (A.1) can be expressed as an expectation: { } (U θ) n = I { u j θ} (X = j), (A.4) j=0 { } u j θ u j where I u j θ takes the value 1 if, and 0 otherwise. Thus, only the values of that are greater than or equal to θ contribute to the sum in (A.4). Although we do not know the values, we do know that they are discrete points in [0, 1], and that they are ordered as in (A.3). Therefore, (U θ) = (X 1) for 0 <θ u 1. (A.5) u j Equation (A.5) converts a probability concerning U to one concerning X. Moreover, if we set u 1 to θ *, where θ * is the unique solution to (X 1) = α, then for 0 <θ u 1 the LI coverage in (A.5) is less 2 n! n than or equal to α 2. In general, if we define u j implicitly as the solution to # $, then " j % & θ j (1 θ) n j = α 2 the LI coverage will be less than or equal to α 2 for 0 <θ 1. Thus, we have found the lower confidence limit for the CP CI directly from the LI coverage criterion in (A.1). For upper limits V, follow the same reasoning, but with reversed inequalities: P[V θ 0 θ =θ 0 ] = P[X j θ =θ 0 ], for v j θ 0 < v j+1, j = 0,, n 1, where each v x is defined implicitly as x! n the solution to # $. " j % & θ j (1 θ) n j = α 2 j=0 In summary, this approach arrives at the standard equations used to construct the CP CI, yet without using the standard test inversion reasoning. The corresponding left and right interval coverage probabilities respectively are: j=x $ (0,α 2] for 0 <θ u (U θ) : n % & = 0 for u n <θ 1 and # = 0 for 0 θ < v P. (A.6) θ (V θ) : 0 $ % (0,α 2] for v 0 θ <1 Accordingly, as in (A.2), the CI coverage is 1 minus the sum of LI and RI coverage probabilities in (A.6):
15 $ [1 α 2, 1) for 0 <θ < v 0 & (U <θ < V ) = % [1 α, 1) for v 0 θ u n & ' [1 α 2, 1) for u n <θ <1, which is the same as Equation (5) in the text.
arxiv: v1 [math.st] 5 Jul 2007
EXPLICIT FORMULA FOR COSTRUCTIG BIOMIAL COFIDECE ITERVAL WITH GUARATEED COVERAGE PROBABILITY arxiv:77.837v [math.st] 5 Jul 27 XIJIA CHE, KEMI ZHOU AD JORGE L. ARAVEA Abstract. In this paper, we derive
More informationTesting Simple Hypotheses R.L. Wolpert Institute of Statistics and Decision Sciences Duke University, Box Durham, NC 27708, USA
Testing Simple Hypotheses R.L. Wolpert Institute of Statistics and Decision Sciences Duke University, Box 90251 Durham, NC 27708, USA Summary: Pre-experimental Frequentist error probabilities do not summarize
More informationLawrence D. Brown, T. Tony Cai and Anirban DasGupta
Statistical Science 2005, Vol. 20, No. 4, 375 379 DOI 10.1214/088342305000000395 Institute of Mathematical Statistics, 2005 Comment: Fuzzy and Randomized Confidence Intervals and P -Values Lawrence D.
More informationFiducial Inference and Generalizations
Fiducial Inference and Generalizations Jan Hannig Department of Statistics and Operations Research The University of North Carolina at Chapel Hill Hari Iyer Department of Statistics, Colorado State University
More informationApproximate and Fiducial Confidence Intervals for the Difference Between Two Binomial Proportions
Approximate and Fiducial Confidence Intervals for the Difference Between Two Binomial Proportions K. Krishnamoorthy 1 and Dan Zhang University of Louisiana at Lafayette, Lafayette, LA 70504, USA SUMMARY
More informationWeizhen Wang & Zhongzhan Zhang
Asymptotic infimum coverage probability for interval estimation of proportions Weizhen Wang & Zhongzhan Zhang Metrika International Journal for Theoretical and Applied Statistics ISSN 006-1335 Volume 77
More informationarxiv: v1 [stat.ap] 17 Mar 2012
A Note On the Use of Fiducial Limits for Control Charts arxiv:1203.3882v1 [stat.ap] 17 Mar 2012 Mokin Lee School of Mechanical Automotive Engineering University of Ulsan Ulsan, South Korea Chanseok Park
More informationOn the coverage probability of the Clopper-Pearson confidence interval
On the coverage probability of the Clopper-Pearson confidence interval Sur la probabilité de couverture de l intervalle de confiance de Clopper-Pearson Rapport Interne GET / ENST Bretagne Dominique Pastor
More informationA New Confidence Interval for the Difference Between Two Binomial Proportions of Paired Data
UW Biostatistics Working Paper Series 6-2-2003 A New Confidence Interval for the Difference Between Two Binomial Proportions of Paired Data Xiao-Hua Zhou University of Washington, azhou@u.washington.edu
More informationCharles Geyer University of Minnesota. joint work with. Glen Meeden University of Minnesota.
Fuzzy Confidence Intervals and P -values Charles Geyer University of Minnesota joint work with Glen Meeden University of Minnesota http://www.stat.umn.edu/geyer/fuzz 1 Ordinary Confidence Intervals OK
More informationCommentary on Basu (1956)
Commentary on Basu (1956) Robert Serfling University of Texas at Dallas March 2010 Prepared for forthcoming Selected Works of Debabrata Basu (Anirban DasGupta, Ed.), Springer Series on Selected Works in
More informationStatistics for Applications Spring 2009
MIT OpenCourseWare http://ocw.mit.edu 18.443 Statistics for Applications Spring 2009 For information about citing these materials or our Terms of Use, visit: http://ocw.mit.edu/terms. CONFIDENCE INTERVALS
More informationTHE SHORTEST CLOPPER PEARSON RANDOM- IZED CONFIDENCE INTERVAL FOR BINOMIAL PROBABILITY
REVSTAT Statistical Journal Volume 15, Number 1, January 2017, 141 153 THE SHORTEST CLOPPER PEARSON RANDOM- IZED CONFIDENCE INTERVAL FOR BINOMIAL PROBABILITY Author: Wojciech Zieliński Department of Econometrics
More informationSuperiority by a Margin Tests for One Proportion
Chapter 103 Superiority by a Margin Tests for One Proportion Introduction This module provides power analysis and sample size calculation for one-sample proportion tests in which the researcher is testing
More informationConstructing Ensembles of Pseudo-Experiments
Constructing Ensembles of Pseudo-Experiments Luc Demortier The Rockefeller University, New York, NY 10021, USA The frequentist interpretation of measurement results requires the specification of an ensemble
More informationTesting Statistical Hypotheses
E.L. Lehmann Joseph P. Romano Testing Statistical Hypotheses Third Edition 4y Springer Preface vii I Small-Sample Theory 1 1 The General Decision Problem 3 1.1 Statistical Inference and Statistical Decisions
More informationTheory and Methods of Statistical Inference. PART I Frequentist theory and methods
PhD School in Statistics cycle XXVI, 2011 Theory and Methods of Statistical Inference PART I Frequentist theory and methods (A. Salvan, N. Sartori, L. Pace) Syllabus Some prerequisites: Empirical distribution
More informationUnit 14: Nonparametric Statistical Methods
Unit 14: Nonparametric Statistical Methods Statistics 571: Statistical Methods Ramón V. León 8/8/2003 Unit 14 - Stat 571 - Ramón V. León 1 Introductory Remarks Most methods studied so far have been based
More informationTheory and Methods of Statistical Inference
PhD School in Statistics cycle XXIX, 2014 Theory and Methods of Statistical Inference Instructors: B. Liseo, L. Pace, A. Salvan (course coordinator), N. Sartori, A. Tancredi, L. Ventura Syllabus Some prerequisites:
More informationPROD. TYPE: COM. Simple improved condence intervals for comparing matched proportions. Alan Agresti ; and Yongyi Min UNCORRECTED PROOF
pp: --2 (col.fig.: Nil) STATISTICS IN MEDICINE Statist. Med. 2004; 2:000 000 (DOI: 0.002/sim.8) PROD. TYPE: COM ED: Chandra PAGN: Vidya -- SCAN: Nil Simple improved condence intervals for comparing matched
More informationIn Defence of Score Intervals for Proportions and their Differences
In Defence of Score Intervals for Proportions and their Differences Robert G. Newcombe a ; Markku M. Nurminen b a Department of Primary Care & Public Health, Cardiff University, Cardiff, United Kingdom
More informationUncertainty due to Finite Resolution Measurements
Uncertainty due to Finite Resolution Measurements S.D. Phillips, B. Tolman, T.W. Estler National Institute of Standards and Technology Gaithersburg, MD 899 Steven.Phillips@NIST.gov Abstract We investigate
More informationSimple Sensitivity Analysis for Differential Measurement Error. By Tyler J. VanderWeele and Yige Li Harvard University, Cambridge, MA, U.S.A.
Simple Sensitivity Analysis for Differential Measurement Error By Tyler J. VanderWeele and Yige Li Harvard University, Cambridge, MA, U.S.A. Abstract Simple sensitivity analysis results are given for differential
More informationTheory and Methods of Statistical Inference. PART I Frequentist likelihood methods
PhD School in Statistics XXV cycle, 2010 Theory and Methods of Statistical Inference PART I Frequentist likelihood methods (A. Salvan, N. Sartori, L. Pace) Syllabus Some prerequisites: Empirical distribution
More informationA Simple Approximate Procedure for Constructing Binomial and Poisson Tolerance Intervals
This article was downloaded by: [Kalimuthu Krishnamoorthy] On: 11 February 01, At: 08:40 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 107954 Registered office:
More informationInterval estimation via tail functions
Page 1 of 42 Interval estimation via tail functions Borek Puza 1,2 and Terence O'Neill 1, 6 May 25 1 School of Finance and Applied Statistics, Australian National University, ACT 2, Australia. 2 Corresponding
More informationBasic Concepts of Inference
Basic Concepts of Inference Corresponds to Chapter 6 of Tamhane and Dunlop Slides prepared by Elizabeth Newton (MIT) with some slides by Jacqueline Telford (Johns Hopkins University) and Roy Welsch (MIT).
More informationTest Volume 11, Number 1. June 2002
Sociedad Española de Estadística e Investigación Operativa Test Volume 11, Number 1. June 2002 Optimal confidence sets for testing average bioequivalence Yu-Ling Tseng Department of Applied Math Dong Hwa
More informationTesting Statistical Hypotheses
E.L. Lehmann Joseph P. Romano, 02LEu1 ttd ~Lt~S Testing Statistical Hypotheses Third Edition With 6 Illustrations ~Springer 2 The Probability Background 28 2.1 Probability and Measure 28 2.2 Integration.........
More informationUnderstanding Ding s Apparent Paradox
Submitted to Statistical Science Understanding Ding s Apparent Paradox Peter M. Aronow and Molly R. Offer-Westort Yale University 1. INTRODUCTION We are grateful for the opportunity to comment on A Paradox
More informationIgnoring the matching variables in cohort studies - when is it valid, and why?
Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association
More informationWaves in a Shock Tube
Waves in a Shock Tube Ivan Christov c February 5, 005 Abstract. This paper discusses linear-wave solutions and simple-wave solutions to the Navier Stokes equations for an inviscid and compressible fluid
More informationAsymptotic Relative Efficiency in Estimation
Asymptotic Relative Efficiency in Estimation Robert Serfling University of Texas at Dallas October 2009 Prepared for forthcoming INTERNATIONAL ENCYCLOPEDIA OF STATISTICAL SCIENCES, to be published by Springer
More informationINVARIANT SMALL SAMPLE CONFIDENCE INTERVALS FOR THE DIFFERENCE OF TWO SUCCESS PROBABILITIES
INVARIANT SMALL SAMPLE CONFIDENCE INTERVALS FOR THE DIFFERENCE OF TWO SUCCESS PROBABILITIES Thomas J Santner Department of Statistics 1958 Neil Avenue Ohio State University Columbus, OH 4210 Shin Yamagami
More informationGaussian Estimation under Attack Uncertainty
Gaussian Estimation under Attack Uncertainty Tara Javidi Yonatan Kaspi Himanshu Tyagi Abstract We consider the estimation of a standard Gaussian random variable under an observation attack where an adversary
More informationStat 5421 Lecture Notes Fuzzy P-Values and Confidence Intervals Charles J. Geyer March 12, Discreteness versus Hypothesis Tests
Stat 5421 Lecture Notes Fuzzy P-Values and Confidence Intervals Charles J. Geyer March 12, 2016 1 Discreteness versus Hypothesis Tests You cannot do an exact level α test for any α when the data are discrete.
More informationConfidence Intervals of the Simple Difference between the Proportions of a Primary Infection and a Secondary Infection, Given the Primary Infection
Biometrical Journal 42 (2000) 1, 59±69 Confidence Intervals of the Simple Difference between the Proportions of a Primary Infection and a Secondary Infection, Given the Primary Infection Kung-Jong Lui
More informationHypothesis Testing. 1 Definitions of test statistics. CB: chapter 8; section 10.3
Hypothesis Testing CB: chapter 8; section 0.3 Hypothesis: statement about an unknown population parameter Examples: The average age of males in Sweden is 7. (statement about population mean) The lowest
More informationModern Likelihood-Frequentist Inference. Donald A Pierce, OHSU and Ruggero Bellio, Univ of Udine
Modern Likelihood-Frequentist Inference Donald A Pierce, OHSU and Ruggero Bellio, Univ of Udine Shortly before 1980, important developments in frequency theory of inference were in the air. Strictly, this
More informationINTRODUCTION TO INTERSECTION-UNION TESTS
INTRODUCTION TO INTERSECTION-UNION TESTS Jimmy A. Doi, Cal Poly State University San Luis Obispo Department of Statistics (jdoi@calpoly.edu Key Words: Intersection-Union Tests; Multiple Comparisons; Acceptance
More informationModified Large Sample Confidence Intervals for Poisson Distributions: Ratio, Weighted Average and Product of Means
Modified Large Sample Confidence Intervals for Poisson Distributions: Ratio, Weighted Average and Product of Means K. KRISHNAMOORTHY a, JIE PENG b AND DAN ZHANG a a Department of Mathematics, University
More informationK. Krishnamoorthy & Shanshan Lv
Highest posterior mass prediction intervals for binomial and poisson distributions K. Krishnamoorthy & Shanshan Lv Metrika International Journal for Theoretical and Applied Statistics ISSN 0026-1335 Volume
More informationSolution: chapter 2, problem 5, part a:
Learning Chap. 4/8/ 5:38 page Solution: chapter, problem 5, part a: Let y be the observed value of a sampling from a normal distribution with mean µ and standard deviation. We ll reserve µ for the estimator
More informationAn Introduction to Multivariate Statistical Analysis
An Introduction to Multivariate Statistical Analysis Third Edition T. W. ANDERSON Stanford University Department of Statistics Stanford, CA WILEY- INTERSCIENCE A JOHN WILEY & SONS, INC., PUBLICATION Contents
More informationAnalysis of Thompson Sampling for the multi-armed bandit problem
Analysis of Thompson Sampling for the multi-armed bandit problem Shipra Agrawal Microsoft Research India shipra@microsoft.com avin Goyal Microsoft Research India navingo@microsoft.com Abstract We show
More informationExact unconditional tests for a 2 2 matched-pairs design
Statistical Methods in Medical Research 2003; 12: 91^108 Exact unconditional tests for a 2 2 matched-pairs design RL Berger Statistics Department, North Carolina State University, Raleigh, NC, USA and
More informationProbability and Statistics
Probability and Statistics Kristel Van Steen, PhD 2 Montefiore Institute - Systems and Modeling GIGA - Bioinformatics ULg kristel.vansteen@ulg.ac.be CHAPTER 4: IT IS ALL ABOUT DATA 4a - 1 CHAPTER 4: IT
More informationPower Comparison of Exact Unconditional Tests for Comparing Two Binomial Proportions
Power Comparison of Exact Unconditional Tests for Comparing Two Binomial Proportions Roger L. Berger Department of Statistics North Carolina State University Raleigh, NC 27695-8203 June 29, 1994 Institute
More informationFuzzy and Randomized Confidence Intervals and P-values
Fuzzy and Randomized Confidence Intervals and P-values Charles J. Geyer and Glen D. Meeden June 15, 2004 Abstract. The optimal hypothesis tests for the binomial distribution and some other discrete distributions
More informationWeek 8: Correlation and Regression
Health Sciences M.Sc. Programme Applied Biostatistics Week 8: Correlation and Regression The correlation coefficient Correlation coefficients are used to measure the strength of the relationship or association
More informationBayesian methods for sample size determination and their use in clinical trials
Bayesian methods for sample size determination and their use in clinical trials Stefania Gubbiotti Abstract This paper deals with determination of a sample size that guarantees the success of a trial.
More informationEstimation and sample size calculations for correlated binary error rates of biometric identification devices
Estimation and sample size calculations for correlated binary error rates of biometric identification devices Michael E. Schuckers,11 Valentine Hall, Department of Mathematics Saint Lawrence University,
More informationPlugin Confidence Intervals in Discrete Distributions
Plugin Confidence Intervals in Discrete Distributions T. Tony Cai Department of Statistics The Wharton School University of Pennsylvania Philadelphia, PA 19104 Abstract The standard Wald interval is widely
More informationChapter 15 Confidence Intervals for Mean Difference Between Two Delta-Distributions
Chapter 15 Confidence Intervals for Mean Difference Between Two Delta-Distributions Karen V. Rosales and Joshua D. Naranjo Abstract Traditional two-sample estimation procedures like pooled-t, Welch s t,
More informationCoverage Properties of Confidence Intervals for Proportions in Complex Sample Surveys
Coverage Properties of Confidence Intervals for Proportions in Complex Sample Surveys Carolina Franco Roderick J. A. Little Thomas A. Louis Eric V. Slud Abstract The challenges of constructing confidence
More informationA COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky
A COMPARISON OF POISSON AND BINOMIAL EMPIRICAL LIKELIHOOD Mai Zhou and Hui Fang University of Kentucky Empirical likelihood with right censored data were studied by Thomas and Grunkmier (1975), Li (1995),
More informationInterval Estimation. Chapter 9
Chapter 9 Interval Estimation 9.1 Introduction Definition 9.1.1 An interval estimate of a real-values parameter θ is any pair of functions, L(x 1,..., x n ) and U(x 1,..., x n ), of a sample that satisfy
More informationTHE SHORTEST CONFIDENCE INTERVAL FOR PROPORTION IN FINITE POPULATIONS
APPLICATIOES MATHEMATICAE Online First version Wojciech Zieliński (Warszawa THE SHORTEST COFIDECE ITERVAL FOR PROPORTIO I FIITE POPULATIOS Abstract. Consider a finite population. Let θ (0, 1 denote the
More informationTest Sample and Size. Synonyms. Definition. Main Body Text. Michael E. Schuckers 1. Sample Size; Crew designs
Test Sample and Size Michael E. Schuckers 1 St. Lawrence University, Canton, NY 13617, USA schuckers@stlawu.edu Synonyms Sample Size; Crew designs Definition The testing and evaluation of biometrics is
More informationAPPENDIX B Sample-Size Calculation Methods: Classical Design
APPENDIX B Sample-Size Calculation Methods: Classical Design One/Paired - Sample Hypothesis Test for the Mean Sign test for median difference for a paired sample Wilcoxon signed - rank test for one or
More informationGeneralized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence
Generalized Linear Model under the Extended Negative Multinomial Model and Cancer Incidence Sunil Kumar Dhar Center for Applied Mathematics and Statistics, Department of Mathematical Sciences, New Jersey
More informationa. See the textbook for examples of proving logical equivalence using truth tables. b. There is a real number x for which f (x) < 0. (x 1) 2 > 0.
For some problems, several sample proofs are given here. Problem 1. a. See the textbook for examples of proving logical equivalence using truth tables. b. There is a real number x for which f (x) < 0.
More informationReports of the Institute of Biostatistics
Reports of the Institute of Biostatistics No 02 / 2008 Leibniz University of Hannover Natural Sciences Faculty Title: Properties of confidence intervals for the comparison of small binomial proportions
More informationHYPOTHESIS TESTING: FREQUENTIST APPROACH.
HYPOTHESIS TESTING: FREQUENTIST APPROACH. These notes summarize the lectures on (the frequentist approach to) hypothesis testing. You should be familiar with the standard hypothesis testing from previous
More informationMathematical Statistics
Mathematical Statistics MAS 713 Chapter 8 Previous lecture: 1 Bayesian Inference 2 Decision theory 3 Bayesian Vs. Frequentist 4 Loss functions 5 Conjugate priors Any questions? Mathematical Statistics
More informationF79SM STATISTICAL METHODS
F79SM STATISTICAL METHODS SUMMARY NOTES 9 Hypothesis testing 9.1 Introduction As before we have a random sample x of size n of a population r.v. X with pdf/pf f(x;θ). The distribution we assign to X is
More informationOne-sample categorical data: approximate inference
One-sample categorical data: approximate inference Patrick Breheny October 6 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/25 Introduction It is relatively easy to think about the distribution
More informationSection Inference for a Single Proportion
Section 8.1 - Inference for a Single Proportion Statistics 104 Autumn 2004 Copyright c 2004 by Mark E. Irwin Inference for a Single Proportion For most of what follows, we will be making two assumptions
More informationApplying the Benjamini Hochberg procedure to a set of generalized p-values
U.U.D.M. Report 20:22 Applying the Benjamini Hochberg procedure to a set of generalized p-values Fredrik Jonsson Department of Mathematics Uppsala University Applying the Benjamini Hochberg procedure
More informationAdjusted Empirical Likelihood for Long-memory Time Series Models
Adjusted Empirical Likelihood for Long-memory Time Series Models arxiv:1604.06170v1 [stat.me] 21 Apr 2016 Ramadha D. Piyadi Gamage, Wei Ning and Arjun K. Gupta Department of Mathematics and Statistics
More informationRecall the Basics of Hypothesis Testing
Recall the Basics of Hypothesis Testing The level of significance α, (size of test) is defined as the probability of X falling in w (rejecting H 0 ) when H 0 is true: P(X w H 0 ) = α. H 0 TRUE H 1 TRUE
More informationON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES
ON SMALL SAMPLE PROPERTIES OF PERMUTATION TESTS: INDEPENDENCE BETWEEN TWO SAMPLES Hisashi Tanizaki Graduate School of Economics, Kobe University, Kobe 657-8501, Japan e-mail: tanizaki@kobe-u.ac.jp Abstract:
More informationInference for Binomial Parameters
Inference for Binomial Parameters Dipankar Bandyopadhyay, Ph.D. Department of Biostatistics, Virginia Commonwealth University D. Bandyopadhyay (VCU) BIOS 625: Categorical Data & GLM 1 / 58 Inference for
More informationLoglikelihood and Confidence Intervals
Stat 504, Lecture 2 1 Loglikelihood and Confidence Intervals The loglikelihood function is defined to be the natural logarithm of the likelihood function, l(θ ; x) = log L(θ ; x). For a variety of reasons,
More informationFiducial Generalized Pivots for a Variance Component vs. an Approximate Confidence Interval
Fiducial Generalized Pivots for a Variance Component vs. an Approximate Confidence Interval B. Arendacká Institute of Measurement Science, Slovak Academy of Sciences, Dúbravská cesta 9, Bratislava, 84
More informationA Note on UMPI F Tests
A Note on UMPI F Tests Ronald Christensen Professor of Statistics Department of Mathematics and Statistics University of New Mexico May 22, 2015 Abstract We examine the transformations necessary for establishing
More informationLETTER TO THE EDITOR
STATISTICS IN MEDICINE Statist. Med. 2004; 23:685 693 LETTER TO THE EDITOR Comment on: Condence limits for the ratio of two rates based on likelihood scores: non-iterative method by P. L. Graham, K. Mengersen
More informationAsymptotic efficiency of simple decisions for the compound decision problem
Asymptotic efficiency of simple decisions for the compound decision problem Eitan Greenshtein and Ya acov Ritov Department of Statistical Sciences Duke University Durham, NC 27708-0251, USA e-mail: eitan.greenshtein@gmail.com
More informationResearch Article A Nonparametric Two-Sample Wald Test of Equality of Variances
Advances in Decision Sciences Volume 211, Article ID 74858, 8 pages doi:1.1155/211/74858 Research Article A Nonparametric Two-Sample Wald Test of Equality of Variances David Allingham 1 andj.c.w.rayner
More information1 Hypothesis testing for a single mean
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationSTATISTICAL ANALYSIS AND COMPARISON OF SIMULATION MODELS OF HIGHLY DEPENDABLE SYSTEMS - AN EXPERIMENTAL STUDY. Peter Buchholz Dennis Müller
Proceedings of the 2009 Winter Simulation Conference M. D. Rossetti, R. R. Hill, B. Johansson, A. Dunkin, and R. G. Ingalls, eds. STATISTICAL ANALYSIS AND COMPARISON OF SIMULATION MODELS OF HIGHLY DEPENDABLE
More informationStatistical Inference Using Maximum Likelihood Estimation and the Generalized Likelihood Ratio
\"( Statistical Inference Using Maximum Likelihood Estimation and the Generalized Likelihood Ratio When the True Parameter Is on the Boundary of the Parameter Space by Ziding Feng 1 and Charles E. McCulloch2
More informationSAMPLE SIZE RE-ESTIMATION FOR ADAPTIVE SEQUENTIAL DESIGN IN CLINICAL TRIALS
Journal of Biopharmaceutical Statistics, 18: 1184 1196, 2008 Copyright Taylor & Francis Group, LLC ISSN: 1054-3406 print/1520-5711 online DOI: 10.1080/10543400802369053 SAMPLE SIZE RE-ESTIMATION FOR ADAPTIVE
More informationA CONDITION TO OBTAIN THE SAME DECISION IN THE HOMOGENEITY TEST- ING PROBLEM FROM THE FREQUENTIST AND BAYESIAN POINT OF VIEW
A CONDITION TO OBTAIN THE SAME DECISION IN THE HOMOGENEITY TEST- ING PROBLEM FROM THE FREQUENTIST AND BAYESIAN POINT OF VIEW Miguel A Gómez-Villegas and Beatriz González-Pérez Departamento de Estadística
More information1 Statistical inference for a population mean
1 Statistical inference for a population mean 1. Inference for a large sample, known variance Suppose X 1,..., X n represents a large random sample of data from a population with unknown mean µ and known
More informationJourneys of an Accidental Statistician
Journeys of an Accidental Statistician A partially anecdotal account of A Unified Approach to the Classical Statistical Analysis of Small Signals, GJF and Robert D. Cousins, Phys. Rev. D 57, 3873 (1998)
More informationarxiv: v1 [stat.co] 26 May 2009
MAXIMUM LIKELIHOOD ESTIMATION FOR MARKOV CHAINS arxiv:0905.4131v1 [stat.co] 6 May 009 IULIANA TEODORESCU Abstract. A new approach for optimal estimation of Markov chains with sparse transition matrices
More informationGENERAL PROBLEMS OF METROLOGY AND MEASUREMENT TECHNIQUE
DOI 10.1007/s11018-017-1141-3 Measurement Techniques, Vol. 60, No. 1, April, 2017 GENERAL PROBLEMS OF METROLOGY AND MEASUREMENT TECHNIQUE APPLICATION AND POWER OF PARAMETRIC CRITERIA FOR TESTING THE HOMOGENEITY
More informationBFF Four: Are we Converging?
BFF Four: Are we Converging? Nancy Reid May 2, 2017 Classical Approaches: A Look Way Back Nature of Probability BFF one to three: a look back Comparisons Are we getting there? BFF Four Harvard, May 2017
More informationInference on reliability in two-parameter exponential stress strength model
Metrika DOI 10.1007/s00184-006-0074-7 Inference on reliability in two-parameter exponential stress strength model K. Krishnamoorthy Shubhabrata Mukherjee Huizhen Guo Received: 19 January 2005 Springer-Verlag
More informationWooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics
Wooldridge, Introductory Econometrics, 4th ed. Appendix C: Fundamentals of mathematical statistics A short review of the principles of mathematical statistics (or, what you should have learned in EC 151).
More informationInference for the mean of a population. Testing hypotheses about a single mean (the one sample t-test). The sign test for matched pairs
Stat 528 (Autumn 2008) Inference for the mean of a population (One sample t procedures) Reading: Section 7.1. Inference for the mean of a population. The t distribution for a normal population. Small sample
More informationCHAPTER 1: Functions
CHAPTER 1: Functions 1.1: Functions 1.2: Graphs of Functions 1.3: Basic Graphs and Symmetry 1.4: Transformations 1.5: Piecewise-Defined Functions; Limits and Continuity in Calculus 1.6: Combining Functions
More informationA MONTE CARLO INVESTIGATION INTO THE PROPERTIES OF A PROPOSED ROBUST ONE-SAMPLE TEST OF LOCATION. Bercedis Peterson
A MONTE CARLO INVESTIGATION INTO THE PROPERTIES OF A PROPOSED ROBUST ONE-SAMPLE TEST OF LOCATION by Bercedis Peterson Department of Biostatistics University of North Carolina at Chapel Hill Institute of
More informationLikelihood Asymptotics for Changepoint Problem
Likelihood Asymptotics for Changepoint Problem K. O. Obisesan Department of Statistics University of Ibadan Nigeria email:ko.obisesan@ui.edu.ng ; obidairo@gmail.com ABSTRACT Changepoint problems are often
More informationSemantics and Inference for Probabilistic Ontologies
Semantics and Inference for Probabilistic Ontologies Fabrizio Riguzzi, Elena Bellodi, Evelina Lamma, and Riccardo Zese ENDIF University of Ferrara, Italy, email: {fabrizio.riguzzi, elena.bellodi, evelina.lamma}@unife.it,
More informationCount data and the Poisson distribution
Count data and the Poisson distribution Patrick Breheny November 5 Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/13 Construction Properties Poisson model: A spatial illustration Consider the following
More informationWhy Bellman-Zadeh Approach to Fuzzy Optimization
Applied Mathematical Sciences, Vol. 12, 2018, no. 11, 517-522 HIKARI Ltd, www.m-hikari.com https://doi.org/10.12988/ams.2018.8456 Why Bellman-Zadeh Approach to Fuzzy Optimization Olga Kosheleva 1 and Vladik
More informationOn the GLR and UMP tests in the family with support dependent on the parameter
STATISTICS, OPTIMIZATION AND INFORMATION COMPUTING Stat., Optim. Inf. Comput., Vol. 3, September 2015, pp 221 228. Published online in International Academic Press (www.iapress.org On the GLR and UMP tests
More informationLAW OF LARGE NUMBERS FOR THE SIRS EPIDEMIC
LAW OF LARGE NUMBERS FOR THE SIRS EPIDEMIC R. G. DOLGOARSHINNYKH Abstract. We establish law of large numbers for SIRS stochastic epidemic processes: as the population size increases the paths of SIRS epidemic
More information