( µ /σ)ζ/(ζ+1) µ /σ ( µ /σ)ζ/(ζ 1)

Similar documents
5. Likelihood Ratio Tests

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Problem Set 4 Due Oct, 12

Lecture 6 Simple alternatives and the Neyman-Pearson lemma

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

6.3 Testing Series With Positive Terms

x = Pr ( X (n) βx ) =

Statistics 511 Additional Materials

7.1 Convergence of sequences of random variables


BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

1.010 Uncertainty in Engineering Fall 2008

Estimation for Complete Data

Department of Mathematics

Topic 9: Sampling Distributions of Estimators

Output Analysis and Run-Length Control

The standard deviation of the mean

Stat 421-SP2012 Interval Estimation Section

INFINITE SEQUENCES AND SERIES

SOME THEORY AND PRACTICE OF STATISTICS by Howard G. Tucker

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Topic 9: Sampling Distributions of Estimators

HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

Lecture 2: Monte Carlo Simulation

Topic 9: Sampling Distributions of Estimators

1 Introduction to reducing variance in Monte Carlo simulations

Chapter 6 Sampling Distributions

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

CS434a/541a: Pattern Recognition Prof. Olga Veksler. Lecture 5

Data Analysis and Statistical Methods Statistics 651

Frequentist Inference

7.1 Convergence of sequences of random variables

Properties and Hypothesis Testing

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

Math 10A final exam, December 16, 2016

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Statistical Intervals for a Single Sample

STATISTICAL PROPERTIES OF LEAST SQUARES ESTIMATORS. Comments:

Convergence of random variables. (telegram style notes) P.J.C. Spreij

MATH/STAT 352: Lecture 15

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Seunghee Ye Ma 8: Week 5 Oct 28

Distribution of Random Samples & Limit theorems

CHAPTER 8 FUNDAMENTAL SAMPLING DISTRIBUTIONS AND DATA DESCRIPTIONS. 8.1 Random Sampling. 8.2 Some Important Statistics

Economics Spring 2015

Random Variables, Sampling and Estimation

1 Inferential Methods for Correlation and Regression Analysis

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Monte Carlo Integration

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

Lecture 10 October Minimaxity and least favorable prior sequences

Double Stage Shrinkage Estimator of Two Parameters. Generalized Exponential Distribution

Probability and statistics: basic terms

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS

Power and Type II Error

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p

This exam contains 19 pages (including this cover page) and 10 questions. A Formulae sheet is provided with the exam.

R. van Zyl 1, A.J. van der Merwe 2. Quintiles International, University of the Free State

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

Confidence interval for the two-parameter exponentiated Gumbel distribution based on record values

Statistical inference: example 1. Inferential Statistics

The picture in figure 1.1 helps us to see that the area represents the distance traveled. Figure 1: Area represents distance travelled

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Study the bias (due to the nite dimensional approximation) and variance of the estimators

WHAT IS THE PROBABILITY FUNCTION FOR LARGE TSUNAMI WAVES? ABSTRACT

Binomial Distribution

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise

Chapter 6 Principles of Data Reduction

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

Estimation of a population proportion March 23,

Summary. Recap ... Last Lecture. Summary. Theorem

A statistical method to determine sample size to estimate characteristic value of soil parameters

LECTURE 14 NOTES. A sequence of α-level tests {ϕ n (x)} is consistent if

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

4.1 Sigma Notation and Riemann Sums

5. INEQUALITIES, LIMIT THEOREMS AND GEOMETRIC PROBABILITY

ECE 8527: Introduction to Machine Learning and Pattern Recognition Midterm # 1. Vaishali Amin Fall, 2015

Exam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234

Lecture 11 October 27

Lecture 12: September 27

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

Statisticians use the word population to refer the total number of (potential) observations under consideration

Parameter, Statistic and Random Samples

STAT Homework 1 - Solutions

Expectation and Variance of a random variable

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics

Chapter 13: Tests of Hypothesis Section 13.1 Introduction

Sample Size Estimation in the Proportional Hazards Model for K-sample or Regression Settings Scott S. Emerson, M.D., Ph.D.

Z ß cos x + si x R du We start with the substitutio u = si(x), so du = cos(x). The itegral becomes but +u we should chage the limits to go with the ew

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Infinite Sequences and Series

Topic 18: Composite Hypotheses

Transcription:

A eective CI for the mea with samples of size 1 ad Melaie Wall James Boe ad Richard Tweedie 1 Abstract It is couterituitive that with a sample of oly oe value from a ormal distributio oe ca costruct a ite codece iterval of ay size for the mea. It goes just as much agaist stadard teachig that from a sample of size two such a CI might be shorter tha that based o the t statistic. We ree a earlier versio of this rst result ad use it to prove the secod. For samples of three ad larger we show that the t-based iterval caot be improved usig this approach. A \partly-bayes" approach idicates that i some istaces with such small samples a quite reasoable codece iterval ca be costructed. keywords locatio estimatio Studet's t ivariat estimators codece itervals 1 Itroductio It has bee kow but ot well kow sice the 1960's that from a sigle observatio x from a ormal distributio with ukow mea ad ukow stadard deviatio it is possible to create a codece iterval (CI) for with ite legth [1 8 9 10]. This remarkable result seems to completely cotradict the stadard statistical ituitio that at least two observatios are ecessary i order to have some idea about variability. Noetheless it is a special case of a eve more surprisig result [ 4 11] that for ay (0; 1) there exists a ite 100(1 )% CI for the mea of ay uimodal distributio of the form x jxj: (1) Based o icreasigly strog assumptios about the uderlyig distributio (uimodal uimodalsymmetric or ormal) the value of decreases; ad for each of the three classes the relevat value of is solely a fuctio of the size 1 ad thus is appropriate for all distributios i that class. I this paper we discuss the case where the uderlyig distributio is ormal. We rst ree the results i [4] usig the methods of [] ad give a formula for that results i shorter codece itervals tha those i [4]. We the show that it is possible to use (1) with samples of size >1 ad that give a sample of size = from a ormal distributio this CI ca yield better results i some cases tha the usual Studet's t iterval. For > however the usual Studet's t iterval is always better. We the discuss itervals of the form x jx Aj () where A is some xed predetermied costat. These have the same coverage properties as do those i (1) but if (usig a prior assessmet of the likely locatio of ) we are able to pick A appropriately we will get shorter itervals from this approach. 1 Divisio of Biostatistics School of Public Health Uiversity of Miesota Mieapolis MN 55455. 1

We coclude with a discussio of the ituitive reasos behid these results ad i particular we discuss the apparet cotradictio with the kow optimality of the CI based o Studet's t. A codece iterval for = 1 Let X be a ormal radom variable with mea ad stadard deviatio. To derive (1) we wat to d a value such that P (X jxj X + jxj) 1 ; o matter what the values of ad. For such aiterval the coverage probability for >1is P (X jxj X + jxj) = 1 P (jx j jxj) X = 1 P X + = 1 1 +1 where is the stadard ormal CDF. Note that this depeds oly o the ratio. I order to set the miimum coverage probability equal to (1 ) eeds to be chose so that sup 1 +1 = : (3) We shall show that is possible to solve umerically for the value of that satises (3). Table 1 gives the exact values of associated with several values of assessed usig (7) below. Table 1: Values of for costructig 100(1 )% CIs for usig (1).0.15.10.05.01.4 3.3 4.84 9.68 48.39 So for example a 90% codece iterval for is give by x 4:84jxj. Note that this is arrower tha the rage x 5:84jxj give by the approximatio (4) from [4]. Prior to derivig the results i Table 1 we preset two simple closed-form approximatios for usig Figure 1 which displays the regio where the codece iterval fails to cover. Edelma [4] derives as a approximatio for the value ecessary to esure the rectagle formed with the dotted lies i Figure 1 has area less tha or equal to. This gives (1) +1 (4) where is the stadard ormal desity. Sice this area is always greater tha the area uder the ormal desity eclosed by the same upper ad lower limit this value for is typically rather coservative.

0.1 0. 0.3 0.4 ( µ /σ)ζ/(ζ+1) µ /σ ( µ /σ)ζ/(ζ 1) Figure 1: Regio uder the stadard ormal desity where the codece iterval (1) fails to cover. Rectagle formed by the dotted lie relates to (4) ad trapezoid formed by the bold lie relates to (5). Blachma ad Machol [] derive a approximatio usig the value of eeded so that the trapezoid formed by takig the rst order approximatio to the ormal desity at the poit (bold lie i Figure 1) has area less tha or equal to. This gives (1) : (5) The formula (5) is remarkably close to the exact umerical solutio to (3) for. Numerical methods idicate that for all <:3 it is withi.005 of the exact solutio. We ow idicate how to derive the exact value of which solves (3). For each y deote by h y the \least favorable value" which maximizes the area ( y y ) ( i. ) I the Appedix y1 y+1 followig the reasoig i [] we show that this least favorable value is = 1 1 s y y y y log 1 : (6) Thus the which solves (3) is the same as the which solves!! 1 +1 = ; (7) ad (7) ca easily be solved umerically usig ay software that ca calculate quatiles of the ormal distributio. The results are show for commo values of i Table 1. 3

3 Codece itervals for > 1 The codece iterval (1) was origially derived with the itetio of beig used whe there was oly oe observatio from a distributio. Of course (1) ca also be used to form codece itervals for whe we have more tha oe observatio. If x 1 ;x ;:::x are iid observatios from N(; ) the x = P x i=1 i= is distributed N(; ) ad therefore based o the results of the last sectio we ca form a 100(1 )% codece iterval for by takig x jxj (8) where is from Table 1. Note that the value of does ot deped o so that these itervals do ot get shorter as more data is collected. The use of (8) whe >1may be of practical use if it ca provide shorter codece itervals tha the usual codece iterval for formed with the sample stadard deviatio s ad the Studet's t-distributio i.e. s x t 1 p ; (9) where t 1 is the quatile associated with = from the Studet's t distributio with 1 degrees of freedom. To compare the margi of error of (8) versus (9) we compare E(j Xj) S toe(t1 p ). We derive a expressio for E(j Xj) i the Appedix for completeess ad a expressio for E(S) ca be foud i [3]. These lead to E(j Xj) =E(j Xj) = "!! 1 p r + # e (10) ad E(t 1 " S E(S). p )=t 1 p = t 1 p 1 r # 1 : (11) To see that it is plausible that the margi of error for (8) might be smaller tha that for (9) whe = compare the quatiles of Studet's t distributio give i Table to the values for give i Table 1 for some values of jxj ad s. We see that t 1 > for all although t < for all. Table : Quatile values of Studet's t-distributio with oe degree of freedom t 1.0.15.10.05.01 t 1 3.08 4.17 6.31 1.71 63.66 t 1.89.8.9 4.30 9.9 A careful compariso of E(j Xj) ad E(S) p shows that whe =there is ideed a regio of the parameter space where (10) is smaller tha (11). Figure shows a perspective plot of E(j Xj) ad E(t S 1 p ) as fuctios of ad for the case whe = ad = :05. I order to compare these surfaces we show their dierece over the parameter space i Figure 3 (left plot). The regio of iterest is that where the E(j Xj) S <E(t1 p ). The plot o the right of Figure 3 represets this regio i.e. where the expected margi of error of (9) is larger tha that of (8). We ca see that 4

0 50 100 150 00 50 0 50 100 150 00 50 30 30 5 0 15 σ 10 5 5 µ 0 5 10 5 0 15 10 σ 5 5 µ 0 5 10 Figure : Give = the height i each of these plots represets the expected margi of error for a 95% CI at a give ad. O the left is E(j Xj) ad o the right ise(t1 S p ) whe = ad is less tha approximately 1 (8) leads to a shorter (i expectatio) codece iterval tha (9). The amout of improvemet i accuracy icreases as sigma icreases. O the other had if >> 1 the average margi of error of (8) ca become much worse tha that of (9). To see this cosider what happes to the margi of error i Figure 3 (left plot) whe is held costat ad icreases. Outperformig the CI (9) usig iid ormal observatios might seem like a impossible task give the well kow ad widely taught optimality properties of (9). The key fact to recall is that the CI (9) is the Uiformly Most Accurate Equivariat codece iterval for the parameter ([7] p. 39). Note that the radom variable X jxj used i formig the codece iterval (8) is ot a pivotal quatity sice it depeds o both ad. Thus the CI x jxj is ot i the equivariat class of codece itervals ad so it is ideed possible that for some poits i the parameter space of ad (8) ca be more accurate tha (9). All of the above refers to the situatio =. Whe > the average margi of error for (8) ca be show to be uiformly larger tha the average margi of error for (9) ad so this o-equivariat method is ot practical for larger samples where the stadard methods are better. The followig propositio demostrates this result for the most useful rage of. Propositio: For > ad (0;:0] E(j Xj) >E(t1 S p ) (1) Proof: Note that mi E(j Xj) occurs whe =0. This ca be see by examiig the derivative of E(j Xj) with respect to ad observig that it oly ca equal zero whe =0. Thus E(j Xj) > r p >:797 p : 5

100 50 0 50 diff 30 5 0 15 10 5 σ 5 µ 0 5 10 σ 0 5 10 15 0 5 30 40 30 0 10 0 10 5 0 5 10 µ Figure 3: The height of the plot o the left is the dierece E(t 1 S p ) E(j Xj) based o a 95% CI with =ad the plot o the right is a cotour plot of the dierece show o the left but oly for the regio where (8) is more accurate tha (9) Now.h q 1 i 1 < 1 for all ad so for 3 E(t 1 S p ) <t 1 p t p : q (1) Hece we ca coclude (1) if we ca show that t <:797 for all (0;:0]. Now t = ([6] p. 11). If we use the umerically q demostrated fact that (5) is withi.005 from q the true (1) for (0;:0] it suces to show :797 ( (1) (1) :005). The quatity is mootoically icreasig sice the the rst derivative w.r.t is always positive whe (0;:0]. t Thus from Table 1 ad we coclude that max = 1:89 (0;:0] <:797 as required. :4 4 A partially Bayesia approach Cosider ow the more geeral codece iterval of the form x jx Aj (13) where A is some xed predetermied costat. I our developmet so far we have assumed A =0. Sice A is xed the coverage probability of (13) for is the same as that developed i Sectio ad thus the values for give i Table 1 will still apply. O the other had the choice of A will clearly aect the margi of error of the codece iterval. For the example we posed above whe = ad = :05 the regio i the parameter space where (13) would provide o average a smaller margi of error tha (9) is where jaj is less tha approximately 1. 6

Thus whe we ca use some form of prior iformatio cocerig the mea ad stadard deviatio oly (but ot ecessarily ay other prior distributioal properties) to choose A to be withi a half a stadard deviatio of we ca provide a better 95% codece iterval with = usig x jx Aj tha usig (9). There have bee a umber of approaches to elicitig prior iformatio o momets [5]. Give the full prior distributios o ad we also ca calculate the prior probability of evets such as for some A ad thus of the prior probability that (8) will be more accurate tha (9). jaj < 1 5 Coclusio Why does this totally implausible approach work? To try ad give a ituitio for this cosider a sigle observatio of x = 10. This could come from a ormal distributio with = 0 ad = 10 or from a ormal distributio with = 10 ad = 1. However \commo sese" suggest that it ca hardly come from = 1000 ad = 0. This reasoig idicates that some pairs are highly ulikely. Thus ay codece regio i (; )-space should exclude some pairs ad should ot be iite. The rather surprisig result eve give this is that there is margially a ite CI for which is correct o matter what the ukow value of. Agai however oe might argue that if x =10 the o matter what the value of there is little probability that will be 100000 or more. Thus we are ituitively workig i a \rectagle" i (; )-space ad the margial result of that calculatio leads to the ite CI for aloe. The proof above formalizes this argumet showig that without ay appeal to prior distributios we ca calculate the actual size of the ite CI. Whe is this sort of work useful? Clearly there is a good use for this example i the classroom. We teach that oe eeds a idea of variability i order to do estimatio: it is useful to hoe this ituitio with examples such as this to make us realize that x itself tells us somethig about the variability as well as the mea. We also teach that Studet's t leads to a uiformly most accurate CI. We do ot always metio ad certaily do ot always stress that this oly applies if we restrict ourselves to equivariat CIs. It is valuable to have a o-articial example that shows the eed for the equivariace i this statemet. Whether the approach is useful i practice is dicult to judge. The authors who largely developed this approach [ 8 9 10] are from NASA ad it appears plausible that i their work small samples of 1 or really do exist. There are may other experimets where such sample sizes also apply. I these cases usig (8) may well lead to greater accuracy of estimatio. Appedix The result of our rst lemma is essetially give i [] without proof but we give it here with proof because [] is ot trivial to read. h Lemma: Let a = ( i y y ) ( ). The the value of which maximizes a is y1 y+1 = 1 1 s y y y y log 1 : 7

Proof: Dee v @a. Settig @v yv y 1 v e v equal to zero ad solvig for v gives successively: y yv y y 1 h y y i y1 y+1 " y(y +1) y 1 y(y 1) # y 1 v 0 4y 3 (y 1) log v v y 1 y 1 log y 1 1 1 y s y y log 1 1 1 y py archcot y Our secod lemma is stadard but we give it for completeess. Lemma: Give X N(; ) Proof: E(jXj) = = = E(jXj) = 1 r + Z 1 jxj 1 Z 1 0 Z 1 X 1 1 p e (X) dx p e (X) dx Z 0 1 ( + U) p e U du 1 + = = 1 + p r 1 Z Z 1 e 1 e X p 1 e (X) dx 1 ( + U) p e U du e V dv p Z 1 e V dv Refereces [1] Abbot J.H. ad Roseblatt J. (1963). Two stage estimatio with oe observatio o the rst stage. Aals of the Istitute of Statistical Mathematics 14 9-35. [] Blachma N.M. ad Machol R.E. (1987). Codece itervals based o oe or more observatios. IEEE Trasactios o Iformatio Theory IT-33 No. 3 373-38. [3] Cureto E.E. (1968). Ubiased estimatio of the stadard deviatio. The America Statisticia No. 1. 8

[4] Edelma D. (1990). A codece iterval for the ceter of a ukow uimodal distributio based o a sample size 1. The America Statisticia 44 No. 4 85-87. [5] Goldstei M. (1981). Revisig previsios: A geometric iterpretatio. J.R. Statistical Society B 43 105-130. [6] Johso N. ad Kotz S. (1970). Distributios i Statistics: Cotiuous Uivariate Distributios -. New York: Joh Wiley ad Sos. [7] Lehma E.L. (1986). Testig statistical hypotheses d ed. New York: Joh Wiley ad Sos. [8] Machol R.E. ad Roseblatt J. (1966). Codece iterval based o sigle observatio. Proceedigs of IEEE 54 1087-1088. [9] Machol R.E. (1966). Codece iterval based o sigle observatio. Proceedigs of IEEE 54 1976. [10] Machol R.E. (1967). Codece iterval based o sigle observatio. Proceedigs of IEEE 55 1. [11] Rodriguez C.C. (1998). Codece itervals from oe observatio. Public webpage: http://omega.albay.edu:8008/cot.html. 9