9.2 Confidence Intervals for Means
|
|
- Lucy Newman
- 5 years ago
- Views:
Transcription
1 202 CHAPTER 9. ESTIMATION 9.2 Cofidece Itervals for Meas We are give X 1, X 2,..., X that are a S RS ( from a orm(mea = µ, sd = σ distributio, where µ is ukow. We kow that we may estimate µ with X, ad we have see that this estimator is the MLE. But how good is our estimate? We kow that X µ σ/ orm(mea = 0, sd = 1. (9.2.1 For a big probability 1 α, for istace, 95%, we ca calculate the quatile z α/2. The IP z α/2 X µ σ/ z α/2 = 1 α. (9.2.2 But ow cosider the followig strig of equivalet iequalities: z α/2 ( σ X z α/2 ( σ X z α/2 ( σ z α/2 X µ σ/ z α/2, X µ z α/2 ( σ, µ X + z α/2 ( σ, µ X + z α/2 ( σ. That is, σ σ IP (X z α/2 µ X + z α/2 = 1 α. (9.2.3 Defiitio The iterval [X z α/2 σ, X + z α/2 σ ] (9.2.4 is a 100(1 α% cofidece iterval for µ. The quatity 1 α is called the cofidece coefficiet. Remark The iterval is also sometimes writte more compactly as X ± z α/2 σ. (9.2.5 The iterpretatio of cofidece itervals is tricky ad ofte mistake by ovices. Whe I am teachig the cocept live durig class, I usually ask the studets to imagie that my piece of chalk represets the ukow parameter, ad I lay it dow o the desk i frot of me. Oce the chalk has bee lai, it is fixed; it does ot move. Our goal is to estimate the parameter. For the estimator I pick up a sheet of loose paper lyig earby. The estimatio procedure is to radomly drop the piece of paper from above, ad observe where it lads. If the piece of paper covers the piece of chalk, the we are successful our estimator covers the parameter. If it falls off to oe side or the other, the we are usuccessful; our iterval fails to cover the parameter.
2 9.2. CONFIDENCE INTERVALS FOR MEANS 203 The I ask them: suppose we were to repeat this procedure hudreds, thousads, millios of times. Suppose we kept track of how may times we covered ad how may times we did ot. What percetage of the time would we be successful? I the demostratio, the parameter correspods to the chalk, the sheet of paper correspods to the cofidece iterval, ad the radom experimet correspods to droppig the sheet of paper. The percetage of the time that we are successful exactly correspods to the cofidece coefficiet. That is, if we use a 95% cofidece iterval, the we ca say that, i the log ru, approximately 95% of our itervals will cover the true parameter (which is fixed, but ukow. See Figure 9.2.1, which is a graphical display of these ideas. Uder the above framework, we ca reaso that a iterval with a larger cofidece coefficiet correspods to a wider sheet of paper. Furthermore, the width of the cofidece iterval (sheet of paper should be somehow related to the amout of iformatio cotaied i the radom sample, X 1, X 2,..., X. The followig remarks makes these otios precise. Remark For a fixed cofidece coefficiet 1 α, if icreases, the the cofidece iterval gets SHORTER. (9.2.6 Remark For a fixed sample size, if 1 α icreases, the the cofidece iterval gets WIDER. (9.2.7 Example Results from a Experimet o Plat Growth. The PlatGrowth data frame gives the results of a experimet to measure plat yield (as measured by the weight of the plat. We would like to a 95% cofidece iterval for the mea weight of the plats. Suppose that we kow from prior research that the true populatio stadard deviatio of the plat weights is 0.7 g. The parameter of iterest is µ, which represets the true mea weight of the populatio of all plats of the particular species i the study. We will first take a look at a stemplot of the data: > library(aplpack > with(platgrowth, stem.leaf(weight 1 2: represets 1.2 leaf uit: 0.1 : 30 1 f 5 s * 11 5 t 3 8 f s (4 5* t f 555 s * t 3
3 204 CHAPTER 9. ESTIMATION Cofidece itervals based o z distributio Idex Cofidece Iterval Figure 9.2.1: Simulated cofidece itervals The graph was geerated by the ci.examp fuctio from the TeachigDemos package. Fifty (50 samples of size twety five (25 were geerated from a orm(mea = 100, sd = 10 distributio, ad each sample was used to fid a 95% cofidece iterval for the populatio mea usig Equatio The 50 cofidece itervals are represeted above by horizotal lies, ad the respective sample meas are deoted by vertical slashes. Cofidece itervals that cover the true mea value of 100 are plotted i black; those that fail to cover are plotted i a lighter color. I the plot we see that oly oe (1 of the simulated itervals out of the 50 failed to cover µ = 100, which is a success rate of 98%. If the umber of geerated samples were to icrease from 50 to 500 to 50000,..., the we would expect our success rate to approach the exact value of 95%.
4 9.2. CONFIDENCE INTERVALS FOR MEANS 205 The data appear to be approximately ormal with o extreme values. The data come from a desiged experimet, so it is reasoable to suppose that the observatios costitute a simple radom sample of weights 3. We kow the populatio stadard deviatio σ = 0.70 from prior research. We are goig to use the oe-sample z-iterval. > dim(platgrowth # sample size is first etry [1] 30 2 > with(platgrowth, mea(weight [1] > qorm(0.975 [1] We fid the sample mea of the data to be x = ad z α/2 = z Our iterval is therefore x ± z α/2 σ = ± , which comes out to approximately [4.823, 5.323]. I coclusio, we are 95% cofidet that the true mea weight µ of all plats of this species lies somewhere betwee g ad g, that is, we are 95% cofidet that the iterval [4.823, 5.323] covers µ. See Figure Example Give some data with X 1, X 2,..., X a S RS ( from a orm(mea = µ, sd = σ distributio. Maybe small sample? 1. What is the parameter of iterest? i the cotext of the problem. Give a poit estimate for µ. 2. What are the assumptios beig made i the problem? Do they meet the coditios of the iterval? 3. Calculate the iterval. 4. Draw the coclusio. Remark What if σ is ukow? We istead use the iterval X ± z α/2 S, (9.2.8 where S is the sample stadard deviatio. If is large, the X will have a approximately ormal distributio regardless of the uderlyig populatio (by the CLT ad S will be very close to the parameter σ (by the SLLN; thus the above iterval will have approximately 100(1 α% cofidece of coverig µ. If is small, the 3 Actually we will see later that there is reaso to believe that the observatios are simple radom samples from three distict populatios. See Sectio 10.6.
5 206 CHAPTER 9. ESTIMATION 95% Normal Cofidece Limits: σ x = 0.128, = 30 µ x f(z g( x = f(( x µ i σ x σ x x x z z shaded area Cof Level= Figure 9.2.2: Cofidece iterval plot for the PlatGrowth data The shaded portio represets 95% of the total area uder the curve, ad the upper ad lower bouds are the limits of the oe-sample 95% cofidece iterval. The graph is cetered at the observed sample mea. It was geerated by computig a z.test from the TeachigDemos package, storig the resultig htest object, ad plottig it with the ormal.ad.t.dist fuctio from the HH package. See the remarks i the How to do it with R discussio later i this sectio.
6 9.2. CONFIDENCE INTERVALS FOR MEANS 207 If the uderlyig populatio is ormal the we may replace z α/2 with t α/2 (df = 1. The resultig 100(1 α% cofidece iterval is X ± t α/2 (df = 1 S (9.2.9 if the uderlyig populatio is ot ormal, but approximately ormal, the we may use the t iterval, Equatio The iterval will have approximately 100(1 α% cofidece of coverig µ. However, if the populatio is highly skewed or the data have outliers, the we should ask a professioal statisticia for advice. The author leared of a hady acroym from AP Statistics Exam graders that summarizes the importat parts of cofidece iterval estimatio, which is PANIC: Parameter, Assumptios, Name, Iterval, ad Coclusio. Parameter: idetify the parameter of iterest with the proper symbols. Write dow what the parameter meas i the cotext of the problem. Assumptios: list ay assumptios made i the experimet. If there are ay other assumptios eeded or that were ot checked, state what they are ad why they are importat. Name: choose a statistical procedure from your bag of tricks based o the aswers to the previous two parts. The assumptios of the procedure you choose should match those of the problem; if they do ot match the either pick a differet procedure or opely admit that the results may ot be reliable. Write dow ay uderlyig formulas used. Iterval: calculate the iterval from the sample data. This ca be doe by had but will more ofte be doe with the aid of a computer. Regardless of the method, all calculatios or code should be show so that the etire process is repeatable by a subsequet reader. Coclusio: state the fial results, usig laguage i the cotext of the problem. Iclude the appropriate iterpretatio of the iterval, makig referece to the cofidece coefficiet. Remark All of the above itervals for µ were two-sided, but there are also oe-sided itervals for µ. They look like [ σ X z α, or (, X + z α σ ] ( ad satisfy ( ( σ σ IP X z α µ = 1 α ad IP X + z α µ = 1 α. ( Example Small sample, some data with X 1, X 2,..., X a S RS ( from a orm(mea = µ, sd = σ distributio. 1. PANIC
7 208 CHAPTER 9. ESTIMATION How to do it with R We ca do Example 9.14 with the followig code. > library(teachigdemos > temp <- with(platgrowth, z.test(weight, stdev = 0.7 > temp Oe Sample z-test data: weight z = , = , Std. Dev. = 0.700, Std. Dev. of the sample mea = 0.128, p-value < 2.2e-16 alterative hypothesis: true mea is ot equal to 0 95 percet cofidece iterval: sample estimates: mea of weight The cofidece iterval bouds are show i the sixth lie dow of the output (please disregard all of the additioal output iformatio for ow we will use it i Chapter 10. We ca make the plot for Figure with > library(ipsur > plot(temp, "Cof" 9.3 Cofidece Itervals for Differeces of Meas Let X 1, X 2,..., X be a S RS ( from a orm(mea = µ X, sd = σ X distributio ad let Y 1, Y 2,..., Y m be a S RS (m from a orm(mea = µ Y, sd = σ Y distributio. Further, assume that the X 1, X 2,..., X sample is idepedet of the Y 1, Y 2,..., Y m sample. Suppose that σ X ad σ Y are kow. We would like a cofidece iterval for µ X µ Y. We kow that X Y orm mea = µ X µ Y, sd = σ 2 X + σ2 Y m. (9.3.1 Therefore, a 100(1 α% cofidece iterval for µ X µ Y is give by ( X Y ± zα/2 σ 2 X + σ2 Y m. (9.3.2 Ufortuately, most of the time the values of σ X ad σ Y are ukow. This leads us to the followig: If both sample sizes are large, the we may appeal to the CLT/SLLN (see 8.3 ad substitute S 2 X ad S Y 2 for σ2 X ad σ2 Y i the iterval The resultig cofidece iterval will have approximately 100(1 α% cofidece.
8 CONFIDENCE INTERVALS FOR DIFFERENCES OF MEANS 209 If oe or more of the sample sizes is small the we are i trouble, uless the uderlyig populatios are both ormal ad σ X = σ Y. I this case (settig σ = σ X = σ Y, 1 X Y orm mea = µ X µ Y, sd = σ + 1 m. (9.3.3 Now let U = 1 σ S 2 2 X + m 1 S σ Y. 2 ( The by Exercise 7.2 we kow that U chisq(df = + m 2 ad is ot a large leap to believe that U is idepedet of X Y; thus Z T = t(df = + m 2. (9.3.5 U/ ( + m 2 But X Y (µ X µ Y T = = σ 1 + m 1 S 2 σ 2 X + /, m 1 S 2 σ 2 Y ( + m 2 X Y (µ X µ Y (, ( 1S 2 X +(m 1S Y 2 +m 2 1 ( m t(df = + m 2. Therefore a 100(1 α% cofidece iterval for µ X µ Y is give by ( 1 X Y ± tα/2 (df = + m 2 S p + 1 m, (9.3.6 where ( 1S 2 X S p = + (m 1S Y 2 + m 2 is called the pooled estimator of σ. (9.3.7
9 210 CHAPTER 9. ESTIMATION How to do it with R The basic fuctio is t.test which has a var.equal argumet that may be set to TRUE or FALSE. The cofidece iterval is show as part of the output, although there is a lot of additioal iformatio that is ot eeded util Chapter 10. There is ot ay specific fuctioality to hadle the z-iterval for small samples, but if the samples are large the t.test with var.equal = FALSE will be essetially the same thig. The stadard deviatios are ever (? kow i advace ayway so it does ot really matter i practice. 9.4 Cofidece Itervals for Proportios We would like to kow p which is the proportio of successes. For istace, p could be: the proportio of U.S. citizes that support Obama, the proportio of smokers amog adults age 18 or over, the proportio of people worldwide ifected by the H1N1 virus. We are give a S RS ( X 1, X 2,..., X distributed biom(size = 1, prob = p. Recall from Sectio 5.3 that the commo mea of these variables is IE X = p ad the variace is IE(X p 2 = p(1 p. If we let Y = X i, the from Sectio 5.3 we kow that Y biom(size =, prob = p ad that X = Y p(1 p has IE X = p ad Var(X =. Thus if is large (here is the CLT the a approximate 100(1 α% cofidece iterval for p would be give by p(1 p X ± z α/2. (9.4.1 OOPS...! Equatio is of o use to us because the ukow parameter p is i the formula! (If we kew what p was to plug i the formula the we would ot eed a cofidece iterval i the first place. There are two solutios to this problem. 1. Replace p with ˆp = X. The a approximate 100(1 α% cofidece iterval for p is give by ˆp(1 ˆp ˆp ± z α/2. (9.4.2 This approach is called the Wald iterval ad is also kow as the asymptotic iterval because it appeals to the CLT for large sample sizes. 2. Go back to first priciples. Note that z α/2 Y/ p p(1 p/ z α/2 exactly whe the fuctio f defied by f (p = (Y/ p 2 z 2 α/2 p(1 p
10 For two proportios p 1 ad p 2, we may collect idepedet biom(size = 1, prob = p samples of size 1 ad 2, respectively. Let Y 1 ad Y 2 deote the umber of successes i the respective samples. We kow that Y 1 p1 (1 p 1 orm mea = p 1, sd = 1 ad Y 2 p2 (1 p 2 orm mea = p 2, sd = 2 so it stads to reaso that a approximate 100(1 α% cofidece iterval for p 1 p 2 is give by ˆp1 (1 ˆp 1 ( ˆp 1 ˆp 2 ± z α/2 + ˆp 2(1 ˆp 2, ( where ˆp 1 = Y 1 / 1 ad ˆp 2 = Y 2 / 2. Remark Whe estimatig a sigle proportio, oe-sided itervals are sometimes eeded. They take the form ˆp(1 ˆp 0, ˆp + z α/2 (9.4.5 or ˆp(1 ˆp ˆp z α/2, 1 (9.4.6 or i other words, we kow i advace that the true proportio is restricted to the iterval [0, 1], so we ca trucate our cofidece iterval to those values o either side How to do it with R > library(hmisc > bicof(x = 7, = 25, method = "asymptotic" PoitEst Lower Upper > bicof(x = 7, = 25, method = "wilso" PoitEst Lower Upper
11 212 CHAPTER 9. ESTIMATION The default value of the method argumet is wilso. A alterate way is > tab <- xtabs(~geder, data = RcmdrTestDrive > prop.test(rbid(tab, cof.level = 0.95, correct = FALSE 1-sample proportios test without cotiuity correctio data: rbid(tab, ull probability 0.5 X-squared = 2.881, df = 1, p-value = alterative hypothesis: true p is ot equal to percet cofidece iterval: sample estimates: p > A <- as.data.frame(titaic > library(reshape > B <- with(a, utable(a, Freq 9.5 Cofidece Itervals for Variaces I am thikig oe ad two sample problems here How to do it with R I am thikig about sigma.test i the TeachigDemos package ad var.test i base R here. 9.6 Fittig Distributios How to do it with R I am thikig about fitdistr from the MASS package [84]. 9.7 Sample Size ad Margi of Error Sectios 9.2 through 9.5 all bega the same way: we were give the sample size ad the cofidece coefficiet 1 α, ad our task was to fid a margi of error E so that Some examples we saw were: ˆθ ± E is a 100(1 α% cofidece iterval for θ. E = z α/2 σ/, i the oe-sample z-iterval, E = t α/2 (df = + m 2S p 1 + m 1, i the two-sample pooled t-iterval.
12 9.7. SAMPLE SIZE AND MARGIN OF ERROR 213 We already kow (we ca see i the formulas above that E decreases as icreases. Now we would like to use this iformatio to our advatage: suppose that we have a fixed margi of error E, say E = 3, ad we wat a 100(1 α% cofidece iterval for µ. The questio is: how big does have to be? For the case of a populatio mea the aswer is easy: we set up a equatio ad solve for. Example Give a situatio, give σ, give E, we would like to kow how big has to be to esure that X ± 5 is a 95% cofidece iterval for µ. Remark Always roud up ay decimal values of, o matter how small the decimal is. 2. Aother ame for E is the maximum error of the estimate. For proportios, recall that the asymptotic formula to estimate p was Reasoig as above we would wat ˆp(1 ˆp ˆp ± z α/2. ˆp(1 ˆp E = z α/2, or (9.7.1 = z 2 ˆp(1 ˆp α/2. (9.7.2 E 2 OOPS! Recall that ˆp = Y/, which would put the variable o both sides of Equatio Agai, there are two solutios to the problem. 1. If we have a good idea of what p is, say p the we ca plug it i to get = z 2 p (1 p α/2. (9.7.3 E 2 2. Eve if we have o idea what p is, we do kow from calculus that p(1 p 1/4 because the fuctio f (x = x(1 x is quadratic (so its graph is a parabola which opes dowward with maximum value attaied at x = 1/2. Therefore, regardless of our choice for p the sample size must satisfy = z 2 p (1 p α/2 z2 α/2 E 2 4E. ( The quatity z 2 α/2 /4E2 is large eough to guaratee 100(1 α% cofidece. Example Proportio example Remark For very small populatios sometimes the value of obtaied from the formula is too big. I this case we should use the hypergeometric distributio for a samplig model rather tha the biomial model. With this modificatio the formulas chage to the followig: if N deotes the populatio size the let m = z 2 p (1 p α/2 (9.7.5 E 2 ad the sample size eeded to esure 100(1 α% cofidece is achieved is m 1 + m 1 N If we do ot have a good value for the estimate p the we may use p = 1/2. =. (9.7.6
Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010 Pearso Educatio, Ic. Comparig Two Proportios Comparisos betwee two percetages are much more commo tha questios about isolated percetages. Ad they are more
More informationChapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.
Chapter 22 Comparig Two Proportios Copyright 2010, 2007, 2004 Pearso Educatio, Ic. Comparig Two Proportios Read the first two paragraphs of pg 504. Comparisos betwee two percetages are much more commo
More informationStatistics 511 Additional Materials
Cofidece Itervals o mu Statistics 511 Additioal Materials This topic officially moves us from probability to statistics. We begi to discuss makig ifereces about the populatio. Oe way to differetiate probability
More informationFrequentist Inference
Frequetist Iferece The topics of the ext three sectios are useful applicatios of the Cetral Limit Theorem. Without kowig aythig about the uderlyig distributio of a sequece of radom variables {X i }, for
More informationAP Statistics Review Ch. 8
AP Statistics Review Ch. 8 Name 1. Each figure below displays the samplig distributio of a statistic used to estimate a parameter. The true value of the populatio parameter is marked o each samplig distributio.
More informationStatistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.
Statistical Iferece (Chapter 10) Statistical iferece = lear about a populatio based o the iformatio provided by a sample. Populatio: The set of all values of a radom variable X of iterest. Characterized
More informationApril 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE
April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE TERRY SOO Abstract These otes are adapted from whe I taught Math 526 ad meat to give a quick itroductio to cofidece
More informationDS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10
DS 00: Priciples ad Techiques of Data Sciece Date: April 3, 208 Name: Hypothesis Testig Discussio #0. Defie these terms below as they relate to hypothesis testig. a) Data Geeratio Model: Solutio: A set
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2016 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationChapter 8: Estimating with Confidence
Chapter 8: Estimatig with Cofidece Sectio 8.2 The Practice of Statistics, 4 th editio For AP* STARNES, YATES, MOORE Chapter 8 Estimatig with Cofidece 8.1 Cofidece Itervals: The Basics 8.2 8.3 Estimatig
More information1 Inferential Methods for Correlation and Regression Analysis
1 Iferetial Methods for Correlatio ad Regressio Aalysis I the chapter o Correlatio ad Regressio Aalysis tools for describig bivariate cotiuous data were itroduced. The sample Pearso Correlatio Coefficiet
More informationDiscrete Mathematics for CS Spring 2008 David Wagner Note 22
CS 70 Discrete Mathematics for CS Sprig 2008 David Wager Note 22 I.I.D. Radom Variables Estimatig the bias of a coi Questio: We wat to estimate the proportio p of Democrats i the US populatio, by takig
More informationResampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.
Jauary 1, 2019 Resamplig Methods Motivatio We have so may estimators with the property θ θ d N 0, σ 2 We ca also write θ a N θ, σ 2 /, where a meas approximately distributed as Oce we have a cosistet estimator
More informationSTA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:
STA 2023 Module 10 Comparig Two Proportios Learig Objectives Upo completig this module, you should be able to: 1. Perform large-sample ifereces (hypothesis test ad cofidece itervals) to compare two populatio
More information7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals
7-1 Chapter 4 Part I. Samplig Distributios ad Cofidece Itervals 1 7- Sectio 1. Samplig Distributio 7-3 Usig Statistics Statistical Iferece: Predict ad forecast values of populatio parameters... Test hypotheses
More informationChapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p
Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE Part 3: Summary of CI for µ Cofidece Iterval for a Populatio Proportio p Sectio 8-4 Summary for creatig a 100(1-α)% CI for µ: Whe σ 2 is kow ad paret
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationThis is an introductory course in Analysis of Variance and Design of Experiments.
1 Notes for M 384E, Wedesday, Jauary 21, 2009 (Please ote: I will ot pass out hard-copy class otes i future classes. If there are writte class otes, they will be posted o the web by the ight before class
More informationExam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234
STA 291 Lecture 19 Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Locatio CB 234 STA 291 - Lecture 19 1 Exam II Covers Chapter 9 10.1; 10.2; 10.3; 10.4; 10.6
More informationA quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population
A quick activity - Cetral Limit Theorem ad Proportios Lecture 21: Testig Proportios Statistics 10 Coli Rudel Flip a coi 30 times this is goig to get loud! Record the umber of heads you obtaied ad calculate
More informationMATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4
MATH 30: Probability ad Statistics 9. Estimatio ad Testig of Parameters Estimatio ad Testig of Parameters We have bee dealig situatios i which we have full kowledge of the distributio of a radom variable.
More informationChapter 6 Sampling Distributions
Chapter 6 Samplig Distributios 1 I most experimets, we have more tha oe measuremet for ay give variable, each measuremet beig associated with oe radomly selected a member of a populatio. Hece we eed to
More informationHypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance
Hypothesis Testig Empirically evaluatig accuracy of hypotheses: importat activity i ML. Three questios: Give observed accuracy over a sample set, how well does this estimate apply over additioal samples?
More informationRecall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.
Testig Statistical Hypotheses Recall the study where we estimated the differece betwee mea systolic blood pressure levels of users of oral cotraceptives ad o-users, x - y. Such studies are sometimes viewed
More informationTests of Hypotheses Based on a Single Sample (Devore Chapter Eight)
Tests of Hypotheses Based o a Sigle Sample Devore Chapter Eight MATH-252-01: Probability ad Statistics II Sprig 2018 Cotets 1 Hypothesis Tests illustrated with z-tests 1 1.1 Overview of Hypothesis Testig..........
More informationTopic 9: Sampling Distributions of Estimators
Topic 9: Samplig Distributios of Estimators Course 003, 2018 Page 0 Samplig distributios of estimators Sice our estimators are statistics (particular fuctios of radom variables), their distributio ca be
More informationRead through these prior to coming to the test and follow them when you take your test.
Math 143 Sprig 2012 Test 2 Iformatio 1 Test 2 will be give i class o Thursday April 5. Material Covered The test is cummulative, but will emphasize the recet material (Chapters 6 8, 10 11, ad Sectios 12.1
More informationMATH/STAT 352: Lecture 15
MATH/STAT 352: Lecture 15 Sectios 5.2 ad 5.3. Large sample CI for a proportio ad small sample CI for a mea. 1 5.2: Cofidece Iterval for a Proportio Estimatig proportio of successes i a biomial experimet
More informationChapter 23: Inferences About Means
Chapter 23: Ifereces About Meas Eough Proportios! We ve spet the last two uits workig with proportios (or qualitative variables, at least) ow it s time to tur our attetios to quatitative variables. For
More informationData Analysis and Statistical Methods Statistics 651
Data Aalysis ad Statistical Methods Statistics 651 http://www.stat.tamu.edu/~suhasii/teachig.html Suhasii Subba Rao Review of testig: Example The admistrator of a ursig home wats to do a time ad motio
More informationConfidence Intervals for the Population Proportion p
Cofidece Itervals for the Populatio Proportio p The cocept of cofidece itervals for the populatio proportio p is the same as the oe for, the samplig distributio of the mea, x. The structure is idetical:
More informationEstimation of a population proportion March 23,
1 Social Studies 201 Notes for March 23, 2005 Estimatio of a populatio proportio Sectio 8.5, p. 521. For the most part, we have dealt with meas ad stadard deviatios this semester. This sectio of the otes
More informationAgreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times
Sigificace level vs. cofidece level Agreemet of CI ad HT Lecture 13 - Tests of Proportios Sta102 / BME102 Coli Rudel October 15, 2014 Cofidece itervals ad hypothesis tests (almost) always agree, as log
More informationSection 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis
Sectio 9.2 Tests About a Populatio Proportio P H A N T O M S Parameters Hypothesis Assess Coditios Name the Test Test Statistic (Calculate) Obtai P value Make a decisio State coclusio Sectio 9.2 Tests
More informationBinomial Distribution
0.0 0.5 1.0 1.5 2.0 2.5 3.0 0 1 2 3 4 5 6 7 0.0 0.5 1.0 1.5 2.0 2.5 3.0 Overview Example: coi tossed three times Defiitio Formula Recall that a r.v. is discrete if there are either a fiite umber of possible
More informationChapter 6. Sampling and Estimation
Samplig ad Estimatio - 34 Chapter 6. Samplig ad Estimatio 6.. Itroductio Frequetly the egieer is uable to completely characterize the etire populatio. She/he must be satisfied with examiig some subset
More informationCONFIDENCE INTERVALS STUDY GUIDE
CONFIDENCE INTERVALS STUDY UIDE Last uit, we discussed how sample statistics vary. Uder the right coditios, sample statistics like meas ad proportios follow a Normal distributio, which allows us to calculate
More informationEstimation for Complete Data
Estimatio for Complete Data complete data: there is o loss of iformatio durig study. complete idividual complete data= grouped data A complete idividual data is the oe i which the complete iformatio of
More informationRandom Variables, Sampling and Estimation
Chapter 1 Radom Variables, Samplig ad Estimatio 1.1 Itroductio This chapter will cover the most importat basic statistical theory you eed i order to uderstad the ecoometric material that will be comig
More informationStatistics 20: Final Exam Solutions Summer Session 2007
1. 20 poits Testig for Diabetes. Statistics 20: Fial Exam Solutios Summer Sessio 2007 (a) 3 poits Give estimates for the sesitivity of Test I ad of Test II. Solutio: 156 patiets out of total 223 patiets
More informationConfidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M.
MATH1005 Statistics Lecture 24 M. Stewart School of Mathematics ad Statistics Uiversity of Sydey Outlie Cofidece itervals summary Coservative ad approximate cofidece itervals for a biomial p The aïve iterval
More informationBig Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.
5. Data, Estimates, ad Models: quatifyig the accuracy of estimates. 5. Estimatig a Normal Mea 5.2 The Distributio of the Normal Sample Mea 5.3 Normal data, cofidece iterval for, kow 5.4 Normal data, cofidece
More informationBIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov
Microarray Ceter BIOSTATISTICS Lecture 5 Iterval Estimatios for Mea ad Proportio dr. Petr Nazarov 15-03-013 petr.azarov@crp-sate.lu Lecture 5. Iterval estimatio for mea ad proportio OUTLINE Iterval estimatios
More informationStat 421-SP2012 Interval Estimation Section
Stat 41-SP01 Iterval Estimatio Sectio 11.1-11. We ow uderstad (Chapter 10) how to fid poit estimators of a ukow parameter. o However, a poit estimate does ot provide ay iformatio about the ucertaity (possible
More informationOctober 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1
October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 1 Populatio parameters ad Sample Statistics October 25, 2018 BIM 105 Probability ad Statistics for Biomedical Egieers 2 Ifereces
More informationExample: Find the SD of the set {x j } = {2, 4, 5, 8, 5, 11, 7}.
1 (*) If a lot of the data is far from the mea, the may of the (x j x) 2 terms will be quite large, so the mea of these terms will be large ad the SD of the data will be large. (*) I particular, outliers
More informationComputing Confidence Intervals for Sample Data
Computig Cofidece Itervals for Sample Data Topics Use of Statistics Sources of errors Accuracy, precisio, resolutio A mathematical model of errors Cofidece itervals For meas For variaces For proportios
More informationGoodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)
Goodess-of-Fit Tests ad Categorical Data Aalysis (Devore Chapter Fourtee) MATH-252-01: Probability ad Statistics II Sprig 2019 Cotets 1 Chi-Squared Tests with Kow Probabilities 1 1.1 Chi-Squared Testig................
More informationBIOS 4110: Introduction to Biostatistics. Breheny. Lab #9
BIOS 4110: Itroductio to Biostatistics Brehey Lab #9 The Cetral Limit Theorem is very importat i the realm of statistics, ad today's lab will explore the applicatio of it i both categorical ad cotiuous
More informationStat 200 -Testing Summary Page 1
Stat 00 -Testig Summary Page 1 Mathematicias are like Frechme; whatever you say to them, they traslate it ito their ow laguage ad forthwith it is somethig etirely differet Goethe 1 Large Sample Cofidece
More informationST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.
ST 305: Exam 3 By hadig i this completed exam, I state that I have either give or received assistace from aother perso durig the exam period. I have used o resources other tha the exam itself ad the basic
More information(7 One- and Two-Sample Estimation Problem )
34 Stat Lecture Notes (7 Oe- ad Two-Sample Estimatio Problem ) ( Book*: Chapter 8,pg65) Probability& Statistics for Egieers & Scietists By Walpole, Myers, Myers, Ye Estimatio 1 ) ( ˆ S P i i Poit estimate:
More information- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion
1 Chapter 7 ad 8 Review for Exam Chapter 7 Estimates ad Sample Sizes 2 Defiitio Cofidece Iterval (or Iterval Estimate) a rage (or a iterval) of values used to estimate the true value of the populatio parameter
More informationComparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading
Topic 15 - Two Sample Iferece I STAT 511 Professor Bruce Craig Comparig Two Populatios Research ofte ivolves the compariso of two or more samples from differet populatios Graphical summaries provide visual
More informationt distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference
EXST30 Backgroud material Page From the textbook The Statistical Sleuth Mea [0]: I your text the word mea deotes a populatio mea (µ) while the work average deotes a sample average ( ). Variace [0]: The
More informationMBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS
MBACATÓLICA Quatitative Methods Miguel Gouveia Mauel Leite Moteiro Faculdade de Ciêcias Ecoómicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS MBACatólica 006/07 Métodos Quatitativos
More informationMath 140 Introductory Statistics
8.2 Testig a Proportio Math 1 Itroductory Statistics Professor B. Abrego Lecture 15 Sectios 8.2 People ofte make decisios with data by comparig the results from a sample to some predetermied stadard. These
More informationAAEC/ECON 5126 FINAL EXAM: SOLUTIONS
AAEC/ECON 5126 FINAL EXAM: SOLUTIONS SPRING 2015 / INSTRUCTOR: KLAUS MOELTNER This exam is ope-book, ope-otes, but please work strictly o your ow. Please make sure your ame is o every sheet you re hadig
More informationHomework 5 Solutions
Homework 5 Solutios p329 # 12 No. To estimate the chace you eed the expected value ad stadard error. To do get the expected value you eed the average of the box ad to get the stadard error you eed the
More informationBecause it tests for differences between multiple pairs of means in one test, it is called an omnibus test.
Math 308 Sprig 018 Classes 19 ad 0: Aalysis of Variace (ANOVA) Page 1 of 6 Itroductio ANOVA is a statistical procedure for determiig whether three or more sample meas were draw from populatios with equal
More informationFACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures
FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING Lectures MODULE 5 STATISTICS II. Mea ad stadard error of sample data. Biomial distributio. Normal distributio 4. Samplig 5. Cofidece itervals
More informationTABLES AND FORMULAS FOR MOORE Basic Practice of Statistics
TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics Explorig Data: Distributios Look for overall patter (shape, ceter, spread) ad deviatios (outliers). Mea (use a calculator): x = x 1 + x 2 + +
More informationStatistical Intervals for a Single Sample
3/5/06 Applied Statistics ad Probability for Egieers Sixth Editio Douglas C. Motgomery George C. Ruger Chapter 8 Statistical Itervals for a Sigle Sample 8 CHAPTER OUTLINE 8- Cofidece Iterval o the Mea
More informationENGI 4421 Confidence Intervals (Two Samples) Page 12-01
ENGI 44 Cofidece Itervals (Two Samples) Page -0 Two Sample Cofidece Iterval for a Differece i Populatio Meas [Navidi sectios 5.4-5.7; Devore chapter 9] From the cetral limit theorem, we kow that, for sufficietly
More informationOverview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions
Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples
More informationA statistical method to determine sample size to estimate characteristic value of soil parameters
A statistical method to determie sample size to estimate characteristic value of soil parameters Y. Hojo, B. Setiawa 2 ad M. Suzuki 3 Abstract Sample size is a importat factor to be cosidered i determiig
More informationFinal Examination Solutions 17/6/2010
The Islamic Uiversity of Gaza Faculty of Commerce epartmet of Ecoomics ad Political Scieces A Itroductio to Statistics Course (ECOE 30) Sprig Semester 009-00 Fial Eamiatio Solutios 7/6/00 Name: I: Istructor:
More informationInstructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters?
CONFIDENCE INTERVALS How do we make ifereces about the populatio parameters? The samplig distributio allows us to quatify the variability i sample statistics icludig how they differ from the parameter
More informationParameter, Statistic and Random Samples
Parameter, Statistic ad Radom Samples A parameter is a umber that describes the populatio. It is a fixed umber, but i practice we do ot kow its value. A statistic is a fuctio of the sample data, i.e.,
More informationConfidence Intervals
Cofidece Itervals Berli Che Deartmet of Comuter Sciece & Iformatio Egieerig Natioal Taiwa Normal Uiversity Referece: 1. W. Navidi. Statistics for Egieerig ad Scietists. Chater 5 & Teachig Material Itroductio
More informationWHAT IS THE PROBABILITY FUNCTION FOR LARGE TSUNAMI WAVES? ABSTRACT
WHAT IS THE PROBABILITY FUNCTION FOR LARGE TSUNAMI WAVES? Harold G. Loomis Hoolulu, HI ABSTRACT Most coastal locatios have few if ay records of tsuami wave heights obtaied over various time periods. Still
More informationKLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions
We have previously leared: KLMED8004 Medical statistics Part I, autum 00 How kow probability distributios (e.g. biomial distributio, ormal distributio) with kow populatio parameters (mea, variace) ca give
More informationEcon 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara
Poit Estimator Eco 325 Notes o Poit Estimator ad Cofidece Iterval 1 By Hiro Kasahara Parameter, Estimator, ad Estimate The ormal probability desity fuctio is fully characterized by two costats: populatio
More informationMOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.
XI-1 (1074) MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND. R. E. D. WOOLSEY AND H. S. SWANSON XI-2 (1075) STATISTICAL DECISION MAKING Advaced
More informationLecture 2: Monte Carlo Simulation
STAT/Q SCI 43: Itroductio to Resamplig ethods Sprig 27 Istructor: Ye-Chi Che Lecture 2: ote Carlo Simulatio 2 ote Carlo Itegratio Assume we wat to evaluate the followig itegratio: e x3 dx What ca we do?
More informationStatisticians use the word population to refer the total number of (potential) observations under consideration
6 Samplig Distributios Statisticias use the word populatio to refer the total umber of (potetial) observatios uder cosideratio The populatio is just the set of all possible outcomes i our sample space
More informationOne-Sample Test for Proportion
Oe-Sample Test for Proportio Approximated Oe-Sample Z Test for Proportio CF Jeff Li, MD., PhD. November 1, 2005 c Jeff Li, MD., PhD. c Jeff Li, MD., PhD. Oe Sample Test for Proportio, 1 I DM-TKR Data,
More informationSTAT 350 Handout 19 Sampling Distribution, Central Limit Theorem (6.6)
STAT 350 Hadout 9 Samplig Distributio, Cetral Limit Theorem (6.6) A radom sample is a sequece of radom variables X, X 2,, X that are idepedet ad idetically distributed. o This property is ofte abbreviated
More informationSTAT431 Review. X = n. n )
STAT43 Review I. Results related to ormal distributio Expected value ad variace. (a) E(aXbY) = aex bey, Var(aXbY) = a VarX b VarY provided X ad Y are idepedet. Normal distributios: (a) Z N(, ) (b) X N(µ,
More informationHYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018
HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018 We are resposible for 2 types of hypothesis tests that produce ifereces about the ukow populatio mea, µ, each of which has 3 possible
More information1 of 7 7/16/2009 6:06 AM Virtual Laboratories > 6. Radom Samples > 1 2 3 4 5 6 7 6. Order Statistics Defiitios Suppose agai that we have a basic radom experimet, ad that X is a real-valued radom variable
More informationSolutions. Discussion D1. a. The middle 95% of a sampling distribution for a binomial proportion ˆp is cut off by the two points
formulas at the AP Exam. Studets should be able to derive it from the expressio for margi of error E i the formula for the cofidece iterval, amely, E z p(1 p) Modelig Good Aswers Oce studets have completed
More informationMathacle. PSet Stats, Concepts In Statistics Level Number Name: Date: Confidence Interval Guesswork with Confidence
PSet ----- Stats, Cocepts I Statistics Cofidece Iterval Guesswork with Cofidece VII. CONFIDENCE INTERVAL 7.1. Sigificace Level ad Cofidece Iterval (CI) The Sigificace Level The sigificace level, ofte deoted
More informationChapter 11: Asking and Answering Questions About the Difference of Two Proportions
Chapter 11: Askig ad Aswerig Questios About the Differece of Two Proportios These otes reflect material from our text, Statistics, Learig from Data, First Editio, by Roxy Peck, published by CENGAGE Learig,
More informationProblems from 9th edition of Probability and Statistical Inference by Hogg, Tanis and Zimmerman:
Math 224 Fall 2017 Homework 4 Drew Armstrog Problems from 9th editio of Probability ad Statistical Iferece by Hogg, Tais ad Zimmerma: Sectio 2.3, Exercises 16(a,d),18. Sectio 2.4, Exercises 13, 14. Sectio
More informationSTAC51: Categorical data Analysis
STAC51: Categorical data Aalysis Mahida Samarakoo Jauary 28, 2016 Mahida Samarakoo STAC51: Categorical data Aalysis 1 / 35 Table of cotets Iferece for Proportios 1 Iferece for Proportios Mahida Samarakoo
More informationANALYSIS OF EXPERIMENTAL ERRORS
ANALYSIS OF EXPERIMENTAL ERRORS All physical measuremets ecoutered i the verificatio of physics theories ad cocepts are subject to ucertaities that deped o the measurig istrumets used ad the coditios uder
More informationGUIDELINES ON REPRESENTATIVE SAMPLING
DRUGS WORKING GROUP VALIDATION OF THE GUIDELINES ON REPRESENTATIVE SAMPLING DOCUMENT TYPE : REF. CODE: ISSUE NO: ISSUE DATE: VALIDATION REPORT DWG-SGL-001 002 08 DECEMBER 2012 Ref code: DWG-SGL-001 Issue
More informationChapter 1 (Definitions)
FINAL EXAM REVIEW Chapter 1 (Defiitios) Qualitative: Nomial: Ordial: Quatitative: Ordial: Iterval: Ratio: Observatioal Study: Desiged Experimet: Samplig: Cluster: Stratified: Systematic: Coveiece: Simple
More informationTopic 10: Introduction to Estimation
Topic 0: Itroductio to Estimatio Jue, 0 Itroductio I the simplest possible terms, the goal of estimatio theory is to aswer the questio: What is that umber? What is the legth, the reactio rate, the fractio
More informationChi-Squared Tests Math 6070, Spring 2006
Chi-Squared Tests Math 6070, Sprig 2006 Davar Khoshevisa Uiversity of Utah February XXX, 2006 Cotets MLE for Goodess-of Fit 2 2 The Multiomial Distributio 3 3 Applicatio to Goodess-of-Fit 6 3 Testig for
More information2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2
Chapter 8 Comparig Two Treatmets Iferece about Two Populatio Meas We wat to compare the meas of two populatios to see whether they differ. There are two situatios to cosider, as show i the followig examples:
More informationInterval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),
Cofidece Iterval Estimatio Problems Suppose we have a populatio with some ukow parameter(s). Example: Normal(,) ad are parameters. We eed to draw coclusios (make ifereces) about the ukow parameters. We
More informationA Confidence Interval for μ
INFERENCES ABOUT μ Oe of the major objectives of statistics is to make ifereces about the distributio of the elemets i a populatio based o iformatio cotaied i a sample. Numerical summaries that characterize
More informationIf, for instance, we were required to test whether the population mean μ could be equal to a certain value μ
STATISTICAL INFERENCE INTRODUCTION Statistical iferece is that brach of Statistics i which oe typically makes a statemet about a populatio based upo the results of a sample. I oesample testig, we essetially
More informationSample Size Determination (Two or More Samples)
Sample Sie Determiatio (Two or More Samples) STATGRAPHICS Rev. 963 Summary... Data Iput... Aalysis Summary... 5 Power Curve... 5 Calculatios... 6 Summary This procedure determies a suitable sample sie
More informationSimulation. Two Rule For Inverting A Distribution Function
Simulatio Two Rule For Ivertig A Distributio Fuctio Rule 1. If F(x) = u is costat o a iterval [x 1, x 2 ), the the uiform value u is mapped oto x 2 through the iversio process. Rule 2. If there is a jump
More informationClass 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 23 Daiel B. Rowe, Ph.D. Departmet of Mathematics, Statistics, ad Computer Sciece Copyright 2017 by D.B. Rowe 1 Ageda: Recap Chapter 9.1 Lecture Chapter 9.2 Review Exam 6 Problem Solvig Sessio. 2
More information1 Models for Matched Pairs
1 Models for Matched Pairs Matched pairs occur whe we aalyse samples such that for each measuremet i oe of the samples there is a measuremet i the other sample that directly relates to the measuremet i
More informationACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER / Statistics
ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 1 MATH00030 SEMESTER 1 018/019 DR. ANTHONY BROWN 8. Statistics 8.1. Measures of Cetre: Mea, Media ad Mode. If we have a series of umbers the
More information