Confidence Interval for one population mean or one population proportion, continued. 1. Sample size estimation based on the large sample C.I.

Similar documents
Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

Lecture 5. Materials Covered: Chapter 6 Suggested Exercises: 6.7, 6.9, 6.17, 6.20, 6.21, 6.41, 6.49, 6.52, 6.53, 6.62, 6.63.

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

Statistics 300: Elementary Statistics

Topic 9: Sampling Distributions of Estimators

Statistics 511 Additional Materials

Statistical Intervals for a Single Sample

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date:

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

UNIT 8: INTRODUCTION TO INTERVAL ESTIMATION

Topic 9: Sampling Distributions of Estimators

Exam II Covers. STA 291 Lecture 19. Exam II Next Tuesday 5-7pm Memorial Hall (Same place as exam I) Makeup Exam 7:15pm 9:15pm Location CB 234

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

Topic 9: Sampling Distributions of Estimators

Sample Size Determination (Two or More Samples)

1 Inferential Methods for Correlation and Regression Analysis

S160 #12. Sampling Distribution of the Proportion, Part 2. JC Wang. February 25, 2016

MATH/STAT 352: Lecture 15

S160 #12. Review of Large Sample Result for Sample Proportion

AP Statistics Review Ch. 8

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Chapter 8: Estimating with Confidence

(7 One- and Two-Sample Estimation Problem )

CH19 Confidence Intervals for Proportions. Confidence intervals Construct confidence intervals for population proportions

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Confidence Intervals QMET103

MBACATÓLICA. Quantitative Methods. Faculdade de Ciências Económicas e Empresariais UNIVERSIDADE CATÓLICA PORTUGUESA 9. SAMPLING DISTRIBUTIONS

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

Topic 10: Introduction to Estimation

Instructor: Judith Canner Spring 2010 CONFIDENCE INTERVALS How do we make inferences about the population parameters?

MIT : Quantitative Reasoning and Statistical Methods for Planning I

STAT 155 Introductory Statistics Chapter 6: Introduction to Inference. Lecture 18: Estimation with Confidence

Confidence Interval for Standard Deviation of Normal Distribution with Known Coefficients of Variation

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Common Large/Small Sample Tests 1/55

Data Analysis and Statistical Methods Statistics 651

Stat 421-SP2012 Interval Estimation Section

Final Examination Solutions 17/6/2010

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Confidence Intervals for the Population Proportion p

Unit 6 Estimation Week #10 - Practice Problems SOLUTIONS

Chapter 6 Sampling Distributions

Big Picture. 5. Data, Estimates, and Models: quantifying the accuracy of estimates.

Stat 200 -Testing Summary Page 1

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

BIOSTATISTICS. Lecture 5 Interval Estimations for Mean and Proportion. dr. Petr Nazarov

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

Working with Two Populations. Comparing Two Means

Estimating the Population Mean - when a sample average is calculated we can create an interval centered on this average

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

CONFIDENCE INTERVALS STUDY GUIDE

1 Models for Matched Pairs

UCLA STAT 13 Introduction to Statistical Methods for the Life and Health Sciences

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date: Confidence Interval Guesswork with Confidence

ST 305: Exam 3 ( ) = P(A)P(B A) ( ) = P(A) + P(B) ( ) = 1 P( A) ( ) = P(A) P(B) σ X 2 = σ a+bx. σ ˆp. σ X +Y. σ X Y. σ X. σ Y. σ n.

Confidence Intervals รศ.ดร. อน นต ผลเพ ม Assoc.Prof. Anan Phonphoem, Ph.D. Intelligent Wireless Network Group (IWING Lab)

Econ 325/327 Notes on Sample Mean, Sample Proportion, Central Limit Theorem, Chi-square Distribution, Student s t distribution 1.

Expectation and Variance of a random variable

Stat 225 Lecture Notes Week 7, Chapter 8 and 11

Chapters 5 and 13: REGRESSION AND CORRELATION. Univariate data: x, Bivariate data (x,y).

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Computing Confidence Intervals for Sample Data

Chapter 6. Sampling and Estimation

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.

Chapter 23: Inferences About Means

Chapter 8: STATISTICAL INTERVALS FOR A SINGLE SAMPLE. Part 3: Summary of CI for µ Confidence Interval for a Population Proportion p

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

KLMED8004 Medical statistics. Part I, autumn Estimation. We have previously learned: Population and sample. New questions

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

This is an introductory course in Analysis of Variance and Design of Experiments.

Chapter 1 (Definitions)

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

October 25, 2018 BIM 105 Probability and Statistics for Biomedical Engineers 1

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Properties and Hypothesis Testing

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

Hypothesis Testing. Evaluation of Performance of Learned h. Issues. Trade-off Between Bias and Variance

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

Statistical Inference Procedures

MidtermII Review. Sta Fall Office Hours Wednesday 12:30-2:30pm Watch linear regression videos before lab on Thursday

Estimation of a population proportion March 23,

SOLUTIONS y n. n 1 = 605, y 1 = 351. y1. p y n. n 2 = 195, y 2 = 41. y p H 0 : p 1 = p 2 vs. H 1 : p 1 p 2.

Math 140 Introductory Statistics

f(x i ; ) L(x; p) = i=1 To estimate the value of that maximizes L or equivalently ln L we will set =0, for i =1, 2,...,m p x i (1 p) 1 x i i=1

Announcements. Unit 5: Inference for Categorical Data Lecture 1: Inference for a single proportion

Probability and Statistics Estimation Chapter 7 Section 3 Estimating p in the Binomial Distribution

Multiple Comparisons Examples STAT 314

Homework 5 Solutions

Simulation. Two Rule For Inverting A Distribution Function

Parameter, Statistic and Random Samples

Chapter two: Hypothesis testing

Transcription:

Cofidece Iterval for oe populatio mea or oe populatio proportio, cotiued 1. ample size estimatio based o the large sample C.I. for p ˆ(1 ˆ) ˆ(1 ˆ) From the iterval ˆ p p Z p ˆ, p Z p p L legh of your 100(1 )% CI Z pˆ (1 pˆ ) L,, pˆ are give ad we are iterested i sample size. Therefore, 1 1 4( Z ) 1 4( Z ) p(1 p) ( Z ) L L L ˆ ˆ (Whe 1 pˆ, it has the maximum value.) Example 1. Example. L 0.0, 0.05, pˆ 0.54? L 0.0, 0.05, pˆ 0.5?. ample size calculatio for p based o the maximum error E. Defiitio. P( pˆ p E) 1 We wat to estimate p withi E with a probability of (1 ). Derive the formula for 1

( ) ( ) ( ( ) ( ) ( ) ) ice whe the sample size is large, by the CLT we have: ( ) ( ) Thus ( ( ) ( ) ) ( ) ( ) ( )

Whe p is ukow, we ca plug i the estimate of p,, ad obtai the followig formula: ( ) ( ) Recall we also derived based o L - the legth of the 100(1 ) % large sample cofidece iterval for p. Their relatioship (usig the secod formula for sample size calculatio because our CI formula was basd o the secod PQ where we estimated the p i the deomiator) is L E Recall we also derived based o L - the legth of the 100(1 ) % large sample cofidece iterval for p. Their relatioship is L E P( pˆ p E) 1 P( E pˆ p E) 1 P( E pˆ p E pˆ ) 1 P( pˆ E p pˆ E) 1 The 100(1 ) % cofidece iterval for p is pˆ E, pˆ E. L pˆ E pˆ E E The legth of the cofidece iterval is Example 3. I order to estimate the percet of childre with iadequate immuizatio to be withi 0.05 of the true proportio with a probability of 98% (a) How may childre should be sampled? olutio. E 0.05, 1 0.98 0.0 Z0.01 4 (0.05).33 4 (0.05) 543 3

(b) If the percetage of childre with iadequate immuizatio is estimated to be 0%, the? olutio. pˆ 0% Z 0.01 0. 1 0. 348 (0.05) ample size calculates for 1 populatio proportios based o the maximum error E. 3. Cofidece Iterval for 1 populatio mea There are 4 scearios we cover oly the first 3 scearios i our class. 1. Normal populatio, a. Poit Estimator : X is kow. X b. Pivotal Quatity : Z ~ N(0,1) c. 100(1 )% CI for : P( Z Z Z ) 1 X Z d. Legth of CI : L Z e. ample size based o L : f. ample size based o E P( X E) 1 P( E X E) 1 4 Z L P( X E X E) 1 L X E X E E Z E. Normal populatio, a. Poit Estimator : X is ukow. 4

X b. Pivotal Quatity : T ~ t 1 c. 100(1 )% CI for : P( t T t ) 1 X t 1, 1, 1, d. Legth of CI : L t 1, e. ample size based o L : f. ample size based o E P( X E) 1 t 1, 4 L P( E X E) 1 P( X E X E) 1 L X E X E E t 1, E 3. Ay populatios, large sample a. Poit Estimator : X b. Pivotal Quatity : X X Z ~ N(0,1) or Z ~ N(0,1) c. 100(1 )% CI for : P( Z Z Z ) 1 X Z X Z d. Legth of CI : L Z or L Z e. ample size based o L : 4 Z 4 Z or L L f. ample size based o E 5

P( X E) 1 P( E X E) 1 P( X E X E) 1 L X E X E E Z Z or E E 4. There also exist other cases, but we do t cover those i our class. Now I will preset more details for ceario metioed above. cearios 1 & 3 (easy) ceario : ormal populatio, ukow 1. Poit estimatio : X. Z ~ N(0,1) X ~ N(, ) 3. Theorem. amplig from ormal populatio a. Z ~ N (0,1) 1 b. W ~ 1 c. Z ad W are idepedet. Z X Defiitio. T ~ t 1 W ( 1) ------ Derivatio of CI, ormal populatio, is ukow ------ X ~ N(, ) is ot a pivotal quatity. X ~ N(0, ) is ot a pivotal quatity. 6

X Z ~ N(0,1) is ot a pivotal quatity. / Remove!!! X Therefore T ~ t 1 is a pivotal quatity. / Now we will use this pivotal quatity to derive the 100(1-α)% cofidece iterval for μ. We start by plottig the pdf of the t-distributio with -1 degrees of freedom as follows: The above pdf plot correspods to the followig probability statemet: P( t T t ) 1 1, / 1, / X P( t t ) 1 / => 1, / 1, / 7

P( t X t ) 1 => 1, / 1, / P( X t X t ) 1 => 1, / 1, / P( X t X t ) 1 => 1, / 1, / P( X t X t ) 1 => 1, / 1, / => Thus the 100(1 )% C.I. for whe is ukow is [ X t 1, /, X t 1, / ]. (*Please ote that t 1, / Z/ ) Example 4. I a radom sample of 36 parochial schools throughout the south, the average umber of pupils per school is 379. with a stadard deviatio of 14. Use the sample to costruct a 95% CI for, the mea umber of pupils per school for all parochial schools i the south. olutio. CI for, large sample 36, X 379., 14, =0.05 14 95% CI for is X Z0.05 379. 1.96 36 338.7, 419.7 Example 5. I a psychological depth-perceptio test, a radom sample of 14 airlie pilots were asked to judge the distace betwee markers at the other ed of a laboratory. The data (i test) are 8

.7,.4, 1.9,.4, 1.9,.3,.,.5,.3, 1.8,.5,.0,.,.6 Please costruct a 95% CI for, the average distace. olutio. (Note: we ca perform the hapiro-wilk test to examie whether the sample comes from a ormal populatio or ot. This test is ot required i our class. Here we simply assume the populatio is ormal. I will always give you such iformatio i the exams.) CI for, small sample, ormal populatio, populatio variace ukow. 14, X.6, 0.8, =0.05 0.8 95% CI for is X t 1,.6.16 14.10,.4 Example 6. A federal agecy has decided to ivestigate the advertised weight we prited o cartos of a certai brad of cereal. Historical data show that 0.75 ouce. If we wish to estimate the weight withi 0.5 ouce with 99% cofidece, how may cartos should be sampled? olutio. E 0.5, 0.75, 0.01 Z0.005 0.75 0.5 60 Example 7. (review of exact CI for mea whe the populatio is ormal ad the populatio variace is kow.) (PU). A radom sample of 16 police officers subjected to costat ihalatio of automobile exhaust fumes i dowtow Cairo had a average blood lead level cocetratio of 9. μg/dl. Assume X, the blood lead level of a radomly selected policema, is ormally distributed with a stadard deviatio of σ = 7.5 μg/dl. Historically, it is kow that the average blood lead level cocetratio of humas with o exposure to automobile exhaust is 18. μg/dl. Is there covicig evidece that policeme exposed to costat auto exhaust have 9

elevated blood lead level cocetratios? (Data source: Kamal, Eldamaty, ad Faris, "Blood lead level of Cairo traffic policeme," ciece of the Total Eviromet, 105(1991): 165-170.) olutio. Let's try to aswer the questio by calculatig a 95% cofidece iterval for the populatio mea. For a 95% cofidece iterval, 1 α = 0.95, so that α = 0.05 ad α/ = 0.05. Therefore, as the followig diagram illustrates the situatio, z 0.05 = 1.96: Now, substitutig i what we kow ( = 9., = 16, σ = 7.5, ad z 0.05 = 1.96) ito the the formula for a Z-iterval for a mea, we get: [ ] [ ] implifyig, we get a 95% cofidece iterval for the mea blood lead level cocetratio of all policeme exposed to costat auto exhaust: [7.89, 30.51] That is, we ca be 95% cofidet that the mea blood lead level cocetratio of all policeme exposed to costat auto exhaust is 10

betwee 7.9 μg/dl ad 30.5 μg/dl. Note that the iterval does ot cotai the value 18., the average blood lead level cocetratio of humas with o exposure to automobile exhaust. I fact, all of the values i the cofidece iterval are much greater tha 18.. Therefore, there is covicig evidece that policeme exposed to costat auto exhaust have elevated blood lead level cocetratios. Example 8. (Large sample CI for populatio mea, variace ukow) (BU) Descriptive statistics o variables measured i a sample of a =3,539 participats attedig the 7th examiatio of the offsprig i the Framigham Heart tudy are show below. Characteristic ample Mea tadard Deviatio (s) ystolic Blood Pressure Diastolic Blood Pressure Total erum Cholesterol 3,534 17.3 19.0 3,53 74.0 9.9 3,310 00.3 36.8 Weight 3,506 174.4 38.7 Height 3,36 65.957 3.749 Body Mass Idex 3,36 8.15 5.3 Because the sample is large, we ca geerate a 95% cofidece iterval for systolic blood pressure usig the followig formula: [ ] 11

ubstitutig the sample statistics ad the Z value for 95% cofidece,, we have [ ] [ ] Therefore, the poit estimate for the true mea systolic blood pressure i the populatio is 17.3, ad we are 95% cofidet that the true mea is betwee 16.7 ad 17.9. The margi of error is very small (the cofidece iterval is arrow), because the sample size is large. Example 9. (mall sample CI for populatio mea, variace ukow, NORMAL POPULATION) (BU) The table below shows data o a subsample of =10 participats i the 7th examiatio of the Framigham offsprig tudy. Characteristic ample Mea tadard Deviatio (s) ystolic Blood Pressure Diastolic Blood Pressure Total erum Cholesterol 10 11. 11.1 10 71.3 7. 10 0.3 37.7 Weight 10 176.0 33.0 Height 10 67.175 4.05 Body Mass Idex 10 7.6 3.10 uppose we compute a 95% cofidece iterval for the true systolic blood pressure usig data i the subsample. Because the sample size is small, we must ow use the cofidece iterval formula that ivolves t rather tha Z. [ ] 1

The sample size is =10, the degrees of freedom (df) = -1 = 9. The t value for 95% cofidece with df = 9 is =.6. ubstitutig the sample statistics ad the t value for 95% cofidece, we have. Iterpretatio: Based o this sample of size =10, our best estimate of the true mea systolic blood pressure i the populatio is 11.. Based o this sample, we are 95% cofidet that the true systolic blood pressure i the populatio is betwee 113.3 ad 19.1. Note that the margi of error is larger here primarily due to the small sample size. 13