Chapter 6 Estimation and Sample Sizes

Similar documents
Sections 7.1 and 7.2. This chapter presents the beginning of inferential statistics. The two major applications of inferential statistics

p = q ˆ = 1 -ˆp = sample proportion of failures in a sample size of n x n Chapter 7 Estimates and Sample Sizes

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series

Lecture Slides. Elementary Statistics. Tenth Edition. by Mario F. Triola. and the Triola Statistics Series

Chapter 6. Estimates and Sample Sizes

Chapter 9 Inferences from Two Samples

STA Module 10 Comparing Two Proportions

Statistics for Business and Economics

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

STAT100 Elementary Statistics and Probability

Chapter 18: Sampling Distribution Models

Unit 9: Inferences for Proportions and Count Data

Percentage point z /2

Lecture #16 Thursday, October 13, 2016 Textbook: Sections 9.3, 9.4, 10.1, 10.2

DETERMINE whether the conditions for performing inference are met. CONSTRUCT and INTERPRET a confidence interval to compare two proportions.

Inferences for Proportions and Count Data

CH.8 Statistical Intervals for a Single Sample

2011 Pearson Education, Inc

Difference Between Pair Differences v. 2 Samples

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1

Chapter 8. Inferences Based on a Two Samples Confidence Intervals and Tests of Hypothesis

Chapter 5 Confidence Intervals

A proportion is the fraction of individuals having a particular attribute. Can range from 0 to 1!

Unit 9: Inferences for Proportions and Count Data

Confidence Intervals, Testing and ANOVA Summary

Ch. 7 Statistical Intervals Based on a Single Sample

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2

MATH 240. Chapter 8 Outlines of Hypothesis Tests

Single Sample Means. SOCY601 Alan Neustadtl

Final Exam Review (Math 1342)

Two-Sample Inferential Statistics

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

Reference: Chapter 7 of Devore (8e)

Topic 16 Interval Estimation

Margin of Error for Proportions

PubH 5450 Biostatistics I Prof. Carlin. Lecture 13

An interval estimator of a parameter θ is of the form θl < θ < θu at a

Chapter 7: Sampling Distributions

Econ 325: Introduction to Empirical Economics

STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)

Stochastic calculus for summable processes 1

You are allowed 3? sheets of notes and a calculator.

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING

UNIVERSITY OF TORONTO MISSISSAUGA. SOC222 Measuring Society In-Class Test. November 11, 2011 Duration 11:15a.m. 13 :00p.m.

10.1. Comparing Two Proportions. Section 10.1

Inference for Proportions

Inference for Proportions

What Is a Sampling Distribution? DISTINGUISH between a parameter and a statistic

Statistic: a that can be from a sample without making use of any unknown. In practice we will use to establish unknown parameters.

Chapter 22. Comparing Two Proportions. Bin Zou STAT 141 University of Alberta Winter / 15

Topic 6 - Confidence intervals based on a single sample

Statistics for Business and Economics: Confidence Intervals for Proportions

Chapter 15 Sampling Distribution Models

Hypothesis testing. 1 Principle of hypothesis testing 2

Chapter 8 - Statistical intervals for a single sample

1 MA421 Introduction. Ashis Gangopadhyay. Department of Mathematics and Statistics. Boston University. c Ashis Gangopadhyay

The point value of each problem is in the left-hand margin. You must show your work to receive any credit, except on problems 1 & 2. Work neatly.

Estimating a Population Mean

Interval estimation. October 3, Basic ideas CLT and CI CI for a population mean CI for a population proportion CI for a Normal mean

Introduction to Survey Analysis!

Practice Questions: Statistics W1111, Fall Solutions

STAT100 Elementary Statistics and Probability

Inferences About Two Population Proportions

QUIZ 4 (CHAPTER 7) - SOLUTIONS MATH 119 SPRING 2013 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS = 100%

Large n normal approximations (Central Limit Theorem). xbar ~ N[mu, sigma 2 / n] (sketch a normal with mean mu and sd = sigma / root(n)).

STA 101 Final Review

IE 581 Introduction to Stochastic Simulation. One page of notes, front and back. Closed book. 50 minutes. Score

Point Estimation and Confidence Interval

Population 1 Population 2

Chapter 8: Confidence Interval Estimation: Further Topics

Chapter 7. Estimates and Sample Sizes

AP Online Quiz KEY Chapter 7: Sampling Distributions

Business Statistics. Lecture 5: Confidence Intervals

7.1 Basic Properties of Confidence Intervals

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Lecture 11. Data Description Estimation

EXAM 3 Math 1342 Elementary Statistics 6-7

Statistical Inference for Means

An inferential procedure to use sample data to understand a population Procedures

The variable θ is called the parameter of the model, and the set Ω is called the parameter space.

Midterm Exam 2 Answers

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Chapter 12: Inference about One Population

BINF702 SPRING 2015 Chapter 7 Hypothesis Testing: One-Sample Inference

Psychology 282 Lecture #4 Outline Inferences in SLR

Chapter. Hypothesis Testing with Two Samples. Copyright 2015, 2012, and 2009 Pearson Education, Inc. 1

Stat 231 Exam 2 Fall 2013

Statistics 135 Fall 2007 Midterm Exam

Chapter 18: Sampling Distributions

Sampling Distribution Models. Chapter 17

Lecture 6: Point Estimation and Large Sample Confidence Intervals. Readings: Sections

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α

a) The runner completes his next 1500 meter race in under 4 minutes: <

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing

Formalizing the Concepts: Simple Random Sampling. Juan Muñoz Kristen Himelein March 2012

III. Inferential Tools

Chapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions

We like to capture and represent the relationship between a set of possible causes and their response, by using a statistical predictive model.

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:

Basic Concepts of Inference

Transcription:

Chapter 6 Estimation and Sample Sizes This chapter presents the beginning of inferential statistics.! The two major applications of inferential statistics! Estimate the value of a population parameter! Test some claim (or hypothesis) about a population.

! Estimation of Population parameters: proportions, means, and variances.! Point estimate: Population proportion (p). (today)! Interval estimate: Confidence Interval for p.! Calculate sample sizes needed to estimate those parameters.

Point Estimate p = population proportion p ˆ x = n sample proportion (pronounced p-hat ) of x successes in a sample of size n. Unbiased estimate (best estimate) ˆ ˆ q = 1 - p = sample proportion of failures in a sample size of n

Example: Photo-Cop Survey Responses " 829 adult Minnesotans were surveyed, and 51% of them are opposed to the use of the photo-cop for issuing traffic tickets. Using these survey results, find the best estimate of the proportion of all adult Minnesotans opposed to photocop use. " Best point estimate=sample proportion=51%.

Confidence Interval! Why?: point estimate is not reliable under re-sampling.! A confidence interval (CI): a range (or an interval) of values used to estimate the true population parameter.

Confidence Level! α: between 0 and 1! A confidence level: 1 - α or 100(1- α)%. E.g. 95%. This is the proportion of times that the confidence interval actually does contain the population parameter, assuming that the estimation process is repeated a large number of times.! Other names: degree of confidence or the confidence coefficient.

The Critical Value z α/ α/2

Finding z α/2 for 95% Degree of Confidence α = 5% α/2 = 2.5% =.025 Critical value

Sampling Distribution of p^! The sampling distribution of sample proportion can be approximated by a normal distribution if np 5 and nq 5 : phat is approximately N(p, pq/n), q=1-p. p

Margin of Error, E When data from a simple random sample are used to estimate a population proportion p, the margin of error, denoted by E, is the maximum likely (with probability 1 α) difference between the observed proportion ˆ p and the true value of the population proportion p.

Margin of Error of ^p z E = α / α / 2 ˆ qˆ p n

100(1-α)% Confidence Interval for Population Proportion p ˆ E < p< ˆp + E where E = z α / 2 p ˆˆ q n

Confidence Interval for Population Proportion p ˆ E < p < pˆ + E p ˆ + E (p ˆ E, p ˆ + E)

Round-Off Rule for Confidence Interval Estimates of p Round the confidence interval limits to three significant digits

Procedure for Constructing a Confidence Interval for p! 1. Verify that the required assumptions are satisfied. Both np>=5 and nq>=5 are satisfied.! 2. Refer to Table A-2 and find the critical value z α/2 that corresponds to the desired confidence level.! 3. Evaluate the margin of error E = z α/2 ˆˆ p q n

! 4. Using the calculated margin of error, E and the value of the sample proportion, p, ˆ find the values of ˆ p E and ˆ p + E. Substitute those values in the general format for the confidence interval: ˆ p E < p < p ˆ + E!5. Round the resulting confidence interval limits to three significant digits.

Example:In the Chapter Problem, we noted that 829 adult Minnesotans were surveyed, and 51% of them are opposed to the use of the photo-cop for issuing traffic tickets. Use these survey results. a) Find the margin of error E that corresponds to a 95% confidence level. b) Find the 95% confidence interval estimate of the population proportion p. c) Based on the results, can we safely conclude that the majority of adult Minnesotans oppose use the the photocop?

a) Find the margin of error E that corresponds to a 95% confidence level First, we check for assumptions. We note that np = 422.79 5, and nq = 406.21 5. ˆ ˆ Next, we calculate the margin of error. We have found that p ˆ=0.51, q ˆ= 1 0.51 = 0.49, z α/2 = 1.96, and n = 829. α/ E = 1.96 (0.51)(0.49) 829 E = 0.03403

b) Find the 95% confidence interval for the population proportion p. We substitute our values from Part a to obtain 95% Confidence Interval for p is: ˆ P -E < p < p ˆ +E 0.51 0.03403 < p < 0.51 + 0.03403, 0.476 < p < 0.544

c) Based on the results, can we safely conclude that the majority of adult Minnesotans oppose use of the photocop? Based on the survey results, we are 95% confident that the limits of 47.6% and 54.4% contain the true percentage of adult Minnesotans opposed to the photo-cop. The percentage of opposed adult Minnesotans is likely to be any value between 47.6% and 54.4%. However, a majority requires a percentage greater than 50%, so we cannot safely conclude that the majority is opposed (because the entire confidence interval is not greater than 50%).

Determining Sample Size E = z α / 2 ˆ ˆ p q n (solve for n by algebra) n = ( zα α / 2) 2 p ˆ qˆ E 2

Sample Size for Estimating Proportion p When an estimate of ˆp is known: n = ˆ ˆ ( ) 2 p q zα α / 2 E 2 When no estimate of p is known: n = ( ) 2 0.25 zα α / 2 E 2

Example: Suppose a sociologist wants to determine the current percentage of U.S. households using e-mail. How many households must be surveyed in order to be 95% confident that the sample percentage is in error by no more than four percentage points? a) Use this result from an earlier study: In 1997, 16.9% of U.S. households used e-mail (based on data from The World Almanac and Book of Facts). b) Assume that we have no prior information suggesting a possible value of p.

a) Use this result from an earlier study: In 1997, 16.9% of U.S. households used e-mail (based on data from The World Almanac and Book of Facts). n = [z a/2 ] 2 p q E 2 ˆ ˆ = [1.96] 2 (0.169)(0.831) 0.04 2 = 337.194 = 338 households To be 95% confident that our sample percentage is within four percentage points of the true percentage for all households, we should randomly select and survey 338 households.

b) Assume that we have no prior information suggesting a possible value of p. n = [z a/2 ] 2 0.25 E 2 = (1.96) 2 (0.25) 0.04 2 = 600.25 = 601 households With no prior information, we need a larger sample to achieve the same results with 95% confidence and an error of no more than 4%.

Finding the Point Estimate and E from a Confidence Interval Point estimate of p: ˆ p ˆ = (upper confidence limit) + (lower confidence limit) 2 Margin of Error: E = (upper confidence limit) (lower confidence limit) 2