Lecture #16 Thursday, October 13, 2016 Textbook: Sections 9.3, 9.4, 10.1, 10.2

Similar documents
STA Module 10 Comparing Two Proportions

Lecture Slides. Elementary Statistics. Tenth Edition. by Mario F. Triola. and the Triola Statistics Series

Chapter 7: Sampling Distributions

Math 124: Modules Overall Goal. Point Estimations. Interval Estimation. Math 124: Modules Overall Goal.

Interval estimation. October 3, Basic ideas CLT and CI CI for a population mean CI for a population proportion CI for a Normal mean

Lecture 11 - Tests of Proportions

Harvard University. Rigorous Research in Engineering Education

p = q ˆ = 1 -ˆp = sample proportion of failures in a sample size of n x n Chapter 7 Estimates and Sample Sizes

Review for Final Exam Stat 205: Statistics for the Life Sciences

Chapter 6 Estimation and Sample Sizes

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12

Statistics for Business and Economics: Confidence Intervals for Proportions

Inference for Single Proportions and Means T.Scofield

One-sample categorical data: approximate inference

10: Crosstabs & Independent Proportions

Are data normally normally distributed?

10.1. Comparing Two Proportions. Section 10.1

CHAPTER 7. Parameters are numerical descriptive measures for populations.

Sections 7.1 and 7.2. This chapter presents the beginning of inferential statistics. The two major applications of inferential statistics

Chapter 9 Inferences from Two Samples

Estimation and Confidence Intervals

Chapter 5 Confidence Intervals

Introduction to Survey Analysis!

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series

Margin of Error for Proportions

Statistical inference provides methods for drawing conclusions about a population from sample data.

You are allowed 3? sheets of notes and a calculator.

Chapter 7: Sampling Distributions

Chapters 4-6: Estimation

Probability theory and inference statistics! Dr. Paola Grosso! SNE research group!! (preferred!)!!

Comparing Systems Using Sample Data

Business Statistics. Lecture 5: Confidence Intervals

Difference Between Pair Differences v. 2 Samples

DETERMINE whether the conditions for performing inference are met. CONSTRUCT and INTERPRET a confidence interval to compare two proportions.

A proportion is the fraction of individuals having a particular attribute. Can range from 0 to 1!

STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)

Lecture Slides. Elementary Statistics Eleventh Edition. by Mario F. Triola. and the Triola Statistics Series 9.1-1

LECTURE 5. Introduction to Econometrics. Hypothesis testing

Stat 231 Exam 2 Fall 2013

Chapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides

1 Binomial Probability [15 points]

Lecture 10: Generalized likelihood ratio test

Stat 529 (Winter 2011) Experimental Design for the Two-Sample Problem. Motivation: Designing a new silver coins experiment

Chapter 22. Comparing Two Proportions. Bin Zou STAT 141 University of Alberta Winter / 15

Chapter 9. Correlation and Regression

Using Dice to Introduce Sampling Distributions Written by: Mary Richardson Grand Valley State University

CHAPTER 18 SAMPLING DISTRIBUTION MODELS STAT 203

Econ 325: Introduction to Empirical Economics

ECO220Y Hypothesis Testing: Type I and Type II Errors and Power Readings: Chapter 12,

Math 141. Lecture 10: Confidence Intervals. Albyn Jones 1. jones/courses/ Library 304. Albyn Jones Math 141

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

7.1: What is a Sampling Distribution?!?!

Operation and Supply Chain Management Prof. G. Srinivasan Department of Management Studies Indian Institute of Technology, Madras

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING

*Karle Laska s Sections: There is no class tomorrow and Friday! Have a good weekend! Scores will be posted in Compass early Friday morning

Inference for Proportions

The scatterplot is the basic tool for graphically displaying bivariate quantitative data.

Statistics for Business and Economics

Ch. 1: Data and Distributions

Section Inference for a Single Proportion

8.2 Margin of Error, Sample Size (old 8.3)

Statistics for IT Managers

Inference for Proportions, Variance and Standard Deviation

Inference for Regression Inference about the Regression Model and Using the Regression Line

2.830J / 6.780J / ESD.63J Control of Manufacturing Processes (SMA 6303) Spring 2008

PubH 5450 Biostatistics I Prof. Carlin. Lecture 13

Statistic: a that can be from a sample without making use of any unknown. In practice we will use to establish unknown parameters.

Ch Inference for Linear Regression

The t-test Pivots Summary. Pivots and t-tests. Patrick Breheny. October 15. Patrick Breheny Biostatistical Methods I (BIOS 5710) 1/18

Frequency table: Var2 (Spreadsheet1) Count Cumulative Percent Cumulative From To. Percent <x<=

Inference for Regression Simple Linear Regression

BINF702 SPRING 2015 Chapter 7 Hypothesis Testing: One-Sample Inference

Lecture 30. DATA 8 Summer Regression Inference

Tests for Population Proportion(s)

The t-statistic. Student s t Test

y = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output

CHAPTER 10 Comparing Two Populations or Groups

Bivariate Data: Graphical Display The scatterplot is the basic tool for graphically displaying bivariate quantitative data.

Lecture 7: Confidence interval and Normal approximation

Statistics and Quantitative Analysis U4320. Segment 5: Sampling and inference Prof. Sharyn O Halloran

Exam 2 (KEY) July 20, 2009

Inferences for Proportions and Count Data

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions

Using Tables and Graphing Calculators in Math 11

Inference for the mean of a population. Testing hypotheses about a single mean (the one sample t-test). The sign test for matched pairs

Determining the Spread of a Distribution

Determining the Spread of a Distribution

Chapter 6. Estimates and Sample Sizes

Chapter 2: Looking at Data Relationships (Part 3)

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

Chapter 15 Sampling Distribution Models

Two-sample inference: Continuous data

1 Matched pair comparison(p430-)

Topic 3: Sampling Distributions, Confidence Intervals & Hypothesis Testing. Road Map Sampling Distributions, Confidence Intervals & Hypothesis Testing

CH.8 Statistical Intervals for a Single Sample

Confidence Intervals for Population Mean

Occupy movement - Duke edition. Lecture 14: Large sample inference for proportions. Exploratory analysis. Another poll on the movement

Sampling Distribution of a Sample Proportion

Chapter 9: Sampling Distributions

Lab #11. Variable B. Variable A Y a b a+b N c d c+d a+c b+d N = a+b+c+d

Transcription:

STATISTICS 200 Lecture #16 Thursday, October 13, 2016 Textbook: Sections 9.3, 9.4, 10.1, 10.2 Objectives: Define standard error, relate it to both standard deviation and sampling distribution ideas. Describe the sampling distribution of a sample proportion. Reformulate confidence interval formula using general idea of estimate plus/minus (multiplier standard error) Interpret confidence level as a relative frequency Calculate new values of the multiplier for new confidence levels other than 95%

We now begin a strong focus on Inference Proportions Means One population proportion Two population proportions One population mean Difference between Means Mean difference This week

Motivation Eventual Goal: Use statistical inference to answer the question What is the percentage of Creamery customers who prefer chocolate ice cream over vanilla? Strategy: Get a random sample of 90 individuals and ask them this question. Use the answers to perform a hypothesis test to answer the question.

Comparison of Binomial-based statistics Variable Notation Mean St. Dev. Count of successes Chapter 8 X n p p np(1 p) Proportion of successes Chapter 9 and beyond ˆp = X n p r p(1 p) n

Binomial Distribution vs. approximate p-hat sampling distribution: n = 100 & p = 0.70 µ = = p np = 70 µ = p =0.70 r np(1 p) =4.58 p(1 p) = =0.05 n

A better confidence interval OLD: ˆp 1 p to ˆp n 1 p n Conservative margin of error: 1 p n NEW: ME = (multiplier)*(standard error) (ˆp ME to ˆp ME)

New formula for margin of error ME = (multiplier) (standard error) Z* Related to. Empirical rule Expresses level of confidence that the interval includes the. parameter s.e.(ˆp) Estimate of the Standard deviation of the sampling distribution of p-hat s.e.(ˆp) = r ˆp(1 n ˆp)

Z*-multiplier Use when the normal approximation is appropriate, i.e. when n*p > 10 and. n*(1-p) > 10 Confidence level 90% 1.65 Multiplier (z*) 95% 1.96 à 2 98% 2.33 99% 2.58 The z-multiplier for a 68% confidence level would be, 1 because we must go 1 standard deviation from the mean to capture 68% of the area. 0.90 0.95 0.98

Three Factors affect the width of a confidence interval ME = Z r ˆp(1 n ˆp) Page 382 textbook 1. Level of confidence 2. Sample size Level of confidence Z* ME sample size s.e.(ˆp) ME 3. ˆp as explained on the next page

3. ˆp in pˆp(1 ˆp) p^(1 p^) The scatterplot shows the variation is A. largest when p-hat = 1.0 B. largest when p-hat = 0.5 C. largest when p-hat = 0.25 D. smallest when p-hat = 0.9 E. smallest when p-hat = 0.2 0.0 0.1 0.2 0.3 0.4 0.5 0.0 0.2 0.4 0.6 0.8 1.0 p^

Factor 3: Value of p-hat impacts width of C.I. ME = 2 r ˆp(1 n ˆp) for a 95% CI At a given level of confidence and sample size, the confidence interval is the widest when p-hat equals 0.5 and it becomes narrower as p-hat moves away from 0.5 in either direction.

Confidence Intervals: Population Proportion When normal conditions aren t met, use this option Conservative Method: Chapter 1 & 5 ME = 1 n Normal Approximation: Chapter 10 ME = z* pˆ(1 pˆ) n Exact (Binomial) Need a computer to calculate the interval. Does not include a M.E. Minitab: provides both options Pages 389 & 390 in the textbook

13 Binomial distributions n fixed at 10, p increasing p fixed at 0.02, n increasing Values of n and p determine whether binomial is normal in shape

What does it mean to be 95% confident? Before the sample is drawn: We can say that P(conf. int. contains the true parameter) = 0.95. After the sample is drawn: There is no more randomness! (Both the CI and the parameter are now fixed.) So we cannot talk of probability any longer.

Interpreting 95% confidence: An example Suppose we have a sample of 200 students in STAT 100 and find that 28 of them are left handed. Our sample proportion is: 0.14 We now find the ME and construct a 95% CI.

Find the standard error: That is, estimate the standard deviation of the sample proportion based on a sample of size 200: r 0.14 (1 0.14) 200 =0.025 Hence, z* times the standard error = 2.025 =.05 On the following two slides, we'll pretend that the true population proportion is 0.12.

Normal curve of sample proportions based on sample size 200 The green curve is the true distrtibution of p- hat. Of course, ordinarily we don't know where it lies, but at least we know its approximate standard deviation. Thus, we can build a confidence interval around our 14% estimate (in red). 0.08 0.10 0.12 0.14 0.16 0.18 sample percents If we take another sample, the red line will move but the green curve will not!

30 confidence intervals based on sample size 200 If we repeat the sampling over and over, 95% of our confidence intervals will contain the true proportion of 0.12. This is why we use the term "95% confidence interval". 0.06 0.08 0.10 0.12 0.14 0.16 0.18 sample percents

Definition of "95% confidence interval for the true population proportion": An interval of values computed from a sample that will cover the true but unknown population proportion for 95% of the possible samples. To find a 95% CI: The center is at p-hat. The margin of error is 2 times the S.E., where the S.E. is the square root of [p-hat(1-p-hat)/n].

What does it mean to be 95% confident? A. There is a 95% probability that the one interval that I calculated contains the true value for the parameter. B. If I get 100 such intervals, about 95 of them will contain the true value for the parameter. C. The sample estimate has a 95% chance of being inside the calculated interval. D. The p-value has a 95% chance of being inside the interval.

If you understand today s lecture 9.25, 9.33, 9.35, 9.37, 10.1, 10.3, 10.7, 10.9, 10.11, 10.13, 10.15, 10.19, 10.21, 10.23, 10.25, 10.27, 10.33, 10.45 Objectives: Define standard error, relate it to both standard deviation and sampling distribution ideas. Describe the sampling distribution of a sample proportion. Reformulate confidence interval formula using general idea of estimate plus/minus (multiplier standard error) Interpret confidence level as a relative frequency Calculate new values of the multiplier for new confidence levels other than 95%