Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

Similar documents
HYPOTHESIS TESTS FOR ONE POPULATION MEAN WORKSHEET MTH 1210, FALL 2018

- E < p. ˆ p q ˆ E = q ˆ = 1 - p ˆ = sample proportion of x failures in a sample size of n. where. x n sample proportion. population proportion

Chapter 22. Comparing Two Proportions. Copyright 2010, 2007, 2004 Pearson Education, Inc.

This chapter focuses on two experimental designs that are crucial to comparative studies: (1) independent samples and (2) matched pair samples.

Common Large/Small Sample Tests 1/55

MOST PEOPLE WOULD RATHER LIVE WITH A PROBLEM THEY CAN'T SOLVE, THAN ACCEPT A SOLUTION THEY CAN'T UNDERSTAND.

STA Learning Objectives. Population Proportions. Module 10 Comparing Two Proportions. Upon completing this module, you should be able to:

Chapter 13, Part A Analysis of Variance and Experimental Design

BIOS 4110: Introduction to Biostatistics. Breheny. Lab #9

Chapter 22. Comparing Two Proportions. Copyright 2010 Pearson Education, Inc.

Section 9.2. Tests About a Population Proportion 12/17/2014. Carrying Out a Significance Test H A N T. Parameters & Hypothesis

Stat 200 -Testing Summary Page 1

Because it tests for differences between multiple pairs of means in one test, it is called an omnibus test.

Final Examination Solutions 17/6/2010

2 1. The r.s., of size n2, from population 2 will be. 2 and 2. 2) The two populations are independent. This implies that all of the n1 n2

1 Inferential Methods for Correlation and Regression Analysis

This is an introductory course in Analysis of Variance and Design of Experiments.

Recall the study where we estimated the difference between mean systolic blood pressure levels of users of oral contraceptives and non-users, x - y.

Working with Two Populations. Comparing Two Means

Properties and Hypothesis Testing

Statistics. Chapter 10 Two-Sample Tests. Copyright 2013 Pearson Education, Inc. publishing as Prentice Hall. Chap 10-1

University of California, Los Angeles Department of Statistics. Hypothesis testing

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

LESSON 20: HYPOTHESIS TESTING

Comparing Two Populations. Topic 15 - Two Sample Inference I. Comparing Two Means. Comparing Two Pop Means. Background Reading

April 18, 2017 CONFIDENCE INTERVALS AND HYPOTHESIS TESTING, UNDERGRADUATE MATH 526 STYLE

Successful HE applicants. Information sheet A Number of applicants. Gender Applicants Accepts Applicants Accepts. Age. Domicile

Regression, Inference, and Model Building

Sample Size Determination (Two or More Samples)

Math 140 Introductory Statistics

Frequentist Inference

Mathematical Notation Math Introduction to Applied Statistics

Math 152. Rumbos Fall Solutions to Review Problems for Exam #2. Number of Heads Frequency

Chapter 23: Inferences About Means

Interval Estimation (Confidence Interval = C.I.): An interval estimate of some population parameter is an interval of the form (, ),

3/3/2014. CDS M Phil Econometrics. Types of Relationships. Types of Relationships. Types of Relationships. Vijayamohanan Pillai N.

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date:

Topic 18: Composite Hypotheses

7-1. Chapter 4. Part I. Sampling Distributions and Confidence Intervals

Sampling Distributions, Z-Tests, Power

Chapter 8: Estimating with Confidence

Statistical Inference About Means and Proportions With Two Populations

Class 23. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

FACULTY OF MATHEMATICAL STUDIES MATHEMATICS FOR PART I ENGINEERING. Lectures

Topic 9: Sampling Distributions of Estimators

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

Chapter 4 Tests of Hypothesis

STAT431 Review. X = n. n )

MA238 Assignment 4 Solutions (part a)

Last Lecture. Wald Test

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

Chapter 20. Comparing Two Proportions. BPS - 5th Ed. Chapter 20 1

Lecture 5: Parametric Hypothesis Testing: Comparing Means. GENOME 560, Spring 2016 Doug Fowler, GS

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS

1 Models for Matched Pairs

Topic 9: Sampling Distributions of Estimators

Chapter 13: Tests of Hypothesis Section 13.1 Introduction

Topic 9: Sampling Distributions of Estimators

MIT : Quantitative Reasoning and Statistical Methods for Planning I

STATISTICAL INFERENCE

Class 27. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Lecture 5. Materials Covered: Chapter 6 Suggested Exercises: 6.7, 6.9, 6.17, 6.20, 6.21, 6.41, 6.49, 6.52, 6.53, 6.62, 6.63.

1036: Probability & Statistics

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

GG313 GEOLOGICAL DATA ANALYSIS

Chapter 6 Sampling Distributions

Worksheet 23 ( ) Introduction to Simple Linear Regression (continued)

CH19 Confidence Intervals for Proportions. Confidence intervals Construct confidence intervals for population proportions

Estimation of a population proportion March 23,

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Data Analysis and Statistical Methods Statistics 651

Statistics 511 Additional Materials

Confidence Intervals QMET103

Stat 319 Theory of Statistics (2) Exercises

Estimating the Population Mean - when a sample average is calculated we can create an interval centered on this average

Direction: This test is worth 150 points. You are required to complete this test within 55 minutes.

Introduction to Econometrics (3 rd Updated Edition) Solutions to Odd- Numbered End- of- Chapter Exercises: Chapter 3

A Confidence Interval for μ

t distribution [34] : used to test a mean against an hypothesized value (H 0 : µ = µ 0 ) or the difference

If, for instance, we were required to test whether the population mean μ could be equal to a certain value μ

Chapter 22: What is a Test of Significance?

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

Comparing your lab results with the others by one-way ANOVA

6 Sample Size Calculations

PSYCHOLOGICAL RESEARCH (PYC 304-C) Lecture 9

Agreement of CI and HT. Lecture 13 - Tests of Proportions. Example - Waiting Times

Stat 139 Homework 7 Solutions, Fall 2015

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

A statistical method to determine sample size to estimate characteristic value of soil parameters

Hypothesis Testing (2) Barrow, Statistics for Economics, Accounting and Business Studies, 4 th edition Pearson Education Limited 2006

Inferential Statistics. Inference Process. Inferential Statistics and Probability a Holistic Approach. Inference Process.

Mathacle. PSet Stats, Concepts In Statistics Level Number Name: Date: Confidence Interval Guesswork with Confidence

Chapter 9. Key Ideas Hypothesis Test (Two Populations)

M227 Chapter 9 Section 1 Testing Two Parameters: Means, Variances, Proportions

INSTRUCTIONS (A) 1.22 (B) 0.74 (C) 4.93 (D) 1.18 (E) 2.43

independence of the random sample measurements, we have U = Z i ~ χ 2 (n) with σ / n 1. Now let W = σ 2. We then have σ 2 (x i µ + µ x ) 2 i =1 ( )

Statistical Analysis on Uncertainty for Autocorrelated Measurements and its Applications to Key Comparisons

Confidence Intervals for the Population Proportion p

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

Important Formulas. Expectation: E (X) = Σ [X P(X)] = n p q σ = n p q. P(X) = n! X1! X 2! X 3! X k! p X. Chapter 6 The Normal Distribution.

Transcription:

Chapter 9 Slide Ifereces from Two Samples 9- Overview 9- Ifereces about Two Proportios 9- Ifereces about Two Meas: Idepedet Samples 9-4 Ifereces about Matched Pairs 9-5 Comparig Variatio i Two Samples Overview Slide There are may importat ad meaigful situatios i which it becomes ecessary to compare two sets of sample data. Ifereces about Two Proportios Assumptios Slide. We have proportios from two idepedet simple radom samples.. For both samples, the coditios p 5 ad q 5 are satisfied. Notatio for Two Proportios q = p The correspodig meaigs are attached to p,, x, p. ad q, which come from populatio. Slide 4 For populatio, we let: p = populatio proportio = size of the sample x = umber of successes i the sample p = x (the sample proportio) The pooled estimate of p ad p is deoted by p. q = p Pooled Estimate of p ad p p = x x Slide 5 Test Statistic for Two Proportios For H 0 : p = p, H 0 : p p, H 0 : p p H : p p, H : p < p, H : p > p z = ( p p ) ( p p ) pq pq Slide 6

Test Statistic for Two Proportios For H 0 : p = p, H 0 : p p, H 0 : p p Slide 7 Example: For the sample data listed i Table 8-, use a 0.05 sigificace level to test Slide 8 the claim that the proportio of black drivers stopped by the police is greater tha the proportio of white drivers H : p p, H : p < p, H : p > p where p p = 0 (assumed i the ull hypothesis) p x = ad p x = p = x x ad q = p Example: For the sample data listed i Table 8-, use a 0.05 sigificace level to test Slide 9 the claim that the proportio of black drivers stopped by the police is greater tha the proportio of white drivers = 00 x = 4 p = x = 4 = 0.0 00 = 400 x = 47 p = x = 47 = 0.05 400 H 0 : p = p, H : p > p p = x x = 4 47 = 0.06875 00400 q = 0.06875 = 0.895. Example: For the sample data listed i Table 8-, use a 0.05 sigificace level to test Slide 0 the claim that the proportio of black drivers stopped by the police is greater tha the proportio of white drivers = 00 x = 4 p = x = 4 = 0.0 00 = 400 x = 47 p = x = 47 = 0.05 400 z = (0.0 0.05) 0 (0.06875)(0.895) (0.06875)(0.895) 00 400 z = 0.64 Example: For the sample data listed i Table 8-, use a 0.05 sigificace level to test Slide the claim that the proportio of black drivers stopped by the police is greater tha the proportio of white drivers = 00 x = 4 p = x = 4 = 0.0 00 = 400 x = 47 p = x = 47 = 0.05 400 z = 0.64 This is a right-tailed test, so the P- value is the area to the right of the test statistic z = 0.64. The P-value is 0.6. Because the P-value of 0.6 is greater tha the sigificace level of α = 0.05, we fail to reject the ull hypothesis. Example: For the sample data listed i Table 8-, use a 0.05 sigificace level to test Slide the claim that the proportio of black drivers stopped by the police is greater tha the proportio of white drivers = 00 x = 4 p = x = 4 = 0.0 00 = 400 x = 47 p = x = 47 = 0.05 400 z = 0.64 Because we fail to reject the ull hypothesis, we coclude that there is ot sufficiet evidece to support the claim that the proportio of black drivers stopped by police is greater tha that for white drivers. This does ot mea that racial profilig has bee disproved. The evidece might be strog eough with more data.

Example: For the sample data listed i Table 8-, use a 0.05 sigificace level to test Slide the claim that the proportio of black drivers stopped by the police is greater tha the proportio of white drivers = 00 x = 4 p = x = 4 = 0.0 00 = 400 x = 47 p = x = 47 = 0.05 400 Usig TI Calculator: Slide 4 4 Compare these results with the example preseted i the last few slides. Cofidece Iterval Estimate of p - p Slide 5 Example: For the sample data listed i Table 8-, fid a 90% cofidece iterval estimate of the differece betwee the two populatio proportios. Slide 6 ( p p ) E < ( p p ) < ( p p ) E where E = z α/ p q p q = 00 x = 4 p = x = 4 = 0.0 00 = 400 x = 47 p = x = 47 = 0.05 400 E = z α/ E =.645 E = 0.400 p q p q (.)(.88)(0.05)(0.895) 00 400 Example: For the sample data listed i Table 8-, use a 0.05 sigificace level to test Slide 7 the claim that the proportio of black drivers stopped by the police is greater tha the proportio of white drivers Usig TI Calculator: Slide 8 = 00 (0.0 0.05) 0.040 < ( p p ) < (0.0 0.05) 0.040 0.05 < ( p x = 4 p ) < 0.055 p = x = 4 = 0.0 00 = 400 x = 47 p = x = 47 = 0.05 400 4 Compare these results with the example preseted i the last few slides.

Defiitios Slide 9 Assumptios Slide 0 Two Samples: Idepedet The sample values selected from oe populatio are ot related or somehow paired with the sample values selected from the other populatio. If the values i oe sample are related to the values i the other sample, the samples are depedet. Such samples are ofte referred to as matched pairs or paired samples.. The two samples are idepedet.. Both samples are simple radom samples.. Either or both of these coditios are satisfied: The two sample sizes are both large (with > 0 ad > 0) or both samples come from populatios havig ormal distributios. Hypothesis Tests Slide Hypothesis Tests Slide Test Statistic for Two Meas: Test Statistic for Two Meas: t = (x x ) (µ µ ). s s 4 Degrees of freedom: I this book we use this estimate: df = smaller of ad. P-value: Critical values: Refer to Table A-. Use the procedure summarized i Figure 7-6. Refer to Table A-. McGwire Versus Bods Slide McGwire Versus Bods Slide 4 Data Set 0 i Appedix B icludes the distaces of the home rus hit i record-settig seasos by Mark McGwire ad Barry Bods. Sample statistics are show. Use a 0.05 sigificace level to test the claim that the distaces come from populatios with differet meas. McGwire Bods 70 7 x 48.5 40.7 s 45.5 0.6 Claim: μ μ H o : μ = μ H : μ μ α = 0.05 = 69 = 7 df = 69 t.05 =.994

McGwire Versus Bods McGwire Versus Bods Slide 5 Slide 6 Test Statistic for Two Meas: Test Statistic for Two Meas: t = (x x ) (µ µ ). s s t = (48.5 40.7) 0 45.5 0.6 70 7 =.7 Claim: μ μ H o : μ = μ H : μ μ α = 0.05 McGwire Versus Bods Slide 7 5 Claim: μ μ H o : μ = μ H : μ μ α = 0.05 McGwire Versus Bods Slide 8 There is sigificat evidece to support the claim that there is a differece betwee the mea home ru distaces of Mark McGwire ad Barry Bods. Reject Null Figure 8- Figure 8- Usig TI Calculator: Slide 9 Cofidece Itervals Slide 0 (x x ) E < (µ µ ) < (x x ) E 4 s s where E = tα/ Our textbook recommeds to use No Pooled optio at this time.

McGwire Versus Bods McGwire Versus Bods Slide Slide Usig the sample data give i the precedig example, costruct a 95%cofidece iterval estimate of the differece betwee the mea home ru distaces of Mark McGwire ad Barry Bods. E = t s s α/ E =.994 E =.0 45.5 0.6 70 7 Usig the sample data give i the precedig example, costruct a 95%cofidece iterval estimate of the differece betwee the mea home ru distaces of Mark McGwire ad Barry Bods. (48.5 40.7).0 < (μ μ ) < (48.5 40.7).0.8 < (μ μ ) < 7.8 We are 95% cofidet that the limits of.8 ft ad 7.8 ft actually do cotai the differece betwee the two populatio meas. Usig TI Calculator: Slide Assumptios Slide 4. The sample data cosist of matched pairs. 4 6. The samples are simple radom samples.. Either or both of these coditios is satisfied: The umber of matched pairs of sample data is ( > 0) or the pairs of values have differeces that are from a populatio havig a distributio that is approximately ormal. Slide 5 µ d = mea value of the differeces d for the populatio of paired data d Notatio for Matched Pairs = mea value of the differeces d for the paired sample data (equal to the mea of the x y values) Test Statistic for Matched Pairs of Sample Data t = d µ d s d Slide 6 s d = stadard deviatio of the differeces d for the paired sample data = umber of pairs of data. where degrees of freedom =

P-values ad Critical Values Slide 7 Cofidece Itervals Slide 8 Use Table A- (t-distributio). d E < µ d < d E where E = t α/ s d degrees of freedom = Slide 9 Slide 40 d =., s = 0.7, = 5 t α/ =.776 (foud from Table A- with 4 degrees of freedom ad 0.05 i two tails) 7 H 0 : μ d = 0 H : μ d 0 t = d µ d =. 0 =.759 s d 0.7 5 Because the test statistic does ot fall i the critical regio, we fail to reject the ull hypothesis. Slide 4 Slide 4 H 0 : μ d = 0 H : μ d 0 t = d µ d =. 0 =.759 s d 0.7 5 The sample data i Table 8- do ot provide sufficiet evidece to support the claim that actual ad five-day forecast low temperatures are differet.

Slide 4 Slide 44 Usig the same sample matched pairs i Table 8-, costruct a 95% cofidece iterval estimate of μ d, which is the mea of the differeces betwee actual low temperatures ad five-day forecasts. E = t α/ s d 0.7 E = (.776)( 5 ) =. Slide 45 Slide 46 d E < μ d < d E.. < μ d <.. 6.5 < μ d < 0. 8 I the log ru, 95% of such samples will lead to cofidece itervals that actually do cotai the true populatio mea of the differeces. Usig TI Calculator: Slide 47 Usig TI Calculator: Slide 48 Eter actual low ito L, predicted values ito L, ad the differece ito L as explaied i class. 5 6 4 7 8 Compare these results with the example preseted i the last few slides.

Measures of Variatio Slide 49 Assumptios Slide 50 s = stadard deviatio of sample σ = stadard deviatio of populatio s = variace of sample σ = variace of populatio. The two populatios are idepedet of each other.. The two populatios are each ormally distributed. s σ Notatio for Hypothesis Tests with Two Variaces = larger of the two sample variaces = size of the sample with the larger variace Slide 5 = variace of the populatio from which the sample with the larger variace was draw The symbols s,, ad σ are used for the other sample ad populatio. 9 Test Statistic for Hypothesis Tests with Two Variaces s F = s Critical Values: Usig Table A-5, we obtai critical F values that are determied by the followig three values: Slide 5. The sigificace level α.. Numerator degrees of freedom (df ) =. Deomiator degrees of freedom (df ) = Slide 5 All oe-tailed tests will be right-tailed. All two-tailed tests will eed oly the critical value to the right. Whe degrees of freedom is ot listed exactly, use the critical values o either side as a iterval. Use iterpolatio oly if the test statistic falls withi the iterval. Slide 54 If the two populatios do have equal s variaces, the F= s will be close to because s ad s are close i value. If the two populatios have radically differet variaces, the F will be a large umber. Remember, the larger sample variace will be s.

Slide 55 Coke Versus Pepsi Slide 56 Cosequetly, a value of F ear will be evidece i favor of the coclusio that σ = σ. But a large value of F will be evidece agaist the coclusio of equality of the populatio variaces. Data Set 7 i Appedix B icludes the weights (i pouds) of samples of regular Coke ad regular Pepsi. Sample statistics are show. Use the 0.05 sigificace level to test the claim that the weights of regular Coke ad the weights of regular Pepsi have the same stadard deviatio. Regular Coke Regular Pepsi 6 6 x 0.868 0.840 s 0.007507 0.00570 Coke Versus Pepsi Slide 57 Coke Versus Pepsi Slide 58 Claim: σ = σ Claim: σ = σ H o : σ = σ H : σ σ α = 0.05 0 H o : σ = σ H : σ σ α = 0.05 Value of F = s s 0.007507 = 0.00570 =.79 There is ot sufficiet evidece to warrat rejectio of the claim that the two variaces are equal. Usig TI Calculator: Slide 59 Fidig Lower Critical F Values Slide 60 4Draw ) Use F R idicates the critical value for right tail ad F L idicates the critical value for the left tail. ) Iterchage the degrees of freedom. ) F L is the reciprocal of the F value foud i the table. For example: =7, =0, α=0.05 Aswer: F L =0.80, F R =4.97