Difference between means - t-test /25

Similar documents
Chapter 24. Comparing Means

Chapter 23. Inference About Means

Test 3 Practice Test A. NOTE: Ignore Q10 (not covered)

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

POLI 443 Applied Political Research

9.5 t test: one μ, σ unknown

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

Two-Sample Inferential Statistics

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Visual interpretation with normal approximation

Classroom Activity 7 Math 113 Name : 10 pts Intro to Applied Stats

Inferences About Two Proportions

The Chi-Square Distributions

Using Tables and Graphing Calculators in Math 11

Study Ch. 9.3, #47 53 (45 51), 55 61, (55 59)

Inferences for Regression

Chapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.

Introduction to Business Statistics QM 220 Chapter 12

Lab #12: Exam 3 Review Key

Chapter 22. Comparing Two Proportions 1 /29

MATH 240. Chapter 8 Outlines of Hypothesis Tests

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

EXAM 3 Math 1342 Elementary Statistics 6-7

Null Hypothesis Significance Testing p-values, significance level, power, t-tests Spring 2017

9-6. Testing the difference between proportions /20

6.4 Type I and Type II Errors

CHAPTER 7. Hypothesis Testing

1 Hypothesis testing for a single mean

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

Two Sample Hypothesis Tests

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

The Chi-Square Distributions

8.1-4 Test of Hypotheses Based on a Single Sample

INTERVAL ESTIMATION AND HYPOTHESES TESTING

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals

STAT Chapter 8: Hypothesis Tests

Hypothesis testing. Data to decisions

Chapter 23: Inferences About Means

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

STAT 515 fa 2016 Lec Statistical inference - hypothesis testing

Table 1: Fish Biomass data set on 26 streams

Inference About Two Means: Independent Samples

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

χ test statistics of 2.5? χ we see that: χ indicate agreement between the two sets of frequencies.

Inferential Statistics and Distributions

Can you tell the relationship between students SAT scores and their college grades?

Business Statistics. Lecture 10: Course Review

Institute of Actuaries of India

COGS 14B: INTRODUCTION TO STATISTICAL ANALYSIS

Is Yawning Contagious video

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. describes the.

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

Section 10.1 (Part 2 of 2) Significance Tests: Power of a Test

An inferential procedure to use sample data to understand a population Procedures

Distribution-Free Procedures (Devore Chapter Fifteen)

Chapter 7 Comparison of two independent samples

10.4 Hypothesis Testing: Two Independent Samples Proportion

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan

Sampling Distributions: Central Limit Theorem

Sampling Distribution of a Sample Proportion

Chapter 9 Inferences from Two Samples

CHAPTER 13: F PROBABILITY DISTRIBUTION

POLI 443 Applied Political Research

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1

Stats Review Chapter 14. Mary Stangler Center for Academic Success Revised 8/16

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

CHAPTER 13: F PROBABILITY DISTRIBUTION

Chapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions

Mathematical Notation Math Introduction to Applied Statistics

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α

determine whether or not this relationship is.

z and t tests for the mean of a normal distribution Confidence intervals for the mean Binomial tests

280 CHAPTER 9 TESTS OF HYPOTHESES FOR A SINGLE SAMPLE Tests of Statistical Hypotheses

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing

STA Module 11 Inferences for Two Population Means

STA Rev. F Learning Objectives. Two Population Means. Module 11 Inferences for Two Population Means

HYPOTHESIS TESTING. Hypothesis Testing

The t-statistic. Student s t Test

Hypothesis testing: Steps

STAT FINAL EXAM

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

We know from STAT.1030 that the relevant test statistic for equality of proportions is:

Exam 2 (KEY) July 20, 2009

Review. One-way ANOVA, I. What s coming up. Multiple comparisons

Lecture 28 Chi-Square Analysis

hypothesis a claim about the value of some parameter (like p)

Single Sample Means. SOCY601 Alan Neustadtl

MBA 605, Business Analytics Donald D. Conant, Ph.D. Master of Business Administration

Note: k = the # of conditions n = # of data points in a condition N = total # of data points

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

Hypothesis testing: Steps

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

Transcription:

Difference between means - t-test 1

Discussion Question p492 Ex 9-4 p492 1-3, 6-8, 12 Assume all variances are not equal. Ignore the test for variance. 2

Students will perform hypothesis tests for two sample means using the t statistic. Compares the means of two small, independent samples. Two choices, variances are equal, variances are not equal. We will assume variances are not equal. 3

4 t-test When variances are assumed to be equal Test Statistic = Observed Value - Expected Value standard error t = X - µ, d.f. = n 1 s n With two samples things need adjusting... t = ( ) X X µ µ 1 2 1 2 ( ) (n 1)s 2 + (n 1)s 2 1 1 1 2 2, d.f. = n 1 + n 2 2 n 1 + n 2 2 n 1 + 1 n 2

5 t-test When variances are assumed to be equal With two samples from populations of equal variances, the variances can be weighted according to the size of the samples. The larger sample (greater d.f.) is weighted more than the smaller sample. This creates a pooled variance. t = X - µ s t = ( ) X X µ µ 1 2 1 2 ( ) (n 1)s 2 + (n 1)s 2 1 1 1 2 2 n n 1 + n 2 2 n 1 + 1 n 2

6 t-test When variances are assumed to be not equal Test Statistic = Observed Value - Expected Value standard error t = X - µ, d.f. = n 1 s n This time the formula is more familiar... ( ) ( ) µ µ 1 2 X X 1 2 t Z = σs 2 2 1 σs + 2 n 1 n 2 ( ) X X 1 2 = σs 2 2 1 σs + 2 n 1 n 2 ( ) *, d.f. = small n 1 * Not really

7 t-test When variances are assumed to be not equal Now that I told you all that, simply to tell you this... Do not bother to pool the variance. First, we have not done the test for homogeneity of variance, and second, the gain from pooling is not worth the work.

8 Confidence Interval As with the previous hypothesis testing, the t statistic can determine confidence intervals. We find the interval within which we expect to find the difference between means. With the t statistic, we will always assume the variances are not equal and we will NOT pool variances.

9 Confidence Interval Since we assume the population variances are not equal. ( X X ) t 1 2 α 2 σ 2 2 1 ( ) + t α 2 + σ 2 < µ µ < X X n n 1 2 1 2 1 2 σ 1 2 2 n 1 + σ 2 n 2 Your book uses d.f. = (smaller n) - 1, but I prefer you not. We will let the calculator find the degrees of freedom for us.

10 Steps for Hypothesis Testing 1. Formulate all hypotheses: H0, Ha 2. Determine the test statistic and the critical value of the test statistic (based on α) that will assess the evidence against the null hypothesis. (Currently Z) 2a. Draw, label, and appropriately shade the curve representing H0 3. Find value(s) of the test statistic based on the data and the p-value for that statistic. 4. Make a decision to Reject or Fail to reject H0. 5. Tell someone about your conclusion.

11 Ejemplo A study was conducted to compare college majors requiring 12 or more hours of mathematics to majors that required less than 12 hours of mathematics. The mean income of 8 majors requiring more than 12 hours of math was $45,692 per year, with a standard deviation of 1236. The mean income of 10 majors that require less than 12 hours of math was $40,416, with a standard deviation of 5324. Test for a difference in the incomes at the.05 level. Sample M; x = 45692, s = 1236, n = 8 Sample L; x = 40416, s = 5324, n = 10

12 TI-84 Sample M; x = 45692, s = 1236, n = 8 Sample L; x = 40416, s = 5324, n = 10 We will conduct a 2-tailed 2-Sample T-test. STAT TESTS 4:2-SampleTTest Inpt: Stats x1: 45692 Sx1: 1236 n1: 8 x2: 40416 Sx2: 5324 n2: 10 µ2 Result µ1 µ2 t= 3.033256297 p=.0123463074 df= 10.19403292 x1= 45692 x2: 40416 Sx2: 5324 n1= 8 Pooled: No n2: 10

13 Sample M; x = 45692, s = 1236, n = 8 Sample L; x = 40416, s = 5324, n = 10 1. Hypotheses - Means H 0 : µ M = µ L and H 1 : µ M µ L H 0 : µ M µ L = 0 and H 1 : µ M µ L 0 M = more than 12 hours L = less than 12 hours 2. Test Statistic and Critical Value If you choose to use the book choice for d.f. Since our samples are small and we do not have -2.3646 2.3646 the population σ we will use the t-statistic..025 95%.025 α =.05 α/2 =.025, d.f. = 8-1 = 7 t c = invt(.975, 7) = 2.3646-3σ -2σ -1σ 0 1σ 2σ 3σ

14 2. Test Statistic and Critical Value Sample M; x = 45692, s = 1236, n = 8 Sample L; x = 40416, s = 5324, n = 10 Since our samples are small and we do not have the population σ we will use the t-statistic with the df provided by calculator. If you choose to use the calculator definition of d.f. α =.05 α/2 =.025, d.f. = 10.19 t c = invt(.975, 10.19) = 2.2225.025-2.2225 95% 2.2225.025-3σ -2σ -1σ 0 1σ 2σ 3σ

15 3. Calculate t & p-value Sample M; x = 45692, s = 1236, n = 8 Sample L; x = 40416, s = 5324, n = 10 t = ( ) 45692 40416 0 = 3.033 1236 2 8 + 53242 10 p-value: p(t > 3.033) = p(x 1 - x 2 > 5246) If you choose to use the small d.f. you will need to use tcdf (or normalcdf) to find the p-value. d.f. = n - 1 = 8-1 = 7 tcdf(3.033, 99, 7) =.0095 x 2 =.0190 d.f. = 10.1940 2-SampTTest(stats, 45692, 1236, 8, 40416, 5324, 10, µ 2, No) =.0123

16 Sample M; x = 45692, s = 1236, n = 8 Sample L; x = 40416, s = 5324, n = 10 4. Decision 3.033 > 2.3646 (2.2225),.019 <.05, we reject the null. 5. Conclusion There is sufficient evidence to suggest there is a difference in the mean incomes -2.3646 2.3646-2.2225 2.2225 for majors requiring 12 units of math and those not requiring 12 units of math..025.0095 95%.025.0095-3.033 3.033-3σ -2σ -1σ 0 1σ 2σ 3σ

17 Confidence Interval TI-84 Sample M; x = 45692, s = 1236, n = 8 Sample L; x = 40416, s = 5324, n = 10 STAT TESTS 9:2-SampTInt Inpt: Data Stats x1: 45692 Sx1: 1236 n1: 8 x2: 40416 Sx2: 5324 n2: 10 C-Level:.95 Pooled: No Yes Calculate 2-SampZInt (1410.4, 9141.6) x1: 45692 x2: 40416 Sx1: 1236 Sx2: 5324 n1: 8 n2: 10 Based on data from our sample of 8 and 10, we believe the true difference in incomes is between $1410.40 and $9141.60

18 Confidence Interval Sample M; x = 45692, s = 1236, n = 8 Sample L; x = 40416, s = 5324, n = 10 Since we conclude it is plausible to believe there is a difference in the mean incomes, what is that difference? X M X L = 45692 40416 = 5276 d.f. = 10.19 d.f. = 7 5276 ± 2.2224 12362 8 + 53242 10 5276 ± 2.3646 12362 8 + 53242 10 (1410.36, 9141.61) (1163.05, 9388.95) Based on data from our sample of 8 and 10, we believe the true difference in incomes is between $1410 and $9141 ($1163 and $9389)

19 Another Example Two groups were randomly assigned to two speed-reading classes that use different teaching techniques. Test the claim that the Orozco Oral Method gives better results than the VanVoorhis Vision Method. Use α =.01. Orozco Oral VV Vision n = 16 n = 12 x = 44.0 x = 36.5 s = 13.2 s = 10.2 1. Hypotheses H0: µ O µ VV ; H1: µ O > µ VV H0: µ O µ VV 0; H1: µ O µ VV > 0 O = Orozco VV = VanVoorhis

20 Orozco Oral VV Vision 2. Test Statistic and Critical Value n = 16 n = 12 x = 44.0 x = 36.5 s = 13.2 s = 10.2 Samples are small and no σ, we will use the t-statistic. We will run a one tail t-test. t c = invt(.99, 25.9567) = 2.4789 t c = invt(.99, 11) = 2.7181 99% 2.7181 2.4789.01-3σ -2σ -1σ 0 1σ 2σ 3σ Slide 24

21 TI-84 STAT TESTS 4:2-SampleTTest n = 16 n = 12 Orozco Oral VV Vision Results x = 44.0 x = 36.5 Input s = 13.2 s = 10.2 2-SampleTTest Stats µ1 > µ2 x1: 44 sx1: 13.2 n1: 16 x2: 36.5 sx2: 10.2 t=1.6958 p=.0509 df=25.9567 x1=44 x2=36.5 99% 1.6958 2.7181 2.4789.01 n2: 12 sx1: 13.2-3σ -2σ -1σ 0 1σ 2σ 3σ µ1: >µ2 sx2=10.2 Pooled: No n1=16 Calculate n2=12

22 Results Orozco Oral VV Vision 3. Calculate t & p-value 2-SampleTTest µ1 > µ2 t=1.6958 n = 16 n = 12 x = 44.0 x = 36.5 s = 13.2 s = 10.2 p=.0509 ( ) 44 36.5 0 df=25.9567 t = = 1.6958 13.2 2 10.2 + 16 12 2 x1=44 x2=36.5 sx1: 13.2 p(t 1.6958) = p(x 1 - x 2 7.5) sx2=10.2 n1=16 1.6958 2.7181 2.4789 n2=12 99% tcdf(1.6958, 99, 11) =.0590.01-3σ -2σ -1σ 0 1σ 2σ 3σ

23 Results Orozco Oral VV Vision 4. Decision 1.6958 < 2.4789,.0509 >.01, we fail to reject the null. 5. Conclusion 2-SampleTTest µ1 > µ2 t=1.6958 p=.0509 df=25.9567 x1=44 x2=36.5 n = 16 n = 12 x = 44.0 x = 36.5 s = 13.2 s = 10.2 sx1: 13.2 There is not sufficient evidence sx2=10.2 to suggest that the Orozco Oral n1=16 1.6958 method of reading produces n2=12 2.4789 significantly better results than the VanVoorhis Vision method. 99%.01-3σ -2σ -1σ 0 1σ 2σ 3σ

24 Confidence Interval For this example there is no reason to do so, but we can find the 98% confidence interval within Orozco Oral VV Vision n = 16 n = 12 x = 44.0 x = 36.5 s = 13.2 s = 10.2 which we expect to find the true difference (0) between the reading methods. Stat - Tests - 0:2-SampTInt X M X L = 44 36.5 = 7.5 13.22 10.2 2 7.5 ± 2.4789 + 16 12 Slide 24 ( 3.4634, 18.4634) 13.22 10.2 2 7.5 ± 2.7181 + 16 12 ( 4.5213, 19.5213) Stats x1: 44 sx1: 13.2 n1: 16 x2: 36.5 sx2: 10.2 n2: 12 C-Level:.98 Pooled: No (-3.463, 18.463) df=25.9567 x1: 44 x2: 36.5 sx1: 13.2 sx2: 10.2 n1: 16 n2: 12

25 Conclusion ( 3.4634, 18.4634) We are 98% confident that the true difference in reading performance between the Baldon Bio Method and DeAlba Decoding Method is between -3.5 and 18.5. Since 0 is found within the interval there is not sufficient evidence to believe there is a significant difference.