STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)

Similar documents
STAT Chapter 8: Hypothesis Tests

Chapter 9 Inferences from Two Samples

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

2011 Pearson Education, Inc

Midterm 1 and 2 results

STAT Section 3.4: The Sign Test. The sign test, as we will typically use it, is a method for analyzing paired data.

Chapter 7: Statistical Inference (Two Samples)

Chapter 12 - Lecture 2 Inferences about regression coefficient

The t-statistic. Student s t Test

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

A proportion is the fraction of individuals having a particular attribute. Can range from 0 to 1!

Chapter 22. Comparing Two Proportions. Bin Zou STAT 141 University of Alberta Winter / 15

Two-Sample Inferential Statistics

Population 1 Population 2

Chapter 8. Inferences Based on a Two Samples Confidence Intervals and Tests of Hypothesis

Harvard University. Rigorous Research in Engineering Education

LECTURE 12 CONFIDENCE INTERVAL AND HYPOTHESIS TESTING

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.

ECO220Y Review and Introduction to Hypothesis Testing Readings: Chapter 12

Problem Set 4 - Solutions

Inferences About Two Proportions

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan

Chapter 9. Inferences from Two Samples. Objective. Notation. Section 9.2. Definition. Notation. q = 1 p. Inferences About Two Proportions

CHAPTER 7. Hypothesis Testing

Inferences About Two Population Proportions

SMAM 314 Exam 42 Name

Data Analysis and Statistical Methods Statistics 651

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters

10.4 Hypothesis Testing: Two Independent Samples Proportion

INTERVAL ESTIMATION OF THE DIFFERENCE BETWEEN TWO POPULATION PARAMETERS

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

MBA 605, Business Analytics Donald D. Conant, Ph.D. Master of Business Administration

Chapter 9. Hypothesis testing. 9.1 Introduction

Chapter Six: Two Independent Samples Methods 1/51

Introduction to Survey Analysis!

STAT 328 (Statistical Packages)

Lecture 10: Comparing two populations: proportions

Single Sample Means. SOCY601 Alan Neustadtl

Chapter 22. Comparing Two Proportions 1 /29

Chapter 22. Comparing Two Proportions 1 /30

Exam 2 (KEY) July 20, 2009

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

T test for two Independent Samples. Raja, BSc.N, DCHN, RN Nursing Instructor Acknowledgement: Ms. Saima Hirani June 07, 2016

COGS 14B: INTRODUCTION TO STATISTICAL ANALYSIS

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1

Two Sample Problems. Two sample problems

Mock Exam - 2 hours - use of basic (non-programmable) calculator is allowed - all exercises carry the same marks - exam is strictly individual

5 Basic Steps in Any Hypothesis Test

Keppel, G. & Wickens, T.D. Design and Analysis Chapter 2: Sources of Variability and Sums of Squares

STAT Section 2.1: Basic Inference. Basic Definitions

Chapter 7 Comparison of two independent samples

The Components of a Statistical Hypothesis Testing Problem

THE ROYAL STATISTICAL SOCIETY 2015 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE MODULE 3

Probability and Samples. Sampling. Point Estimates

Sampling Distributions: Central Limit Theorem

Lecture 11 - Tests of Proportions

Inference for Proportions, Variance and Standard Deviation

Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t

# of 6s # of times Test the null hypthesis that the dice are fair at α =.01 significance

The independent-means t-test:

Stat 427/527: Advanced Data Analysis I

Probability and Statistics

TABLES AND FORMULAS FOR MOORE Basic Practice of Statistics

2.57 when the critical value is 1.96, what decision should be made?

Difference between means - t-test /25

QUEEN S UNIVERSITY FINAL EXAMINATION FACULTY OF ARTS AND SCIENCE DEPARTMENT OF ECONOMICS APRIL 2018

Statistical Methods in Natural Resources Management ESRM 304

Midterm 2. Math 205 Spring 2015 Dr. Lily Yen

hp calculators HP 20b Probability Distributions The HP 20b probability distributions Practice solving problems involving probability distributions

Chapters 4-6: Inference with two samples Read sections 4.2.5, 5.2, 5.3, 6.2

Inferences Based on Two Samples

their contents. If the sample mean is 15.2 oz. and the sample standard deviation is 0.50 oz., find the 95% confidence interval of the true mean.

Confidence intervals and Hypothesis testing

An inferential procedure to use sample data to understand a population Procedures

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

One-factor analysis of variance (ANOVA)

Data Analysis and Statistical Methods Statistics 651

The Chi-Square Distributions

Hypothesis testing: Steps

CHAPTER 9, 10. Similar to a courtroom trial. In trying a person for a crime, the jury needs to decide between one of two possibilities:

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12)

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α

The Chi-Square Distributions

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan

Section I Questions with parts

7 Estimation. 7.1 Population and Sample (P.91-92)

Inference for Single Proportions and Means T.Scofield

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2

Chapter 23: Inferences About Means

Chapter 6. Estimates and Sample Sizes

y response variable x 1, x 2,, x k -- a set of explanatory variables

EXAM 3 Math 1342 Elementary Statistics 6-7

Permutation Tests. Noa Haas Statistics M.Sc. Seminar, Spring 2017 Bootstrap and Resampling Methods

Independent Samples t tests. Background for Independent Samples t test

Confidence Intervals with σ unknown

Transcription:

STAT 515 -- Chapter 9: Two-Sample Problems Paired Differences (Section 9.3) Examples of Paired Differences studies: Similar subjects are paired off and one of two treatments is given to each subject in the pair. or We could have two observations on the same subject. The key: With paired data, the pairings cannot be switched around without affecting the analysis. We typically wish to perform inference about the mean of the differences, denoted D. Example 1: Six students are given two tests, one after being fed, and one on an empty stomach. Is there evidence that students perform better on a full stomach? (Assume normality of data, and use =.05.) Student Scores 1 3 4 5 6 X 1 (with food) 74 71 8 77 7 81 X (without food) 68 71 86 70 67 80

Calculate differences: D = X 1 X D: Example : Find a 98% CI for the mean difference in arm strength for right-handed people (measured by the number of seconds a certain weight can be held extended). Person 1 3 4 5 6 7 X 1 (Right) 6 35 17 47 16 3 X (Left) 0 31 10 38 3 16 9 D:

Interpretation: With 98% confidence, the mean rightarm strength is between 0.336 seconds less and 8.336 seconds greater than the mean left-arm strength. (We are 98% confident the mean difference is between -0.336 and 8.336 seconds.) Note: With paired data, the two-sample problem really reduces to a one-sample problem on the sample of differences.

Two Independent Samples (Section 9.) Sometimes there s no natural pairing between samples. Example 1: Collect sample of males and sample of females and ask their opinions on whether capital punishment should be legal. Example : Collect sample of iron pans and sample of copper pans and measure their resiliency at high temperatures. No attempt made to pair subjects we have two independent samples. We could rearrange the order of the data and it wouldn t affect the analysis at all.

Comparing Two Means Our goal is to compare the mean responses to two treatments, or to compare two population means (we have two separate samples). We assume both populations are normally distributed (or nearly normal). We re typically interested in the difference between the mean of population 1 ( 1 ) and the mean of population ( ). We may construct a CI for 1 or perform one of three types of hypothesis test: H 0 : 1 = H 0 : 1 = H 0 : 1 = H a : 1 H a : 1 < H a : 1 > Note: H 0 could be written H 0 : 1 = 0. The parameter of interest is Notation: X _ 1 = mean of Sample 1 X _ = mean of Sample 1 = standard deviation of Population 1 = standard deviation of Population s 1 = standard deviation of Sample 1

s = standard deviation of Sample n 1 = size of Sample 1 n = size of Sample The point estimate of 1 is This statistic has standard error but we use since 1, unknown. Since the data are normal, we can use the t-procedures for inference. Case I: Unequal population variances ( 1 ) In the case where the two populations have different variances, the t-procedures are only approximate. Formula for (1 )100% CI for 1 is: where the d.f. = the smaller of n 1 1 and n 1.

To test H 0 : 1 =, the test statistic is: H a Rejection region P-value 1 t < -t / or t > t / *(tail area) 1 < t < -t left tail area 1 > t > t right tail area where the d.f. = the smaller of n 1 1 and n 1. Case II: Equal population variances ( 1 = ) In the case where the two populations have equal variances, we can better estimate this population variance with the pooled sample variance: s p ( n 1 1) s1 ( n 1) s n n 1 Our t-procedures in this case are exact, not approximate.

Formula for (1 )100% CI for 1 is: where the d.f. = n 1 + n. To test H 0 : 1 =, the test statistic is: H a Rejection region P-value 1 t < -t / or t > t / *(tail area) 1 < t < -t left tail area 1 > t > t right tail area where the d.f. = n 1 + n.

Example: What is the difference in mean DVD prices at Best Buy and Walmart? Let 1 = mean DVD price at Best Buy and let = mean DVD price at Walmart. Find 99% CI for 1. Randomly sample 8 DVDs from Best Buy: X _ 1 = 17.93, s 1 = 10., s 1 = 104.45, n 1 = 8. Randomly sample 0 DVDs from Walmart: X _ = 5.70, s = 11.35, s = 18.8, n = 0. Does 1 =? Could test this formally using an F-test (Sec. 9.5) or could simply compare spreads of box plots for samples 1 and. When in doubt, assume 1. Let s assume 1 here. 99% CI for 1 :

Interpretation: We are 99% confident that Best Buy s mean DVD price is between $16.89 lower and $1.35 higher than Walmart s mean DVD price. Test: H 0 : 1 = vs. H a : 1 < (at =.10) Test statistic:

Inference about Two Proportions (Sec. 9.4) We now consider inference about p 1 p, the difference between two population proportions. Point estimate for p 1 p is For large samples, this statistic has an approximately normal distribution with mean p 1 p and standard p1 (1 p1) p(1 p ) deviation n n. 1 So a (1 )100% CI for p 1 p is ˆp 1 = sample proportion for Sample 1 ˆp = sample proportion for Sample n 1 = sample size of Sample 1 n = sample size of Sample Requires large samples: (1) Need n 1 0 and n 0. () Need number of successes and number of failures to be 5 or more in both samples.

Test of H 0 : p 1 = p Test statistic: (Use pooled proportion because under H 0, p 1 and p are the same.) Pooled sample proportion pˆ = Example: Let p 1 = the proportion of male USC students who park on campus and let p = the proportion of female students who park on campus. Find a 95% CI for the difference in the true proportion of males and the true proportion of females who park at USC. Take a random sample of 50 males; 3 park at USC. Take a random sample of 60 females; 34 park at USC.

Interpretation: We are 95% confident that the proportion of males who park at USC is between.110 lower and.56 higher than the proportion of females who park at USC. Hypothesis Test: Is the proportion of males who park greater than the proportion of females who park?