Two-Sample Inferential Statistics

Similar documents
The t-statistic. Student s t Test

Psychology 282 Lecture #4 Outline Inferences in SLR

An inferential procedure to use sample data to understand a population Procedures

Chapter 9 Inferences from Two Samples

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

Sampling Distributions: Central Limit Theorem

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Review. One-way ANOVA, I. What s coming up. Multiple comparisons

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

10/31/2012. One-Way ANOVA F-test

Single Sample Means. SOCY601 Alan Neustadtl

Hypothesis testing: Steps

Multiple t Tests. Introduction to Analysis of Variance. Experiments with More than 2 Conditions

One-factor analysis of variance (ANOVA)

HYPOTHESIS TESTING. Hypothesis Testing

Statistics Primer. ORC Staff: Jayme Palka Peter Boedeker Marcus Fagan Trey Dejong

Using SPSS for One Way Analysis of Variance

Hypothesis testing: Steps

Inferences About the Difference Between Two Means

Introduction to the Analysis of Variance (ANOVA)

p = q ˆ = 1 -ˆp = sample proportion of failures in a sample size of n x n Chapter 7 Estimates and Sample Sizes

ANOVA TESTING 4STEPS. 1. State the hypothesis. : H 0 : µ 1 =

Statistics for Managers Using Microsoft Excel Chapter 9 Two Sample Tests With Numerical Data

Chapter 7 Comparison of two independent samples

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

Inferences About Two Proportions

CBA4 is live in practice mode this week exam mode from Saturday!

Difference between means - t-test /25

HYPOTHESIS TESTING II TESTS ON MEANS. Sorana D. Bolboacă

Chapter 5 Confidence Intervals

Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

Harvard University. Rigorous Research in Engineering Education

Analysis of variance (ANOVA) Comparing the means of more than two groups

Data files. SATpassage.sav sel2elpbook.sav. Homework: - Word_recall.sav - LuckyCharm.sav

Lecture 5: Sampling Methods

MBA 605, Business Analytics Donald D. Conant, Ph.D. Master of Business Administration

PSY 216. Assignment 9 Answers. Under what circumstances is a t statistic used instead of a z-score for a hypothesis test

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals

41.2. Tests Concerning a Single Sample. Introduction. Prerequisites. Learning Outcomes

Chapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.

Inferential statistics

Chapter 24. Comparing Means

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

COGS 14B: INTRODUCTION TO STATISTICAL ANALYSIS

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

df=degrees of freedom = n - 1

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

8/23/2018. One-Way ANOVA F-test. 1. Situation/hypotheses. 2. Test statistic. 3.Distribution. 4. Assumptions

Visual interpretation with normal approximation

STA Module 10 Comparing Two Proportions

1. What does the alternate hypothesis ask for a one-way between-subjects analysis of variance?

10/4/2013. Hypothesis Testing & z-test. Hypothesis Testing. Hypothesis Testing

PSYC 331 STATISTICS FOR PSYCHOLOGISTS

INTRODUCTION TO ANALYSIS OF VARIANCE

STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests

HYPOTHESIS TESTING SAMPLING DISTRIBUTION. the sampling distribution for di erences of means is. 2 is known. normal if.

Calculating Fobt for all possible combinations of variances for each sample Calculating the probability of (F) for each different value of Fobt

Student s t-distribution. The t-distribution, t-tests, & Measures of Effect Size

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

Introduction to Business Statistics QM 220 Chapter 12

Repeated Measures Analysis of Variance

In ANOVA the response variable is numerical and the explanatory variables are categorical.

Advanced Experimental Design

Content by Week Week of October 14 27

Inference for Distributions Inference for the Mean of a Population. Section 7.1

The independent-means t-test:

Statistics for Business and Economics

What p values really mean (and why I should care) Francis C. Dane, PhD

Independent Samples ANOVA

The Chi-Square Distributions

Chapters 4-6: Inference with two samples Read sections 4.2.5, 5.2, 5.3, 6.2

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series

Questions 3.83, 6.11, 6.12, 6.17, 6.25, 6.29, 6.33, 6.35, 6.50, 6.51, 6.53, 6.55, 6.59, 6.60, 6.65, 6.69, 6.70, 6.77, 6.79, 6.89, 6.

The Chi-Square Distributions

Inferential Statistics

10.4 Hypothesis Testing: Two Independent Samples Proportion

Statistics: CI, Tolerance Intervals, Exceedance, and Hypothesis Testing. Confidence intervals on mean. CL = x ± t * CL1- = exp

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

BIO5312 Biostatistics Lecture 6: Statistical hypothesis testings

Statistical Inference for Means

POLI 443 Applied Political Research

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan

CHAPTER 10 ONE-WAY ANALYSIS OF VARIANCE. It would be very unusual for all the research one might conduct to be restricted to

Review of Statistics 101

Statistical Foundations:

Stat 705: Completely randomized and complete block designs

Classroom Activity 7 Math 113 Name : 10 pts Intro to Applied Stats

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2

Example: Four levels of herbicide strength in an experiment on dry weight of treated plants.

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

Hypothesis Tests Solutions COR1-GB.1305 Statistics and Data Analysis

Lecture 3: Inference in SLR

Unit 27 One-Way Analysis of Variance

Transcription:

The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is not administered The other condition is often called either the treatment condition or the experimental condition; the treatment is administered 2 Example of an Experiment Some people see a brief description of the subject of a passage Other people see nothing Both groups then hear a passage The groups then rate their comprehension of the passage and then recall it What is the control group? What is the experimental group? 3 1

Question Asked in an Experiment The basic question asked in such an experiment is whether the treatment caused an effect That is, are the values of the dependent variables (the measured variables) different in the treatment and control conditions? E.g., did having a context increase comprehension and recall of the passage relative to the control condition? 4 Two-Sample t-tests Inferential statistics are used to answer the primary question in an experiment The particular inferential statistic that you use depends on the experimental design of the study; more about that in Experimental Psychology When there are just two conditions (control and experimental), you often want to use a twosample t-test 5 Types of Two-Sample t Tests Two-sample t tests can be either Independent measures -- using a separate sample for the control and treatment groups, or Also known as a between-subjects design Repeated measures using the same (or matched) groups for the control and treatment conditions Also known as a within-subjects (or matched) design 6 2

Two-Sample t-tests The two-sample t-test is not fundamentally different from the single-sample t-test that we have already discussed There are two primary differences: You have two sample means instead of a single sample mean and a population mean The population standard deviation is unknown, and you have two estimates of it You have one estimate of the population standard deviation from each of the two sample standard deviations 7 Two-Sample t-tests The t-test formula says to take the difference of the means (M 1 M 2 ) and then divide that by the standard error of the difference of the means s (M 1 -M 2 ) Note: This formula only works if n 1 = n 2 8 n 1 n 2 Pooled variance: 9 3

An Example Two groups of students were asked to perform 5 simple tasks at specified times during the next hour One group tied a string (the external memory cue) around their finger to remind them that they had tasks to perform The other group had no external memory cue Did the memory cue change memory 10 performance? An Example The number of tasks completed was recorded for each group Memory Cue No Memory Cue 2 1 2 1 4 3 1 2 3 2 3 1 2 3 2 1 4 4 1 2 n = 10 n = 10 M = 2.8 M = 1.6 SS = 9.6 SS = 4.4 11 Step 1 Write the hypotheses: H 0 : µ memory cue µ no memory cue = 0 H 1 : µ memory cue - µ no memory cue 0 Is it a one-tailed or two-tailed test? Two-tailed the hypothesis does not specify the direction of the change Specify the α level α =.05 12 4

Step 2 Determine the critical regions df = df 1 + df 2 = (n 1 1) + (n 2 1) df = (10 1) + (10 1) = 9 + 9 = 18 Consult a table of critical t values with df = 18, two-tails, α =.05 t critical = 2.101 If the calculated t is larger than 2.101 or smaller than -2.101, reject H 0 13 Step 3 This is an independent samples design (two separate groups of participants) 14 Step 3 15 5

Step 4 Make a decision The calculated t = 3.04 is in the critical region cut off by t critical = 2.101. Reject H 0 It is unlikely that these two samples were drawn from the same population. That is, it is likely the case that the memory cue affected memory 16 Using a Computer Using a computer to perform t-tests is easy The computer will print out the value of t, its degrees of freedom, and a p value The p value is the probability of observing a difference between the means this large due to chance When the p value is less than or equal to the specified α level, you can reject H 0 17 Cohen s d: Effect Size 18 6

Effect Size Cohen s d: Magnitude of d d = 0.2 d = 0.5 d = 0.8 Evaluation of Effect Size Small effect Medium effect Large effect 19 Effect Size Percentage of Variance Explained, r 2 r 2 = 0.01 r 2 = 0.09 r 2 = 0.25 Small effect Medium effect Large effect Both the estimated Cohen s d and the proportion of variance explained claim that this is a large effect 20 Confidence Interval µ 1 µ 2 = M 1 M 2 ± t critical s (M1 M 2 ) 95% CI = 2.8 1.6 ± 2.101 0.3944 0.371 to 2.029 21 7

One-Tailed Example How would the previous example be different if the hypothesis was one-tailed? Did the memory cue improve memory performance? H 0 : µ memory cue µ no memory cue H 1 : µ memory cue > µ no memory cue This is a one-tailed test as the direction of the hypothesized effect is specified α =.05 22 Step 2 Determine the critical regions df = df 1 + df 2 = (n 1 1) + (n 2 1) df = (10 1) + (10 1) = 9 + 9 = 18 Consult a table of critical t values with df = 18, one-tails, α =.05 t critical = 1.734 If the calculated t is larger than 1.734, reject H 0 23 Steps 3 & 4 Step 3, calculating the t, is identical to the two-tailed example t = 3.04 Make a decision The calculated t = 3.04 is in the critical region cut off by t critical = 1.734 Reject H 0 It is likely the case that the memory cue improved memory 24 8

Effect Size & Confidence Interval The effect sizes (estimated Cohen s d and r 2 ) are not influenced by the number of tails Estimated d = 1.36, r 2 =.339 The confidence interval is affected (the critical t value is different) µ 1 µ 2 = M 1 M 2 ± t critical s (M1 M 2 ) 95% CI = 2.8 1.6 ± 1.734 0.3944 0.516 to 1.884 25 Assumptions of Student s t Student s t test makes several assumptions that should be verified prior to accepting the results of a t-test If the assumptions are violated, then you may not be safe in making any conclusions based on the t-value 26 Samples are Independent The data in the two samples should be independent of each other This assumption is violated in repeated measures designs and matched designs 27 9

Normal Distributions The data in each condition should be normally distributed This is often the case in the social sciences Unless the distribution is very non-normal, the t-test will still give good results That is, the t-test is robust -- it will give good results even when some of its assumptions are violated By the central limit theorem, large sample sizes will make the sampling distribution normal 28 Homogeneity of Variance The assumption of homogeneity of variance states that the variance in each condition (i.e. sample) should be equal It is possible to perform a statistical test to determine if the variances are probably different from each other If the variances are found to be different, then a modified form of the t-test should be used Most statistics programs automatically calculate both the normal t-test and the heterogeneous 29 variance t-test 10