Hypothesis Testing. We normally talk about two types of hypothesis: the null hypothesis and the research or alternative hypothesis.

Similar documents
Lab #12: Exam 3 Review Key

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests

Econometrics. 4) Statistical inference

Sampling Distributions: Central Limit Theorem

Chapter 9 Inferences from Two Samples

Your schedule of coming weeks. One-way ANOVA, II. Review from last time. Review from last time /22/2004. Create ANOVA table

Chapter 7 Comparison of two independent samples

Harvard University. Rigorous Research in Engineering Education

One sided tests. An example of a two sided alternative is what we ve been using for our two sample tests:

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

Hypothesis testing: Steps

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

Student s t-distribution. The t-distribution, t-tests, & Measures of Effect Size

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Hypothesis testing: Steps

Lecture 5: Hypothesis testing with the classical linear model

Introduction to the Analysis of Variance (ANOVA) Computing One-Way Independent Measures (Between Subjects) ANOVAs

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true

Chapter 26: Comparing Counts (Chi Square)

HYPOTHESIS TESTING. Hypothesis Testing

Ordinary Least Squares Regression Explained: Vartanian

Review. One-way ANOVA, I. What s coming up. Multiple comparisons

Confidence Interval Estimation

review session gov 2000 gov 2000 () review session 1 / 38

Biostatistics and Design of Experiments Prof. Mukesh Doble Department of Biotechnology Indian Institute of Technology, Madras. Lecture 11 t- Tests

STA 101 Final Review

DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

Lecture 18: Analysis of variance: ANOVA

Hypothesis testing. Data to decisions

Lesson 21 Not So Dramatic Quadratics

Solving Equations. Another fact is that 3 x 4 = 12. This means that 4 x 3 = = 3 and 12 3 = will give us the missing number...

The One-Way Independent-Samples ANOVA. (For Between-Subjects Designs)

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Econometrics Review questions for exam

Inferences for Correlation

Chapter 27 Summary Inferences for Regression

Power. January 12, 2019

Chapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.

CBA4 is live in practice mode this week exam mode from Saturday!

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2

Sociology Research Statistics I Final Exam Answer Key December 15, 1993

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Statistical Inference. Why Use Statistical Inference. Point Estimates. Point Estimates. Greg C Elvers

Statistics Introductory Correlation

Last few slides from last time

MORE ON MULTIPLE REGRESSION

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

Sociology 593 Exam 2 Answer Key March 28, 2002

Ch. 16: Correlation and Regression

This is particularly true if you see long tails in your data. What are you testing? That the two distributions are the same!

1 Correlation and Inference from Regression

Module 7 Practice problem and Homework answers

appstats27.notebook April 06, 2017

Design & Analysis of Experiments 7E 2009 Montgomery

CHAPTER 9: HYPOTHESIS TESTING

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

Warm-up Using the given data Create a scatterplot Find the regression line

1 Hypothesis testing for a single mean

INTERVAL ESTIMATION AND HYPOTHESES TESTING

Mathematical Notation Math Introduction to Applied Statistics

T-TEST FOR HYPOTHESIS ABOUT

Some Review and Hypothesis Tes4ng. Friday, March 15, 13

Multiple Regression Analysis

Linear Regression & Correlation

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

Chapter 1 Review of Equations and Inequalities

Two sided, two sample t-tests. a) IQ = 100 b) Average height for men = c) Average number of white blood cells per cubic millimeter is 7,000.

Ordinary Least Squares Regression Explained: Vartanian

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

Please bring the task to your first physics lesson and hand it to the teacher.

Originality in the Arts and Sciences: Lecture 2: Probability and Statistics

Math Lecture 3 Notes

STAT/SOC/CSSS 221 Statistical Concepts and Methods for the Social Sciences. Random Variables

Can you tell the relationship between students SAT scores and their college grades?

Business Statistics. Lecture 9: Simple Regression

STA2601. Tutorial Letter 104/1/2014. Applied Statistics II. Semester 1. Department of Statistics STA2601/104/1/2014 TRIAL EXAMINATION PAPER

Political Science 236 Hypothesis Testing: Review and Bootstrapping

Elementary Statistics Triola, Elementary Statistics 11/e Unit 17 The Basics of Hypotheses Testing

MS&E 226: Small Data

Econ 325: Introduction to Empirical Economics

Quantitative Analysis and Empirical Methods

MONT 105Q Mathematical Journeys Lecture Notes on Statistical Inference and Hypothesis Testing March-April, 2016

Chapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a

Content by Week Week of October 14 27

Regression, Part I. - In correlation, it would be irrelevant if we changed the axes on our graph.

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

Basic Probability Reference Sheet

Talk Science Professional Development

The t-statistic. Student s t Test

Solution to Proof Questions from September 1st

16.400/453J Human Factors Engineering. Design of Experiments II

Algebra Year 10. Language

Advanced Experimental Design

Statistical Modeling and Analysis of Scientific Inquiry: The Basics of Hypothesis Testing

Lecture 3: Inference in SLR

Transcription:

Hypothesis Testing Today, we are going to begin talking about the idea of hypothesis testing how we can use statistics to show that our causal models are valid or invalid. We normally talk about two types of hypothesis: the null hypothesis and the research or alternative hypothesis. Hypothesis Testing 1

The null hypothesis The hypothesis we actually test is called the null hypothesis. The null hypothesis means there is no difference. For example, if we want to test whether or not the means of two variables (x and y) are equal, our null hypothesis is that they are equal. We d write this as: H 0 : x = ȳ H 0 is just a fancy way to write the null hypothesis. Hypothesis Testing 2

The research hypothesis The other hypothesis is called the research or alternative hypothesis. This hypothesis means that there is a difference. For example, if we are testing the equality of two means, we would write the alternative hypothesis as: H A : x ȳ Again, H A is just another way to write the research hypothesis. Hypothesis Testing 3

An example Last time, we thought about the IQs of political science majors at Ole Miss. Let s assume we found that the mean IQ of all political science majors, based on a sample, was µ x = 107 ± 8. Now, imagine we found the mean IQ of the students in this class, x. Let s test whether the mean IQ of the class was the same as the mean of all political science majors. What would our null and research hypotheses be? H 0 : H A : x = µ x x µ x Hypothesis Testing 4

Why do we use the null? Why do we test the null hypothesis, if it s usually what we want to disprove? The reason is that it s easier to disprove a specific hypothesis (x is not, in fact, equal to y) than it is to prove a non-specific hypothesis. When we disprove the null hypothesis, we call that rejecting the null (or accepting the research hypothesis). Conversely, when we can t disprove the null (we don t find a statistical difference), we fail to reject the null. So, how do we decide whether or not to reject the null hypothesis? Hypothesis Testing 5

A short lesson in rejection It turns out we use the same concepts we use in confidence intervals: specifically, whether we reject the null depends on the alpha level. The alpha level is a statement of how confident we are that the null is correctly rejected; for example, when α =.05, we are 95% sure we correctly rejected the null, but 5% of the time we may have actually rejected it when the null is actually true. The other possibility is that we may accept (or fail to reject) the null hypothesis. If we fail to reject the null, that means we cannot prove our research hypothesis at a given alpha level we aren t confident that the research hypothesis is valid. Hypothesis Testing 6

Types of error We have Type I Error when we reject the null when the null is in fact true. We have Type II Error when we fail to reject the null when the null is in fact false. H 0 is true H 0 is false Reject H 0 Type I Error Correct rejection Fail to reject H 0 Correct fail Type II Error Hypothesis Testing 7

The Z Test Now, we can start talking about some actual statistical tests. For example, let s think about taking a (possibly non-random) sample from a population, and then trying to decide if our sample is typical of the population. This means we want to test H A : µ x. This sort of test is called a Z test. The formula for the Z test is pretty simple: Z ob = x µ σ/ n Hypothesis Testing 8

Z Test Example Let s revisit our example of Ole Miss students. Say the mean IQ of the class x = 115, the mean IQ of all political science undergrads µ = 107 with a standard deviation σ = 14, and we have 33 students in the class (n). Z ob = x µ σ/ n = 115 107 14/ 33 = 8 14/5.74 = 8 2.44 = 3.28 Hypothesis Testing 9

Comparing to the critical value Now we have Z ob = 3.28. We call this Z obtained. To decide whether the Z test is statistically significant, we have to compare Z ob to the critical value of Z for a given alpha level (Z crit ). It turns out that we find Z crit the same way we found critical values for confidence intervals. Specifically, we use Z α/2 : Z.05/2 = Z.025 = 1.96 and Z 0.01/2 = Z.005 = 2.58 Hypothesis Testing 10

The final part of the test So, now all we have to do is compare Z crit to Z ob : H 0 : is { rejected if Zcrit Z ob accepted if Z crit > Z ob Of course, the converse applies to H A ; if we reject H 0, we accept H A, and vice versa. In this example, we would reject the null at both α =.05 and α =.01, because 3.28 is greater than either critical value of Z. Hypothesis Testing 11

The dependent sample t test Often (like in the case of confidence intervals) we may know the population mean but be unaware of the population standard deviation. To solve this problem (again, testing whether µ = x), we use something called the dependent sample t test: t ob = x µ s/ n The formula looks a lot like the Z test, but since we only have the sample standard deviation we have to use the t distribution. Again, we use the same information we used to construct confidence intervals with the t distribution: an alpha level (usually.05 or.01) and a number of degrees of freedom (still n 1). Hypothesis Testing 12

Dependent sample t test example For example, let s think about the clothing expenditures of Greek students relative to other students. We know (somehow) that the average male student spends $50.00 a month on clothing, but we don t know the standard deviation. Now, imagine that we take a sample of 25 fraternity members, and find that the mean expenditure is $60.00 with a sample standard deviation of $20.00. Using this information, we can do a dependent sample t test, with an alpha level of.02: t ob = x µ s/ n = $60.00 $50.00 $20.00/ 25 = $10 $20/5 = $10 $4 = 2.5 Hypothesis Testing 13

Compare t obtained with t critical The next step, like the Z test, is to compare the obtained value of t with the critical value of t for the given alpha level and number of degrees of freedom. For α =.02 and df = 25 1 = 24, the critical value is 2.492. And, again, all we do is compare: H 0 : is { rejected if tcrit t ob accepted if t crit > t ob Since 2.492 2.5, we reject the null hypothesis and conclude that frat guys do spend a different amount than non-frat guys on clothing. Hypothesis Testing 14

The independent samples t test This is all well and good, but what if we want to compare two samples from the same or different populations? For example, what if we want to compare the views of blacks on affirmative action to the views of whites, rather than the views of the population as a whole? The solution to this problem is called the independent samples t test. The formula for the problem is pretty ugly, but bear in mind that there s nothing here you can t already do: it s just addition, subtraction, multiplication, division, and square roots. (Also bear in mind that it will be on a formula sheet on an exam!) Hypothesis Testing 15

The formula in all its glory The formula is generally divided into two parts: t ob = x 1 x 2 spooled (1/n 1 + 1/n 2 ) s pooled = (n 1 1)s 2 1 + (n 2 1)s 2 2 n 1 + n 2 2 The s pooled part weighs the two sample standard deviations to produce a weighted standard error. The formula is much simpler when n 1 and n 2 (and s 1 and s 2 ) are equal, but this is rare. Hypothesis Testing 16

How do we calculate this? It s probably easiest to calculate the s pooled part first, then plug the result into the larger formula. Let s say we obtain survey data on the number of social events attended by fraternity members and non-fraternity males in a given month, and want to see if frat guys go to more events than non-frat guys. Hypothesis Testing 17

Frat Data We find the following data (where group 1 is frat guys and group 2 is non-frat guys): x 1 = 15 x 2 = 11 s 1 = 3 s 2 = 4 n 1 = 33 n 2 = 29 Let s decide whether the difference is statistically significant, with α =.01. Hypothesis Testing 18

Calculations s pooled = (n 1 1)s 2 1 + (n 2 1)s 2 2 n 1 + n 2 2 (32)(9) + (28)(16) = = 60 x 1 x 2 t ob = spooled (1/n 1 + 1/n 2 ) = (33 1)32 + (29 1)4 2 33 + 29 2 288 + 448 60 = 73660 = 12.2666. = = 15 11 (12.2666)(1/33 + 1/29) = 4 (12.2666)(.0303 +.0344) = 4 (12.2666)(.0648) = 4 = 4 0.794 0.891 = 4.487 Hypothesis Testing 19

Determining t critical Again, we need to compare the t-obtained (t ob ) with the critical value of t. We already know the alpha level (.01), as it was given in the problem; what we don t know is the number of degrees of freedom to use (n 1 won t work, because we now have two n s!). It turns out that we use df = n 1 + n 2 2 in this case, because we are dealing with two independent samples. (In terms of the free parameters problem, we d only need n 1 + n 2 2 data points in addition to our statistics to figure out the data for both groups.) So, in this case, we use 29 + 33 2 = 60 degrees of freedom. It turns out that, after we look it up in Appendix 2, t crit = 2.66. Hypothesis Testing 20

Determining the validity of our hypothesis Now, we compare t ob and t crit the way we always do: H 0 : is { rejected if tcrit t ob accepted if t crit > t ob Since 2.66 4.487, we reject the null hypothesis and conclude that frat guys do have more fun. Hypothesis Testing 21

The difference (matched or paired) t test Now, let s imagine that instead of wanting to compare two groups, we want to compare the response of the same group of people in two situations. For example, we might want to compare voters evaluations of Al Gore before and after he grew his beard, or compare the response of Bill the Cat to heavy metal and easy listening music. This sort of comparison requires the use of the difference t test (or the matched or paired t test). This is the sort of test you d use in an experiment, quasiexperiment, or panel study, where you want to examine the same subjects on different occasions. The difference t test is a bit different in that we need to have the original data available to do the comparison; we can t use the mean and standard deviation of all the subjects from before and after the event. (Why not?) Hypothesis Testing 22

The formula The formula comes in three parts; luckily, they re simpler than the parts of the independent-samples t test: t ob = d s d / n, where d = d n, and s d = d2 ( d) 2 /n n 1 Note that the last two parts are really the sample mean and standard deviation formulas for d we just put d in there instead of x. But what is d? It s simply the difference between the subject s before and after values. For example, if the subject gave Al Gore a rating of 60 when he was beardless and 40 when he had a beard, d = 60 40 = 20. Hypothesis Testing 23

Al: Thanks for shaving Here s some example data: Individual Rating of Gore w/o beard Rating of Gore w/beard 1 80 60 2 85 87 3 17 21 4 18 27 5 50 40 6 10 10 7 98 54 Now, let s figure out whether there is a significant difference between people s attitudes towards Al Gore before and after he grew a beard, with a 95% confidence level (α =.05). Hypothesis Testing 24

Calculate d and d-squared The first step is to find the mean and standard deviation of d. That will require us to know d, d 2, d, and d 2 : Individual Rating of Gore w/o beard Rating of Gore w/beard d d 2 1 80 60 +20 400 2 85 87 2 4 3 17 21 4 16 4 18 27 9 81 5 50 40 +10 100 6 10 10 0 0 7 98 54 +44 1936 +59 2537 Hypothesis Testing 25

Calculations The mean difference is d = 59/7 = 8.42 and the standard deviation of the differences is: s d = = d2 ( d) 2 /n 2537 (59)2 /7 2537 3481/7 = = n 1 6 6 2537 497.2857 2039.7143 = = 339.95 = 18.438 6 6 Now, we can calculate the t ob value: t ob = d s d / n = 8.42 18.438/ 7 = 8.42 6.97 = 1.21 Hypothesis Testing 26

Determining the validity of our hypothesis We use the same number of degrees of freedom we d use for a dependent samples t test (n 1); so, with α =.05 and df = 7 1 = 6, t crit = 2.447. Since 2.447 > 1.21, we fail to reject the null hypothesis and thus conclude that Al Gore s beard did not significantly affect public opinion about him. Hypothesis Testing 27

Homework There was a lot of material in this chapter. I strongly recommend doing all of the practice exercises; however, I will not collect the assignment. If you have any questions on how to solve these problems, or want your work checked, see me. You may want to read pages 155 56 about one-tailed test statistics. These are used when you want to test nulls involving comparisons, like µ x. However, political scientists generally rely on two-tailed tests in our more sophisticated models. Hypothesis Testing 28

The Road Ahead Next time, we will begin discussing a technique called Analysis of Variance (ANOVA Chapter 11). We will return to bivariate regression and correlation (Chapter 10) after we discuss ANOVAs. Then, we will discuss how to interpret statistics presented in journal articles and briefly discuss the concepts behind other statistical techniques that are common in political science multivariate regression and models with limited dependent variables (logit, probit) with the goal of helping you understand the sorts of articles you might encounter in an upper-division course. Hypothesis Testing 29