Quantitative Analysis and Empirical Methods

Similar documents
POLI 443 Applied Political Research

Chi-Square. Heibatollah Baghi, and Mastee Badii

Lecture 41 Sections Mon, Apr 7, 2008

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.

Psych 230. Psychological Measurement and Statistics

Class 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

STP 226 ELEMENTARY STATISTICS NOTES

Chapter 8 Student Lecture Notes 8-1. Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance

HYPOTHESIS TESTING. Hypothesis Testing

Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance ECON 509. Dr.

INTERVAL ESTIMATION AND HYPOTHESES TESTING

POLI 443 Applied Political Research

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals

Review of Statistics 101

Econ 325: Introduction to Empirical Economics

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Chi square test of independence

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

Chi square test of independence

Statistics for Managers Using Microsoft Excel

There are statistical tests that compare prediction of a model with reality and measures how significant the difference.

Black White Total Observed Expected χ 2 = (f observed f expected ) 2 f expected (83 126) 2 ( )2 126

Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution

An inferential procedure to use sample data to understand a population Procedures

23. MORE HYPOTHESIS TESTING

Statistical Inference for Means

The One-Way Repeated-Measures ANOVA. (For Within-Subjects Designs)

Hypothesis Testing and Confidence Intervals (Part 2): Cohen s d, Logic of Testing, and Confidence Intervals

15: CHI SQUARED TESTS

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017

Rama Nada. -Ensherah Mokheemer. 1 P a g e

Inferential statistics

Confidence Interval Estimation

Acknowledge error Smaller samples, less spread

Class 19. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700

Statistics for IT Managers

Chapter Eight: Assessment of Relationships 1/42

Statistical methods for comparing multiple groups. Lecture 7: ANOVA. ANOVA: Definition. ANOVA: Concepts

Quantitative Methods for Economics, Finance and Management (A86050 F86050)

Weldon s dice. Lecture 15 - χ 2 Tests. Labby s dice. Labby s dice (cont.)

10.2 Hypothesis Testing with Two-Way Tables

Unit 27 One-Way Analysis of Variance

Multiple Regression Analysis

The Chi-Square Distributions

Hypothesis testing. Data to decisions

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

Testing Research and Statistical Hypotheses

Business Statistics: Lecture 8: Introduction to Estimation & Hypothesis Testing

Tables Table A Table B Table C Table D Table E 675

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

The t-statistic. Student s t Test

Single Sample Means. SOCY601 Alan Neustadtl

MBA 605, Business Analytics Donald D. Conant, Ph.D. Master of Business Administration

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12)

12.10 (STUDENT CD-ROM TOPIC) CHI-SQUARE GOODNESS- OF-FIT TESTS

Example. χ 2 = Continued on the next page. All cells

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

Soc 3811 Basic Social Statistics Second Midterm Exam Spring Your Name [50 points]: ID #: ANSWERS

Module 10: Analysis of Categorical Data Statistics (OA3102)

Chapter 26: Comparing Counts (Chi Square)

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests

Quantitative Introduction ro Risk and Uncertainty in Business Module 5: Hypothesis Testing

Preliminary Statistics Lecture 5: Hypothesis Testing (Outline)

(x t. x t +1. TIME SERIES (Chapter 8 of Wilks)

CHAPTER 9: HYPOTHESIS TESTING

Introduction to Survey Analysis!

Dummies and Interactions

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Basic Business Statistics, 10/e

STP 226 EXAMPLE EXAM #3 INSTRUCTOR:

Understanding p Values

February 5, 2018 START HERE. measurement scale. Means. number of means. independent measures t-test Ch dependent measures t-test Ch 16.

Mathematical Notation Math Introduction to Applied Statistics

Statistics: revision

PHP2510: Principles of Biostatistics & Data Analysis. Lecture X: Hypothesis testing. PHP 2510 Lec 10: Hypothesis testing 1

Chapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables.

Lecture 7: Hypothesis Testing and ANOVA

Business Statistics. Lecture 10: Course Review

STAT 515 fa 2016 Lec Statistical inference - hypothesis testing

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions

The Chi-Square Distributions

χ 2 Test for Frequencies January 15, 2019 Contents

Sampling distribution of t. 2. Sampling distribution of t. 3. Example: Gas mileage investigation. II. Inferential Statistics (8) t =

Outline for Today. Review of In-class Exercise Bivariate Hypothesis Test 2: Difference of Means Bivariate Hypothesis Testing 3: Correla

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC

STAT Chapter 8: Hypothesis Tests

MA : Introductory Probability

UC Berkeley Math 10B, Spring 2015: Midterm 2 Prof. Sturmfels, April 9, SOLUTIONS

An Analysis of College Algebra Exam Scores December 14, James D Jones Math Section 01

CONTINUOUS RANDOM VARIABLES

MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. describes the.

General Certificate of Education Advanced Level Examination June 2014

Nominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers

Part III: Unstructured Data

10.4 Hypothesis Testing: Two Independent Samples Proportion

Transcription:

Hypothesis testing Sciences Po, Paris, CEE / LIEPP

Introduction Hypotheses Procedure of hypothesis testing Two-tailed and one-tailed tests Statistical tests with categorical variables

A hypothesis A testable statement about relationships between characteristics Since Karl Popper, scientific inquiry is not expected to prove facts, but rather to falsify or confirm theoretical postulates. The logic we take when testing hypotheses in statistical methods is thus a negative logic: Each hypothesis has a logical opposite which we call the null hypothesis and denote it H 0. In statistics we often set up a null hypothesis which we seek to reject. If we reject the null, then the hypothesis of interest is supported by our analysis.

Example from last lecture X N(5, 16) Is 7.5 significantly different from the mean of X?

Example from last lecture X N(5, 16) Is 7.5 significantly different from the mean of X? H 1 : 7.5 X H 0 : 7.5 = X

Example from last lecture X N(5, 16) Is 7.5 significantly different from the mean of X? H 1 : 7.5 X H 0 : 7.5 = X 7.5 in terms of Z-scores: Z = 7.5 5 4 =.625.625 is clearly within the [ 1.96, +1.96] interval, thus 7.5 is too close to X. We fail to reject the null hypothesis. That means that H 0 stands and H 1 is not supported. 7.5 is not significantly different from X.

Hypothesis Testing Procedure 1 State a null and alternative hypothesis: H 0 : µ = µ 0, H a : µ µ 0 2 Select a level of significance of interest: α =.05 (we want to be 95% sure.) 3 Determine the sampling distribution of the test statistic. (If we are dealing with a means test and we know σ, we use the standard normal distribution and its Z statistic, if we are dealing with a means test and we don t know σ we use Student s t distribution and the T statistic.) 4 Calculate the test statistic (for z: z = X µ σ ) 5 Find the critical value in the appropriate statistical table 6 Make a conclusion about the null hypothesis (reject or fail to reject)

Test of Statistical Significance Do men and women view gay marriage differently? A feeling thermometer on gay marriage 0=fully oppose; 100=fully support Poll: Women X = 51, s = 4; men X = 46, s = 8 Difference: 51 46 = 5; N=100 women, 100 men Does the sample difference reflect the population difference or just sampling error?

1. Stating the hypotheses H a : There is a difference in women s and men s feelings toward gay marriage in the population H 0 : There is NO difference in women s and men s feelings toward gay marriage in the population.

2. Deciding the significance level Two possible errors we can commit in statistics: Type I error: finding a relationship where there is none (false positive) Type II error: finding no relationship where there is one (false negative) Usually select significance level α = 0.05 (or 5%) Rejecting H 0 will commit Type I error (false positive) no more than 5 times in 100 Rejecting H 0 only if the observation (the difference of 5 between women and men) could have occurred by chance fewer than 5 times out of 100.

3. The sampling distribution Comparing two means CLT normal distribution T or Z? N < 1000, so prefer T

4. The test statistic As before we take the observed or expected value and subtract our null from it: T = Ha H0 se diff But need to calculate the s.e. of the difference se diff = se1 2 + se2 2 = sewomen 2 + semen 2 se w = s N = 4 100 = 0.4; se m = s N = 8 se diff = se1 2 + se2 2 = 0.4 2 + 0.8 2 = 0.894 Back to T: T = Ha H0 diff 0 se diff = se diff = 5 0 0.894 = 5.593 100 = 0.8

5. and 6. Critical value and Conclusion How likely are we to get a T value of 5.593 if H 0 were true? Same as asking: What is the probability of scoring 5.593 on the T-distribution? (df=n1+n2-2) T-table The cutoff at the 0.05 significance level is about 1.984, so it is extremely unlikely to get 5.593 by chance. Conclusion: Reject H 0. The difference of 5 is statistically significant. There is a significant difference between women s and men s feelings towards gay marriage. Women are significantly more in support.

5. and 6. Critical value and Conclusion Alternatively, using confidence intervals: A 95% confidence interval around the difference (5) would be X ± t se = 5 ± 1.984 0.894 = 5 ± 1.774 The 95% confidence interval is [3.226; 6.774] Conclusion: 95 times out of a 100, the sample difference in women s and men s feelings on gay marriage will lie between 3.226 and 6.774. We are thus confident (at the 0.05 level) that there is a true difference between their opinion in the population.

Two-Tailed v. One-Tailed Tests Until now, we have been doing our tests as if we had no expectation about the direction in which we expect 0 to lie. As a result, when we were testing whether our observed value is significantly different from, say, 0, we looked at both ends (or tails) of the distribution of our statistic of interest. This was a two-tailed test. In reality, we often have theoretical expectations about the direction where 0 lies. If we find a value of, say, 5 (such as in our example), and question whether it is significantly different from 0, why should we look for 0 on the right tail? It will not be there. Consequently, when testing whether a statistic is significantly different from 0, we would expect 0 to be on one particular side of the distribution. Here we can do a one-tailed test.

Two-Tailed v. One-Tailed Tests

Testing relationships between categorical variables We want to test how cases are dispersed across the dependent variable H 0 = every category of the IV should have the same distribution as the total, i.e. the IV does not matter. Party ID and career crosstabulation Law Politics Business Education Total Republican N 6 2 5 1 14 % 42.9 14.3 35.7 7.1 100 Democrat N 10 10 2 2 24 % 41.7 41.7 8.3 8.3 100 Other N 6 5 7 3 21 % 28.6 23.8 33.3 14.3 100 Total N 22 17 14 6 59 % 37.3 28.8 23.7 10.2 100

χ 2 Test To test H 0, we use the χ 2 (read chi-squared) test This test compares each observed frequency (fo) with the expected (total) frequency (fe) E.g. if H 0 is correct, 37.3% of the 14 republicans (=5.22) should want to go to into law; and 28.8% of the 14 Republicans (=4.03) should want to go into politics Test: sum the squared differences, divide by the expected frequency: χ 2 = N i=1 (fo i fe i ) 2 fe i ; N=number of cells (12) Party ID and career crosstabulation Law Politics Business Education Total Republican N 6 2 5 1 14 % 42.9 14.3 35.7 7.1 100 Democrat N 10 10 2 2 24 % 41.7 41.7 8.3 8.3 100 Other N 6 5 7 3 21 % 28.6 23.8 33.3 14.3 100 Total N 22 17 14 6 59 % 37.3 28.8 23.7 10.2 100

χ 2 Test The χ 2 test: χ 2 = N (fo i fe i ) 2 i=1 fe i = (6 5.2) 2 /5.2 + (2 4.0) 2 /4.0 +... = 7.87 Apply this value to the χ 2 distribution with appropriate degrees of freedom Df=(number of rows - 1)*(number of columns - 1) = (3-1)*(4-1)=6 Party ID and career crosstabulation Law Politics Business Education Total Republican N 6 2 5 1 14 exp N 5.2 4.0 3.3 1.4 14 % 42.9 14.3 35.7 7.1 100 Democrat N 10 10 2 2 24 exp N 8.9 6.9 5.7 2.4 24 % 41.7 41.7 8.3 8.3 100 Other N 6 5 7 3 21 exp N 7.8 6.1 5.0 2.1 21 % 28.6 23.8 33.3 14.3 100 Total N 22 17 14 6 59 % 37.3 28.8 23.7 10.2 100

χ 2 Test Our value of χ 2 is 7.78 What is the critical value of χ 2 at the 0.05 confidence level with 6 df? Chi2-table The answer is 12.592. Our χ 2 is smaller than the critical value, so it is possible that 7.87 could occur more than 5 times out of 100 by random chance. We fail to reject H 0 ; there is no statistically significant difference between party ID and career choice.