HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

Similar documents
CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Chi-Square. Heibatollah Baghi, and Mastee Badii

Inferential statistics

Chapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables.

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.

Nominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers

Lecture 28 Chi-Square Analysis

Chi Square Analysis M&M Statistics. Name Period Date

STP 226 ELEMENTARY STATISTICS NOTES

11-2 Multinomial Experiment

Statistics for Managers Using Microsoft Excel

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017

THE PEARSON CORRELATION COEFFICIENT

10: Crosstabs & Independent Proportions

Frequency Distribution Cross-Tabulation

Quantitative Analysis and Empirical Methods

10.2: The Chi Square Test for Goodness of Fit

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

Lecture Slides. Section 13-1 Overview. Elementary Statistics Tenth Edition. Chapter 13 Nonparametric Statistics. by Mario F.

Visual interpretation with normal approximation

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

10.2 Hypothesis Testing with Two-Way Tables

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC

Statistics in medicine

POLI 443 Applied Political Research

Bio 183 Statistics in Research. B. Cleaning up your data: getting rid of problems

Chapter Eight: Assessment of Relationships 1/42

Inference for Categorical Data. Chi-Square Tests for Goodness of Fit and Independence

Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution

HYPOTHESIS TESTING. Hypothesis Testing

LOOKING FOR RELATIONSHIPS

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

Chi-Squared Tests. Semester 1. Chi-Squared Tests

STAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure).

Example. χ 2 = Continued on the next page. All cells

Hypothesis Testing. Hypothesis: conjecture, proposition or statement based on published literature, data, or a theory that may or may not be true

Statistics - Lecture 04

STP 226 EXAMPLE EXAM #3 INSTRUCTOR:

Chapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing

Spearman Rho Correlation

Ch. 11 Inference for Distributions of Categorical Data

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

Testing Independence

Contingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878

Stat 529 (Winter 2011) Experimental Design for the Two-Sample Problem. Motivation: Designing a new silver coins experiment

Bivariate Relationships Between Variables

12.10 (STUDENT CD-ROM TOPIC) CHI-SQUARE GOODNESS- OF-FIT TESTS

February 5, 2018 START HERE. measurement scale. Means. number of means. independent measures t-test Ch dependent measures t-test Ch 16.

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Lecture 41 Sections Mon, Apr 7, 2008

:the actual population proportion are equal to the hypothesized sample proportions 2. H a

Psych 230. Psychological Measurement and Statistics

ST4241 Design and Analysis of Clinical Trials Lecture 9: N. Lecture 9: Non-parametric procedures for CRBD

Analysis of Variance: Part 1

Nonparametric Statistics. Leah Wright, Tyler Ross, Taylor Brown

The Chi-Square Distributions

Finding Relationships Among Variables

Module 10: Analysis of Categorical Data Statistics (OA3102)

Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance ECON 509. Dr.

Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12)

χ 2 Test for Frequencies January 15, 2019 Contents

Math 152. Rumbos Fall Solutions to Exam #2

Topic 21 Goodness of Fit

PSY 307 Statistics for the Behavioral Sciences. Chapter 20 Tests for Ranked Data, Choosing Statistical Tests

Unit 9: Inferences for Proportions and Count Data

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

Single Sample Means. SOCY601 Alan Neustadtl

Discrete Multivariate Statistics

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Tables Table A Table B Table C Table D Table E 675

Section VII. Chi-square test for comparing proportions and frequencies. F test for means

Unit 9: Inferences for Proportions and Count Data

Chapter 9 Inferences from Two Samples

Section 9.1 (Part 2) (pp ) Type I and Type II Errors

We know from STAT.1030 that the relevant test statistic for equality of proportions is:

Chapter 8 Student Lecture Notes 8-1. Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance

Data Analysis: Agonistic Display in Betta splendens I. Betta splendens Research: Parametric or Non-parametric Data?

Mathematical Notation Math Introduction to Applied Statistics

Nonparametric Statistics

Non-parametric Hypothesis Testing

Agonistic Display in Betta splendens: Data Analysis I. Betta splendens Research: Parametric or Non-parametric Data?

AP Statistics Cumulative AP Exam Study Guide

2.3 Analysis of Categorical Data

Module 5 Practice problem and Homework answers

Basic Business Statistics, 10/e

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

Module 4 Practice problem and Homework answers

Applied Statistics for the Behavioral Sciences

STATISTIKA INDUSTRI 2 TIN 4004

Inferences for Proportions and Count Data

Lecture 22. December 19, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

Independent Samples ANOVA

Lecture 9. Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

Rama Nada. -Ensherah Mokheemer. 1 P a g e

Sociology 6Z03 Review II

Transcription:

1 HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

7 steps of Hypothesis Testing 1. State the hypotheses 2. Identify level of significant 3. Identify the critical values 4. Calculate test statistics 5. Compare critical values with test statistics 6. Conclusion 7. Conclusion in words

3 Parametric and Nonparametric Tests Two non-parametric hypothesis tests using the chisquare statistic: the chi-square test for goodness of fit the chi-square test for independence

Parametric and Nonparametric Tests (cont.) The term "non-parametric" refers to the fact that the chi-square tests do not require assumptions about population parameters nor do they test hypotheses about population parameters. For chi-square, the data are frequencies rather than numerical scores. 4

5 The Chi-Square Test for Goodness-of-Fit The chi-square test for goodness-of-fit uses frequency data from a sample to test hypotheses about the shape or proportions of a population. Each individual in the sample is classified into one category on the scale of measurement. The data, called observed frequencies, simply count how many individuals from the sample are in each category.

The Chi-Square Test for Goodness-of-Fit (cont.) The null hypothesis specifies the proportion of the population that should be in each category. The proportions from the null hypothesis are used to compute expected frequencies that describe how the sample would appear if it were in perfect agreement with the null hypothesis. 6

Example 1 Test the hypothesis that eye colors spread evenly for each type at 1% level of significant

Step 1 State Hypothesis H 0 : The eyes color spread evenly for each type H 1 : The eyes color not spread evenly for each type

Step 2 & 3 : Level of significant and Critical Values Step 2 Alpha = 0.01 Step 3 df = n 1 Df = 4-1=3 χ 2 = 4.541

Step 4:Calculate the Test Statistic Formula to find χ 2 test statistics ( f e f ) 2 o f e 2 Since, total of observation is equal to 40 with 4 category. f e = 40 4 = 10

Step 4:Calculate the Test Statistic A computational table helps organize the computations. f o f e f o - f e (f o - f e ) 2 (f o - f e ) 2 /f e 12 10 21 10 3 10 4 10 40 40

Step 4:Calculate the Test Statistic Subtract each f e from each f o. The total of this column must be zero. f o f e f o - f e (f o - f e ) 2 (f o - f e ) 2 /f e 12 10 2 21 10 11 3 10-7 4 10-6 40 40 0

Step 4:Calculate the Test Statistic Square each of these values f o f e f o - f e (f o - f e ) 2 (f o - f e ) 2 /f e 12 10 2 4 21 10 11 121 3 10-7 49 4 10-6 36 40 40 0

Step 4:Calculate the Test Statistic Divide each of the squared values by the f e for that cell. The sum of this column is chi square f o f e f o - f e (f o - f e ) 2 (f o - f e ) 2 /f e 12 10 2 4 0.4 21 10 11 121 12.1 3 10-7 49 4.9 4 10-6 36 3.6 40 40 0 χ 2 = 21

Step 5 & 6: Compare and Conclude Critical value Reject H 0,when test statistics value > critical value Fail to Reject H 0,when test statistics value < critical value

Step 5 & 6: Compare and Conclude χ 2 (critical value) = 4.541 χ 2 (test statistics) = 21 The test statistic is in the Critical Region. Reject the H 0. There is enough evidence to conclude that the eyes color are differ for each types.

17 The Chi-Square Test for Independence Can be used and interpreted in two different ways: 1. Testing hypotheses about the relationship between two variables in a population, or 2. Testing hypotheses about differences between proportions for two or more populations.

The Chi-Square Test for Independence (cont.) 18 Testing hypotheses about the relationship between two variables in a population The null hypothesis : There is no relationship between the two variables; that is, the two variables are independent.

The Chi-Square Test for Independence (cont.) Testing hypotheses about differences between proportions for two or more populations 19 The null hypothesis: The proportions (the distribution across categories) are the same for all of the populations

The Chi-Square Test for Independence (cont.) The data (observed frequencies), show how many individuals are in each cell of the matrix. Both chi-square tests use the same test statistic.

The relationship of homicide rate and gun sales Low homicide High homicide Totals Low gun sales High gun sales 8 5 13 4 8 12 Totals 12 13 25

Tables Notice the following about these tables 1. Table must have a title 2. Independent variable must go into columns 3. Subtotals are called marginals. 4. N is reported at the intersection of row and column marginals.

Tables Title Rows Column 1 Column 2 Row 1 cell a cell b Row Marginal 1 Row 2 cell c cell d Row Marginal 2 Column Marginal 1 Column Marginal 2 N

Example 2 Test association the homicide rate and volume of gun sales related for a sample of 25 cities, with 5% level of significant? Low High High 8 5 13 Low 4 8 12 12 13 25

Step 1 State Hypothesis H 0 : The variables are independent Another way to state the H 0 : There is no relationship between the two variables H 1 : The variables are dependent Another way to state the H 1 : There is a relationship between the two variables

Step 2 & 3 : Level of significant and Critical Values Step 2 Alpha = 0.05 Step 3 where df = (r 1)(c 1) r = the number of rows c = the number of columns Df = (2-1)(2-1)=1 χ 2 = 3.841

Step 4:Calculate the Test Statistic Formula to find χ 2 test statistics ( f e f ) 2 o f e 2

Step 4:Calculate the Test Statistic f e = (column marginal)(row marginal) N

Step 4:Calculate the Test Statistic Expected frequencies: Low High High 6.24 6.76 13 Low 5.76 6.24 12 12 13 25

Step 4:Calculate the Test Statistic A computational table helps organize the computations. f o f e f o - f e (f o - f e ) 2 (f o - f e ) 2 /f e 8 6.24 5 6.76 4 5.76 8 6.24 25 25

Step 4:Calculate the Test Statistic Subtract each f e from each f o. The total of this column must be zero. f o f e f o - f e (f o - f e ) 2 (f o - f e ) 2 /f e 8 6.24 1.76 5 6.76-1.76 4 5.76-1.76 8 6.24 1.76 25 25 0

Step 4:Calculate the Test Statistic Square each of these values f o f e f o - f e (f o - f e ) 2 (f o - f e ) 2 /f e 8 6.24 1.76 3.10 5 6.76-1.76 3.10 4 5.76-1.76 3.10 8 6.24 1.76 3.10 25 25 0

Step 4:Calculate the Test Statistic Divide each of the squared values by the f e for that cell. The sum of this column is chi square f o f e f o - f e (f o - f e ) 2 (f o - f e ) 2 /f e 8 6.24 1.76 3.10.50 5 6.76-1.76 3.10.46 4 5.76-1.76 3.10.54 8 6.24 1.76 3.10.50 25 25 0 χ 2 = 2.00

Step 5 & 6: Compare and Conclude Critical value Reject H 0,when test statistics value > critical value Fail to Reject H 0,when test statistics value < critical value

Step 5 & 6: Compare and Conclude χ 2 (critical value) = 3.841 χ 2 (test statistics) = 2.00 The test statistic is not in the Critical Region. Fail to reject the H 0. There is no significant relationship between homicide rate and gun sales.