Hypothesis Testing: Chi-Square Test 1

Similar documents
Discrete Distributions: Poisson Distribution 1

Contingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878

Contingency Tables. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels.

What s for today. More on Binomial distribution Poisson distribution. c Mikyoung Jun (Texas A&M) stat211 lecture 7 February 8, / 16

Chapter 26: Comparing Counts (Chi Square)

Chapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables.

Categorical Data Analysis. The data are often just counts of how many things each category has.

Example. χ 2 = Continued on the next page. All cells

4. Suppose that we roll two die and let X be equal to the maximum of the two rolls. Find P (X {1, 3, 5}) and draw the PMF for X.

Basic Concepts of Probability

Two sided, two sample t-tests. a) IQ = 100 b) Average height for men = c) Average number of white blood cells per cubic millimeter is 7,000.

Statistics 3858 : Contingency Tables

11-2 Multinomial Experiment

Chapter 10: Chi-Square and F Distributions

Chapter 10: Information Retrieval. See corresponding chapter in Manning&Schütze

Statistics Handbook. All statistical tables were computed by the author.

Section 9.5. Testing the Difference Between Two Variances. Bluman, Chapter 9 1

13.1 Categorical Data and the Multinomial Experiment

Statistics for Economists. Lectures 3 & 4

Inference for Categorical Data. Chi-Square Tests for Goodness of Fit and Independence

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017

10: Crosstabs & Independent Proportions

STAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure).

Chi-Squared Tests. Semester 1. Chi-Squared Tests

Probability. Chapter 1 Probability. A Simple Example. Sample Space and Probability. Sample Space and Event. Sample Space (Two Dice) Probability

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.

16.400/453J Human Factors Engineering. Design of Experiments II

Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution

4 Hypothesis testing. 4.1 Types of hypothesis and types of error 4 HYPOTHESIS TESTING 49

CS 361: Probability & Statistics

What is Probability? Probability. Sample Spaces and Events. Simple Event

THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE

10.2: The Chi Square Test for Goodness of Fit

Lecture 5: ANOVA and Correlation

Lecture 9. Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests

STT 315 Problem Set #3

14.2 THREE IMPORTANT DISCRETE PROBABILITY MODELS

STEP Support Programme. Statistics STEP Questions

8/4/2009. Describing Data with Graphs

Basic Concepts of Probability

Intermediate Math Circles November 8, 2017 Probability II

37.3. The Poisson Distribution. Introduction. Prerequisites. Learning Outcomes

Probability Distributions

# of 6s # of times Test the null hypthesis that the dice are fair at α =.01 significance

Testing Independence

Chapter 10: STATISTICAL INFERENCE FOR TWO SAMPLES. Part 1: Hypothesis tests on a µ 1 µ 2 for independent groups

Chapter 10. Prof. Tesler. Math 186 Winter χ 2 tests for goodness of fit and independence

One-Way Tables and Goodness of Fit

Lecture 10: Generalized likelihood ratio test

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Chi-squared (χ 2 ) (1.10.5) and F-tests (9.5.2) for the variance of a normal distribution ( )

Ling 289 Contingency Table Statistics

40.2. Interval Estimation for the Variance. Introduction. Prerequisites. Learning Outcomes

Chi Square Analysis M&M Statistics. Name Period Date

Frequency Distribution Cross-Tabulation

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

Confidence Intervals, Testing and ANOVA Summary

Chapter 7. Inference for Distributions. Introduction to the Practice of STATISTICS SEVENTH. Moore / McCabe / Craig. Lecture Presentation Slides

Confidence Intervals 1

Inference for Proportions, Variance and Standard Deviation

CBA4 is live in practice mode this week exam mode from Saturday!

Chapter 4: An Introduction to Probability and Statistics

11 CHI-SQUARED Introduction. Objectives. How random are your numbers? After studying this chapter you should

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /8/2016 1/38

Chi-square (χ 2 ) Tests

STP 226 EXAMPLE EXAM #3 INSTRUCTOR:

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Goodness of Fit Tests

2.3 Analysis of Categorical Data

Lecture 41 Sections Wed, Nov 12, 2008

Math 50: Final. 1. [13 points] It was found that 35 out of 300 famous people have the star sign Sagittarius.

green green green/green green green yellow green/yellow green yellow green yellow/green green yellow yellow yellow/yellow yellow

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

2.4. Conditional Probability

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

Psych 230. Psychological Measurement and Statistics

While you wait: Enter the following in your calculator. Find the mean and sample variation of each group. Bluman, Chapter 12 1

EDEXCEL S2 PAPERS MARK SCHEMES AVAILABLE AT:

Probability (Devore Chapter Two)

12 Chi-squared (χ 2 ) Tests for Goodness-of-fit and Independence

Lecture 41 Sections Mon, Apr 7, 2008

SBAOD Statistical Methods & their Applications - II. Unit : I - V

One- and Two-Sample Tests of Hypotheses

Lecture 28 Chi-Square Analysis

Purposes of Data Analysis. Variables and Samples. Parameters and Statistics. Part 1: Probability Distributions

Exam details. Final Review Session. Things to Review

Section VII. Chi-square test for comparing proportions and frequencies. F test for means

Nominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers

15: CHI SQUARED TESTS

Engineering Mathematics IV(15MAT41) Module-V : SAMPLING THEORY and STOCHASTIC PROCESS

Relate Attributes and Counts

1) Answer the following questions with one or two short sentences.

3 PROBABILITY TOPICS

green green green/green green green yellow green/yellow green yellow green yellow/green green yellow yellow yellow/yellow yellow

Sociology 6Z03 Review II

PROBABILITY.

Transcription:

Hypothesis Testing: Chi-Square Test 1 November 9, 2017 1 HMS, 2017, v1.0

Chapter References Diez: Chapter 6.3 Navidi, Chapter 6.10 Chapter References 2

Chi-square Distributions Let X 1, X 2,... X n be independent normally distributed random variables. Form the sum of their squares: Q = Q is then distributed according to the χ 2 distribution. The chi-square distribution has one parameter, the degrees of freedom, k k i=1 f(x; k) = x(k/2 1) e x/2 2 k/2 Γ ( ) x > 0 k 2 Γ is the gamma function which generalizes the factorial to non-integer values. For every value for the degrees of freedom there is a unique curve described by f(x; k). X 2 i Mean = µ = n variance σ 2 = 2n. Hypothesis Testing 3

Chi-square Distributions Hypothesis Testing 4

Chi-square Distributions: Its Purpose The purpose of χ 2 is to compare a set of data against a specific distribution. For example, the likelihood of throwing a particular number on a die is 1/6 What if we had a die that was suspect? Throw a die 600 times and count the number of 1s, 2s, etc Category Observed 1 115 2 97 3 91 4 101 5 110 6 186 Hypothesis Testing 5

Chi-square Distributions: Its Purpose What is the expected number of 1s, 2s etc? Hypothesis Testing 6

Chi-square Distributions: Its Purpose What is the expected number of 1s, 2s etc? Category Observed Expected 1 115 100 2 97 100 3 91 100 4 101 100 5 110 100 6 186 100 This is an example where we have an expected outcome and the corresponding observed outcome. We d like to know if the die is loaded or not? Hypothesis Testing 7

Chi-square Distributions: Its Purpose How might we compare the observed and expected? If they are the same then the difference will be zero so we could compute: O i E i where O i is the observed and E i is the corresponding expected. We could sum over the square terms to eliminate negative terms to give: n (O i E i ) 2 i=1 Hypothesis Testing 8

Chi-square Distributions: Its Purpose The final thing to do is normalize by dividing by E i to yield: k (O i E i ) 2 i=1 That is we reduce the relative importance of large expected values to prevent them from dominating the sum. Note that the smaller the number the better the fit. We state that this measure is distributed according to a chi-square distribution. χ 2 = with degrees of freedom k 1. E i k (O i E i ) 2 i=1 Hypothesis Testing 9 E i

Chi-square Distributions: Its Purpose χ 2 = k (O i E i ) 2 i=1 Think of it this way, take the square root on both sides: E i x i = O i E i Ei If x i is a random variable that is normally distributed then x 2 i distributed. will be χ2 Hypothesis Testing 10

Chi-square Distributions: Its Purpose Given H o that there is no difference between the expected and observed, the larger the value of χ 2 the stronger the evidence against H o. χ 2 = k (O i E i ) 2 i=1 E i Hypothesis Testing 11

Chi-square Distributions: Tables As expected there are tables that describe the χ 2 distribution. Hypothesis Testing 12

Chi-square: Example Category Observed Expected 1 115 100 2 97 100 3 91 100 4 101 100 5 110 100 6 186 100 χ 2 (115 100)2 = +... = 6.12 100 We need to determine the p-value at say a critical value of 0.05 (95%). df = 6 1 = 5. Look up table with df = 5 and column χ 2 0.05 yields 11.07. This is bigger than 6.12 which means that 6.12 lies inside the H o areas. We conclude that the pattern of die throws is not unusual and therefore the die is probably fair. Hypothesis Testing 13

Chi-square: Limitation Only use the χ 2 test when ever all the expected values are greater than or equal to 5. In the case of the die they were. Hypothesis Testing 14

Chi-square: A Classic Example Mendel s Experiments A F 1 cross (heterozygous cross) yields 355 yellow and 123 green peas. The expected ratio is 3:1. Total number of peas = 478 H o : There is no difference between the results of the experiment and the expected ratio. Expected yellow peas: E 1 = 478 3/4 = 385.5 E 2 = 478 1/4 = 119.5 Observed Expected Yellow 355 385.5 Green 123 119.5 Hypothesis Testing 15

Chi-square: A Classic Example χ 2 (355 358.5)2 = + 385.5 Choose α = 0.05 (One tailed test). (123 119.5)2 119.5 = 0.137 df = 2 1 = 1 Hypothesis Testing 16

Chi-square: A Classic Example Therefore we do no reject H o. Hypothesis Testing 17

Class Exercise M&M come in six colors, Red, Orange, Yellow, Green, Blue, and Brown 600 M&Ms were obtained by purchasing a bunch of M&M bags and counting the individual colors. The following counts for each of the colors was obtained: Color Observed Expected Red 115 Orange 95 Yellow 120 Green 105 Blue 90 Brown 118 Sum 600 Determine whether bags of M&Ms are filled with equal amounts of each color. Hypothesis Testing 18

Class Exercise H o : evenly distributed. p i = 1/6. χ 2 = (115 100)2 Color Observed Expected Red 115 100 Orange 95 100 Yellow 120 100 Green 105 100 Blue 90 100 Brown 118 100 Sum 600 100 + (95 100)2 100 + (120 100)2 100 + (105 100)2 100 + (90 100)2 100 + (118 100) 2 100 = 10.99 Degrees of freedom = 6 1 = 5 Hypothesis Testing 19

Chi-square Distributions: Tables Look up 10.99 in the body of the χ 2 table. Note that it lies between 10% and 5%, that is the p-value is inside the H o region. Therefore we do not reject H o. The actual p-value = 0.052 Hypothesis Testing 20

Chi-square Distributions: Tables Computing the exact p-value using Python: from scipy import stats pvalue = 1 - stats.chi2.cdf (10.99, 5) print pvalue 0.051578614910744669 Hypothesis Testing 21

Restrictions All observations must be independent Expected counts should not be less than 5, eg Blue = 2 although if the number of categories is large some (< 20%) can be less than 5. It the last entries in the table are less than 5 then you should pool them (see next example). All counts must be > 0 Data should be frequency data, variables should be categorical. eg Heights are not categorical but you can use ranges: Range Height short (< 5) 10 Middle 5 6 50 Tall > 6 20 Hypothesis Testing 22

A Harder Problem A factory makes prosthetic limbs but it has been found that the manufacturing process produces defects. The factory owners want to know if the defects are purely random or whether there is some systematic non-random process such as a defective machine, or poor workmanship that is causing the defects. A random sample of n = 60 has been collected for inspection. The following data is the result: Number of Defects Observed Frequency 0 32 1 15 2 9 3 4 Show that the distribution of defects is purely random. Hypothesis Testing 23

Poisson Distribution The number of cars that pass under a road bridge during a given period of time. The number of spelling mistakes while typing a single page. The number of phone calls at a call center per minute. The number of times a web server is accessed per minute. The number of animals killed per unit length of road. Number of mutations per 100,000 base-pairs on DNA after a certain amount of radiation. The number of pine trees per unit area of mixed forest. The number of stars in a given volume of space. The number of soldiers killed by horse-kicks each year in each corps in the Prussian cavalry. The number of light bulbs that burn out in a certain amount of time. The number of viruses that can infect a cell in cell culture. The number of inventions invented over a span of time in an inventor s career. Number of particles that scatter off of a target in a nuclear experiment. The number of hurricanes in a year that originate in the Atlantic ocean. Number of defects per manufactured item. Hypothesis Testing 24

A Harder Problem If the number of defects follows a Poisson distribution then the events should be random. Our H o is therefore that the number of defects follows a Poisson distribution (95%) There is a single parameter for a Poisson distribution, the mean rate. The mean rate of defects per prosthetic limb is: λ = (32 0 + 15 1 + 9 2 + 4 3)/60 = 0.75 We can use the mean to compute the expected frequency of defects. Hypothesis Testing 25

A Harder Problem f(x) = λk e λ k! Num Defects Probability Expected defects (p 60) 0 (0.75) 0 e 0.75 /0! = 0.472 28.32 1 0.354 21.24 2 0.133 7.98 3 0.041 2.46 Hypothesis Testing 26

A Harder Problem Expected Defects Observed Expected defects 0 32 28.32 1 15 21.24 2 9 7.98 3 4 2.46 Pool the last two rows (< 5) Expected Defects Observed Expected defects 0 32 28.32 1 15 21.24 2 13 10.44 df = 3 1 = 2 α = 0.05 Hypothesis Testing 27

A Harder Problem Expected Defects Observed Expected defects 0 32 28.32 1 15 21.24 2 13 10.44 χ 2 = (32 28.32)2 28.32 + (15 21.24)2 24.24 + (13 10.44)2 10.44 = 2.94 Hypothesis Testing 28

A Harder Problem Look up 2.94 in the body of the χ 2 table. Note that it lies between 0.9 and 0.1, that is the p-value is well within the H o region. Therefore we do not reject H o. There is no evidence to suggest that the defects are non-random. The actual p-value = 0.229 The 5% cutoff point is 5.99 (compare to 2.94) Hypothesis Testing 29

Chi-square Distributions: Contingency Tables What if we had various types of categories and not just one? The text book cites the example of the manufacture of steel pins made by different machines. We can describe the pins either as too thin, ok or too thick. Too Thin OK Too Thick Total Machine 1 10 102 8 120 Machine 2 34 161 5 200 Machine 3 12 79 9 100 Machine 4 10 60 10 80 Total 66 402 32 500 These are called contingency or two-way tables. Hypothesis Testing 30

Chi-square Distributions: Contingency Tables Not a Smoker Smoker Total Male 96 39 135 Female 87 18 105 Total 183 57 240 For the columns you could have examples such as left or right-handed, or blond, brunette, red-head, Mouse colored hair, or six different genetic markers, or tall, medium or short height etc. Hypothesis Testing 31

Chi-square Distributions: Contingency Tables Cancer Fatal Heart Disease Non-fatal Heart Disease Healthy Total Healthy Diet 15 24 25 239 303 Mediterranean 7 14 8 273 302 Asian 4 3 16 260 283 Burger Diet 21 39 45 190 295 Total 47 80 94 962 1183 Check Diez for further examples. Hypothesis Testing 32

Chi-square Distributions: Contingency Tables In general: Column 1 Column 2... Column J Total Row 1 O 11 O 12... O 1J O 1 Row 2 O 21 O 22... O 2J O 2..... Total T 1 T 2... T J T T Contingency tables can be used to test if there is a relationship between the rows and columns. For example, does diet affect the likelihood of getting heart disease or cancer, or is diet immaterial? Hypothesis Testing 33

Contingency Tables: Example H o : The probability that the outcome of a trial falls into column j is the same for each row i. i.e it does not matter what row we pick, the probability of the outcomes in the rows are the same (eg diet does not matter). How do we compute these probabilities assuming there is not effect? These are the expected outcomes. Hypothesis Testing 34

Calculate Expected (E) Numbers Chocolate Strawberry Vanilla Total Female 70 35 60 165 Male 30 32 47 109 Total 100 67 107 274 100 or 36.5% liked chocolate 274 If there is no difference between males and females how many females would you expect to like chocolate? There are 165 females in all therefore we would expect 36.5% of 165 females to like chocolate = 60.2 females. By subtraction it must be the case that 100 60.2 makes would like chocolate = 39.8 Hypothesis Testing 35

Calculate Expected (E) Numbers We do the same for the other entries. Expected Numbers: Chocolate Strawberry Vanilla Total Female 60.2 40.4 64.4 165 Male 39.8 26.6 42.6 109 Total 100 67 107 274 If you want the formula then: Expected Count = Row Total Column Total Overall Total We now have a set of observed and expected numbers and we can carry out a chi-square test. We assume that the underlying distribution is normal. Hypothesis Testing 36

Calculate Expected (E) Numbers Degrees of freedom: = (nrows 1) (ncolumns 1) = (2 1)(3 1) = 2 χ 2 = i (O ij E ij ) 2 The χ 2 is summed over every entry in the table (except the totals) For the ice-cream table: j χ 2 = 6.58 The area to the left this represents = 0.037 This is less than 0.05 therefore we reject H o. There is evidence to suggests that males and females have different preferences for ice-cream. Hypothesis Testing 37 E ij

Try this Example Hypertension Low Fat Diet Average Fat Diet High Fat Diet Total Yes 24 33 46 103 No 109 101 87 297 Total 133 134 133 400 Is there a relationship between a fat diet and hypertension (high blood pressure)? Hypothesis Testing 38

F Test for Equality of Variance Test for variance equality. Hypothesis Testing 39

F Test for Equality of Variance What if you want to know if the variance of two independent populations is the same? This will become more important when we study the analysis of variance. Let X 1, X 2,..., X m be a sample from N(µ 1, σ 2 1) and Y 1, Y 2,..., Y n from N(µ 2, σ 2 2). Assume both samples were selected independently. The values for µ 1 and µ 2 are unimportant. Then if the ratio has a F distribution. F = σ2 1 σ 2 2 f(x) = ν1+ν2 Γ( 2 )( ν1 ν 2 ) ν 1 2 x ν 1 2 1 Γ( ν1 2 ν2 ν1x )Γ( 2 )(1 + ν 2 ) ν1+ν 2 2 Hypothesis Testing 40

F Test for Equality of Variance There are two degrees of freedom, one for the numerator (n 1 1) and another for the denominator (n 2 1): F 3,5 means an F distribution with 3 degrees of freedom for the numerator and 5 degrees of freedom for denominator. Hypothesis Testing 41

F Distribution: Table Hypothesis Testing 42

F Distribution: Table Hypothesis Testing 43

F Distribution: Table Hypothesis Testing 44

F Test for Equality of Variance Types of hypotheses that can be tested: H o : σ2 1 σ 2 2 1 H o : σ2 1 σ 2 2 1 H o : σ2 1 σ 2 2 We ll focus on the equality test. = 1 means σ 2 1 = σ 2 2 Hypothesis Testing 45

F Test for Equality of Variance: Example Consider two samples (n 1 = 12 and n 2 = 14) drawn from two normal populations. Assume that the variance for the two samples is: s 2 1 = 5 s 2 2 = 11 Our H o is s 2 1 = s 2 2 and H 1 is s 2 1 s 2 2 Choose a significance level of 0.05. Because we re dealing with an equality test we must use a two-tailed test. Compute the ratio of the variances: F = s2 1 s 2 2 Most tables only give the right-hand tail. Therefore arrange the ratio so that the larger variance is in the numerator. This ensures that the ratio will be > 1 and therefore the F statistics will be located on the right-had side of the distribution. Hypothesis Testing 46

F Test for Equality of Variance: Example Next look up the table. F = s2 1 s 2 = 11 2 5 = 2.2 df num = 14 1 df den = 12 1 Recall the significance level was set to 0.05 but we re doing a two tailed test, but tables only give the right hand tail. Therefore the we need to halve the significance level to 0.025. We must therefore use a table with a significance level of 0.025. Hypothesis Testing 47

F Test for Equality of Variance: Example F = 2.2 df num = 13 df den = 11 Hypothesis Testing 48

F Test for Equality of Variance: Example The critical value is 3.39 at 2.5% However the F value of 2.2 is smaller than 3.39, therefore we are within the H o region. We conclude that at the 5% level there is sufficient evidence to cast doubt on the hypothesis that the two variances are equal. Hypothesis Testing 49

Computing F distribution area using Python Computing the exact p-value using Python: from scipy import stats pvalue = stats.f.sf(2.2, dfn=11, dfd=13) print pvalue 0.089024550297109858 0.089 is greater than 0.025 therefore we do not reject H o. Hypothesis Testing 50