Chi Square Analysis M&M Statistics. Name Period Date

Similar documents
Lecture 41 Sections Wed, Nov 12, 2008

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017

Chapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables.

Chapter 26: Comparing Counts (Chi Square)

11-2 Multinomial Experiment

Contingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

Hypothesis testing. Data to decisions

Lecture 41 Sections Mon, Apr 7, 2008

:the actual population proportion are equal to the hypothesized sample proportions 2. H a

Contingency Tables. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels.

Example. χ 2 = Continued on the next page. All cells

Multiple samples: Modeling and ANOVA

Ling 289 Contingency Table Statistics

Bag RED ORANGE GREEN YELLOW PURPLE Candies per Bag

Two-sample Categorical data: Testing

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.

POLI 443 Applied Political Research

Hypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc.

One sided tests. An example of a two sided alternative is what we ve been using for our two sample tests:

Chi-Squared Tests. Semester 1. Chi-Squared Tests

Ch. 11 Inference for Distributions of Categorical Data

Inferential statistics

9-7: THE POWER OF A TEST

10.2: The Chi Square Test for Goodness of Fit

Categorical Data Analysis. The data are often just counts of how many things each category has.

The Chi-Square Distributions

THE SAMPLING DISTRIBUTION OF THE MEAN

Testing Research and Statistical Hypotheses

Inference for Categorical Data. Chi-Square Tests for Goodness of Fit and Independence

Average weight of Eisenhower dollar: 23 grams

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Chapter 10. Prof. Tesler. Math 186 Winter χ 2 tests for goodness of fit and independence

General Certificate of Education Advanced Level Examination June 2013

Psych 230. Psychological Measurement and Statistics

Chapter 10: Chi-Square and F Distributions

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.

10: Crosstabs & Independent Proportions

We know from STAT.1030 that the relevant test statistic for equality of proportions is:

Part 1.) We know that the probability of any specific x only given p ij = p i p j is just multinomial(n, p) where p k1 k 2

Inference for Proportions, Variance and Standard Deviation

The Purpose of Hypothesis Testing

Bio 183 Statistics in Research. B. Cleaning up your data: getting rid of problems

Background to Statistics

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2

Lab #12: Exam 3 Review Key

General Certificate of Education Advanced Level Examination June 2014

Econ 325: Introduction to Empirical Economics

DESAIN EKSPERIMEN Analysis of Variances (ANOVA) Semester Genap 2017/2018 Jurusan Teknik Industri Universitas Brawijaya

Section VII. Chi-square test for comparing proportions and frequencies. F test for means

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

CONTINUOUS RANDOM VARIABLES

Practice Questions: Statistics W1111, Fall Solutions

PSY 305. Module 3. Page Title. Introduction to Hypothesis Testing Z-tests. Five steps in hypothesis testing

Mathematical Notation Math Introduction to Applied Statistics

Pre-Calculus Summer Homework

Lecture 5: ANOVA and Correlation

This gives us an upper and lower bound that capture our population mean.

Chapter 12: Inference about One Population

Discrete distribution. Fitting probability models to frequency data. Hypotheses for! 2 test. ! 2 Goodness-of-fit test

Statistical methods for comparing multiple groups. Lecture 7: ANOVA. ANOVA: Definition. ANOVA: Concepts

Inter-Rater Agreement

Summary of Chapter 7 (Sections ) and Chapter 8 (Section 8.1)

Lecture 11: Two Way Analysis of Variance

Chapter 6. Logistic Regression. 6.1 A linear model for the log odds

Poisson regression: Further topics

Classroom Activity 7 Math 113 Name : 10 pts Intro to Applied Stats

The Multinomial Model

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

Probability and Statistics

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

Lecture 28 Chi-Square Analysis

CHAPTER 4: SCIENCE SEMINAR

Statistical Inference. Why Use Statistical Inference. Point Estimates. Point Estimates. Greg C Elvers

The t-distribution. Patrick Breheny. October 13. z tests The χ 2 -distribution The t-distribution Summary

This is a multiple choice and short answer practice exam. It does not count towards your grade. You may use the tables in your book.

Review of One-way Tables and SAS

Module 03 Lecture 14 Inferential Statistics ANOVA and TOI

Probability Methods in Civil Engineering Prof. Dr. Rajib Maity Department of Civil Engineering Indian Institution of Technology, Kharagpur

0 0'0 2S ~~ Employment category

Goodness of Fit Goodness of fit - 2 classes

16.400/453J Human Factors Engineering. Design of Experiments II

Hypothesis testing I. - In particular, we are talking about statistical hypotheses. [get everyone s finger length!] n =

This activity will help students to differentiate between living and non-living and to identify characteristics of living things.

CHAPTER 5 STATISTICAL INFERENCE. 1.0 Hypothesis Testing. 2.0 Decision Errors. 3.0 How a Hypothesis is Tested. 4.0 Test for Goodness of Fit

How do we compare the relative performance among competing models?

Mathematician Preemptive Strike 2 Due on Day 1 of Chapter 2

Discrete Multivariate Statistics

Data Analysis: Agonistic Display in Betta splendens I. Betta splendens Research: Parametric or Non-parametric Data?

16.3 One-Way ANOVA: The Procedure

February 5, 2018 START HERE. measurement scale. Means. number of means. independent measures t-test Ch dependent measures t-test Ch 16.

LOOKING FOR RELATIONSHIPS

Harvard University. Rigorous Research in Engineering Education

Institute of Actuaries of India

STAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure).

Introduction to Statistics and Data Analysis

Lecture 9. Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests

STA1000F Summary. Mitch Myburgh MYBMIT001 May 28, Work Unit 1: Introducing Probability

20 Hypothesis Testing, Part I

Transcription:

Chi Square Analysis M&M Statistics Name Period Date

Have you ever wondered why the package of M&Ms you just bought never seems to have enough of your favorite color? Or, why is it that you always seem to get the package of mostly brown M&Ms? What s going on at the Mars Company? Is the number of the different colors of M&Ms in a package really different from one package to the next, or does the Mars Company do something to insure that each package gets the correct number of each color of M&M? One way that we could determine if the Mars Co. is true to its word is to sample a package of M&Ms and do a type of statistical test known as a goodness of fit test. This type of statistical test allow us to determine if any differences between our observed measurements (counts of colors from our M&M sample) and our expected (what the Mars Co. claims) are simply due to chance or some other reason (i.e. the Mars company s sorters aren t doing a very good job of putting the correct number of M&M s in each package). The goodness of fit test we will be using is called a Chi Square (X 2 ) Analysis. We will be calculating a statistical value and using a table to determine the probability that any difference between observed data and expected data is due to chance alone. We begin by stating the null hypothesis. A null hypothesis is the prediction that something is not present, that a treatment will have no effect, or that there is no difference between treatment and control. Another way of saying this is the hypothesis that an observed pattern of data and an expected pattern are effectively the same, differing only by chance, not because they are truly different. What is our null hypothesis for this experiment? To test this hypothesis we will need to calculate the X 2 statistic, which is calculated in the following way: X 2 = Σ(sum of) (O-E) 2 E where O is the observed (actual count) and E is the expected number for each color category. The main thing to note about this formula is that, when all else is equal, the value of X 2 increases as the difference between the observed and expected values increase. On to it!

1. Lay out a large sheet of paper you ll be sorting M&Ms on this. 2. Open up a bag of M&Ms. 3. DO NOT EAT ANY OF THE M&M S (for now!) 4. Separate the M&M s into color categories and count the number of each color of M&M you have. 5. Record your counts in Data Chart 1. 6. Determine the Chi square value for your data. What does the χ 2 value mean? If the observed numbers (O) were exactly equal to the theoretically expected numbers (E), the fit would be perfect and χ 2 would be zero. Thus, a small value for χ 2 indicates that the observed and expected numbers are in close agreement. A large value indicates marked deviation from the expected values. Because chance deviations from the theoretical values are expected to occur, the question to be answered is "Are the observed deviations within the limits expected by chance?" Generally, statisticians have agreed on the arbitrary limits of odds of 1 chance in 20 (probability =.05) for the drawing the line between acceptance and rejection of the data as a satisfactory fit to the expected ratio. A table of χ 2 values is given in the following Table. These values are the standards to which the calculated χ 2 values are compared. In order to use the table, the degrees of freedom (df) must be calculated. The degrees of freedom is one less than the number of categories used in the χ 2 calculation. In our example, the number of categories (colored m&m s) was six. Therefore, the degrees of freedom is: 6-1 or 5. The value of χ 2 at df = 5 and.05 probability is 11.07. This is the maximum value of χ 2 that we can accept and still attribute that the deviations observed are due to chance alone.

The comparison of the calculated χ 2 value from the data and the χ 2 table value can be summarized as follows: Accept the fit if: calculated χ 2 value is < χ 2 table value Reject the fit if: calculated χ 2 value is > χ 2 table value Based on your individual sample, should you accept or reject the null hypothesis? Why? If you rejected your null hypothesis, what might be some explanations for your outcome? Class Data Now that you completed this chi-square test for your data, let s do it for the entire class, as if we had one huge bag of M&Ms. Using the information reported on the board, complete Data Chart 2.

Based on the class data, should you accept or reject the null hypothesis? Why? If you rejected the null hypothesis based on the class data, what might be some of the explanations for your outcome? If you accepted the null hypothesis, how do you explain it particularly if you rejected the null based on individual group data? What is the purpose of collecting data from the entire group?