:the actual population proportion are equal to the hypothesized sample proportions 2. H a

Similar documents
10.2: The Chi Square Test for Goodness of Fit

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.

Ch. 11 Inference for Distributions of Categorical Data

Chapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables.

Inferential statistics

11-2 Multinomial Experiment

Chi-Squared Tests. Semester 1. Chi-Squared Tests

Study Ch. 13.1, # 1 4 all Study Ch. 13.2, # 9 15, 25, 27, 31 [# 11 17, ~27, 29, ~33]

Chi Square Analysis M&M Statistics. Name Period Date

Chapter 10: Chi-Square and F Distributions

Lecture 41 Sections Wed, Nov 12, 2008

CHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Chapter 26: Comparing Counts (Chi Square)

χ test statistics of 2.5? χ we see that: χ indicate agreement between the two sets of frequencies.

Lecture 41 Sections Mon, Apr 7, 2008

CIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8

Department of Mathematics & Statistics STAT 2593 Final Examination 17 April, 2000

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

Chapter 12: Inference about One Population

9.5 t test: one μ, σ unknown

CHAPTER 13: F PROBABILITY DISTRIBUTION

Chi-square (χ 2 ) Tests

We know from STAT.1030 that the relevant test statistic for equality of proportions is:

Chi-square (χ 2 ) Tests

CHAPTER 13: F PROBABILITY DISTRIBUTION

hypotheses. P-value Test for a 2 Sample z-test (Large Independent Samples) n > 30 P-value Test for a 2 Sample t-test (Small Samples) n < 30 Identify α

Chapters 9 and 10. Review for Exam. Chapter 9. Correlation and Regression. Overview. Paired Data

Example. χ 2 = Continued on the next page. All cells

Module 10: Analysis of Categorical Data Statistics (OA3102)

This gives us an upper and lower bound that capture our population mean.

Chi-Square Analyses Stat 251

Sleep data, two drugs Ch13.xls

Sampling Distributions: Central Limit Theorem

Chapter 3 Multiple Regression Complete Example

11 CHI-SQUARED Introduction. Objectives. How random are your numbers? After studying this chapter you should

Tables Table A Table B Table C Table D Table E 675

STAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure).

Goodness of Fit Tests

Lecture 9. Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests

POLI 443 Applied Political Research

STP 226 ELEMENTARY STATISTICS NOTES

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Lecture 45 Sections Wed, Nov 19, 2008

15: CHI SQUARED TESTS

Hypothesis Testing. Week 04. Presented by : W. Rofianto

10.2 Hypothesis Testing with Two-Way Tables

Chi-Square. Heibatollah Baghi, and Mastee Badii

ME3620. Theory of Engineering Experimentation. Spring Chapter IV. Decision Making for a Single Sample. Chapter IV

Hypothesis Tests and Estimation for Population Variances. Copyright 2014 Pearson Education, Inc.

Statistics for Managers Using Microsoft Excel

STAT 135 Lab 6 Duality of Hypothesis Testing and Confidence Intervals, GLRT, Pearson χ 2 Tests and Q-Q plots. March 8, 2015

5.2 Frequency Tables, Histograms,

+ Specify 1 tail / 2 tail

Lecture 6 Probability

10: Crosstabs & Independent Proportions

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

Institute of Actuaries of India

Statistical inference provides methods for drawing conclusions about a population from sample data.

Lecture 28 Chi-Square Analysis

Test statistic P value Reject/fail to reject. Conclusion:

16.3 One-Way ANOVA: The Procedure

The Chi-Square Distributions

Chapter 15: Nonparametric Statistics Section 15.1: An Overview of Nonparametric Statistics

χ L = χ R =

Econometrics. 4) Statistical inference

INTRODUCTION TO FORECASTING (PART 2) AMAT 167

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

Mean, Median, Mode, and Range

Topic 2 Part 3 [189 marks]

Do students sleep the recommended 8 hours a night on average?

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters

Types of Information. Topic 2 - Descriptive Statistics. Examples. Sample and Sample Size. Background Reading. Variables classified as STAT 511

Hypothesis testing: Steps

Statistics 3858 : Contingency Tables

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression

Six Sigma Black Belt Study Guides

This document contains 3 sets of practice problems.

Unit 1 Writing and Evaluating Algebraic Expressions

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017

Hypothesis testing: Steps

Section 3-3 Measures of Variation

STA 101 Final Review

Dover- Sherborn High School Mathematics Curriculum Probability and Statistics

Review of One-way Tables and SAS

Test 3 Practice Test A. NOTE: Ignore Q10 (not covered)

Degrees of freedom df=1. Limitations OR in SPSS LIM: Knowing σ and µ is unlikely in large

Announcements. Final Review: Units 1-7

Frequency Distribution Cross-Tabulation

STA Module 10 Comparing Two Proportions

INSTITUTE OF ACTUARIES OF INDIA

Unit5: Inferenceforcategoricaldata. 4. MT2 Review. Sta Fall Duke University, Department of Statistical Science

Inferences for Regression

Quantitative Analysis and Empirical Methods

2.3 Analysis of Categorical Data

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

Comparing Means from Two-Sample

Statistics for Managers Using Microsoft Excel/SPSS Chapter 8 Fundamentals of Hypothesis Testing: One-Sample Tests

Econ 325: Introduction to Empirical Economics

Inference About Means and Proportions with Two Populations. Chapter 10

Transcription:

AP Statistics Chapter 14 Chi- Square Distribution Procedures I. Chi- Square Distribution ( χ 2 ) The chi- square test is used when comparing categorical data or multiple proportions. a. Family of only positive values. b. Each curve begins on 0 and the horizontal axis, increases to a peak, and them approaches the horizontal axis asymptotically from above. c. A chi- square curve is skewed right. As the number of degrees of freedom increase, the curve becomes more and more symmetrical and looks more like a Normal curve. d. The total area under a chi- square curve is equal to 1. II. III. Tests a. Test for Goodness of Fit This is applied when you have one categorical variable with multiple categories from a single population. It is used when trying to determine if one sample is a good fit for the population. It is used to determine whether sample data are consistent with hypothesized distributions. b. Test for Homogeneity It is applied when you have one categorical variable with multiple categories from 2 different popuations and is used to determine whether frequency counts are distribution identically across different populations. c. Test for Independence/Association This is applied when you have two categorical variables with multiple categories from a single population. It is used to determine whether there is a significant association between the variables, i.e., dependent or independent? Conditions a. S - Random (Representative) Sample - simple random sample(s) or a representative sample(s). b. I - Independence - Individual observations are independent. When sampling without replacement, check that the population is at least 10 times as large as the sample (the 10% condition). c. C Counts (Large Sample) - You may use this test with critical values from the chi- square distribution when all individual expected counts are at least 1 and no more than 20% of the expect counts are less than 5. IV. Test for Goodness of Fit (comparing one sample distribution to a population distribution) - is used when comparing a sample distribution to a population distribution. You want to determine if your sample (observed counts) is a good fit to the population distribution (expected counts). Does your sample differ enough from the population distribution to determine if the claimed population distribution is in fact true? A Hypotheses - Test for Goodness of Fit 1. H o :the actual population proportion are equal to the hypothesized sample proportions 2. H a :at least one of the actual population proportions differ from their hypothesized sample proportions B. Conditions 1. S I C (see above)

C. Calculations 1. Observed Count (O) is the number per category that is observed or given. 2. Expected Count (E) n * hypothesized proportion 3. Name of Test 2 2 2 ( observed exp ected) ( O E) 4. Chi- Square Test Statistic - X = = with degrees of exp ected E freedom = # of categories 1. D. P- value = P( χ 2 X 2 ) The P- value is the area under the density curve to the right of X 2. Large values of X 2 are evidence against H o. E. Interpretation a. If we have a small P- Value we will reject the null hypothesis. That is, our sample produces statistically significant evidence to say that the population distribution is no longer true. b. If the test finds statistically significant result, do a follow- up analysis that compares the observed and expected counts and that looks for the largest components of the chi- square statistic. V. Calculator a. Enter observed values in List 1. b. Enter proportions from null hypothesis in List 2. c. List 3 = (List 1)(List 2) χ 2 d. Stats Tests GOF- Test Observed: L1 Expected: L3 df: # categories - 1 Calculate Draw

VI. Examples A. Acme Toy Company Acme Toy Company prints baseball cards. The company claims that 30% of the cards are rookies, 60% veterans, and 10% All- stars. The cards are sold in packages of 100. Suppose a randomly selected package of cards has 50 rookies, 45 veterans, and 5 All- stars. Is this consistent with Acme s claim? Use a 0.05 level of significance.

B. Cell Phones Are you likely to have a motor vehicle collision when using a cell phone? A study of 699 drivers who were using a cell phone when they were involved in a collision examined this question? These drivers made 26, 798 cell phone calls during a 14 month study period. Each of the 699 collisions was classified in various ways. We want to determine if the accidents are equally likely to occur on any day of the week? Here are the counts for each day of the week: Day Sun Mon Tues Wed Thur Fri Sat Total Number 20 133 126 159 136 113 12 699 Explain why is this is a Chi- Square Distribution problem.

C. Biology Grades A biology professor reports that historically grades in her introductory biology course have been distributed as follows: 15% A s, 30% B s, 40% C s, 10% D s and 5% F s. Test an appropriate hypothesis to decide if the professor s most recent grade distribution matches the historical distribution. Give statistical evidence to support your conclusion. Grades in her most recent course were distributed as follows: Grade A B C D F Frequency 89 121 78 25 12

D. The Moose Problem A study was conducted to determine where moose are found in a region containing a large burned area. A map of the study area was partitioned into the following four habitat types. The figure below shows these four habitat types. (1) Inside the burned area, not near the edge of the burned area, (2) Inside the burned area, near the edge, (3) Outside the burned area, near the edge, and (4) Outside the burned area, not near the edge. The proportion of total acreage in each of the habitat types was determined for the study area. Using an aerial survey, moose locations were observed and classified into one of the four habitat types. The results are given in the table below. Habitat Type Proportion of Total Acreage Number of Moose Observed 1 0.340 25 2 0.101 22 3 0.104 30 4 0.455 40 Total 1.000 117 (a) The researchers who are conducting the study expect the number of moose observed in a habitat type to be proportional to the amount of acreage of that type of habitat. Are the data consistent with this expectation? Conduct an appropriate statistical test to support your conclusion. Assume the conditions for inference are met. (b) Relative to the proportion of total acreage, which habitat types did the moose seem to prefer? Explain.