Lecture 9. Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests
|
|
- Warren Heath
- 5 years ago
- Views:
Transcription
1 Lecture 9 Selected material from: Ch. 12 The analysis of categorical data and goodness of fit tests
2 Univariate categorical data Univariate categorical data are best summarized in a one way frequency table. Example: Births and the Lunar cycle A common urban legend is that more babies than expected are born during certain phases of the lunar cycle, especially near the full moon. The paper The Effect of the Lunar Cycle on Frequency of Births and Birth Complications (American Journal of Obstetrics and Gynecology, 2005) reported data from a random sample of births that occurred during 24 lunar cycles. Perform a test of whether the lunar cycle has an effect on births at the.05 statistical level. 2
3 Example: Births and the Lunar cycle Data from 24 lunar cycles: Lunar phase Number of days Number of births New moon Waxing crescent First quarter Waxing gibbous Full moon Waning gibbous Last quarter Waning crescent
4 Example: Births and the Lunar cycle Notation: Lunar phase Number of days Proportion of births New moon 24 1 Waxing crescent First quarter 24 3 If there is no relationship between births and the lunar phase, the proportion of births in each lunar phase should just be: Waxing gibbous Full moon 24 5 Waning gibbous Last quarter 24 7 Waning crescent Total = 699 days Number of days in phase, Total number of days i. e., 24 / , / , /
5 Example: Births and the Lunar cycle Lunar phase Number of days Under H 0 ( i = #days/total days) New moon 24 1 =.0343 Waxing crescent =.2175 First quarter 24 3 =.0343 Waxing gibbous =.2132 Full moon 24 5 =.0343 Waning gibbous =.2146 Last quarter 24 7 =.0343 Waning crescent =.2175 Total = 699 days Total = 1.0 Null hypothesis H 0 Alternative hypothesis H a : H 0 not true, i.e., at least one of the proportions does not equal its null hypothesized value. 5
6 Example: Births and the Lunar cycle Data from 24 lunar cycles: Lunar phase Number of days Number of births New moon Waxing crescent First quarter Under H 0 : 1 =.0343 Expected number Waxing gibbous of births during the Full moon new moon = Waning gibbous , = Last quarter Waning crescent Total = 222,784 births 6
7 General formulation H 0 : 1 = hypothesized proportion for category 1 2 = hypothesized proportion for category 2 k = hypothesized proportion for category k H a : H 0 is not true, so at least one of the true category proportions differs from the corresponding hypothesized value. The expected counts for a one way frequency table are how many people you would expect in each category of the table if the null hypothesis was true. expected count of the category = total sample size hypothesized proportion for that category 7
8 Example: Births and the Lunar cycle Lunar phase Observed Number of births Expected Number under H0:no relationship between lunar cycle and births (n j ) New moon , = Waxing crescent , = First quarter , = Waxing gibbous , = Full moon , = Waning gibbous , = Last quarter , = Waning crescent , = n = 222,784 births 8
9 Example: Births and the Lunar cycle Lunar phase Observed Births (O) Expected Births (E) O E (O E) 2 /E New moon Waxing crescent First quarter Waxing gibbous Full moon Waning gibbous Last quarter Waning crescent
10 Goodness of fit statistic, 2 The goodness of fit statistic, 2, results from first computing the quantity 2 (observed cell count - expected cell count) expected cell count for each cell. The value of the 2 statistic is the sum of these terms. 2 (observed cell count - expected cell count) 2 expected cell count measures how good the fit of the hypothesized proportions under H 0 are to the observed proportions: large χ 2 poor fit if H 0 is true, it follows a chi square distribution, which can then be used to calculate p values. 10
11 Chi square distributions skewed non negative Chi-square Distributions df = 1 df = 2 df = 3 df = 4 df = 5 df = 8 df = 10 df = x
12 Upper tail areas for chi square distributions Right-tail area df = 1 df = 2 df = 3 df = 4 df = 5 >.100 < 2.70 < 4.60 < 6.25 < 7.77 < <.001 > > > > > Right-tail area df = 6 df = 7 df = 8 df = 9 df = 10 >.100 < < < < <
13 Upper tail areas for chi square distributions Right-tail area df = 1 df = 2 df = 3 df = 4 df = 5 >.100 < 2.70 < 4.60 < 6.25 < 7.77 < P( X > 5.05) = where X has a chi square distribution with 2 df <.001 > > > > > Right-tail area df = 6 df = 7 df = 8 df = 9 df = 10 >.100 < < < < <
14 Goodness of fit test procedure Assumptions: 1. Observed cell counts are based on a random sample. 2. The sample size is large: every expected count is at least 5. Hypotheses: H 0 : 1 = hypothesized proportion for category 1 2 = hypothesized proportion for category 2 k = hypothesized proportion for category k H a :H 0 is not true Test statistic: 2 (observed cell count - expected cell count) 2 expected cell count 2 ~ 1 k 14
15 Goodness of fit test procedure Significance level α of the test is set before the test is performed. p value: The area to the right of 2 under the df = k 1 chi square curve. p value H 0 is rejected in favor of H a if p value < α. 15
16 Example: Births and the Lunar cycle Lunar phase Observed Births (O) Expected Births (E) O E (O E) 2 /E New moon Waxing crescent First quarter Waxing gibbous Full moon Waning gibbous Last quarter Waning crescent k 1 = 8 1 = 7 df
17 Upper tail areas for chi square distributions Right-tail area df = 1 df = 2 df = 3 df = 4 df = 5 >.100 < 2.70 < 4.60 < 6.25 < 7.77 < P( ) 0.1 since Since 0.05 is statistical level of test, and the p - value, do not reject H Conclude there is no relationship between lunar cycle, births. 0. the the <.001 > > > > > Right-tail area df = 6 df = 7 df = 8 df = 9 df = 10 >.100 < < < < <
18 Two-way tables: Classroom data Code Age Weight Height Gender Vision Smoke Male Glasses No Male None No Male None No Male None No Male None No Male Contacts No Male Glasses No Male Contacts No Data from last 39 of the Male None Yes Male Contacts Yes students Male Glasses No Male Glasses No Male Glasses Yes Summarize the two Male Contacts No Female Glasses No categorical variable Female Glasses No Female Contacts No columns, gender and vision, Female Glasses No Female Glasses No Female Contacts No in a smaller two way table Female None Yes Female Glasses No and use that to test for a Female None No Female None No relationship between them Female Contacts Yes Female Contacts No Female Glasses No Female Contacts No Female Glasses Yes Female None No Female None No Female None No Female None No Female None No Female None No Female None No Female Glasses No Female None No Female Glasses No
19 Two way tables Contacts Glasses None Female Male This is a two way frequency table, or contingency table. The number entries in the table are the observed counts. Interpretation: 5 females who wore contacts 9 females who wore glasses 11 females with no vision correction 5 males who wore contacts 22 males with glasses 27 males with no correction 19
20 Code Age Weight Height Gender Vision Smoke Male None No Classroom data Male Glasses No Male None No Male Glasses No Male Glasses No Male None No Male None No Male None No Male None No Male Glasses No This table is made simply by Male None No Male Glasses Yes counting in which cell each of Male Glasses No Male Glasses No the 79 students falls in Male Glasses No Male None No Male None No Male None No Male None No Male Contacts No Male None No Male Glasses No Male None No Contacts Glasses None Male None No Male None No Female Male None No Male Glasses No Male Male Glasses No Male Glasses No Male None Yes Male Glasses No Male Glasses No Male None No Male None No Male None No Male Glasses No Male Glasses No = Male Glasses No Male None Yes Male None No
21 Two way tables Marginal totals are obtained by adding the observed cell counts in each row and also in each column. Row Marginal Total Contacts Glasses None Female Male Column Marginal Total The sum of the column marginal totals (or the row marginal totals) is called the grand total. 21
22 Two way tables Row Marginal Total Contacts Glasses None Female Male Column Marginal Total people in 31 glasses 38 no vision the class wore correction contacts 25 females in the class, 54 males 79 students total 22
23 Tests for homogeneity in a two way table Very often in a two way table the rows indicate different populations and the columns indicate different categories. Population of males and females Contacts Glasses None Female Male Categories of vision correction A test of homogeneity (sameness) is a test that the proportion of people in the different categories is the same for both populations. For example, here, whether the proportion of females wearing contacts, glasses or nothing is the same as the proportion of males. 23
24 Tests for homogeneity in a two way table H 0 : category proportions are the same for males and females (homogeneity) vs. H a : the category proportions are not all the same for males and females First step: get the expected counts under H 0 for each cell. Row Marginal Total Contacts Glasses None Female Male Column Marginal Total
25 Tests for homogeneity in a two way table Row Marginal Total Contacts Glasses None Female Male Column Marginal Total Expected count for this cell: Under H 0, the proportion of males and females wearing contacts is the same, so the expected proportion of males wearing contacts is just the overall proportion (proportion over both males and females). 10/79 = proportion of all students using contacts 54 = number of male students 10/79 54 = 6.84 = expected number of males that use contacts 25
26 Tests for homogeneity in a two way table This is equivalent to using the formula: Expected cell count = (Row total)(column total) Grand total Row Marginal Total Contacts Glasses None Female Male Column Marginal Total column total row total grand total
27 Tests for homogeneity in a two way table for each cell... Female Male Column Marginal Total Contacts Glasses None Row Marginal Total
28 Tests for homogeneity in a two way table Female Male Column Marginal Total Contacts Glasses None (3.16) (9.81) (12.03) (6.84) (21.19) (25.97) Row Marginal Total Expected number of males with no vision correction under H 0. The expected counts under the assumption that males and females have the same vision correction (H 0 ) look close to what was observed in the random sample, so maybe the data support H 0. 28
29 Comparing two or more populations using the 2 statistic Hypotheses: H 0 : The true category proportions are the same for all of the populations (homogeneity of populations). H a : The true category proportions are not all the same for all of the populations. Test statistic: 2 (observed cell count - expected cell count) 2 expected cell count The expected cell counts are estimated from the sample data (assuming that H 0 is true) using the formula Expected cell count = (Row total)(column total) Grand total 29
30 Comparing two or more populations Assumptions: 1. The data consist of independently chosen random samples. 2. The sample size is large: all expected counts are at least 5. If some expected counts are less than 5, rows or columns of the table may be combined to achieve a table with satisfactory expected counts. p value: When H 0 is true, 2 has approximately a chi square distribution with df = (number of rows 1)(number of columns 1) Calculate the area to the right of the observed 2 statistic. 30
31 Back to classroom data example Perform a test at the.05 level of significance: H 0 : category proportions are the same for males and females (homogeneity) vs. H a : the category proportions are not all the same for males and females Row Marginal Total Contacts Glasses None Female Male Column Marginal Total
32 Tests for homogeneity in a two way table Female Male Column Marginal Total Contacts Glasses None (3.16) (9.81) (12.03) (6.84) (21.19) (25.97) Row Marginal Total Assumptions Females, males randomly selected (known from before). All expected counts are at least 5. 32
33 Tests for homogeneity in a two way table Female Male Column Marginal Total Contacts Glasses None (3.16) (9.81) (12.03) (6.84) (21.19) (25.97) Row Marginal Total Solutions: Either collect a larger random sample or agree to collapse contacts and glasses into a single category, such as Any Vision Correction, and then do the test. Use Fisher s exact test (not covered in this course). For now, no test
34 Example: Shopping A student randomly observed 57 shoppers in a supermarket to see if males and females carried their items the same way. Device Row Marginal Total Gender Cart Basket Nothing Male Female Column Marginal Total Determine if the carrying device proportions are the same for both genders using a 0.05 level of significance. 34
35 Example: Shopping Hypotheses: H 0 : H a : The true proportions of the device used are the same for both genders. The true proportions of the device used are not the same for both genders. Significance level: = 0.05 Test statistic: 2 (observed cell count - expected cell count) 2 expected cell count 35
36 Example: Shopping A computer program such as minitab gives: Chi-Square Test: Basket, Cart, Nothing Expected counts are printed below observed counts (All > 5) Basket Cart Nothing Total Total Chi-Sq = = DF = 2, P-Value = > 0.05, do not reject H0. Exercise: Perform the calculations using the formulas to see if you get the same answer. 36
37 2 test for independence The exact same 2 test statistic and procedure can be used to investigate the association/independence between any two categorical variables on a single population (note the difference to the test of homogeneity, which tests for a difference between multiple populations). Hypotheses: H 0 : The two variables are independent. H a : The two variables are not independent. The expected counts under the null hypothesis of independence follow the same formula: Expected cell count = (Row total)(column total) Grand total 37
38 Example: Pain A medical paper examined the relationship between a nurse s assessment of a patient s facial expression and the patient s self reported level of pain. tennis elbow Data for 89 randomly selected patients. Patient facial expression Patient report No pain Pain 40 patients reported they had pain but their facial expression did not show any pain. No pain Pain patients reported no pain but their facial expression indicated pain. 38
39 Example: Pain The question: was there an association between whether or not the patient reported pain and the look of pain on his/her face? Or did they have nothing to do with each other? Patient facial expression Patient report No pain Pain No pain Perform a test at the.05 level of signficance. Pain
40 Example: Pain Hypotheses: H 0 : Patient self report and facial expression of pain are independent. H a : Patient self report and facial expression of pain are not independent. Significance level:.05 Test statistic: 2 (observed cell count - expected cell count) 2 expected cell count 40
41 Example: Pain Expected cell count = (Row total)(column total) Grand total Patient facial expression No pain Pain Patient report No pain 17 (12.81) 3 (7.19) Pain Row totals 40 (44.19) (24.81) 32 Assumptions Random sample All expected counts 5 Column totals
42 Example: Pain Calculation: 2 ( ) ( ) The table has 2 rows and 2 columns so df = (2 1)(2 1) = 1. The entry closest to 4.92 in the chi square table is 5.02, so the approximate p value is p value.025. Conclusion: Reject H 0 in favor of H a, that patient self report and facial expression of pain are not independent. 42
43 Tutorial 9: Smoking and Alcohol Do people who drink more tend to smoke more? A study in the New England Journal of Medicine in 1999 addressed this question by asking 21,870 male doctors ages years. They collected the following data. Smoking status Never smoked Past smoker Current smoker Alcohol: number of drinks per week < > ) To answer the question should we do a test of homogeneity or a test of independence? Why? 2.) Write the null and alternative hypotheses. 43
44 Tutorial 9: Smoking and Alcohol Smoking status Never smoked Past smoker Current smoker Alcohol: number of drinks per week < > ) Calculate row percentages by dividing each observed count by the corresponding row total. Graph your results for each smoking category separately as relative frequency barcharts (Lecture 1) and compare them. Comment on whether the graphs support the null versus alternative hypothesis? 44
45 Tutorial 9: Smoking and Alcohol Smoking status Never smoked Past smoker Current smoker Alcohol: number of drinks per week < > ) The chi square statistic for performing a test of the null hypothesis at the.05 level is calculated to be and the p value is less than State the degrees of freedom of the chi square distribution used for this test, sketch the chi square distribution itself, and indicate the statistic value and the p value on the distribution. 5.) What are the conclusions of the test? Do they agree with your graphs and what you would expect? 45
Lecture 41 Sections Mon, Apr 7, 2008
Lecture 41 Sections 14.1-14.3 Hampden-Sydney College Mon, Apr 7, 2008 Outline 1 2 3 4 5 one-proportion test that we just studied allows us to test a hypothesis concerning one proportion, or two categories,
More informationLecture 28 Chi-Square Analysis
Lecture 28 STAT 225 Introduction to Probability Models April 23, 2014 Whitney Huang Purdue University 28.1 χ 2 test for For a given contingency table, we want to test if two have a relationship or not
More informationContingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878
Contingency Tables I. Definition & Examples. A) Contingency tables are tables where we are looking at two (or more - but we won t cover three or more way tables, it s way too complicated) factors, each
More information10.2: The Chi Square Test for Goodness of Fit
10.2: The Chi Square Test for Goodness of Fit We can perform a hypothesis test to determine whether the distribution of a single categorical variable is following a proposed distribution. We call this
More informationStatistics for Managers Using Microsoft Excel
Statistics for Managers Using Microsoft Excel 7 th Edition Chapter 1 Chi-Square Tests and Nonparametric Tests Statistics for Managers Using Microsoft Excel 7e Copyright 014 Pearson Education, Inc. Chap
More informationSection 9.5. Testing the Difference Between Two Variances. Bluman, Chapter 9 1
Section 9.5 Testing the Difference Between Two Variances Bluman, Chapter 9 1 This the last day the class meets before spring break starts. Please make sure to be present for the test or make appropriate
More informationContingency Tables. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels.
Contingency Tables Definition & Examples. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels. (Using more than two factors gets complicated,
More informationGoodness of Fit Tests
Goodness of Fit Tests Marc H. Mehlman marcmehlman@yahoo.com University of New Haven (University of New Haven) Goodness of Fit Tests 1 / 38 Table of Contents 1 Goodness of Fit Chi Squared Test 2 Tests of
More information10.4 Hypothesis Testing: Two Independent Samples Proportion
10.4 Hypothesis Testing: Two Independent Samples Proportion Example 3: Smoking cigarettes has been known to cause cancer and other ailments. One politician believes that a higher tax should be imposed
More informationTopic 21 Goodness of Fit
Topic 21 Goodness of Fit Contingency Tables 1 / 11 Introduction Two-way Table Smoking Habits The Hypothesis The Test Statistic Degrees of Freedom Outline 2 / 11 Introduction Contingency tables, also known
More informationParametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami
Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami Parametric Assumptions The observations must be independent. Dependent variable should be continuous
More informationChapter 10. Chapter 10. Multinomial Experiments and. Multinomial Experiments and Contingency Tables. Contingency Tables.
Chapter 10 Multinomial Experiments and Contingency Tables 1 Chapter 10 Multinomial Experiments and Contingency Tables 10-1 1 Overview 10-2 2 Multinomial Experiments: of-fitfit 10-3 3 Contingency Tables:
More informationSTAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure).
STAT 515 -- Chapter 13: Categorical Data Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure). Many studies allow for more than 2 categories. Example
More informationBasic Business Statistics, 10/e
Chapter 1 1-1 Basic Business Statistics 11 th Edition Chapter 1 Chi-Square Tests and Nonparametric Tests Basic Business Statistics, 11e 009 Prentice-Hall, Inc. Chap 1-1 Learning Objectives In this chapter,
More informationWe know from STAT.1030 that the relevant test statistic for equality of proportions is:
2. Chi 2 -tests for equality of proportions Introduction: Two Samples Consider comparing the sample proportions p 1 and p 2 in independent random samples of size n 1 and n 2 out of two populations which
More informationModule 10: Analysis of Categorical Data Statistics (OA3102)
Module 10: Analysis of Categorical Data Statistics (OA3102) Professor Ron Fricker Naval Postgraduate School Monterey, California Reading assignment: WM&S chapter 14.1-14.7 Revision: 3-12 1 Goals for this
More informationThe goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.
The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions. A common problem of this type is concerned with determining
More informationSTP 226 EXAMPLE EXAM #3 INSTRUCTOR:
STP 226 EXAMPLE EXAM #3 INSTRUCTOR: Honor Statement: I have neither given nor received information regarding this exam, and I will not do so until all exams have been graded and returned. Signed Date PRINTED
More information10: Crosstabs & Independent Proportions
10: Crosstabs & Independent Proportions p. 10.1 P Background < Two independent groups < Binary outcome < Compare binomial proportions P Illustrative example ( oswege.sav ) < Food poisoning following church
More informationLecture 5: ANOVA and Correlation
Lecture 5: ANOVA and Correlation Ani Manichaikul amanicha@jhsph.edu 23 April 2007 1 / 62 Comparing Multiple Groups Continous data: comparing means Analysis of variance Binary data: comparing proportions
More informationCh. 11 Inference for Distributions of Categorical Data
Ch. 11 Inference for Distributions of Categorical Data CH. 11 2 INFERENCES FOR RELATIONSHIPS The two sample z procedures from Ch. 10 allowed us to compare proportions of successes in two populations or
More informationPsych 230. Psychological Measurement and Statistics
Psych 230 Psychological Measurement and Statistics Pedro Wolf December 9, 2009 This Time. Non-Parametric statistics Chi-Square test One-way Two-way Statistical Testing 1. Decide which test to use 2. State
More information11-2 Multinomial Experiment
Chapter 11 Multinomial Experiments and Contingency Tables 1 Chapter 11 Multinomial Experiments and Contingency Tables 11-11 Overview 11-2 Multinomial Experiments: Goodness-of-fitfit 11-3 Contingency Tables:
More informationChapter 8 Student Lecture Notes 8-1. Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance
Chapter 8 Student Lecture Notes 8-1 Department of Economics Business Statistics Chapter 1 Chi-square test of independence & Analysis of Variance ECON 509 Dr. Mohammad Zainal Chapter Goals After completing
More information13.1 Categorical Data and the Multinomial Experiment
Chapter 13 Categorical Data Analysis 13.1 Categorical Data and the Multinomial Experiment Recall Variable: (numerical) variable (i.e. # of students, temperature, height,). (non-numerical, categorical)
More informationDepartment of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance ECON 509. Dr.
Department of Economics Business Statistics Chapter 1 Chi-square test of independence & Analysis of Variance ECON 509 Dr. Mohammad Zainal Chapter Goals After completing this chapter, you should be able
More informationSTP 226 ELEMENTARY STATISTICS NOTES
STP 226 ELEMENTARY STATISTICS NOTES PART 1V INFERENTIAL STATISTICS CHAPTER 12 CHI SQUARE PROCEDURES 12.1 The Chi Square Distribution A variable has a chi square distribution if the shape of its distribution
More information16.400/453J Human Factors Engineering. Design of Experiments II
J Human Factors Engineering Design of Experiments II Review Experiment Design and Descriptive Statistics Research question, independent and dependent variables, histograms, box plots, etc. Inferential
More informationFrequency Distribution Cross-Tabulation
Frequency Distribution Cross-Tabulation 1) Overview 2) Frequency Distribution 3) Statistics Associated with Frequency Distribution i. Measures of Location ii. Measures of Variability iii. Measures of Shape
More informationCategorical Data Analysis. The data are often just counts of how many things each category has.
Categorical Data Analysis So far we ve been looking at continuous data arranged into one or two groups, where each group has more than one observation. E.g., a series of measurements on one or two things.
More informationCHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC
CHI SQUARE ANALYSIS I N T R O D U C T I O N T O N O N - P A R A M E T R I C A N A L Y S E S HYPOTHESIS TESTS SO FAR We ve discussed One-sample t-test Dependent Sample t-tests Independent Samples t-tests
More informationSTAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression
STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression Rebecca Barter April 20, 2015 Fisher s Exact Test Fisher s Exact Test
More informationInferences About Two Proportions
Inferences About Two Proportions Quantitative Methods II Plan for Today Sampling two populations Confidence intervals for differences of two proportions Testing the difference of proportions Examples 1
More informationStatistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017
Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017 I. χ 2 or chi-square test Objectives: Compare how close an experimentally derived value agrees with an expected value. One method to
More informationTesting Independence
Testing Independence Dipankar Bandyopadhyay Department of Biostatistics, Virginia Commonwealth University BIOS 625: Categorical Data & GLM 1/50 Testing Independence Previously, we looked at RR = OR = 1
More informationCHAPTER 17 CHI-SQUARE AND OTHER NONPARAMETRIC TESTS FROM: PAGANO, R. R. (2007)
FROM: PAGANO, R. R. (007) I. INTRODUCTION: DISTINCTION BETWEEN PARAMETRIC AND NON-PARAMETRIC TESTS Statistical inference tests are often classified as to whether they are parametric or nonparametric Parameter
More informationChapter 26: Comparing Counts (Chi Square)
Chapter 6: Comparing Counts (Chi Square) We ve seen that you can turn a qualitative variable into a quantitative one (by counting the number of successes and failures), but that s a compromise it forces
More informationTA: Sheng Zhgang (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan (W 1:20) / 346 (Th 12:05) FINAL EXAM
STAT 301, Fall 2011 Name Lec 4: Ismor Fischer Discussion Section: Please circle one! TA: Sheng Zhgang... 341 (Th 1:20) / 342 (W 1:20) / 343 (W 2:25) / 344 (W 12:05) Haoyang Fan... 345 (W 1:20) / 346 (Th
More informationRelate Attributes and Counts
Relate Attributes and Counts This procedure is designed to summarize data that classifies observations according to two categorical factors. The data may consist of either: 1. Two Attribute variables.
More informationLecture 2: Categorical Variable. A nice book about categorical variable is An Introduction to Categorical Data Analysis authored by Alan Agresti
Lecture 2: Categorical Variable A nice book about categorical variable is An Introduction to Categorical Data Analysis authored by Alan Agresti 1 Categorical Variable Categorical variable is qualitative
More informationIntroduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution
Introduction to Statistical Data Analysis Lecture 7: The Chi-Square Distribution James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis
More informationChapter 10: Chi-Square and F Distributions
Chapter 10: Chi-Square and F Distributions Chapter Notes 1 Chi-Square: Tests of Independence 2 4 & of Homogeneity 2 Chi-Square: Goodness of Fit 5 6 3 Testing & Estimating a Single Variance 7 10 or Standard
More informationSection 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples
Objective Section 9.4 Inferences About Two Means (Matched Pairs) Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means
More informationLecture 41 Sections Wed, Nov 12, 2008
Lecture 41 Sections 14.1-14.3 Hampden-Sydney College Wed, Nov 12, 2008 Outline 1 2 3 4 5 6 7 one-proportion test that we just studied allows us to test a hypothesis concerning one proportion, or two categories,
More informationRelax and good luck! STP 231 Example EXAM #2. Instructor: Ela Jackiewicz
STP 31 Example EXAM # Instructor: Ela Jackiewicz Honor Statement: I have neither given nor received information regarding this exam, and I will not do so until all exams have been graded and returned.
More informationExample. χ 2 = Continued on the next page. All cells
Section 11.1 Chi Square Statistic k Categories 1 st 2 nd 3 rd k th Total Observed Frequencies O 1 O 2 O 3 O k n Expected Frequencies E 1 E 2 E 3 E k n O 1 + O 2 + O 3 + + O k = n E 1 + E 2 + E 3 + + E
More informationDiscrete Multivariate Statistics
Discrete Multivariate Statistics Univariate Discrete Random variables Let X be a discrete random variable which, in this module, will be assumed to take a finite number of t different values which are
More informationMAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1. MAT 2379, Introduction to Biostatistics
MAT 2379, Introduction to Biostatistics, Sample Calculator Questions 1 MAT 2379, Introduction to Biostatistics Sample Calculator Problems for the Final Exam Note: The exam will also contain some problems
More informationRon Heck, Fall Week 3: Notes Building a Two-Level Model
Ron Heck, Fall 2011 1 EDEP 768E: Seminar on Multilevel Modeling rev. 9/6/2011@11:27pm Week 3: Notes Building a Two-Level Model We will build a model to explain student math achievement using student-level
More informationModule 03 Lecture 14 Inferential Statistics ANOVA and TOI
Introduction of Data Analytics Prof. Nandan Sudarsanam and Prof. B Ravindran Department of Management Studies and Department of Computer Science and Engineering Indian Institute of Technology, Madras Module
More informationClass 24. Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science. Marquette University MATH 1700
Class 4 Daniel B. Rowe, Ph.D. Department of Mathematics, Statistics, and Computer Science Copyright 013 by D.B. Rowe 1 Agenda: Recap Chapter 9. and 9.3 Lecture Chapter 10.1-10.3 Review Exam 6 Problem Solving
More informationNominal Data. Parametric Statistics. Nonparametric Statistics. Parametric vs Nonparametric Tests. Greg C Elvers
Nominal Data Greg C Elvers 1 Parametric Statistics The inferential statistics that we have discussed, such as t and ANOVA, are parametric statistics A parametric statistic is a statistic that makes certain
More informationInferential statistics
Inferential statistics Inference involves making a Generalization about a larger group of individuals on the basis of a subset or sample. Ahmed-Refat-ZU Null and alternative hypotheses In hypotheses testing,
More information10.2 Hypothesis Testing with Two-Way Tables
10.2 Hypothesis Testing with Two-Way Tables Part 2: more examples 3x3 Two way table 2x3 Two-way table (worksheet) 1 Example 2: n Is there an association between the type of school area and the students'
More informationASTRO Fall 2012 LAB #5: Observing the Moon
ASTRO 1050 - Fall 2012 LAB #5: Observing the Moon ABSTRACT In this lab we will create a model of the earth-moon-sun system. This will allow us to explain the moon s rotation, why the moon has phases, and
More informationRama Nada. -Ensherah Mokheemer. 1 P a g e
- 9 - Rama Nada -Ensherah Mokheemer - 1 P a g e Quick revision: Remember from the last lecture that chi square is an example of nonparametric test, other examples include Kruskal Wallis, Mann Whitney and
More informationCIVL /8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8
CIVL - 7904/8904 T R A F F I C F L O W T H E O R Y L E C T U R E - 8 Chi-square Test How to determine the interval from a continuous distribution I = Range 1 + 3.322(logN) I-> Range of the class interval
More informationStatistical methods for comparing multiple groups. Lecture 7: ANOVA. ANOVA: Definition. ANOVA: Concepts
Statistical methods for comparing multiple groups Lecture 7: ANOVA Sandy Eckel seckel@jhsph.edu 30 April 2008 Continuous data: comparing multiple means Analysis of variance Binary data: comparing multiple
More informationTables Table A Table B Table C Table D Table E 675
BMTables.indd Page 675 11/15/11 4:25:16 PM user-s163 Tables Table A Standard Normal Probabilities Table B Random Digits Table C t Distribution Critical Values Table D Chi-square Distribution Critical Values
More informationStatistics 3858 : Contingency Tables
Statistics 3858 : Contingency Tables 1 Introduction Before proceeding with this topic the student should review generalized likelihood ratios ΛX) for multinomial distributions, its relation to Pearson
More informationDisadvantages of using many pooled t procedures. The sampling distribution of the sample means. The variability between the sample means
Stat 529 (Winter 2011) Analysis of Variance (ANOVA) Reading: Sections 5.1 5.3. Introduction and notation Birthweight example Disadvantages of using many pooled t procedures The analysis of variance procedure
More informationChapter 14. Multiple Regression Models. Multiple Regression Models. Multiple Regression Models
Chapter 14 Multiple Regression Models 1 Multiple Regression Models A general additive multiple regression model, which relates a dependent variable y to k predictor variables,,, is given by the model equation
More informationThe Difference in Proportions Test
Overview The Difference in Proportions Test Dr Tom Ilvento Department of Food and Resource Economics A Difference of Proportions test is based on large sample only Same strategy as for the mean We calculate
More informationχ test statistics of 2.5? χ we see that: χ indicate agreement between the two sets of frequencies.
I. T or F. (1 points each) 1. The χ -distribution is symmetric. F. The χ may be negative, zero, or positive F 3. The chi-square distribution is skewed to the right. T 4. The observed frequency of a cell
More informationST3241 Categorical Data Analysis I Two-way Contingency Tables. Odds Ratio and Tests of Independence
ST3241 Categorical Data Analysis I Two-way Contingency Tables Odds Ratio and Tests of Independence 1 Inference For Odds Ratio (p. 24) For small to moderate sample size, the distribution of sample odds
More informationHYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC
1 HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC 7 steps of Hypothesis Testing 1. State the hypotheses 2. Identify level of significant 3. Identify the critical values 4. Calculate test statistics 5. Compare
More informationReview of One-way Tables and SAS
Stat 504, Lecture 7 1 Review of One-way Tables and SAS In-class exercises: Ex1, Ex2, and Ex3 from http://v8doc.sas.com/sashtml/proc/z0146708.htm To calculate p-value for a X 2 or G 2 in SAS: http://v8doc.sas.com/sashtml/lgref/z0245929.htmz0845409
More informationMarketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12)
Marketing Research Session 10 Hypothesis Testing with Simple Random samples (Chapter 12) Remember: Z.05 = 1.645, Z.01 = 2.33 We will only cover one-sided hypothesis testing (cases 12.3, 12.4.2, 12.5.2,
More information:the actual population proportion are equal to the hypothesized sample proportions 2. H a
AP Statistics Chapter 14 Chi- Square Distribution Procedures I. Chi- Square Distribution ( χ 2 ) The chi- square test is used when comparing categorical data or multiple proportions. a. Family of only
More informationSalt Lake Community College MATH 1040 Final Exam Fall Semester 2011 Form E
Salt Lake Community College MATH 1040 Final Exam Fall Semester 011 Form E Name Instructor Time Limit: 10 minutes Any hand-held calculator may be used. Computers, cell phones, or other communication devices
More informationLecture 26 Section 8.4. Wed, Oct 14, 2009
PDFs n = Lecture 26 Section 8.4 Hampden-Sydney College Wed, Oct 14, 2009 Outline PDFs n = 1 2 PDFs n = 3 4 5 6 Outline PDFs n = 1 2 PDFs n = 3 4 5 6 PDFs n = Exercise 8.12, page 528. Suppose that 60% of
More informationPOLI 443 Applied Political Research
POLI 443 Applied Political Research Session 6: Tests of Hypotheses Contingency Analysis Lecturer: Prof. A. Essuman-Johnson, Dept. of Political Science Contact Information: aessuman-johnson@ug.edu.gh College
More informationThe Chi-Square Distributions
MATH 183 The Chi-Square Distributions Dr. Neal, WKU The chi-square distributions can be used in statistics to analyze the standard deviation σ of a normally distributed measurement and to test the goodness
More informationCBA4 is live in practice mode this week exam mode from Saturday!
Announcements CBA4 is live in practice mode this week exam mode from Saturday! Material covered: Confidence intervals (both cases) 1 sample hypothesis tests (both cases) Hypothesis tests for 2 means as
More informationPsych Jan. 5, 2005
Psych 124 1 Wee 1: Introductory Notes on Variables and Probability Distributions (1/5/05) (Reading: Aron & Aron, Chaps. 1, 14, and this Handout.) All handouts are available outside Mija s office. Lecture
More information11 CHI-SQUARED Introduction. Objectives. How random are your numbers? After studying this chapter you should
11 CHI-SQUARED Chapter 11 Chi-squared Objectives After studying this chapter you should be able to use the χ 2 distribution to test if a set of observations fits an appropriate model; know how to calculate
More informationChapter 9 Inferences from Two Samples
Chapter 9 Inferences from Two Samples 9-1 Review and Preview 9-2 Two Proportions 9-3 Two Means: Independent Samples 9-4 Two Dependent Samples (Matched Pairs) 9-5 Two Variances or Standard Deviations Review
More informationModule 4 Practice problem and Homework answers
Module 4 Practice problem and Homework answers Practice Problems, pages 2 3 OBSERVED cat no cat total over 3 7 10 under 9 3 12 total 12 10 22 The observed table simply summarizes the data from page 1.
More informationAP Final Review II Exploring Data (20% 30%)
AP Final Review II Exploring Data (20% 30%) Quantitative vs Categorical Variables Quantitative variables are numerical values for which arithmetic operations such as means make sense. It is usually a measure
More informationChapter Eight: Assessment of Relationships 1/42
Chapter Eight: Assessment of Relationships 1/42 8.1 Introduction 2/42 Background This chapter deals, primarily, with two topics. The Pearson product-moment correlation coefficient. The chi-square test
More informationSTAT 212 Business Statistics II 1
STAT 1 Business Statistics II 1 KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 1: BUSINESS STATISTICS II Semester 091 Final Exam Thursday Feb
More informationLecture 15 - ANOVA cont.
Lecture 15 - ANOVA cont. Statistics 102 Colin Rundel March 18, 2013 One-way ANOVA Example - Alfalfa Example - Alfalfa (11.6.1) Researchers were interested in the effect that acid has on the growth rate
More informationTwo-Sample Inferential Statistics
The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is
More informationPS5: Two Variable Statistics LT3: Linear regression LT4: The test of independence.
PS5: Two Variable Statistics LT3: Linear regression LT4: The test of independence. Example by eye. On a hot day, nine cars were left in the sun in a car parking lot. The length of time each car was left
More informationQuantitative Analysis and Empirical Methods
Hypothesis testing Sciences Po, Paris, CEE / LIEPP Introduction Hypotheses Procedure of hypothesis testing Two-tailed and one-tailed tests Statistical tests with categorical variables A hypothesis A testable
More informationChi-square (χ 2 ) Tests
Math 442 - Mathematical Statistics II April 30, 2018 Chi-square (χ 2 ) Tests Common Uses of the χ 2 test. 1. Testing Goodness-of-fit. 2. Testing Equality of Several Proportions. 3. Homogeneity Test. 4.
More informationLecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t
Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t t Confidence Interval for Population Mean Comparing z and t Confidence Intervals When neither z nor t Applies
More informationThe t-statistic. Student s t Test
The t-statistic 1 Student s t Test When the population standard deviation is not known, you cannot use a z score hypothesis test Use Student s t test instead Student s t, or t test is, conceptually, very
More informationChapters 4-6: Inference with two samples Read sections 4.2.5, 5.2, 5.3, 6.2
Chapters 4-6: Inference with two samples Read sections 45, 5, 53, 6 COMPARING TWO POPULATION MEANS When presented with two samples that you wish to compare, there are two possibilities: I independent samples
More informationExample - Alfalfa (11.6.1) Lecture 14 - ANOVA cont. Alfalfa Hypotheses. Treatment Effect
(11.6.1) Lecture 14 - ANOVA cont. Sta102 / BME102 Colin Rundel March 19, 2014 Researchers were interested in the effect that acid has on the growth rate of alfalfa plants. They created three treatment
More informationSampling Distribution of a Sample Proportion
Sampling Distribution of a Sample Proportion Lecture 26 Section 8.4 Robb T. Koether Hampden-Sydney College Mon, Mar 1, 2010 Robb T. Koether (Hampden-Sydney College) Sampling Distribution of a Sample Proportion
More informationLing 289 Contingency Table Statistics
Ling 289 Contingency Table Statistics Roger Levy and Christopher Manning This is a summary of the material that we ve covered on contingency tables. Contingency tables: introduction Odds ratios Counting,
More informationT test for two Independent Samples. Raja, BSc.N, DCHN, RN Nursing Instructor Acknowledgement: Ms. Saima Hirani June 07, 2016
T test for two Independent Samples Raja, BSc.N, DCHN, RN Nursing Instructor Acknowledgement: Ms. Saima Hirani June 07, 2016 Q1. The mean serum creatinine level is measured in 36 patients after they received
More informationSuppose that we are concerned about the effects of smoking. How could we deal with this?
Suppose that we want to study the relationship between coffee drinking and heart attacks in adult males under 55. In particular, we want to know if there is an association between coffee drinking and heart
More informationMath 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010
Math 1040 Final Exam Form A Introduction to Statistics Fall Semester 2010 Instructor Name Time Limit: 120 minutes Any calculator is okay. Necessary tables and formulas are attached to the back of the exam.
More informationOutline for Today. Review of In-class Exercise Bivariate Hypothesis Test 2: Difference of Means Bivariate Hypothesis Testing 3: Correla
Outline for Today 1 Review of In-class Exercise 2 Bivariate hypothesis testing 2: difference of means 3 Bivariate hypothesis testing 3: correlation 2 / 51 Task for ext Week Any questions? 3 / 51 In-class
More information15: CHI SQUARED TESTS
15: CHI SQUARED ESS MULIPLE CHOICE QUESIONS In the following multiple choice questions, please circle the correct answer. 1. Which statistical technique is appropriate when we describe a single population
More informationLecture 22. December 19, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.
Lecture 22 Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University December 19, 2007 1 2 3 4 5 6 7 8 9 1 tests for equivalence of two binomial 2 tests for,
More informationCohen s s Kappa and Log-linear Models
Cohen s s Kappa and Log-linear Models HRP 261 03/03/03 10-11 11 am 1. Cohen s Kappa Actual agreement = sum of the proportions found on the diagonals. π ii Cohen: Compare the actual agreement with the chance
More informationMath 2000 Practice Final Exam: Homework problems to review. Problem numbers
Math 2000 Practice Final Exam: Homework problems to review Pages: Problem numbers 52 20 65 1 181 14 189 23, 30 245 56 256 13 280 4, 15 301 21 315 18 379 14 388 13 441 13 450 10 461 1 553 13, 16 561 13,
More information