Three-Way Contingency Tables

Similar documents
Longitudinal Modeling with Logistic Regression

Testing Independence

ST3241 Categorical Data Analysis I Two-way Contingency Tables. Odds Ratio and Tests of Independence

Ordinal Variables in 2 way Tables

Analysis of Categorical Data Three-Way Contingency Table

3 Way Tables Edpsy/Psych/Soc 589

Cohen s s Kappa and Log-linear Models

Multinomial Logistic Regression Models

Statistics 3858 : Contingency Tables

8 Nominal and Ordinal Logistic Regression

Multi-Level Test of Independence for 2 X 2 Contingency Table using Cochran and Mantel Haenszel Statistics

Analytic Methods for Applied Epidemiology: Framework and Contingency Table Analysis

Lecture 8: Summary Measures

13.1 Categorical Data and the Multinomial Experiment

ST3241 Categorical Data Analysis I Multicategory Logit Models. Logit Models For Nominal Responses

Analysis of categorical data S4. Michael Hauptmann Netherlands Cancer Institute Amsterdam, The Netherlands

Unit 9: Inferences for Proportions and Count Data

NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION (SOLUTIONS) ST3241 Categorical Data Analysis. (Semester II: )

Review. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

Unit 9: Inferences for Proportions and Count Data

Epidemiology Wonders of Biostatistics Chapter 13 - Effect Measures. John Koval

BIOS 625 Fall 2015 Homework Set 3 Solutions

WORKSHOP 3 Measuring Association

NATIONAL UNIVERSITY OF SINGAPORE EXAMINATION. ST3241 Categorical Data Analysis. (Semester II: ) April/May, 2011 Time Allowed : 2 Hours

Discrete Multivariate Statistics

ij i j m ij n ij m ij n i j Suppose we denote the row variable by X and the column variable by Y ; We can then re-write the above expression as

Frequency Distribution Cross-Tabulation

Chi-Square. Heibatollah Baghi, and Mastee Badii

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

Log-linear Models for Contingency Tables

STA 4504/5503 Sample Exam 1 Spring 2011 Categorical Data Analysis. 1. Indicate whether each of the following is true (T) or false (F).

Model Estimation Example

Lecture 14: Introduction to Poisson Regression

Modelling counts. Lecture 14: Introduction to Poisson Regression. Overview

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

ST3241 Categorical Data Analysis I Two-way Contingency Tables. 2 2 Tables, Relative Risks and Odds Ratios

Analysis of Categorical Data. Nick Jackson University of Southern California Department of Psychology 10/11/2013

STA 303 H1S / 1002 HS Winter 2011 Test March 7, ab 1cde 2abcde 2fghij 3

Some comments on Partitioning

Contingency Tables. Safety equipment in use Fatal Non-fatal Total. None 1, , ,128 Seat belt , ,878

Describing Stratified Multiple Responses for Sparse Data

CDA Chapter 3 part II

Basic Medical Statistics Course

Single-level Models for Binary Responses

n y π y (1 π) n y +ylogπ +(n y)log(1 π).

Suppose that we are concerned about the effects of smoking. How could we deal with this?

STAT 7030: Categorical Data Analysis

Logistic Regression: Regression with a Binary Dependent Variable

Binary Logistic Regression

Review of Statistics 101

Describing Contingency tables

DIAGNOSTICS FOR STRATIFIED CLINICAL TRIALS IN PROPORTIONAL ODDS MODELS

Simple logistic regression

Stat 642, Lecture notes for 04/12/05 96

Sociology 6Z03 Review II

Lecture 23. November 15, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

Utilization of Addictions Services

IX. Complete Block Designs (CBD s)

Categorical data analysis Chapter 5

STAT 525 Fall Final exam. Tuesday December 14, 2010

2 Describing Contingency Tables

Good Confidence Intervals for Categorical Data Analyses. Alan Agresti

Contingency Tables. Contingency tables are used when we want to looking at two (or more) factors. Each factor might have two more or levels.

Department of Economics. Business Statistics. Chapter 12 Chi-square test of independence & Analysis of Variance ECON 509. Dr.

Relative Risk Calculations in R

Part IV Statistics in Epidemiology

(Where does Ch. 7 on comparing 2 means or 2 proportions fit into this?)

Review: what is a linear model. Y = β 0 + β 1 X 1 + β 2 X 2 + A model of the following form:

Statistics in medicine

The Logit Model: Estimation, Testing and Interpretation

Analysis of data in square contingency tables

Sections 3.4, 3.5. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression

Research Methodology: Tools

(c) Interpret the estimated effect of temperature on the odds of thermal distress.

Computational Systems Biology: Biology X

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL 2010 EXAMINATIONS STA 303 H1S / STA 1002 HS. Duration - 3 hours. Aids Allowed: Calculator

Categorical Data Analysis Chapter 3

A class of latent marginal models for capture-recapture data with continuous covariates

Lecture 25: Models for Matched Pairs

Model Based Statistics in Biology. Part V. The Generalized Linear Model. Chapter 18.1 Logistic Regression (Dose - Response)

Logistic regression: Miscellaneous topics

SPSS LAB FILE 1

Deciphering Math Notation. Billy Skorupski Associate Professor, School of Education

Hypothesis Testing, Power, Sample Size and Confidence Intervals (Part 2)

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

HYPOTHESIS TESTING: THE CHI-SQUARE STATISTIC

Relate Attributes and Counts

PROC LOGISTIC: Traps for the unwary Peter L. Flom, Independent statistical consultant, New York, NY

SAS Analysis Examples Replication C8. * SAS Analysis Examples Replication for ASDA 2nd Edition * Berglund April 2017 * Chapter 8 ;

Chapter 2: Describing Contingency Tables - II

Sections 2.3, 2.4. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis 1 / 21

Asymptotic equivalence of paired Hotelling test and conditional logistic regression

Investigating Models with Two or Three Categories

Repeated ordinal measurements: a generalised estimating equation approach

Three-Way Tables (continued):

Chapter Seven: Multi-Sample Methods 1/52

Section IX. Introduction to Logistic Regression for binary outcomes. Poisson regression

Bios 6649: Clinical Trials - Statistical Design and Monitoring

Transcription:

Newsom PSY 50/60 Categorical Data Analysis, Fall 06 Three-Way Contingency Tables Three-way contingency tables involve three binary or categorical variables. I will stick mostly to the binary case to keep things simple, but we can have three-way tables with any number of categories with each variable. Of course, higher dimensions are also possible, but they are uncommon in practice and there are few commonly available statistical tests for them. Notation We described two-way contingency tables earlier as having I rows and J columns, forming an I J matrix. If a third variable is included, the three-way contingency table is described as I J K, where I is the number of rows, J is the number of columns, and K is the number of superordinate columns each level containing a matrix. The indexes i, j, and k are used to enumerate the rows or columns for each dimension. Each within-cell count or joint proportion has three subscripts now, n ijk and p ijk. There are two general types of marginal counts (or proportions), with three specific marginals for each type. The first type of marginal count combines one of the three dimensions. Using the + notation from twoway tables, the marginals combining counts across either I, J, or K dimensions are n +jk, n i+k, or n ij+, respectively. The second general type of marginal collapses two dimensions and is given as n i++, n +j+, or n ++k. The total count (sample size) is n +++ (or sometimes just n). The simplest case is a shown below to illustrate the notation. n n n + n n n + n + n + n ++ k = k = Note: marginals collapsing across levels of K are not shown. n n n + n n n + n + n + n ++ In the Azen and Walker text as well as elsewhere, the three variables are referred to by letter names X, Y, and Z. Although we do not need to assign explanatory (independent) and response (dependent) roles to the variables for all hypotheses that might be tested, it can be easier to put the table into more familiar terms by imagining one response variable Y (with its J number of levels) and two explanatory variables, X and Z (with I and K number of levels, respectively). Thinking in this way, we now have the conceptual equivalent of a ANOVA design, where Y is the (binary) dependent variable and X and Z are two independent variables. One common hypothesis of interest is to examine the X effect on Y at different levels of Z, which is the same question asked in the test of the two-way factorial ANOVA (e.g., is the X group difference on Y the same for Z and Z?). It is important to note that this is the real analogy to ANOVA for at least one possible or common hypothesis that can be tested with three-way contingency table. Azen and Walker use the three-way ANOVA as an analogy throughout the chapter for a different reason. Their intention is to use the three-way ANOVA analogy for the three-way contingency table to describe the process by which the researcher should proceed in analysis steps (e.g., examining the three-way table first, and then depending on the results pursuing two-dimensional or one-dimensional comparisons). So, be careful not to confuse their process analogy to ANOVA with the statistical test analogy to ANOVA. Partial Tables There are two tables above that make up the three-way table an X Y table within Z and an X Y table within Z. These are known as partial tables. Although partial tables can be constructed for levels of the X or levels the Y variable also, it is most common to set up three-way tables in this manner and discuss the X Y partial tables within levels (or strata) of Z. Marginals for the two-way partial tables can be expressed as conditional proportions, similar to the simple conditional proportion in the two-way case. The notation p ij k denotes one type of conditional proportion, which is the within-cell count relative to the Many authors use A, B, and C, but this is easily confused with the fourway table notation.

Newsom PSY 50/60 Categorical Data Analysis, Fall 06 number of cases within one level of Z, p ij k = n ijk/n ++k. Of course, we could also compute other conditional proportions, such as p i jk or p jk i and so on. If the two I J contingency tables are considered separately, we could compute odds ratios as before, adding to the notation to indicate that the odds ratio is specific to one level of the Z variable. Following the notation in your text, we use X, Y, and Z and the Greek letter θ ( theta ) for the odds ratio. θ XY Z odds n / n n n p = = = = odds n / n n n p X = YZ X = YZ X = 0 YZ X = 0 YZ If we were to calculate these odds ratios for each level of Z and then average them taking into account the different number of cases in each stratum of Z, we get a common odds ratio called the Mantel- Haenszel (MH) odds ratio. θ MH K ( n ) kn k / n k = ++ k = K ( n / ) k = kn k n++ k The Mantel-Haenszel is the common odds ratio and assumes that the odds ratio does not differ across the levels of Z, so it may not be of interest itself in many circumstances. It is used, however, in test of whether the odds ratio does differ across levels of Z, which is a test of homogenous association. Breslow-Day Test Perhaps the most useful test is a test of the hypothesis that the X-Y association differs across levels of Z, known as the Brelow-Day test (Breslow & Day, 980). Later modified by Tarone (985), it is sometimes referred to as the Breslow-Day-Tarone test. If there is no difference across levels of Z, the X-Y relationship is homogeneous. Another way to state this hypothesis test is as an evaluation of whether the partial table odds ratios are equal across the levels of Z, that θxy Z = θ XY Z. χ BD K O = E( Y θ ) ( θ ) Y k k MH V Y k = k MH In the numerator, we have an alteration of the usual chi-squared statistics in which of the expected values are subtracted from the observed values, except that the observed and the expected values are for partial tables for each level of Z (indexed by subscript k for each stratum of Z), with the expected E Y θ, and values taking into account the Mantel-Haenszel odds ratio, θ MH. The expected values, ( k MH ) variance estimate, ( ) k MH V Y θ, in the denominator are considerably more complicated than for the twoway contingency table. The Breslow-Day test is only for k tables, so X and Y must be binary. The χ BD can be evaluated against the chi-squared distribution with df = K - if the Tarone modification is used. Significance implies θxy Z θ XY Z but also that θ XZ Y θ XZ Y and θ YZ X θ YZ X for the case. If there are many strata for Z (i.e., K is large) and data are sparse, then an exponential weighted average of the common odds ratio (Woolf, 955) is more efficient. Since you are not afraid to look, here is the computations for these two quantities (Hosmer & Lemeshow, 000): E Y n n n n n n n n n n n n n n n n n n n n { } { ( ) ( ) ( ) ( ) } ( ) ( )( ) ( k θmh ) = θmh ( k + k ) + ( k + k ) + ( k + k ) ( k + k ) ± θmh k + k + k + k + k + k k + k { 4 θmh θmh k + k k + k } ( θmh ) = + + + E( Y k θmh ) ( nk + n k) E( Y k θmh ) ( n k + n k ) E( Y k θmh ) ( n k + nk ) ( n k + n k ) + E( Y k θmh ) and V E( Y k θmh )

Newsom 3 PSY 50/60 Categorical Data Analysis, Fall 06 I suspect that most social scientist are interested in testing the hypothesis for which the Breslow-Day test is designed, that the odds ratio at each level of Z is equal. If this is the question of interest, here is an overview of the process you might use: sig Test partial tables within levels of Z sig ns Interpret, test cell differences with levels of Z if desired Test marginals χ BD ns Collapse across levels of Z and test x (i.e., test θ MH ) sig ns Interpret, test cell differences (collapsing across level of Z) if desired Test marginals Cochran-Mantel-Haenszel A special case of the homogeneity of association is the Cochran-Mantel-Haenszel (CMH) test that the odds ratios for all partial tables are equal to, which is referred to as a test of conditional independence. For the case, it is a test of the null hypothesis thatθxy Z = θ XY Z =, but also whether θxz Y = θ XZ Y = and θ YZ X = θ YZ X =. If the test is significant, it may be due to any one of the partial tables with an odds ratio above or below.0. The computation of the CMH follows the general Pearson chisquared equation and can be stated as a likelihood ratio, G, as well. The computation of the expected values is considerably simpler than for the Breslow-Day test. χ CMH I J K ( O ) ijk Eijk = or E i= j= k= ijk G CMH I J K E ijk = Oijk ln i= j= k= O ijk n n n where Eijk = n i++ + j+ ++ k +++ Both CMH tests are evaluated against the chi-squared distribution with df = IJK I J K +. Logistic Regression Logistic regression can be used to test the same hypotheses as those investigated with the Mantel- Haenszel odds ratio and the Breslow-Day tests, and the logistic regression approach is probably the more commonly employed. As you may have guessed, the corresponding hypothesis in logistic regression is investigated by testing the X*Z interaction in predicting Y. Such tests are not exactly equal even though they address the same hypothesis (see Hosmer & Lemeshow, 000, for a discussion). The association tests discussed above do not require assigning explanatory (independent) and response (dependent) roles to the variables per se. Logistic regression models assume that one of the three variables is a response (usually Y in our examples above) and the other two (X and Z in our examples above) are explanatory. The maximum likelihood estimation as well as the more general framework of the logistic regression modeling set the logistic approach apart somewhat from the Mantel-Haenszel odds ratio and Breslow-Day tests. We will return to how to test these hypotheses with logistic regression in a later section of the course. Other Three-Way Table Hypotheses The Breslow-Day and the Cochran-Mantel-Haenszel are the most commonly employed hypothesis tests, but, with a three-way contingency table, there are many possibilities. With the properly derived expected frequencies, a variety of tests can investigate whether variables X, Y, and Z are independent from one another. Wickens (989) discusses association tests can be formulated with loglinear models. Using bracket notation, different tests can be distinguished by which variables are independent from one another. For example, [X][Y][X] indicates that each of the three variables are independent from one

Newsom 4 PSY 50/60 Categorical Data Analysis, Fall 06 another, whereas [XY][Z] indicates that Z is unrelated to X and Y. A nodal diagram can also be used to depict these relations, with the diagram below reflecting the independence implied by [XY][Z]. X Z Loglinear modeling is a general and very flexible framework which will be introduced in the next section of the course. Software Examples To illustrate statistical tests of three-way contingency tables, I used some early presidential data from a Roper/CNN poll of 94 respondents conducted in 05, pitting Clinton and Trump against one another. 3 The Breslow-Day test examines whether independents were more likely to favor Clinton or Trump, comparing this association across gender. SPSS Note: SPSS results from previous version of the handout have been replaced. The earlier version used sampling weights, which were not used in the R and SAS runs. The corrected output is printed below. output comment text="males ONLY". temporary. select if sex eq. crosstabs /tables=ind by response /statistics=none /cells=count row. output comment text="females ONLY". temporary. select if sex eq. crosstabs /tables=ind by response /statistics=none /cells=count row. Y crosstabs /tables=ind by response by sex /statistics=cmh /cells=none. MALES ONLY ind * response GENERAL ELECTION: CLINTON VS. TRUMP Crosstabulation response GENERAL ELECTION: CLINTON VS. TRUMP CLINTON TRUMP Total ind Total.00 party affiliate.00 independent 00 4.8% 06 45.3% 06 43.6% 39 58.% 8 54.7% 67 56.4% 39 34 473 3 Data source: http://www.ropercenter.cornell.edu/cfide/cf/action/catalog/abstract.cfm?label=&keyword=usorccnn05+008&fromdate=&todate=&organiz ation=any&type=&keywordoptions=&start=&id=&exclude=&excludeoptions=&topic=any&sortby=desc&archno=usorccnn05-008&abstract=abstract&x=3&y= (access requires free registration)

Newsom 5 PSY 50/60 Categorical Data Analysis, Fall 06 FEMALES ONLY ind * response GENERAL ELECTION: CLINTON VS. TRUMP Crosstabulation response GENERAL ELECTION: CLINTON VS. TRUMP CLINTON TRUMP Total ind.00 party affiliate 57 40 97 5.9% 47.%.00 independent 89 77 66 53.6% 46.4% Total 46 7 463 53.% 46.9% Tests of Homogeneity of the Odds Ratio Chi-Squared df Asymptotic Significance (-sided) Breslow-Day.69.68 Tarone's.69.68 Tests of Conditional Independence Chi-Squared df Asymptotic Significance (-sided) Cochran's.430.5 Mantel-Haenszel.346.556 Under the conditional independence assumption, Cochran's statistic is asymptotically distributed as a df chi-squared distribution, only if the number of strata is fixed, while the Mantel-Haenszel statistic is always asymptotically distributed as a df chi-squared distribution. Note that the continuity correction is removed from the Mantel-Haenszel statistic when the sum of the differences between the observed and the expected is 0. Mantel-Haenszel Common Odds Ratio Estimate Estimate ln(estimate) Standardized Error of ln(estimate) Asymptotic Significance (-sided).96 -.088.34.5 Asymptotic 95% Confidence Interval Common Odds Ratio Lower Bound Upper Bound.704.9 ln(common Odds Ratio) Lower Bound -.35 Upper Bound.75 The Mantel-Haenszel common odds ratio estimate is asymptotically normally distributed under the common odds ratio of.000 assumption. So is the natural log of the estimate. R > mydata <- Subset(sex=="MALE") > #this lessr BarChart function produces a chi-square test by default > BarChart(ind, response, xlab="response: Trump=0, Clinton=",data=mydata) >>> Suggest: Plot(ind,response) ind:

Newsom 6 PSY 50/60 Categorical Data Analysis, Fall 06 - by levels of - response: GENERAL ELECTION: CLINTON VS. TRUMP Joint and Marginal Frequencies ------------------------------ ind response party affiliate independent Sum CLINTON 00 06 06 TRUMP 39 8 67 Sum 39 34 473 > mydata <- Subset(sex=="FEMALE") > #this lessr BarChart function prosduces a chi-square test by default > BarChart(ind, response, xlab="response: Trump=0, Clinton=",data=mydata) >>> Suggest: Plot(ind,response) ind: - by levels of - response: GENERAL ELECTION: CLINTON VS. TRUMP Joint and Marginal Frequencies ------------------------------ ind response party affiliate independent Sum CLINTON 57 89 46 TRUMP 40 77 7 Sum 97 66 463 > counts <-array( + c(00,39,06,8,57,40,89,77), + dim=c(,, ), + dimnames=list(sex=c("male", "female"), + ind =c("party aff", "independent"), + response =c("clinton", "trump")) + ) > > # counts <- matrix(c(00,39,06,8,57,40,89,77),ncol=,byrow=true) > #install.packages('desctools') #run on first use > library(desctools) > BreslowDayTest(counts,correct=FALSE) Breslow-Day test on Homogeneity of Odds Ratios data: counts X-squared = 0.694, df =, p-value = 0.6809 > mantelhaen.test(counts) Mantel-Haenszel chi-squared test with continuity correction data: counts Mantel-Haenszel X-squared = 0.34599, df =, p-value = 0.5564 alternative hypothesis: true common odds ratio is not equal to 95 percent confidence interval: 0.7040049.9537 sample estimates: common odds ratio 0.957775 SAS proc import datafile="c:\jason\spsswin\cdaclass\cnnpoll.sav" out=one dbms = sav replace; run; proc freq data=one; tables sex*ind*response /cmh ; title 'Test of 3-Way Table'; run;

Newsom 7 PSY 50/60 Categorical Data Analysis, Fall 06 Test of 3-Way Table 4:55 Saturday, October 5, 06 3 The FREQ Procedure Table of ind by response Controlling for sex=male ind(ind) response(general ELECTION: CLINTON VS. TRUMP) Frequency Percent Row Pct Col Pct CLINTON TRUMP Total party affiliate 00 39 39.4 9.39 50.53 4.84 58.6 48.54 5.06 independent 06 8 34.4 7.06 49.47 45.30 54.70 5.46 47.94 Total 06 67 473 43.55 56.45 00.00 Frequency Missing = 7 Table of ind by response Controlling for sex=female ind(ind) response(general ELECTION: CLINTON VS. TRUMP) Frequency Percent Row Pct Col Pct CLINTON TRUMP Total party affiliate 57 40 97 33.9 30.4 64.5 5.86 47.4 63.8 64.5 independent 89 77 66 9. 6.63 35.85 53.6 46.39 36.8 35.48 Total 46 7 463 53.3 46.87 00.00 Frequency Missing =

Newsom 8 PSY 50/60 Categorical Data Analysis, Fall 06 The FREQ Procedure Summary Statistics for ind by response Controlling for sex Cochran-Mantel-Haenszel Statistics (Based on Table Scores) Statistic Alternative Hypothesis DF Value Prob ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ Nonzero Correlation 0.493 0.53 Row Mean Scores Differ 0.493 0.53 3 General Association 0.493 0.53 Common Odds Ratio and Relative Risks Statistic Method Value 95% Confidence Limits ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ Odds Ratio Mantel-Haenszel 0.958 0.7040.93 Logit 0.958 0.7040.93 Relative Risk (Column ) Mantel-Haenszel 0.9558 0.835.0939 Logit 0.9588 0.8383.0967 Relative Risk (Column ) Mantel-Haenszel.049 0.997.85 Logit.045 0.93.843 Breslow-Day Test for Homogeneity of the Odds Ratios ƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒƒ Chi-Square 0.69 DF Pr > ChiSq 0.6809 Effective Sample Size = 936 Frequency Missing = 8 Sample Write-Up A Breslow-Day test was used to investigate whether the association between identification as an independent (identified with a major party vs. independent) and candidate preference (Clinton vs. Trump) was the same for men and women. Among men, 4.84% of respondents identifying with a major party favored Clinton, whereas 45.30% of respondents identifying as independent favored Clinton. Among women, 5.86% of respondents identifying with a major party favored Clinton, whereas 53.6% of respondents identifying with a major party favored Clinton. The Breslow-Day test indicated that the association between party identification and the candidate preference did not differ significantly between χ =.69, ns. The common odds ratio indicated no overall difference in candidate men and women, ( ) BD preference by party identification, OR MH =.96, 95% CI =.704,.9. Follow-up analyses [not shown above] indicated that there was no association between independent identification and the candidate favored within gender, Pearson χ () =.575, ns, for men, and Pearson χ () =.4, ns, for women. 4 References and Further Readings Breslow, N. E., & Day, N. E. (980). Statistical methods in cancer research. Vol.. The analysis of case-control studies (Vol., No. 3). Distributed for IARC by WHO, Geneva, Switzerland. Tarone, R. E. (985). On heterogeneity tests based on efficient scores. Biometrika, 7(), 9-95. Wickens, T. D. (989). Multiway contingency tables analysis for the social sciences. New York: Erlbaum. Woolf, B. (955). On estimating the relation between blood group and disease. Annals of Human Genetics, 9(4), 5-53. 4 It might be desirable to go one step further and test the marginals to determine whether Clinton or Trump was favored overall.