CHAPTER 2. Types of Effect size indices: An Overview of the Literature

Similar documents
Textbook Examples of. SPSS Procedure

Data Analysis as a Decision Making Process

Glossary. The ISI glossary of statistical terms provides definitions in a number of different languages:

Review of the General Linear Model

MANOVA is an extension of the univariate ANOVA as it involves more than one Dependent Variable (DV). The following are assumptions for using MANOVA:

Test Yourself! Methodological and Statistical Requirements for M.Sc. Early Childhood Research

Group comparison test for independent samples

psyc3010 lecture 2 factorial between-ps ANOVA I: omnibus tests

Principal component analysis

Repeated-Measures ANOVA in SPSS Correct data formatting for a repeated-measures ANOVA in SPSS involves having a single line of data for each

MANOVA MANOVA,$/,,# ANOVA ##$%'*!# 1. $!;' *$,$!;' (''

THE PRINCIPLES AND PRACTICE OF STATISTICS IN BIOLOGICAL RESEARCH. Robert R. SOKAL and F. James ROHLF. State University of New York at Stony Brook

ScienceDirect. Who s afraid of the effect size?

Chapter 7, continued: MANOVA

Experimental Design and Data Analysis for Biologists

Turning a research question into a statistical question.

Inferences About the Difference Between Two Means

Y (Nominal/Categorical) 1. Metric (interval/ratio) data for 2+ IVs, and categorical (nominal) data for a single DV

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Statistics Introductory Correlation

Chapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing

DETAILED CONTENTS PART I INTRODUCTION AND DESCRIPTIVE STATISTICS. 1. Introduction to Statistics

Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model

Multivariate analysis of variance and covariance

Finding Relationships Among Variables

Bivariate Relationships Between Variables

Chapter 16: Correlation

Logistic Regression: Regression with a Binary Dependent Variable

ANCOVA. Lecture 9 Andrew Ainsworth

STAT 501 Assignment 2 NAME Spring Chapter 5, and Sections in Johnson & Wichern.

INTRODUCTION TO DESIGN AND ANALYSIS OF EXPERIMENTS

Effect Size Statistics:

POWER AND TYPE I ERROR RATE COMPARISON OF MULTIVARIATE ANALYSIS OF VARIANCE

BIOMETRICS INFORMATION

Multivariate Statistical Analysis

Prepared by: Prof. Dr Bahaman Abu Samah Department of Professional Development and Continuing Education Faculty of Educational Studies Universiti

Small n, σ known or unknown, underlying nongaussian

DESIGNING EXPERIMENTS AND ANALYZING DATA A Model Comparison Perspective

Overview. Overview. Overview. Specific Examples. General Examples. Bivariate Regression & Correlation

HANDBOOK OF APPLICABLE MATHEMATICS

Categorical Predictor Variables

Readings Howitt & Cramer (2014) Overview

Neuendorf MANOVA /MANCOVA. Model: X1 (Factor A) X2 (Factor B) X1 x X2 (Interaction) Y4. Like ANOVA/ANCOVA:

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Correlation & Regression Chapter 5

COMPARING THREE EFFECT SIZES FOR LATENT CLASS ANALYSIS. Elvalicia A. Granado, B.S., M.S. Dissertation Prepared for the Degree of DOCTOR OF PHILOSOPHY

Investigating Models with Two or Three Categories

10/31/2012. One-Way ANOVA F-test

Readings Howitt & Cramer (2014)

Lecture 5: Hypothesis tests for more than one sample

Multiple Linear Regression for the Salary Data

Analysis of Covariance (ANCOVA) with Two Groups

T. Mark Beasley One-Way Repeated Measures ANOVA handout

Multivariate Statistics Summary and Comparison of Techniques. Multivariate Techniques

Practical Meta-Analysis -- Lipsey & Wilson

Review of Statistics 101

Repeated Measures ANOVA Multivariate ANOVA and Their Relationship to Linear Mixed Models

Applied Multivariate Statistical Analysis Richard Johnson Dean Wichern Sixth Edition

Analysis of Covariance (ANCOVA) Lecture Notes

Multivariate Statistical Analysis

This gives us an upper and lower bound that capture our population mean.

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.

Introduction to Statistical Analysis using IBM SPSS Statistics (v24)

ECON1310 Quantitative Economic and Business Analysis A

Multivariate Regression (Chapter 10)

An Examination of the Robustness of the Empirical Bayes and Other Approaches. for Testing Main and Interaction Effects in Repeated Measures Designs

Psychology 205: Research Methods in Psychology

6348 Final, Fall 14. Closed book, closed notes, no electronic devices. Points (out of 200) in parentheses.

z = β βσβ Statistical Analysis of MV Data Example : µ=0 (Σ known) consider Y = β X~ N 1 (β µ, β Σβ) test statistic for H 0β is

Neuendorf MANOVA /MANCOVA. Model: MAIN EFFECTS: X1 (Factor A) X2 (Factor B) INTERACTIONS : X1 x X2 (A x B Interaction) Y4. Like ANOVA/ANCOVA:

Introducing Generalized Linear Models: Logistic Regression

Two-Sample Inferential Statistics

Contents. Acknowledgments. xix

Deciphering Math Notation. Billy Skorupski Associate Professor, School of Education

Data Analyses in Multivariate Regression Chii-Dean Joey Lin, SDSU, San Diego, CA

2. Sample representativeness. That means some type of probability/random sampling.

Chi-Square. Heibatollah Baghi, and Mastee Badii

One-way between-subjects ANOVA. Comparing three or more independent means

26:010:557 / 26:620:557 Social Science Research Methods

Ron Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)

COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS

Multivariate Linear Regression Models

Glossary for the Triola Statistics Series

Multivariate Linear Models

CHI SQUARE ANALYSIS 8/18/2011 HYPOTHESIS TESTS SO FAR PARAMETRIC VS. NON-PARAMETRIC

104 Business Research Methods - MCQs

Introduction to inferential statistics. Alissa Melinger IGK summer school 2006 Edinburgh

SOME ASPECTS OF MULTIVARIATE BEHRENS-FISHER PROBLEM

M A N O V A. Multivariate ANOVA. Data

Correlation 1. December 4, HMS, 2017, v1.1

Multivariate Statistical Analysis

Neuendorf MANOVA /MANCOVA. Model: X1 (Factor A) X2 (Factor B) X1 x X2 (Interaction) Y4. Like ANOVA/ANCOVA:

ESP 178 Applied Research Methods. 2/23: Quantitative Analysis

Explanation of R 2, and Other Stories

Parametric versus Nonparametric Statistics-when to use them and which is more powerful? Dr Mahmoud Alhussami

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Analysis of Longitudinal Data: Comparison Between PROC GLM and PROC MIXED. Maribeth Johnson Medical College of Georgia Augusta, GA

Discrete Multivariate Statistics

An Introduction to Multivariate Statistical Analysis

Transcription:

CHAPTER Types of Effect size indices: An Overview of the Literature There are different types of effect size indices as a result of their different interpretations. Huberty (00) names three different types: Group differences Relationships Group overlapping Before we provide an overview of these types, we will pause a moment to look at the measurement scales and the assumptions, each of which produce different indices..1 Measurement scales and assumptions The different effect size indices do not only go hand-in-hand with the statistical analyses or objectives to which one strives, but they are also dependent on the type of measurement. For continuous measurements (i.e., which can take on any value within a given interval, e.g., a person s height or IQ), sample means are used to compare different populations, while Pearson s correlation coefficient is used to indicate relationships between the different measurements that can be made in the sample. If we are working with categorical data (i.e., data whose measurements are divided into categories, e.g., language groups or gender of people), then proportions can be used to make comparisons between populations and measures obtained from two-way frequency tables can be used to determine relationships. In the discussion that follows we will distinguish between effect size indices based on the following measurement scales: Interval/ratio (continuous measurements, for example, IQ, temperature, blood pressure). 1

Ordinal (categorical measurement with ordered categories, e.g., employment position in a company or academic qualification of an individual). Nominal (categorical with no ordering, e.g., racial group or church membership of an individual). Dichotomous (nominal with only two categories, e.g., someone s HIVstatus as positive/negative or gender as Male/Female). In addition to the type of scale, the effect size indices also depend on the situation or assumptions underlying the populations or models being used. The situation which one finds oneself in is determined by: The number of populations or groups, If one is studying the entire population (for example, if one wants to only analyse the data from the respondents of a questionnaire and there is no interest in generalizing the results further) or if one is studying only a random sample, and If univariate or multivariate effect sizes must be calculated. The assumptions that one might consider include: The assumption of homogeneity of variances (i.e., the equality of standard deviations) of interval scaled measurements between different populations when effect size indices are based on means or contrasts. Particular attention will be paid to this in Chapters 4 and 6. Underlying models are usually assumed to consist of fixed factor effects or random factor effect or a combination of the two. Different effect size indices will be considered for each of these assumptions. Chapter 6 will provide further details on these topics. When confidence intervals for effect size indices are calculated, the assumption of normality of the interval scaled measurements can be important.

Independent groups (e.g., control and experimental groups) and dependent groups (e.g., a group before a treatment, and the same group after a treatment) each have their own specific effect size indices (see Chapter 4). Using covariates for correction (in analysis of covariance or ANCOVA) introduces its own modifications to effect size indices (see paragraph 6.4). When one is concerned with the reliability of a variable, then an attenuation correction for effect sizes can also be introduced (see paragraph 5.1.4).. Effect size indices for differences in groups In a situation where the means of two groups must be compared, Cohen (1969, 1977, 1988) proposed the use of the standardized difference of the means (d). In the 1970 s and 1980 s there was a great deal of discussion as to how the µ µ difference in the means ( ) should be standardized. Cohen suggested that 1 one should simply divide it by the pooled standard deviation ( σ ) of the two populations, since this assumes homogeneity of variances. Glass (1976) proposed that only the standard deviation of the control group should be used in the denominator. Hedges (1981), in contrast, modified d so that it is an unbiased estimator for random samples. For dichotomous variables Cohen (1969, 1977, 1988) compares two proportions with the index h, which is the difference between the arc-sine transformations of the proportions. For differences between two means where heterogeneity of variances is assumed, Steyn (000) proposes an adjusted index along with a conservative estimator a indices could also be applied when comparing proportions (Steyn 1999). ˆ. These a When more than two groups are compared, Cohen (1969, 1977, 1988) proposes an index f which represents the variance of the group means relative to the 3

pooled variance σ. He also recommends using δ ( µ µ ) index, where = σ as an max min / µ and µ are the largest and smallest population means max min respectively. A discussion concerning cases where the effect sizes of contrasts are important can be found in both Olejnik & Algina (000) and Kline (004a: Chapter 6). The former also contemplates cases where means are adjusted for covariates in an ANCOVA design, as for contrasts within individuals..3 Effect size indices based on Relationships The familiar Pearson product moment correlation coefficient ( r ) is a measure of the linear relationship between two continuous or interval scaled variables. As such, r can be used as an effect size index (Cohen, 1977, Chapter 3) to express the strength of the linear relationship. In cases where one requires the strength of the relationship between nominal scaled variables, Cohen (1977, Chapter 7) provides an effect size, based on the Chi-squared test statistic, called w. A special case of this effect size is called the φ -coefficient and it is used for dichotomous variables. The correlation between a grouping indicator variable (x), which only takes on two values (say 1 and ) and is used to distinguish between two groups, and the interval scale variable (y) denotes the relationship between y and the group membership (this is also known as a point bi-serial correlation, r ). This is another sort of interpretation of effect size, because the more the group means change relative to their standard deviations, the larger the value r becomes. Cohen (1969, 1977, 1988) also express r in terms of d, the standardized 4

differences in means. The squared value, r, in turn, represents the proportion of variation of y attributed to its population membership, which is a special case of Pearson s eta-squared ( η ). Eta-squared can serve as an effect size whenever more than two population means are compared, in which case it represents the ratio of the between-group (or model) sum of squares and the total sum of squares of a one-way ANOVA model. Hays (1963) made use of the omega-squared index ( ω ) which was recently redefined (Olejnik & Algina, 000). When the grouping variable can be considered to be a random factor, the inter-class correlation coefficient of ( ˆρ I ) is the recommended measure of this relationship and the square of this quantity, ˆρ I, serves as an estimator for η (Olejnik & Algina, 000). Effect size indices for situations where two or more grouping variables are used in various combinations of fixed and random factor effects are provided by Olejnik & Algina (000) as well as Kline (004a: Chapter 7)..4 Effect size indices based on Group overlapping Tilton (1937) suggested that the comparison between two means should be, as far as is possible, enhanced by making use of a measure of overlapping such as the percentage of area under the distribution common to both distributions. Cohen (1977:3) defines various overlapping indices when two normally distributed populations with equal standard deviations are used and also provides the relationship of this quantity to the standardized difference in means (d). This index could, for example, be used to help interpret the value d: a small degree of overlapping suggests large values of d (for further reading see Huberty & Lowman, 000, as well as Kline, 004a: Figure 4.1). 5

In this way group overlapping can be related to forecast discriminant analysis. Huberty & Holmes (1983) apply it in the univariate two-sample case, where they relate the probability of correct classification (percentage of overlapping) to the effect size index d. A more sensible better-than-chance-hit-rate index (I) is suggested by Huberty (1994), while Huberty & Lowman (000) made use of this idea in a multivariate context..5 Multivariate Effect size indices In the case where a dependent continuous variable s relationship with more than one independent variable has to be determined, (i.e., with multiple linear regression, abbreviated as MLR) the coefficient of multiple correlation (R) is the appropriate measure. In an attempt to relate MLR to ANOVA, Cohen (1977:410) introduced the f index which describes the explained variation ( ) the unexplained variation ( 1 R ) partial. He goes further to define R which in turn is a function of the semi-partial defined as the increase in set, A (see also Smithson, 001). R relative to f in terms of the R. The semi-partial R is R when a set of variables, B, is added to the existing In multivariate analysis of variance (MANOVA), Wilk s Λ is a well known statistic. The generalized η = 1 Λ is therefore a logical effect size index. Similarly, we could also use 1 s τ = 1 Λ, with s = min (p,q), where p is the number of variables and q is the hypothesized degrees of freedom (equal to k-1 if k populations are compared.) (Rencher, 1995:19). Other indices are ζ and ξ, which are based on the Hotelling-Lawley Trace statistic and Pilai s statistic (Rencher, 1995; Huberty, 1994; Olejnik & Algina, 6

000). Modifications were also introduced to estimate the generalized η (see Olejnik & Algina, 000:73 and Steyn & Ellis, 009 ). In the case where only two populations are compared with one another, the Mahalanobis distance, D, is used as an effect size index (Kline, 004b: 3-4 and M Steyn & Ellis, 009). This is a generalization of Cohen s d for more than one variable. In the case where one has more than two populations, D can also be defined for contrasts and expressed in terms of Wilk s Λ for the contrasts. (See Kline, 004b: 10). M The better-than-chance-hit-rate index by Huberty (1994) and Huberty & Lowman (000) is, by its very nature, a multivariate index and these authors show how it can be used as an index for more than two groups..6 Concluding remarks Huberty (00) provides the details and history of many other effect size indices, and goes on to summarize it graphically in a diagram of these indices along with dates (his Figure 1). The discussion up to this point has focused only on the most important indices. In the following chapters an attempt will be made to provide the full details of each of the effect sizes along with the methods used to calculate them and illustrative examples. When calculations are difficult to do by hand, an effort will be made to supply any available computer programs/packages/spread sheets which can be used to perform these calculations. 7