A note on structured means analysis for a single group. André Beauducel 1. October 3 rd, 2015

Similar documents
The Impact of Varying the Number of Measurement Invariance Constraints on. the Assessment of Between-Group Differences of Latent Means.

Retained-Components Factor Transformation: Factor Loadings and Factor Score Predictors in the Column Space of Retained Components

A Study of Statistical Power and Type I Errors in Testing a Factor Analytic. Model for Group Differences in Regression Intercepts

Chapter 14: Repeated-measures designs

Moving beyond testing means: Using MACS modeling to test group differences in variances and covariances

AN INVESTIGATION OF THE ALIGNMENT METHOD FOR DETECTING MEASUREMENT NON- INVARIANCE ACROSS MANY GROUPS WITH DICHOTOMOUS INDICATORS

ADVANCED C. MEASUREMENT INVARIANCE SEM REX B KLINE CONCORDIA

Confirmatory Factor Analysis. Psych 818 DeShon

Structural Equation Modeling and Confirmatory Factor Analysis. Types of Variables

Can Variances of Latent Variables be Scaled in Such a Way That They Correspond to Eigenvalues?

STAT 730 Chapter 9: Factor analysis

Measurement Invariance (MI) in CFA and Differential Item Functioning (DIF) in IRT/IFA

Testing Structural Equation Models: The Effect of Kurtosis

Chapter 4: Factor Analysis

General structural model Part 1: Covariance structure and identification. Psychology 588: Covariance structure and factor models

MICHAEL SCHREINER and KARL SCHWEIZER

A Threshold-Free Approach to the Study of the Structure of Binary Data

Factor Analysis. Qian-Li Xue

Model Invariance Testing Under Different Levels of Invariance. W. Holmes Finch Brian F. French

Longitudinal Invariance CFA (using MLR) Example in Mplus v. 7.4 (N = 151; 6 items over 3 occasions)

Introduction to Confirmatory Factor Analysis

Consequences of measurement error. Psychology 588: Covariance structure and factor models

sempower Manual Morten Moshagen

Measurement Invariance Testing with Many Groups: A Comparison of Five Approaches (Online Supplements)

Supplementary materials for: Exploratory Structural Equation Modeling

Introduction to Structural Equation Modeling

Exploratory Factor Analysis: dimensionality and factor scores. Psychology 588: Covariance structure and factor models

Multivariate and Multivariable Regression. Stella Babalola Johns Hopkins University

STRUCTURAL EQUATION MODELING. Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013

Introduction to Structural Equation Modeling Dominique Zephyr Applied Statistics Lab

Factor Analysis & Structural Equation Models. CS185 Human Computer Interaction

Intelligence 37 (2009) Contents lists available at ScienceDirect. Intelligence

ABSTRACT. Phillip Edward Gagné. priori information about population membership. There have, however, been

Nesting and Equivalence Testing

Reliability Estimates for Three Factor Score Estimators

Introduction to Factor Analysis

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

Running head: PERMUTATION INVARIANCE 1. Differential Item Functioning in Multiple-Group Confirmatory Factor Analysis

Factor Analysis: An Introduction. What is Factor Analysis? 100+ years of Factor Analysis FACTOR ANALYSIS AN INTRODUCTION NILAM RAM

FIT CRITERIA PERFORMANCE AND PARAMETER ESTIMATE BIAS IN LATENT GROWTH MODELS WITH SMALL SAMPLES

Comparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior

2/26/2017. This is similar to canonical correlation in some ways. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

Terrence D. Jorgensen*, Alexander M. Schoemann, Brent McPherson, Mijke Rhemtulla, Wei Wu, & Todd D. Little

Conventional And Robust Paired And Independent-Samples t Tests: Type I Error And Power Rates

Module 3. Latent Variable Statistical Models. y 1 y2

Factor Analysis. Robert L. Wolpert Department of Statistical Science Duke University, Durham, NC, USA

Introduction to Factor Analysis

MATH 644: Regression Analysis Methods

Problems with parallel analysis in data sets with oblique simple structure

LECTURE 4 PRINCIPAL COMPONENTS ANALYSIS / EXPLORATORY FACTOR ANALYSIS

Multiple Group CFA Invariance Example (data from Brown Chapter 7) using MLR Mplus 7.4: Major Depression Criteria across Men and Women (n = 345 each)

Evaluating Small Sample Approaches for Model Test Statistics in Structural Equation Modeling

Confirmatory Factor Analysis and Structural Equation Modeling Group Differences: Measurement Invariance.

Congeneric, parallel, and tau-equivalent measures. 6. Measurement Models

Specifying Latent Curve and Other Growth Models Using Mplus. (Revised )

The Sensitivity of Confirmatory Factor Analytic Fit Indices to. Violations of Factorial Invariance across Latent Classes: A Simulation.

Recovery of weak factor loadings in confirmatory factor analysis under conditions of model misspecification

A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data

Appendix A: The time series behavior of employment growth

Chapter 8. Models with Structural and Measurement Components. Overview. Characteristics of SR models. Analysis of SR models. Estimation of SR models

Investigating Population Heterogeneity With Factor Mixture Models

BIOL 458 BIOMETRY Lab 8 - Nested and Repeated Measures ANOVA

An Efficient State Space Approach to Estimate Univariate and Multivariate Multilevel Regression Models

Chapter 5. Introduction to Path Analysis. Overview. Correlation and causation. Specification of path models. Types of path models

M M Cross-Over Designs

ABSTRACT. Between-Subjects Design under Variance. Heterogeneity and Nonnormality. Evaluation

An Introduction to Mplus and Path Analysis

Exploratory Factor Analysis and Canonical Correlation

General linear models. One and Two-way ANOVA in SPSS Repeated measures ANOVA Multiple linear regression

A Cautionary Note on the Use of LISREL s Automatic Start Values in Confirmatory Factor Analysis Studies R. L. Brown University of Wisconsin

Statistical Analysis for QBIC Genetics Adapted by Ellen G. Dow 2017

Confirmatory Factor Analysis: Model comparison, respecification, and more. Psychology 588: Covariance structure and factor models

Moderation 調節 = 交互作用

Robust Means Modeling vs Traditional Robust Tests 1

An Introduction to Path Analysis

Robustness of factor analysis in analysis of data with discrete variables

Measuring Market Orientation: Are There Differences Between Business Marketers and Consumer Marketers?

Lecture 2. The Simple Linear Regression Model: Matrix Approach

TAMS39 Lecture 10 Principal Component Analysis Factor Analysis

26:010:557 / 26:620:557 Social Science Research Methods

COMPARISON OF FIVE TESTS FOR THE COMMON MEAN OF SEVERAL MULTIVARIATE NORMAL POPULATIONS

Inference using structural equations with latent variables

Appendix to The Life-Cycle and the Business-Cycle of Wage Risk - Cross-Country Comparisons

Re-Visiting Exploratory Methods for Construct Comparability: Is There Something to be Gained From the Ways of Old?

2/26/2017. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

Introduction to Maximum Likelihood Estimation

Reliability-Constrained Latent Structure Models

UNIVERSITY OF CALGARY. The Influence of Model Components and Misspecification Type on the Performance of the

AN INTRODUCTION TO STRUCTURAL EQUATION MODELING WITH AN APPLICATION TO THE BLOGOSPHERE

A Rejoinder to Mackintosh and some Remarks on the. Concept of General Intelligence

1 A factor can be considered to be an underlying latent variable: (a) on which people differ. (b) that is explained by unknown variables

One-Way Repeated Measures Contrasts

ABSTRACT. Professor Gregory R. Hancock, Department of Measurement, Statistics & Evaluation

Inferences on missing information under multiple imputation and two-stage multiple imputation

IMPACT OF NOT FULLY ADDRESSING CROSS-CLASSIFIED MULTILEVEL STRUCTURE IN TESTING MEASUREMENT INVARIANCE AND CONDUCTING MULTILEVEL MIXTURE MODELING

Estimating Multilevel Linear Models as Structural Equation Models

Dimensionality Assessment: Additional Methods

An Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin

Goals for the Morning

The 3 Indeterminacies of Common Factor Analysis

Transcription:

Structured means analysis for a single group 1 A note on structured means analysis for a single group André Beauducel 1 October 3 rd, 2015 Abstract The calculation of common factor means in structured means analysis (SMM) is considered. The SMM equations imply that the unique factors are defined as having zero means. It was shown within the one factor solution that this definition implies larger absolute common factor loadings to co-occur with larger absolute expectations of the observed variables in the single group case. This result was illustrated by means of a small simulation study. It is argued that the proportionality of factor loadings and observed means should be critically examined in the context of SMM. It is recommended that researchers should freely estimate the observed expectations, whenever, the proportionality of loadings and observed means does not make sense. It was also shown that for a given size of observed expectations, smaller common factor loadings result in larger common factor mean estimates. It should therefore be checked whether variables with very small factor loadings occur when SMM results in very large absolute common factor means. Keywords: Structural equation modeling; structured means analysis; latent means analysis. 1 Dr. André Beauducel, Institute of Psychology, University of Bonn, Kaiser-Karl-Ring 9, 53111 Bonn, Germany, email: beauducel@uni-bonn.de

Structured means analysis for a single group 2 Sörbom (1974) proposed estimates of factor means for confirmatory factor analysis (CFA) and structural equation modeling (SEM). The method was called structured means modeling (SMM) and is widely used in order to test hypotheses on group differences (Cole, Maxwell, Arvey & Salas, 1993; Hancock, Lawrence, & Nevitt, 2000). Several methodological developments have been presented in order to improve SMM, especially for the testing of group differences (Hancock et al., 2000; Whittaker, 2013). However, the estimation of common factor means might also be considered in the single group case. The present note therefore refers to SMM for a single group of participants. Definitions Sörbom (1974) presented the original equations for SMM for multiple groups. Accordingly, the factor model for latent mean analysis for the gth group can be written as xg = μ+ Λξ g g + ε g, with g = 1,2,..., m, (1) where xg is a vector of p observed variables, µ is the constant (intercept) of the observed variables, Λg is a p x q matrix of factor loadings, ξg is a random vector of common factors, and εg is a random vector of unique factors. Equation 1 follows Sörbom s (1974) notation. The usual assumptions of the factor model are that the covariances of the unique factors are a diagonal matrix of unique factor variances (Cov[εg, ε g]= Ψg 2 ) and that the covariances of the common and unique factors are zero (Cov[ξg, εg]= 0). The expected values of the observed variables for the gth group are given by E ( x ) = μ+ Λθ, (2) g g g where θg is the mean vector of the common factors. This is the equation as it is usually presented (Sörbom, 1974; Cole et al., 1993). The expectancy refers to the population of individuals in the gth group. It should be noted that Equation 2 implies E ( ε ) = 0, i.e., that the unique factors have a zero expectation. Therefore, the definition of common factor means according to Equation (2) corresponds to the component model. This follows from the fact that the expectations of the observed variables are calculated as a linear combination from the common factor loadings and that the loadings of the unique factors are not taken into account. Accordingly, the common factor mean θg can be calculated from the observed expectations when Equation 2 is transformed into 1 ( g g) g( E( g) ) = g, g ΛΛ Λ x μ θ (3)

Structured means analysis for a single group 3 where the expectations of the observed variables result from the group specific expectations E( x g ) and the constant μ. The single group case When there is only a single group, it is impossible to distinguish between the overall constant μ and the group-specific expectations E( x g ) of the observed variables. Therefore, only the expectations of the observed variables can be considered, which implies 1 ( ) E( ) =, ΛΛ Λ x θ (4) since the group specific index is not necessary for the single group case. Equation 2 for a single group can be written as Considering a one-factor model yields E ( x) = Λθ. (5) E( x ) Λ = 1θ, (6) where denotes the Hadamard (elementwise) division and 1 is a p 1 unit-vector. Equation 6 implies that the ratio of the expectations of observed variables and the corresponding loadings yields θ. It follows that performing conventional SMM for single group data in order to estimate a single common factor model with a common factor mean implies a proportionality of common factor loadings and observed expectations. This effect is also present with more than one common factor, as long as the variables have only a single common factor loading. It might be less pronounced when each variable has more than one factor loading. However, several applications of the SMM are possible where this proportionality might not be reasonable: For example, why should a more easy task with a larger mean number of correct solutions have higher factor loadings than a more difficult task with a smaller mean number of correct solutions? The model implied by Equations 5 and 6 may fit to the data when floor effects occur for some of the observed variables: Floor effects typically result in smaller variances of the observed variables with smaller means which typically result in smaller inter-correlations of these variables. In such a context of variables, those variables with larger means will have larger factor loadings which will correspond to Equations 5 and 6. In contrast, a ceiling effect might cause the variables with higher means to have smaller variances so that they might have smaller factor loadings. However, a calculation of common factor means that is more compatible with a floor effect than with a ceiling effect may introduce an arbitrary bias into single group SSM. As an illustration for the implications of Equations 5 and 6, a population one-factor model (Model 1) with five observed variables and common factor loadings of.30,.40,.50,.60,

Structured means analysis for a single group 4 and.70 with expectations of the observed variables of 3, 4, 5, 6, and 7 was defined. Model 1 can be directly derived from Equation 5 when a common factor mean of 10 is assumed. Model 2 had exactly the same common factor loadings and a reversed order of expectations of the observed variables (7, 6, 5, 4, and 3; see Table 1). Thus, Model 2 cannot be derived from Equation 5 since there is no constant that can be entered as a common factor mean in order to get the respective expectations. For these two population models a small simulation study based on samples with 150, 300, and 900 cases based on 2,000 replications for each model and sample size was conducted (see Table 1). One-factor models were specified, common factor loadings were freely estimated and the factor variances were fixed at one. In order to be compatible with Equation 5, the intercepts of the observed variables were fixed to zero. As expected, Model 1 fits perfectly to the data with a mean χ²-value that is close to the degrees of freedom (see Table 1) and the estimated common factor mean is close to the value used for model generation. In contrast, Model 2 does not fit to the data since the mean χ²-value is much larger than the degrees of freedom. Although the degree of misfit of Model 2 is strongly affected by sample size, the averaged model parameters are very similar for all samples sizes (see Table 1). Moreover, the relative size of the estimated loadings is reversed for Model 2, so that the loadings are proportional to the observed variables expectations (although this was not the case in the population model). Overall, these results illustrate the consequences of common factor mean estimation in accordance with Equation 5. Model 1, which is based on a proportionality of common factor loadings and observed expectations has a considerably superior fit than Model 2 which is based on anti-proportional factor loadings and observed expectations.

Structured means analysis for a single group 5 Table 1. Factor loadings and factor means for two population models in the single group case Population data Item F1 (Model 1) F1 (Model 2) Λ µ Λ µ x1.30 3.00.30 7.00 x2.40 4.00.40 6.00 x3.50 5.00.50 5.00 x4.60 6.00.60 4.00 x5.70 7.00.70 3.00 2,000 samples, N = 900 Item F1 (Model 1) F1 (Model 2) ˆΛ µ * ˆΛ µ * x1.30 (0.01) 0.00.56 (0.03) 0.00 x2.40 (0.02) 0.00.48 (0.03) 0.00 x3.50 (0.02) 0.00.40 (0.02) 0.00 x4.60 (0.03) 0.00.32 (0.02) 0.00 x5.70 (0.03) 0.00.24 (0.01) 0.00 Factor mean 10.04 (0.43) 12.49 (0.66) Model fit χ² = 9.15 (4.26); df = 9 χ² = 126.82 (22.88); df = 9 2,000 samples, N = 300 Item F1 (Model 1) F1 (Model 2) x1.30 (0.02) 0.00.56 (0.05) 0.00 x2.40 (0.03) 0.00.48 (0.04) 0.00 x3.50 (0.04) 0.00.40 (0.04) 0.00 x4.60 (0.05) 0.00.32 (0.03) 0.00 x5.70 (0.05) 0.00.24 (0.02) 0.00 Factor mean 10.10 (0.79) 12.59 (1.21) Model fit χ² = 9.01 (4.28); df = 9 χ² = 48.47 (13.42); df = 9 2,000 samples, N = 150 Item F1 (Model 1) F1 (Model 2) x1.30 (0.03) 0.00.56 (0.07) 0.00 x2.40 (0.04) 0.00.48 (0.06) 0.00 x3.49 (0.05) 0.00.40 (0.05) 0.00 x4.59 (0.06) 0.00.32 (0.04) 0.00 x5.69 (0.07) 0.00.24 (0.03) 0.00 Factor 10.25 (1.16) 12.83 (1.89) mean Model fit χ² = 9.16 (4.32); df = 9 χ² = 28.46 (9.99); df = 9 Note. * Parameters were fixed for model specification. Standard deviations are given in brackets.

Structured means analysis for a single group 6 Note that the simple way out of the problem is not to freely estimate the common factor mean and all the observed variables means, because such a model would not be identified. It is, however, possible to fix one observed variable mean per factor and to estimate the common factor mean on this basis. This strategy would be incompatible with Equation 5 and it introduces the choice of the observed variables mean to be fixed. When a single variables mean is fixed to zero, the common factor mean is calculated on the basis of the empirical mean and the loading of this variable. When the expectations of all observed variables are equal, it follows from Equation 5 that choosing a variable with a small common factor loading will lead to a larger common factor mean than choosing a variable with a large common factor loading. When there is, moreover, variation of the loading sizes on one factor, the choice of the variable with the fixed mean will affect the common factor mean. For example, in Model 2 of the simulation study (Table 1), fixing the mean of the variable with the smallest loading (x1) to zero results in an averaged common factor mean of 23.82 (SD = 3.90) in the simulation based on 900 cases. In contrast, in the same condition, fixing the mean of variable with the largest loading (x5) to zero results in a common factor mean of 4.32 (SD = 0.37). Thus, the modeling strategy of fixing a single mean introduces considerably arbitrariness into the resulting common factor mean. The relation between common factor loadings and the resulting common factor mean can be more generally described. Theorem 1 shows the consequences of estimating factor means according to Equation 6, when factor models based on observed variables with the same E( x) but different Λ are compared. Consider a one-factor model with equal loadings on the common factor, so that Λ = w1, where 1 is a p 1 unit-vector, and w is a scalar. Theorem 1. w 0 implies abs( θ) for E( x) 0, with abs denoting absolute values. Proof. Entering Λ = w1 into Equation 6 yields This completes the proof. 1 1 1 ( w w) w E( ) = p w E( ) =. 11 1 x 1 x θ (7) Theorem 1 describes the following relationship for constant E( x ): The smaller the absolute size of the common factor loadings, the larger is the absolute size of the common factor mean. On this basis, one might question whether a modeling of factor means according to Equation 5 and 6 is reasonable in the single-group case, since lower factor loadings normally imply that a more careful interpretation of the factor would be reasonable.

Structured means analysis for a single group 7 Discussion The effect of SMM on the descriptive common factor means was investigated for the singlegroup case. It was found that conventional SMM, which is based on the equations of Sörbom (1974), implies that the expectations of the observed variables are proportional to the absolute size of the common factor loadings. There is, however, no theoretical reason for the expectations of observed variables being proportional to the common factor loadings. In several tests or inventories the relative size of the expectations depends on a more or less arbitrary scale of the variables. In other inventories, the expectations also depend on ceiling effects and floor effects. Interestingly, floor effects implying smaller variances, thereby smaller intercorrelations and factor loadings for variables with smaller expectations are compatible with the assumptions of the SMM, whereas ceiling effects are not. It follows that researchers should check, whether such effects are present in their data, in order to be aware in their effects on SMM. Moreover, even equal means of the observed variables could be a cause of model misfit, when the factor loadings are unequal. In order to avoid model evaluations that are based on the misfit that is caused by the proportionality assumption of the SMM, researchers should freely estimate some of the intercepts of the observed variables, whenever the proportionality assumption of the SMM does not make sense in the specific context of research. However, this leads to the problem of deciding which intercept has to be freely estimated. Since it was shown that larger absolute common factor means occur for smaller absolute common factor loadings when the observed expectations (intercepts) remain unchanged (Theorem 1), it is recommended to freely estimate the intercepts of the variables with the smallest loadings. This would ensure that the most important observed variables are used in order to estimate the common factor mean and that the common factor mean is not inflated due to the effect of small loadings.

Structured means analysis for a single group 8 References Cole, D. A., Maxwell, S. E., Arvey, R., & Salas, E. (1993). Multivariate group comparisons of variable systems: MANOVA and structural equation modeling. Psychological Bulletin, 114, 174 184. Hancock, G. R., Lawrence, F. R., & Nevitt, J. (2000). Type I error and power of latent mean methods and MANOVA in factorially invariant and noninvariant latent variable systems. Structural Equation Modeling, 7, 534-556. Sörbom, D. (1974). A general method for studying differences in factor means and factor structure between groups. British Journal of Mathematical and Statistical Psychology, 27, 229 239. Whittaker, T. A. (2013). The impact of noninvariant intercepts in latent means models. Structural Equation Modeling, 20, 108-130.