Mixture Modeling. Identifying the Correct Number of Classes in a Growth Mixture Model. Davood Tofighi Craig Enders Arizona State University

Similar documents
Chapter 19. Latent Variable Analysis. Growth Mixture Modeling and Related Techniques for Longitudinal Data. Bengt Muthén

The Bayesian Method of Estimation for the Number of Latent Classes in Growth Mixture Models

Testing the Limits of Latent Class Analysis. Ingrid Carlson Wurpts

Determining the number of components in mixture models for hierarchical data

Measurement Invariance Testing with Many Groups: A Comparison of Five Approaches (Online Supplements)

COMPARING THREE EFFECT SIZES FOR LATENT CLASS ANALYSIS. Elvalicia A. Granado, B.S., M.S. Dissertation Prepared for the Degree of DOCTOR OF PHILOSOPHY

Variable-Specific Entropy Contribution

Model Estimation Example

Using Mplus individual residual plots for. diagnostics and model evaluation in SEM

What is Latent Class Analysis. Tarani Chandola

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

PERFORMANCE OF INFORMATION CRITERIA FOR MODEL SELECTION IN A LATENT GROWTH CURVE MIXTURE MODEL

General structural model Part 2: Categorical variables and beyond. Psychology 588: Covariance structure and factor models

ENUMERATING THE CORRECT NUMBER OF CLASSES IN A SEMIPARAMETRIC GROUP-BASED TRAJECTORY MODEL. Thomas James Blaze

Strati cation in Multivariate Modeling

Power analysis for the Bootstrap Likelihood Ratio Test in Latent Class Models

Latent Class Analysis

Bayesian Analysis of Latent Variable Models using Mplus

Analyzing Criminal Trajectory Profiles: Bridging Multilevel and Group-based Approaches Using Growth Mixture Modeling

Using Bayesian Priors for More Flexible Latent Class Analysis

Global Model Fit Test for Nonlinear SEM

Analyzing criminal trajectory profiles: Bridging multilevel and group-based approaches using growth mixture modeling

SRMR in Mplus. Tihomir Asparouhov and Bengt Muthén. May 2, 2018

Running head: DETECTING LONGITUDINAL HETEROGENEITY 1. Growth mixture models outperform simpler clustering algorithms when detecting

Auxiliary Variables in Mixture Modeling: Using the BCH Method in Mplus to Estimate a Distal Outcome Model and an Arbitrary Secondary Model

The Integration of Continuous and Discrete Latent Variable Models: Potential Problems and Promising Opportunities

Categorical and Zero Inflated Growth Models

Relating Latent Class Analysis Results to Variables not Included in the Analysis

Bayesian Mixture Modeling

INTRODUCTION TO STRUCTURAL EQUATION MODELS

Multiple Group CFA Invariance Example (data from Brown Chapter 7) using MLR Mplus 7.4: Major Depression Criteria across Men and Women (n = 345 each)

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation

A (Brief) Introduction to Crossed Random Effects Models for Repeated Measures Data

Advances in Mixture Modeling And More

Application of Plausible Values of Latent Variables to Analyzing BSI-18 Factors. Jichuan Wang, Ph.D

The Impact of Model Misspecification in Clustered and Continuous Growth Modeling

Signal Detection Theory With Finite Mixture Distributions: Theoretical Developments With Applications to Recognition Memory

Outline. Mixed models in R using the lme4 package Part 3: Longitudinal data. Sleep deprivation data. Simple longitudinal data

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models

Departamento de Economía Universidad de Chile

Introductory Econometrics

An Introduction to Path Analysis

All models are wrong but some are useful. George Box (1979)

An Introduction to Mplus and Path Analysis

DIC, AIC, BIC, PPL, MSPE Residuals Predictive residuals

Multilevel regression mixture analysis

Investigating the Feasibility of Using Mplus in the Estimation of Growth Mixture Models

Parameter Estimation. William H. Jefferys University of Texas at Austin Parameter Estimation 7/26/05 1

Ruth E. Mathiowetz. Chapel Hill 2010

A single-level random-effects cross-lagged panel model for longitudinal mediation analysis

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

Multi-group analyses for measurement invariance parameter estimates and model fit (ML)

Growth Mixture Model

A Study of Statistical Power and Type I Errors in Testing a Factor Analytic. Model for Group Differences in Regression Intercepts

Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood

ABSTRACT. Phillip Edward Gagné. priori information about population membership. There have, however, been

Basic Sampling Methods

Longitudinal Invariance CFA (using MLR) Example in Mplus v. 7.4 (N = 151; 6 items over 3 occasions)

Statistics 202: Data Mining. c Jonathan Taylor. Model-based clustering Based in part on slides from textbook, slides of Susan Holmes.

Model Invariance Testing Under Different Levels of Invariance. W. Holmes Finch Brian F. French

Latent Variable Centering of Predictors and Mediators in Multilevel and Time-Series Models

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010

Likelihood Ratio Criterion for Testing Sphericity from a Multivariate Normal Sample with 2-step Monotone Missing Data Pattern

Model comparison and selection

Political Science 236 Hypothesis Testing: Review and Bootstrapping

2/26/2017. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

Conventional And Robust Paired And Independent-Samples t Tests: Type I Error And Power Rates

Akaike Information Criterion

Econometric Analysis of Cross Section and Panel Data

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs)

Inference using structural equations with latent variables

Introduction to Statistical modeling: handout for Math 489/583

Confidence Intervals for a Semiparametric Approach to Modeling Nonlinear Relations among Latent Variables

A Comparison of Two Approaches For Selecting Covariance Structures in The Analysis of Repeated Measurements. H.J. Keselman University of Manitoba

Bayesian Analysis of Multivariate Normal Models when Dimensions are Absent

Lecture 5: Poisson and logistic regression

Nesting and Equivalence Testing

Structural Equation Modeling and Confirmatory Factor Analysis. Types of Variables

Evaluating Small Sample Approaches for Model Test Statistics in Structural Equation Modeling

Weighted tests of homogeneity for testing the number of components in a mixture

Example 7b: Generalized Models for Ordinal Longitudinal Data using SAS GLIMMIX, STATA MEOLOGIT, and MPLUS (last proportional odds model only)

5/3/2012 Moderator Effects with SAS 1

SC705: Advanced Statistics Instructor: Natasha Sarkisian Class notes: Introduction to Structural Equation Modeling (SEM)

Parametric Modelling of Over-dispersed Count Data. Part III / MMath (Applied Statistics) 1

Exploring Cultural Differences with Structural Equation Modelling

Long memory in the R$/US$ exchange rate: A robust analysis

On the Behavior of Marginal and Conditional Akaike Information Criteria in Linear Mixed Models

SEM for Categorical Outcomes

* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course.

Lecture 2: Poisson and logistic regression

Inflation Revisited: New Evidence from Modified Unit Root Tests

Introduction to Within-Person Analysis and RM ANOVA

9. Model Selection. statistical models. overview of model selection. information criteria. goodness-of-fit measures

CHAPTER 6: SPECIFICATION VARIABLES

Mixtures of Rasch Models

Statistical Practice. Selecting the Best Linear Mixed Model Under REML. Matthew J. GURKA

Inferences on missing information under multiple imputation and two-stage multiple imputation

Multilevel Mixture with Known Mixing Proportions: Applications to School and Individual Level Overweight and Obesity Data from Birmingham, England

Moment and IV Selection Approaches: A Comparative Simulation Study

Growth Mixture Modeling and Causal Inference. Booil Jo Stanford University

Transcription:

Identifying the Correct Number of Classes in a Growth Mixture Model Davood Tofighi Craig Enders Arizona State University Mixture Modeling Heterogeneity exists such that the data are comprised of two or more latent classes with different distributions K k = Class Class Mixture Distribution ( y ) f ( y) = πφ ; µ, Σ k k k k

Growth Mixture Modeling (GMM) k latent classes with different growth trajectories and variance components Class membership may be related to covariates and distal outcomes GMM is analogous to a multiple-group growth model, but group membership is unobserved Religiousness Example McCullough, Enders, Brion (6) Three classes of religious development Religiousness 4 4 44 54 64 74 84 Age

Enumerating Latent Classes How many trajectory classes are there? Information-based criteria BIC, AIC, etc. Likelihood ratio tests Lo, Mendell, Rubin () Goodness of fit tests Tests based on model-implied skewness and kurtosis (Muthén, ) Bayesian Information Criterion Based on the log likelihood and penalty terms related to model complexity BIC = LL + pln( N) The sample-size size adjusted BIC (SABIC) replaces N with (N( + ) / 4

Akaike Information Criterion Similar idea as the BIC AIC = LL + p The consistent AIC (CAIC) is CAIC = LL + p(ln[ N] + ) A sample-size size adjusted CAIC (SACAIC) replaces N with (N( + ) / 4 Likelihood Ratio Tests Likelihood ratio tests can be used to compare a k versus k - class model LRT = ( LL LL ) k The LRT is not chi-square distributed Class probabilities for the nested k - class model are at the boundary (zero) k 4

Lo, Mendell,, and Rubin () Derived an appropriate reference distribution for the LRT, and an ad hoc adjustment to the test statistic LMR and adjusted LMR (ALMR) A small p value suggests that the k class model is favored over k - classes Bootstrapping The LRT The k and k - class models are fit to a number of bootstrap samples A p value for the LRT is obtained from the empirical reference distribution of bootstrapped LRT values See Nylund, Asparouhov,, and Muthén (6) for detailed simulation results 5

Fit Tests Based On Multivariate Skewness And Kurtosis Model-implied skewness and kurtosis from the k class model are compared to the sample moments (Muthén( Muthén,, ) Analogous to the GOF test in SEM A large p value indicates that the k class model accurately reproduces the higher- order moments Herein referred to as MST and MKT An Artificial Data Example An artificial data set (N( = ) was generated from a population with three trajectory classes A sequence of models was fit (k( = to 4) to illustrate the class extraction process 6

Analysis Results k = k = k = k = 4 BIC 7.77 668.68 66.59 6649.66 SABIC 7.5 66. 655.54 656.7 LMR N/A p <. p <. p =.9 BLRT N/A p <. p =. p =. MST p <. p =.6 p =.8 p =.86 MKT p <. p =.9 p =.49 p =.55 More On The LRT Multiple k - class models are possible For example, when testing a class model, three different class models could result Mplus discards the first class when fitting the k - class model Starting values must be used to ensure that classes are ordered properly 7

Example Correct ordering -class model LL = -64.7 -class model LL = -64.7 LL = -88.5 -class model LL = -88.5 LL = -.6 Incorrect ordering -class model LL = -64.7 -class model LL = -64.7 LL = -88.5 -class model LL = -65.7 LL = -.6 Purpose Of Study In the previous example, the fit indices did not agree on the number of classes Which index should be used to determine the number of classes? We designed a Monte Carlo simulation study to address this question 8

Simulation Procedure Data were generated from a population with three trajectory classes A sequence of GMMs was fit (k( = to 4) Extraction was performed with and without covariates In what proportion of replications was the k = class model recovered? Data Generation Model y y y5 y7 4 6 4 6 6 i s q c x x 9

Population Trajectory Classes 5 5 Class Class Class 5 4 5 6 7 Time Manipulated Variables Number of repeated measures t = 4, 7 Sample size N = 4, 7,, Mixing proportions %, %, 47% and 7%, 6%, 57% Within-class normality S =, K = and S =, K = Class separation High and Low

4 4 5 5 6 6 Manipulating Class Separation Definition of separation is subjective, but based on previous experience (e.g., McCullough, Enders, & Brion,, 5) Within-class variance components were increased in magnitude to create the low separation condition Mean growth trajectories did not change Class : High Versus Low Separation 6 5 4 High Separation 6 5 4 Low Separation

4 4 4 4 5 5 5 5 6 6 6 6 Class : High Versus Low Separation 6 5 4 High Separation 6 5 4 Low Separation Class : High Versus Low Separation 6 5 4 High Separation 6 5 4 Low Separation

Average Class Probabilities Low Separation High Separation.85...9.8...8.8.5.89.8.7..8...89 Sample Size (No Covariates) Normality, High Separation, t = 4, Proportions = ::47 % Correct Number of Classes.9.8.7.6.5.4... BIC SABIC AIC CAIC SACAIC LMR MST MKT N = 4 N = 7 N = N =

Sample Size (With Covariates) Normality, High Separation, t = 4, Proportions = ::47.9 % Correct Number of Classes.8.7.6.5.4... N = 4 N = 7 N = N = BIC SABIC AIC CAIC SACAIC LMR MST MKT Class Separation (No Covariates) Normality, N =, t = 4, Proportions = ::47.9 % Correct Number of Classes.8.7.6.5.4... Low Sep High Sep BIC SABIC AIC CAIC SACAIC LMR MST MKT 4

Class Separation (Covariates) Normality, N =, t = 4, Proportions = ::47.9 % Correct Number of Classes.8.7.6.5.4... Low Sep High Sep BIC SABIC AIC CAIC SACAIC LMR MST MKT Mixing Proportion (No Covariates) Normality, N =, t = 4, High Separation.9 % Correct Number of Classes.8.7.6.5.4... ::47 7:6:57 BIC SABIC AIC CAIC SACAIC LMR MST MKT 5

Mixing Proportion (Covariates) Normality, N =, t = 4, High Separation.9 % Correct Number of Classes.8.7.6.5.4... ::47 7:6:57 BIC SABIC AIC CAIC SACAIC LMR MST MKT Normality (No Covariates) N =, t = 4, Proportions = ::47, High Separation.9 % Correct Number of Classes.8.7.6.5.4... Normal Nonnormal BIC SABIC AIC CAIC SACAIC LMR MST MKT 6

Normality (Covariates) N =, t = 4, Proportions = ::47, High Separation.9.8.7.6.5.4... BIC SABIC AIC CAIC SACAIC LMR MST MKT % Correct Number of Classes Normal Nonnormal Information-Based Criteria The SABIC very accurately detected the number of latent classes At small Ns, it had a slight tendency to extract too few classes Note that the k class model was retained if the SABIC decreased by any amount Other information-based criteria performed poorly 7

Likelihood Ratio Tests The LMR and ALMR performed well, but were somewhat less powerful than the SABIC LMR tended to extract too few classes at small Ns, and too many classes at large Ns MSK and MKT MSK and MKT uniformly extracted too few classes The performance of these measures may be model-dependent dependent MSK was slightly more accurate than the LMR in a pilot study with a slightly different set of mixture distributions 8

The Use Of Covariates The inclusion of covariates dramatically decreased power, and resulted in the extraction of too few classes e.g., At N = 4, the SABIC extracted two classes 9% of the time, compared to 4% when covariates were excluded The use of covariates should be avoided unless N is very large Nonnormal Data Mild violations of within-class normality led to the extraction of too many classes None of the tests we studied was immune to this problem Is bootstrapping the LRT a solution? 9

Bootstrapping The LRT Preliminary results suggest that the bootstrap is less powerful than the LMR e.g., The k = class model was correctly rejected about 75% and % of the time in the high and low separation conditions, respectively See Nylund, Asparouhov,, and Muthén (6) for detailed simulation results How Likely Is Over-Extraction? Extracting too many classes frequently produced problematic solutions (e.g., negative variances, unstable solutions) Estimating class-specific specific variances probably prevents over-extraction extraction Invoking constraints to attain convergence (e.g., fixing variances to zero) is likely a sign of over-extraction extraction