SEM for Categorical Outcomes

Similar documents
Inference using structural equations with latent variables

General structural model Part 2: Categorical variables and beyond. Psychology 588: Covariance structure and factor models

Advanced Structural Equations Models I

Simple Linear Regression. John McGready Johns Hopkins University

Nesting and Equivalence Testing

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models

Factor Analysis. Qian-Li Xue

Lecture 27. December 13, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

Introduction to Structural Equations

Centering Predictor and Mediator Variables in Multilevel and Time-Series Models

Sampling Variability and Confidence Intervals. John McGready Johns Hopkins University

Bayesian Analysis of Latent Variable Models using Mplus

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA

Multilevel Structural Equation Modeling

Outline. 1. Define likelihood 2. Interpretations of likelihoods 3. Likelihood plots 4. Maximum likelihood 5. Likelihood ratio benchmarks

Lecture 23. November 15, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

Linear Methods for Prediction

Model Estimation Example

Introduction to Matrix Algebra and the Multivariate Normal Distribution

Confirmatory Factor Analysis: Model comparison, respecification, and more. Psychology 588: Covariance structure and factor models

WHAT IS STRUCTURAL EQUATION MODELING (SEM)?

Uppsala University and Norwegian School of Management, b Uppsala University, Online publication date: 08 July 2010

SRMR in Mplus. Tihomir Asparouhov and Bengt Muthén. May 2, 2018

Lecture 21. December 19, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

Multiple Group CFA Invariance Example (data from Brown Chapter 7) using MLR Mplus 7.4: Major Depression Criteria across Men and Women (n = 345 each)

STA 216, GLM, Lecture 16. October 29, 2007

Multiple group models for ordinal variables

Introduction to Statistical Analysis

Section B. The Theoretical Sampling Distribution of the Sample Mean and Its Estimate Based on a Single Sample

Assessment, analysis and interpretation of Patient Reported Outcomes (PROs)

Comparing IRT with Other Models

Testing Structural Equation Models: The Effect of Kurtosis

Psychology 282 Lecture #4 Outline Inferences in SLR

From last time... The equations

Plausible Values for Latent Variables Using Mplus

Structural Equation Modeling and Confirmatory Factor Analysis. Types of Variables

Measurement Invariance (MI) in CFA and Differential Item Functioning (DIF) in IRT/IFA

Specifying Latent Curve and Other Growth Models Using Mplus. (Revised )

Auxiliary Variables in Mixture Modeling: Using the BCH Method in Mplus to Estimate a Distal Outcome Model and an Arbitrary Secondary Model

1 Hypothesis testing for a single mean

Statistics for laboratory scientists II

Latent Class Analysis

Citation for published version (APA): Jak, S. (2013). Cluster bias: Testing measurement invariance in multilevel data

A Framework for the Study of Urban Health. Abdullah Baqui, DrPH, MPH, MBBS Johns Hopkins University

Latent variable interactions

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation

Online Appendix for Sterba, S.K. (2013). Understanding linkages among mixture models. Multivariate Behavioral Research, 48,

Toxicokinetics: Absorption, Distribution, and Excretion. Michael A. Trush, PhD Johns Hopkins University

Variable-Specific Entropy Contribution

Old and new approaches for the analysis of categorical data in a SEM framework

Assessing Factorial Invariance in Ordered-Categorical Measures

Investigating Models with Two or Three Categories

Global Model Fit Test for Nonlinear SEM

Probability measures A probability measure, P, is a real valued function from the collection of possible events so that the following

Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood

Generalized Linear Models for Non-Normal Data

DEEP, University of Lausanne Lectures on Econometric Analysis of Count Data Pravin K. Trivedi May 2005

Computational Systems Biology: Biology X

Application of Plausible Values of Latent Variables to Analyzing BSI-18 Factors. Jichuan Wang, Ph.D

Multivariate Capability Analysis Using Statgraphics. Presented by Dr. Neil W. Polhemus

Contents. Acknowledgments. xix

Methods for Handling Missing Non-Normal Data in Structural Equation Modeling. Fan Jia

Estimation of Curvilinear Effects in SEM. Rex B. Kline, September 2009

Correlations with Categorical Data

An Introduction to SEM in Mplus

Exploring Cultural Differences with Structural Equation Modelling

Extending the Robust Means Modeling Framework. Alyssa Counsell, Phil Chalmers, Matt Sigal, Rob Cribbie

Moderation 調節 = 交互作用

Confirmatory Factor Analysis. Psych 818 DeShon

Consequences of measurement error. Psychology 588: Covariance structure and factor models

Probability Pr(A) 0, for any event A. 2. Pr(S) = 1, for the sample space S. 3. If A and B are mutually exclusive, Pr(A or B) = Pr(A) + Pr(B).

Selection endogenous dummy ordered probit, and selection endogenous dummy dynamic ordered probit models

Mixture Modeling in Mplus

Categorical and Zero Inflated Growth Models

What is in the Book: Outline

Parametric and semiparametric estimation of ordered response models with sample selection and individual-specific thresholds

Chapter Fifteen. Frequency Distribution, Cross-Tabulation, and Hypothesis Testing

A Note on Item Restscore Association in Rasch Models

Using Mplus individual residual plots for. diagnostics and model evaluation in SEM

EVALUATION OF STRUCTURAL EQUATION MODELS

Maximum Likelihood Estimation. only training data is available to design a classifier

Ridge Structural Equation Modeling with Correlation Matrices for Ordinal and Continuous Data. Ke-Hai Yuan University of Notre Dame

Online Appendices for: Modeling Latent Growth With Multiple Indicators: A Comparison of Three Approaches

Preface. List of examples

Statistics 203: Introduction to Regression and Analysis of Variance Course review

Package paramap. R topics documented: September 20, 2017

A Re-Introduction to General Linear Models (GLM)

Generalized Models: Part 1

Package threg. August 10, 2015

Indirect estimation of a simultaneous limited dependent variable model for patient costs and outcome

Assessing the relation between language comprehension and performance in general chemistry. Appendices

Frequency Distribution Cross-Tabulation

A Threshold-Free Approach to the Study of the Structure of Binary Data

Introduction to Maximum Likelihood Estimation

Nemours Biomedical Research Statistics Course. Li Xie Nemours Biostatistics Core October 14, 2014

Statistics in medicine

STRUCTURAL EQUATION MODELING. Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013

Application of Variance Homogeneity Tests Under Violation of Normality Assumption

Estimating chopit models in gllamm Political efficacy example from King et al. (2002)

Confidence Intervals, Testing and ANOVA Summary

Transcription:

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this site. Copyright 2007, The Johns Hopkins University and Qian-Li Xue. All rights reserved. Use of these materials permitted only in accordance with license rights granted. Materials provided AS IS ; no representations or warranties provided. User assumes all responsibility for use, and all liability related thereto, and must independently review all materials for accuracy and efficacy. May contain materials owned by others. User is responsible for obtaining permissions for use from third parties as needed.

SEM for Categorical Outcomes Statistics for Psychosocial Research II: Structural Models Qian-Li Xue

Outline Consequences of violating distributional assumptions with continuous observed variables SEM for categorical observed variables

Consequences of Violation of Multivariate Normality Assumption Observed Variable Dist. Consistency Properties of ML and GLS Estimators Asymptotic Efficiency ACOV( θˆ ) Chi-square Estimator Multivariate Normal Yes Yes Correct Correct No Kurtosis Yes Yes Correct Correct Arbitrary yes no Incorrect incorrect Adapted from Bollen s Structural Equations with Latent Variables, p. 46

Definition Tests of Non-normality For a random variable X with a population mean of μ The r th moment about the mean is μ r = r E( X μ ) for r > Population parameter μ Sample Statistic Normal Dist. Skewness 0 ( μ ) 3/ 2 2 μ Kurtosis 4 4 3 ( μ ) 2 2 3 m ( m ) 3/ 2 2 m ( m ) 2 2 3 Normal Normal m r ( X X ) N r = Σ /

Tests of Non-normality Univariate Test Calculate the first four sample moments of the observed variable Calculate skewness and kurtosis based on these sample moments Test H 0 : skewness=0 and H 0 : kurtosis=3 (D Agostino, 986, see Bollen, p.42) Joint test of skewness and kurtosis equal to that of a normal distribution, i.e. H 0 : skewness=0 and kurtosis=3 (if N 00) Using sktest command in STATA Multivariate Test for multivariate skewness and kurtosis Using univariate tests with a Bonferroni adjustment based on the fact: Multivariate normality univariate normality Mardia s multivariate test (see Bollen, pp.423-424) or Use multnorm in STATA

Solutions for Non-normality. Transformation of the observed variables to achieve approximate normality 2. Post-estimation adjustments to the usual test statistics and standard errors (Browne, 982, 984) 3. Nonparametric tests via bootstrap resampling procedures However, neither 2 nor 3 corrects the lack of asymptotic efficiency of θˆ

A Better Solution for Non-normality 4. Weighted Least Squares (WLS) Estimators To minimize the fitting function: = s σ ( θ ) W s σ ( θ ) F WLS [ ] [ ] where s is a vector of n(n+)/2 non-redundant elements in S, σ(θ) si the vector of corresponding elements in Σ(θ), and W - is a (n(n+)/2) (n(n+)/2) weight matrix Optimal choice for W: asymptotic covariance matrix of the sample covariances (i.e. s) With the optimal choice of W, the WLS fitting function is also termed arbitrary distribution function (ADF) It can be shown that F GSL, F MLS, and F ULS are special cases of F WLS

Pros and Cons of the WLS Estimator Pros Minimal assumptions about the distribution of the observed variables The WLS is a consistent and efficient estimator Provide valid estimates of asymptotic covariance matrix of θˆ and a chi-square test statistic Cons Computational burden Larger sample size requirement for convergence compared to other estimators Not clear about the degree to which WLS outperforms F GSL, F MLS, and F ULS in the case of minor violation of normality

SEM with Categorical Observed Variables So far, we have assumed that the observed and latent variables are continuous What happens if we have observed variables taking ordinal or binary values? Are the estimators and significance tests for continuous variables still valid for categorical variables? We will deal with categorical latent variables in next lecture

Consequences of Using Ordinal Indicators as if They were Continuous. y Λ η + ε y 2. x Λ x ξ +δ 3. Σ Σ(θ ) 4. ACOV ( s ij, s gh ) ACOV ( s * ij, s * gh )

Corrective Procedures for and 2 Define a nonlinear function relating the observed categorical variables (y and/or x) to the latent continuous variables (y* and/or x*) y * = y Λ η + ε Assume and x * = Λ x ξ + δ For example, Where a is the category threshold. y 0 if = if y y * * a > a Not Depressed a Depressed Y *

Corrective Procedures for and 2 In general, define Where c is the number of categories for y, a i (i=,2,,c-) is the category threshold, and y * is the latent continuous indicator > < < = * * 2 2 * * if if if 2 if c c c a y c a y a c a y a a y y M

Determine the Thresholds y* and x* ~ multivariate normal Such that each variable of y* and x* ~ univariate normal Standardize each variable to a mean of 0 and a variance of a a 2 2 3 y * y An estimate of the threshold is: a i i = Φ k = N N k Where Φ is the standardized normal distribution function Adapted from Bollen s Structural Equations with Latent Variables, p.440

Example: Industrialization and Political Democracy ε ε 2 ε 3 ε 4 ε 5 ε 6 ε 7 ε 8 y y 2 y 3 y 4 y 5 y 6 y 7 y 8 ζ λ λ λ 3 4 λ λ 2 3 λ 4 2 Y,Y 5 : freedom of press β 2 η Y 2,Y 6 : freedom of group opposition η 2 Y 3,Y 7 : fairness of election Y 4,Y 8 : effectiveness of legislative γ γ 2 body ζ x is GNP per capita ξ 2 x 2 is energy consumption per capita λ6 λ 7 x 3 is % labor force x x 2 x 3 δ δ 2 δ 3 Bollen pp322-323

Determine the Threshold Consider a categorized version of the 960 free press measure Y 2 3 4 5 6 7 8 Frequency 8 3 5 3 5 22 4 5 Proportion 0. 0.7 0.07 0.7 0.07 0.29 0.05 0.07 Cum.Prop. 0. 0.28 0.35 0.52 0.59 0.88 0.93.00 Threshold a a 2 a 3 a 4 a 5 a 6 a 7 Estimate -.24-0.58-0.39 0.05 0.22.7.50 Adapted from Bollen s Structural Equations with Latent Variables, p. 440-44

Corrective Procedures for 3 (i.e. Σ Σ(θ)) Assume: Σ*=Σ(θ), where Σ* is the covariance matrix of y* and x* y* and x* ~ multivariate normal Idea: estimate correlation between each pair of latent variables y i * and x j * If are both y i and x j are continuous, calculate Pearson correlation If are both y i and x j are ordinal, calculate polychoric correlation between y i * and x j * If are both y i and x j are binary, calculate tetrachoric correlation between y i * and x j * If one is ordinal and the other is continuous, calculate polyserial correlation between y i * and x j *

Pros and Cons of Polychoric and Tetrachoric Correlation (Pearson, 90) Pros In a familiar form of a correlation coefficient Separately quantify association and similarity of category definitions Cons Model assumptions are not always appropriate With only two variables, the assumptions of the tetrachoric correlation can not be tested Independent of number of categories Assumptions underlying the polychoric and tetrachoric correlation can be easily tested Estimation software is routinely available (Uebersax JS)

Maximum Likelihood Estimation of the Polychoric Correlation For example, the log likelihood for estimation of the polychoric correlation based on a I J table of two ordinal variables x and y is y ln L π ij Φ 2 ( a = = Φ i I i= j= 2, b ( a j J i N, b j ij ) + Φ ln( π ) + C 2 3 ) Φ 2 ( a 2 ij i ( a, b i j, b ) j ) where N ij is the frequency of observations in the ith and jth categories, C is a constant, a i and b j are thresholds for x and y, respectively, and Φ 2 is the bivariate normal distribution function with correlation ρ An iterative search algorithm tries different combinations for ai, bj and ρ to find a optimal combination for minimizing the difference between the expected counts to the observed counts x 2 3 (Olsson, 979)

A Few Important Facts The polychoric correlation matrix Σ p based on y and x is a consistent estimator of Σ* Analysis of Σ p via F ML, F GLS, or F ULS yields consistent estimators of θ However, standard errors, significant tests (e.g. chisquare tests) are incorrect!! A better choice is F WLS : ˆ = ρ σ ( θ ) W ˆ ρ σ ( θ ) F WLS [ ] [ ] where ρˆ is [n(n+)/2] vector of the polychoric correlations, σ(θ) is the implied covariance matrix, and W is the asymptotic covariance matrix of ρˆ (Muthen, 984).

MPLUS Fitting of CFA with Categorical Indicators TITLE: this is an example of a CFA with categorical factor indicators DATA: FILE IS ex5.2.dat; VARIABLE: NAMES ARE u-u6; CATEGORICAL ARE u-u6; MODEL: f BY u-u3; f2 BY u4-u6; U-u6 are binary indicators Declare U-u6 to be categorical indicators The default estimator is robust weighted least squares estimator

MPLUS Fitting of CFA with Continuous and Categorical Indicators TITLE: this is an example of a CFA with continuous and categorical factor indicators DATA: FILE IS ex5.3.dat; VARIABLE: NAMES ARE u-u3 y4-y6; CATEGORICAL ARE u u2 u3; MODEL: f BY u-u3; f2 BY y4-y6; By default, MPLUS treats y4-u6 as continuous indicators Declare only u-u3 to be categorical indicators

Example: Frailty and Disability Study Population: Women s Health and Aging Studies I; N = 002 Community-dwelling women 65-0 yrs; Represent one-third most disabled women Outcome: Frailty by 5 binary indicators Disability by 5 4-level ordinal indicators Predictor: Age, education, disease burden

Outcome Definitions Frailty Binary Criteria: Shrinking (weight loss) Weakness Poor endurance Slowed walking speed Low physical activity Mobility Disability Ordinal Criteria: Walk ¼ mile Climb up 0 steps Lift 0 lbs Transfer from bed to chair Heavy housework Classification: Non-frail: 0/5 criteria Pre-frail: or 2/5 criteria Frail: 3,4, or 5/5 criteria Each rated on a four-point scale: 0 no difficulty a little difficulty 2 some difficulty 3 a lot of difficulty/unable

Example: Frailty and Disability Study Aims ) Evaluate the association between frailty and mobility disability 2) Study potential risk factors of frailty and mobility disability Age, education, number of chronic diseases 3) Assess racial differences in ) and 2)

Example: Frailty and Disability ε ε 2 ε 3 ε 4 ε 5 ε 6 ε 7 ε 8 ε 9 ε 0 y y 2 y 3 y 4 y 5 y 6 y 7 y 8 y 9 y 0 λ λ λ 3 4 λ 5 λ λ 2 3 λ 4 λ 5 2 ζ Frailty γ γ 2 β 2 Disability γ 3 γ 2 γ 22 γ 23 ζ 2 Age Educ Disease

Example: Measurement Model for Mobility By default, MPLUS sets loadings and thresholds to be the same across groups (i.e. a more restricted model) TITLE: this is an example of a multiple group CFA with categorical factor indicators for mobility disability and a threshold structure DATA:FILE IS c:\teaching\40.658.2007\catna.dat; VARIABLE: NAMES ARE baseid age race educ disease shrink strength speed exhaust physical lift walk stairs transfer hhw; USEVARIABLES ARE race lift-hhw; CATEGORICAL ARE lift-hhw; GROUPING IS race (0=white =black); ANALYSIS: TYPE = MEANSTRUCTURE; DIFFTEST IS c:\teaching\40.658.2007\deriv.dat; MODEL: mobility BY lift* walk@ stairs-hhw; OUTPUT: SAMPSTAT; See output file: catcfad.out

Example: Measurement Model for Mobility Set loadings and thresholds for lift, stairs, and hhw to be different across groups (i.e. a less restricted model) (SAME AS BEFORE) ANALYSIS: TYPE = MEANSTRUCTURE; DIFFTEST IS c:\teaching\40.658.2007\deriv.dat; MODEL: mobility BY lift* walk@ stairs-hhw; MODEL black: mobility BY lift; [lift$ lift$2 lift$3]; {lift@}; mobility BY stairs; [stairs$ stairs$2 stairs$3]; {stairs@} See output file: catcfad.out mobility BY transfer; [transfer$ transfer$2 transfer$3]; {transfer@}; mobility BY hhw; [hhw$ hhw$2 hhw$3]; {hhw@}; SAVEDATA: DIFFTEST is c:\teaching\40.658.2007\deriv.dat; OUTPUT: SAMPSTAT;

Example: Structural Models for Mobility and Frailty TITLE: this is an example of a multiple group CFA with covariates and categorical factor indicators for mobility and frailty and a threshold structure DATA: FILE IS c:\teaching\40.658.2007\catna.dat; VARIABLE: NAMES ARE baseid age race educ disease shrink strength speed exhaust physical lift walk stairs transfer hhw; USEVARIABLES ARE race age educ disease shrink-hhw; CATEGORICAL ARE shrink-hhw; GROUPING IS race (0=white =black); ANALYSIS: TYPE = MEANSTRUCTURE; MODEL: frailty BY shrink-physical; mobility BY lift* walk@ stairs-hhw; mobility ON frailty; mobility frailty ON age educ disease; MODEL black: mobility BY lift; [lift$ lift$2 lift$3]; {lift@}; mobility BY stairs; [stairs$ stairs$2 stairs$3]; {stairs@}; mobility BY transfer; [transfer$ transfer$2 transfer$3]; {transfer@}; mobility BY hhw; [hhw$ hhw$2 hhw$3]; {hhw@}; See output file: catreg.out frailty BY strength; [strength$]; {strength@};