Advanced Structural Equations Models I

Similar documents
Inference using structural equations with latent variables

Simple Linear Regression. John McGready Johns Hopkins University

An Introduction to Mplus and Path Analysis

An Introduction to Path Analysis

Introduction to Structural Equations

Path Analysis. PRE 906: Structural Equation Modeling Lecture #5 February 18, PRE 906, SEM: Lecture 5 - Path Analysis

SEM for Categorical Outcomes

Sampling Variability and Confidence Intervals. John McGready Johns Hopkins University

Introduction to Structural Equation Modeling

Multiple Group CFA Invariance Example (data from Brown Chapter 7) using MLR Mplus 7.4: Major Depression Criteria across Men and Women (n = 345 each)

Section B. The Theoretical Sampling Distribution of the Sample Mean and Its Estimate Based on a Single Sample

ADVANCED C. MEASUREMENT INVARIANCE SEM REX B KLINE CONCORDIA

Lecture 27. December 13, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

From last time... The equations

1 Hypothesis testing for a single mean

2/26/2017. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

Introduction to Structural Equation Modeling Dominique Zephyr Applied Statistics Lab

Outline. 1. Define likelihood 2. Interpretations of likelihoods 3. Likelihood plots 4. Maximum likelihood 5. Likelihood ratio benchmarks

Longitudinal Invariance CFA (using MLR) Example in Mplus v. 7.4 (N = 151; 6 items over 3 occasions)

Chapter 5. Introduction to Path Analysis. Overview. Correlation and causation. Specification of path models. Types of path models

Factor Analysis & Structural Equation Models. CS185 Human Computer Interaction

Multilevel Structural Equation Modeling

Lecture 21. December 19, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University.

Statistics for laboratory scientists II

SC705: Advanced Statistics Instructor: Natasha Sarkisian Class notes: Introduction to Structural Equation Modeling (SEM)

Online Appendices for: Modeling Latent Growth With Multiple Indicators: A Comparison of Three Approaches

Structural Equation Modeling and Confirmatory Factor Analysis. Types of Variables

Factor Analysis. Qian-Li Xue

Introduction to Confirmatory Factor Analysis

Structural equation modeling

Chapter 8. Models with Structural and Measurement Components. Overview. Characteristics of SR models. Analysis of SR models. Estimation of SR models

Specifying Latent Curve and Other Growth Models Using Mplus. (Revised )

Title. Description. Remarks and examples. stata.com. stata.com. Variable notation. methods and formulas for sem Methods and formulas for sem

Estimation of Curvilinear Effects in SEM. Rex B. Kline, September 2009

4. Path Analysis. In the diagram: The technique of path analysis is originated by (American) geneticist Sewell Wright in early 1920.

Overview. 1. Terms and Definitions. 2. Model Identification. 3. Path Coefficients

SEM 2: Structural Equation Modeling

How to run the RI CLPM with Mplus By Ellen Hamaker March 21, 2018

2013 IAP. Chapter 2. Feedback Loops and Formative Measurement. Rex B. Kline

SEM with observed variables: parameterization and identification. Psychology 588: Covariance structure and factor models

Weighted Least Squares

Exploring Cultural Differences with Structural Equation Modelling

Review of CLDP 944: Multilevel Models for Longitudinal Data

Chapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania

Confirmatory Factor Analysis. Psych 818 DeShon

Weighted Least Squares

SEM 2: Structural Equation Modeling

Model Estimation Example

An Introduction to Multilevel Models. PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 25: December 7, 2012

Supplemental material to accompany Preacher and Hayes (2008)

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

Describing Change over Time: Adding Linear Trends

Path Analysis Examples. Outline

Model Assumptions; Predicting Heterogeneity of Variance

Variance. Standard deviation VAR = = value. Unbiased SD = SD = 10/23/2011. Functional Connectivity Correlation and Regression.

An Introduction to Path Analysis

A Re-Introduction to General Linear Models (GLM)

Misspecification in Nonrecursive SEMs 1. Nonrecursive Latent Variable Models under Misspecification

Outline

Latent Variable Analysis

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

Condition 9 and 10 Tests of Model Confirmation with SEM Techniques

Lecture 11 Multiple Linear Regression

General Linear Model (Chapter 4)

Probability measures A probability measure, P, is a real valued function from the collection of possible events so that the following

Dr. Junchao Xia Center of Biophysics and Computational Biology. Fall /1/2016 1/46

Variance component models part I

EVALUATION OF STRUCTURAL EQUATION MODELS

y response variable x 1, x 2,, x k -- a set of explanatory variables

Supplemental material for Autoregressive Latent Trajectory 1

How well do Fit Indices Distinguish Between the Two?

An Introduction to Structural Equation Modeling with the sem Package for R 1. John Fox

STAT 3A03 Applied Regression With SAS Fall 2017

Simple Linear Regression: One Quantitative IV

Comparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior

Copyright 2013 The Guilford Press

Psychology 454: Latent Variable Modeling How do you know if a model works?

General structural model Part 1: Covariance structure and identification. Psychology 588: Covariance structure and factor models

Example. Test for a proportion

Hypothesis Testing for Var-Cov Components

Using Mplus individual residual plots for. diagnostics and model evaluation in SEM

Structural Equation Modeling Lab 5 In Class Modification Indices Example

Factor Analysis: An Introduction. What is Factor Analysis? 100+ years of Factor Analysis FACTOR ANALYSIS AN INTRODUCTION NILAM RAM

Longitudinal Modeling with Logistic Regression

Measurement Invariance (MI) in CFA and Differential Item Functioning (DIF) in IRT/IFA

Simple Linear Regression: One Qualitative IV

Mplus Short Courses Day 2. Growth Modeling With Latent Variables Using Mplus

Principal Component Analysis-I Geog 210C Introduction to Spatial Data Analysis. Chris Funk. Lecture 17

THE GENERAL STRUCTURAL EQUATION MODEL WITH LATENT VARIATES

Multilevel Structural Equation Model with. Gifi System in Understanding the. Satisfaction of Health Condition at Java

Confirmatory Factor Analysis

STRUCTURAL EQUATION MODELING. Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013

STAT 730 Chapter 9: Factor analysis

Nonrecursive Models Highlights Richard Williams, University of Notre Dame, Last revised April 6, 2015

Step 2: Select Analyze, Mixed Models, and Linear.

Structural Model Equivalence

36-309/749 Experimental Design for Behavioral and Social Sciences. Dec 1, 2015 Lecture 11: Mixed Models (HLMs)

A study on new customer satisfaction index model of smart grid

FinQuiz Notes

Inference for Regression

Transcription:

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this site. Copyright 2007, The Johns Hopkins University and Qian-Li Xue. All rights reserved. Use of these materials permitted only in accordance with license rights granted. Materials provided AS IS ; no representations or warranties provided. User assumes all responsibility for use, and all liability related thereto, and must independently review all materials for accuracy and efficacy. May contain materials owned by others. User is responsible for obtaining permissions for use from third parties as needed.

Advanced Structural Equations Models I Statistics for Psychosocial Research II: Structural Models Qian-Li Xue

No Ordinary Regression Test of causal hypotheses? Yes SEM (Origin: Path Models) Yes Continuous endogenous var. and Continuous LV? Yes No Categorical indicators and Categorical LV? No Classic SEM Latent Class Reg. Latent Trait Yes Longitudinal Data? No Latent Profile Adv. SEM I: latent growth curves) Yes Multilevel Data? No Adv. SEM II: Multilevel Models Classic SEM

Outline 1. Estimating means of observed and latent variables 2. Modeling repeated measures of outcome over time The Simplex-Growth Over Time 3. Non-Recursive Models 4. Modeling repeated measures of outcome and covariate over time Cross-Lag Panel Analysis Latent Growth Curve Models (Next Lecture)

1. Estimating Means of Observed and Latent Variables

Estimating Means of Observed and Latent Variables So far, we have largely ignored intercept terms in our analyses What has happened to the alpha coefficient?

Estimating Means of Observed and Latent Variables Up to now, information on means and intercepts has not been of interest It is possible to estimate levels of association without information on these parameters If of interest, these parameters can be estimated using a mean model. In addition to covariances, these models also require information on mean of variables These parameters are of key interest in group comparisons and growth curve models

Estimating Means of Observed and Latent Variables Does the mean score on the latent variable ξ (e.g. depression) differ between men and women? Man d 1 e Women ξ 11 a b c a b c ξ 12 0.6 0.8 0.7 0.6 0.8 0.7 X 11 X 12 X 13 X 21 Y 22 X 23.64.36.51.64.36.51 4.0 5.0 6.0 4.3 5.4 6.35 Resid. Var. Means (Loehlin p.139)

Estimating Means of Observed and Latent Variables Man ξ 11 d 1 a b c a b c e Women ξ 12 d=0 (reference) a=4.0,b=5.0,c=6.0 (baseline values, same across groups) 0.6 0.8 0.7 0.6 0.8 0.7 e difference between the means of the latent variable X 11 X 12 X 13 X 21 Y 22 X 23 e*0.6+a=4.3 e=0.5.64.36.51.64.36.51 4.0 5.0 6.0 4.3 5.4 6.35 Resid. Var. Means (Loehlin p.139)

Example: Stress, Resources, and Depression (Holahan & Moos, 1991) How do the high-stressor and the low-stressor groups compare on the two latent variables: depression (D) and resources (R) High-Stressor 1 h i a b f g j D k l r c d R e DM DF SC EG FS m n o p q

Example: Stress, Resources, and Depression (Holahan & Moos, 1991) High-stressor group: above diagonal (underlined) Low-stressor group: below diagonal DM DF SC EG FS SD M Depressed Mood 1.84 -.36 -.45 -.51 5.97 8.82 Depressive Features.71 1 -.32 -.41 -.50 7.98 13.87 Self-confidence -.35 -.16 1.26.47 3.97 15.24 Easygoingness -.35 -.21.11 1.34 2.27 7.92 Family support -.38 -.26.30.28 1 4.91 19.03 Standard Deviation 4.84 6.33 3.84 2.14 4.43 N 128 Mean 6.15 9.96 15.14 8.80 20.43 126

Example: Stress, Resources, and Depression (Holahan & Moos, 1991) Low-Stressor MPLUS code 1 h i a DM m f b DF n g j r D R c l d k e SC EG FS o p q TITLE: Stress, resources, and depression (Loehlin, p.142) DATA: FILE is c:/teaching/140.658.2007/depression.dat; TYPE IS CORRELATION MEANS STDEVIATIONS; NOBSERVATIONS ARE 126 128; NGROUPS=2; VARIABLE: NAMES ARE DM DF SC EG FS; USEVARIABLES ARE DM-FS; MODEL: D BY DM* DF; R BY SC* EG FS; DM (1); DF (2); SC (3); EG (4); FS (5); MODEL g1: [D@0 R@0]; D@1 R@1; OUTPUT: TECH1; Equate the measurement models across the groups Set reference group (i.e. lowstressor)

Example: Stress, Resources, and Depression (Holahan & Moos, 1991) Low- Stressor High- Stressor Latent Variables Path Coeff. Measurement Model Residual Var. Baseline means Depression: Mean f [0]* a 4.42 m 2.91 h 6.09 Resources: Mean g [0] b 5.22 n 16.04 i 10.27 Depression: SD [1] c 1.56 o 11.76 j 15.59 Resources: SD [1] d 1.01 p 3.61 k 8.61 correlation r -0.72 e 2.67 q 12.25 l 20.40 Depression: Mean f 0.63 Resources: Mean g -0.50 Depression: SD 1.30 Resources: SD 1.29 correlation r -0.78 Same as above * Numbers in [ ] are prefixed in order to make the model identified

Example: Stress, Resources, and Depression (Holahan & Moos, 1991) TESTS OF MODEL FIT Chi-Square Test of Model Fit Value 27.245 Degrees of Freedom 19 P-Value 0.0991 CFI/TLI CFI 0.979 TLI 0.978 RMSEA (Root Mean Square Error Of Approximation) Estimate 0.058 90 Percent C.I. 0.000 0.104 SRMR (Standardized Root Mean Square Residual) Value 0.055 The model fits reasonably well to the data!

2. Modeling Repeated Measures of Outcome Over Time

The Simplex-Growth Over Time Modeling growth over (e.g. height) Measurements taken repeatedly over time In general, measurements made closer together in time would be more highly correlated (called simplex by Guttman, 1954) E.g. Smaller Correlation 1 2 3 4 1 1 0.73 0.72 0.68 2 1 0.79 0.76 3 1 0.84 4 1

The Simplex-Growth Over Time Example: Scores on standardized tests of academic achievement at grades 1-7 (Bracht & Hopkins, 1972) Test score (Y) is a measure of the latent academic achievement (η) Achievement at grade t is a function of achievement at t-1 via β, and other factors ζ ζ 2 ζ 3 ζ 4 ζ 5 ζ 6 ζ 7 β 21 β 32 β 43 β 54 β 65 β 76 η1 η2 η3 η4 η5 η6 1 1 1 1 1 1 η7 1 Y1 Y2 Y3 Y4 Y5 Y6 Y7 ε 1 ε 2 ε 3 ε 4 ε 5 ε 6 ε 7 Loehlin p.125

The Simplex-Growth Over Time ζ 2 ζ 3 ζ 4 ζ 5 ζ 6 ζ 7 η1 β 21 η2 β 32 η3 β 43 η4 β 54 η5 β 65 η6 β 76 1 1 1 1 1 1 η7 1 Y1 Y2 Y3 Y4 Y5 Y6 Y7 ε 1 ε 2 ε 3 ε 4 ε 5 ε 6 ε 7 Y i η i = η + ε = i β η i i i 1 + ς i ε i are uncorrelated, ε i η i, and ζ i η i-1

The Simplex-Growth Over Time Var(η 1 ), Var(ζ 7 ), Var(ε 1 ), Var(ε 2 ), β 21 are unidentified To achieve identification, set Var(ε 1 )=Var(ε 2 ) AND Var(ε 6 )=Var(ε 7 ), reasonable if Ys are on the same scale # free parameters = 3p-3, where p=# of Ys For testing a simplex model, p>3!!! ζ 2 ζ 3 ζ 4 ζ 5 ζ 6 ζ 7 η1 β 21 η2 β 32 η3 β 43 η4 β 54 η5 β 65 η6 β 76 1 1 1 1 1 1 η7 1 Y1 Y2 Y3 Y4 Y5 Y6 Y7 ε 1 ε 2 ε 3 ε 4 ε 5 ε 6 ε 7

3. Non-Recursive Models

Non-Recursive Models So far, there has been little discussion of models with feedback loops Non-recursive models deal with reciprocal causal relationships Can not be analyzed by ordinary regression analysis due to correlated errors Non-recursive models may not be identified even if the T-rule is met

Non-Recursive Models Time 1 Time 2 A A A B B B Reciprocal Lagged What do you mean by reciprocal causation? Alternative: Lagged model Assumption: the principal of finite causal lag Roles of the variables in the bidirectional relationship change over time (e.g. A is a cause at Time 1, but effect at Time 2) The reciprocal causation model becomes the only choice if only cross-sectional data are available

Non-Recursive Models: Model Identification Recall: recursive path models without measurement error are always identified Not true for non-recursive models Definition: Instrumental variable a predictor is an instrument for an endogenous variable if it has a direct path to other endogenous variables but not the endogenous variable of interest X1 Y1 X2 X3 is an instrument for Y1 X3 Y2 Maruyama, 1998; p.106

Non-Recursive Models: Model Identification Order condition (necessary but not sufficient) For any system of N endogenous variables, a particular equation is identified only if at least N-1 variables are left out of that equation Rank condition (necessary AND sufficient) is met for a particular equation if there is at least one non-zero determinant of rank N-1 from the coefficients of the variables omitted from that equation X1 Y1 X2 X3 Y2 Maruyama, 1998; p.106

4. Modeling Repeated Measures of Outcome and Covariate Over Time

Cross-Lagged Panel Analysis: Terminology Time 1 Time 2 Synchronous correlations: Corr(X1,Y1) and Corr(X2,Y2) X1 e X1 X2 e X2 Autocorrelations (i.e. stability): Corr(X1,X2) and Corr(Y1,Y2) Cross-lagged: Corr(X1,Y2) and Corr(Y1,X2) e Y1 Y1 e Y2 Y2 Residual correlations (due to measure-specific variance): Corr(e x1,e X2 ) and Corr(e Y1,e Y2 ) Here Corr. denotes total correlation!

Cross-Lagged Panel Analysis: Identification Time 1 Time 2 Is this model identified? # equations = 4*5/2=10 e X1 # unknowns = 11 e X2 Not identified! X1 Y1 X2 Y2 e Y2 What is the problem? The repeated assessment of the same measure leads to two sources of common variance construct variance Measure-specific variance Model would be identified if delete residual correlations or Build multiple-indicator models e Y1

Cross-Lagged Panel Analysis: Key Issues (Maruyama, pp.112-120) Time 1 Time 2 1. Stability of a variable For example, if Y is perfectly stable, Y2 is perfectly determined by Y1 Y1 e X2 Y2 X2 e Y2 If data is only available at Time 2, then Y1 is not available Any variable correlated with Y or caused by Y could be included as predictors, leading to a misspecified model! Low stability over time may result from poor reliability (if so, we re in trouble!) or Real change in the measure

Cross-Lagged Panel Analysis: Key Issues Time 1 Time 2 2. Temporal Lags How long is the causal lag? e Y1 X1 Y1 e X1 e Y2 X2 Y2 e Y2 It the sampling interval > causal lag attenuated effect If the sampling interval < causal lag no effect or underestimated effect What if the causal lag from X1 to Y2 is different from Y1 to X2? Solution: three-wave data with different intervals

Cross-Lagged Panel Analysis: Key Issues 3. Growth Across Time When to use covariance vs. correlation data in SEM Covariance allows for growth by focusing on raw scores Correlation focuses on standardized relationships If no change in variability of any of the variables over time, the results are identical Using covariance is highly recommended!

Cross-Lagged Panel Analysis: Key Issues 3. Stability of Causal Process Causal dynamics between variables remain stable across time intervals of the same length If not true, the relationships would differ depending on the particular interval sampled On the other hand, modeling unstable processes may be warranted when studying Developmental processes Time-varying interventions

Cross-Lagged Panel Analysis with Latent Variables: Example 0.39 0.53 0.52 0.54 Nervous or upset Often get scared Nervous or upset Often get scared 0.63 0.73 Grade 7 Anxiety 0.51 Grade 8 Anxiety 0.69 0.73 0.63 0.72 0.73 Grade 9 Anxiety 0.64 Nervous or upset Often get scared 0.48 0.53 (Ma & Xu, Journal of Adolescence 27 (2): 165-179 APR 2004 )

Cross-Lagged Panel Analysis with Latent Variables: Example 0.77 0.31 0.46 0.64 Basic skills Algebra Geometry Literacy 0.88 0.56 0.68 0.80 Grade 7 Achieve 0.98 Grade 8 Achieve 0.92 0.89 0.88 0.85 0.90 Basic skills Algebra Geometry Literacy 0.79 0.77 0.72 0.81 (Ma & Xu, Journal of Adolescence 27 (2): 165-179 APR 2004 )

Anxiety Grade 0.39 0.39 0.55 0.55 0.57 0.57 0.59 0.59 0.57 0.57 7 8 9 10 10 11 11 12 12-0.05-0.05-0.01-0.02-0.02-0.20-0.12-0.14-0.15-0.11 7 7 8 9 10 10 11 11 12 12 0.98 0.98 0.91 0.91 0.95 0.95 0.97 0.97 0.97 0.97 Achievement Grade Example of cross-lagged panel analysis with latent variables. Structural equation model estimating the causal relationship between mathematics anxiety & mathematics achievement across Grades 7 12. Large ovals represent latent factors & unidirectional arrows represent casual links. All parameter estimates for unidirectional paths are standardized. Pink boxes indicated P < 0.001). Adapted from Ma & Xu, Journal of Adolescence 2004;27:165-179