Old and new approaches for the analysis of categorical data in a SEM framework
|
|
- Katherine Farmer
- 6 years ago
- Views:
Transcription
1 Old and new approaches for the analysis of categorical data in a SEM framework Yves Rosseel Department of Data Analysis Belgium Myrsini Katsikatsou Department of Statistics London Scool of Economics UK Meeting of the Working Group Structural Equation Modeling 26 February 2015 Freie Universität Berlin Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 1 / 32
2 two approaches for handling categorical data in a SEM framework limited information approach only univariate and bivariate information is used mainly developed in the SEM literature perhaps the best known method: three-stage least squares (in Mplus: estimator WLSMV) new approach: pairwise likelihood estimation full information approach all information is used frequentist approach: marginal maximum likelihood estimation requires numerical integration (number of dimensions = number of latent variables) mainly developed in the IRT literature (and GLMM literature) only recently incorporated in modern SEM software Bayesian approach Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 2 / 32
3 example SEM framework: u = binary, o = ordered, y = numeric u1 y1 y2 y3 u2 u3 u4 o1 o2 o3 f2 f3 f1 o4 x1 x2 Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 3 / 32
4 full information approach three approaches: 1. marginal maximum likelihood (MML) 2. latent response approach 3. (Bayesian estimation) Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 4 / 32
5 full information approach: marginal maximum likelihood origins: IRT models (eg Bock & Lieberman, 1970) and GLMMs the marginal likelihood for the response vector y i can be written as L i (θ) = f(y i x i ; θ) = f(y i η, x i ; θ)f(η x i ; θ)dη D(η) where y i are observed endogenous variables, x i are observed exogenous covariates, and η are latent variables; D(η) is the domain of integration; θ is the parameter vector numerical integration Gauss-Hermite quadrature adaptive quadrature Laplace approximation Monte Carlo integration some clever dimension reduction techniques exist for special cases Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 5 / 32
6 available software for the marginal maximum likelihood approach commercial software: SEM software: Mplus,... IRT software: BILOG-MG, MUTLILOG, PARSCALE, TESTFACT, EQSIRT, IRTPRO, flexmirt,... non-commercial, open-source software the Stata module gllamm R packages for IRT: TAM, mirt,... (see the CRAN Task View: Psychometric Models and Methods) and lme4 R packages for SEM: OpenMx, lavaan (since , but slow) Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 6 / 32
7 full information approach: latent response approach (1) an observed variable y can often be viewed as a partial observation of a latent continuous response y ; eg ordinal variable with K = 4 response categories: t2 t1 t3 y=1 y=2 y=3 y= latent continuous response y* Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 7 / 32
8 full information approach: latent response approach (2) assumption: both latent continuous responses (y ) and latent variables (η) are multivariate normal the likelihood contribution for observation i is given by L i (θ) = f(y i x i ; θ) = N [y i µ i (θ), Σ i (θ)] dy i T (y i ) where y i are observed endogenous variables, x i are observed exogenous covariates; T (y i ) is the integration region (defined by the thresholds) the order of integration equals the number of (non-continuous) observed variables some examples in the literature exist, up to 4 variables Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 8 / 32
9 available software for the full information latent response approach commercial software: none? non-commercial, open-source software R package lavaan (since version ) estimator="fml" integration is done by the sadmvn() function in the R package mnormt no analytical gradient (for now) just for fun Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 9 / 32
10 limited information approaches 1. three stage least squares (Mplus WLSMV) 2. pairwise likelihood estimation Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 10 / 32
11 the three stage least squares estimator developed by Bengt Muthén, in a series of papers; the seminal paper is Muthén, B. (1984). A general structural equation model with dichotomous, ordered categorical, and continuous latent variable indicators. Psychometrika, 49, this approach has been the golden standard in the SEM literature for almost three decades first available in LISCOMP (Linear Structural Equations using a Comprehensive Measurement Model), distributed by SSI, follow up program: Mplus (Version 1: 1998), currently version 7.3 other authors (Jöreskog 1994; Lee, Poon, Bentler 1992) have proposed similar approaches (implemented in LISREL and EQS respectively) another great program: MECOSA (Arminger, G., Wittenberg, J., Schepers, A.) written in the GAUSS language (mid 90 s) Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 11 / 32
12 stage 1 estimating the thresholds estimating the thresholds: maximum likelihood using univariate data if no exogenous variables, this is just # generate ordered data with 4 categories Y <- sample(1:4, size = 100, replace = TRUE) prop <- table(y)/sum(table(y)) cprop <- c(0, cumsum(prop)) th <- qnorm(cprop) in the presence of exogenous covariates, this is just ordered probit regression library(mass) X1 <- rnorm(100); X2 <- rnorm(100); X3 <- rnorm(100) fit <- polr(ordered(y) X1 + X2 + X3, method = "probit") fit$zeta Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 12 / 32
13 stage 2 estimating tetrachoric, polychoric,..., correlations estimate tetrachoric/polychoric/... correlation from bivariate data: tetrachoric (binary binary) polychoric (ordered ordered) polyserial (ordered numeric) biserial (binary numeric) pearson (numeric numeric) ML estimation is available (see eg. Olsson 1979 and 1982) two-step: first estimate thresholds using univariate information only; then, keeping the thresholds fixed, estimate the correlation one-step: estimate thresholds and correlation simultaneously if exogenous covariates are involved, the correlations are based on the residual values of y (eg bivariate probit regression) Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 13 / 32
14 stage 3 estimating the SEM model third stage uses weighted least squares: F W LS = (s ˆσ) W 1 (s ˆσ) where s and ˆσ are vectors containing all relevant sample-based and modelbased statistics respectively s contains: thresholds, correlations, optionally regression slopes of exogenous covariates, optionally variances and means of continuous variables the weight matrix W is (a consistent estimator of) the asymptotic covariance matrix of the sample statistics (s) robust version: WLSMV use the diagonal of W only for estimation (DWLS) use the full matrix for inference (standard errors and test statistic) MV stands for the Satterthwaite s mean and variance corrected test statistic Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 14 / 32
15 available software for the WLSMV estimator commercial software: golden standard: Mplus (since 1998) LISREL and EQS have similar capabilities (but less general) MECOSA (mid 90s, not available anymore) non-commercial, open-source software R package lavaan (since version 0.5) estimator="wlsmv" is the default estimator if some of the observed (endogenous) variables are categorical full implementation including delta and theta parameterization for multiple groups and/or longitudinal data Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 15 / 32
16 pairwise likelihood (PL) estimation special case of the broader framework of composite likelihood estimation key idea: the complex likelihood is broken down as a (weighted) product of component likelihoods which are easier to handle (computationally) composite ML estimators are asymptotically unbiased, consistent, and normally distributed key references: Lindsay, B. (1998). Composite likelihood methods. Contemporary Mathematics, 80, Varin, C. (2008). On composite marginal likelihoods. Advances in Statistical Analysis, 92(1), introduced in the SEM literature by Jöreskog & Moustaki (2001), De Leon (2005), Liu (2007), Xi (2011), Katsikatsou et al. (2012) computational complexity can be kept low regardless the number of observed and latent variables Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 16 / 32
17 pairwise likelihood (PL) estimation in SEM (1) in PL estimation, all model parameters are estimated in a single step for a random sample of N observations, the pairwise loglikelihood (pl) is defined as the sum of all bivariate log-likelihood functions given by: N pl (θ; y) = ln L (θ; (y in, y i n)) = n=1 i<i = c i c i n (y i = a, y i = b) ln π (y i = a, y i = b; θ), i<i where a=1 b=1 π (y i = a, y i = b; θ) = τi,a τ i,a 1 τi,b τ i,b 1 f (y i, y i ) dy i dy i. robust standard errors are based on the Godambe/sandwich information Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 17 / 32
18 pairwise likelihood (PL) estimation in SEM (2) a recent simulation study illustrates the many pleasant properties of PL: bias and MSE of PL estimators and their (sandwich type) standard errors are found to be small in all experimental conditions, and decreasing with the sample size Katsikatsou, M., Moustaki, I., Yang-Wallentin, F., & Jöreskog, K. G. (2012). Pairwise likelihood estimation for factor analysis models with ordinal data. Computational Statistics & Data Analysis, 56(12), a follow-up study illustrates how PL can be used in a large SEM setting (7 latent variables, many indicators) available in lavaan since (dec 2012) estimation and robust standard errors only no support for mixed item types no support for exogenous covariates Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 18 / 32
19 inference under PL in a SEM framework reference: inference: Katsikatsou, M., Moustaki, I. (under revision). Inference under pairwise likelihood for structural equation models with ordinal variables. Wald test Pairwise Likelihood Ratio Test (PLRT) for overall fit PL-AIC and PL-BIC PLRT for comparing nested models available in the development version of lavaan (0.5-18) Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 19 / 32
20 lavaan example simulated dataset (N = 500) 7 latent variables (2 endogenous, 5 exogenous) 45 ordinal indicators (4 response categories) structural part: timings: ξ 6 ξ 1 + ξ 2 + ξ 3 + ξ 4 + ξ 5 ξ 7 ξ 5 + ξ 6 lavaan estimator = PML : currently takes about 36 minutes (3 min estimation, 17min standard errors, 16min test statistic) lavaan estimator = WLSMV about 3 minutes Mplus estimator = ML, integration = montecarlo (700), default settings: 1h 17min, but failed with THE MODEL ESTIMATION DID NOT TERMINATE NORMALLY Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 20 / 32
21 lavaan input library(lavaan) Data <- read.csv("rx.ord") Data[,] <- lapply(data[,], ordered) simmodel <- # exogenous lv ksi1 = V1 + V2 + V3 + V4 + V5 ksi2 = V6 + V7 + V8 ksi3 = V9 + V10 + V11 + V12 + V13 ksi4 = V14 + V15 + V16 ksi5 = V17 + V18 + V19 + V20 + V21 # endogenous lv ksi6 = V22 + V23 + V24 + V25 + V26 + V27 + V28 + V29 + V30 + V31 + V32 + V33 + V34 + V35 + V36 + V37 + V38 ksi7 = V39 + V40 + V41 + V42 + V43 + V44 + V45 # structural model ksi6 ksi1 + ksi2 + ksi3 + ksi4 + ksi5 ksi7 ksi5 +ksi6 fit <- sem(model = simmodel, data = Data, estimator = "PML") Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 21 / 32
22 lavaan output (header only) lavaan ( ) converged normally after 105 iterations Number of observations 500 Estimator PML Robust Minimum Function Test Statistic Degrees of freedom P-value NA Scaling correction factor for the mean and variance adjusted correction Parameter estimates: Information Standard Errors Observed Robust.huber.white Estimate Std.err Z-value P(> z ) Latent variables: ksi1 = V V V V V ksi2 =... Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 22 / 32
23 last slide PL estimation is a (relatively) new approach for handling categorical data in a SEM framework PL can handle a large number of observed and latent variables PL has many attractive statistical properties support for the full SEM framework the R package lavaan: still catching-up in the full information ML area (but wait for 0.6!) full support for WLSMV (and friends) PL is currently only implemented in lavaan no support for inference in multiple groups yet! complete data only (PL with missing values is ongoing research) Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 23 / 32
24 Thank you! (questions?) Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 24 / 32
25 PL estimation: likelihood basic assumption: ( y i y i ) N 2 (( 0 0 ), ( 1 ρ i i 1 )) the pl for N independent observations: N pl (θ; y) = ln L (θ; (y in, y i n)) = n=1 i<i = c i c i n (y i = a, y i = b) ln π (y i = a, y i = b; θ), i<i where a=1 b=1 τi,a π (y i = a, y i = b; θ) = τ i,a 1 τi,b τ i,b 1 f (y i, y i ) dy i dy i. Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 25 / 32
26 properties of the pairwise likelihood estimator ˆθ P L = maxpl (θ; y) θ under regularity conditions upon the component likelihoods, N ( ˆθP L θ) N ( 0, G 1 (θ) ) where and G(θ) = H(θ)J 1 (θ)h(θ) H(θ) = E { 2 J(θ) = V ar } pl(θ; y) } θ θ { pl(θ; y) θ Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 26 / 32
27 PLRT for nested models (1) let θ be partitioned as θ = (ψ, ω ), where ψ : vector of parameters of interest of dimension d, ω : vector of nuisance parameters. consider the hypothesis: Pace et al. (2011), H 0 : ψ = ψ 0 vs H 1 : ψ ψ 0 P LRT (ψ 0 ) = 2 ( ( ) ( )) pl ˆθ pl θ, where ˆθ = ( ˆψ, ˆω ) and θ = (ψ 0, ω ψ 0 ) are the PL estimators under H 1 and H 0, respectively. the standard asymptotic result that, under H 0, P LRT χ 2 diff used. cannot be Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 27 / 32
28 PLRT for nested models (2) instead, we use a Satterthwaite approximation; under H 0, E (P LRT (ψ 0 )) 1 2 V ar (P LRT (ψ 0)) P LRT (ψ 0) χ 2 v [E(P LRT (ψ0))]2 1 2 V ar(p LRT (ψ0)), where v = ( E (P LRT (ψ 0 )) tr G [ ψψ H ψψ] ) 1, V ar (P LRT (ψ 0 )) 2 tr ( G ψψ [ H ψψ] 1 G ψψ [ H ψψ] 1 ), G ψψ and H ψψ are the parts of G 1 and H 1 that refer to ψ. Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 28 / 32
29 PL version of AIC and BIC AIC P L based on Varin & Vidoni (2005): ( ) AIC P L = 2 pl ˆθP L ; y + 2 tr(ĵ( ˆθ P L )Ĥ 1 ( ˆθ P L )) BIC P L based on Gao and Song (2009): ( ) BIC P L = 2 pl ˆθP L ; y + log N tr(ĵ( ˆθ P L )Ĥ 1 ( ˆθ P L )) Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 29 / 32
30 PLRT for overall fit (1) let θ be partitioned as θ = (ϕ, τ ), where τ : the vector of thresholds, ϕ : the vector of the rest SEM parameters, dimension d. recall P Corr (y ); let ρ = vech (P ), dimension p consider the hypothesis: where g : R d R p H 0 : ρ = g(ϕ) versus H 1 : ρ unconstrained under H 1, the parameter vector ϑ is partitioned as ϑ = (ρ, τ ) τ : nuisance parameter Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 30 / 32
31 PLRT for overall fit (2) let ( ( ) ( )) P LRT SEM = 2 pl ˆϑ pl ˆθ, where ˆϑ = ( ˆρ, ˆτ ) and ˆθ = ( ˆϕ, ˆτ ) are the PL estimates under H 1 and H 0, respectively under H 0, where E (P LRT SEM ) 1 2 V ar (P LRT SEM) P LRT SEM χ 2 v v = [E (P LRT SEM)] V ar (P LRT SEM) Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 31 / 32
32 PLRT for overall fit (3) mean: ( E (P LRT SEM ) tr G ρρ [H ρρ ] 1) tr (G ϕϕ [H ϕϕ ] 1) variance: V ar (P LRT SEM ) 2 tr (G ρρ [H ρρ ] 1 G ρρ [H ρρ ] 1) + 2 tr (G ϕϕ [H ϕϕ ] 1 G ϕϕ [H ϕϕ ] 1) 4 tr (M [H ρρ ] 1 MG ϕϕ [H ϕϕ ] 1 G ϕϕ) where M = ϕ g (ϕ) ϕ=ϕ0 Yves Rosseel Old and new approaches for the analysis of categorical data in a SEM framework 32 / 32
Pairwise Likelihood Estimation for factor analysis models with ordinal data
Working Paper 2011:4 Department of Statistics Pairwise Likelihood Estimation for factor analysis models with ordinal data Myrsini Katsikatsou Irini Moustaki Fan Yang-Wallentin Karl G. Jöreskog Working
More informationComputationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models
Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Tihomir Asparouhov 1, Bengt Muthen 2 Muthen & Muthen 1 UCLA 2 Abstract Multilevel analysis often leads to modeling
More informationComposite Likelihood Estimation for Latent Variable Models with Ordinal and Continuous, or Ranking Variables
Digital Comprehensive Summaries of Uppsala Dissertations from the Faculty of Social Sciences 86 Composite Likelihood Estimation for Latent Variable Models with Ordinal and Continuous, or Ranking Variables
More informationTutorial. Myrsini Katsikatsou Department of Statistics London School of Economics and Political Science, UK
Tutorial The Pairwise Likelihood Method for Structural Equation Modelling with ordinal variables and data with missing values using the R package lavaan Myrsini Katsikatsou Department of Statistics London
More informationBayesian Analysis of Latent Variable Models using Mplus
Bayesian Analysis of Latent Variable Models using Mplus Tihomir Asparouhov and Bengt Muthén Version 2 June 29, 2010 1 1 Introduction In this paper we describe some of the modeling possibilities that are
More informationNesting and Equivalence Testing
Nesting and Equivalence Testing Tihomir Asparouhov and Bengt Muthén August 13, 2018 Abstract In this note, we discuss the nesting and equivalence testing (NET) methodology developed in Bentler and Satorra
More informationLatent variable models: a review of estimation methods
Latent variable models: a review of estimation methods Irini Moustaki London School of Economics Conference to honor the scientific contributions of Professor Michael Browne Outline Modeling approaches
More informationINTRODUCTION TO STRUCTURAL EQUATION MODELS
I. Description of the course. INTRODUCTION TO STRUCTURAL EQUATION MODELS A. Objectives and scope of the course. B. Logistics of enrollment, auditing, requirements, distribution of notes, access to programs.
More informationCentering Predictor and Mediator Variables in Multilevel and Time-Series Models
Centering Predictor and Mediator Variables in Multilevel and Time-Series Models Tihomir Asparouhov and Bengt Muthén Part 2 May 7, 2018 Tihomir Asparouhov and Bengt Muthén Part 2 Muthén & Muthén 1/ 42 Overview
More informationMaximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood
Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood PRE 906: Structural Equation Modeling Lecture #3 February 4, 2015 PRE 906, SEM: Estimation Today s Class An
More informationA Composite Likelihood Approach for Factor Analyzing Ordinal Data
A Composite Likelihood Approach for Factor Analyzing Ordinal Data Dissertation Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate School of The Ohio
More informationRidge Structural Equation Modeling with Correlation Matrices for Ordinal and Continuous Data. Ke-Hai Yuan University of Notre Dame
Ridge Structural Equation Modeling with Correlation Matrices for Ordinal and Continuous Data Ke-Hai Yuan University of Notre Dame Rui Wu Beihang University Peter M. Bentler University of California, Los
More informationCorrelations with Categorical Data
Maximum Likelihood Estimation of Multiple Correlations and Canonical Correlations with Categorical Data Sik-Yum Lee The Chinese University of Hong Kong Wal-Yin Poon University of California, Los Angeles
More informationOverview. Multidimensional Item Response Theory. Lecture #12 ICPSR Item Response Theory Workshop. Basics of MIRT Assumptions Models Applications
Multidimensional Item Response Theory Lecture #12 ICPSR Item Response Theory Workshop Lecture #12: 1of 33 Overview Basics of MIRT Assumptions Models Applications Guidance about estimating MIRT Lecture
More informationLesson 7: Item response theory models (part 2)
Lesson 7: Item response theory models (part 2) Patrícia Martinková Department of Statistical Modelling Institute of Computer Science, Czech Academy of Sciences Institute for Research and Development of
More informationComposite Likelihood
Composite Likelihood Nancy Reid January 30, 2012 with Cristiano Varin and thanks to Don Fraser, Grace Yi, Ximing Xu Background parametric model f (y; θ), y R m ; θ R d likelihood function L(θ; y) f (y;
More informationComposite Likelihood Estimation
Composite Likelihood Estimation With application to spatial clustered data Cristiano Varin Wirtschaftsuniversität Wien April 29, 2016 Credits CV, Nancy M Reid and David Firth (2011). An overview of composite
More informationGeneral structural model Part 2: Categorical variables and beyond. Psychology 588: Covariance structure and factor models
General structural model Part 2: Categorical variables and beyond Psychology 588: Covariance structure and factor models Categorical variables 2 Conventional (linear) SEM assumes continuous observed variables
More informationA Markov chain Monte Carlo approach to confirmatory item factor analysis. Michael C. Edwards The Ohio State University
A Markov chain Monte Carlo approach to confirmatory item factor analysis Michael C. Edwards The Ohio State University An MCMC approach to CIFA Overview Motivating examples Intro to Item Response Theory
More informationComposite likelihood methods
1 / 20 Composite likelihood methods Nancy Reid University of Warwick, April 15, 2008 Cristiano Varin Grace Yun Yi, Zi Jin, Jean-François Plante 2 / 20 Some questions (and answers) Is there a drinks table
More informationSEM for Categorical Outcomes
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationFactor Analysis. Qian-Li Xue
Factor Analysis Qian-Li Xue Biostatistics Program Harvard Catalyst The Harvard Clinical & Translational Science Center Short course, October 7, 06 Well-used latent variable models Latent variable scale
More informationFitting Multidimensional Latent Variable Models using an Efficient Laplace Approximation
Fitting Multidimensional Latent Variable Models using an Efficient Laplace Approximation Dimitris Rizopoulos Department of Biostatistics, Erasmus University Medical Center, the Netherlands d.rizopoulos@erasmusmc.nl
More informationLatent Variable Centering of Predictors and Mediators in Multilevel and Time-Series Models
Latent Variable Centering of Predictors and Mediators in Multilevel and Time-Series Models Tihomir Asparouhov and Bengt Muthén August 5, 2018 Abstract We discuss different methods for centering a predictor
More informationNELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation
NELS 88 Table 2.3 Adjusted odds ratios of eighth-grade students in 988 performing below basic levels of reading and mathematics in 988 and dropping out of school, 988 to 990, by basic demographics Variable
More informationPlausible Values for Latent Variables Using Mplus
Plausible Values for Latent Variables Using Mplus Tihomir Asparouhov and Bengt Muthén August 21, 2010 1 1 Introduction Plausible values are imputed values for latent variables. All latent variables can
More informationIntroduction to Within-Person Analysis and RM ANOVA
Introduction to Within-Person Analysis and RM ANOVA Today s Class: From between-person to within-person ANOVAs for longitudinal data Variance model comparisons using 2 LL CLP 944: Lecture 3 1 The Two Sides
More informationFactor Analysis and Latent Structure of Categorical Data
Factor Analysis and Latent Structure of Categorical Data Irini Moustaki Athens University of Economics and Business Outline Objectives Factor analysis model Literature Approaches Item Response Theory Models
More informationCHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA
Examples: Multilevel Modeling With Complex Survey Data CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA Complex survey data refers to data obtained by stratification, cluster sampling and/or
More informationEmpirical Validation of the Critical Thinking Assessment Test: A Bayesian CFA Approach
Empirical Validation of the Critical Thinking Assessment Test: A Bayesian CFA Approach CHI HANG AU & ALLISON AMES, PH.D. 1 Acknowledgement Allison Ames, PhD Jeanne Horst, PhD 2 Overview Features of the
More informationGeneralized Linear Models for Non-Normal Data
Generalized Linear Models for Non-Normal Data Today s Class: 3 parts of a generalized model Models for binary outcomes Complications for generalized multivariate or multilevel models SPLH 861: Lecture
More informationCenter for Advanced Studies in Measurement and Assessment. CASMA Research Report
Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 41 A Comparative Study of Item Response Theory Item Calibration Methods for the Two Parameter Logistic Model Kyung
More informationStructural Equation Modeling and Confirmatory Factor Analysis. Types of Variables
/4/04 Structural Equation Modeling and Confirmatory Factor Analysis Advanced Statistics for Researchers Session 3 Dr. Chris Rakes Website: http://csrakes.yolasite.com Email: Rakes@umbc.edu Twitter: @RakesChris
More informationBasic IRT Concepts, Models, and Assumptions
Basic IRT Concepts, Models, and Assumptions Lecture #2 ICPSR Item Response Theory Workshop Lecture #2: 1of 64 Lecture #2 Overview Background of IRT and how it differs from CFA Creating a scale An introduction
More informationA Re-Introduction to General Linear Models (GLM)
A Re-Introduction to General Linear Models (GLM) Today s Class: You do know the GLM Estimation (where the numbers in the output come from): From least squares to restricted maximum likelihood (REML) Reviewing
More informationMeasurement Invariance (MI) in CFA and Differential Item Functioning (DIF) in IRT/IFA
Topics: Measurement Invariance (MI) in CFA and Differential Item Functioning (DIF) in IRT/IFA What are MI and DIF? Testing measurement invariance in CFA Testing differential item functioning in IRT/IFA
More informationVariable-Specific Entropy Contribution
Variable-Specific Entropy Contribution Tihomir Asparouhov and Bengt Muthén June 19, 2018 In latent class analysis it is useful to evaluate a measurement instrument in terms of how well it identifies the
More informationLatent variable interactions
Latent variable interactions Bengt Muthén & Tihomir Asparouhov Mplus www.statmodel.com November 2, 2015 1 1 Latent variable interactions Structural equation modeling with latent variable interactions has
More informationSTAT 730 Chapter 9: Factor analysis
STAT 730 Chapter 9: Factor analysis Timothy Hanson Department of Statistics, University of South Carolina Stat 730: Multivariate Data Analysis 1 / 15 Basic idea Factor analysis attempts to explain the
More informationGlobal Model Fit Test for Nonlinear SEM
Global Model Fit Test for Nonlinear SEM Rebecca Büchner, Andreas Klein, & Julien Irmer Goethe-University Frankfurt am Main Meeting of the SEM Working Group, 2018 Nonlinear SEM Measurement Models: Structural
More informationSRMR in Mplus. Tihomir Asparouhov and Bengt Muthén. May 2, 2018
SRMR in Mplus Tihomir Asparouhov and Bengt Muthén May 2, 2018 1 Introduction In this note we describe the Mplus implementation of the SRMR standardized root mean squared residual) fit index for the models
More informationCourse Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model
Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model PSYC 943 (930): Fundamentals of Multivariate Modeling Lecture 1: August 22, 2012
More informationGENERALIZED LATENT TRAIT MODELS. 1. Introduction
PSYCHOMETRIKA VOL. 65, NO. 3, 391 411 SEPTEMBER 2000 GENERALIZED LATENT TRAIT MODELS IRINI MOUSTAKI AND MARTIN KNOTT LONDON SCHOOL OF ECONOMICS AND POLITICAL SCIENCE In this paper we discuss a general
More informationIdentifying and accounting for outliers and extreme response patterns in latent variable modelling
Identifying and accounting for outliers and extreme response patterns in latent variable modelling Irini Moustaki Athens University of Economics and Business Outline 1. Define the problem of outliers and
More informationGeneralized Linear. Mixed Models. Methods and Applications. Modern Concepts, Walter W. Stroup. Texts in Statistical Science.
Texts in Statistical Science Generalized Linear Mixed Models Modern Concepts, Methods and Applications Walter W. Stroup CRC Press Taylor & Francis Croup Boca Raton London New York CRC Press is an imprint
More informationCourse Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model
Course Introduction and Overview Descriptive Statistics Conceptualizations of Variance Review of the General Linear Model EPSY 905: Multivariate Analysis Lecture 1 20 January 2016 EPSY 905: Lecture 1 -
More informationTime-Invariant Predictors in Longitudinal Models
Time-Invariant Predictors in Longitudinal Models Today s Class (or 3): Summary of steps in building unconditional models for time What happens to missing predictors Effects of time-invariant predictors
More informationFall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.
1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n
More informationMultilevel Statistical Models: 3 rd edition, 2003 Contents
Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction
More informationANALYSIS OF TWO-LEVEL STRUCTURAL EQUATION MODELS VIA EM TYPE ALGORITHMS
Statistica Sinica 8(1998), 749-766 ANALYSIS OF TWO-LEVEL STRUCTURAL EQUATION MODELS VIA EM TYPE ALGORITHMS Sik-Yum Lee and Wai-Yin Poon Chinese University of Hong Kong Abstract: In this paper, the maximum
More informationComparing IRT with Other Models
Comparing IRT with Other Models Lecture #14 ICPSR Item Response Theory Workshop Lecture #14: 1of 45 Lecture Overview The final set of slides will describe a parallel between IRT and another commonly used
More informationComparison between conditional and marginal maximum likelihood for a class of item response models
(1/24) Comparison between conditional and marginal maximum likelihood for a class of item response models Francesco Bartolucci, University of Perugia (IT) Silvia Bacci, University of Perugia (IT) Claudia
More informationLikelihood-Based Methods
Likelihood-Based Methods Handbook of Spatial Statistics, Chapter 4 Susheela Singh September 22, 2016 OVERVIEW INTRODUCTION MAXIMUM LIKELIHOOD ESTIMATION (ML) RESTRICTED MAXIMUM LIKELIHOOD ESTIMATION (REML)
More informationReview. Timothy Hanson. Department of Statistics, University of South Carolina. Stat 770: Categorical Data Analysis
Review Timothy Hanson Department of Statistics, University of South Carolina Stat 770: Categorical Data Analysis 1 / 22 Chapter 1: background Nominal, ordinal, interval data. Distributions: Poisson, binomial,
More informationRecap. Vector observation: Y f (y; θ), Y Y R m, θ R d. sample of independent vectors y 1,..., y n. pairwise log-likelihood n m. weights are often 1
Recap Vector observation: Y f (y; θ), Y Y R m, θ R d sample of independent vectors y 1,..., y n pairwise log-likelihood n m i=1 r=1 s>r w rs log f 2 (y ir, y is ; θ) weights are often 1 more generally,
More informationRuth E. Mathiowetz. Chapel Hill 2010
Evaluating Latent Variable Interactions with Structural Equation Mixture Models Ruth E. Mathiowetz A thesis submitted to the faculty of the University of North Carolina at Chapel Hill in partial fulfillment
More informationEPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7
Introduction to Generalized Univariate Models: Models for Binary Outcomes EPSY 905: Fundamentals of Multivariate Modeling Online Lecture #7 EPSY 905: Intro to Generalized In This Lecture A short review
More informationThe Bayesian Approach to Multi-equation Econometric Model Estimation
Journal of Statistical and Econometric Methods, vol.3, no.1, 2014, 85-96 ISSN: 2241-0384 (print), 2241-0376 (online) Scienpress Ltd, 2014 The Bayesian Approach to Multi-equation Econometric Model Estimation
More information13. October p. 1
Lecture 8 STK3100/4100 Linear mixed models 13. October 2014 Plan for lecture: 1. The lme function in the nlme library 2. Induced correlation structure 3. Marginal models 4. Estimation - ML and REML 5.
More informationWHAT IS STRUCTURAL EQUATION MODELING (SEM)?
WHAT IS STRUCTURAL EQUATION MODELING (SEM)? 1 LINEAR STRUCTURAL RELATIONS 2 Terminología LINEAR LATENT VARIABLE MODELS T.W. Anderson (1989), Journal of Econometrics MULTIVARIATE LINEAR RELATIONS T.W. Anderson
More informationAssessing Factorial Invariance in Ordered-Categorical Measures
Multivariate Behavioral Research, 39 (3), 479-515 Copyright 2004, Lawrence Erlbaum Associates, Inc. Assessing Factorial Invariance in Ordered-Categorical Measures Roger E. Millsap and Jenn Yun-Tein Arizona
More informationThreshold models with fixed and random effects for ordered categorical data
Threshold models with fixed and random effects for ordered categorical data Hans-Peter Piepho Universität Hohenheim, Germany Edith Kalka Universität Kassel, Germany Contents 1. Introduction. Case studies
More informationMixed models in R using the lme4 package Part 7: Generalized linear mixed models
Mixed models in R using the lme4 package Part 7: Generalized linear mixed models Douglas Bates University of Wisconsin - Madison and R Development Core Team University of
More informationMISCELLANEOUS TOPICS RELATED TO LIKELIHOOD. Copyright c 2012 (Iowa State University) Statistics / 30
MISCELLANEOUS TOPICS RELATED TO LIKELIHOOD Copyright c 2012 (Iowa State University) Statistics 511 1 / 30 INFORMATION CRITERIA Akaike s Information criterion is given by AIC = 2l(ˆθ) + 2k, where l(ˆθ)
More informationGeneralized Linear Models
Generalized Linear Models Lecture 3. Hypothesis testing. Goodness of Fit. Model diagnostics GLM (Spring, 2018) Lecture 3 1 / 34 Models Let M(X r ) be a model with design matrix X r (with r columns) r n
More informationChapter 4: Factor Analysis
Chapter 4: Factor Analysis In many studies, we may not be able to measure directly the variables of interest. We can merely collect data on other variables which may be related to the variables of interest.
More informationA Very Brief Summary of Statistical Inference, and Examples
A Very Brief Summary of Statistical Inference, and Examples Trinity Term 2009 Prof. Gesine Reinert Our standard situation is that we have data x = x 1, x 2,..., x n, which we view as realisations of random
More informationInference using structural equations with latent variables
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this
More informationCitation for published version (APA): Jak, S. (2013). Cluster bias: Testing measurement invariance in multilevel data
UvA-DARE (Digital Academic Repository) Cluster bias: Testing measurement invariance in multilevel data Jak, S. Link to publication Citation for published version (APA): Jak, S. (2013). Cluster bias: Testing
More informationGreene, Econometric Analysis (6th ed, 2008)
EC771: Econometrics, Spring 2010 Greene, Econometric Analysis (6th ed, 2008) Chapter 17: Maximum Likelihood Estimation The preferred estimator in a wide variety of econometric settings is that derived
More informationOutline. Mixed models in R using the lme4 package Part 5: Generalized linear mixed models. Parts of LMMs carried over to GLMMs
Outline Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Douglas Bates University of Wisconsin - Madison and R Development Core Team UseR!2009,
More informationHierarchical Generalized Linear Models. ERSH 8990 REMS Seminar on HLM Last Lecture!
Hierarchical Generalized Linear Models ERSH 8990 REMS Seminar on HLM Last Lecture! Hierarchical Generalized Linear Models Introduction to generalized models Models for binary outcomes Interpreting parameter
More informationLatent Class Analysis
Latent Class Analysis Karen Bandeen-Roche October 27, 2016 Objectives For you to leave here knowing When is latent class analysis (LCA) model useful? What is the LCA model its underlying assumptions? How
More informationDefault Priors and Effcient Posterior Computation in Bayesian
Default Priors and Effcient Posterior Computation in Bayesian Factor Analysis January 16, 2010 Presented by Eric Wang, Duke University Background and Motivation A Brief Review of Parameter Expansion Literature
More informationLinear models and their mathematical foundations: Simple linear regression
Linear models and their mathematical foundations: Simple linear regression Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/21 Introduction
More informationJournal of Multivariate Analysis. Use of prior information in the consistent estimation of regression coefficients in measurement error models
Journal of Multivariate Analysis 00 (2009) 498 520 Contents lists available at ScienceDirect Journal of Multivariate Analysis journal homepage: www.elsevier.com/locate/jmva Use of prior information in
More informationEndogenous Treatment Effects for Count Data Models with Endogenous Participation or Sample Selection
Endogenous Treatment Effects for Count Data Models with Endogenous Participation or Sample Selection Massimilano Bratti & Alfonso Miranda Institute of Education University of London c Bratti&Miranda (p.
More informationMisspecification in Nonrecursive SEMs 1. Nonrecursive Latent Variable Models under Misspecification
Misspecification in Nonrecursive SEMs 1 Nonrecursive Latent Variable Models under Misspecification Misspecification in Nonrecursive SEMs 2 Abstract A problem central to structural equation modeling is
More informationWalkthrough for Illustrations. Illustration 1
Tay, L., Meade, A. W., & Cao, M. (in press). An overview and practical guide to IRT measurement equivalence analysis. Organizational Research Methods. doi: 10.1177/1094428114553062 Walkthrough for Illustrations
More informationReview of CLDP 944: Multilevel Models for Longitudinal Data
Review of CLDP 944: Multilevel Models for Longitudinal Data Topics: Review of general MLM concepts and terminology Model comparisons and significance testing Fixed and random effects of time Significance
More informationPOLI 8501 Introduction to Maximum Likelihood Estimation
POLI 8501 Introduction to Maximum Likelihood Estimation Maximum Likelihood Intuition Consider a model that looks like this: Y i N(µ, σ 2 ) So: E(Y ) = µ V ar(y ) = σ 2 Suppose you have some data on Y,
More informationIntroduction to Generalized Models
Introduction to Generalized Models Today s topics: The big picture of generalized models Review of maximum likelihood estimation Models for binary outcomes Models for proportion outcomes Models for categorical
More informationMixed models in R using the lme4 package Part 5: Generalized linear mixed models
Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Douglas Bates 2011-03-16 Contents 1 Generalized Linear Mixed Models Generalized Linear Mixed Models When using linear mixed
More informationStat 5101 Lecture Notes
Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random
More informationContinuous Time Survival in Latent Variable Models
Continuous Time Survival in Latent Variable Models Tihomir Asparouhov 1, Katherine Masyn 2, Bengt Muthen 3 Muthen & Muthen 1 University of California, Davis 2 University of California, Los Angeles 3 Abstract
More informationMixed models in R using the lme4 package Part 5: Generalized linear mixed models
Mixed models in R using the lme4 package Part 5: Generalized linear mixed models Douglas Bates Madison January 11, 2011 Contents 1 Definition 1 2 Links 2 3 Example 7 4 Model building 9 5 Conclusions 14
More informationTime-Invariant Predictors in Longitudinal Models
Time-Invariant Predictors in Longitudinal Models Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building strategies
More informationTime-Invariant Predictors in Longitudinal Models
Time-Invariant Predictors in Longitudinal Models Today s Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building
More informationStrati cation in Multivariate Modeling
Strati cation in Multivariate Modeling Tihomir Asparouhov Muthen & Muthen Mplus Web Notes: No. 9 Version 2, December 16, 2004 1 The author is thankful to Bengt Muthen for his guidance, to Linda Muthen
More informationGeneralized Models: Part 1
Generalized Models: Part 1 Topics: Introduction to generalized models Introduction to maximum likelihood estimation Models for binary outcomes Models for proportion outcomes Models for categorical outcomes
More informationPreface. List of examples
Contents Preface List of examples i xix 1 LISREL models and methods 1 1.1 The general LISREL model 1 Assumptions 2 The covariance matrix of the observations as implied by the LISREL model 3 Fixed, free,
More informationSTA 216, GLM, Lecture 16. October 29, 2007
STA 216, GLM, Lecture 16 October 29, 2007 Efficient Posterior Computation in Factor Models Underlying Normal Models Generalized Latent Trait Models Formulation Genetic Epidemiology Illustration Structural
More informationThe performance of estimation methods for generalized linear mixed models
University of Wollongong Research Online University of Wollongong Thesis Collection 1954-2016 University of Wollongong Thesis Collections 2008 The performance of estimation methods for generalized linear
More informationUppsala University and Norwegian School of Management, b Uppsala University, Online publication date: 08 July 2010
This article was downloaded by: [UAM University Autonoma de Madrid] On: 28 April 20 Access details: Access Details: [subscription number 93384845] Publisher Psychology Press Informa Ltd Registered in England
More informationNon-linear panel data modeling
Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1
More informationSEM with Non Normal Data: Robust Estimation and Generalized Models
SEM with Non Normal Data: Robust Estimation and Generalized Models Introduction to Structural Equation Modeling Lecture #10 April 4, 2012 ERSH 8750: Lecture 10 Today s Class Data assumptions of SEM Continuous
More informationFractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling
Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Jae-Kwang Kim 1 Iowa State University June 26, 2013 1 Joint work with Shu Yang Introduction 1 Introduction
More informationA Model for Correlated Paired Comparison Data
Working Paper Series, N. 15, December 2010 A Model for Correlated Paired Comparison Data Manuela Cattelan Department of Statistical Sciences University of Padua Italy Cristiano Varin Department of Statistics
More informationLasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices
Article Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices Fei Jin 1,2 and Lung-fei Lee 3, * 1 School of Economics, Shanghai University of Finance and Economics,
More informationVarious types of likelihood
Various types of likelihood 1. likelihood, marginal likelihood, conditional likelihood, profile likelihood, adjusted profile likelihood 2. semi-parametric likelihood, partial likelihood 3. empirical likelihood,
More information