A Markov chain Monte Carlo approach to confirmatory item factor analysis. Michael C. Edwards The Ohio State University

Similar documents
Overview. Multidimensional Item Response Theory. Lecture #12 ICPSR Item Response Theory Workshop. Basics of MIRT Assumptions Models Applications

Some Issues In Markov Chain Monte Carlo Estimation For Item Response Theory

Journal of Statistical Software

IRT Model Selection Methods for Polytomous Items

Lesson 7: Item response theory models (part 2)

Whats beyond Concerto: An introduction to the R package catr. Session 4: Overview of polytomous IRT models

A DISSERTATION SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY. Yu-Feng Chang

Preliminary Manual of the software program Multidimensional Item Response Theory (MIRT)

An Overview of Item Response Theory. Michael C. Edwards, PhD

Item Response Theory (IRT) Analysis of Item Sets

Making the Most of What We Have: A Practical Application of Multidimensional Item Response Theory in Test Scoring

Latent variable models: a review of estimation methods

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016

Item Parameter Calibration of LSAT Items Using MCMC Approximation of Bayes Posterior Distributions

A model of skew item response theory

ABSTRACT. Yunyun Dai, Doctor of Philosophy, Mixtures of item response theory models have been proposed as a technique to explore

Journal of Statistical Software

JMASM27: An Algorithm for Implementing Gibbs Sampling for 2PNO IRT Models (Fortran)

Multidimensional Linking for Tests with Mixed Item Types

Application of Plausible Values of Latent Variables to Analyzing BSI-18 Factors. Jichuan Wang, Ph.D

IRT linking methods for the bifactor model: a special case of the two-tier item factor analysis model

Monte Carlo Simulations for Rasch Model Tests

A Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions

Introduction To Confirmatory Factor Analysis and Item Response Theory

BAYESIAN MODEL CHECKING STRATEGIES FOR DICHOTOMOUS ITEM RESPONSE THEORY MODELS. Sherwin G. Toribio. A Dissertation

Empirical Validation of the Critical Thinking Assessment Test: A Bayesian CFA Approach

Comparison between conditional and marginal maximum likelihood for a class of item response models

Item Response Theory for Scores on Tests Including Polytomous Items with Ordered Responses

Basic IRT Concepts, Models, and Assumptions

Equating Tests Under The Nominal Response Model Frank B. Baker

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

COMPARISON OF CONCURRENT AND SEPARATE MULTIDIMENSIONAL IRT LINKING OF ITEM PARAMETERS

2 Bayesian Hierarchical Response Modeling

An Introduction to the DA-T Gibbs Sampler for the Two-Parameter Logistic (2PL) Model and Beyond

LSAC RESEARCH REPORT SERIES. Law School Admission Council Research Report March 2008

SENSITIVITY ANALYSIS OF PRIOR SPECIFICATION FOR THE PROBIT-NORMAL IRT MODEL: AN EMPIRICAL STUDY

BAYESIAN METHODS FOR VARIABLE SELECTION WITH APPLICATIONS TO HIGH-DIMENSIONAL DATA

Summer School in Applied Psychometric Principles. Peterhouse College 13 th to 17 th September 2010

Monte Carlo in Bayesian Statistics

BAYESIAN IRT MODELS INCORPORATING GENERAL AND SPECIFIC ABILITIES

Bayesian Networks in Educational Assessment Tutorial

References. Markov-Chain Monte Carlo. Recall: Sampling Motivation. Problem. Recall: Sampling Methods. CSE586 Computer Vision II

Multilevel Statistical Models: 3 rd edition, 2003 Contents

flexmirt R : Flexible Multilevel Multidimensional Item Analysis and Test Scoring

A new family of asymmetric models for item response theory: A Skew-Normal IRT family

Bayesian Nonparametric Rasch Modeling: Methods and Software

A Nonlinear Mixed Model Framework for Item Response Theory

Markov Chain Monte Carlo methods

Bayesian inference for multivariate extreme value distributions

Markov-Chain Monte Carlo

An Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin

Markov Chain Monte Carlo

Contents. Part I: Fundamentals of Bayesian Inference 1

Reminder of some Markov Chain properties:

JORGE L. BAZAN Department of Sciences, Pontifical Catholic University of Peru.

Bayesian Inference and MCMC

A Use of the Information Function in Tailored Testing

Assessing Factorial Invariance in Ordered-Categorical Measures

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models

Fitting Multidimensional Latent Variable Models using an Efficient Laplace Approximation

Computational statistics

A Note on the Equivalence Between Observed and Expected Information Functions With Polytomous IRT Models

A Bivariate Generalized Linear Item Response Theory Modeling Framework to the Analysis of Responses and Response Times

PIRLS 2016 Achievement Scaling Methodology 1

Application of Item Response Theory Models for Intensive Longitudinal Data

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence

On the Use of Nonparametric ICC Estimation Techniques For Checking Parametric Model Fit

A Study of Statistical Power and Type I Errors in Testing a Factor Analytic. Model for Group Differences in Regression Intercepts

STAT 425: Introduction to Bayesian Analysis

Down by the Bayes, where the Watermelons Grow

Diagnostic Classification Models: Psychometric Issues and Statistical Challenges

Bayesian Methods for Machine Learning

NESTED LOGIT MODELS FOR MULTIPLE-CHOICE ITEM RESPONSE DATA UNIVERSITY OF TEXAS AT AUSTIN UNIVERSITY OF WISCONSIN-MADISON

SAMPLE SIZE IN EXPLORATORY FACTOR ANALYSIS WITH ORDINAL DATA

Comparing Multi-dimensional and Uni-dimensional Computer Adaptive Strategies in Psychological and Health Assessment. Jingyu Liu

Walkthrough for Illustrations. Illustration 1

Probabilistic Graphical Networks: Definitions and Basic Results

Ruth E. Mathiowetz. Chapel Hill 2010

Comparing IRT with Other Models

Educational and Psychological Measurement

Multilevel IRT using dichotomous and polytomous response data

Bayesian Analysis of Latent Variable Models using Mplus

GENERALIZED LATENT TRAIT MODELS. 1. Introduction

Markov Chain Monte Carlo, Numerical Integration

Multidimensional Computerized Adaptive Testing in Recovering Reading and Mathematics Abilities

A Review of Pseudo-Marginal Markov Chain Monte Carlo

Bayesian Estimation of Input Output Tables for Russia

Development and Calibration of an Item Response Model. that Incorporates Response Time

Multidimensional item response theory observed score equating methods for mixed-format tests

General structural model Part 2: Categorical variables and beyond. Psychology 588: Covariance structure and factor models

SCORING TESTS WITH DICHOTOMOUS AND POLYTOMOUS ITEMS CIGDEM ALAGOZ. (Under the Direction of Seock-Ho Kim) ABSTRACT

Bayesian philosophy Bayesian computation Bayesian software. Bayesian Statistics. Petter Mostad. Chalmers. April 6, 2017

Old and new approaches for the analysis of categorical data in a SEM framework

Tutorial on Probabilistic Programming with PyMC3

Stochastic Approximation Methods for Latent Regression Item Response Models

ST 740: Markov Chain Monte Carlo

STA 4273H: Statistical Machine Learning

Theory of Stochastic Processes 8. Markov chain Monte Carlo

Local Dependence Diagnostics in IRT Modeling of Binary Data

Applied Psychological Measurement 2001; 25; 283

Transcription:

A Markov chain Monte Carlo approach to confirmatory item factor analysis Michael C. Edwards The Ohio State University

An MCMC approach to CIFA

Overview Motivating examples Intro to Item Response Theory (IRT) Review of IRT estimation history Current estimation challenges Markov chain Monte Carlo (MCMC) MCMC applied to IRT Some Results

Motivating Examples SAT Diagnostic Scores Quality of Life Measurement Dimensionality Scoring

Item Response Theory IRT is a collection of latent variable models that explain the process by which people respond to items in terms of item and person parameters.

-Parameter Normal Ogive Model One of the most widely used Appropriate for dichotomous item responses when guessing is not present T(Item Response) 0.0 0. 0.4 0.6 0.8 1.0-3 - -1 0 1 3 Theta P( y = 1 θ ) = Φ[ a ( θ b j j j )] P( y = 1 θ ) = Φ[ a θ d j j j ]

Samejima s Graded Model Widely used in psychology Appropriate for ordered categorical data T(Item Response) 0.0 0. 0.4 0.6 0.8 1.0-3 - -1 0 1 3 Theta P( y = c θ ) = Φ[ a ( θ b )] Φ[ a ( θ b )] j j jc j jc+ 1 P( y j = c θ ) = Φ[ a jθ ( d j + o jc )] Φ[ a jθ ( d j + o jc+ 1)]

IRT Parameter Estimation Heuristic Estimation Lord (195), Lord & Novick (1968) Joint Maximum Likelihood Lord (1953), Rasch (1960), Birnbaum (1968) Maximum Marginal Likelihood Bock & Lieberman (1970) Maximum Marginal Likelihood with an EM algorithm Bock & Aitkin (1981) Bayes Modal Estimation with an EM algorithm Mislevy (1986)

Potential Uses Item Analysis Scale Development Scoring Linking and Equating Computerized Adaptive Testing (CAT)

Moving to Multiple Dimensions TESTFACT Exploratory item factor analysis for dichotomous models Bi-factor model Uses MML-EM and different methods of numerical integration SEM approaches WLS, DWLS, UBN, etc.

Old Problems, New Solutions Curse of Dimensionality

Old Problems, New Solutions Curse of Dimensionality How to handle high dimensional integration?

Markov Chain Monte Carlo MCMC estimation can be thought of as Monte Carlo integration using Markov chains Monte Carlo integration works by simulating samples from a target distribution and then computing averages to replace expectations Simulated values from target distribution - generated by constructing a Markov chain with the target as its stationary distribution

How to Simulate from Target Distribution? Metropolis Hastings Gibbs Sampling Data Augmented Gibbs Sampling (DAG) Metropolis Hastings within Gibbs (MHwG)

MCMC and IRT Albert (199) First published application of MCMC to IRT Patz & Junker (1996) MHwG for IRT models Early forays into MCMC for MIRT Béguin & Glas (001); Segall (00) Shi & Lee (1998); Arminger & Muthén (1998)

Dissertation Research An MCMC Approach to Confirmatory Item Factor Analysis

Comparing MCMC Approaches MCMC with GRM Authors a d o DA-Gibbs? MHwG? Combinations? A & C (93) DAG DAG DAG Cowles (96) DAG DAG MHwG J & A (99) DAG DAG MHwG Fox (04) DAG - MHwG W et al (0) MHwG MHwG MHwG MCMC with 3PNO Authors a d g P & J (99) MHwG MHwG MHwG W et al (00) MHwG MHwG MHwG B & G (01) DAG DAG DAG Segall (0) DAG DAG DAG Sahu (0) DAG DAG DAG

Performance Components Examine: Parameter recovery Autocorrelations and effective sample size Mixing of chains Time per cycle

Next Steps Make it work, make it fast, make it pretty R too slow, move to C++ Multiple correlated factors with independent clustering Cross loadings Mixed item types

Example 1 F1 F F3 F4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4 4

10000 0000 30000 40000 50000 60000 Cycle 1. 1.3 1.4 1.5 1.6 Item 1 Slope Item 1 Slope 1.3 1.5 Frequency 0 00 400 10000 0000 30000 40000 50000 60000 Cycle -1.7-1.6-1.5-1.4-1.3-1. Item 1 Intercept Item 1 Intercept -1.6-1.4 Frequency 0 00 400 10000 0000 30000 40000 50000 60000 Cycle 0.9 1.0 1.1 1. Item 1 Offset Item 1 Offset 0.90 1.10 Frequency 0 00 400 10000 0000 30000 40000 50000 60000 Cycle.8.9 3.0 3.1 3. 3.3 3.4 Item 1 Offset 3 Item 1 Offset 3.8 3.1 3.4 Frequency 0 00 400

Example 1 Results (RMSE) N = 000 MCMC Multilog WLS a 0.05 0.05 0.06 d 0.08 0.10 0.10 o 0.04 0.05 0.06 o 3 0.07 0.08 0.10 r 0.0-0.03

Example 1 Results (RMSE) N = 000 N = 500 MCMC Multilog WLS MCMC Multilog WLS a 0.05 0.05 0.06 0.09 0.11 0.7 d 0.08 0.10 0.10 0.10 0.1 0.18 o 0.04 0.05 0.06 0.10 0.10 0.18 o 3 0.07 0.08 0.10 0.15 0.16 0.3 r 0.0-0.03 0.03-0.09

Example S1 G S 3 3 3 3 3

Example 3 F1 F

Example 3 - Slopes 0.40 0.30 0.0 0.10 0.00 Difference -0.10-0.0-0.30-0.40-0.50-0.60-0.70 0.00 0.50 1.00 1.50.00.50 Generating Slope

Example 3 - Slopes 0.40 0.30 0.0 0.10 0.00 Difference -0.10-0.0-0.30-0.40-0.50 Item 3-0.60-0.70 0.00 0.50 1.00 1.50.00.50 Generating Slope

Example 3 Intercepts 1.00 0.50 Difference 0.00-0.50-1.00-1.50-6.00-4.00 -.00 0.00.00 4.00 6.00 Generating Intercept

Example 3 Intercepts 1.00 0.50 Difference 0.00-0.50-1.00 Item 3-1.50-6.00-4.00 -.00 0.00.00 4.00 6.00 Generating Intercept

Example 3 Lower Asymptotes 0.10 0.05 Difference 0.00-0.05-0.10-0.15 0.00 0.05 0.10 0.15 0.0 0.5 0.30 0.35 0.40 Generating Lower Asymptote

Example 4 F1 F S1 S S3 S4

Example 4 - Reflection 0 e+00 e+04 4 e+04 6 e+04 8 e+04 1 e+05 Cycle -1.5-1.0-0.5 0.0 0.5 1.0 1.5 Item 40 Specific Slope Item 40 Specific Slope -1.0 0.0 1.0 Frequency 0 100 00 300 0 e+00 e+04 4 e+04 6 e+04 8 e+04 1 e+05 Cycle - -1 0 1 Item 41 Specific Slope Item 41 Specific Slope -1.5-0.5 0.5 1.5 Frequency 0 100 300 0 e+00 e+04 4 e+04 6 e+04 8 e+04 1 e+05 Cycle - -1 0 1 Item 4 Specific Slope Item 4 Specific Slope -1.5 0.0 1.0 Frequency 0 00 400 600

Example 4 Correlation Troubles R(,1) 0.66 0.70 0.74 Frequency 0 100 00 300 0 e+00 e+04 4 e+04 6 e+04 8 e+04 1 e+05 0.64 0.66 0.68 0.70 0.7 0.74 Cycle R(,1) R(5,) -0.1 0.1 0.3 Frequency 0 100 00 300 0 e+00 e+04 4 e+04 6 e+04 8 e+04 1 e+05 Cycle -0.1 0.0 0.1 0. 0.3 0.4 0.5 R(5,) R(6,) -0. 0. 0 e+00 e+04 4 e+04 6 e+04 8 e+04 1 e+05 Cycle Frequency 0 100 00 300-0.4-0. 0.0 0. 0.4 0.6 R(6,)

Example 4 Slopes 0.60 0.40 0.0 Difference 0.00-0.0-0.40-0.60-0.80 0.00 0.50 1.00 1.50.00.50 Generating Slope

Example 4 Intercepts 1.0 1.00 0.80 0.60 Difference 0.40 0.0 0.00-0.0-0.40-0.60-5.00-4.00-3.00 -.00-1.00 0.00 1.00.00 3.00 4.00 Generating Intercept

Example 4 Lower Asymptotes 0.15 0.10 0.05 Difference 0.00-0.05-0.10-0.15 0.00 0.05 0.10 0.15 0.0 0.5 0.30 0.35 0.40 0.45 Generating Lower Asymptote

Example 5 F1 F F3 F4 F5 F6 F7 F8

Example 5 Slopes 0.30 0.0 0.10 Difference 0.00-0.10-0.0-0.30 0.60 0.70 0.80 0.90 1.00 1.10 1.0 1.30 1.40 1.50 Generating Slope

Example 5 Intercepts 0.30 0.0 0.10 Difference 0.00-0.10-0.0-0.30-0.40-3.00 -.00-1.00 0.00 1.00.00 3.00 Generating Intercept

Example 5 Lower Asymptotes 0.10 0.08 0.06 0.04 Difference 0.0 0.00-0.0-0.04-0.06-0.08 0.10 0.15 0.0 0.5 0.30 Generating Lower Asymptote

Beta Weight = 0 0.10 0.08 0.06 0.04 Difference 0.0 0.00-0.0-0.04-0.06-0.08-0.10 0.10 0.15 0.0 0.5 0.30 Generating Lower Asymptote

Example 5 Correlations 0.0 0.01 0.00-0.01 Difference -0.0-0.03-0.04-0.05-0.06-0.07 0.00 0.0 0.40 0.60 0.80 1.00 Generating Correlation

E5 Continued: More Uniform R 0.04 0.03 0.0 0.01 Difference 0.00-0.01-0.0-0.03-0.04-0.05 0.0 0.30 0.40 0.50 0.60 0.70 0.80 Generating

5 Most Poorly Recovered Items 1 0.9 0.8 0.7 Probability 0.6 0.5 0.4 0.3 0. 0.1 0-4 -3 - -1 0 1 3 4 Theta Generated Item 1 Generated Item 9 Generated Item 38 Generated Item 54 Generated Item 65 Estimated Item 1 Estimated Item 9 Estimated Item 38 Estimated Item 54 Estimated Item 65

Future Directions Software Dissemination Projects MCMC variant comparison Confirmatory item factor analysis and applications MCMC vs. MML-EM, WLS, etc. Extensions Structural Model Model Fit

Concluding Thoughts Item factor analysis is a useful method for the social sciences IRT framework provides an advantageous platform for item factor analysis MCMC can be used to estimate parameters for more complex IRT models

Acknowledgements Dave Thissen Dissertation Committee Ken Bollen, Patrick Curran, Andrea Hussong, & Bud MacCallum Jean-Paul Fox Psychometric Society (& Judges)

Thank You. edwards.134@osu.edu http://faculty.psy.ohio-state.edu/edwards/