Empirical Validation of the Critical Thinking Assessment Test: A Bayesian CFA Approach

Similar documents
Application of Plausible Values of Latent Variables to Analyzing BSI-18 Factors. Jichuan Wang, Ph.D

Plausible Values for Latent Variables Using Mplus

Bayesian Mixture Modeling

Bayesian Analysis of Latent Variable Models using Mplus

Bayesian SEM: A more flexible representation of substantive theory

Misspecification in Nonrecursive SEMs 1. Nonrecursive Latent Variable Models under Misspecification

Structural Equation Modeling and Confirmatory Factor Analysis. Types of Variables

Overview. Multidimensional Item Response Theory. Lecture #12 ICPSR Item Response Theory Workshop. Basics of MIRT Assumptions Models Applications

Running head: BSEM: SENSITIVITY TO THE PRIOR 1. Prior Sensitivity Analysis in Default Bayesian Structural Equation Modeling

Using Bayesian Priors for More Flexible Latent Class Analysis

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models

Measurement Invariance Testing with Many Groups: A Comparison of Five Approaches (Online Supplements)

Bayesian Networks in Educational Assessment

Bayesian inference for factor scores

Maximum Likelihood Estimation; Robust Maximum Likelihood; Missing Data with Maximum Likelihood

ST 740: Markov Chain Monte Carlo

SRMR in Mplus. Tihomir Asparouhov and Bengt Muthén. May 2, 2018

Web Appendix for The Dynamics of Reciprocity, Accountability, and Credibility

Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016

A Parameter Expansion Approach to Bayesian SEM Estimation

An Introduction to Mplus and Path Analysis

Bagging During Markov Chain Monte Carlo for Smoother Predictions

ABSTRACT. Roy Levy, Doctor of Philosophy, conditional independence assumptions of the hypothesized model do not hold. The

Multilevel Structural Equation Modeling

Markov Chain Monte Carlo methods

An Introduction to Path Analysis

Nesting and Equivalence Testing

Tools for Parameter Estimation and Propagation of Uncertainty

Bayesian Inference. Chapter 1. Introduction and basic concepts

A Markov chain Monte Carlo approach to confirmatory item factor analysis. Michael C. Edwards The Ohio State University

Assessing Factorial Invariance in Ordered-Categorical Measures

Centering Predictor and Mediator Variables in Multilevel and Time-Series Models

The Map of the Mplus Team. Advances in Latent Variable Modeling Using Mplus Version 7

Supplement to A Hierarchical Approach for Fitting Curves to Response Time Measurements

UvA-DARE (Digital Academic Repository)

Bayesian Inference for the Multivariate Normal

Introduction to Structural Equation Modeling

Multiple Group CFA Invariance Example (data from Brown Chapter 7) using MLR Mplus 7.4: Major Depression Criteria across Men and Women (n = 345 each)

Bayesian Estimation An Informal Introduction

Default Priors and Effcient Posterior Computation in Bayesian

Bayesian Methods for Machine Learning

Journal of Statistical Software

Penalized Loss functions for Bayesian Model Choice

Making the Most of What We Have: A Practical Application of Multidimensional Item Response Theory in Test Scoring

Comparing Priors in Bayesian Logistic Regression for Sensorial Classification of Rice

IRT Model Selection Methods for Polytomous Items

PARAMETER ESTIMATION: BAYESIAN APPROACH. These notes summarize the lectures on Bayesian parameter estimation.

Bayesian data analysis in practice: Three simple examples

Score-Based Tests of Measurement Invariance with Respect to Continuous and Ordinal Variables

Quantile POD for Hit-Miss Data

Journal of Statistical Software

MULTILEVEL IMPUTATION 1

Bayesian spatial hierarchical modeling for temperature extremes

CHAPTER 9 EXAMPLES: MULTILEVEL MODELING WITH COMPLEX SURVEY DATA

PACKAGE LMest FOR LATENT MARKOV ANALYSIS

BayesSummaryStatLM: An R package for Bayesian Linear Models for

Multilevel regression mixture analysis

Number of Subjects and Time Points Needed for Multilevel Time-Series Analysis: A Simulation Study of Dynamic Structural Equation Modeling

Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded

Ronald Christensen. University of New Mexico. Albuquerque, New Mexico. Wesley Johnson. University of California, Irvine. Irvine, California

eqr094: Hierarchical MCMC for Bayesian System Reliability

Dealing with rotational invariance in Bayesian confirmatory factor analysis

An Introduction to SEM in Mplus

BAYESIAN IRT MODELS INCORPORATING GENERAL AND SPECIFIC ABILITIES

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Using Mplus individual residual plots for. diagnostics and model evaluation in SEM

Related Concepts: Lecture 9 SEM, Statistical Modeling, AI, and Data Mining. I. Terminology of SEM

Introduction to Confirmatory Factor Analysis

LCA Distal Stata function users guide (Version 1.0)

General structural model Part 2: Categorical variables and beyond. Psychology 588: Covariance structure and factor models

BAYESIAN INFERENCE FOR A COVARIANCE MATRIX

PRESENTATION TITLE. Is my survey biased? The importance of measurement invariance. Yusuke Kuroki Sunny Moon November 9 th, 2017

Introduction to Structural Equation Modeling Dominique Zephyr Applied Statistics Lab

How well do Fit Indices Distinguish Between the Two?

Statistical Methods in Particle Physics Lecture 1: Bayesian methods

Specifying Latent Curve and Other Growth Models Using Mplus. (Revised )

Item Parameter Calibration of LSAT Items Using MCMC Approximation of Bayes Posterior Distributions

Variable-Specific Entropy Contribution

LCA_Distal_LTB Stata function users guide (Version 1.1)

STAT 518 Intro Student Presentation

MEASUREMENT UNCERTAINTY AND SUMMARISING MONTE CARLO SAMPLES

arxiv: v2 [stat.co] 17 Nov 2016

Disk Diffusion Breakpoint Determination Using a Bayesian Nonparametric Variation of the Errors-in-Variables Model

CFA Loading Estimation and Comparison Example Joel S Steele, PhD

Basics of Modern Missing Data Analysis

Longitudinal Invariance CFA (using MLR) Example in Mplus v. 7.4 (N = 151; 6 items over 3 occasions)

Bayesian Methods in Multilevel Regression

Confirmatory Factor Analysis and Structural Equation Modeling Group Differences: Measurement Invariance.

The sbgcop Package. March 9, 2007

Session: Probabilistic methods in avalanche hazard assessment. Session moderator: Marco Uzielli Norwegian Geotechnical Institute

A Study of Statistical Power and Type I Errors in Testing a Factor Analytic. Model for Group Differences in Regression Intercepts

Bayesian Inference for Discretely Sampled Diffusion Processes: A New MCMC Based Approach to Inference

A general mixed model approach for spatio-temporal regression data

Bayesian Modeling of Conditional Distributions

Online Appendix for Sterba, S.K. (2013). Understanding linkages among mixture models. Multivariate Behavioral Research, 48,

STA 216, GLM, Lecture 16. October 29, 2007

A Threshold-Free Approach to the Study of the Structure of Binary Data

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence

Assessment of the South African anchovy resource using data from : posterior distributions for the two base case hypotheses

Analysing geoadditive regression data: a mixed model approach

Transcription:

Empirical Validation of the Critical Thinking Assessment Test: A Bayesian CFA Approach CHI HANG AU & ALLISON AMES, PH.D. 1

Acknowledgement Allison Ames, PhD Jeanne Horst, PhD 2

Overview Features of the Critical Thinking Assessment Test (CAT) Estimation Framework Frequentist Bayesian Methods Data Collection MCMC in Mplus Results Implications and Future Research 3

Overview Features of the Critical Thinking Assessment Test (CAT) Estimation Framework Frequentist Bayesian Methods Data Collection MCMC in Mplus Results Implications and Future Research 4

Features of the CAT Four domains (factors), 15 components/items (CAT; Tennessee Technological University) Evaluation of Information Problem Solving Creative Thinking Communication Issues: Inconsistent rating scale (non-integer values) Multidimensionality of items 5

Item Q1 Evaluate and Interpret Information X Problem Solving Creative Thinking Effective Communication Q2 X X Q3 X X Q4 X X X Q5 X Q6 X X Q7 X X X Q8 X Q9 X X Q10 X X Q11 X X X Q12 Q13 X X Q14 X X X Q15 X X X X 6

Ideal simple structure 59 Unknown Parameters 7

But actually 75 Unknown Parameters 8

Overview Features of the Critical Thinking Assessment Test (CAT) Estimation Framework Frequentist Bayesian Methods Data Collection MCMC in Mplus Results Implications and Future Research 9

Frequentist WLSMV Relies on large sample theory (Li, 2016) Local independence Continuous or Categorical indicators (>5 response categories) Problems encountered Inconsistent scoring on the CAT Multidimensionality of most components 10

Frequentist Software Packages R package: lavaan WLSMV estimation Mplus WLSMV estimation 11

Non-convergence lavaan Mplus 12

Bayesian Markov Chain Monte Carlo (Levy & Mislevy, 2016) Use prior knowledge to guide estimation Flexible, can accommodate different data type and structure Does not require large sample theory or local independence 13

Research Questions 1. Can the CAT s factor structure be confirmed empirically using BSEM? 2. Does the BSEM approach overcome the shortcomings of other estimation methods? 14

Overview Features of the Critical Thinking Assessment Test (CAT) Estimation Framework Frequentist Bayesian Methods Data Collection MCMC in Mplus Results Implications and Future Research 15

Data University-wide assessment day Collected from Spring 2012 to Spring 2016 (n = 727) Missing data removed Non-integer responses (disagreement among raters) Final n = 671 Sophomore or Junior status After completing Gen Ed requirements 16

Bayesian Estimation Mplus Version 7.4 Two chains 200,000 total iterations 100,000 burn-in iterations ~15-20 minutes 17

MCMC in Mplus FACTOR LOADING DEFAULT: N(0,5) FACTOR COVARIANCE DEFAULT: IW(0,5) Default priors for parameter types (categorical indicators) *Scaled Inverse-Wishart Distribution 18

Overview Features of the Critical Thinking Assessment Test (CAT) Estimation Framework Frequentist Bayesian Methods Data Collection MCMC in Mplus Results Implications and Future Research 19

20

Convergence Potential Scale Reduction (PSR) Between chain variability by within chain variability Mplus returns highest PSR value of a parameter in an iteration PSR should be around 1 N(0,5): Highest PSR = 2.035 at the 200k th iteration N(0,10): Highest PSR = 1.979 at the 200k th iteration 21

Model Fit Three Methods (Gelman et al., 2003) 1. Prior Sensitivity: Sensitivity to different prior distributions Will changing the prior substantially change the posterior? 2. Posterior Predictive Checking: Discrepancy measures Are simulated data from the posterior data similar to the observed data? 3. Conceptual: Posterior inferences to substantive knowledge Are the estimates or patterns consistent with theory? 22

0, 10 Prior Estimates Model Fit: Priors Sensitivity Default: N(0,5) Diffuse: N(0,10) Range of % differences Q14 loads on to three factors Identify and explain the best solution for a real-world problem using relevant information Largest estimate on Problem Solving Prior Sensitivity Estimate Differences 15 13 11 9 7 5 3 1-1 -1 1 3 5 7 9 11 13 15 0,5 Prior Estimates 23

Model Fit: Posterior Predictive Checks Discrepancy value for Mplus: Chi-square Statistics Posterior Predictive p value (PPP) Acceptable range:.05< PPP <.95 Obtained PPP N(0,5) ~.213 N(0,10) ~.062 24

Problem Solving Creative Thinking Communication Q1 Model Fit: Conceptual Close to zero and negative interfactor correlations Q2 X Q3 X X Q4 X X X Q5 Q6 X X Estimated Latent Factor Correlation Matrix Q7 X X X 1. Evaluation 2. Problem Solving 3. Creative thinking 4. Communication Q8 Q9 X X 1. Evaluation 1 2. Problem Solving -0.226 1 3. Creative Thinking 0.593 0.052 1 4. Communication 0.291-0.036-0.125 1 Q10 X Q11 X X Q12 X Q13 X Q14 X X Q15 X X X 25

Problem Solving Creative Thinking Communication Q1 Model Fit: Conceptual Close to zero and negative interfactor correlations Q2 X Q3 X X Q4 X X X Q5 Q6 X X Estimated Latent Factor Correlation Matrix Q7 X X X 1. Evaluation 2. Problem Solving 3. Creative thinking 4. Communication Q8 Q9 X X 1. Evaluation 1 2. Problem Solving -0.226 1 3. Creative Thinking 0.593 0.052 1 4. Communication 0.291-0.036-0.125 1 Q10 X Q11 X X Q12 X Q13 X Q14 X X Q15 X X X 26

Problem Solving Creative Thinking Communication Q1 Model Fit: Conceptual Close to zero and negative interfactor correlations Q2 X Q3 X X Q4 X X X Q5 Q6 X X Estimated Latent Factor Correlation Matrix Q7 X X X 1. Evaluation 2. Problem Solving 3. Creative thinking 4. Communication Q8 Q9 X X 1. Evaluation 1 2. Problem Solving -0.226 1 3. Creative Thinking 0.593 0.052 1 4. Communication 0.291-0.036-0.125 1 Q10 X Q11 X X Q12 X Q13 X Q14 X X Q15 X X X 27

4-Factor Structure Prior Sensitivity: Sensitivity to different prior distributions % difference in parameters: -201.42% to 22,000% Indicates poor model-data fit Posterior predictive checking PPP value: ~.2 Indicates good model-data fit Conceptual: Posterior inferences to substantive knowledge Close to zero and negative interfactor correlations Indicates poor model-data fit 28

Alternative Model 1-Factor Structure 53 Unknown Parameters φ 1 ξ 1 χ 1 χ 2 χ 3 χ 4 χ 5 χ 6 χ 7 χ 8 χ 9 χ 10 χ 11 χ 12 χ 13 χ 14 χ 15 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 δ 1 δ 2 δ 3 δ 4 δ 5 δ 6 δ 7 δ 8 δ 9 δ 10 δ 11 δ 12 δ 13 δ 14 δ 15 Ψ 11 Ψ 22 Ψ 33 Ψ 44 Ψ 55 Ψ 66 Ψ 77 Ψ 88 Ψ 99 Ψ 1010 Ψ 1111 Ψ 1212 Ψ 1313 Ψ 29 1414 Ψ 1515 29

30

Convergence: 1-Factor Structure 2 chains, 200,000 iterations ~12 minutes N(0,5): Highest PSR = 1.001 at the 200k th iteration N(0,10): Highest PSR = 1.001 at the 200k th iteration 31

0,10 Prior Estimates Model Fit: Priors Sensitivity Default: N(0,5) Diffuse: N(0,10) Range of % differences < 1% 1 0.9 0.8 0.7 0.6 0.5 0.4 0.3 0.2 0.1 0 Prior Sensitivity Estimate Differences 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0,5 Prior Estimates 32

Model Fit: Posterior Predictive Checks Posterior Predictive p value (PPP) Acceptable range:.05< PPP <.95 Obtained PPP N(0,5) <.001 N(0,10) <.001 33

Critical Thinking Model Fit: Conceptual All factor loadings are non-zero and positive Q1 0.147 Q2 0.435 Q3 0.631 Q4 0.478 Q5 0.337 Q6 0.542 Q7 0.356 Q8 0.305 Q9 0.271 Q10 0.192 Q11 0.204 Q12 0.158 Q13 0.491 Q14 0.411 Q15 0.59 34

1-Factor Structure Prior Sensitivity: Sensitivity to different prior distributions % difference in parameters: 0% to 0.633% Indicates good model-data fit Posterior predictive checking PPP value: <.001 Indicates poor model-data fit Conceptual: Posterior inferences to substantive knowledge All factor loadings are positive (.147 to.631) Indicates decent model-data fit 35

Overview Features of the Critical Thinking Assessment Test (CAT) Estimation Framework Frequentist Bayesian Methods Data Collection MCMC in Mplus Results Implications and Future Research 36

Coming back to the Research Questions 1. Can the CAT s factor structure be confirmed empirically using BSEM? 2. Does the BSEM approach overcome the shortcomings of other estimation methods? 37

Implications and Future Research Inconsistent evidence Application of BSEM approach for instrument development Encourage growing body of validity evidence Compare different factor models for the CAT Informative priors from content experts Use different discrepancy measures to assess model fit 38

Thank you. Questions? aucb@dukes.jmu.edu ames2aj@jmu.edu 39

Selected References Au, C. B. & Ames, A. J. (2016, October). A Multitrait Multimethod Analysis of the Construct Validity of Ethical Reasoning. Paper presented at the annual conference of the Northeastern Educational Research Association, Trumbull, CT. Center for Assessment and Improvement of Learning, Tennessee Technological University. https://www.tntech.edu/cat/about/ Gelman, A., Carlin, J. B., Stern, H. S., Dunson, D. B., Vehtari, A., Rubin, D. B. (2004). Bayesian data analysis. Boca Raton : CRC Press, 2004. Levy, R. & Mislevy, R. J. (2016). Confirmatory Factor Analysis. In (Eds.)., Bayesian Psychometric Modeling (pp.187-230). Chapman and Hall/CRC.Center for Assessment and Improvement of Learning, Tennessee Technological University. https://www.tntech.edu/cat/about/ Muthén, B., & Asparouhov, T. (2012). Bayesian structural equation modeling: A more flexible representation of substantive theory. Psychological Methods, 17(3), 313-335. 40

0,10 Estimates Model Fit: Priors Sensitivity (Covariance) Default: N(0,5) Diffuse: N(0,10) Range of % differences -500% to 205% 0.7 0.6 0.5 0.4 0.3 0.2 Prior Sensitivity Covariance Estimate Differences 0.1 0-0.3-0.2-0.1 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7-0.1 0,5 Estimates 41

Factor pattern Coefficients Standard Error Significance Q1.143.059.015 1-Factor Results Fit Indices χ 2 p RMSEA CFI 262.654 <.001.054.826 Q2.386.049 <.001 Q3.518.042 <.001 Q4.415.044 <.001 Q5.318.062 <.001 Q6.466.044 <.001 Q7.323.052 <.001 Q8.296.056 <.001 Q9.260.051 <.001 Q10.186.051 <.001 Q11.198.053 <.001 Q12.164.072.023 Q13.490.044 <.001 Q14.430.043 <.001 Q15.508.043 <.001 42

Rating Method Rater effect cannot be examined Rater interpretation may influence multidimensionality of item-level scores 43

CAT Observed Correlation Matrix 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 1-2.181-3.045.244-4.018.145.310-5.075.181.231.094-6.012.187.281.209.318-7.111.177.186.167.182.165-8.120.152.166.061.179.210.129-9.072.092.164.115.088.112.164.304-10 -.094.084.177.073.032.112.042.013 -.032-11 -.001.044.089.069.089.090.148.076.081.088-12.083.020.036.073 -.010.089.029.213.115.037.100-13.093.158.130.153.101.184.054.050.059.070.014 -.032-14.014.072.110.117 -.007.050.062.013.051.118.018.030.492-15.062.188.244.218 -.007.225.105.053.053.048.171.146.303.345-44