Whats beyond Concerto: An introduction to the R package catr. Session 4: Overview of polytomous IRT models

Size: px
Start display at page:

Download "Whats beyond Concerto: An introduction to the R package catr. Session 4: Overview of polytomous IRT models"

Transcription

1 Whats beyond Concerto: An introduction to the R package catr Session 4: Overview of polytomous IRT models The Psychometrics Centre, Cambridge, June 10th, 2014

2 2 Outline: 1. Introduction 2. General notations 3. Difference models 4. Divide-by-total models 5. Summary

3 3 1. Introduction Current version of catr is version 3.0 (available online) Up to version 2.6, only dichotomous IRT models were embedded into catr Version 3.0 holds now most known polytomous IRT models Goal of this session: 1. To briefly describe the general framework for polytomous IRT models 2. To set up the notations and IRT models that are embedded in catr

4 4 2. General notations With dichotomous items, only two responses are possible With polytomous items, more than two responses are allowed For item j, set g j + 1 as the number of possible responses Responses are coded as k {0, 1,..., g j } Settiong g j = 1 yields a dichotomous item Item responses can be ordinal: e.g., ( never, sometimes, often, always ) nominal: e.g., color, political affiliation, etc. p j is the set of item parameters (depending on the model) P jk (θ) = P r(x j = k θ, p j ) is the probability of answering response k to item j, given proficiency θ and item parameters

5 5 2. General notations Two main categories of polytomous IRT models (Thissen & Steinberg, 1986): 1. Difference models: probabilities P jk (θ) are set as differences between cumulative probabilities P jk (θ) = P r(x j k θ, p j ) 2. Divide-by-total models: probabilities P jk (θ) are set as ratios of values divided by the sum of these values: t jk P jk (θ) = g j l=0 t jl Notations to be used come from Embretson and Reise (2000)

6 6 2. General notations Most-known difference models: Graded response model (GRM; Samejima, 1969) Modified graded response model (MGRM; Muraki, 1990) Most-known difference models: Partial credit model (PCM; Masters, 1982) Generalized partial credit model (GPCM; Muraki, 1992) Rating scale model (RSM; Andrich, 1978) Nominal response model (NRM; Bock, 1972) All these models are available in catr

7 7 3. Difference models With difference models, probabilities P jk (θ) are set as differences between cumulative probabilities P jk (θ) = P r(x j k θ, p j ) that is, the probability of selecting response in (k, k + 1,..., g j ) Graded response model (GRM; Samejima, 1969): P jk (θ) = exp [α j (θ β jk )] 1 + exp [α j (θ β jk )] with P j0 (θ) = 1 and P j,g j (θ) = P j,gj (θ) α j discrimination (slope) parameter and β jk threshold (intercept) parameters of cumulative probabilities Only g j thresholds β j1,..., β j,gj are necessary to distinguish the g j + 1 response categories

8 8 3. Difference models Response category probabilities P jk (θ) are found back as follows (with 0 < k < g j ): Illustration with P j,gj (θ) = P j,g j (θ) P jk (θ) = P jk (θ) P j,k+1 (θ) P j0 (θ) = 1 P j1 (θ) 4 response categories (i.e. g j = 3), α j = 1.5, β j1 = 0.5, β j2 = 0.5, and β j3 = 1.2

9 9 3. Difference models P * jk (θ) Cumulative probability k = 1 k = 2 k = θ

10 10 3. Difference models P jk (θ) Probability k = 0 k = 1 k = 2 k = θ

11 11 3. Difference models With GRM, threshold parameters β jk may vary across items items may not have the same number of response categories g j Modified graded response model (MGRM; Muraki, 1990): modification of GRM to allow for common thresholds across all items With MGRM, thresholds β jk are split in two components: b j (general intercept parameter for item j) and c k (threshold parameter between categories k and k + 1, common to all items) Cumulative probability under MGRM: P jk (θ) = exp [α j (θ b j + c k )] 1 + exp [α j (θ b j + c k )]

12 12 3. Difference models With GRM, all items have the same number of categories (since all c k parameters are equal across items) MGRM applicable for Likert scales with same response categories across items, such as e.g. Never - Rarely - Sometimes - Often - Always

13 13 4. Divide-by-total models With divide-by-total models, probabilities P jk (θ) are set directly as ratios of values divided by the sum of these values (across response categories) Partial credit model (PCM; Masters, 1982): with P jk (θ) = exp k t=0 (θ δ jt ) g j r=0 exp r t=0 (θ δ jt ) 0 (θ δ jt ) = 0 or exp t=0 0 (θ δ jt ) = 1 t=0

14 14 4. Divide-by-total models Detailed equations with three response categories (i.e. g j = 2 and k {0, 1, 2}): g j r=0 exp r (θ δ jt ) = t=0 2 r=0 exp r (θ δ jt ) t=0 = 1 + exp (θ δ j1 ) + exp (θ δ j1 + θ δ j2 ) Denominator is then equal to g j r exp (θ δ jt ) = 1 + exp (θ δ j1 ) + exp (2 θ δ j1 δ j2 ) r=0 t=0

15 15 4. Divide-by-total models Response category probabilities are then 1 P j0 (θ) = 1 + exp (θ δ j1 ) + exp (2 θ δ j1 δ j2 ), and P j1 (θ) = P j2 (θ) = exp (θ δ j1 ) 1 + exp (θ δ j1 ) + exp (2 θ δ j1 δ j2 ), exp (2 θ δ j1 δ j2 ) 1 + exp (θ δ j1 ) + exp (2 θ δ j1 δ j2 ), Idea: to allow for partial credit in the item response (e.g., false - incomplete - almost correct - correct responses)

16 16 4. Divide-by-total models Only g j thresholds (δ j1,..., δ j,gj ) are necessary Illustration with g j = 3 (i.e. four response categories) (δ j1, δ j2, δ j3 ) = (1, 1, 0.5)

17 17 4. Divide-by-total models P jk (θ) Probability k = 0 k = 1 k = 2 k = 3 β j1 = 1 β j2 = 1 β j3 = θ

18 18 4. Divide-by-total models Generalized partial credit model (GPCM; Muraki, 1992) extends the PCM by introducing a discrimination parameter α j for the item: exp k P jk (θ) = t=0 α j (θ δ jt ) g j r=0 exp r t=0 α j (θ δ jt ) with 0 α j (θ δ jt ) = 0 t=0 Illustration with same parameters as PCM, i.e. (δ j1, δ j2, δ j3 ) = (1, 1, 0.5), and with α j = 1.5

19 19 4. Divide-by-total models PCM Probability k = 0 k = 1 k = 2 k = θ

20 20 4. Divide-by-total models GPCM Probability k = 0 k = 1 k = 2 k = θ

21 21 4. Divide-by-total models PCM allows for different thresholds δ jt across the items, and also different response categories Rating scale model (RSM; Andrich, 1978) is a modification of PCM to allow equal thresholds across items With RSM, thresholds δ jt are split in two components, λ j (general intercept parameter) and δ t (threshold parameter common to all items): with P jk (θ) = exp k t=0 [θ (λ j + δ t )] g j r=0 exp r t=0 [θ (λ j + δ t )] 0 [θ (λ j + δ t )] = 0 t=0

22 22 4. Divide-by-total models Finally, Nominal response model (NRM; Bock, 1972) specifies response category probabilities as follows: with P jk (θ) = exp (α jk θ + c jk ) g j r=0 exp (α jr θ + c jr ) α j0 θ + c j0 = 0 Illustration with four categories (i.e.g j = 3) and (α j1, c j1 ) = (1, 1), (α j2, c j2 ) = (1.2, 0.5), (α j3, c j3 ) = (0.8, 0)

23 23 4. Divide-by-total models P jk (θ) Probability k = 0 k = 1 k = 2 k = θ

24 24 5. Summary Polytomous IRT models extend dichotomous IRT models by allowing more than two possible responses Response categories can be the same across all items or they may vary Polytomous IRT models: GRM, MGRM, PCM, GPCM, RSM, NRM,... MGRM and RSM are convenient only if the same response categories (as they share common threshold parameters for all items) Extensions of algorithms for dichotomous IRT model calibration to the polytomous framework exist

25 25 5. Summary Each model can be fully described with specific sets of parameters for each item j: (α j, β j1,..., β j,gj ) for GRM (α j, b j, c 1,..., c g ) for MGRM (δ j1,..., δ j,gj ) for PCM (α j, δ j1,..., δ j,gj ) for GPCM (λ j, δ 1,..., δ g ) for RSM (α j1, c j1,..., α j,gj, c j,gj ) for NRM These orders of parameters are used in catr...

26 26 References Andrich, D. (1978). A rating formulation for ordered response categories. Psychometrika, 43, Bock, R.D. (1972). Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika, 37, Embretson, S. E., & Reise, S. P. (2000). Item response theory for psychologists. Mahwah, NJ: Lawrence Erlbaum Associates. Masters, G.N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47, Muraki, E. (1990). Fitting a polytomous item response model to Likert-type data. Applied Psychological Measurement, 14,

27 27 References Muraki, E. (1992). A generalized partial credit model: application of an EM algorithm. Applied Psychological Measurement, 16, Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores. Psychometrika monograph (Vol. 17). Thissen, D., & Steinberg, L. (1986). A taxonomy of item response models. Psychometrika, 51,

A Note on the Equivalence Between Observed and Expected Information Functions With Polytomous IRT Models

A Note on the Equivalence Between Observed and Expected Information Functions With Polytomous IRT Models Journal of Educational and Behavioral Statistics 2015, Vol. 40, No. 1, pp. 96 105 DOI: 10.3102/1076998614558122 # 2014 AERA. http://jebs.aera.net A Note on the Equivalence Between Observed and Expected

More information

A Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions

A Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions A Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions Cees A.W. Glas Oksana B. Korobko University of Twente, the Netherlands OMD Progress Report 07-01. Cees A.W.

More information

On the Construction of Adjacent Categories Latent Trait Models from Binary Variables, Motivating Processes and the Interpretation of Parameters

On the Construction of Adjacent Categories Latent Trait Models from Binary Variables, Motivating Processes and the Interpretation of Parameters Gerhard Tutz On the Construction of Adjacent Categories Latent Trait Models from Binary Variables, Motivating Processes and the Interpretation of Parameters Technical Report Number 218, 2018 Department

More information

IRT Model Selection Methods for Polytomous Items

IRT Model Selection Methods for Polytomous Items IRT Model Selection Methods for Polytomous Items Taehoon Kang University of Wisconsin-Madison Allan S. Cohen University of Georgia Hyun Jung Sung University of Wisconsin-Madison March 11, 2005 Running

More information

Equating Tests Under The Nominal Response Model Frank B. Baker

Equating Tests Under The Nominal Response Model Frank B. Baker Equating Tests Under The Nominal Response Model Frank B. Baker University of Wisconsin Under item response theory, test equating involves finding the coefficients of a linear transformation of the metric

More information

What is an Ordinal Latent Trait Model?

What is an Ordinal Latent Trait Model? What is an Ordinal Latent Trait Model? Gerhard Tutz Ludwig-Maximilians-Universität München Akademiestraße 1, 80799 München February 19, 2019 arxiv:1902.06303v1 [stat.me] 17 Feb 2019 Abstract Although various

More information

Summer School in Applied Psychometric Principles. Peterhouse College 13 th to 17 th September 2010

Summer School in Applied Psychometric Principles. Peterhouse College 13 th to 17 th September 2010 Summer School in Applied Psychometric Principles Peterhouse College 13 th to 17 th September 2010 1 Two- and three-parameter IRT models. Introducing models for polytomous data. Test information in IRT

More information

PIRLS 2016 Achievement Scaling Methodology 1

PIRLS 2016 Achievement Scaling Methodology 1 CHAPTER 11 PIRLS 2016 Achievement Scaling Methodology 1 The PIRLS approach to scaling the achievement data, based on item response theory (IRT) scaling with marginal estimation, was developed originally

More information

Chapter 3 Models for polytomous data

Chapter 3 Models for polytomous data This is page 75 Printer: Opaque this Chapter 3 Models for polytomous data Francis Tuerlinckx Wen-Chung Wang 3.1 Introduction In the first two chapters of this volume, models for binary or dichotomous variables

More information

Comparison between conditional and marginal maximum likelihood for a class of item response models

Comparison between conditional and marginal maximum likelihood for a class of item response models (1/24) Comparison between conditional and marginal maximum likelihood for a class of item response models Francesco Bartolucci, University of Perugia (IT) Silvia Bacci, University of Perugia (IT) Claudia

More information

Polytomous IRT models and monotone likelihood ratio of the total score Hemker, B.T.; Sijtsma, K.; Molenaar, I.W.; Junker, B.W.

Polytomous IRT models and monotone likelihood ratio of the total score Hemker, B.T.; Sijtsma, K.; Molenaar, I.W.; Junker, B.W. Tilburg University Polytomous IRT models and monotone likelihood ratio of the total score Hemker, B.T.; Sijtsma, K.; Molenaar, I.W.; Junker, B.W. Published in: Psychometrika Publication date: 1996 Link

More information

Preliminary Manual of the software program Multidimensional Item Response Theory (MIRT)

Preliminary Manual of the software program Multidimensional Item Response Theory (MIRT) Preliminary Manual of the software program Multidimensional Item Response Theory (MIRT) July 7 th, 2010 Cees A. W. Glas Department of Research Methodology, Measurement, and Data Analysis Faculty of Behavioural

More information

Monte Carlo Simulations for Rasch Model Tests

Monte Carlo Simulations for Rasch Model Tests Monte Carlo Simulations for Rasch Model Tests Patrick Mair Vienna University of Economics Thomas Ledl University of Vienna Abstract: Sources of deviation from model fit in Rasch models can be lack of unidimensionality,

More information

A Markov chain Monte Carlo approach to confirmatory item factor analysis. Michael C. Edwards The Ohio State University

A Markov chain Monte Carlo approach to confirmatory item factor analysis. Michael C. Edwards The Ohio State University A Markov chain Monte Carlo approach to confirmatory item factor analysis Michael C. Edwards The Ohio State University An MCMC approach to CIFA Overview Motivating examples Intro to Item Response Theory

More information

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 24 in Relation to Measurement Error for Mixed Format Tests Jae-Chun Ban Won-Chan Lee February 2007 The authors are

More information

Applied Psychological Measurement 2001; 25; 283

Applied Psychological Measurement 2001; 25; 283 Applied Psychological Measurement http://apm.sagepub.com The Use of Restricted Latent Class Models for Defining and Testing Nonparametric and Parametric Item Response Theory Models Jeroen K. Vermunt Applied

More information

Item Response Theory (IRT) Analysis of Item Sets

Item Response Theory (IRT) Analysis of Item Sets University of Connecticut DigitalCommons@UConn NERA Conference Proceedings 2011 Northeastern Educational Research Association (NERA) Annual Conference Fall 10-21-2011 Item Response Theory (IRT) Analysis

More information

SCORING TESTS WITH DICHOTOMOUS AND POLYTOMOUS ITEMS CIGDEM ALAGOZ. (Under the Direction of Seock-Ho Kim) ABSTRACT

SCORING TESTS WITH DICHOTOMOUS AND POLYTOMOUS ITEMS CIGDEM ALAGOZ. (Under the Direction of Seock-Ho Kim) ABSTRACT SCORING TESTS WITH DICHOTOMOUS AND POLYTOMOUS ITEMS by CIGDEM ALAGOZ (Under the Direction of Seock-Ho Kim) ABSTRACT This study applies item response theory methods to the tests combining multiple-choice

More information

Estimating Integer Parameters in IRT Models for Polytomous Items

Estimating Integer Parameters in IRT Models for Polytomous Items Measurement and Research Department Reports 96-1 Estimating Integer Parameters in IRT Models for Polytomous Items H.H.F.M. Verstralen Measurement and Research Department Reports 96-1 Estimating Integer

More information

Rater agreement - ordinal ratings. Karl Bang Christensen Dept. of Biostatistics, Univ. of Copenhagen NORDSTAT,

Rater agreement - ordinal ratings. Karl Bang Christensen Dept. of Biostatistics, Univ. of Copenhagen NORDSTAT, Rater agreement - ordinal ratings Karl Bang Christensen Dept. of Biostatistics, Univ. of Copenhagen NORDSTAT, 2012 http://biostat.ku.dk/~kach/ 1 Rater agreement - ordinal ratings Methods for analyzing

More information

Test Equating under the Multiple Choice Model

Test Equating under the Multiple Choice Model Test Equating under the Multiple Choice Model Jee Seon Kim University of Illinois at Urbana Champaign Bradley A. Hanson ACT,Inc. May 12,2000 Abstract This paper presents a characteristic curve procedure

More information

ANALYSIS OF ITEM DIFFICULTY AND CHANGE IN MATHEMATICAL AHCIEVEMENT FROM 6 TH TO 8 TH GRADE S LONGITUDINAL DATA

ANALYSIS OF ITEM DIFFICULTY AND CHANGE IN MATHEMATICAL AHCIEVEMENT FROM 6 TH TO 8 TH GRADE S LONGITUDINAL DATA ANALYSIS OF ITEM DIFFICULTY AND CHANGE IN MATHEMATICAL AHCIEVEMENT FROM 6 TH TO 8 TH GRADE S LONGITUDINAL DATA A Dissertation Presented to The Academic Faculty By Mee-Ae Kim-O (Mia) In Partial Fulfillment

More information

PREDICTING THE DISTRIBUTION OF A GOODNESS-OF-FIT STATISTIC APPROPRIATE FOR USE WITH PERFORMANCE-BASED ASSESSMENTS. Mary A. Hansen

PREDICTING THE DISTRIBUTION OF A GOODNESS-OF-FIT STATISTIC APPROPRIATE FOR USE WITH PERFORMANCE-BASED ASSESSMENTS. Mary A. Hansen PREDICTING THE DISTRIBUTION OF A GOODNESS-OF-FIT STATISTIC APPROPRIATE FOR USE WITH PERFORMANCE-BASED ASSESSMENTS by Mary A. Hansen B.S., Mathematics and Computer Science, California University of PA,

More information

A Nonlinear Mixed Model Framework for Item Response Theory

A Nonlinear Mixed Model Framework for Item Response Theory Psychological Methods Copyright 2003 by the American Psychological Association, Inc. 2003, Vol. 8, No. 2, 185 205 1082-989X/03/$12.00 DOI: 10.1037/1082-989X.8.2.185 A Nonlinear Mixed Model Framework for

More information

Comparing IRT with Other Models

Comparing IRT with Other Models Comparing IRT with Other Models Lecture #14 ICPSR Item Response Theory Workshop Lecture #14: 1of 45 Lecture Overview The final set of slides will describe a parallel between IRT and another commonly used

More information

Polytomous Item Explanatory IRT Models with Random Item Effects: An Application to Carbon Cycle Assessment Data

Polytomous Item Explanatory IRT Models with Random Item Effects: An Application to Carbon Cycle Assessment Data Polytomous Item Explanatory IRT Models with Random Item Effects: An Application to Carbon Cycle Assessment Data Jinho Kim and Mark Wilson University of California, Berkeley Presented on April 11, 2018

More information

Item-Focussed Trees for the Detection of Differential Item Functioning in Partial Credit Models

Item-Focussed Trees for the Detection of Differential Item Functioning in Partial Credit Models arxiv:1609.08970v1 [stat.me] 8 Sep 016 Item-Focussed Trees for the Detection of Differential Item Functioning in Partial Credit Models Stella Bollmann, Moritz Berger & Gerhard Tutz Ludwig-Maximilians-Universität

More information

The Factor Analytic Method for Item Calibration under Item Response Theory: A Comparison Study Using Simulated Data

The Factor Analytic Method for Item Calibration under Item Response Theory: A Comparison Study Using Simulated Data Int. Statistical Inst.: Proc. 58th World Statistical Congress, 20, Dublin (Session CPS008) p.6049 The Factor Analytic Method for Item Calibration under Item Response Theory: A Comparison Study Using Simulated

More information

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Jonathan Gruhl March 18, 2010 1 Introduction Researchers commonly apply item response theory (IRT) models to binary and ordinal

More information

NESTED LOGIT MODELS FOR MULTIPLE-CHOICE ITEM RESPONSE DATA UNIVERSITY OF TEXAS AT AUSTIN UNIVERSITY OF WISCONSIN-MADISON

NESTED LOGIT MODELS FOR MULTIPLE-CHOICE ITEM RESPONSE DATA UNIVERSITY OF TEXAS AT AUSTIN UNIVERSITY OF WISCONSIN-MADISON PSYCHOMETRIKA VOL. 75, NO. 3, 454 473 SEPTEMBER 2010 DOI: 10.1007/S11336-010-9163-7 NESTED LOGIT MODELS FOR MULTIPLE-CHOICE ITEM RESPONSE DATA YOUNGSUK SUH UNIVERSITY OF TEXAS AT AUSTIN DANIEL M. BOLT

More information

UCLA Department of Statistics Papers

UCLA Department of Statistics Papers UCLA Department of Statistics Papers Title Can Interval-level Scores be Obtained from Binary Responses? Permalink https://escholarship.org/uc/item/6vg0z0m0 Author Peter M. Bentler Publication Date 2011-10-25

More information

Score-Based Tests of Measurement Invariance with Respect to Continuous and Ordinal Variables

Score-Based Tests of Measurement Invariance with Respect to Continuous and Ordinal Variables Score-Based Tests of Measurement Invariance with Respect to Continuous and Ordinal Variables Achim Zeileis, Edgar C. Merkle, Ting Wang http://eeecon.uibk.ac.at/~zeileis/ Overview Motivation Framework Score-based

More information

Anders Skrondal. Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine. Based on joint work with Sophia Rabe-Hesketh

Anders Skrondal. Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine. Based on joint work with Sophia Rabe-Hesketh Constructing Latent Variable Models using Composite Links Anders Skrondal Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine Based on joint work with Sophia Rabe-Hesketh

More information

A DISSERTATION SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY. Yu-Feng Chang

A DISSERTATION SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY. Yu-Feng Chang A Restricted Bi-factor Model of Subdomain Relative Strengths and Weaknesses A DISSERTATION SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY Yu-Feng Chang IN PARTIAL FULFILLMENT

More information

Latent Variable Modeling and Statistical Learning. Yunxiao Chen

Latent Variable Modeling and Statistical Learning. Yunxiao Chen Latent Variable Modeling and Statistical Learning Yunxiao Chen Submitted in partial fulfillment of the requirements for the degree of Doctor of Philosophy in the Graduate School of Arts and Sciences COLUMBIA

More information

Scaling Methodology and Procedures for the TIMSS Mathematics and Science Scales

Scaling Methodology and Procedures for the TIMSS Mathematics and Science Scales Scaling Methodology and Procedures for the TIMSS Mathematics and Science Scales Kentaro Yamamoto Edward Kulick 14 Scaling Methodology and Procedures for the TIMSS Mathematics and Science Scales Kentaro

More information

Comparing Multi-dimensional and Uni-dimensional Computer Adaptive Strategies in Psychological and Health Assessment. Jingyu Liu

Comparing Multi-dimensional and Uni-dimensional Computer Adaptive Strategies in Psychological and Health Assessment. Jingyu Liu Comparing Multi-dimensional and Uni-dimensional Computer Adaptive Strategies in Psychological and Health Assessment by Jingyu Liu BS, Beijing Institute of Technology, 1994 MS, University of Texas at San

More information

General structural model Part 2: Categorical variables and beyond. Psychology 588: Covariance structure and factor models

General structural model Part 2: Categorical variables and beyond. Psychology 588: Covariance structure and factor models General structural model Part 2: Categorical variables and beyond Psychology 588: Covariance structure and factor models Categorical variables 2 Conventional (linear) SEM assumes continuous observed variables

More information

An Overview of Item Response Theory. Michael C. Edwards, PhD

An Overview of Item Response Theory. Michael C. Edwards, PhD An Overview of Item Response Theory Michael C. Edwards, PhD Overview General overview of psychometrics Reliability and validity Different models and approaches Item response theory (IRT) Conceptual framework

More information

Raffaela Wolf, MS, MA. Bachelor of Science, University of Maine, Master of Science, Robert Morris University, 2008

Raffaela Wolf, MS, MA. Bachelor of Science, University of Maine, Master of Science, Robert Morris University, 2008 Assessing the Impact of Characteristics of the Test, Common-items, and Examinees on the Preservation of Equity Properties in Mixed-format Test Equating by Raffaela Wolf, MS, MA Bachelor of Science, University

More information

GENERALIZED LATENT TRAIT MODELS. 1. Introduction

GENERALIZED LATENT TRAIT MODELS. 1. Introduction PSYCHOMETRIKA VOL. 65, NO. 3, 391 411 SEPTEMBER 2000 GENERALIZED LATENT TRAIT MODELS IRINI MOUSTAKI AND MARTIN KNOTT LONDON SCHOOL OF ECONOMICS AND POLITICAL SCIENCE In this paper we discuss a general

More information

LSAC RESEARCH REPORT SERIES. Law School Admission Council Research Report March 2008

LSAC RESEARCH REPORT SERIES. Law School Admission Council Research Report March 2008 LSAC RESEARCH REPORT SERIES Structural Modeling Using Two-Step MML Procedures Cees A. W. Glas University of Twente, Enschede, The Netherlands Law School Admission Council Research Report 08-07 March 2008

More information

AC : GETTING MORE FROM YOUR DATA: APPLICATION OF ITEM RESPONSE THEORY TO THE STATISTICS CONCEPT INVENTORY

AC : GETTING MORE FROM YOUR DATA: APPLICATION OF ITEM RESPONSE THEORY TO THE STATISTICS CONCEPT INVENTORY AC 2007-1783: GETTING MORE FROM YOUR DATA: APPLICATION OF ITEM RESPONSE THEORY TO THE STATISTICS CONCEPT INVENTORY Kirk Allen, Purdue University Kirk Allen is a post-doctoral researcher in Purdue University's

More information

Multidimensional Linking for Tests with Mixed Item Types

Multidimensional Linking for Tests with Mixed Item Types Journal of Educational Measurement Summer 2009, Vol. 46, No. 2, pp. 177 197 Multidimensional Linking for Tests with Mixed Item Types Lihua Yao 1 Defense Manpower Data Center Keith Boughton CTB/McGraw-Hill

More information

Because it might not make a big DIF: Assessing differential test functioning

Because it might not make a big DIF: Assessing differential test functioning Because it might not make a big DIF: Assessing differential test functioning David B. Flora R. Philip Chalmers Alyssa Counsell Department of Psychology, Quantitative Methods Area Differential item functioning

More information

Application of Item Response Theory Models for Intensive Longitudinal Data

Application of Item Response Theory Models for Intensive Longitudinal Data Application of Item Response Theory Models for Intensive Longitudinal Data Don Hedeker, Robin Mermelstein, & Brian Flay University of Illinois at Chicago hedeker@uic.edu Models for Intensive Longitudinal

More information

Basic IRT Concepts, Models, and Assumptions

Basic IRT Concepts, Models, and Assumptions Basic IRT Concepts, Models, and Assumptions Lecture #2 ICPSR Item Response Theory Workshop Lecture #2: 1of 64 Lecture #2 Overview Background of IRT and how it differs from CFA Creating a scale An introduction

More information

An IRT-based Parameterization for Conditional Probability Tables

An IRT-based Parameterization for Conditional Probability Tables An IRT-based Parameterization for Conditional Probability Tables Russell G. Almond Educational Psychology and Learning Systems College of Education Florida State Univeristy Tallahassee, FL 32312 Abstract

More information

Logistic Regression and Item Response Theory: Estimation Item and Ability Parameters by Using Logistic Regression in IRT.

Logistic Regression and Item Response Theory: Estimation Item and Ability Parameters by Using Logistic Regression in IRT. Louisiana State University LSU Digital Commons LSU Historical Dissertations and Theses Graduate School 1998 Logistic Regression and Item Response Theory: Estimation Item and Ability Parameters by Using

More information

Links Between Binary and Multi-Category Logit Item Response Models and Quasi-Symmetric Loglinear Models

Links Between Binary and Multi-Category Logit Item Response Models and Quasi-Symmetric Loglinear Models Links Between Binary and Multi-Category Logit Item Response Models and Quasi-Symmetric Loglinear Models Alan Agresti Department of Statistics University of Florida Gainesville, Florida 32611-8545 July

More information

Statistical and psychometric methods for measurement: Scale development and validation

Statistical and psychometric methods for measurement: Scale development and validation Statistical and psychometric methods for measurement: Scale development and validation Andrew Ho, Harvard Graduate School of Education The World Bank, Psychometrics Mini Course Washington, DC. June 11,

More information

EVALUATION OF MATHEMATICAL MODELS FOR ORDERED POLYCHOTOMOUS RESPONSES. Fumiko Samejima*

EVALUATION OF MATHEMATICAL MODELS FOR ORDERED POLYCHOTOMOUS RESPONSES. Fumiko Samejima* EVALUATION OF MATHEMATICAL MODELS FOR ORDERED POLYCHOTOMOUS RESPONSES Fumiko Samejima* In this paper, mathematical modeling is treated as distinct from curve fitting. Considerations of psychological reality

More information

An Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin

An Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin Equivalency Test for Model Fit 1 Running head: EQUIVALENCY TEST FOR MODEL FIT An Equivalency Test for Model Fit Craig S. Wells University of Massachusetts Amherst James. A. Wollack Ronald C. Serlin University

More information

Introduction To Confirmatory Factor Analysis and Item Response Theory

Introduction To Confirmatory Factor Analysis and Item Response Theory Introduction To Confirmatory Factor Analysis and Item Response Theory Lecture 23 May 3, 2005 Applied Regression Analysis Lecture #23-5/3/2005 Slide 1 of 21 Today s Lecture Confirmatory Factor Analysis.

More information

Item Selection and Hypothesis Testing for the Adaptive Measurement of Change

Item Selection and Hypothesis Testing for the Adaptive Measurement of Change Item Selection and Hypothesis Testing for the Adaptive Measurement of Change Matthew Finkelman Tufts University School of Dental Medicine David J. Weiss University of Minnesota Gyenam Kim-Kang Korea Nazarene

More information

The application and empirical comparison of item. parameters of Classical Test Theory and Partial Credit. Model of Rasch in performance assessments

The application and empirical comparison of item. parameters of Classical Test Theory and Partial Credit. Model of Rasch in performance assessments The application and empirical comparison of item parameters of Classical Test Theory and Partial Credit Model of Rasch in performance assessments by Paul Moloantoa Mokilane Student no: 31388248 Dissertation

More information

Large-scale assessment programs are increasingly including complex

Large-scale assessment programs are increasingly including complex GOODMAN, JOSHUA, Ph.D. An Examination of the Residual Covariance Structures of Complex Performance Assessments Under Various Scaling and Scoring Methods. (2008) Directed by Dr. Richard M. Luecht. 190 pp.

More information

Score-Based Tests of Measurement Invariance with Respect to Continuous and Ordinal Variables

Score-Based Tests of Measurement Invariance with Respect to Continuous and Ordinal Variables Score-Based Tests of Measurement Invariance with Respect to Continuous and Ordinal Variables Achim Zeileis, Edgar C. Merkle, Ting Wang http://eeecon.uibk.ac.at/~zeileis/ Overview Motivation Framework Score-based

More information

Item Response Theory for Scores on Tests Including Polytomous Items with Ordered Responses

Item Response Theory for Scores on Tests Including Polytomous Items with Ordered Responses Item Response Theory for Scores on Tests Including Polytomous Items with Ordered Responses David Thissen, University of North Carolina at Chapel Hill Mary Pommerich, American College Testing Kathleen Billeaud,

More information

Editor Nicholas J. Cox Department of Geography Durham University South Road Durham City DH1 3LE UK

Editor Nicholas J. Cox Department of Geography Durham University South Road Durham City DH1 3LE UK The Stata Journal Editor H. Joseph Newton Department of Statistics Texas A & M University College Station, Texas 7784 979-845-14; FAX 979-845-144 jnewton@stata-journal.com Associate Editors Christopher

More information

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 41 A Comparative Study of Item Response Theory Item Calibration Methods for the Two Parameter Logistic Model Kyung

More information

DETECTION OF DIFFERENTIAL ITEM FUNCTIONING USING LAGRANGE MULTIPLIER TESTS

DETECTION OF DIFFERENTIAL ITEM FUNCTIONING USING LAGRANGE MULTIPLIER TESTS Statistica Sinica 8(1998), 647-667 DETECTION OF DIFFERENTIAL ITEM FUNCTIONING USING LAGRANGE MULTIPLIER TESTS C. A. W. Glas University of Twente, Enschede, the Netherlands Abstract: In the present paper

More information

Extensions and Applications of Item Explanatory Models to Polytomous Data in Item Response Theory. Jinho Kim

Extensions and Applications of Item Explanatory Models to Polytomous Data in Item Response Theory. Jinho Kim Extensions and Applications of Item Explanatory Models to Polytomous Data in Item Response Theory by Jinho Kim A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor

More information

Assessment of fit of item response theory models used in large scale educational survey assessments

Assessment of fit of item response theory models used in large scale educational survey assessments DOI 10.1186/s40536-016-0025-3 RESEARCH Open Access Assessment of fit of item response theory models used in large scale educational survey assessments Peter W. van Rijn 1, Sandip Sinharay 2*, Shelby J.

More information

A Comparison of Item-Fit Statistics for the Three-Parameter Logistic Model

A Comparison of Item-Fit Statistics for the Three-Parameter Logistic Model A Comparison of Item-Fit Statistics for the Three-Parameter Logistic Model Cees A. W. Glas, University of Twente, the Netherlands Juan Carlos Suárez Falcón, Universidad Nacional de Educacion a Distancia,

More information

Paul Barrett

Paul Barrett Paul Barrett email: p.barrett@liv.ac.uk http://www.liv.ac.uk/~pbarrett/paulhome.htm Affiliations: The The State Hospital, Carstairs Dept. of of Clinical Psychology, Univ. Of Of Liverpool 20th 20th November,

More information

Presented by. Philip Holmes-Smith School Research Evaluation and Measurement Services

Presented by. Philip Holmes-Smith School Research Evaluation and Measurement Services The Statistical Analysis of Student Performance Data Presented by Philip Holmes-Smith School Research Evaluation and Measurement Services Outline of session This session will address four topics, namely:.

More information

LATENT VARIABLE SELECTION FOR MULTIDIMENSIONAL ITEM RESPONSE THEORY MODELS VIA L 1 REGULARIZATION

LATENT VARIABLE SELECTION FOR MULTIDIMENSIONAL ITEM RESPONSE THEORY MODELS VIA L 1 REGULARIZATION LATENT VARIABLE SELECTION FOR MULTIDIMENSIONAL ITEM RESPONSE THEORY MODELS VIA L 1 REGULARIZATION Jianan Sun beijing forestry university, china. Yunxiao Chen columbia university, usa. Jingchen Liu columbia

More information

Educational and Psychological Measurement

Educational and Psychological Measurement Target Rotations and Assessing the Impact of Model Violations on the Parameters of Unidimensional Item Response Theory Models Journal: Educational and Psychological Measurement Manuscript ID: Draft Manuscript

More information

Making the Most of What We Have: A Practical Application of Multidimensional Item Response Theory in Test Scoring

Making the Most of What We Have: A Practical Application of Multidimensional Item Response Theory in Test Scoring Journal of Educational and Behavioral Statistics Fall 2005, Vol. 30, No. 3, pp. 295 311 Making the Most of What We Have: A Practical Application of Multidimensional Item Response Theory in Test Scoring

More information

Chapter 11. Scaling the PIRLS 2006 Reading Assessment Data Overview

Chapter 11. Scaling the PIRLS 2006 Reading Assessment Data Overview Chapter 11 Scaling the PIRLS 2006 Reading Assessment Data Pierre Foy, Joseph Galia, and Isaac Li 11.1 Overview PIRLS 2006 had ambitious goals for broad coverage of the reading purposes and processes as

More information

Local response dependence and the Rasch factor model

Local response dependence and the Rasch factor model Local response dependence and the Rasch factor model Dept. of Biostatistics, Univ. of Copenhagen Rasch6 Cape Town Uni-dimensional latent variable model X 1 TREATMENT δ T X 2 AGE δ A Θ X 3 X 4 Latent variable

More information

University of Groningen. Mokken scale analysis van Schuur, Wijbrandt H. Published in: Political Analysis. DOI: /pan/mpg002

University of Groningen. Mokken scale analysis van Schuur, Wijbrandt H. Published in: Political Analysis. DOI: /pan/mpg002 University of Groningen Mokken scale analysis van Schuur, Wijbrandt H. Published in: Political Analysis DOI: 10.1093/pan/mpg002 IMPORTANT NOTE: You are advised to consult the publisher's version (publisher's

More information

Conditional maximum likelihood estimation in polytomous Rasch models using SAS

Conditional maximum likelihood estimation in polytomous Rasch models using SAS Conditional maximum likelihood estimation in polytomous Rasch models using SAS Karl Bang Christensen kach@sund.ku.dk Department of Biostatistics, University of Copenhagen November 29, 2012 Abstract IRT

More information

Using Graphical Methods in Assessing Measurement Invariance in Inventory Data

Using Graphical Methods in Assessing Measurement Invariance in Inventory Data Multivariate Behavioral Research, 34 (3), 397-420 Copyright 1998, Lawrence Erlbaum Associates, Inc. Using Graphical Methods in Assessing Measurement Invariance in Inventory Data Albert Maydeu-Olivares

More information

A Note on Item Restscore Association in Rasch Models

A Note on Item Restscore Association in Rasch Models Brief Report A Note on Item Restscore Association in Rasch Models Applied Psychological Measurement 35(7) 557 561 ª The Author(s) 2011 Reprints and permission: sagepub.com/journalspermissions.nav DOI:

More information

Modeling rater effects using a combination of Generalizability Theory and IRT

Modeling rater effects using a combination of Generalizability Theory and IRT Psychological Test and Assessment Modeling, Volume 60, 2018 (1), 53-80 Modeling rater effects using a combination of Generalizability Theory and IRT Jinnie Choi 1 & Mark R. Wilson 2 Abstract Motivated

More information

flexmirt R : Flexible Multilevel Multidimensional Item Analysis and Test Scoring

flexmirt R : Flexible Multilevel Multidimensional Item Analysis and Test Scoring flexmirt R : Flexible Multilevel Multidimensional Item Analysis and Test Scoring User s Manual Version 3.0RC Authored by: Carrie R. Houts, PhD Li Cai, PhD This manual accompanies a Release Candidate version

More information

Lesson 7: Item response theory models (part 2)

Lesson 7: Item response theory models (part 2) Lesson 7: Item response theory models (part 2) Patrícia Martinková Department of Statistical Modelling Institute of Computer Science, Czech Academy of Sciences Institute for Research and Development of

More information

Covariates of the Rating Process in Hierarchical Models for Multiple Ratings of Test Items

Covariates of the Rating Process in Hierarchical Models for Multiple Ratings of Test Items Journal of Educational and Behavioral Statistics March XXXX, Vol. XX, No. issue;, pp. 1 28 DOI: 10.3102/1076998606298033 Ó AERA and ASA. http://jebs.aera.net Covariates of the Rating Process in Hierarchical

More information

New Developments for Extended Rasch Modeling in R

New Developments for Extended Rasch Modeling in R New Developments for Extended Rasch Modeling in R Patrick Mair, Reinhold Hatzinger Institute for Statistics and Mathematics WU Vienna University of Economics and Business Content Rasch models: Theory,

More information

Meiser et. al.: Latent Change in Discrete Data 76 as new perspectives in the application of test models to social science issues (e.g., Fischer & Mole

Meiser et. al.: Latent Change in Discrete Data 76 as new perspectives in the application of test models to social science issues (e.g., Fischer & Mole Methods of Psychological Research Online 1998, Vol.3, No.2 Internet: http://www.pabst-publishers.de/mpr/ Latent Change in Discrete Data: Unidimensional, Multidimensional, and Mixture Distribution Rasch

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

UCLA Department of Statistics Papers

UCLA Department of Statistics Papers UCLA Department of Statistics Papers Title IRT Goodness-of-Fit Using Approaches from Logistic Regression Permalink https://escholarship.org/uc/item/1m46j62q Authors Mair, Patrick Steven P. Reise Bentler,

More information

The Use of Quadratic Form Statistics of Residuals to Identify IRT Model Misfit in Marginal Subtables

The Use of Quadratic Form Statistics of Residuals to Identify IRT Model Misfit in Marginal Subtables The Use of Quadratic Form Statistics of Residuals to Identify IRT Model Misfit in Marginal Subtables 1 Yang Liu Alberto Maydeu-Olivares 1 Department of Psychology, The University of North Carolina at Chapel

More information

A Bivariate Generalized Linear Item Response Theory Modeling Framework to the Analysis of Responses and Response Times

A Bivariate Generalized Linear Item Response Theory Modeling Framework to the Analysis of Responses and Response Times Multivariate Behavioral Research ISSN: 0027-3171 (Print 1532-7906 (Online Journal homepage: http://www.tandfonline.com/loi/hmbr20 A Bivariate Generalized Linear Item Response Theory Modeling Framework

More information

Assessment, analysis and interpretation of Patient Reported Outcomes (PROs)

Assessment, analysis and interpretation of Patient Reported Outcomes (PROs) Assessment, analysis and interpretation of Patient Reported Outcomes (PROs) Day 2 Summer school in Applied Psychometrics Peterhouse College, Cambridge 12 th to 16 th September 2011 This course is prepared

More information

Multidimensional item response theory observed score equating methods for mixed-format tests

Multidimensional item response theory observed score equating methods for mixed-format tests University of Iowa Iowa Research Online Theses and Dissertations Summer 2014 Multidimensional item response theory observed score equating methods for mixed-format tests Jaime Leigh Peterson University

More information

Measurement precision test construction and best test design

Measurement precision test construction and best test design Edith Cowan University Research Online ECU Publications Pre. 2011 1993 Measurement precision test construction and best test design Pender J. Pedler Pedler, P. (1993). Measurement precision test construction

More information

RANDOM INTERCEPT ITEM FACTOR ANALYSIS. IE Working Paper MK8-102-I 02 / 04 / Alberto Maydeu Olivares

RANDOM INTERCEPT ITEM FACTOR ANALYSIS. IE Working Paper MK8-102-I 02 / 04 / Alberto Maydeu Olivares RANDOM INTERCEPT ITEM FACTOR ANALYSIS IE Working Paper MK8-102-I 02 / 04 / 2003 Alberto Maydeu Olivares Instituto de Empresa Marketing Dept. C / María de Molina 11-15, 28006 Madrid España Alberto.Maydeu@ie.edu

More information

A PSYCHOPHYSICAL INTERPRETATION OF RASCH S PSYCHOMETRIC PRINCIPLE OF SPECIFIC OBJECTIVITY

A PSYCHOPHYSICAL INTERPRETATION OF RASCH S PSYCHOMETRIC PRINCIPLE OF SPECIFIC OBJECTIVITY A PSYCHOPHYSICAL INTERPRETATION OF RASCH S PSYCHOMETRIC PRINCIPLE OF SPECIFIC OBJECTIVITY R. John Irwin Department of Psychology, The University of Auckland, Private Bag 92019, Auckland 1142, New Zealand

More information

Journal of Statistical Software

Journal of Statistical Software JSS Journal of Statistical Software November 2008, Volume 28, Issue 10. http://www.jstatsoft.org/ A MATLAB Package for Markov Chain Monte Carlo with a Multi-Unidimensional IRT Model Yanyan Sheng Southern

More information

A comparison of parameter covariance estimation methods for item response models in an expectation-maximization framework

A comparison of parameter covariance estimation methods for item response models in an expectation-maximization framework Virginia Commonwealth University VCU Scholars Compass Psychiatry Publications Dept. of Psychiatry 2017 A comparison of parameter covariance estimation methods for item response models in an expectation-maximization

More information

Studies on the effect of violations of local independence on scale in Rasch models: The Dichotomous Rasch model

Studies on the effect of violations of local independence on scale in Rasch models: The Dichotomous Rasch model Studies on the effect of violations of local independence on scale in Rasch models Studies on the effect of violations of local independence on scale in Rasch models: The Dichotomous Rasch model Ida Marais

More information

A Use of the Information Function in Tailored Testing

A Use of the Information Function in Tailored Testing A Use of the Information Function in Tailored Testing Fumiko Samejima University of Tennessee for indi- Several important and useful implications in latent trait theory, with direct implications vidualized

More information

Psychology 405: Psychometric Theory Validity

Psychology 405: Psychometric Theory Validity Preliminaries Psychology 405: Psychometric Theory Validity William Revelle Department of Psychology Northwestern University Evanston, Illinois USA May, 2016 1/17 Preliminaries Outline Preliminaries 2/17

More information

Walkthrough for Illustrations. Illustration 1

Walkthrough for Illustrations. Illustration 1 Tay, L., Meade, A. W., & Cao, M. (in press). An overview and practical guide to IRT measurement equivalence analysis. Organizational Research Methods. doi: 10.1177/1094428114553062 Walkthrough for Illustrations

More information

A Comparison of Linear and Nonlinear Factor Analysis in Examining the Effect of a Calculator Accommodation on Math Performance

A Comparison of Linear and Nonlinear Factor Analysis in Examining the Effect of a Calculator Accommodation on Math Performance University of Connecticut DigitalCommons@UConn NERA Conference Proceedings 2010 Northeastern Educational Research Association (NERA) Annual Conference Fall 10-20-2010 A Comparison of Linear and Nonlinear

More information

COMPARISON OF CONCURRENT AND SEPARATE MULTIDIMENSIONAL IRT LINKING OF ITEM PARAMETERS

COMPARISON OF CONCURRENT AND SEPARATE MULTIDIMENSIONAL IRT LINKING OF ITEM PARAMETERS COMPARISON OF CONCURRENT AND SEPARATE MULTIDIMENSIONAL IRT LINKING OF ITEM PARAMETERS A THESIS SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY Mayuko Kanada Simon IN PARTIAL

More information

Pairwise Parameter Estimation in Rasch Models

Pairwise Parameter Estimation in Rasch Models Pairwise Parameter Estimation in Rasch Models Aeilko H. Zwinderman University of Leiden Rasch model item parameters can be estimated consistently with a pseudo-likelihood method based on comparing responses

More information