LSAC RESEARCH REPORT SERIES. Law School Admission Council Research Report March 2008
|
|
- Brianna Hunter
- 6 years ago
- Views:
Transcription
1 LSAC RESEARCH REPORT SERIES Structural Modeling Using Two-Step MML Procedures Cees A. W. Glas University of Twente, Enschede, The Netherlands Law School Admission Council Research Report March 2008 A publication of the Law School Admission Council
2
3 The Law School Admission Council (LSAC) is a nonprofit corporation whose members are more than 200 law schools in the United States, Canada, and Australia. Headquartered in Newtown, PA, USA, the Council was founded in 1947 to facilitate the law school admission process. The Council has grown to provide numerous products and services to law schools and to more than 85,000 law school applicants each year. All law schools approved by the American Bar Association (ABA) are LSAC members. Canadian law schools recognized by a provincial or territorial law society or government agency are also members. Accredited law schools outside of the United States and Canada are eligible for membership at the discretion of the LSAC Board of Trustees by Law School Admission Council, Inc. All rights reserved. No part of this work, including information, data, or other portions of the work published in electronic form, may be reproduced or transmitted in any form or by any means, electronic or mechanical, including photocopying and recording, or by any information storage and retrieval system, without permission of the publisher. For information, write: Communications, Law School Admission Council, 662 Penn Street, Box 40, Newtown, PA This study is published and distributed by LSAC. The opinions and conclusions contained in this report are those of the author(s) and do not necessarily reflect the position or policy of LSAC.
4
5 i Table of Contents Executive Summary...1 Abstract...1 Introduction...1 The Model...2 MML1...3 The E-Step...3 The M-Step... 3 MML2...4 The E-Step...5 The M-Step...5 An Example...5 References...7
6
7 1 Executive Summary In a computerized adaptive test (CAT), test questions (items) are selected for administration to a test taker based on their performance on previous items, with the intent of tailoring the difficulty level of the test to the ability level of the test taker. Data from CATs are special in the sense that every student responds to a virtually unique set of items. Therefore, number-correct scores of different test takers can no longer be compared, and statistical methods traditionally used in the analysis of number-correct scores lose their relevance. The problem can be solved by applying these traditional methods in the analysis of proficiency scores typically produced by the application of an item response theory (IRT) model. IRT is a mathematical model used in the analysis of test data that results in comparable proficiency scores for each test taker even though the test takers may have responded to different items in the test administration. In this paper, we consider two methods for estimating test-taker proficiencies with traditional statistical methodology. We then compare these proficiency estimates with those obtained using an IRT model. We base our analyses on a dataset from school effectiveness research. The proficiency estimates were close to those obtained with the IRT model. The computing time needed for the traditional statistical methods was about 25% of the time needed for the IRT method. Abstract Two methods are considered for estimating the parameters of an item response (IRT) model with a multilevel regression model on the person parameters (the MLIRT model). In both methods it is assumed that the item parameters are known. The person parameter estimates are compared with the parameter estimates obtained by a fully Bayesian method where this assumption is not made. The first method is marginal maximum likelihood (MML) given the original item response patterns on the tests (labeled MML1). The second is MML given the estimates of the true scores on the tests (labeled MML2). In both cases, the parameters are estimated using the expectation-maximization (EM) algorithm. The methods are compared using a dataset from school effectiveness research. The parameter estimates were close to the estimates obtained using the fully Bayesian method. The MML estimation procedures took about 25% of the time needed for the fully Bayesian method. Introduction In educational and social research, there is a growing interest in the problems associated with describing the relationships between variables of different aggregation levels. For instance, one may be interested in the effects of the school budget on the educational achievement of the students. However, the former variable is defined on the level of schools while the latter variable is defined on the level of students. These problems can be addressed through the use of multilevel models (Bryk & Raudenbush, 1992; Goldstein, 1986; Longford, 1993). In the above example, students are nested in schools; in a multilevel model, the students would make up a first level and the schools a secondary level. Although most applications of the multilevel paradigm are found in regression and analysis of variance models (see, for instance, Bryk and Raudenbush, 1992), multilevel modeling does, in principle, apply to all statistical modeling of data where elementary units are nested within higher order units. Longford (1993), for instance, gives examples of multilevel factor analytical models and generalized linear models. Also, in the field of item response theory (IRT), some applications of the multilevel paradigm can be found (Adams, Wilson, & Wu, 1997; Mislevy & Bock, 1989). The present paper investigates the following problem related to multilevel modeling. In educational research, many variables are measured subject to error. This predominately concerns the dependent variables, but covariates on the student and school level can be subject to measurement error too. In practice, the multilevel models used belong to the framework of the usual linear multivariate normal model, and solutions to the problem of measurement error boil down to applications of classical test theory (Longford, 1993). One of the many drawbacks of classical test theory is that measurement error is supposed to be independent of the score level of the person. In modern test theory (i.e., IRT), measurement error is defined conditionally on the level of the latent ability variable. Therefore, it seems worthwhile to tackle the problem of measurement error in the framework of hierarchical IRT models. We consider two methods for estimating the parameters of an IRT model with a multilevel regression model on the person parameters. In both methods it is assumed that the item parameters are known. The parameter estimates are compared with the parameter estimates obtained by a method where this assumption is not made (i.e., the method by Fox and Glas, 2001). The first method is marginal maximum likelihood (MML) given the original item response patterns on the tests (labeled MML1). The second is MML given the estimates of the true scores on the tests (labeled MML2). In both cases the parameters are estimated using the expectation-maximization (EM) algorithm. The methods are compared using a dataset from school effectiveness research.
8 2 The Model Assume Level 1 units, say, students i = 1,, N j, are nested within Level 2 units j = 1,, J and respond to a number of tests labeled q = 1,..., Q. We impose a two-level regression model on the latent variables: q P =å b +e ijq jpq ijp ijq p= 1 x (1) and S b = å g w + u. (2) jpq pqs jspq jpq s= 1 x ij1 and w j1pq are equal to one. The other values of x ijp and w jspq are values of observed covariates at Level 1 and Level 2, respectively. In principle, latent predictors can also be present. In that case, it is assumed that the model is rewritten to the socalled reduced form, where all latent variables are on the left-hand side of the regression equations. The error terms have distributions where is a Q Q covariance matrix, and ij N(, 0 ), (3) j N(, 0T), where T is a PQ PQ covariance matrix. and T are not restricted to be diagonal. It is assumed that q ijq is a parameter in a unidimensional response model, either for discrete responses or continuous responses. The latter might, for instance, be the logarithms of response times (van der Linden, 2006, 2007, 2008). Discrete responses will be responses on a test with polytomously scored items labeled k = 1,..., Kq. Item responses will be stochastic variables Uijqk with realizations u ijqk. The realizations are integers between 0 and mqk. For educational tests, it is reasonable to assume that the probability of obtaining a score of 0 decreases as a function of q, the probability of obtaining the maximum score increases as a function of q, and the probabilities of the intermediate scores are single peaked. Such response probabilities are produced by the generalized partial credit model (GPCM; Muraki, 1992), where the probability of a response is given by exp( laqkqijq - bqkl ) PU ( = l q ) =. (4) 1 + exp( ha q - b ) ijqk ijq å m h= 1 qk ijq qkh The partial credit model (PCM; Masters, 1982) is a special case where a k = 1 for all items k. The two-parameter logistic model (2PL model; Birnbaum, 1968) is a special case that pertains to dichotomously scored responses (so m qk = 1), and the one-parameter logistic model (1PL model or Rasch model; Rasch, 1960) is a further specialization where aqk = 1 for all items qk.
9 3 ij Continuous responses can be modeled by a normal response model where it is assumed that the response of student on item qk is normally distributed, that is, æ 1 ( u ) 2 ijqk t ö - ijqk PU ( ijqk = uijqk qijq, akq, bqk ) = exp ps 2s, ç k çè k ø with t ijqk = aqkqijq - bqk. MML1 We use the EM algorithm (Dempster, Laird, & Rubin, 1977) to estimate,, and T given responses U Level 1 covariates X, Level 2 covariates W, and item parameters a and b. To apply the logic of the EM algorithm, the values of given by q ijq j are seen as missing data (Bock & Aitkin, 1981). The complete data specification for a person ij is then p( u,,, a, b, x, W,, T) µ ij ij j ij ij ij j p( u, a, b ) p(,, x ) p( W,, T), ij ij ij ij ij j ij j j where is a PQS -dimensional vector of the elements g pqs, and j is a PQ -dimensional vector of the elements b jpq. The E-step consists of estimating the posterior distribution p( y, a,,,, x, W,, T), and the M-step consists ij ij ij b ij j ij j of a maximization of E(log( L q (,,, T) Y,,, D, X, W,, T ) with respect to,,, and T. The E-Step, y is The E-step consists of computation of the posterior distribution of the missing data. For the present application, this p( u, a, b,,,, T) ij ij ij ij j p( uij ij, aij, bij ) p( ij j,, xij ) p( j Wj,, T) = [ ò.. ò p( uij ij, aij, bij ) p( ij j,, xij ) d ij ] p( j Wj,, T) p( uij ij, aij, bij ) p( ij j,, xij ) = ò.. ò p( u, a, b ) p(,, x ) d ij ij ij ij ij j ij ij. The M-Step In the M-step, we maximize with respect to,,, and T. The marginal likelihood equations needed in the M-step can be easily derived using Fisher s identity (Efron, 1977; Louis, 1982; see Glas, 1992, 1998 for applications to IRT). Fisher s identity sets the first-order derivatives with respect to the model parameters equal to the expectation with respect to the posterior distribution obtained in the E-step of the first-order derivatives of the complete data likelihood. However, the model for the complete data is just an ordinary multilevel regression model, so the first-order derivatives of the complete data likelihood are easily obtained. Since the parameters are regression coefficients in a linear regression model, we can apply the generalized least-squares (GLS) estimator, which is equivalent to the maximum likelihood estimator.
10 4 Define W jpq as an S -dimensional vector of the elements w jspq and Wj = { Wjpq} ÄI PQ. Note that W j is a PQ PQS matrix. Then the GLS estimate is obtained as å and the covariance matrix of the residuals estimated as j j j j ( WT W) WT j (5) j å 1 T ( j - j )( j - j J å W W ). (6) j Define X * j = { Xj} ÄI Q, with X j a matrix of the elements { x ijp}. Note that X j is an to j we then have NQ PQ j matrix. With respect where j = { jq} Ä1Q, and jq has elements h j = ( X * j X * j T - ) - é * j j - ù ë X T Wj û, (7) òò.. q p( u, a, b,,, x ) d, i = 1,..., N j. (8) ijq ijq ij ij ij ij j ij ij For, we have the estimate 1.. ( ij - j ij ) ( ij - j ij ) p( ij ij, ij, ij, ) d ij N åò ò BX BX u a b, (9) ij with ij a Q -vector of the elements qijq, Xij a P -dimensional vector of the elements x ijp, and B j a Q P matrix of the elements b jpq. This process is iterated until convergence is achieved. The multiple integrals that appear above can be evaluated using adaptive Gauss-Hermite quadrature (Schilling & Bock, 2005). A critical point related to using Gauss-Hermite quadrature is the dimensionality of the latent space (i.e., the number of latent variables that can be analyzed simultaneously). Wood et al. (2002) indicate that the maximum number of dimensions is 10 with adaptive quadrature, 5 with nonadaptive quadrature, and 15 with Monte Carlo integration. MML2 The second approach is a generalization of a method by Rubin and Thomas (2001). Assume that are estimated latent variables of persons i = 1,, N j within Level 2 units j = 1,..., J, on scales q = 1,..., Q. The estimates have unknown estimation errors e ijq, with estimated variance d ij q; that is, y ijq = qijq + eijq, where it is assumed that has a Q -variate normal distribution with expectation zero and ij Cov( e, e ) = D = diag( d,...,,..., d ). Thus, ij ij ij ij1 d ijq ijq y ijq e ij N( 0, D ), ij In the application below, maximum likelihood estimates and the squared estimated standard errors are used for d ijq, respectively. y ijq and
11 5 Again, we use the EM-algorithm to estimate,, and T given Y, X, W, and D. Analogous to MML1, consider q ijq as missing data. The complete-data specification for a person ij is then given by p( y,,, D, x, W,, T) µ ij ij j ij ij j ( y, D ) p(,, x ) p( W,, T), ij ij ij ij j ij j j where is a PQS -dimensional vector of the elements g pqs, and j is a PQ -dimensional vector of the elements b jpq. The E-step consists of estimating the posterior distribution p ( ij yij, Dij, j,, xij, Wj,, T), and the M-step consists of a maximization of E(log( L q, y (,,, T) Y,,, D, X, W,, T) with respect to,,, and T. Note that this is very similar to MML1. The E-Step Compute p( yij ij, Dij ) p( ij j,, xij ) p( j Wj,, T) p( ij yij, Dij, j,,, T) = [ ò.. ò p( y, D ) p(,, x ) d ] p( W,, T) ij ij ij ij j ij ij j j The M-step p( yij ij, Dij ) p( ij j,, xij ) = ò.. ò p( y, D ) p(,, x ) d ij ij ij ij j ij ij We have to maximize with respect to,,, and T. The equations for, the equations in MML2 given by (5), (6), and (7). Besides, (8) becomes h ò ò y D x.. q p(,,,, ) d, ijq ijq ij ij ij j ij ij for i = 1,..., N j. For the estimation of, we now have the equation with the same definitions as above.. T, and 1.. ( ij - j ij ) ( ij - j ij ) p( ij ij, ij, ) d ij N åò ò BX BX y D, ij An Example are completely analogous to The estimation procedures will be compared to each other and to a fully Bayesian procedure using an application reported by Shalabi (2002). The data were a cluster sample of 3,384 seventh grade students in 119 schools. At student level the variables were Gender (0 = male, 1 = female), SES (with two indicators: the father s and mother s education; score range, 0 8), and IQ (score range, 0 80). School-level variables were Leadership (measured by a scale consisting of 25 five-point Likert items, administered to the school teachers), School Climate (measured by a scale consisting of 23 five-point Likert items) and Mean-IQ (the IQ scores aggregated at school level). The items assessing Leadership and School Climate were dichotomous. The dependent variable was a mathematics achievement test consisting of 50 multiple-choice items. The 2PL model was used to model the responses to the Leadership and School Climate questionnaire and the mathematics test. The parameters were estimated with the Gibbs sampler using the method by Fox and Glas (2001). For a complete description of all analyses, see Shalabi (2002); here only the estimates of the final
12 6 model are given as an example. The model is given by and q = b + bses + b Gender + b IQ + e ij 0j 1 ij 2 ij 3 ij ij b = g + g Mean-IQ + g Leadership + g Climate +. 0j j 02 j 03 j u0 j The results reported by Shalabi (2002) are given in Table 1. The estimates of the MLIRT model are compared with a traditional multilevel analysis where all variables were manifest. The observed Mathematics, Leadership, and School Climate scores were transformed in such a way that their scale was comparable to the scale used in the MLIRT model. Further, the parameters of the multilevel model were also estimated with a Bayesian approach using the Gibbs sampler. The columns labeled CI give the 90% confidence intervals of the unknown parameters; they were derived from the posterior standard deviation. It can be seen that the magnitudes of the fixed effects in the MLIRT model were larger than the analogous estimates in the multilevel model. This finding is in line with the other findings (Fox & Glas, 2001; Shalabi, 2002), which indicates that the MLIRT model has more power to detect effects in hierarchical data when some variables are measured with error. TABLE 1 Bayesian estimates of multilevel IRT model and multilevel model on observed scores MLIRT Estimates ML Estimates Variable Estimates CI Estimates CI g b 1 SES b 2 Gender b IQ g 01 Mean-IQ g 02 Leadership g Climate Variance components 2 t s CI = confidence interval; SES = socioeconomic status. The estimates and standard errors obtained by using MML1 and MML2 are shown in Table 2. By comparing the entries in Table 1 and Table 2, it can be seen that the estimates are all very close. This also holds when the credible regions obtained by the fully Bayesian method are compared with the confidence intervals obtained by maximum likelihood. The latter regions are slightly smaller, but this has no effect on the outcomes of tests of the hypotheses that certain effects are equal to zero.
13 7 TABLE 2 Marginal maximum likelihood estimates of multilevel IRT model using methods MML1 and MML2, respectively MML1 Estimates MML2 Estimates Variable Estimates CI Estimates CI g b 1 SES b 2 Gender b 3 IQ g 01 Mean-IQ g 02 Leadership g Climate Variance components 2 t s CI = confidence interval; SES = socioeconomic status. An advantage of the MML methods is their relative speed. The fully Bayesian analysis took about 3 hours, while the MML analyses took about 45 minutes. The fact that the latter is also not very fast has to do with fact that MML involved integration over a three-dimensional latent space with 10 quadrature points for every dimension. It must also be mentioned that the programming was not optimized with respect to speed. Further, the ever-increasing speed of computers may also further support the application of these models in complex situations. References Adams, R. J., Wilson, M. R., & Wu, M. (1997). Multilevel item response theory models: An approach to errors in variables of regression. Journal of Educational and Behavioral Statistics, 22, Birnbaum, A. (1968). Some latent trait models. In F. M. Lord & M. R. Novick (Eds.), Statistical theories of mental test scores (pp ). Reading, MA: Addison-Wesley. Bock, R. D., & Aitkin, M. (1981). Marginal maximum likelihood estimation of item parameters: An application of an EM-algorithm. Psychometrika, 46, Bryk, A. S., & Raudenbush, S. W. (1992). Hierarchical linear models. Newbury Park: Sage. Dempster, A. P., Laird, N. M., & Rubin, D. B. (1977). Maximum likelihood from incomplete data via the EM algorithm (with discussion). Journal of the Royal Statistical Society, Series B, 39, Efron, B. (1977). Discussion on maximum likelihood from incomplete data via the EM algorithm (by A. P. Dempster, N. M. Laird, and D. B. Rubin). Journal of the Royal Statistical Society, Series B, 39, Fox, J.-P., & Glas, C. A. W. (2001). Bayesian estimation of a multilevel IRT model using Gibbs sampling. Psychometrika, 66, Glas, C. A. W. (1992). A Rasch model with a multivariate distribution of ability. In M. Wilson (Ed.), Objective measurement: Theory into practice (Vol. 1; pp ). Norwood, NJ: Ablex. Glas, C. A. W. (1998). Detection of differential item functioning using Lagrange multiplier tests. Statistica Sinica, 8, Goldstein, H. (1986). Multilevel mixed linear models analysis using iterative generalized least squares. Biometrika, 73, Longford, N. T. (1993). Random coefficient models. Oxford Statistical Science Series (Vol. 11). Oxford: Clarendon.
14 8 Louis, T. A. (1982). Finding the observed information matrix when using the EM algorithm. Journal of the Royal Statistical Society, Series B, 44, Masters, G. N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47, Mislevy, R. J., & Bock, R. D. (1989). A hierarchical item-response model for educational testing. In R. D. Bock (Ed.), Multilevel analysis of educational data. San Diego: Academic Press. Muraki, E. (1992). A generalized partial credit model: Application of an EM algorithm. Applied Psychological Measurement, 16, Rasch, G. (1960). Probabilistic models for some intelligence and attainment tests. Copenhagen: Danish Institute for Educational Research. Rubin, D. B., & Thomas, N. (2001). Using parameter expansion to improve the performance of the EM algorithm for multidimensional IRT population-survey models. In A. Boomsma, M. A. J. van Duijn, & T. A. B. Snijders (Eds.), Essays on item response theory (pp ). New York: Springer. Schilling, S., & Bock, R. D. (2005). High-dimensional maximum marginal likelihood item factor analysis by adaptive quadrature. Psychometrika, 70, Shalabi, F. (2002). Effective schooling in the West Bank. Unpublished doctoral thesis, University of Twente, the Netherlands. Wood, R., Wilson, D. T., Gibbons, R. D., Schilling, S. G., Muraki, E., & Bock, R. D. (2002). TESTFACT: Test scoring, item statistics, and item factor analysis. Chicago, IL: Scientific Software International, Inc. van der Linden, W. J. (2006). A lognormal model for response times on test items. Journal of Educational and Behavioral Statistics, 31, van der Linden, W. J. (2007). A hierarchical framework for modeling speed and accuracy on test items. Psychometrika, 72, van der Linden, W. J. (2008). Using response times for item selection in adaptive testing. Journal of Educational and Behavioral Statistics, 33, 5 20.
A Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions
A Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions Cees A.W. Glas Oksana B. Korobko University of Twente, the Netherlands OMD Progress Report 07-01. Cees A.W.
More informationMonte Carlo Simulations for Rasch Model Tests
Monte Carlo Simulations for Rasch Model Tests Patrick Mair Vienna University of Economics Thomas Ledl University of Vienna Abstract: Sources of deviation from model fit in Rasch models can be lack of unidimensionality,
More informationPIRLS 2016 Achievement Scaling Methodology 1
CHAPTER 11 PIRLS 2016 Achievement Scaling Methodology 1 The PIRLS approach to scaling the achievement data, based on item response theory (IRT) scaling with marginal estimation, was developed originally
More informationA Comparison of Item-Fit Statistics for the Three-Parameter Logistic Model
A Comparison of Item-Fit Statistics for the Three-Parameter Logistic Model Cees A. W. Glas, University of Twente, the Netherlands Juan Carlos Suárez Falcón, Universidad Nacional de Educacion a Distancia,
More informationPairwise Parameter Estimation in Rasch Models
Pairwise Parameter Estimation in Rasch Models Aeilko H. Zwinderman University of Leiden Rasch model item parameters can be estimated consistently with a pseudo-likelihood method based on comparing responses
More informationFitting Multidimensional Latent Variable Models using an Efficient Laplace Approximation
Fitting Multidimensional Latent Variable Models using an Efficient Laplace Approximation Dimitris Rizopoulos Department of Biostatistics, Erasmus University Medical Center, the Netherlands d.rizopoulos@erasmusmc.nl
More informationComparison between conditional and marginal maximum likelihood for a class of item response models
(1/24) Comparison between conditional and marginal maximum likelihood for a class of item response models Francesco Bartolucci, University of Perugia (IT) Silvia Bacci, University of Perugia (IT) Claudia
More informationLesson 7: Item response theory models (part 2)
Lesson 7: Item response theory models (part 2) Patrícia Martinková Department of Statistical Modelling Institute of Computer Science, Czech Academy of Sciences Institute for Research and Development of
More informationDetermining the number of components in mixture models for hierarchical data
Determining the number of components in mixture models for hierarchical data Olga Lukočienė 1 and Jeroen K. Vermunt 2 1 Department of Methodology and Statistics, Tilburg University, P.O. Box 90153, 5000
More informationDevelopment and Calibration of an Item Response Model. that Incorporates Response Time
Development and Calibration of an Item Response Model that Incorporates Response Time Tianyou Wang and Bradley A. Hanson ACT, Inc. Send correspondence to: Tianyou Wang ACT, Inc P.O. Box 168 Iowa City,
More informationA DISSERTATION SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY. Yu-Feng Chang
A Restricted Bi-factor Model of Subdomain Relative Strengths and Weaknesses A DISSERTATION SUBMITTED TO THE FACULTY OF THE GRADUATE SCHOOL OF THE UNIVERSITY OF MINNESOTA BY Yu-Feng Chang IN PARTIAL FULFILLMENT
More informationHierarchical Linear Models. Jeff Gill. University of Florida
Hierarchical Linear Models Jeff Gill University of Florida I. ESSENTIAL DESCRIPTION OF HIERARCHICAL LINEAR MODELS II. SPECIAL CASES OF THE HLM III. THE GENERAL STRUCTURE OF THE HLM IV. ESTIMATION OF THE
More informationPreliminary Manual of the software program Multidimensional Item Response Theory (MIRT)
Preliminary Manual of the software program Multidimensional Item Response Theory (MIRT) July 7 th, 2010 Cees A. W. Glas Department of Research Methodology, Measurement, and Data Analysis Faculty of Behavioural
More informationA Markov chain Monte Carlo approach to confirmatory item factor analysis. Michael C. Edwards The Ohio State University
A Markov chain Monte Carlo approach to confirmatory item factor analysis Michael C. Edwards The Ohio State University An MCMC approach to CIFA Overview Motivating examples Intro to Item Response Theory
More informationStochastic Approximation Methods for Latent Regression Item Response Models
Research Report Stochastic Approximation Methods for Latent Regression Item Response Models Matthias von Davier Sandip Sinharay March 2009 ETS RR-09-09 Listening. Learning. Leading. Stochastic Approximation
More informationA Nonlinear Mixed Model Framework for Item Response Theory
Psychological Methods Copyright 2003 by the American Psychological Association, Inc. 2003, Vol. 8, No. 2, 185 205 1082-989X/03/$12.00 DOI: 10.1037/1082-989X.8.2.185 A Nonlinear Mixed Model Framework for
More informationA Note on the Equivalence Between Observed and Expected Information Functions With Polytomous IRT Models
Journal of Educational and Behavioral Statistics 2015, Vol. 40, No. 1, pp. 96 105 DOI: 10.3102/1076998614558122 # 2014 AERA. http://jebs.aera.net A Note on the Equivalence Between Observed and Expected
More informationStat 542: Item Response Theory Modeling Using The Extended Rank Likelihood
Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Jonathan Gruhl March 18, 2010 1 Introduction Researchers commonly apply item response theory (IRT) models to binary and ordinal
More informationScaling Methodology and Procedures for the TIMSS Mathematics and Science Scales
Scaling Methodology and Procedures for the TIMSS Mathematics and Science Scales Kentaro Yamamoto Edward Kulick 14 Scaling Methodology and Procedures for the TIMSS Mathematics and Science Scales Kentaro
More informationComputerized Adaptive Testing With Equated Number-Correct Scoring
Computerized Adaptive Testing With Equated Number-Correct Scoring Wim J. van der Linden University of Twente A constrained computerized adaptive testing (CAT) algorithm is presented that can be used to
More informationMixed-Effects Logistic Regression Models for Indirectly Observed Discrete Outcome Variables
MULTIVARIATE BEHAVIORAL RESEARCH, 40(3), 28 30 Copyright 2005, Lawrence Erlbaum Associates, Inc. Mixed-Effects Logistic Regression Models for Indirectly Observed Discrete Outcome Variables Jeroen K. Vermunt
More informationMultilevel IRT using dichotomous and polytomous response data
145 British Journal of Mathematical and Statistical Psychology (2005), 58, 145 172 q 2005 The British Psychological Society The British Psychological Society www.bpsjournals.co.uk Multilevel IRT using
More information2 Bayesian Hierarchical Response Modeling
2 Bayesian Hierarchical Response Modeling In the first chapter, an introduction to Bayesian item response modeling was given. The Bayesian methodology requires careful specification of priors since item
More informationMixtures of Rasch Models
Mixtures of Rasch Models Hannah Frick, Friedrich Leisch, Achim Zeileis, Carolin Strobl http://www.uibk.ac.at/statistics/ Introduction Rasch model for measuring latent traits Model assumption: Item parameters
More informationWhats beyond Concerto: An introduction to the R package catr. Session 4: Overview of polytomous IRT models
Whats beyond Concerto: An introduction to the R package catr Session 4: Overview of polytomous IRT models The Psychometrics Centre, Cambridge, June 10th, 2014 2 Outline: 1. Introduction 2. General notations
More informationApplied Psychological Measurement 2001; 25; 283
Applied Psychological Measurement http://apm.sagepub.com The Use of Restricted Latent Class Models for Defining and Testing Nonparametric and Parametric Item Response Theory Models Jeroen K. Vermunt Applied
More informationThe Rasch Poisson Counts Model for Incomplete Data: An Application of the EM Algorithm
The Rasch Poisson Counts Model for Incomplete Data: An Application of the EM Algorithm Margo G. H. Jansen University of Groningen Rasch s Poisson counts model is a latent trait model for the situation
More informationItem Parameter Calibration of LSAT Items Using MCMC Approximation of Bayes Posterior Distributions
R U T C O R R E S E A R C H R E P O R T Item Parameter Calibration of LSAT Items Using MCMC Approximation of Bayes Posterior Distributions Douglas H. Jones a Mikhail Nediak b RRR 7-2, February, 2! " ##$%#&
More informationItem Response Theory (IRT) Analysis of Item Sets
University of Connecticut DigitalCommons@UConn NERA Conference Proceedings 2011 Northeastern Educational Research Association (NERA) Annual Conference Fall 10-21-2011 Item Response Theory (IRT) Analysis
More informationA multivariate multilevel model for the analysis of TIMMS & PIRLS data
A multivariate multilevel model for the analysis of TIMMS & PIRLS data European Congress of Methodology July 23-25, 2014 - Utrecht Leonardo Grilli 1, Fulvia Pennoni 2, Carla Rampichini 1, Isabella Romeo
More informationBayesian Nonparametric Rasch Modeling: Methods and Software
Bayesian Nonparametric Rasch Modeling: Methods and Software George Karabatsos University of Illinois-Chicago Keynote talk Friday May 2, 2014 (9:15-10am) Ohio River Valley Objective Measurement Seminar
More informationCenter for Advanced Studies in Measurement and Assessment. CASMA Research Report
Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 41 A Comparative Study of Item Response Theory Item Calibration Methods for the Two Parameter Logistic Model Kyung
More informationComputationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models
Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Tihomir Asparouhov 1, Bengt Muthen 2 Muthen & Muthen 1 UCLA 2 Abstract Multilevel analysis often leads to modeling
More informationExtension of the NAEP BGROUP Program to Higher Dimensions
Research Report Extension of the NAEP Program to Higher Dimensions Sandip Sinharay Matthias von Davier Research & Development December 2005 RR-05-27 Extension of the NAEP Program to Higher Dimensions Sandip
More informationDetermining Sample Sizes for Surveys with Data Analyzed by Hierarchical Linear Models
Journal of Of cial Statistics, Vol. 14, No. 3, 1998, pp. 267±275 Determining Sample Sizes for Surveys with Data Analyzed by Hierarchical Linear Models Michael P. ohen 1 Behavioral and social data commonly
More informationLog-linear multidimensional Rasch model for capture-recapture
Log-linear multidimensional Rasch model for capture-recapture Elvira Pelle, University of Milano-Bicocca, e.pelle@campus.unimib.it David J. Hessen, Utrecht University, D.J.Hessen@uu.nl Peter G.M. Van der
More informationMultilevel Modeling: When and Why 1. 1 Why multilevel data need multilevel models
Multilevel Modeling: When and Why 1 J. Hox University of Amsterdam & Utrecht University Amsterdam/Utrecht, the Netherlands Abstract: Multilevel models have become popular for the analysis of a variety
More informationOn the Construction of Adjacent Categories Latent Trait Models from Binary Variables, Motivating Processes and the Interpretation of Parameters
Gerhard Tutz On the Construction of Adjacent Categories Latent Trait Models from Binary Variables, Motivating Processes and the Interpretation of Parameters Technical Report Number 218, 2018 Department
More informationChapter 11. Scaling the PIRLS 2006 Reading Assessment Data Overview
Chapter 11 Scaling the PIRLS 2006 Reading Assessment Data Pierre Foy, Joseph Galia, and Isaac Li 11.1 Overview PIRLS 2006 had ambitious goals for broad coverage of the reading purposes and processes as
More informationTilburg University. Mixed-effects logistic regression models for indirectly observed outcome variables Vermunt, Jeroen
Tilburg University Mixed-effects logistic regression models for indirectly observed outcome variables Vermunt, Jeroen Published in: Multivariate Behavioral Research Document version: Peer reviewed version
More informationarxiv: v1 [stat.ap] 11 Aug 2014
Noname manuscript No. (will be inserted by the editor) A multilevel finite mixture item response model to cluster examinees and schools Michela Gnaldi Silvia Bacci Francesco Bartolucci arxiv:1408.2319v1
More informationAPPENDICES TO Protest Movements and Citizen Discontent. Appendix A: Question Wordings
APPENDICES TO Protest Movements and Citizen Discontent Appendix A: Question Wordings IDEOLOGY: How would you describe your views on most political matters? Generally do you think of yourself as liberal,
More information36-720: The Rasch Model
36-720: The Rasch Model Brian Junker October 15, 2007 Multivariate Binary Response Data Rasch Model Rasch Marginal Likelihood as a GLMM Rasch Marginal Likelihood as a Log-Linear Model Example For more
More informationAn Overview of Item Response Theory. Michael C. Edwards, PhD
An Overview of Item Response Theory Michael C. Edwards, PhD Overview General overview of psychometrics Reliability and validity Different models and approaches Item response theory (IRT) Conceptual framework
More informationSummer School in Applied Psychometric Principles. Peterhouse College 13 th to 17 th September 2010
Summer School in Applied Psychometric Principles Peterhouse College 13 th to 17 th September 2010 1 Two- and three-parameter IRT models. Introducing models for polytomous data. Test information in IRT
More informationSome Issues In Markov Chain Monte Carlo Estimation For Item Response Theory
University of South Carolina Scholar Commons Theses and Dissertations 2016 Some Issues In Markov Chain Monte Carlo Estimation For Item Response Theory Han Kil Lee University of South Carolina Follow this
More informationIRT Model Selection Methods for Polytomous Items
IRT Model Selection Methods for Polytomous Items Taehoon Kang University of Wisconsin-Madison Allan S. Cohen University of Georgia Hyun Jung Sung University of Wisconsin-Madison March 11, 2005 Running
More informationLSAC RESEARCH REPORT SERIES. Law School Admission Council Research Report October 2005
LSAC RESEARCH REPORT SERIES A Hierarchical Framework for Modeling Speed and Accuracy on Test Items Wim J. van der Linden University of Twente, Enschede, The Netherlands Law School Admission Council Research
More informationTest Design and Speededness
Journal of Educational Measurement Spring 2011, Vol. 48, No. 1, pp. 44 60 Test Design and Speededness Wim J. van der Linden CTB/McGraw-Hill A critical component of test speededness is the distribution
More informationExploiting TIMSS and PIRLS combined data: multivariate multilevel modelling of student achievement
Exploiting TIMSS and PIRLS combined data: multivariate multilevel modelling of student achievement Second meeting of the FIRB 2012 project Mixture and latent variable models for causal-inference and analysis
More informationEquating Tests Under The Nominal Response Model Frank B. Baker
Equating Tests Under The Nominal Response Model Frank B. Baker University of Wisconsin Under item response theory, test equating involves finding the coefficients of a linear transformation of the metric
More informationPolytomous Item Explanatory IRT Models with Random Item Effects: An Application to Carbon Cycle Assessment Data
Polytomous Item Explanatory IRT Models with Random Item Effects: An Application to Carbon Cycle Assessment Data Jinho Kim and Mark Wilson University of California, Berkeley Presented on April 11, 2018
More informationBecause it might not make a big DIF: Assessing differential test functioning
Because it might not make a big DIF: Assessing differential test functioning David B. Flora R. Philip Chalmers Alyssa Counsell Department of Psychology, Quantitative Methods Area Differential item functioning
More informationItem Response Theory for Scores on Tests Including Polytomous Items with Ordered Responses
Item Response Theory for Scores on Tests Including Polytomous Items with Ordered Responses David Thissen, University of North Carolina at Chapel Hill Mary Pommerich, American College Testing Kathleen Billeaud,
More informationLocal response dependence and the Rasch factor model
Local response dependence and the Rasch factor model Dept. of Biostatistics, Univ. of Copenhagen Rasch6 Cape Town Uni-dimensional latent variable model X 1 TREATMENT δ T X 2 AGE δ A Θ X 3 X 4 Latent variable
More informationWhat is an Ordinal Latent Trait Model?
What is an Ordinal Latent Trait Model? Gerhard Tutz Ludwig-Maximilians-Universität München Akademiestraße 1, 80799 München February 19, 2019 arxiv:1902.06303v1 [stat.me] 17 Feb 2019 Abstract Although various
More informationMaking the Most of What We Have: A Practical Application of Multidimensional Item Response Theory in Test Scoring
Journal of Educational and Behavioral Statistics Fall 2005, Vol. 30, No. 3, pp. 295 311 Making the Most of What We Have: A Practical Application of Multidimensional Item Response Theory in Test Scoring
More informationCenter for Advanced Studies in Measurement and Assessment. CASMA Research Report
Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 24 in Relation to Measurement Error for Mixed Format Tests Jae-Chun Ban Won-Chan Lee February 2007 The authors are
More informationNovember 2002 STA Random Effects Selection in Linear Mixed Models
November 2002 STA216 1 Random Effects Selection in Linear Mixed Models November 2002 STA216 2 Introduction It is common practice in many applications to collect multiple measurements on a subject. Linear
More informationIRT linking methods for the bifactor model: a special case of the two-tier item factor analysis model
University of Iowa Iowa Research Online Theses and Dissertations Summer 2017 IRT linking methods for the bifactor model: a special case of the two-tier item factor analysis model Kyung Yong Kim University
More informationItem Response Theory and Computerized Adaptive Testing
Item Response Theory and Computerized Adaptive Testing Richard C. Gershon, PhD Department of Medical Social Sciences Feinberg School of Medicine Northwestern University gershon@northwestern.edu May 20,
More informationLINEAR MULTILEVEL MODELS. Data are often hierarchical. By this we mean that data contain information
LINEAR MULTILEVEL MODELS JAN DE LEEUW ABSTRACT. This is an entry for The Encyclopedia of Statistics in Behavioral Science, to be published by Wiley in 2005. 1. HIERARCHICAL DATA Data are often hierarchical.
More informationOverview. Multidimensional Item Response Theory. Lecture #12 ICPSR Item Response Theory Workshop. Basics of MIRT Assumptions Models Applications
Multidimensional Item Response Theory Lecture #12 ICPSR Item Response Theory Workshop Lecture #12: 1of 33 Overview Basics of MIRT Assumptions Models Applications Guidance about estimating MIRT Lecture
More information1 if response to item i is in category j 0 otherwise (1)
7 Scaling Methodology and Procedures for the Mathematics and Science Literacy, Advanced Mathematics, and Physics Scales Greg Macaskill Raymond J. Adams Margaret L. Wu Australian Council for Educational
More informationJoint Modeling of Longitudinal Item Response Data and Survival
Joint Modeling of Longitudinal Item Response Data and Survival Jean-Paul Fox University of Twente Department of Research Methodology, Measurement and Data Analysis Faculty of Behavioural Sciences Enschede,
More informationMultilevel Statistical Models: 3 rd edition, 2003 Contents
Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction
More informationIntroduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation. EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016
Introduction to Bayesian Statistics and Markov Chain Monte Carlo Estimation EPSY 905: Multivariate Analysis Spring 2016 Lecture #10: April 6, 2016 EPSY 905: Intro to Bayesian and MCMC Today s Class An
More informationAssessment of fit of item response theory models used in large scale educational survey assessments
DOI 10.1186/s40536-016-0025-3 RESEARCH Open Access Assessment of fit of item response theory models used in large scale educational survey assessments Peter W. van Rijn 1, Sandip Sinharay 2*, Shelby J.
More informationAppendix: Modeling Approach
AFFECTIVE PRIMACY IN INTRAORGANIZATIONAL TASK NETWORKS Appendix: Modeling Approach There is now a significant and developing literature on Bayesian methods in social network analysis. See, for instance,
More informationRandom Intercept Models
Random Intercept Models Edps/Psych/Soc 589 Carolyn J. Anderson Department of Educational Psychology c Board of Trustees, University of Illinois Spring 2019 Outline A very simple case of a random intercept
More informationPart 8: GLMs and Hierarchical LMs and GLMs
Part 8: GLMs and Hierarchical LMs and GLMs 1 Example: Song sparrow reproductive success Arcese et al., (1992) provide data on a sample from a population of 52 female song sparrows studied over the course
More informationHANNEKE GEERLINGS AND CEES A.W. GLAS WIM J. VAN DER LINDEN MODELING RULE-BASED ITEM GENERATION. 1. Introduction
PSYCHOMETRIKA VOL. 76, NO. 2, 337 359 APRIL 2011 DOI: 10.1007/S11336-011-9204-X MODELING RULE-BASED ITEM GENERATION HANNEKE GEERLINGS AND CEES A.W. GLAS UNIVERSITY OF TWENTE WIM J. VAN DER LINDEN CTB/MCGRAW-HILL
More informationML estimation: Random-intercepts logistic model. and z
ML estimation: Random-intercepts logistic model log p ij 1 p = x ijβ + υ i with υ i N(0, συ) 2 ij Standardizing the random effect, θ i = υ i /σ υ, yields log p ij 1 p = x ij β + σ υθ i with θ i N(0, 1)
More informationAnders Skrondal. Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine. Based on joint work with Sophia Rabe-Hesketh
Constructing Latent Variable Models using Composite Links Anders Skrondal Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine Based on joint work with Sophia Rabe-Hesketh
More informationA Simulation Study to Compare CAT Strategies for Cognitive Diagnosis
A Simulation Study to Compare CAT Strategies for Cognitive Diagnosis Xueli Xu Department of Statistics,University of Illinois Hua-Hua Chang Department of Educational Psychology,University of Texas Jeff
More informationIntroducing Generalized Linear Models: Logistic Regression
Ron Heck, Summer 2012 Seminars 1 Multilevel Regression Models and Their Applications Seminar Introducing Generalized Linear Models: Logistic Regression The generalized linear model (GLM) represents and
More informationEstimation of item location effects by means of the generalized logistic regression model: a simulation study and an application
Psychology Science Quarterly, Volume 50, 2008 (1), p. 64-74 Estimation of item location effects by means of the generalized logistic regression model: a simulation study and an application RAINER ALEXANDROWICZ
More informationLecture 10. Announcement. Mixture Models II. Topics of This Lecture. This Lecture: Advanced Machine Learning. Recap: GMMs as Latent Variable Models
Advanced Machine Learning Lecture 10 Mixture Models II 30.11.2015 Bastian Leibe RWTH Aachen http://www.vision.rwth-aachen.de/ Announcement Exercise sheet 2 online Sampling Rejection Sampling Importance
More informationRon Heck, Fall Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October 20, 2011)
Ron Heck, Fall 2011 1 EDEP 768E: Seminar in Multilevel Modeling rev. January 3, 2012 (see footnote) Week 8: Introducing Generalized Linear Models: Logistic Regression 1 (Replaces prior revision dated October
More informationModel Estimation Example
Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions
More informationClass Notes: Week 8. Probit versus Logit Link Functions and Count Data
Ronald Heck Class Notes: Week 8 1 Class Notes: Week 8 Probit versus Logit Link Functions and Count Data This week we ll take up a couple of issues. The first is working with a probit link function. While
More informationConditional maximum likelihood estimation in polytomous Rasch models using SAS
Conditional maximum likelihood estimation in polytomous Rasch models using SAS Karl Bang Christensen kach@sund.ku.dk Department of Biostatistics, University of Copenhagen November 29, 2012 Abstract IRT
More informationIntroduction to lnmle: An R Package for Marginally Specified Logistic-Normal Models for Longitudinal Binary Data
Introduction to lnmle: An R Package for Marginally Specified Logistic-Normal Models for Longitudinal Binary Data Bryan A. Comstock and Patrick J. Heagerty Department of Biostatistics University of Washington
More informationLogistic regression analysis with multidimensional random effects: A comparison of three approaches
Quality & Quantity manuscript No. (will be inserted by the editor) Logistic regression analysis with multidimensional random effects: A comparison of three approaches Olga Lukočienė Jeroen K. Vermunt Received:
More informationRon Heck, Fall Week 3: Notes Building a Two-Level Model
Ron Heck, Fall 2011 1 EDEP 768E: Seminar on Multilevel Modeling rev. 9/6/2011@11:27pm Week 3: Notes Building a Two-Level Model We will build a model to explain student math achievement using student-level
More informationEfficient Analysis of Mixed Hierarchical and Cross-Classified Random Structures Using a Multilevel Model
Efficient Analysis of Mixed Hierarchical and Cross-Classified Random Structures Using a Multilevel Model Jon Rasbash; Harvey Goldstein Journal of Educational and Behavioral Statistics, Vol. 19, No. 4.
More informationMultidimensional Linking for Tests with Mixed Item Types
Journal of Educational Measurement Summer 2009, Vol. 46, No. 2, pp. 177 197 Multidimensional Linking for Tests with Mixed Item Types Lihua Yao 1 Defense Manpower Data Center Keith Boughton CTB/McGraw-Hill
More informationResearch Design: Topic 18 Hierarchical Linear Modeling (Measures within Persons) 2010 R.C. Gardner, Ph.d.
Research Design: Topic 8 Hierarchical Linear Modeling (Measures within Persons) R.C. Gardner, Ph.d. General Rationale, Purpose, and Applications Linear Growth Models HLM can also be used with repeated
More informationJournal of Statistical Software
JSS Journal of Statistical Software April 2008, Volume 25, Issue 8. http://www.jstatsoft.org/ Markov Chain Monte Carlo Estimation of Normal Ogive IRT Models in MATLAB Yanyan Sheng Southern Illinois University-Carbondale
More informationPerformance of Likelihood-Based Estimation Methods for Multilevel Binary Regression Models
Performance of Likelihood-Based Estimation Methods for Multilevel Binary Regression Models Marc Callens and Christophe Croux 1 K.U. Leuven Abstract: By means of a fractional factorial simulation experiment,
More informationPartitioning variation in multilevel models.
Partitioning variation in multilevel models. by Harvey Goldstein, William Browne and Jon Rasbash Institute of Education, London, UK. Summary. In multilevel modelling, the residual variation in a response
More informationA Use of the Information Function in Tailored Testing
A Use of the Information Function in Tailored Testing Fumiko Samejima University of Tennessee for indi- Several important and useful implications in latent trait theory, with direct implications vidualized
More informationUtilizing Hierarchical Linear Modeling in Evaluation: Concepts and Applications
Utilizing Hierarchical Linear Modeling in Evaluation: Concepts and Applications C.J. McKinney, M.A. Antonio Olmos, Ph.D. Kate DeRoche, M.A. Mental Health Center of Denver Evaluation 2007: Evaluation and
More informationApplication of Plausible Values of Latent Variables to Analyzing BSI-18 Factors. Jichuan Wang, Ph.D
Application of Plausible Values of Latent Variables to Analyzing BSI-18 Factors Jichuan Wang, Ph.D Children s National Health System The George Washington University School of Medicine Washington, DC 1
More informationJoint Assessment of the Differential Item Functioning. and Latent Trait Dimensionality. of Students National Tests
Joint Assessment of the Differential Item Functioning and Latent Trait Dimensionality of Students National Tests arxiv:1212.0378v1 [stat.ap] 3 Dec 2012 Michela Gnaldi, Francesco Bartolucci, Silvia Bacci
More informationEstimating the Hausman test for multilevel Rasch model. Kingsley E. Agho School of Public health Faculty of Medicine The University of Sydney
Estimating the Hausman test for multilevel Rasch model Kingsley E. Agho School of Public health Faculty of Medicine The University of Sydney James A. Athanasou Faculty of Education University of Technology,
More informationThe Application and Promise of Hierarchical Linear Modeling (HLM) in Studying First-Year Student Programs
The Application and Promise of Hierarchical Linear Modeling (HLM) in Studying First-Year Student Programs Chad S. Briggs, Kathie Lorentz & Eric Davis Education & Outreach University Housing Southern Illinois
More informationItem Selection and Hypothesis Testing for the Adaptive Measurement of Change
Item Selection and Hypothesis Testing for the Adaptive Measurement of Change Matthew Finkelman Tufts University School of Dental Medicine David J. Weiss University of Minnesota Gyenam Kim-Kang Korea Nazarene
More informationContents. 3 Evaluating Manifest Monotonicity Using Bayes Factors Introduction... 44
Contents 1 Introduction 4 1.1 Measuring Latent Attributes................. 4 1.2 Assumptions in Item Response Theory............ 6 1.2.1 Local Independence.................. 6 1.2.2 Unidimensionality...................
More informationStreamlining Missing Data Analysis by Aggregating Multiple Imputations at the Data Level
Streamlining Missing Data Analysis by Aggregating Multiple Imputations at the Data Level A Monte Carlo Simulation to Test the Tenability of the SuperMatrix Approach Kyle M Lang Quantitative Psychology
More information