Meiser et. al.: Latent Change in Discrete Data 76 as new perspectives in the application of test models to social science issues (e.g., Fischer & Mole

Size: px
Start display at page:

Download "Meiser et. al.: Latent Change in Discrete Data 76 as new perspectives in the application of test models to social science issues (e.g., Fischer & Mole"

Transcription

1 Methods of Psychological Research Online 1998, Vol.3, No.2 Internet: Latent Change in Discrete Data: Unidimensional, Multidimensional, and Mixture Distribution Rasch Models for the Analysis of Repeated Observations Thorsten Meiser Psychologisches Institut der Universität Bonn Elsbeth Stern Max-Planck-Institut für Bildungsforschung Berlin Rolf Langeheine Institut für die Pädagogik der Naturwissenschaften an der Universität Kiel Abstract A survey of unidimensional, multidimensional, and mixture distribution Rasch models is presented with a particular focus on model applications for the analysis of change in repeated measures designs. A mover-stayer mixed Rasch model is specified for modeling global change in one of two latent subpopulations and for modeling stability in the other latent subpopulation. The application of unidimensional, multidimensional, and mixture distribution Rasch models for the analysis of change is illustrated using data on the development of understanding and solving arithmetic word problems in elementary school children. Keywords: Rasch model, measurement of change, finite mixture distributions Zusammenfassung Eine Übersicht über eindimensionale, mehrdimensionale und Mischverteilungs- Rasch-Modelle wird vorgestellt unter besonderer Berücksichtigung der Modellanwendung zur Analyse von Veränderungen in Designs mit Meßwiederholungen. Ein Mover-Stayer-Mischverteilungs-Rasch-Modell wird formuliert zur Modellierung globaler Veränderung in einer von zwei latenten Subpopulationen und zur Modellierung von Stabilität in der anderen latenten Subpopulation. Die Anwendung von eindimensionalen, mehrdimensionalen und Mischverteilungs-Rasch-Modellen zur Analyse von Veränderungen wird anhand von Daten zur Entwicklung des Verstehens und Lösens mathematischer Textaufgaben bei Grundschulkindern illustriert. Schlüsselwörter: Rasch-Modell, Veränderungsmessung, finite Mischverteilungen The class of psychometric models presented by Georg Rasch (1960/1980, 1968) has gained considerable interest and has stimulated an impressive amount of research on statistical models in the social and behavioral sciences. The impact of Rasch's work on modern test theory is documented in several monographs which summarize current developments in the mathematical modeling of test data as well

2 Meiser et. al.: Latent Change in Discrete Data 76 as new perspectives in the application of test models to social science issues (e.g., Fischer & Molenaar, 1995; Langeheine & Rost, 1988; Rost & Langeheine, 1997; van der Linden & Hambleton, 1997). The present article is devoted to the application of Rasch models to longitudinal test data which comprise repeated observations of the same items and the same sample of individuals at different occasions. For this purpose, the unidimensional Rasch model and some recent extensions, such as multidimensional Rasch models, mixture distribution Rasch models, and submodels thereof, are outlined in the next three sections with a particular focus on the analysis of change. With respect to the loglinear representation of unidimensional and multidimensional Rasch models, a model hierarchy is pointed out which facilitates testing the assumption of homogeneity of change across individuals. In the context of mixture distribution Rasch models, a mover-stayer model is presented which allows an a-priori specification of different patterns of change for different latent subpopulations. Throughout the text, the models are presented in terms of polytomous item response models which include models for dichotomous items as special cases. In regard to polytomous Rasch models, two contradictory views concerning the ordering of threshold difficulties are described and a mediating view is suggested. In the fourth section, the analysis of change by means of unidimensional, multidimensional, and mixture distribution Rasch models is illustrated using longitudinal data on the development of understanding and solving arithmetic word problems in elementary school children. Thereby the fourth section extends a previous analysis of these data which was based on a latent class state-mastery model embedded in latent Markov chain models (Langeheine, Stern, & van de Pol, 1994). 1 Unidimensional Rasch Models Unidimensional Rasch models for polytomous items with ordered response categories x = 0; :::; m i can be derived by the appropriate parameterization of the threshold probabilities. The threshold probability of item i, i 2f1; :::; Ig, threshold x, x 2f1; :::; m i g, and person v, v 2f1; :::; Ng, is defined as the conditional probability of person v responding with response category x in item i, given that person v responds with either category x 1orx: fl vix = p(x vi = x) p(x vi = x 1) + p(x vi = x) : (1) For unidimensional Rasch models, the threshold probabilities fl vix are parameterized in terms of the logistic function, where the argument of the function is the difference between a person parameter v and a threshold parameter fi ix : fl vix = exp( v fi ix ) 1 + exp( v fi ix ) : (2) The parameter reflects the latent ability or attitude of person v, whereas the parameter fi ix reflects the difficulty of threshold x of item i. From Equation (2), the probability of response category x for item i and individual v can be derived by a recursive formula: = fl vix p(x vi = x) = p(x vi = x 1) (3) 1 fl vix = ::: P x exp (x v s=1 fi is) P mi y=0 exp (y v P y s=1 fi is)

3 Meiser et. al.: Latent Change in Discrete Data 77 (Andrich, 1978; Masters, 1982; Rost, 1988) with P 0 s=1 fi is := 0. The probability of response vector X containing the responses to a given set of I items, X= (X 1 ; :::; X I ), results from multiplying the probabilities of the single item responses, that is multiplying Equation (3) over i: p (X v =(x 1 ; :::; x I )) = exp Q I i=1 P mi t v P I i=1 P xi s=1 fi is P y=0 exp (y y v s=1 fi is) ; (4) where t = P I i=1 x i denotes the total score of the item responses. Equation (4) rests on the assumption of local independence, that is stochastic independence of responses conditional on the person and threshold parameters. Several special cases of the general unidimensional Rasch model for polytomous items, which is also called the partial credit model (PCM; Masters, 1982; Masters &Wright, 1997), can be specified by restrictions on the threshold parameters fi ix. Equality constraints on the differences between threshold parameters across items, fi ix = ff i + ψ x with mx s=1 ψ s =0; (5) result in the well-known rating scale model (RSM; Andrich, 1978). Equality constraints on the differences between adjacent threshold parameters within each item, result in the dispersion model (Andrich, 1982). fi ix = ff i +(x (m i +1)=2) ffi i ; (6) 1.1 Interpretation of Threshold Parameters Since the parameterization of the threshold probabilities in Equation (2) is equivalent to the unidimensional Rasch model for dichotomous items, the threshold parameters fi ix have the same interpretation as the item parameters in the dichotomous Rasch model. In particular, the following relations hold: ffl v = fi ix, fl vix = :5, ffl fi ix is the turning point ofthethreshold characteristic curve, and ffl v = fi ix, p(x vi = x 1) = p(x vi = x). A controversial issue concerns the question whether the claim of ordered response categories implies a corresponding order on the threshold parameters within each item, that is: does the assumption that response category x of item i reflects a higher amount of latent ability or attitude than response category x 1, for x =1; :::; m i, necessarily imply the order relation fi i1 <fi i2 < ::: <fi imi? In recent publications, Masters and Wright (1997) and Andrich, de Jong, and Sheridan (1997) elaborated their contradictory views on this issue. Masters and Wright (1997) pointed out that each threshold parameter refers to the comparison of a single pair of adjacent categories (see Equation (2)) and that the parameters can therefore have any order: Because each item parameter [i.e., threshold parameter in the present terminology] in the PCM is defined locally with respect to just two adjacent categories (rather than taking into account all categories simultaneously), the item parameters in the model can take any order. (Masters & Wright, 1997, p. 105)

4 Meiser et. al.: Latent Change in Discrete Data 78 An example for an achievement item with four different response categories, one for entirely incorrect responses, two for partial solutions, and one for the correct solution, is the math item p 5 1 =? 4 The solution of the item requires three consecutive steps: = 4, 2. p 4 = 2, 3. 2=4 = 0:5. If no step is taken successfully, the response is entirely incorrect (ignoring the possibility of guessing the solution) and scored as x = 0. Finishing only step 1 or finishing steps 1 and 2 leads to the interim results 4" or 2" respectively, scored as categories x = 1 and x =2. Finishing all steps successfully leads to the correct solution 0.5" which is scored as category x = 3. Since threshold 2 (i.e., scoring 2, rather than 1) requires taking a square root and therefore may beregarded as more difficult than threshold 3 (i.e., scoring 3, rather than 2) which involves solving a simple fraction, the natural order of threshold parameters for this task would be fi i1 <fi i3 <fi i2. Thus, the order of threshold parameters does not correspond to the order of response categories x. However, this does not violate the basic assumption that response category x = 3 reflects more effort or ability than does category x = 2, unless the division required in step 3 is considered to be trivial. In contrast to the view of Masters and Wright (1997), Andrich et al. (1997) emphasized that the effects of the threshold parameters are not confined to adjacent categories and that each category probability ofitem i depends on the entire set of threshold parameters of that item (see Equation (3)). In particular, the authors regard the ascending order of threshold parameters as an implication of the hypothesis of ordered response categories and, as a consequence, as a criterion for model evaluation: The ordering of the thresholds which divide the latent unidimensional continuum into categories itself is a hypothesis about the data which is embedded in the model. Although formal statistical tests of fit could be constructed to test that the empirical ordering is not consistent with the intended ordering, reversed threshold estimates is sufficient evidence to conclude that the empirical ordering is not consistent with the intended ordering. (Andrich et al., 1997, p. 68) The rationale behind this view is based on the presumed process of generating a response to an item in terms of integrating single responses to each of the thresholds (Andrich, 1978, 1985): considering the possible patterns of responses to the two thresholds of a trichotomous item, for instance, yields the sample space Ω = f(0; 0); (1; 0); (0; 1); (1; 1)g, where (0,0) means that neither of the thresholds is passed, (1,0) means that only the first threshold is passed, etc. The requirementtointegrate the threshold responses to an unique response on the item reduces the sample space of the item to Ω 0 = f0 =(0; 0); 1=(1; 0); 2=(1; 1)g so that the pattern (0; 1) in Ω has to be dropped from the set of available response pairs. As a consequence, the assumption of ordered response categories implies a Guttman structure on the threshold responses within each item reflecting the order fi i1 <fi i2 < ::: <fi imi. This brief reflection of the contradictory views reveals that the hypotheses about the response generating processes are crucial to the application of a psychometric

5 Meiser et. al.: Latent Change in Discrete Data 79 model and should be made explicit and underpinned by substantial theory. As a mediating view on the issue, we suggest that violations of the Guttman structure in the ordering of threshold parameters may be accepted for achievement items, if the cognitive processes involved in the single steps of a task are well specified and if an additional cognitive operation, resulting in a higher response category, indicates some degree of effort or ability on the latent continuum to be measured (cf. Masters & Wright, 1997). In contrast, applications of Rasch models to attitude items may well include the assumption of ascending threshold difficulties, because individuals have to select the appropriate category by taking into consideration the whole set of available categories at once (cf. Andrich et al., 1997). Hence, for attitude items the ordering of the response categories is a hypothesis which can be tested by means of the ordering of threshold difficulties. 1.2 Linear Logistic Test Models for Measuring Change Fischer and Parzer (1991) and Fischer and Ponocny (1994) introduced polytomous Rasch models with linear constraints on the threshold parameters which allow measuring change and (quasi-) experimental treatment effects in repeated observations (see Fischer& Ponocny, 1995, for an overview). These linearly restricted polytomous Rasch models are generalizations of the linear logistic test model (LLTM) for the assessment of change with dichotomous items (Fischer, 1983, 1995; Fischer & Formann, 1982). As before, only the polytomous case is considered here, because it encompasses the application to dichotomous items as a special case. If a set of I items is assessed at two measurement occasions T 1 and T 2, the resulting set of 2 I items can be divided into the subset of I items 1; :::; I observed at T 1 and a further subset of I virtually new items I +1; :::; 2I observed at T 2. In the linearly restricted PCM (Fischer& Ponocny, 1994), the thresholds of the first I items are parameterized in terms of Equation (2), while the threshold parameters of the virtually new items I + 1; :::; 2I are decomposed into the initial threshold parameters and a set of J treatment effects: fi (I+i)x = fi ix JX j=1 q j j (7) In Equation (7), q j denotes the dosage of treatment j, which may differ between experimental groups, and j denotes the effect of treatment j. By means of replacing the effect parameter j by an item-specific effect parameter ji, the assumption of constant treatment effects across items is dropped. Analogous linear decompositions have been proposed for the parameters of the RSM (Fischer & Parzer, 1991). Methods of conditional maximum-likelihood (CML) estimation of the parameters in the linearly restricted PCM and RSM are presented by Fischer and Parzer (1991) and Fischer and Ponocny (1994, 1995). If no experimental treatments are applied, development occuring in the interval from T 1 to T 2 can be measured by setting J = 1 and q 1 =1. Then 1 reflects the individuals' global amountofchange on the latent continuum (cf. Langeheine, 1993; Rost & Spada, 1983; Spada & McGaw, 1985). Generalizations to more than two measurement occasions are straightforward. The linear logistic test models for dichotomous and polytomous items maintain the assumption of unidimensionality across items. Another family of models, called linear logistic test models with relaxed assumptions (LLRA), abandons this condition (Fischer, 1983, 1995; Fischer & Formann, 1982). In LLRA models, each of the items 1; :::; I may measure a trait of its own, although all items are presumed to assess the same kind and amount of change. The relaxation is accomplished by replacing the difference of the person and threshold parameter, v fi ix, by a

6 Meiser et. al.: Latent Change in Discrete Data 80 joint parameter which denotes the interaction of both, vix Λ. For the virtual items I +1; :::; 2I, the joint term Λ v(i+i)x is decomposed into the initial parameter Λ vix plus a linear combination of change parameters as in the LLTM (see Equation (7)). While LLRA models are multidimensional in nature, they do not contain explicit hypotheses about the latent dimensions underlying the responses to any particular item. Alternative multidimensional Rasch models which allow specifying the latent structure of each threshold are presented in the next section. 2 Multidimensional Rasch Models If it is assumed that the observed responses are affected by more than one latent trait, the threshold probabilities can be parameterized in terms of the logistic function where the argument involves the sum over several latent dimensions d, d =1; :::; D: PD exp d=1 w ixd( vd fi ixd ) fl vix = PD : (8) 1 + exp d=1 w ixd( vd fi ixd ) The weights w ixd 2 f0; 1g specify whether passing threshold x of item i involves latent trait d or not (for details, see Meiser, 1996). The category probabilities can be derived from Equation (8) by the recursive formula used in Equation (3): p(x vi = x) = exp PD d=1 P x s=1 w isd vd P D d=1 P mi y=0 exp P D d=1 P y s=1 w isd vd P D d=1 P x s=1 w isdfi isd P y s=1 w isdfi isd (9) with fi isd = 0 for w isd = 0. As in the unidimensional case, the distribution of response vector X results from multiplication over i: p (X v =(x 1 ; :::; x I )) = PD exp P d=1 t D P I P xi d vd d=1 i=1 Q I i=1 P mi y=0 exp P D d=1 P y s=1 w isd vd P D d=1 s=1 w isdfi isd P y s=1 w isdfi isd ; (10) P I P where t d = xi i=1 s=1 w isd denotes the total score related to latent dimension d, that is the number of thresholds passed involving this dimension. Special cases can be derived from the general multidimensional Rasch model by appropriate specifications of the weights w ixd. The so-called multidimensional partial credit model in which each threshold x of items 1 to I refers to a latent dimension of its own (Kelderman, 1993, 1996; Kelderman & Rijkes, 1994) results from w ixd = ( 1 for x = d; 0 for x 6= d: (11) By this specification, Equation (9) is reduced to: p(x vi = x) = exp (P P x d=1 x vd P mi y=0 (P y d=1 vd P y d=1 fi id) d=1 fi id) : (12)

7 Meiser et. al.: Latent Change in Discrete Data 81 Recently it has been shown that the multidimensional partial credit model for ordered response categories is equivalent to Rasch's traditional multidimensional model for categorical responses with category probabilities p(x vi = x) = exp ( vx fi ix ) P mi y=0 ( vy fi iy ) (13) (cf. Andersen, 1995; Roskam, 1996). Actually, reparameterizing Model (12) in terms of Λ vx = xx d=1 yields Model (13) (Kelderman, 1997). vd and fi Λ ix = xx d=1 fi id (14) 2.1 Loglinear Representation of Multidimensional Rasch Models Loglinear representations of unidimensional Rasch models (e.g. Cressie & Holland, 1983; Kelderman, 1984; Thissen & Mooney, 1989; Tjur, 1982) and of multidimensional Rasch models (e. g. Kelderman & Rijkes, 1994; Meiser, 1996) provide a convenient framework for specifying and testing hypotheses about the latent space underlying the observed responses and about the structure of threshold parameters. The general multidimensional Rasch model of Equation (10) can be rewritten in terms of the loglinear model ln p (X =(x 1 ; :::; x I )) = u DX IX Xx i d=1 i=1 s=1 w isd fi isd + u (t1;:::;t D) ; (15) where (t 1 ; :::; t D ) is the vector of total scores referring to the latent dimensions 1 to D. To achieve identifiability, several constraints have to be imposed on the parameters of Equation (15) (cf. Kelderman & Rijkes, P 1994; P Meiser, 1996). A common set of constraints is fi isd =0for w isd =0, t 1 ::: t D u (t1;:::;t P D) =0,and the restriction of centered scales for the latent dimensions, that is I P mi i=1 s=1 fi isd =0 for d =1; :::; D. Thereby, the model for the distribution of the total score vector (t 1 ; :::; t D ) is saturated in Equation (15), and maximum-likelihood estimation of the parameters in the loglinear model yield CML estimates of the threshold parameters. The unidimensional Rasch model, the multidimensional partial credit model, and models with several unidimensional scales can be derived from Model (15) by appropriate selections of the design matrix in the nonstandard loglinear modeling approach (cf. Langeheine, 1983; Rindskopf, 1990, 1992). 2.2 Loglinear Rasch Models for the Analysis of Change Consider again the case of two measurement occasions T 1 and T 2 with a set of items 1; :::; I observed at T 1 and with the corresponding set of virtually new items I +1; :::; 2I observed at T 2. The above-mentioned special case of the linearly restricted PCM (Fischer& Ponocny, 1994), namely the unidimensional Rasch model of global change which is independent from the thresholds, the items, and the person parameters, results from Model (15) by the specifications D =1,w is1 = 1 for all thresholds, and fi (I+i)s = fi is : ln p (X =(x 1 ; :::; x I ;x I+1 ; :::; x 2I )) =

8 Meiser et. al.: Latent Change in Discrete Data 82 u IX Xx i i=1 s=1 fi is IX i=1 x X (I+i) s=1 fi is + IX i=1 x X (I+i) s=1 + u t (16) P P I with t = i=1 x I i + i=1 x (I+i). The statistical comparison of Model (16) to the model of perfect stability which results from the restriction = 0 allows testing for the occurrence of change from T 1 to T 2. Model (16) can easily be extended to measuring global change in several latent dimensions or to designs with more than two measurement occasions (see Meiser, 1996). While the unidimensional Rasch model for repeated observations is based on the assumption of homogeneity ofchange across individuals, the two-dimensional Rasch model with latent trait d = 1 affecting only responses at T 1, that is responses to the items 1; :::; I, and latent trait d = 2 affecting only responses at T 2, that is responses to the virtually new items I +1; :::; 2I, ln p (X =(x 1 ; :::; x I ;x I+1 ; :::; x 2I )) = u IX Xx i i=1 s=1 fi is IX i=1 x X (I+i) s=1 fi is + u (t1;t 2) (17) P P I with t 1 = i=1 x I i and t 2 = i=1 x (I+i), permits person-specific change (Meiser, 1996; see also Duncan, 1985a, 1985b). In Equation (17), the threshold parameters are specified to be invariantover time. This invariance mirrors the assumptions that latent change is confined to the individuals' total scores, which are the sufficient statistics of the latent person parameters vd, and that the structure of the items and response categories remains unchanged. Note that the unidimensional Rasch model of global change is a submodel of the two-dimensional Rasch model of person-specific change: Equation (16) results from Equation (17) by the restrictions u (t1;t 2) = u t + t 2 with t = t 1 + t 2. Hence, the two models can be compared by means of the conditional log-likelihood ratio statistic G 2 in order to test for homogeneity ofchange. In Equation (17), each of the item sets 1; :::; I and I +1; :::; 2I is specified to be unidimensional. Therefore, multidimensionality comes into play only by admitting person-specific change between the two measurement occasions which both refer to a unidimensional Rasch model. This contrasts with the LLRA which abandons the assumption of unidimensionality per measurement occasions as discussed above. 3 Mixture Distribution Rasch Models The essential assumption of the Rasch model that the threshold parameters are homogeneous across individuals can be relaxed by use of finite mixture distribution models (Everitt & Hand, 1981; Rost & Erdfelder, 1996; Titterington, Smith, & Makov, 1985). In finite mixture distribution models, the probability of response vector X is characterized by a set of component probability functions p(x j c) which are conditional on latent subpopulations c, c =1; :::; C, and by the distribution of the latent subpopulations: p(x) = CX c=1 ß c p(x j c); (18) where ß c denotes the probability of subpopulation c. If the component distributions of a mixture multinomial distribution are specified in terms of the unidimensional Rasch model (see Equation (4)), the mixed

9 Meiser et. al.: Latent Change in Discrete Data 83 Rasch model results: p(x v =(x 1 ; :::; x I )) = CX Q exp I P mi i=1 ß c 0 t vjc P I i=1 P xi s=1 fi isjc y=0 exp y vjc P y s=1 fi isjc 1 A (19) (Rost, 1990, 1991; von Davier & Rost, 1995). In the mixed Rasch model, homogeneity of threshold parameters is specified within each of the latent subpopulations, while the parameters may differ between latent subpopulations. Parameter estimation for mixture distribution Rasch models via the EM algorithm is described by Rost (1990, 1991) and von Davier and Rost (1995). 3.1 A Mover-Stayer Mixed Rasch Model for Repeated Observations Recently, mixed Rasch models were applied to longitudinal data in order to separate different patterns of change in an exploratory manner (Glück & Spiel, 1997; Meiser, Hein-Eggers, Rompe, & Rudinger, 1995; Meiser & Rudinger, 1997). Here we want to focus on a special case of mixed Rasch models for a more confirmatory analysis of longitudinal data, namely on a mover-stayer model encompassing a subpopulation c = 1 of global change and another subpopulation c = 2 in which nochange occurs. In the case of two measurement occasions with items i =1; :::; I observed at T 1 and virtually new items i = I +1; :::; 2I observed at T 2, the model of global change in subpopulation c = 1 and of no change in subpopulation c = 2 is specified by the restrictions fi (I+i)sj1 = fi isj1 and fi (I+i)sj2 = fi isj2 (20) on the threshold parameters of the component distributions in Model (19). By the additional restriction fi isj1 = fi isj2 = fi is (21) identical initial threshold parameters are specified for both subpopulations so that differences between the latent subpopulations are confined to potential differences in the distribution of the person parameter and to the a priori specified difference in the pattern of change, that is global change in subpopulation c =1versus no change in subpopulation c = 2. Formally, the mover-stayer mixed Rasch model with restriction (21) is related to the Saltus model (Wilson, 1989) for the analysis of discontinuous development by cross-sectional data. The generalization of the mover-stayer mixed Rasch model to more than two measurement occasions is straightforward. 4 Development of Understanding and Solving Arithmetic Word Problems in Elementary School Children The data of the present analysis were gathered in the longitudinal study SCHOLASTIK by the Max-Planck-Institute of Psychological Research in Munich, Germany (Weinert & Helmke, 1997). In this study a sample of 1,453 children from 54 elementary school classes were repeatedly presented with achievement measures on mathematics, science, and the mother tongue from first grade to fourth grade. In this paper we only focus on a small part of the collected data: certain arithmetic word problems presented in the second and third grade are discussed.

10 Meiser et. al.: Latent Change in Discrete Data 84 Solving particular arithmetic word problems can be considered as a good indicator of mathematical understanding. While children can easily solve simple problems that describe the exchange of sets (e.g., At the beginning, John had 5 marbles. Then he gave 2 of these marbles to Peter. How many marbles does John have now?"), word problems dealing with the quantitative comparison (e.g., John has 5 marbles. He has 3 marbles more than Peter has. How many marbles does Peter have?") or dealing with certain kinds of combination (e.g., Peter and John have 8 marbles altogether. Peter has 3 marbles. How many marbles does John have?") provide particular difficulties. Solving comparison and combination problems requires an advanced mathematical understanding which is based on abstract part-whole relations rather than on the counting function of numbers (Stern, 1993, 1998; Stern & Lehrndorfer, 1992). Handling part-whole relations allows flexibility in formulating equations and partitioning sets. Particularly complex word problems which require the inference of information can only be solved on the basis of part-whole representations. In this article we concentrate on three complex word problems dealing with the comparison and the combination of sets presented in the second and third grade of elementary school. 4.1 Items and Sample The arithmetic word problems selected for the present analysis were: 1. Jack and Beth have 6 apples altogether. Jack has 2 apples. Ken and Ina have 9 apples altogether. Ken has 5 apples. How many apples do Beth and Ina have altogether? 2. John has 7 rabbits. He has 4 rabbits more than Tom. How many rabbits do John and Tom have altogether? 3. Joyce has 7 marbles. She has 2 marbles more than Tom has. Oliver has 3 marbles more than Tom. How many marbles does Oliver has? Responses were scored 0 for incorrect responses and 1 for correct responses. A sample of N = 1030 children participated in the two measurement occasions considered here, that is in second and third grade. The empirical frequencies of the response vectors comprising the responses to the three arithmetic word problems at second and third grade are displayed in Table 1. Since the kind of word problems used for the present analysis is only rarely presented during elementary school, children cannot simply retrieve complete solution strategies from memory; they have to develop them on their own. Therefore, solving each of the three word problems requires what is called far-transfer: a deep reconstruction of the existing knowledge base in arithmetic is necessary. The mathematical requirements of the three problems go far beyond what is usually dealt with in the first two years of elementary school mathematics. Therefore, only second graders who have already developed outstanding mathematical competencies on their own can be expected to solve at least some of the problems at this age level. In the third grade, part-whole understanding of arithmetic equations is particularly emphasized in school, for instance by frequently presenting children with fill-in-theblank problems. Children who benefitted from the instruction at school should be able to develop solution strategies for the three word problems and thereby improve

11 Meiser et. al.: Latent Change in Discrete Data 85 Table 1: Observed and Expected Frequencies of Response Vectors for the Three Arithmetic Word Items at Second and Third Grade Grade 2 Grade 3 Frequencies Grade 2 Grade 3 Frequencies It.1 It.2 It.3 It.1 It.2 It.3 Obs. Exp. a It.1 It.2 It.3 It.1 It.2 It.3 Obs. Exp. a a Expected frequencies according to the mover-stayer mixed Rasch model.

12 Meiser et. al.: Latent Change in Discrete Data 86 performance. There may be, however, children who do not benefit much from the learning opportunities in the third grade and therefore will not be able to construct advanced solution strategies for the word problems. These children will remain on their initial performance level. In their previous analysis of the data 1 Langeheine et al. (1994), using twoclass state mastery models, found that a mover-stayer model including two latent subpopulations, one with transitions from a state of low competency to a state of high competency and one without transitions between states of competency, was superior compared to a set of other models which did not allow for heterogeneity of change. Although this result fits nicely into the theoretical expectation of differential gains in mathematical reasoning performance during third grade, none of the statemastery models considered in this earlier analysis showed a satisfactory goodness of fit at conventional levels of statistical significance (Langeheine et al., 1994, p. 285). The poor fit may bedue to the rather restrictive assumption that there are only two levels of competency, which are reflected by the class of nonmasters" and the class of masters". In the present analysis, we drop this restrictive assumption by using Rasch models, rather than two-class state-mastery models, thereby allowing for more than two values of the latent person variable. 4.2 Results Conditional maximum-likelihood estimates for the parameters of loglinear Rasch models were obtained using the program LEM (Vermunt, 1997) which enables specifying the design matrices of nonstandard loglinear models. The criterion of statistical significance for model rejection was set at ff = :05. A power analysis using the program GPOWER (Faul & Erdfelder, 1992; see also Erdfelder, Faul, & Buchner, 1996) with specifications ff = :05, N = 1030, and medium effect size w = :3 (Cohen, 1988) revealed that the statistical power of detecting effects of at least medium effect size is larger than 1 fi = :99 for all of the goodness of fit tests and model comparisons reported in the following. Therefore, models and model restrictions which do not yield a significant misfit can be accepted with a small second-type error probability. The most restrictive models of change discussed above are the unidimensional Rasch model of global change specified in Equation (16) and its submodel of perfect stability resulting from the parameter fixation = 0.Note that both of these models are special cases of the LLTM as described earlier. The estimates of the threshold parameters 2 fi i and of the change parameter in the model of global change are listed in Table 2 as is the log-likelihood ratio test statistic G 2. As can be seen in the table, the model of global change yields a significant G 2 statistic so that it has to be rejected. Accordingly, the even more restrictive model of perfect stability must also be rejected for the present data, G 2 = 184:87, 55 df, p<:05. Since the unidimensional Rasch model of global change does not fit the data, we turn to the two-dimensional Rasch model permitting person-specific change. The estimates of the threshold parameters and the log-likelihood ratio statistic of the two-dimensional model specified in Equation (17) are displayed in Table 2. In contrast to the unidimensional Rasch model of global change, the two-dimensional 1 Although Langeheine et al. (1994, pp. 284 ff., Model X) selected the same items, the data set used in the previous analysis differs from the data of the present analysis: Langeheine et al. considered responses from N = 965 children who participated in three adjacent measurement occasions of the longitudinal study, whereas the present analysis is based on a sample of N = 1030 children with complete data for two measurement occasions. 2 Since the responses to the arithmetic word problems were scored in dichotomous categories 0 and 1 and, as a consequence, the items have only one threshold, the index indicating the threshold is omitted.

13 Meiser et. al.: Latent Change in Discrete Data 87 Rasch model shows an acceptable goodness of fit. As shown above, the unidimensional Rasch Model (16) is a submodel of the two-dimensional Model (17) so that the two models can be compared statistically, thereby testing for homogeneity of change across individuals. The conditional test statistic of the model comparison is significant, G 2 =17:69, 8 df, p<:05, indicating that the assumption of homogeneity ofchange does not hold for the present data. Considering the result that individuals differ with respect to their course of development from second to third grade, it is interesting to see whether the observed heterogeneity of change can be traced back to two simple processes occuring in different latent subpopulations: global change in subpopulation c = 1 and perfect stability in subpopulation c = 2. This hypothesis is reflected by the mover-stayer mixed Rasch model, that is by the mixed Rasch model specified in Equation (19) with the additional restrictions displayed in Equations (20) and (21). Since the program LEM allows the specification of user-defined design matrices for log-linear models in finite mixture distributions, the parameters of the mover-stayer mixed Rasch model can be estimated by parameterizing the component distributions in terms of the unidimensional Rasch model of global change (Equation (16)), where the threshold parameters are constrained to be equal across subpopulations (Equation (21)) and where is restricted to zero for latent subpopulation c = 2 (Equation (20)). Note that the response vectors (0,0,0,0,0,0) and (1,1,1,1,1,1) do not contribute to the CML estimation of the threshold parameters and therefore cannot be assigned to latent subpopulations c = 1 and c = 2 (Rost, 1991). Therefore, the loglinear parameters u 0jc and u 6jc are not identifiable, unless additional restrictions are imposed, such as equality constraints on u 0jc and u 6jc across c or fixations at zero in one of the two subpopulations. In the present analysis, we fitted the cells (0,0,0,0,0,0) and (1,1,1,1,1,1) of the contingency table by estimating the parameters u 0j2 and u 6j2 in the subpopulation of stayers and at the same time restricting u 0j1 and u 6j1 to zero for the subpopulation of movers. These restrictions affect the distribution of subpopulations, that is ß 1 and ß 2, but do not influence the estimates of the threshold and change parameters of the component Rasch models. As shown in Table 2, the mover-stayer mixed Rasch model does not yield a significant misfit. Therefore we can maintain the hypothesis that the existence of a mover and a stayer subpopulation accounts for the heterogeneity ofchange revealed in the previous steps of analysis. The estimated size of the mover subpopulation amounts to ^ß 1 =0:43 and that of the stayer subpopulation to ^ß 2 =0:57. The mean expected probability of a correct response is depicted in Figure 1 as a function of item, measurement occasion, and latent subpopulation. The expected frequencies of response vectors are displayed in Table 1. A relaxation of the model by permitting different fi-parameters for the latent subpopulations, that is by dropping the restriction displayed in Equation (21), does not result in a significant improvement in goodness of fit, G 2 =2:46, 2 df, p>:05. Since a statistical comparison of the two-dimensional Rasch model and the finite-mixture model with two subpopulations is not possible, we cannot directly test whether the more parsimonious mover-stayer model differs from the model of person-specific change. Therefore, the final model selection is based on the information criterion CAIC which emphasizes the avoidance of overfitting a model (cf. Bozdogan, 1987). The mover-stayer mixed Rasch model shows a CAIC value of 7440:27 which is slightly lower than that of the two-dimensional Rasch model with a CAIC value of 7442:36. Therefore, the more parsimonious mover-stayer model may be preferred over the model of person-specific change.

14 Probability of a Correct Response Meiser et. al.: Latent Change in Discrete Data 88 Table 2: Parameter Estimates of the Threshold and Change Parameters and Goodness of Fit for the Unidimensional Rasch Model of Global Change, the Two-Dimensional Rasch Model of Person-Specific Change, and the Mover-Stayer Mixed Rasch Model Model of change Global Person-specific Mover-stayer fi 1 0:244 0:250 0:249 fi 2 0:310 0:317 0:317 fi 3 0:066 0:067 0:067 0:669 1:188 G 2 72:53 Λ 54:84 64:79 df Λ p<:05 Figure 1: Mean expected probabilities of correct responses to the arithmetic word items at second and third grade for the latent subpopulations of movers and stayers Movers Stayers Item 1 Item 2 Item 3 Item 1 Item 2 Item 3 Grade 2 Grade 3

15 Meiser et. al.: Latent Change in Discrete Data 89 5 Conclusions The Rasch model and its extensions form a class of psychometric models which can be used for modeling various aspects of change, such as global change, personspecific change, as well as different patterns of change in different subpopulations. The loglinear representation of unidimensional and multidimensional Rasch models and the loglinear parameterization of component distributions in mixture distribution Rasch models allow specifying and testing hypotheses about change in a flexible and straightforward way. In the present article, we pointed out that the unidimensional Rasch model of global change is a submodel of the multidimensional Rasch model of person-specific change and that a mover-stayer mixed Rasch model can be specified by appropriate restrictions on the threshold parameters of the mixed Rasch model. Thereby, we addressed two issues raised by Stelzl (1997) in her reply to Glück and Spiel (1997): first, Stelzl questioned the assumption of homogeneity of change underlying the LLTM and other item response models for measuring change in repeated measures designs. The above-mentioned hierarchical relation of the loglinear unidimensional Rasch model of global change and the loglinear multidimensional Rasch model of person-specific change allows explicitly testing this assumption by means of a conditional likelihood ratio test. Second, Stelzl emphasized the need to specify treatment or change parameters in mixed Rasch models, if these models are to be used for analyzing change as suggested by Glück and Spiel (1997). In the context of the mover-stayer mixed Rasch model, we showed how to specify a change parameter for one subpopulation and to fix the parameter at zero for another subpopulation. If a research design comprises several experimental or quasi-experimental groups, straightforward extensions of the mover-stayer model by the extraneous group variable are available. Those extended mover-stayer models may include, for instance, equality constraints on the threshold and change parameters across groups while investigating differences in the sizes of the mover and stayer subpopulations, or they may impose equality constraints on the threshold parameters across groups while testing for differences in the change parameters of movers. Thus, fitting a moverstayer mixed Rasch model may be the basis for further analyses to investigate which groups of individuals will be affected by certain interventions and which groups of individuals will not be affected or will be affected in a different way. As illustrated by the above analysis of mathematical reasoning performance in elementary school children, the restrictive assumption of homogeneity of change may well be inappropriate for a given data set. Furthermore, we agree with Stelzl (1997) that the assumption may even be implausible as far as natural (i.e., not experimentally induced) change is concerned: in the context of solving arithmetic word problems, children are expected to differ in both their level of competency at one measurement occasion as well as their growth in competency from one measurement occasion to another. The unidimensional Rasch model of global change reflects differences in initial competency, but precludes interindividual differences in growth. In contrast, the multidimensional Rasch model of person-specific change is a rather unrestrictive model, inasmuch as it incorporates differences in reasoning competency at a given occasion as well as unconstrained interindividual differences in the amount of change from one occasion to another. The mover-stayer mixed Rasch model falls in between these two extremes by allowing for interindividual differences in change in terms of two well-defined developmental processes: global change for one subpopulation and stability for another subpopulation. For the present data, the mover-stayer model turned out to be superior to the alternative Rasch models considered, because it is not rejected and provides a parsimonious description of the data generating processes. The two subpopulations separated by the mover-stayer mixed Rasch model differ

16 Meiser et. al.: Latent Change in Discrete Data 90 with respect to the impact school instruction has on the development of mathematical competencies. In the third grade, some children gained from practicing problems that require part-whole modeling by developing solution strategies for complex word problems, while others did not. The subpopulation of stayers is composed of children who did not profit from elementary school instruction either because they had already developed the competencies by themselves before they were taught at school, or because the cognitive or motivational preconditions for profiting from the instruction were not available. In the subpopulation of movers, the children extended their arithmetical competencies by exploiting what had been taught at school, thereby improving their performance in word problem solving. References [1] Andersen, E. B. (1995). Polytomous Rasch models and their estimation. In G. H. Fischer & I. W. Molenaar (Eds.), Rasch models. Foundations, recent developments, and applications (pp ). New York: Springer. [2] Andrich, D. (1978). A rating formulation for ordered response categories. Psychometrika, 43, [3] Andrich, D. (1982). An extension of the Rasch model for ratings providing both location and dispersion parameters. Psychometrika, 47, [4] Andrich, D. (1985). An elaboration of Guttman scaling with Rasch models for measurement. In N. B. Tuma (Ed.), Sociological Methodology 1985 (pp ). San Francisco: Jossey-Bass. [5] Andrich, D., de Jong, J. H. A. L. & Sheridan, B. E. (1997). Diagnostic opportunities with the Rasch model for ordered response categories. In J. Rost & R. Langeheine (Eds.), Applications of latent trait and latent class models in the social sciences (pp ). Münster: Waxmann. [6] Bozdogan, H. (1987). Model selection and Akaike's information criterion (AIC): The general theory and its analytical extensions. Psychometrika, 52, [7] Cohen, J. (1988). Statistical power analysis for the behavioral sciences. Hillsdale, NJ: Lawrence Erlbaum. [8] Cressie, N. & Holland, P. W. (1983). Characterizing the manifest probabilities of latent trait models. Psychometrika, 48, [9] Duncan, O. D. (1985a). New light on the 16-fold table. American Journal of Sociology, 91, [10] Duncan, O. D. (1985b). Some models of response uncertainty for panel analysis. Social Science Research, 14, [11] Erdfelder, E., Faul, F., & Buchner, A. (1996). GPOWER: A general power analysis program. Behavior Research Methods, Instruments, & Computers, 28, [12] Everitt, B. S. & Hand, D. J. (1981). Finite mixture distributions. London: Chapman and Hall. [13] Faul, F. & Erdfelder, E. (1992). GPOWER: A priori, post-hoc, and compromise power analyses for MS-DOS. University of Bonn: Department of Psychology. [14] Fischer, G. H. (1983). Logistic latent trait models with linear constraints. Psychometrika, 48, [15] Fischer, G. H. (1995). Linear logistic models for change. In G. H. Fischer& I. W. Molenaar (Eds.), Rasch models. Foundations, recent developments, and applications (pp ). New York: Springer. [16] Fischer, G. H.& Formann, A. K. (1982). Some applications of logistic latent trait models with linear constraints on the parameters. Applied Psychological Measurement, 6,

17 Meiser et. al.: Latent Change in Discrete Data 91 [17] Fischer, G. H. & Molenaar, I. W. (Eds.) (1995). Rasch models. Foundations, recent developments, and applications. New York: Springer. [18] Fischer, G. H. & Parzer, P. (1991). An extension of the rating scale model with an application to the measurement of change. Psychometrika, 4, [19] Fischer, G. H. & Ponocny, I. (1994). An extension of the partial credit model with an application to the measurement of change. Psychometrika, 59, [20] Fischer, G. H. & Ponocny, I. (1995). Extended rating scale and partial credit models for assessing change. In G. H. Fischer & I. W. Molenaar (Eds.), Rasch models. Foundations, recent developments, and applications (pp ). New York: Springer. [21] Glück, J. & Spiel, C. (1997). Item Response-Modelle für Meßwiederholungsdesigns: Anwendung und Grenzen verschiedener Ansätze. [Item response models for repeated measures designs: Application and limitations of different approaches]. Methods of Psychological Research Online, 2. Internet: [22] Kelderman, H. (1984). Loglinear Rasch model tests. Psychometrika, 49, [23] Kelderman, H. (1993). Estimating and testing a multidimensional Rasch model for partial credit scoring of children's application of size concepts. In R. Steyer, K. F. Wender & K. F. Widaman (Eds.), Psychometric methodology. Proceedings of the 7th European Meeting of the Psychometric Society in Trier (pp ). Stuttgart: Fischer. [24] Kelderman, H. (1996). Multidimensional Rasch models for partial-credit scoring. Applied Psychological Measurement, 20, [25] Kelderman, H. (1997). Loglinear multidimensional item response models for polytomously scored items. In W. J. van der Linden & R. K. Hambleton (Eds.), Handbook of modern item response theory (pp ). New York: Springer. [26] Kelderman, H. & Rijkes, C. P. M. (1994). Loglinear multidimensional IRT models for polytomously scored items. Psychometrika, 59, [27] Langeheine, R. (1983). Nonstandard log-lineare Modelle [Nonstandard log-linear models]. Zeitschrift für Sozialpsychologie, 14, [28] Langeheine, R. (1993). Diagnosing incremental learning: Some probabilistic models for measuring change and testing hypotheses about growth. Studies in Educational Evaluation, 19, [29] Langeheine, R. & Rost, J. (1988). Latent trait and latent class models. New York: Plenum Press. [30] Langeheine, R., Stern, E., & van de Pol, F. (1994). State mastery learning. Dynamic models for longitudinal data. Applied Psychological Measurement, 18, [31] Masters, G. N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47, [32] Masters, G. N. & Wright, B. D. (1997). The partial credit model. In W. J. van der Linden & R. K. Hambleton (Eds.), Handbook of modern item response theory (pp ). New York: Springer. [33] Meiser, T. (1996). Loglinear Rasch models for the analysis of stability and change. Psychometrika, 61, [34] Meiser, T., Hein-Eggers, M., Rompe, P. & Rudinger, G. (1995). Analyzing homogeneity and heterogeneity of change using Rasch and latent class models: A comparative and integrative approach. Applied Psychological Measurement, 19, [35] Meiser, T. & Rudinger, G. (1997). Modeling stability and regularity of change: Latent structure analysis of longitudinal discrete data. In J. Rost & R. Langeheine (Eds.), Applications of latent trait and latent class models in the social sciences (pp ). Münster: Waxmann. [36] Rasch, G. (1968). An individualistic approach to item analysis. In P. F. Lazarsfeld & N. W. Henry (Eds.), Readings in mathematical social science. Cambridge: MIT Press.

Monte Carlo Simulations for Rasch Model Tests

Monte Carlo Simulations for Rasch Model Tests Monte Carlo Simulations for Rasch Model Tests Patrick Mair Vienna University of Economics Thomas Ledl University of Vienna Abstract: Sources of deviation from model fit in Rasch models can be lack of unidimensionality,

More information

On the Construction of Adjacent Categories Latent Trait Models from Binary Variables, Motivating Processes and the Interpretation of Parameters

On the Construction of Adjacent Categories Latent Trait Models from Binary Variables, Motivating Processes and the Interpretation of Parameters Gerhard Tutz On the Construction of Adjacent Categories Latent Trait Models from Binary Variables, Motivating Processes and the Interpretation of Parameters Technical Report Number 218, 2018 Department

More information

What is an Ordinal Latent Trait Model?

What is an Ordinal Latent Trait Model? What is an Ordinal Latent Trait Model? Gerhard Tutz Ludwig-Maximilians-Universität München Akademiestraße 1, 80799 München February 19, 2019 arxiv:1902.06303v1 [stat.me] 17 Feb 2019 Abstract Although various

More information

PIRLS 2016 Achievement Scaling Methodology 1

PIRLS 2016 Achievement Scaling Methodology 1 CHAPTER 11 PIRLS 2016 Achievement Scaling Methodology 1 The PIRLS approach to scaling the achievement data, based on item response theory (IRT) scaling with marginal estimation, was developed originally

More information

Whats beyond Concerto: An introduction to the R package catr. Session 4: Overview of polytomous IRT models

Whats beyond Concerto: An introduction to the R package catr. Session 4: Overview of polytomous IRT models Whats beyond Concerto: An introduction to the R package catr Session 4: Overview of polytomous IRT models The Psychometrics Centre, Cambridge, June 10th, 2014 2 Outline: 1. Introduction 2. General notations

More information

Local response dependence and the Rasch factor model

Local response dependence and the Rasch factor model Local response dependence and the Rasch factor model Dept. of Biostatistics, Univ. of Copenhagen Rasch6 Cape Town Uni-dimensional latent variable model X 1 TREATMENT δ T X 2 AGE δ A Θ X 3 X 4 Latent variable

More information

Latent Class Analysis for Models with Error of Measurement Using Log-Linear Models and An Application to Women s Liberation Data

Latent Class Analysis for Models with Error of Measurement Using Log-Linear Models and An Application to Women s Liberation Data Journal of Data Science 9(2011), 43-54 Latent Class Analysis for Models with Error of Measurement Using Log-Linear Models and An Application to Women s Liberation Data Haydar Demirhan Hacettepe University

More information

Pairwise Parameter Estimation in Rasch Models

Pairwise Parameter Estimation in Rasch Models Pairwise Parameter Estimation in Rasch Models Aeilko H. Zwinderman University of Leiden Rasch model item parameters can be estimated consistently with a pseudo-likelihood method based on comparing responses

More information

Investigating Population Heterogeneity With Factor Mixture Models

Investigating Population Heterogeneity With Factor Mixture Models Psychological Methods 2005, Vol. 10, No. 1, 21 39 Copyright 2005 by the American Psychological Association 1082-989X/05/$12.00 DOI: 10.1037/1082-989X.10.1.21 Investigating Population Heterogeneity With

More information

Log-linear multidimensional Rasch model for capture-recapture

Log-linear multidimensional Rasch model for capture-recapture Log-linear multidimensional Rasch model for capture-recapture Elvira Pelle, University of Milano-Bicocca, e.pelle@campus.unimib.it David J. Hessen, Utrecht University, D.J.Hessen@uu.nl Peter G.M. Van der

More information

Polytomous Item Explanatory IRT Models with Random Item Effects: An Application to Carbon Cycle Assessment Data

Polytomous Item Explanatory IRT Models with Random Item Effects: An Application to Carbon Cycle Assessment Data Polytomous Item Explanatory IRT Models with Random Item Effects: An Application to Carbon Cycle Assessment Data Jinho Kim and Mark Wilson University of California, Berkeley Presented on April 11, 2018

More information

Applied Psychological Measurement 2001; 25; 283

Applied Psychological Measurement 2001; 25; 283 Applied Psychological Measurement http://apm.sagepub.com The Use of Restricted Latent Class Models for Defining and Testing Nonparametric and Parametric Item Response Theory Models Jeroen K. Vermunt Applied

More information

UCLA Department of Statistics Papers

UCLA Department of Statistics Papers UCLA Department of Statistics Papers Title Can Interval-level Scores be Obtained from Binary Responses? Permalink https://escholarship.org/uc/item/6vg0z0m0 Author Peter M. Bentler Publication Date 2011-10-25

More information

A PSYCHOPHYSICAL INTERPRETATION OF RASCH S PSYCHOMETRIC PRINCIPLE OF SPECIFIC OBJECTIVITY

A PSYCHOPHYSICAL INTERPRETATION OF RASCH S PSYCHOMETRIC PRINCIPLE OF SPECIFIC OBJECTIVITY A PSYCHOPHYSICAL INTERPRETATION OF RASCH S PSYCHOMETRIC PRINCIPLE OF SPECIFIC OBJECTIVITY R. John Irwin Department of Psychology, The University of Auckland, Private Bag 92019, Auckland 1142, New Zealand

More information

The application and empirical comparison of item. parameters of Classical Test Theory and Partial Credit. Model of Rasch in performance assessments

The application and empirical comparison of item. parameters of Classical Test Theory and Partial Credit. Model of Rasch in performance assessments The application and empirical comparison of item parameters of Classical Test Theory and Partial Credit Model of Rasch in performance assessments by Paul Moloantoa Mokilane Student no: 31388248 Dissertation

More information

Equivalency of the DINA Model and a Constrained General Diagnostic Model

Equivalency of the DINA Model and a Constrained General Diagnostic Model Research Report ETS RR 11-37 Equivalency of the DINA Model and a Constrained General Diagnostic Model Matthias von Davier September 2011 Equivalency of the DINA Model and a Constrained General Diagnostic

More information

New Developments for Extended Rasch Modeling in R

New Developments for Extended Rasch Modeling in R New Developments for Extended Rasch Modeling in R Patrick Mair, Reinhold Hatzinger Institute for Statistics and Mathematics WU Vienna University of Economics and Business Content Rasch models: Theory,

More information

22 MPR-Online 2002, Vol. 7, No. 1 1 Introduction Structural Rasch models for the measurement of change (see Fischer & Ponocny-Seliger, 1998) have been

22 MPR-Online 2002, Vol. 7, No. 1 1 Introduction Structural Rasch models for the measurement of change (see Fischer & Ponocny-Seliger, 1998) have been Methods of Psychological Research Online 2002, Vol. 7, No. 1 Internet: http://www.mpr-online.de Institute for Science Education cfl 2002 IPN Kiel On the Applicability of some IRT Models for Repeated Measurement

More information

The Covariate-Adjusted Frequency Plot for the Rasch Poisson Counts Model

The Covariate-Adjusted Frequency Plot for the Rasch Poisson Counts Model Thailand Statistician 2015; 13(1): 67-78 http://statassoc.or.th Contributed paper The Covariate-Adjusted Frequency Plot for the Rasch Poisson Counts Model Heinz Holling [a], Walailuck Böhning [a] and Dankmar

More information

Title: Testing for Measurement Invariance with Latent Class Analysis. Abstract

Title: Testing for Measurement Invariance with Latent Class Analysis. Abstract 1 Title: Testing for Measurement Invariance with Latent Class Analysis Authors: Miloš Kankaraš*, Guy Moors*, and Jeroen K. Vermunt Abstract Testing for measurement invariance can be done within the context

More information

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood Jonathan Gruhl March 18, 2010 1 Introduction Researchers commonly apply item response theory (IRT) models to binary and ordinal

More information

Equating Tests Under The Nominal Response Model Frank B. Baker

Equating Tests Under The Nominal Response Model Frank B. Baker Equating Tests Under The Nominal Response Model Frank B. Baker University of Wisconsin Under item response theory, test equating involves finding the coefficients of a linear transformation of the metric

More information

Studies on the effect of violations of local independence on scale in Rasch models: The Dichotomous Rasch model

Studies on the effect of violations of local independence on scale in Rasch models: The Dichotomous Rasch model Studies on the effect of violations of local independence on scale in Rasch models Studies on the effect of violations of local independence on scale in Rasch models: The Dichotomous Rasch model Ida Marais

More information

ABSTRACT. Yunyun Dai, Doctor of Philosophy, Mixtures of item response theory models have been proposed as a technique to explore

ABSTRACT. Yunyun Dai, Doctor of Philosophy, Mixtures of item response theory models have been proposed as a technique to explore ABSTRACT Title of Document: A MIXTURE RASCH MODEL WITH A COVARIATE: A SIMULATION STUDY VIA BAYESIAN MARKOV CHAIN MONTE CARLO ESTIMATION Yunyun Dai, Doctor of Philosophy, 2009 Directed By: Professor, Robert

More information

Links Between Binary and Multi-Category Logit Item Response Models and Quasi-Symmetric Loglinear Models

Links Between Binary and Multi-Category Logit Item Response Models and Quasi-Symmetric Loglinear Models Links Between Binary and Multi-Category Logit Item Response Models and Quasi-Symmetric Loglinear Models Alan Agresti Department of Statistics University of Florida Gainesville, Florida 32611-8545 July

More information

36-720: The Rasch Model

36-720: The Rasch Model 36-720: The Rasch Model Brian Junker October 15, 2007 Multivariate Binary Response Data Rasch Model Rasch Marginal Likelihood as a GLMM Rasch Marginal Likelihood as a Log-Linear Model Example For more

More information

Lesson 7: Item response theory models (part 2)

Lesson 7: Item response theory models (part 2) Lesson 7: Item response theory models (part 2) Patrícia Martinková Department of Statistical Modelling Institute of Computer Science, Czech Academy of Sciences Institute for Research and Development of

More information

Correspondence Analysis of Longitudinal Data

Correspondence Analysis of Longitudinal Data Correspondence Analysis of Longitudinal Data Mark de Rooij* LEIDEN UNIVERSITY, LEIDEN, NETHERLANDS Peter van der G. M. Heijden UTRECHT UNIVERSITY, UTRECHT, NETHERLANDS *Corresponding author (rooijm@fsw.leidenuniv.nl)

More information

A Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions

A Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions A Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions Cees A.W. Glas Oksana B. Korobko University of Twente, the Netherlands OMD Progress Report 07-01. Cees A.W.

More information

CS281 Section 4: Factor Analysis and PCA

CS281 Section 4: Factor Analysis and PCA CS81 Section 4: Factor Analysis and PCA Scott Linderman At this point we have seen a variety of machine learning models, with a particular emphasis on models for supervised learning. In particular, we

More information

A Note on Item Restscore Association in Rasch Models

A Note on Item Restscore Association in Rasch Models Brief Report A Note on Item Restscore Association in Rasch Models Applied Psychological Measurement 35(7) 557 561 ª The Author(s) 2011 Reprints and permission: sagepub.com/journalspermissions.nav DOI:

More information

Anders Skrondal. Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine. Based on joint work with Sophia Rabe-Hesketh

Anders Skrondal. Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine. Based on joint work with Sophia Rabe-Hesketh Constructing Latent Variable Models using Composite Links Anders Skrondal Norwegian Institute of Public Health London School of Hygiene and Tropical Medicine Based on joint work with Sophia Rabe-Hesketh

More information

Psychometric Issues in Formative Assessment: Measuring Student Learning Throughout the Academic Year Using Interim Assessments

Psychometric Issues in Formative Assessment: Measuring Student Learning Throughout the Academic Year Using Interim Assessments Psychometric Issues in Formative Assessment: Measuring Student Learning Throughout the Academic Year Using Interim Assessments Jonathan Templin The University of Georgia Neal Kingston and Wenhao Wang University

More information

APPENDICES TO Protest Movements and Citizen Discontent. Appendix A: Question Wordings

APPENDICES TO Protest Movements and Citizen Discontent. Appendix A: Question Wordings APPENDICES TO Protest Movements and Citizen Discontent Appendix A: Question Wordings IDEOLOGY: How would you describe your views on most political matters? Generally do you think of yourself as liberal,

More information

Yu Xie, Institute for Social Research, 426 Thompson Street, University of Michigan, Ann

Yu Xie, Institute for Social Research, 426 Thompson Street, University of Michigan, Ann Association Model, Page 1 Yu Xie, Institute for Social Research, 426 Thompson Street, University of Michigan, Ann Arbor, MI 48106. Email: yuxie@umich.edu. Tel: (734)936-0039. Fax: (734)998-7415. Association

More information

Extensions and Applications of Item Explanatory Models to Polytomous Data in Item Response Theory. Jinho Kim

Extensions and Applications of Item Explanatory Models to Polytomous Data in Item Response Theory. Jinho Kim Extensions and Applications of Item Explanatory Models to Polytomous Data in Item Response Theory by Jinho Kim A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor

More information

Comparison between conditional and marginal maximum likelihood for a class of item response models

Comparison between conditional and marginal maximum likelihood for a class of item response models (1/24) Comparison between conditional and marginal maximum likelihood for a class of item response models Francesco Bartolucci, University of Perugia (IT) Silvia Bacci, University of Perugia (IT) Claudia

More information

LOG-MULTIPLICATIVE ASSOCIATION MODELS AS LATENT VARIABLE MODELS FOR NOMINAL AND0OR ORDINAL DATA. Carolyn J. Anderson* Jeroen K.

LOG-MULTIPLICATIVE ASSOCIATION MODELS AS LATENT VARIABLE MODELS FOR NOMINAL AND0OR ORDINAL DATA. Carolyn J. Anderson* Jeroen K. 3 LOG-MULTIPLICATIVE ASSOCIATION MODELS AS LATENT VARIABLE MODELS FOR NOMINAL AND0OR ORDINAL DATA Carolyn J. Anderson* Jeroen K. Vermunt Associations between multiple discrete measures are often due to

More information

Contents. 3 Evaluating Manifest Monotonicity Using Bayes Factors Introduction... 44

Contents. 3 Evaluating Manifest Monotonicity Using Bayes Factors Introduction... 44 Contents 1 Introduction 4 1.1 Measuring Latent Attributes................. 4 1.2 Assumptions in Item Response Theory............ 6 1.2.1 Local Independence.................. 6 1.2.2 Unidimensionality...................

More information

Modeling Joint and Marginal Distributions in the Analysis of Categorical Panel Data

Modeling Joint and Marginal Distributions in the Analysis of Categorical Panel Data SOCIOLOGICAL Vermunt et al. / JOINT METHODS AND MARGINAL & RESEARCH DISTRIBUTIONS This article presents a unifying approach to the analysis of repeated univariate categorical (ordered) responses based

More information

Extended Rasch Modeling: The R Package erm

Extended Rasch Modeling: The R Package erm Extended Rasch Modeling: The R Package erm Patrick Mair Wirtschaftsuniversität Wien Reinhold Hatzinger Wirtschaftsuniversität Wien Abstract This package vignette is an update of the erm papers by published

More information

Chapter 5. Introduction to Path Analysis. Overview. Correlation and causation. Specification of path models. Types of path models

Chapter 5. Introduction to Path Analysis. Overview. Correlation and causation. Specification of path models. Types of path models Chapter 5 Introduction to Path Analysis Put simply, the basic dilemma in all sciences is that of how much to oversimplify reality. Overview H. M. Blalock Correlation and causation Specification of path

More information

Chapter 8. Models with Structural and Measurement Components. Overview. Characteristics of SR models. Analysis of SR models. Estimation of SR models

Chapter 8. Models with Structural and Measurement Components. Overview. Characteristics of SR models. Analysis of SR models. Estimation of SR models Chapter 8 Models with Structural and Measurement Components Good people are good because they've come to wisdom through failure. Overview William Saroyan Characteristics of SR models Estimation of SR models

More information

Bayesian Nonparametric Rasch Modeling: Methods and Software

Bayesian Nonparametric Rasch Modeling: Methods and Software Bayesian Nonparametric Rasch Modeling: Methods and Software George Karabatsos University of Illinois-Chicago Keynote talk Friday May 2, 2014 (9:15-10am) Ohio River Valley Objective Measurement Seminar

More information

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES

MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES REVSTAT Statistical Journal Volume 13, Number 3, November 2015, 233 243 MARGINAL HOMOGENEITY MODEL FOR ORDERED CATEGORIES WITH OPEN ENDS IN SQUARE CONTINGENCY TABLES Authors: Serpil Aktas Department of

More information

Statistical Tests for Parameterized Multinomial Models: Power Approximation and Optimization. Edgar Erdfelder University of Mannheim Germany

Statistical Tests for Parameterized Multinomial Models: Power Approximation and Optimization. Edgar Erdfelder University of Mannheim Germany Statistical Tests for Parameterized Multinomial Models: Power Approximation and Optimization Edgar Erdfelder University of Mannheim Germany The Problem Typically, model applications start with a global

More information

Item Response Theory (IRT) Analysis of Item Sets

Item Response Theory (IRT) Analysis of Item Sets University of Connecticut DigitalCommons@UConn NERA Conference Proceedings 2011 Northeastern Educational Research Association (NERA) Annual Conference Fall 10-21-2011 Item Response Theory (IRT) Analysis

More information

Logistic Regression and Item Response Theory: Estimation Item and Ability Parameters by Using Logistic Regression in IRT.

Logistic Regression and Item Response Theory: Estimation Item and Ability Parameters by Using Logistic Regression in IRT. Louisiana State University LSU Digital Commons LSU Historical Dissertations and Theses Graduate School 1998 Logistic Regression and Item Response Theory: Estimation Item and Ability Parameters by Using

More information

Can Variances of Latent Variables be Scaled in Such a Way That They Correspond to Eigenvalues?

Can Variances of Latent Variables be Scaled in Such a Way That They Correspond to Eigenvalues? International Journal of Statistics and Probability; Vol. 6, No. 6; November 07 ISSN 97-703 E-ISSN 97-7040 Published by Canadian Center of Science and Education Can Variances of Latent Variables be Scaled

More information

Basic IRT Concepts, Models, and Assumptions

Basic IRT Concepts, Models, and Assumptions Basic IRT Concepts, Models, and Assumptions Lecture #2 ICPSR Item Response Theory Workshop Lecture #2: 1of 64 Lecture #2 Overview Background of IRT and how it differs from CFA Creating a scale An introduction

More information

An Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin

An Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin Equivalency Test for Model Fit 1 Running head: EQUIVALENCY TEST FOR MODEL FIT An Equivalency Test for Model Fit Craig S. Wells University of Massachusetts Amherst James. A. Wollack Ronald C. Serlin University

More information

Alternatives to Difference Scores: Polynomial Regression and Response Surface Methodology. Jeffrey R. Edwards University of North Carolina

Alternatives to Difference Scores: Polynomial Regression and Response Surface Methodology. Jeffrey R. Edwards University of North Carolina Alternatives to Difference Scores: Polynomial Regression and Response Surface Methodology Jeffrey R. Edwards University of North Carolina 1 Outline I. Types of Difference Scores II. Questions Difference

More information

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models

Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Computationally Efficient Estimation of Multilevel High-Dimensional Latent Variable Models Tihomir Asparouhov 1, Bengt Muthen 2 Muthen & Muthen 1 UCLA 2 Abstract Multilevel analysis often leads to modeling

More information

A class of latent marginal models for capture-recapture data with continuous covariates

A class of latent marginal models for capture-recapture data with continuous covariates A class of latent marginal models for capture-recapture data with continuous covariates F Bartolucci A Forcina Università di Urbino Università di Perugia FrancescoBartolucci@uniurbit forcina@statunipgit

More information

Optimal Design for the Rasch Poisson-Gamma Model

Optimal Design for the Rasch Poisson-Gamma Model Optimal Design for the Rasch Poisson-Gamma Model Ulrike Graßhoff, Heinz Holling and Rainer Schwabe Abstract The Rasch Poisson counts model is an important model for analyzing mental speed, an fundamental

More information

MICHAEL SCHREINER and KARL SCHWEIZER

MICHAEL SCHREINER and KARL SCHWEIZER Review of Psychology, 2011, Vol. 18, No. 1, 3-11 UDC 159.9 The hypothesis-based investigation of patterns of relatedness by means of confirmatory factor models: The treatment levels of the Exchange Test

More information

Ability Metric Transformations

Ability Metric Transformations Ability Metric Transformations Involved in Vertical Equating Under Item Response Theory Frank B. Baker University of Wisconsin Madison The metric transformations of the ability scales involved in three

More information

Dimensionality Assessment: Additional Methods

Dimensionality Assessment: Additional Methods Dimensionality Assessment: Additional Methods In Chapter 3 we use a nonlinear factor analytic model for assessing dimensionality. In this appendix two additional approaches are presented. The first strategy

More information

Comparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior

Comparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior Comparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior David R. Johnson Department of Sociology and Haskell Sie Department

More information

Conditional maximum likelihood estimation in polytomous Rasch models using SAS

Conditional maximum likelihood estimation in polytomous Rasch models using SAS Conditional maximum likelihood estimation in polytomous Rasch models using SAS Karl Bang Christensen kach@sund.ku.dk Department of Biostatistics, University of Copenhagen November 29, 2012 Abstract IRT

More information

Specifying Latent Curve and Other Growth Models Using Mplus. (Revised )

Specifying Latent Curve and Other Growth Models Using Mplus. (Revised ) Ronald H. Heck 1 University of Hawai i at Mānoa Handout #20 Specifying Latent Curve and Other Growth Models Using Mplus (Revised 12-1-2014) The SEM approach offers a contrasting framework for use in analyzing

More information

CHAPTER 3. THE IMPERFECT CUMULATIVE SCALE

CHAPTER 3. THE IMPERFECT CUMULATIVE SCALE CHAPTER 3. THE IMPERFECT CUMULATIVE SCALE 3.1 Model Violations If a set of items does not form a perfect Guttman scale but contains a few wrong responses, we do not necessarily need to discard it. A wrong

More information

Making the Most of What We Have: A Practical Application of Multidimensional Item Response Theory in Test Scoring

Making the Most of What We Have: A Practical Application of Multidimensional Item Response Theory in Test Scoring Journal of Educational and Behavioral Statistics Fall 2005, Vol. 30, No. 3, pp. 295 311 Making the Most of What We Have: A Practical Application of Multidimensional Item Response Theory in Test Scoring

More information

Model Estimation Example

Model Estimation Example Ronald H. Heck 1 EDEP 606: Multivariate Methods (S2013) April 7, 2013 Model Estimation Example As we have moved through the course this semester, we have encountered the concept of model estimation. Discussions

More information

Chapter 1. Introduction. 1.1 Background

Chapter 1. Introduction. 1.1 Background Chapter 1 Introduction Science is facts; just as houses are made of stones, so is science made of facts; but a pile of stones is not a house and a collection of facts is not necessarily science. Henri

More information

COMPARATIVE STATICS ANALYSIS in ECONOMICS

COMPARATIVE STATICS ANALYSIS in ECONOMICS COMPARATIVE STATICS ANALYSIS in ECONOMICS This page is intentionally left blank COMPARATIVE STATICS ANALYSIS in ECONOMICS Kevin M. Currier Department of Economics Oklahoma State University \ > World Scientific

More information

A Goodness-of-Fit Measure for the Mokken Double Monotonicity Model that Takes into Account the Size of Deviations

A Goodness-of-Fit Measure for the Mokken Double Monotonicity Model that Takes into Account the Size of Deviations Methods of Psychological Research Online 2003, Vol.8, No.1, pp. 81-101 Department of Psychology Internet: http://www.mpr-online.de 2003 University of Koblenz-Landau A Goodness-of-Fit Measure for the Mokken

More information

Ensemble Rasch Models

Ensemble Rasch Models Ensemble Rasch Models Steven M. Lattanzio II Metamatrics Inc., Durham, NC 27713 email: slattanzio@lexile.com Donald S. Burdick Metamatrics Inc., Durham, NC 27713 email: dburdick@lexile.com A. Jackson Stenner

More information

On the Use of Nonparametric ICC Estimation Techniques For Checking Parametric Model Fit

On the Use of Nonparametric ICC Estimation Techniques For Checking Parametric Model Fit On the Use of Nonparametric ICC Estimation Techniques For Checking Parametric Model Fit March 27, 2004 Young-Sun Lee Teachers College, Columbia University James A.Wollack University of Wisconsin Madison

More information

Causal Regression Models II: Unconfoundedness and Causal Unbiasedness 1

Causal Regression Models II: Unconfoundedness and Causal Unbiasedness 1 Methods of Psychological Research Online 2000, Vol.5, No.3 Internet: http://www.mpr-online.de Institute for Science Education 2000 IPN Kiel Causal Regression Models II: Unconfoundedness and Causal Unbiasedness

More information

Development and Calibration of an Item Response Model. that Incorporates Response Time

Development and Calibration of an Item Response Model. that Incorporates Response Time Development and Calibration of an Item Response Model that Incorporates Response Time Tianyou Wang and Bradley A. Hanson ACT, Inc. Send correspondence to: Tianyou Wang ACT, Inc P.O. Box 168 Iowa City,

More information

Chapter 3 Models for polytomous data

Chapter 3 Models for polytomous data This is page 75 Printer: Opaque this Chapter 3 Models for polytomous data Francis Tuerlinckx Wen-Chung Wang 3.1 Introduction In the first two chapters of this volume, models for binary or dichotomous variables

More information

Nonparametric tests for the Rasch model: explanation, development, and application of quasi-exact tests for small samples

Nonparametric tests for the Rasch model: explanation, development, and application of quasi-exact tests for small samples Nonparametric tests for the Rasch model: explanation, development, and application of quasi-exact tests for small samples Ingrid Koller 1* & Reinhold Hatzinger 2 1 Department of Psychological Basic Research

More information

Summer School in Applied Psychometric Principles. Peterhouse College 13 th to 17 th September 2010

Summer School in Applied Psychometric Principles. Peterhouse College 13 th to 17 th September 2010 Summer School in Applied Psychometric Principles Peterhouse College 13 th to 17 th September 2010 1 Two- and three-parameter IRT models. Introducing models for polytomous data. Test information in IRT

More information

Investigation into the use of confidence indicators with calibration

Investigation into the use of confidence indicators with calibration WORKSHOP ON FRONTIERS IN BENCHMARKING TECHNIQUES AND THEIR APPLICATION TO OFFICIAL STATISTICS 7 8 APRIL 2005 Investigation into the use of confidence indicators with calibration Gerard Keogh and Dave Jennings

More information

A nonparametric test for path dependence in discrete panel data

A nonparametric test for path dependence in discrete panel data A nonparametric test for path dependence in discrete panel data Maximilian Kasy Department of Economics, University of California - Los Angeles, 8283 Bunche Hall, Mail Stop: 147703, Los Angeles, CA 90095,

More information

Doctor of Philosophy

Doctor of Philosophy MAINTAINING A COMMON ARBITRARY UNIT IN SOCIAL MEASUREMENT STEPHEN HUMPHRY 2005 Submitted in fulfillment of the requirements of the degree of Doctor of Philosophy School of Education, Murdoch University,

More information

Validation of Cognitive Structures: A Structural Equation Modeling Approach

Validation of Cognitive Structures: A Structural Equation Modeling Approach Multivariate Behavioral Research, 38 (1), 1-23 Copyright 2003, Lawrence Erlbaum Associates, Inc. Validation of Cognitive Structures: A Structural Equation Modeling Approach Dimiter M. Dimitrov Kent State

More information

Part 8: GLMs and Hierarchical LMs and GLMs

Part 8: GLMs and Hierarchical LMs and GLMs Part 8: GLMs and Hierarchical LMs and GLMs 1 Example: Song sparrow reproductive success Arcese et al., (1992) provide data on a sample from a population of 52 female song sparrows studied over the course

More information

Preliminary Manual of the software program Multidimensional Item Response Theory (MIRT)

Preliminary Manual of the software program Multidimensional Item Response Theory (MIRT) Preliminary Manual of the software program Multidimensional Item Response Theory (MIRT) July 7 th, 2010 Cees A. W. Glas Department of Research Methodology, Measurement, and Data Analysis Faculty of Behavioural

More information

Logistic Regression: Regression with a Binary Dependent Variable

Logistic Regression: Regression with a Binary Dependent Variable Logistic Regression: Regression with a Binary Dependent Variable LEARNING OBJECTIVES Upon completing this chapter, you should be able to do the following: State the circumstances under which logistic regression

More information

Bayes methods for categorical data. April 25, 2017

Bayes methods for categorical data. April 25, 2017 Bayes methods for categorical data April 25, 2017 Motivation for joint probability models Increasing interest in high-dimensional data in broad applications Focus may be on prediction, variable selection,

More information

Likelihood and Fairness in Multidimensional Item Response Theory

Likelihood and Fairness in Multidimensional Item Response Theory Likelihood and Fairness in Multidimensional Item Response Theory or What I Thought About On My Holidays Giles Hooker and Matthew Finkelman Cornell University, February 27, 2008 Item Response Theory Educational

More information

ADDITIVITY IN PSYCHOLOGICAL MEASUREMENT. Benjamin D. Wright MESA Psychometric Laboratory The Department of Education The University of Chicago

ADDITIVITY IN PSYCHOLOGICAL MEASUREMENT. Benjamin D. Wright MESA Psychometric Laboratory The Department of Education The University of Chicago Measurement and Personality Assessment Edw. E. Roskam (ed.) Elsevier Science Publishers B.V. (North-Holland), 198S 101 ADDITIVITY IN PSYCHOLOGICAL MEASUREMENT Benjamin D. Wright MESA Psychometric Laboratory

More information

Running head: ACTIVITIES, OBVIOUS SOCIAL STIMULI 1. On Probabilistic Causalities between Activities, Obvious Social Stimuli, Inferences, and

Running head: ACTIVITIES, OBVIOUS SOCIAL STIMULI 1. On Probabilistic Causalities between Activities, Obvious Social Stimuli, Inferences, and Running head: ACTIVITIES, OBVIOUS SOCIAL STIMULI 1 On Probabilistic Causalities between Activities, Obvious Social Stimuli, Inferences, and Behavioral Outcomes Raimo J Laasonen IARR/BBS ACTIVITIES, OBVIOUS

More information

Time Dependence of Growth Parameters in Latent Growth Curve Models with Time Invariant Covariates

Time Dependence of Growth Parameters in Latent Growth Curve Models with Time Invariant Covariates Methods of Psychological Research Online 003, Vol.8, No., pp. -4 Department of Psychology Internet: http://www.mpr-online.de 003 University of Koblenz-Landau Time Dependence of Growth Parameters in Latent

More information

Institut für Mathematik

Institut für Mathematik U n i v e r s i t ä t A u g s b u r g Institut für Mathematik Ali Ünlü, Silke Brandt, Dietrich Albert Test Surmise Relations, Test Knowledge Structures, and their Characterizations Preprint Nr. 035/2007

More information

Paradoxical Results in Multidimensional Item Response Theory

Paradoxical Results in Multidimensional Item Response Theory UNC, December 6, 2010 Paradoxical Results in Multidimensional Item Response Theory Giles Hooker and Matthew Finkelman UNC, December 6, 2010 1 / 49 Item Response Theory Educational Testing Traditional model

More information

Student Mathematical Connections in an Introductory Linear Algebra Course. Spencer Payton Washington State University

Student Mathematical Connections in an Introductory Linear Algebra Course. Spencer Payton Washington State University Student Mathematical Connections in an Introductory Linear Algebra Course Spencer Payton Washington State University In an introductory linear algebra course, students are expected to learn a plethora

More information

Model Complexity of Pseudo-independent Models

Model Complexity of Pseudo-independent Models Model Complexity of Pseudo-independent Models Jae-Hyuck Lee and Yang Xiang Department of Computing and Information Science University of Guelph, Guelph, Canada {jaehyuck, yxiang}@cis.uoguelph,ca Abstract

More information

A Comparison of Item-Fit Statistics for the Three-Parameter Logistic Model

A Comparison of Item-Fit Statistics for the Three-Parameter Logistic Model A Comparison of Item-Fit Statistics for the Three-Parameter Logistic Model Cees A. W. Glas, University of Twente, the Netherlands Juan Carlos Suárez Falcón, Universidad Nacional de Educacion a Distancia,

More information

Score-Based Tests of Measurement Invariance with Respect to Continuous and Ordinal Variables

Score-Based Tests of Measurement Invariance with Respect to Continuous and Ordinal Variables Score-Based Tests of Measurement Invariance with Respect to Continuous and Ordinal Variables Achim Zeileis, Edgar C. Merkle, Ting Wang http://eeecon.uibk.ac.at/~zeileis/ Overview Motivation Framework Score-based

More information

Augustin: Some Basic Results on the Extension of Quasi-Likelihood Based Measurement Error Correction to Multivariate and Flexible Structural Models

Augustin: Some Basic Results on the Extension of Quasi-Likelihood Based Measurement Error Correction to Multivariate and Flexible Structural Models Augustin: Some Basic Results on the Extension of Quasi-Likelihood Based Measurement Error Correction to Multivariate and Flexible Structural Models Sonderforschungsbereich 386, Paper 196 (2000) Online

More information

Lecture 21: Spectral Learning for Graphical Models

Lecture 21: Spectral Learning for Graphical Models 10-708: Probabilistic Graphical Models 10-708, Spring 2016 Lecture 21: Spectral Learning for Graphical Models Lecturer: Eric P. Xing Scribes: Maruan Al-Shedivat, Wei-Cheng Chang, Frederick Liu 1 Motivation

More information

Diagnostic Classification Models: Psychometric Issues and Statistical Challenges

Diagnostic Classification Models: Psychometric Issues and Statistical Challenges Diagnostic Classification Models: Psychometric Issues and Statistical Challenges Jonathan Templin Department of Educational Psychology The University of Georgia University of South Carolina Talk Talk Overview

More information

RANDOM INTERCEPT ITEM FACTOR ANALYSIS. IE Working Paper MK8-102-I 02 / 04 / Alberto Maydeu Olivares

RANDOM INTERCEPT ITEM FACTOR ANALYSIS. IE Working Paper MK8-102-I 02 / 04 / Alberto Maydeu Olivares RANDOM INTERCEPT ITEM FACTOR ANALYSIS IE Working Paper MK8-102-I 02 / 04 / 2003 Alberto Maydeu Olivares Instituto de Empresa Marketing Dept. C / María de Molina 11-15, 28006 Madrid España Alberto.Maydeu@ie.edu

More information

Assessing Factorial Invariance in Ordered-Categorical Measures

Assessing Factorial Invariance in Ordered-Categorical Measures Multivariate Behavioral Research, 39 (3), 479-515 Copyright 2004, Lawrence Erlbaum Associates, Inc. Assessing Factorial Invariance in Ordered-Categorical Measures Roger E. Millsap and Jenn Yun-Tein Arizona

More information

How to Measure the Objectivity of a Test

How to Measure the Objectivity of a Test How to Measure the Objectivity of a Test Mark H. Moulton, Ph.D. Director of Research, Evaluation, and Psychometric Development Educational Data Systems Specific Objectivity (Ben Wright, Georg Rasch) Rasch

More information

Heterogeneous mixture-of-experts for fusion of locally valid knowledge-based submodels

Heterogeneous mixture-of-experts for fusion of locally valid knowledge-based submodels ESANN'29 proceedings, European Symposium on Artificial Neural Networks - Advances in Computational Intelligence and Learning. Bruges Belgium), 22-24 April 29, d-side publi., ISBN 2-9337-9-9. Heterogeneous

More information

Donghoh Kim & Se-Kang Kim

Donghoh Kim & Se-Kang Kim Behav Res (202) 44:239 243 DOI 0.3758/s3428-02-093- Comparing patterns of component loadings: Principal Analysis (PCA) versus Independent Analysis (ICA) in analyzing multivariate non-normal data Donghoh

More information