Rasch , 40 (9) : ,,, ,,,, B A cta Psychologica S in ica DO I: /SP. J

Size: px
Start display at page:

Download "Rasch , 40 (9) : ,,, ,,,, B A cta Psychologica S in ica DO I: /SP. J"

Transcription

1 2008, 40 (9) : A cta Psychologica S in ica DO I: /SP. J Rasch ( 1,,100875) ( 2 Kennedy School of Government, Harvard University, MA 02138, USA) Rasch, 66,,,, Facets Facets,,,Facets, Facets, ; ;Rasch B ,,, [ 6 ] [ 1 ] [ 5 ],,,, [ 7 ] [ 2 ] 1. 2 :,,, [ 3 ], W agner : :, : [ 5 ],,,,,, [ 4 ] : :, E2mail: sunxiaom in@ bnu. edu. cn, :

2 9 :Rasch 1031,,,,,,,,, (Classical Test Theory, CTT) ( Generalizability Theory, GT) CTT,,, CTT,,, CTT, Kendall, CTT,, KendallCTT GT CTT,,, GTCTT, CTT,, GT, ( Item Response Theory, IRT), CTT GT, IRT, IRT, IRT, () CTT, IRT, IRT, IRT, IRT RaschRasch Rasch log P n i 1 - Pn i = B n - D i P n i n i 1 - P n i n i B n n ( n = 1, 2,, N ) D i i ( i = 1, 2,, L ) Rasch,,, Rasch,,,,, L inacre [ 28 ] Rasch(Many Facets Rasch Model, MFRM ) MFRM Rasch, Rasch MFRM : log P n ijk P n ij( k - 1) = B n - D i - C j - F k P n ijk n i j k P n ij( k - 1) n i j k - 1 B n n( n = 1, 2,, N ) D i i( i = 1, 2,, L ) C j j( j = 1, 2,, J ) F k ( Partial Credit Model) k - 1k ( step difficul2 ty), K ( k = 1, 2,, K) logits Logits,

3 , MFRM, [ 10 ],MFRM MFRM, MFRM [ 11 ] [ ] [ ] [ 20 ] [ 21 ] [ ], [ 25 ],,, MFRM,, CTT, MFRM, [ 26 ], John M. L inacre MFRM Facets Facets (Unconditional Maximum L ikelihood) MFRM Facets forw indows [ 27 ] 1. 3 Facets , ;,, MFRM,, ,, A B C D E F G 34, A E H I J K L 32, : IRT MFRM, Facets forw indows M FRM Facets , ,, 1 23,,, [ 29 ] logits, logits, ( S E = 112), 53, logits(se = 0. 22) ; 37, logits(s E = 0. 13) 4 infit ( Infit MnSq), MFRM, fit,, MFRM,,, fit fit,mfrm, fit

4 9 :Rasch 1033, fit fit [ 13 ] [ 14 ] infit 112,,, infit 0. 8, 1,3 infit 2. 28, 1. 20, 1 66 S. E. InfitMnSq S. E. InfitMnSq S. E. InfitMnSq : RM SE (Model) : 0. 18, Adj SD: Separation: 6. 75, Separation Reliability: RM SE Root Mean - Square Standard Error, ( 1 3 )RM SE M SE (Mean2Square Standard Error),M S E Adj. SD Adj. SD ( ) M SE Separation Adj. SD RM S E,, Separation , Separation 3. 0 [ 30 ], Separa2 tion6. 75 Separation Reliability, [ 26 ],, 0;,1 1, Separation Reliability 0. 98, 2,, 2 (65) = , p < 0. 01,

5 , :,, MFRM ( logits) 2, 2 Facets MFRM,, Facets2 MFRM ( logits) MFRM , 1, 66 ; 2 ; 3 ; 4 Facets; 5 ; 6 3 5,

6 9 :Rasch ,, 56,, 16, Facets, 31, 15 25, 56 56,Facets, 3. 3 Facets, 56 7, 56, logits ModelS. E. A E H I J K L ; 2 ; 3 ; 4 ; 5 3, 56 7, 5, 5,, 56, 66, 12 Facets 3. 4Facets Facets, 4 4, 1 ; 2,,, A E ; 4 ; , , , t = , p = , Facets for W indows ,, infit

7 Facets [ 26 ],, infit, Infit, MFRM, IRT CTT [ 31 ],, 56 56,,12 Facets :, Facets :1, 2,,, , 3: 567, 5, 5 :, 56 ;, 5, Facets,, 56, (1) 1 1, 1, 2 12 :, logits, , 4 2, 21. 6, 66,, 1 3,4

8 9 :Rasch 1037,,,,, , 2 1, 2,, :,,,,,,,, MFRM, 5 MFRM, 66,: (1)MFRM,, ( infit ) (2),, Facets Facets,, Facets, MFRM,,, MFRM 1 Maurer T J, Solamon J M. The science and p ractice of a structured emp loyment interview coaching p rogram. 2006, 59 (2) : Personnel Psychology, 2 W iesnerw H, Cronshaw S F. A meta - analytic investigation of the impact of interview format and degree of structure on the validity of the emp loyment interview. Journal of Occupational Psychology, 1988, 61 (4) : Schm idt F L, Zimmerman R D. A counterintuitive hypothesis about emp loyment interview validity and some supporting evidence. Journal of App lied Psychology, 2004, 89 (3) : W alters L C, M illerm R, Ree M J. Structured interviews for p ilot selection: No incremental validity. Psychology, 1993, 3 (1) : 25 International Journal of Aviation 5 A rvey R D. The emp loyment interview: A summary and review of recent research. Personnel Psychology, 1982, 35 (2) : Schm itt N. Social and situational determ inants of interview decisions: Implications for the emp loyment interview. Personnel Psychology, 1976, 29 (1) : Maurer S D, Fay C. Effect of situational interviews, conventional structured interviews, and training on interview rating agreement: an experimental analysis. Personnel Psychology, 1988, 41 (2) : Sun X M, Zhang H C. A comparative study on methods used in estimating reliability of performance assessment: Methods based on correlation, percent of inter - rater reliability and Generalizability theory. Chinese) Psychological Science, 2005, 28 (3), ( in (,. - -., 2005, 28 (03) : ) 9 Shavelson R J, W ebb E A. Generability theory: A p rimer. Newbury Park, CA: SAGE Publications, Inc., Myford C M, Wolfe E W. Detecting and measuring rater effects usingmany - Facet Rasch Measurement: Part II. Journal of App lied Measurement, 2004, 5 (2) : LunzM E. Variation among exam iners and p rotocols on oral

9 exam inations. Paper p resented at the AnnualMeeting of the American Educational Research A ssociation, San Francisco, CA: L inacre J M. The calibration of essay graders. Paper p resented at the M idwest Objective Measurement Sem inar, Chicago, IL: Du Y. Raters and single p romp t - to - p romp t equating using the FACETS model in a writing performance assessment. Paper presented at the N inth International Objective Measurement Conference, Chicago, IL: Kondo - B rown K A. FACETS analysis of rater bias in measuring Japanese second language writing performance. Language Testing, 2002: Yamauchi K. ComparingMany - facet Rasch Model and ANOVA model: Analysis of ratings of essays. Japanese Journal of Educational Psychology, 1999, 47 (3) : Myford C M. Looking for patterns in disagreements: A Facets analysis of human raters and e - rater s scores on essays written for the graduate management admission test ( GMAT). New O rleans, LA. : American Educational Research A ssociation, L inacre J M. An extension of the Rasch Model to multi - faceted situation. Chicago: University of Chicago, LunzM E, W right B D. Measuring the impact of judge severity on exam ination scores. App lied Measurement in Education, 1990, 3 (4) : A llen J M, Schumacker R E. Team assessment utilizing a Many - Facet Rasch Model. Journal of Outcome Measurement, 1998, 2 (2) : Looney M. A many - facet Rasch analysis of 1994 O lymp ic figure skating. Research Quarterly for Exercise & Sport, 1997, 68 (1) : Myford C M, M islevy R J. Monitoring and imp roving a portfolio assessment system, Center for Performance A ssessment Research Report. Princeton, NJ: Educational Testing Service, Chi E. Comparing holistic and analytic scoring for performance assessment with many - facet Rasch model. Journal of App lied Measurement, 2001, 2 (4) : Heller J I, Sheingold K, Myford C M. Reasoning about evidence in portfolios: Cognitive foundations for valid and reliable assessment. Educational A ssessment, 1998, 5 (1) : 5 24 LunzM E. Performance exam inations: Technology for analysis and standard setting. Paper presented at the AnnualMeeting of the National Council of Measurement in Education. Chicago, IL: Kline T L, Schm idt K M, Bowles R. U sing L inlog and FACETS to model item components in the LLTM. Journal of Applied Measurement, 2006, 7 (1) : Sun X M, Zhang H C. An IRT analysis of rater bias in structured interview of national civilian candidates. Acta Psychologica Sinica, 2006, 38 (4), ( in Chinese) (,. IRT., 2006, 38 (04) : ) 27 L inacre J M. Facets forw indows ed. Chicago, IL: MESA Press, L inacre J M, W right B D. Understand Rasch measurement: Construction of measures from Many - facet Data. Journal of App lied Measurement, 2002, 3 (4) : L inacre J M. Understanding Rasch measurement: Op tim izing rating scale category effectiveness. Journal of App lied Measurement, 2002, 3 (1) : LunzM E, W right B D, L inacre J M. Measuring the impact of judge severity on exam ination scores. App lied Measurement in Education, 1990, 3 (4) : L inacre J M, W right B D. Construction of measures from many - facet data. Journal of Applied Measurement, 2002, 3 (4) : A M any - faceted Ra sch M odel Ana lysis of Structured In terv iew SUN Xiao - M in 1, XUE Gang 2 ( 1 School of Psychology, B eijing Key Lab of Applied Experim ental Psychology, B eijing N orm al U niversity, B eijing China) ( 2 Kennedy School of Governm ent, Harvard U niversity, MA 02138, USA ) Abstract Being one of the most important techniques in personnel selection, structured interview has attracted more and more research interest in imp roving its reliability and validity. Some researches focus on the standardization of its content and dimensions, others try to decrease rater bias by intensive rater training. The third one, to handle the possible bias in statistic way, has attracted more and more attention. Many - faceted Rasch Model (MFRM ), an extension to Rasch model, served as such kind of techniques. By parameterizing not only interviewee s ability and item difficulty but also judge severity, MFRM offers an effective way to estimate interviewee s latent trait, that is, the ability, and p rovides detailed information of inter - rater reliability as far as a specific interviewee is concerned. This study used MFRM to analyze the result of a structured interview and demonstrated a creative way to locate the source of bias for a specific

10 9 :Rasch 1039 interviewee. Data came from a structured interview. There were 7 raters in each interview panel. Since the interview last two days, these 21 raters were random ized into 3 panels in the morning of each day in order to p revent cheating. Rating scores of two panels were used in this study. A B C D E F G were raters of one panel and interviewed interviewees numbered A E H I J K L were raters in the other panel that interviewees numbered were interviewed. rated each interviewee independently on five dimensions using a 10 points rating scale. Each rater U sing Facets , a computer p rogram based on MFRM, the abilities of 66 interviewees was estimated, accompanied with a Infit MnSq, which demonstrated the degree to which raters in the panel agreed with each other on the evaluation of a specific interviewee. The ranking order based on interview raw scores and Facets estimated logits values were compared. D ifference were found between those them. To track the source of error for interviewee numbered 56, bias analysis of Facets was also made. The ability of 66 interviewees were reported with infit MnSq, showing inter - rater reliability at individual level; The ranking order based on interview raw score and estimated ability score were quite different, especially for some interviews. Taking interviewee numbered 56 for examp le, the ranking difference was as large as 15. B ias analysis aimed at locating the source of error for interviewee number 56 show that not only rater consistency, but also rater severity contribute to the ranking difference. The results confirmed the utility of MFRM analysis. The app lication of MFRM in the analysis of structured interview was p roved to be not only an effective way in personnel selection, but also p rovided diagnostic information for sources of error locating at individual level. Key words structured interview; IRT; many2faceted Rasch Model

Beyond classical statistics: Different approaches to evaluating marking reliability

Beyond classical statistics: Different approaches to evaluating marking reliability Beyond classical statistics: Different approaches to evaluating marking reliability E. Harrison, W. Pointer, Y. Bimpeh & B. Smith What do we mean by marking reliability? Reliability = when something can

More information

, A insworth. , B rennan, Clark. ences in Close Relationship s Inventory, ECR), Bowlby. A insworth, : ( 1)

, A insworth. , B rennan, Clark. ences in Close Relationship s Inventory, ECR), Bowlby. A insworth, : ( 1) 2006, 38 (3) : 399 406 A cta Psychologica S in ica : ( ECR) 3 1 2 ( 1, 100871) ( 2, 8130016, ) ( ECR) 371,, 59 231,, ( ) ( ), ( ECR),,, B848 1 20,,,, A insworth,, B rennan, Clark Shaver [ 1 ] ( Experi2

More information

( cop ing) , Gentry (1983) : ( Science of Cop ing) [ 1 ] Van D ick W agner ( 2001) (mobbing), [ 9 ], Sirbaskn Cohen (2001) ,,,

( cop ing) , Gentry (1983) : ( Science of Cop ing) [ 1 ] Van D ick W agner ( 2001) (mobbing), [ 9 ], Sirbaskn Cohen (2001) ,,, 2005, 37 (3) : 413 419 A cta Psychologica S in ica : 3 (, 215021) 1806, 30 :, SARS, ; 0. 86, 0. 79, 63 0. 71, 30,,, B849 1 ( cop ing), Gentry (1983) : ( Science of Cop ing) [ 1 ],,, ;, [ 2, 3 ], [ 4 7

More information

Measurement Theory. Reliability. Error Sources. = XY r XX. r XY. r YY

Measurement Theory. Reliability. Error Sources. = XY r XX. r XY. r YY Y -3 - -1 0 1 3 X Y -10-5 0 5 10 X Measurement Theory t & X 1 X X 3 X k Reliability e 1 e e 3 e k 1 The Big Picture Measurement error makes it difficult to identify the true patterns of relationships between

More information

M itchelson R L , (Wolfson index) ( Tsui - W ang index) : ; : : ( ) :,, E - mail: edu.

M itchelson R L , (Wolfson index) ( Tsui - W ang index) : ; : : ( ) :,, E - mail: edu. 2 8 6 28 No. 6 Vol. 2 0 0 8 1 2 SC IENTIA GEOGRAPH ICA SIN ICA Dec., 2 0 0 8 1, 2, 3 (1., 100101; 2., 100049; 3.,, 130024) : ; 1990, ;, - ;, - : ; ; ; ; : F119. 9 : A : 1000-0690 (2008) 06-0722 - 07, M

More information

Item Response Theory and Computerized Adaptive Testing

Item Response Theory and Computerized Adaptive Testing Item Response Theory and Computerized Adaptive Testing Richard C. Gershon, PhD Department of Medical Social Sciences Feinberg School of Medicine Northwestern University gershon@northwestern.edu May 20,

More information

Grace College Department of Graduate Counseling Vital Statistics and Program Evaluation Outcomes Vital Statistics

Grace College Department of Graduate Counseling Vital Statistics and Program Evaluation Outcomes Vital Statistics Grace College Department of Graduate Counseling Vital Statistics and Program Evaluation Outcomes 2017-2018 Vital Statistics 2017-2018 Enrollment as of 4/1/2018 (full-time) 113 Graduates (Summer 2017-Spring

More information

Con struction and applica tion of m odeling tendency of land type tran sition ba sed on spa tia l adjacency

Con struction and applica tion of m odeling tendency of land type tran sition ba sed on spa tia l adjacency 29 1 2009 1 ACTA ECOLOGICA SIN ICA Vol. 29, No. 1 Jan., 2009 1, 2, 3, 1, 3 (1., 250014; 2. ( ), 100083; 3., 100101) :,, 2000 2005,,,,, : ; ; ; : 100020933 (2009) 0120337207 : F323. 1, Q147, S126 : A Con

More information

Polytomous Item Explanatory IRT Models with Random Item Effects: An Application to Carbon Cycle Assessment Data

Polytomous Item Explanatory IRT Models with Random Item Effects: An Application to Carbon Cycle Assessment Data Polytomous Item Explanatory IRT Models with Random Item Effects: An Application to Carbon Cycle Assessment Data Jinho Kim and Mark Wilson University of California, Berkeley Presented on April 11, 2018

More information

China Academic Journal Electronic Publishing House. All rights reserved JOURNAL OF NATURAL RESOURCES Aug, 2009

China Academic Journal Electronic Publishing House. All rights reserved JOURNAL OF NATURAL RESOURCES Aug, 2009 24 8 Vol124 No18 20098 JOURNAL OF NATURAL RESOURCES Aug, 2009, (, 710062) : 19962006,,, 15,:,,,,,,, : ; ; ; : F29111: A : 1000-3037 (2009) 08-1378 - 08 1 1978 1719%20064319%, 0193 [ 1 ] 20 90,, 19962002

More information

Statistical and psychometric methods for measurement: Scale development and validation

Statistical and psychometric methods for measurement: Scale development and validation Statistical and psychometric methods for measurement: Scale development and validation Andrew Ho, Harvard Graduate School of Education The World Bank, Psychometrics Mini Course Washington, DC. June 11,

More information

2006 38 (3) : 392 398 A cta Psychologica S in ica ( 510631) 1152 5 : 18 5 8 9 5 R395 1 [ 1 ] ; [ 2 ] 1995 Adrian W ells 1 ( type1 worry) II ( type2 worry) II (metaworry) [ 3 ] 1 II1 1 : 1 ( habituation

More information

Bayesian Nonparametric Rasch Modeling: Methods and Software

Bayesian Nonparametric Rasch Modeling: Methods and Software Bayesian Nonparametric Rasch Modeling: Methods and Software George Karabatsos University of Illinois-Chicago Keynote talk Friday May 2, 2014 (9:15-10am) Ohio River Valley Objective Measurement Seminar

More information

China Academic Journal Electronic Publishing House. All rights reserved.

China Academic Journal Electronic Publishing House. All rights reserved. 15 2 2009 4 Open Education Research Vol. 15, No. 2 Ap r12009 (, 100875),, ; ; ; ; G434 A 100722179 (2009) 0220027210 (Ubiquitous Learning),,,,,, (, 2008),, e2learning (, 2007),, ( ) e2learning, IEEE LOM

More information

MANY- FACET MESA RASCH MEASUREMENT. John M. Linacre

MANY- FACET MESA RASCH MEASUREMENT. John M. Linacre MANY- FACET RASCH MEASUREMENT John M. Linacre MESA MANY-FACET RASCH MEASUREMENT John M. Linacre The University of Chicago MESA PRESS Chicago 1989, 1994 Many-Facet Rasch Measurement John M. Linacre Second

More information

Inter-Rater Agreement

Inter-Rater Agreement Engineering Statistics (EGC 630) Dec., 008 http://core.ecu.edu/psyc/wuenschk/spss.htm Degree of agreement/disagreement among raters Inter-Rater Agreement Psychologists commonly measure various characteristics

More information

Ability Metric Transformations

Ability Metric Transformations Ability Metric Transformations Involved in Vertical Equating Under Item Response Theory Frank B. Baker University of Wisconsin Madison The metric transformations of the ability scales involved in three

More information

47 JOURNAL OF SUN YATSEN UN IVERSITY Vol. 47 ( 209 ) ( SOC IAL SC IENCE ED ITION) General No. 209

47 JOURNAL OF SUN YATSEN UN IVERSITY Vol. 47 ( 209 ) ( SOC IAL SC IENCE ED ITION) General No. 209 2007 5 ( ) No. 5 2007 47 JOURNAL OF SUN YATSEN UN IVERSITY Vol. 47 ( 209 ) ( SOC IAL SC IENCE ED ITION) General No. 209 : 3 (, 510275) :,,,,, : ; ; ; : D035 : A : 1000-9639 (2007) 05-0082 - 09,,,,,,,,

More information

Conditional Standard Errors of Measurement for Performance Ratings from Ordinary Least Squares Regression

Conditional Standard Errors of Measurement for Performance Ratings from Ordinary Least Squares Regression Conditional SEMs from OLS, 1 Conditional Standard Errors of Measurement for Performance Ratings from Ordinary Least Squares Regression Mark R. Raymond and Irina Grabovsky National Board of Medical Examiners

More information

Worked Examples for Nominal Intercoder Reliability. by Deen G. Freelon October 30,

Worked Examples for Nominal Intercoder Reliability. by Deen G. Freelon October 30, Worked Examples for Nominal Intercoder Reliability by Deen G. Freelon (deen@dfreelon.org) October 30, 2009 http://www.dfreelon.com/utils/recalfront/ This document is an excerpt from a paper currently under

More information

The Difficulty of Test Items That Measure More Than One Ability

The Difficulty of Test Items That Measure More Than One Ability The Difficulty of Test Items That Measure More Than One Ability Mark D. Reckase The American College Testing Program Many test items require more than one ability to obtain a correct response. This article

More information

Studies on the effect of violations of local independence on scale in Rasch models: The Dichotomous Rasch model

Studies on the effect of violations of local independence on scale in Rasch models: The Dichotomous Rasch model Studies on the effect of violations of local independence on scale in Rasch models Studies on the effect of violations of local independence on scale in Rasch models: The Dichotomous Rasch model Ida Marais

More information

An Overview of Item Response Theory. Michael C. Edwards, PhD

An Overview of Item Response Theory. Michael C. Edwards, PhD An Overview of Item Response Theory Michael C. Edwards, PhD Overview General overview of psychometrics Reliability and validity Different models and approaches Item response theory (IRT) Conceptual framework

More information

Paul Barrett

Paul Barrett Paul Barrett email: p.barrett@liv.ac.uk http://www.liv.ac.uk/~pbarrett/paulhome.htm Affiliations: The The State Hospital, Carstairs Dept. of of Clinical Psychology, Univ. Of Of Liverpool 20th 20th November,

More information

Utilizing Hierarchical Linear Modeling in Evaluation: Concepts and Applications

Utilizing Hierarchical Linear Modeling in Evaluation: Concepts and Applications Utilizing Hierarchical Linear Modeling in Evaluation: Concepts and Applications C.J. McKinney, M.A. Antonio Olmos, Ph.D. Kate DeRoche, M.A. Mental Health Center of Denver Evaluation 2007: Evaluation and

More information

( Stationary wavelet transform, SW T) [ 5 ]

( Stationary wavelet transform, SW T) [ 5 ] 123 20106 JOURAL OF GEO2IFORATIO SC IECE Vol112, o13 Jun1, 2010, 3 (, 350108;, 350108) :,, : allat (DW T) trous ( SW T) (SCT),IHS PCA IKOOS,,,,, DW T SW T SCT ; DW T SW T SCT IHS PCA,, IHS PCA,, SCT PCA,

More information

An extended summary of the NCGR/Berkeley Double-Blind Test of Astrology undertaken by Shawn Carlson and published in 1985

An extended summary of the NCGR/Berkeley Double-Blind Test of Astrology undertaken by Shawn Carlson and published in 1985 From: http://www.astrodivination.com/moa/ncgrberk.htm An extended summary of the NCGR/Berkeley Double-Blind Test of Astrology undertaken by Shawn Carlson and published in 1985 Introduction Under the heading

More information

The application and empirical comparison of item. parameters of Classical Test Theory and Partial Credit. Model of Rasch in performance assessments

The application and empirical comparison of item. parameters of Classical Test Theory and Partial Credit. Model of Rasch in performance assessments The application and empirical comparison of item parameters of Classical Test Theory and Partial Credit Model of Rasch in performance assessments by Paul Moloantoa Mokilane Student no: 31388248 Dissertation

More information

New Developments for Extended Rasch Modeling in R

New Developments for Extended Rasch Modeling in R New Developments for Extended Rasch Modeling in R Patrick Mair, Reinhold Hatzinger Institute for Statistics and Mathematics WU Vienna University of Economics and Business Content Rasch models: Theory,

More information

Equating Subscores Using Total Scaled Scores as an Anchor

Equating Subscores Using Total Scaled Scores as an Anchor Research Report ETS RR 11-07 Equating Subscores Using Total Scaled Scores as an Anchor Gautam Puhan Longjuan Liang March 2011 Equating Subscores Using Total Scaled Scores as an Anchor Gautam Puhan and

More information

The Alignment of the easycbm Middle School Mathematics. CCSS Measures to the Common Core State Standards

The Alignment of the easycbm Middle School Mathematics. CCSS Measures to the Common Core State Standards Technical Report #1208 The Alignment of the easycbm Middle School Mathematics CCSS Measures to the Common Core State Standards Daniel Anderson P. Shawn Irvin Julie Alonzo Gerald Tindal University of Oregon

More information

Basic IRT Concepts, Models, and Assumptions

Basic IRT Concepts, Models, and Assumptions Basic IRT Concepts, Models, and Assumptions Lecture #2 ICPSR Item Response Theory Workshop Lecture #2: 1of 64 Lecture #2 Overview Background of IRT and how it differs from CFA Creating a scale An introduction

More information

The Discriminating Power of Items That Measure More Than One Dimension

The Discriminating Power of Items That Measure More Than One Dimension The Discriminating Power of Items That Measure More Than One Dimension Mark D. Reckase, American College Testing Robert L. McKinley, Educational Testing Service Determining a correct response to many test

More information

Ensemble Rasch Models

Ensemble Rasch Models Ensemble Rasch Models Steven M. Lattanzio II Metamatrics Inc., Durham, NC 27713 email: slattanzio@lexile.com Donald S. Burdick Metamatrics Inc., Durham, NC 27713 email: dburdick@lexile.com A. Jackson Stenner

More information

Course title SD206. Introduction to Structural Equation Modelling

Course title SD206. Introduction to Structural Equation Modelling 10 th ECPR Summer School in Methods and Techniques, 23 July - 8 August University of Ljubljana, Slovenia Course Description Form 1-2 week course (30 hrs) Course title SD206. Introduction to Structural

More information

Validation of Cognitive Structures: A Structural Equation Modeling Approach

Validation of Cognitive Structures: A Structural Equation Modeling Approach Multivariate Behavioral Research, 38 (1), 1-23 Copyright 2003, Lawrence Erlbaum Associates, Inc. Validation of Cognitive Structures: A Structural Equation Modeling Approach Dimiter M. Dimitrov Kent State

More information

Classical Test Theory. Basics of Classical Test Theory. Cal State Northridge Psy 320 Andrew Ainsworth, PhD

Classical Test Theory. Basics of Classical Test Theory. Cal State Northridge Psy 320 Andrew Ainsworth, PhD Cal State Northridge Psy 30 Andrew Ainsworth, PhD Basics of Classical Test Theory Theory and Assumptions Types of Reliability Example Classical Test Theory Classical Test Theory (CTT) often called the

More information

Use of e-rater in Scoring of the TOEFL ibt Writing Test

Use of e-rater in Scoring of the TOEFL ibt Writing Test Research Report ETS RR 11-25 Use of e-rater in Scoring of the TOEFL ibt Writing Test Shelby J. Haberman June 2011 Use of e-rater in Scoring of the TOEFL ibt Writing Test Shelby J. Haberman ETS, Princeton,

More information

Weather Second Grade Virginia Standards of Learning 2.6 Assessment Creation Project. Amanda Eclipse

Weather Second Grade Virginia Standards of Learning 2.6 Assessment Creation Project. Amanda Eclipse Weather Second Grade Virginia Standards of Learning 2.6 Assessment Creation Project 1 Amanda Eclipse Overview and Description of Course The goal of Virginia s science standards is for the students to develop

More information

Extensions and Applications of Item Explanatory Models to Polytomous Data in Item Response Theory. Jinho Kim

Extensions and Applications of Item Explanatory Models to Polytomous Data in Item Response Theory. Jinho Kim Extensions and Applications of Item Explanatory Models to Polytomous Data in Item Response Theory by Jinho Kim A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor

More information

The Electron ic PSC Testing System

The Electron ic PSC Testing System 20 6 JO URAL O F CH IESE IFO RM AT IO PROCESS IG Vol120 o16 : 1003-0077 (2006) 06-0089 - 08,,, (, 230027) : 100,,, 500,, (2144)(2130) :; ;; ; ; : TP391: A The Electron ic PSC Testing System W E I Si, L

More information

A Study of Statistical Power and Type I Errors in Testing a Factor Analytic. Model for Group Differences in Regression Intercepts

A Study of Statistical Power and Type I Errors in Testing a Factor Analytic. Model for Group Differences in Regression Intercepts A Study of Statistical Power and Type I Errors in Testing a Factor Analytic Model for Group Differences in Regression Intercepts by Margarita Olivera Aguilar A Thesis Presented in Partial Fulfillment of

More information

Application of Item Response Theory Models for Intensive Longitudinal Data

Application of Item Response Theory Models for Intensive Longitudinal Data Application of Item Response Theory Models for Intensive Longitudinal Data Don Hedeker, Robin Mermelstein, & Brian Flay University of Illinois at Chicago hedeker@uic.edu Models for Intensive Longitudinal

More information

UCLA Department of Statistics Papers

UCLA Department of Statistics Papers UCLA Department of Statistics Papers Title Can Interval-level Scores be Obtained from Binary Responses? Permalink https://escholarship.org/uc/item/6vg0z0m0 Author Peter M. Bentler Publication Date 2011-10-25

More information

36-720: The Rasch Model

36-720: The Rasch Model 36-720: The Rasch Model Brian Junker October 15, 2007 Multivariate Binary Response Data Rasch Model Rasch Marginal Likelihood as a GLMM Rasch Marginal Likelihood as a Log-Linear Model Example For more

More information

UNIDIMENSIONALITY IN THE RASCH MODEL: HOW TO DETECT AND INTERPRET. E. Brentari, S. Golia 1. INTRODUCTION

UNIDIMENSIONALITY IN THE RASCH MODEL: HOW TO DETECT AND INTERPRET. E. Brentari, S. Golia 1. INTRODUCTION STATISTICA, anno LXVII, n. 3, 2007 UNIDIMENSIONALITY IN THE RASCH MODEL: HOW TO DETECT AND INTERPRET 1. INTRODUCTION The concept of udimensionality is frequently defined as a single latent trait being

More information

Covariates of the Rating Process in Hierarchical Models for Multiple Ratings of Test Items

Covariates of the Rating Process in Hierarchical Models for Multiple Ratings of Test Items Journal of Educational and Behavioral Statistics March XXXX, Vol. XX, No. issue;, pp. 1 28 DOI: 10.3102/1076998606298033 Ó AERA and ASA. http://jebs.aera.net Covariates of the Rating Process in Hierarchical

More information

ESTIMATION OF IRT PARAMETERS OVER A SMALL SAMPLE. BOOTSTRAPPING OF THE ITEM RESPONSES. Dimitar Atanasov

ESTIMATION OF IRT PARAMETERS OVER A SMALL SAMPLE. BOOTSTRAPPING OF THE ITEM RESPONSES. Dimitar Atanasov Pliska Stud. Math. Bulgar. 19 (2009), 59 68 STUDIA MATHEMATICA BULGARICA ESTIMATION OF IRT PARAMETERS OVER A SMALL SAMPLE. BOOTSTRAPPING OF THE ITEM RESPONSES Dimitar Atanasov Estimation of the parameters

More information

HSCCC, (H igh - Speed Countercurrent Chro2 matography, HSCCC) 2 ,, HSCCC, HPLC HSCCC

HSCCC, (H igh - Speed Countercurrent Chro2 matography, HSCCC) 2 ,, HSCCC, HPLC HSCCC 2007 Vol. 9 No. 1 * * * (/ 100037) :(HSCCC) : (H igh - Speed Countercurrent Chro2 matography, HSCCC)2,, : 2006210210 : 2006211230 * ( KM200410011001) ;,:; (2042006),: * * :,,,Tel: 010268983116, E2mail:

More information

CHEMTAX,. , CHEMTAX, g/m % % 4. 50% % 4. 83% %.

CHEMTAX,. , CHEMTAX, g/m % % 4. 50% % 4. 83% %. 29 2 Vol. 29, No. 2 2010 5 JOURNAL OF OCEANOGRAPHY IN TA IWAN STRA IT May, 2010 2001 2 6 1, 2, 2 (1., 510632; 2., 361005) : (RP2HPLC) 2001 2 6, CHEMTAX,. RP2HPLC, 1 /3,. RP2HPLC, 2001 2 6, ( fucoxanthin)

More information

18. SEPARATION STATISTICS

18. SEPARATION STATISTICS 18. SEPARATION STATISTICS The person and item separation statistics used in Rasch measurement are still unfamiliar to some practitioners (Wright, Mead and Bell, 1979 ; Wright and Stone, 1979 ; and Wrightand

More information

Raffaela Wolf, MS, MA. Bachelor of Science, University of Maine, Master of Science, Robert Morris University, 2008

Raffaela Wolf, MS, MA. Bachelor of Science, University of Maine, Master of Science, Robert Morris University, 2008 Assessing the Impact of Characteristics of the Test, Common-items, and Examinees on the Preservation of Equity Properties in Mixed-format Test Equating by Raffaela Wolf, MS, MA Bachelor of Science, University

More information

Research Methodology Statistics Comprehensive Exam Study Guide

Research Methodology Statistics Comprehensive Exam Study Guide Research Methodology Statistics Comprehensive Exam Study Guide References Glass, G. V., & Hopkins, K. D. (1996). Statistical methods in education and psychology (3rd ed.). Boston: Allyn and Bacon. Gravetter,

More information

FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE

FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE Course Title: Probability and Statistics (MATH 80) Recommended Textbook(s): Number & Type of Questions: Probability and Statistics for Engineers

More information

Lesson 6: Reliability

Lesson 6: Reliability Lesson 6: Reliability Patrícia Martinková Department of Statistical Modelling Institute of Computer Science, Czech Academy of Sciences NMST 570, December 12, 2017 Dec 19, 2017 1/35 Contents 1. Introduction

More information

Item Response Theory (IRT) Analysis of Item Sets

Item Response Theory (IRT) Analysis of Item Sets University of Connecticut DigitalCommons@UConn NERA Conference Proceedings 2011 Northeastern Educational Research Association (NERA) Annual Conference Fall 10-21-2011 Item Response Theory (IRT) Analysis

More information

(2009) Journal of Rem ote Sensing (, 2006) 2. 1 (, 1999), : ( : 2007CB714402) ;

(2009) Journal of Rem ote Sensing (, 2006) 2. 1 (, 1999), : ( : 2007CB714402) ; 100724619 (2009) 0220183207 Journal of Rem ote Sensing 1, 2, 3, 3, 3, 1 1., 100101; 2., 100049; 3., 100080 :,,, 3, ( ),1%,, :,,, : TP79 : A 1 20,,,,;, (, 1999),,,,, (, 2006),,,,, 2 2. 1 : 2007209217; :

More information

Comparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior

Comparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior Comparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior David R. Johnson Department of Sociology and Haskell Sie Department

More information

A discussion on methodologies for research into complex system s

A discussion on methodologies for research into complex system s 4 1 Vol. 4. 1 2009 2 CAA I Transactions on Intelligent System s Feb. 2009 1, 2, 3, 2, 4 (1., 610074; 2., 610500; 3., 610500; 4., 618000) : 20 80,,,, 6 : ; ; Agent ; Swarm; StarLogo : N94, TP273 : A : 167324785

More information

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 25 A Analysis of a Large-scale Reading Comprehension Test. Dongmei Li Robert L. Brennan August 2007 Robert L.Brennan

More information

Measurement Systems Analysis

Measurement Systems Analysis Measurement Systems Analysis Since measurement systems represent a subprocess within a process They are subject to Variation. What could be the source of this variation? Why do Measurements Vary? Possible

More information

Pairwise Parameter Estimation in Rasch Models

Pairwise Parameter Estimation in Rasch Models Pairwise Parameter Estimation in Rasch Models Aeilko H. Zwinderman University of Leiden Rasch model item parameters can be estimated consistently with a pseudo-likelihood method based on comparing responses

More information

Multi-level Models: Idea

Multi-level Models: Idea Review of 140.656 Review Introduction to multi-level models The two-stage normal-normal model Two-stage linear models with random effects Three-stage linear models Two-stage logistic regression with random

More information

PIRLS 2016 Achievement Scaling Methodology 1

PIRLS 2016 Achievement Scaling Methodology 1 CHAPTER 11 PIRLS 2016 Achievement Scaling Methodology 1 The PIRLS approach to scaling the achievement data, based on item response theory (IRT) scaling with marginal estimation, was developed originally

More information

Monte Carlo Simulations for Rasch Model Tests

Monte Carlo Simulations for Rasch Model Tests Monte Carlo Simulations for Rasch Model Tests Patrick Mair Vienna University of Economics Thomas Ledl University of Vienna Abstract: Sources of deviation from model fit in Rasch models can be lack of unidimensionality,

More information

Equating Tests Under The Nominal Response Model Frank B. Baker

Equating Tests Under The Nominal Response Model Frank B. Baker Equating Tests Under The Nominal Response Model Frank B. Baker University of Wisconsin Under item response theory, test equating involves finding the coefficients of a linear transformation of the metric

More information

Development and Calibration of an Item Response Model. that Incorporates Response Time

Development and Calibration of an Item Response Model. that Incorporates Response Time Development and Calibration of an Item Response Model that Incorporates Response Time Tianyou Wang and Bradley A. Hanson ACT, Inc. Send correspondence to: Tianyou Wang ACT, Inc P.O. Box 168 Iowa City,

More information

, kw, kw 3176%,, JOURNAL OF NATURAL RESOURCES Aug., , : F42612 : A : (2009)

, kw, kw 3176%,, JOURNAL OF NATURAL RESOURCES Aug., , : F42612 : A : (2009) 24 8 Vol124 No18 2009 8 JOURNAL OF NATURAL RESOURCES Aug., 2009 1, 2, 13 (11, 100101; 21, 100049) : 1997 2006 10,,,, 2006 GDP 527111 1 195120 5 745182, 3,,,,, 4 : : ; ; ; : F42612 : A: 1000-3037 (2009)

More information

The Impact of Item Position Change on Item Parameters and Common Equating Results under the 3PL Model

The Impact of Item Position Change on Item Parameters and Common Equating Results under the 3PL Model The Impact of Item Position Change on Item Parameters and Common Equating Results under the 3PL Model Annual Meeting of the National Council on Measurement in Education Vancouver, B.C. Jason L. Meyers

More information

Hypothesis Registration: Structural Predictions for the 2013 World Schools Debating Championships

Hypothesis Registration: Structural Predictions for the 2013 World Schools Debating Championships Hypothesis Registration: Structural Predictions for the 2013 World Schools Debating Championships Tom Gole * and Simon Quinn January 26, 2013 Abstract This document registers testable predictions about

More information

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 24 in Relation to Measurement Error for Mixed Format Tests Jae-Chun Ban Won-Chan Lee February 2007 The authors are

More information

Logistic Regression and Item Response Theory: Estimation Item and Ability Parameters by Using Logistic Regression in IRT.

Logistic Regression and Item Response Theory: Estimation Item and Ability Parameters by Using Logistic Regression in IRT. Louisiana State University LSU Digital Commons LSU Historical Dissertations and Theses Graduate School 1998 Logistic Regression and Item Response Theory: Estimation Item and Ability Parameters by Using

More information

The Robustness of LOGIST and BILOG IRT Estimation Programs to Violations of Local Independence

The Robustness of LOGIST and BILOG IRT Estimation Programs to Violations of Local Independence A C T Research Report Series 87-14 The Robustness of LOGIST and BILOG IRT Estimation Programs to Violations of Local Independence Terry Ackerman September 1987 For additional copies write: ACT Research

More information

Robert Bianchi, PhD Former director DEA Special Testing Laboratory Donald Cooper Former opioid lead in DEA Special Testing Lab

Robert Bianchi, PhD Former director DEA Special Testing Laboratory Donald Cooper Former opioid lead in DEA Special Testing Lab Investigators Robert Bianchi, PhD Former director DEA Special Testing Laboratory Donald Cooper Former opioid lead in DEA Special Testing Lab Donald Jasinski, MD Center for Chemical Dependency, Johns Hopkins

More information

Statistical and psychometric methods for measurement: G Theory, DIF, & Linking

Statistical and psychometric methods for measurement: G Theory, DIF, & Linking Statistical and psychometric methods for measurement: G Theory, DIF, & Linking Andrew Ho, Harvard Graduate School of Education The World Bank, Psychometrics Mini Course 2 Washington, DC. June 27, 2018

More information

JOURNAL OF NATURAL RESOURCES Mar., 2008 : X24 : A : (2008) : ; : : ( )

JOURNAL OF NATURAL RESOURCES Mar., 2008 : X24 : A : (2008) : ; : : ( ) 23 2 Vol123 No12 2008 3 JOURNAL OF NATURAL RESOURCES Mar., 2008 : (, 100101) :,,,,,,,(),,, : ; ; ; : X24 : A: 1000-3037 (2008) 02-0177 - 08 1,,,,,,,,,,,,,,, 2,, [ 1, 2 ] : 2007-06- 09; : 2007-12- 26 :

More information

An Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin

An Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin Equivalency Test for Model Fit 1 Running head: EQUIVALENCY TEST FOR MODEL FIT An Equivalency Test for Model Fit Craig S. Wells University of Massachusetts Amherst James. A. Wollack Ronald C. Serlin University

More information

Estimating ability for two samples

Estimating ability for two samples Estimating ability for two samples William Revelle David M. Condon Northwestern University Abstract Using IRT to estimate ability is easy, but how accurate are the estimate and what about multiple samples?

More information

This Unit is suitable for candidates with no previous archaeological experience but who have an interest in history and/or archaeology.

This Unit is suitable for candidates with no previous archaeological experience but who have an interest in history and/or archaeology. National Unit specification: general information Unit code: H29B 12 Superclass: DC Publication date: August 2012 Source: Scottish Qualifications Authority Version: 01 Summary This Unit introduces candidates

More information

Diagnostic Classification Models: Psychometric Issues and Statistical Challenges

Diagnostic Classification Models: Psychometric Issues and Statistical Challenges Diagnostic Classification Models: Psychometric Issues and Statistical Challenges Jonathan Templin Department of Educational Psychology The University of Georgia University of South Carolina Talk Talk Overview

More information

GOVERNMENT MAPPING WORKSHOP RECOVER Edmonton s Urban Wellness Plan Mapping Workshop December 4, 2017

GOVERNMENT MAPPING WORKSHOP RECOVER Edmonton s Urban Wellness Plan Mapping Workshop December 4, 2017 GOVERNMENT MAPPING WORKSHOP 12.4.17 RECOVER Edmonton s Urban Wellness Plan Mapping Workshop December 4, 2017 In July of 2017, City Council directed administration to develop RECOVER, Edmonton s Urban Wellness

More information

Programme Specification (Undergraduate) Chemistry

Programme Specification (Undergraduate) Chemistry Programme Specification (Undergraduate) BSc Chemistry This document provides a definitive record of the main features of the programme and the learning outcomes that a typical student may reasonably be

More information

Vol112, No11 Feb1, 2010 JOURNAL OF GEO2INFORMATION SC IENCE , CBERS IRS - P5, ;, : ; : E2mail: lreis1ac1cn [ 6-13 ]

Vol112, No11 Feb1, 2010 JOURNAL OF GEO2INFORMATION SC IENCE , CBERS IRS - P5, ;, : ; : E2mail: lreis1ac1cn [ 6-13 ] 12 1 2010 2 JOURNAL OF GEO2INFORMATION SC IENCE Vol112, No11 Feb1, 2010, 3, (, 100101) :,,,,,, CBERS IRS - P5, ;,, 20m 1km ;,, CBERS, 200m, 76% ; P5, 100m, 84% : ; ; ; 1 [ 4-5 ], GIS, [ 6-13 ], ( 1995

More information

AC : GETTING MORE FROM YOUR DATA: APPLICATION OF ITEM RESPONSE THEORY TO THE STATISTICS CONCEPT INVENTORY

AC : GETTING MORE FROM YOUR DATA: APPLICATION OF ITEM RESPONSE THEORY TO THE STATISTICS CONCEPT INVENTORY AC 2007-1783: GETTING MORE FROM YOUR DATA: APPLICATION OF ITEM RESPONSE THEORY TO THE STATISTICS CONCEPT INVENTORY Kirk Allen, Purdue University Kirk Allen is a post-doctoral researcher in Purdue University's

More information

A Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions

A Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions A Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions Cees A.W. Glas Oksana B. Korobko University of Twente, the Netherlands OMD Progress Report 07-01. Cees A.W.

More information

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 41 A Comparative Study of Item Response Theory Item Calibration Methods for the Two Parameter Logistic Model Kyung

More information

ORF 245 Fundamentals of Engineering Statistics. Midterm Exam 1

ORF 245 Fundamentals of Engineering Statistics. Midterm Exam 1 Princeton University Department of Operations Research and Financial Engineering ORF 45 Fundamentals of Engineering Statistics Midterm Exam March 06, 009 0:00am-0:50am PLEASE DO NOT TURN THIS PAGE AND

More information

Test Administration Technical Report

Test Administration Technical Report Test Administration Technical Report 2015 2016 Program Year May 2017 Prepared for: Center for the Teaching Profession Ohio Department of Education 25 South Front Street, MS 505 Columbus, OH 43215-4183

More information

Measurement Error in Nonparametric Item Response Curve Estimation

Measurement Error in Nonparametric Item Response Curve Estimation Research Report ETS RR 11-28 Measurement Error in Nonparametric Item Response Curve Estimation Hongwen Guo Sandip Sinharay June 2011 Measurement Error in Nonparametric Item Response Curve Estimation Hongwen

More information

On the Construction of Adjacent Categories Latent Trait Models from Binary Variables, Motivating Processes and the Interpretation of Parameters

On the Construction of Adjacent Categories Latent Trait Models from Binary Variables, Motivating Processes and the Interpretation of Parameters Gerhard Tutz On the Construction of Adjacent Categories Latent Trait Models from Binary Variables, Motivating Processes and the Interpretation of Parameters Technical Report Number 218, 2018 Department

More information

Test Administration Technical Report

Test Administration Technical Report Test Administration Technical Report 2013 2014 Program Year October 2014 Prepared for: Center for the Teaching Profession Ohio Department of Education 25 South Front Street, MS 505 Columbus, OH 43215-4183

More information

The Factor Analytic Method for Item Calibration under Item Response Theory: A Comparison Study Using Simulated Data

The Factor Analytic Method for Item Calibration under Item Response Theory: A Comparison Study Using Simulated Data Int. Statistical Inst.: Proc. 58th World Statistical Congress, 20, Dublin (Session CPS008) p.6049 The Factor Analytic Method for Item Calibration under Item Response Theory: A Comparison Study Using Simulated

More information

Research on Standard Errors of Equating Differences

Research on Standard Errors of Equating Differences Research Report Research on Standard Errors of Equating Differences Tim Moses Wenmin Zhang November 2010 ETS RR-10-25 Listening. Learning. Leading. Research on Standard Errors of Equating Differences Tim

More information

Summer School in Applied Psychometric Principles. Peterhouse College 13 th to 17 th September 2010

Summer School in Applied Psychometric Principles. Peterhouse College 13 th to 17 th September 2010 Summer School in Applied Psychometric Principles Peterhouse College 13 th to 17 th September 2010 1 Two- and three-parameter IRT models. Introducing models for polytomous data. Test information in IRT

More information

Centre for Education Research and Practice

Centre for Education Research and Practice A Rasch analysis of the quality of marking of GCSE Science William Pointer Summary The reintroduction of an examination that combines biology, chemistry and physics has created the potential for a reduction

More information

A Simulation Study to Compare CAT Strategies for Cognitive Diagnosis

A Simulation Study to Compare CAT Strategies for Cognitive Diagnosis A Simulation Study to Compare CAT Strategies for Cognitive Diagnosis Xueli Xu Department of Statistics,University of Illinois Hua-Hua Chang Department of Educational Psychology,University of Texas Jeff

More information

Instrumentation (cont.) Statistics vs. Parameters. Descriptive Statistics. Types of Numerical Data

Instrumentation (cont.) Statistics vs. Parameters. Descriptive Statistics. Types of Numerical Data Norm-Referenced vs. Criterion- Referenced Instruments Instrumentation (cont.) October 1, 2007 Note: Measurement Plan Due Next Week All derived scores give meaning to individual scores by comparing them

More information

Base Rates and Bayes Theorem

Base Rates and Bayes Theorem Slides to accompany Grove s handout March 8, 2016 Table of contents 1 Diagnostic and Prognostic Inference 2 Example inferences Does this patient have Alzheimer s disease, schizophrenia, depression, etc.?

More information

Validation of the TAFE SA Assessment of Basic Skills (TABS) Instrument Francisco Ben, Wilham Hailaya & Sivakumar Alagumalai

Validation of the TAFE SA Assessment of Basic Skills (TABS) Instrument Francisco Ben, Wilham Hailaya & Sivakumar Alagumalai 2012 Validation of the TAFE SA Assessment of Basic Skills (TABS) Instrument Francisco Ben, Wilham Hailaya & Sivakumar Alagumalai 0 Table of Contents List of Tables... 2 List of Figures... 2 Executive Summary...

More information

China Academic Journal Electronic Publishing House. All rights reserved.

China Academic Journal Electronic Publishing House. All rights reserved. 2006 09 28 5 Modern Law Science Vol128 No. 5 Sep t., 2006 : 1001-2397 (2006) 05-0059 - 09 (, 000115) : :,, ;,,,, ( ), ( ) ;,,,,,,,,, ;,,,, : ; ; : DF082 : A 2006,, ;, :,,, : 2006-07 - 07 : (1935 - ),,,,

More information