Rasch , 40 (9) : ,,, ,,,, B A cta Psychologica S in ica DO I: /SP. J
|
|
- Derek Norman
- 5 years ago
- Views:
Transcription
1 2008, 40 (9) : A cta Psychologica S in ica DO I: /SP. J Rasch ( 1,,100875) ( 2 Kennedy School of Government, Harvard University, MA 02138, USA) Rasch, 66,,,, Facets Facets,,,Facets, Facets, ; ;Rasch B ,,, [ 6 ] [ 1 ] [ 5 ],,,, [ 7 ] [ 2 ] 1. 2 :,,, [ 3 ], W agner : :, : [ 5 ],,,,,, [ 4 ] : :, E2mail: sunxiaom in@ bnu. edu. cn, :
2 9 :Rasch 1031,,,,,,,,, (Classical Test Theory, CTT) ( Generalizability Theory, GT) CTT,,, CTT,,, CTT, Kendall, CTT,, KendallCTT GT CTT,,, GTCTT, CTT,, GT, ( Item Response Theory, IRT), CTT GT, IRT, IRT, IRT, () CTT, IRT, IRT, IRT, IRT RaschRasch Rasch log P n i 1 - Pn i = B n - D i P n i n i 1 - P n i n i B n n ( n = 1, 2,, N ) D i i ( i = 1, 2,, L ) Rasch,,, Rasch,,,,, L inacre [ 28 ] Rasch(Many Facets Rasch Model, MFRM ) MFRM Rasch, Rasch MFRM : log P n ijk P n ij( k - 1) = B n - D i - C j - F k P n ijk n i j k P n ij( k - 1) n i j k - 1 B n n( n = 1, 2,, N ) D i i( i = 1, 2,, L ) C j j( j = 1, 2,, J ) F k ( Partial Credit Model) k - 1k ( step difficul2 ty), K ( k = 1, 2,, K) logits Logits,
3 , MFRM, [ 10 ],MFRM MFRM, MFRM [ 11 ] [ ] [ ] [ 20 ] [ 21 ] [ ], [ 25 ],,, MFRM,, CTT, MFRM, [ 26 ], John M. L inacre MFRM Facets Facets (Unconditional Maximum L ikelihood) MFRM Facets forw indows [ 27 ] 1. 3 Facets , ;,, MFRM,, ,, A B C D E F G 34, A E H I J K L 32, : IRT MFRM, Facets forw indows M FRM Facets , ,, 1 23,,, [ 29 ] logits, logits, ( S E = 112), 53, logits(se = 0. 22) ; 37, logits(s E = 0. 13) 4 infit ( Infit MnSq), MFRM, fit,, MFRM,,, fit fit,mfrm, fit
4 9 :Rasch 1033, fit fit [ 13 ] [ 14 ] infit 112,,, infit 0. 8, 1,3 infit 2. 28, 1. 20, 1 66 S. E. InfitMnSq S. E. InfitMnSq S. E. InfitMnSq : RM SE (Model) : 0. 18, Adj SD: Separation: 6. 75, Separation Reliability: RM SE Root Mean - Square Standard Error, ( 1 3 )RM SE M SE (Mean2Square Standard Error),M S E Adj. SD Adj. SD ( ) M SE Separation Adj. SD RM S E,, Separation , Separation 3. 0 [ 30 ], Separa2 tion6. 75 Separation Reliability, [ 26 ],, 0;,1 1, Separation Reliability 0. 98, 2,, 2 (65) = , p < 0. 01,
5 , :,, MFRM ( logits) 2, 2 Facets MFRM,, Facets2 MFRM ( logits) MFRM , 1, 66 ; 2 ; 3 ; 4 Facets; 5 ; 6 3 5,
6 9 :Rasch ,, 56,, 16, Facets, 31, 15 25, 56 56,Facets, 3. 3 Facets, 56 7, 56, logits ModelS. E. A E H I J K L ; 2 ; 3 ; 4 ; 5 3, 56 7, 5, 5,, 56, 66, 12 Facets 3. 4Facets Facets, 4 4, 1 ; 2,,, A E ; 4 ; , , , t = , p = , Facets for W indows ,, infit
7 Facets [ 26 ],, infit, Infit, MFRM, IRT CTT [ 31 ],, 56 56,,12 Facets :, Facets :1, 2,,, , 3: 567, 5, 5 :, 56 ;, 5, Facets,, 56, (1) 1 1, 1, 2 12 :, logits, , 4 2, 21. 6, 66,, 1 3,4
8 9 :Rasch 1037,,,,, , 2 1, 2,, :,,,,,,,, MFRM, 5 MFRM, 66,: (1)MFRM,, ( infit ) (2),, Facets Facets,, Facets, MFRM,,, MFRM 1 Maurer T J, Solamon J M. The science and p ractice of a structured emp loyment interview coaching p rogram. 2006, 59 (2) : Personnel Psychology, 2 W iesnerw H, Cronshaw S F. A meta - analytic investigation of the impact of interview format and degree of structure on the validity of the emp loyment interview. Journal of Occupational Psychology, 1988, 61 (4) : Schm idt F L, Zimmerman R D. A counterintuitive hypothesis about emp loyment interview validity and some supporting evidence. Journal of App lied Psychology, 2004, 89 (3) : W alters L C, M illerm R, Ree M J. Structured interviews for p ilot selection: No incremental validity. Psychology, 1993, 3 (1) : 25 International Journal of Aviation 5 A rvey R D. The emp loyment interview: A summary and review of recent research. Personnel Psychology, 1982, 35 (2) : Schm itt N. Social and situational determ inants of interview decisions: Implications for the emp loyment interview. Personnel Psychology, 1976, 29 (1) : Maurer S D, Fay C. Effect of situational interviews, conventional structured interviews, and training on interview rating agreement: an experimental analysis. Personnel Psychology, 1988, 41 (2) : Sun X M, Zhang H C. A comparative study on methods used in estimating reliability of performance assessment: Methods based on correlation, percent of inter - rater reliability and Generalizability theory. Chinese) Psychological Science, 2005, 28 (3), ( in (,. - -., 2005, 28 (03) : ) 9 Shavelson R J, W ebb E A. Generability theory: A p rimer. Newbury Park, CA: SAGE Publications, Inc., Myford C M, Wolfe E W. Detecting and measuring rater effects usingmany - Facet Rasch Measurement: Part II. Journal of App lied Measurement, 2004, 5 (2) : LunzM E. Variation among exam iners and p rotocols on oral
9 exam inations. Paper p resented at the AnnualMeeting of the American Educational Research A ssociation, San Francisco, CA: L inacre J M. The calibration of essay graders. Paper p resented at the M idwest Objective Measurement Sem inar, Chicago, IL: Du Y. Raters and single p romp t - to - p romp t equating using the FACETS model in a writing performance assessment. Paper presented at the N inth International Objective Measurement Conference, Chicago, IL: Kondo - B rown K A. FACETS analysis of rater bias in measuring Japanese second language writing performance. Language Testing, 2002: Yamauchi K. ComparingMany - facet Rasch Model and ANOVA model: Analysis of ratings of essays. Japanese Journal of Educational Psychology, 1999, 47 (3) : Myford C M. Looking for patterns in disagreements: A Facets analysis of human raters and e - rater s scores on essays written for the graduate management admission test ( GMAT). New O rleans, LA. : American Educational Research A ssociation, L inacre J M. An extension of the Rasch Model to multi - faceted situation. Chicago: University of Chicago, LunzM E, W right B D. Measuring the impact of judge severity on exam ination scores. App lied Measurement in Education, 1990, 3 (4) : A llen J M, Schumacker R E. Team assessment utilizing a Many - Facet Rasch Model. Journal of Outcome Measurement, 1998, 2 (2) : Looney M. A many - facet Rasch analysis of 1994 O lymp ic figure skating. Research Quarterly for Exercise & Sport, 1997, 68 (1) : Myford C M, M islevy R J. Monitoring and imp roving a portfolio assessment system, Center for Performance A ssessment Research Report. Princeton, NJ: Educational Testing Service, Chi E. Comparing holistic and analytic scoring for performance assessment with many - facet Rasch model. Journal of App lied Measurement, 2001, 2 (4) : Heller J I, Sheingold K, Myford C M. Reasoning about evidence in portfolios: Cognitive foundations for valid and reliable assessment. Educational A ssessment, 1998, 5 (1) : 5 24 LunzM E. Performance exam inations: Technology for analysis and standard setting. Paper presented at the AnnualMeeting of the National Council of Measurement in Education. Chicago, IL: Kline T L, Schm idt K M, Bowles R. U sing L inlog and FACETS to model item components in the LLTM. Journal of Applied Measurement, 2006, 7 (1) : Sun X M, Zhang H C. An IRT analysis of rater bias in structured interview of national civilian candidates. Acta Psychologica Sinica, 2006, 38 (4), ( in Chinese) (,. IRT., 2006, 38 (04) : ) 27 L inacre J M. Facets forw indows ed. Chicago, IL: MESA Press, L inacre J M, W right B D. Understand Rasch measurement: Construction of measures from Many - facet Data. Journal of App lied Measurement, 2002, 3 (4) : L inacre J M. Understanding Rasch measurement: Op tim izing rating scale category effectiveness. Journal of App lied Measurement, 2002, 3 (1) : LunzM E, W right B D, L inacre J M. Measuring the impact of judge severity on exam ination scores. App lied Measurement in Education, 1990, 3 (4) : L inacre J M, W right B D. Construction of measures from many - facet data. Journal of Applied Measurement, 2002, 3 (4) : A M any - faceted Ra sch M odel Ana lysis of Structured In terv iew SUN Xiao - M in 1, XUE Gang 2 ( 1 School of Psychology, B eijing Key Lab of Applied Experim ental Psychology, B eijing N orm al U niversity, B eijing China) ( 2 Kennedy School of Governm ent, Harvard U niversity, MA 02138, USA ) Abstract Being one of the most important techniques in personnel selection, structured interview has attracted more and more research interest in imp roving its reliability and validity. Some researches focus on the standardization of its content and dimensions, others try to decrease rater bias by intensive rater training. The third one, to handle the possible bias in statistic way, has attracted more and more attention. Many - faceted Rasch Model (MFRM ), an extension to Rasch model, served as such kind of techniques. By parameterizing not only interviewee s ability and item difficulty but also judge severity, MFRM offers an effective way to estimate interviewee s latent trait, that is, the ability, and p rovides detailed information of inter - rater reliability as far as a specific interviewee is concerned. This study used MFRM to analyze the result of a structured interview and demonstrated a creative way to locate the source of bias for a specific
10 9 :Rasch 1039 interviewee. Data came from a structured interview. There were 7 raters in each interview panel. Since the interview last two days, these 21 raters were random ized into 3 panels in the morning of each day in order to p revent cheating. Rating scores of two panels were used in this study. A B C D E F G were raters of one panel and interviewed interviewees numbered A E H I J K L were raters in the other panel that interviewees numbered were interviewed. rated each interviewee independently on five dimensions using a 10 points rating scale. Each rater U sing Facets , a computer p rogram based on MFRM, the abilities of 66 interviewees was estimated, accompanied with a Infit MnSq, which demonstrated the degree to which raters in the panel agreed with each other on the evaluation of a specific interviewee. The ranking order based on interview raw scores and Facets estimated logits values were compared. D ifference were found between those them. To track the source of error for interviewee numbered 56, bias analysis of Facets was also made. The ability of 66 interviewees were reported with infit MnSq, showing inter - rater reliability at individual level; The ranking order based on interview raw score and estimated ability score were quite different, especially for some interviews. Taking interviewee numbered 56 for examp le, the ranking difference was as large as 15. B ias analysis aimed at locating the source of error for interviewee number 56 show that not only rater consistency, but also rater severity contribute to the ranking difference. The results confirmed the utility of MFRM analysis. The app lication of MFRM in the analysis of structured interview was p roved to be not only an effective way in personnel selection, but also p rovided diagnostic information for sources of error locating at individual level. Key words structured interview; IRT; many2faceted Rasch Model
Beyond classical statistics: Different approaches to evaluating marking reliability
Beyond classical statistics: Different approaches to evaluating marking reliability E. Harrison, W. Pointer, Y. Bimpeh & B. Smith What do we mean by marking reliability? Reliability = when something can
More information, A insworth. , B rennan, Clark. ences in Close Relationship s Inventory, ECR), Bowlby. A insworth, : ( 1)
2006, 38 (3) : 399 406 A cta Psychologica S in ica : ( ECR) 3 1 2 ( 1, 100871) ( 2, 8130016, ) ( ECR) 371,, 59 231,, ( ) ( ), ( ECR),,, B848 1 20,,,, A insworth,, B rennan, Clark Shaver [ 1 ] ( Experi2
More information( cop ing) , Gentry (1983) : ( Science of Cop ing) [ 1 ] Van D ick W agner ( 2001) (mobbing), [ 9 ], Sirbaskn Cohen (2001) ,,,
2005, 37 (3) : 413 419 A cta Psychologica S in ica : 3 (, 215021) 1806, 30 :, SARS, ; 0. 86, 0. 79, 63 0. 71, 30,,, B849 1 ( cop ing), Gentry (1983) : ( Science of Cop ing) [ 1 ],,, ;, [ 2, 3 ], [ 4 7
More informationMeasurement Theory. Reliability. Error Sources. = XY r XX. r XY. r YY
Y -3 - -1 0 1 3 X Y -10-5 0 5 10 X Measurement Theory t & X 1 X X 3 X k Reliability e 1 e e 3 e k 1 The Big Picture Measurement error makes it difficult to identify the true patterns of relationships between
More informationM itchelson R L , (Wolfson index) ( Tsui - W ang index) : ; : : ( ) :,, E - mail: edu.
2 8 6 28 No. 6 Vol. 2 0 0 8 1 2 SC IENTIA GEOGRAPH ICA SIN ICA Dec., 2 0 0 8 1, 2, 3 (1., 100101; 2., 100049; 3.,, 130024) : ; 1990, ;, - ;, - : ; ; ; ; : F119. 9 : A : 1000-0690 (2008) 06-0722 - 07, M
More informationItem Response Theory and Computerized Adaptive Testing
Item Response Theory and Computerized Adaptive Testing Richard C. Gershon, PhD Department of Medical Social Sciences Feinberg School of Medicine Northwestern University gershon@northwestern.edu May 20,
More informationGrace College Department of Graduate Counseling Vital Statistics and Program Evaluation Outcomes Vital Statistics
Grace College Department of Graduate Counseling Vital Statistics and Program Evaluation Outcomes 2017-2018 Vital Statistics 2017-2018 Enrollment as of 4/1/2018 (full-time) 113 Graduates (Summer 2017-Spring
More informationCon struction and applica tion of m odeling tendency of land type tran sition ba sed on spa tia l adjacency
29 1 2009 1 ACTA ECOLOGICA SIN ICA Vol. 29, No. 1 Jan., 2009 1, 2, 3, 1, 3 (1., 250014; 2. ( ), 100083; 3., 100101) :,, 2000 2005,,,,, : ; ; ; : 100020933 (2009) 0120337207 : F323. 1, Q147, S126 : A Con
More informationPolytomous Item Explanatory IRT Models with Random Item Effects: An Application to Carbon Cycle Assessment Data
Polytomous Item Explanatory IRT Models with Random Item Effects: An Application to Carbon Cycle Assessment Data Jinho Kim and Mark Wilson University of California, Berkeley Presented on April 11, 2018
More informationChina Academic Journal Electronic Publishing House. All rights reserved JOURNAL OF NATURAL RESOURCES Aug, 2009
24 8 Vol124 No18 20098 JOURNAL OF NATURAL RESOURCES Aug, 2009, (, 710062) : 19962006,,, 15,:,,,,,,, : ; ; ; : F29111: A : 1000-3037 (2009) 08-1378 - 08 1 1978 1719%20064319%, 0193 [ 1 ] 20 90,, 19962002
More informationStatistical and psychometric methods for measurement: Scale development and validation
Statistical and psychometric methods for measurement: Scale development and validation Andrew Ho, Harvard Graduate School of Education The World Bank, Psychometrics Mini Course Washington, DC. June 11,
More information2006 38 (3) : 392 398 A cta Psychologica S in ica ( 510631) 1152 5 : 18 5 8 9 5 R395 1 [ 1 ] ; [ 2 ] 1995 Adrian W ells 1 ( type1 worry) II ( type2 worry) II (metaworry) [ 3 ] 1 II1 1 : 1 ( habituation
More informationBayesian Nonparametric Rasch Modeling: Methods and Software
Bayesian Nonparametric Rasch Modeling: Methods and Software George Karabatsos University of Illinois-Chicago Keynote talk Friday May 2, 2014 (9:15-10am) Ohio River Valley Objective Measurement Seminar
More informationChina Academic Journal Electronic Publishing House. All rights reserved.
15 2 2009 4 Open Education Research Vol. 15, No. 2 Ap r12009 (, 100875),, ; ; ; ; G434 A 100722179 (2009) 0220027210 (Ubiquitous Learning),,,,,, (, 2008),, e2learning (, 2007),, ( ) e2learning, IEEE LOM
More informationMANY- FACET MESA RASCH MEASUREMENT. John M. Linacre
MANY- FACET RASCH MEASUREMENT John M. Linacre MESA MANY-FACET RASCH MEASUREMENT John M. Linacre The University of Chicago MESA PRESS Chicago 1989, 1994 Many-Facet Rasch Measurement John M. Linacre Second
More informationInter-Rater Agreement
Engineering Statistics (EGC 630) Dec., 008 http://core.ecu.edu/psyc/wuenschk/spss.htm Degree of agreement/disagreement among raters Inter-Rater Agreement Psychologists commonly measure various characteristics
More informationAbility Metric Transformations
Ability Metric Transformations Involved in Vertical Equating Under Item Response Theory Frank B. Baker University of Wisconsin Madison The metric transformations of the ability scales involved in three
More information47 JOURNAL OF SUN YATSEN UN IVERSITY Vol. 47 ( 209 ) ( SOC IAL SC IENCE ED ITION) General No. 209
2007 5 ( ) No. 5 2007 47 JOURNAL OF SUN YATSEN UN IVERSITY Vol. 47 ( 209 ) ( SOC IAL SC IENCE ED ITION) General No. 209 : 3 (, 510275) :,,,,, : ; ; ; : D035 : A : 1000-9639 (2007) 05-0082 - 09,,,,,,,,
More informationConditional Standard Errors of Measurement for Performance Ratings from Ordinary Least Squares Regression
Conditional SEMs from OLS, 1 Conditional Standard Errors of Measurement for Performance Ratings from Ordinary Least Squares Regression Mark R. Raymond and Irina Grabovsky National Board of Medical Examiners
More informationWorked Examples for Nominal Intercoder Reliability. by Deen G. Freelon October 30,
Worked Examples for Nominal Intercoder Reliability by Deen G. Freelon (deen@dfreelon.org) October 30, 2009 http://www.dfreelon.com/utils/recalfront/ This document is an excerpt from a paper currently under
More informationThe Difficulty of Test Items That Measure More Than One Ability
The Difficulty of Test Items That Measure More Than One Ability Mark D. Reckase The American College Testing Program Many test items require more than one ability to obtain a correct response. This article
More informationStudies on the effect of violations of local independence on scale in Rasch models: The Dichotomous Rasch model
Studies on the effect of violations of local independence on scale in Rasch models Studies on the effect of violations of local independence on scale in Rasch models: The Dichotomous Rasch model Ida Marais
More informationAn Overview of Item Response Theory. Michael C. Edwards, PhD
An Overview of Item Response Theory Michael C. Edwards, PhD Overview General overview of psychometrics Reliability and validity Different models and approaches Item response theory (IRT) Conceptual framework
More informationPaul Barrett
Paul Barrett email: p.barrett@liv.ac.uk http://www.liv.ac.uk/~pbarrett/paulhome.htm Affiliations: The The State Hospital, Carstairs Dept. of of Clinical Psychology, Univ. Of Of Liverpool 20th 20th November,
More informationUtilizing Hierarchical Linear Modeling in Evaluation: Concepts and Applications
Utilizing Hierarchical Linear Modeling in Evaluation: Concepts and Applications C.J. McKinney, M.A. Antonio Olmos, Ph.D. Kate DeRoche, M.A. Mental Health Center of Denver Evaluation 2007: Evaluation and
More information( Stationary wavelet transform, SW T) [ 5 ]
123 20106 JOURAL OF GEO2IFORATIO SC IECE Vol112, o13 Jun1, 2010, 3 (, 350108;, 350108) :,, : allat (DW T) trous ( SW T) (SCT),IHS PCA IKOOS,,,,, DW T SW T SCT ; DW T SW T SCT IHS PCA,, IHS PCA,, SCT PCA,
More informationAn extended summary of the NCGR/Berkeley Double-Blind Test of Astrology undertaken by Shawn Carlson and published in 1985
From: http://www.astrodivination.com/moa/ncgrberk.htm An extended summary of the NCGR/Berkeley Double-Blind Test of Astrology undertaken by Shawn Carlson and published in 1985 Introduction Under the heading
More informationThe application and empirical comparison of item. parameters of Classical Test Theory and Partial Credit. Model of Rasch in performance assessments
The application and empirical comparison of item parameters of Classical Test Theory and Partial Credit Model of Rasch in performance assessments by Paul Moloantoa Mokilane Student no: 31388248 Dissertation
More informationNew Developments for Extended Rasch Modeling in R
New Developments for Extended Rasch Modeling in R Patrick Mair, Reinhold Hatzinger Institute for Statistics and Mathematics WU Vienna University of Economics and Business Content Rasch models: Theory,
More informationEquating Subscores Using Total Scaled Scores as an Anchor
Research Report ETS RR 11-07 Equating Subscores Using Total Scaled Scores as an Anchor Gautam Puhan Longjuan Liang March 2011 Equating Subscores Using Total Scaled Scores as an Anchor Gautam Puhan and
More informationThe Alignment of the easycbm Middle School Mathematics. CCSS Measures to the Common Core State Standards
Technical Report #1208 The Alignment of the easycbm Middle School Mathematics CCSS Measures to the Common Core State Standards Daniel Anderson P. Shawn Irvin Julie Alonzo Gerald Tindal University of Oregon
More informationBasic IRT Concepts, Models, and Assumptions
Basic IRT Concepts, Models, and Assumptions Lecture #2 ICPSR Item Response Theory Workshop Lecture #2: 1of 64 Lecture #2 Overview Background of IRT and how it differs from CFA Creating a scale An introduction
More informationThe Discriminating Power of Items That Measure More Than One Dimension
The Discriminating Power of Items That Measure More Than One Dimension Mark D. Reckase, American College Testing Robert L. McKinley, Educational Testing Service Determining a correct response to many test
More informationEnsemble Rasch Models
Ensemble Rasch Models Steven M. Lattanzio II Metamatrics Inc., Durham, NC 27713 email: slattanzio@lexile.com Donald S. Burdick Metamatrics Inc., Durham, NC 27713 email: dburdick@lexile.com A. Jackson Stenner
More informationCourse title SD206. Introduction to Structural Equation Modelling
10 th ECPR Summer School in Methods and Techniques, 23 July - 8 August University of Ljubljana, Slovenia Course Description Form 1-2 week course (30 hrs) Course title SD206. Introduction to Structural
More informationValidation of Cognitive Structures: A Structural Equation Modeling Approach
Multivariate Behavioral Research, 38 (1), 1-23 Copyright 2003, Lawrence Erlbaum Associates, Inc. Validation of Cognitive Structures: A Structural Equation Modeling Approach Dimiter M. Dimitrov Kent State
More informationClassical Test Theory. Basics of Classical Test Theory. Cal State Northridge Psy 320 Andrew Ainsworth, PhD
Cal State Northridge Psy 30 Andrew Ainsworth, PhD Basics of Classical Test Theory Theory and Assumptions Types of Reliability Example Classical Test Theory Classical Test Theory (CTT) often called the
More informationUse of e-rater in Scoring of the TOEFL ibt Writing Test
Research Report ETS RR 11-25 Use of e-rater in Scoring of the TOEFL ibt Writing Test Shelby J. Haberman June 2011 Use of e-rater in Scoring of the TOEFL ibt Writing Test Shelby J. Haberman ETS, Princeton,
More informationWeather Second Grade Virginia Standards of Learning 2.6 Assessment Creation Project. Amanda Eclipse
Weather Second Grade Virginia Standards of Learning 2.6 Assessment Creation Project 1 Amanda Eclipse Overview and Description of Course The goal of Virginia s science standards is for the students to develop
More informationExtensions and Applications of Item Explanatory Models to Polytomous Data in Item Response Theory. Jinho Kim
Extensions and Applications of Item Explanatory Models to Polytomous Data in Item Response Theory by Jinho Kim A dissertation submitted in partial satisfaction of the requirements for the degree of Doctor
More informationThe Electron ic PSC Testing System
20 6 JO URAL O F CH IESE IFO RM AT IO PROCESS IG Vol120 o16 : 1003-0077 (2006) 06-0089 - 08,,, (, 230027) : 100,,, 500,, (2144)(2130) :; ;; ; ; : TP391: A The Electron ic PSC Testing System W E I Si, L
More informationA Study of Statistical Power and Type I Errors in Testing a Factor Analytic. Model for Group Differences in Regression Intercepts
A Study of Statistical Power and Type I Errors in Testing a Factor Analytic Model for Group Differences in Regression Intercepts by Margarita Olivera Aguilar A Thesis Presented in Partial Fulfillment of
More informationApplication of Item Response Theory Models for Intensive Longitudinal Data
Application of Item Response Theory Models for Intensive Longitudinal Data Don Hedeker, Robin Mermelstein, & Brian Flay University of Illinois at Chicago hedeker@uic.edu Models for Intensive Longitudinal
More informationUCLA Department of Statistics Papers
UCLA Department of Statistics Papers Title Can Interval-level Scores be Obtained from Binary Responses? Permalink https://escholarship.org/uc/item/6vg0z0m0 Author Peter M. Bentler Publication Date 2011-10-25
More information36-720: The Rasch Model
36-720: The Rasch Model Brian Junker October 15, 2007 Multivariate Binary Response Data Rasch Model Rasch Marginal Likelihood as a GLMM Rasch Marginal Likelihood as a Log-Linear Model Example For more
More informationUNIDIMENSIONALITY IN THE RASCH MODEL: HOW TO DETECT AND INTERPRET. E. Brentari, S. Golia 1. INTRODUCTION
STATISTICA, anno LXVII, n. 3, 2007 UNIDIMENSIONALITY IN THE RASCH MODEL: HOW TO DETECT AND INTERPRET 1. INTRODUCTION The concept of udimensionality is frequently defined as a single latent trait being
More informationCovariates of the Rating Process in Hierarchical Models for Multiple Ratings of Test Items
Journal of Educational and Behavioral Statistics March XXXX, Vol. XX, No. issue;, pp. 1 28 DOI: 10.3102/1076998606298033 Ó AERA and ASA. http://jebs.aera.net Covariates of the Rating Process in Hierarchical
More informationESTIMATION OF IRT PARAMETERS OVER A SMALL SAMPLE. BOOTSTRAPPING OF THE ITEM RESPONSES. Dimitar Atanasov
Pliska Stud. Math. Bulgar. 19 (2009), 59 68 STUDIA MATHEMATICA BULGARICA ESTIMATION OF IRT PARAMETERS OVER A SMALL SAMPLE. BOOTSTRAPPING OF THE ITEM RESPONSES Dimitar Atanasov Estimation of the parameters
More informationHSCCC, (H igh - Speed Countercurrent Chro2 matography, HSCCC) 2 ,, HSCCC, HPLC HSCCC
2007 Vol. 9 No. 1 * * * (/ 100037) :(HSCCC) : (H igh - Speed Countercurrent Chro2 matography, HSCCC)2,, : 2006210210 : 2006211230 * ( KM200410011001) ;,:; (2042006),: * * :,,,Tel: 010268983116, E2mail:
More informationCHEMTAX,. , CHEMTAX, g/m % % 4. 50% % 4. 83% %.
29 2 Vol. 29, No. 2 2010 5 JOURNAL OF OCEANOGRAPHY IN TA IWAN STRA IT May, 2010 2001 2 6 1, 2, 2 (1., 510632; 2., 361005) : (RP2HPLC) 2001 2 6, CHEMTAX,. RP2HPLC, 1 /3,. RP2HPLC, 2001 2 6, ( fucoxanthin)
More information18. SEPARATION STATISTICS
18. SEPARATION STATISTICS The person and item separation statistics used in Rasch measurement are still unfamiliar to some practitioners (Wright, Mead and Bell, 1979 ; Wright and Stone, 1979 ; and Wrightand
More informationRaffaela Wolf, MS, MA. Bachelor of Science, University of Maine, Master of Science, Robert Morris University, 2008
Assessing the Impact of Characteristics of the Test, Common-items, and Examinees on the Preservation of Equity Properties in Mixed-format Test Equating by Raffaela Wolf, MS, MA Bachelor of Science, University
More informationResearch Methodology Statistics Comprehensive Exam Study Guide
Research Methodology Statistics Comprehensive Exam Study Guide References Glass, G. V., & Hopkins, K. D. (1996). Statistical methods in education and psychology (3rd ed.). Boston: Allyn and Bacon. Gravetter,
More informationFRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE
FRANKLIN UNIVERSITY PROFICIENCY EXAM (FUPE) STUDY GUIDE Course Title: Probability and Statistics (MATH 80) Recommended Textbook(s): Number & Type of Questions: Probability and Statistics for Engineers
More informationLesson 6: Reliability
Lesson 6: Reliability Patrícia Martinková Department of Statistical Modelling Institute of Computer Science, Czech Academy of Sciences NMST 570, December 12, 2017 Dec 19, 2017 1/35 Contents 1. Introduction
More informationItem Response Theory (IRT) Analysis of Item Sets
University of Connecticut DigitalCommons@UConn NERA Conference Proceedings 2011 Northeastern Educational Research Association (NERA) Annual Conference Fall 10-21-2011 Item Response Theory (IRT) Analysis
More information(2009) Journal of Rem ote Sensing (, 2006) 2. 1 (, 1999), : ( : 2007CB714402) ;
100724619 (2009) 0220183207 Journal of Rem ote Sensing 1, 2, 3, 3, 3, 1 1., 100101; 2., 100049; 3., 100080 :,,, 3, ( ),1%,, :,,, : TP79 : A 1 20,,,,;, (, 1999),,,,, (, 2006),,,,, 2 2. 1 : 2007209217; :
More informationComparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior
Comparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior David R. Johnson Department of Sociology and Haskell Sie Department
More informationA discussion on methodologies for research into complex system s
4 1 Vol. 4. 1 2009 2 CAA I Transactions on Intelligent System s Feb. 2009 1, 2, 3, 2, 4 (1., 610074; 2., 610500; 3., 610500; 4., 618000) : 20 80,,,, 6 : ; ; Agent ; Swarm; StarLogo : N94, TP273 : A : 167324785
More informationCenter for Advanced Studies in Measurement and Assessment. CASMA Research Report
Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 25 A Analysis of a Large-scale Reading Comprehension Test. Dongmei Li Robert L. Brennan August 2007 Robert L.Brennan
More informationMeasurement Systems Analysis
Measurement Systems Analysis Since measurement systems represent a subprocess within a process They are subject to Variation. What could be the source of this variation? Why do Measurements Vary? Possible
More informationPairwise Parameter Estimation in Rasch Models
Pairwise Parameter Estimation in Rasch Models Aeilko H. Zwinderman University of Leiden Rasch model item parameters can be estimated consistently with a pseudo-likelihood method based on comparing responses
More informationMulti-level Models: Idea
Review of 140.656 Review Introduction to multi-level models The two-stage normal-normal model Two-stage linear models with random effects Three-stage linear models Two-stage logistic regression with random
More informationPIRLS 2016 Achievement Scaling Methodology 1
CHAPTER 11 PIRLS 2016 Achievement Scaling Methodology 1 The PIRLS approach to scaling the achievement data, based on item response theory (IRT) scaling with marginal estimation, was developed originally
More informationMonte Carlo Simulations for Rasch Model Tests
Monte Carlo Simulations for Rasch Model Tests Patrick Mair Vienna University of Economics Thomas Ledl University of Vienna Abstract: Sources of deviation from model fit in Rasch models can be lack of unidimensionality,
More informationEquating Tests Under The Nominal Response Model Frank B. Baker
Equating Tests Under The Nominal Response Model Frank B. Baker University of Wisconsin Under item response theory, test equating involves finding the coefficients of a linear transformation of the metric
More informationDevelopment and Calibration of an Item Response Model. that Incorporates Response Time
Development and Calibration of an Item Response Model that Incorporates Response Time Tianyou Wang and Bradley A. Hanson ACT, Inc. Send correspondence to: Tianyou Wang ACT, Inc P.O. Box 168 Iowa City,
More information, kw, kw 3176%,, JOURNAL OF NATURAL RESOURCES Aug., , : F42612 : A : (2009)
24 8 Vol124 No18 2009 8 JOURNAL OF NATURAL RESOURCES Aug., 2009 1, 2, 13 (11, 100101; 21, 100049) : 1997 2006 10,,,, 2006 GDP 527111 1 195120 5 745182, 3,,,,, 4 : : ; ; ; : F42612 : A: 1000-3037 (2009)
More informationThe Impact of Item Position Change on Item Parameters and Common Equating Results under the 3PL Model
The Impact of Item Position Change on Item Parameters and Common Equating Results under the 3PL Model Annual Meeting of the National Council on Measurement in Education Vancouver, B.C. Jason L. Meyers
More informationHypothesis Registration: Structural Predictions for the 2013 World Schools Debating Championships
Hypothesis Registration: Structural Predictions for the 2013 World Schools Debating Championships Tom Gole * and Simon Quinn January 26, 2013 Abstract This document registers testable predictions about
More informationCenter for Advanced Studies in Measurement and Assessment. CASMA Research Report
Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 24 in Relation to Measurement Error for Mixed Format Tests Jae-Chun Ban Won-Chan Lee February 2007 The authors are
More informationLogistic Regression and Item Response Theory: Estimation Item and Ability Parameters by Using Logistic Regression in IRT.
Louisiana State University LSU Digital Commons LSU Historical Dissertations and Theses Graduate School 1998 Logistic Regression and Item Response Theory: Estimation Item and Ability Parameters by Using
More informationThe Robustness of LOGIST and BILOG IRT Estimation Programs to Violations of Local Independence
A C T Research Report Series 87-14 The Robustness of LOGIST and BILOG IRT Estimation Programs to Violations of Local Independence Terry Ackerman September 1987 For additional copies write: ACT Research
More informationRobert Bianchi, PhD Former director DEA Special Testing Laboratory Donald Cooper Former opioid lead in DEA Special Testing Lab
Investigators Robert Bianchi, PhD Former director DEA Special Testing Laboratory Donald Cooper Former opioid lead in DEA Special Testing Lab Donald Jasinski, MD Center for Chemical Dependency, Johns Hopkins
More informationStatistical and psychometric methods for measurement: G Theory, DIF, & Linking
Statistical and psychometric methods for measurement: G Theory, DIF, & Linking Andrew Ho, Harvard Graduate School of Education The World Bank, Psychometrics Mini Course 2 Washington, DC. June 27, 2018
More informationJOURNAL OF NATURAL RESOURCES Mar., 2008 : X24 : A : (2008) : ; : : ( )
23 2 Vol123 No12 2008 3 JOURNAL OF NATURAL RESOURCES Mar., 2008 : (, 100101) :,,,,,,,(),,, : ; ; ; : X24 : A: 1000-3037 (2008) 02-0177 - 08 1,,,,,,,,,,,,,,, 2,, [ 1, 2 ] : 2007-06- 09; : 2007-12- 26 :
More informationAn Equivalency Test for Model Fit. Craig S. Wells. University of Massachusetts Amherst. James. A. Wollack. Ronald C. Serlin
Equivalency Test for Model Fit 1 Running head: EQUIVALENCY TEST FOR MODEL FIT An Equivalency Test for Model Fit Craig S. Wells University of Massachusetts Amherst James. A. Wollack Ronald C. Serlin University
More informationEstimating ability for two samples
Estimating ability for two samples William Revelle David M. Condon Northwestern University Abstract Using IRT to estimate ability is easy, but how accurate are the estimate and what about multiple samples?
More informationThis Unit is suitable for candidates with no previous archaeological experience but who have an interest in history and/or archaeology.
National Unit specification: general information Unit code: H29B 12 Superclass: DC Publication date: August 2012 Source: Scottish Qualifications Authority Version: 01 Summary This Unit introduces candidates
More informationDiagnostic Classification Models: Psychometric Issues and Statistical Challenges
Diagnostic Classification Models: Psychometric Issues and Statistical Challenges Jonathan Templin Department of Educational Psychology The University of Georgia University of South Carolina Talk Talk Overview
More informationGOVERNMENT MAPPING WORKSHOP RECOVER Edmonton s Urban Wellness Plan Mapping Workshop December 4, 2017
GOVERNMENT MAPPING WORKSHOP 12.4.17 RECOVER Edmonton s Urban Wellness Plan Mapping Workshop December 4, 2017 In July of 2017, City Council directed administration to develop RECOVER, Edmonton s Urban Wellness
More informationProgramme Specification (Undergraduate) Chemistry
Programme Specification (Undergraduate) BSc Chemistry This document provides a definitive record of the main features of the programme and the learning outcomes that a typical student may reasonably be
More informationVol112, No11 Feb1, 2010 JOURNAL OF GEO2INFORMATION SC IENCE , CBERS IRS - P5, ;, : ; : E2mail: lreis1ac1cn [ 6-13 ]
12 1 2010 2 JOURNAL OF GEO2INFORMATION SC IENCE Vol112, No11 Feb1, 2010, 3, (, 100101) :,,,,,, CBERS IRS - P5, ;,, 20m 1km ;,, CBERS, 200m, 76% ; P5, 100m, 84% : ; ; ; 1 [ 4-5 ], GIS, [ 6-13 ], ( 1995
More informationAC : GETTING MORE FROM YOUR DATA: APPLICATION OF ITEM RESPONSE THEORY TO THE STATISTICS CONCEPT INVENTORY
AC 2007-1783: GETTING MORE FROM YOUR DATA: APPLICATION OF ITEM RESPONSE THEORY TO THE STATISTICS CONCEPT INVENTORY Kirk Allen, Purdue University Kirk Allen is a post-doctoral researcher in Purdue University's
More informationA Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions
A Marginal Maximum Likelihood Procedure for an IRT Model with Single-Peaked Response Functions Cees A.W. Glas Oksana B. Korobko University of Twente, the Netherlands OMD Progress Report 07-01. Cees A.W.
More informationCenter for Advanced Studies in Measurement and Assessment. CASMA Research Report
Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 41 A Comparative Study of Item Response Theory Item Calibration Methods for the Two Parameter Logistic Model Kyung
More informationORF 245 Fundamentals of Engineering Statistics. Midterm Exam 1
Princeton University Department of Operations Research and Financial Engineering ORF 45 Fundamentals of Engineering Statistics Midterm Exam March 06, 009 0:00am-0:50am PLEASE DO NOT TURN THIS PAGE AND
More informationTest Administration Technical Report
Test Administration Technical Report 2015 2016 Program Year May 2017 Prepared for: Center for the Teaching Profession Ohio Department of Education 25 South Front Street, MS 505 Columbus, OH 43215-4183
More informationMeasurement Error in Nonparametric Item Response Curve Estimation
Research Report ETS RR 11-28 Measurement Error in Nonparametric Item Response Curve Estimation Hongwen Guo Sandip Sinharay June 2011 Measurement Error in Nonparametric Item Response Curve Estimation Hongwen
More informationOn the Construction of Adjacent Categories Latent Trait Models from Binary Variables, Motivating Processes and the Interpretation of Parameters
Gerhard Tutz On the Construction of Adjacent Categories Latent Trait Models from Binary Variables, Motivating Processes and the Interpretation of Parameters Technical Report Number 218, 2018 Department
More informationTest Administration Technical Report
Test Administration Technical Report 2013 2014 Program Year October 2014 Prepared for: Center for the Teaching Profession Ohio Department of Education 25 South Front Street, MS 505 Columbus, OH 43215-4183
More informationThe Factor Analytic Method for Item Calibration under Item Response Theory: A Comparison Study Using Simulated Data
Int. Statistical Inst.: Proc. 58th World Statistical Congress, 20, Dublin (Session CPS008) p.6049 The Factor Analytic Method for Item Calibration under Item Response Theory: A Comparison Study Using Simulated
More informationResearch on Standard Errors of Equating Differences
Research Report Research on Standard Errors of Equating Differences Tim Moses Wenmin Zhang November 2010 ETS RR-10-25 Listening. Learning. Leading. Research on Standard Errors of Equating Differences Tim
More informationSummer School in Applied Psychometric Principles. Peterhouse College 13 th to 17 th September 2010
Summer School in Applied Psychometric Principles Peterhouse College 13 th to 17 th September 2010 1 Two- and three-parameter IRT models. Introducing models for polytomous data. Test information in IRT
More informationCentre for Education Research and Practice
A Rasch analysis of the quality of marking of GCSE Science William Pointer Summary The reintroduction of an examination that combines biology, chemistry and physics has created the potential for a reduction
More informationA Simulation Study to Compare CAT Strategies for Cognitive Diagnosis
A Simulation Study to Compare CAT Strategies for Cognitive Diagnosis Xueli Xu Department of Statistics,University of Illinois Hua-Hua Chang Department of Educational Psychology,University of Texas Jeff
More informationInstrumentation (cont.) Statistics vs. Parameters. Descriptive Statistics. Types of Numerical Data
Norm-Referenced vs. Criterion- Referenced Instruments Instrumentation (cont.) October 1, 2007 Note: Measurement Plan Due Next Week All derived scores give meaning to individual scores by comparing them
More informationBase Rates and Bayes Theorem
Slides to accompany Grove s handout March 8, 2016 Table of contents 1 Diagnostic and Prognostic Inference 2 Example inferences Does this patient have Alzheimer s disease, schizophrenia, depression, etc.?
More informationValidation of the TAFE SA Assessment of Basic Skills (TABS) Instrument Francisco Ben, Wilham Hailaya & Sivakumar Alagumalai
2012 Validation of the TAFE SA Assessment of Basic Skills (TABS) Instrument Francisco Ben, Wilham Hailaya & Sivakumar Alagumalai 0 Table of Contents List of Tables... 2 List of Figures... 2 Executive Summary...
More informationChina Academic Journal Electronic Publishing House. All rights reserved.
2006 09 28 5 Modern Law Science Vol128 No. 5 Sep t., 2006 : 1001-2397 (2006) 05-0059 - 09 (, 000115) : :,, ;,,,, ( ), ( ) ;,,,,,,,,, ;,,,, : ; ; : DF082 : A 2006,, ;, :,,, : 2006-07 - 07 : (1935 - ),,,,
More information