Beyond classical statistics: Different approaches to evaluating marking reliability
|
|
- Joel Williamson
- 5 years ago
- Views:
Transcription
1 Beyond classical statistics: Different approaches to evaluating marking reliability E. Harrison, W. Pointer, Y. Bimpeh & B. Smith
2 What do we mean by marking reliability? Reliability = when something can be repeated many times and produces the same results Marking reliability = when marking of a candidate response can be repeated many times and results in the same (or similar) mark(s) Intra-marker reliability (within markers) Inter-marker reliability (between markers)
3 Why does reliable marking matter? awarded score = true score + error Affects how certain we can be that a candidate has been awarded their true score Implications for (grade) classification accuracy Can be the difference between University/not, or job/not Grade changes undermine confidence in the system Ultimately, inaccurate grading upsets the rank order of candidates and is thus unfair
4 How can we measure marking reliability? Variety of methods for monitoring marker consistency across marking: Classical test theory Generalisability theory (Brennan, 2001) Many-facet Rasch model (Engelhard, 1996; Myford and Wolfe, 2003, 2004) Hierarchical rater modelling (Patz, 1996) Rater bundle model (Wilson and Hoskens, 2001) Signal detection rater model (DeCarlo, Kim, and Johnson, 2011) Latent trait model (Wolfe and McVay, 2012) Confirmatory factor analysis (Bollen, 1989; Green & Yang, 2009; Jöreskog, 1971)
5 Example of seeding data Candidate responses (seeds) markers x x x x x x 2 x x x x x x x x x 3 x x x x x x 4 x x 5 x x x x x x x x x 6 x x 7 x x x x x x x x x 8 x x x x 9 x x x x x 10 x x x x x
6 A G theory approach to marking reliability Liz Harrison
7 Background: Classical test theory (CTT) Observed score on this occasion X po = T p +e po Error on this occasion True score Reliability: the correlation between observed and true Cronbach s α often used as a proxy, ideally >0.8 Error undifferentiated
8 Generalisability theory Cronbach and others developed G theory by generalising some of the basic assumptions of CTT Sources of variation are referred to as facets e.g. person, marker, item, form, time, setting, school, etc. A test is viewed as a sample from a wider universe of admissible observations on each facet Brennan, R. L. (2001). Generalizability Theory. New York: Springer-Verlag
9 The G study and the D study G study: Researcher must decide which facets are sources of error Carefully designed to estimate as many sources of error as is reasonably and economically feasible D study: The results from the G study can be used to inform a decision: a D study
10 Component variance approach Person x marker for a single item: Person effect Marker effect Interaction effect + residual X pm = μ + (μ p μ) + (μ m μ) +(X pm μ p μ m + μ) σσ 2 (XX pppp ) = σσ 2 (pp) + σσ 2 (mm) + σσ 2 (pppp, ee)
11 Person x marker x item design X pmi = μ σσ 2 XXXXXXXX = σσ 2 pp + σσ 2 mm + σσ 2 ii + σσ 2 ppmm + σσ 2 ppii + σσ 2 mmii + σσ 2 (pppppp, ee) grand mean + μ p μ person effect + μ m μ marker effect + μ i μ item effect + μ pm μ p μ m + μ person x marker effect + μ pi μ p μ i + μ person x item effect + μ mi μ m μ i + μ marker x item effect + X pmi μ pm μ pi μ mi + μ p + μ m + μ i μ person x marker x item + residual
12 Two measurements of error (person x marker design) σ : absolute error (inter-marker agreement) σσ 2 = σσ2 mm nn mm + σσ2 pppp, ee nn mm σ δδ : relative error (inter-marker reliability) σσ 2 δδ = σσ2 pppp, ee nn mm nn mm = number of markers in the D study
13 Generalisability Coefficient: reliability Eρ 2 = σσ 2 (pp) σσ 2 (pp) + σσ 2 (δδ) Eρ 2 correlation(x p, X pm ) 2 Universe score
14 Design issues Crossed Nested Fixed effects Unbalanced Analogous ANOVA procedure. Henderson s Method 1 (Biometrics, 1953)
15 The D study for mark-remark data Use the simple person x marker design per item We are interested in the impact of error on an individual student; marked by one marker Absolute error: Reliability: σσ ii 2 = σσ ii 2 (mm) + σσ ii 2 (pppp, ee) EEρρ 2 (ii) = σσ ii 2 (pp) σσ ii 2 (pp) + σσ ii 2 mm CC pppp + σσ ii 2 (pppp, ee) See Brennan p.236
16 D study results at item level Item Tariff %σ(p) %σ(m) %σ(pm,e) σ i (Δ) 1a b c d e a b bE c cE
17 Other D studies Eρ i 2 Item absolute error Item Tariff n m =1 n m =2 n m =3 n m =5 n m =1 n m =2 n m =3 n m =5 1a b c d e a b bE c cE
18 Conclusion G theory powerful tool to look at marking reliability Can be used to design better assessments Use for continuous improvement of exam papers by flagging items that have been challenging to mark Brennan, R. L. (2001). Generalizability Theory. New York: Springer-Verlag
19 The Many-Facet Rasch Measurement Model William Pointer, AQA
20 The Many-Facets Rasch Measurement (MFRM) Model loooo PP nnnnnnnn PP nnnnnnnn 11 = BB nn DD ii CC jj FF kk Where: PP nnnnnnnn = probability of candidate n being rated k on item i by marker j PP nnnnnnnn 1 = probability of candidate n being rated k - 1 on item i by marker j BB nn = ability of candidate n DD ii = difficulty of item i CC jj = severity of marker j FF kk = difficulty of scale category k relative to scale category k - 1
21 Missing Data There are often gaps in the data: Not all candidates will answer every item Not all markers will mark every candidate Not a problem for Rasch models The only requirement is that the data is fully connected
22 Key Statistical Indicators A fixed chi-square test Separation Strata Separation reliability
23 Key Statistical Indicators Item Chisquare d.f. Significance Separation Strata Reliability 1a < b < c < d < e < a <
24 Types of Marker Effects Leniency / severity Randomness Central tendency Halo effect Differential leniency / severity
25 Leniency / Severity Are some examiners more severe or lenient than others? Item 1c Chi-square D.F. 19 Significance <0.01 Separation 2.76 Strata 4.01 Reliability 0.88
26 Leniency / Severity Fair average average mark awarded by a marker adjusted to take into count the ability of the candidates that they have marked
27 Randomness Are markers applying the scale in a consistent manner? Key statistical indicators: Infit/outfit SR/ROR correlations
28 Randomness Infit Outfit Correlation MnSq ZStd MnSq ZStd PtMea PtExp Examiner
29 Central Tendency Do markers tend to overuse the middle categories?
30 Summary The model performs well even with lots of missing data The model allows us to analyse candidates, items and markers in detail Person ability estimates are not dependent on markers Can diagnose various different types of marker effects
31 References Andrich, D. (1978). A rating formulation for ordered response categories. Psychometrika, 43, Linacre, M. (1989). Many-facet Rasch measurement. Chicago: MESA Press. Masters, G. (1982). A Rasch model for partial credit scoring. Psychometrika, 47, Myford, C. & Wolfe, E. (2003). Detecting and Measuring Rater Effects Using Many- Facet Rasch Measurement: Part I. Journal of applied measurement, Myford, C. & Wolfe, E. (2004). Detecting and Measuring Rater Effects Using Many- Facet Rasch Measurement: Part II. Journal of applied measurement, Rasch, G. (1960). Probabilistic models for some intelligence and attainment tests. Copyright AQA and its licensors. All rights reserved. AQA Education (AQA) is a registered charity (registered charity number ) and a company limited by guarantee registered in England and Wales (company number ). Registered address: AQA, Devas Street, Manchester M15 6EX.
32 Marking Reliability: using confirmatory factor analysis Yaw Bimpeh
33 Background Data from routine monitoring of on-screen marking has the potential to be used for regular measurement of marking reliability by the UK awarding bodies The objective of the current research is to look at the Confirmatory factor analytic (CFA)-based perspectives on marking reliability
34 Why CFA approach? Many facet Rash model provides micro level of marker analysis G-theory provides macro level of marker analysis CFA provides both micro and macro level of marker analysis
35 CFA approach CFA provides the researcher with a lens to see both the tree and forest of marking
36 Confirmatory factor analytic perspectives on reliability Joreskog (1971) originally formulated the congeneric test model upon which CFA-based estimates of reliability are grounded; it is based on a set of replicates define along a single facet of measurement (e.g., markers, items), and that facet is assumed to be fully crossed with the objects of measurement (e.g., persons)
37 Analysing marking data using congeneric measurement model Xi is marker i score λλ ii is the factor loading ξ is the true or latent score E is the measurement error Congeneric indicators are ones that measure the same latent score There are no restrictions on the factor loadings or error variances except that the error variances are independent
38 Estimation of reliability for general congeneric measures Wright (1918, 1934) developed a method of estimating causal path coefficients by decomposing the correlations among a set of variables Using Wright s tracing rules, one can obtain the reliability of X (i.e., the sum of X1,X2,...Xk) for congeneric indicators of markers. We do this by calculating the proportion of variance due to ξ and the proportion due to E. The estimation of reliability for the general case of congeneric measures, is addressed in the framework of covariance structure analysis (e.g., Feldt & Brenan, 1989; Gilmer & Feldt, 1983; Raykov, 2001b) A more efficiently approach is via latent trait modelling (e.g., Jöreskog, 1971; Raykov, 2009; Raykov, Dimitrov, & Asparouhov, 2010).
39 CFA-based perspective on marking reliability For each item i: Inter-marker reliability = KK ii kk=1 λλ kkkk KK ii kk=1 λλ kkkk 2 2 KK + ii kk=1 vvvvvv(eekkkk ) where λλ kkkk is the marker loading vvvvvv(ee kkkk ) is variance due the measurement error of the kth marker (k = 1,,KK ii ) of seed item i KK ii is the number of markers of scoring seed item i This measure of reliability is the so called omega (McDonald, 1999)
40 Handling missing data Missing data is inherent in the seed marking data, since seeds are evaluated by many markers, but each item gets a different candidate-marker combination Little and Rubin (2002) postulate three different types of missing data: Missing Completely at Random (MCAR) Missing at Random (MAR) Missing Not At Random (MNAR) The missing data in marking is either MCAR or MAR The missing data can be handled in CFA using methods such as: Multiple imputation Full information maximum likelihood Traditional methods like pairwise deletion or list-wise deletion
41 Empirical studies of 2016 A-level Dondology marking reliability using CFA method Item M-type Intermarker Reliability Std.err No. of seeds Markers Maximum mark 1c Seeded di Seeded dii Seeded e Seeded a Seeded
42 Results: item 2a Estimate Std.Err z-value P(> z ) factor loading LV =~ A68 (l1) A7485 (l2) A11550 (l3) A13285 (l4) A13826 (l5) A14599 (l6) A18565 (l7) A24085 (l8) A26725 (l9) A26893 (l10) A28889 (l11) A52195 (l12) A57466 (l13) A (l14) A (l15) A (l16) A (l17) A (l18) A (l19) A (l20) A (l21) A (l22) A (l23) A (l24) A (l25) A (l26) A (l27) A (l28) A (l29) A (l30) A (l31) A (l32)
43 Marker Reliability Marker item 1e Grade of item 2a examiner (tariff=6) (tariff=10) A S A S A S A N A N A S A N A N A N A N A N A N A N A S A N A N A N A N A N A N A N A S A N A N A N A N A N A N A N A N A N A N N=normal S=senior
44 Conclusions We provide an alternative method for estimating item level marking reliability The CFA approach provides marker performance information as well as and inter-marker reliability for seed data It provides confidence interval and standard error for the inter-marker reliability estimates It can handle missing data using methods such as: Mean imputation Full information maximum likelihood And traditional methods like pairwise deletion or list-wise deletion It also provides goodness of fit statistics Copyright AQA and its licensors. All rights reserved. AQA Education (AQA) is a registered charity (registered charity number ) and a company limited by guarantee registered in England and Wales (company number ). Registered address: AQA, Devas Street, Manchester M15 6EX.
45 Pros & cons of G-theory Allows delineation of variance components Index values can be compared across disparate items Can be aggregated to unit/subject level Missing data not an issue Not possible to generate detailed marker-level information Problematic with low sample sizes (negative variance issues)
46 Pros & cons of many-facet Rasch models Produces detailed information on markers, candidates and items Specific marker effects can be diagnosed Missing data not an issue Precision of examinees latent scores tends to infinite as number of raters per item increases Potentially biased standard errors Still needs connected data
47 Pros & cons of confirmatory factor analysis Produces model fit estimates Gives both marker performance and overall reliability statistics If poor fit, model cannot be used Computationally more intensive than other methods Not feasible with high missingness (<10% coverage) so double-marked data Requires relatively even # of markers and candidates
48 Discussion Philosophical differences between the models: Raters as rating machines Raters as independent experts G-theory = rating machines Many facet Rasch = independent experts Confirmatory factor analysis = expert machines? Shavelson, R. J., & Webb, N. M. (1991). Generalizability Theory: A Primer. Thousand Oaks, CA: Sage.
49 Any questions? Copyright AQA and its licensors. All rights reserved. AQA Education (AQA) is a registered charity (registered charity number ) and a company limited by guarantee registered in England and Wales (company number ). Registered address: AQA, Devas Street, Manchester M15 6EX.
Centre for Education Research and Practice
A Rasch analysis of the quality of marking of GCSE Science William Pointer Summary The reintroduction of an examination that combines biology, chemistry and physics has created the potential for a reduction
More informationMeasurement Theory. Reliability. Error Sources. = XY r XX. r XY. r YY
Y -3 - -1 0 1 3 X Y -10-5 0 5 10 X Measurement Theory t & X 1 X X 3 X k Reliability e 1 e e 3 e k 1 The Big Picture Measurement error makes it difficult to identify the true patterns of relationships between
More informationVersion : 1.0: General Certificate of Secondary Education November Higher Unit 2. Final. Mark Scheme
Version : 1.0: 11.11 General Certificate of Secondary Education November 2011 Mathematics Higher Unit 2 43602H Final Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together
More informationPolytomous Item Explanatory IRT Models with Random Item Effects: An Application to Carbon Cycle Assessment Data
Polytomous Item Explanatory IRT Models with Random Item Effects: An Application to Carbon Cycle Assessment Data Jinho Kim and Mark Wilson University of California, Berkeley Presented on April 11, 2018
More informationELSEVIER FIRST PROOFS CONTRIBUTORS PROOFCHECKING INSTRUCTIONS FOR ENCYCLOPEDIA OF SOCIAL MEASUREMENT
CONTRIBUTORS PROOFCHECKING INSTRUCTIONS FOR ENCYCLOPEDIA OF SOCIAL MEASUREMENT PROOFREADING The text content and layout of your article is not in final form when you receive proofs. Read proofs for accuracy
More informationVersion 1.0. Level 2 Certificate in Further Mathematics. Practice Paper Set 4. Paper /2. Mark Scheme
Version 1.0 Level Certificate in Further Mathematics Practice Paper Set 4 Paper 8360/ Mark Scheme Mark Schemes Principal Examiners have prepared these mark schemes for practice papers. These mark schemes
More informationVersion 1.0. General Certificate of Education (A-level) June Mathematics MS2B. (Specification 6360) Statistics 2B. Final.
Version.0 General Certificate of Education (A-level) June 20 Mathematics MS2B (Specification 660) Statistics 2B Final Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together
More informationVersion 1.0. Level 2 Certificate in Further Mathematics Practice Paper Set 2. Paper /2. Mark Scheme
Version 1.0 Level Certificate in Further Mathematics Practice Paper Set Paper 8360/ Mark Scheme Mark Schemes Principal Examiners have prepared these mark schemes for practice papers. These mark schemes
More informationConditional Standard Errors of Measurement for Performance Ratings from Ordinary Least Squares Regression
Conditional SEMs from OLS, 1 Conditional Standard Errors of Measurement for Performance Ratings from Ordinary Least Squares Regression Mark R. Raymond and Irina Grabovsky National Board of Medical Examiners
More informationRasch , 40 (9) : ,,, ,,,, B A cta Psychologica S in ica DO I: /SP. J
2008, 40 (9) : 1030 1040 A cta Psychologica S in ica DO I: 10. 3724 /SP. J. 1041. 2008. 01030 Rasch 3 1 2 ( 1,,100875) ( 2 Kennedy School of Government, Harvard University, MA 02138, USA) Rasch, 66,,,,
More informationPMT. Version. General Certificate of Education (A-level) January 2013 MS2B. Mathematics. (Specification 6360) Statistics 2B. Final.
Version General Certificate of Education (A-level) January 01 Mathematics MSB (Specification 660) Statistics B Final Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together
More informationVersion 1.0. Level 2 Certificate in Further Mathematics Practice Paper Set 1. Paper /2. Mark Scheme
Version 1.0 Level 2 Certificate in Further Mathematics Practice Paper Set 1 Paper 2 8360/2 Mark Scheme Mark Schemes Principal Examiners have prepared these mark schemes for practice papers. These mark
More informationVersion 1.0. General Certificate of Education (A-level) June Mathematics MPC2. (Specification 6360) Pure Core 2. Final.
Version.0 General Certificate of Education (A-level) June 0 Mathematics MPC (Specification 660) Pure Core Final Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together
More informationVersion 1.0. General Certificate of Education (A-level) June Mathematics MS04. (Specification 6360) Statistics 4. Final.
Version.0 General Certificate of Education (A-level) June 0 Mathematics MS04 (Specification 6360) Statistics 4 Final Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together
More informationT. Mark Beasley One-Way Repeated Measures ANOVA handout
T. Mark Beasley One-Way Repeated Measures ANOVA handout Profile Analysis Example In the One-Way Repeated Measures ANOVA, two factors represent separate sources of variance. Their interaction presents an
More informationVersion. General Certificate of Education (A-level) January Mathematics MPC1. (Specification 6360) Pure Core 1. Final.
Version General Certificate of Education (A-level) January 01 Mathematics MPC1 (Specification 660) Pure Core 1 Final Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together
More informationA MULTIVARIATE GENERALIZABILITY ANALYSIS OF STUDENT STYLE QUESTIONNAIRE
A MULTIVARIATE GENERALIZABILITY ANALYSIS OF STUDENT STYLE QUESTIONNAIRE By YOUZHEN ZUO A THESIS PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT OF THE REQUIREMENTS
More informationMonte Carlo Simulations for Rasch Model Tests
Monte Carlo Simulations for Rasch Model Tests Patrick Mair Vienna University of Economics Thomas Ledl University of Vienna Abstract: Sources of deviation from model fit in Rasch models can be lack of unidimensionality,
More informationVersion1.1. General Certificate of Education (A-level) January 2011 SS04. Statistics. (Specification 6380) Statistics 4.
Version1.1 General Certificate of Education (A-level) January 2011 Statistics SS04 (Specification 6380) Statistics 4 Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together
More informationVersion 1.0. General Certificate of Secondary Education Practice Paper Set 3. Mathematics (Linear) B. Paper 2 Higher Tier 43652H.
Version 1.0 General Certificate of Secondary Education Practice Paper Set 3 Mathematics (Linear) B Paper 2 Higher Tier 43652H Mark Scheme Mark schemes are prepared by the Principal Examiner and considered,
More informationVersion 1.0. General Certificate of Education (A-level) June 2012 MPC1. Mathematics. (Specification 6360) Pure Core 1. Mark Scheme
Version 1.0 General Certificate of Education (A-level) June 01 Mathematics (Specification 660) Pure Core 1 Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together with
More informationVersion 1.0. General Certificate of Education (A-level) June Mathematics MM1B. (Specification 6360) Mechanics 1B. Final.
Version 1.0 General Certificate of Education (A-level) June 013 Mathematics MB (Specification 6360) Mechanics 1B Final Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together
More informationStandardisation Mark Scheme. Additional Science 4463 / Physics PHY2H Unit Physics 2. General Certificate of Secondary Education
Version : :0 09/02/09 General Certificate of Secondary Education Additional Science 4463 / Physics 445 Unit Physics 2 Standardisation Mark Scheme 2009 examination January series Mark schemes are prepared
More informationTeacher Support Materials Maths GCE. Paper Reference MPC4
Teacher Support Materials 2008 Maths GCE Paper Reference MPC4 Copyright 2008 AQA and its licensors. All rights reserved. Permission to reproduce all copyrighted material has been applied for. In some cases,
More informationPMT. Version 1.0. General Certificate of Education (A-level) June 2013 MPC1. Mathematics. (Specification 6360) Pure Core 1. Final.
Version 1.0 General Certificate of Education (A-level) June 01 Mathematics MPC1 (Specification 660) Pure Core 1 Final Mark Scheme Mark schemes are prepared by the Principal Eaminer and considered, together
More informationVersion 1.0. Level 2 Certificate in Further Mathematics Practice Paper Set 1. Paper /1. Mark Scheme
Version 1.0 Level Certificate in Further Mathematics Practice Paper Set 1 Paper 1 860/1 Mark Scheme Mark Schemes Principal Examiners have prepared these mark schemes for practice papers. These mark schemes
More informationPMT. Version 1.0: abc. General Certificate of Education. Mathematics MS03 Statistics 3. Mark Scheme examination - June series
Version 1.0: 0608 abc General Certificate of Education Mathematics 6360 MS03 Statistics 3 Mark Scheme 2008 examination - June series Mark schemes are prepared by the Principal Examiner and considered,
More informationStructural Equation Modeling and Confirmatory Factor Analysis. Types of Variables
/4/04 Structural Equation Modeling and Confirmatory Factor Analysis Advanced Statistics for Researchers Session 3 Dr. Chris Rakes Website: http://csrakes.yolasite.com Email: Rakes@umbc.edu Twitter: @RakesChris
More informationPMT. Version. General Certificate of Education (A-level) January 2013 MPC3. Mathematics. (Specification 6360) Pure Core 3. Final.
Version General Certificate of Education (A-level) January Mathematics MPC (Specification 66) Pure Core Final Mark Scheme Mark schemes are prepared by the Principal Eaminer and considered, together with
More informationPMT. Version 1.0. klm. General Certificate of Education June Mathematics. Pure Core 1. Mark Scheme
Version.0 klm General Certificate of Education June 00 Mathematics MPC Pure Core Mark Scheme Mark schemes are prepared by the Principal Eaminer and considered, together with the relevant questions, by
More informationVersion 1.0. General Certificate of Education (A-level) January 2011 MPC3. Mathematics. (Specification 6360) Pure Core 3.
Version.0 General Certificate of Education (A-level) January 0 Mathematics MPC (Specification 660) Pure Core Mark Scheme Mark schemes are prepared by the Principal Eaminer and considered, together with
More informationVersion 1.0. General Certificate of Education (A-level) June 2012 MPC4. Mathematics. (Specification 6360) Pure Core 4. Mark Scheme
Version.0 General Certificate of Education (A-level) June 0 Mathematics MPC (Specification 660) Pure Core Mark Scheme Mark schemes are prepared by the Principal Eaminer and considered, together with the
More informationGeneral Certificate of Secondary Education January Mathematics (Linear) B 4365 Paper 1 Foundation Tier. Final. Mark Scheme
General Certificate of Secondary Education January 2013 Mathematics (Linear) B 4365 Paper 1 Foundation Tier Final Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together
More informationabc Mark Scheme Additional Science 4463 / Chemistry 4421 General Certificate of Secondary Education Unit 2 Chemistry 2009 examination June series
Version.0: 2.07 abc General Certificate of Secondary Education Additional Science 4463 / Chemistry 442 Unit 2 Chemistry Mark Scheme 2009 examination June series Mark schemes are prepared by the Principal
More information1. How will an increase in the sample size affect the width of the confidence interval?
Study Guide Concept Questions 1. How will an increase in the sample size affect the width of the confidence interval? 2. How will an increase in the sample size affect the power of a statistical test?
More informationVersion : klm. General Certificate of Education. Mathematics MPC3 Pure Core 3. Mark Scheme examination - June series
Version :. 69 klm General Certificate of Education Mathematics 66 MPC Pure Core Mark Scheme 9 examination - June series Mark schemes are prepared by the Principal Examiner and considered, together with
More informationVersion 1.0. General Certificate of Education (A-level) June Statistics SS06. (Specification 6380) Statistics 6. Final.
Version 1.0 General Certificate of Education (A-level) June 013 Statistics SS06 (Specification 6380) Statistics 6 Final Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together
More informationVersion : abc. General Certificate of Education. Mathematics MPC4 Pure Core 4. Report on the Examination
Version :.0 0608 abc General Certificate of Education Mathematics 6360 MPC4 Pure Core 4 Report on the Examination 008 examination - June series Further copies of this Report are available to download from
More informationModeling differences in itemposition effects in the PISA 2009 reading assessment within and between schools
Modeling differences in itemposition effects in the PISA 2009 reading assessment within and between schools Dries Debeer & Rianne Janssen (University of Leuven) Johannes Hartig & Janine Buchholz (DIPF)
More informationVersion 1,0. General Certificate of Education (A-level) June 2012 MPC3. Mathematics. (Specification 6360) Pure Core 3. Mark Scheme
Version,0 General Certificate of Education (A-level) June 0 Mathematics MPC (Specification 660) Pure Core Mark Scheme Mark schemes are prepared by the Principal Eaminer and considered, together with the
More informationConfirmatory Factor Analysis. Psych 818 DeShon
Confirmatory Factor Analysis Psych 818 DeShon Purpose Takes factor analysis a few steps further. Impose theoretically interesting constraints on the model and examine the resulting fit of the model with
More informationVersion 1.0: abc. General Certificate of Education. Mathematics Mechanics 1B. Mark Scheme examination - June series
Version 1.0: 0608 abc General Certificate of Education Mathematics 6360 MB Mechanics 1B Mark Scheme 008 examination - June series Mark schemes are prepared by the Principal Examiner and considered, together
More informationVersion 1.0: hij. General Certificate of Education. Mathematics Mechanics 1B. Mark Scheme examination - January series.
Version 1.0: 0109 hij General Certificate of Education Mathematics 6360 MB Mechanics 1B Mark Scheme 009 examination - January series Final Mark schemes are prepared by the Principal Examiner and considered,
More informationRelated Concepts: Lecture 9 SEM, Statistical Modeling, AI, and Data Mining. I. Terminology of SEM
Lecture 9 SEM, Statistical Modeling, AI, and Data Mining I. Terminology of SEM Related Concepts: Causal Modeling Path Analysis Structural Equation Modeling Latent variables (Factors measurable, but thru
More informationVersion 1.0: klm. General Certificate of Education. Mathematics MFP4 Further Pure 4. Mark Scheme examination - January series
Version 1.0: 0110 klm General Certificate of Education Mathematics 660 MFP Further Pure Mark Scheme 010 examination - January series Mark schemes are prepared by the Principal Examiner and considered,
More informationVersion 1.0: abc. General Certificate of Education. Mathematics MDO2 Decision 02. Mark Scheme examination - January series
Version.0: 0.08 abc General Certificate of Education Mathematics 6360 MDO2 Decision 02 Mark Scheme 2008 examination - January series Mark schemes are prepared by the Principal Examiner and considered,
More informationAQA Qualifications GCSE MATHEMATICS. Topic tests Higher tier - Mark schemes. Version 1.1
AQA Qualifications GCSE MATHEMATICS Topic tests Higher tier - Mark schemes Version. Our specification is published on our website (www.aqa.org.uk). We will let centres know in writing about any changes
More informationPackage gtheory. October 30, 2016
Package gtheory October 30, 2016 Version 0.1.2 Date 2016-10-22 Title Apply Generalizability Theory with R Depends lme4 Estimates variance components, generalizability coefficients, universe scores, and
More informationVersion 1.0. General Certificate of Education (A-level) January Mathematics MS2B. (Specification 6360) Statistics 2B. Final.
Version.0 General Certificate of Education (A-level) January 0 Mathematics MSB (Specification 6360) Statistics B Final Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together
More informationTime-Invariant Predictors in Longitudinal Models
Time-Invariant Predictors in Longitudinal Models Topics: What happens to missing predictors Effects of time-invariant predictors Fixed vs. systematically varying vs. random effects Model building strategies
More informationModeling rater effects using a combination of Generalizability Theory and IRT
Psychological Test and Assessment Modeling, Volume 60, 2018 (1), 53-80 Modeling rater effects using a combination of Generalizability Theory and IRT Jinnie Choi 1 & Mark R. Wilson 2 Abstract Motivated
More informationVersion 1.0. General Certificate of Education (A-level) June Mathematics MPC3. (Specification 6360) Pure Core 3. Final.
Version.0 General Certificate of Education (A-level) June 0 Mathematics MPC3 (Specification 6360) Pure Core 3 Final Mark Scheme Mark schemes are prepared by the Principal Eaminer and considered, together
More informationFinal. Mark Scheme. Physics A PHYA1. (Specification 2450) Unit 1: Particles, quantum phenomena and electricity
Version.0 General Certificate of Education (A-level) June 0 Physics A PHYA (Specification 450) Unit : Particles, quantum phenomena and electricity Final Mark Scheme Mark schemes are prepared by the Principal
More informationVersion 1.0. General Certificate of Education (A-level) June Mathematics MM1B. (Specification 6360) Mechanics 1B. Final.
Version.0 General Certificate of Education (A-level) June 0 Mathematics MB (Specification 6360) Mechanics B Final Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together
More informationTeacher Support Materials. Maths GCE. Paper Reference MM03
klm Teacher Support Materials Maths GCE Paper Reference MM03 Copyright 2008 AQA and its licensors. All rights reserved. Permission to reproduce all copyrighted material has been applied for. In some cases,
More informationConfirmatory Factor Analysis: Model comparison, respecification, and more. Psychology 588: Covariance structure and factor models
Confirmatory Factor Analysis: Model comparison, respecification, and more Psychology 588: Covariance structure and factor models Model comparison 2 Essentially all goodness of fit indices are descriptive,
More informationVersion 1.0: abc. General Certificate of Education. Mathematics MFP2 Further Pure 2. Mark Scheme examination June series
Version.0: 0608 abc General Certificate of Education Mathematics 6360 MFP Further Pure Mark Scheme 008 examination June series Mark schemes are prepared by the Principal Examiner and considered, together
More informationVersion: abc. General Certificate of Education. Mathematics MPC4 Pure Core 4. Mark Scheme examination - June series
Version:.0 0608 abc General Certificate of Education Mathematics 660 MPC4 Pure Core 4 Mark Scheme 008 examination - June series Mark schemes are prepared by the Principal Examiner and considered, together
More informationGeneral Certificate of Education Advanced Level Examination June 2014
General Certificate of Education Advanced Level Examination June 2014 Biology BIO6T/Q14/task Unit 6T A2 Investigative Skills Assignment Task Sheet Introduction Investigating populations You will use leaves
More informationSpecifying Latent Curve and Other Growth Models Using Mplus. (Revised )
Ronald H. Heck 1 University of Hawai i at Mānoa Handout #20 Specifying Latent Curve and Other Growth Models Using Mplus (Revised 12-1-2014) The SEM approach offers a contrasting framework for use in analyzing
More informationPMT. Version 1.0. General Certificate of Education (A-level) June 2013 MPC3. Mathematics. (Specification 6360) Pure Core 3. Final.
Version.0 General Certificate of Education (A-level) June 0 Mathematics MPC (Specification 660) Pure Core Final Mark Scheme Mark schemes are prepared by the Principal Eaminer and considered, together with
More informationReport on the Examination
Version 1.0 General Certificate of Education (A-level) January 01 Mathematics MPC4 (Specification 660) Pure Core 4 Report on the Examination Further copies of this Report on the Examination are available
More informationVersion 1.0. klm. General Certificate of Education June Mathematics. Pure Core 4. Mark Scheme
Version.0 klm General Certificate of Education June 00 Mathematics MPC4 Pure Core 4 Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together with the relevant questions,
More informationPIRLS 2016 Achievement Scaling Methodology 1
CHAPTER 11 PIRLS 2016 Achievement Scaling Methodology 1 The PIRLS approach to scaling the achievement data, based on item response theory (IRT) scaling with marginal estimation, was developed originally
More informationabc Mark Scheme Mathematics 3301 Specification A General Certificate of Secondary Education Paper 2 Higher 2007 examination - June series
Version 1.0: 20/06/07 abc General Certificate of Secondary Education Mathematics 3301 Specification A Paper 2 Higher Mark Scheme 2007 examination - June series Mark schemes are prepared by the Principal
More informationVersion 1.0. General Certificate of Education (A-level) June 2012 MS/SS1A. Mathematics. (Specification 6360) Statistics 1A.
Version 1.0 General Certificate of Education (A-level) June 2012 Mathematics MS/SS1A (Specification 6360) Statistics 1A Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together
More informationGCE. Mark scheme January 2004
Mark scheme January 2004 GCE The Assessment and Qualifications Alliance (AQA) is a company limited by guarantee registered in England and Wales 3644723 and a registered charity number 1073334. Registered
More informationabc Mark Scheme Physics 5451 Specification A General Certificate of Education Mechanics and Molecular Kinetic Theory 2008 examination - June series
Version 1.0: 0708 abc General Certificate of Education Physics 551 Specification A PA0 Mechanics and Molecular Kinetic Theory Mark Scheme 008 examination - June series Mark schemes are prepared by the
More informationResearch on Standard Errors of Equating Differences
Research Report Research on Standard Errors of Equating Differences Tim Moses Wenmin Zhang November 2010 ETS RR-10-25 Listening. Learning. Leading. Research on Standard Errors of Equating Differences Tim
More informationPMT. Version 1.0: abc. General Certificate of Education. Mathematics MPC3 Pure Core 3. Mark Scheme examination - June series
Version.0: 0608 abc General Certificate of Education Mathematics 660 MPC Pure Core Mark Scheme 008 eamination - June series MPC - AQA GCE Mark Scheme 008 June series Mark schemes are prepared by the Principal
More informationMeasurement Invariance (MI) in CFA and Differential Item Functioning (DIF) in IRT/IFA
Topics: Measurement Invariance (MI) in CFA and Differential Item Functioning (DIF) in IRT/IFA What are MI and DIF? Testing measurement invariance in CFA Testing differential item functioning in IRT/IFA
More informationGCE AS and A Level. Physics A. AS exams 2009 onwards A2 exams 2010 onwards. Unit 5D: Approved specimen mark scheme. Version 1.1
GCE AS and A Level Physics A AS exams 2009 onwards A2 exams 2010 onwards Unit 5D: Approved specimen mark scheme Version 1.1 version : 1.1 abc General Certificate of Education Physics 2450 Specification
More informationVersion 1.0. General Certificate of Education (A-level) January Mathematics MPC1. (Specification 6360) Pure Core 1. Final.
Version 1.0 General Certificate of Education (A-level) January 01 Mathematics MPC1 (Specification 6360) Pure Core 1 Final Mark Scheme Mark schemes are prepared by the Principal Examiner and considered,
More informationMark Scheme. Science B 4462 / Physics General Certificate of Secondary Education. PHY1H Unit Physics Examination June Series
Version.0: 29/07/20 General Certificate of Secondary Education Science B 4462 / Physics 445 PHYH Unit Physics Mark Scheme 20 Examination June Series Mark schemes are prepared by the Principal Examiner
More informationVersion 1.0. General Certificate of Education (A-level) June 2012 MM1B. Mathematics. (Specification 6360) Mechanics 1B.
Version 1.0 General Certificate of Education (A-level) June 01 Mathematics (Specification 6360) Mechanics 1B Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together with
More informationVersion 1.0: abc. General Certificate of Education. Mathematics MPC1 Pure Core 1. Mark Scheme examination - January series
Version.0: 0.08 abc General Certificate of Education Mathematics 660 MPC Pure Core Mark Scheme 008 examination - January series Mark schemes are prepared by the Principal Examiner and considered, together
More informationChapter 8. Models with Structural and Measurement Components. Overview. Characteristics of SR models. Analysis of SR models. Estimation of SR models
Chapter 8 Models with Structural and Measurement Components Good people are good because they've come to wisdom through failure. Overview William Saroyan Characteristics of SR models Estimation of SR models
More informationVersion General Certificate of Education. Mathematics MPC1 Pure Core 1. Mark Scheme examination - January series
Version.0 00 General Certificate of Education Mathematics 660 MPC Pure Core Mark Scheme 00 examination - January series Mark schemes are prepared by the Principal Examiner and considered, together with
More informationVersion 1.0. General Certificate of Education (A-level) June 2012 MPC2. Mathematics. (Specification 6360) Pure Core 2. Mark Scheme
Version.0 General Certificate of Education (A-level) June 0 Mathematics MPC (Specification 660) Pure Core Mark Scheme Mark schemes are prepared by the Principal Eaminer and considered, together with the
More informationMixed- Model Analysis of Variance. Sohad Murrar & Markus Brauer. University of Wisconsin- Madison. Target Word Count: Actual Word Count: 2755
Mixed- Model Analysis of Variance Sohad Murrar & Markus Brauer University of Wisconsin- Madison The SAGE Encyclopedia of Educational Research, Measurement and Evaluation Target Word Count: 3000 - Actual
More informationVersion 1.0: abc. General Certificate of Education. Physics Specification B. Practical Examination. Mark Scheme
Version 1.0: 0807 abc General Certificate of Education Physics 46 Specification B PHB Practical Examination Mark Scheme 2007 examination - June series Mark schemes are prepared by the Principal Examiner
More informationabc GCE 2005 January Series Mark Scheme Physics Specification A Medical Physics
GCE 2005 January Series abc Mark Scheme Physics Specification A PHA6/W Medical Physics Mark schemes are prepared by the Principal Examiner and considered, together with the relevant questions, by a panel
More informationMultilevel Statistical Models: 3 rd edition, 2003 Contents
Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction
More informationCenter for Advanced Studies in Measurement and Assessment. CASMA Research Report. Coefficients and Indices in Generalizability Theory
Center for Advanced Studies in Measurement and Assessment CASMA Research Report Number 1 Coefficients and Indices in Generalizability Theory Robert L. Brennan August 2003 A revised version of a paper presented
More informationRANDOM INTERCEPT ITEM FACTOR ANALYSIS. IE Working Paper MK8-102-I 02 / 04 / Alberto Maydeu Olivares
RANDOM INTERCEPT ITEM FACTOR ANALYSIS IE Working Paper MK8-102-I 02 / 04 / 2003 Alberto Maydeu Olivares Instituto de Empresa Marketing Dept. C / María de Molina 11-15, 28006 Madrid España Alberto.Maydeu@ie.edu
More informationItem Response Theory and Computerized Adaptive Testing
Item Response Theory and Computerized Adaptive Testing Richard C. Gershon, PhD Department of Medical Social Sciences Feinberg School of Medicine Northwestern University gershon@northwestern.edu May 20,
More informationVersion 1.0: abc. General Certificate of Education. Mathematics MPC3 Pure Core 3. Mark Scheme examination - January series
Version.0: 008 abc General Certificate of Education Mathematics 660 MPC Pure Core Mark Scheme 008 eamination - January series Mark schemes are prepared by the Principal Eaminer and considered, together
More informationVersion 1.0. General Certificate of Education (A-level) June Mathematics MPC4. (Specification 6360) Pure Core 4. Final.
Version.0 General Certificate of Education (A-level) June 0 Mathematics MPC (Specification 660) Pure Core Final Mark Scheme Mark schemes are prepared by the Principal Eaminer and considered, together with
More informationVersion 1.0. General Certificate of Education (A-level) January Mathematics MPC2. (Specification 6360) Pure Core 2. Final.
Version 1.0 General Certificate of Education (A-level) January 01 Mathematics MPC (Specification 660) Pure Core Final Mar Scheme Mar schemes are prepared by the Principal Eaminer and considered, together
More informationAn Introduction to Path Analysis
An Introduction to Path Analysis PRE 905: Multivariate Analysis Lecture 10: April 15, 2014 PRE 905: Lecture 10 Path Analysis Today s Lecture Path analysis starting with multivariate regression then arriving
More informationGeneral Certificate of Education (A-level) January 2011 Physics A PHYA4 (Specification 2450) Unit 4: Fields and further mechanics Final Mark Scheme
Version 1.0 General Certificate of Education (A-level) January 2011 Physics A PHYA4 (Specification 2450) Unit 4: Fields and further mechanics Final Mark Scheme Mark schemes are prepared by the Principal
More informationAn Overview of Item Response Theory. Michael C. Edwards, PhD
An Overview of Item Response Theory Michael C. Edwards, PhD Overview General overview of psychometrics Reliability and validity Different models and approaches Item response theory (IRT) Conceptual framework
More informationVersion1.0. General Certificate of Education (A-level) January 2011 MM1B. Mathematics. (Specification 6360) Mechanics 1B.
Version1.0 General Certificate of Education (A-level) January 011 Mathematics MB (Specification 6360) Mechanics 1B Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together
More informationVersion 1.0. General Certificate of Education (A-level) June 2012 MFP3. Mathematics. (Specification 6360) Further Pure 3.
Version.0 General Certificate of Education (A-level) June 0 Mathematics MFP (Specification 660) Further Pure Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together with
More informationA-level PHYSICS (7408/1) Paper 1
A-level PHYSICS (7408/) Paper Mark scheme MARK SCHEME A-LEVEL PHYSICS PAPER 7408/ SPECIMEN Mark schemes are prepared by the Lead Assessment Writer and considered, together with the relevant questions,
More informationVersion 1.0. abc. General Certificate of Secondary Education. Mathematics Specification A. Paper 2 Higher. Mark Scheme
Version 1.0 abc General Certificate of Secondary Education Mathematics 4306 Specification A Paper 2 Higher Mark Scheme 2009 examination - June series Mark schemes are prepared by the Principal Examiner
More informationMark Scheme. Additional Science 4463 / Physics General Certificate of Secondary Education. PHY2H Unit Physics Examination June Series
Version.0: 9/07/0 General Certificate of Secondary Education Additional Science 4463 / Physics 445 Unit Physics Mark Scheme 0 Examination June Series Mark schemes are prepared by the Principal Examiner
More informationMANY- FACET MESA RASCH MEASUREMENT. John M. Linacre
MANY- FACET RASCH MEASUREMENT John M. Linacre MESA MANY-FACET RASCH MEASUREMENT John M. Linacre The University of Chicago MESA PRESS Chicago 1989, 1994 Many-Facet Rasch Measurement John M. Linacre Second
More informationGCE AS and A Level. Chemistry. AS exams 2009 onwards A2 exams 2010 onwards. Unit 6T: ISA Specimen mark scheme. Version 1.3
GCE AS and A Level Chemistry AS exams 2009 onwards A2 exams 2010 onwards Unit 6T: ISA Specimen mark scheme Version 1.3 Version 1.2: 01/08 abc General Certificate of Education Chemistry CHM6T Investigative
More informationVersion 1.0. General Certificate of Education (A-level) June Mathematics MM03. (Specification 6360) Mechanics 3. Final.
Version 1.0 General Certificate of Education (A-level) June 011 Mathematics MM03 (Specification 6360) Mechanics 3 Final Mark Scheme Mark schemes are prepared by the Principal Examiner and considered, together
More information