Nonparametric Bounds Analysis for NEAT Equating

Similar documents
Methods for the Comparability of Scores

Testing the Untestable Assumptions of the Chain and Poststratification Equating Methods for the NEAT Design

Population Invariance of Score Linking: Theory and Applications to Advanced Placement Program Examinations

Research on Standard Errors of Equating Differences

Choice of Anchor Test in Equating

Equating of Subscores and Weighted Averages Under the NEAT Design

LOWELL WEEKLY JOURNAL

Reflected Brownian Motion

LOWELL WEEKLY JOURNAL. ^Jberxy and (Jmott Oao M d Ccmsparftble. %m >ai ruv GEEAT INDUSTRIES

A Unified Approach to Linear Equating for the Non-Equivalent Groups Design

Supplementary Information

Nonparametric Statistics Notes

Some Characterizations of Strongly Convex Functions in Inner Product Spaces

MEASUREMENT CENTER MIDE UC, CHILE CEPPE-UC, CHILE CENTER FOR OPERATIONS RESEARCH AND ECONOMETRICS CORE, UNIVERSIT É CATHOLIQUE DE LOUVAIN, BELGIUM

. ^e Traveler in taesnok. i the IHilty.-^ifStiiart. BbUaaoa aad WalL.""ras 'crossing a mountain»h ch w e are A«ply inteiwted. Add

Chained Versus Post-Stratification Equating in a Linear Context: An Evaluation Using Empirical Data

' '-'in.-i 1 'iritt in \ rrivfi pr' 1 p. ru

arxiv: v3 [math.ap] 28 Feb 2017

STAT 6385 Survey of Nonparametric Statistics. Order Statistics, EDF and Censoring

Chapter 3 - Solved Problems

Lecture Notes on Partial Dierential Equations (PDE)/ MaSc 221+MaSc 225

LOWELL WEEKLY JOURNAL.

On Monoids over which All Strongly Flat Right S-Acts Are Regular

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

Package equate. February 15, 2013

IOWA EQUATING SUMMIT

Unconstrained minimization

An empirical distribution-based approximation of PROMETHEE II's net flow scores

Linking the Smarter Balanced Assessments to NWEA MAP Tests

Effect of Repeaters on Score Equating in a Large Scale Licensure Test. Sooyeon Kim Michael E. Walker ETS, Princeton, NJ

Derivatives in 2D. Outline. James K. Peterson. November 9, Derivatives in 2D! Chain Rule

DISCUSSION PAPER 2016/43

Educjatipnal. L a d ie s * COBNWALILI.S H IG H SCHOOL. I F O R G IR L S A n B k i n d e r g a r t e n.

A NEW CLASS OF SKEW-NORMAL DISTRIBUTIONS

Math 205b Homework 2 Solutions

T k b p M r will so ordered by Ike one who quits squuv. fe2m per year, or year, jo ad vaoce. Pleaie and THE ALTO SOLO

Q SON,' (ESTABLISHED 1879L

Probability Distribution And Density For Functional Random Variables

Plausible inference from the idea of estimation

Section 4. Test-Level Analyses

Equating Subscores Using Total Scaled Scores as an Anchor

Math 220a - Fall 2002 Homework 6 Solutions

WORLD MATH DAY ACTIVITY PACK. Ages worldmathsday.com UNICEF WORLD MATH DAY Lesson Plans Age 4 10 ACTIVITY RESOURCE

Haiwen (Henry) Chen and Paul Holland 1 ETS, Princeton, New Jersey

Data Repository of Paper: The role of subducted sediments in plate interface dynamics as constrained by Andean forearc (paleo)topography

A Comparison of Bivariate Smoothing Methods in Common-Item Equipercentile Equating

HEAGAN & CO., OPP. f>, L. & W. DEPOT, DOYER, N. J, OUR MOTTO! ould Iwv ia immediate vltlui. VEEY BEST NEW Creamery Butter 22c ib,

Statistical Equating Methods

Online Appendix to The Recovery of a Schema Mapping: Bringing Exchanged Data Back

A science-policy approach towards local adaptation planning: The case of Santiago de Chile

Effects of fishing, market price, and climate on two South American clam species

Alphonse Magnus, Université Catholique de Louvain.

Z score indicates how far a raw score deviates from the sample mean in SD units. score Mean % Lower Bound

II&Ij <Md Tmlaiiiiiit, aad once in Ihe y a w Teataa m i, the vmb thatalmta oot Uiaapirit world. into as abode or wotld by them- CooTBOtioa

ON THE NORMAL MOMENT DISTRIBUTIONS

Solutions for Math 225 Assignment #5 1

MA22S2 Lecture Notes on Fourier Series and Partial Differential Equations.

Markov operators acting on Polish spaces

UNIVERSITY OF MANITOBA

Structured Matrices and Solving Multivariate Polynomial Equations

Raquel Prado. Name: Department of Applied Mathematics and Statistics AMS-131. Spring 2010

GODFREY, KELLY ELIZABETH, Ph.D. A Comparison of Kernel Equating and IRT True Score Equating Methods. (2007) Directed by Dr. Terry A. Ackerman. 181 pp.

Math 311, Partial Differential Equations, Winter 2015, Midterm

SOLVABILITY OF A QUADRATIC INTEGRAL EQUATION OF FREDHOLM TYPE IN HÖLDER SPACES

Substitute Valuations with Divisible Goods

Sturm-Liouville Theory

ESTIMATING BIVARIATE TAIL

1 h 9 e $ s i n t h e o r y, a p p l i c a t i a n

Microeconomic Theory-I Washington State University Midterm Exam #1 - Answer key. Fall 2016

A filled function method for finding a global minimizer on global integer optimization

A Quadratic Curve Equating Method to Equate the First Three Moments in Equipercentile Equating

STA2603/205/1/2014 /2014. ry II. Tutorial letter 205/1/

CHAPTER 2 POLYNOMIALS KEY POINTS

There are two basic kinds of random variables continuous and discrete.

T E C H N I C A L R E P O R T IGNORABLE COMMON INFORMATION, NULL SETS AND BASUS FIRST THEOREM. SAN MARTIN, E., MOUCHART, M. and J.-M.

Convexity and Smoothness

Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com

Simulation of point process using MCMC

Conic section. Ans: c. Ans: a. Ans: c. Episode:43 Faculty: Prof. A. NAGARAJ. 1. A circle

Yaglom-type limit theorems for branching Brownian motion with absorption. by Jason Schweinsberg University of California San Diego

Heisenberg Uncertainty Principle for the q-bessel Fourier transform

Use of Continuous Exponential Families to Link Forms via Anchor Tests

MS&E 321 Spring Stochastic Systems June 1, 2013 Prof. Peter W. Glynn Page 1 of 10

MATH 425, HOMEWORK 3 SOLUTIONS

Math 4263 Homework Set 1

March Algebra 2 Question 1. March Algebra 2 Question 1

Math 220A - Fall 2002 Homework 5 Solutions

Unbraced Column Verification Example. AISC Design Examples AISC 13 th Edition. ASDIP Steel is available for purchase online at

A Spatial Decomposition of Income Inequality in Chile

1) The line has a slope of ) The line passes through (2, 11) and. 6) r(x) = x + 4. From memory match each equation with its graph.

TUM 2016 Class 1 Statistical learning theory

Nonequivalent-Populations Design David J. Woodruff American College Testing Program

Estimating Bivariate Tail: a copula based approach

The Generalized Roy Model and Treatment Effects

The Multivariate Normal Distribution 1

PMR Learning as Inference

equate: An R Package for Observed-Score Linking and Equating

Tail Mutual Exclusivity and Tail- Var Lower Bounds

Formula Sheet for Optimal Control

Center for Advanced Studies in Measurement and Assessment. CASMA Research Report

Transcription:

Nonparametric Bounds Analysis for NEAT Equating Jorge González a, c a, b, c, Ernesto San Martín a Department of Statistics, P. Universidad Católica de Chile, Chile b The Economics School of Louvain, Université Catholique de Louvain, Belgium c LIES, Laboratorio Interdisciplinario de Estadística Social Seminario de Estadística Educacional Septiembre 1, 2017, PUC, Chile

Outline 1 The equating statistical problem 2 3 4

Model specification and general formulation Equating transformation General formulation of the equating problem Test forms X and Y are administrated to n x and n y test takers, respectively. The scores X and Y are assumed to be random variables. X and Y are defined on (score) sample spaces X and Y, respectively. x 1,..., x nx F X (x) and y 1,..., y ny F Y (y). Statistical problem Modeling the relationship between scores to make them comparable.

Model specification and general formulation Equating transformation General formulation of the equating problem Test forms X and Y are administrated to n x and n y test takers, respectively. The scores X and Y are assumed to be random variables. X and Y are defined on (score) sample spaces X and Y, respectively. x 1,..., x nx F X (x) and y 1,..., y ny F Y (y). Statistical problem Modeling the relationship between scores to make them comparable.

Model specification and general formulation Equating transformation General formulation: equating transformation Definition Let X and Y be two sample spaces. A function ϕ : X Y will be called an equating transformation. The equating transformation maps the scores on the scale of one test form into the scale of the other. The equating transformation is to be estimated by an estimator ϕ n based on samples x 1,..., x nx F X and y 1,..., y ny F Y, respectively.

Equipercentile transformation Model specification and general formulation Equating transformation Cumulative Distribution Function 0 1 p F(y) F(x) y x Scores y = ϕ(x) = F 1 Y (F X(x))

Equipercentile transformation Model specification and general formulation Equating transformation Cumulative Distribution Function 0 1 p F(y) F(x) y x Scores y = ϕ(x) = F 1 Y (F X(x))

design : the current view : an alternative view Population Sample X Y A P 1 Q 2 T = w P P + w Q Q f XT (x) =w P f XP (x) + w Q f XQ (x) f Y T (y) =w P f Y P (y) + w Q f Y Q (y). Additional assumptions are needed to estimate f XT (x) and f Y T (y)

design : the current view : an alternative view Population Sample X Y A P 1 Q 2 T = w P P + w Q Q f XT (x) =w P f XP (x) + w Q f XQ (x) f Y T (y) =w P f Y P (y) + w Q f Y Q (y). Additional assumptions are needed to estimate f XT (x) and f Y T (y)

design : the current view : an alternative view Population Sample X Y A P 1 Q 2 T = w P P + w Q Q f XT (x) =w P f XP (x) + w Q f XQ (x) f Y T (y) =w P f Y P (y) + w Q f Y Q (y). Additional assumptions are needed to estimate f XT (x) and f Y T (y)

design : the current view : an alternative view Population Sample X Y A P 1 Q 2 T = w P P + w Q Q f XT (x) =w P f XP (x) + w Q f XQ (x) f Y T (y) =w P f Y P (y) + w Q f Y Q (y). Additional assumptions are needed to estimate f XT (x) and f Y T (y)

design : the current view : an alternative view Most common assumption, f XP (x a) = f XQ (x a) and f Y P (y a) = f Y Q (y a) Using these assumptions we get f XT (x) =w P f XP (x) + w Q f XP (x)f AQ (a) f Y T (y) =w P f Y Q (y)f AP (a) + w Q f Y Q (y) a a and from here ϕ T (x) = F 1 Y T (F XT (x))

design : the current view : an alternative view Most common assumption, f XP (x a) = f XQ (x a) and f Y P (y a) = f Y Q (y a) Using these assumptions we get f XT (x) =w P f XP (x) + w Q f XP (x)f AQ (a) f Y T (y) =w P f Y Q (y)f AP (a) + w Q f Y Q (y) a a and from here ϕ T (x) = F 1 Y T (F XT (x))

: the current view : an alternative view Conditional score distributions with no assumptions Let Z denote the population group so that { 1, if test taker is administered X; Z = 0, if test taker is administered Y. Then, by the Law of Total Probability (LTP) P (X x A) =P (X x A, Z = 1)P (Z = 1 A)+ (1) P (X x A, Z = 0)P (Z = 0 A) P (X x A) is not identified.

: the current view : an alternative view Conditional score distributions with no assumptions Let Z denote the population group so that { 1, if test taker is administered X; Z = 0, if test taker is administered Y. Then, by the Law of Total Probability (LTP) P (X x A) =P (X x A, Z = 1)P (Z = 1 A)+ (1) P (X x A, Z = 0)P (Z = 0 A) P (X x A) is not identified.

: the current view : an alternative view Conditional score distributions with no assumptions Let Z denote the population group so that { 1, if test taker is administered X; Z = 0, if test taker is administered Y. Then, by the Law of Total Probability (LTP) P (X x A) =P (X x A, Z = 1)P (Z = 1 A)+ (1) P (X x A, Z = 0)P (Z = 0 A) P (X x A) is not identified.

: the current view : an alternative view Partially identified probability distributions From (1) and the fact that P (X x A, Z = 0) is bounded between 0 and 1, it follows that L x P (X x A) U x (2) L x = P (X x A, Z = 1)P (Z = 1 A) U x = P (X x A, Z = 1)P (Z = 1 A) + P (Z = 0 A) Analogously for Y we have L y P (Y y A) U y (3) L y = P (Y y A, Z = 0)P (Z = 0 A) U y = P (Y y A, Z = 0)P (Z = 0 A) + P (Z = 1 A)

: the current view : an alternative view Partially identified probability distributions From (1) and the fact that P (X x A, Z = 0) is bounded between 0 and 1, it follows that L x P (X x A) U x (2) L x = P (X x A, Z = 1)P (Z = 1 A) U x = P (X x A, Z = 1)P (Z = 1 A) + P (Z = 0 A) Analogously for Y we have L y P (Y y A) U y (3) L y = P (Y y A, Z = 0)P (Z = 0 A) U y = P (Y y A, Z = 0)P (Z = 0 A) + P (Z = 1 A)

: the current view : an alternative view Partially identified probability distributions From (1) and the fact that P (X x A, Z = 0) is bounded between 0 and 1, it follows that L x P (X x A) U x (2) L x = P (X x A, Z = 1)P (Z = 1 A) U x = P (X x A, Z = 1)P (Z = 1 A) + P (Z = 0 A) Analogously for Y we have L y P (Y y A) U y (3) L y = P (Y y A, Z = 0)P (Z = 0 A) U y = P (Y y A, Z = 0)P (Z = 0 A) + P (Z = 1 A)

Graphical illustration : the current view : an alternative view Bounds for P(X<x A=2) Pr 0.0 0.2 0.4 0.6 0.8 1.0 0 1 2 3 4 5 Score

Graphical illustration : the current view : an alternative view Bounds for P(X<x A=2) (black line) Bounds for P(Y<y A=2) (red line) Pr 0.0 0.2 0.4 0.6 0.8 1.0 0 1 2 3 4 5 Score

: the current view : an alternative view Target distributions with no assumptions Marginalizing over A, P (X x A) =P (X x A, Z = 1)P (Z = 1 A)+ (4) P (X x A, Z = 0)P (Z = 0 A) becomes P (X x) = P (X x Z = 1)P (Z = 1)+P (X x Z = 0)P (Z = 0) or F X (x) = w P F XP (x) + w Q F XQ (x) Problem: F XQ (x) = P (X x Z = 0) is still non identified!

: the current view : an alternative view Target distributions with no assumptions Marginalizing over A, P (X x A) =P (X x A, Z = 1)P (Z = 1 A)+ (4) P (X x A, Z = 0)P (Z = 0 A) becomes P (X x) = P (X x Z = 1)P (Z = 1)+P (X x Z = 0)P (Z = 0) or F X (x) = w P F XP (x) + w Q F XQ (x) Problem: F XQ (x) = P (X x Z = 0) is still non identified!

: the current view : an alternative view Target distributions with no assumptions Marginalizing over A, P (X x A) =P (X x A, Z = 1)P (Z = 1 A)+ (4) P (X x A, Z = 0)P (Z = 0 A) becomes P (X x) = P (X x Z = 1)P (Z = 1)+P (X x Z = 0)P (Z = 0) or F X (x) = w P F XP (x) + w Q F XQ (x) Problem: F XQ (x) = P (X x Z = 0) is still non identified!

Partially identified target distributions : the current view : an alternative view P (X x) can also be bounded. In fact, L x P (X x) U x, L x = P (X x Z = 1)P (Z = 1) U x = P (X x Z = 1)P (Z = 1) + P (Z = 0) How does F X (x) compare to F XT (x)?

Partially identified target distributions : the current view : an alternative view P (X x) can also be bounded. In fact, L x P (X x) U x, L x = P (X x Z = 1)P (Z = 1) U x = P (X x Z = 1)P (Z = 1) + P (Z = 0) How does F X (x) compare to F XT (x)?

Bounds illustrations : the current view : an alternative view Score F XT [L x, U x ] F Y T [L y, U y ] 0 0.100 [0.050 ; 0.550] 0.105 [0.040 ; 0.540] 1 0.250 [0.125 ; 0.625] 0.320 [0.140 ; 0.640] 2 0.500 [0.250 ; 0.750] 0.530 [0.250 ; 0.750] 3 0.750 [0.375 ; 0.875] 0.755 [0.375 ; 0.875] 4 0.900 [0.450 ; 0.950] 0.900 [0.450 ; 0.950] 5 1.000 [0.500 ; 1.000] 1.000 [0.500 ; 1.000]

Graphical illustration : the current view : an alternative view Pr 0.0 0.2 0.4 0.6 0.8 1.0 F_T(x) F_T(y) 0 1 2 3 4 5 Scores

Graphical illustration : the current view : an alternative view Pr 0.0 0.2 0.4 0.6 0.8 1.0 F_T(x) Bounds for F(x) F_T(y) Bounds for F(y) F_T(x) F_T(y) 0 1 2 3 4 5 Scores

Bounded quantiles : the current view : an alternative view Let α (0, 1) and q α (X). = inf{t : P (X t) > α}. Define the following quantiles: r α (X) =. inf{t : P (X t Z = 1)P (Z = 1) + P (Z = 0) > α} { } α P (Z = 0) = inf t : P (X t Z = 1) > P (Z = 1) and =q α (X Z = 1) s α (X) =. inf{t : P (X t Z = 1)P (Z = 1) > α} { } α = inf t : P (X t Z = 1) > P (Z = 1) =q α (X Z = 1)

Bounded quantiles : the current view : an alternative view In the NEAT design, the quantiles of the partially identified probabilities P (X t) and P (Y t) are also partially identified by the following intervals: (i) r α (X) q α (X) s α (X); (ii) r α (Y ) q α (Y ) s α (Y ). (5)

Bounded quantile equating : the current view : an alternative view Main idea α [r α (X), s α (X)] [r α (Y ), s α (Y )]... 0.1 [1 ; 3] [2 ; 4] 0.2 [2 ; 4] [3 ; 5]...

Informative bounded quantiles : the current view : an alternative view The lower and the upper bounds are not always informative: 0 α P (Z=0) P (Z=1) 1, for all α [P (Z = 0), 1]; 0 α P (Z=1) 1, for all α [0, P (Z = 1)].

Bounded equipercentile equating : the current view : an alternative view Given that then F 1 Y moreover and F XP (t) w P (X t) F XP (t) w + (1 w) (F X P (t) w) ϕ(t) F 1 (F X P (t) w + (1 w)) u Y (α) = inf{t : F Y (t) > F YQ (t) (1 w) > α} l Y (α) = inf{t : F YQ (t) (1 w) + w > F Y (t) > α} Y

Example Data from Kolen and Brennan (2014). Two 36-items test forms. Form X was administered to 1,655 examinees and form Y was administered to 1,638 examinees. Also, 12 out of the 36 items are common between both test forms (items 3, 6, 9, 12, 15, 18, 21, 24, 27, 30, 33, and 36). w P = 1 and w Q = 0

Graphical illustration Example Score distributions in P and Q 0.0 0.2 0.4 0.6 0.8 1.0 F_XP F_YQ 0 5 10 15 20 25 30 35 Scores

Graphical illustration Example Score distributions in P and Q 0.0 0.2 0.4 0.6 0.8 1.0 F_XP F_YQ F_XT (w1=1) F_YT (w2=0) 0 5 10 15 20 25 30 35 Scores

Graphical illustration Example Score distributions in P and Q 0.0 0.2 0.4 0.6 0.8 1.0 F_XP F_YQ F_XT (w1=0.7) F_YT (w2=0.3) 0 5 10 15 20 25 30 35 Scores

Graphical illustration Example Score distributions in P and Q 0.0 0.2 0.4 0.6 0.8 1.0 F_XP F_YQ F_XT (w1=0.5) F_YT (w2=0.5) 0 5 10 15 20 25 30 35 Scores

Graphical illustration Example Score distributions in P and Q 0.0 0.2 0.4 0.6 0.8 1.0 F_XP F_YQ F_XT (w1=0.3) F_YT (w2=0.7) 0 5 10 15 20 25 30 35 Scores

Graphical illustration Example Score distributions in P and Q 0.0 0.2 0.4 0.6 0.8 1.0 F_XP F_YQ F_XT (w1=0) F_YT (w2=1) 0 5 10 15 20 25 30 35 Scores

Graphical illustration Example Score distributions 0.0 0.2 0.4 0.6 0.8 1.0 F_XT (w1=1) F_YT (w2=0) 0 5 10 15 20 25 30 35 Scores

Graphical illustration Example Score distributions 0.0 0.2 0.4 0.6 0.8 1.0 F_XT (w1=1) F_YT (w2=0) Ux Lx Uy Ly 0 5 10 15 20 25 30 35 Scores

Graphical illustration Example Quantiles 5 10 15 20 25 30 35 r_x, s_x r_y, s_y 0.0 0.2 0.4 0.6 0.8 1.0 Alpha

Summary and discussion Future work The equality of conditional distributions assumption is actually an identification restriction. We offer an alternative to the conditional distributions restriction Partially identified probability distributions The derived bounds show that there is huge uncertainty about the probability distributions that are to be used for equating

Future work Future work Equiquantile vs equipercentile: which one? Incorporate additional information in order to obtain tighter bounds.

Future work Thank you for your attention! Jorge González Departament of Statistics, Faculty of Mathematics, Pontificia Universidad Católica de Chile Av. Vicuña Mackenna 4860, Macul, Santiago, Chile E-mail: jorge.gonzalez@mat.uc.cl www.mat.uc.cl/~jorge.gonzalez July 15-19 Santiago, Chile This research is supported by Grant Fondecyt 1150233