Methods for the Comparability of Scores

Similar documents
Nonparametric Bounds Analysis for NEAT Equating

A Nonparametric Bayesian Model for Multivariate Ordinal Data

A Fully Nonparametric Modeling Approach to. BNP Binary Regression

Nonparametric Bayesian modeling for dynamic ordinal regression relationships

Topic Models. Brandon Malone. February 20, Latent Dirichlet Allocation Success Stories Wrap-up

STAT 518 Intro Student Presentation

Gibbs Sampling in Latent Variable Models #1

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Flexible Regression Modeling using Bayesian Nonparametric Mixtures

Nonparametric Bayesian Modeling for Multivariate Ordinal. Data

Incorporating unobserved heterogeneity in Weibull survival models: A Bayesian approach

Preliminaries The bootstrap Bias reduction Hypothesis tests Regression Confidence intervals Time series Final remark. Bootstrap inference

Bayesian Nonparametric Regression through Mixture Models

Testing Statistical Hypotheses

A NEW CLASS OF SKEW-NORMAL DISTRIBUTIONS

Contents. Part I: Fundamentals of Bayesian Inference 1

Stat 542: Item Response Theory Modeling Using The Extended Rank Likelihood

Bayesian Methods for Machine Learning

ECE521 week 3: 23/26 January 2017

Political Science 236 Hypothesis Testing: Review and Bootstrapping

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

A Quadratic Curve Equating Method to Equate the First Three Moments in Equipercentile Equating

h=1 exp (X : J h=1 Even the direction of the e ect is not determined by jk. A simpler interpretation of j is given by the odds-ratio

Defining Predictive Probability Functions for Species Sampling Models

Nonparametric Bayesian Modeling for Multivariate Ordinal. Data

Preliminaries The bootstrap Bias reduction Hypothesis tests Regression Confidence intervals Time series Final remark. Bootstrap inference

CS839: Probabilistic Graphical Models. Lecture 7: Learning Fully Observed BNs. Theo Rekatsinas

Hybrid Copula Bayesian Networks

Stat 5101 Lecture Notes

Nonparametric Bayesian Methods (Gaussian Processes)

Appendix F. Computational Statistics Toolbox. The Computational Statistics Toolbox can be downloaded from:

Midwest Big Data Summer School: Introduction to Statistics. Kris De Brabanter

Bayesian Nonparametric Modeling for Multivariate Ordinal Regression

Gaussian Processes. Le Song. Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012

Probabilistic Latent Semantic Analysis

Comparing latent inequality with ordinal health data

Research on Standard Errors of Equating Differences

A CONDITION TO OBTAIN THE SAME DECISION IN THE HOMOGENEITY TEST- ING PROBLEM FROM THE FREQUENTIST AND BAYESIAN POINT OF VIEW

Density Estimation: ML, MAP, Bayesian estimation

DEPARTMENT OF COMPUTER SCIENCE Autumn Semester MACHINE LEARNING AND ADAPTIVE INTELLIGENCE

Analysis of Multinomial Response Data: a Measure for Evaluating Knowledge Structures

Hypothesis testing: theory and methods

PMR Learning as Inference

COURSE INTRODUCTION. J. Elder CSE 6390/PSYC 6225 Computational Modeling of Visual Perception

Class Notes: Week 8. Probit versus Logit Link Functions and Count Data

Motivation Scale Mixutres of Normals Finite Gaussian Mixtures Skew-Normal Models. Mixture Models. Econ 690. Purdue University

Algorithmic Game Theory Introduction to Mechanism Design

STAT Section 2.1: Basic Inference. Basic Definitions

Clustering using Mixture Models

Math 494: Mathematical Statistics

Testing Statistical Hypotheses

BAYESIAN DECISION THEORY

Comparing latent inequality with ordinal health data

Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded

Non-Parametric Bayesian Inference for Controlled Branching Processes Through MCMC Methods

Multivariate Versus Multinomial Probit: When are Binary Decisions Made Separately also Jointly Optimal?

arxiv: v1 [stat.co] 26 May 2009

A model of skew item response theory

Causal Inference Basics

Bayesian Modeling of Conditional Distributions

Classification. Chapter Introduction. 6.2 The Bayes classifier

Bayesian inference for sample surveys. Roderick Little Module 2: Bayesian models for simple random samples

COMS 4771 Introduction to Machine Learning. Nakul Verma

Wageningen Summer School in Econometrics. The Bayesian Approach in Theory and Practice

Quantifying the Price of Uncertainty in Bayesian Models

19 : Bayesian Nonparametrics: The Indian Buffet Process. 1 Latent Variable Models and the Indian Buffet Process

Mathematical Formulation of Our Example

Preface Introduction to Statistics and Data Analysis Overview: Statistical Inference, Samples, Populations, and Experimental Design The Role of

MATHEMATICS AND STATISTICS

Bayesian estimation of the discrepancy with misspecified parametric models

Chapter 7. Confidence Sets Lecture 30: Pivotal quantities and confidence sets

Unsupervised machine learning

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Order-q stochastic processes. Bayesian nonparametric applications

Hybrid Censoring; An Introduction 2

Outline. Binomial, Multinomial, Normal, Beta, Dirichlet. Posterior mean, MAP, credible interval, posterior distribution

Linear Classification

TABLE OF CONTENTS CHAPTER 1 COMBINATORIAL PROBABILITY 1

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Chapter 5. Chapter 5 sections

Logistic Regression and Generalized Linear Models

On A-distance and Relative A-distance

Log Gaussian Cox Processes. Chi Group Meeting February 23, 2016

Bayesian Approach 2. CSC412 Probabilistic Learning & Reasoning

A Comparison of Bivariate Smoothing Methods in Common-Item Equipercentile Equating

Part 2: One-parameter models

A Nonparametric Model-based Approach to Inference for Quantile Regression

Bayesian Nonparametric Regression for Diabetes Deaths

Bayesian GLMs and Metropolis-Hastings Algorithm

COMPOSITIONAL IDEAS IN THE BAYESIAN ANALYSIS OF CATEGORICAL DATA WITH APPLICATION TO DOSE FINDING CLINICAL TRIALS

Testing the Untestable Assumptions of the Chain and Poststratification Equating Methods for the NEAT Design

Algebra II. Algebra II Higher Mathematics Courses 77

9/12/17. Types of learning. Modeling data. Supervised learning: Classification. Supervised learning: Regression. Unsupervised learning: Clustering

A review of some semiparametric regression models with application to scoring

Simulation of Max Stable Processes

Computer Emulation With Density Estimation

PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 2: PROBABILITY DISTRIBUTIONS

Fast approximations for the Expected Value of Partial Perfect Information using R-INLA

Effectiveness of the hybrid Levine equipercentile and modified frequency estimation equating methods under the common-item nonequivalent groups design

Transcription:

Assessment and Development of new Statistical Methods for the Comparability of Scores 1 Pontificia Universidad Católica de Chile 2 Laboratorio Interdisciplinario de Estadística Social Advisor: Jorge González Co-Advisor: Fernando Quintana

Outline 1 2 New discrete-based equating methods Hybrid methods 3 Latent modelling approach 4

Importance of Equating It is common for measurement programs to produce di erent forms of a test that are intended to measure the same attribute.

Importance of Equating It is common for measurement programs to produce di erent forms of a test that are intended to measure the same attribute. Because test scores are used to make important decisions in various settings, it is needed to report scores in a fair and precise way.

Importance of Equating It is common for measurement programs to produce di erent forms of a test that are intended to measure the same attribute. Because test scores are used to make important decisions in various settings, it is needed to report scores in a fair and precise way. The main idea behind Equating is to treat scores as if they come from the same test.

Equating: statistical point of view Let consider two test forms X and Y. The scores X and Y are assumed to be random variables.

Equating: statistical point of view Let consider two test forms X and Y. The scores X and Y are assumed to be random variables. The scores of these two test forms are defined on score sample spaces X and Y respectively.

Equating: statistical point of view Let consider two test forms X and Y. The scores X and Y are assumed to be random variables. The scores of these two test forms are defined on score sample spaces X and Y respectively. In the context of educational measurement these spaces represent the scale of these scores.

Equating: statistical point of view Let consider two test forms X and Y. The scores X and Y are assumed to be random variables. The scores of these two test forms are defined on score sample spaces X and Y respectively. In the context of educational measurement these spaces represent the scale of these scores. Let X 1,...,X nx and Y 1,...,Y ny be the scores obtained on the test forms X and Y by n X and n Y examinees, respectively.

Equating: statistical point of view The statistical problem in equating is to establish the equating transformation.

Equating: statistical point of view The statistical problem in equating is to establish the equating transformation. González and Wiberg (2017) give a formal definition of the equating transformation: a mapping between these two score sample spaces i.e. ' : X! Y

Equating: statistical point of view The statistical problem in equating is to establish the equating transformation. González and Wiberg (2017) give a formal definition of the equating transformation: a mapping between these two score sample spaces i.e. ' : X! Y Thus, the equating transformation maps the scores on the scale of one test form into the scale of the other.

Some equating transformations Equipercentile Equating The most popular equating transformation was defined by Braun and Holland (1982). It is called the equipercentile function defined by: '(x) = F 1 Y (F X (x)) where F Y and F X are the CDF functions of Y and X, respectively.

Some equating transformations Equipercentile Equating It is important to highlight here that we are interested on equate scores which are usually integer i.e. X and Y are subsets of integer numbers.

Some equating transformations Equipercentile Equating It is important to highlight here that we are interested on equate scores which are usually integer i.e. X and Y are subsets of integer numbers. This represent and important drawback of this method: the inverse function F 1 Y is not well defined.

Some equating transformations Equipercentile Equating It is important to highlight here that we are interested on equate scores which are usually integer i.e. X and Y are subsets of integer numbers. This represent and important drawback of this method: the inverse function F 1 Y is not well defined. The continuization step is proposed in order to avoid this problem: linear interpolation, polynomial log-linear models and kernel equating.

Some equating transformations Linear Equating In this method, the equating transformation is defined by: '(x) = µ Y + Y (x µ X ) X where µ W and deviation of W. W are the expected value and the standard

Some equating transformations Linear Equating In this method, the equating transformation is defined by: '(x) = µ Y + Y (x µ X ) X where µ W and deviation of W. W are the expected value and the standard In practice, the parameters involved in this function are estimated as the mean and the standard deviation of each sample.

New discrete-based equating methods Hybrid methods Goals of this work To extend and propose new methods of test equating based on discrete distributions.

New discrete-based equating methods Hybrid methods Goals of this work To extend and propose new methods of test equating based on discrete distributions. Propose improved methods based on continuization tools.

New discrete-based equating methods Hybrid methods Goals of this work To extend and propose new methods of test equating based on discrete distributions. Propose improved methods based on continuization tools. To compare both approaches (discrete-based equating and continuization-based equating).

New discrete-based equating methods Hybrid methods Discrete-based equating methods Since the linear equating transformation does not need continuization, we propose to estimate it using both parametric and nonparametric models for the discrete score distributions.

New discrete-based equating methods Hybrid methods Discrete-based equating methods Since the linear equating transformation does not need continuization, we propose to estimate it using both parametric and nonparametric models for the discrete score distributions. We assume X F (x X ) (1) Y F (y Y ) (2)

New discrete-based equating methods Hybrid methods Discrete-based equating methods (a) Classical Parametric approach For F (x X )andf (y Y ), assume some parametric distributions that have been used to model discrete scores:

New discrete-based equating methods Hybrid methods Discrete-based equating methods (a) Classical Parametric approach For F (x X )andf (y Y ), assume some parametric distributions that have been used to model discrete scores: 1 Poisson-Binomial 2 Negative Binomial

New discrete-based equating methods Hybrid methods Discrete-based equating methods (a) Classical Parametric approach For F (x X )andf (y Y ), assume some parametric distributions that have been used to model discrete scores: 1 Poisson-Binomial 2 Negative Binomial For these models the mean and standard deviation are functions of X and Y.

New discrete-based equating methods Hybrid methods Discrete-based equating methods (a) Classical Parametric approach For F (x X )andf (y Y ), assume some parametric distributions that have been used to model discrete scores: 1 Poisson-Binomial 2 Negative Binomial For these models the mean and standard deviation are functions of X and Y. By getting estimations(mle) of X and Y,wecanplugthem in the linear equating transformation.

New discrete-based equating methods Hybrid methods Discrete-based equating methods (b) Bayesian Parametric approach Under the same model (1)-(2), assume some of the parametric distribution for discrete scores mentioned before.

New discrete-based equating methods Hybrid methods Discrete-based equating methods (b) Bayesian Parametric approach Under the same model (1)-(2), assume some of the parametric distribution for discrete scores mentioned before. Let consider a prior distribution for the parameters X and Y as follows: X p( X ) Y p( Y )

New discrete-based equating methods Hybrid methods Discrete-based equating methods (b) Bayesian Parametric approach Under the same model (1)-(2), assume some of the parametric distribution for discrete scores mentioned before. Let consider a prior distribution for the parameters X and Y as follows: X p( X ) Y p( Y ) Perform Bayesian inference to obtain estimates of the parameters and then compute the linear equating transformation.

New discrete-based equating methods Hybrid methods Discrete-based equating methods (c) Bayesian Nonparametric approach Assume the following model

New discrete-based equating methods Hybrid methods Discrete-based equating methods (c) Bayesian Nonparametric approach Assume the following model X F X F X F X P Y F Y F Y F Y P where P is a random probability measure.

New discrete-based equating methods Hybrid methods Discrete-based equating methods (c) Bayesian Nonparametric approach Assume the following model X F X F X F X P Y F Y F Y F Y P where P is a random probability measure. Use Bayesian nonparametric inference to estimate the score distributions. Using those estimations, obtain means and standard deviations to be plugged in the linear equating transformation.

New discrete-based equating methods Hybrid methods Improved continuization methods In order to preserve as much as possible the discrete nature of the score data, we propose some statistical tools that take advantages of some results on the continuous framework and relate them with the main characteristic of score data: discreteness.

New discrete-based equating methods Hybrid methods Improved continuization methods (a) Latent variable approach

New discrete-based equating methods Hybrid methods Improved continuization methods (a) Latent variable approach We assume that the scores are realizations of an underlying continuous latent process.

New discrete-based equating methods Hybrid methods Improved continuization methods (a) Latent variable approach We assume that the scores are realizations of an underlying continuous latent process. Based on a bayesian nonparametric model for the latent process, we can estimate the score distributions and thus the equipercentile equating transformation.

New discrete-based equating methods Hybrid methods Improved continuization methods (a) Latent variable approach We assume that the scores are realizations of an underlying continuous latent process. Based on a bayesian nonparametric model for the latent process, we can estimate the score distributions and thus the equipercentile equating transformation. Since there is a one-to-one relation between the discrete score values and the continuous latent process, we can obtain an equating in the discrete setting.

New discrete-based equating methods Hybrid methods Improved continuization methods (b) Testing continuization

New discrete-based equating methods Hybrid methods Improved continuization methods (b) Testing continuization We can consider the following result '(x) =Lin(x)+R(x) where Lin(x) does not need continuization, whereas R(x) does.

New discrete-based equating methods Hybrid methods Improved continuization methods (b) Testing continuization We can consider the following result '(x) =Lin(x)+R(x) where Lin(x) does not need continuization, whereas R(x) does. The idea is then look for ways to minimize the use of R(x) and/or to develop measures to decide to what extend is this function important in the estimation of '.

Latent modelling approach Ordinal random variables and latent variables Let W be an ordinal random variable with C categories named w 1,...,w C.

Latent modelling approach Ordinal random variables and latent variables Let W be an ordinal random variable with C categories named w 1,...,w C. One approach to model this kind of random variables is the latent modelling formulation.

Latent modelling approach Ordinal random variables and latent variables Let W be an ordinal random variable with C categories named w 1,...,w C. One approach to model this kind of random variables is the latent modelling formulation. Let Z be a continuous latent variable with distribution F Z.

Latent modelling approach Ordinal random variables and latent variables The relation between these variables is the following one:

Latent modelling approach Ordinal random variables and latent variables The relation between these variables is the following one: W = 8 >< >: w 1 if 0 < Z apple 1 w 2 if 1 < Z apple 2... w C if C 1 < Z apple C (3) where 1,..., C are called cuto s.

Latent modelling approach Ordinal random variables and latent variables From the previous relation it follows that for k =1,...,C:

Latent modelling approach Ordinal random variables and latent variables From the previous relation it follows that for k =1,...,C: W = w k, W = w k 1{Z 2 ( k 1, k])} (4)

Latent modelling approach Ordinal random variables and latent variables From the previous relation it follows that for k =1,...,C: W = w k, W = w k 1{Z 2 ( k 1, k])} (4) Then, the probability of W is specified by the probability of Z, foreachk =1...,C: P(W = w k ) = P(Z apple k ) P(Z apple k 1 ) (5)

Latent modelling approach Ordinal random variables and latent variables Then, in terms of Z, the distribution of W can be written as a multinomial distribution:

Latent modelling approach Ordinal random variables and latent variables Then, in terms of Z, the distribution of W can be written as a multinomial distribution: W Mult(1, C, p Z, ) (6) where p Z, = (p Z, 1,...,p Z, C ) (7) p Z, k = P(Z apple k ) P(Z apple k 1 ) k =1,...,C (8)

Latent modelling approach Bayesian nonparametric model approach Let W 1,...,W n be a random sample from W.

Latent modelling approach Bayesian nonparametric model approach Let W 1,...,W n be a random sample from W. For i =1,...,n, letz i be the latent variable associated to W i, from the relation (3)

Latent modelling approach Bayesian nonparametric model approach Let W 1,...,W n be a random sample from W. For i =1,...,n, letz i be the latent variable associated to W i, from the relation (3) Based on the proposal of Kottas et. al(2005), we propose a Bayesian nonparametric model for the latent model F Z defined by:

Latent modelling approach Bayesian nonparametric model approach Let W 1,...,W n be a random sample from W. For i =1,...,n, letz i be the latent variable associated to W i, from the relation (3) Based on the proposal of Kottas et. al(2005), we propose a Bayesian nonparametric model for the latent model F Z defined by: Z 1...,Z n F Z iid FZ (9) F Z G (10) where G is a random probability measure that leads on a continuous density: a DPM model.

Latent modelling approach Equating: latent modelling approach Let s consider the score X be an ordinal random variable with J categories defined by each value in the score sample space.

Latent modelling approach Equating: latent modelling approach Let s consider the score X be an ordinal random variable with J categories defined by each value in the score sample space. Let X 1,...,X nx a random sample of X.Usingthelatent modelling approach for ordinal variables, we will assume the following model:

Latent modelling approach Equating: latent modelling approach Let s consider the score X be an ordinal random variable with J categories defined by each value in the score sample space. Let X 1,...,X nx a random sample of X.Usingthelatent modelling approach for ordinal variables, we will assume the following model: X 1,...,X n Z 1,...,Z n Z 1...,Z n 1,..., n ind Mult(1, J, p z, ) (11) ind N(z i µ i, 2 i ) (12) 1,..., n G iid G (13) G PD(a, b, G 0 ( )) (14) p( ) (15)

Holland, P., & Rubin, D. (1982). Test equating. Academic Press, New York Gonzalez, J., & Wiberg, M. (2017). Applying Test Equating Methods Using R. Springer. Kottas, A., Müller, P., & Quintana, F. (2005). Nonparametric Bayesian modeling for multivariate ordinal data. Journal of Computational and Graphical Statistics, 14, 610 625.