RR R E E A H R E P DENOTING THE BASE FREE MEASURE OF CHANGE. Samuel Messick. Educational Testing Service Princeton, New Jersey December 1980

Size: px
Start display at page:

Download "RR R E E A H R E P DENOTING THE BASE FREE MEASURE OF CHANGE. Samuel Messick. Educational Testing Service Princeton, New Jersey December 1980"

Transcription

1 RR R E 5 E A RC H R E P o R T DENOTING THE BASE FREE MEASURE OF CHANGE Samuel Messick Educational Testing Service Princeton, New Jersey December 1980

2 DENOTING THE BASE-FREE MEASURE OF CHANGE Samuel Messick Educational Testing Service Princeton, New Jersey December 1980

3 Educational Testing Service. All rights reserved.

4 ABSTRACT Denoting the Base-Free Measure of Change Samuel Messick Educational Testing Service Bond criticized the base-free measure of change proposed by Tucker, Damarin, and Messick by pointing to an incorrect derivation which is here viewed instead as a correct derivation entailing an inadequately specified tacit assumption. Bond's revision leads to estimates of the correlation between initial position and change which are negatively biased by correlated errors, whereas the original approach, with the tacit assumption properly denoted, leads to unbiased values. Key words: base-free measure of change, difference scores, change scores, measuring change.

5 Denoting the Base-Free Measure of Change Samuel Messick Educational Testing Service Bond (1979) pointed out a presumably incorrect derivation in the Tucker, Damarin, and Messick (1966) presentation of a base-free measure of change. As a consequence, Bond argued that properties attributed to a (the coefficient for the regression of true scores from the second testing on true scores from the first testing) were instead properties of b, the corresponding regression coefficient for observed scores. At issue was the correlation between initial position () and change (d = XZ-XI), with Tucker et al. (1966) claiming that P d was positive when a > 1, was zero when a = 1, and was negative when a < I and Bond insisting that these cut-off points were determined by b rather than a. In reply, Tucker (1979) emphasized that the focus should have been on the correlation between true initial position (tl) and true change (6 = tz-tl) all along, whereby the interpretation returns to a comparison of a with unity since (1) Equation (1) is algebraically equivalent to the formula provided by Zieve (1940) for the correlation between true initial position and the true difference score, which also appears as equation (62) in Tucker et al. (1966), Zieve's formula explicitly corrects P d for correlated errors as well as for attenuation. The derivation error in question revolves around whether P xlxl in the formula for P d represents the reliability of, as Tucker et al. tacitly

6 -2- assumed, or the self-correlation of (i.e., unity) as Bond properly asserted. In the absence of further specifications, Bond was correct. But once the nature of the Tucker et al. tacit assumption is explicitly denoted, we see that a remains a proper benchmark regardless of whether the observed correlation or the true correlation is considered. Using the Tucker et al. (1966) notation which was also followed by Bond (1979), as well as standard relationships from classical test theory, the basefree measure of change (y) is defined as a true independent change score: (2) (3) a = P a xl x2 X2 p a xlxl xl b Formula (61) in Tucker et al. (1966) and formula (14) in Bond (1979) give the correlation between an observed difference score and any other variable k, (4) PX2kaX2 - Pka ad Now t instead of setting k = xl as Tucker et a1. did originally, leading Bond to question whether p xlxl should not properly be unity and hence (5) (b-d) o xl let k = xl, a parallel form of xl' The use of parallel forms of the pretest, one to determine the difference score and one to correlate with the difference score, eliminates the negative bias in P d produced by correlated errors (Thorndike, 1966), Equation (4) then becomes, with inserts from equation (3),

7 -3- (6) Equation (6) is algebraically equivalent to equation (63) in Tucker et al. (1966), which gives the correlation between observed initial position and the observed difference score corrected for correlated errors. It is clear from equation (6) that PX'd is positive when a > 1, is zero when a = 1, and is negative when a < 1. I Furthermore, if both sides of equation (6) are divided by (PxlxI)~(Pdd')~,where P dd, is the reliability of the observed difference score, then (7) = = which is consistent with Tucker's (1979) formulation. Thus, P t l o is Pxid corrected for attenuation in both xi and d. As previously noted, it is clear from equation (6) that P 'd--the correlaxl tion between observed initial position and observed change uncontaminated by correlated errors--is positive when a > 1, is zero when a = 1, and is negative when a < 1. It may be helpful in explicating these relationships to consider a as the product of two ratios (8) The first ratio is typically less than unity and will usually become smaller and smaller as the time interval between the first and second testing increases. Therefore, a > 1 and P, is positiv~ only when xld

8 -4- (9) (J x2 This ~ondition of (J becoming increasingly larger than (J as the testing inx2 xl terval increases corresponds to a growth model known as the fan-spread hypothesis (Bryk & Weisberg. 1977). Further. a = 1 and Px'd 1 o when (10) ---= This condition corresponds to a growth model known as the overlap hypothesis (Anderson. 1939). where XL consists of everything present at xl plus an independent increment uncorrelated with xl' In the overlap model of growth. the squared correlation coefficient (corrected for attenuation) between test scores at two points in time is sometimes interpreted as indicating the "percent of the elements" in the second measurement that had been attained by the time of the first measurement (Anderson p. 365). In this view of longitudinal data. the proportion of common elements or the amount of overlap in two measurements taken at different points in time is indexed not only by the squared unattenuated correlation between the two tests or by the ratio of the two test variances as in equation (10), but by the ratio of the two test means (Bloom. 1964). Such reasoning enabled Bloom (1964). after reviewing empirical evidence that P d averaged about zero for mea sures of intelligence (at least through age seven). to move from statements about variance to statements about amount--that is. instead of stating that 50 percent of the variance of adult intelligence is accounted for by age four. he claimed that "about 50% of the development takes place between conception and age 4" (P. 88).

9 -5- This is a strong conclusion, but the overlap model of growth undergirding it implies the very special condition of a = 1. Finally, a < land p 'd is negative when xl (11) a x2 which occurs so frequently in situations having test ceiling effects or in systems having natural boundary conditions, as in the measurement of psychophysiological responses, that its corresponding growth model has been called the law of initial values (Lacey & Lacey, 1962). These differences in possible conceptions of the nature of growth occurring in a particular instance hinge on the values of a or.p 'd--not, as Bond (1979) averred. on b or p d. These latter values incorporate correlated errors and hence are negatively biased. It can be seen from equation (3) that the difference between using a or b to specify analytically the conditions under which the correlation between initial position and change will be positive. negative, or zero becomes increasingly important to the extent that the initial measure is fallible. Since unreliability of measurement is a pervasive problem during periods of rapid growth as in early childhood, the distinction between a and b is especially germane there. but it is also important in any investigation of the correlates of change using fallible measures (Tucker et al., 1966).

10 REFERENCES Anderson, J. E. The limitation of infant and preschool tests in the measurement of intelligence. Journal of Psychology, 1939, ~, Bloom, B. S. Stability and change in human characteristics. New York: Wiley, Bond, L. On the base-free measure of change proposed by Tucker, Damarin, and Messick. Psychometrika, 1979, 44, Bryk, A. S. & Weisberg, H. I. Use of the nonequiva1ent control group design when subjects are growing. Psychological Bulletin, 1977, 84, Lacey, J. I. & Lacey, B. C. The law of initial value in the longitudinal study of autonomic constitution: Reproducibility of autonomic responses and response patterns over a four year interval. In W. M. Wolf (Ed.), Rythmic functions in the living system. Annals of the New York Academy of Sciences, Thorndike, R. L. Intellectual status and intellectual growth. Journal of Educational Psycho1o~, 1966,22, Tucker, L. R. Comment on a note on a base-free measure of change. Psychometrika, 1979, 44, 357. Tucker, L. R., Damarin, F., & Messick S. A base-free measure of change. Psychometrika, 1966, 1!, Zieve, L. Note on the correlation of initial scores with gains. Journal of Educational Psychology, 1940, 1!,

Estimating Operational Validity Under Incidental Range Restriction: Some Important but Neglected Issues

Estimating Operational Validity Under Incidental Range Restriction: Some Important but Neglected Issues A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to Practical Assessment, Research & Evaluation. Permission is granted to distribute

More information

PROBABILITIES OF MISCLASSIFICATION IN DISCRIMINATORY ANALYSIS. M. Clemens Johnson

PROBABILITIES OF MISCLASSIFICATION IN DISCRIMINATORY ANALYSIS. M. Clemens Johnson RB-55-22 ~ [ s [ B A U R L t L Ii [ T I N PROBABILITIES OF MISCLASSIFICATION IN DISCRIMINATORY ANALYSIS M. Clemens Johnson This Bulletin is a draft for interoffice circulation. Corrections and suggestions

More information

CORRELATIONS ~ PARTIAL REGRESSION COEFFICIENTS (GROWTH STUDY PAPER #29) and. Charles E. Werts

CORRELATIONS ~ PARTIAL REGRESSION COEFFICIENTS (GROWTH STUDY PAPER #29) and. Charles E. Werts RB-69-6 ASSUMPTIONS IN MAKING CAUSAL INFERENCES FROM PART CORRELATIONS ~ PARTIAL CORRELATIONS AND PARTIAL REGRESSION COEFFICIENTS (GROWTH STUDY PAPER #29) Robert L. Linn and Charles E. Werts This Bulletin

More information

VARIABILITY OF KUDER-RICHARDSON FOm~A 20 RELIABILITY ESTIMATES. T. Anne Cleary University of Wisconsin and Robert L. Linn Educational Testing Service

VARIABILITY OF KUDER-RICHARDSON FOm~A 20 RELIABILITY ESTIMATES. T. Anne Cleary University of Wisconsin and Robert L. Linn Educational Testing Service ~ E S [ B A U ~ L t L H E TI VARIABILITY OF KUDER-RICHARDSON FOm~A 20 RELIABILITY ESTIMATES RB-68-7 N T. Anne Cleary University of Wisconsin and Robert L. Linn Educational Testing Service This Bulletin

More information

A TEST OF SIGNIFICANCE OF DIFFERENCE BETWEEN CORRELATED PROPORTIONS. John A. Keats

A TEST OF SIGNIFICANCE OF DIFFERENCE BETWEEN CORRELATED PROPORTIONS. John A. Keats ~ E S E B A U ~ L t L I-i E TI RB-55-20 A TEST OF SIGNIFICANCE OF DIFFERENCE BETWEEN CORRELATED PROPORTIONS John A. Keats N This Bulletin is a draft for interoffice circulation. Corrections and suggestions

More information

Nonequivalent-Populations Design David J. Woodruff American College Testing Program

Nonequivalent-Populations Design David J. Woodruff American College Testing Program A Comparison of Three Linear Equating Methods for the Common-Item Nonequivalent-Populations Design David J. Woodruff American College Testing Program Three linear equating methods for the common-item nonequivalent-populations

More information

Reconciling factor-based and composite-based approaches to structural equation modeling

Reconciling factor-based and composite-based approaches to structural equation modeling Reconciling factor-based and composite-based approaches to structural equation modeling Edward E. Rigdon (erigdon@gsu.edu) Modern Modeling Methods Conference May 20, 2015 Thesis: Arguments for factor-based

More information

Instrumental Variables

Instrumental Variables Instrumental Variables James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Instrumental Variables 1 / 10 Instrumental Variables

More information

Chapter 4: Factor Analysis

Chapter 4: Factor Analysis Chapter 4: Factor Analysis In many studies, we may not be able to measure directly the variables of interest. We can merely collect data on other variables which may be related to the variables of interest.

More information

An extended summary of the NCGR/Berkeley Double-Blind Test of Astrology undertaken by Shawn Carlson and published in 1985

An extended summary of the NCGR/Berkeley Double-Blind Test of Astrology undertaken by Shawn Carlson and published in 1985 From: http://www.astrodivination.com/moa/ncgrberk.htm An extended summary of the NCGR/Berkeley Double-Blind Test of Astrology undertaken by Shawn Carlson and published in 1985 Introduction Under the heading

More information

Equating Subscores Using Total Scaled Scores as an Anchor

Equating Subscores Using Total Scaled Scores as an Anchor Research Report ETS RR 11-07 Equating Subscores Using Total Scaled Scores as an Anchor Gautam Puhan Longjuan Liang March 2011 Equating Subscores Using Total Scaled Scores as an Anchor Gautam Puhan and

More information

Instrumental Variables

Instrumental Variables James H. Steiger Department of Psychology and Human Development Vanderbilt University Regression Modeling, 2009 1 Introduction 2 3 4 Instrumental variables allow us to get a better estimate of a causal

More information

PROGRAM STATISTICS RESEARCH

PROGRAM STATISTICS RESEARCH An Alternate Definition of the ETS Delta Scale of Item Difficulty Paul W. Holland and Dorothy T. Thayer @) PROGRAM STATISTICS RESEARCH TECHNICAL REPORT NO. 85..64 EDUCATIONAL TESTING SERVICE PRINCETON,

More information

Design of Experiments

Design of Experiments Design of Experiments DOE Radu T. Trîmbiţaş April 20, 2016 1 Introduction The Elements Affecting the Information in a Sample Generally, the design of experiments (DOE) is a very broad subject concerned

More information

y response variable x 1, x 2,, x k -- a set of explanatory variables

y response variable x 1, x 2,, x k -- a set of explanatory variables 11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate

More information

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit

More information

Clarifying the concepts of reliability, validity and generalizability

Clarifying the concepts of reliability, validity and generalizability Clarifying the concepts of reliability, validity and generalizability Maria Valaste 1 and Lauri Tarkkonen 2 1 University of Helsinki, Finland e-mail: maria.valaste@helsinki.fi 2 University of Helsinki,

More information

Ignoring the matching variables in cohort studies - when is it valid, and why?

Ignoring the matching variables in cohort studies - when is it valid, and why? Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association

More information

Myths about the Analysis of Change

Myths about the Analysis of Change James H. Steiger Department of Psychology and Human Development Vanderbilt University, 2010 1 Introduction 2 Limitations of the Two-Observation Study 3 Myth 2: The Difference Score is Intrinsically Unreliable

More information

A bias improved estimator of the concordance correlation coefficient

A bias improved estimator of the concordance correlation coefficient The 22 nd Annual Meeting in Mathematics (AMM 217) Department of Mathematics, Faculty of Science Chiang Mai University, Chiang Mai, Thailand A bias improved estimator of the concordance correlation coefficient

More information

IEOR 165 Lecture 7 1 Bias-Variance Tradeoff

IEOR 165 Lecture 7 1 Bias-Variance Tradeoff IEOR 165 Lecture 7 Bias-Variance Tradeoff 1 Bias-Variance Tradeoff Consider the case of parametric regression with β R, and suppose we would like to analyze the error of the estimate ˆβ in comparison to

More information

EXAMINATION: QUANTITATIVE EMPIRICAL METHODS. Yale University. Department of Political Science

EXAMINATION: QUANTITATIVE EMPIRICAL METHODS. Yale University. Department of Political Science EXAMINATION: QUANTITATIVE EMPIRICAL METHODS Yale University Department of Political Science January 2014 You have seven hours (and fifteen minutes) to complete the exam. You can use the points assigned

More information

Statistical Power and Autocorrelation for Short, Comparative Interrupted Time Series Designs with Aggregate Data

Statistical Power and Autocorrelation for Short, Comparative Interrupted Time Series Designs with Aggregate Data Statistical Power and Autocorrelation for Short, Comparative Interrupted Time Series Designs with Aggregate Data Andrew Swanlund, American Institutes for Research (aswanlund@air.org) Kelly Hallberg, University

More information

6.435, System Identification

6.435, System Identification System Identification 6.435 SET 3 Nonparametric Identification Munther A. Dahleh 1 Nonparametric Methods for System ID Time domain methods Impulse response Step response Correlation analysis / time Frequency

More information

UNIT 3 CONCEPT OF DISPERSION

UNIT 3 CONCEPT OF DISPERSION UNIT 3 CONCEPT OF DISPERSION Structure 3.0 Introduction 3.1 Objectives 3.2 Concept of Dispersion 3.2.1 Functions of Dispersion 3.2.2 Measures of Dispersion 3.2.3 Meaning of Dispersion 3.2.4 Absolute Dispersion

More information

2/26/2017. This is similar to canonical correlation in some ways. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2

2/26/2017. This is similar to canonical correlation in some ways. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 What is factor analysis? What are factors? Representing factors Graphs and equations Extracting factors Methods and criteria Interpreting

More information

Chapter 8. Models with Structural and Measurement Components. Overview. Characteristics of SR models. Analysis of SR models. Estimation of SR models

Chapter 8. Models with Structural and Measurement Components. Overview. Characteristics of SR models. Analysis of SR models. Estimation of SR models Chapter 8 Models with Structural and Measurement Components Good people are good because they've come to wisdom through failure. Overview William Saroyan Characteristics of SR models Estimation of SR models

More information

An Approximate Test for Homogeneity of Correlated Correlation Coefficients

An Approximate Test for Homogeneity of Correlated Correlation Coefficients Quality & Quantity 37: 99 110, 2003. 2003 Kluwer Academic Publishers. Printed in the Netherlands. 99 Research Note An Approximate Test for Homogeneity of Correlated Correlation Coefficients TRIVELLORE

More information

Point-to-point response to reviewers comments

Point-to-point response to reviewers comments Point-to-point response to reviewers comments Reviewer #1 1) The authors analyze only one millennial reconstruction (Jones, 1998) with the argument that it is the only one available. This is incorrect.

More information

EMERGING MARKETS - Lecture 2: Methodology refresher

EMERGING MARKETS - Lecture 2: Methodology refresher EMERGING MARKETS - Lecture 2: Methodology refresher Maria Perrotta April 4, 2013 SITE http://www.hhs.se/site/pages/default.aspx My contact: maria.perrotta@hhs.se Aim of this class There are many different

More information

Causal Inference with Big Data Sets

Causal Inference with Big Data Sets Causal Inference with Big Data Sets Marcelo Coca Perraillon University of Colorado AMC November 2016 1 / 1 Outlone Outline Big data Causal inference in economics and statistics Regression discontinuity

More information

The Use of Structural Equation Models in Interpreting Regression Equations Including Suppressor and Enhancer Variables

The Use of Structural Equation Models in Interpreting Regression Equations Including Suppressor and Enhancer Variables The Use of Structural Equation Models in Interpreting Regression Equations Including Suppressor and Enhancer Variables Robert M. McFatter University of Denver It is shown that the usual interpretation

More information

Linear Equating Models for the Common-item Nonequivalent-Populations Design Michael J. Kolen and Robert L. Brennan American College Testing Program

Linear Equating Models for the Common-item Nonequivalent-Populations Design Michael J. Kolen and Robert L. Brennan American College Testing Program Linear Equating Models for the Common-item Nonequivalent-Populations Design Michael J. Kolen Robert L. Brennan American College Testing Program The Tucker Levine equally reliable linear meth- in the common-item

More information

Lecture Notes on Measurement Error

Lecture Notes on Measurement Error Steve Pischke Spring 2000 Lecture Notes on Measurement Error These notes summarize a variety of simple results on measurement error which I nd useful. They also provide some references where more complete

More information

Scaling of Variance Space

Scaling of Variance Space ... it Dominance, Information, and Hierarchical Scaling of Variance Space David J. Krus and Robert W. Ceurvorst Arizona State University A method for computation of dominance relations and for construction

More information

Consequences of measurement error. Psychology 588: Covariance structure and factor models

Consequences of measurement error. Psychology 588: Covariance structure and factor models Consequences of measurement error Psychology 588: Covariance structure and factor models Scaling indeterminacy of latent variables Scale of a latent variable is arbitrary and determined by a convention

More information

A Rejoinder to Mackintosh and some Remarks on the. Concept of General Intelligence

A Rejoinder to Mackintosh and some Remarks on the. Concept of General Intelligence A Rejoinder to Mackintosh and some Remarks on the Concept of General Intelligence Moritz Heene Department of Psychology, Ludwig Maximilian University, Munich, Germany. 1 Abstract In 2000 Nicholas J. Mackintosh

More information

Of Nickell Bias, Cross-Sectional Dependence, and Their Cures: Reply

Of Nickell Bias, Cross-Sectional Dependence, and Their Cures: Reply Of Nickell Bias, Cross-Sectional Dependence, and Their Cures: Reply Khusrav Gaibulloev Department of Economics American University of Sharjah Sharjah, UAE kgaibulloev@aus.edu and Todd Sandler Donggyu Sul

More information

Observed-Score "Equatings"

Observed-Score Equatings Comparison of IRT True-Score and Equipercentile Observed-Score "Equatings" Frederic M. Lord and Marilyn S. Wingersky Educational Testing Service Two methods of equating tests are compared, one using true

More information

Math Review. for the Quantitative Reasoning measure of the GRE General Test

Math Review. for the Quantitative Reasoning measure of the GRE General Test Math Review for the Quantitative Reasoning measure of the GRE General Test www.ets.org Overview This Math Review will familiarize you with the mathematical skills and concepts that are important for solving

More information

FinQuiz Notes

FinQuiz Notes Reading 10 Multiple Regression and Issues in Regression Analysis 2. MULTIPLE LINEAR REGRESSION Multiple linear regression is a method used to model the linear relationship between a dependent variable

More information

Use of e-rater in Scoring of the TOEFL ibt Writing Test

Use of e-rater in Scoring of the TOEFL ibt Writing Test Research Report ETS RR 11-25 Use of e-rater in Scoring of the TOEFL ibt Writing Test Shelby J. Haberman June 2011 Use of e-rater in Scoring of the TOEFL ibt Writing Test Shelby J. Haberman ETS, Princeton,

More information

Formula for the t-test

Formula for the t-test Formula for the t-test: How the t-test Relates to the Distribution of the Data for the Groups Formula for the t-test: Formula for the Standard Error of the Difference Between the Means Formula for the

More information

Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems

Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems Functional form misspecification We may have a model that is correctly specified, in terms of including

More information

Analyzing Group by Time Effects in Longitudinal Two-Group Randomized Trial Designs With Missing Data

Analyzing Group by Time Effects in Longitudinal Two-Group Randomized Trial Designs With Missing Data Journal of Modern Applied Statistical Methods Volume Issue 1 Article 6 5-1-003 Analyzing Group by Time Effects in Longitudinal Two-Group Randomized Trial Designs With Missing Data James Algina University

More information

General structural model Part 1: Covariance structure and identification. Psychology 588: Covariance structure and factor models

General structural model Part 1: Covariance structure and identification. Psychology 588: Covariance structure and factor models General structural model Part 1: Covariance structure and identification Psychology 588: Covariance structure and factor models Latent variables 2 Interchangeably used: constructs --- substantively defined

More information

A Pedagogic Demonstration of Attenuation of Correlation Due to Measurement Error

A Pedagogic Demonstration of Attenuation of Correlation Due to Measurement Error Spreadsheets in Education (ejsie) Volume 7 Issue 1 Article 4 1-14-2014 A Pedagogic Demonstration of Attenuation of Correlation Due to Measurement Error Christopher R. Fisher Miami University, fisherc2@miamioh.edu

More information

Lesson 6: Reliability

Lesson 6: Reliability Lesson 6: Reliability Patrícia Martinková Department of Statistical Modelling Institute of Computer Science, Czech Academy of Sciences NMST 570, December 12, 2017 Dec 19, 2017 1/35 Contents 1. Introduction

More information

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1 Math 66/566 - Midterm Solutions NOTE: These solutions are for both the 66 and 566 exam. The problems are the same until questions and 5. 1. The moment generating function of a random variable X is M(t)

More information

Advanced Quantitative Research Methodology Lecture Notes: January Ecological 28, 2012 Inference1 / 38

Advanced Quantitative Research Methodology Lecture Notes: January Ecological 28, 2012 Inference1 / 38 Advanced Quantitative Research Methodology Lecture Notes: Ecological Inference 1 Gary King http://gking.harvard.edu January 28, 2012 1 c Copyright 2008 Gary King, All Rights Reserved. Gary King http://gking.harvard.edu

More information

Regularized Common Factor Analysis

Regularized Common Factor Analysis New Trends in Psychometrics 1 Regularized Common Factor Analysis Sunho Jung 1 and Yoshio Takane 1 (1) Department of Psychology, McGill University, 1205 Dr. Penfield Avenue, Montreal, QC, H3A 1B1, Canada

More information

Model II (or random effects) one-way ANOVA:

Model II (or random effects) one-way ANOVA: Model II (or random effects) one-way ANOVA: As noted earlier, if we have a random effects model, the treatments are chosen from a larger population of treatments; we wish to generalize to this larger population.

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

AN INDEX OF THE DISCRIMINATING POWER OF A TEST. Richard Levine and Frederic M. Lord

AN INDEX OF THE DISCRIMINATING POWER OF A TEST. Richard Levine and Frederic M. Lord RB-58-13 ~ [ s [ B A U ~ L t L I-t [ T I N AN INDEX OF THE DISCRIMINATING POWER OF A TEST AT DIFFERENT PARTS OF THE SCORE RANGE Richard Levine and Frederic M. Lord This Bulletin is a draft for interoffice

More information

1 The Classic Bivariate Least Squares Model

1 The Classic Bivariate Least Squares Model Review of Bivariate Linear Regression Contents 1 The Classic Bivariate Least Squares Model 1 1.1 The Setup............................... 1 1.2 An Example Predicting Kids IQ................. 1 2 Evaluating

More information

A Note on the Choice of an Anchor Test in Equating

A Note on the Choice of an Anchor Test in Equating Research Report ETS RR 12-14 A Note on the Choice of an Anchor Test in Equating Sandip Sinharay Shelby Haberman Paul Holland Charles Lewis September 2012 ETS Research Report Series EIGNOR EXECUTIVE EDITOR

More information

Shelby J. Haberman. Hongwen Guo. Jinghua Liu. Neil J. Dorans. ETS, Princeton, NJ

Shelby J. Haberman. Hongwen Guo. Jinghua Liu. Neil J. Dorans. ETS, Princeton, NJ Consistency of SAT I: Reasoning Test Score Conversions Shelby J. Haberman Hongwen Guo Jinghua Liu Neil J. Dorans ETS, Princeton, NJ Paper presented at the annual meeting of the American Educational Research

More information

Regression Models - Introduction

Regression Models - Introduction Regression Models - Introduction In regression models there are two types of variables that are studied: A dependent variable, Y, also called response variable. It is modeled as random. An independent

More information

Simple Linear Regression Model & Introduction to. OLS Estimation

Simple Linear Regression Model & Introduction to. OLS Estimation Inside ECOOMICS Introduction to Econometrics Simple Linear Regression Model & Introduction to Introduction OLS Estimation We are interested in a model that explains a variable y in terms of other variables

More information

Mixed- Model Analysis of Variance. Sohad Murrar & Markus Brauer. University of Wisconsin- Madison. Target Word Count: Actual Word Count: 2755

Mixed- Model Analysis of Variance. Sohad Murrar & Markus Brauer. University of Wisconsin- Madison. Target Word Count: Actual Word Count: 2755 Mixed- Model Analysis of Variance Sohad Murrar & Markus Brauer University of Wisconsin- Madison The SAGE Encyclopedia of Educational Research, Measurement and Evaluation Target Word Count: 3000 - Actual

More information

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data

Estimation of Conditional Kendall s Tau for Bivariate Interval Censored Data Communications for Statistical Applications and Methods 2015, Vol. 22, No. 6, 599 604 DOI: http://dx.doi.org/10.5351/csam.2015.22.6.599 Print ISSN 2287-7843 / Online ISSN 2383-4757 Estimation of Conditional

More information

A Better Way to Do R&R Studies

A Better Way to Do R&R Studies The Evaluating the Measurement Process Approach Last month s column looked at how to fix some of the Problems with Gauge R&R Studies. This month I will show you how to learn more from your gauge R&R data

More information

LINEAR COMBINATION OF ESTIMATORS IN PROBABILITY PROPORTIONAL TO SIZES SAMPLING TO ESTIMATE THE POPULATION MEAN AND ITS ROBUSTNESS TO OPTIMUM VALUE

LINEAR COMBINATION OF ESTIMATORS IN PROBABILITY PROPORTIONAL TO SIZES SAMPLING TO ESTIMATE THE POPULATION MEAN AND ITS ROBUSTNESS TO OPTIMUM VALUE STATISTICA, anno LXIX, n., 009 LINEAR COMBINATION OF ESTIMATORS IN PROBABILITY PROPORTIONAL TO SIZES SAMPLING TO ESTIMATE THE POPULATION MEAN AND ITS ROBUSTNESS TO OPTIMUM VALUE. INTRODUCTION To reduce

More information

UNIVERSITY OF ILLINOIS LIBRARY AT URBANA-CHAMPAIGN BOOKSTACKS

UNIVERSITY OF ILLINOIS LIBRARY AT URBANA-CHAMPAIGN BOOKSTACKS UNIVERSITY OF ILLINOIS LIBRARY AT URBANA-CHAMPAIGN BOOKSTACKS i CENTRAL CIRCULATION BOOKSTACKS The person charging this material is responsible for its renewal or its return to the library from which it

More information

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed

More information

A Cautionary Note on Estimating the Reliability of a Mastery Test with the Beta-Binomial Model

A Cautionary Note on Estimating the Reliability of a Mastery Test with the Beta-Binomial Model A Cautionary Note on Estimating the Reliability of a Mastery Test with the Beta-Binomial Model Rand R. Wilcox University of Southern California Based on recently published papers, it might be tempting

More information

Unit 2. Describing Data: Numerical

Unit 2. Describing Data: Numerical Unit 2 Describing Data: Numerical Describing Data Numerically Describing Data Numerically Central Tendency Arithmetic Mean Median Mode Variation Range Interquartile Range Variance Standard Deviation Coefficient

More information

On line resources Should be able to use for homework

On line resources Should be able to use for homework On line resources Should be able to use for homework http://www.amstat.org/publications/jse/v10n3/aberson/po wer_applet.html http://www.indiana.edu/~psyugrad/gradschool/apply.php http://onlinestatbook.com/stat_sim/conf_interval/index.ht

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 12: Frequentist properties of estimators (v4) Ramesh Johari ramesh.johari@stanford.edu 1 / 39 Frequentist inference 2 / 39 Thinking like a frequentist Suppose that for some

More information

Wooldridge, Introductory Econometrics, 2d ed. Chapter 8: Heteroskedasticity In laying out the standard regression model, we made the assumption of

Wooldridge, Introductory Econometrics, 2d ed. Chapter 8: Heteroskedasticity In laying out the standard regression model, we made the assumption of Wooldridge, Introductory Econometrics, d ed. Chapter 8: Heteroskedasticity In laying out the standard regression model, we made the assumption of homoskedasticity of the regression error term: that its

More information

Chained Versus Post-Stratification Equating in a Linear Context: An Evaluation Using Empirical Data

Chained Versus Post-Stratification Equating in a Linear Context: An Evaluation Using Empirical Data Research Report Chained Versus Post-Stratification Equating in a Linear Context: An Evaluation Using Empirical Data Gautam Puhan February 2 ETS RR--6 Listening. Learning. Leading. Chained Versus Post-Stratification

More information

OLD BUSINESS. Conditional versus Unconditional Analyses (Goldstein, Plewis...) [UK Reading example]

OLD BUSINESS. Conditional versus Unconditional Analyses (Goldstein, Plewis...) [UK Reading example] Rogosa, D. R. (1988). Myths about longitudinal research. In Methodological issues in aging research, K. W. Schaie, R. T. Campbell, W. M. Meredith, and S. C. Rawlings, Eds. New York, Springer Publishing

More information

ACTEX CAS EXAM 3 STUDY GUIDE FOR MATHEMATICAL STATISTICS

ACTEX CAS EXAM 3 STUDY GUIDE FOR MATHEMATICAL STATISTICS ACTEX CAS EXAM 3 STUDY GUIDE FOR MATHEMATICAL STATISTICS TABLE OF CONTENTS INTRODUCTORY NOTE NOTES AND PROBLEM SETS Section 1 - Point Estimation 1 Problem Set 1 15 Section 2 - Confidence Intervals and

More information

II. MATCHMAKER, MATCHMAKER

II. MATCHMAKER, MATCHMAKER II. MATCHMAKER, MATCHMAKER Josh Angrist MIT 14.387 Fall 2014 Agenda Matching. What could be simpler? We look for causal effects by comparing treatment and control within subgroups where everything... or

More information

Multicollinearity and A Ridge Parameter Estimation Approach

Multicollinearity and A Ridge Parameter Estimation Approach Journal of Modern Applied Statistical Methods Volume 15 Issue Article 5 11-1-016 Multicollinearity and A Ridge Parameter Estimation Approach Ghadban Khalaf King Khalid University, albadran50@yahoo.com

More information

Økonomisk Kandidateksamen 2004 (I) Econometrics 2. Rettevejledning

Økonomisk Kandidateksamen 2004 (I) Econometrics 2. Rettevejledning Økonomisk Kandidateksamen 2004 (I) Econometrics 2 Rettevejledning This is a closed-book exam (uden hjælpemidler). Answer all questions! The group of questions 1 to 4 have equal weight. Within each group,

More information

SAMPLE IS USED IN A NEW SAMPLE

SAMPLE IS USED IN A NEW SAMPLE EB 50-40 ~ E S E B A U R L t L Ii E TI EFFICIENCY OF PBEDICTION WHEN A REGRESSION EQUATION FROM ONE SAMPLE IS USED IN A NEW SAMPLE Frederic M. Lord (Prepublication draft) N ~----'-- This Bulletin is a

More information

FORMATIVE AND REFLECTIVE MODELS: STATE OF THE ART. Anna Simonetto *

FORMATIVE AND REFLECTIVE MODELS: STATE OF THE ART. Anna Simonetto * Electronic Journal of Applied Statistical Analysis EJASA (2012), Electron. J. App. Stat. Anal., Vol. 5, Issue 3, 452 457 e-issn 2070-5948, DOI 10.1285/i20705948v5n3p452 2012 Università del Salento http://siba-ese.unile.it/index.php/ejasa/index

More information

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing 1. Purpose of statistical inference Statistical inference provides a means of generalizing

More information

1 Motivation for Instrumental Variable (IV) Regression

1 Motivation for Instrumental Variable (IV) Regression ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

Test for Discontinuities in Nonparametric Regression

Test for Discontinuities in Nonparametric Regression Communications of the Korean Statistical Society Vol. 15, No. 5, 2008, pp. 709 717 Test for Discontinuities in Nonparametric Regression Dongryeon Park 1) Abstract The difference of two one-sided kernel

More information

THE EVALUATION OF LINEAR FORMS P. S. DWYER

THE EVALUATION OF LINEAR FORMS P. S. DWYER PSYCHOMETRIKA~VOL. 6, NO. 6 DECEMBER. 1941 THE EVALUATION OF LINEAR FORMS P. S. DWYER UNIVERSITY OF MICHIGAN This presentation deals with the evaluation and transformation of linear forms. Especial emphasis

More information

Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded

Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded 1 Background Latent confounder is common in social and behavioral science in which most of cases the selection mechanism

More information

Plausible Values for Latent Variables Using Mplus

Plausible Values for Latent Variables Using Mplus Plausible Values for Latent Variables Using Mplus Tihomir Asparouhov and Bengt Muthén August 21, 2010 1 1 Introduction Plausible values are imputed values for latent variables. All latent variables can

More information

Study of the Relationship between Dependent and Independent Variable Groups by Using Canonical Correlation Analysis with Application

Study of the Relationship between Dependent and Independent Variable Groups by Using Canonical Correlation Analysis with Application Modern Applied Science; Vol. 9, No. 8; 2015 ISSN 1913-1844 E-ISSN 1913-1852 Published by Canadian Center of Science and Education Study of the Relationship between Dependent and Independent Variable Groups

More information

Psychology 454: Latent Variable Modeling How do you know if a model works?

Psychology 454: Latent Variable Modeling How do you know if a model works? Psychology 454: Latent Variable Modeling How do you know if a model works? William Revelle Department of Psychology Northwestern University Evanston, Illinois USA November, 2012 1 / 18 Outline 1 Goodness

More information

1/11/2011. Chapter 4: Variability. Overview

1/11/2011. Chapter 4: Variability. Overview Chapter 4: Variability Overview In statistics, our goal is to measure the amount of variability for a particular set of scores, a distribution. In simple terms, if the scores in a distribution are all

More information

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.

Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. Preface p. xi Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. 6 The Scientific Method and the Design of

More information

Sequential Reliability Tests Mindert H. Eiting University of Amsterdam

Sequential Reliability Tests Mindert H. Eiting University of Amsterdam Sequential Reliability Tests Mindert H. Eiting University of Amsterdam Sequential tests for a stepped-up reliability estimator and coefficient alpha are developed. In a series of monte carlo experiments,

More information

Analysis of propensity score approaches in difference-in-differences designs

Analysis of propensity score approaches in difference-in-differences designs Author: Diego A. Luna Bazaldua Institution: Lynch School of Education, Boston College Contact email: diego.lunabazaldua@bc.edu Conference section: Research methods Analysis of propensity score approaches

More information

* * MATHEMATICS 4732 Probability & Statistics 1 ADVANCED SUBSIDIARY GCE. Wednesday 21 January 2009 Afternoon. Duration: 1 hour 30 minutes.

* * MATHEMATICS 4732 Probability & Statistics 1 ADVANCED SUBSIDIARY GCE. Wednesday 21 January 2009 Afternoon. Duration: 1 hour 30 minutes. ADVANCED SUBSIDIARY GCE MATHEMATICS 4732 Probability & Statistics 1 Candidates answer on the Answer Booklet OCR Supplied Materials: 8 page Answer Booklet List of Formulae (MF1) Other Materials Required:

More information

Paper Reference R. Statistics S4 Advanced/Advanced Subsidiary. Friday 21 June 2013 Morning Time: 1 hour 30 minutes

Paper Reference R. Statistics S4 Advanced/Advanced Subsidiary. Friday 21 June 2013 Morning Time: 1 hour 30 minutes Centre No. Candidate No. Paper Reference(s) 6686/01R Edexcel GCE Statistics S4 Advanced/Advanced Subsidiary Friday 21 June 2013 Morning Time: 1 hour 30 minutes Materials required for examination Mathematical

More information

Regression-Discontinuity Analysis

Regression-Discontinuity Analysis Page 1 of 11 Home» Analysis» Inferential Statistics» Regression-Discontinuity Analysis Analysis Requirements The basic RD Design is a two-group pretestposttest model as indicated in the design notation.

More information

Problems with Stepwise Procedures in. Discriminant Analysis. James M. Graham. Texas A&M University

Problems with Stepwise Procedures in. Discriminant Analysis. James M. Graham. Texas A&M University Running Head: PROBLEMS WITH STEPWISE IN DA Problems with Stepwise Procedures in Discriminant Analysis James M. Graham Texas A&M University 77842-4225 Graham, J. M. (2001, January). Problems with stepwise

More information

The variable θ is called the parameter of the model, and the set Ω is called the parameter space.

The variable θ is called the parameter of the model, and the set Ω is called the parameter space. Lecture 8 What is a statistical model? A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced the data. The variable θ is called

More information

Multiple Regression Analysis. Part III. Multiple Regression Analysis

Multiple Regression Analysis. Part III. Multiple Regression Analysis Part III Multiple Regression Analysis As of Sep 26, 2017 1 Multiple Regression Analysis Estimation Matrix form Goodness-of-Fit R-square Adjusted R-square Expected values of the OLS estimators Irrelevant

More information

1 Procedures robust to weak instruments

1 Procedures robust to weak instruments Comment on Weak instrument robust tests in GMM and the new Keynesian Phillips curve By Anna Mikusheva We are witnessing a growing awareness among applied researchers about the possibility of having weak

More information

STRUCTURAL EQUATION MODELING. Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013

STRUCTURAL EQUATION MODELING. Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013 STRUCTURAL EQUATION MODELING Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013 Introduction: Path analysis Path Analysis is used to estimate a system of equations in which all of the

More information

Comparative Bayesian Confirmation and the Quine Duhem Problem: A Rejoinder to Strevens Branden Fitelson and Andrew Waterman

Comparative Bayesian Confirmation and the Quine Duhem Problem: A Rejoinder to Strevens Branden Fitelson and Andrew Waterman Brit. J. Phil. Sci. 58 (2007), 333 338 Comparative Bayesian Confirmation and the Quine Duhem Problem: A Rejoinder to Strevens Branden Fitelson and Andrew Waterman ABSTRACT By and large, we think (Strevens

More information