RR R E E A H R E P DENOTING THE BASE FREE MEASURE OF CHANGE. Samuel Messick. Educational Testing Service Princeton, New Jersey December 1980
|
|
- Gwenda Tabitha McGee
- 5 years ago
- Views:
Transcription
1 RR R E 5 E A RC H R E P o R T DENOTING THE BASE FREE MEASURE OF CHANGE Samuel Messick Educational Testing Service Princeton, New Jersey December 1980
2 DENOTING THE BASE-FREE MEASURE OF CHANGE Samuel Messick Educational Testing Service Princeton, New Jersey December 1980
3 Educational Testing Service. All rights reserved.
4 ABSTRACT Denoting the Base-Free Measure of Change Samuel Messick Educational Testing Service Bond criticized the base-free measure of change proposed by Tucker, Damarin, and Messick by pointing to an incorrect derivation which is here viewed instead as a correct derivation entailing an inadequately specified tacit assumption. Bond's revision leads to estimates of the correlation between initial position and change which are negatively biased by correlated errors, whereas the original approach, with the tacit assumption properly denoted, leads to unbiased values. Key words: base-free measure of change, difference scores, change scores, measuring change.
5 Denoting the Base-Free Measure of Change Samuel Messick Educational Testing Service Bond (1979) pointed out a presumably incorrect derivation in the Tucker, Damarin, and Messick (1966) presentation of a base-free measure of change. As a consequence, Bond argued that properties attributed to a (the coefficient for the regression of true scores from the second testing on true scores from the first testing) were instead properties of b, the corresponding regression coefficient for observed scores. At issue was the correlation between initial position () and change (d = XZ-XI), with Tucker et al. (1966) claiming that P d was positive when a > 1, was zero when a = 1, and was negative when a < I and Bond insisting that these cut-off points were determined by b rather than a. In reply, Tucker (1979) emphasized that the focus should have been on the correlation between true initial position (tl) and true change (6 = tz-tl) all along, whereby the interpretation returns to a comparison of a with unity since (1) Equation (1) is algebraically equivalent to the formula provided by Zieve (1940) for the correlation between true initial position and the true difference score, which also appears as equation (62) in Tucker et al. (1966), Zieve's formula explicitly corrects P d for correlated errors as well as for attenuation. The derivation error in question revolves around whether P xlxl in the formula for P d represents the reliability of, as Tucker et al. tacitly
6 -2- assumed, or the self-correlation of (i.e., unity) as Bond properly asserted. In the absence of further specifications, Bond was correct. But once the nature of the Tucker et al. tacit assumption is explicitly denoted, we see that a remains a proper benchmark regardless of whether the observed correlation or the true correlation is considered. Using the Tucker et al. (1966) notation which was also followed by Bond (1979), as well as standard relationships from classical test theory, the basefree measure of change (y) is defined as a true independent change score: (2) (3) a = P a xl x2 X2 p a xlxl xl b Formula (61) in Tucker et al. (1966) and formula (14) in Bond (1979) give the correlation between an observed difference score and any other variable k, (4) PX2kaX2 - Pka ad Now t instead of setting k = xl as Tucker et a1. did originally, leading Bond to question whether p xlxl should not properly be unity and hence (5) (b-d) o xl let k = xl, a parallel form of xl' The use of parallel forms of the pretest, one to determine the difference score and one to correlate with the difference score, eliminates the negative bias in P d produced by correlated errors (Thorndike, 1966), Equation (4) then becomes, with inserts from equation (3),
7 -3- (6) Equation (6) is algebraically equivalent to equation (63) in Tucker et al. (1966), which gives the correlation between observed initial position and the observed difference score corrected for correlated errors. It is clear from equation (6) that PX'd is positive when a > 1, is zero when a = 1, and is negative when a < 1. I Furthermore, if both sides of equation (6) are divided by (PxlxI)~(Pdd')~,where P dd, is the reliability of the observed difference score, then (7) = = which is consistent with Tucker's (1979) formulation. Thus, P t l o is Pxid corrected for attenuation in both xi and d. As previously noted, it is clear from equation (6) that P 'd--the correlaxl tion between observed initial position and observed change uncontaminated by correlated errors--is positive when a > 1, is zero when a = 1, and is negative when a < 1. It may be helpful in explicating these relationships to consider a as the product of two ratios (8) The first ratio is typically less than unity and will usually become smaller and smaller as the time interval between the first and second testing increases. Therefore, a > 1 and P, is positiv~ only when xld
8 -4- (9) (J x2 This ~ondition of (J becoming increasingly larger than (J as the testing inx2 xl terval increases corresponds to a growth model known as the fan-spread hypothesis (Bryk & Weisberg. 1977). Further. a = 1 and Px'd 1 o when (10) ---= This condition corresponds to a growth model known as the overlap hypothesis (Anderson. 1939). where XL consists of everything present at xl plus an independent increment uncorrelated with xl' In the overlap model of growth. the squared correlation coefficient (corrected for attenuation) between test scores at two points in time is sometimes interpreted as indicating the "percent of the elements" in the second measurement that had been attained by the time of the first measurement (Anderson p. 365). In this view of longitudinal data. the proportion of common elements or the amount of overlap in two measurements taken at different points in time is indexed not only by the squared unattenuated correlation between the two tests or by the ratio of the two test variances as in equation (10), but by the ratio of the two test means (Bloom. 1964). Such reasoning enabled Bloom (1964). after reviewing empirical evidence that P d averaged about zero for mea sures of intelligence (at least through age seven). to move from statements about variance to statements about amount--that is. instead of stating that 50 percent of the variance of adult intelligence is accounted for by age four. he claimed that "about 50% of the development takes place between conception and age 4" (P. 88).
9 -5- This is a strong conclusion, but the overlap model of growth undergirding it implies the very special condition of a = 1. Finally, a < land p 'd is negative when xl (11) a x2 which occurs so frequently in situations having test ceiling effects or in systems having natural boundary conditions, as in the measurement of psychophysiological responses, that its corresponding growth model has been called the law of initial values (Lacey & Lacey, 1962). These differences in possible conceptions of the nature of growth occurring in a particular instance hinge on the values of a or.p 'd--not, as Bond (1979) averred. on b or p d. These latter values incorporate correlated errors and hence are negatively biased. It can be seen from equation (3) that the difference between using a or b to specify analytically the conditions under which the correlation between initial position and change will be positive. negative, or zero becomes increasingly important to the extent that the initial measure is fallible. Since unreliability of measurement is a pervasive problem during periods of rapid growth as in early childhood, the distinction between a and b is especially germane there. but it is also important in any investigation of the correlates of change using fallible measures (Tucker et al., 1966).
10 REFERENCES Anderson, J. E. The limitation of infant and preschool tests in the measurement of intelligence. Journal of Psychology, 1939, ~, Bloom, B. S. Stability and change in human characteristics. New York: Wiley, Bond, L. On the base-free measure of change proposed by Tucker, Damarin, and Messick. Psychometrika, 1979, 44, Bryk, A. S. & Weisberg, H. I. Use of the nonequiva1ent control group design when subjects are growing. Psychological Bulletin, 1977, 84, Lacey, J. I. & Lacey, B. C. The law of initial value in the longitudinal study of autonomic constitution: Reproducibility of autonomic responses and response patterns over a four year interval. In W. M. Wolf (Ed.), Rythmic functions in the living system. Annals of the New York Academy of Sciences, Thorndike, R. L. Intellectual status and intellectual growth. Journal of Educational Psycho1o~, 1966,22, Tucker, L. R. Comment on a note on a base-free measure of change. Psychometrika, 1979, 44, 357. Tucker, L. R., Damarin, F., & Messick S. A base-free measure of change. Psychometrika, 1966, 1!, Zieve, L. Note on the correlation of initial scores with gains. Journal of Educational Psychology, 1940, 1!,
Estimating Operational Validity Under Incidental Range Restriction: Some Important but Neglected Issues
A peer-reviewed electronic journal. Copyright is retained by the first or sole author, who grants right of first publication to Practical Assessment, Research & Evaluation. Permission is granted to distribute
More informationPROBABILITIES OF MISCLASSIFICATION IN DISCRIMINATORY ANALYSIS. M. Clemens Johnson
RB-55-22 ~ [ s [ B A U R L t L Ii [ T I N PROBABILITIES OF MISCLASSIFICATION IN DISCRIMINATORY ANALYSIS M. Clemens Johnson This Bulletin is a draft for interoffice circulation. Corrections and suggestions
More informationCORRELATIONS ~ PARTIAL REGRESSION COEFFICIENTS (GROWTH STUDY PAPER #29) and. Charles E. Werts
RB-69-6 ASSUMPTIONS IN MAKING CAUSAL INFERENCES FROM PART CORRELATIONS ~ PARTIAL CORRELATIONS AND PARTIAL REGRESSION COEFFICIENTS (GROWTH STUDY PAPER #29) Robert L. Linn and Charles E. Werts This Bulletin
More informationVARIABILITY OF KUDER-RICHARDSON FOm~A 20 RELIABILITY ESTIMATES. T. Anne Cleary University of Wisconsin and Robert L. Linn Educational Testing Service
~ E S [ B A U ~ L t L H E TI VARIABILITY OF KUDER-RICHARDSON FOm~A 20 RELIABILITY ESTIMATES RB-68-7 N T. Anne Cleary University of Wisconsin and Robert L. Linn Educational Testing Service This Bulletin
More informationA TEST OF SIGNIFICANCE OF DIFFERENCE BETWEEN CORRELATED PROPORTIONS. John A. Keats
~ E S E B A U ~ L t L I-i E TI RB-55-20 A TEST OF SIGNIFICANCE OF DIFFERENCE BETWEEN CORRELATED PROPORTIONS John A. Keats N This Bulletin is a draft for interoffice circulation. Corrections and suggestions
More informationNonequivalent-Populations Design David J. Woodruff American College Testing Program
A Comparison of Three Linear Equating Methods for the Common-Item Nonequivalent-Populations Design David J. Woodruff American College Testing Program Three linear equating methods for the common-item nonequivalent-populations
More informationReconciling factor-based and composite-based approaches to structural equation modeling
Reconciling factor-based and composite-based approaches to structural equation modeling Edward E. Rigdon (erigdon@gsu.edu) Modern Modeling Methods Conference May 20, 2015 Thesis: Arguments for factor-based
More informationInstrumental Variables
Instrumental Variables James H. Steiger Department of Psychology and Human Development Vanderbilt University James H. Steiger (Vanderbilt University) Instrumental Variables 1 / 10 Instrumental Variables
More informationChapter 4: Factor Analysis
Chapter 4: Factor Analysis In many studies, we may not be able to measure directly the variables of interest. We can merely collect data on other variables which may be related to the variables of interest.
More informationAn extended summary of the NCGR/Berkeley Double-Blind Test of Astrology undertaken by Shawn Carlson and published in 1985
From: http://www.astrodivination.com/moa/ncgrberk.htm An extended summary of the NCGR/Berkeley Double-Blind Test of Astrology undertaken by Shawn Carlson and published in 1985 Introduction Under the heading
More informationEquating Subscores Using Total Scaled Scores as an Anchor
Research Report ETS RR 11-07 Equating Subscores Using Total Scaled Scores as an Anchor Gautam Puhan Longjuan Liang March 2011 Equating Subscores Using Total Scaled Scores as an Anchor Gautam Puhan and
More informationInstrumental Variables
James H. Steiger Department of Psychology and Human Development Vanderbilt University Regression Modeling, 2009 1 Introduction 2 3 4 Instrumental variables allow us to get a better estimate of a causal
More informationPROGRAM STATISTICS RESEARCH
An Alternate Definition of the ETS Delta Scale of Item Difficulty Paul W. Holland and Dorothy T. Thayer @) PROGRAM STATISTICS RESEARCH TECHNICAL REPORT NO. 85..64 EDUCATIONAL TESTING SERVICE PRINCETON,
More informationDesign of Experiments
Design of Experiments DOE Radu T. Trîmbiţaş April 20, 2016 1 Introduction The Elements Affecting the Information in a Sample Generally, the design of experiments (DOE) is a very broad subject concerned
More informationy response variable x 1, x 2,, x k -- a set of explanatory variables
11. Multiple Regression and Correlation y response variable x 1, x 2,, x k -- a set of explanatory variables In this chapter, all variables are assumed to be quantitative. Chapters 12-14 show how to incorporate
More informationWooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares
Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit
More informationClarifying the concepts of reliability, validity and generalizability
Clarifying the concepts of reliability, validity and generalizability Maria Valaste 1 and Lauri Tarkkonen 2 1 University of Helsinki, Finland e-mail: maria.valaste@helsinki.fi 2 University of Helsinki,
More informationIgnoring the matching variables in cohort studies - when is it valid, and why?
Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association
More informationMyths about the Analysis of Change
James H. Steiger Department of Psychology and Human Development Vanderbilt University, 2010 1 Introduction 2 Limitations of the Two-Observation Study 3 Myth 2: The Difference Score is Intrinsically Unreliable
More informationA bias improved estimator of the concordance correlation coefficient
The 22 nd Annual Meeting in Mathematics (AMM 217) Department of Mathematics, Faculty of Science Chiang Mai University, Chiang Mai, Thailand A bias improved estimator of the concordance correlation coefficient
More informationIEOR 165 Lecture 7 1 Bias-Variance Tradeoff
IEOR 165 Lecture 7 Bias-Variance Tradeoff 1 Bias-Variance Tradeoff Consider the case of parametric regression with β R, and suppose we would like to analyze the error of the estimate ˆβ in comparison to
More informationEXAMINATION: QUANTITATIVE EMPIRICAL METHODS. Yale University. Department of Political Science
EXAMINATION: QUANTITATIVE EMPIRICAL METHODS Yale University Department of Political Science January 2014 You have seven hours (and fifteen minutes) to complete the exam. You can use the points assigned
More informationStatistical Power and Autocorrelation for Short, Comparative Interrupted Time Series Designs with Aggregate Data
Statistical Power and Autocorrelation for Short, Comparative Interrupted Time Series Designs with Aggregate Data Andrew Swanlund, American Institutes for Research (aswanlund@air.org) Kelly Hallberg, University
More information6.435, System Identification
System Identification 6.435 SET 3 Nonparametric Identification Munther A. Dahleh 1 Nonparametric Methods for System ID Time domain methods Impulse response Step response Correlation analysis / time Frequency
More informationUNIT 3 CONCEPT OF DISPERSION
UNIT 3 CONCEPT OF DISPERSION Structure 3.0 Introduction 3.1 Objectives 3.2 Concept of Dispersion 3.2.1 Functions of Dispersion 3.2.2 Measures of Dispersion 3.2.3 Meaning of Dispersion 3.2.4 Absolute Dispersion
More information2/26/2017. This is similar to canonical correlation in some ways. PSY 512: Advanced Statistics for Psychological and Behavioral Research 2
PSY 512: Advanced Statistics for Psychological and Behavioral Research 2 What is factor analysis? What are factors? Representing factors Graphs and equations Extracting factors Methods and criteria Interpreting
More informationChapter 8. Models with Structural and Measurement Components. Overview. Characteristics of SR models. Analysis of SR models. Estimation of SR models
Chapter 8 Models with Structural and Measurement Components Good people are good because they've come to wisdom through failure. Overview William Saroyan Characteristics of SR models Estimation of SR models
More informationAn Approximate Test for Homogeneity of Correlated Correlation Coefficients
Quality & Quantity 37: 99 110, 2003. 2003 Kluwer Academic Publishers. Printed in the Netherlands. 99 Research Note An Approximate Test for Homogeneity of Correlated Correlation Coefficients TRIVELLORE
More informationPoint-to-point response to reviewers comments
Point-to-point response to reviewers comments Reviewer #1 1) The authors analyze only one millennial reconstruction (Jones, 1998) with the argument that it is the only one available. This is incorrect.
More informationEMERGING MARKETS - Lecture 2: Methodology refresher
EMERGING MARKETS - Lecture 2: Methodology refresher Maria Perrotta April 4, 2013 SITE http://www.hhs.se/site/pages/default.aspx My contact: maria.perrotta@hhs.se Aim of this class There are many different
More informationCausal Inference with Big Data Sets
Causal Inference with Big Data Sets Marcelo Coca Perraillon University of Colorado AMC November 2016 1 / 1 Outlone Outline Big data Causal inference in economics and statistics Regression discontinuity
More informationThe Use of Structural Equation Models in Interpreting Regression Equations Including Suppressor and Enhancer Variables
The Use of Structural Equation Models in Interpreting Regression Equations Including Suppressor and Enhancer Variables Robert M. McFatter University of Denver It is shown that the usual interpretation
More informationLinear Equating Models for the Common-item Nonequivalent-Populations Design Michael J. Kolen and Robert L. Brennan American College Testing Program
Linear Equating Models for the Common-item Nonequivalent-Populations Design Michael J. Kolen Robert L. Brennan American College Testing Program The Tucker Levine equally reliable linear meth- in the common-item
More informationLecture Notes on Measurement Error
Steve Pischke Spring 2000 Lecture Notes on Measurement Error These notes summarize a variety of simple results on measurement error which I nd useful. They also provide some references where more complete
More informationScaling of Variance Space
... it Dominance, Information, and Hierarchical Scaling of Variance Space David J. Krus and Robert W. Ceurvorst Arizona State University A method for computation of dominance relations and for construction
More informationConsequences of measurement error. Psychology 588: Covariance structure and factor models
Consequences of measurement error Psychology 588: Covariance structure and factor models Scaling indeterminacy of latent variables Scale of a latent variable is arbitrary and determined by a convention
More informationA Rejoinder to Mackintosh and some Remarks on the. Concept of General Intelligence
A Rejoinder to Mackintosh and some Remarks on the Concept of General Intelligence Moritz Heene Department of Psychology, Ludwig Maximilian University, Munich, Germany. 1 Abstract In 2000 Nicholas J. Mackintosh
More informationOf Nickell Bias, Cross-Sectional Dependence, and Their Cures: Reply
Of Nickell Bias, Cross-Sectional Dependence, and Their Cures: Reply Khusrav Gaibulloev Department of Economics American University of Sharjah Sharjah, UAE kgaibulloev@aus.edu and Todd Sandler Donggyu Sul
More informationObserved-Score "Equatings"
Comparison of IRT True-Score and Equipercentile Observed-Score "Equatings" Frederic M. Lord and Marilyn S. Wingersky Educational Testing Service Two methods of equating tests are compared, one using true
More informationMath Review. for the Quantitative Reasoning measure of the GRE General Test
Math Review for the Quantitative Reasoning measure of the GRE General Test www.ets.org Overview This Math Review will familiarize you with the mathematical skills and concepts that are important for solving
More informationFinQuiz Notes
Reading 10 Multiple Regression and Issues in Regression Analysis 2. MULTIPLE LINEAR REGRESSION Multiple linear regression is a method used to model the linear relationship between a dependent variable
More informationUse of e-rater in Scoring of the TOEFL ibt Writing Test
Research Report ETS RR 11-25 Use of e-rater in Scoring of the TOEFL ibt Writing Test Shelby J. Haberman June 2011 Use of e-rater in Scoring of the TOEFL ibt Writing Test Shelby J. Haberman ETS, Princeton,
More informationFormula for the t-test
Formula for the t-test: How the t-test Relates to the Distribution of the Data for the Groups Formula for the t-test: Formula for the Standard Error of the Difference Between the Means Formula for the
More informationWooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems
Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems Functional form misspecification We may have a model that is correctly specified, in terms of including
More informationAnalyzing Group by Time Effects in Longitudinal Two-Group Randomized Trial Designs With Missing Data
Journal of Modern Applied Statistical Methods Volume Issue 1 Article 6 5-1-003 Analyzing Group by Time Effects in Longitudinal Two-Group Randomized Trial Designs With Missing Data James Algina University
More informationGeneral structural model Part 1: Covariance structure and identification. Psychology 588: Covariance structure and factor models
General structural model Part 1: Covariance structure and identification Psychology 588: Covariance structure and factor models Latent variables 2 Interchangeably used: constructs --- substantively defined
More informationA Pedagogic Demonstration of Attenuation of Correlation Due to Measurement Error
Spreadsheets in Education (ejsie) Volume 7 Issue 1 Article 4 1-14-2014 A Pedagogic Demonstration of Attenuation of Correlation Due to Measurement Error Christopher R. Fisher Miami University, fisherc2@miamioh.edu
More informationLesson 6: Reliability
Lesson 6: Reliability Patrícia Martinková Department of Statistical Modelling Institute of Computer Science, Czech Academy of Sciences NMST 570, December 12, 2017 Dec 19, 2017 1/35 Contents 1. Introduction
More informationM(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1
Math 66/566 - Midterm Solutions NOTE: These solutions are for both the 66 and 566 exam. The problems are the same until questions and 5. 1. The moment generating function of a random variable X is M(t)
More informationAdvanced Quantitative Research Methodology Lecture Notes: January Ecological 28, 2012 Inference1 / 38
Advanced Quantitative Research Methodology Lecture Notes: Ecological Inference 1 Gary King http://gking.harvard.edu January 28, 2012 1 c Copyright 2008 Gary King, All Rights Reserved. Gary King http://gking.harvard.edu
More informationRegularized Common Factor Analysis
New Trends in Psychometrics 1 Regularized Common Factor Analysis Sunho Jung 1 and Yoshio Takane 1 (1) Department of Psychology, McGill University, 1205 Dr. Penfield Avenue, Montreal, QC, H3A 1B1, Canada
More informationModel II (or random effects) one-way ANOVA:
Model II (or random effects) one-way ANOVA: As noted earlier, if we have a random effects model, the treatments are chosen from a larger population of treatments; we wish to generalize to this larger population.
More informationMATH 1150 Chapter 2 Notation and Terminology
MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the
More informationAN INDEX OF THE DISCRIMINATING POWER OF A TEST. Richard Levine and Frederic M. Lord
RB-58-13 ~ [ s [ B A U ~ L t L I-t [ T I N AN INDEX OF THE DISCRIMINATING POWER OF A TEST AT DIFFERENT PARTS OF THE SCORE RANGE Richard Levine and Frederic M. Lord This Bulletin is a draft for interoffice
More information1 The Classic Bivariate Least Squares Model
Review of Bivariate Linear Regression Contents 1 The Classic Bivariate Least Squares Model 1 1.1 The Setup............................... 1 1.2 An Example Predicting Kids IQ................. 1 2 Evaluating
More informationA Note on the Choice of an Anchor Test in Equating
Research Report ETS RR 12-14 A Note on the Choice of an Anchor Test in Equating Sandip Sinharay Shelby Haberman Paul Holland Charles Lewis September 2012 ETS Research Report Series EIGNOR EXECUTIVE EDITOR
More informationShelby J. Haberman. Hongwen Guo. Jinghua Liu. Neil J. Dorans. ETS, Princeton, NJ
Consistency of SAT I: Reasoning Test Score Conversions Shelby J. Haberman Hongwen Guo Jinghua Liu Neil J. Dorans ETS, Princeton, NJ Paper presented at the annual meeting of the American Educational Research
More informationRegression Models - Introduction
Regression Models - Introduction In regression models there are two types of variables that are studied: A dependent variable, Y, also called response variable. It is modeled as random. An independent
More informationSimple Linear Regression Model & Introduction to. OLS Estimation
Inside ECOOMICS Introduction to Econometrics Simple Linear Regression Model & Introduction to Introduction OLS Estimation We are interested in a model that explains a variable y in terms of other variables
More informationMixed- Model Analysis of Variance. Sohad Murrar & Markus Brauer. University of Wisconsin- Madison. Target Word Count: Actual Word Count: 2755
Mixed- Model Analysis of Variance Sohad Murrar & Markus Brauer University of Wisconsin- Madison The SAGE Encyclopedia of Educational Research, Measurement and Evaluation Target Word Count: 3000 - Actual
More informationEstimation of Conditional Kendall s Tau for Bivariate Interval Censored Data
Communications for Statistical Applications and Methods 2015, Vol. 22, No. 6, 599 604 DOI: http://dx.doi.org/10.5351/csam.2015.22.6.599 Print ISSN 2287-7843 / Online ISSN 2383-4757 Estimation of Conditional
More informationA Better Way to Do R&R Studies
The Evaluating the Measurement Process Approach Last month s column looked at how to fix some of the Problems with Gauge R&R Studies. This month I will show you how to learn more from your gauge R&R data
More informationLINEAR COMBINATION OF ESTIMATORS IN PROBABILITY PROPORTIONAL TO SIZES SAMPLING TO ESTIMATE THE POPULATION MEAN AND ITS ROBUSTNESS TO OPTIMUM VALUE
STATISTICA, anno LXIX, n., 009 LINEAR COMBINATION OF ESTIMATORS IN PROBABILITY PROPORTIONAL TO SIZES SAMPLING TO ESTIMATE THE POPULATION MEAN AND ITS ROBUSTNESS TO OPTIMUM VALUE. INTRODUCTION To reduce
More informationUNIVERSITY OF ILLINOIS LIBRARY AT URBANA-CHAMPAIGN BOOKSTACKS
UNIVERSITY OF ILLINOIS LIBRARY AT URBANA-CHAMPAIGN BOOKSTACKS i CENTRAL CIRCULATION BOOKSTACKS The person charging this material is responsible for its renewal or its return to the library from which it
More informationCausal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies
Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed
More informationA Cautionary Note on Estimating the Reliability of a Mastery Test with the Beta-Binomial Model
A Cautionary Note on Estimating the Reliability of a Mastery Test with the Beta-Binomial Model Rand R. Wilcox University of Southern California Based on recently published papers, it might be tempting
More informationUnit 2. Describing Data: Numerical
Unit 2 Describing Data: Numerical Describing Data Numerically Describing Data Numerically Central Tendency Arithmetic Mean Median Mode Variation Range Interquartile Range Variance Standard Deviation Coefficient
More informationOn line resources Should be able to use for homework
On line resources Should be able to use for homework http://www.amstat.org/publications/jse/v10n3/aberson/po wer_applet.html http://www.indiana.edu/~psyugrad/gradschool/apply.php http://onlinestatbook.com/stat_sim/conf_interval/index.ht
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 12: Frequentist properties of estimators (v4) Ramesh Johari ramesh.johari@stanford.edu 1 / 39 Frequentist inference 2 / 39 Thinking like a frequentist Suppose that for some
More informationWooldridge, Introductory Econometrics, 2d ed. Chapter 8: Heteroskedasticity In laying out the standard regression model, we made the assumption of
Wooldridge, Introductory Econometrics, d ed. Chapter 8: Heteroskedasticity In laying out the standard regression model, we made the assumption of homoskedasticity of the regression error term: that its
More informationChained Versus Post-Stratification Equating in a Linear Context: An Evaluation Using Empirical Data
Research Report Chained Versus Post-Stratification Equating in a Linear Context: An Evaluation Using Empirical Data Gautam Puhan February 2 ETS RR--6 Listening. Learning. Leading. Chained Versus Post-Stratification
More informationOLD BUSINESS. Conditional versus Unconditional Analyses (Goldstein, Plewis...) [UK Reading example]
Rogosa, D. R. (1988). Myths about longitudinal research. In Methodological issues in aging research, K. W. Schaie, R. T. Campbell, W. M. Meredith, and S. C. Rawlings, Eds. New York, Springer Publishing
More informationACTEX CAS EXAM 3 STUDY GUIDE FOR MATHEMATICAL STATISTICS
ACTEX CAS EXAM 3 STUDY GUIDE FOR MATHEMATICAL STATISTICS TABLE OF CONTENTS INTRODUCTORY NOTE NOTES AND PROBLEM SETS Section 1 - Point Estimation 1 Problem Set 1 15 Section 2 - Confidence Intervals and
More informationII. MATCHMAKER, MATCHMAKER
II. MATCHMAKER, MATCHMAKER Josh Angrist MIT 14.387 Fall 2014 Agenda Matching. What could be simpler? We look for causal effects by comparing treatment and control within subgroups where everything... or
More informationMulticollinearity and A Ridge Parameter Estimation Approach
Journal of Modern Applied Statistical Methods Volume 15 Issue Article 5 11-1-016 Multicollinearity and A Ridge Parameter Estimation Approach Ghadban Khalaf King Khalid University, albadran50@yahoo.com
More informationØkonomisk Kandidateksamen 2004 (I) Econometrics 2. Rettevejledning
Økonomisk Kandidateksamen 2004 (I) Econometrics 2 Rettevejledning This is a closed-book exam (uden hjælpemidler). Answer all questions! The group of questions 1 to 4 have equal weight. Within each group,
More informationSAMPLE IS USED IN A NEW SAMPLE
EB 50-40 ~ E S E B A U R L t L Ii E TI EFFICIENCY OF PBEDICTION WHEN A REGRESSION EQUATION FROM ONE SAMPLE IS USED IN A NEW SAMPLE Frederic M. Lord (Prepublication draft) N ~----'-- This Bulletin is a
More informationFORMATIVE AND REFLECTIVE MODELS: STATE OF THE ART. Anna Simonetto *
Electronic Journal of Applied Statistical Analysis EJASA (2012), Electron. J. App. Stat. Anal., Vol. 5, Issue 3, 452 457 e-issn 2070-5948, DOI 10.1285/i20705948v5n3p452 2012 Università del Salento http://siba-ese.unile.it/index.php/ejasa/index
More informationNotes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing
Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing 1. Purpose of statistical inference Statistical inference provides a means of generalizing
More information1 Motivation for Instrumental Variable (IV) Regression
ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data
More informationApplied Statistics and Econometrics
Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple
More informationTest for Discontinuities in Nonparametric Regression
Communications of the Korean Statistical Society Vol. 15, No. 5, 2008, pp. 709 717 Test for Discontinuities in Nonparametric Regression Dongryeon Park 1) Abstract The difference of two one-sided kernel
More informationTHE EVALUATION OF LINEAR FORMS P. S. DWYER
PSYCHOMETRIKA~VOL. 6, NO. 6 DECEMBER. 1941 THE EVALUATION OF LINEAR FORMS P. S. DWYER UNIVERSITY OF MICHIGAN This presentation deals with the evaluation and transformation of linear forms. Especial emphasis
More informationControlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded
Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded 1 Background Latent confounder is common in social and behavioral science in which most of cases the selection mechanism
More informationPlausible Values for Latent Variables Using Mplus
Plausible Values for Latent Variables Using Mplus Tihomir Asparouhov and Bengt Muthén August 21, 2010 1 1 Introduction Plausible values are imputed values for latent variables. All latent variables can
More informationStudy of the Relationship between Dependent and Independent Variable Groups by Using Canonical Correlation Analysis with Application
Modern Applied Science; Vol. 9, No. 8; 2015 ISSN 1913-1844 E-ISSN 1913-1852 Published by Canadian Center of Science and Education Study of the Relationship between Dependent and Independent Variable Groups
More informationPsychology 454: Latent Variable Modeling How do you know if a model works?
Psychology 454: Latent Variable Modeling How do you know if a model works? William Revelle Department of Psychology Northwestern University Evanston, Illinois USA November, 2012 1 / 18 Outline 1 Goodness
More information1/11/2011. Chapter 4: Variability. Overview
Chapter 4: Variability Overview In statistics, our goal is to measure the amount of variability for a particular set of scores, a distribution. In simple terms, if the scores in a distribution are all
More informationIntroduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p.
Preface p. xi Introduction and Descriptive Statistics p. 1 Introduction to Statistics p. 3 Statistics, Science, and Observations p. 5 Populations and Samples p. 6 The Scientific Method and the Design of
More informationSequential Reliability Tests Mindert H. Eiting University of Amsterdam
Sequential Reliability Tests Mindert H. Eiting University of Amsterdam Sequential tests for a stepped-up reliability estimator and coefficient alpha are developed. In a series of monte carlo experiments,
More informationAnalysis of propensity score approaches in difference-in-differences designs
Author: Diego A. Luna Bazaldua Institution: Lynch School of Education, Boston College Contact email: diego.lunabazaldua@bc.edu Conference section: Research methods Analysis of propensity score approaches
More information* * MATHEMATICS 4732 Probability & Statistics 1 ADVANCED SUBSIDIARY GCE. Wednesday 21 January 2009 Afternoon. Duration: 1 hour 30 minutes.
ADVANCED SUBSIDIARY GCE MATHEMATICS 4732 Probability & Statistics 1 Candidates answer on the Answer Booklet OCR Supplied Materials: 8 page Answer Booklet List of Formulae (MF1) Other Materials Required:
More informationPaper Reference R. Statistics S4 Advanced/Advanced Subsidiary. Friday 21 June 2013 Morning Time: 1 hour 30 minutes
Centre No. Candidate No. Paper Reference(s) 6686/01R Edexcel GCE Statistics S4 Advanced/Advanced Subsidiary Friday 21 June 2013 Morning Time: 1 hour 30 minutes Materials required for examination Mathematical
More informationRegression-Discontinuity Analysis
Page 1 of 11 Home» Analysis» Inferential Statistics» Regression-Discontinuity Analysis Analysis Requirements The basic RD Design is a two-group pretestposttest model as indicated in the design notation.
More informationProblems with Stepwise Procedures in. Discriminant Analysis. James M. Graham. Texas A&M University
Running Head: PROBLEMS WITH STEPWISE IN DA Problems with Stepwise Procedures in Discriminant Analysis James M. Graham Texas A&M University 77842-4225 Graham, J. M. (2001, January). Problems with stepwise
More informationThe variable θ is called the parameter of the model, and the set Ω is called the parameter space.
Lecture 8 What is a statistical model? A statistical model for some data is a set of distributions, one of which corresponds to the true unknown distribution that produced the data. The variable θ is called
More informationMultiple Regression Analysis. Part III. Multiple Regression Analysis
Part III Multiple Regression Analysis As of Sep 26, 2017 1 Multiple Regression Analysis Estimation Matrix form Goodness-of-Fit R-square Adjusted R-square Expected values of the OLS estimators Irrelevant
More information1 Procedures robust to weak instruments
Comment on Weak instrument robust tests in GMM and the new Keynesian Phillips curve By Anna Mikusheva We are witnessing a growing awareness among applied researchers about the possibility of having weak
More informationSTRUCTURAL EQUATION MODELING. Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013
STRUCTURAL EQUATION MODELING Khaled Bedair Statistics Department Virginia Tech LISA, Summer 2013 Introduction: Path analysis Path Analysis is used to estimate a system of equations in which all of the
More informationComparative Bayesian Confirmation and the Quine Duhem Problem: A Rejoinder to Strevens Branden Fitelson and Andrew Waterman
Brit. J. Phil. Sci. 58 (2007), 333 338 Comparative Bayesian Confirmation and the Quine Duhem Problem: A Rejoinder to Strevens Branden Fitelson and Andrew Waterman ABSTRACT By and large, we think (Strevens
More information