Imprecise Measurement Error Models Towards a More Reliable Analysis of Messy Data

Similar documents
Measurement error modeling. Department of Statistical Sciences Università degli Studi Padova

Augustin: Some Basic Results on the Extension of Quasi-Likelihood Based Measurement Error Correction to Multivariate and Flexible Structural Models

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

Measurement error as missing data: the case of epidemiologic assays. Roderick J. Little

Eric Shou Stat 598B / CSE 598D METHODS FOR MICRODATA PROTECTION

Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems

Analysing geoadditive regression data: a mixed model approach

Regression with Imprecise Data: A Robust Approach

Plausible Values for Latent Variables Using Mplus

Econometric Analysis of Cross Section and Panel Data

COMPARISON OF GMM WITH SECOND-ORDER LEAST SQUARES ESTIMATION IN NONLINEAR MODELS. Abstract

Christopher Dougherty London School of Economics and Political Science

A general mixed model approach for spatio-temporal regression data

Modelling geoadditive survival data

Chapter 3: Maximum Likelihood Theory

State-space Model. Eduardo Rossi University of Pavia. November Rossi State-space Model Fin. Econometrics / 53

Consequences of measurement error. Psychology 588: Covariance structure and factor models

Measurement Error and Linear Regression of Astronomical Data. Brandon Kelly Penn State Summer School in Astrostatistics, June 2007

Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

6. Assessing studies based on multiple regression

A Course on Advanced Econometrics

Least Squares Estimation-Finite-Sample Properties

Estimation for two-phase designs: semiparametric models and Z theorems

International Journal of Approximate Reasoning

the error term could vary over the observations, in ways that are related

Previous lecture. P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing.

A weighted simulation-based estimator for incomplete longitudinal data models

Confirmatory Factor Analysis: Model comparison, respecification, and more. Psychology 588: Covariance structure and factor models

GENERALIZED LINEAR MIXED MODELS AND MEASUREMENT ERROR. Raymond J. Carroll: Texas A&M University

A better way to bootstrap pairs

Made available courtesy of Elsevier:

Lecture Notes 1: Decisions and Data. In these notes, I describe some basic ideas in decision theory. theory is constructed from

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY

STATS 200: Introduction to Statistical Inference. Lecture 29: Course review

Econometrics of Panel Data

Statistics - Lecture One. Outline. Charlotte Wickham 1. Basic ideas about estimation

Linear Models 1. Isfahan University of Technology Fall Semester, 2014

Discussion of Papers on the Extensions of Propensity Score

"ZERO-POINT" IN THE EVALUATION OF MARTENS HARDNESS UNCERTAINTY

Non-Gaussian Berkson Errors in Bioassay

σ(a) = a N (x; 0, 1 2 ) dx. σ(a) = Φ(a) =

MA/ST 810 Mathematical-Statistical Modeling and Analysis of Complex Systems

Streamlining Missing Data Analysis by Aggregating Multiple Imputations at the Data Level

For more information about how to cite these materials visit

VCMC: Variational Consensus Monte Carlo

GWAS IV: Bayesian linear (variance component) models

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools

A review of some semiparametric regression models with application to scoring

Problem Set 3: Bootstrap, Quantile Regression and MCMC Methods. MIT , Fall Due: Wednesday, 07 November 2007, 5:00 PM

Simple Sensitivity Analysis for Differential Measurement Error. By Tyler J. VanderWeele and Yige Li Harvard University, Cambridge, MA, U.S.A.

ESTIMATION OF NONLINEAR BERKSON-TYPE MEASUREMENT ERROR MODELS

Measurement Error in Covariates

Working Paper No Maximum score type estimators

More on Specification and Data Issues

Combining multiple observational data sources to estimate causal eects

Bayesian Statistical Methods. Jeff Gill. Department of Political Science, University of Florida

The Economics of European Regions: Theory, Empirics, and Policy

Nonparametric Identi cation of Regression Models Containing a Misclassi ed Dichotomous Regressor Without Instruments

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates

Linear discriminant functions

FORECASTING STANDARDS CHECKLIST

Multistate Modeling and Applications

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Non-linear panel data modeling

ECON2228 Notes 8. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 35

Nonparametric Identi cation of Regression Models Containing a Misclassi ed Dichotomous Regressor Without Instruments

Robust Monte Carlo Methods for Sequential Planning and Decision Making

To appear in The American Statistician vol. 61 (2007) pp

Statistical Models with Uncertain Error Parameters (G. Cowan, arxiv: )

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

What s New in Econometrics. Lecture 1

FAILURE-TIME WITH DELAYED ONSET

Linear Models in Machine Learning

Bayesian model selection: methodology, computation and applications

Theory of Maximum Likelihood Estimation. Konstantin Kashin

Introduction to Econometrics

1 Motivation for Instrumental Variable (IV) Regression

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.

Introduction to Econometrics. Assessing Studies Based on Multiple Regression

A note on multiple imputation for general purpose estimation

Review of Econometrics

Comparing latent inequality with ordinal health data

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

6 Pattern Mixture Models

POSTERIOR ANALYSIS OF THE MULTIPLICATIVE HETEROSCEDASTICITY MODEL

Model Selection for Semiparametric Bayesian Models with Application to Overdispersion

Specification Test for Instrumental Variables Regression with Many Instruments

Statistical Methods in Particle Physics Lecture 1: Bayesian methods

Flexible Estimation of Treatment Effect Parameters

Equivalence of random-effects and conditional likelihoods for matched case-control studies

Bootstrapping Sensitivity Analysis

Bayesian Interpretations of Heteroskedastic Consistent Covariance Estimators Using the Informed Bayesian Bootstrap

Issues on quantile autoregression

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication

Prequential Analysis

Imputation of rounded data

The regression model with one stochastic regressor (part II)

Probabilistic Machine Learning

Transcription:

Imprecise Measurement Error Models Towards a More Reliable Analysis of Messy Data Thomas Augustin Department of Statistics, Ludwig-Maximilians Universität (LMU) Munich, Germany Thomas Augustin, LMU Munich 22 June 2010 1

Sociology Empirical social research idealized data Statistics Thomas Augustin, LMU Munich 22 June 2010 2

Sociology Empirical social research idealized data Statistics Imprecise Probabilities imprecise models robust statistics decision theory Thomas Augustin, LMU Munich 22 June 2010 3

Sociology Empirical social research idealized data Measurement Error Modelling imprecise data correction under precise deficiency models Statistics Imprecise Probabilities imprecise models robust statistics decision theory Thomas Augustin, LMU Munich 22 June 2010 4

Sociology Empirical social research idealized data non-indealized data Measurement Error Modelling imprecise data correction under precise deficiency models Statistics Partial identification Imprecise Probabilities imprecise models imprecise deficiency models robust statistics decision theory Thomas Augustin, LMU Munich 22 June 2010 5

Sketch of the Argument Omnipresence of measurement error Severe bias in statistical analysis when neglecting it Powerful correction methods based on the classical model of testing theory construct unbiased estimating functions zero expectation consistency The underlying assumptions are very restrictive, and rarely satisfied in social surveys Relax assumptions: imprecise measurement error model construct unbiased sets of estimating functions zero expectation for one element of the credal set credal consistency Thomas Augustin, LMU Munich 22 June 2010 6

Outline 1. Measurement Error 2. Measurement Error Correction based on Precise Error Models 2.1 Classical Measurement Error Modelling 2.2 Unbiased Estimating Equations and Corrected Score Functions for Classical Measurement Error 3. Overcoming the Dogma of Precision in Deficiency Models 3.1 Credal Deficiency Models as Imprecise Measurement Error Models 3.2 Credal Consistency of Set-Valued Estimators 3.3 Minimal and Complete Sets of Unbiased Estimating Functions under Imprecise Measurement Error Models Thomas Augustin, LMU Munich 22 June 2010 7

1. Measurement Error Thomas Augustin, LMU Munich 22 June 2010 8

1. Measurement Error Applied Statistics (statistical modelling): Learning from data by sophisticated models Complex relationships between variables dependent independent effects variable Y i variable X i data inference data Thomas Augustin, LMU Munich 22 June 2010 9

Often the relationship between variables and data is complex, too: * Often variables of interest (gold standard) are not ascertainable. * Only proxy variables (surrogates) are available instead. Thomas Augustin, LMU Munich 22 June 2010 10

dependent independent effects variable Y i variable X i error model error model proxy variable Y i proxy variable X i data inference data * -Notation (here) X, Z : (unobservable) variable, gold standard X, Z : corresponding possibly incorrect measurements analogously: Y, Y and T, T Thomas Augustin, LMU Munich 22 June 2010 11

Typical examples: Measurement Error Error-prone measurements of true quantities * error in technical devices * indirect measurement * response effects * use of aggregated quantities, averaged values, imputation, rough estimates etc. * anonymization of data by deliberate contamination Measured indicators of complex constructs; latent variables * long term quantities: long term protein intake, long term blood pressure * permanent income * importance of a patent * extent of motivation, degree of costumer satisfaction * severeness of undernutrition Thomas Augustin, LMU Munich 22 June 2010 12

Note: Measurement error and misclassification are not just a matter of sloppiness. Latent variables are eo ipso not exactly measurable. Almost all economic variables are measured with error. [...] Unfortunately, the statistical consequences of errors in explanatory variables are severe. (Davidson and Mackinnon (1993), Estimation and Inference in Econometrics.) In nonlinear models, the later statement may apply (!?) to the dependent variable, too. (Dependence on the DGP: Torelli & Trivellato (1993, J. Econometrics)) Thomas Augustin, LMU Munich 22 June 2010 13

The triple whammy effect of measurement error Carroll, Ruppert, Stefanski, Crainiceanu (2006, Chap.H.) bias masking of features loss of power classical error: attenuation Thomas Augustin, LMU Munich 22 June 2010 14

2. Measurement Error Correction based on Precise Error Models Thomas Augustin, LMU Munich 22 June 2010 15

2.1. Classical Measurement Error Modelling Terminology continuous variables measurement error error U i classical Berkson Rounding Anonymization meas. error error Heaping Techniques X i U i Xi U i discrete variables misclassification Thomas Augustin, LMU Munich 22 June 2010 16

The Classical Model of Testing Theory Measurement = True Value + Error X i [j] = X i [j] + U i [j], i = 1,..., n, j = 1,..., p Assumptions on the distribution E(U i ) = 0 [A1.1] V ar(u i ) = σi 2 ( σ) [A1.2] U i N(0, σ 2 ) [A1.3] Independence Assumptions (Uncorrelatedness) U i [j] X i [j] [A2.1] U i1 [j] U i2 [j] i 1 i 2 [A2.2] U i [j 1 ] U i [j 2 ] j 1 j 2 [A2.3] U i1 [j 1 ] X i2 [j 2 ] i 1 i 2 ; j 1 j 2 [A2.4] Thomas Augustin, LMU Munich 22 June 2010 17

2.2 Unbiased Estimating Equations and Corrected Score Functions for Classical Measurement Error Thomas Augustin, LMU Munich 22 June 2010 18

Full Bayes-Inference in Flexible Models Sample latent variables in a hierarchial setting: MCMC; graphical modelling Berry, Carroll & Ruppert (2002, JASA). Kneib, Brezger & Crainiceanu (2010, Fahrmeir Fest.) Rummel, Augustin & Küechenhoff (2010, Biometrics) Thomas Augustin, LMU Munich 22 June 2010 19

The Basic Idea: Estimating Functions Idea: Do not investigate estimators directly but the equations producing estimators estimator = root(f unction(observeddata, P arameter)) Estimator is not systematically biased when * in the average this was the right decision, * i.e. when the true value is indeed the root of the expected value of the function Thomas Augustin, LMU Munich 22 June 2010 20

Frame the problem in terms of unbiased estimating functions (score functions) for the parameter ϑ s X (Y; X; ϑ) such that E ϑ (s X (Y; X; ϑ)) = 0 at the true parameter value ϑ Concept contains as special cases * maximum likelihood estimators * least squares estimators in linear regression * quasi-likelihood estimators (McCullagh, 1981, Ann. Stat.; 1990, Cox Fest.) * M-estimators (Huber, 1981, Wiley) ; * Godambe (1991, Oxford UP). * GMM-estimators (Wansbeek & Meijer, 2000, Elsevier) Under mild regularity conditions still * consistency * and asymptotic normality. Thomas Augustin, LMU Munich 22 June 2010 21

For the moment classical covariate measurement error only X i = X i + U i, X i U i. Note that typically, even if E(X ) = E(X) then E((X ) r ) E(X r ), r > 1. Therefore naive estimation by simply replacing X with X, leads in general to E ϑ ( s X (Y; X ; ϑ) ) a > 0, resulting in inconsistent estimators. For instance, ( n E (y i β 0 β 1 Xi ) i=1 ( 1 X i ) ) E ( n i=1 ( 1 (y i β 0 β 1 X i ) Xi ) ) =0 Measurement error correction: Find an estimating function s X (Y, X, ϑ) in the error prone data with E ϑ s X (Y; X ; ϑ) = 0. Thomas Augustin, LMU Munich 22 June 2010 22

The technical argument condensed: on the construction of unbiased estimating equations under measurement error ϑ true parameter value Ideal estimating function: ψ X,Y (X, Y, ϑ) Naive estimating function: ψ sic! X,Y (X, Y, ϑ) Find ψ X,Y (X, Y, ϑ) such that E ϑ (ψ X,Y (X, Y!, ϑ)) = 0 ( ) Idea: use the ideal score function as a building block! Try ψ X,Y (X, Y, ϑ) = f(ψ X,Y (X, Y, ϑ)) for some appropriate f( ) Thomas Augustin, LMU Munich 22 June 2010 23

In general, ψ X,Y ( ), i.e. f( ), can not be determined directly. Note that, since E ϑ ( ψ X,Y )(X, Y, ϑ) ) = 0, ( ) is equivalent to ) E ϑ (ψ X,Y (X, Y (, ϑ) = E ϑ ψ X,Y (X, Y, ϑ) ) Look at the expected difference between ψ X,Y ( ) and ψ X,Y ( ). Try to break ψ X,Y (X, Y, ϑ) into additive pieces, and handle it piece by piece Thomas Augustin, LMU Munich 22 June 2010 24

Typically, ψ( ) has the form ψ(x, Y, ϑ) = 1 n n i=1 ψ i(x i, Y i, ϑ), and there are representations such that, for i = 1,..., n, ψ i (X i, Y i, ϑ) = s g j (X i, Y i, ϑ). j=1 Then try to find f 1 ( ),..., f s ( ) such that E ϑ (f j (g j (X i, Y i, ϑ))) = E ϑ (g j (X i, Y i, ϑ)) ( ) (conditionally/locally) corrected score functions (for covariate measurement error: Nakamura (1990, Biometrika), Stefanski (1989, Comm. Stat. Theory Meth.)) Try to find f 1 ( ),..., f s ( ) such that Eϑ (f j (g j (X i, Y i, ϑ)) X i, Y i ) = g j (X i, Y i, ϑ) ( ), then the law of iterated expectation leads to (**). Thomas Augustin, LMU Munich 22 June 2010 25

(globally or locally) corrected log- Sometimes indirect proceeding: likelihood l X (Y, X, ϑ) with E(l X (Y, X, ϑ) X, Y) = l X (Y, X, ϑ). or E ( ) l X (Y, X, ϑ) = E ( l X (Y, X, ϑ) ). Same techniques as before * piece by piece * globally or locally Under mild regularity conditions unbiased estimating function by taking the derivative with respect to ϑ. Thomas Augustin, LMU Munich 22 June 2010 26

+ Functional method: no (unjustified!?) assumptions on the distr. of X + Successful for generalized linear models, polynomial regression, etc. (Survey: Schneeweiß & Augustin, 2006, ASTA; Hübler & Frohn (eds.); Cox model: Augustin (2004, Scand. J. Stat.)) Numerical difficulties for small samples Handling of transformations (e.g. ln X) complicated or impossible Non-existence of corrected score functions for some models + Extensions to misclassification (Akazawa, Kinukawa, Nakamura, 1998, J. Jap. Stat. Soc.; Zucker, Spiegelman, 2008, Stat. Med.) + Quite general error distribution can often be handled (only existing moment generating function needed); this includes deliberate contamination for privacy protection (Bleninger & Augustin ) + Extension to dependent variable unified understanding of censoring and measurement error (Pötter & Augustin) + Extension to Berkson Error (Wallner & Augustin) + Extension to rounding (Felderer, Müller, Schneeweiß, Wiencierz & Augustin) Thomas Augustin, LMU Munich 22 June 2010 27

3. Overcoming the Dogma of Precision in Deficiency Models Thomas Augustin, LMU Munich 22 June 2010 28

3.1 Credal Deficiency Model as Imprecise Measurement Error Models Thomas Augustin, LMU Munich 22 June 2010 29

Manski s Law of Decreasing Credibility Reliability!? Credibility? The credibility of inference decreases with the strength of the assumptions maintained. (Manski (2003, p. 1)) Thomas Augustin, LMU Munich 22 June 2010 30

Manski s Law of Decreasing Credibility Reliability!? Credibility? The credibility of inference decreases with the strength of the assumptions maintained. (Manski (2003, p. 1)) Identifying assumptions in measurement error models Very strong assumptions needed to ensure identifiability = precise solution Measurement error model must be known precisely - type of error, in particular assumptions on (conditional) independence - independence of true value - independence of other covariates - independence of other measurements - type of error distribution - moments of error distribution validation studies typically not available Thomas Augustin, LMU Munich 22 June 2010 31

Reliable Inference Instead of Overprecision! Make more realistic assumption and let the data speak for themselves! Consider the set of all models that maybe compatible with the data (and then add successively additional assumptions, if desirable) The results may be imprecise, but are more reliable for sure The extent of imprecision is related to the data quality! As a welcome by-product: clarification of the implication of certain assumptions parallel developments (missing data; transfer to measurement error context!) * economics: partial identification: e.g., Manski (2003, Springer) * biometrics: systematic sensitiviy analysis: e.g., Vansteelandt, Goetghebeur, Kenword, Molenberghs (2006, Stat. Sinica) current developments, e.g., * Cheng, Small (2006, JRSSB) * Henmi, Copas, Eguchi (2007, Biometrics) * Stoye (2009, Econometrica) * Gustafson & Greenland (2009, Stat. Science) Kleyer (2009, MSc.); Kunz, Augustin, Küchenhoff (2010, TR) Thomas Augustin, LMU Munich 22 June 2010 32

Credal Deficiency Models Different types of deficiency can be expressed Measurement error problems Misclassification If Y P(Y) {0, 1} : coarsening, rounding, censoring, missing data Outliers credal set: convex set of traditional probability distributions [Y X, ϑ] P Y X,ϑ [Y X, Y ] P Y X,Y or [Y X, Y ] P Y Y,X [X X, Y ] P X X,Y or [X X, Y ] P X X,Y Thomas Augustin, LMU Munich 22 June 2010 33

Credal Estimation Natural idea: sets of traditional models sets of traditional estimators Construct estimators Θ R p, i.e. set of plausible parameter values, appropriately reflecting the ambiguity (non-stochastic uncertainty, ignorance) in the credal set P. Θ small if and only if (!) P is small * Usual point estimator as the border case of precise probabilistic information * Connection to Manski s (2003) identification regions and Vansteelandt, Goetghebeur, Kenward & Molenberghs (Stat Sinica, 2006) ignorance regions. Construction of unbiased sets of estimating functions Credal consistency Thomas Augustin, LMU Munich 22 June 2010 34

3.2 Credal Consistency ( Θ(n)) n N set P ϑ ) if ϑ Θ : R p is called credally consistent (with respect to the credal p P ϑ ( ˆϑ(n) p ) n N ( Θ(n) ) n N : plim n ˆϑ (n) p = ϑ. Thomas Augustin, LMU Munich 22 June 2010 35

3.3 Construction of Credally Consistent Estimators Transfer the framework of unbiased estimating functions A set Ψ of estimating functions is called * unbiased (with respect to the credal set P ϑ ) if for all ϑ: ψ Ψ p ψ,ϑ P ϑ : E pψ,ϑ (Ψ) = 0 * complete (with respect to the credal set P ϑ ) if for all ϑ: p P ϑ ψ p,ϑ Ψ : E p (ψ p,ϑ ) = 0. A complete and unbiased set ψ of estimating functions is called minimal if there is no complete and unbiased set of estimating functions Ψ Ψ. Thomas Augustin, LMU Munich 22 June 2010 36

Construction of Minimal Consistent Estimators Define for some set Ψ of estimating functions { Θ Ψ = ˆϑ ˆϑ } is root of ψ, ψ Ψ. Under the usual regularity conditions (in particular unique root for every ψ) Ψ unbiased and complete Θ Ψ credally consistent Thomas Augustin, LMU Munich 22 June 2010 37

3.4 Examples Imprecise sampling model: neighborhood model P Y X,ϑ around some ideal central distribution p Y X,ϑ Let ψ be an unbiased estimation function for p Y X,ϑ. Then (if well defined) Ψ = { ψ ψ = ψ E p (ψ), p P Y X,ϑ } is unbiased and complete. Imprecise measurement error model, e.g. P X X,Y : Ψ = { ψ ψ is corrected score function for some p P X X,Y } is unbiased and complete. Construction of confidence regeions: * union of traditional confidence regions * can often be improved (Vansteelandt, Goetghebeur, Kenward & Molenberghs (Stat Sinica, 2006), Stoye (2009, Econometrica)). Thomas Augustin, LMU Munich 22 June 2010 38

Sketch of the Argument Omnipresence of measurement error Severe bias in statistical analysis when neglecting it Powerful correction methods based on the classical model of testing theory construct unbiased estimating functions zero expectation consistency The underlying assumptions are very restrictive, and rarely satisfied in social surveys Relax assumptions: imprecise measurement error model construct unbiased sets of estimating functions zero expectation for one element of the credal set credal consistency Thomas Augustin, LMU Munich 22 June 2010 39

dependent independent effects variable Y i variable X i error model error model proxy variable Y i proxy variable X i data inference data Thomas Augustin, LMU Munich 22 June 2010 40

[Y X; ϑ] Y Y ϑ = (β T, ν T ) T X X [Y X, Y ] [X X, Y ] Y X X X y 1,..., y n x 1,..., x n Thomas Augustin, LMU Munich 22 June 2010 41