Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded
|
|
- Elwin Pierce
- 5 years ago
- Views:
Transcription
1 Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded 1
2 Background Latent confounder is common in social and behavioral science in which most of cases the selection mechanism is neither fully known nor perfect measured. To measure latent confounder, multiple indicators need to be included to enhance both reliability and validity of the measurement. But, condition on many covariates may cause estimation problem or inefficiency. In this paper, we will investigate how confirmatory factor analysis (CFA) could be used for controlling for selection bias caused by a latent confounding variable. Confirmatory factor analysis (CFA) Methods for Latent Confounding Control Confirmatory factor analysis (CFA) incorporates the subjective knowledge of latent structure, observed measurements and measurement errors to estimate latent variables (Bollen, 1989). CFA derives latent variable as linear combinations of independent observed variables. The factor loadings indicate the relative importance of each factor to the latent variable. Factor scores are composite score that provide information about each unit s predicted placement on the latent factors. In this paper, we use Bartlett Scores which have the advantage of producing unbiased estimates of true factor scores (Hershberger, 2005). Data-Based Covariate Selection We define covariate selection as the process by which a subset X " X identified with the aim of satisfying ignorability. We use the theoretical framework proposed by de Luna et al. (2011) referred to as X % because it retains outcome-only predictors, which increases efficiency (Brookhart et. al, 2006). To implement conditional independence testing, we used max-min parents and children algorithm for Gaussian Bayesian networks (GBN) to select target covariates (Scutari, 2010). Hybrid Approach and Kitchen Sink Approach The hybrid approach is a combination of covariate selection and factor analysis insofar as it involves conditioning on the covariate set selected by covariate selection together with the estimated factor score. The rationale for developing this approach is that it avoids information loss of factor analysis and simultaneously prevents bias caused by omitting important covariates missed by covariate selection, while still allowing for dimension reduction of the covariate space. The kitchen sink approach involves including all measured pre-treatment covariates without any pre-processing; this approach will be set as benchmark for comparison in our study. Research Questions The three motivation questions for this simulation studies are as follow. 2
3 1. Is it possible for confirmatory factor analysis (CFA) to successfully control for selection bias caused by latent confounders? Furthermore, how do sample size and factor structure and loadings contribute? 2. How does CFA compare to other methods with respect to bias and efficiency? 3. What are sufficient conditions to successfully implement CFA for latent confounding control? Monte Carlo Simulation Design The data generating process was motivated by the model specification for CFA. Data were generated as follows: F ' = N(0,1) X ' = a ' F ' + ε ' X 3 = a 3 F ' + ε 3. X '4 = a '4 F ' + ε '4 log 8 PS 1 PS < = b 4 + b ' F ' Z = Bernoulli(PS) Y ' = β I β ' F ' + ε ' Y 4 = β I + β ' F ' + ε 4 Where ε LM'.'4 ~ N(0, Q1 a L 3 ) and ε ', ε 4 ~ N(0,1) See Figure 1. for a graphical representation. The confounding coefficients are set as follows: b 4 = 0, b ' = 2 and β 4 = 0, β ' = 2. The simulation design factors are: Sample sizes: 100, 500, Strengths of factor loadings. (1). Strong vs. weak (2). Medium vs. weak [a ' = 0.9, a 3 = 0.1., a '4 = 0.1], [a ' = 0.9, a 3 = 0.9., a '4 = 0.1],. [a ' = 0.9, a 3 = 0.9,., a '4 = 0.9]. [a ' = 0.5, a 3 = 0.1., a '4 = 0.1], [a ' = 0.5, a 3 = 0.5., a '4 = 0.1],. 3
4 [a ' = 0.5, a 3 = 0.5,., a '4 = 0.5]. (3). Small vs. weak [a ' = 0.3, a 3 = 0.1., a '4 = 0.1], [a ' = 0.3, a 3 = 0.3., a '4 = 0.1],. [a ' = 0.3, a 3 = 0.3,., a '4 = 0.3]. The analysis factor is based on the four different approaches to deal with a latent confounder: covariate selection, confirmatory factor analysis, hybrid method and kitchen-sink approach. The primary simulation outcomes are the bias and MSE of regression treatment effect estimation. We also quantify the number of covariates selected with the covariate selection method. 100 replications were run for each cell of the study. Simulation Results Results of the simulation study are reported in Tables 1 to 9. The true value of the treatment effect was 2 units (cf., Figure 1). The overall sample size, which was varied from 100 to 500 to 1000, did not have an appreciable influence on bias reduction. There is a clear interaction between the strength of the strong factor loadings and the number of strong factor loadings. When the strong loadings are all set to 0.3 (Table 4), the bias ranges from about 2.3 (or 115% of the treatment effect), when none of the indicators is strong, to about 1.5 (75%), when all of them are. It is worth noting that a standardized loading of 0.3 is a commonly used cutoff for retaining an indicator as important in exploratory factor analysis. When the strong loadings set to 0.9 (Table 6), the bias ranges from about 2.3 (115%) to about 0.1 (5%). Thus, regardless of method, full (or nearly full) bias reduction is only possible when (a) all indicators are strongly related to the latent factor and (b) the strengths of those relationship are very high. Dimension reduction results for covariate selection are summarized graphically in Figures 2 to 4. As expected, the covariate selection method tended to select fewer indicators when fewer had large loadings and more indicators were selected when more were generated with large loadings. Any differences among the four methods were relative minor in comparison to differences due to the magnitudes of factor loadings. Finally, we note that the hybrid approach performed well across all conditions. Conclusions Our first conclusion is that factor scores from a confirmatory factor analysis may be used to reduce the dimension of a set of manifest indicators without a detrimental loss in capacity for bias reduction. Here we underscore the point that factor analysis reduced the dimension of the indicator space from ten down to one, whereas, the other methods used either all ten indicators or used some number selected by GBN, typically between five and ten. In practice, to reduce the dimension of indicators, researchers often take sum scores of inventories that are not meant to be summed. What our results show is that it may be acceptable to use factor scores instead. 4
5 The second conclusion is the importance of working with valid indicators that truly measure the latent construct they purport to. These results perhaps may be used as a warning for researchers considering using untested and unvalidated items as proxies for latent constructs in observational study settings. 5
6 Reference Angrist, J. D., & Pischke, J.-S. (2009). Mostly harmless econometrics: An empiricists companion. Princeton: Princeton University Press. Beck, A. T., Steer, R. A., & Carbin, M. G. (1988). Psychometric properties of the Beck Depression Inventory: Twenty-five years of evaluation. Clinical psychology review, 8(1), Brookhart, M. A., Schneeweiss, S., Rothman, K. J., Glynn, R. J., Avorn, J., & Stürmer, T. (2006). Variable selection for propensity score models. American journal of epidemiology, 163(12), Bollen, K. A. (1989). Structural equations with latent variables. New York: Wiley. DiStefano, C., Zhu, M., & Mindrila, D. (2009). Understanding and using factor scores: Considerations for the applied researcher. Practical Assessment, Research & Evaluation, 14(20), De Luna, X., Waernbaum, I., & Richardson, T. S. (2011). Covariate selection for the nonparametric estimation of an average treatment effect. Biometrika, 98(4), Hershberger, S. L. (2005). Factor scores. In B. S. Everitt and D. C. Howell (Eds.) Encyclopedia of Statistics in Behavioral Science. (pp ). New York: John Wiley. Kaplan, D. (1999). An extension of the propensity score adjustment method for the analysis of group differences in MIMIC models. Multivariate Behavioral Research, 34(4), Kupek, E. (2013). Detection of Unknown Confounders by Bayesian Confirmatory Factor Analysis. Advanced Studies in Medical Sciences, 1(3), Pearl, J., & Verma, T. (1991). A theory of inferred causation. KR, 91,
7 Rosenbaum, P. R., & Rubin, D. B. (1983). The central role of the propensity score in observational studies for causal effects. Biometrika, 70, Rubin, D. B. (1978). Bayesian inference for causal effects: The role of randomization. Annals of Statistics, 6, Rubin, D. B. (1980). Randomization analysis of experimental data: The Fisher Randomization test comment. Journal of the American Statistical Association, 75, Rubin, D. B. (1990). Formal models of statistical inference for causal effects. Journal of Statistical Planning and Inference, 25, Rubin, D. B., & Thomas, N. (1996). Matching using estimated propensity scores: relating theory to practice. Biometrics, Steiner, P. M., Cook, T. D., & Shadish, W. R. (2011). On the importance of reliable covariate measurement in selection bias adjustments using propensity scores. Journal of Educational and Behavioral Statistics, 36(2), Schafer, J., & Kang, J. (2008). Average causal effects from nonrandomized studies: A practical guide and simulated example. Psychological Methods, 13, Scutari, M. (2010). bnlearn: Bayesian network structure learning. R package. Schneeweiss, S., Rassen, J. A., Glynn, R. J., Avorn, J., Mogun, H., & Brookhart, M. A. (2009). High-dimensional propensity score adjustment in studies of treatment effects using health care claims data. Epidemiology (Cambridge, Mass.), 20(4), 512. Shortreed, S. M., & Ertefaie, A. (2017). Outcome-adaptive lasso: Variable selection for causal inference. Biometrics, 73(4),
8 Figure 1. Data_generating process for simulation study; F1 is the latent confounder; a s are the factor loadings for each constructs vs vs vs.0.1 Median number of covariates selected Good Index Figure 2. Median number of covariates selected when sample size n = 100; Good Index is the number of high loading items included. 8
9 0.9 vs vs vs Median number of covariates selected Good Index Figure 3. Medium sample size for covariate selection methods when n=500; Good Index is the number of high loading items included vs vs vs.0.1 Median number of covariates selected Good Index Figure 4. Medium sample size for covariate selection methods when n=1000; Good Index is the number of high loading items included. 9
10 Small Sample Size n=100 Methods Good. Index Bias S.D MSE All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select
11 Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid Table 1: This table is the result of bias, standard deviation (S.D) and MSE of estimated treatment effect with sample size=100; Good. Index are the number of measures with factor loading=0.3 included; Good. Index=0 means factor loading=0.1; Methods are different approaches to deal with latent confounding: All indicate kitchen sink approach, Hybrid is the hybrid approach, Select is the covariate selection approach and Factor is the factor analysis approach. Methods Good. Index Bias S.D MSE All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor
12 Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid Table 2: This table is the result of bias, standard deviation (S.D) and MSE of estimated treatment effect with sample size=100; Good. Index are the number of measures with factor loading=0.5 included; Good. Index=0 means factor loading=0.1; Methods are different 12
13 approaches to deal with latent confounding: All indicate kitchen sink approach, Hybrid is the hybrid approach, Select is the covariate selection approach and Factor is the factor analysis approach. Methods Good. Index Bias S. D MSE All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor
14 Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid Table 3: This table is the result of bias, standard deviation (S.D) and MSE of estimated treatment effect with sample size=100; Good. Index are the number of measures with factor loading=0. included; Good. Index=0 means factor loading=0.1; Methods are different approaches to deal with latent confounding: All indicate kitchen sink approach, Hybrid is the hybrid approach, Select is the covariate selection approach and Factor is the factor analysis approach. Medium Sample Size n=500 Methods Good. Index Bias S.D MSE All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All
15 Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid Table 4: This table is the result of bias, standard deviation(s.d) and MSE of estimated treatment effect with sample size=500; Good. Index are the number of measures with factor loading=0.3 included; Good. Index=0 means factor loading=0.1; Methods are different approaches to deal with latent confounding: All indicate kitchen sink approach, Hybrid is the hybrid approach, Select is the covariate selection approach and Factor is the factor analysis approach. 15
16 Methods Good. Index Bias S.D MSE All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid
17 All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid Table 5: This table is the result of bias, standard deviation (S.D) and MSE of estimated treatment effect with sample size=500; Good. Index are the number of measures with factor loading=0.5 included; Good. Index=0 means factor loading=0.1; Methods are different approaches to deal with latent confounding: All indicate kitchen sink approach, Hybrid is the hybrid approach, Select is the covariate selection approach and Factor is the factor analysis approach. Methods Good. Index Bias S.D MSE All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select
18 Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid Table 6: This table is the result of bias, standard deviation(s.d) and MSE of estimated treatment effect with sample size=500; Good. Index are the number of measures with factor loading=0.9 included; Good. Index=0 means factor loading=0.1; Methods are different approaches to deal with latent confounding: All indicate kitchen sink approach, Hybrid is the hybrid approach, Select is the covariate selection approach and Factor is the factor analysis approach. 18
19 Large Sample Size n=1000 Methods Good. Index Bias S.D MSE All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select
20 Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid Table 7: This table is the result of bias, standard deviation(s.d) and MSE of estimated treatment effect with sample size=1000; Good. Index are the number of measures with factor loading=0.3 included; Good. Index=0 means factor loading=0.1; Methods are different approaches to deal with latent confounding: All indicate kitchen sink approach, Hybrid is the hybrid approach, Select is the covariate selection approach and Factor is the factor analysis approach. Methods Good. Index Bias S.D MSE All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor
21 Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid Table 8: This table is the result of bias, standard deviation(s.d) and MSE of estimated treatment effect with sample size=1000; Good. Index are the number of measures with factor loading=0.5 included; Good. Index=0 means factor loading=0.1; Methods are different approaches to deal with latent confounding: All indicate kitchen sink approach, Hybrid is the hybrid approach, Select is the covariate selection approach and Factor is the factor analysis approach. 21
22 Methods Good. Index Bias SD MSE All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid All
23 Factor Select Hybrid All Factor Select Hybrid All Factor Select Hybrid Table 9: This table is the result of bias, standard deviation (S.D) and MSE of estimated treatment effect with sample size=1000; Good.Index are the number of measures with factor loading=0.9 included; Good. Index=0 means factor loading=0.1; Methods are different approaches to deal with latent confounding: All indicate kitchen sink approach, Hybrid is the hybrid approach, Select is the covariate selection approach and Factor is the factor analysis approach. 23
DATA-ADAPTIVE VARIABLE SELECTION FOR
DATA-ADAPTIVE VARIABLE SELECTION FOR CAUSAL INFERENCE Group Health Research Institute Department of Biostatistics, University of Washington shortreed.s@ghc.org joint work with Ashkan Ertefaie Department
More informationThe Impact of Measurement Error on Propensity Score Analysis: An Empirical Investigation of Fallible Covariates
The Impact of Measurement Error on Propensity Score Analysis: An Empirical Investigation of Fallible Covariates Eun Sook Kim, Patricia Rodríguez de Gil, Jeffrey D. Kromrey, Rheta E. Lanehart, Aarti Bellara,
More informationAn Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016
An Introduction to Causal Mediation Analysis Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 1 Causality In the applications of statistics, many central questions
More informationAn Introduction to Causal Analysis on Observational Data using Propensity Scores
An Introduction to Causal Analysis on Observational Data using Propensity Scores Margie Rosenberg*, PhD, FSA Brian Hartman**, PhD, ASA Shannon Lane* *University of Wisconsin Madison **University of Connecticut
More informationAN EVALUATION OF PARAMETRIC AND NONPARAMETRIC VARIANCE ESTIMATORS IN COMPLETELY RANDOMIZED EXPERIMENTS. Stanley A. Lubanski. and. Peter M.
AN EVALUATION OF PARAMETRIC AND NONPARAMETRIC VARIANCE ESTIMATORS IN COMPLETELY RANDOMIZED EXPERIMENTS by Stanley A. Lubanski and Peter M. Steiner UNIVERSITY OF WISCONSIN-MADISON 018 Background To make
More informationPEARL VS RUBIN (GELMAN)
PEARL VS RUBIN (GELMAN) AN EPIC battle between the Rubin Causal Model school (Gelman et al) AND the Structural Causal Model school (Pearl et al) a cursory overview Dokyun Lee WHO ARE THEY? Judea Pearl
More informationAuthors and Affiliations: Nianbo Dong University of Missouri 14 Hill Hall, Columbia, MO Phone: (573)
Prognostic Propensity Scores: A Method Accounting for the Correlations of the Covariates with Both the Treatment and the Outcome Variables in Matching and Diagnostics Authors and Affiliations: Nianbo Dong
More informationComments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D.
Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D. David Kaplan Department of Educational Psychology The General Theme
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint
More informationSince the seminal paper by Rosenbaum and Rubin (1983b) on propensity. Propensity Score Analysis. Concepts and Issues. Chapter 1. Wei Pan Haiyan Bai
Chapter 1 Propensity Score Analysis Concepts and Issues Wei Pan Haiyan Bai Since the seminal paper by Rosenbaum and Rubin (1983b) on propensity score analysis, research using propensity score analysis
More informationOn the Use of the Bross Formula for Prioritizing Covariates in the High-Dimensional Propensity Score Algorithm
On the Use of the Bross Formula for Prioritizing Covariates in the High-Dimensional Propensity Score Algorithm Richard Wyss 1, Bruce Fireman 2, Jeremy A. Rassen 3, Sebastian Schneeweiss 1 Author Affiliations:
More informationApplication of Plausible Values of Latent Variables to Analyzing BSI-18 Factors. Jichuan Wang, Ph.D
Application of Plausible Values of Latent Variables to Analyzing BSI-18 Factors Jichuan Wang, Ph.D Children s National Health System The George Washington University School of Medicine Washington, DC 1
More informationOnline Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha
Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha January 18, 2010 A2 This appendix has six parts: 1. Proof that ab = c d
More informationDEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS
DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS Donald B. Rubin Harvard University 1 Oxford Street, 7th Floor Cambridge, MA 02138 USA Tel: 617-495-5496; Fax: 617-496-8057 email: rubin@stat.harvard.edu
More informationCausal Inference with General Treatment Regimes: Generalizing the Propensity Score
Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke
More informationEstimating the Marginal Odds Ratio in Observational Studies
Estimating the Marginal Odds Ratio in Observational Studies Travis Loux Christiana Drake Department of Statistics University of California, Davis June 20, 2011 Outline The Counterfactual Model Odds Ratios
More informationVariable selection and machine learning methods in causal inference
Variable selection and machine learning methods in causal inference Debashis Ghosh Department of Biostatistics and Informatics Colorado School of Public Health Joint work with Yeying Zhu, University of
More informationCovariate selection and propensity score specification in causal inference
Covariate selection and propensity score specification in causal inference Ingeborg Waernbaum Doctoral Dissertation Department of Statistics Umeå University SE-901 87 Umeå, Sweden Copyright c 2008 by Ingeborg
More informationAn Approximate Test for Homogeneity of Correlated Correlation Coefficients
Quality & Quantity 37: 99 110, 2003. 2003 Kluwer Academic Publishers. Printed in the Netherlands. 99 Research Note An Approximate Test for Homogeneity of Correlated Correlation Coefficients TRIVELLORE
More informationIgnoring the matching variables in cohort studies - when is it valid, and why?
Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association
More informationStrategy of Bayesian Propensity. Score Estimation Approach. in Observational Study
Theoretical Mathematics & Applications, vol.2, no.3, 2012, 75-86 ISSN: 1792-9687 (print), 1792-9709 (online) Scienpress Ltd, 2012 Strategy of Bayesian Propensity Score Estimation Approach in Observational
More informationAnalysis of propensity score approaches in difference-in-differences designs
Author: Diego A. Luna Bazaldua Institution: Lynch School of Education, Boston College Contact email: diego.lunabazaldua@bc.edu Conference section: Research methods Analysis of propensity score approaches
More informationCorrelation and efficiency of propensity score-based estimators for average causal effects
Correlation and efficiency of propensity score-based estimators for average causal effects Ronnie Pingel Ingeborg Waernbaum WORKING PAPER 2015:3 The Institute for Evaluation of Labour Market and Education
More informationOn the Use of Linear Fixed Effects Regression Models for Causal Inference
On the Use of Linear Fixed Effects Regression Models for ausal Inference Kosuke Imai Department of Politics Princeton University Joint work with In Song Kim Atlantic ausal Inference onference Johns Hopkins
More informationarxiv: v1 [stat.me] 15 May 2011
Working Paper Propensity Score Analysis with Matching Weights Liang Li, Ph.D. arxiv:1105.2917v1 [stat.me] 15 May 2011 Associate Staff of Biostatistics Department of Quantitative Health Sciences, Cleveland
More informationPropensity Score Weighting with Multilevel Data
Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative
More informationAn Introduction to Path Analysis
An Introduction to Path Analysis PRE 905: Multivariate Analysis Lecture 10: April 15, 2014 PRE 905: Lecture 10 Path Analysis Today s Lecture Path analysis starting with multivariate regression then arriving
More informationEstimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing
Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing Alessandra Mattei Dipartimento di Statistica G. Parenti Università
More information12E016. Econometric Methods II 6 ECTS. Overview and Objectives
Overview and Objectives This course builds on and further extends the econometric and statistical content studied in the first quarter, with a special focus on techniques relevant to the specific field
More informationCausal Directed Acyclic Graphs
Causal Directed Acyclic Graphs Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Causal DAGs Stat186/Gov2002 Fall 2018 1 / 15 Elements of DAGs (Pearl. 2000.
More informationNISS. Technical Report Number 167 June 2007
NISS Estimation of Propensity Scores Using Generalized Additive Models Mi-Ja Woo, Jerome Reiter and Alan F. Karr Technical Report Number 167 June 2007 National Institute of Statistical Sciences 19 T. W.
More informationPropensity Score Matching
Methods James H. Steiger Department of Psychology and Human Development Vanderbilt University Regression Modeling, 2009 Methods 1 Introduction 2 3 4 Introduction Why Match? 5 Definition Methods and In
More informationNotes on causal effects
Notes on causal effects Johan A. Elkink March 4, 2013 1 Decomposing bias terms Deriving Eq. 2.12 in Morgan and Winship (2007: 46): { Y1i if T Potential outcome = i = 1 Y 0i if T i = 0 Using shortcut E
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University
More informationComparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior
Comparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior David R. Johnson Department of Sociology and Haskell Sie Department
More informationCovariate Balancing Propensity Score for General Treatment Regimes
Covariate Balancing Propensity Score for General Treatment Regimes Kosuke Imai Princeton University October 14, 2014 Talk at the Department of Psychiatry, Columbia University Joint work with Christian
More informationAlexina Mason. Department of Epidemiology and Biostatistics Imperial College, London. 16 February 2010
Strategy for modelling non-random missing data mechanisms in longitudinal studies using Bayesian methods: application to income data from the Millennium Cohort Study Alexina Mason Department of Epidemiology
More informationMeasurement Error and Causal Discovery
Measurement Error and Causal Discovery Richard Scheines & Joseph Ramsey Department of Philosophy Carnegie Mellon University Pittsburgh, PA 15217, USA 1 Introduction Algorithms for causal discovery emerged
More informationUniversity of Michigan School of Public Health
University of Michigan School of Public Health The University of Michigan Department of Biostatistics Working Paper Series Year 003 Paper Weighting Adustments for Unit Nonresponse with Multiple Outcome
More informationUse of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:
Use of Matching Methods for Causal Inference in Experimental and Observational Studies Kosuke Imai Department of Politics Princeton University April 13, 2009 Kosuke Imai (Princeton University) Matching
More informationInstrumental variables as bias amplifiers with general outcome and confounding
Biometrika (2017), 104, 2,pp. 291 302 doi: 10.1093/biomet/asx009 Printed in Great Britain Advance Access publication 17 April 2017 Instrumental variables as bias amplifiers with general outcome and confounding
More informationPlausible Values for Latent Variables Using Mplus
Plausible Values for Latent Variables Using Mplus Tihomir Asparouhov and Bengt Muthén August 21, 2010 1 1 Introduction Plausible values are imputed values for latent variables. All latent variables can
More informationANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS
ANALYTIC COMPARISON of Pearl and Rubin CAUSAL FRAMEWORKS Content Page Part I. General Considerations Chapter 1. What is the question? 16 Introduction 16 1. Randomization 17 1.1 An Example of Randomization
More informationEmpirical Validation of the Critical Thinking Assessment Test: A Bayesian CFA Approach
Empirical Validation of the Critical Thinking Assessment Test: A Bayesian CFA Approach CHI HANG AU & ALLISON AMES, PH.D. 1 Acknowledgement Allison Ames, PhD Jeanne Horst, PhD 2 Overview Features of the
More informationSC705: Advanced Statistics Instructor: Natasha Sarkisian Class notes: Introduction to Structural Equation Modeling (SEM)
SC705: Advanced Statistics Instructor: Natasha Sarkisian Class notes: Introduction to Structural Equation Modeling (SEM) SEM is a family of statistical techniques which builds upon multiple regression,
More informationMS&E 226. In-Class Midterm Examination Solutions Small Data October 20, 2015
MS&E 226 In-Class Midterm Examination Solutions Small Data October 20, 2015 PROBLEM 1. Alice uses ordinary least squares to fit a linear regression model on a dataset containing outcome data Y and covariates
More informationRatio of Mediator Probability Weighting for Estimating Natural Direct and Indirect Effects
Ratio of Mediator Probability Weighting for Estimating Natural Direct and Indirect Effects Guanglei Hong University of Chicago, 5736 S. Woodlawn Ave., Chicago, IL 60637 Abstract Decomposing a total causal
More informationChapter 5. Introduction to Path Analysis. Overview. Correlation and causation. Specification of path models. Types of path models
Chapter 5 Introduction to Path Analysis Put simply, the basic dilemma in all sciences is that of how much to oversimplify reality. Overview H. M. Blalock Correlation and causation Specification of path
More informationCausal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions
Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census
More informationStatistical Models for Causal Analysis
Statistical Models for Causal Analysis Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 Three Modes of Statistical Inference 1. Descriptive Inference: summarizing and exploring
More informationReconciling factor-based and composite-based approaches to structural equation modeling
Reconciling factor-based and composite-based approaches to structural equation modeling Edward E. Rigdon (erigdon@gsu.edu) Modern Modeling Methods Conference May 20, 2015 Thesis: Arguments for factor-based
More informationGeneral structural model Part 1: Covariance structure and identification. Psychology 588: Covariance structure and factor models
General structural model Part 1: Covariance structure and identification Psychology 588: Covariance structure and factor models Latent variables 2 Interchangeably used: constructs --- substantively defined
More informationCausality II: How does causal inference fit into public health and what it is the role of statistics?
Causality II: How does causal inference fit into public health and what it is the role of statistics? Statistics for Psychosocial Research II November 13, 2006 1 Outline Potential Outcomes / Counterfactual
More informationAdvanced Quantitative Methods: Causal inference
Advanced Quantitative Methods: Johan A. Elkink University College Dublin 2 March 2017 1 2 3 1 2 3 Inference In regression analysis we look at the relationship between (a set of) independent variable(s)
More informationCausal Mechanisms Short Course Part II:
Causal Mechanisms Short Course Part II: Analyzing Mechanisms with Experimental and Observational Data Teppei Yamamoto Massachusetts Institute of Technology March 24, 2012 Frontiers in the Analysis of Causal
More informationAssess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014
Assess Assumptions and Sensitivity Analysis Fan Li March 26, 2014 Two Key Assumptions 1. Overlap: 0
More informationControlling for overlap in matching
Working Papers No. 10/2013 (95) PAWEŁ STRAWIŃSKI Controlling for overlap in matching Warsaw 2013 Controlling for overlap in matching PAWEŁ STRAWIŃSKI Faculty of Economic Sciences, University of Warsaw
More informationVector-Based Kernel Weighting: A Simple Estimator for Improving Precision and Bias of Average Treatment Effects in Multiple Treatment Settings
Vector-Based Kernel Weighting: A Simple Estimator for Improving Precision and Bias of Average Treatment Effects in Multiple Treatment Settings Jessica Lum, MA 1 Steven Pizer, PhD 1, 2 Melissa Garrido,
More informationIntroduction To Confirmatory Factor Analysis and Item Response Theory
Introduction To Confirmatory Factor Analysis and Item Response Theory Lecture 23 May 3, 2005 Applied Regression Analysis Lecture #23-5/3/2005 Slide 1 of 21 Today s Lecture Confirmatory Factor Analysis.
More informationPath Analysis. PRE 906: Structural Equation Modeling Lecture #5 February 18, PRE 906, SEM: Lecture 5 - Path Analysis
Path Analysis PRE 906: Structural Equation Modeling Lecture #5 February 18, 2015 PRE 906, SEM: Lecture 5 - Path Analysis Key Questions for Today s Lecture What distinguishes path models from multivariate
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Princeton University Asian Political Methodology Conference University of Sydney Joint
More informationPropensity Score Methods for Causal Inference
John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good
More informationChapter 8. Models with Structural and Measurement Components. Overview. Characteristics of SR models. Analysis of SR models. Estimation of SR models
Chapter 8 Models with Structural and Measurement Components Good people are good because they've come to wisdom through failure. Overview William Saroyan Characteristics of SR models Estimation of SR models
More informationResearch Design: Causal inference and counterfactuals
Research Design: Causal inference and counterfactuals University College Dublin 8 March 2013 1 2 3 4 Outline 1 2 3 4 Inference In regression analysis we look at the relationship between (a set of) independent
More informationPropensity score modelling in observational studies using dimension reduction methods
University of Colorado, Denver From the SelectedWorks of Debashis Ghosh 2011 Propensity score modelling in observational studies using dimension reduction methods Debashis Ghosh, Penn State University
More informationMatching via Majorization for Consistency of Product Quality
Matching via Majorization for Consistency of Product Quality Lirong Cui Dejing Kong Haijun Li Abstract A new matching method is introduced in this paper to match attributes of parts in order to ensure
More information19 Effect Heterogeneity and Bias in Main-Effects-Only Regression Models
19 Effect Heterogeneity and Bias in Main-Effects-Only Regression Models FELIX ELWERT AND CHRISTOPHER WINSHIP 1 Introduction The overwhelming majority of OLS regression models estimated in the social sciences,
More informationEffect Heterogeneity and Bias in Main-Effects- Only Regression Models
Effect Heterogeneity and Bias in Regression 1 Effect Heterogeneity and Bias in Main-Effects- Only Regression Models FELIX ELWERT AND CHRISTOPHER WINSHIP Introduction The overwhelming majority of OLS regression
More informationSTAT 518 Intro Student Presentation
STAT 518 Intro Student Presentation Wen Wei Loh April 11, 2013 Title of paper Radford M. Neal [1999] Bayesian Statistics, 6: 475-501, 1999 What the paper is about Regression and Classification Flexible
More informationCausal Inference Using Nonnormality Yutaka Kano and Shohei Shimizu 1
Causal Inference Using Nonnormality Yutaka Kano and Shohei Shimizu 1 Path analysis, often applied to observational data to study causal structures, describes causal relationship between observed variables.
More informationImbens/Wooldridge, IRP Lecture Notes 2, August 08 1
Imbens/Wooldridge, IRP Lecture Notes 2, August 08 IRP Lectures Madison, WI, August 2008 Lecture 2, Monday, Aug 4th, 0.00-.00am Estimation of Average Treatment Effects Under Unconfoundedness, Part II. Introduction
More informationComment on Article by Scutari
Bayesian Analysis (2013) 8, Number 3, pp. 543 548 Comment on Article by Scutari Hao Wang Scutari s paper studies properties of the distribution of graphs ppgq. This is an interesting angle because it differs
More informationUsing Mplus individual residual plots for. diagnostics and model evaluation in SEM
Using Mplus individual residual plots for diagnostics and model evaluation in SEM Tihomir Asparouhov and Bengt Muthén Mplus Web Notes: No. 20 October 31, 2017 1 Introduction A variety of plots are available
More informationDay 3: Search Continued
Center for Causal Discovery Day 3: Search Continued June 15, 2015 Carnegie Mellon University 1 Outline Models Data 1) Bridge Principles: Markov Axiom and D-separation 2) Model Equivalence 3) Model Search
More informationCausal Inference Basics
Causal Inference Basics Sam Lendle October 09, 2013 Observed data, question, counterfactuals Observed data: n i.i.d copies of baseline covariates W, treatment A {0, 1}, and outcome Y. O i = (W i, A i,
More informationAn Empirical Comparison of Multiple Imputation Approaches for Treating Missing Data in Observational Studies
Paper 177-2015 An Empirical Comparison of Multiple Imputation Approaches for Treating Missing Data in Observational Studies Yan Wang, Seang-Hwane Joo, Patricia Rodríguez de Gil, Jeffrey D. Kromrey, Rheta
More informationA Sampling of IMPACT Research:
A Sampling of IMPACT Research: Methods for Analysis with Dropout and Identifying Optimal Treatment Regimes Marie Davidian Department of Statistics North Carolina State University http://www.stat.ncsu.edu/
More informationMeasuring Social Influence Without Bias
Measuring Social Influence Without Bias Annie Franco Bobbie NJ Macdonald December 9, 2015 The Problem CS224W: Final Paper How well can statistical models disentangle the effects of social influence from
More informationAn Introduction to Mplus and Path Analysis
An Introduction to Mplus and Path Analysis PSYC 943: Fundamentals of Multivariate Modeling Lecture 10: October 30, 2013 PSYC 943: Lecture 10 Today s Lecture Path analysis starting with multivariate regression
More informationDon t be Fancy. Impute Your Dependent Variables!
Don t be Fancy. Impute Your Dependent Variables! Kyle M. Lang, Todd D. Little Institute for Measurement, Methodology, Analysis & Policy Texas Tech University Lubbock, TX May 24, 2016 Presented at the 6th
More informationA Distinction between Causal Effects in Structural and Rubin Causal Models
A istinction between Causal Effects in Structural and Rubin Causal Models ionissi Aliprantis April 28, 2017 Abstract: Unspecified mediators play different roles in the outcome equations of Structural Causal
More informationTechnical Track Session I: Causal Inference
Impact Evaluation Technical Track Session I: Causal Inference Human Development Human Network Development Network Middle East and North Africa Region World Bank Institute Spanish Impact Evaluation Fund
More information26:010:557 / 26:620:557 Social Science Research Methods
26:010:557 / 26:620:557 Social Science Research Methods Dr. Peter R. Gillett Associate Professor Department of Accounting & Information Systems Rutgers Business School Newark & New Brunswick 1 Overview
More informationIntroduction to Structural Equation Modeling
Introduction to Structural Equation Modeling Notes Prepared by: Lisa Lix, PhD Manitoba Centre for Health Policy Topics Section I: Introduction Section II: Review of Statistical Concepts and Regression
More informationFrom Causality, Second edition, Contents
From Causality, Second edition, 2009. Preface to the First Edition Preface to the Second Edition page xv xix 1 Introduction to Probabilities, Graphs, and Causal Models 1 1.1 Introduction to Probability
More informationExternal validity, causal interaction and randomised trials
External validity, causal interaction and randomised trials Seán M. Muller University of Cape Town Evidence and Causality in the Sciences Conference University of Kent (Canterbury) 5 September 2012 Overview
More informationarxiv: v2 [stat.me] 21 Nov 2016
Biometrika, pp. 1 8 Printed in Great Britain On falsification of the binary instrumental variable model arxiv:1605.03677v2 [stat.me] 21 Nov 2016 BY LINBO WANG Department of Biostatistics, Harvard School
More informationGenotype-Environment Effects Analysis Using Bayesian Networks
Genotype-Environment Effects Analysis Using Bayesian Networks 1, Alison Bentley 2 and Ian Mackay 2 1 scutari@stats.ox.ac.uk Department of Statistics 2 National Institute for Agricultural Botany (NIAB)
More informationProbabilistic Index Models
Probabilistic Index Models Jan De Neve Department of Data Analysis Ghent University M3 Storrs, Conneticut, USA May 23, 2017 Jan.DeNeve@UGent.be 1 / 37 Introduction 2 / 37 Introduction to Probabilistic
More informationPropensity Score Analysis Using teffects in Stata. SOC 561 Programming for the Social Sciences Hyungjun Suh Apr
Propensity Score Analysis Using teffects in Stata SOC 561 Programming for the Social Sciences Hyungjun Suh Apr. 25. 2016 Overview Motivation Propensity Score Weighting Propensity Score Matching with teffects
More informationOptimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies.
Institute for Clinical Evaluative Sciences From the SelectedWorks of Peter Austin 2011 Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions
More informationEconometrics with Observational Data. Introduction and Identification Todd Wagner February 1, 2017
Econometrics with Observational Data Introduction and Identification Todd Wagner February 1, 2017 Goals for Course To enable researchers to conduct careful quantitative analyses with existing VA (and non-va)
More informationTargeted Maximum Likelihood Estimation in Safety Analysis
Targeted Maximum Likelihood Estimation in Safety Analysis Sam Lendle 1 Bruce Fireman 2 Mark van der Laan 1 1 UC Berkeley 2 Kaiser Permanente ISPE Advanced Topics Session, Barcelona, August 2012 1 / 35
More informationLogistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015
Logistic regression: Why we often can do what we think we can do Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015 1 Introduction Introduction - In 2010 Carina Mood published an overview article
More informationINTRODUCTION TO STRUCTURAL EQUATION MODELS
I. Description of the course. INTRODUCTION TO STRUCTURAL EQUATION MODELS A. Objectives and scope of the course. B. Logistics of enrollment, auditing, requirements, distribution of notes, access to programs.
More informationInstrumental variables estimation in the Cox Proportional Hazard regression model
Instrumental variables estimation in the Cox Proportional Hazard regression model James O Malley, Ph.D. Department of Biomedical Data Science The Dartmouth Institute for Health Policy and Clinical Practice
More informationSimulation-Extrapolation for Estimating Means and Causal Effects with Mismeasured Covariates
Observational Studies 1 (2015) 241-290 Submitted 4/2015; Published 10/2015 Simulation-Extrapolation for Estimating Means and Causal Effects with Mismeasured Covariates J.R. Lockwood Educational Testing
More informationCausal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies
Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed
More informationNew Developments in Nonresponse Adjustment Methods
New Developments in Nonresponse Adjustment Methods Fannie Cobben January 23, 2009 1 Introduction In this paper, we describe two relatively new techniques to adjust for (unit) nonresponse bias: The sample
More informationMapping non-preference onto preference-based PROMs Patient-reported outcomes measures (PROMs) in health economics
Mapping non-preference onto preference-based PROMs Patient-reported outcomes measures (PROMs) in health economics Assoc. Professor Oliver Rivero-Arias Royal Statistical Society Seminar RSS Primary Health
More information