Analysis of propensity score approaches in difference-in-differences designs

Size: px
Start display at page:

Download "Analysis of propensity score approaches in difference-in-differences designs"

Transcription

1 Author: Diego A. Luna Bazaldua Institution: Lynch School of Education, Boston College Contact Conference section: Research methods Analysis of propensity score approaches in difference-in-differences designs Background. Difference-in-differences designs (DD) focus on the comparison of the mean differences between treatment and control groups after the intervention is implemented with respect to the pre-intervention baseline mean (Somers, Zhu, Jacob, & Bloom, 2013). Former work presented in Abadie (2005) and in Stuart et al. (2014) expresses the DD using the Rubin Causal Model (RCM, 1974) framework to estimate treatment effects, adding the particularity that DD requires longitudinal outcome measures of each treatment and control case before and after the intervention is implemented. Because of the existence of pre- and post-implementation measures, the counterfactual assumption in DD is that, had the intervention not been implemented, both groups would show similar average changes over time (Somers et al., 2013). In this way, the design relies on the assumption of equivalence between treatment and control groups. Nevertheless, it is recognized in observational and quasi-experimental methodological research that when these groups differ with respect to relevant characteristics, the observed changes over time are likely to be explained by these preexisting differences rather than the intervention effect on the outcome (Shadish, Cook, & Campbell, 2002; Rubin, 2005). In the case of longitudinal quasi-experiments such as DD, if there is not guarantee of equivalence between treatment and control groups, alternative hypothesis e.g., maturation effects, differential attrition, or historical events impacting only to one group could explain the observed outcomes (Shadish, 2010; Shadish & Steiner, 2010). Former research has documented approaches to reduce bias in DD, focusing on the composition of the control group and its similarity with respect to the treatment group. For this purpose, the literature describes the use of different propensity scores (PS) approaches to improve the estimation of treatment effects and to establish equivalence between treatment and comparison groups with respect to relevant characteristics (Fortson, Verbitsky-Savitz, Kopa, & Gleason, 2012; Somers et al., 2013; Song et al., 2012; Stuart et al., 2014). The research shows that while some authors propose matching treatment and control cases based on PS estimates of the binary treatment status (e.g. Somers et al., 2013), others have analyzed the role of inverse weights using PS (IWPS) for multiple treatments when four groups are defined for those cases in treatment status or control status before and after the intervention is implemented (Stuart et al., 2014).

2 Purpose. Because some of the research on the use of PS in DD has only been exemplified using empirical data, there is not conclusive evidence about whether the use of IWPS or PS matching provides a more accurate estimation of treatment effects in DD or better balance among covariates. Thus, the present research project focuses on analyzing these PS techniques using different PS estimators in a Monte Carlo study. Methods. The study consisted of four conditions to simulate data resembling a long data arrangement. The common features among conditions were the number units i generated with i = 1,, 500 with four time observations t created for each unit, two within-unit pre-intervention observations and other two post-intervention observations. Standard normally distributed covariates were generated for each unit i, two time-varying covariates X ti with covariance equal to 0.30 and two time-constant covariates W i also with covariance equal to 0.30, the covariance between X ti and W i was set equal to zero. 500 data sets were generated for each condition. The four conditions were established based on the model used to generate the PS for the binary treatment assignment indicator Z ti and the model used to generate the outcome variable Y ti. For the PS, a model with only main effects for the probability of treatment assignment was defined as PZ ti ( 1) 1 { 1 (1 W W X X r )} 1 1i 2 2i 3 1ti 4 2ti ti 1 exp (1) where P(Z ti = 1) is the probability of being assigned to treatment. While four PS are available for each participant i because of the four time points t, only the value of the first pre-intervention PS P(Z 1i = 1) was used to determine treatment assignment by comparing it with a randomly generated number from a standard uniform distribution. The values for the four coefficients β 1, 0.5, 0.35, 0.40 were chosen so a larger proportion of units i were allocated in the control group. The residual term r ti is generated from a normal distribution N(0,2). A second model for the PS included squared terms for the W i and interactions between the X ti and the W i PZ ti ( 1) { 1 ( exp W i W i X ti X ti W i W i W ix ti W ix ti rti )} (2) β where the eight coefficients 1, 0.5, 0.35, 0.4, 0.5, 0.9, 0.8, 1 to have a larger proportion of units i in the control group. were also chosen The first model for the outcome Y ti was Y Z P Z P W W X X W W X X r ti 1 i 2 ti 3 i ti 4 1i 5 2i 6 1ti 7 2ti 8 1i 2i 1ti 2ti ti (3)

3 β 0.5, 0.5, 1, 0.67, 0.3, 0.45, 0.2, 0.02 where the eight coefficients indicate the effect of the assignment to the treatment group Z i, the change from pre- to post-implementation P ti, the DD treatment effect Z i P ti resulting from the outcome change over time for those assigned to the treatment, and the effect of the covariates W i and X ti. A second model for the outcome Y ti was Y Z P Z P W W X X ti 1 i 2 ti 3 i ti 4 1i 5 2i 6 1ti 7 2ti 2 2 8W1 i 9 W2i 10 W1 i X1ti 11W2 i X2ti 12 W1 i W2 i X1ti X2ti rti (3) β 0.5, 0.5, 1, 0.67, 0.3, 0.45, 0.2, 0.5, 0.37, 0.32, 1, 0.02 with coefficients. The combination of these models for the PS and outcome resulted in the four conditions to generate the data (see Table 1). Finally, each produced data set was analyzed using different models to estimate the PS for binary treatment assignment and multiple treatment assignment with four groups: logistic regression, multinomial regression, generalized boosted models, and neural networks (Stuart et al., 2014; Keller, Kim, & Steiner, 2015; McCaffrey et al., 2013; Shadish et al., 2002). The PS for binary treatment status were used to match treatment and control cases. The PS for multiple treatment assignment were used as inverse weights in DD models. Results. As shown in Table 2, the methods using inverse weights reach better covariate balance compared to the matching techniques. Additionally, Tables 3 to 5 show that more accurate temporal and treatment effects are also estimated using IWPS methods, especially when the generalized boosted models or the neural networks models are used to estimate the PS. References Abadie, A. (2005). Semiparametric difference-in-differences estimators. The Review of Economic Studies, 72(1), Fortson, K., Verbitsky-Savitz, N., Kopa, E., & Gleason, P. (2012). Using an Experimental Evaluation of Charter Schools to Test Whether Nonexperimental Comparison Group Methods can Replicate Experimental Impacts (NCEE Technical Methods Report ). Washington DC: National Center for Education Evaluation and Regional Assistance, Institute for Education Sciences, U.S. Department of Education. Keller, B., Kim, J.-S., & Steiner, P. M. (2015). Neural networks for propensity score estimation: Simulation results and recommendations. In L. A. van der Ark, D. M. Bolt, S.M. Chow, J.

4 A. Douglas, & W.C. Wang (Eds.), Quantitative psychology research. New York, NY: Springer. McCaffrey, D. F., Griffin, B. A., Almirall, D., Slaughter, M. E., Ramchand, R., & Burgette, L. F. (2013). A tutorial on propensity score estimation for multiple treatments using generalized boosted models. Statistics in medicine, 32(19), Rubin, D. B. (1974). Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology, 66(5), Rubin, D.B. (2005). Causal Inference Using Potential Outcomes. Journal of the American Statistical Association, 100(469), Shadish, W. R. (2010). Campbell and Rubin: A primer and comparison of their approaches to causal inference in field settings. Psychological methods, 15(1), Shadish, W. R., Cook, T. D., & Campbell, D. T. (2002). Experimental and quasi-experimental designs for generalized causal inference. Boston, MA: Houghton Mifflin. Shadish, W. R., & Steiner, P. M. (2010). A primer on propensity score analysis. Newborn and Infant Nursing Reviews, 10(1), Song, Z., Safran, D.G., Landon, B.E., Landrum, M.B., He, Y., Mechanic, R.E., Day, M.P., Chernew, M.E. (2012). The alternative quality contract, based on a global budget, lowered medical spending and improved quality. Health Affairs, 31(8), Somers, M., Zhu, P., Jacob, R., & Bloom, H., (2013). The validity and precision of the comparative interrupted time series design and the difference-in-difference design in educational evaluation (MDRC working paper in research methodology). Retrieved from Stuart, E. A., Huskamp, H. A., Duckworth, K., Simmons, J., Song, Z., Chernew, M. E., & Barry, C. L. (2014). Using propensity scores in difference-in-differences models to estimate the effects of a policy change. Health Services and Outcomes Research Methodology, 14(4),

5 Tables Table 1. Conditions in the simulation study. Condition PS model Outcome model 1 Equation (1) Equation (3) 2 Equation (2) Equation (3) 3 Equation (1) Equation (4) 4 Equation (2) Equation (4) Table 2. rage standardized absolute mean difference between treatment and comparison groups. Condition Naïve Regression Boosted Model Condition Boosted Model Note: The four conditions are specified in Table 1 in terms of the models used to generate the propensity scores and the outcome. The values included in the naïve category are calculated before the use of any PS technique.

6 Cond Naïve Table 3. Estimated effect for the treatment assignment Z i. Regression Boosting Model Cond Naïve with covariates Boosting Model Note: The four conditions are specified in Table 1. The values included in the naïve categories are estimated with the full sample without the use of any PS technique. refers to the average estimate among replications, to the average absolute bias, and to the mean standard error. Table 4. Estimated effect for the pre- and post-implementation P ti. Naïve Regression Boosting Model Cond Naïve with covariates

7 Boosting Model Cond Note: The four conditions are specified in Table 1. The values included in the naïve categories are estimated with the full sample without the use of any PS technique. refers to the average estimate among replications, to the average absolute bias, and to the mean standard error. Cond Naïve Table 5. Estimated DD effect Z i P ti. Regression Boosting Model Naïve with covariates Boosting Model Cond Note: The four conditions are specified in Table 1. The values included in the naïve categories are estimated with the full sample without the use of any PS technique. refers to the average estimate among replications, to the average absolute bias, and to the mean standard error.

The Impact of Measurement Error on Propensity Score Analysis: An Empirical Investigation of Fallible Covariates

The Impact of Measurement Error on Propensity Score Analysis: An Empirical Investigation of Fallible Covariates The Impact of Measurement Error on Propensity Score Analysis: An Empirical Investigation of Fallible Covariates Eun Sook Kim, Patricia Rodríguez de Gil, Jeffrey D. Kromrey, Rheta E. Lanehart, Aarti Bellara,

More information

Statistical Power and Autocorrelation for Short, Comparative Interrupted Time Series Designs with Aggregate Data

Statistical Power and Autocorrelation for Short, Comparative Interrupted Time Series Designs with Aggregate Data Statistical Power and Autocorrelation for Short, Comparative Interrupted Time Series Designs with Aggregate Data Andrew Swanlund, American Institutes for Research (aswanlund@air.org) Kelly Hallberg, University

More information

Bayesian regression tree models for causal inference: regularization, confounding and heterogeneity

Bayesian regression tree models for causal inference: regularization, confounding and heterogeneity Bayesian regression tree models for causal inference: regularization, confounding and heterogeneity P. Richard Hahn, Jared Murray, and Carlos Carvalho June 22, 2017 The problem setting We want to estimate

More information

A dynamic perspective to evaluate multiple treatments through a causal latent Markov model

A dynamic perspective to evaluate multiple treatments through a causal latent Markov model A dynamic perspective to evaluate multiple treatments through a causal latent Markov model Fulvia Pennoni Department of Statistics and Quantitative Methods University of Milano-Bicocca http://www.statistica.unimib.it/utenti/pennoni/

More information

An Introduction to Causal Analysis on Observational Data using Propensity Scores

An Introduction to Causal Analysis on Observational Data using Propensity Scores An Introduction to Causal Analysis on Observational Data using Propensity Scores Margie Rosenberg*, PhD, FSA Brian Hartman**, PhD, ASA Shannon Lane* *University of Wisconsin Madison **University of Connecticut

More information

Small-sample cluster-robust variance estimators for two-stage least squares models

Small-sample cluster-robust variance estimators for two-stage least squares models Small-sample cluster-robust variance estimators for two-stage least squares models ames E. Pustejovsky The University of Texas at Austin Context In randomized field trials of educational interventions,

More information

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 An Introduction to Causal Mediation Analysis Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 1 Causality In the applications of statistics, many central questions

More information

Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded

Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded 1 Background Latent confounder is common in social and behavioral science in which most of cases the selection mechanism

More information

Propensity Score Analysis with Hierarchical Data

Propensity Score Analysis with Hierarchical Data Propensity Score Analysis with Hierarchical Data Fan Li Alan Zaslavsky Mary Beth Landrum Department of Health Care Policy Harvard Medical School May 19, 2008 Introduction Population-based observational

More information

NISS. Technical Report Number 167 June 2007

NISS. Technical Report Number 167 June 2007 NISS Estimation of Propensity Scores Using Generalized Additive Models Mi-Ja Woo, Jerome Reiter and Alan F. Karr Technical Report Number 167 June 2007 National Institute of Statistical Sciences 19 T. W.

More information

Causal Inference with Big Data Sets

Causal Inference with Big Data Sets Causal Inference with Big Data Sets Marcelo Coca Perraillon University of Colorado AMC November 2016 1 / 1 Outlone Outline Big data Causal inference in economics and statistics Regression discontinuity

More information

Propensity scores for repeated treatments: A tutorial for the iptw function in the twang package

Propensity scores for repeated treatments: A tutorial for the iptw function in the twang package Propensity scores for repeated treatments: A tutorial for the iptw function in the twang package Lane Burgette, Beth Ann Griffin and Dan McCaffrey RAND Corporation July, 07 Introduction While standard

More information

Variable selection and machine learning methods in causal inference

Variable selection and machine learning methods in causal inference Variable selection and machine learning methods in causal inference Debashis Ghosh Department of Biostatistics and Informatics Colorado School of Public Health Joint work with Yeying Zhu, University of

More information

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed

More information

Causal Inference Basics

Causal Inference Basics Causal Inference Basics Sam Lendle October 09, 2013 Observed data, question, counterfactuals Observed data: n i.i.d copies of baseline covariates W, treatment A {0, 1}, and outcome Y. O i = (W i, A i,

More information

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers: Use of Matching Methods for Causal Inference in Experimental and Observational Studies Kosuke Imai Department of Politics Princeton University April 27, 2007 Kosuke Imai (Princeton University) Matching

More information

Propensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018

Propensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018 Propensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018 Schedule and outline 1:00 Introduction and overview 1:15 Quasi-experimental vs. experimental designs

More information

Estimating the Marginal Odds Ratio in Observational Studies

Estimating the Marginal Odds Ratio in Observational Studies Estimating the Marginal Odds Ratio in Observational Studies Travis Loux Christiana Drake Department of Statistics University of California, Davis June 20, 2011 Outline The Counterfactual Model Odds Ratios

More information

Since the seminal paper by Rosenbaum and Rubin (1983b) on propensity. Propensity Score Analysis. Concepts and Issues. Chapter 1. Wei Pan Haiyan Bai

Since the seminal paper by Rosenbaum and Rubin (1983b) on propensity. Propensity Score Analysis. Concepts and Issues. Chapter 1. Wei Pan Haiyan Bai Chapter 1 Propensity Score Analysis Concepts and Issues Wei Pan Haiyan Bai Since the seminal paper by Rosenbaum and Rubin (1983b) on propensity score analysis, research using propensity score analysis

More information

Selection on Observables: Propensity Score Matching.

Selection on Observables: Propensity Score Matching. Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017

More information

Propensity Scores for Repeated Treatments

Propensity Scores for Repeated Treatments C O R P O R A T I O N Propensity Scores for Repeated Treatments A Tutorial for the iptw Function in the TWANG Package Lane F. Burgette, Beth Ann Griffin, Daniel F. McCaffrey For more information on this

More information

The Effect of TenMarks Math on Student Achievement: Technical Supplement. Jordan Rickles, Ryan Williams, John Meakin, Dong Hoon Lee, and Kirk Walters

The Effect of TenMarks Math on Student Achievement: Technical Supplement. Jordan Rickles, Ryan Williams, John Meakin, Dong Hoon Lee, and Kirk Walters The Effect of TenMarks Math on Student Achievement: Technical Supplement Jordan Rickles, Ryan Williams, John Meakin, Dong Hoon Lee, and Kirk Walters AUGUST 2017 The Effect of TenMarks Math on Student

More information

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Overview In observational and experimental studies, the goal may be to estimate the effect

More information

Covariate Balancing Propensity Score for General Treatment Regimes

Covariate Balancing Propensity Score for General Treatment Regimes Covariate Balancing Propensity Score for General Treatment Regimes Kosuke Imai Princeton University October 14, 2014 Talk at the Department of Psychiatry, Columbia University Joint work with Christian

More information

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design 1 / 32 Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design Changbao Wu Department of Statistics and Actuarial Science University of Waterloo (Joint work with Min Chen and Mary

More information

Methods for inferring short- and long-term effects of exposures on outcomes, using longitudinal data on both measures

Methods for inferring short- and long-term effects of exposures on outcomes, using longitudinal data on both measures Methods for inferring short- and long-term effects of exposures on outcomes, using longitudinal data on both measures Ruth Keogh, Stijn Vansteelandt, Rhian Daniel Department of Medical Statistics London

More information

Technical Track Session I: Causal Inference

Technical Track Session I: Causal Inference Impact Evaluation Technical Track Session I: Causal Inference Human Development Human Network Development Network Middle East and North Africa Region World Bank Institute Spanish Impact Evaluation Fund

More information

Empirical approaches in public economics

Empirical approaches in public economics Empirical approaches in public economics ECON4624 Empirical Public Economics Fall 2016 Gaute Torsvik Outline for today The canonical problem Basic concepts of causal inference Randomized experiments Non-experimental

More information

Propensity score matching for multiple treatment levels: A CODA-based contribution

Propensity score matching for multiple treatment levels: A CODA-based contribution Propensity score matching for multiple treatment levels: A CODA-based contribution Hajime Seya *1 Graduate School of Engineering Faculty of Engineering, Kobe University, 1-1 Rokkodai-cho, Nada-ku, Kobe

More information

PROPENSITY SCORE MATCHING. Walter Leite

PROPENSITY SCORE MATCHING. Walter Leite PROPENSITY SCORE MATCHING Walter Leite 1 EXAMPLE Question: Does having a job that provides or subsidizes child care increate the length that working mothers breastfeed their children? Treatment: Working

More information

Propensity Score Methods for Causal Inference

Propensity Score Methods for Causal Inference John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good

More information

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing Primal-dual Covariate Balance and Minimal Double Robustness via (Joint work with Daniel Percival) Department of Statistics, Stanford University JSM, August 9, 2015 Outline 1 2 3 1/18 Setting Rubin s causal

More information

Propensity Score Weighting with Multilevel Data

Propensity Score Weighting with Multilevel Data Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative

More information

Quantitative Economics for the Evaluation of the European Policy

Quantitative Economics for the Evaluation of the European Policy Quantitative Economics for the Evaluation of the European Policy Dipartimento di Economia e Management Irene Brunetti Davide Fiaschi Angela Parenti 1 25th of September, 2017 1 ireneb@ec.unipi.it, davide.fiaschi@unipi.it,

More information

Technical Track Session I:

Technical Track Session I: Impact Evaluation Technical Track Session I: Click to edit Master title style Causal Inference Damien de Walque Amman, Jordan March 8-12, 2009 Click to edit Master subtitle style Human Development Human

More information

DATA-ADAPTIVE VARIABLE SELECTION FOR

DATA-ADAPTIVE VARIABLE SELECTION FOR DATA-ADAPTIVE VARIABLE SELECTION FOR CAUSAL INFERENCE Group Health Research Institute Department of Biostatistics, University of Washington shortreed.s@ghc.org joint work with Ashkan Ertefaie Department

More information

Causal Analysis in Social Research

Causal Analysis in Social Research Causal Analysis in Social Research Walter R Davis National Institute of Applied Statistics Research Australia University of Wollongong Frontiers in Social Statistics Metholodogy 8 February 2017 Walter

More information

Authors and Affiliations: Nianbo Dong University of Missouri 14 Hill Hall, Columbia, MO Phone: (573)

Authors and Affiliations: Nianbo Dong University of Missouri 14 Hill Hall, Columbia, MO Phone: (573) Prognostic Propensity Scores: A Method Accounting for the Correlations of the Covariates with Both the Treatment and the Outcome Variables in Matching and Diagnostics Authors and Affiliations: Nianbo Dong

More information

Observational Studies and Propensity Scores

Observational Studies and Propensity Scores Observational Studies and s STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University Makeup Class Rather than making you come

More information

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3 University of California, Irvine 2017-2018 1 Statistics (STATS) Courses STATS 5. Seminar in Data Science. 1 Unit. An introduction to the field of Data Science; intended for entering freshman and transfers.

More information

Notes on causal effects

Notes on causal effects Notes on causal effects Johan A. Elkink March 4, 2013 1 Decomposing bias terms Deriving Eq. 2.12 in Morgan and Winship (2007: 46): { Y1i if T Potential outcome = i = 1 Y 0i if T i = 0 Using shortcut E

More information

Matching Techniques. Technical Session VI. Manila, December Jed Friedman. Spanish Impact Evaluation. Fund. Region

Matching Techniques. Technical Session VI. Manila, December Jed Friedman. Spanish Impact Evaluation. Fund. Region Impact Evaluation Technical Session VI Matching Techniques Jed Friedman Manila, December 2008 Human Development Network East Asia and the Pacific Region Spanish Impact Evaluation Fund The case of random

More information

Advanced Statistical Methods for Observational Studies L E C T U R E 0 1

Advanced Statistical Methods for Observational Studies L E C T U R E 0 1 Advanced Statistical Methods for Observational Studies L E C T U R E 0 1 introduction this class Website Expectations Questions observational studies The world of observational studies is kind of hard

More information

Campbell S And Rubin S Perspectives On Causal Inference

Campbell S And Rubin S Perspectives On Causal Inference Campbell S And Rubin S Perspectives On Causal Inference We have made it easy for you to find a PDF Ebooks without any digging. And by having access to our ebooks online or by storing it on your computer,

More information

A little (more) about me.

A little (more) about me. Comparative Effectiveness Research Methods Training Module 2: Research Designs J. Michael Oakes, PhD Associate Professor Division of Epidemiology University of Minnesota oakes007@umn.edu A little (more)

More information

Integrated approaches for analysis of cluster randomised trials

Integrated approaches for analysis of cluster randomised trials Integrated approaches for analysis of cluster randomised trials Invited Session 4.1 - Recent developments in CRTs Joint work with L. Turner, F. Li, J. Gallis and D. Murray Mélanie PRAGUE - SCT 2017 - Liverpool

More information

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis. Estimating the Mean Response of Treatment Duration Regimes in an Observational Study Anastasios A. Tsiatis http://www.stat.ncsu.edu/ tsiatis/ Introduction to Dynamic Treatment Regimes 1 Outline Description

More information

Small-Sample Methods for Cluster-Robust Inference in School-Based Experiments

Small-Sample Methods for Cluster-Robust Inference in School-Based Experiments Small-Sample Methods for Cluster-Robust Inference in School-Based Experiments James E. Pustejovsky UT Austin Educational Psychology Department Quantitative Methods Program pusto@austin.utexas.edu Elizabeth

More information

Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D.

Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D. Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D. David Kaplan Department of Educational Psychology The General Theme

More information

Lecture 8. Roy Model, IV with essential heterogeneity, MTE

Lecture 8. Roy Model, IV with essential heterogeneity, MTE Lecture 8. Roy Model, IV with essential heterogeneity, MTE Economics 2123 George Washington University Instructor: Prof. Ben Williams Heterogeneity When we talk about heterogeneity, usually we mean heterogeneity

More information

Propensity score modelling in observational studies using dimension reduction methods

Propensity score modelling in observational studies using dimension reduction methods University of Colorado, Denver From the SelectedWorks of Debashis Ghosh 2011 Propensity score modelling in observational studies using dimension reduction methods Debashis Ghosh, Penn State University

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2015 Paper 334 Targeted Estimation and Inference for the Sample Average Treatment Effect Laura B. Balzer

More information

DOCUMENTS DE TRAVAIL CEMOI / CEMOI WORKING PAPERS. A SAS macro to estimate Average Treatment Effects with Matching Estimators

DOCUMENTS DE TRAVAIL CEMOI / CEMOI WORKING PAPERS. A SAS macro to estimate Average Treatment Effects with Matching Estimators DOCUMENTS DE TRAVAIL CEMOI / CEMOI WORKING PAPERS A SAS macro to estimate Average Treatment Effects with Matching Estimators Nicolas Moreau 1 http://cemoi.univ-reunion.fr/publications/ Centre d'economie

More information

OMITTED VARIABLES, R, AND BIAS REDUCTION IN MATCHING HIERARCHICAL DATA: A MONTE CARLO STUDY

OMITTED VARIABLES, R, AND BIAS REDUCTION IN MATCHING HIERARCHICAL DATA: A MONTE CARLO STUDY Journal of Statistics: Advances in Theory and Applications Volume 17, Number 1, 017, Pages 43-81 Available at http://scientificadvances.co.in DOI: http://dx.doi.org/10.1864/jsata_710011791 OMITTED VARIABLES,

More information

Vector-Based Kernel Weighting: A Simple Estimator for Improving Precision and Bias of Average Treatment Effects in Multiple Treatment Settings

Vector-Based Kernel Weighting: A Simple Estimator for Improving Precision and Bias of Average Treatment Effects in Multiple Treatment Settings Vector-Based Kernel Weighting: A Simple Estimator for Improving Precision and Bias of Average Treatment Effects in Multiple Treatment Settings Jessica Lum, MA 1 Steven Pizer, PhD 1, 2 Melissa Garrido,

More information

Multilevel Statistical Models: 3 rd edition, 2003 Contents

Multilevel Statistical Models: 3 rd edition, 2003 Contents Multilevel Statistical Models: 3 rd edition, 2003 Contents Preface Acknowledgements Notation Two and three level models. A general classification notation and diagram Glossary Chapter 1 An introduction

More information

Evaluating the performance of propensity score matching methods: A simulation study

Evaluating the performance of propensity score matching methods: A simulation study James Madison University JMU Scholarly Commons Dissertations The Graduate School Spring 2017 Evaluating the performance of propensity score matching methods: A simulation study Jessica N. Jacovidis James

More information

Modern Statistical Learning Methods for Observational Biomedical Data. Chapter 2: Basic identification and estimation of an average treatment effect

Modern Statistical Learning Methods for Observational Biomedical Data. Chapter 2: Basic identification and estimation of an average treatment effect Modern Statistical Learning Methods for Observational Biomedical Data Chapter 2: Basic identification and estimation of an average treatment effect David Benkeser Emory Univ. Marco Carone Univ. of Washington

More information

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes Kosuke Imai Department of Politics Princeton University July 31 2007 Kosuke Imai (Princeton University) Nonignorable

More information

The Economics of European Regions: Theory, Empirics, and Policy

The Economics of European Regions: Theory, Empirics, and Policy The Economics of European Regions: Theory, Empirics, and Policy Dipartimento di Economia e Management Davide Fiaschi Angela Parenti 1 1 davide.fiaschi@unipi.it, and aparenti@ec.unipi.it. Fiaschi-Parenti

More information

Dan Graham Professor of Statistical Modelling. Centre for Transport Studies

Dan Graham Professor of Statistical Modelling. Centre for Transport Studies Quantifying the effects of urban road capacity expansions on network performance and productivity via a mixed model generalised propensity score estimator Dan Graham Professor of Statistical Modelling

More information

Causal Sensitivity Analysis for Decision Trees

Causal Sensitivity Analysis for Decision Trees Causal Sensitivity Analysis for Decision Trees by Chengbo Li A thesis presented to the University of Waterloo in fulfillment of the thesis requirement for the degree of Master of Mathematics in Computer

More information

Targeted Maximum Likelihood Estimation in Safety Analysis

Targeted Maximum Likelihood Estimation in Safety Analysis Targeted Maximum Likelihood Estimation in Safety Analysis Sam Lendle 1 Bruce Fireman 2 Mark van der Laan 1 1 UC Berkeley 2 Kaiser Permanente ISPE Advanced Topics Session, Barcelona, August 2012 1 / 35

More information

Analysis of Panel Data: Introduction and Causal Inference with Panel Data

Analysis of Panel Data: Introduction and Causal Inference with Panel Data Analysis of Panel Data: Introduction and Causal Inference with Panel Data Session 1: 15 June 2015 Steven Finkel, PhD Daniel Wallace Professor of Political Science University of Pittsburgh USA Course presents

More information

Combining multiple observational data sources to estimate causal eects

Combining multiple observational data sources to estimate causal eects Department of Statistics, North Carolina State University Combining multiple observational data sources to estimate causal eects Shu Yang* syang24@ncsuedu Joint work with Peng Ding UC Berkeley May 23,

More information

What s New in Econometrics. Lecture 1

What s New in Econometrics. Lecture 1 What s New in Econometrics Lecture 1 Estimation of Average Treatment Effects Under Unconfoundedness Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Potential Outcomes 3. Estimands and

More information

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014 Assess Assumptions and Sensitivity Analysis Fan Li March 26, 2014 Two Key Assumptions 1. Overlap: 0

More information

Statistical Models for Causal Analysis

Statistical Models for Causal Analysis Statistical Models for Causal Analysis Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 Three Modes of Statistical Inference 1. Descriptive Inference: summarizing and exploring

More information

THE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B.

THE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B. THE DESIGN (VERSUS THE ANALYSIS) OF EVALUATIONS FROM OBSERVATIONAL STUDIES: PARALLELS WITH THE DESIGN OF RANDOMIZED EXPERIMENTS DONALD B. RUBIN My perspective on inference for causal effects: In randomized

More information

Application of Plausible Values of Latent Variables to Analyzing BSI-18 Factors. Jichuan Wang, Ph.D

Application of Plausible Values of Latent Variables to Analyzing BSI-18 Factors. Jichuan Wang, Ph.D Application of Plausible Values of Latent Variables to Analyzing BSI-18 Factors Jichuan Wang, Ph.D Children s National Health System The George Washington University School of Medicine Washington, DC 1

More information

New Developments in Nonresponse Adjustment Methods

New Developments in Nonresponse Adjustment Methods New Developments in Nonresponse Adjustment Methods Fannie Cobben January 23, 2009 1 Introduction In this paper, we describe two relatively new techniques to adjust for (unit) nonresponse bias: The sample

More information

Harvard University. Harvard University Biostatistics Working Paper Series

Harvard University. Harvard University Biostatistics Working Paper Series Harvard University Harvard University Biostatistics Working Paper Series Year 2015 Paper 192 Negative Outcome Control for Unobserved Confounding Under a Cox Proportional Hazards Model Eric J. Tchetgen

More information

Measuring Social Influence Without Bias

Measuring Social Influence Without Bias Measuring Social Influence Without Bias Annie Franco Bobbie NJ Macdonald December 9, 2015 The Problem CS224W: Final Paper How well can statistical models disentangle the effects of social influence from

More information

Comparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior

Comparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior Comparing Change Scores with Lagged Dependent Variables in Models of the Effects of Parents Actions to Modify Children's Problem Behavior David R. Johnson Department of Sociology and Haskell Sie Department

More information

Step-by-Step Guidelines for Propensity Score Weighting with Two Groups. Beth Ann Griffin Daniel McCaffrey

Step-by-Step Guidelines for Propensity Score Weighting with Two Groups. Beth Ann Griffin Daniel McCaffrey Step-by-Step Guidelines for Propensity Score Weighting with Two Groups Beth Ann Griffin Daniel McCaffrey 1 Four key steps 1) Choose the primary treatment effect of interest (ATE or ATT) 2) Estimate propensity

More information

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1 Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1 Gary King GaryKing.org April 13, 2014 1 c Copyright 2014 Gary King, All Rights Reserved. Gary King ()

More information

Covariate selection and propensity score specification in causal inference

Covariate selection and propensity score specification in causal inference Covariate selection and propensity score specification in causal inference Ingeborg Waernbaum Doctoral Dissertation Department of Statistics Umeå University SE-901 87 Umeå, Sweden Copyright c 2008 by Ingeborg

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND

More information

Data Integration for Big Data Analysis for finite population inference

Data Integration for Big Data Analysis for finite population inference for Big Data Analysis for finite population inference Jae-kwang Kim ISU January 23, 2018 1 / 36 What is big data? 2 / 36 Data do not speak for themselves Knowledge Reproducibility Information Intepretation

More information

Principles Underlying Evaluation Estimators

Principles Underlying Evaluation Estimators The Principles Underlying Evaluation Estimators James J. University of Chicago Econ 350, Winter 2019 The Basic Principles Underlying the Identification of the Main Econometric Evaluation Estimators Two

More information

CompSci Understanding Data: Theory and Applications

CompSci Understanding Data: Theory and Applications CompSci 590.6 Understanding Data: Theory and Applications Lecture 17 Causality in Statistics Instructor: Sudeepa Roy Email: sudeepa@cs.duke.edu Fall 2015 1 Today s Reading Rubin Journal of the American

More information

Calculating Effect-Sizes. David B. Wilson, PhD George Mason University

Calculating Effect-Sizes. David B. Wilson, PhD George Mason University Calculating Effect-Sizes David B. Wilson, PhD George Mason University The Heart and Soul of Meta-analysis: The Effect Size Meta-analysis shifts focus from statistical significance to the direction and

More information

Econometrics I. Professor William Greene Stern School of Business Department of Economics 1-1/40. Part 1: Introduction

Econometrics I. Professor William Greene Stern School of Business Department of Economics 1-1/40. Part 1: Introduction Econometrics I Professor William Greene Stern School of Business Department of Economics 1-1/40 http://people.stern.nyu.edu/wgreene/econometrics/econometrics.htm 1-2/40 Overview: This is an intermediate

More information

Streamlining Missing Data Analysis by Aggregating Multiple Imputations at the Data Level

Streamlining Missing Data Analysis by Aggregating Multiple Imputations at the Data Level Streamlining Missing Data Analysis by Aggregating Multiple Imputations at the Data Level A Monte Carlo Simulation to Test the Tenability of the SuperMatrix Approach Kyle M Lang Quantitative Psychology

More information

ECON Introductory Econometrics. Lecture 17: Experiments

ECON Introductory Econometrics. Lecture 17: Experiments ECON4150 - Introductory Econometrics Lecture 17: Experiments Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 13 Lecture outline 2 Why study experiments? The potential outcome framework.

More information

Propensity Score Analysis Using teffects in Stata. SOC 561 Programming for the Social Sciences Hyungjun Suh Apr

Propensity Score Analysis Using teffects in Stata. SOC 561 Programming for the Social Sciences Hyungjun Suh Apr Propensity Score Analysis Using teffects in Stata SOC 561 Programming for the Social Sciences Hyungjun Suh Apr. 25. 2016 Overview Motivation Propensity Score Weighting Propensity Score Matching with teffects

More information

Propensity Score Matching

Propensity Score Matching Methods James H. Steiger Department of Psychology and Human Development Vanderbilt University Regression Modeling, 2009 Methods 1 Introduction 2 3 4 Introduction Why Match? 5 Definition Methods and In

More information

On the Use of Linear Fixed Effects Regression Models for Causal Inference

On the Use of Linear Fixed Effects Regression Models for Causal Inference On the Use of Linear Fixed Effects Regression Models for ausal Inference Kosuke Imai Department of Politics Princeton University Joint work with In Song Kim Atlantic ausal Inference onference Johns Hopkins

More information

Differences-in- Differences. November 10 Clair

Differences-in- Differences. November 10 Clair Differences-in- Differences November 10 Clair The Big Picture What is this class really about, anyway? The Big Picture What is this class really about, anyway? Causality The Big Picture What is this class

More information

Relative Effect Sizes for Measures of Risk. Jake Olivier, Melanie Bell, Warren May

Relative Effect Sizes for Measures of Risk. Jake Olivier, Melanie Bell, Warren May Relative Effect Sizes for Measures of Risk Jake Olivier, Melanie Bell, Warren May MATHEMATICS & THE UNIVERSITY OF NEW STATISTICS SOUTH WALES November 2015 1 / 27 Motivating Examples Effect Size Phi and

More information

High Dimensional Propensity Score Estimation via Covariate Balancing

High Dimensional Propensity Score Estimation via Covariate Balancing High Dimensional Propensity Score Estimation via Covariate Balancing Kosuke Imai Princeton University Talk at Columbia University May 13, 2017 Joint work with Yang Ning and Sida Peng Kosuke Imai (Princeton)

More information

Ignoring the matching variables in cohort studies - when is it valid, and why?

Ignoring the matching variables in cohort studies - when is it valid, and why? Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association

More information

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers: Use of Matching Methods for Causal Inference in Experimental and Observational Studies Kosuke Imai Department of Politics Princeton University April 13, 2009 Kosuke Imai (Princeton University) Matching

More information

Difference-in-Differences Estimation

Difference-in-Differences Estimation Difference-in-Differences Estimation Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. The Basic Methodology 2. How Should We

More information

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. The Basic Methodology 2. How Should We View Uncertainty in DD Settings?

More information

e author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls

e author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls e author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls under the restrictions of the copyright, in particular

More information

Bayesian nonparametric predictive approaches for causal inference: Regression Discontinuity Methods

Bayesian nonparametric predictive approaches for causal inference: Regression Discontinuity Methods Bayesian nonparametric predictive approaches for causal inference: Regression Discontinuity Methods George Karabatsos University of Illinois-Chicago ERCIM Conference, 14-16 December, 2013 Senate House,

More information

Introduction to Propensity Score Matching: A Review and Illustration

Introduction to Propensity Score Matching: A Review and Illustration Introduction to Propensity Score Matching: A Review and Illustration Shenyang Guo, Ph.D. School of Social Work University of North Carolina at Chapel Hill January 28, 2005 For Workshop Conducted at the

More information

Balancing Covariates via Propensity Score Weighting

Balancing Covariates via Propensity Score Weighting Balancing Covariates via Propensity Score Weighting Kari Lock Morgan Department of Statistics Penn State University klm47@psu.edu Stochastic Modeling and Computational Statistics Seminar October 17, 2014

More information

SREE WORKSHOP ON PRINCIPAL STRATIFICATION MARCH Avi Feller & Lindsay C. Page

SREE WORKSHOP ON PRINCIPAL STRATIFICATION MARCH Avi Feller & Lindsay C. Page SREE WORKSHOP ON PRINCIPAL STRATIFICATION MARCH 2017 Avi Feller & Lindsay C. Page Agenda 2 Conceptual framework (45 minutes) Small group exercise (30 minutes) Break (15 minutes) Estimation & bounds (1.5

More information