Causal inference in multilevel data structures:

Similar documents
Propensity Score Weighting with Multilevel Data

Propensity Score Methods for Causal Inference

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014

Propensity Score Analysis with Hierarchical Data

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Bayesian Inference for Sequential Treatments under Latent Sequential Ignorability. June 19, 2017

Statistical Models for Causal Analysis

Behavioral Data Mining. Lecture 19 Regression and Causal Effects

The Essential Role of Pair Matching in. Cluster-Randomized Experiments. with Application to the Mexican Universal Health Insurance Evaluation

Selection on Observables: Propensity Score Matching.

Inference for Average Treatment Effects

Potential Outcomes Model (POM)

What s New in Econometrics. Lecture 1

CompSci Understanding Data: Theory and Applications

CEPA Working Paper No

Front-Door Adjustment

Research Design: Causal inference and counterfactuals

causal inference at hulu

Matching. Quiz 2. Matching. Quiz 2. Exact Matching. Estimand 2/25/14

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1

1 What does the random effect η mean?

AGEC 661 Note Fourteen

Dynamics in Social Networks and Causality

Hypothesis Testing with Z and T

Bayesian causal forests: dealing with regularization induced confounding and shrinking towards homogeneous effects

Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing

Difference-in-Differences Methods

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Propensity Score Matching

Lecture 8. Roy Model, IV with essential heterogeneity, MTE

Conditional probabilities and graphical models

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Eco517 Fall 2014 C. Sims FINAL EXAM

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Vector-Based Kernel Weighting: A Simple Estimator for Improving Precision and Bias of Average Treatment Effects in Multiple Treatment Settings

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

PSC 504: Instrumental Variables

multilevel modeling: concepts, applications and interpretations

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Gov 2002: 5. Matching

Weighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i.

Discussion of Papers on the Extensions of Propensity Score

Bayesian regression tree models for causal inference: regularization, confounding and heterogeneity

Econometrics with Observational Data. Introduction and Identification Todd Wagner February 1, 2017

Experimental Designs for Identifying Causal Mechanisms

The Complexity of Forecast Testing

Variable selection and machine learning methods in causal inference

review session gov 2000 gov 2000 () review session 1 / 38

Stratified Randomized Experiments

PSC 504: Differences-in-differeces estimators

Advanced Statistical Methods for Observational Studies L E C T U R E 0 1

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015

Unobserved heterogeneity in logistic regression

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

Quantitative Economics for the Evaluation of the European Policy

Principles Underlying Evaluation Estimators

University of California, Berkeley

Statistical Analysis of Causal Mechanisms

A Decision Theoretic Approach to Causality

Statistical Analysis of Causal Mechanisms

Previous lecture. P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing.

Gov 2002: 13. Dynamic Causal Inference

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015

Sample Space: Specify all possible outcomes from an experiment. Event: Specify a particular outcome or combination of outcomes.

An overview of applied econometrics

COMPARING SEVERAL MEANS: ANOVA

Section 10: Inverse propensity score weighting (IPSW)

Statistical Analysis of Causal Mechanisms

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Final Exam. Economics 835: Econometrics. Fall 2010

Flexible Estimation of Treatment Effect Parameters

OUTCOME REGRESSION AND PROPENSITY SCORES (CHAPTER 15) BIOS Outcome regressions and propensity scores

MATCHING FOR EE AND DR IMPACTS

Introductory Econometrics

Bootstrapping Sensitivity Analysis

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha

Causal Inference Basics

Controlling for Time Invariant Heterogeneity

Statistical Inference with Regression Analysis

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Relationships between variables. Association Examples: Smoking is associated with heart disease. Weight is associated with height.

Randomized Experiments

Applied Microeconometrics (L5): Panel Data-Basics

Midterm 1 ECO Undergraduate Econometrics

Lecture Notes 1: Decisions and Data. In these notes, I describe some basic ideas in decision theory. theory is constructed from

Inference with few assumptions: Wasserman s example

Combining Difference-in-difference and Matching for Panel Data Analysis

Causal Inference. Prediction and causation are very different. Typical questions are:

Chapter 6 Stochastic Regressors

Econometric Analysis of Cross Section and Panel Data

Balancing Covariates via Propensity Score Weighting

From Causality, Second edition, Contents

The Simple Linear Regression Model

Wolpert, D. (1995). Off-training set error and a priori distinctions between learning algorithms.

Causality and Experiments

Transcription:

Causal inference in multilevel data structures: Discussion of papers by Li and Imai Jennifer Hill May 19 th, 2008

Li paper

Strengths Area that needs attention! With regard to propensity score strategies advice is often that will only affect s.e. s, not necessarily point estimates this shows not true Important to see the implications of ignoring clustering since that is often done Separates out the implications of mis-specifying the model for the assignment mechanism and the model for the response

Concerns What s being estimated/assumed Causal vs. not causal Individual vs cluster-level effects SUTVA violations?

Importance of formalizing as a causal estimand While Li et al. have defined a valid estimand and derived properties of their estimators with respect to it, it seems the most important decisions regarding modeling and assumptions arise once we make the jump to a causal estimand Li et al. write their response model as: y hk = δ h + γ h z hk + αd h + ε hk where, recall, d h = (1/n h )Σz hk What do these parameters mean? How do we translate this into a causal model..? It depends

Causal formalization #1 y hk = δ h + γ h z hk + αd h + ε hk This model seems to imply that the outcome for person k in cluster h depends not only on his own treatment assignment but also the treatment assignment of others in his cluster (so proportion treated could be interpreted causally as well). This is a SUTVA violation though a potentially tractable one (particularly if we resign ourselves to making inferences at the cluster level). So potential outcomes then would be defined with respect to both z and d y hk (z hk =0,d h ) = δ h + αd h + ε hk y hk (z hk =1,d h ) = δ h + γ h + αd h + ε hk Which leaves room for any number of different kinds of average causal comparisons However we have to make stronger assumptions to identify all of these effects.

Causal formalization #2 y hk = δ h + γ h z hk + αd h + ε hk However another interpretation is that d h is simply a proxy for an unobserved confounder (e.g. d h ~ N(βW,σ d2 )), but conditional on z, d doesn t affect y For instance if the treatment z denoted Head Start participation in Head Start and W reflected the overall spending on social services in community h (which is associated with both d h and subsequent outcomes). In this case we wouldn t interpret α causally, rather d h is probably something we want to control for in order to satisfy ignorability (so α is a nuisance parameter). No SUTVA violation now. Then, potential outcomes could be defined with respect to z alone y hk (z hk =0) = δ h + αd h + ε hk y hk (z hk =1) = δ h + γ h + αd h + ε hk

Other models Of course this is also a rather limited structure, should also consider, for instances, cases such as When α = 0 When d h only impact potential outcomes for the treated When clusters proxy for unobserved heterogeneity in a more general way (i.e. not proxied by measured variabls)

Also Choice of weights. Why use the basic HT weights rather than stabilized versions? Not convinced of the utility of the population overlap estimand Since TOT and TOC are popular estimands would be interesting to see implications in those scenarios as well.

Imai paper

Strengths Formalizes aspects of the MPCR that I haven t seen done before with regard to SUTVA at individual and cluster (both within pair and across pairs) levels Points out important misconceptions in previous discussions of MPCR Delineates when the commonly used harmonic mean estimators will fail Clarity about potential estimands

Throws down the gauntlet: pair matching should be used whenever feasible

Concerns SATE/CATE requirements for unbiasedness can be hard to control cluster sizes in field experiments and matching that is strong enough to match SATE across clusters within pairs seems quite unlikely (so even if better than harmonic mean or rb approaches, still a cause for concern) also doesn t indiv.level interference make this less likely? PATE/UATE need cluster sizes uncorrelated with cluster treatment effects (this made more plausible by above conditions similarity across clusters within pairs in terms of cluster size and covariate distributions) Variance not identified for SATE/CATE (is upper bound calculation small enough to be at all helpful?)

Variance estimation If you only care about SATE or CATE why can t you use randomization based inference (then constant treatment effect assumption shouldn t matter should it?)

Simulations Designed to see what happens when the assumptions necessary for the harmonic mean estimators are not met However there is no simulation condition that tests what happens if the assumptions necessary for the new estimators are not met.? If I can t balance cluster sizes, how similar do those clusters have to be?

Comparison to UMCR When comparing the efficiency gains of the matched pairs design over the unmatched cluster randomized design the authors assume that treatment effect estimation is performed without further covariance adjustment UMCR with additional covariance adjustment seems to me to be the more reasonable comparison

Alternative designs? Why not take a fully Bayesian approach to estimation? Should be able to account for the design and incorporate additional covariance adjustments. Could relax some of the assumptions that the harmonic mean estimator is implicitly built upon (normality, iid distributions, equal variances for the potential outcomes)