Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1

Similar documents
Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Discussion: Can We Get More Out of Experiments?

Propensity Score Matching

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing

An Introduction to Causal Analysis on Observational Data using Propensity Scores

Research Design: Causal inference and counterfactuals

Notes on causal effects

Propensity Score Methods for Causal Inference

Stratified Randomized Experiments

BIOS 6649: Handout Exercise Solution

Gov 2002: 4. Observational Studies and Confounding

The Essential Role of Pair Matching in. Cluster-Randomized Experiments. with Application to the Mexican Universal Health Insurance Evaluation

Advanced Quantitative Methods: Causal inference

Partially Identified Treatment Effects for Generalizability

On the Use of Linear Fixed Effects Regression Models for Causal Inference

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Inference for Average Treatment Effects

Casual Mediation Analysis

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

EMERGING MARKETS - Lecture 2: Methodology refresher

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Causal Inference Lecture Notes: Selection Bias in Observational Studies

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

review session gov 2000 gov 2000 () review session 1 / 38

Double Robustness. Bang and Robins (2005) Kang and Schafer (2007)

Causal Mediation Analysis in R. Quantitative Methodology and Causal Mechanisms

Lecture: Difference-in-Difference (DID)

Dynamics in Social Networks and Causality

Gov 2002: 13. Dynamic Causal Inference

Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies

Statistical Analysis of Causal Mechanisms

Causal Mechanisms Short Course Part II:

Matching for Causal Inference Without Balance Checking

Measuring Social Influence Without Bias

Answers to Problem Set #4

Variable selection and machine learning methods in causal inference

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Matching. Quiz 2. Matching. Quiz 2. Exact Matching. Estimand 2/25/14

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015

Statistical Analysis of Causal Mechanisms

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Modification and Improvement of Empirical Likelihood for Missing Response Problem

PSC 504: Dynamic Causal Inference

Discussion of Papers on the Extensions of Propensity Score

Experimental Designs for Identifying Causal Mechanisms

ECON Introductory Econometrics. Lecture 17: Experiments

A Theory of Statistical Inference for Matching Methods in Causal Research

Addressing Analysis Issues REGRESSION-DISCONTINUITY (RD) DESIGN

Fractional Imputation in Survey Sampling: A Comparative Review

Front-Door Adjustment

Survival Analysis for Case-Cohort Studies

Selection on Observables: Propensity Score Matching.

Lecture Outline. Biost 518 Applied Biostatistics II. Choice of Model for Analysis. Choice of Model. Choice of Model. Lecture 10: Multiple Regression:

Gov 2002: 5. Matching

Causal Modeling in Environmental Epidemiology. Joel Schwartz Harvard University

The propensity score with continuous treatments

Combining multiple observational data sources to estimate causal eects

Statistical Analysis of Causal Mechanisms

Causal inference in multilevel data structures:

Covariate Balancing Propensity Score for General Treatment Regimes

ESTIMATION OF TREATMENT EFFECTS VIA MATCHING

Since the seminal paper by Rosenbaum and Rubin (1983b) on propensity. Propensity Score Analysis. Concepts and Issues. Chapter 1. Wei Pan Haiyan Bai

Identification and Inference in Causal Mediation Analysis

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design

DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS

What s New in Econometrics. Lecture 1

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

Noncompliance in Randomized Experiments

Bootstrapping Sensitivity Analysis

Balancing Covariates via Propensity Score Weighting: The Overlap Weights

6. Assessing studies based on multiple regression

Distribution-Free Procedures (Devore Chapter Fifteen)

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

multilevel modeling: concepts, applications and interpretations

Mathematical statistics

Causal Inference Basics

Review of Econometrics

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Lecture 9. Matthew Osborne

ECO Class 6 Nonparametric Econometrics

A Theory of Statistical Inference for Matching Methods in Applied Causal Research

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.

Quantitative Economics for the Evaluation of the European Policy

Econometrics with Observational Data. Introduction and Identification Todd Wagner February 1, 2017

Balancing Covariates via Propensity Score Weighting


Chapter 4: Factor Analysis

MATCHING FOR EE AND DR IMPACTS

Hypothesis Testing with Z and T

Introduction to Causal Inference. Solutions to Quiz 4

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari

Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies

Data Integration for Big Data Analysis for finite population inference

Linear Regression with Multiple Regressors

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Lecture Discussion. Confounding, Non-Collapsibility, Precision, and Power Statistics Statistical Methods II. Presented February 27, 2018

Transcription:

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1 Gary King GaryKing.org April 13, 2014 1 c Copyright 2014 Gary King, All Rights Reserved. Gary King () Research Designs April 13, 2014 1 / 23

Reference Kosuke Imai, Gary King, and Elizabeth Stuart. Misunderstandings among Experimentalists and Observationalists: Balance Test Fallacies in Causal Inference Journal of the Royal Statistical Society, Series A Vol. 171, Part 2 (2008): Pp. 1-22 http://gking.harvard.edu/files/abs/matchse-abs.shtml Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 2 / 23

Notation Sample of n units from a finite population of N units (typically N >> n) Sample selection: I i is 1 for units selected, 0 otherwise Treatment assignment: T i is 1 for treated group, 0 control group (Assume: treated and control groups are each of size n/2) Observed outcome variable: Y i Potential outcomes: Y i (1) and Y i (0), Y i potential values when T i is 1 or 0 respectively. Fundamental problem of causal inference. Only one potential outcome is ever observed: If T i = 0, Y i (0) = Y i Y i (1) =? If T i = 1, Y i (0) =? Y i (1) = Y i (I i, T i, Y i ) are random; Y i (1) and Y i (0) are fixed. Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 3 / 23

Quantities of Interest Treatment Effect (for unit i): TE i Y i (1) Y i (0) Population Average Treatment Effect: PATE 1 N N TE i i=1 Sample Average Treatment Effect: SATE 1 n TE i i {I i =1} Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 4 / 23

Decomposition of Causal Effect Estimation Error Difference in means estimator: D 1 n/2 Estimation Error: i {I i =1,T i =1} Y i 1 n/2 PATE D i {I i =1,T i =0} Pretreatment confounders: X are observed and U are unobserved Decomposition: = S + T = ( SX + SU ) + ( TX + TU ) Error due to S (sample selection), T (treatment imbalance), and each due to observed (X i ) and unobserved (U i ) covariates Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 5 / 23 Y i.

Selection Error Definition: S PATE SATE = N n (NATE SATE), N where NATE is the nonsample average treatment effect. S vanishes if: 1 The sample is a census (I i = 1 for all observations and n = N); 2 SATE = NATE; or 3 Switch quantity of interest from PATE to SATE (recommended!) Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 6 / 23

Decomposing Selection Error Decomposition: S = SX + SU SX = 0 when empirical distribution of (observed) X is identical in population and sample: F (X I = 0) = F (X I = 1). SU = 0 when empirical distribution of (unobserved) U is identical in population and sample: F (U I = 0) = F (U I = 1). conditions are unverifiable: X is observed only in sample and U is not observed at all. SX SU vanishes if weighting on X cannot be corrected after the fact Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 7 / 23

Decomposing Treatment Imbalance Decomposition: T = TX + TU TX = 0 when X is balanced between treateds and controls: F (X T = 1, I = 1) = F (X T = 0, I = 1). Verifiable from data; can be generated ex ante by blocking or enforced ex post via matching or parametric adjustment TU = 0 when U is balanced between treateds and controls: F (U T = 1, I = 1) = F (U T = 0, I = 1). Unverifiable. Achieved only by assumption or, on average, by random treatment assignment Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 8 / 23

Alternative Quantities of Interest Sample average treatment effect on the treated: SATT 1 TE i n/2 i {I i =1,T i =1} Population average treatment effect on the treated: PATT 1 N TE i i {T i =1} (where N = N i=1 T i is the number of treated units in the population) When are these of more interest than PATE and SATE? Why never for randomized experiments? Why usually for matching? Analogous estimation error decomposition: = PATT D, holds: = ( S X + S U ) + ( T X + T U ) Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 9 / 23

Effects of Design Components on Estimation Error Design Choice SX SU avg avg Random sampling = 0 = 0 TX TU Complete stratified random sampling = 0 avg = 0 Focus on SATE rather than PATE = 0 = 0 Weighting for nonrandom sampling = 0 =? Large sample size???? Random treatment assignment avg = 0 avg = 0 Complete blocking = 0 =? Exact matching = 0 =? Assumption No selection bias avg = 0 avg = 0 Ignorability avg = 0 No omitted variables = 0 Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 10 / 23

Comparing Blocking and Matching Adding blocking to random assignment is always as or more efficient, and never biased (blocking on pretreatment variables related to the outcome is more efficient) Blocking is like regression adjustment, where the functional form and the parameter values are known Matching is like blocking, except: 1 to avoid selection error, must change quantity of interest from PATE to PATT or SATT, 2 random treatment assignment following matching is impossible, 3 Exact matching, unlike blocking, is dependent on the already-collected data happening to contain sufficiently good matches. 4 In the worst case scenerio, matching (just like regression adjustment) can increase bias (this cannot occur with blocking plus random assignment) Adding matching to a parametric model almost always reduces model dependence and bias, and sometimes variance too Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 11 / 23

The Benefits of Major Research Designs: Overview SX SU TX TU Ideal experiment 0 0 = 0 0 Randomized clinicial trials (Limited or no blocking) 0 0 avg = 0 Randomized clinicial trials (Full blocking) 0 0 = 0 avg = 0 avg = 0 Social Science Field Experiment (Limited or no blocking) 0 0 0 0 Survey Experiment (Limited or no blocking) 0 0 0 0 Observational Study (Representative data set, Well-matched) 0 0 0 0 Observational Study (Unrepresentative but partially, correctable data, well-matched) 0 0 0 0 Observational Study (Unrepresentative data set, Well-matched) 0 0 0 0 For column Q, 0 denotes E(Q) = 0 and lim n Var(Q) = 0, whereas avg = 0 indicates zero on average, or E(Q) = 0, for a design with a small n. S can be set to zero if we switch from PATE to SATE. Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 12 / 23

The Ideal Experiment (according to the paper) Random selection from well-defined population large n blocking on all known confounders random treatment assignment within blocks E( SX ) = 0, lim n V ( SX ) = 0 E( SU ) = 0, lim n V ( SU ) = 0 TX = 0 E( TU ) = 0, lim n V ( TU ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 13 / 23

An Even More Ideal Experiment Begin with a well-defined population Define sampling strata based on cross-classification of all known confounders Random sampling within strata (if strata sample is proportional to population fraction, no weights are needed) large n blocking on all known confounders random treatment assignment within blocks SX = 0 E( SU ) = 0, lim n V ( SU ) = 0 TX = 0 E( TU ) = 0, lim n V ( TU ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 14 / 23

Randomized Clinical Trials (Little or no Blocking) nonrandom selection small n little or no blocking random treatment assignment SX 0 SU 0 E( TX ) = 0 E( TU ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 15 / 23

Randomized Clinical Trials (Full Blocking) nonrandom selection small n Full blocking random treatment assignment SX 0 SU 0 TX = 0 E( TU ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 16 / 23

Social Science Field Experiment nonrandom selection large n limited or no blocking random treatment assignment SX 0 or change PATE to SATE and SX = 0 SU 0 or change PATE to SATE and SU = 0 E( TX ) = 0, lim n V ( TX ) = 0 E( TU ) = 0, lim n V ( TU ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 17 / 23

Survey Experiment random selection large n limited or no blocking random treatment assignment (only question wording treatments) E( SX ) = 0, lim n V ( SX ) = 0 E( SU ) = 0, lim n V ( SU ) = 0 E( TX ) = 0, lim n V ( TX ) = 0 E( TU ) = 0, lim n V ( TU ) = 0 Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 18 / 23

Observational Study, well-matched nonrandom selection large n no blocking nonrandom treatment assignment SX 0 if representative, corrected by weighting, or for estimating SATE; or 0 otherwise SU 0 TX TU 0 (due to matching well) 0 except by assumption Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 19 / 23

What is the Best Design? The ideal design is rarely feasible Effort in experimental studies: random assignment Effort in observational studies: measuring and adjusting for X (via matching or modeling) the Achilles heal of experimental studies: S and small n the Achilles heal of observational studies: U Each design accomodates best to the applications for which it was designed Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 20 / 23

Fallacies in Experimental Research Failure to block on all known covariates seen incorrectly as requiring fewer assumptions (about what to block on) In fact, blocking helps (except in strange situations) Blocking on relevant covariates is better, so choose carefully. Block what you can and randomize what you cannot (Box et al.) Conducting t-tests to check for balance after random treatment assignment variables blocked on balance exactly after treatment assignment; so if you re checking, you missed an opportunity to increase efficiency if variables became available after treatment assignment, a t-test only checks for whether randomization was done appropriately randomization balances on average; any one random assignment is not balanced exactly (which is why its better to block) Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 21 / 23

The Balance Test Fallacy in Matching Research t statistic 0 1 2 3 4 "Statistical insignificance" region Math test score 0 20 40 60 80 100 QQ Plot Mean Deviation Difference in Means 0 100 200 300 400 0 100 200 300 400 Number of Controls Randomly Dropped Number of Controls Randomly Dropped Randomly dropping observations reduces imbalance??? Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 22 / 23

The Balance Test Fallacy: Explanation hypothesis tests are a function of balance AND power; we only want to measure balance for SATE, PATE, or SPATE, balance is a characteristic of the sample, not some superpopulation; so no need to make an inference Simple linear model (for intution): Suppose E(Y T, X ) = θ + T β + X γ Bias in coefficient on T from regressing Y on T (without X ): E( ˆβ β T, X ) = Gγ (where G are coefficients from a regression X on a constant and T ) Imbalance is due to G only. If G = 0, bias=0 If G 0, bias can be any size due to γ If you cannot control G (and we don t usually try to minimize γ so we don t cook the books), then G must be reduced without limit. No threshold level is safe. Gary King (Harvard) Research Designs for Causal Inference April 13, 2014 23 / 23