When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

Similar documents
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

On the Use of Linear Fixed Effects Regression Models for Causal Inference

When Should We Use Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Matching and Weighting Methods for Causal Inference

Econometric Analysis of Cross Section and Panel Data

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Applied Microeconometrics (L5): Panel Data-Basics

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Covariate Balancing Propensity Score for General Treatment Regimes

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Causal Mechanisms Short Course Part II:

Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies

Statistical Models for Causal Analysis

Flexible Estimation of Treatment Effect Parameters

Instrumental Variables

Final Exam. Economics 835: Econometrics. Fall 2010

Gov 2002: 4. Observational Studies and Confounding

Statistical Analysis of Causal Mechanisms

Experimental Designs for Identifying Causal Mechanisms

ESTIMATION OF TREATMENT EFFECTS VIA MATCHING

Quantitative Economics for the Evaluation of the European Policy

Statistical Analysis of Causal Mechanisms

Econometrics of Panel Data

What s New in Econometrics? Lecture 14 Quantile Methods

Selection on Observables: Propensity Score Matching.

Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies

Missing dependent variables in panel data models

Difference-in-Differences Methods

Matching Methods for Causal Inference with Time-Series Cross-Section Data

Identification and Inference in Causal Mediation Analysis

Lecture 6: Dynamic panel models 1

Identification and Sensitivity Analysis for Multiple Causal Mechanisms: Revisiting Evidence from Framing Experiments

Applied Econometrics Lecture 1

Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

High Dimensional Propensity Score Estimation via Covariate Balancing

Discussion of Papers on the Extensions of Propensity Score

1 Motivation for Instrumental Variable (IV) Regression

Non-linear panel data modeling

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation

Propensity Score Weighting with Multilevel Data

Controlling for Time Invariant Heterogeneity

New Developments in Econometrics Lecture 16: Quantile Estimation

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS

Causal Mediation Analysis in R. Quantitative Methodology and Causal Mechanisms

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Regression Discontinuity Designs

EMERGING MARKETS - Lecture 2: Methodology refresher

Statistical Analysis of Causal Mechanisms

Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook)

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Clustering as a Design Problem

Statistical Analysis of Causal Mechanisms for Randomized Experiments

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha

A Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Identification, Inference, and Sensitivity Analysis for Causal Mediation Effects

Fixed Effects Models for Panel Data. December 1, 2014

What s New in Econometrics. Lecture 1

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

Causal Inference Lecture Notes: Selection Bias in Observational Studies

Econometrics of Panel Data

Front-Door Adjustment

Short T Panels - Review

Linear dynamic panel data models

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates

Review of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University.

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Simple Linear Regression

Chapter 2: simple regression model

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

The Economics of European Regions: Theory, Empirics, and Policy

Econometrics of Panel Data

Causal Interaction in Factorial Experiments: Application to Conjoint Analysis

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous

An overview of applied econometrics

Causal Inference Basics

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Unpacking the Black Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Applied Health Economics (for B.Sc.)

Help! Statistics! Mediation Analysis


IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

1 Estimation of Persistent Dynamic Panel Data. Motivation

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

10 Panel Data. Andrius Buteikis,

Panel Data Seminar. Discrete Response Models. Crest-Insee. 11 April 2008


Sensitivity checks for the local average treatment effect

A nonparametric test for path dependence in discrete panel data

Applied Microeconometrics Chapter 8 Regression Discontinuity (RD)

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Difference-in-Differences Estimation

The Supervised Learning Approach To Estimating Heterogeneous Causal Regime Effects

Transcription:

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint work with In Song Kim (MIT) Seminar at University of California, San Diego May 6, 2016 Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 1 / 33

Fixed Effects Regressions in Causal Inference Linear fixed effects regression models are the primary workhorse for causal inference with panel data Researchers use them to adjust for unobserved confounders (omitted variables, endogeneity, selection bias,...): Good instruments are hard to find..., so we d like to have other tools to deal with unobserved confounders. This chapter considers... strategies that use data with a time or cohort dimension to control for unobserved but fixed omitted variables (Angrist & Pischke, Mostly Harmless Econometrics) fixed effects regression can scarcely be faulted for being the bearer of bad tidings (Green et al., Dirty Pool) Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 2 / 33

Motivating Questions 1 What make it possible for fixed effects regression models to adjust for unobserved confounding? 2 Are there any trade-offs when compared to the selection-on-observables approaches such as matching? 3 What are the exact causal assumptions underlying fixed effects regression models? Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 3 / 33

Main Results of the Paper Identify causal assumptions of one-way fixed effects estimators: 1 Treatments do not directly affect future outcomes 2 Outcomes do not directly affect future treatments and future time-varying confounders can be relaxed under the selection-on-observables approach Develop within-unit matching estimators to relax the functional form assumptions of linear fixed effects regression estimators Identify the problem of two-way fixed effects regression models no other observations share the same unit and time Propose simple ways to improve fixed effects estimators using the new matching/weighted fixed effects regression framework Replace the assumptions with the design-based assumptions before-and-after and difference-in-differences designs Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 4 / 33

Linear Regression with Unit Fixed Effects Balanced panel data with N units and T time periods Y it : outcome variable X it : causal or treatment variable of interest Model: Estimator: de-meaning ˆβ FE = arg min β Y it = α i + βx it + ɛ it N i=1 t=1 T {(Y it Y i ) β(x it X i )} 2 where X i and Y i are unit-specific sample means Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 5 / 33

The Standard Assumption Assumption 1 (Strict Exogeneity) E(ɛ it X i, α i ) = 0 where X i is a T 1 vector of treatment variables for unit i U i : a vector of time-invariant unobserved confounders α i = h(u i ) for any function h( ) A flexible way to adjust for unobservables Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 6 / 33

Causal Assumption I Assumption 2 (No carryover effect) Treatments do not directly affect future outcomes Y it (X i1, X i2,..., X i,t 1, X it ) = Y it (X it ) Potential outcome model: Average treatment effect: Y it (x) = α i + βx + ɛ it for x = 0, 1 τ = E(Y it (1) Y it (0) C i = 1) = β where C i = 1{0 < T t=1 X it < T } Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 7 / 33

Causal Directed Acyclic Graph (DAG) Y i1 Y i2 Y i3 X i1 X i2 X i3 U i arrow = direct causal effect absence of arrows causal assumptions Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 8 / 33

Causal Directed Acyclic Graph (DAG) Y i1 Y i2 Y i3 X i1 X i2 X i3 Adding a red dashed arrow violates strict exogeneity U i Nonparametric SEM (Pearl) Y it = g 1 (X it, U i, ɛ it ) X it = g 2 (X i1,..., X i,t 1, U i, η it ) Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 9 / 33

Causal Assumption II What randomized experiment satisfies strict exogeneity? Assumption 3 (Sequential Ignorability with Unobservables) {Y it (1), Y it (0)} T t=1 X i1 U i. {Y it (1), Y it (0)} T t=1 X it X i1,..., X i,t 1, U i. {Y it (1), Y it (0)} T t=1 X it X i1,..., X i,t 1, U i The as-if random assumption without conditioning on the previous outcomes Outcomes can directly affect future outcomes but no need to adjust for past outcomes Nonparametric identification result Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 10 / 33

An Alternative Selection-on-Observables Approach Marginal structural models in epidemiology (Robins) Risk set matching (Rosenbaum) Trade-off: unobserved time-invariant confounders vs. direct effect of outcome on future treatment Y i1 Y i2 Y i3 X i1 X i2 X i3 Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 11 / 33

Within-Unit Matching Estimator Even if these assumptions are satisfied, the the unit fixed effects estimator is inconsistent for the ATE: { ( T t=1 E C X ) } it Y T it t=1 i T (1 X it )Y it p t=1 ˆβ FE X T S 2 it t=1 1 X i it E(C i Si 2 τ ) where S 2 i = T t=1 (X it X i ) 2 /(T 1) is the unit-specific variance The Within-unit matching estimator improves ˆβ FE by relaxing the linearity assumption: ˆτ match = 1 N i=1 C i N i=1 C i ( T t=1 X ity it T t=1 X it T t=1 (1 X it)y it T t=1 (1 X it) ) Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 12 / 33

Constructing a General Matching Estimator M it : matched set for observation (i, t) For the within-unit matching estimator, M(i, t) = {(i, t ) : i = i, X i t = 1 X it} A general matching estimator just introduced: ˆτ match = N i=1 1 T t=1 D it N T D it (Ŷit(1) Ŷit(0)) i=1 t=1 where D it = 1{#M(i, t) > 0} and { Y it if X it = x Ŷ it (x) = 1 #M(i,t) (i,t ) M(i,t) Y i t if X it = 1 x Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 13 / 33

Unit Fixed Effects Estimator as a Matching Estimator de-meaning match with all other observations within the same unit: M(i, t) = {(i, t ) : i = i, t t} mismatch: observations with the same treatment status Unit fixed effects estimator adjusts for mismatches: ˆβ FE = 1 K { N i=1 1 T t=1 D it N T i=1 t=1 where K is the proportion of proper matches ) } D it (Ŷit (1) Ŷit(0) The within-unit matching estimator eliminates all mismatches Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 14 / 33

Matching as a Weighted Unit Fixed Effects Estimator Any within-unit matching estimator can be written as a weighted unit fixed effects estimator with different regression weights The proposed within-matching estimator: ˆβ WFE = arg min β W it = N i=1 t=1 T D it W it {(Y it Y i ) β(x it X i )} 2 where X i and Y i are unit-specific weighted averages, and T T if X t =1 X it = 1, it T T t =1 (1 X it ) if X it = 0. Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 15 / 33

We show how to construct regression weights for different matching estimators (i.e., different matched sets) Idea: count the number of times each observation is used for matching Benefits: computational efficiency model-based standard errors double-robustness matching estimator is consistent even when linear fixed effects regression is the true model specification test (White 1980) null hypothesis: linear fixed effects regression is the true model Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 16 / 33

Before-and-After Design The assumption that outcomes do not directly affect future treatments may not be credible Replace it with the design-based assumption: E(Y it (x) X it = x ) = E(Y i,t 1 (x) X i,t 1 = 1 x ) Average Outcome 0.0 0.2 0.4 0.6 0.8 1.0 control group treatment group counterfactual time t time t+1 Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 17 / 33

This is a matching estimator with the following matched set: M(i, t) = {(i, t ) : i = i, t {t 1, t + 1}, X i t = 1 X it} It is also the first differencing estimator: ˆβ FD = arg min β N i=1 t=2 T {(Y it Y i,t 1 ) β(x it X i,t 1 )} 2 We emphasize that the model and the interpretation of β are exactly as in [the linear fixed effects model]. What differs is our method for estimating β (Wooldridge; italics original). The identification assumptions is very different! Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 18 / 33

Remarks on Other Important Issues 1 Adjusting for observed time-varying confounding Z it Proposes within-unit matching estimators that adjust for Z it Key assumption: outcomes neither directly affect future treatments nor future time-varying confounders 2 Adjusting for past treatments Impossible to adjust for all past treatments within the same unit Researchers must decide the number of past treatments to adjust 3 Adjusting for past outcomes No need to adjust for past outcomes if they do not directly affect future treatments If they do, the strict exogeneity assumption will be violated Past outcomes as instrumental variables (Arellano and Bond) often not credible No free lunch: adjustment for unobservables comes with costs Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 19 / 33

Linear Regression with Unit and Time Fixed Effects Model: Y it = α i + γ t + βx it + ɛ it where γ t flexibly adjusts for a vector of unobserved unit-invariant time effects V t, i.e., γ t = f (V t ) Estimator: ˆβ FE2 = arg min β N i=1 t=1 T {(Y it Y i Y t + Y ) β(x it X i X t + X)} 2 where Y t and X t are time-specific means, and Y and X are overall means Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 20 / 33

Understanding the Two-way Fixed Effects Estimator β FE : bias due to time effects β FEtime : bias due to unit effects β pool : bias due to both time and unit effects ˆβ FE2 = ω FE ˆβ FE + ω FEtime ˆβ FEtime ω pool ˆβ pool w FE + w FEtime w pool with sufficiently large N and T, the weights are given by, ω FE E(S 2 i ) = average unit-specific variance ω FEtime E(S 2 t ) = average time-specific variance ω pool S 2 = overall variance Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 21 / 33

Matching and Two-way Fixed Effects Estimators Problem: No other unit shares the same unit and time Units Time periods 4 3 2 C T C C T T C T T C C C T C C 1 Two kinds of mismatches 1 Same treatment status 2 Neither same unit nor same time T T T C T Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 22 / 33

We Can Never Eliminate Mismatches Units Time periods 4 3 2 1 C T C C T T C T T C C C T C C T T T C T To cancel time and unit effects, we must induce mismatches No weighted two-way fixed effects model eliminates mismatches Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 23 / 33

Difference-in-Differences Design Replace the model-based assumption with the design-based one Parallel trend assumption: E(Y it (0) Y i,t 1 (0) X it = 1, X i,t 1 = 0) = E(Y it (0) Y i,t 1 (0) X it = X i,t 1 = 0) Average Outcome 0.0 0.2 0.4 0.6 0.8 1.0 control group treatment group counterfactual time t time t+1 Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 24 / 33

General DiD = Weighted Two-Way FE Effects 2 2 standard two-way fixed effects estimator works General setting: Multiple time periods, repeated treatments Units Time periods 4 3 2 1 C T C C T T C T T C C C T C C T T T C T Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 25 / 33

Regression weights: Units Time periods 4 3 2 1 0 0 0 0 0 0 0 1 2 0 1 1 2 0 1 1 2 1 2 0 0 0 0 0 Weights can be negative = the method of moments estimator Fast computation is still available Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 26 / 33

Effects of GATT Membership on International Trade 1 Controversy Rose (2004): No effect of GATT membership on trade Tomz et al. (2007): Significant effect with non-member participants 2 The central role of fixed effects models: Rose (2004): one-way (year) fixed effects for dyadic data Tomz et al. (2007): two-way (year and dyad) fixed effects Rose (2005): I follow the profession in placing most confidence in the fixed effects estimators; I have no clear ranking between country-specific and country pair-specific effects. Tomz et al. (2007): We, too, prefer FE estimates over OLS on both theoretical and statistical ground Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 27 / 33

Data and Methods 1 Data Data set from Tomz et al. (2007) Effect of GATT: 1948 1994 162 countries, and 196,207 (dyad-year) observations 2 Year fixed effects model: ln Y it = α t + βx it + δ Z it + ɛ it Y it : trade volume X it : membership (formal/participants) Both vs. At most one Z it : 15 dyad-varying covariates (e.g., log product GDP) 3 Weighted one-way fixed effects model: arg min (α,β,δ) N i=1 t=1 T W it (ln Y it α t βx it δ Z it ) 2 Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 28 / 33

Empirical Results: Formal Membership Dyad with Both Members vs. One or None Member Estimated Effects (log of trade) 0.2 0.1 0.0 0.1 0.2 0.3 0.4 FE WFE FE WFE FD Year Fixed Effects Dyad Fixed Effects Year and Dyad Fixed Effects FE DID Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 29 / 33

Empirical Results Dyad with Both Members vs. One or None Member Estimated Effects (log of trade) 0.2 0.1 0.0 0.1 0.2 0.3 0.4 FE WFE FE WFE FE WFE FD FE WFE Year Fixed Effects Dyad Fixed Effects Year and Dyad Fixed Effects FD FE DID FE DID Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 30 / 33

Concluding Remarks Linear fixed effects models are attractive because they can adjust for unobserved confounders However, this advantage comes at costs Two key causal assumptions: 1 treatments do not directly affect future outcomes 2 outcomes do not directly affect future treatments and future time-varying covariates These assumptions can be relaxed under alternative selection-on-observables approaches Improve fixed effects estimators: 1 Within-unit matching estimator no linearity assumption 2 Design-based assumptions before-and-after, differnece-in-differences 3 All of these can be written as weighted fixed effects regression R package wfe is available at CRAN Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 31 / 33

Send comments and suggestions to: kimai@princeton.edu More information about this and other research: http://imai.princeton.edu Imai (Princeton) and Kim (MIT) Fixed Effects for Causal Inference UCSD (May 6, 2016) 32 / 33