Lecture: Difference-in-Difference (DID)

Similar documents
1 Impact Evaluation: Randomized Controlled Trial (RCT)

Chapter 2: simple regression model

Job Training Partnership Act (JTPA)

14.74 Lecture 10: The returns to human capital: education

Ec1123 Section 7 Instrumental Variables

Econ 1123: Section 2. Review. Binary Regressors. Bivariate. Regression. Omitted Variable Bias

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

EMERGING MARKETS - Lecture 2: Methodology refresher

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

EC402 - Problem Set 3

Treatment Effects. Christopher Taber. September 6, Department of Economics University of Wisconsin-Madison

Lecture 2: Constant Treatment Strategies. Donglin Zeng, Department of Biostatistics, University of North Carolina

Next week Professor Saez will discuss. This week John and I will

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015

Empirical approaches in public economics

Question. Hypothesis testing. Example. Answer: hypothesis. Test: true or not? Question. Average is not the mean! μ average. Random deviation or not?

Empirical Application of Simple Regression (Chapter 2)

Rockefeller College University at Albany

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

Causality and Experiments

Simple Regression Model (Assumptions)

Applied Statistics and Econometrics

Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook)

Econometrics in a nutshell: Variation and Identification Linear Regression Model in STATA. Research Methods. Carlos Noton.

Mediation analyses. Advanced Psychometrics Methods in Cognitive Aging Research Workshop. June 6, 2016

Quantitative Economics for the Evaluation of the European Policy

ECON3150/4150 Spring 2015

Regression Analysis. Ordinary Least Squares. The Linear Model

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Differences-in- Differences. November 10 Clair

ECO375 Tutorial 9 2SLS Applications and Endogeneity Tests

Research Design: Causal inference and counterfactuals

An overview of applied econometrics

Empirical Methods in Applied Microeconomics

The Simple Linear Regression Model

PIER Summer 2017 Mediation Basics

Education Production Functions. April 7, 2009

Classification. Classification is similar to regression in that the goal is to use covariates to predict on outcome.

Combining multiple observational data sources to estimate causal eects

Exam ECON5106/9106 Fall 2018

Recall from last time: Conditional probabilities. Lecture 2: Belief (Bayesian) networks. Bayes ball. Example (continued) Example: Inference problem

Math 243 Section 3.1 Introduction to Probability Lab

On the Use of Linear Fixed Effects Regression Models for Causal Inference

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Review of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University.

Econ Spring 2016 Section 9

Clustering as a Design Problem

Dynamics in Social Networks and Causality

ECNS 561 Multiple Regression Analysis

Using Instrumental Variables to Find Causal Effects in Public Health

Behavioral Data Mining. Lecture 19 Regression and Causal Effects

STA441: Spring Multiple Regression. More than one explanatory variable at the same time

Principles Underlying Evaluation Estimators

LECTURE 15: SIMPLE LINEAR REGRESSION I

Comparative Politics and the Synthetic Control Method Revisited: A Note on Abadie et al. (2015)

Econometrics of Policy Evaluation (Geneva summer school)

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.

Tools for causal analysis and philosophical theories of causation. Isabelle Drouet (IHPST)

Applied Econometrics Lecture 1

Now we will define some common sampling plans and discuss their strengths and limitations.

A PRIMER ON LINEAR REGRESSION

Instrumental Variables

Gravity Models and the Armington Assumption

Experiments and Quasi-Experiments

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47

Applied Quantitative Methods II

Describing Associations, Covariance, Correlation, and Causality. Interest Rates and Inflation. Data & Scatter Diagram. Lecture 4

Section 1 : Introduction to the Potential Outcomes Framework. Andrew Bertoli 4 September 2013

Regression and Stats Primer

Problem Set 10: Panel Data

QUADRATICS 3.2 Breaking Symmetry: Factoring

1 A Non-technical Introduction to Regression

Notes 11: OLS Theorems ECO 231W - Undergraduate Econometrics

Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Intermediate Econometrics

Data Analysis and Statistical Methods Statistics 651

Lecture#12. Instrumental variables regression Causal parameters III

Multiple Linear Regression CIVL 7012/8012

Lecture Module 8. Agenda. 1 Endogeneity. 2 Instrumental Variables. 3 Two-stage least squares. 4 Panel Data: First Differencing

Econometrics I Lecture 7: Dummy Variables

Mid-term exam Practice problems

WISE International Masters

Lecture 1 January 18

Analysing data: regression and correlation S6 and S7

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2.

Fixed Effects Models for Panel Data. December 1, 2014

A Meta-Analysis of the Urban Wage Premium

Causal Sensitivity Analysis for Decision Trees

Solutions to Odd-Numbered End-of-Chapter Exercises: Chapter 8

Revisiting the Synthetic Control Estimator

( ) P A B : Probability of A given B. Probability that A happens

Panel Data. STAT-S-301 Exercise session 5. November 10th, vary across entities but not over time. could cause omitted variable bias if omitted

Data Analysis and Statistical Methods Statistics 651

1 Motivation for Instrumental Variable (IV) Regression

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

Double slit questions

Transcription:

Lecture: Difference-in-Difference (DID) 1

2 Motivation Q: How to show a new medicine is effective? Naive answer: Give the new medicine to some patients (treatment group), and see what happens This before-and-after comparison is naive because something may happen even without the new medicine. Think about self-healing. Better answer: Give the new medicine to some patients, and compare them to other group of patients that do not get the medicine (control group) It is better because control group shows what would happen to the treatment group in the absence of treatment (counterfactual) This answer is not perfect because the two groups may differ in other ways (confounding factors)

3 Perfect Answers Perfect answer 1: Give the new medicine to a treatment group that is the same as the control group in all other aspects. Here, confounding factors are constant and ceteris paribus holds. Perfect answer 2: Randomly assign the new medicine to treatment and control groups, and compare them. Randomization ensures that the two groups are otherwise comparable. Put differently, there is no systematic difference in confounding factors; the only systematic difference is getting the new medicine.

4 Reality But the reality is we cannot find so many patients that are the same in all other aspects And sometimes it is impossible to implement the randomized controlled trial. Can we randomly assign students to selective schools like Harvard, and evaluate its effect on future earning?

5 DID Based on Observed Data Let D A be the after-treatment dummy (or time dummy), and D T be the treatment dummy. The dependent variable y measures the outcome. Consider the pooled regression that includes D A,D T and their interaction term y = β 0 + β 1 D A + β 2 D T + β 3 (D A D T ) + β 4 x + u (1) 1. β 0 measures the average before outcome for control group 2. β 1 measures (average after outcome - average before outcome ) for control group 3. β 2 measures (the average before outcome for treatment group - average before outcome for control group) 4. β 3 is DID, measuring (the average after outcome for treatment group - average after outcome for control group)-(the average before outcome for treatment group - average before outcome for control group)

Big Picture 1 6

Big Picture 2 7

8 Remarks We can allow for observable difference between the two groups by controlling x We need to assume the unobservable characteristics are the same for the two groups The validity of DID relies upon this crucial assumption!

Heuristic Proof 9 Denote the true causal effect (treatment effect) by β. For the treatment group, we compare the before and after outcomes by running the regression y treatment group = β 0 + β 1 D A + u (2) ˆβ 1 β + bias treatment group (3) where the bias is due to confounding factors. Similarly, we can do the same for the control group y control group = α 0 + α 1 D A + u (4) ˆα 1 0 + bias control group (5) Assuming It follows that bias treatment group = bias control group (6) DID ˆβ 1 ˆα 1 β (7)

10 Bottom Line In order for DID to work, the treatment and control groups need to be as similar as possible. Make sure you do not compare apple to orange They can differ in x for which we have data. The two groups cannot differ in characteristics that are unobservable DID fails if people choose to receive the treatment based on unobserved characteristic. For example, conscious parents may intentionally send their kids to small class (and receive treatment). This can cause self-selection bias and invalidate DID. It is nice to have multiple control groups. Significantly different DID estimates can raise a red flag. Even better, consider the synthetic control method proposed by Alberto Abadie.

(Optional) Synthetic Control Method 1 11 We may consider SCM when we have multiple control groups The basic idea of SCM is to construct just one non-existent combined or synthetic control group, which is a weighted average of the original control groups We want to find optimal weight so that the synthetic control group looks as similar as possible to the treatment group before treatment in terms of observable characteristics For instance, we want to investigate how the 2011 earthquake and tsunami affect Japan (treatment group), and we want to focus on the GDP (outcome). To do so we need to find control groups countries that do not suffer earthquake or tsunami and have similar GDP. Suppose UK and Germany satisfy both conditions, so we have two control groups. The idea of SCM is to combine them into an imaginary country called UG, a weighted average of UK and Germany that mimics Japan to the largest extent. We can construct synthetic control (SC) according to Cobb-Douglas production function Y = K β L α, where Y is GDP, K is capital, L is labor and β,α are input-shares.

(Optional) Synthetic Control Method 2 12 Mathematically, SC is a weighted average of UK and Germany SC = wuk + (1 w)germany We try to find optimal weights (w,1 w) that minimize the squared difference (why not absolute value?) between SC and treatment in terms of K and L (observable characteristics): min w β ( K Japan w K UK (1 w) K Germany) 2 + α ( L Japan w L UK (1 w) L Germany) 2 (8) where K, L denote average (before 2011).We can download a stata package called synth to solve (8). After w is obtained, the estimated treatment effect at period T (after 2011) is GDP Japan T wgdp UK T (1 w)gdp Germany T (9) Exercise: how to modify (8) if we add France (another control) and technology (another input)? What if α and β are unknown?