Empirical Methods in Applied Economics Lecture Notes

Similar documents
Regression Discontinuity

Applied Microeconometrics Chapter 8 Regression Discontinuity (RD)

Regression Discontinuity

Regression Discontinuity Designs

Regression Discontinuity

Regression Discontinuity Designs.

Regression Discontinuity

Regression Discontinuity Design Econometric Issues

Empirical Methods in Applied Microeconomics

Online Appendix to The Political Economy of the U.S. Mortgage Default Crisis Not For Publication

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous

Michael Lechner Causal Analysis RDD 2014 page 1. Lecture 7. The Regression Discontinuity Design. RDD fuzzy and sharp

Supplemental Appendix to "Alternative Assumptions to Identify LATE in Fuzzy Regression Discontinuity Designs"

1 Fixed E ects and Random E ects

The Economics of European Regions: Theory, Empirics, and Policy

Empirical Methods in Applied Economics

An Alternative Assumption to Identify LATE in Regression Discontinuity Designs

Lecture 1: Introduction to Regression Discontinuity Designs in Economics

ECONOMETRICS II (ECO 2401) Victor Aguirregabiria. Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS

Estimating the Effect of Financial Aid Offers on College Enrollment: A Regression Discontinuity Approach. Wilbert van der Klaaw

On the level of public good provision in games of redistributive politics

Chapter 2 Regression Discontinuity Design: When Series Interrupt

NBER WORKING PAPER SERIES REGRESSION DISCONTINUITY DESIGNS: A GUIDE TO PRACTICE. Guido Imbens Thomas Lemieux

NOWCASTING THE OBAMA VOTE: PROXY MODELS FOR 2012

Quantitative Economics for the Evaluation of the European Policy

An Alternative Assumption to Identify LATE in Regression Discontinuity Design

Addressing Analysis Issues REGRESSION-DISCONTINUITY (RD) DESIGN

Instrumental Variables. Ethan Kaplan

Prediction and causal inference, in a nutshell

The returns to schooling, ability bias, and regression

The regression discontinuity (RD) design has become

Statistical Models for Causal Analysis

Lecture Notes Part 4: Nonlinear Models

Lecture 10 Regression Discontinuity (and Kink) Design

Finding Instrumental Variables: Identification Strategies. Amine Ouazad Ass. Professor of Economics

Regression Discontinuity Designs in Economics

Identifying the Effect of Changing the Policy Threshold in Regression Discontinuity Models

The Causal Inference Problem and the Rubin Causal Model

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

Solutions: Monday, October 22

CORRELATION AND REGRESSION

Why high-order polynomials should not be used in regression discontinuity designs

Lecture Notes Part 7: Systems of Equations

Regression discontinuity design with covariates

Imbens, Lecture Notes 4, Regression Discontinuity, IEN, Miami, Oct Regression Discontinuity Designs

Empirical approaches in public economics

Section 7: Local linear regression (loess) and regression discontinuity designs

ted: a Stata Command for Testing Stability of Regression Discontinuity Models

QUANTILE REGRESSION DISCONTINUITY: ESTIMATING THE EFFECT OF CLASS SIZE ON SCHOLASTIC ACHIEVEMENT

Economics 620, Lecture 19: Introduction to Nonparametric and Semiparametric Estimation

Education Production Functions. April 7, 2009

Imbens/Wooldridge, Lecture Notes 3, NBER, Summer 07 1

LECTURE 2: SIMPLE REGRESSION I

PSC 504: Differences-in-differeces estimators

29 Linear Programming

PSC 504: Instrumental Variables

Social Choice and Networks

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Continuity. MATH 161 Calculus I. J. Robert Buchanan. Fall Department of Mathematics

Gov 2002: 4. Observational Studies and Confounding

October 11, Keywords: Regression Discontinuity Design, Permutation Test, Induced Order Statistics, R.

Determine whether the formula determines y as a function of x. If not, explain. Is there a way to look at a graph and determine if it's a function?

Introduction to Statistical Inference

Inference in Regression Discontinuity Designs with a Discrete Running Variable

ECONOMET RICS P RELIM EXAM August 24, 2010 Department of Economics, Michigan State University

Econometrics II. Nonstandard Standard Error Issues: A Guide for the. Practitioner

Regression Discontinuity: Advanced Topics. NYU Wagner Rajeev Dehejia

Two-way fixed effects estimators with heterogeneous treatment effects

Four Parameters of Interest in the Evaluation. of Social Programs. James J. Heckman Justin L. Tobias Edward Vytlacil

Problem Set #6: OLS. Economics 835: Econometrics. Fall 2012

The Importance of the Median Voter

Variables and Variable De nitions

Probability Models of Information Exchange on Networks Lecture 1

Math 261 Exercise sheet 5

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models

1.2 Functions and Their Properties Name:

Regression Discontinuity Design on Model Schools Value-Added Effects: Empirical Evidence from Rural Beijing

A Practical Introduction to Regression Discontinuity Designs: Volume I

Simple Estimators for Semiparametric Multinomial Choice Models

Regression Discontinuity Design

Recitation Notes 6. Konrad Menzel. October 22, 2006

Supporting Information. Controlling the Airwaves: Incumbency Advantage and. Community Radio in Brazil

Causal Inference Basics

Why High-Order Polynomials Should Not Be Used in Regression Discontinuity Designs

Class business PS is due Wed. Lecture 20 (QPM 2016) Multivariate Regression November 14, / 44

1 A Non-technical Introduction to Regression

Functional Analysis Exercise Class

Causal Inference with Big Data Sets

UNIVERSITY OF MARYLAND Department of Economics Economics 754 Topics in Political Economy Fall 2005 Allan Drazen. Exercise Set I

Behavioral Data Mining. Lecture 19 Regression and Causal Effects

Nonparametric Identi cation of Regression Models Containing a Misclassi ed Dichotomous Regressor Without Instruments

Function Approximation

Chapter (7) Continuous Probability Distributions Examples Normal probability distribution

Regression Discontinuity in Serial Dictatorship: Achievement Effects at Chicago s Exam Schools

Last week we looked at limits generally, and at finding limits using substitution.

Unit 2 Maths Methods (CAS) Exam

Chapter 12: Model Specification and Development

Introduction to Econometrics

Single-Equation GMM: Endogeneity Bias

dqd: A command for treatment effect estimation under alternative assumptions

Transcription:

Empirical Methods in Applied Economics Lecture Notes Jörn-Ste en Pischke LSE October 2005 1 Regression Discontinuity Design 1.1 Basics and the Sharp Design The basic idea of the regression discontinuity design is extremely simple. There is a variable x, which is correlated with both the treatment and the outcome. However, the relationship between the variable x and the treatment is very speci c. There is a point x 0, and 1 if x x0 D = : 0 if x < x 0 Hence, there is a cuto point for x, and for individuals with x above the cuto point, treatment takes place, while for individuals with x below the cuto point there is no treatment. Figure 1 illustrates the situation. Elections are a good example of a situation which creates a regression discontinuity design. In this case, D denotes the candidate who is elected, x is the vote share for the candidate, and y is an outcome in uenced by the candidate. A candidate wins the election by receiving more than 50 percent of the vote. The counterfactual outcome, on the other changes continuously with the vote share, so it is going to be very similar for a vote share of 49 percent or 51 percent, although a di erent candidate is elected. Hence, the identifying assumption is going to be again E(y 0 jd; x) = E(y 0 jx). The obvious idea is to use the change in the outcome for data points near the cuto point as an estimate of the treatment e ect. Let s formalize these ideas. The basic assumption is that E(y 0 jx) is a smooth function. 1

Figure 1: Regression Discontinuity Design This suggests two strategies. Let be a small postitive number. Then and hence E (yjx 0 < x < x 0 ) ' E(y 0 jx 0 ) E (yjx 0 x < x 0 + ) ' E(y 1 jx 0 ) E (yjx 0 x < x 0 + ) E (yjx 0 < x < x 0 ) ' : This says that if we take a small sample around the point of discontinuity, then comparing the mean outcome to the left and to the right of the point of discontinuity gives us an estimate of the treatment e ect. The second strategy to estimate the treatment e ect exploits a bigger sample, and models E(y 0 jx) parametrically. Let E(y 0 jx) = f(x), where f() might be a low order polynomial. Then y 0i = f(x i ) + " i y 1i = y 0i + y i = f(x i ) + D i + " i (1) = f(x i ) + 1(x i x 0 ) + " i (2) and running this regression will identify the treatment e ect. It is obvious that the function f() has to be continuous at x 0, to avoid collinearity in the model. If f() is being modelled exibly enough, the two approaches use very similar variation in the data. With a exible f(), data points far 2

from x 0 are not going to contribute to the estimation in either strategy. Of course, the two strategies can be combined: we might want to limit the sample but even allow some parametric control for x in this smaller sample. In addition, sometimes it is useful to allow for di erent functional forms to the left and to the right of the disconinuity, i.e. y i = f l (x i )1(x i < x 0 ) + f r (x i )1(x i x 0 ) + 1(x i x 0 ) + " i In this speci cation, it is important to specify the two functions f l () and f r () so that f l (x 0 ) = f r (x 0 ) holds, otherwise would not capture the jump at the discontinuity correctly. 1.2 The Fuzzy Regression Discontiunity Design The design described so far is called the sharp regression discontinuity design, because D changes discretely at the point x 0. This is, however, not necessary for the regression discontinuity design. It is enough that the probability of treatment assignment changes discretely at x 0, i.e. lim!0 P (D = 1jx 0 ) < P (D = 1jx 0 + ). Now, there will be both treated or untreated observations on either side of the point of discontinuity. Hence, E (yjx 0 x < x 0 + ) E (yjx 0 < x < x 0 ) ' [E (Djx 0 x < x 0 + ) E (Djx 0 < x < x 0 )] E (yjx 0 x < x 0 + ) E (yjx 0 < x < x 0 ) E (Djx 0 x < x 0 + ) E (Djx 0 < x < x 0 ) ' : It is easy to see that this is the Wald estimator, with the indicator 1(x i x 0 ) as the instrumental variable. Similarly, we can estimate equation (1) by instrumental variables with 1(x i x 0 ) as the instrument. (1) and (2) are now no longer identical. (1) is the structural equation, while (2) is the reduced form. 1.3 Examples An example of the sharp regression discontinuity design is the study by Lee (2003) of the advantage of incumbency on re-election probabilities. Incumbent politicians may have resources at their disposal, which help them in gaining re-election compared to challengers not holding the same political o ce. On the other hand, incumbent politicians have been elected in the 3

past, and hence may also simply be prefered by voters. The idea of the Lee study is that voters preferences should be well proxied by a politicians vote share in the past election. Hence, comparing politicians who just won a past election (and hence became incumbent politicians for a future election) to those who just lost a past election (and hence do not hold the political o ce) can give an estimate of the incumbancy e ect. This is a regression discontinuity setup: the vote share in the past election is the variable x, which captures the confounding variable on voters preferences. The treatment is victory in the past election (and hence incumbancy) and the outcome is the probability of winning a future election. Figure 2a in Lee (2003) illustrates the results of this exercise for elections to the US House of Representatives. The top gure plots the probability of winning the election in t + 1 against the vote share in election t. Future election probabilities are a function of the past vote share, but there is a large discrete jump at the point where the candidate wins the election in the past. The gure shows that incumbancy raises the re-election probability by about 35 percentage points. Figure 2b checks the identi cation strategy by looking at the number of past election victories before election t as the outcome. This should not be a ected by winning the election t, and this is indeed born out by the data. An example of a fuzzy regression discontinuity design with a continuous treatment variable is the study of the e ect of class size on student performance by Angrist and Lavy (1999). There study uses the fact that class size in Israeli schools is capped at 40. Once enrollment reaches 41, two classes will be formed, three classes at 81 and so on (this rule goes back to the medieval talmudic scholar Maimonides). Hence, class size is a function of enrollment, with discontinuties at multiples of 40. Of course, enrollment may correlate with student performance indepently of its e ect on class size. For example, bigger schools are more likely to be in urban areas with a better intake of students. Bigger schools may also o er more economies of scale, and hence make better use of resources. It is therefore important to be able to control for enrollment directly. Figure 1 in Angrist and Lavy (1999) plots actual class sizes against enrollment, and also shows predicted class sizes from Maimonides rule. The class size patterns roughly follow Maimonides rule, although the t is not exact because sometimes schools split classes before reaching the maximum size of 40. However, there are clearly visible declines in class sizes at enrollment levels of 40, 80, and 120. Hence, the regression discontinuity design here is fuzzy, rather than sharp, and the correct empirical implementation is to instrument class size with Maimonides rule. 4

Figure 2 plots predicted class size against average reading scores by enrollment. It is obvious that there is an inverse saw tooth pattern, suggesting that smaller classes may be good for performance. It also demonstrates that students in larger schools do better on average. Table 2 shows OLS estimates. It demonstrates that there is a positive correlation between class size and test scores in the raw data. This correlation vanishes when the fraction disadvantaged students is controlled for. Table 4 shows the IV results, explointing the regression discontinuities created by Maimonides rule. The table displays various speci cations with no, linear, quadratic, and piecewise linear controls for enrollment, as well as estimates in subsamples around the discontinuity points. Controlling for enrollment is important, particularly for the math test scores. The form of the control matters less. On the other hand, the discontinuity samples give larger e ects (in absolute values) than the full sample, which is less comforting. 5

Lee 2005: Figure 2a Lee 2005: Figure 2b

Angrist and Lavy 1999: Figure 1

Angrist and Lavy 1999: Figure 2

Angrist and Lavy 1999: Table 2 Angrist and Lavy 1999: Table 4