Regression Discontinuity Designs in Stata

Size: px
Start display at page:

Download "Regression Discontinuity Designs in Stata"

Transcription

1 Regression Discontinuity Designs in Stata Matias D. Cattaneo University of Michigan July 30, 2015

2 Overview Main goal: learn about treatment effect of policy or intervention. If treatment randomization available, easy to estimate treatment effects. If treatment randomization not available, turn to observational studies. Instrumental variables. Selection on observables. Regression discontinuity (RD) designs. Simple and objective. Requires little information, if design available. Might be viewed as a local randomized trial. Easy to falsify, easy to interpret. Careful: very local!

3 Overview of RD packages rdrobust package: estimation, inference and graphical presentation using local polynomials, partitioning, and spacings estimators. rdrobust: RD inference (point estimation and CI; classic, bias-corrected, robust). rdbwselect: bandwidth or window selection (IK, CV, CCT). rdplot: plots data (with optimal block length). rddensity package: discontinuity in density test at cutoff (a.k.a. manipulation testing) using novel local polynomial density estimator. rddensity: manipulation testing using local polynomial density estimation. rdbwdensity: bandwidth or window selection. rdlocrand package: covariate balance, binomial tests, randomization inference methods (window selection & inference). rdrandinf: inference using randomization inference methods. rdwinselect: falsification testing and window selection. rdsensitivity: treatment effect models over grid of windows, CI inversion. rdrbounds: Rosenbaum bounds.

4 Randomized Control Trials Notation: (Y i (0), Y i (1), X i ), i = 1, 2,..., n. Treatment: T i {0, 1}, T i independent of (Y i (0), Y i (1), X i ). Data: (Y i, T i, X i ), i = 1, 2,..., n, with Y i = { Yi (0) if T i = 0 Y i (1) if T i = 1 Average Treatment Effect: τ ATE = E[Y i (1) Y i (0)] = E[Y i T = 1] E[Y i T = 0] Experimental Design.

5 Sharp RD design Notation: (Y i (0), Y i (1), X i ), i = 1, 2,..., n, X i continuous Treatment: T i {0, 1}, T i = 1(X i x). Data: (Y i, T i, X i ), i = 1, 2,..., n, with Y i = { Yi (0) if T i = 0 Y i (1) if T i = 1 Average Treatment Effect at the cutoff: τ SRD = E[Y i (1) Y i (0) X i = x] = lim x x E[Y i X i = x] lim x x E[Y i X i = x] Quasi-Experimental Design: local randomization (more later)

6 Outcome variable (Y) τ Assignment variable (R)

7 Outcome variable (Y) τ Assignment variable (R)

8 Outcome variable (Y) τ Assignment variable (R)

9 Outcome variable (Y) τ 0 Local Random Assignment Assignment variable (R)

10 Empirical Illustration: Cattaneo, Frandsen & Titiunik (2015, JCI) Problem: incumbency advantage (U.S. senate). Data: Y i = election outcome. T i = whether incumbent. X i = vote share previous election ( x = 0). Z i = covariates (demvoteshlag1, demvoteshlag2, dopen, etc.). Potential outcomes: Y i (0) = election outcome if had not been incumbent. Y i (1) = election outcome if had been incumbent. Causal Inference: Y i (0) Y i T i = 0 and Y i (1) Y i T i = 1

11 Graphical and Falsification Methods Always plot data: main advantage of RD designs! Plot regression functions to assess treatment effect and validity. Plot density of X i for assessing validity; test for continuity at cutoff and elsewhere. Important: use also estimators that do not smooth-out data. RD Plots (Calonico, Cattaneo & Titiunik, JASA): Two ingredients: (i) Smoothed global polynomial fit & (ii) binned discontinuous local-means fit. Two goals: (i) detention of discontinuities, & (ii) representation of variability. Two tuning parameters: G lobal p olynom ial degree (k n). L o cation (E S or Q S) and num b er of bins (J n).

12 Manipulation Tests & Covariate Balance and Placebo Tests Density tests near cutoff: Idea: distribution of running variable should be similar at either side of cutoff. Method 1: Histograms & Binomial count test. Method 2: Density Estimator at boundary. Pre-binned local polynomial method M ccrary (2008). N ew tuning-param eter-free m etho d C attan eo, Jan sson an d M a (2015). Placebo tests on pre-determined/exogenous covariates. Idea: zero RD treatment effect for pre-determined/exogenous covariates. Methods: global polynomial, local polynomial, randomization-based. Placebo tests on outcomes. Idea: zero RD treatment effect for outcome at values other than cutoff. Methods: global polynomial, local polynomial, randomization-based.

13 Estimation and Inference Methods Global polynomial approach (not recommended). Robust local polynomial inference methods. Bandwidth selection. Bias-correction. Confidence intervals. Local randomization and randomization inference methods. Window selection. Estimation and Inference methods. Falsification, sensitivity and related methods

14 Conventional Local-polynomial Approach Idea: approximate regression functions for control and treatment units locally. Local-linear estimator (w/ weights K( )): h n X i < x : x X i h n : Y i = α + (X i x) β + ε,i Y i = α + + (X i x) β + + ε +,i Treatment effect (at the cutoff): ˆτ SRD = ˆα + ˆα Can be estimated using linear models (w/ weights K( )): Y i = α + τ SRD T i + (X i x) β 1 + T i (X i x) γ 1 + ε i, h n X i h n Once h n chosen, inference is standard : weighted linear models. Details coming up next.

15 Conventional Local-polynomial Approach How to choose h n? Imbens & Kalyanaraman (2012, ReStud): optimal plug-in, ĥ IK = ĈIK n 1/5 Calonico, Cattaneo & Titiunik (2014, ECMA): refinement of IK ĥ CCT = ĈCCT n 1/5 Ludwig & Miller (2007, QJE): cross-validation, ĥ CV = arg min h n w(x i ) (Y i ˆµ 1 (X i, h)) 2 i=1 Key idea: trade-off bias and variance of ˆτ SRD(h n). Heuristically: Bias(ˆτ SRD) = ĥ and Var(ˆτ SRD) = ĥ

16 Local-Polynomial Methods: Bandwidth Selection Two main methods: plug-in & cross-validation. Both MSE-optimal in some sense. Imbens & Kalyanaraman (2012, ReStud): propose MSE-optimal rule, h MSE = C 1/5 MSE Var(ˆτ SRD) n 1/5 C MSE = C(K) Bias(ˆτ SRD) 2 IK implementation: first-generation plug-in rule. CCT implementation: second-generation plug-in rule. They differ in the way Var(ˆτ SRD) and Bias(ˆτ SRD) are estimated. Imbens & Kalyanaraman (2012, ReStud): discuss cross-validation approach, ĥ CV = arg min h>0 CV δ (h), CV δ (h) = where n 1(X,[δ] X i X +,[δ] ) (Y i ˆµ(X i ; h)) 2, i=1 ˆµ +,p (x; h) and ˆµ,p (x; h) are local polynomials estimates. δ (0, 1), X,[δ] and X +,[δ] denote δ-th quantile of {X i : X i < x} and {X i : X i x}. Our implementation uses δ = 0.5; but this is a tuning parameter!

17 Conventional Approach to RD Local-linear estimator (w/ weights K( )): h n X i < x : x X i h n : Y i = α + (X i x) β + ε,i Y i = α + + (X i x) β + + ε +,i Treatment effect (at the cutoff): ˆτ SRD = ˆα + ˆα Construct usual t-test. For H 0 : τ SRD = 0, ˆT (h n) = ˆτ SRD ˆα + ˆα = ˆVn ˆV+,n + ˆV d N (0, 1),n 95% Confidence interval: [ ] Î(h n) = ˆτ SRD ± 1.96 ˆV n

18 Bias-Correction Approach to RD Note well: for usual t-test, ˆT (h MSE) = ˆτ SRD d N (B, 1) N (0, 1), B > 0 ˆVn Bias B in RD estimator captures curvature of regression functions. Undersmoothing/ Small Bias Approach: Choose smaller h n... Perhaps ĥn = 0.5 ĥik? = Not clear guidance & power loss! Bias-correction Approach: ˆT bc (h n, b n) = ˆτ SRD ˆB n ˆVn d N (0, 1) [ ) = 95% Confidence Interval: Î bc (h n, b n) = (ˆτ SRD ˆB n ] ± 1.96 ˆV n How to choose b n? Same ideas as before... ˆb n = Ĉ n 1/7

19 Robust Bias-Correction Approach to RD Recall: ˆT (h n) = ˆτ SRD bc d N (0, 1) and ˆT (hn, b n) = ˆV ˆτ SRD ˆB n d N (0, 1) n ˆV n ˆBn is constructed to estimate leading bias B. Robust approach: ˆT bc (h n, b n) = ˆτ SRD ˆB n ˆVn = ˆτ SRD Bn + ˆVn Bn ˆB n ˆVn }{{}}{{} d N (0,1) d N (0,γ) Robust bias-corrected t-test: ˆT rbc (h n, b n) = ˆτ SRD ˆB n = ˆV ˆτ SRD ˆB n d N (0, 1) n + Ŵ n ˆV n bc = 95% Confidence Interval: [ ) Î rbc (h n, b n) = (ˆτ SRD ˆB n ] ± 1.96 ˆV n bc, ˆV n bc = ˆV n + Ŵ n

20 Local-Polynomial Methods: Robust Inference Approach 1: Undersmoothing/ Small Bias. [ ] Î(h n) = ˆτ SRD ± 1.96 ˆV n Approach 2: Bias correction (not recommended). [ ) Î bc (h n, b n) = (ˆτ SRD ˆB n ] ± 1.96 ˆV n Approach 3: Robust Bias correction. [ Î rbc (h n, b n) = (ˆτ SRD ˆB ) n ] ± 1.96 ˆV n + Ŵn

21 Local-randomization approach and finite-sample inference Popular approach: local-polynomial methods. Approximates regression function and relies on continuity assumptions. Requires: choosing weights, bandwidth and polynomial order. Alternative approach: local-randomization + randomization-inference Gives an alternative that can be used as a robustness check. Key assumption: exists window W = [ h n, h n] around cutoff ( h n < x < h n) where T i independent of (Y i(0), Y i(1)) (for all X i W ) In words: treatment is randomly assigned within W. Good news: if plausible, then RCT ideas/methods apply. Not-so-good news: most plausible for very small windows (very few observations). One solution: employ small window but use randomization-inference methods. Requires: choosing randomization rule, window and statistic.

22 Local-randomization approach and finite-sample inference Recall key assumption: exists W = [ h n, h n] around cutoff ( h n < x < h n) where T i independent of (Y i (0), Y i (1)) (for all X i W ) How to choose window? Use balance tests on pre-determined/exogenous covariates. Very intuitive, easy to implement. How to conduct inference? Use randomization-inference methods. 1 Choose statistic of interest. E.g., t-stat for difference-in-means. 2 Choose randomization rule. E.g., number of treatments and controls given. 3 Compute finite-sample distribution of statistics by permuting treatment assignments.

23 Local-randomization approach and finite-sample inference Do not forget to validate & falsify the empirical strategy. 1 Plot data to make sure local-randomization is plausible. 2 Conduct placebo tests. (e.g., use pre-intervention outcomes or other covariates not used select W ) 3 Do sensitivity analysis. See Cattaneo, Frandsen and Titiunik (2015) for introduction. See Cattaneo, Titiunik and Vazquez-Bare (2015) for further results and implementation.

Robust Data-Driven Inference in the Regression-Discontinuity Design

Robust Data-Driven Inference in the Regression-Discontinuity Design The Stata Journal (2014) vv, Number ii, pp. 1 36 Robust Data-Driven Inference in the Regression-Discontinuity Design Sebastian Calonico University of Michigan Ann Arbor, MI calonico@umich.edu Matias D.

More information

Robust Data-Driven Inference in the Regression-Discontinuity Design

Robust Data-Driven Inference in the Regression-Discontinuity Design The Stata Journal (2013) vv, Number ii, pp. 1 32 Robust Data-Driven Inference in the Regression-Discontinuity Design Sebastian Calonico University of Michigan Ann Arbor, MI calonico@umich.edu Matias D.

More information

Section 7: Local linear regression (loess) and regression discontinuity designs

Section 7: Local linear regression (loess) and regression discontinuity designs Section 7: Local linear regression (loess) and regression discontinuity designs Yotam Shem-Tov Fall 2015 Yotam Shem-Tov STAT 239/ PS 236A October 26, 2015 1 / 57 Motivation We will focus on local linear

More information

Supplemental Appendix to "Alternative Assumptions to Identify LATE in Fuzzy Regression Discontinuity Designs"

Supplemental Appendix to Alternative Assumptions to Identify LATE in Fuzzy Regression Discontinuity Designs Supplemental Appendix to "Alternative Assumptions to Identify LATE in Fuzzy Regression Discontinuity Designs" Yingying Dong University of California Irvine February 2018 Abstract This document provides

More information

Multidimensional Regression Discontinuity and Regression Kink Designs with Difference-in-Differences

Multidimensional Regression Discontinuity and Regression Kink Designs with Difference-in-Differences Multidimensional Regression Discontinuity and Regression Kink Designs with Difference-in-Differences Rafael P. Ribas University of Amsterdam Stata Conference Chicago, July 28, 2016 Motivation Regression

More information

Regression Discontinuity Designs Using Covariates

Regression Discontinuity Designs Using Covariates Regression Discontinuity Designs Using Covariates Sebastian Calonico Matias D. Cattaneo Max H. Farrell Rocío Titiunik May 25, 2018 We thank the co-editor, Bryan Graham, and three reviewers for comments.

More information

Regression Discontinuity Designs

Regression Discontinuity Designs Regression Discontinuity Designs Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Regression Discontinuity Design Stat186/Gov2002 Fall 2018 1 / 1 Observational

More information

Optimal Bandwidth Choice for Robust Bias Corrected Inference in Regression Discontinuity Designs

Optimal Bandwidth Choice for Robust Bias Corrected Inference in Regression Discontinuity Designs Optimal Bandwidth Choice for Robust Bias Corrected Inference in Regression Discontinuity Designs Sebastian Calonico Matias D. Cattaneo Max H. Farrell September 14, 2018 Abstract Modern empirical work in

More information

Optimal bandwidth selection for the fuzzy regression discontinuity estimator

Optimal bandwidth selection for the fuzzy regression discontinuity estimator Optimal bandwidth selection for the fuzzy regression discontinuity estimator Yoichi Arai Hidehiko Ichimura The Institute for Fiscal Studies Department of Economics, UCL cemmap working paper CWP49/5 Optimal

More information

Michael Lechner Causal Analysis RDD 2014 page 1. Lecture 7. The Regression Discontinuity Design. RDD fuzzy and sharp

Michael Lechner Causal Analysis RDD 2014 page 1. Lecture 7. The Regression Discontinuity Design. RDD fuzzy and sharp page 1 Lecture 7 The Regression Discontinuity Design fuzzy and sharp page 2 Regression Discontinuity Design () Introduction (1) The design is a quasi-experimental design with the defining characteristic

More information

ted: a Stata Command for Testing Stability of Regression Discontinuity Models

ted: a Stata Command for Testing Stability of Regression Discontinuity Models ted: a Stata Command for Testing Stability of Regression Discontinuity Models Giovanni Cerulli IRCrES, Research Institute on Sustainable Economic Growth National Research Council of Italy 2016 Stata Conference

More information

A Practical Introduction to Regression Discontinuity Designs: Part I

A Practical Introduction to Regression Discontinuity Designs: Part I A Practical Introduction to Regression Discontinuity Designs: Part I Matias D. Cattaneo Nicolás Idrobo Rocío Titiunik December 23, 2017 Monograph prepared for Cambridge Elements: Quantitative and Computational

More information

A Practical Introduction to Regression Discontinuity Designs: Volume I

A Practical Introduction to Regression Discontinuity Designs: Volume I A Practical Introduction to Regression Discontinuity Designs: Volume I Matias D. Cattaneo Nicolás Idrobo Rocío Titiunik April 11, 2018 Monograph prepared for Cambridge Elements: Quantitative and Computational

More information

Regression Discontinuity Design

Regression Discontinuity Design Chapter 11 Regression Discontinuity Design 11.1 Introduction The idea in Regression Discontinuity Design (RDD) is to estimate a treatment effect where the treatment is determined by whether as observed

More information

Regression Discontinuity Designs Using Covariates

Regression Discontinuity Designs Using Covariates Regression Discontinuity Designs Using Covariates Sebastian Calonico Matias D. Cattaneo Max H. Farrell Rocío Titiunik March 31, 2016 Abstract We study identification, estimation, and inference in Regression

More information

Simultaneous selection of optimal bandwidths for the sharp regression discontinuity estimator

Simultaneous selection of optimal bandwidths for the sharp regression discontinuity estimator Quantitative Economics 9 (8), 44 48 759-733/844 Simultaneous selection of optimal bandwidths for the sharp regression discontinuity estimator Yoichi Arai School of Social Sciences, Waseda University Hidehiko

More information

Optimal Bandwidth Choice for the Regression Discontinuity Estimator

Optimal Bandwidth Choice for the Regression Discontinuity Estimator Optimal Bandwidth Choice for the Regression Discontinuity Estimator Guido Imbens and Karthik Kalyanaraman First Draft: June 8 This Draft: September Abstract We investigate the choice of the bandwidth for

More information

Bootstrap Confidence Intervals for Sharp Regression Discontinuity Designs with the Uniform Kernel

Bootstrap Confidence Intervals for Sharp Regression Discontinuity Designs with the Uniform Kernel Economics Working Papers 5-1-2016 Working Paper Number 16003 Bootstrap Confidence Intervals for Sharp Regression Discontinuity Designs with the Uniform Kernel Otávio C. Bartalotti Iowa State University,

More information

A Nonparametric Bayesian Methodology for Regression Discontinuity Designs

A Nonparametric Bayesian Methodology for Regression Discontinuity Designs A Nonparametric Bayesian Methodology for Regression Discontinuity Designs arxiv:1704.04858v4 [stat.me] 14 Mar 2018 Zach Branson Department of Statistics, Harvard University Maxime Rischard Department of

More information

Why high-order polynomials should not be used in regression discontinuity designs

Why high-order polynomials should not be used in regression discontinuity designs Why high-order polynomials should not be used in regression discontinuity designs Andrew Gelman Guido Imbens 6 Jul 217 Abstract It is common in regression discontinuity analysis to control for third, fourth,

More information

Optimal Data-Driven Regression Discontinuity Plots. Supplemental Appendix

Optimal Data-Driven Regression Discontinuity Plots. Supplemental Appendix Optimal Data-Driven Regression Discontinuity Plots Supplemental Appendix Sebastian Calonico Matias D. Cattaneo Rocio Titiunik November 25, 2015 Abstract This supplemental appendix contains the proofs of

More information

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.

More information

An Alternative Assumption to Identify LATE in Regression Discontinuity Design

An Alternative Assumption to Identify LATE in Regression Discontinuity Design An Alternative Assumption to Identify LATE in Regression Discontinuity Design Yingying Dong University of California Irvine May 2014 Abstract One key assumption Imbens and Angrist (1994) use to identify

More information

Regression Discontinuity Design Econometric Issues

Regression Discontinuity Design Econometric Issues Regression Discontinuity Design Econometric Issues Brian P. McCall University of Michigan Texas Schools Project, University of Texas, Dallas November 20, 2009 1 Regression Discontinuity Design Introduction

More information

Lecture 10 Regression Discontinuity (and Kink) Design

Lecture 10 Regression Discontinuity (and Kink) Design Lecture 10 Regression Discontinuity (and Kink) Design Economics 2123 George Washington University Instructor: Prof. Ben Williams Introduction Estimation in RDD Identification RDD implementation RDD example

More information

Approximate Permutation Tests and Induced Order Statistics in the Regression Discontinuity Design

Approximate Permutation Tests and Induced Order Statistics in the Regression Discontinuity Design Approximate Permutation Tests and Induced Order Statistics in the Regression Discontinuity Design Ivan A. Canay Department of Economics Northwestern University iacanay@northwestern.edu Vishal Kamat Department

More information

A Simple Adjustment for Bandwidth Snooping

A Simple Adjustment for Bandwidth Snooping A Simple Adjustment for Bandwidth Snooping Timothy B. Armstrong Yale University Michal Kolesár Princeton University June 28, 2017 Abstract Kernel-based estimators such as local polynomial estimators in

More information

Why High-Order Polynomials Should Not Be Used in Regression Discontinuity Designs

Why High-Order Polynomials Should Not Be Used in Regression Discontinuity Designs Why High-Order Polynomials Should Not Be Used in Regression Discontinuity Designs Andrew GELMAN Department of Statistics and Department of Political Science, Columbia University, New York, NY, 10027 (gelman@stat.columbia.edu)

More information

Regression III Regression Discontinuity Designs

Regression III Regression Discontinuity Designs Motivation Regression III Regression Discontinuity Designs Dave Armstrong University of Western Ontario Department of Political Science Department of Statistics and Actuarial Science (by courtesy) e: dave.armstrong@uwo.ca

More information

Regression Discontinuity Designs with a Continuous Treatment

Regression Discontinuity Designs with a Continuous Treatment Regression Discontinuity Designs with a Continuous Treatment Yingying Dong, Ying-Ying Lee, Michael Gou First version: April 17; this version: December 18 Abstract This paper provides identification and

More information

Imbens, Lecture Notes 4, Regression Discontinuity, IEN, Miami, Oct Regression Discontinuity Designs

Imbens, Lecture Notes 4, Regression Discontinuity, IEN, Miami, Oct Regression Discontinuity Designs Imbens, Lecture Notes 4, Regression Discontinuity, IEN, Miami, Oct 10 1 Lectures on Evaluation Methods Guido Imbens Impact Evaluation Network October 2010, Miami Regression Discontinuity Designs 1. Introduction

More information

Difference-in-Differences Methods

Difference-in-Differences Methods Difference-in-Differences Methods Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 1 Introduction: A Motivating Example 2 Identification 3 Estimation and Inference 4 Diagnostics

More information

Regression Discontinuity Designs with a Continuous Treatment

Regression Discontinuity Designs with a Continuous Treatment Regression Discontinuity Designs with a Continuous Treatment Yingying Dong, Ying-Ying Lee, Michael Gou This version: March 18 Abstract This paper provides identification and inference theory for the class

More information

Applied Microeconometrics Chapter 8 Regression Discontinuity (RD)

Applied Microeconometrics Chapter 8 Regression Discontinuity (RD) 1 / 26 Applied Microeconometrics Chapter 8 Regression Discontinuity (RD) Romuald Méango and Michele Battisti LMU, SoSe 2016 Overview What is it about? What are its assumptions? What are the main applications?

More information

Regression III Regression Discontinuity Designs

Regression III Regression Discontinuity Designs Motivation Regression III Regression Discontinuity Designs Dave Armstrong University of Western Ontario Department of Political Science Department of Statistics and Actuarial Science (by courtesy) e: dave.armstrong@uwo.ca

More information

A Simple Adjustment for Bandwidth Snooping

A Simple Adjustment for Bandwidth Snooping A Simple Adjustment for Bandwidth Snooping Timothy B. Armstrong Yale University Michal Kolesár Princeton University October 18, 2016 Abstract Kernel-based estimators are often evaluated at multiple bandwidths

More information

What s New in Econometrics. Lecture 1

What s New in Econometrics. Lecture 1 What s New in Econometrics Lecture 1 Estimation of Average Treatment Effects Under Unconfoundedness Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Potential Outcomes 3. Estimands and

More information

Regression Discontinuity Designs with a Continuous Treatment

Regression Discontinuity Designs with a Continuous Treatment Regression Discontinuity Designs with a Continuous Treatment Yingying Dong, Ying-Ying Lee, Michael Gou This version: January 8 Abstract This paper provides identification and inference theory for the class

More information

Simple and Honest Confidence Intervals in Nonparametric Regression

Simple and Honest Confidence Intervals in Nonparametric Regression Simple and Honest Confidence Intervals in Nonparametric Regression Timothy B. Armstrong Yale University Michal Kolesár Princeton University June, 206 Abstract We consider the problem of constructing honest

More information

Simultaneous selection of optimal bandwidths for the sharp regression discontinuity estimator

Simultaneous selection of optimal bandwidths for the sharp regression discontinuity estimator Simultaneous selection of optimal bandwidths for the sharp regression discontinuity estimator Yoichi Arai Hidehiko Ichimura The Institute for Fiscal Studies Department of Economics, UCL cemmap working

More information

Local Polynomial Order in Regression Discontinuity Designs 1

Local Polynomial Order in Regression Discontinuity Designs 1 Local Polynomial Order in Regression Discontinuity Designs 1 Zhuan Pei 2 Cornell University and IZA David Card 4 UC Berkeley, NBER and IZA David S. Lee 3 Princeton University and NBER Andrea Weber 5 Central

More information

Imbens/Wooldridge, Lecture Notes 3, NBER, Summer 07 1

Imbens/Wooldridge, Lecture Notes 3, NBER, Summer 07 1 Imbens/Wooldridge, Lecture Notes 3, NBER, Summer 07 1 What s New in Econometrics NBER, Summer 2007 Lecture 3, Monday, July 30th, 2.00-3.00pm R e gr e ssi on D i sc ont i n ui ty D e si gns 1 1. Introduction

More information

Regression Discontinuity Designs with Clustered Data

Regression Discontinuity Designs with Clustered Data Regression Discontinuity Designs with Clustered Data Otávio Bartalotti and Quentin Brummet August 4, 206 Abstract Regression Discontinuity designs have become popular in empirical studies due to their

More information

Regression Discontinuity: Advanced Topics. NYU Wagner Rajeev Dehejia

Regression Discontinuity: Advanced Topics. NYU Wagner Rajeev Dehejia Regression Discontinuity: Advanced Topics NYU Wagner Rajeev Dehejia Summary of RD assumptions The treatment is determined at least in part by the assignment variable There is a discontinuity in the level

More information

Testing Continuity of a Density via g-order statistics in the Regression Discontinuity Design

Testing Continuity of a Density via g-order statistics in the Regression Discontinuity Design Testing Continuity of a Density via g-order statistics in the Regression Discontinuity Design Federico A. Bugni Department of Economics Duke University federico.bugni@duke.edu Ivan A. Canay Department

More information

Simultaneous Selection of Optimal Bandwidths for the Sharp Regression Discontinuity Estimator

Simultaneous Selection of Optimal Bandwidths for the Sharp Regression Discontinuity Estimator GRIPS Discussion Paper 14-03 Simultaneous Selection of Optimal Bandwidths for the Sharp Regression Discontinuity Estimator Yoichi Arai Hidehiko Ichimura April 014 National Graduate Institute for Policy

More information

Optimal Bandwidth Choice for the Regression Discontinuity Estimator

Optimal Bandwidth Choice for the Regression Discontinuity Estimator Optimal Bandwidth Choice for the Regression Discontinuity Estimator Guido Imbens and Karthik Kalyanaraman First Draft: June 8 This Draft: February 9 Abstract We investigate the problem of optimal choice

More information

SIMPLE AND HONEST CONFIDENCE INTERVALS IN NONPARAMETRIC REGRESSION. Timothy B. Armstrong and Michal Kolesár. June 2016 Revised March 2018

SIMPLE AND HONEST CONFIDENCE INTERVALS IN NONPARAMETRIC REGRESSION. Timothy B. Armstrong and Michal Kolesár. June 2016 Revised March 2018 SIMPLE AND HONEST CONFIDENCE INTERVALS IN NONPARAMETRIC REGRESSION By Timothy B. Armstrong and Michal Kolesár June 2016 Revised March 2018 COWLES FOUNDATION DISCUSSION PAPER NO. 2044R2 COWLES FOUNDATION

More information

Optimal bandwidth choice for the regression discontinuity estimator

Optimal bandwidth choice for the regression discontinuity estimator Optimal bandwidth choice for the regression discontinuity estimator Guido Imbens Karthik Kalyanaraman The Institute for Fiscal Studies Department of Economics, UCL cemmap working paper CWP5/ Optimal Bandwidth

More information

Regression Discontinuity

Regression Discontinuity Regression Discontinuity Christopher Taber Department of Economics University of Wisconsin-Madison October 16, 2018 I will describe the basic ideas of RD, but ignore many of the details Good references

More information

Inference in Regression Discontinuity Designs with a Discrete Running Variable

Inference in Regression Discontinuity Designs with a Discrete Running Variable Inference in Regression Discontinuity Designs with a Discrete Running Variable Michal Kolesár Christoph Rothe arxiv:1606.04086v4 [stat.ap] 18 Nov 2017 November 21, 2017 Abstract We consider inference in

More information

Causal Inference with Big Data Sets

Causal Inference with Big Data Sets Causal Inference with Big Data Sets Marcelo Coca Perraillon University of Colorado AMC November 2016 1 / 1 Outlone Outline Big data Causal inference in economics and statistics Regression discontinuity

More information

Testing continuity of a density via g-order statistics in the regression discontinuity design

Testing continuity of a density via g-order statistics in the regression discontinuity design Testing continuity of a density via g-order statistics in the regression discontinuity design Federico A. Bugni Ivan A. Canay The Institute for Fiscal Studies Department of Economics, UCL cemmap working

More information

OPTIMAL INFERENCE IN A CLASS OF REGRESSION MODELS. Timothy B. Armstrong and Michal Kolesár. May 2016 Revised May 2017

OPTIMAL INFERENCE IN A CLASS OF REGRESSION MODELS. Timothy B. Armstrong and Michal Kolesár. May 2016 Revised May 2017 OPTIMAL INFERENCE IN A CLASS OF REGRESSION MODELS By Timothy B. Armstrong and Michal Kolesár May 2016 Revised May 2017 COWLES FOUNDATION DISCUSSION PAPER NO. 2043R COWLES FOUNDATION FOR RESEARCH IN ECONOMICS

More information

Bayesian nonparametric predictive approaches for causal inference: Regression Discontinuity Methods

Bayesian nonparametric predictive approaches for causal inference: Regression Discontinuity Methods Bayesian nonparametric predictive approaches for causal inference: Regression Discontinuity Methods George Karabatsos University of Illinois-Chicago ERCIM Conference, 14-16 December, 2013 Senate House,

More information

The Stata Journal. Editors H. Joseph Newton Department of Statistics Texas A&M University College Station, Texas

The Stata Journal. Editors H. Joseph Newton Department of Statistics Texas A&M University College Station, Texas The Stata Journal Editors H. Joseph Newton Department of Statistics Texas A&M University College Station, Texas editors@stata-journal.com Nicholas J. Cox Department of Geography Durham University Durham,

More information

APermutationTestfortheRegressionKinkDesign. Peter Ganong and Simon Jäger. March 25, 2015

APermutationTestfortheRegressionKinkDesign. Peter Ganong and Simon Jäger. March 25, 2015 APermutationTestfortheRegressionKinkDesign Peter Ganong and Simon Jäger March 25, 2015 Abstract The Regression Kink (RK) design is an increasingly popular empirical method for causal inference. Analogous

More information

Finding Instrumental Variables: Identification Strategies. Amine Ouazad Ass. Professor of Economics

Finding Instrumental Variables: Identification Strategies. Amine Ouazad Ass. Professor of Economics Finding Instrumental Variables: Identification Strategies Amine Ouazad Ass. Professor of Economics Outline 1. Before/After 2. Difference-in-difference estimation 3. Regression Discontinuity Design BEFORE/AFTER

More information

41903: Introduction to Nonparametrics

41903: Introduction to Nonparametrics 41903: Notes 5 Introduction Nonparametrics fundamentally about fitting flexible models: want model that is flexible enough to accommodate important patterns but not so flexible it overspecializes to specific

More information

An Alternative Assumption to Identify LATE in Regression Discontinuity Designs

An Alternative Assumption to Identify LATE in Regression Discontinuity Designs An Alternative Assumption to Identify LATE in Regression Discontinuity Designs Yingying Dong University of California Irvine September 2014 Abstract One key assumption Imbens and Angrist (1994) use to

More information

The Economics of European Regions: Theory, Empirics, and Policy

The Economics of European Regions: Theory, Empirics, and Policy The Economics of European Regions: Theory, Empirics, and Policy Dipartimento di Economia e Management Davide Fiaschi Angela Parenti 1 1 davide.fiaschi@unipi.it, and aparenti@ec.unipi.it. Fiaschi-Parenti

More information

Optimal bandwidth selection for differences of nonparametric estimators with an application to the sharp regression discontinuity design

Optimal bandwidth selection for differences of nonparametric estimators with an application to the sharp regression discontinuity design Optimal bandwidth selection for differences of nonparametric estimators with an application to the sharp regression discontinuity design Yoichi Arai Hidehiko Ichimura The Institute for Fiscal Studies Department

More information

Regression Discontinuity Designs.

Regression Discontinuity Designs. Regression Discontinuity Designs. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 31/10/2017 I. Brunetti Labour Economics in an European Perspective 31/10/2017 1 / 36 Introduction

More information

Regression Discontinuity Designs Using Covariates: Supplemental Appendix

Regression Discontinuity Designs Using Covariates: Supplemental Appendix Regression Discontinuity Designs Using Covariates: Supplemental Appendix Sebastian Calonico Matias D. Cattaneo Max H. Farrell Rocio Titiunik May 25, 208 Abstract This supplemental appendix contains the

More information

Regression Discontinuity

Regression Discontinuity Regression Discontinuity Christopher Taber Department of Economics University of Wisconsin-Madison October 24, 2017 I will describe the basic ideas of RD, but ignore many of the details Good references

More information

Nonparametric Inference via Bootstrapping the Debiased Estimator

Nonparametric Inference via Bootstrapping the Debiased Estimator Nonparametric Inference via Bootstrapping the Debiased Estimator Yen-Chi Chen Department of Statistics, University of Washington ICSA-Canada Chapter Symposium 2017 1 / 21 Problem Setup Let X 1,, X n be

More information

Confidence intervals for kernel density estimation

Confidence intervals for kernel density estimation Stata User Group - 9th UK meeting - 19/20 May 2003 Confidence intervals for kernel density estimation Carlo Fiorio c.fiorio@lse.ac.uk London School of Economics and STICERD Stata User Group - 9th UK meeting

More information

On the Effect of Bias Estimation on Coverage Accuracy in Nonparametric Inference

On the Effect of Bias Estimation on Coverage Accuracy in Nonparametric Inference On the Effect of Bias Estimation on Coverage Accuracy in Nonparametric Inference Sebastian Calonico Matias D. Cattaneo Max H. Farrell November 4, 014 PRELIMINARY AND INCOMPLETE COMMENTS WELCOME Abstract

More information

A Permutation Test for the Regression Kink. Design

A Permutation Test for the Regression Kink. Design A Permutation Test for the Regression Kink Design Peter Ganong and Simon Jäger March 24, 2017 The Regression Kink (RK) design is an increasingly popular empirical method for estimating causal e ects of

More information

Clustering as a Design Problem

Clustering as a Design Problem Clustering as a Design Problem Alberto Abadie, Susan Athey, Guido Imbens, & Jeffrey Wooldridge Harvard-MIT Econometrics Seminar Cambridge, February 4, 2016 Adjusting standard errors for clustering is common

More information

Kernel-based density. Nuno Vasconcelos ECE Department, UCSD

Kernel-based density. Nuno Vasconcelos ECE Department, UCSD Kernel-based density estimation Nuno Vasconcelos ECE Department, UCSD Announcement last week of classes we will have Cheetah Day (exact day TBA) what: 4 teams of 6 people each team will write a report

More information

Selection on Observables: Propensity Score Matching.

Selection on Observables: Propensity Score Matching. Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017

More information

APermutationTestfortheRegression. Kink Design

APermutationTestfortheRegression. Kink Design APermutationTestfortheRegression Kink Design Peter Ganong and Simon Jäger The Regression Kink (RK) design is an increasingly popular empirical method for estimating causal effects of policies, such as

More information

Applied Statistics Lecture Notes

Applied Statistics Lecture Notes Applied Statistics Lecture Notes Kosuke Imai Department of Politics Princeton University February 2, 2008 Making statistical inferences means to learn about what you do not observe, which is called parameters,

More information

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design 1 / 32 Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design Changbao Wu Department of Statistics and Actuarial Science University of Waterloo (Joint work with Min Chen and Mary

More information

A Practitioner s Guide to Cluster-Robust Inference

A Practitioner s Guide to Cluster-Robust Inference A Practitioner s Guide to Cluster-Robust Inference A. C. Cameron and D. L. Miller presented by Federico Curci March 4, 2015 Cameron Miller Cluster Clinic II March 4, 2015 1 / 20 In the previous episode

More information

SIMPLE AND HONEST CONFIDENCE INTERVALS IN NONPARAMETRIC REGRESSION. Timothy B. Armstrong and Michal Kolesár. June 2016 Revised October 2016

SIMPLE AND HONEST CONFIDENCE INTERVALS IN NONPARAMETRIC REGRESSION. Timothy B. Armstrong and Michal Kolesár. June 2016 Revised October 2016 SIMPLE AND HONEST CONFIDENCE INTERVALS IN NONPARAMETRIC REGRESSION By Timothy B. Armstrong and Michal Kolesár June 2016 Revised October 2016 COWLES FOUNDATION DISCUSSION PAPER NO. 2044R COWLES FOUNDATION

More information

Lecture 10 Multiple Linear Regression

Lecture 10 Multiple Linear Regression Lecture 10 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 10-1 Topic Overview Multiple Linear Regression Model 10-2 Data for Multiple Regression Y i is the response variable

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Propensity Score Methods for Causal Inference

Propensity Score Methods for Causal Inference John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good

More information

Gov 2002: 5. Matching

Gov 2002: 5. Matching Gov 2002: 5. Matching Matthew Blackwell October 1, 2015 Where are we? Where are we going? Discussed randomized experiments, started talking about observational data. Last week: no unmeasured confounders

More information

Regression Discontinuity Design

Regression Discontinuity Design Regression Discontinuity Design Marcelo Coca Perraillon University of Chicago May 13 & 18, 2015 1 / 51 Introduction Plan Overview of RDD Meaning and validity of RDD Several examples from the literature

More information

A Simple Adjustment for Bandwidth Snooping

A Simple Adjustment for Bandwidth Snooping Review of Economic Studies (207) 0, 35 0034-6527/7/0000000$02.00 c 207 The Review of Economic Studies Limited A Simple Adjustment for Bandwidth Snooping TIMOTHY B. ARMSTRONG Yale University E-mail: timothy.armstrong@yale.edu

More information

Statistical Models for Causal Analysis

Statistical Models for Causal Analysis Statistical Models for Causal Analysis Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 Three Modes of Statistical Inference 1. Descriptive Inference: summarizing and exploring

More information

Implementing Matching Estimators for. Average Treatment Effects in STATA. Guido W. Imbens - Harvard University Stata User Group Meeting, Boston

Implementing Matching Estimators for. Average Treatment Effects in STATA. Guido W. Imbens - Harvard University Stata User Group Meeting, Boston Implementing Matching Estimators for Average Treatment Effects in STATA Guido W. Imbens - Harvard University Stata User Group Meeting, Boston July 26th, 2006 General Motivation Estimation of average effect

More information

Inference for Average Treatment Effects

Inference for Average Treatment Effects Inference for Average Treatment Effects Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Average Treatment Effects Stat186/Gov2002 Fall 2018 1 / 15 Social

More information

Minimum Contrast Empirical Likelihood Manipulation. Testing for Regression Discontinuity Design

Minimum Contrast Empirical Likelihood Manipulation. Testing for Regression Discontinuity Design Minimum Contrast Empirical Likelihood Manipulation Testing for Regression Discontinuity Design Jun Ma School of Economics Renmin University of China Hugo Jales Department of Economics Syracuse University

More information

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai Weighting Methods Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Weighting Methods Stat186/Gov2002 Fall 2018 1 / 13 Motivation Matching methods for improving

More information

Robust Inference in Fuzzy Regression Discontinuity Designs

Robust Inference in Fuzzy Regression Discontinuity Designs Robust Inference in Fuzzy Regression Discontinuity Designs Yang He November 2, 2017 Abstract Fuzzy regression discontinuity (RD) design and instrumental variable(s) (IV) regression share similar identification

More information

Optimal inference in a class of regression models

Optimal inference in a class of regression models Optimal inference in a class of regression models arxiv:1511.06028v4 [math.st] 22 Nov 2017 Timothy B. Armstrong Michal Kolesár Yale University Princeton University November 23, 2017 Abstract We consider

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University

More information

Is My Matched Dataset As-If Randomized, More, Or Less? Unifying the Design and. Analysis of Observational Studies

Is My Matched Dataset As-If Randomized, More, Or Less? Unifying the Design and. Analysis of Observational Studies Is My Matched Dataset As-If Randomized, More, Or Less? Unifying the Design and arxiv:1804.08760v1 [stat.me] 23 Apr 2018 Analysis of Observational Studies Zach Branson Department of Statistics, Harvard

More information

ESTIMATION OF TREATMENT EFFECTS VIA MATCHING

ESTIMATION OF TREATMENT EFFECTS VIA MATCHING ESTIMATION OF TREATMENT EFFECTS VIA MATCHING AAEC 56 INSTRUCTOR: KLAUS MOELTNER Textbooks: R scripts: Wooldridge (00), Ch.; Greene (0), Ch.9; Angrist and Pischke (00), Ch. 3 mod5s3 General Approach The

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint

More information

Gov 2002: 3. Randomization Inference

Gov 2002: 3. Randomization Inference Gov 2002: 3. Randomization Inference Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last week: This week: What can we identify using randomization? Estimators were justified via

More information

EXAMINATION: QUANTITATIVE EMPIRICAL METHODS. Yale University. Department of Political Science

EXAMINATION: QUANTITATIVE EMPIRICAL METHODS. Yale University. Department of Political Science EXAMINATION: QUANTITATIVE EMPIRICAL METHODS Yale University Department of Political Science January 2014 You have seven hours (and fifteen minutes) to complete the exam. You can use the points assigned

More information

The regression discontinuity (RD) design has become

The regression discontinuity (RD) design has become Interpreting Regression Discontinuity Designs with Multiple Cutoffs Matias D. Cattaneo, University of Michigan Luke Keele, Penn State University Rocío Titiunik, University of Michigan Gonzalo Vazquez-Bare,

More information

Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies

Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies Kosuke Imai Princeton University Joint work with Keele (Ohio State), Tingley (Harvard), Yamamoto (Princeton)

More information

Density estimation Nonparametric conditional mean estimation Semiparametric conditional mean estimation. Nonparametrics. Gabriel Montes-Rojas

Density estimation Nonparametric conditional mean estimation Semiparametric conditional mean estimation. Nonparametrics. Gabriel Montes-Rojas 0 0 5 Motivation: Regression discontinuity (Angrist&Pischke) Outcome.5 1 1.5 A. Linear E[Y 0i X i] 0.2.4.6.8 1 X Outcome.5 1 1.5 B. Nonlinear E[Y 0i X i] i 0.2.4.6.8 1 X utcome.5 1 1.5 C. Nonlinearity

More information

Estimation of the Conditional Variance in Paired Experiments

Estimation of the Conditional Variance in Paired Experiments Estimation of the Conditional Variance in Paired Experiments Alberto Abadie & Guido W. Imbens Harvard University and BER June 008 Abstract In paired randomized experiments units are grouped in pairs, often

More information