Two-way fixed effects estimators with heterogeneous treatment effects

Size: px
Start display at page:

Download "Two-way fixed effects estimators with heterogeneous treatment effects"

Transcription

1 Two-way fixed effects estimators with heterogeneous treatment effects Clément de Chaisemartin UC Santa Barbara Xavier D Haultfœuille CREST USC-INET workshop on Evidence to Inform Economic Policy

2 are very pervasive 20% of empirical papers published by AER in estimate treatment effects (TE) using regs with period and group fixed effects. If common trends assumption holds, and if TE of all units is equal to a constant, then β, coeff. of treatment in those reg., is equal to. Often implausible that TE constant: TE may vary over time and across individuals. E.g.: returns to schooling presumably heterogeneous across people, and some evidence that are changing over time. What do those regressions identify if TE heterogeneous?

3 Two-way FE reg. may be misleading if TE heterogeneous Main result: without assuming constant TE, β = g,t w g,t g,t, where w g,t: weights summing to 1, and g,t: TE in group g at period t. w g,t to proportion of units in (g, t), so β ATE. But even worse, often times, many weights w g,t are < 0. In 2 articles estimating two-way FE, we find that 50% of weights are < 0. Then, β could be < 0 even if all the g,t are > 0. Estimating weights = diagnostic of β s robustness to heterogeneous TE. Stata progs. estimating weights available from my website.

4 We propose an alternative to two-way FE reg. We propose an alternative estimand, Wald-TC estimand. Does not rely on constant TE, but on conditional CT assumption. Like the standard CT assumption, conditional CT can be partly tested by looking at pre-trends. Identifies an easy to interpret Local Average Treatment Effect (LATE). In applications, we find that our estimator and the two-way FE estimators are often very different, and sometimes even have opposite signs. Our Wald-TC estimator is computed by fuzzydid Stata package, available from SSC repository.

5 Literature de Chaisemartin and D Haultfœuille (2018). Imai et al. (2018). Several papers study special case with staggered adoption: Borusyak and Jaravel (2017), Abraham and Sun (2018), Athey and Imbens (2018), Goodman-Bacon (2018), Callaway and Sant Anna (2018).

6 Outline 1 Set-up 2 Intuition: 2 groups & 2 periods 3 Main results 4 Alternative estimand 5 Applications 6 Conclusion

7 Set-up and notations Binary treatment D (for simplicity, all results extend to non-binary D). Potential outcomes (Y (0), Y (1)) and Y = Y (D). t + 1 time periods T {0,..., t}, and g + 1 groups G {0,..., g}. Shortcut: for any var. R, E(R.,t) = E(R T = t), E(R g,t) = E(R G = g, T = t).

8 The FE regression We are going to study two commonly used two-way FE regs. FE regression: OLS of E(Y G, T ) on group and time FE and E(D G, T ). β fe = coeff. of E(D G, T ). Example, Enikolopov, Petrova, and Zhuravskaya (2011): Study effect of NTV, independent TV channel introduced in 1996 in Russia, on the share of votes for opposition parties. Signal quality heterogeneous: in some regions many people get access to NTV post 1996, in other regions few people. E(Y G, T ): % of votes for opposition in region G & year T. E(D G, T ): % of pop. having access to NTV in region G & year T.

9 The FD regression FD regression: OLS of E(Y G, T ) E(Y G, T 1) on time FE and E(D G, T ) E(D G, T 1). β fd = coeff. of E(D G, T ) E(D G, T 1). Example, Gentzkow, Shapiro, and Sinkinson (2011): Study whether newspapers affect electoral participation in the USA between 1868 and Regress FD of turnout in county g between presidential elections t and t 1 on FD of the number of newspapers and state-year FE.

10 Outline 1 Set-up 2 Intuition: 2 groups & 2 periods 3 Main results 4 Alternative estimand 5 Applications 6 Conclusion

11 To gain intuition, let s start with 2 groups & 2 periods Assume that 2 groups and periods. Then, β fe = β fd = E(Y1,1) E(Y1,0) E(Y0,1) + E(Y0,0) E(D 1,1) E(D 1,0) E(D 0,1) + E(D 0,0). Rhs of previous display: W DID, Wald-DID estimand studied in C&D (2018).

12 The common trends assumption A1 (Common trends) E(Y (0) G, T = 1) E(Y (0) G, T = 0) does not depend on G. Usual CT condition: mean Y (0) follows the same trend in all groups.

13 No negative weights in sharp DID Assume that sharp design (D = G T ). Then, if CT holds, well-known result: E(Y 1,1) E(Y 1,0) E(Y 0,1) + E(Y 0,0) = TR 1,1, (1) where TR g,t = E(Y (1) Y (0) D = 1, G = g, T = t): ATT in (g, t). CT: if nobody treated, mean of Y follows same trend in 2 groups. Sharp design: 1 departure from no treated scenario: G = 1, T = 1 treated. Diverging trends between 2 groups must come from TE in G = 1, T = 1.

14 Negative weights in fuzzy DID Now fuzzy design: there can be treated units in both groups and periods: E(Y 1,1) E(Y 1,0) E(Y 0,1) + E(Y 0,0) =E(D 1,1) TR 1,1 E(D 1,0) TR 1,0 E(D 0,1) TR 0,1 + E(D 0,0) TR 0,0. (2) Potentially, four departures from scenario where no one treated. Discrepancy between trends of 2 groups can come from TE in any (g, t). Let DID D = E(D 1,1) E(D 1,0) E(D 0,1) + E(D 0,0), then β fe = E(D 1,1) TR 1,1 DID E(D 1,0) TR 1,0 D DID E(D 0,1) TR 0,1 D DID + E(D 0,0) TR 0,0 D DID. (3) D β fe identifies weighted sum of 4 ATTs, where 2 ATTs get < 0 weight.

15 Couldn t we impose more assumptions? Does β fe identify something more meaningful under supplementary, plausible assumptions?

16 The treatment monotonicity assumption D(0) and D(1): unit s treatment at period 0 and 1. A2 (Treatment monotonicity) P(D(1) D(0) G) = 1 or P(D(1) D(0) G) = 1. Within a group, there can t be units switching from non treatment to treatment and units making the opposite switch from T = 0 to T = 1. Automatically holds when treatment at group period level. With individual-level treatment, similar to monotonicity in Imbens and Angrist (1994). S = {D(0) D(1), T = 1}: units switching treatment from T = 0 to 1.

17 The stable treatment effect assumption A3 (Stable treatment effect) For all g, E(Y (1) Y (0) D(0) = 1, G = g, T = 1) =E(Y (1) Y (0) D(0) = 1, G = g, T = 0). In each group, ATT of treated at T = 0 does not change from T = 0 to T = 1. Restriction on TE heterogeneity over time. May or may not be plausible.

18 We may have no negative weights under CT+TM+STE Remember, under CT we have: β fe = E(D 1,1) TR 1,1 DID E(D 1,0) TR 1,0 D DID E(D 0,1) TR 0,1 D DID + E(D 0,0) TR 0,0 D DID. (4) D Under CT, TM, and STE: β fe = E(D1,1) E(D1,0) S E(D0,1) E(D0,0) 1,1 S 0,1. (5) DID D DID D where S g,t = E(Y (1) Y (0) S, G = g, T = t): switch. LATE in (g, t). Intuition, TE of units treated at T = 0 cancels in E(D 1,1) TR 1,1 E(D 1,0) TR 1,0, under STE. β fe identifies a weighted sum of 2 LATEs: If % treated in both groups, 1 LATE gets < 0 weight. if % treated in group 1 & in group 0, 0 LATE gets < 0 weight.

19 Outline 1 Set-up 2 Intuition: 2 groups & 2 periods 3 Main results 4 Alternative estimand 5 Applications 6 Conclusion

20 Results with 2 G & 2 T generalize to many G & many T Theorem 1 1 If A1 holds, β fe = E [ w fe,g,t TR G,T D = 1 ]. 2 If A1-A3 hold, β fe = E [ ω fe,g,t S G,T S ]. Under CT, FE = weighted sums of ATTs. β fe = g,t P(G = g, T = t D = 1)w fe,g,t TR g,t Under CT+TM+STE, FE=weighted sums of LATEs. β fe = g,t P(G = g, T = t S)ω fe,g,t S g,t Similar results hold for FD estimand, with different weights.

21 The weights attached to FE and FD Let ε fe,g,t =residual of units in group g and at period t in reg. of E(D G, T ) on group & time FE. Weights attached to FE under CT: w fe,g,t = ε fe,g,t E(ε fe,g,t D=1). (g, t) receiving < 0 weight = those with a < 0 residual. w fe,g,t can be estimated. Similarly, weights attached to FD under CT are a function of ε fd,g,t, residual of units in (g, t) in reg. of E(D G, T ) E(D G, T 1) on time FE. Can also be estimated. Weights attached to FE (resp. FD) under CT+TM+STE are also a function of ε fe,g,t (resp. ε fd,g,t ).

22 When are all the weights attached to FE positive? 1 Under CT: Weights attached to FE all 0 only in very special cases (e.g.: sharp DID, 2 groups & periods). Even in staggered adoption designs, one of the many designs where two-way FE regressions have been used, and where groups go from fully untreated to fully treated at heterogeneous dates, some weights are < 0, while this design is very close to a sharp DID. 2 Under CT+TM+STE: Weights all 0 in staggered adoption design. in this design, FE relies on STE, but not on homogeneous treatment effect across groups. Outside of this special case, very likely that some weights < 0. When are all weights attached to FD positive?

23 A measure of robustness of FE and FD to heterogenous TE Let σ TR = V ( TR G,T D = 1 ) 1/2. Std. dev. of ATTs across (g, t): amount of TE heterogeneity. Corollary 1 If A1 holds, lowest value of σ TR such that β fe and TR can have signs is σ TR β fe fe = V (w fe,g,t D = 1). 1/2 The larger σ TR fe, the more robust to heterogeneous TE FE is. Similar lower bound on TE heterogeneity to have sign reversal can be obtained for FD under CT, and for FE and FD under CT+TM+STE.

24 Outline 1 Set-up 2 Intuition: 2 groups & 2 periods 3 Main results 4 Alternative estimand 5 Applications 6 Conclusion

25 The two assumptions underlying our estimand A1 For d {0, 1}, E(Y g,t(d) D(t 1) = d) E(Y g,t 1(d) D(t 1) = d) does not depend on g. Among units untreated in t 1, mean of Y (0) follows same evolution from t 1 to t in all groups. Among units treated in t 1, mean of Y (1) follows same evolution from t 1 to t in all groups. Common trends conditional on treatment in t 1. A6 For all t 1, there exists g such that E(D g,t) = E(D g,t 1). For each t, there are groups whose exposure to treatment does not change between t 1 and t. Often satisfied: Gentzkow et al., staggered adoption design...

26 S identified without any assumption on TE heterogeneity Theorem 2 Suppose that A1, A2, and A6 hold. Then W TC = S. W TC estimand defined in paper, but intuition on what it does given next slide.

27 What does W TC do? Assume all units in same (g, t) have same D (e.g. Gentzkow et al.). Then W TC is weighted average of: 1 DIDs comparing the groups that went from non treatment in t 1 to treatment in t to groups untreated at both dates. Under A1, identifies TE in period t in the switching-in groups. 2 DIDs comparing the groups treated at both dates to those going from treatment in t 1 to non treatment in t. Under A1, identifies TE in period t in the switching-out groups. When treatment not constant in (g, t), what W TC does is a bit more complicated, but idea of comparing trends of units with same treatment in t 1 remains.

28 Outline 1 Set-up 2 Intuition: 2 groups & 2 periods 3 Main results 4 Alternative estimand 5 Applications 6 Conclusion

29 Enikolopov et al. (2011) Regress share of vote for opposition in region g and year t on region FE, year FE, and share of people having access to NTV in region g and year t. β fe = 6.65 (s.e.=1.40). We estimate weights attached to FE under CT: 918 > 0, 1,020 < 0. under CT, FE= weight. sum of 1,938 ATTs, 1,020 ATTs get < 0 weight. Negative weights sum to (s.e.=0.06). = 0.91 (CI=[0.53, 1.28]): β fe and ATT could be of opposite signs even if s.e. of effect of NTV across subregions is one percentage point. σ TR fe 2 periods (1995 & 1999), no one treated in 1995, so switchers = treated, and weights under CT+TM+STE same as under CT. In every region, % of pop having access to NTV > 0 in 1999, so we cannot compute Wald-TC.

30 Gentzkow et al. (2011): two-way FE estimators Did newspapers affect electoral turnout in the US between 1868 and 1928? Reg change in turnout from presidential election t 1 to t in county g on change in number of newspapers and state-year FE. Alternatively, one could estimate FE regression. β fd = (0.0009), β fe = (0.0011), t-stat of diff: About half of the weights attached to FD and FE under CT < 0. All weights attached to FD under CT+TM+STE > 0, while 25% of weights attached to FE still < 0. FD estimates weighted average of TEs under CT+TM+STE.

31 Gentzkow et al. (2011): estimating Wald-TC Still, FD relies on STE, not warranted (e.g. radio developed over period). A6 holds here: at each t, there are counties whose number of newspapers does not change between t 1 and t. Ŵ TC = % than and significantly from β fd, opposite sign than β fe Does not rely on any homogeneous TE assumption, just conditional CT. Test of pre-trends indicates conditional CT plausible.

32 Outline 1 Set-up 2 Intuition: 2 groups & 2 periods 3 Main results 4 Alternative estimand 5 Applications 6 Conclusion

33 Take-away Under CT, FE and FD regs identify weighted sums of ATTs. Weights can be estimated. Most often, many are < 0. Under CT+TM+STE, FE and FD regs often still identify uninterpretable weighted sums of TEs. Exception: staggered adoption design. We propose alternative estimator, that does not rely on TE homogeneity, and identifies an easily interpretable parameter, the LATE of all switchers. Can be used when for each pair of periods there are groups whose treatment does not change. Our applications show that using FE, FD, or W TC can make large.

34 Thank you!

35 When are all the weights attached to FD positive? 1 Under CT, weights attached to FD all 0 only in very special cases. 2 Under CT+TM+STE, weights attached to FD all 0 iif: a) in groups where % treated from t 1 to t, is E(D.,t) E(D.,t 1). b) in groups where % treated from t 1 to t, is E(D.,t) E(D.,t 1). Holds if E(D.,t) E(D.,t 1) = 0 or close to 0. E.g.: Gentzkow et al. Also holds in staggered adoption designs. Fails if e.g., % treated in all groups from t 1 to t, for some t. Back

36 The random weights assumptions Under CT, FE and FD often identify uninterpretable weighted sums of ATTs with many < 0 weights. Adding TM+STE does not solve the pblm. Let s add more assumptions: A4 cov ( TR G,T, w fe,g,t D = 1 ) = 0. A4: Weights attached to FE under CT uncorrelated with ATTs. Corollary 2 If A1 and A4 hold, β fe = TR. FD also identifies TR under CT and similar random weights assumption. Similar result can be obtained under CT+TM+STE.

37 Random weights assumptions often implausible A4 equivalent to cov ( ε fe,g,t, TR G,T D = 1 ) = 0. fails if groups where a large proportion of units adopt the treatment (ε fe,g,t > 0) are those with the largest treatment effect ( TR G,T > TR ). A4 incompatible with Roy model of selection into treatment.

38 The random weights assumptions are partly testable Under CT, if A4 and corresponding assumption for FD hold, β fe = β fd. Sometimes, the 2 are significantly different (e.g. Gentzkow et al.) the two random weights assumptions cannot jointly hold.

39 Extensions We extend our results to non-binary treatments: the weights remain unchanged. We extend our results to FD and FE regressions with controls X : the weights remain unchanged. Our results are widely applicable.

40 The weights attached to FE and FD under CT+TM+STE Let s g,t = sgn (E(D g,t) E(D g,t 1)). s g,t = 1, 0, and -1 in groups where % treated, =, from t 1 to t. Weights attached to FD under CT+TM+STE: ω fd,g,t = ω fd,g,t can be estimated. sg,t ε fd,g,t E[s G,T ε fd,g,t S]. Similarly, weights attached to FE under CT+TM+STE are a function of ε fe,g,t and s g,t. Can also be estimated.

Two-way fixed effects estimators with heterogeneous treatment effects

Two-way fixed effects estimators with heterogeneous treatment effects Two-way fixed effects estimators with heterogeneous treatment effects Clément de Chaisemartin Xavier D Haultfœuille April 26, 2018 Abstract Around 20% of all empirical articles published by the American

More information

Two-way fixed effects estimators with heterogeneous treatment effects

Two-way fixed effects estimators with heterogeneous treatment effects Two-way fixed effects estimators with heterogeneous treatment effects arxiv:1803.08807v4 [econ.em] 2 Aug 2018 Clément de Chaisemartin Xavier D Haultfœuille August 3, 2018 Abstract Linear regressions with

More information

Fuzzy Differences-in-Differences

Fuzzy Differences-in-Differences Review of Economic Studies (2017) 01, 1 30 0034-6527/17/00000001$02.00 c 2017 The Review of Economic Studies Limited Fuzzy Differences-in-Differences C. DE CHAISEMARTIN University of California at Santa

More information

Supplement to Fuzzy Differences-in-Differences

Supplement to Fuzzy Differences-in-Differences Supplement to Fuzzy Differences-in-Differences Clément de Chaisemartin Xavier D Haultfœuille June 30, 2017 Abstract This paper gathers the supplementary material to de Chaisemartin and D Haultfœuille (2017).

More information

Difference-in-Differences with Multiple Time Periods

Difference-in-Differences with Multiple Time Periods Difference-in-Differences with Multiple Time Periods Brantly Callaway Pedro H. C. Sant Anna March 1, 2019 Abstract In this article, we consider identification, estimation, and inference procedures for

More information

Supplementary material to: Tolerating deance? Local average treatment eects without monotonicity.

Supplementary material to: Tolerating deance? Local average treatment eects without monotonicity. Supplementary material to: Tolerating deance? Local average treatment eects without monotonicity. Clément de Chaisemartin September 1, 2016 Abstract This paper gathers the supplementary material to de

More information

Gov 2002: 9. Differences in Differences

Gov 2002: 9. Differences in Differences Gov 2002: 9. Differences in Differences Matthew Blackwell October 30, 2015 1 / 40 1. Basic differences-in-differences model 2. Conditional DID 3. Standard error issues 4. Other DID approaches 2 / 40 Where

More information

Estimating Dynamic Treatment Effects in Event Studies with Heterogeneous Treatment Effects

Estimating Dynamic Treatment Effects in Event Studies with Heterogeneous Treatment Effects Estimating Dynamic Treatment Effects in Event Studies with Heterogeneous Treatment Effects Sarah Abraham Department of Economics, MIT and Liyang Sun Department of Economics, MIT (lsun20@mitedu) September

More information

Difference-in-Differences with Multiple Time Periods and. an Application on the Minimum Wage and Employment

Difference-in-Differences with Multiple Time Periods and. an Application on the Minimum Wage and Employment Difference-in-Differences with Multiple Time Periods and an Application on the Minimum Wage and Employment arxiv:1803.09015v2 [econ.em] 31 Aug 2018 Brantly Callaway Pedro H. C. Sant Anna September 5, 2018

More information

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed

More information

Identifying the effect of persuasion

Identifying the effect of persuasion Identifying the effect of persuasion Sung Jae Jun Sokbae Lee The Institute for Fiscal Studies Department of Economics, UCL cemmap working paper CWP19/18 IDENTIFYING THE EFFECT OF PERSUASION SUNG JAE JUN

More information

Noncompliance in Randomized Experiments

Noncompliance in Randomized Experiments Noncompliance in Randomized Experiments Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Noncompliance in Experiments Stat186/Gov2002 Fall 2018 1 / 15 Encouragement

More information

Recitation Notes 6. Konrad Menzel. October 22, 2006

Recitation Notes 6. Konrad Menzel. October 22, 2006 Recitation Notes 6 Konrad Menzel October, 006 Random Coefficient Models. Motivation In the empirical literature on education and earnings, the main object of interest is the human capital earnings function

More information

PSC 504: Instrumental Variables

PSC 504: Instrumental Variables PSC 504: Instrumental Variables Matthew Blackwell 3/28/2013 Instrumental Variables and Structural Equation Modeling Setup e basic idea behind instrumental variables is that we have a treatment with unmeasured

More information

Lecture 8. Roy Model, IV with essential heterogeneity, MTE

Lecture 8. Roy Model, IV with essential heterogeneity, MTE Lecture 8. Roy Model, IV with essential heterogeneity, MTE Economics 2123 George Washington University Instructor: Prof. Ben Williams Heterogeneity When we talk about heterogeneity, usually we mean heterogeneity

More information

Difference-in-Differences with Multiple Time Periods and an Application on the Minimum Wage and Employment

Difference-in-Differences with Multiple Time Periods and an Application on the Minimum Wage and Employment Difference-in-Differences with Multiple Time Periods and an Application on the Minimum Wage and Employment Brantly Callaway Department of Economics Temple University Pedro H. C. Sant Anna Department of

More information

150C Causal Inference

150C Causal Inference 150C Causal Inference Instrumental Variables: Modern Perspective with Heterogeneous Treatment Effects Jonathan Mummolo May 22, 2017 Jonathan Mummolo 150C Causal Inference May 22, 2017 1 / 26 Two Views

More information

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous Econometrics of causal inference Throughout, we consider the simplest case of a linear outcome equation, and homogeneous effects: y = βx + ɛ (1) where y is some outcome, x is an explanatory variable, and

More information

Regression Discontinuity

Regression Discontinuity Regression Discontinuity Christopher Taber Department of Economics University of Wisconsin-Madison October 16, 2018 I will describe the basic ideas of RD, but ignore many of the details Good references

More information

Regression Discontinuity

Regression Discontinuity Regression Discontinuity Christopher Taber Department of Economics University of Wisconsin-Madison October 24, 2017 I will describe the basic ideas of RD, but ignore many of the details Good references

More information

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case Maximilian Kasy Department of Economics, Harvard University 1 / 40 Agenda instrumental variables part I Origins of instrumental

More information

Econometrics for PhDs

Econometrics for PhDs Econometrics for PhDs Amine Ouazad April 2012, Final Assessment - Answer Key 1 Questions with a require some Stata in the answer. Other questions do not. 1 Ordinary Least Squares: Equality of Estimates

More information

Causality and Experiments

Causality and Experiments Causality and Experiments Michael R. Roberts Department of Finance The Wharton School University of Pennsylvania April 13, 2009 Michael R. Roberts Causality and Experiments 1/15 Motivation Introduction

More information

Difference-in-Differences Methods

Difference-in-Differences Methods Difference-in-Differences Methods Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 1 Introduction: A Motivating Example 2 Identification 3 Estimation and Inference 4 Diagnostics

More information

Exam ECON5106/9106 Fall 2018

Exam ECON5106/9106 Fall 2018 Exam ECO506/906 Fall 208. Suppose you observe (y i,x i ) for i,2,, and you assume f (y i x i ;α,β) γ i exp( γ i y i ) where γ i exp(α + βx i ). ote that in this case, the conditional mean of E(y i X x

More information

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. The Basic Methodology 2. How Should We View Uncertainty in DD Settings?

More information

Selection on Observables

Selection on Observables Selection on Observables Hasin Yousaf (UC3M) 9th November Hasin Yousaf (UC3M) Selection on Observables 9th November 1 / 22 Summary Altonji, Elder and Taber, JPE, 2005 Bellows and Miguel, JPubE, 2009 Oster,

More information

IsoLATEing: Identifying Heterogeneous Effects of Multiple Treatments

IsoLATEing: Identifying Heterogeneous Effects of Multiple Treatments IsoLATEing: Identifying Heterogeneous Effects of Multiple Treatments Peter Hull December 2014 PRELIMINARY: Please do not cite or distribute without permission. Please see www.mit.edu/~hull/research.html

More information

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015 Introduction to causal identification Nidhiya Menon IGC Summer School, New Delhi, July 2015 Outline 1. Micro-empirical methods 2. Rubin causal model 3. More on Instrumental Variables (IV) Estimating causal

More information

Principles Underlying Evaluation Estimators

Principles Underlying Evaluation Estimators The Principles Underlying Evaluation Estimators James J. University of Chicago Econ 350, Winter 2019 The Basic Principles Underlying the Identification of the Main Econometric Evaluation Estimators Two

More information

Instrumental Variables

Instrumental Variables Instrumental Variables Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Noncompliance in Experiments Stat186/Gov2002 Fall 2018 1 / 18 Instrumental Variables

More information

An Alternative Assumption to Identify LATE in Regression Discontinuity Designs

An Alternative Assumption to Identify LATE in Regression Discontinuity Designs An Alternative Assumption to Identify LATE in Regression Discontinuity Designs Yingying Dong University of California Irvine September 2014 Abstract One key assumption Imbens and Angrist (1994) use to

More information

Applied Microeconometrics Chapter 8 Regression Discontinuity (RD)

Applied Microeconometrics Chapter 8 Regression Discontinuity (RD) 1 / 26 Applied Microeconometrics Chapter 8 Regression Discontinuity (RD) Romuald Méango and Michele Battisti LMU, SoSe 2016 Overview What is it about? What are its assumptions? What are the main applications?

More information

An Alternative Assumption to Identify LATE in Regression Discontinuity Design

An Alternative Assumption to Identify LATE in Regression Discontinuity Design An Alternative Assumption to Identify LATE in Regression Discontinuity Design Yingying Dong University of California Irvine May 2014 Abstract One key assumption Imbens and Angrist (1994) use to identify

More information

Pre-test with Caution: Event-study Estimates After Testing for Parallel Trends

Pre-test with Caution: Event-study Estimates After Testing for Parallel Trends Pre-test with Caution: Event-study Estimates After Testing for Parallel Trends Jonathan Roth November 29, 2018 Abstract Researchers often test for pre-trends when using a research design that relies on

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

Exploring Marginal Treatment Effects

Exploring Marginal Treatment Effects Exploring Marginal Treatment Effects Flexible estimation using Stata Martin Eckhoff Andresen Statistics Norway Oslo, September 12th 2018 Martin Andresen (SSB) Exploring MTEs Oslo, 2018 1 / 25 Introduction

More information

Lecture 11 Roy model, MTE, PRTE

Lecture 11 Roy model, MTE, PRTE Lecture 11 Roy model, MTE, PRTE Economics 2123 George Washington University Instructor: Prof. Ben Williams Roy Model Motivation The standard textbook example of simultaneity is a supply and demand system

More information

ted: a Stata Command for Testing Stability of Regression Discontinuity Models

ted: a Stata Command for Testing Stability of Regression Discontinuity Models ted: a Stata Command for Testing Stability of Regression Discontinuity Models Giovanni Cerulli IRCrES, Research Institute on Sustainable Economic Growth National Research Council of Italy 2016 Stata Conference

More information

Applied Econometrics (QEM)

Applied Econometrics (QEM) Applied Econometrics (QEM) The Simple Linear Regression Model based on Prinicples of Econometrics Jakub Mućk Department of Quantitative Economics Jakub Mućk Applied Econometrics (QEM) Meeting #2 The Simple

More information

Recitation Notes 5. Konrad Menzel. October 13, 2006

Recitation Notes 5. Konrad Menzel. October 13, 2006 ecitation otes 5 Konrad Menzel October 13, 2006 1 Instrumental Variables (continued) 11 Omitted Variables and the Wald Estimator Consider a Wald estimator for the Angrist (1991) approach to estimating

More information

AGEC 661 Note Fourteen

AGEC 661 Note Fourteen AGEC 661 Note Fourteen Ximing Wu 1 Selection bias 1.1 Heckman s two-step model Consider the model in Heckman (1979) Y i = X iβ + ε i, D i = I {Z iγ + η i > 0}. For a random sample from the population,

More information

Statistical Models for Causal Analysis

Statistical Models for Causal Analysis Statistical Models for Causal Analysis Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 Three Modes of Statistical Inference 1. Descriptive Inference: summarizing and exploring

More information

Flexible Estimation of Treatment Effect Parameters

Flexible Estimation of Treatment Effect Parameters Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both

More information

Supplemental Appendix to "Alternative Assumptions to Identify LATE in Fuzzy Regression Discontinuity Designs"

Supplemental Appendix to Alternative Assumptions to Identify LATE in Fuzzy Regression Discontinuity Designs Supplemental Appendix to "Alternative Assumptions to Identify LATE in Fuzzy Regression Discontinuity Designs" Yingying Dong University of California Irvine February 2018 Abstract This document provides

More information

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes Kosuke Imai Department of Politics Princeton University July 31 2007 Kosuke Imai (Princeton University) Nonignorable

More information

Introduction to Econometrics

Introduction to Econometrics Introduction to Econometrics STAT-S-301 Experiments and Quasi-Experiments (2016/2017) Lecturer: Yves Dominicy Teaching Assistant: Elise Petit 1 Why study experiments? Ideal randomized controlled experiments

More information

Controlling for Time Invariant Heterogeneity

Controlling for Time Invariant Heterogeneity Controlling for Time Invariant Heterogeneity Yona Rubinstein July 2016 Yona Rubinstein (LSE) Controlling for Time Invariant Heterogeneity 07/16 1 / 19 Observables and Unobservables Confounding Factors

More information

PSC 504: Differences-in-differeces estimators

PSC 504: Differences-in-differeces estimators PSC 504: Differences-in-differeces estimators Matthew Blackwell 3/22/2013 Basic differences-in-differences model Setup e basic idea behind a differences-in-differences model (shorthand: diff-in-diff, DID,

More information

Lecture 4: Linear panel models

Lecture 4: Linear panel models Lecture 4: Linear panel models Luc Behaghel PSE February 2009 Luc Behaghel (PSE) Lecture 4 February 2009 1 / 47 Introduction Panel = repeated observations of the same individuals (e.g., rms, workers, countries)

More information

Experiments and Quasi-Experiments

Experiments and Quasi-Experiments Experiments and Quasi-Experiments (SW Chapter 13) Outline 1. Potential Outcomes, Causal Effects, and Idealized Experiments 2. Threats to Validity of Experiments 3. Application: The Tennessee STAR Experiment

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators

More information

Difference-in-Differences Estimation

Difference-in-Differences Estimation Difference-in-Differences Estimation Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. The Basic Methodology 2. How Should We

More information

IV Estimation WS 2014/15 SS Alexander Spermann. IV Estimation

IV Estimation WS 2014/15 SS Alexander Spermann. IV Estimation SS 2010 WS 2014/15 Alexander Spermann Evaluation With Non-Experimental Approaches Selection on Unobservables Natural Experiment (exogenous variation in a variable) DiD Example: Card/Krueger (1994) Minimum

More information

The problem of causality in microeconometrics.

The problem of causality in microeconometrics. The problem of causality in microeconometrics. Andrea Ichino University of Bologna and Cepr June 11, 2007 Contents 1 The Problem of Causality 1 1.1 A formal framework to think about causality....................................

More information

Regression Discontinuity Designs in Stata

Regression Discontinuity Designs in Stata Regression Discontinuity Designs in Stata Matias D. Cattaneo University of Michigan July 30, 2015 Overview Main goal: learn about treatment effect of policy or intervention. If treatment randomization

More information

Empirical Methods in Applied Economics Lecture Notes

Empirical Methods in Applied Economics Lecture Notes Empirical Methods in Applied Economics Lecture Notes Jörn-Ste en Pischke LSE October 2005 1 Regression Discontinuity Design 1.1 Basics and the Sharp Design The basic idea of the regression discontinuity

More information

Regression Discontinuity

Regression Discontinuity Regression Discontinuity Christopher Taber Department of Economics University of Wisconsin-Madison October 9, 2016 I will describe the basic ideas of RD, but ignore many of the details Good references

More information

The Economics of European Regions: Theory, Empirics, and Policy

The Economics of European Regions: Theory, Empirics, and Policy The Economics of European Regions: Theory, Empirics, and Policy Dipartimento di Economia e Management Davide Fiaschi Angela Parenti 1 1 davide.fiaschi@unipi.it, and aparenti@ec.unipi.it. Fiaschi-Parenti

More information

Identification with Latent Choice Sets: The Case of the Head Start Impact Study

Identification with Latent Choice Sets: The Case of the Head Start Impact Study Identification with Latent Choice Sets: The Case of the Head Start Impact Study Vishal Kamat University of Chicago 22 September 2018 Quantity of interest Common goal is to analyze ATE of program participation:

More information

Notes on causal effects

Notes on causal effects Notes on causal effects Johan A. Elkink March 4, 2013 1 Decomposing bias terms Deriving Eq. 2.12 in Morgan and Winship (2007: 46): { Y1i if T Potential outcome = i = 1 Y 0i if T i = 0 Using shortcut E

More information

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43 Panel Data March 2, 212 () Applied Economoetrics: Topic March 2, 212 1 / 43 Overview Many economic applications involve panel data. Panel data has both cross-sectional and time series aspects. Regression

More information

Two-Variable Regression Model: The Problem of Estimation

Two-Variable Regression Model: The Problem of Estimation Two-Variable Regression Model: The Problem of Estimation Introducing the Ordinary Least Squares Estimator Jamie Monogan University of Georgia Intermediate Political Methodology Jamie Monogan (UGA) Two-Variable

More information

The Distribution of Treatment Effects in Experimental Settings: Applications of Nonlinear Difference-in-Difference Approaches. Susan Athey - Stanford

The Distribution of Treatment Effects in Experimental Settings: Applications of Nonlinear Difference-in-Difference Approaches. Susan Athey - Stanford The Distribution of Treatment Effects in Experimental Settings: Applications of Nonlinear Difference-in-Difference Approaches Susan Athey - Stanford Guido W. Imbens - UC Berkeley 1 Introduction Based on:

More information

Quantitative Economics for the Evaluation of the European Policy

Quantitative Economics for the Evaluation of the European Policy Quantitative Economics for the Evaluation of the European Policy Dipartimento di Economia e Management Irene Brunetti Davide Fiaschi Angela Parenti 1 25th of September, 2017 1 ireneb@ec.unipi.it, davide.fiaschi@unipi.it,

More information

Methods to Estimate Causal Effects Theory and Applications. Prof. Dr. Sascha O. Becker U Stirling, Ifo, CESifo and IZA

Methods to Estimate Causal Effects Theory and Applications. Prof. Dr. Sascha O. Becker U Stirling, Ifo, CESifo and IZA Methods to Estimate Causal Effects Theory and Applications Prof. Dr. Sascha O. Becker U Stirling, Ifo, CESifo and IZA last update: 21 August 2009 Preliminaries Address Prof. Dr. Sascha O. Becker Stirling

More information

Recitation 7. and Kirkebøen, Leuven, and Mogstad (2014) Spring Peter Hull

Recitation 7. and Kirkebøen, Leuven, and Mogstad (2014) Spring Peter Hull 14.662 Recitation 7 The Roy Model, isolateing, and Kirkebøen, Leuven, and Mogstad (2014) Peter Hull Spring 2015 The Roy Model Motivation 1/14 Selection: an Applied Microeconomist s Best Friend Life (and

More information

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies

Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies Kosuke Imai Princeton University Joint work with Keele (Ohio State), Tingley (Harvard), Yamamoto (Princeton)

More information

Sociology Exam 2 Answer Key March 30, 2012

Sociology Exam 2 Answer Key March 30, 2012 Sociology 63993 Exam 2 Answer Key March 30, 2012 I. True-False. (20 points) Indicate whether the following statements are true or false. If false, briefly explain why. 1. A researcher has constructed scales

More information

Empirical approaches in public economics

Empirical approaches in public economics Empirical approaches in public economics ECON4624 Empirical Public Economics Fall 2016 Gaute Torsvik Outline for today The canonical problem Basic concepts of causal inference Randomized experiments Non-experimental

More information

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Arthur Lewbel Boston College Original December 2016, revised July 2017 Abstract Lewbel (2012)

More information

What s New in Econometrics? Lecture 14 Quantile Methods

What s New in Econometrics? Lecture 14 Quantile Methods What s New in Econometrics? Lecture 14 Quantile Methods Jeff Wooldridge NBER Summer Institute, 2007 1. Reminders About Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile Regression

More information

Instrumental Variables

Instrumental Variables Instrumental Variables Yona Rubinstein July 2016 Yona Rubinstein (LSE) Instrumental Variables 07/16 1 / 31 The Limitation of Panel Data So far we learned how to account for selection on time invariant

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University

More information

The Generalized Roy Model and Treatment Effects

The Generalized Roy Model and Treatment Effects The Generalized Roy Model and Treatment Effects Christopher Taber University of Wisconsin November 10, 2016 Introduction From Imbens and Angrist we showed that if one runs IV, we get estimates of the Local

More information

General Linear Model (Chapter 4)

General Linear Model (Chapter 4) General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients

More information

Comparison of Three Approaches to Causal Mediation Analysis. Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh

Comparison of Three Approaches to Causal Mediation Analysis. Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh Comparison of Three Approaches to Causal Mediation Analysis Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh Introduction Mediation defined using the potential outcomes framework natural effects

More information

Estimation of pre and post treatment Average Treatment Effects (ATEs) with binary time-varying treatment using Stata

Estimation of pre and post treatment Average Treatment Effects (ATEs) with binary time-varying treatment using Stata Estimation of pre and post treatment Average Treatment Effects (ATEs) with binary time-varying treatment using Stata Giovanni Cerulli CNR IRCrES Marco Ventura ISTAT Methodological and Data Quality Division,

More information

Labor Economics with STATA. Introduction to Regression Diagnostics

Labor Economics with STATA. Introduction to Regression Diagnostics Labor Economics with STATA Liyousew G. Borga November 4, 2015 Introduction to Regression Diagnostics Liyou Borga Labor Economics with STATA November 4, 2015 64 / 85 Outline 1 Violations of Basic Assumptions

More information

Applied Quantitative Methods II

Applied Quantitative Methods II Applied Quantitative Methods II Lecture 10: Panel Data Klára Kaĺıšková Klára Kaĺıšková AQM II - Lecture 10 VŠE, SS 2016/17 1 / 38 Outline 1 Introduction 2 Pooled OLS 3 First differences 4 Fixed effects

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Simple Linear Regression Stat186/Gov2002 Fall 2018 1 / 18 Linear Regression and

More information

Problem set - Selection and Diff-in-Diff

Problem set - Selection and Diff-in-Diff Problem set - Selection and Diff-in-Diff 1. You want to model the wage equation for women You consider estimating the model: ln wage = α + β 1 educ + β 2 exper + β 3 exper 2 + ɛ (1) Read the data into

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

Fuzzy Changes-in-Changes

Fuzzy Changes-in-Changes Fuzzy Changes-in-Changes Clément de Chaisemartin Xavier D'Haultf uille June 23, 2014 Abstract The changes-in-changes model extends the widely used dierence-in-dierences to situations where outcomes may

More information

Econometrics. 8) Instrumental variables

Econometrics. 8) Instrumental variables 30C00200 Econometrics 8) Instrumental variables Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Thery of IV regression Overidentification Two-stage least squates

More information

The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest

The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest Edward Vytlacil, Yale University Renmin University, Department

More information

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35 Rewrap ECON 4135 November 18, 2011 () Rewrap ECON 4135 November 18, 2011 1 / 35 What should you now know? 1 What is econometrics? 2 Fundamental regression analysis 1 Bivariate regression 2 Multivariate

More information

Research Methods in Political Science I

Research Methods in Political Science I Research Methods in Political Science I 6. Linear Regression (1) Yuki Yanai School of Law and Graduate School of Law November 11, 2015 1 / 25 Today s Menu 1 Introduction What Is Linear Regression? Some

More information

Supplementary Appendix: Difference-in-Differences with. Multiple Time Periods and an Application on the Minimum. Wage and Employment

Supplementary Appendix: Difference-in-Differences with. Multiple Time Periods and an Application on the Minimum. Wage and Employment Supplementary Appendix: Difference-in-Differences with Multiple Time Periods and an Application on the Minimum Wage and mployment Brantly Callaway Pedro H. C. Sant Anna August 3, 208 This supplementary

More information

Lecture 11/12. Roy Model, MTE, Structural Estimation

Lecture 11/12. Roy Model, MTE, Structural Estimation Lecture 11/12. Roy Model, MTE, Structural Estimation Economics 2123 George Washington University Instructor: Prof. Ben Williams Roy model The Roy model is a model of comparative advantage: Potential earnings

More information

Instrumental Variables: Then and Now

Instrumental Variables: Then and Now Instrumental Variables: Then and Now James Heckman University of Chicago and University College Dublin Econometric Policy Evaluation, Lecture II Koopmans Memorial Lectures Cowles Foundation Yale University

More information

Michael Lechner Causal Analysis RDD 2014 page 1. Lecture 7. The Regression Discontinuity Design. RDD fuzzy and sharp

Michael Lechner Causal Analysis RDD 2014 page 1. Lecture 7. The Regression Discontinuity Design. RDD fuzzy and sharp page 1 Lecture 7 The Regression Discontinuity Design fuzzy and sharp page 2 Regression Discontinuity Design () Introduction (1) The design is a quasi-experimental design with the defining characteristic

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Censoring and truncation b)

More information

150C Causal Inference

150C Causal Inference 150C Causal Inference Difference in Differences Jonathan Mummolo Stanford Selection on Unobservables Problem Often there are reasons to believe that treated and untreated units differ in unobservable characteristics

More information

Regression Discontinuity Designs.

Regression Discontinuity Designs. Regression Discontinuity Designs. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 31/10/2017 I. Brunetti Labour Economics in an European Perspective 31/10/2017 1 / 36 Introduction

More information

Clustering as a Design Problem

Clustering as a Design Problem Clustering as a Design Problem Alberto Abadie, Susan Athey, Guido Imbens, & Jeffrey Wooldridge Harvard-MIT Econometrics Seminar Cambridge, February 4, 2016 Adjusting standard errors for clustering is common

More information

Job Training Partnership Act (JTPA)

Job Training Partnership Act (JTPA) Causal inference Part I.b: randomized experiments, matching and regression (this lecture starts with other slides on randomized experiments) Frank Venmans Example of a randomized experiment: Job Training

More information

Chapter 1 Linear Regression with One Predictor

Chapter 1 Linear Regression with One Predictor STAT 525 FALL 2018 Chapter 1 Linear Regression with One Predictor Professor Min Zhang Goals of Regression Analysis Serve three purposes Describes an association between X and Y In some applications, the

More information