Advanced Quantitative Methods: Causal inference

Size: px
Start display at page:

Download "Advanced Quantitative Methods: Causal inference"

Transcription

1 Advanced Quantitative Methods: Johan A. Elkink University College Dublin 2 March 2017

2 1 2 3

3 1 2 3

4 Inference In regression analysis we look at the relationship between (a set of) independent variable(s) and a dependent variable. Statistical is concerned with the question how likely it is to observe this relationship given the null hypothesis of no relationship (frequentist) or how much we should update our beliefs concerning this relationship given our new evidence (Bayesian). A different question is whether or not we can deduce that the independent variable is a cause of the dependent one, that is, drawing a causal.

5 correlation causation

6 Many possible patterns

7 Many possible patterns

8 If healthier patients take a drug and sicker patients do not, we can find an association between drug and recovery even when the drug does not work. If sicker patients take a drug and healthier patients do not, we might not find an association between drug and recovery even when the drug works. association causation

9 Marijuana Imagine you have conducted a study on college students and have found an inverse correlation between marijuana smoking and grade point average that is, those who smoke tend to have lower GPAs than those who do not, and the more smoked, the lower the GPA.

10 Marijuana Possible conclusion: marijuana use must be affecting the mental abilities, and hence grades.

11 Marijuana But perhaps the frustration of poor grades leads to escapist behaviour, e.g., marijuana use.

12 Marijuana Or, a difficult emotional state, or a lot of stress, causes one to use marijuana, and also affects grades.

13 Marijuana Or, perhaps the group of people you studied just happened to have lower GPAs among the marijuana users, purely by chance.

14 Fundamental problem Imagine, there are two kinds of people, one group, T = 1, that has a college degree, and another group, T = 0, that does not. We want to measure where a college degree leads to a higher salary, Y. What we would like to know is the difference for any individual i whether they have a college degree or not: Y T i =1 i Y T i =0 i. However, for every individual i, we either observe Y T i =1 i, or we observe Y T i =0 i they either have the degree or they don t.

15 We wish... Respondent Degree Y T i =0 i effect 1 Yes Yes No No Yes Yes No Yes i Y T i =1

16 We wish... we have... Respondent Degree Y T i =0 i 1 Yes Yes No 90 4 No 87 5 Yes Yes No 92 8 Yes 109 i Y T i =1 effect

17 outcome = { Y1i if T i = 1 Y 0i if T i = 0 E.g., Y 1i is the salary of individual i had (s)he a college degree, irrespective of whether (s)he actually does. Y i = Y 0i + (Y 1i Y 0i )T i = Y 0i + δt i, where δ = Y 1i Y 0i is the causal effect. (Angrist and Pischke, 2009, 13 14)

18 Average treatment effect Because it is impossible to observe individual treatment effect, we usually turn to average treatment effect: E[δ] = E[Y 1i Y 0i ] = E[Y 1i ] E[Y 0i ], which we could naively estimate with ˆδ = E[Y 1i T i = 1] E[Y 0i T i = 0]. This assumes that E[Y 1i ] reflects the salary for people with a college degree, irrespective of whether they got one or not, and that E[Y 0i ] reflects the salary without a college degree, irrespective of whether they got one or not.

19 Counterfactual causality By making such assumptions by looking at the ATE we are making a counterfactual argument. We are making assumptions of what Y 1i would have been, had i had a college degree. This implies that we cannot measure a causal effect, only estimate it. To understand when the ATE assumptions are reasonable, we need to look at the effect of covariates other that relate to Y, which we will denote by X.

20 Treatment effect: abbreviations ATE Average Treatment Effect E[δ] = E[Y 1i Y 0i ] ATT ATE for the Treated E[Y 1i Y 0i T i = 1] ATC ATE for the Control (untreated) E[Y 1i Y 0i T i = 0] PATE Population ATE E[Y 1i Y 0i ] SATE Sample ATE E n [Y 1i Y 0i ] LATE Local ATE E[Y 1i Y 0i X i = x] CATE Conditional ATE E[Y 1i Y 0i X i = x] and analogously we have LATT, PATT, SATC, etceta. Note that ATE, ATT, ATC implicitly refer to population values. E n [ ] is the sample mean, i.e. E n [x] = x = 1 n n i=1, where the E n allows for the formulation of conditional and counterfactual means.

21 Bias in causal Using shorthand E 01 = E[Y 0i T i = 1], etc., and taking π as the population proportion that received the treatment, E[δ] = πe[δ T i = 1] + (1 π)e[δ T i = 0] = π(e 11 E 01 ) + (1 π)(e 10 E 00 )

22 Bias in causal Using shorthand E 01 = E[Y 0i T i = 1], etc., and taking π as the population proportion that received the treatment, E[δ] = πe[δ T i = 1] + (1 π)e[δ T i = 0] = π(e 11 E 01 ) + (1 π)(e 10 E 00 ) can be decomposed into (E 11 E 00 ) = E[δ] + (E 01 E 00 ) + (1 π){(e 11 E 01 ) (E 10 E 00 )}.

23 Bias in causal Using shorthand E 01 = E[Y 0i T i = 1], etc., and taking π as the population proportion that received the treatment, can be decomposed into E[δ] = πe[δ T i = 1] + (1 π)e[δ T i = 0] = π(e 11 E 01 ) + (1 π)(e 10 E 00 ) (E 11 E 00 ) = E[δ] + (E 01 E 00 ) + (1 π){(e 11 E 01 ) (E 10 E 00 )}. (E 11 E 00 ) observed difference in effect E[δ] average treatment effect (E 01 E 00 ) selection bias (1 π){(e 11 E 01 ) (E 10 E 00 )} differential treatment effect bias (Morgan and Winship, 2007)

24 SUTVA The stable unit treatment value assumption SUTVA is simply the a priori assumption that the value of Y for unit i when exposed to treatment t will be the same no matter what mechanism is used to assign treatment t to unit i and no matter what treatments the other units receive. (Rubin 1986, 961, as cited in Morgan and Winship (2007, 37))

25 Propensity score The propensity score is the conditional probability of receiving the treatment, given control in X: π(x) P(T = 1 X = x) = E[T X = x]. (Imbens, 2004, 6)

26 When studying effect of, say, T on Y, by examining the statistical association between the two, we need to ascertain that the observed effect is not caused by a third variable, say, X. We can say that T and Y are confounded when there is a third variable X that influences both T and Y ; such a variable is then called a confounder of T and Y. (Pearl, 2000, )

27 Another way of saying this is that if E(Y T, X ) E(Y T ) and E(T X ) E(T ), X is a confounder of the effect of T on Y. Note that confounding is a causal concept, not an associational one! X has to have a causal effect on T and X has to have a causal effect on Y for there to be an issue. (Pearl, 2000; Lee, 2005, 44)

28 1 2 3

29 X affects both T and Y = control (Lee, 2005, 43 48)

30 Do control This is the typical case of a confounding factor, and hence should be eliminated through controlling.

31 X affects both T and Y = control T affects Y, which in turn affects X = do not control (Lee, 2005, 43 48)

32 Don t control In this case, X is an effect of Y. By controlling for X, you can severily underestimate the effect of T on Y. Imagine that a college degree leads to a better income leads to a nicer car. Controlling for the price of the car in estimating the effect of having a college degree on income might cancel the effect.

33 X affects both T and Y = control T affects Y, which in turn affects X = do not control T affects X, which in turn affects Y = do not control... (Lee, 2005, 43 48)

34 Don t control To get the overall effect of T on Y, you want to include the effect through X. E.g. if you want to know the effect of changing the policy regarding smoking in pubs on the amount of smoking in general, you do not care through what mechanism this happened (through peer pressure, laziness, etc.), but only about the overall effect.

35 X affects both T and Y = control T affects Y, which in turn affects X = do not control T affects X, which in turn affects Y = do not control unless you explicitly want only the direct effect (Lee, 2005, 43 48)

36 Maybe control A scholarship for poorer students might help them to get a college degree, which in turn might help them to earn more money later in life. Having a scholarship on your CV, however, might also further your career, independent of the effect of having a college degree. To see the overall effect of the scholarship, don t control on having a college degree. To see the effect of having a scholarship, independent of the effect of getting a college degree, do control for college degree.

37 X affects both T and Y = control T affects Y, which in turn affects X = do not control T affects X, which in turn affects Y = do not control unless you explicitly want only the direct effect X affects Y, but not T, nor the effect of T on Y (Lee, 2005, 43 48)

38 Maybe control When X affects Y, but not T, there is no confounding issue and the estimates for the effect of T on Y should not be affected by inclusion of X. However, including X in the model can still help for efficiency. (Gelman and Hill, 2007, 177)

39 Kitchen sink A typical approach in the quantitative social sciences is to collect a number of different theories / hypotheses, add them all as to a regression, and see who wins. This is the kitchen sink approach (or garbage can approach). If anything, the above discussion should have made clear that to draw causal s, a clear distinction of treatment from covariates is crucial. In other words: focus your research! (Note that the garbage can phrase has also been used to argue against ignoring nonlinearities (Achen, 2005), as opposed to careless specification of the causal effect.)

40 Causal diagrams The preceding examples underline how it is important to always draw out the causal diagram and consider carefully how you select cases and select controls when making causal s.

41 X- and Y-centered research Another way of putting the issue is that the above is all about trying to study the effect of a cause (treatment), rather than the cause of an effect. The latter is perhaps ill-defined and runs into the infinite regress of causation (Gelman and Hill, 2007, 187). (See Gerring (2001, 2012) for an extensive discussion of Y -centered and X -centered research.)

42 1 2 3

43 The ideal experiment To avoid any effect of covariates the ideal is to randomly select participants for your research from the overal population (enables to the population) and to randomly assign the treatment to these participants (enables causal ).

44 Controls Randomized assignment Blocking Multiple regression (Imai, King and Stuart, 2008)

45 The ideal experiment To avoid any effect of covariates the ideal is to randomly select participants for your research from the overal population (enables to the population) and to randomly assign the treatment to these participants (enables causal ).

46 Randomized selection Random selection, or random sampling, enables s from the sample to the overall population. It does not affect whether someone is treated or not, so it does not affect the causal. It helps drawing s from sample average treatment effect (SATE) to population average treatment effect (PATE). (Imai, King and Stuart, 2008)

47 Randomized assignment When the treatment is entirely randomly assigned, any effect of covariates, observed or unobserved, on the treatment, is removed - the covariates are now ignorable. P(T X, ε) = P(T ) Hence, randomized assignment is useful for causal s, either to the overall population (random selection) or the sample (non-random selection).

48 Randomized assignment In this case, the estimate of the effect is simply the difference in means ȳ T i =1 i ȳ T i =0 i, or ˆβ in the simple regression Y i = α + βt i + ε i. Adding pre-treatment covariates can still increase the efficiency of this estimator, however.

49 Blocking Blocking is the random assignment of units to treatment and control groups within strata (blocks) that are defined by a set of observed pretreatment covariates. Blocking removes the effect of covariates for the covariates we use to block. So it does not assist in dealing with unobserved confounders. P(T X, ε) = P(T ε) (Imai, King and Stuart, 2008, 486)

50 Unconfoundedness The assumption of unconfoundedness states that, conditional on X, the treatment assignment is ignorable: (Y 0, Y 1 ) T X. Alternative names: ignorable treatment assignment, conditional independence assumption, selection on observables, exogeneity assumption. When only interested in ATT instead of ATE, unconfoundedness for controls suffices: Y 0 T X. (Imbens, 2004, 7 8)

51 Overlap The assumption of overlap states that we have both treated and untreated observations across the values of X : 0 < P(T = 1 X ) < 1. When only interested in ATT instead of ATE, weak overlap suffices: P(T = 1 X ) < 1. (Imbens, 2004, 7 8)

52 A causal effect can be estimated using regression y = Xβ + δt + ε, with ˆδ OLS the estimate of the causal effect, controlling for X.

53 Sampling strategy: match control cases to treatment cases strategy: nonparametric use of controls (Similar to blocking, but post-experiment.) (Imai, King and Stuart, 2008; Morgan and Harding, 2006, 5 6)

54 With matching, we match each observation where T i = 1 with a set of M corresponding observations where T j = 0. and vice versa for Ŷ 1i. { Yi if T i = 0 Ŷ 0i = j Y j if T i = 1 1 M Our simple matching estimator of ATE is then ˆδ ATE SM = 1 n i (Ŷ 1i Ŷ 0i ). (Imbens, 2004, 14)

55 An alternative formulation to the same effect is ˆδ ATE M = δ x P(X i = x) x ˆδ ATT M = δ x P(X i = x T i = 1), x where δ x = E n [y i t i = 1, x i = x] E n [y i t i = 0, x i = x]. Here ˆδ is a weighted mean of the means of substrata, weighted by the probability of being in that substratum. This is comparable to the previous, if each treated case is matched with all control cases in the same substratum. (Angrist and Pischke, 2009, 71 72)

56 : example Imagine we have a treatment variable T, a control variable X with two categories, and a dependent variable Y. We will require the estimated proportions (left) and mean of y (right) for each combination of T, X : ˆδ M ATE = T = 0 T = 1 T = 0 T = 1 X = X = δ x P(X i = x) =.44 (40 20) +.56 (120 80) = 31.2 ˆδ ATT M = δ x P(X i = x T i = 1) x = (40 20) + (120 80) = ˆδ OLS = 33.5 x

57 vs regression Note that the regression coefficient can also be obtained using: ˆδ OLS = x δ xp(t i = 1 X i = x)(1 P(T i = 1 X i = x))p(x i = x) x P(T i = 1 X i = x)(1 P(T i = 1 X i = x))p(x i = x) (40 20).44(1.44).44 + (120 80).56(1.56 = ) (1.44) (1.56 ).56 = 33.5

58 vs regression ˆδ OLS = x δ xp(t i = 1 X = x)(1 P(T i = 1 X = x))p(x = x) x P(T i = 1 X = x)(1 P(T i = 1 X = x))p(x = x) ˆδ ATT M = x δ xp(t i = 1 X = x)p(x = x) x P(T i = 1 X = x)p(x = x) ˆδ ATE M = δ x P(X = x), x so while ˆδ R is weighted by the variance of the treatment, conditional on X, ˆδ ATT M is weighted by the probability of treatment, conditional on X, and ˆδ ATE M can be seen as the unweighted estimator. (See handout for derivation and references.)

59 Distance metrics The term corresponding here is not specified and there are many options: Use Euclidean distances in X ;... weighted by the (diagonal of the) inverse of their covariances;... weighted by the coefficient on estimating the propensity score;... weighted by the coefficients in a linear regression explaining Y ; etc. (Imbens, 2004, 14 15)

60 Using propensity scores As alternative to using distances in X, one can use the propensity score: By weighting the sample on the reciprocal of π(x); Divide the sample in subsamples with approximately the same π(x) ( blocking ); By adding π(x) as a covariate in a regression model; By matching cases based on closeness in π(x). π(x) typically needs to be estimated, e.g. using a logit model. Propensity scores help with many-valued control and they address the issue of lack of overlap. (Imbens 2004, 16; Morgan and Harding 2006, 22 28

61 Exact matching Here all cases where T i = 1 are matched with M cases where T j = 0 for which X i = X j. E.g. one can randomly select one control for each treated from all cases where X i = X j (so M = 1). For the ATT we can then use ˆδ ATT EM = 1 (y i t i = 1) n 1 i j w ij (y j t j = 0), where n 1 is the number of treated cases and w ij is a weight calculated as 1/M ij for the matched control and 0 otherwise. (Note that this is a reformulation of the estimator we used in the above example.) (Morgan and Harding, 2006, 31)

62 Nearest neighbour matching Same as exact matching, except instead of looking for controls where X i = X j, we match with the nearest control on some dimension. The most common would be the propensity score, matching each treated case with one or more controls that are closest in π(x). The weights are then calculated as in exact matching. Interval matching is very similar, bundling cases within a particular range and then selecting controls from within the same group. (Morgan and Winship, 2007, )

63 Kernel matching Same as exact matching, but all treated are matched against all controls, with weights such that those closest to the treatment case are given the greatest weight: [ ] π(xj ) π(x G i ) a n w ij = [ j G π(xj ], ) π(x i ) a n where a n is a bandwidth parameter that scales the difference in the estimated propensity scores based on the sample size and G[ ] is a kernel function. (A kernel function needs to have a mean of zero and integrate to 1, for example, Smith and Todd (2005) use G(s) = (s2 1) 2 for s 1, G(s) = 0 otherwise.) (Morgan and Winship, 2007, 109)

64 : final points, blocking, matching all control for observed confounders. are similar, but differ in their weighting of cases. can be combined, using regression on matched data ( doubly robust ). Choice of matching algorithm significantly affects estimates extensive research is still ongoing into comparing these estimators. (Imbens, 2004; Morgan and Winship, 2007)

65 Sharp RD is applicable in situations where a continuous variable determines treatment, as in: { 1 if xi x T i = 0 0 if x i < x 0, with x 0 a known cutoff point. Note that this is the opposite of the overlap assumption in matching. (Angrist and Pischke, 2009, )

66 RD estimation Estimation of Sharp RD can be very straightforward, e.g. by running the regression y = Xβ + ρt + ε, where X contains the variable determining T. Nonlinear extensions are straightforward, e.g. X can include quadratic terms. (Angrist and Pischke, 2009, 253)

67 RD example: incumbent advantage (Lee, 2008, Fig. 4)

68 RD extensions: local regression A nonlinear relationship between X and Y can occassionally be mistaken for a regression.focusing on the data close to x 0, or weighting this data more heavily in the regression, can address this issue. Typically, nonparametric RD designs are used in this context. The risk is inefficiency and model sensitivity due to lack of data near the cutoff point. (Angrist and Pischke, 2009, 256)

69 Mistaken RD Nonlinearity mistaken for RD: (Angrist and Pischke, 2009, 254)

70 RD extensions: local regression A nonlinear relationship between X and Y can occassionally be mistaken for a regression.focusing on the data close to x 0, or weighting this data more heavily in the regression, can address this issue. Typically, nonparametric RD designs are used in this context. The risk is inefficiency and model sensitivity due to lack of data near the cutoff point. (Angrist and Pischke, 2009, 256)

71 RD extensions: fuzzy RD Fuzzy RD refers to a model where where g 1 (x i ) g 0 (x i ). { g1 (x P(T i = 1 x i ) = i ) if x i x 0 g 0 (x i ) if x i < x 0 This type of model can be estimated using a polynomial of X and/or an indicator for the cutoff point as instruments for T. (Angrist and Pischke, 2009, )

72 1 2 3

73 The situation where cor(x, ε) 0 is called endogeneity. has three main causes: Measurement error in X Simultaneity or reverse causation Omitted

74 The idea of instrumental variable (IV) estimation is to find a variable which is correlated with x, but not with ε, and that has no causal relationship with y other than through x, such that any effect it has on y must be attributed to x. y i = β 0 + β 1 x i + ε i x i = γ 0 + γ 1 z i + v i, assuming cov(x i, ε i ) 0, x is a cause of y, but z is not, and cov(z i, ε i x i ) = 0, then z is a good instrument for x.

75 : 2SLS A simple procedure to estimate IV is to use two-stage least (2SLS): Regress x on z plus all other independent ; Regress y on ˆx plus all other independent. The ˆβ on ˆx is the estimate for β.

76 : 2SLS ˆβ OLS IV = (ˆX ˆX) 1 ˆX y = (X Z(Z Z) 1 Z X) 1 X Z(Z Z) 1 Z y If X and Z have the same dimensions, this reduces to: ˆβ OLS IV = (Z X) 1 Z y

77 : 2SLS Because only the information in x which overlaps with z is used, standard errors will be larger. Estimation: V ( ˆβ IV OLS ) = ˆσ2 (ˆX ˆX) 1 = ˆσ 2 (X Z(Z Z) 1 Z X) 1 e e ˆσ 2 = n k = (y X ˆβ OLS IV ) (y X ˆβ OLS IV ) n k Note the use of X, not ˆX, to determine ˆσ 2.

78 Weak instruments x and z will correlate, but not perfectly. If they only weakly correlate, we call z a weak instrument. If instruments are weak: bias of IV is large (but IV is still consistent); variances are underestimated; estimation becomes unreliable; even very weak endogeneity in instrument can make IV worse than OLS.

79 H 0 : ˆβ OLS = ˆβ OLS IV H 1 : ˆβ OLS ˆβ OLS IV If H 0 is rejected, only IV estimator is consistent. H = ( ˆβ IV ˆβ OLS ) (V ( ˆβ IV ) V ( ˆβ OLS )) 1 ( ˆβ IV ˆβ OLS ) H χ 2 (rank(v ( ˆβ IV ) V ( ˆβ OLS ))

80 ATE vs LATE One concern with IV estimation is the distinction between average treatment effect (ATE) and local average treatment effect (LATE). We assume that the effect of x on y is constant across subgroups in the population. With IV, we also assume that this does not change when only looking at the variation in x explained by z. If for some groups in the population x depends on z, but for other groups it does not, then our estimate of β is only valid for the former group (i.e. LATE). If there is no such correlation, the estimate applies to the entire population (ATE). One solution to reduce the difference between ATE and LATE would be to add more instruments for the same x. More instruments increases the risks of introducing endogeneity in the instruments (Kennedy, 2008).

81 Achen, Christopher H Let s put garbage-can regressions and garbage-can probits where they belong. Conflict Management and Peace Science 22: Angrist, Joshua D. and Jörn-Steffen Pischke Mostly harmless econometrics: An empiricist s companion. Princeton: Princeton University Press. Gelman, Andrew and Jennifer Hill Data analysis using regression and multilevel/hierarchical models. Analytical Methods for Social Research Cambridge: Cambridge University Press. Gerring, John Social science methodology: A critical framework. Cambridge: Cambridge University Press. Gerring, John Social science methodology: A unified framework. Cambridge: Cambridge University Press. Imai, Kosuke, Gary King and Elizabeth A. Stuart Misunderstandings between experimentalists and observationalists about causal. Journal of the Royal Statistical Society A 171: Imbens, Guido W Nonparametric estimation of average treatment effects under exogeneity: A review. Review of Economics and Statistics 86(1):4 29. Kennedy, Peter A guide to econometrics. 6th ed. Malden, MA: Blackwell. Lee, David S Randomized experiments from non-random selection in U.S. House elections. Journal of Econometrics 142(2). Lee, Myoung Jae Micro-econometrics for policy, program, and treatment effects. Oxford: Oxford University Press. Morgan, Stephen L. and Christopher Winship Counterfactuals and causal. Methods and principles for social research. New York: Cambridge University Press. Morgan, Stephen L. and David J. Harding estimators of causal effects: Prospects and pitfalls in theory and practice. Sociological Methods & Research 35(1):3 60. Pearl, Judea Causality: Models, reasoning, and. Cambridge: Cambridge University Press. Rubin, Donald B Which ifs have causal answers (Comment on Statistics and causal by Paul W. Holland). Journal of the American Statistical Association 81: Smith, Jeffrey A. and Petra E. Todd Does matching overcome Lalonde s critique of nonexperimental estimators? Journal of Econometrics 125:

Research Design: Causal inference and counterfactuals

Research Design: Causal inference and counterfactuals Research Design: Causal inference and counterfactuals University College Dublin 8 March 2013 1 2 3 4 Outline 1 2 3 4 Inference In regression analysis we look at the relationship between (a set of) independent

More information

Notes on causal effects

Notes on causal effects Notes on causal effects Johan A. Elkink March 4, 2013 1 Decomposing bias terms Deriving Eq. 2.12 in Morgan and Winship (2007: 46): { Y1i if T Potential outcome = i = 1 Y 0i if T i = 0 Using shortcut E

More information

Selection on Observables: Propensity Score Matching.

Selection on Observables: Propensity Score Matching. Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017

More information

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1 Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1 Gary King GaryKing.org April 13, 2014 1 c Copyright 2014 Gary King, All Rights Reserved. Gary King ()

More information

Statistical Models for Causal Analysis

Statistical Models for Causal Analysis Statistical Models for Causal Analysis Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 Three Modes of Statistical Inference 1. Descriptive Inference: summarizing and exploring

More information

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers: Use of Matching Methods for Causal Inference in Experimental and Observational Studies Kosuke Imai Department of Politics Princeton University April 27, 2007 Kosuke Imai (Princeton University) Matching

More information

ESTIMATION OF TREATMENT EFFECTS VIA MATCHING

ESTIMATION OF TREATMENT EFFECTS VIA MATCHING ESTIMATION OF TREATMENT EFFECTS VIA MATCHING AAEC 56 INSTRUCTOR: KLAUS MOELTNER Textbooks: R scripts: Wooldridge (00), Ch.; Greene (0), Ch.9; Angrist and Pischke (00), Ch. 3 mod5s3 General Approach The

More information

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed

More information

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 An Introduction to Causal Mediation Analysis Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 1 Causality In the applications of statistics, many central questions

More information

The Economics of European Regions: Theory, Empirics, and Policy

The Economics of European Regions: Theory, Empirics, and Policy The Economics of European Regions: Theory, Empirics, and Policy Dipartimento di Economia e Management Davide Fiaschi Angela Parenti 1 1 davide.fiaschi@unipi.it, and aparenti@ec.unipi.it. Fiaschi-Parenti

More information

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha January 18, 2010 A2 This appendix has six parts: 1. Proof that ab = c d

More information

What s New in Econometrics. Lecture 1

What s New in Econometrics. Lecture 1 What s New in Econometrics Lecture 1 Estimation of Average Treatment Effects Under Unconfoundedness Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Potential Outcomes 3. Estimands and

More information

Quantitative Economics for the Evaluation of the European Policy

Quantitative Economics for the Evaluation of the European Policy Quantitative Economics for the Evaluation of the European Policy Dipartimento di Economia e Management Irene Brunetti Davide Fiaschi Angela Parenti 1 25th of September, 2017 1 ireneb@ec.unipi.it, davide.fiaschi@unipi.it,

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint

More information

On the Use of Linear Fixed Effects Regression Models for Causal Inference

On the Use of Linear Fixed Effects Regression Models for Causal Inference On the Use of Linear Fixed Effects Regression Models for ausal Inference Kosuke Imai Department of Politics Princeton University Joint work with In Song Kim Atlantic ausal Inference onference Johns Hopkins

More information

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers: Use of Matching Methods for Causal Inference in Experimental and Observational Studies Kosuke Imai Department of Politics Princeton University April 13, 2009 Kosuke Imai (Princeton University) Matching

More information

Dynamics in Social Networks and Causality

Dynamics in Social Networks and Causality Web Science & Technologies University of Koblenz Landau, Germany Dynamics in Social Networks and Causality JProf. Dr. University Koblenz Landau GESIS Leibniz Institute for the Social Sciences Last Time:

More information

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. The Basic Methodology 2. How Should We View Uncertainty in DD Settings?

More information

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors Laura Mayoral IAE, Barcelona GSE and University of Gothenburg Gothenburg, May 2015 Roadmap Deviations from the standard

More information

Implementing Matching Estimators for. Average Treatment Effects in STATA

Implementing Matching Estimators for. Average Treatment Effects in STATA Implementing Matching Estimators for Average Treatment Effects in STATA Guido W. Imbens - Harvard University West Coast Stata Users Group meeting, Los Angeles October 26th, 2007 General Motivation Estimation

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Princeton University Asian Political Methodology Conference University of Sydney Joint

More information

The returns to schooling, ability bias, and regression

The returns to schooling, ability bias, and regression The returns to schooling, ability bias, and regression Jörn-Steffen Pischke LSE October 4, 2016 Pischke (LSE) Griliches 1977 October 4, 2016 1 / 44 Counterfactual outcomes Scholing for individual i is

More information

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.

More information

Empirical approaches in public economics

Empirical approaches in public economics Empirical approaches in public economics ECON4624 Empirical Public Economics Fall 2016 Gaute Torsvik Outline for today The canonical problem Basic concepts of causal inference Randomized experiments Non-experimental

More information

Implementing Matching Estimators for. Average Treatment Effects in STATA. Guido W. Imbens - Harvard University Stata User Group Meeting, Boston

Implementing Matching Estimators for. Average Treatment Effects in STATA. Guido W. Imbens - Harvard University Stata User Group Meeting, Boston Implementing Matching Estimators for Average Treatment Effects in STATA Guido W. Imbens - Harvard University Stata User Group Meeting, Boston July 26th, 2006 General Motivation Estimation of average effect

More information

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous Econometrics of causal inference Throughout, we consider the simplest case of a linear outcome equation, and homogeneous effects: y = βx + ɛ (1) where y is some outcome, x is an explanatory variable, and

More information

Regression Discontinuity

Regression Discontinuity Regression Discontinuity Christopher Taber Department of Economics University of Wisconsin-Madison October 16, 2018 I will describe the basic ideas of RD, but ignore many of the details Good references

More information

Regression Discontinuity

Regression Discontinuity Regression Discontinuity Christopher Taber Department of Economics University of Wisconsin-Madison October 24, 2017 I will describe the basic ideas of RD, but ignore many of the details Good references

More information

Flexible Estimation of Treatment Effect Parameters

Flexible Estimation of Treatment Effect Parameters Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both

More information

The problem of causality in microeconometrics.

The problem of causality in microeconometrics. The problem of causality in microeconometrics. Andrea Ichino University of Bologna and Cepr June 11, 2007 Contents 1 The Problem of Causality 1 1.1 A formal framework to think about causality....................................

More information

Teaching Causal Inference in Undergraduate Econometrics

Teaching Causal Inference in Undergraduate Econometrics Teaching Causal Inference in Undergraduate Econometrics October 24, 2012 Abstract This paper argues that the current way in which the undergraduate introductory econometrics course is taught is neither

More information

Michael Lechner Causal Analysis RDD 2014 page 1. Lecture 7. The Regression Discontinuity Design. RDD fuzzy and sharp

Michael Lechner Causal Analysis RDD 2014 page 1. Lecture 7. The Regression Discontinuity Design. RDD fuzzy and sharp page 1 Lecture 7 The Regression Discontinuity Design fuzzy and sharp page 2 Regression Discontinuity Design () Introduction (1) The design is a quasi-experimental design with the defining characteristic

More information

Discussion: Can We Get More Out of Experiments?

Discussion: Can We Get More Out of Experiments? Discussion: Can We Get More Out of Experiments? Kosuke Imai Princeton University September 4, 2010 Kosuke Imai (Princeton) Discussion APSA 2010 (Washington D.C.) 1 / 9 Keele, McConnaughy, and White Question:

More information

Econometrics with Observational Data. Introduction and Identification Todd Wagner February 1, 2017

Econometrics with Observational Data. Introduction and Identification Todd Wagner February 1, 2017 Econometrics with Observational Data Introduction and Identification Todd Wagner February 1, 2017 Goals for Course To enable researchers to conduct careful quantitative analyses with existing VA (and non-va)

More information

An Alternative Assumption to Identify LATE in Regression Discontinuity Design

An Alternative Assumption to Identify LATE in Regression Discontinuity Design An Alternative Assumption to Identify LATE in Regression Discontinuity Design Yingying Dong University of California Irvine May 2014 Abstract One key assumption Imbens and Angrist (1994) use to identify

More information

Instrumental Variables

Instrumental Variables Instrumental Variables Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Noncompliance in Experiments Stat186/Gov2002 Fall 2018 1 / 18 Instrumental Variables

More information

ted: a Stata Command for Testing Stability of Regression Discontinuity Models

ted: a Stata Command for Testing Stability of Regression Discontinuity Models ted: a Stata Command for Testing Stability of Regression Discontinuity Models Giovanni Cerulli IRCrES, Research Institute on Sustainable Economic Growth National Research Council of Italy 2016 Stata Conference

More information

Gov 2002: 4. Observational Studies and Confounding

Gov 2002: 4. Observational Studies and Confounding Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What

More information

Regression Discontinuity

Regression Discontinuity Regression Discontinuity STP Advanced Econometrics Lecture Douglas G. Steigerwald UC Santa Barbara March 2006 D. Steigerwald (Institute) Regression Discontinuity 03/06 1 / 11 Intuition Reference: Mostly

More information

Propensity Score Methods for Causal Inference

Propensity Score Methods for Causal Inference John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good

More information

Propensity Score Matching

Propensity Score Matching Methods James H. Steiger Department of Psychology and Human Development Vanderbilt University Regression Modeling, 2009 Methods 1 Introduction 2 3 4 Introduction Why Match? 5 Definition Methods and In

More information

Methods to Estimate Causal Effects Theory and Applications. Prof. Dr. Sascha O. Becker U Stirling, Ifo, CESifo and IZA

Methods to Estimate Causal Effects Theory and Applications. Prof. Dr. Sascha O. Becker U Stirling, Ifo, CESifo and IZA Methods to Estimate Causal Effects Theory and Applications Prof. Dr. Sascha O. Becker U Stirling, Ifo, CESifo and IZA last update: 21 August 2009 Preliminaries Address Prof. Dr. Sascha O. Becker Stirling

More information

ECO Class 6 Nonparametric Econometrics

ECO Class 6 Nonparametric Econometrics ECO 523 - Class 6 Nonparametric Econometrics Carolina Caetano Contents 1 Nonparametric instrumental variable regression 1 2 Nonparametric Estimation of Average Treatment Effects 3 2.1 Asymptotic results................................

More information

Job Training Partnership Act (JTPA)

Job Training Partnership Act (JTPA) Causal inference Part I.b: randomized experiments, matching and regression (this lecture starts with other slides on randomized experiments) Frank Venmans Example of a randomized experiment: Job Training

More information

Applied Microeconometrics Chapter 8 Regression Discontinuity (RD)

Applied Microeconometrics Chapter 8 Regression Discontinuity (RD) 1 / 26 Applied Microeconometrics Chapter 8 Regression Discontinuity (RD) Romuald Méango and Michele Battisti LMU, SoSe 2016 Overview What is it about? What are its assumptions? What are the main applications?

More information

An Introduction to Causal Analysis on Observational Data using Propensity Scores

An Introduction to Causal Analysis on Observational Data using Propensity Scores An Introduction to Causal Analysis on Observational Data using Propensity Scores Margie Rosenberg*, PhD, FSA Brian Hartman**, PhD, ASA Shannon Lane* *University of Wisconsin Madison **University of Connecticut

More information

CompSci Understanding Data: Theory and Applications

CompSci Understanding Data: Theory and Applications CompSci 590.6 Understanding Data: Theory and Applications Lecture 17 Causality in Statistics Instructor: Sudeepa Roy Email: sudeepa@cs.duke.edu Fall 2015 1 Today s Reading Rubin Journal of the American

More information

The Simple Linear Regression Model

The Simple Linear Regression Model The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate

More information

Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded

Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded Controlling for latent confounding by confirmatory factor analysis (CFA) Blinded Blinded 1 Background Latent confounder is common in social and behavioral science in which most of cases the selection mechanism

More information

causal inference at hulu

causal inference at hulu causal inference at hulu Allen Tran July 17, 2016 Hulu Introduction Most interesting business questions at Hulu are causal Business: what would happen if we did x instead of y? dropped prices for risky

More information

Regression Discontinuity Designs

Regression Discontinuity Designs Regression Discontinuity Designs Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Regression Discontinuity Design Stat186/Gov2002 Fall 2018 1 / 1 Observational

More information

Technical Track Session I:

Technical Track Session I: Impact Evaluation Technical Track Session I: Click to edit Master title style Causal Inference Damien de Walque Amman, Jordan March 8-12, 2009 Click to edit Master subtitle style Human Development Human

More information

Applied Econometrics Lecture 1

Applied Econometrics Lecture 1 Lecture 1 1 1 Università di Urbino Università di Urbino PhD Programme in Global Studies Spring 2018 Outline of this module Beyond OLS (very brief sketch) Regression and causality: sources of endogeneity

More information

Causal Analysis in Social Research

Causal Analysis in Social Research Causal Analysis in Social Research Walter R Davis National Institute of Applied Statistics Research Australia University of Wollongong Frontiers in Social Statistics Metholodogy 8 February 2017 Walter

More information

EMERGING MARKETS - Lecture 2: Methodology refresher

EMERGING MARKETS - Lecture 2: Methodology refresher EMERGING MARKETS - Lecture 2: Methodology refresher Maria Perrotta April 4, 2013 SITE http://www.hhs.se/site/pages/default.aspx My contact: maria.perrotta@hhs.se Aim of this class There are many different

More information

Matching Techniques. Technical Session VI. Manila, December Jed Friedman. Spanish Impact Evaluation. Fund. Region

Matching Techniques. Technical Session VI. Manila, December Jed Friedman. Spanish Impact Evaluation. Fund. Region Impact Evaluation Technical Session VI Matching Techniques Jed Friedman Manila, December 2008 Human Development Network East Asia and the Pacific Region Spanish Impact Evaluation Fund The case of random

More information

External validity, causal interaction and randomised trials

External validity, causal interaction and randomised trials External validity, causal interaction and randomised trials Seán M. Muller University of Cape Town Evidence and Causality in the Sciences Conference University of Kent (Canterbury) 5 September 2012 Overview

More information

Front-Door Adjustment

Front-Door Adjustment Front-Door Adjustment Ethan Fosse Princeton University Fall 2016 Ethan Fosse Princeton University Front-Door Adjustment Fall 2016 1 / 38 1 Preliminaries 2 Examples of Mechanisms in Sociology 3 Bias Formulas

More information

PSC 504: Instrumental Variables

PSC 504: Instrumental Variables PSC 504: Instrumental Variables Matthew Blackwell 3/28/2013 Instrumental Variables and Structural Equation Modeling Setup e basic idea behind instrumental variables is that we have a treatment with unmeasured

More information

Experimental Designs for Identifying Causal Mechanisms

Experimental Designs for Identifying Causal Mechanisms Experimental Designs for Identifying Causal Mechanisms Kosuke Imai Princeton University October 18, 2011 PSMG Meeting Kosuke Imai (Princeton) Causal Mechanisms October 18, 2011 1 / 25 Project Reference

More information

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015 Introduction to causal identification Nidhiya Menon IGC Summer School, New Delhi, July 2015 Outline 1. Micro-empirical methods 2. Rubin causal model 3. More on Instrumental Variables (IV) Estimating causal

More information

12E016. Econometric Methods II 6 ECTS. Overview and Objectives

12E016. Econometric Methods II 6 ECTS. Overview and Objectives Overview and Objectives This course builds on and further extends the econometric and statistical content studied in the first quarter, with a special focus on techniques relevant to the specific field

More information

An Alternative Assumption to Identify LATE in Regression Discontinuity Designs

An Alternative Assumption to Identify LATE in Regression Discontinuity Designs An Alternative Assumption to Identify LATE in Regression Discontinuity Designs Yingying Dong University of California Irvine September 2014 Abstract One key assumption Imbens and Angrist (1994) use to

More information

Empirical Methods in Applied Microeconomics

Empirical Methods in Applied Microeconomics Empirical Methods in Applied Microeconomics Jörn-Ste en Pischke LSE November 2007 1 Nonlinearity and Heterogeneity We have so far concentrated on the estimation of treatment e ects when the treatment e

More information

Difference-in-Differences Estimation

Difference-in-Differences Estimation Difference-in-Differences Estimation Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. The Basic Methodology 2. How Should We

More information

Potential Outcomes Model (POM)

Potential Outcomes Model (POM) Potential Outcomes Model (POM) Relationship Between Counterfactual States Causality Empirical Strategies in Labor Economics, Angrist Krueger (1999): The most challenging empirical questions in economics

More information

Difference-in-Differences Methods

Difference-in-Differences Methods Difference-in-Differences Methods Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 1 Introduction: A Motivating Example 2 Identification 3 Estimation and Inference 4 Diagnostics

More information

Chapter 2 Regression Discontinuity Design: When Series Interrupt

Chapter 2 Regression Discontinuity Design: When Series Interrupt Chapter 2 Regression Discontinuity Design: When Series Interrupt Abstract This chapter introduces the identification and estimation of policy effects when outcome variables can be ordered according to

More information

Econ 673: Microeconometrics Chapter 12: Estimating Treatment Effects. The Problem

Econ 673: Microeconometrics Chapter 12: Estimating Treatment Effects. The Problem Econ 673: Microeconometrics Chapter 12: Estimating Treatment Effects The Problem Analysts are frequently interested in measuring the impact of a treatment on individual behavior; e.g., the impact of job

More information

Since the seminal paper by Rosenbaum and Rubin (1983b) on propensity. Propensity Score Analysis. Concepts and Issues. Chapter 1. Wei Pan Haiyan Bai

Since the seminal paper by Rosenbaum and Rubin (1983b) on propensity. Propensity Score Analysis. Concepts and Issues. Chapter 1. Wei Pan Haiyan Bai Chapter 1 Propensity Score Analysis Concepts and Issues Wei Pan Haiyan Bai Since the seminal paper by Rosenbaum and Rubin (1983b) on propensity score analysis, research using propensity score analysis

More information

Causal Inference Lecture Notes: Selection Bias in Observational Studies

Causal Inference Lecture Notes: Selection Bias in Observational Studies Causal Inference Lecture Notes: Selection Bias in Observational Studies Kosuke Imai Department of Politics Princeton University April 7, 2008 So far, we have studied how to analyze randomized experiments.

More information

The propensity score with continuous treatments

The propensity score with continuous treatments 7 The propensity score with continuous treatments Keisuke Hirano and Guido W. Imbens 1 7.1 Introduction Much of the work on propensity score analysis has focused on the case in which the treatment is binary.

More information

AN EVALUATION OF PARAMETRIC AND NONPARAMETRIC VARIANCE ESTIMATORS IN COMPLETELY RANDOMIZED EXPERIMENTS. Stanley A. Lubanski. and. Peter M.

AN EVALUATION OF PARAMETRIC AND NONPARAMETRIC VARIANCE ESTIMATORS IN COMPLETELY RANDOMIZED EXPERIMENTS. Stanley A. Lubanski. and. Peter M. AN EVALUATION OF PARAMETRIC AND NONPARAMETRIC VARIANCE ESTIMATORS IN COMPLETELY RANDOMIZED EXPERIMENTS by Stanley A. Lubanski and Peter M. Steiner UNIVERSITY OF WISCONSIN-MADISON 018 Background To make

More information

Statistical Methods. Missing Data snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23

Statistical Methods. Missing Data  snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23 1 / 23 Statistical Methods Missing Data http://www.stats.ox.ac.uk/ snijders/sm.htm Tom A.B. Snijders University of Oxford November, 2011 2 / 23 Literature: Joseph L. Schafer and John W. Graham, Missing

More information

Lecture 8. Roy Model, IV with essential heterogeneity, MTE

Lecture 8. Roy Model, IV with essential heterogeneity, MTE Lecture 8. Roy Model, IV with essential heterogeneity, MTE Economics 2123 George Washington University Instructor: Prof. Ben Williams Heterogeneity When we talk about heterogeneity, usually we mean heterogeneity

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/

More information

19 Effect Heterogeneity and Bias in Main-Effects-Only Regression Models

19 Effect Heterogeneity and Bias in Main-Effects-Only Regression Models 19 Effect Heterogeneity and Bias in Main-Effects-Only Regression Models FELIX ELWERT AND CHRISTOPHER WINSHIP 1 Introduction The overwhelming majority of OLS regression models estimated in the social sciences,

More information

Causal Inference with Big Data Sets

Causal Inference with Big Data Sets Causal Inference with Big Data Sets Marcelo Coca Perraillon University of Colorado AMC November 2016 1 / 1 Outlone Outline Big data Causal inference in economics and statistics Regression discontinuity

More information

Effect Heterogeneity and Bias in Main-Effects- Only Regression Models

Effect Heterogeneity and Bias in Main-Effects- Only Regression Models Effect Heterogeneity and Bias in Regression 1 Effect Heterogeneity and Bias in Main-Effects- Only Regression Models FELIX ELWERT AND CHRISTOPHER WINSHIP Introduction The overwhelming majority of OLS regression

More information

Principles Underlying Evaluation Estimators

Principles Underlying Evaluation Estimators The Principles Underlying Evaluation Estimators James J. University of Chicago Econ 350, Winter 2019 The Basic Principles Underlying the Identification of the Main Econometric Evaluation Estimators Two

More information

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017 Introduction to Regression Analysis Dr. Devlina Chatterjee 11 th August, 2017 What is regression analysis? Regression analysis is a statistical technique for studying linear relationships. One dependent

More information

Gov 2002: 5. Matching

Gov 2002: 5. Matching Gov 2002: 5. Matching Matthew Blackwell October 1, 2015 Where are we? Where are we going? Discussed randomized experiments, started talking about observational data. Last week: no unmeasured confounders

More information

Covariate Balancing Propensity Score for General Treatment Regimes

Covariate Balancing Propensity Score for General Treatment Regimes Covariate Balancing Propensity Score for General Treatment Regimes Kosuke Imai Princeton University October 14, 2014 Talk at the Department of Psychiatry, Columbia University Joint work with Christian

More information

Controlling for overlap in matching

Controlling for overlap in matching Working Papers No. 10/2013 (95) PAWEŁ STRAWIŃSKI Controlling for overlap in matching Warsaw 2013 Controlling for overlap in matching PAWEŁ STRAWIŃSKI Faculty of Economic Sciences, University of Warsaw

More information

Regression Discontinuity

Regression Discontinuity Regression Discontinuity Christopher Taber Department of Economics University of Wisconsin-Madison October 9, 2016 I will describe the basic ideas of RD, but ignore many of the details Good references

More information

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai Weighting Methods Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Weighting Methods Stat186/Gov2002 Fall 2018 1 / 13 Motivation Matching methods for improving

More information

Matching. James J. Heckman Econ 312. This draft, May 15, Intro Match Further MTE Impl Comp Gen. Roy Req Info Info Add Proxies Disc Modal Summ

Matching. James J. Heckman Econ 312. This draft, May 15, Intro Match Further MTE Impl Comp Gen. Roy Req Info Info Add Proxies Disc Modal Summ Matching James J. Heckman Econ 312 This draft, May 15, 2007 1 / 169 Introduction The assumption most commonly made to circumvent problems with randomization is that even though D is not random with respect

More information

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors Laura Mayoral IAE, Barcelona GSE and University of Gothenburg Gothenburg, May 2015 Roadmap of the course Introduction.

More information

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke

More information

Lecture 11 Roy model, MTE, PRTE

Lecture 11 Roy model, MTE, PRTE Lecture 11 Roy model, MTE, PRTE Economics 2123 George Washington University Instructor: Prof. Ben Williams Roy Model Motivation The standard textbook example of simultaneity is a supply and demand system

More information

II. MATCHMAKER, MATCHMAKER

II. MATCHMAKER, MATCHMAKER II. MATCHMAKER, MATCHMAKER Josh Angrist MIT 14.387 Fall 2014 Agenda Matching. What could be simpler? We look for causal effects by comparing treatment and control within subgroups where everything... or

More information

1 Correlation and Inference from Regression

1 Correlation and Inference from Regression 1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is

More information

Sociology 593 Exam 2 Answer Key March 28, 2002

Sociology 593 Exam 2 Answer Key March 28, 2002 Sociology 59 Exam Answer Key March 8, 00 I. True-False. (0 points) Indicate whether the following statements are true or false. If false, briefly explain why.. A variable is called CATHOLIC. This probably

More information

Imbens, Lecture Notes 1, Unconfounded Treatment Assignment, IEN, Miami, Oct 10 1

Imbens, Lecture Notes 1, Unconfounded Treatment Assignment, IEN, Miami, Oct 10 1 Imbens, Lecture Notes 1, Unconfounded Treatment Assignment, IEN, Miami, Oct 10 1 Lectures on Evaluation Methods Guido Imbens Impact Evaluation Network October 2010, Miami Methods for Estimating Treatment

More information

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 8 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 25 Recommended Reading For the today Instrumental Variables Estimation and Two Stage

More information

Ec1123 Section 7 Instrumental Variables

Ec1123 Section 7 Instrumental Variables Ec1123 Section 7 Instrumental Variables Andrea Passalacqua Harvard University andreapassalacqua@g.harvard.edu November 16th, 2017 Andrea Passalacqua (Harvard) Ec1123 Section 7 Instrumental Variables November

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression Simple linear regression tries to fit a simple line between two variables Y and X. If X is linearly related to Y this explains some of the variability in Y. In most cases, there

More information

Interpreting Tests of School VAM Validity

Interpreting Tests of School VAM Validity Interpreting Tests of School VAM Validity By Joshua Angrist, Peter Hull, Parag Pathak, and Christopher Walters Public school districts increasingly look to the estimates generated by value-added models

More information

Controlling for Time Invariant Heterogeneity

Controlling for Time Invariant Heterogeneity Controlling for Time Invariant Heterogeneity Yona Rubinstein July 2016 Yona Rubinstein (LSE) Controlling for Time Invariant Heterogeneity 07/16 1 / 19 Observables and Unobservables Confounding Factors

More information