Instrumental Variables: Then and Now

Size: px
Start display at page:

Download "Instrumental Variables: Then and Now"

Transcription

1 Instrumental Variables: Then and Now James Heckman University of Chicago and University College Dublin Econometric Policy Evaluation, Lecture II Koopmans Memorial Lectures Cowles Foundation Yale University September 27, / 188

2 This lecture based on Heckman, Urzua and Vytlacil (2006) Understanding Instrumental Variables in Models With Essential Heterogeneity 2 / 188

3 Policy adoption problem Suppose a policy is proposed for adoption in a country. What can we conclude about the likely effectiveness of the policy in countries? Build a model of counterfactuals. Y 1 = µ 1 (X ) + U 1 (1.1) Y 0 = µ 0 (X ) + U 0. 3 / 188

4 Consider the basic generalized Roy model Two potential outcomes (Y 0, Y 1 ). A choice equation Observed outcomes are D = 1[µ D (Z, V ) > 0]. }{{} net utility Y = DY 1 + (1 D)Y 0 Assume µ D (Z, V ) = µ D (Z) V. 4 / 188

5 Switching Regression Notation Y = Y 0 + (Y 1 Y 0 )D (1.2) = µ 0 + (µ 1 µ 0 + U 1 U 0 )D + U 0. (Quandt, 1958, 1972) In Conventional Regression Notation Y = α + βd + ε (1.3) α = µ 0, β = (Y 1 Y 0 ) = µ 1 µ 0 + U 1 U 0, ε = U 0. β is the treatment effect. 5 / 188

6 Figure 1: distribution of gains, a Roy economy T T Return to Marginal Agent AT E T U T C=1.5 5 =Y 1 -Y 0 β = Y 1 Y 0 TT= 2.666, TUT= Return to Marginal Agent = C = 1.5, ATE = µ 1 µ 0 = β = / 188

7 The model Outcomes Choice Model Y 1 = µ 1 + U 1 = α + β + U 1 D = Y 0 = µ 0 + U 0 = α + U 0 { 1 if D > 0 0 if D 0 General Case (U 1 U 0 ) D ATE TT TUT 7 / 188

8 The model The Researcher Observes (Y, D, C) Y = α + βd + U 0 where β = Y 1 Y 0 Parameterization α = 0.67 (U 1, U [ 0 ) N (0, Σ) ] D = Y 1 Y 0 C β = 0.2 Σ = C = / 188

9 In the case when U 1 = U 0 = ε 0, simple least squares regression of Y on D subject to a selection bias. This is a form of endogeneity bias considered by the Cowles analysts. Upward biased for β if Cov(D, ε) > 0. 9 / 188

10 Three main approaches have been adopted to solve this problem: 1 Selection models 2 Instrumental variable models 3 Matching: assumes that ε D X. Matching is just nonparametric least squares and assumes access to rich data which happens to guarantee this condition. 10 / 188

11 Case I, the traditional case: β is a constant If there is an instrument Z, with the property that then Cov(Z, D) 0 (1.4) Cov(Z, ε) = 0, (1.5) plim ˆβ IV = Cov(Z, Y ) Cov(Z, D) = β. If other instruments exist, each identifies the same β. 11 / 188

12 Case II, heterogeneous response case: β is a random variable even conditioning on X Sorting bias or sorting on the gain which is distinct from sorting on the level. Essential heterogeneity Cov(β, D) 0. Suppose (1.4), (1.5) and Cov(Z, β) = 0. (1.6) Can we identify the mean of (Y 1 Y 0 ) using IV? 12 / 188

13 In general we cannot (Heckman and Robb, 1985). Let β = (µ 1 µ 0 ) β = β + η U 1 U 0 = η Y = α + βd + [ε + ηd]. Need Z to be uncorrelated with [ε + ηd] to use IV to identify β. This condition will be satisfied if policy adoption is made without knowledge of η (= U 1 U 0 ). If decisions about D are made with partial or full knowledge of η, IV does not identify β. 13 / 188

14 The IV condition is E [ε + ηd Z] = 0. E (ε Z) = 0, E (η Z) = 0. Even if η Z, η Z D = 1. E(ηD Z) = E(η D = 1, Z) Pr(D = 1 Z). But E (η Z, D = 1) 0, in general, if agents have some information about the gains. 14 / 188

15 Draft Lottery example (Heckman, 1997). Linear IV does not identify ATE or any standard treatment parameters. 15 / 188

16 Imbens Angrist conditions (1994) Imbens and Angrist (1994) establish that IV can identify an interpretable parameter in the model with essential heterogeneity. Their parameter is a discrete approximation to the marginal gain parameter of Björklund and Moffitt (1987). This parameter can be interpreted as the marginal gain to outcomes induced from a marginal change in the costs of participating in treatment (Björklund-Moffitt). 16 / 188

17 Imbens Angrist conditions (1994) Imbens and Angrist assume the existence of an instrument Z that takes two or more distinct values. Keep conditioning on X implicit. Let D i (z) be the indicator (= 1 if adopted; = 0 if not) It is a random variable for choice when we set Z = z. 17 / 188

18 Imbens Angrist conditions (1994) IV-1 (Independence) Z ( Y 1, Y 0, {D (z)} z Z ). IV-2 (Rank) Pr(D = 1 Z) depends on Z. They supplement the standard IV assumption with a monotonicity assumption. IV-3 (Monotonicity or Uniformity) D i (z) D i (z ) or D i (z) D i (z ) i = 1,..., I. 18 / 188

19 Imbens Angrist conditions (1994) Uniformity of responses across persons. Uniformity is satisfied when, for z < z, D i (z) D i (z ) for all i, while for z > z, D i (z ) D i (z ) for all i. 19 / 188

20 Imbens Angrist conditions (1994) These conditions imply the LATE parameter. E (Y Z = z) E (Y Z = z ) = E ((D(z) D(z )) (Y 1 Y 0 )) (Independence) 20 / 188

21 Imbens Angrist conditions (1994) Using iterated expectations, E (Y Z = z) E (Y Z = z ) (1.7) ( ) E (Y1 Y = 0 D (z) D (z ) = 1) Pr (D (z) D (z ) = 1) ( ) E (Y1 Y 0 D (z) D (z ) = 1) Pr (D (z) D (z. ) = 1) Monotonicity allows us to drop out one term. 21 / 188

22 Imbens Angrist conditions (1994) Suppose, for example, that Pr(D(z) D(z ) = 1) = 0. Thus, E (Y Z = z) E (Y Z = z ) = E (Y 1 Y 0 D(z) D(z ) = 1) Pr (D(z) D(z ) = 1). E (Y Z = z) E (Y Z = z ) LATE = Pr(D = 1 Z = z) Pr(D = 1 Z = z ) = E (Y 1 Y 0 D(z) D(z ) = 1) (1.8) The mean gain to those induced to switch from 0 to 1 by a change in Z from z to z. 22 / 188

23 Imbens Angrist conditions (1994) Observe LATE = ATE if Pr (D = 1 Z = z) = 1 while Pr (D = 1 Z = z ) = 0. Identification at infinity plays a crucial role throughout the entire literature on policy evaluation. 23 / 188

24 Imbens Angrist conditions (1994) In general, LATE E(Y 1 Y 0 ) = E(β). Not treatment on the treated: E(β D = 1). Different instruments define different parameters. Having a wealth of different strong instruments does not improve the precision of the estimate of any particular parameter (Heckman and Robb, 1986). When there are more than two distinct values of Z, Imbens and Angrist use Yitzhaki (1989) weights. 24 / 188

25 Imbens Angrist conditions (1994) Goal of our work: unify literature with a common set of underlying parameters interpretable across studies. To understand how to connect the results of various disparate IV estimands within a unified framework. 25 / 188

26 IV in choice models D = 1 [D > 0] (2.1) 1[ ] is an indicator (1[A] = 1 if A true; 0 otherwise). Example: µ D (Z) = γz D = µ D (Z) V (2.2) D = γz V 26 / 188

27 Examples (V Z) X. The propensity score: P(z) = Pr(D = 1 Z = z) = Pr(γz > V ) = F V (γz) F V is the distribution of V. 27 / 188

28 Examples Generalized Roy model D = 1[Y 1 Y 0 C > 0] Costs C = µ C (W ) + U C Z = (X, W ) µ D (Z) = µ 1 (X ) µ 0 (X ) µ C (W ) V = (U 1 U 0 U C ). 28 / 188

29 Heterogeneous response model In a general model with heterogenous responses, specification of P(Z) and its relationship with the instrument play a crucial role. Cov (Z, ηd) = E (( Z Z ) ηd ) = E (( Z Z ) η D = 1 ) Pr (D = 1) = E( ( Z Z ) η γz > V }{{} F V (γz) > F V (V ) P(Z) > U D )Pr (γz > V ). }{{} P(Z) Probability of selection enters the covariance even though we use only one component of Z as an instrument. 29 / 188

30 Selection models control for this dependence induced by choice. 30 / 188

31 Selection models Assume (U 1, U 0, V ) Z (2.3) [Alternatively (ε, η, V ) Z]. η = (U 1 U 0 ), ε = U 0 (2.4) E (Y D = 0, Z = z) = E(Y 0 D = 0, Z = z) = α + E(U 0 γz < V ) E(Y D = 0, Z = z) = α + K 0 (P(z) }{{} ) control function 31 / 188

32 Selection models E (Y D = 1, Z = z) = E (Y 1 D = 1, Z = z) = α + β + E (U 1 γz > V ) = α + β + K 1 (P(z)) }{{} control function K 0 (P(z)) and K 1 (P(z)) are control functions in the sense of Heckman and Robb (1985, 1986). P(z) is an essential ingredient. Matching: K 1 (P(z)) = K 0 (P(z)). 32 / 188

33 In a model where β is variable and not independent of V, misspecification of Z affects the interpretation of what IV estimates analogous to its role in selection models. Misspecification of Z affects both approaches to identification. This is a new phenomenon in models with heterogenous β. 33 / 188

34 Model for outcomes Y 1 = µ 1 (X, U 1 ) (3.1) Y 0 = µ 0 (X, U 0 ). X are observed and (U 1, U 0 ) are unobserved by the analyst. The X may be dependent on U 0 and U 1. Generalize choice model (2.1) and (2.2) for D, a latent utility. 34 / 188

35 Model for outcomes D = µ D (Z) V and D = 1 (D 0) (3.2) µ D (Z) V can be interpreted as a net utility for a person with characteristics (Z, V ). β = Y 1 Y 0 = µ 1 (X, U 1 ) µ 0 (X, U 0 ) (Treatment Effect) 35 / 188

36 Model for outcomes A special case that links our analysis to standard models in econometrics: Y 1 = X β 1 + U 1 and Y 0 = X β 0 + U 0 ; so β = X (β 1 β 0 ) + (U 1 U 0 ). In the case of separable outcomes, heterogeneity in β arises because in general U 1 U 0 and people differ in their X. Heckman-Vytlacil conditions (1999,2001, 2005) 36 / 188

37 Assumptions A-1 The distribution of µ D (Z) conditional on X is nondegenerate (Rank Condition for IV). This says that we can vary Z (excluded from outcome equations) given X. Key property of an instrument. A-2 (U 0, U 1, V ) are independent of Z conditional on X (Independence Condition for IV). Z is not affecting potential outcomes or affecting the unobservables affecting choices. 37 / 188

38 Assumptions A-3 The distribution of V is continuous (not essential). A-4 E Y 1 <, and E Y 0 < (Finite Means). 38 / 188

39 Assumptions A-5 1 > Pr (D = 1 X ) > 0 (For each X there is a treatment group and a comparison group). A-6 Let X 0 denote the counterfactual value of X that would have been observed if D is set to 0. X 1 is defined analogously. Thus X d = X, for d = 0, 1 (The X d are invariant to counterfactual manipulations). 39 / 188

40 Separability between V and µ D (Z) in choice equation is conventional. Plays an important role in the properties of instrumental variable estimators in models with essential heterogeneity. It implies monotonicity (uniformity) condition (IV-3) from choice equation (3.2). Vytlacil (2002) shows that independence and monotonicity (IV-3) imply the existence of a V and representation (3.2) given some regularity conditions. 40 / 188

41 Use probability integral transform to write D = 1 [F V (µ D (Z)) > F V (V )] = 1 [P (Z) > U D ] (3.3) U D = F V (V ) and P (Z) = F V (µ D (Z)) = Pr[D = 1 Z] P(Z) is transformation of mean scale utility in a discrete choice model. 41 / 188

42 LATE, the marginal treatment effect and instrumental variables A basic parameter that can be used to unify the treatment effect literature: MTE (x, u D ) = E(Y 1 Y 0 X = x, U D = u D ). = E (β X = x, V = v) MTE and the local average treatment effect (LATE) parameter are closely related. For (z, z ) Z(x) Z(x) so that P(z) > P(z ), under (IV-3) and independence (A-2), LATE is: LATE (z, z) = E (Y 1 Y 0 D (z) = 1, D (z ) = 0) (3.4) 42 / 188

43 LATE, the marginal treatment effect and instrumental variables LATE can be written in a fashion free of any instrument: E (Y 1 Y 0 D(z) = 1, D(z ) = 0) (3.5) = E (Y 1 Y 0 u D < U D < u D ) = LATE (u D, u D ) u D = Pr(D (z) = 1) = Pr (D (z) = 1 Z = z) = Pr(D (z) = 1) = P(z) u D = Pr (D (z ) = 1 Z = z ) = Pr(D (z ) = 1) = P(z ) The z just help us define evaluation points for the u D. 43 / 188

44 LATE, the marginal treatment effect and instrumental variables Under (A-1) (A-5), all standard treatment parameters are weighted averages of MTE with weights that can be estimated. 44 / 188

45 LATE, the marginal treatment effect and instrumental variables Table 1A: treatment effects and estimands as weighted averages of the marginal treatment effect ATE(x) = E (Y 1 Y 0 X = x) = 1 0 MTE (x, u D ) du D TT(x) = E (Y 1 Y 0 X = x, D = 1) = 1 0 MTE (x, u D ) ω TT (x, u D ) du D TUT(x) = E (Y 1 Y 0 X = x, D = 0) = 1 0 MTE (x, u D ) ω TUT (x, u D ) du D Policy Relevant Treatment Effect (x) = E (Y a X = x) E (Y a X = x) = 1 0 MTE (x, u D ) ω PRTE (x, u D ) du D for two policies a and a that affect the Z but not the X IV J (x) = 1 0 MTE (x, u D ) ω J IV (x, u D) du D, given instrument J OLS(x) = 1 0 MTE (x, u D ) ω OLS (x, u D ) du D 45 / 188

46 LATE, the marginal treatment effect and instrumental variables Table 1B: weights ω ATE (x, u D ) = 1 ω TT (x, u D ) = [ ] 1 1 u D f (p X = x)dp E(P X = x) ω TUT (x, u D ) = [ u D 0 f (p X = x) dp ] 1 E ((1 P) X = x) [ ] FPa,X (u D ) F Pa,X (u D ) ω PRTE (x, u D ) = P 46 / 188

47 LATE, the marginal treatment effect and instrumental variables Table 1B: weights ωiv J (x, u D) 1 u = D (J(Z) E(J(Z) X = x)) f J,P X (j, t X = x) dt dj Cov(J(Z), D X = x) ω OLS (x, u { D ) } E(U1 X = x, U D = u D ) ω 1 (x, u D ) E(U 0 X = x, U D = u D ) ω 0 (x, u D ) = 1 + MTE (x, u D ) 47 / 188

48 LATE, the marginal treatment effect and instrumental variables Table 1B: weights ω 1 (x, u D ) = [ ] [ ] 1 1 u D f (p X = x) dp E(P X = x) ω 0 (x, u D ) = [ u D 0 f (p X = x) dp ] 1 E((1 P) X = x) Source: Heckman and Vytlacil (2005) 48 / 188

49 LATE, the marginal treatment effect and instrumental variables Figure 2: weights for the marginal treatment effect for different parameters 2 D P h(u D ) MTE.... U D 49 / 188

50 LATE, the marginal treatment effect and instrumental variables E(β U D = u D ) does not vary with u D. Standard case. ATE = TT = LATE = policy counterfactuals = plim IV. 50 / 188

51 LATE, the marginal treatment effect and instrumental variables When will E(β U D = u D ) not vary with u D? 1 If U 1 = U 0 β a Constant. 2 More Generally, if U 1 U 0 is mean independent of U D, so treatment effect heterogeneity is allowed but individuals do not act upon their own idiosyncratic effect. 51 / 188

52 LATE, the marginal treatment effect and instrumental variables Consider standard analysis. plim of OLS: ln Y = α + ( β + U 1 U 0 )D + U 0 E (ln Y D = 1) E (ln Y D = 0) { = β E (U0 D = 1) + E(U 1 U 0 D = 1) + E (U 0 D = 0) = ATE + Sorting Gain + Ability Bias }{{} } = TT + Ability Bias 52 / 188

53 LATE, the marginal treatment effect and instrumental variables If ATE is a parameter of interest, OLS suffers from both sorting bias and ability bias. If TT is parameter of interest, OLS suffers from ability bias. Using IV removes ability bias, but changes the parameter being estimated (neither ATE nor TT in general). Different IV Weight MTE differently. We derive IV weights below. 53 / 188

54 LATE, the marginal treatment effect and instrumental variables IV Instrument Dependent (which Z used and which values of Z used). Hence studies using different Z are not comparable. How to make studies comparable? We can test to see if these complications are required in any particular empirical analysis. 54 / 188

55 Identifying MTE Testing for essential heterogeneity E (Y Z = z) = E(Y P(Z) = p) (index sufficiency) = E (DY 1 + (1 D)Y 0 P (Z) = p) = E(Y 0 ) + E (D (Y 1 Y 0 ) P (Z) = p) [ E(Y1 Y = E(Y 0 ) + 0 D = 1, P (Z) = p) Pr (D = 1 Z = z) = E(Y 0 ) + p 0 E(Y 1 Y 0 U D = u D ) du D. ] 55 / 188

56 Identifying MTE Testing for essential heterogeneity As a consequence, we get LIV (Local Instrumental Variables), which identifies MTE E (Y Z = z) P(z) = E(Y 1 Y 0 U D = u D ). (3.6) P(Z)=uD }{{}}{{} MTE LIV When β D, Y is linear in P (Z): where b = MTE = ATE = TT. E (Y Z) = a + bp (Z) (3.7) These results are valid whether or not Y 1 and Y 0 are separable in U 1 and U 0. Therefore we can identify the treatment parameters using estimated weights and estimated MTE. 56 / 188

57 Identifying MTE Example: college attendance on wages for high school graduates E(Y X, P) as a function of P for average X E(Y P) P Source: Carneiro, Heckman and Vytlacil (2006) 57 / 188

58 Identifying MTE Example: college attendance on wages for high school graduates E(Y 1 Y ) X, U S ) estimated using locally quadratic regression (averaged over X ) E(Y 1 - Y 0 X,U S ) U S Source: Carneiro, Heckman and Vytlacil (2006) 58 / 188

59 Identifying MTE Example: costs of breast cancer treatments using different instruments in P(Z) MTE(UD) ( in $) MTE(UD) Profile and 95% CI with IV = FEEDIF UD Source: Basu, Heckman and Urzua 59 / 188

60 Identifying MTE Example: costs of breast cancer treatments using different instruments in P(Z) MTE(UD) ( in $) MTE(UD) Profile and 95% CI with IV = NORTH, FEEDIF UD Source: Basu, Heckman and Urzua 60 / 188

61 Identifying MTE Example: costs of breast cancer treatments using different instruments in P(Z) Estimated propensity score for BCSRT and MST MTE(η q, u D ) E(P(Z) X) BCSRT MST Using IV = NORTH Source: Basu, Heckman and Urzua 61 / 188

62 Identifying MTE Example: costs of breast cancer treatments using different instruments in P(Z) ω ATE (η q, u D ) MTE(u D ) MTE(UD) ( in $) MTE(UD) Profile and 95% CI with IV = NORTH UD Source: Basu, Heckman and Urzua 62 / 188

63 Identifying MTE Example: costs of breast cancer treatments using different instruments in P(Z) ω TT (η q, u D ) ω IV (η q, u D ) Source: Basu, Heckman and Urzua 63 / 188

64 Identifying MTE Example: costs of breast cancer treatments using different instruments in P(Z) Source: Basu, Heckman and Urzua 64 / 188

65 Identifying MTE Example: costs of breast cancer treatments using different instruments in P(Z) Source: Basu, Heckman and Urzua 65 / 188

66 Identifying MTE Example: unionism on wages Cross Section: 1989 E(Y X, P) Source: Heckman, Schmierer and Urzua (2006) 66 / 188

67 Identifying MTE Example: unionism on wages Bootstrap Mean Estimates of MTE Note: Source: The top Heckman, panel plots Schmierer the estimated and Urzua outcomes (2006) using the whole sample over the support of P with X fixed at its mean value. The bottom panel plots the estimated outcomes using the bootstrap mean coefficients. In each panel, the dotted line represents the estimates using only a linear term in P and the solid line has up to a quartic term in P. 67 / 188

68 Understanding what linear IV estimates Consider J(Z) as an instrument, a scalar function of Z. IV J = Cov(Y, J(Z)) Cov(D, J(Z)) Express it as a weighted average of MTE. Z can be a vector of instruments. 68 / 188

69 Understanding what linear IV estimates Under (A-1) (A-5) and separable choice model IV J = 1 0 MTE (u D ) ω J IV (u D ) du D (3.8) ωiv J (u D ) = E ( ) J (Z) J(Z) P (Z) > u D Pr (P (Z) > ud ). Cov (J (Z), D) (3.9) J(Z) and P(Z) do not have to be continuous random variables. Functional forms of P(Z) and J(Z) are general. 69 / 188

70 Understanding what linear IV estimates Dependence between J(Z) and P(Z) gives shape and sign to the weights. If J(Z) = P(Z), then weights obviously non-negative. If E(J(Z) J(Z) P(Z) u D ) not monotonic in u D, weights can be negative. 70 / 188

71 Understanding what linear IV estimates When J(Z) = P(Z), weight (3.9) follows from Yitzhaki (1989). He considers a regression function E (Y P (Z) = p). Linear regression of Y on P identifies β Y,P = 1 0 ω (p) = [ E (Y P (Z) = p) 1 p p (t E (P)) df P (t) Var (P) ] ω (p) dp, This is the weight (3.9) when P is the instrument. This expression does not require uniformity or monotonicity for the model; consistent with 2-way flows.. 71 / 188

72 Understanding what linear IV estimates Yitzhaki s theorem Theorem Assume (Y, X ) i.i.d. E( Y ) < E( X ) < µ Y = E(Y ) µ X = E(X ) E(Y X ) = g(x ) Assume g (X ) exists and E ( g (X ) ) <. 72 / 188

73 Understanding what linear IV estimates Yitzhaki s theorem Theorem (cont.) Then, where ω(t) = = Cov(Y, X ) Var(X ) = g (t) ω(t) dt, 1 (x µ X ) f X (x) dx Var(X ) t 1 Var(X ) E (X µ X X > t) Pr (X > t). Y = πx + η, π = Cov(Y, X ). Var(X ) 73 / 188

74 Understanding what linear IV estimates Proof of Yitzhaki s theorem Proof. Cov(Y, X ) = Cov (E(Y X ), X ) = Cov (g(x ), X ) = where t is an argument of integration. g(t)(t µ X ) f X (t) dt 74 / 188

75 Understanding what linear IV estimates Proof of Yitzhaki s theorem cont. Integration by parts: Cov(Y, X ) = g(t) = t (x µ X ) f X (x) dx t g (t) g (t) t since E (X µ X ) = 0. (x µ X ) f X (x) dx dt (x µ X ) f X (x) dx dt, 75 / 188

76 Understanding what linear IV estimates Proof of Yitzhaki s theorem cont. Therefore, Cov(Y, X ) = Result follows with ω(t) = g (t) E (X µ X X > t) Pr (X > t) dt. 1 Var(X ) E (X µ X X > t) Pr (X > t) 76 / 188

77 Understanding what linear IV estimates Weights positive. Integrate to one (use integration by parts formula). = 0 when t and t. We get claimed results on weights for TSLS when in place of X we use P(Z). Weight reaches its peak at t = µ X, if f X has density at x = µ X : d dt t (x µ X ) f X (x) dx dt = (t µ X )f X (t) = 0 at t = µ X. 77 / 188

78 Yitzhaki, s Weights Understanding what linear IV estimates Yitzhaki s weights for X BetaPDF(x, α, β) d[g(x)]/dx; b =0, c =0.5 2 w(x); =1, / = X,Y X 2 w(x); =2, / = X,Y X 2 w(x); =5, / = X,Y X 2 w(x); =10, / = X,Y X x, g(x) = b*x + c*log(x), X ~ BetaPDF(x,, ); =5 78 / 188

79 Adoption model 1.5 IV General model Comparing models Examples GED Separability Conclusion Understanding what linear IV estimates 1 Yitzhaki s weights for X BetaPDF(x, α, β) x, g(x) = b*x + c*log(x), X ~ BetaPDF(x,, ); =5 79 / 188

80 Understanding what linear IV estimates Can apply Yitzhaki s analysis to the treatment effect model Y = α + βd + ε P(Z), the propensity score is the instrument: E (Y Z = z) = E (Y P(Z) = p) 80 / 188

81 Understanding what linear IV estimates E (Y P(Z) = p) = α + E (βd P(Z) = p) = α + E (β D = 1, P(Z) = p) p = α + E (β P(Z) > U D, P(Z) = p) p = α + E (β p > U D ) p p = α + β f (β, u D ) du D } 0 {{ } Derivative with respect to p is MTE. g (p) =MTE and weights as before. g(p) 81 / 188

82 Understanding what linear IV estimates Under uniformity, E (Y P (Z) = p) p = E (Y 1 Y 0 U D = u D ) = MTE (u D ). More generally, it is LIV = E(Y P(Z)=p) p. Yitzhaki s result does not rely on uniformity; true of any regression of Y on P. Estimates a weighted net effect. The expression can be generalized. It produces Heckman-Vytlacil weights. 82 / 188

83 Understanding what linear IV estimates The Heckman-Vytlacil weight as a Yitzhaki weight Proof. ( Cov (J (Z), Y ) = E Y J ) ( = E E (Y Z) J ) (Z) ( = E E (Y P (Z)) J ) (Z) ( = E g (P (Z)) J ) (Z). J = J (Z) E (J (Z) P (Z) ud ), E (Y P (Z)) = g (P (Z)). 83 / 188

84 Understanding what linear IV estimates The Heckman-Vytlacil weight as a Yitzhaki weight cont. Cov (J (Z), Y ) = = 1 J 0 1 J g (u D ) 0 J g (u D ) jf P,J (u D, j) djdu D J jf P,J (u D, j) djdu D. 84 / 188

85 Understanding what linear IV estimates The Heckman-Vytlacil weight as a Yitzhaki weight cont. Use integration by parts: Cov (J (Z), Y ) = g (u D ) = = ud J g (u D ) 0 J 1 g (u D ) E 0 1 J jf P,J (p, j) djdp g (u D ) u D J ud J 0 J jf P,J (p, j) djdpdu D 1 0 jf P,J (p, j) djdpdu D ( J (Z) P (Z) ud ) Pr (P (Z) u D ) du D. 85 / 188

86 Understanding what linear IV estimates The Heckman-Vytlacil weight as a Yitzhaki weight cont. Thus: g (u D ) = E (Y P (Z) = p) P (Z) = MTE (u D ). p=ud 86 / 188

87 Understanding the structure of the IV weights Recapitulate: ω J IV (u D ) = J IV = MTE (u D ) ω J IV(u D ) du D 1 (j E(J (Z))) f J,P (j, t) dt dj u D Cov (J (Z), D) (3.10) The weights are always positive if J (Z) is monotonic in the scalar Z. In this case J (Z) and P (Z) have the same distribution and f J,P (j, t) collapses to a single distribution. 87 / 188

88 Understanding the structure of the IV weights The possibility of negative weights arises when J(Z) is not a monotonic function of P(Z). It can also arise when there are two or more instruments, and the analyst computes estimates with only one instrument or a combination of the Z instruments that is not a monotonic fuction of P(Z) so that J(Z) and P(Z) are not perfectly dependent. 88 / 188

89 Understanding the structure of the IV weights The weights can be constructed from data on (J, P, D). Data on (J (Z), P (Z)) pairs and (J (Z), D) pairs (for each X value) are all that is required. 89 / 188

90 Understanding the structure of the IV weights Discrete instruments J (Z) Discrete Case Support of the distribution of P(Z) contains a finite number of values p 1 < p 2 < < p K. Support of the instrument J (Z) is also discrete, taking I distinct values. E(J(Z) P(Z) u D ) is constant in u D for u D within any (p l, p l+1 ) interval, and Pr(P(Z) u D ) is constant in u D for u D within any (p l, p l+1 ) interval. Let λ l denote the weight on the LATE for the interval (p l, p l+1 ). 90 / 188

91 Understanding the structure of the IV weights Discrete instruments J (Z) Under monotonicity, or uniformity IV J = E(Y 1 Y 0 U D = u D )ωiv J (u D ) du D (3.11) = = K 1 pl+1 1 λ l E(Y 1 Y 0 U D = u D ) p l (p l+1 p l ) du D l=1 K 1 LATE (p l, p l+1 )λ l. l=1 91 / 188

92 Understanding the structure of the IV weights Discrete instruments J (Z) Let j i be the i th smallest value of the support of J(Z). λ l = I (j i E(J(Z))) K (f (j i, p t )) t>l Cov (J (Z), D) (p l+1 p l ) (3.12) i=1 92 / 188

93 Understanding the structure of the IV weights Discrete instruments J (Z) In general, this formula is true, under index sufficiency even if monotonicity is violated. It s certainly true under (A-1) (A-5). True where LATE (p l, p l+1 ) is replaced by the Wald estimator, based on P(z l ), l = 1,..., L, instruments. Observe, LATE here defined in terms of P(Z), the natural instrument. 93 / 188

94 Understanding the structure of the IV weights Discrete instruments J (Z) Generalizes the expression presented by Imbens and Angrist (1994) and Yitzhaki (1989, 1996) Their analysis of the case of vector Z only considers the case where J(Z) and P(Z) are perfectly dependent because J(Z) is a monotonic function of P (Z). More generally, the weights can be positive or negative for any l but they must sum to 1 over the l. 94 / 188

95 The central role of the propensity score For the IV weight to be correctly constructed and interpreted, we need to know the correct model for P (Z). IV depends on: 1 the choice of the instrument J (Z), 2 its dependence with P (Z), 3 the specification of the propensity score (i.e., what variables go into Z). Structural LATE or MTE identified by P(Z). Can derive all other instrumental variable estimators in terms of weighted averages of MTE or LATE. 95 / 188

96 Monotonicity, uniformity and conditional instruments Monotonicity or uniformity condition (IV-3) rules out general heterogeneous responses to treatment choices in response to changes in Z. The recent literature on instrumental variables with heterogeneous responses is asymmetric. The uniformity condition can be violated even when all components of γ are of the same sign if Z is a vector and γ is a nondegenerate random variable. D = 1 [γz > γ] 96 / 188

97 Monotonicity, uniformity and conditional instruments Uniformity is a condition on a vector. Changing one coordinate of Z, holding the other coordinates at different values across people, will not necessarily produce uniformity. Let µ D (z) = γ 0 + γ 1 z 1 + γ 2 z 2 + γ 3 z 1 z 2, where γ 0, γ 1, γ 2 and γ 3 are constants. Consider changing z 1 from a common base state while holding z 2 fixed at different values across people. If γ 3 < 0 then µ D (z) does not necessarily satisfy the uniformity condition. 97 / 188

98 Monotonicity, uniformity and conditional instruments Positive weights and uniformity are distinct issues. Under uniformity, and assumptions (A-1) (A-5), the weights on MTE or LIV for any particular instrument may be positive or negative. 98 / 188

99 Monotonicity, uniformity and conditional instruments If we condition on Z 2 = z 2,..., Z K = z K using Z 1 as an instrument, then uniformity is satisfied. Effectively convert the problem back to that of a scalar instrument where the weights must be positive. The concept of conditioning on other instruments to produce positive weights for the selected instrument is a new idea. 99 / 188

100 Monotonicity and weights Monotonicity is a property needed to get treatment effects with just two values of Z, Z = z 1 and Z = z 2, to guarantee that IV estimates a treatment effect. With multiple values of Z we need to weight to produce linear IV. If our IV shifts P(Z) in same way for everyone, it shifts D in the same way for everyone, D = 1 [P(Z) U D ]. 100 / 188

101 Monotonicity and weights If P(Z) is instrument, monotonicity is obviously satisfied. If J(Z) is an instrument and not a monotonic function of P(Z), may not shift P(Z) in same way for all people. We can get two-way flows if, e.g., we use only one Z or else have a random coefficient model, D = 1 [γz V ]. Negative weights are a tip off of two-way flows. 101 / 188

102 Monotonicity and weights If we do not want a treatment effect, who cares? We do not always want a treatment effect. Go back to ask What economic question am I trying to answer? 102 / 188

103 Treatment effects vs. policy effects Even if uniformity condition (IV-3) fails, IV may answer relevant policy questions. IV or TSLS estimates a weighted average of marginal responses which may be pointwise positive or negative. Policies may induce some people to switch into and others to switch out of choices. Net effects are sometimes of interest in many policy analyses. 103 / 188

104 Treatment effects vs. policy effects Thus, subsidized housing in a region supported by higher taxes may attract some to migrate to the region and cause others to leave. The net effect on earnings from the policy is all that is required to perform cost benefit calculations of the policy on outcomes. If the housing subsidy is the instrument, the issue of monotonicity is a red herring. If the subsidy is exogenously imposed, IV estimates the net effect of the policy on mean outcomes. Only if the effect of migration induced by the subsidy on outcomes is the question of interest, and not the effect of the subsidy, does uniformity emerge as an interesting question. 104 / 188

105 Comparing selection and IV models Angrist and Krueger (1999) compare IV with selection models and view the former with favor. Useful to understand this comparison in a model with essential heterogeneity. IV is estimating the derivative (or finite changes) of the parameters of a selection model. IV only conditions on Z (and X ). 105 / 188

106 Comparing selection and IV models The control function approach conditions on Z and D (and X ). From index sufficiency, equivalent to conditioning on P (Z) and D: E (Y X, D, Z) (4.1) = µ 0 (X ) + [µ 1 (X ) µ 0 (X )] D + K 1 (P (Z), X ) D + K 0 (P (Z), X ) (1 D) K 1 (P(Z), X ) = E(U 1 D = 1, X, P (Z)) and K 0 (P(Z), X ) = E (U 0 D = 0, X, P (Z)). 106 / 188

107 Comparing selection and IV models IV approach does not condition on D. It works with the integral (over D) of (4.1). E (Y X, P(Z)) (4.2) = µ 0 (X ) + [µ 1 (X ) µ 0 (X )] P(Z) + K 1 (P (Z), X ) P(Z) + K 0 (P (Z), X ) (1 P(Z)) Under monotonicity and (A-1) (A-5) E(Y X, P(Z)) P(Z) = LIV (X, p) = MTE (X, p). P(Z)=p Control function builds up MTE from components. IV gets it in one fell swoop. 107 / 188

108 Comparing selection and IV models With rank and limit conditions (Heckman and Robb, 1985; Heckman, 1990), using control functions, one can identify µ 1 (X ), µ 0 (X ), K 1 (P (Z), X ), and K 0 (P (Z), X ). The selection (control function) estimator identifies the conditional means E (Y 1 X, P(Z), D = 1) = µ 1 (X ) + K 1 (X, P(Z)) (4.3a) and E (Y 0 X, P(Z), D = 0) = µ 0 (X ) + K 0 (X, P(Z)). (4.3b) 108 / 188

109 Comparing selection and IV models To decompose these means and separate µ 1 (X ) from K 1 (X, P(Z)) without invoking functional form assumptions, it is necessary to have an exclusion (a Z not in X ). This allows µ 1 (X ) and K 1 (X, P (Z)) to be independently varied with respect to each other. We can also invoke curvature conditions without exclusion of variables. (Heckman and Navarro, 2006) In addition there must exist a limit set for Z given X such that K 1 (X, P(Z)) = 0 for Z in that limit set. 109 / 188

110 Comparing selection and IV models Limit set not required for selection model if we are interested only in MTE or LATE. Not required in IV either if we only seek MTE or LATE. 110 / 188

111 Comparing selection and IV models Without functional form assumptions, it is not possible to disentangle µ 1 (X ) from K 1 (X, P(Z)) which may contain constants and functions of X that do not interact with P(Z) (see Heckman (1990)). These limit set arguments are needed for ATE or TT, not LATE or LIV. 111 / 188

112 IV method IV method works with derivatives of (4.2) and not levels. Cannot directly recover the constant terms in (4.3a) and (4.3b). 112 / 188

113 IV method In summary, the control function method directly identifies levels while the LIV approach works with slopes. Constants that do not depend on P(Z) disappear from the LIV estimates of the model. 113 / 188

114 IV method The distributions of U 1, U 0 and V do not need to be specified to estimate control function models (see Powell, 1994). In particular, there is no reliance on normality. 114 / 188

115 Support problems for IV Support conditions with control function models have their counterparts in IV models. One common criticism of selection models is that without invoking functional form assumptions, identification of µ 1 (X ) and µ 0 (X ) requires that P(Z) 1 and P(Z) 0 in limit sets. Identification in limit sets is sometimes called identification at infinity. In order to identify ATE = E(Y 1 Y 0 X ), IV methods also require that P(Z) 1 and P(Z) 0 in limit sets, so an identification at infinity argument is implicit when IV is used to identify this parameter. 115 / 188

116 Support problems for IV The LATE parameter avoids this problem by moving the goal posts and redefining the parameter of interest from a level parameter like ATE or TT to a slope parameter like LATE which differences out the unidentified constants. We can identify this parameter by selection models or IV models without invoking identification at infinity. 116 / 188

117 Support problems for IV The IV estimator is model dependent, just like the selection estimator, but in application, the model does not have to be fully specified to obtain IV using Z (or J(Z)). However the distribution of P (Z) and the relationship between P (Z) and J (Z) generates the weights on MTE (or LIV). The interpretation placed on IV in terms of weights on MTE depends crucially on the specification of P (Z). In both control function and IV approaches for the general model of heterogeneous responses, P (Z) plays a central role. 117 / 188

118 Support problems for IV Two economists using the same instrument will obtain the same point estimate using the same data. Their interpretation of that estimate will differ depending on how they specify the arguments in P(Z), even if neither uses P(Z) as an instrument. By conditioning on P (Z), the control function approach makes the dependence of estimates on the specification of P (Z) explicit. The IV approach is less explicit and masks the assumptions required to economically interpret the empirical output of an IV estimation. 118 / 188

119 Examples based on choice theory Suppose cost of adopting the policy C is the same across all countries. Countries choose to adopt the policy if D > 0 where D is the net benefit: D = (Y 1 Y 0 C) and ATE = E (β) = E (Y 1 Y 0 ) = µ 1 µ 0 Treatment on the treated is E (β D = 1) = E (Y 1 Y 0 D = 1) = µ 1 µ 0 + E (U 1 U 0 D = 1). 119 / 188

120 Figure 1: distribution of gains The Roy Economy U 1 U 0 D T T Return to Marginal Agent AT E T U T C=1.5 5 β = Y=Y 1 1 -YY 0 0 TT= 2.666, TUT= Return to Marginal Agent = C = 1.5, ATE = µ 1 µ 0 = β = / 188

121 The model Outcomes Choice Model Y 1 = µ 1 + U 1 = α + β + U 1 D = Y 0 = µ 0 + U 0 = α + U 0 { 1 if D > 0 0 if D 0 General Case (U 1 U 0 ) D ATE TT TUT 121 / 188

122 The model The Researcher Observes (Y, D, C) Y = α + βd + U 0 where β = Y 1 Y 0 Parameterization α = 0.67 (U 1, U [ 0 ) N (0, Σ) ] D = Y 1 Y 0 C β = 0.2 Σ = C = Let C = γz, γ / 188

123 The Discrete instruments and the weights for LATE A. Standard Case B. Ch Figure 4A: monotonicity, the extended Roy economy Standard case z =(0,1) z' =(1,1) γz γz' β = Y 1 - Y / 188

124 Figure 2. Monotonicity The Extended Roy Economy Adoption model IV General model Comparing models Examples GED Separability Conclusion Discrete instruments and the weights for LATE Figure 4B: monotonicity, the extended Roy economy Changing Z 1 without controlling for Z 2 B. Changing Z 1 without Controlling for Z γz z =(0,1) z' =(1,1) z'' =(1,-1) γz'' γz' β = Y 1 - Y / 188

125 Discrete instruments and the weights for LATE Figure 4C: monotonicity, the extended Roy economy Random coefficient case or Z C. Random Coefficient Case ~ γ =( 0.5,0.5) ~ γ =(-0.5,0.5) z =(0,1) z' =(1,1) ~ ~ γ z= γ z ~ γ z' 0.05 ~ γz' β = Y 1 - Y / 188

126 Discrete instruments and the weights for LATE Figure 4: monotonicity, the extended Roy economy A. Standard Case B. Changing Z 1 without Controlling for Z 2 C. Random Coefficient Case z z 0 z z 0 or z z 00 z z 0 z =(0, 1) and z 0 =(1, 1) z =(0, 1), z 0 =(1, 1) and z 00 =(1, 1) z =(0, 1) and z 0 =(1, 1) γ is a random vector eγ =(0.5, 0.5) and eγ =( 0.5, 0.5) where eγ and eγ are two realizations of γ D(γz) D(γz 0 ) D(γz) D(γz 0 ) or D(γz) <D(γz 00 ) D ³eγz D ³eγz 0 and D (eγz) <D(eγz 0 ) For all individuals Depending on the value of z 0 or z 00 Depending on value of γ 126 / 188

127 Discrete instruments and the weights for LATE Figure 4: monotonicity, the extended Roy economy model Outcomes Choice Model Y 1 = α + β + U 1 j 1 if Y1 Y D = 0 γz > 0 0 if Y 1 Y 0 γz 0 Y 0 = α + U 0 with γz = γ 1 Z 1 + γ 2 Z 2 (U 1, U 0 ) N (0, Σ), Σ = Parameterization» 1 0.9, Z 1 = { 1, 0, 1} and Z 2 = { 1, 0, 1} α = 0.67, β = 0.2, γ = (0.5, 0.5) (except in Case C) 127 / 188

128 Discrete instruments and the weights for LATE Figure 5: IV weights and its components under discrete instruments when P(Z) is the instrument LATE (p l, p l+1 ) = E (Y P(Z) = p l+1) E (Y P(Z) = p l ) p l+1 p l = β (p l+1 p l ) + σ U1 U 0 ( φ ( Φ 1 (1 p l+1 ) ) φ ( Φ 1 (1 p l ) )) p l+1 p l λ l = (p l+1 p l ) = (p l+1 p l ) K K (p i E (P (Z))) f (p i, p t ) i=1 t>l Cov (Z 1, D) K (p t E (P (Z))) f (p t ) t>l Cov (Z 1, D) 128 / 188

129 Discrete instruments and the weights for LATE Joint probability distribution of (Z 1, Z 2 ) and the propensity score Z 1 \Z Cov(Z 1, Z 2 ) = (joint probabilities in ordinary type (Pr(Z 1 = z 1, Z 2 = z 2 )); propensity score in italics (Pr (D = 1 Z 1 = z 1, Z 2 = z 2 ))) 129 / 188

130 Discrete instruments and the weights for LATE Figure 5: IV weights and its components under discrete instruments when P(Z) is the instrument ATE = 0.2, TT = , TUT = and K 1 IV P(Z) = LATE (p l, p l+1 ) λ l = 0.09 l=1 130 / 188

131 Discrete instruments and the weights for LATE Figure 5A: IV weights and its components under discrete instruments when Figure 3. IV Weight and Its Components under Discrete Instrum P(Z) is the instrument (IV Weights) The Extended Roy Economy A. IV Weights B. E E(P(Z λ 1 λ 2 λ 3 λ E(P C. Local Average Treatment Eff 131 / 188

132 Discrete instruments and the weights for LATE Figure 5B: IV weights and its components under discrete instruments when under Discrete Instruments when P (Z) is the Instrument xtended P(Z) isroy the Economy instrument (E(P(Z) P(Z) > p l ) and E(P(Z))) B. E(P (Z) P (Z) > p l ) and E(P (Z)) E(P(Z))= E(P(Z) P(Z)>p 1 )E(P(Z) P(Z)>p 2 )E(P(Z) P(Z)>p 3 )E(P(Z) P(Z)>p 4 ) -0.2 verage Treatment Effects 132 / 188

133 Discrete instruments and the weights for LATE λ 2 λ λ 3 4 Figure 5C: IV weights and its components under discrete instruments when -0.2 P(Z) is the instrument (Local average treatment effects) C. Local Average Treatment Effects 0 E(P(Z) P(Z)>p 1 )E(P(Z) P(Z)>p 2 )E(P(Z) P(Z)>p 3 )E(P(Z LATE 0.340,0.438 LATE 0.438,0.540 LATE 0.540,0.640 LATE 0.640, The model is the same as the one presented below Figure / 188

134 Discrete instruments and the weights for LATE Consider using Z 1 as instrument If Z 1 and Z 2 are negatively dependent and E(Z 1 P(Z) > u D ) is not monotonic in u D, weights negative. This nonmonotonicity is evident in Figure 6B. This produces the pattern of negative weights shown in Figure 6A. Associated with two way flows. Two way flows are induced by uncontrolled variation in Z / 188

135 Figure 2. Monotonicity The Extended Roy Economy Adoption model IV General model Comparing models Examples GED Separability Conclusion Discrete instruments and the weights for LATE Figure 4B: monotonicity, the extended Roy economy Changing Z 1 without controlling for Z 2 B. Changing Z 1 without Controlling for Z 2 C γz z =(0,1) z' =(1,1) z'' =(1,-1) γz'' ~ γ = ~ γ = z = z' = 0.05 γz' β = Y 1 - Y / 188

136 Discrete instruments and the weights for LATE Figure 6: IV weights and its components under discrete instruments when Z 1 is the instrument Figure 4. IV Weight and Its Components under Discrete Instruments when Z 1 is the Instrument The Extended Roy Economy A. IV Weights B. E(Z 1 P (Z) >p ) and E(Z 1) E(Z 1 P(Z)>p 1 ) E(Z 1 P(Z)>p 2 ) E(Z 1 P(Z)>p 3 ) E(Z 1 P(Z)>p 4 ) 1 0 E(Z 1 )= λ λ 1 λ 2 λ ThemodelisthesameastheonepresentedbelowFigure2.Thevaluesofthetreatmentparametersarethesameasthe The model is the same as the one presented after figure 4. ones presented below Figure / 188

137 Discrete instruments and the weights for LATE λ 2 λ λ 3 4 Figure 5C: IV weights and its components under discrete instruments when -0.2 P(Z) is the instrument (local average treatment effects) C. Local Average Treatment Effects 0 E(P(Z) P(Z)>p 1 )E(P(Z) P(Z)>p 2 )E(P(Z) P(Z)>p 3 )E(P(Z LATE 0.340,0.438 LATE 0.438,0.540 LATE 0.540,0.640 LATE 0.640, The model is the same as the one presented below Figure / 188

138 Discrete instruments and the weights for LATE K 1 IV Z 1 = LATE (p l, p l+1 ) λ l = l=1 λ l = (p l+1 p l ) I K (z 1,i E (Z 1 )) f (z 1,i, p t ) i=1 t>l Cov (Z 1, D) 138 / 188

139 Discrete instruments and the weights for LATE Joint probability distribution of (Z 1, Z 2 ) and the propensity score Z 1 \Z Cov(Z 1, Z 2 ) = (joint probabilities in ordinary type (Pr(Z 1 = z 1, Z 2 = z 2 )); propensity score in italics (Pr (D = 1 Z 1 = z 1, Z 2 = z 2 ))) 139 / 188

140 Discrete instruments and the weights for LATE Conditional variable estimator and conditional local average treatment effect when Z 1 is the instrument (given Z 2 = z 2 ) Z 2 = 1 Z 2 = 0 Z 2 = 1 P ( 1, Z 2 ) = p P (0, Z 2 ) = p P (1, Z 2 ) = p λ λ LATE (p 1, p 2 ) LATE (p 2, p 3 ) IV Z 1 Z 2 =z / 188

141 Discrete instruments and the weights for LATE Conditional instrumental variable estimator XI 1 XI 1 IV Z 1 Z 2 =z 2 = LATE (p l, p l+1 Z 2 = z 2 ) λ l Z2 =z 2 = LATE (p l, p l+1 Z 2 = z 2 ) λ l Z2 =z 2 l=1 l=1 LATE (p l, p l+1 Z 2 = z 2 ) = E (Y P(Z) = p l+1, Z 2 = z 2 ) E (Y P(Z) = p l, Z 2 = z 2 ) p l+1 p l IX `z1,i E (Z 1 Z 2 = z 2 ) IX f `z 1,i, p t Z 2 = z 2 i=1 t>l λ l Z2 =z 2 = (p l+1 p l ) Cov (Z 1, D) IX (z 1,t E (Z 1 Z 2 = z 2 )) f (z 1,t, p t Z 2 = z 2 ) t>l = (p l+1 p l ) Cov (Z 1, D) 141 / 188

142 Discrete instruments and the weights for LATE Conditional instrumental variable estimator Probability Distribution of Z 1 Conditional on Z 2 (Pr(Z 1 = z 1 Z 2 = z 2 )) z 1 Pr(Z 1 = z 1 Z 2 = 1) Pr(Z 1 = z 1 Z 2 = 0) Pr(Z 1 = z 1 Z 2 = 1) / 188

143 Continuous instruments Figure 7 plots E(Y P(Z)) and MTE for the models displayed at the base of the figure. In cases I and II, β D. In case I, this is trivial since β is a constant. In case II, β is random but selection into D does not depend on β. Case III is the model with essential heterogeneity (β D). Figure 7A depicts E(Y P(Z)) in the three cases. 143 / 188

144 Continuous instruments Figure 7: conditional expectation of Y on P(Z) and the marginal treatment effect (MTE) Figure 5. Conditional Expectation of Y on P (Z) and the Marginal Treatment Effect (MTE) The Extended Roy Economy A. E(Y P (Z) =p) B. MTE (u D) Cases I and II Case III 0.2 Cases I and II Case III p p=u D 144 / 188

145 Continuous instruments Outcomes Choice Model j Y 1 = α + β 1 if D + U 1 D = > 0 0 if D 0 Y 0 = α + U 0 Case I Case II Case III U 1 = U 0 U 1 U 0 D U 1 U 0 D β =ATE=TT=TUT=IV β =ATE=TT=TUT=IV β =ATE TT TUT IV Parameterization Cases I, II and III Cases II and III Case III α = 0.67 (U 1, U 0» ) N (0, Σ) D = Y 1 Y 0 γz β = 0.2 with Σ = Z N (µ Z, Σ Z )» 9 2 µ Z = (2, 2) and Σ Z = 2 9 γ = (0.5, 0.5) 145 / 188

146 Continuous instruments Cases I and II make E(Y P(Z)) linear in P(Z) (see equation 3.7). Case III is nonlinear in P(Z) which arises when β D. The derivative of E(Y P(Z)) is presented in the right panel (Figure 7B). It is a constant in cases I and II (flat MTE) but declining in U D = P(Z) for the case with selection on the gain. 146 / 188

147 Continuous instruments MTE gives the mean marginal return for persons who have utility P(Z) = u D (P(Z) = u D is the margin of indifference). Figure 7 highlights that MTE (and LATE) identify average returns for persons at the margin of indifference at different levels of the mean utility function P(Z). Figure 8 plots MTE and LATE for different intervals of u D using the model plotted in Figure 7. LATE is the chord of E(Y P(Z)) evaluated at different points. The relationship between LATE and MTE is presented in the right panel of Figure / 188

148 Continuous instruments Figure 8: the local average treatment effect Figure 6. The Local Average Treatment Effect The Extended Roy Economy A. E(Y P (Z) =p) and LATE (p,p +1) B. MTE (u D) and LATE (p,p +1) A LATE(0.1,0.35)= LATE(0.6,0.9)= C B D Area(E,F,G,H)/0.3=LATE(0.6,0.9)= 1.17 E F G Area(A,B,C,D)/0.25=LATE(0.1,0.35)= H p u D =p 148 / 188

149 Continuous instruments Figure 8: the local average treatment effect LATE (p l, p l+1 ) = E (Y P(Z) = p l+1) E (Y P(Z) = p l ) p l+1 p l p l+1 MTE (u D )du D = p l p l+1 p l LATE (0.1, 0.35) = LATE (0.6, 0.9) = / 188

150 Continuous instruments Figure 8: the local average treatment effect Outcomes Choice Model Y 1 = α + β + U 1 { 1 if D D = > 0 0 if D 0 Y 0 = α + U 0 with D = Y 1 Y 0 γz Σ = Parameterization (U 1, U 0 ) N (0, Σ) and Z N (µ Z, Σ Z ) [ ] [ ] , µ Z = (2, 2) and Σ Z = 2 9 α = 0.67, β = 0.2, γ = (0.5, 0.5) 150 / 188

151 Continuous instruments The treatment parameters as a function of p associated with case III are plotted in Figure 9. MTE is the same as that reported in Figure 7. ATE is the same for all p. TT (p) = E(Y 1 Y 0 D = 1, P(Z) = p) declines in p (equivalently, it declines in u D ). LATE(p, p ) = TT (p )p TT (p)p, p p p p MTE = [ TT (p)p]. p 151 / 188

152 Continuous instruments Parameter Definition Under Assumptions (*) Marginal Treatment Effect E [Y 1 Y 0 D =0,P(Z) =p] β + σu1 U0Φ 1 (1 p) Average Treatment Effect E [Y 1 Y 0 P (Z) =p] β Treatment on the Treated E [Y 1 Y 0 D φ(φ > 0,P(Z) =p] β + 1 (1 p)) σu1 U0 Treatment on the Untreated E [Y 1 Y 0 D 0,P(Z) =p] β σu1 U0 µ OLS/Matching on P (Z) E [Y 1 D > 0,P(Z) =p] E [Y 0 D 0,P(Z) =p] β + σ 2 U 1 σu 1,U 0 σu1 U 0 p φ(φ 1 (1 p)) 1 p ³ 1 2p p(1 p) φ Φ 1 (1 p) Note: Φ ( ) and φ ( ) represent the cdf and pdf of a standard normal distribution, respectively. Φ 1 ( ) represents the inverse of Φ ( ). (*):ThemodelinthiscaseisthesameastheonepresentedbelowFigure / 188

153 Continuous instruments Figure 9: treatment parameters and OLS matching as a function of P(Z) = p Figure 7. Treatment Parameters and OLS/Matching as a function of P (Z) =p 5 4 T T (p) A T E ( p ) M T E (p) T U T (p) M a t c h i n g ( p ) p 153 / 188

154 Continuous instruments Another nonmonotonicity example A mixture of two normals: Z P 1 N(µ 1, Σ 1 ) + P 2 N(µ 2, Σ 2 ) P 1 is the proportion in population 1, P 2 is the proportion in population 2 and P 1 + P 2 = / 188

155 Continuous instruments Another nonmonotonicity example Conventional normal outcome selection model generated by the parameters at the base of Figure 11. The discrete choice equation ( is a conventional probit: γz Pr (D = 1 Z = z) = Φ σ V ). The MTE (v), E(Y 1 Y 0 V = v) = µ 1 µ 0 + Cov (U 1 U 0, V ) v. Var (V ) We show results for models with vector Z that satisfies (IV-1) and (IV-2) and with γ > 0 componentwise. 155 / 188

156 Continuous instruments Outcomes Choice Model Y 1 = α + β + U 1 { 1 if D D = > 0 0 if D 0 Y 0 = α + U 0 D = Y 1 Y 0 γz and V = (U 1 U 0 ) (U 1, U 0 ) N (0, Σ), Σ = Parameterization [ ] 1 0.9, α = 0.67, β = / 188

157 Continuous instruments Z = (Z 1, Z 2 ) p 1 N(κ 1, Σ 1 ) + p 2 N(κ 2, Σ 2 ) p 1 = 0.45, p 2 = 0.55 ; Σ 1 = [ Cov(Z 1, γz) = γσ 1 1 = 0.98 ; γ = (0.2, 1.4) ] 157 / 188

158 Continuous instruments Figure 11: marginal treatment effect and IV weights using Z 1 as the instrument when Z = (Z 1, Z 2 ) p 1 N(µ 1, Σ 1 ) + p 2 N(µ 2, Σ 2 ) for different values of Σ 2 Figure 8. Marginal Treatment Effect and IV Weights using Z 1 as the Instrument when Z =(Z 1,Z 2) p 1N(κ 1, Σ 1)+p 2N(κ 2, Σ 2) for different values of Σ A. IV Weights B. MTE (v) Weights MTE 4 ω 1 ω2 3 ω v v Outcomes Choice Model Y 1 = α + β + U 1 D = { 1 if D > 0 0 if D / 188

Lecture 11 Roy model, MTE, PRTE

Lecture 11 Roy model, MTE, PRTE Lecture 11 Roy model, MTE, PRTE Economics 2123 George Washington University Instructor: Prof. Ben Williams Roy Model Motivation The standard textbook example of simultaneity is a supply and demand system

More information

SUPPOSE a policy is proposed for adoption in a country.

SUPPOSE a policy is proposed for adoption in a country. The Review of Economics and Statistics VOL. LXXXVIII AUGUST 2006 NUMBER 3 UNDERSTANDING INSTRUMENTAL VARIABLES IN MODELS WITH ESSENTIAL HETEROGENEITY James J. Heckman, Sergio Urzua, and Edward Vytlacil*

More information

Lecture 8. Roy Model, IV with essential heterogeneity, MTE

Lecture 8. Roy Model, IV with essential heterogeneity, MTE Lecture 8. Roy Model, IV with essential heterogeneity, MTE Economics 2123 George Washington University Instructor: Prof. Ben Williams Heterogeneity When we talk about heterogeneity, usually we mean heterogeneity

More information

Understanding Instrumental Variables in Models with Essential Heterogeneity

Understanding Instrumental Variables in Models with Essential Heterogeneity DISCUSSION PAPER SERIES IZA DP No. 2320 Understanding Instrumental Variables in Models with Essential Heterogeneity James J. Heckman Sergio Urzua Edward Vytlacil September 2006 Forschungsinstitut zur Zukunft

More information

Principles Underlying Evaluation Estimators

Principles Underlying Evaluation Estimators The Principles Underlying Evaluation Estimators James J. University of Chicago Econ 350, Winter 2019 The Basic Principles Underlying the Identification of the Main Econometric Evaluation Estimators Two

More information

Estimating Marginal and Average Returns to Education

Estimating Marginal and Average Returns to Education Estimating Marginal and Average Returns to Education Pedro Carneiro, James Heckman and Edward Vytlacil Econ 345 This draft, February 11, 2007 1 / 167 Abstract This paper estimates marginal and average

More information

Lecture 11/12. Roy Model, MTE, Structural Estimation

Lecture 11/12. Roy Model, MTE, Structural Estimation Lecture 11/12. Roy Model, MTE, Structural Estimation Economics 2123 George Washington University Instructor: Prof. Ben Williams Roy model The Roy model is a model of comparative advantage: Potential earnings

More information

Estimation of Treatment Effects under Essential Heterogeneity

Estimation of Treatment Effects under Essential Heterogeneity Estimation of Treatment Effects under Essential Heterogeneity James Heckman University of Chicago and American Bar Foundation Sergio Urzua University of Chicago Edward Vytlacil Columbia University March

More information

5.5 Yitzhaki Weights as a Version of Theil Weights (1950)

5.5 Yitzhaki Weights as a Version of Theil Weights (1950) 5.5 Yitzhaki Weights as a Version of Theil Weights (195) Another intuition behind the Yitzhaki weights (Yitzhaki, 1989). Sample Size (finite sample) Take Case () =() (propensity score is the instrument).

More information

Policy-Relevant Treatment Effects

Policy-Relevant Treatment Effects Policy-Relevant Treatment Effects By JAMES J. HECKMAN AND EDWARD VYTLACIL* Accounting for individual-level heterogeneity in the response to treatment is a major development in the econometric literature

More information

Haavelmo, Marschak, and Structural Econometrics

Haavelmo, Marschak, and Structural Econometrics Haavelmo, Marschak, and Structural Econometrics James J. Heckman University of Chicago Trygve Haavelmo Centennial Symposium December 13 14, 2011 University of Oslo This draft, October 13, 2011 James J.

More information

The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest

The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest Edward Vytlacil, Yale University Renmin University, Department

More information

NBER WORKING PAPER SERIES STRUCTURAL EQUATIONS, TREATMENT EFFECTS AND ECONOMETRIC POLICY EVALUATION. James J. Heckman Edward Vytlacil

NBER WORKING PAPER SERIES STRUCTURAL EQUATIONS, TREATMENT EFFECTS AND ECONOMETRIC POLICY EVALUATION. James J. Heckman Edward Vytlacil NBER WORKING PAPER SERIES STRUCTURAL EQUATIONS, TREATMENT EFFECTS AND ECONOMETRIC POLICY EVALUATION James J. Heckman Edward Vytlacil Working Paper 11259 http://www.nber.org/papers/w11259 NATIONAL BUREAU

More information

Matching. James J. Heckman Econ 312. This draft, May 15, Intro Match Further MTE Impl Comp Gen. Roy Req Info Info Add Proxies Disc Modal Summ

Matching. James J. Heckman Econ 312. This draft, May 15, Intro Match Further MTE Impl Comp Gen. Roy Req Info Info Add Proxies Disc Modal Summ Matching James J. Heckman Econ 312 This draft, May 15, 2007 1 / 169 Introduction The assumption most commonly made to circumvent problems with randomization is that even though D is not random with respect

More information

STRUCTURAL EQUATIONS, TREATMENT EFFECTS AND ECONOMETRIC POLICY EVALUATION 1. By James J. Heckman and Edward Vytlacil

STRUCTURAL EQUATIONS, TREATMENT EFFECTS AND ECONOMETRIC POLICY EVALUATION 1. By James J. Heckman and Edward Vytlacil STRUCTURAL EQUATIONS, TREATMENT EFFECTS AND ECONOMETRIC POLICY EVALUATION 1 By James J. Heckman and Edward Vytlacil First Draft, August 2; Revised, June 21 Final Version, September 25, 23. 1 This paper

More information

AGEC 661 Note Fourteen

AGEC 661 Note Fourteen AGEC 661 Note Fourteen Ximing Wu 1 Selection bias 1.1 Heckman s two-step model Consider the model in Heckman (1979) Y i = X iβ + ε i, D i = I {Z iγ + η i > 0}. For a random sample from the population,

More information

Econometric Causality

Econometric Causality Econometric (2008) International Statistical Review, 76(1):1-27 James J. Heckman Spencer/INET Conference University of Chicago Econometric The econometric approach to causality develops explicit models

More information

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models James J. Heckman and Salvador Navarro The University of Chicago Review of Economics and Statistics 86(1)

More information

Exploring Marginal Treatment Effects

Exploring Marginal Treatment Effects Exploring Marginal Treatment Effects Flexible estimation using Stata Martin Eckhoff Andresen Statistics Norway Oslo, September 12th 2018 Martin Andresen (SSB) Exploring MTEs Oslo, 2018 1 / 25 Introduction

More information

The Econometric Evaluation of Policy Design: Part III: Selection Models and the MTE

The Econometric Evaluation of Policy Design: Part III: Selection Models and the MTE The Econometric Evaluation of Policy Design: Part III: Selection Models and the MTE Edward Vytlacil, Yale University Renmin University June 2017 1 / 96 Lectures primarily drawing upon: Heckman and Vytlacil

More information

Instrumental Variables in Models with Multiple Outcomes: The General Unordered Case

Instrumental Variables in Models with Multiple Outcomes: The General Unordered Case DISCUSSION PAPER SERIES IZA DP No. 3565 Instrumental Variables in Models with Multiple Outcomes: The General Unordered Case James J. Heckman Sergio Urzua Edward Vytlacil June 2008 Forschungsinstitut zur

More information

STRUCTURAL EQUATIONS, TREATMENT EFFECTS, AND ECONOMETRIC POLICY EVALUATION 1

STRUCTURAL EQUATIONS, TREATMENT EFFECTS, AND ECONOMETRIC POLICY EVALUATION 1 Econometrica, Vol. 73, No. 3 (May, 2005), 669 738 STRUCTURAL EQUATIONS, TREATMENT EFFECTS, AND ECONOMETRIC POLICY EVALUATION 1 BY JAMES J. HECKMAN ANDEDWARDVYTLACIL This paper uses the marginal treatment

More information

The relationship between treatment parameters within a latent variable framework

The relationship between treatment parameters within a latent variable framework Economics Letters 66 (2000) 33 39 www.elsevier.com/ locate/ econbase The relationship between treatment parameters within a latent variable framework James J. Heckman *,1, Edward J. Vytlacil 2 Department

More information

Causal Analysis After Haavelmo: Definitions and a Unified Analysis of Identification of Recursive Causal Models

Causal Analysis After Haavelmo: Definitions and a Unified Analysis of Identification of Recursive Causal Models Causal Inference in the Social Sciences University of Michigan December 12, 2012 This draft, December 15, 2012 James Heckman and Rodrigo Pinto Causal Analysis After Haavelmo, December 15, 2012 1 / 157

More information

Generalized Roy Model and Cost-Benefit Analysis of Social Programs 1

Generalized Roy Model and Cost-Benefit Analysis of Social Programs 1 Generalized Roy Model and Cost-Benefit Analysis of Social Programs 1 James J. Heckman The University of Chicago University College Dublin Philipp Eisenhauer University of Mannheim Edward Vytlacil Columbia

More information

Four Parameters of Interest in the Evaluation. of Social Programs. James J. Heckman Justin L. Tobias Edward Vytlacil

Four Parameters of Interest in the Evaluation. of Social Programs. James J. Heckman Justin L. Tobias Edward Vytlacil Four Parameters of Interest in the Evaluation of Social Programs James J. Heckman Justin L. Tobias Edward Vytlacil Nueld College, Oxford, August, 2005 1 1 Introduction This paper uses a latent variable

More information

Testing for Essential Heterogeneity

Testing for Essential Heterogeneity Testing for Essential Heterogeneity James J. Heckman University of Chicago, University College Dublin and the American Bar Foundation Daniel Schmierer University of Chicago Sergio Urzua University of Chicago

More information

Estimating Marginal and Average Returns to Education

Estimating Marginal and Average Returns to Education Estimating Marginal and Average Returns to Education (Author list removed for review process) November 3, 2006 Abstract This paper estimates marginal and average returns to college when returns vary in

More information

Comparing IV with Structural Models: What Simple IV Can and Cannot Identify

Comparing IV with Structural Models: What Simple IV Can and Cannot Identify DISCUSSION PAPER SERIES IZA DP No. 3980 Comparing IV with Structural Models: What Simple IV Can and Cannot Identify James J. Heckman Sergio Urzúa January 2009 Forschungsinstitut zur Zukunft der Arbeit

More information

Estimating Marginal Returns to Education

Estimating Marginal Returns to Education DISCUSSION PAPER SERIES IZA DP No. 5275 Estimating Marginal Returns to Education Pedro Carneiro James J. Heckman Edward Vytlacil October 2010 Forschungsinstitut zur Zukunft der Arbeit Institute for the

More information

Estimating Heterogeneous Treatment Effects in the Presence of Self-Selection: A Propensity Score Perspective*

Estimating Heterogeneous Treatment Effects in the Presence of Self-Selection: A Propensity Score Perspective* Estimating Heterogeneous Treatment Effects in the Presence of Self-Selection: A Propensity Score Perspective* Xiang Zhou and Yu Xie Princeton University 05/15/2016 *Direct all correspondence to Xiang Zhou

More information

Estimating marginal returns to education

Estimating marginal returns to education Estimating marginal returns to education Pedro Carneiro James J. Heckman Edward Vytlacil The Institute for Fiscal Studies Department of Economics, UCL cemmap working paper CWP29/10 Estimating Marginal

More information

Flexible Estimation of Treatment Effect Parameters

Flexible Estimation of Treatment Effect Parameters Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both

More information

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models DISCUSSION PAPER SERIES IZA DP No. 768 Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models James J. Heckman Salvador Navarro-Lozano April 2003 Forschungsinstitut

More information

Heterogeneous Treatment Effects in the Presence of. Self-Selection: A Propensity Score Perspective

Heterogeneous Treatment Effects in the Presence of. Self-Selection: A Propensity Score Perspective Heterogeneous Treatment Effects in the Presence of Self-Selection: A Propensity Score Perspective Xiang Zhou Harvard University Yu Xie Princeton University University of Michigan Population Studies Center,

More information

Part VII. Accounting for the Endogeneity of Schooling. Endogeneity of schooling Mean growth rate of earnings Mean growth rate Selection bias Summary

Part VII. Accounting for the Endogeneity of Schooling. Endogeneity of schooling Mean growth rate of earnings Mean growth rate Selection bias Summary Part VII Accounting for the Endogeneity of Schooling 327 / 785 Much of the CPS-Census literature on the returns to schooling ignores the choice of schooling and its consequences for estimating the rate

More information

Using matching, instrumental variables and control functions to estimate economic choice models

Using matching, instrumental variables and control functions to estimate economic choice models Using matching, instrumental variables and control functions to estimate economic choice models James Heckman Salvador Navarro-Lozano WORKING PAPER 2003:4 The Institute for Labour Market Policy Evaluation

More information

What s New in Econometrics? Lecture 14 Quantile Methods

What s New in Econometrics? Lecture 14 Quantile Methods What s New in Econometrics? Lecture 14 Quantile Methods Jeff Wooldridge NBER Summer Institute, 2007 1. Reminders About Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile Regression

More information

The Problem of Causality in the Analysis of Educational Choices and Labor Market Outcomes Slides for Lectures

The Problem of Causality in the Analysis of Educational Choices and Labor Market Outcomes Slides for Lectures The Problem of Causality in the Analysis of Educational Choices and Labor Market Outcomes Slides for Lectures Andrea Ichino (European University Institute and CEPR) February 28, 2006 Abstract This course

More information

Comparative Advantage and Schooling

Comparative Advantage and Schooling Comparative Advantage and Schooling Pedro Carneiro University College London, Institute for Fiscal Studies and IZA Sokbae Lee University College London and Institute for Fiscal Studies June 7, 2004 Abstract

More information

New Developments in Econometrics Lecture 16: Quantile Estimation

New Developments in Econometrics Lecture 16: Quantile Estimation New Developments in Econometrics Lecture 16: Quantile Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. Review of Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile

More information

Adding Uncertainty to a Roy Economy with Two Sectors

Adding Uncertainty to a Roy Economy with Two Sectors Adding Uncertainty to a Roy Economy with Two Sectors James J. Heckman The University of Chicago Nueld College, Oxford University This draft, August 7, 2005 1 denotes dierent sectors. =0denotes choice of

More information

Marginal Treatment Effects from a Propensity Score Perspective

Marginal Treatment Effects from a Propensity Score Perspective Marginal Treatment Effects from a Propensity Score Perspective Xiang Zhou 1 and Yu Xie 2 1 Harvard University 2 Princeton University January 26, 2018 Abstract We offer a propensity score perspective to

More information

Identification and Extrapolation with Instrumental Variables

Identification and Extrapolation with Instrumental Variables Identification and Extrapolation with Instrumental Variables Magne Mogstad Alexander Torgovitsky September 19, 217 Abstract Instrumental variables (IV) are widely used in economics to address selection

More information

The Generalized Roy Model and Treatment Effects

The Generalized Roy Model and Treatment Effects The Generalized Roy Model and Treatment Effects Christopher Taber University of Wisconsin November 10, 2016 Introduction From Imbens and Angrist we showed that if one runs IV, we get estimates of the Local

More information

Marginal treatment effects

Marginal treatment effects 11 Marginal treatment effects In this chapter, we review policy evaluation and Heckman and Vytlacil s [25, 27a] (HV) strategy for linking marginal treatment effects to other average treatment effects including

More information

Understanding What Instrumental Variables Estimate: Estimating Marginal and Average Returns to Education

Understanding What Instrumental Variables Estimate: Estimating Marginal and Average Returns to Education Understanding What Instrumental Variables Estimate: Estimating Marginal and Average Returns to Education Pedro Carneiro University of Chicago JamesJ.Heckman University of Chicago and The American Bar Foundation

More information

Chilean and High School Dropout Calculations to Testing the Correlated Random Coefficient Model

Chilean and High School Dropout Calculations to Testing the Correlated Random Coefficient Model Chilean and High School Dropout Calculations to Testing the Correlated Random Coefficient Model James J. Heckman University of Chicago, University College Dublin Cowles Foundation, Yale University and

More information

( ) : : (CUHIES2000), Heckman Vytlacil (1999, 2000, 2001),Carneiro, Heckman Vytlacil (2001) Carneiro (2002) Carneiro, Heckman Vytlacil (2001) :

( ) : : (CUHIES2000), Heckman Vytlacil (1999, 2000, 2001),Carneiro, Heckman Vytlacil (2001) Carneiro (2002) Carneiro, Heckman Vytlacil (2001) : 2004 4 ( 100732) ( ) : 2000,,,20 : ;,, 2000 6 43 %( 11 %), 80 90,, : :,,, (), (),, (CUHIES2000), Heckman Vytlacil (1999, 2000, 2001),Carneiro, Heckman Vytlacil (2001) Carneiro (2002) (MTE), Bjorklund Moffitt

More information

Recitation Notes 5. Konrad Menzel. October 13, 2006

Recitation Notes 5. Konrad Menzel. October 13, 2006 ecitation otes 5 Konrad Menzel October 13, 2006 1 Instrumental Variables (continued) 11 Omitted Variables and the Wald Estimator Consider a Wald estimator for the Angrist (1991) approach to estimating

More information

Final Exam. Economics 835: Econometrics. Fall 2010

Final Exam. Economics 835: Econometrics. Fall 2010 Final Exam Economics 835: Econometrics Fall 2010 Please answer the question I ask - no more and no less - and remember that the correct answer is often short and simple. 1 Some short questions a) For each

More information

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous Econometrics of causal inference Throughout, we consider the simplest case of a linear outcome equation, and homogeneous effects: y = βx + ɛ (1) where y is some outcome, x is an explanatory variable, and

More information

A Course in Applied Econometrics. Lecture 5. Instrumental Variables with Treatment Effect. Heterogeneity: Local Average Treatment Effects.

A Course in Applied Econometrics. Lecture 5. Instrumental Variables with Treatment Effect. Heterogeneity: Local Average Treatment Effects. A Course in Applied Econometrics Lecture 5 Outline. Introduction 2. Basics Instrumental Variables with Treatment Effect Heterogeneity: Local Average Treatment Effects 3. Local Average Treatment Effects

More information

Rubin s Potential Outcome Framework. Motivation. Example: Crossfit versus boot camp 2/20/2012. Paul L. Hebert, PhD

Rubin s Potential Outcome Framework. Motivation. Example: Crossfit versus boot camp 2/20/2012. Paul L. Hebert, PhD Rubin s Potential Outcome Framework Paul L. Hebert, PhD Motivation Rubin s potential outcomes model has been the theoretical framework that statisticians/econometricians use to demonstrate under what conditions/assumptions

More information

Additional Material for Estimating the Technology of Cognitive and Noncognitive Skill Formation (Cuttings from the Web Appendix)

Additional Material for Estimating the Technology of Cognitive and Noncognitive Skill Formation (Cuttings from the Web Appendix) Additional Material for Estimating the Technology of Cognitive and Noncognitive Skill Formation (Cuttings from the Web Appendix Flavio Cunha The University of Pennsylvania James Heckman The University

More information

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed

More information

Using Instrumental Variables for Inference about Policy Relevant Treatment Parameters

Using Instrumental Variables for Inference about Policy Relevant Treatment Parameters Using Instrumental Variables for Inference about Policy Relevant Treatment Parameters Magne Mogstad Andres Santos Alexander Torgovitsky May 25, 218 Abstract We propose a method for using instrumental variables

More information

More on Roy Model of Self-Selection

More on Roy Model of Self-Selection V. J. Hotz Rev. May 26, 2007 More on Roy Model of Self-Selection Results drawn on Heckman and Sedlacek JPE, 1985 and Heckman and Honoré, Econometrica, 1986. Two-sector model in which: Agents are income

More information

Propensity-Score-Based Methods versus MTE-Based Methods. in Causal Inference

Propensity-Score-Based Methods versus MTE-Based Methods. in Causal Inference Propensity-Score-Based Methods versus MTE-Based Methods in Causal Inference Xiang Zhou University of Michigan Yu Xie University of Michigan Population Studies Center Research Report 11-747 December 2011

More information

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case Maximilian Kasy Department of Economics, Harvard University 1 / 40 Agenda instrumental variables part I Origins of instrumental

More information

Identifying Effects of Multivalued Treatments

Identifying Effects of Multivalued Treatments Identifying Effects of Multivalued Treatments Sokbae Lee Bernard Salanie The Institute for Fiscal Studies Department of Economics, UCL cemmap working paper CWP72/15 Identifying Effects of Multivalued Treatments

More information

Empirical approaches in public economics

Empirical approaches in public economics Empirical approaches in public economics ECON4624 Empirical Public Economics Fall 2016 Gaute Torsvik Outline for today The canonical problem Basic concepts of causal inference Randomized experiments Non-experimental

More information

4.8 Instrumental Variables

4.8 Instrumental Variables 4.8. INSTRUMENTAL VARIABLES 35 4.8 Instrumental Variables A major complication that is emphasized in microeconometrics is the possibility of inconsistent parameter estimation due to endogenous regressors.

More information

Identifying and Estimating the Distributions of Ex Post. and Ex Ante Returns to Schooling

Identifying and Estimating the Distributions of Ex Post. and Ex Ante Returns to Schooling Identifying and Estimating the Distributions of Ex Post and Ex Ante Returns to Schooling Flavio Cunha University of Chicago James J. Heckman University of Chicago and This draft, May 18, 2007 University

More information

TECHNICAL WORKING PAPER SERIES LOCAL INSTRUMENTAL VARIABLES. James J. Heckman Edward J. Vytlacil

TECHNICAL WORKING PAPER SERIES LOCAL INSTRUMENTAL VARIABLES. James J. Heckman Edward J. Vytlacil TECHNICAL WORKING PAPER SERIES LOCAL INSTRUMENTAL VARIABLES James J. Heckman Edward J. Vytlacil Technical Working Paper 252 http://www.nber.org/papers/t252 NATIONAL BUREAU OF ECONOMIC RESEARCH 15 Massachusetts

More information

The Generalized Roy Model and the Cost-Benefit Analysis of Social Programs

The Generalized Roy Model and the Cost-Benefit Analysis of Social Programs Discussion Paper No. 14-082 The Generalized Roy Model and the Cost-Benefit Analysis of Social Programs Philipp Eisenhauer, James J. Heckman, and Edward Vytlacil Discussion Paper No. 14-082 The Generalized

More information

ECO Class 6 Nonparametric Econometrics

ECO Class 6 Nonparametric Econometrics ECO 523 - Class 6 Nonparametric Econometrics Carolina Caetano Contents 1 Nonparametric instrumental variable regression 1 2 Nonparametric Estimation of Average Treatment Effects 3 2.1 Asymptotic results................................

More information

150C Causal Inference

150C Causal Inference 150C Causal Inference Instrumental Variables: Modern Perspective with Heterogeneous Treatment Effects Jonathan Mummolo May 22, 2017 Jonathan Mummolo 150C Causal Inference May 22, 2017 1 / 26 Two Views

More information

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated

More information

EVALUATING EDUCATION POLICIES WHEN THE RATE OF RETURN VARIES ACROSS INDIVIDUALS PEDRO CARNEIRO * Take the standard model of potential outcomes:

EVALUATING EDUCATION POLICIES WHEN THE RATE OF RETURN VARIES ACROSS INDIVIDUALS PEDRO CARNEIRO * Take the standard model of potential outcomes: 516 Cuadernos de Economía, CUADERNO Año 40, Nº DE 121, ECONOMIA pp. 516-529 (Vol. (diciembre 40, Nº 121, 2003) diciembre 2003) EVALUATING EDUCATION POLICIE WHEN THE RATE OF RETURN VARIE ACRO INDIVIDUAL

More information

Generated Covariates in Nonparametric Estimation: A Short Review.

Generated Covariates in Nonparametric Estimation: A Short Review. Generated Covariates in Nonparametric Estimation: A Short Review. Enno Mammen, Christoph Rothe, and Melanie Schienle Abstract In many applications, covariates are not observed but have to be estimated

More information

Imbens, Lecture Notes 2, Local Average Treatment Effects, IEN, Miami, Oct 10 1

Imbens, Lecture Notes 2, Local Average Treatment Effects, IEN, Miami, Oct 10 1 Imbens, Lecture Notes 2, Local Average Treatment Effects, IEN, Miami, Oct 10 1 Lectures on Evaluation Methods Guido Imbens Impact Evaluation Network October 2010, Miami Methods for Estimating Treatment

More information

ECONOMETRICS II (ECO 2401) Victor Aguirregabiria. Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS

ECONOMETRICS II (ECO 2401) Victor Aguirregabiria. Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS ECONOMETRICS II (ECO 2401) Victor Aguirregabiria Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS 1. Introduction and Notation 2. Randomized treatment 3. Conditional independence

More information

NBER WORKING PAPER SERIES USING INSTRUMENTAL VARIABLES FOR INFERENCE ABOUT POLICY RELEVANT TREATMENT EFFECTS

NBER WORKING PAPER SERIES USING INSTRUMENTAL VARIABLES FOR INFERENCE ABOUT POLICY RELEVANT TREATMENT EFFECTS NBER WORKING PAPER SERIES USING INSTRUMENTAL VARIABLES FOR INFERENCE ABOUT POLICY RELEVANT TREATMENT EFFECTS Magne Mogstad Andres Santos Alexander Torgovitsky Working Paper 23568 http://www.nber.org/papers/w23568

More information

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) 1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For

More information

Using Instrumental Variables for Inference about Policy Relevant Treatment Parameters

Using Instrumental Variables for Inference about Policy Relevant Treatment Parameters Using Instrumental Variables for Inference about Policy Relevant Treatment Parameters Magne Mogstad Andres Santos Alexander Torgovitsky July 22, 2016 Abstract We propose a method for using instrumental

More information

Applied Health Economics (for B.Sc.)

Applied Health Economics (for B.Sc.) Applied Health Economics (for B.Sc.) Helmut Farbmacher Department of Economics University of Mannheim Autumn Semester 2017 Outlook 1 Linear models (OLS, Omitted variables, 2SLS) 2 Limited and qualitative

More information

A Note on Adapting Propensity Score Matching and Selection Models to Choice Based Samples

A Note on Adapting Propensity Score Matching and Selection Models to Choice Based Samples DISCUSSION PAPER SERIES IZA DP No. 4304 A Note on Adapting Propensity Score Matching and Selection Models to Choice Based Samples James J. Heckman Petra E. Todd July 2009 Forschungsinstitut zur Zukunft

More information

Instrumental Variables

Instrumental Variables Instrumental Variables Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Noncompliance in Experiments Stat186/Gov2002 Fall 2018 1 / 18 Instrumental Variables

More information

Tables and Figures. This draft, July 2, 2007

Tables and Figures. This draft, July 2, 2007 and Figures This draft, July 2, 2007 1 / 16 Figures 2 / 16 Figure 1: Density of Estimated Propensity Score Pr(D=1) % 50 40 Treated Group Untreated Group 30 f (P) 20 10 0.01~.10.11~.20.21~.30.31~.40.41~.50.51~.60.61~.70.71~.80.81~.90.91~.99

More information

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.

More information

Dealing With Endogeneity

Dealing With Endogeneity Dealing With Endogeneity Junhui Qian December 22, 2014 Outline Introduction Instrumental Variable Instrumental Variable Estimation Two-Stage Least Square Estimation Panel Data Endogeneity in Econometrics

More information

Market and Nonmarket Benefits

Market and Nonmarket Benefits of Education John Eric Humphries The University of Chicago James Heckman The University of Chicago Greg Veramendi Arizona State University A Celebration of the Life and Work of Gary S. Becker University

More information

Fuzzy Differences-in-Differences

Fuzzy Differences-in-Differences Review of Economic Studies (2017) 01, 1 30 0034-6527/17/00000001$02.00 c 2017 The Review of Economic Studies Limited Fuzzy Differences-in-Differences C. DE CHAISEMARTIN University of California at Santa

More information

Binary Models with Endogenous Explanatory Variables

Binary Models with Endogenous Explanatory Variables Binary Models with Endogenous Explanatory Variables Class otes Manuel Arellano ovember 7, 2007 Revised: January 21, 2008 1 Introduction In Part I we considered linear and non-linear models with additive

More information

Controlling for Time Invariant Heterogeneity

Controlling for Time Invariant Heterogeneity Controlling for Time Invariant Heterogeneity Yona Rubinstein July 2016 Yona Rubinstein (LSE) Controlling for Time Invariant Heterogeneity 07/16 1 / 19 Observables and Unobservables Confounding Factors

More information

Methods to Estimate Causal Effects Theory and Applications. Prof. Dr. Sascha O. Becker U Stirling, Ifo, CESifo and IZA

Methods to Estimate Causal Effects Theory and Applications. Prof. Dr. Sascha O. Becker U Stirling, Ifo, CESifo and IZA Methods to Estimate Causal Effects Theory and Applications Prof. Dr. Sascha O. Becker U Stirling, Ifo, CESifo and IZA last update: 21 August 2009 Preliminaries Address Prof. Dr. Sascha O. Becker Stirling

More information

The problem of causality in microeconometrics.

The problem of causality in microeconometrics. The problem of causality in microeconometrics. Andrea Ichino European University Institute April 15, 2014 Contents 1 The Problem of Causality 1 1.1 A formal framework to think about causality....................................

More information

Econometric Causality

Econometric Causality Econometric Causality James J. Heckman The Institute for Fiscal Studies Department of Economics, UCL cemmap working paper CWP1/08 Econometric Causality James J. Heckman February 4, 2008 Summary This paper

More information

Recitation Notes 6. Konrad Menzel. October 22, 2006

Recitation Notes 6. Konrad Menzel. October 22, 2006 Recitation Notes 6 Konrad Menzel October, 006 Random Coefficient Models. Motivation In the empirical literature on education and earnings, the main object of interest is the human capital earnings function

More information

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015 Introduction to causal identification Nidhiya Menon IGC Summer School, New Delhi, July 2015 Outline 1. Micro-empirical methods 2. Rubin causal model 3. More on Instrumental Variables (IV) Estimating causal

More information

Two-way fixed effects estimators with heterogeneous treatment effects

Two-way fixed effects estimators with heterogeneous treatment effects Two-way fixed effects estimators with heterogeneous treatment effects Clément de Chaisemartin Xavier D Haultfœuille April 26, 2018 Abstract Around 20% of all empirical articles published by the American

More information

Prediction and causal inference, in a nutshell

Prediction and causal inference, in a nutshell Prediction and causal inference, in a nutshell 1 Prediction (Source: Amemiya, ch. 4) Best Linear Predictor: a motivation for linear univariate regression Consider two random variables X and Y. What is

More information

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) 1 2 Model Consider a system of two regressions y 1 = β 1 y 2 + u 1 (1) y 2 = β 2 y 1 + u 2 (2) This is a simultaneous equation model

More information

Econ 2148, fall 2017 Instrumental variables II, continuous treatment

Econ 2148, fall 2017 Instrumental variables II, continuous treatment Econ 2148, fall 2017 Instrumental variables II, continuous treatment Maximilian Kasy Department of Economics, Harvard University 1 / 35 Recall instrumental variables part I Origins of instrumental variables:

More information

Potential Outcomes Model (POM)

Potential Outcomes Model (POM) Potential Outcomes Model (POM) Relationship Between Counterfactual States Causality Empirical Strategies in Labor Economics, Angrist Krueger (1999): The most challenging empirical questions in economics

More information

Chapter 60 Evaluating Social Programs with Endogenous Program Placement and Selection of the Treated

Chapter 60 Evaluating Social Programs with Endogenous Program Placement and Selection of the Treated See discussions, stats, and author profiles for this publication at: http://www.researchgate.net/publication/222400893 Chapter 60 Evaluating Social Programs with Endogenous Program Placement and Selection

More information

Dynamic Treatment Effects

Dynamic Treatment Effects Dynamic Treatment Effects James J. Heckman University of Chicago and the American Bar Foundation John Eric Humphries University of Chicago Gregory Veramendi Arizona State University July 21, 2015 James

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint

More information

1 Motivation for Instrumental Variable (IV) Regression

1 Motivation for Instrumental Variable (IV) Regression ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data

More information