Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models

Size: px
Start display at page:

Download "Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models"

Transcription

1 Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models James J. Heckman and Salvador Navarro The University of Chicago Review of Economics and Statistics 86(1) (2004) This draft for Nu eld College, Oxford August 7,

2 1 Introduction 1. We orient the discussion of the selection of alternative estimators around the economic theory of choice. We compare the di erent roles that the propensity score plays in three widely used econometric methods, and the implicit economic assumptions that underlie applications of these methods. 2

3 2. Conventional matching methods do not distinguish between excluded and included variables.we show that matching breaks down when there are variables that predict the choice of treatment perfectly whereas control function methods take advantage of exclusion restrictions and use the information available from perfect prediction to obtain identification. Matching assumes away the possibility of perfect prediction while selection models rely on this property in limit sets. 3

4 3. We define the concepts of relevant information and minimal relevant information, and distinguish agent and analyst information sets. We state clearly what information is required to identify di erent treatment parameters. In particular we show that when the analyst does not have access to the minimal relevant information, matching estimators of di erent treatment parameters are biased. 4

5 Having more information, but not all minimal relevant information, can increase bias compared to having less information. Enlarging the analyst s information set with variables that do not belong in the relevant information set may either increase or decrease bias from matching. Because the method of control functions explicitly models omitted relevant variables, rather than assuming that there are none, it is more robust to omitted conditioning variables. 5

6 4. The method of matching o ers no guidance as to which variables to include or exclude in conditioning sets. Such choices can greatly a ect inference. There is no support for the commonly used rules of selecting matching variables by choosing the set of variables that maximizes the probability of successful prediction into treatment or by including variables in conditioning sets that are statistically significant in choice equations. This weakness is shared by many econometric procedures but is not fully appreciated in recent applications of matching which apply these selection rules when choosing conditioning sets. 6

7 2 A Prototypical Model of Economic Choice Two potential outcomes ( 0 1 ). =1if 1 is selected. =0if 0 is selected. Let be utility. = ( ) =1( 0) (1) : observed factors determining choices, unobserved 1 = 1 ( 1 ) 0 = 0 ( 0 ) (2a) (2b) 0 1 are (absolutely) continuous 7

8 = 1 0 Additively Separable Case: For Familiarity. Not essential. = ( )+ ( )=0 (1 0 ) 1 = 1 ( )+ 1 ( 1 )=0 (2 0 ) 0 = 0 ( )+ 0 ( 0 )=0 (2 0 ) 8

9 3 Parameters Discussed Today : ( 1 0 ) (Average Treatment E ect) : ( 1 0 =1)(Treatment on the Treated) : ( 1 0 =0)(Marginal Treatment E ect) These are familiar but by no means the only parameters we could consider From MTE, can identify many other parameters (Recall IV Lectures) 9

10 4 The Selection Problem Samples generated by choices: Data: ( =1)= ( 1 =1) ( =0)= ( 0 =0) Pr ( =1 ) ( 1 =1) and ( 0 =0) From raw means, we get biases. Can form ( 1 =1) ( 0 =0) 10

11 In General This Produces BIASES : Bias = [ ( =1) ( =0)] ( 1 0 =1) = [ ( 0 =1) ( 0 =0)] Under Additive Separability Bias = ( 0 =1) ( 0 =0) : Bias = ( =1) ( =0) ( 1 0 ) 11

12 Under Additive Separability Bias = [ ( 1 =1) ( 1 )] [ ( 0 =0) ( 0 )] : Bias = ( =1) ( =0) ( 1 0 =0) Under Additive Separability Bias = ( 1 =1) ( 1 =0) [ ( 0 =0) ( 0 = 0)] 12

13 5 How Di erent Methods Solve the Bias Problem 5.1 Matching =( ) ( 1 0 ) " denotes independence given (M-1) 0 Pr( =1 )= ( ) 1 (M-2) Rosenbaum and Rubin (1983) show (M-1) and (M-2) imply ( 1 0 ) ( ) (M-3) 13

14 ( 1 =0 ( )) = ( 1 =1 ( )) = ( 1 ( )) ( 0 =1 ( )) = ( 0 =0 ( )) = ( 0 ( )) Dependence between and ( 1 0 ) is eliminated by conditioning on : ( 1 0 ) Selection on Observables If ( )=1or ( )=0 method breaks down for those values. 14

15 Extensions (Heckman, Ichimura, Smith and Todd) Distinction between and Introducing allows one to solve the breakdown problem arising from ( ) =1or ( ) =0 Thus if outcomes are defined in terms of and ( ) = ( ) If we can find another value 0 such that Pr( 0 ) 6= 1 can match using this (IV assumption) 15

16 Require only weaker mean independence assumptions ( 1 =1) = ( 1 ) ( 0 =0) = ( 0 ) CanbeusedforMeans. 16

17 Matching is for free (Gill and Robins (2001)): ( 0 =1 ) is not observed. Can just as well replace it by ( 0 =1 )= ( 0 =0 ) However, the implied economic restrictions are not for free. Imposes that, conditional on and, the marginal person is thesameastheaverageperson. ThisisthesameasaflatMTE( ) in (MTE does not depend on ) 17

18 5.2 Control Functions Additively Separable Case We observe left-hand sides of ( 1 =1)= 1 ( )+ ( 1 =1) ( 0 =0)= 0 ( )+ ( 0 =0) If ( 1 ) ( 1 =1)= ( 1 ( ) )= 1 ( ( )) If ( 1 ) ( 0 =0)= ( 0 ( ) )= 0 ( ( )) 18

19 So, key assumption ( 1 0 ) ( ) Under this condition ( 1 =1)= 1 ( )+ 1 ( ( )) ( 0 =0)= 0 ( )+ 0 ( ( )) Need Limit Set Results lim 1 1 ( )=0and lim 0 0 ( )=0 19

20 If there are limit sets Z 0 and Z 1 such that lim ( ) = Z 0 0 and lim ( ) =1, then we can identify the constants. Z 1 There are semiparametric versions of these estimators. Use polynomials in ; Local Linear Regression in 20

21 From this model can obviously identify = 1 ( ) 0 ( ) (Aswehaveseen) Plus, = 1 ( ) 0 ( )+ ( 1 0 =1) = 1 ( ) 0 ( )+ 1 ( ( )) μ ( ( )) 21

22 = 1 ( ) 0 ( ) + [ ( 1 0 =1) ( )] ( ) = 1 ( ) 0 ( ) + ( ) 1 ( ( )+ 1 0 ( ( ) ª ( ) Marginal and Average are allowed to be di erent. 22

23 Both Matching and Control functions are defined only over Support ( =1) Support ( =0) Method of control functions does not require ( 0 1 ) ( ) But Matching does. 23

24 Matching is a special case of control functions in the additively separable case. Additive separability and control functions assumptions are central to this claim. ( 1 =1) = ( 1 ) = ( 1 ( )) ( 0 =0) = ( 0 ) = ( 0 ( )) If then 1 ( )= ( 1 ) and 0 ( )= ( 0 ) ( 1 ( )) = 0 and ( 0 ( )) = 0 24

25 In the method of control functions, If ( ) ( 0 1 ) ( ) = ( 1 =1) + ( 0 =0)(1 ) = 0 ( )+( 1 ( ) 0 ( )) + ( 1 =1) + ( 0 ( ) =0)(1 ) = 0 ( )+( 1 ( ) 0 ( )) + ( 1 ( ) =1) + ( 0 ( ) =0)(1 ) = 0 ( ) +[ 1 ( ) 0 ( )+ 1 ( ( )) 0 ( ( ))] + 0 ( ( )) 25

26 To identify must isolate it from and 1 ( ) 0 ( ) 1 ( ( ) 0 ( ( ) 26

27 Under matching assumptions, ( ( ) ) = 0 ( ( )) + ( 1 ( ( )) 0 ( ( ))) +[ ( 1 ( )) ( 0 ( ))] + ( 0 ( )) ( ( ) )= 0 ( ( )) + ( 1 ( ( )) 0 ( ( ))) since ( 1 ( )) = ( 0 ( )) = 0 27

28 We can get identification without exclusion, conditional on ( 1 0 ) 28

29 Tables 1 and 2 present sensitivity analysis for the case of so ( 1 0 ) 0 (0 ) ( ) = ( ) = 2 ; = {0 1} Bias ( ( ) = ) = 0 0 ( ) Bias ( ( ) = ) = ( )[ 1 1 (1 )+ 0 0 ] where ( ) = ( 1 (1 )) (1 ) 29

30 Even if the correlation between the observables and the unobservables is small, so that one might think that selection on unobservables is relatively unimportant, we still get substantial biases if we do not control for relevant omitted conditioning variables. These examples also demonstrate that sensitivity analyses can be conducted for control function models even when they are not fully identified. 30

31 31

32 32

33 5.3 Instrumental Variables Matching and Method of control functions work with ( ) and Pr ( =1 ). = 1 +(1 ) 0 = 0 ( )+( 1 ( ) 0 ( )+ 1 0 ) + 0 = 0 ( )+ ( ) + 0 If 1 = 0 ( 0 ( ) )= ( 0 ) (IV-1) Pr ( =1 ) is a nontrivial function of for each (IV-2) 33

34 When 1 6= 0 but ( 1 0 ) or alternatively ( 1 0 ) we have = ( 1 0 ) = ( ( ) ) = ( 1 0 =1)= ( 1 0 ) = 34

35 Analytically More Interesting Case 1 6= 0 and 6 ( 1 0 ) For : ( 0 + ( 1 0 ) ( ) )= ( 0 + ( 1 0 ) ) (IV-3) For : ( 0 + ( 1 0 ) ( 0 + ( 1 0 ) ) ( ) ) = ( 0 + ( 1 0 ) ( 0 + ( 1 0 ) ) ) For we can rewrite: ( 0 ( ) )+ ( 1 0 =1 ( ) ) ( ) = ( 0 )+ ( 1 0 =1 ) ( ) Conditions Satisfied If 1 = 0,or( 1 0 ) ( ) 35

36 1. Method of Control Functions Models Dependence between ( 1 0 ) and 2. Matching assumes ( 1 0 ) 3. Linear IV requires that 0 + ( 1 0 ) be mean independent of (or ( )) givenx. 36

37 Local Instrumental Variables (LIV) require that ( ) be a non-degenerate random variable given (existence of an exclusion restriction) (LIV-1) ( 0 1 ) (LIV-2) 0 Pr ( ) 1 (LIV-3) Support ( ( )) = [0 1] (LIV-4) Under these conditions, ( ( )) ( ( )) = ( ( ) =0) 37

38 Proof: ( ( )) = ( 1 =1 ( )) ( ) = + ( 0 =0 ( )) (1 ( )) Z + where = ( ) Thus ( ( )) ( ) Z ( ) Z Z ( ) 1 ( 1 ) 1 0 ( 0 ) 0 = ( 1 0 = ( )) = 38

39 39

40 6 The bias from Matching: Information requirements Fundamental Problem: Information of the Analyst often less than that of the Agent. Definition 1 We say that ( ) is a relevant information set if its associated random variable,, satisfies (M-1) so ( 1 0 ) 40

41 Definition 2 We say that ( ) is a minimal relevant information set if it is the intersection of all sets ( ) and ( 1 0 ). The associated random variable is the minimum amount of information that guarantees that (M-1) is satisfied. Intersection may be empty. May not be a unique minimal information set. Definition 3 The agent s information set, ( ),isdefined by the information used by the agent when choosing among treatments. Accordingly, we call the agent s information. 41

42 Definition 4 The econometrician s full information set, ( ), is defined by all the information available to the econometrician, Definition 5 The econometrician s information set, ( ) is defined by the information used by the econometrician when analyzing the agent s choice of treatment,. 42

43 Obvious Inclusions: ( ) ( ) ( ) ( ) ( ) ( ) and This assumes exists. Matching implies ( ) ( ) 43

44 Generalized Roy Examples (Assume Factor Structure for error terms) = + = = 1 if 0 =0otherwise 1 = = = = ( ) mean zero random variables, mutually independent of each other and 44

45 The minimal relevant information set when factor loadings are not zero: = { 1 2 } Agent information sets may include di erent variables. If shocks to the outcomes not known, but other terms are: = { 1 2 } Under perfect certainty, = { }. (ThisistheoriginalRoymodel) Construct examples using: ( ) (0 ) ( ) =

46 6.1 The economist uses the minimal relevant information: ( ) ( ) Suppose = { 1 2 } ( 1 =1 ) ( 0 =0 ) = 1 0 +( ) 1 +( ) 2 Knowledge of ( 1 2 ) and knowledge of ( 1 2 ) equivalent μ ( ) = Pr μ = 1 = 46

47 ( 1 =1 ( )= ) ( 0 =0 ( )= ) = ( 1 =1 ( )= ) ( 0 =0 ( )= ) μ = (1 ) μ 0 1 (1 ) =

48 This is equal to all the treatment parameters (MTE=ATE=LATE=TT) μ 1 1 (1 ) μ 0 1 (1 ) = ( 1 ) 1 ( ) = ( 0 ) 0 ( ) where 1 ( )= ( 1 (1 )) because and 0 ( )= ( 1 (1 )) 1 ( )= ( )=0 48 =0 1

49 6.2 The Economist does not use all of the Minimal Relevant Information: = { } q S 1 (1 ) Bias ( ( ) = ) = 0 ( ) ( )= 1 ( ) 0 ( ) Bias ( ( ) = ) = ( )[ 1 (1 )+ 0 ] 49

50 where Bias ( ( ) = ) = ( )[ 1 (1 )+ 0 ] ( ) = ( 1 (1 )) (1 ) 1 (1 )[ 1 0 ] 1 = q = q

51 6.3 Adding information to the econometrician s information set : using Some, but not all, Information from the Minimal Relevant Information Set 0 = { 2 } May raise or lower the bias. Comparable to 1 and 0 above, we can define 0 1 = 0 0 = q q

52 Condition 1 The bias produced by using matching to estimate is smaller in absolute value for any given when the new information set ( 0 ) is used if Condition 2 The bias produced by using matching to estimate is smaller in absolute value for any given when the new information set ( 0 ) is used if 1 (1 ) (1 )

53 Condition 3 The bias produced by using matching to estimate is smaller in absolute value for any given when the new information set ( ) 0 is used if ( )[ 1 (1 )+ 0 ] 1 (1 )[ 1 0 ] ( )[ 0 1 (1 )+ 0 0 ] 1 (1 )[ ] 53

54 Proof of Condition 1 Suppose ³ ( 2 2) 0 = When q {z } when 2 is in information set ³ 02 2 = q {z } when 1 is not = 0 0 ³ 0 = q

55 Thereissomecriticalvalue 02 beyond which Assume 01 = 1 = 2 =1 02 = 12 =1 11 = 2 55

56 Figure 1.--Bias for Treatment on the Treated Special case:adding relevant information f2 increases the bias Matching on P(Z) Matching on P(Z,f2) Bias Average Bias = Average Bias = P 56

57 Figure 2.--Bias for Average Treatment Effect Special case: Adding relevant information f2 increases the bias 4 Matching on P(Z) Matching on P(Z,f2) Bias Average Bias = Average Bias = P 57

58 Figure 3.--Bias for Marginal Treatment Effect Special case: Adding relevant information f2 increases the bias 4 Matching on P(Z) Matching on P(Z,f2) Bias Average Bias = Average Bias = P 58

59 Control Function Method Models the Bias In control function method, adding 2 we simply change the control function. We go from to 1 ( ( ) = ) = 1 1 ( ) 0 ( ( ) = ) = 0 0 ( ) 1 0 ( ( 2 )= ) = ( ) 0 0 ( ( 2 )= ) = ( ) where 1 ( ) = ( 1 (1 )) and 0 ( ) = ( 1 (1 )) 1 This protects us against misspecification. 59

60 6.4 Adding information to the econometrician s information set: using proxies for the relevant information Suppose we do not know 2,justproxyitby e e = n o e Suppose = e Suppose further that e 0 2 ³ e 2 = and e ( ) 60

61 Expressions corresponding to 0 and 1 : e 1 = (1 2 ) 2 2 q (1 2 )+ 2 e 0 = (1 2 ) 2 2 q (1 2 )+ 2 61

62 By substituting 0 for e and 0 for e ( =0 1) into Conditions (1), (2) and (3) we obtain equivalent results for this case. Whether e will be bias reducing depends on how well it spans and the signs of the terms in the absolute values. In this case, there is another parameter ( =0). 62

63 The bias generated when the econometrician s information is e can also be smaller than when it is 0.Itcan be the case that knowing variable e is than knowing the actual variable 2. Take treatment on the treated as the parameter. The bias is reduced when e is used instead of 2 if (1 2 ) 2 q (1 2 )+ 2 q

64 Figure 4.--Bias for Treatment on the Treated ~ ~ Special case: Adding irrelevant information Z increases the bias correlation(z,f2)= Matching on P(Z) ~ Matching on P(Z,Z) Bias Average Bias = Average Bias = P 64

65 Figure 5.--Bias for Average Treatment Effect ~ ~ Special case: Adding irrelevant information Z increases the bias correlation(z,f2)= Matching on P(Z) ~ Matching on P(Z,Z) Bias Average Bias = Average Bias = P 65

66 3.2 3 Figure 6.--Bias for Marginal Treatment Effect ~ ~ Special case: Adding irrelevant information Z increases the bias correlation(z,f2)=0.5 Matching on P(Z) ~ Matching on P(Z,Z) Bias Average Bias = Average Bias = P 66

67 Figure 7.--Bias for Treatment on the Treated ~ Using proxy Z for f2 increases the bias ~ correlation (Z,f2)= Matching on P(Z,f2) ~ Matching on P(Z,Z) Bias Average Bias = Average Bias = P 67

68 4 Figure 8.--Bias for the Average Treatment Effect ~ Using proxy Z for f2 increases the bias ~ correlation (Z,f2)=0.5 Matching on P(Z,f2) ~ Matching on P(Z,Z) Bias Average Bias = Average Bias = P 68

69 Figure 9.--Bias for the Marginal Treatment Effect ~ Using proxy Z for f2 increases the bias ~ correlation (Z,f2)=0.5 Matching on P(Z,f2) ~ Matching on P(Z,Z) Bias Average Bias = Average Bias = P 69

70 6.5 The case of a discrete treatment = + = , =0 1 = 1 if 0 =0otherwise, People receive treatment according to the rule = + = = 1 if 0 =0otherwise; ( 1 2 ) ( 0 1 ) 70

71 The e ect of treatment is given by: 1 ( )= Pr ( 1 =1 =1 ) Pr ( 0 =1 =1 ) A second definition works with odds ratios: 2 ( )= Pr( 1 =1 =1 ) Pr( 1 =0 =1 ) Pr( 0 =1 =1 ) Pr( 0 =0 =1 ) 71

72 Onecouldalsoworkwithlog Under the null hypothesis of no e ect of treatment 1 = 2 =1 b 1 ( )= Pr ( 1 =1 =1 ) Pr ( 0 =1 =0 ) The denominator replaces the desired probability Pr ( 0 =1 =1 ) by Pr ( 0 =1 =0 ) 72

73 Under the Null Hypothesis of no real e ect of treatment 1 = 0 = 1 = 0 = can be generated by 11 = 01 = 1 12 = 02 = 2 1 = 0 = 73

74 Assume initially that = { 1 2 } b 1 ( )= Pr ( 1 =1 =1 1 2 ) Pr ( 0 =1 =0 1 2 ) In general: 1 ( )= Pr ( 1 =1 =1 ) Pr ( 0 =1 =1 ) = Pr ( 1 =1 1 2 ) Pr ( 0 =1 1 2 ) = 1( ) 6= Pr ( 1 =1 =1 ) Pr ( 0 =1 =0 ) = b 1 ( ) 74

75 16 14 Figure 10.--Estimated Effect of Treatment under Different Information Sets No Effect of Treatment and v2 =1 ^ 1 I E ={f 1,f 2 } I E ={f 2 } < f 2 =1(V>0) 75

76 6 Figure 11.--Estimated Effect of Treatment under Different Information Sets No Effect of Treatment and v2 =-1 ^ 1 I E ={f 1,f 2 } I E ={f 2 } 5 4 < f2 76

77 14 Figure 12.--Estimated Effect of Treatment under Different Information Sets No Effect of Treatment and v2 =1 ^ 2 I E ={f 1,f 2 } I E ={f 2 } < f 2 77

78 25 Figure 13.--Estimated Effect of Treatment under Different Information Sets No Effect of Treatment and v2 =-1 ^ 2 I E ={f 1,f 2 } I E ={f 2 } < f 2 78

79 3 2.5 Figure 14.--Estimated Effects of Treatment under Different Information Sets No Effect of Treatment and v2 =1 ^ 3 IE={f1,f2} I E ={f 2 } < f 2 79

80 2 Figure 15.--Estimated Effects of Treatment under Different Information Sets No Effect of Treatment and v2 =-1 ^ 3 I E ={f 1,f 2 } I E ={f 2 } 1 0 < f 2 80

81 3 Figure 16.--Estimated Effect of Treatment under Different Infomation Sets No Effect of Treatment and v2 =1 4 ^ I E ={f 1,f 2 } I E ={f 2 } < f

82 3.5 3 Figure 17.--Estimated Effects of Treatment under Different Information Sets No Effect of Treatment and v2 =-1 ^ 4 I E ={f 1,f 2 } I E ={f 2 } < f 2 82

83 6.6 On the use of model selection criteria to choose matching variables Adding more variables to the Information Set may increase Bias. How to choose the relevant variables? Standard methods on model selection criteria fail. An implicit assumption underlying such procedures is that the added conditioning variables are exogenous in the following sense ( 0 1 ) (M-4) ( is the list of initial variables used as conditioning variables.) 83

84 Sometimes procedures suggested Add variables when ratios big in propensity score Improve Fit Such procedures can raise the bias. Consider the following example: where e e = { } = ( ) might be an elicitation from a questionaire. 84

85 Same expressions for the biases using e ( =0 1) instead of where: e 1 = q e 0 = q = q Bias can be in- In general, these expressions are not zero. creased or decreased. 85

86 When 2 =0we can perfectly predict. (Will pass a goodnessof-fit criterion) Thus for It follows that lim Pr ( =1 + ) 0 = 1 for Pr ( =1 + ) = 0 for lim 0 Assumption (M-2) is violated and matching breaks down Making 2 arbitrarily small, we can predict arbitrarily well. Can improve over the fit with ( 1 2 ) in the set which produces no bias. 86

87 87

88 A More General Example Considers use of a proxy regressor = ( 1 2 ); ( 1 2 ) has mean zero 1 2, and ( 1 2 ) ( ); in the latent variable gener- possibly dependent on ating the treatment choice is measurement error. 88

89 For di erent levels of dependence between and,and di erent weights on 1 2 and on the scale of measurement error, can be a better predictor of than 1 2 or even 1 2. However, in general, ( 1 0 ) Á because is an imperfect proxy for the combinations of 1 and 2 entering 1 and 0. Thus conditioning on can produce a better fit for but greater bias for the treatment parameters. 89

90 Consider the following example where is an outcome and is an index = + = + where ( ) Obviously. 90

91 Suppose instead that we have a candidate conditioning variable = + +. Suppose that all variables are normal with zero mean and are mutually independent. Then we may write where = = It is assumed that is independent of all other error components on the right-hand sides of the equations for and. 91

92 From normal regression theory we know that conditioning is equivalent to residualizing. Constructing the residuals we obtain = (1 )+ ( + ) By a parallel argument = (1 )+ ( + ) requires that and be uncorrelated, which in general does not happen. Letting the dependence between and get large, and setting to suitable values, we can predict better (in the sense of 2 )with than with. 92

93 Letting =1( 0) produces a simple version of the example because better prediction of produces better prediction of. 93

94 Appendix Consider a general model of the form: 1 = = = ( )+ = 1 if 0 =0otherwise = 1 +(1 ) 0 94

95 where ( 1 0 ) 0 (0 ) ( ) = 2 ( ) = =0; =1 ( 1 ) = 1 ( 0 ) = 0 Let ( ) and ( ) be the pdf and the cdf of a standard normal random variable. Then, the propensity score for this model is given by: Pr ( 0 ( )) = ( ( )) = Pr ( ( )) = μ ( ) = 1 = 95

96 so Since the event ( ) = 1 (1 ) ³ S 0 ( ( )) = can be written as S ( ) S 1 (1 ) we can write the conditional expectations required to get the biases as a function of. 96

97 For 1 : ( 1 0 ( ( )) = ) = μ 1 = 1 ( ) μ ( ( )) = 1 (1 ) = 1 1 ( ) 97

98 ( 1 =0 ( ( )) = ) = 1 = 1 μ μ = ( ) ( ( )) = = 1 (1 ) ( ( )) = = 1 1 (1 ) where Similarly for 0 : 1 = 1 ( 0 0 ( )= ) = 0 1 ( ) ( 0 0 ( )= ) = 0 0 ( ) ( 0 =0 ( )= ) = 0 1 (1 ) 98

99 where and 0 = 0 1 ( ) = ( 1 (1 )) 0 ( ) = ( 1 (1 )) (1 ) are inverse Mills ratio terms. Substituting these into the expressions for the biases Bias ( ) = 0 1 ( ) 0 0 ( ) = 0 ( ) 99

100 Bias ( ) = 1 1 ( ) 0 0 ( ) = ( )( 1 (1 )+ 0 ) Bias = 1 1 ( ) 0 0 ( ) 1 1 (1 )+ 0 1 (1 ) = ( )( 1 (1 )+ 0 ) 1 (1 )[ 1 0 ] where ( ) = 1 ( ) 0 ( ) = ( 1 (1 )) (1 ) 100

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models DISCUSSION PAPER SERIES IZA DP No. 768 Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models James J. Heckman Salvador Navarro-Lozano April 2003 Forschungsinstitut

More information

Matching. James J. Heckman Econ 312. This draft, May 15, Intro Match Further MTE Impl Comp Gen. Roy Req Info Info Add Proxies Disc Modal Summ

Matching. James J. Heckman Econ 312. This draft, May 15, Intro Match Further MTE Impl Comp Gen. Roy Req Info Info Add Proxies Disc Modal Summ Matching James J. Heckman Econ 312 This draft, May 15, 2007 1 / 169 Introduction The assumption most commonly made to circumvent problems with randomization is that even though D is not random with respect

More information

Principles Underlying Evaluation Estimators

Principles Underlying Evaluation Estimators The Principles Underlying Evaluation Estimators James J. University of Chicago Econ 350, Winter 2019 The Basic Principles Underlying the Identification of the Main Econometric Evaluation Estimators Two

More information

Using matching, instrumental variables and control functions to estimate economic choice models

Using matching, instrumental variables and control functions to estimate economic choice models Using matching, instrumental variables and control functions to estimate economic choice models James Heckman Salvador Navarro-Lozano WORKING PAPER 2003:4 The Institute for Labour Market Policy Evaluation

More information

Ability Bias, Errors in Variables and Sibling Methods. James J. Heckman University of Chicago Econ 312 This draft, May 26, 2006

Ability Bias, Errors in Variables and Sibling Methods. James J. Heckman University of Chicago Econ 312 This draft, May 26, 2006 Ability Bias, Errors in Variables and Sibling Methods James J. Heckman University of Chicago Econ 312 This draft, May 26, 2006 1 1 Ability Bias Consider the model: log = 0 + 1 + where =income, = schooling,

More information

Estimation of Treatment Effects under Essential Heterogeneity

Estimation of Treatment Effects under Essential Heterogeneity Estimation of Treatment Effects under Essential Heterogeneity James Heckman University of Chicago and American Bar Foundation Sergio Urzua University of Chicago Edward Vytlacil Columbia University March

More information

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 8 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 25 Recommended Reading For the today Instrumental Variables Estimation and Two Stage

More information

Four Parameters of Interest in the Evaluation. of Social Programs. James J. Heckman Justin L. Tobias Edward Vytlacil

Four Parameters of Interest in the Evaluation. of Social Programs. James J. Heckman Justin L. Tobias Edward Vytlacil Four Parameters of Interest in the Evaluation of Social Programs James J. Heckman Justin L. Tobias Edward Vytlacil Nueld College, Oxford, August, 2005 1 1 Introduction This paper uses a latent variable

More information

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed

More information

Adding Uncertainty to a Roy Economy with Two Sectors

Adding Uncertainty to a Roy Economy with Two Sectors Adding Uncertainty to a Roy Economy with Two Sectors James J. Heckman The University of Chicago Nueld College, Oxford University This draft, August 7, 2005 1 denotes dierent sectors. =0denotes choice of

More information

1 Motivation for Instrumental Variable (IV) Regression

1 Motivation for Instrumental Variable (IV) Regression ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data

More information

Propensity Score Methods for Causal Inference

Propensity Score Methods for Causal Inference John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good

More information

Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity

Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity R.G. Pierse 1 Omitted Variables Suppose that the true model is Y i β 1 + β X i + β 3 X 3i + u i, i 1,, n (1.1) where β 3 0 but that the

More information

CHAPTER 6: SPECIFICATION VARIABLES

CHAPTER 6: SPECIFICATION VARIABLES Recall, we had the following six assumptions required for the Gauss-Markov Theorem: 1. The regression model is linear, correctly specified, and has an additive error term. 2. The error term has a zero

More information

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Estimation - Theory Department of Economics University of Gothenburg December 4, 2014 1/28 Why IV estimation? So far, in OLS, we assumed independence.

More information

The relationship between treatment parameters within a latent variable framework

The relationship between treatment parameters within a latent variable framework Economics Letters 66 (2000) 33 39 www.elsevier.com/ locate/ econbase The relationship between treatment parameters within a latent variable framework James J. Heckman *,1, Edward J. Vytlacil 2 Department

More information

More on Roy Model of Self-Selection

More on Roy Model of Self-Selection V. J. Hotz Rev. May 26, 2007 More on Roy Model of Self-Selection Results drawn on Heckman and Sedlacek JPE, 1985 and Heckman and Honoré, Econometrica, 1986. Two-sector model in which: Agents are income

More information

Lecture 8. Roy Model, IV with essential heterogeneity, MTE

Lecture 8. Roy Model, IV with essential heterogeneity, MTE Lecture 8. Roy Model, IV with essential heterogeneity, MTE Economics 2123 George Washington University Instructor: Prof. Ben Williams Heterogeneity When we talk about heterogeneity, usually we mean heterogeneity

More information

Multiple Linear Regression CIVL 7012/8012

Multiple Linear Regression CIVL 7012/8012 Multiple Linear Regression CIVL 7012/8012 2 Multiple Regression Analysis (MLR) Allows us to explicitly control for many factors those simultaneously affect the dependent variable This is important for

More information

ECNS 561 Multiple Regression Analysis

ECNS 561 Multiple Regression Analysis ECNS 561 Multiple Regression Analysis Model with Two Independent Variables Consider the following model Crime i = β 0 + β 1 Educ i + β 2 [what else would we like to control for?] + ε i Here, we are taking

More information

Exploring Marginal Treatment Effects

Exploring Marginal Treatment Effects Exploring Marginal Treatment Effects Flexible estimation using Stata Martin Eckhoff Andresen Statistics Norway Oslo, September 12th 2018 Martin Andresen (SSB) Exploring MTEs Oslo, 2018 1 / 25 Introduction

More information

EMERGING MARKETS - Lecture 2: Methodology refresher

EMERGING MARKETS - Lecture 2: Methodology refresher EMERGING MARKETS - Lecture 2: Methodology refresher Maria Perrotta April 4, 2013 SITE http://www.hhs.se/site/pages/default.aspx My contact: maria.perrotta@hhs.se Aim of this class There are many different

More information

Causal Analysis After Haavelmo: Definitions and a Unified Analysis of Identification of Recursive Causal Models

Causal Analysis After Haavelmo: Definitions and a Unified Analysis of Identification of Recursive Causal Models Causal Inference in the Social Sciences University of Michigan December 12, 2012 This draft, December 15, 2012 James Heckman and Rodrigo Pinto Causal Analysis After Haavelmo, December 15, 2012 1 / 157

More information

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9. Section 7 Model Assessment This section is based on Stock and Watson s Chapter 9. Internal vs. external validity Internal validity refers to whether the analysis is valid for the population and sample

More information

Lecture 11 Roy model, MTE, PRTE

Lecture 11 Roy model, MTE, PRTE Lecture 11 Roy model, MTE, PRTE Economics 2123 George Washington University Instructor: Prof. Ben Williams Roy Model Motivation The standard textbook example of simultaneity is a supply and demand system

More information

Econometric Causality

Econometric Causality Econometric (2008) International Statistical Review, 76(1):1-27 James J. Heckman Spencer/INET Conference University of Chicago Econometric The econometric approach to causality develops explicit models

More information

The Generalized Roy Model and Treatment Effects

The Generalized Roy Model and Treatment Effects The Generalized Roy Model and Treatment Effects Christopher Taber University of Wisconsin November 10, 2016 Introduction From Imbens and Angrist we showed that if one runs IV, we get estimates of the Local

More information

AGEC 661 Note Fourteen

AGEC 661 Note Fourteen AGEC 661 Note Fourteen Ximing Wu 1 Selection bias 1.1 Heckman s two-step model Consider the model in Heckman (1979) Y i = X iβ + ε i, D i = I {Z iγ + η i > 0}. For a random sample from the population,

More information

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit

More information

Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems

Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems Functional form misspecification We may have a model that is correctly specified, in terms of including

More information

Methods to Estimate Causal Effects Theory and Applications. Prof. Dr. Sascha O. Becker U Stirling, Ifo, CESifo and IZA

Methods to Estimate Causal Effects Theory and Applications. Prof. Dr. Sascha O. Becker U Stirling, Ifo, CESifo and IZA Methods to Estimate Causal Effects Theory and Applications Prof. Dr. Sascha O. Becker U Stirling, Ifo, CESifo and IZA last update: 21 August 2009 Preliminaries Address Prof. Dr. Sascha O. Becker Stirling

More information

LECTURE 11. Introduction to Econometrics. Autocorrelation

LECTURE 11. Introduction to Econometrics. Autocorrelation LECTURE 11 Introduction to Econometrics Autocorrelation November 29, 2016 1 / 24 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists of choosing: 1. correct

More information

Probabilistic Choice Models

Probabilistic Choice Models Probabilistic Choice Models James J. Heckman University of Chicago Econ 312 This draft, March 29, 2006 This chapter examines dierent models commonly used to model probabilistic choice, such as eg the choice

More information

Instrumental Variables in Models with Multiple Outcomes: The General Unordered Case

Instrumental Variables in Models with Multiple Outcomes: The General Unordered Case DISCUSSION PAPER SERIES IZA DP No. 3565 Instrumental Variables in Models with Multiple Outcomes: The General Unordered Case James J. Heckman Sergio Urzua Edward Vytlacil June 2008 Forschungsinstitut zur

More information

ECO Class 6 Nonparametric Econometrics

ECO Class 6 Nonparametric Econometrics ECO 523 - Class 6 Nonparametric Econometrics Carolina Caetano Contents 1 Nonparametric instrumental variable regression 1 2 Nonparametric Estimation of Average Treatment Effects 3 2.1 Asymptotic results................................

More information

Testing Restrictions and Comparing Models

Testing Restrictions and Comparing Models Econ. 513, Time Series Econometrics Fall 00 Chris Sims Testing Restrictions and Comparing Models 1. THE PROBLEM We consider here the problem of comparing two parametric models for the data X, defined by

More information

Applied Health Economics (for B.Sc.)

Applied Health Economics (for B.Sc.) Applied Health Economics (for B.Sc.) Helmut Farbmacher Department of Economics University of Mannheim Autumn Semester 2017 Outlook 1 Linear models (OLS, Omitted variables, 2SLS) 2 Limited and qualitative

More information

Controlling for Time Invariant Heterogeneity

Controlling for Time Invariant Heterogeneity Controlling for Time Invariant Heterogeneity Yona Rubinstein July 2016 Yona Rubinstein (LSE) Controlling for Time Invariant Heterogeneity 07/16 1 / 19 Observables and Unobservables Confounding Factors

More information

Dealing With Endogeneity

Dealing With Endogeneity Dealing With Endogeneity Junhui Qian December 22, 2014 Outline Introduction Instrumental Variable Instrumental Variable Estimation Two-Stage Least Square Estimation Panel Data Endogeneity in Econometrics

More information

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015 Logistic regression: Why we often can do what we think we can do Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015 1 Introduction Introduction - In 2010 Carina Mood published an overview article

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

ECONOMETRICS II (ECO 2401) Victor Aguirregabiria. Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS

ECONOMETRICS II (ECO 2401) Victor Aguirregabiria. Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS ECONOMETRICS II (ECO 2401) Victor Aguirregabiria Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS 1. Introduction and Notation 2. Randomized treatment 3. Conditional independence

More information

SUPPOSE a policy is proposed for adoption in a country.

SUPPOSE a policy is proposed for adoption in a country. The Review of Economics and Statistics VOL. LXXXVIII AUGUST 2006 NUMBER 3 UNDERSTANDING INSTRUMENTAL VARIABLES IN MODELS WITH ESSENTIAL HETEROGENEITY James J. Heckman, Sergio Urzua, and Edward Vytlacil*

More information

Multiple Regression. Midterm results: AVG = 26.5 (88%) A = 27+ B = C =

Multiple Regression. Midterm results: AVG = 26.5 (88%) A = 27+ B = C = Economics 130 Lecture 6 Midterm Review Next Steps for the Class Multiple Regression Review & Issues Model Specification Issues Launching the Projects!!!!! Midterm results: AVG = 26.5 (88%) A = 27+ B =

More information

Policy-Relevant Treatment Effects

Policy-Relevant Treatment Effects Policy-Relevant Treatment Effects By JAMES J. HECKMAN AND EDWARD VYTLACIL* Accounting for individual-level heterogeneity in the response to treatment is a major development in the econometric literature

More information

Discrete Dependent Variable Models

Discrete Dependent Variable Models Discrete Dependent Variable Models James J. Heckman University of Chicago This draft, April 10, 2006 Here s the general approach of this lecture: Economic model Decision rule (e.g. utility maximization)

More information

Comparing IV with Structural Models: What Simple IV Can and Cannot Identify

Comparing IV with Structural Models: What Simple IV Can and Cannot Identify DISCUSSION PAPER SERIES IZA DP No. 3980 Comparing IV with Structural Models: What Simple IV Can and Cannot Identify James J. Heckman Sergio Urzúa January 2009 Forschungsinstitut zur Zukunft der Arbeit

More information

Part VII. Accounting for the Endogeneity of Schooling. Endogeneity of schooling Mean growth rate of earnings Mean growth rate Selection bias Summary

Part VII. Accounting for the Endogeneity of Schooling. Endogeneity of schooling Mean growth rate of earnings Mean growth rate Selection bias Summary Part VII Accounting for the Endogeneity of Schooling 327 / 785 Much of the CPS-Census literature on the returns to schooling ignores the choice of schooling and its consequences for estimating the rate

More information

IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL

IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL BRENDAN KLINE AND ELIE TAMER Abstract. Randomized trials (RTs) are used to learn about treatment effects. This paper

More information

Lecture 11/12. Roy Model, MTE, Structural Estimation

Lecture 11/12. Roy Model, MTE, Structural Estimation Lecture 11/12. Roy Model, MTE, Structural Estimation Economics 2123 George Washington University Instructor: Prof. Ben Williams Roy model The Roy model is a model of comparative advantage: Potential earnings

More information

The Problem of Causality in the Analysis of Educational Choices and Labor Market Outcomes Slides for Lectures

The Problem of Causality in the Analysis of Educational Choices and Labor Market Outcomes Slides for Lectures The Problem of Causality in the Analysis of Educational Choices and Labor Market Outcomes Slides for Lectures Andrea Ichino (European University Institute and CEPR) February 28, 2006 Abstract This course

More information

Applied Statistics and Econometrics. Giuseppe Ragusa Lecture 15: Instrumental Variables

Applied Statistics and Econometrics. Giuseppe Ragusa Lecture 15: Instrumental Variables Applied Statistics and Econometrics Giuseppe Ragusa Lecture 15: Instrumental Variables Outline Introduction Endogeneity and Exogeneity Valid Instruments TSLS Testing Validity 2 Instrumental Variables Regression

More information

Econometrics II. Nonstandard Standard Error Issues: A Guide for the. Practitioner

Econometrics II. Nonstandard Standard Error Issues: A Guide for the. Practitioner Econometrics II Nonstandard Standard Error Issues: A Guide for the Practitioner Måns Söderbom 10 May 2011 Department of Economics, University of Gothenburg. Email: mans.soderbom@economics.gu.se. Web: www.economics.gu.se/soderbom,

More information

Understanding Instrumental Variables in Models with Essential Heterogeneity

Understanding Instrumental Variables in Models with Essential Heterogeneity DISCUSSION PAPER SERIES IZA DP No. 2320 Understanding Instrumental Variables in Models with Essential Heterogeneity James J. Heckman Sergio Urzua Edward Vytlacil September 2006 Forschungsinstitut zur Zukunft

More information

statistical sense, from the distributions of the xs. The model may now be generalized to the case of k regressors:

statistical sense, from the distributions of the xs. The model may now be generalized to the case of k regressors: Wooldridge, Introductory Econometrics, d ed. Chapter 3: Multiple regression analysis: Estimation In multiple regression analysis, we extend the simple (two-variable) regression model to consider the possibility

More information

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Arthur Lewbel Boston College Original December 2016, revised July 2017 Abstract Lewbel (2012)

More information

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated

More information

Föreläsning /31

Föreläsning /31 1/31 Föreläsning 10 090420 Chapter 13 Econometric Modeling: Model Speci cation and Diagnostic testing 2/31 Types of speci cation errors Consider the following models: Y i = β 1 + β 2 X i + β 3 X 2 i +

More information

STOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Per Pettersson-Lidbom Number of creds: 7,5 creds Date of exam: Thursday, January 15, 009 Examination

More information

Using Instrumental Variables to Find Causal Effects in Public Health

Using Instrumental Variables to Find Causal Effects in Public Health 1 Using Instrumental Variables to Find Causal Effects in Public Health Antonio Trujillo, PhD John Hopkins Bloomberg School of Public Health Department of International Health Health Systems Program October

More information

Generalized Roy Model and Cost-Benefit Analysis of Social Programs 1

Generalized Roy Model and Cost-Benefit Analysis of Social Programs 1 Generalized Roy Model and Cost-Benefit Analysis of Social Programs 1 James J. Heckman The University of Chicago University College Dublin Philipp Eisenhauer University of Mannheim Edward Vytlacil Columbia

More information

An Introduction to Mplus and Path Analysis

An Introduction to Mplus and Path Analysis An Introduction to Mplus and Path Analysis PSYC 943: Fundamentals of Multivariate Modeling Lecture 10: October 30, 2013 PSYC 943: Lecture 10 Today s Lecture Path analysis starting with multivariate regression

More information

Topic 4: Model Specifications

Topic 4: Model Specifications Topic 4: Model Specifications Advanced Econometrics (I) Dong Chen School of Economics, Peking University 1 Functional Forms 1.1 Redefining Variables Change the unit of measurement of the variables will

More information

Comparative Advantage and Schooling

Comparative Advantage and Schooling Comparative Advantage and Schooling Pedro Carneiro University College London, Institute for Fiscal Studies and IZA Sokbae Lee University College London and Institute for Fiscal Studies June 7, 2004 Abstract

More information

6.3 How the Associational Criterion Fails

6.3 How the Associational Criterion Fails 6.3. HOW THE ASSOCIATIONAL CRITERION FAILS 271 is randomized. We recall that this probability can be calculated from a causal model M either directly, by simulating the intervention do( = x), or (if P

More information

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Arthur Lewbel Boston College December 2016 Abstract Lewbel (2012) provides an estimator

More information

The problem of causality in microeconometrics.

The problem of causality in microeconometrics. The problem of causality in microeconometrics. Andrea Ichino European University Institute April 15, 2014 Contents 1 The Problem of Causality 1 1.1 A formal framework to think about causality....................................

More information

2 Prediction and Analysis of Variance

2 Prediction and Analysis of Variance 2 Prediction and Analysis of Variance Reading: Chapters and 2 of Kennedy A Guide to Econometrics Achen, Christopher H. Interpreting and Using Regression (London: Sage, 982). Chapter 4 of Andy Field, Discovering

More information

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors Laura Mayoral IAE, Barcelona GSE and University of Gothenburg Gothenburg, May 2015 Roadmap of the course Introduction.

More information

An Introduction to Path Analysis

An Introduction to Path Analysis An Introduction to Path Analysis PRE 905: Multivariate Analysis Lecture 10: April 15, 2014 PRE 905: Lecture 10 Path Analysis Today s Lecture Path analysis starting with multivariate regression then arriving

More information

NBER WORKING PAPER SERIES A NOTE ON ADAPTING PROPENSITY SCORE MATCHING AND SELECTION MODELS TO CHOICE BASED SAMPLES. James J. Heckman Petra E.

NBER WORKING PAPER SERIES A NOTE ON ADAPTING PROPENSITY SCORE MATCHING AND SELECTION MODELS TO CHOICE BASED SAMPLES. James J. Heckman Petra E. NBER WORKING PAPER SERIES A NOTE ON ADAPTING PROPENSITY SCORE MATCHING AND SELECTION MODELS TO CHOICE BASED SAMPLES James J. Heckman Petra E. Todd Working Paper 15179 http://www.nber.org/papers/w15179

More information

The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest

The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest Edward Vytlacil, Yale University Renmin University, Department

More information

TECHNICAL WORKING PAPER SERIES LOCAL INSTRUMENTAL VARIABLES. James J. Heckman Edward J. Vytlacil

TECHNICAL WORKING PAPER SERIES LOCAL INSTRUMENTAL VARIABLES. James J. Heckman Edward J. Vytlacil TECHNICAL WORKING PAPER SERIES LOCAL INSTRUMENTAL VARIABLES James J. Heckman Edward J. Vytlacil Technical Working Paper 252 http://www.nber.org/papers/t252 NATIONAL BUREAU OF ECONOMIC RESEARCH 15 Massachusetts

More information

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors Laura Mayoral IAE, Barcelona GSE and University of Gothenburg Gothenburg, May 2015 Roadmap Deviations from the standard

More information

Simultaneous Equation Models Learning Objectives Introduction Introduction (2) Introduction (3) Solving the Model structural equations

Simultaneous Equation Models Learning Objectives Introduction Introduction (2) Introduction (3) Solving the Model structural equations Simultaneous Equation Models. Introduction: basic definitions 2. Consequences of ignoring simultaneity 3. The identification problem 4. Estimation of simultaneous equation models 5. Example: IS LM model

More information

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.

More information

Additional Material for Estimating the Technology of Cognitive and Noncognitive Skill Formation (Cuttings from the Web Appendix)

Additional Material for Estimating the Technology of Cognitive and Noncognitive Skill Formation (Cuttings from the Web Appendix) Additional Material for Estimating the Technology of Cognitive and Noncognitive Skill Formation (Cuttings from the Web Appendix Flavio Cunha The University of Pennsylvania James Heckman The University

More information

More on Specification and Data Issues

More on Specification and Data Issues More on Specification and Data Issues Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu (HKU) Specification and Data Issues 1 / 35 Functional Form Misspecification Functional

More information

LECTURE 5. Introduction to Econometrics. Hypothesis testing

LECTURE 5. Introduction to Econometrics. Hypothesis testing LECTURE 5 Introduction to Econometrics Hypothesis testing October 18, 2016 1 / 26 ON TODAY S LECTURE We are going to discuss how hypotheses about coefficients can be tested in regression models We will

More information

Instrumental Variables: Then and Now

Instrumental Variables: Then and Now Instrumental Variables: Then and Now James Heckman University of Chicago and University College Dublin Econometric Policy Evaluation, Lecture II Koopmans Memorial Lectures Cowles Foundation Yale University

More information

Econ 673: Microeconometrics Chapter 12: Estimating Treatment Effects. The Problem

Econ 673: Microeconometrics Chapter 12: Estimating Treatment Effects. The Problem Econ 673: Microeconometrics Chapter 12: Estimating Treatment Effects The Problem Analysts are frequently interested in measuring the impact of a treatment on individual behavior; e.g., the impact of job

More information

Gov 2002: 4. Observational Studies and Confounding

Gov 2002: 4. Observational Studies and Confounding Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What

More information

The returns to schooling, ability bias, and regression

The returns to schooling, ability bias, and regression The returns to schooling, ability bias, and regression Jörn-Steffen Pischke LSE October 4, 2016 Pischke (LSE) Griliches 1977 October 4, 2016 1 / 44 Counterfactual outcomes Scholing for individual i is

More information

Statistical Inference with Regression Analysis

Statistical Inference with Regression Analysis Introductory Applied Econometrics EEP/IAS 118 Spring 2015 Steven Buck Lecture #13 Statistical Inference with Regression Analysis Next we turn to calculating confidence intervals and hypothesis testing

More information

A Note on Adapting Propensity Score Matching and Selection Models to Choice Based Samples

A Note on Adapting Propensity Score Matching and Selection Models to Choice Based Samples DISCUSSION PAPER SERIES IZA DP No. 4304 A Note on Adapting Propensity Score Matching and Selection Models to Choice Based Samples James J. Heckman Petra E. Todd July 2009 Forschungsinstitut zur Zukunft

More information

Identifying and Estimating the Distributions of Ex Post. and Ex Ante Returns to Schooling

Identifying and Estimating the Distributions of Ex Post. and Ex Ante Returns to Schooling Identifying and Estimating the Distributions of Ex Post and Ex Ante Returns to Schooling Flavio Cunha University of Chicago James J. Heckman University of Chicago and This draft, May 18, 2007 University

More information

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2.

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2. Updated: November 17, 2011 Lecturer: Thilo Klein Contact: tk375@cam.ac.uk Contest Quiz 3 Question Sheet In this quiz we will review concepts of linear regression covered in lecture 2. NOTE: Please round

More information

Analogy Principle. Asymptotic Theory Part II. James J. Heckman University of Chicago. Econ 312 This draft, April 5, 2006

Analogy Principle. Asymptotic Theory Part II. James J. Heckman University of Chicago. Econ 312 This draft, April 5, 2006 Analogy Principle Asymptotic Theory Part II James J. Heckman University of Chicago Econ 312 This draft, April 5, 2006 Consider four methods: 1. Maximum Likelihood Estimation (MLE) 2. (Nonlinear) Least

More information

Lecture 2: Constant Treatment Strategies. Donglin Zeng, Department of Biostatistics, University of North Carolina

Lecture 2: Constant Treatment Strategies. Donglin Zeng, Department of Biostatistics, University of North Carolina Lecture 2: Constant Treatment Strategies Introduction Motivation We will focus on evaluating constant treatment strategies in this lecture. We will discuss using randomized or observational study for these

More information

Linear Regression with Multiple Regressors

Linear Regression with Multiple Regressors Linear Regression with Multiple Regressors (SW Chapter 6) Outline 1. Omitted variable bias 2. Causality and regression analysis 3. Multiple regression and OLS 4. Measures of fit 5. Sampling distribution

More information

1/34 3/ Omission of a relevant variable(s) Y i = α 1 + α 2 X 1i + α 3 X 2i + u 2i

1/34 3/ Omission of a relevant variable(s) Y i = α 1 + α 2 X 1i + α 3 X 2i + u 2i 1/34 Outline Basic Econometrics in Transportation Model Specification How does one go about finding the correct model? What are the consequences of specification errors? How does one detect specification

More information

1 A Non-technical Introduction to Regression

1 A Non-technical Introduction to Regression 1 A Non-technical Introduction to Regression Chapters 1 and Chapter 2 of the textbook are reviews of material you should know from your previous study (e.g. in your second year course). They cover, in

More information

2. Linear regression with multiple regressors

2. Linear regression with multiple regressors 2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions

More information

Chapter 7. Testing Linear Restrictions on Regression Coefficients

Chapter 7. Testing Linear Restrictions on Regression Coefficients Chapter 7 Testing Linear Restrictions on Regression Coefficients 1.F-tests versus t-tests In the previous chapter we discussed several applications of the t-distribution to testing hypotheses in the linear

More information

Econometrics with Observational Data. Introduction and Identification Todd Wagner February 1, 2017

Econometrics with Observational Data. Introduction and Identification Todd Wagner February 1, 2017 Econometrics with Observational Data Introduction and Identification Todd Wagner February 1, 2017 Goals for Course To enable researchers to conduct careful quantitative analyses with existing VA (and non-va)

More information

Linear Regression with Multiple Regressors

Linear Regression with Multiple Regressors Linear Regression with Multiple Regressors (SW Chapter 6) Outline 1. Omitted variable bias 2. Causality and regression analysis 3. Multiple regression and OLS 4. Measures of fit 5. Sampling distribution

More information

Economics 300. Econometrics Multiple Regression: Extensions and Issues

Economics 300. Econometrics Multiple Regression: Extensions and Issues Economics 300 Econometrics Multiple : Extensions and Dennis C. Plott University of Illinois at Chicago Department of Economics www.dennisplott.com Fall 2014 Dennis C. Plott (UIC) ECON 300 Fall 2014 1 /

More information

Economics 326 Methods of Empirical Research in Economics. Lecture 14: Hypothesis testing in the multiple regression model, Part 2

Economics 326 Methods of Empirical Research in Economics. Lecture 14: Hypothesis testing in the multiple regression model, Part 2 Economics 326 Methods of Empirical Research in Economics Lecture 14: Hypothesis testing in the multiple regression model, Part 2 Vadim Marmer University of British Columbia May 5, 2010 Multiple restrictions

More information

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous Econometrics of causal inference Throughout, we consider the simplest case of a linear outcome equation, and homogeneous effects: y = βx + ɛ (1) where y is some outcome, x is an explanatory variable, and

More information

THE MULTIVARIATE LINEAR REGRESSION MODEL

THE MULTIVARIATE LINEAR REGRESSION MODEL THE MULTIVARIATE LINEAR REGRESSION MODEL Why multiple regression analysis? Model with more than 1 independent variable: y 0 1x1 2x2 u It allows : -Controlling for other factors, and get a ceteris paribus

More information