David M. Drukker Italian Stata Users Group meeting 15 November Executive Director of Econometrics Stata

Size: px
Start display at page:

Download "David M. Drukker Italian Stata Users Group meeting 15 November Executive Director of Econometrics Stata"

Transcription

1 Estimating the ATE of an endogenously assigned treatment from a sample with endogenous selection by regression adjustment using an extended regression models David M. Drukker Executive Director of Econometrics Stata 2018 Italian Stata Users Group meeting 15 November 2018

2 Fictional data on wellness ram from large company. use wram2. describe Contains data from wram2.dta obs: 3,000 vars: 8 28 Jul :13 size: 96,000 storage display value variable name type format label variable label wchange float %9.0g changel Weight change level age float %9.0g Years over 50 over float %9.0g Overweight (tens of pounds) phealth float %9.0g Prior health score float %9.0g yesno Participate in wellness ram wt float %9.0g yesno Offered work time to participate in ram wtsamp float %9.0g Offered work time to participate in sample insamp float %9.0g In sample: attended initial and final weigh in Sorted by: 1 / 41

3 Three levels of wchange. tabulate wchange Weight Participate in change wellness ram level No Yes Total Loss ,114 No change Gain Total 589 1,295 1,884 2 / 41

4 Three levels of wchange. tabulate wchange Weight Participate in change wellness ram level No Yes Total Loss ,114 No change Gain Total 589 1,295 1,884 Data are observational 2 / 41

5 Dealing with observational data. tabulate wchange Weight Participate in change wellness ram level No Yes Total Loss ,114 No change Gain / 41 Total 589 1,295 1,884 Table does not account for how observed covariates that affect ram participation also affect the potential outcome variables Assume the treatment is as good as random after conditioning on covariates Conditional mean independence Exogenous treatment assignment teffects

6 Dealing with observational data. tabulate wchange Weight Participate in change wellness ram level No Yes Total Loss ,114 No change Gain Total 589 1,295 1,884 Table does not account for how observed unobserved error that affect ram participation also affect the potential outcome variables Endogenous treatment assignment etefffects and etregress for continuous outcomes etpoisson for count outcomes Need Stata command for ordinal outcome 4 / 41

7 Dealing with observational data. tabulate wchange Weight Participate in change wellness ram level No Yes Total Loss ,114 No change Gain Total 589 1,295 1,884 Table does not account for the possibility that unobserved errors in the process that caused some of 3,000 individuals not to show for the final weigh in may also affect the potential outcome variables Endogenous loss to follow up Endogenous sample selection 5 / 41

8 Ordinal Potential outcomes Because the outcome wchange is ordinal, there are really three binary outcomes wchange== Loss, wchange== No Change, and wchange== Gain 6 / 41

9 Ordinal Potential outcomes In the potential outcome framework, there is an outcome for each person when they participate and when the do not participate 7 / 41

10 Ordinal Potential outcomes In the potential outcome framework, there is an outcome for each person when they participate and when the do not participate Thus, there are really three binary outcomes for each potential outcome Participate Not participate wchange p == Loss wchange np == Loss wchange p == No change wchange np == No change wchange p == Gain wchange np == Gain 7 / 41

11 Potential outcome framework For each outcome (Loss, No change, and Gain), we only observe one of these two potential outcomes for each individual 8 / 41

12 Potential outcome framework For each outcome (Loss, No change, and Gain), we only observe one of these two potential outcomes for each individual We estimate the parameters of a model and use the estimated parameters to predict what each person does in the unobserved potential outcome Regression adjustment 8 / 41

13 Average treatment effects In the case of one outcome, the average treatment effect (ATE) is E[y p y np ] 9 / 41

14 Average treatment effects In the case of one outcome, the average treatment effect (ATE) is E[y p y np ] As there are three outcomes, there are three ATEs one for Loss, one for No Change, and one for Gain 9 / 41

15 Average treatment effects In the case of one outcome, the average treatment effect (ATE) is E[y p y np ] As there are three outcomes, there are three ATEs one for Loss, one for No Change, and one for Gain ATE Loss = E[(wchange p == Loss ) (wchange np == Loss )] ATE Nochange = E[(wchange p == No change ) (wchange np == No change )] ATE Gain = E[(wchange p == Gain ) (wchange np == Gain )] 9 / 41

16 Average treatment effects I will provide some details about the average treatment effect for Loss The details for the outcomes of No change and Gain are analogous 10 / 41

17 the average treatment effect (ATE) of the ram on the Loss outcome ATE Loss ATE Loss = E[(wchange p == Loss ) (wchange np == Loss )] The first line says that ATE Loss is the mean diffence in the outcomes when everyone participates instead of no one participates 11 / 41

18 the average treatment effect (ATE) of the ram on the Loss outcome ATE Loss ATE Loss = E[(wchange p == Loss ) (wchange np == Loss )] = E[wchange p == Loss ] E[wchange np == Loss ] The second line says that the mean of the differences is the difference in the means 12 / 41

19 the average treatment effect (ATE) of the ram on the Loss outcome ATE Loss ATE Loss = E[(wchange p == Loss ) (wchange np == Loss )] = E[wchange p == Loss ] E[wchange np == Loss ] = Pr[wchange p == Loss ] Pr[wchange np == Loss ] The third line says that because the mean of binary outcome is the probability that the event is true, the ATE Loss is the difference in the probability an individual is in the state of Loss when everyone participates instead of no one participates 13 / 41

20 14 / 41 I am going to use the ERM comand eoprobit to estimate the parameters of Pr[wchange p == Loss x] and Pr[wchange np == Loss x] and

21 I am going to use the ERM comand eoprobit to estimate the parameters of Pr[wchange p == Loss x] and Pr[wchange np == Loss x] and Then I use margins or estat teffects to estimate E[Pr[wchange p == Loss x]] E[Pr[wchange np == Loss x]] = Pr[wchange p == Loss ] Pr[wchange np == Loss ] = ATE Loss 14 / 41

22 I am going to use the ERM comand eoprobit to estimate the parameters of Pr[wchange p == Loss x] and Pr[wchange np == Loss x] and Then I use margins or estat teffects to estimate E[Pr[wchange p == Loss x]] E[Pr[wchange np == Loss x]] = Pr[wchange p == Loss ] Pr[wchange np == Loss ] = ATE Loss The ATE Loss is the mean difference in the probability an individual is in the state of Loss when everyone participates instead of no one participates 14 / 41

23 Models for the ordinal outcome For exogenous treatment, we do a one-step equivalent to fitting two separate ordinal probit models One fit to partipants Another fit to non partipants 15 / 41

24 Model for partipants Loss if xβ 0 + ɛ 0 cut1 0 wchange = No change if cut1 0 < xβ 0 + ɛ 0 cut2 0 Gain if cut2 0 < xβ 0 + ɛ 0 xβ 0 = β 1,0 age + β 2,0 over + β 3,0 phealth for the observations at which =0, and ɛ 0, is standard normal 16 / 41

25 Model for nonparticipants Loss if xβ 1 + ɛ 1 cut1 1 wchange = No change if cut1 1 < xβ 1 + ɛ 1 cut2 1 Gain if cut2 1 < xβ 1 + ɛ 1 xβ 1 = β 1,1 age + β 2,1 over + β 3,1 phealth for the observations at which =1 ɛ 1 is standard normal 17 / 41

26 Loss if xβ 0 + ɛ 0 cut1 0 wchange = No change if cut1 0 < xβ 0 + ɛ 0 cut2 0 Gain if cut2 0 < xβ 0 + ɛ 0 xβ 0 = β 1,0 age + β 2,0 over + β 3,0 phealth for the observations at which =0, and Loss if xβ 1 + ɛ 1 cut1 1 wchange = No change if cut1 1 < xβ 1 + ɛ 1 cut2 1 Gain if cut2 1 < xβ 1 + ɛ 1 xβ 1 = β 1,1 age + β 2,1 over + β 3,1 phealth for the observations at which =1 ɛ 0, and ɛ 1 are normal corr(ɛ 0, ɛ 1 ) is not identified or estimated 18 / 41

27 . eoprobit wchange age over phealth, extreat() vsquish nolog Extended ordered probit regression Number of obs = 1,884 Wald chi2(6) = Log likelihood = Prob > chi2 = wchange Coef. Std. Err. z P> z [95% Conf. Interval] #c.age No Yes #c.over No Yes # c.phealth No Yes /wchange #c.cut1 No Yes #c.cut2 No Yes estimates store oprobit 19 / 41

28 . estat teffects Predictive margins Number of obs = 3,000 ATE_Pr0 : Pr(wchange=0=Loss) ATE_Pr1 : Pr(wchange=1=No change) ATE_Pr2 : Pr(wchange=2=Gain) Unconditional Margin Std. Err. z P> z [95% Conf. Interval] ATE_Pr0 (Yes vs No) ATE_Pr1 (Yes vs No) ATE_Pr2 (Yes vs No) When everyone joins the ram instead of when no one participants in the ram, 20 / 41

29 . estat teffects Predictive margins Number of obs = 3,000 ATE_Pr0 : Pr(wchange=0=Loss) ATE_Pr1 : Pr(wchange=1=No change) ATE_Pr2 : Pr(wchange=2=Gain) Unconditional Margin Std. Err. z P> z [95% Conf. Interval] ATE_Pr0 (Yes vs No) ATE_Pr1 (Yes vs No) ATE_Pr2 (Yes vs No) When everyone joins the ram instead of when no one participants in the ram, On average, the probablity of Loss goes up by / 41

30 . estat teffects Predictive margins Number of obs = 3,000 ATE_Pr0 : Pr(wchange=0=Loss) ATE_Pr1 : Pr(wchange=1=No change) ATE_Pr2 : Pr(wchange=2=Gain) Unconditional Margin Std. Err. z P> z [95% Conf. Interval] ATE_Pr0 (Yes vs No) ATE_Pr1 (Yes vs No) ATE_Pr2 (Yes vs No) / 41 When everyone joins the ram instead of when no one participants in the ram, On average, the probablity of Loss goes up by.44 On average, the probablity of No change goes down by.17

31 When everyone joins the ram instead of when no one participants in the ram, On average, the probablity of Loss goes up by.44 On average, the probablity of No change goes down by.17 On average, the probablity of Gain goes down / 41. estat teffects Predictive margins Number of obs = 3,000 ATE_Pr0 : Pr(wchange=0=Loss) ATE_Pr1 : Pr(wchange=1=No change) ATE_Pr2 : Pr(wchange=2=Gain) Unconditional Margin Std. Err. z P> z [95% Conf. Interval] ATE_Pr0 (Yes vs No) ATE_Pr1 (Yes vs No) ATE_Pr2 (Yes vs No)

32 None lost to follow up Some observations on wchange are missing No observations on covariates are missing Can do predictions for all cases 21 / 41

33 ATE: How (1). generate _original =. replace = 0 (1,700 real changes made). predict double pr_loss_0, outlevel("loss") (option pr assumed; predicted probabilities). replace = 1 (3,000 real changes made). predict double pr_loss_1, outlevel("loss") (option pr assumed; predicted probabilities). replace = _original (1,300 real changes made). drop _original. mean pr_loss_0 pr_loss_1 Mean estimation Number of obs = 3,000 Mean Std. Err. [95% Conf. Interval] pr_loss_ pr_loss_ / 41

34 ATE: How (2). estimates restore oprobit (results oprobit are active now). margins, /// > predict(outlevel("loss")) /// > predict(outlevel("no change")) /// > predict(outlevel("gain")) noesample Predictive margins Number of obs = 3,000 Model VCE : OIM 1._predict : Pr(wchange==Loss), predict(outlevel("loss")) 2._predict : Pr(wchange==No change), predict(outlevel("no change")) 3._predict : Pr(wchange==Gain), predict(outlevel("gain")) Delta-method Margin Std. Err. z P> z [95% Conf. Interval] _predict# 1#No #Yes #No #Yes #No #Yes / 41

35 ATE: How (3). margins r., /// > predict(outlevel("loss")) /// > predict(outlevel("no change")) /// > predict(outlevel("gain")) /// > contrast(nowald) /// > noesample Contrasts of predictive margins Model VCE : OIM 1._predict : Pr(wchange==Loss), predict(outlevel("loss")) 2._predict : Pr(wchange==No change), predict(outlevel("no change")) 3._predict : Pr(wchange==Gain), predict(outlevel("gain")) Delta-method Contrast Std. Err. [95% Conf. (Yes vs No) (Yes vs No) (Yes vs No) / 41

36 Endogenous Treatment model The potential-outcome model for an endogenous treatment Allows the coefficients to differ for the treated and not-treated state Allows the cut offs to differ for the treated and not-treated state Allows for distinct (nonzero) correlations between the errors driving treatment assignment and the errors driving the ordinal outcomes for the treated and not-treated states 25 / 41

37 26 / 41 = (xγ + γ 1 wt + η > 0)

38 = (xγ + γ 1 wt + η > 0) Loss if xβ 0 + ɛ 0 cut1 0 wchange = No change if cut1 0 < xβ 0 + ɛ 0 cut2 0 Gain if cut2 0 < xβ 0 + ɛ 0 xβ 0 = β 1,0 age + β 2,0 over + β 3,0 phealth for the observations at which =0, and 26 / 41

39 = (xγ + γ 1 wt + η > 0) Loss if xβ 0 + ɛ 0 cut1 0 wchange = No change if cut1 0 < xβ 0 + ɛ 0 cut2 0 Gain if cut2 0 < xβ 0 + ɛ 0 xβ 0 = β 1,0 age + β 2,0 over + β 3,0 phealth for the observations at which =0, and Loss if xβ 1 + ɛ 1 cut1 1 wchange = No change if cut1 1 < xβ 1 + ɛ 1 cut2 1 Gain if cut2 1 < xβ 1 + ɛ 1 xβ 1 = β 1,1 age + β 2,1 over + β 3,1 phealth for the observations at which =1 26 / 41

40 26 / 41 = (xγ + γ 1 wt + η > 0) Loss if xβ 0 + ɛ 0 cut1 0 wchange = No change if cut1 0 < xβ 0 + ɛ 0 cut2 0 Gain if cut2 0 < xβ 0 + ɛ 0 xβ 0 = β 1,0 age + β 2,0 over + β 3,0 phealth for the observations at which =0, and Loss if xβ 1 + ɛ 1 cut1 1 wchange = No change if cut1 1 < xβ 1 + ɛ 1 cut2 1 Gain if cut2 1 < xβ 1 + ɛ 1 xβ 1 = β 1,1 age + β 2,1 over + β 3,1 phealth for the observations at which =1 ɛ 0, ɛ 1, and η are correlated and joint normal ρ 0 correlation between ɛ 0 and η ρ 1 correlation between ɛ 1 and η

41 Endogenous treatment model. eoprobit wchange age over phealth, /// > entreat( = age over phealth wt, pocorr ) /// > vce(robust) vsquish nolog Extended ordered probit regression Number of obs = 1,884 Wald chi2(6) = Log pseudolikelihood = Prob > chi2 = Robust Coef. Std. Err. z P> z [95% Conf. Interval] wchange #c.age No Yes #c.over No Yes # c.phealth No Yes age over phealth / 41 wt

42 Robust Coef. Std. Err. z P> z [95% Conf. Interval] wchange #c.age No Yes #c.over No Yes # c.phealth No Yes age over phealth wt _cons /wchange #c.cut1 No Yes #c.cut2 No Yes corr(e., e.wchange) 28 / 41

43 No Yes #c.over No Yes # c.phealth No Yes age over phealth wt _cons /wchange #c.cut1 No Yes #c.cut2 No Yes corr(e., e.wchange) No Yes / 41

44 . estat teffects Predictive margins Number of obs = 3,000 ATE_Pr0 : Pr(wchange=0=Loss) ATE_Pr1 : Pr(wchange=1=No change) ATE_Pr2 : Pr(wchange=2=Gain) Unconditional Margin Std. Err. z P> z [95% Conf. Interval] ATE_Pr0 (Yes vs No) ATE_Pr1 (Yes vs No) ATE_Pr2 (Yes vs No) When everyone joins the ram instead of when no one participants in the ram, 30 / 41

45 . estat teffects Predictive margins Number of obs = 3,000 ATE_Pr0 : Pr(wchange=0=Loss) ATE_Pr1 : Pr(wchange=1=No change) ATE_Pr2 : Pr(wchange=2=Gain) Unconditional Margin Std. Err. z P> z [95% Conf. Interval] ATE_Pr0 (Yes vs No) ATE_Pr1 (Yes vs No) ATE_Pr2 (Yes vs No) When everyone joins the ram instead of when no one participants in the ram, On average, the probablity of Loss goes up by.1 30 / 41

46 . estat teffects Predictive margins Number of obs = 3,000 ATE_Pr0 : Pr(wchange=0=Loss) ATE_Pr1 : Pr(wchange=1=No change) ATE_Pr2 : Pr(wchange=2=Gain) Unconditional Margin Std. Err. z P> z [95% Conf. Interval] ATE_Pr0 (Yes vs No) ATE_Pr1 (Yes vs No) ATE_Pr2 (Yes vs No) / 41 When everyone joins the ram instead of when no one participants in the ram, On average, the probablity of Loss goes up by.1 On average, the probablity of No change does not change by much

47 When everyone joins the ram instead of when no one participants in the ram, On average, the probablity of Loss goes up by.1 On average, the probablity of No change does not change by much 30 / 41 On average, the probablity of Gain goes down.09. estat teffects Predictive margins Number of obs = 3,000 ATE_Pr0 : Pr(wchange=0=Loss) ATE_Pr1 : Pr(wchange=1=No change) ATE_Pr2 : Pr(wchange=2=Gain) Unconditional Margin Std. Err. z P> z [95% Conf. Interval] ATE_Pr0 (Yes vs No) ATE_Pr1 (Yes vs No) ATE_Pr2 (Yes vs No)

48 . margins r., /// > predict(fix() outlevel("loss")) /// > predict(fix() outlevel("no change")) /// > predict(fix() outlevel("gain")) /// > contrast(nowald) vce(unconditional) noesample Contrasts of predictive margins 1._predict : Pr(wchange==Loss), predict(fix() outlevel("loss")) 2._predict : Pr(wchange==No change), predict(fix() outlevel("no change")) 3._predict : Pr(wchange==Gain), predict(fix() outlevel("gain")) Unconditional Contrast Std. Err. [95% Conf. (Yes vs No) (Yes vs No) (Yes vs No) / 41

49 32 / 41 fix() gets us the effect of the ram that is not contaminated by the selection effect/correlation between ɛ and η that increases the participation among people more likely to lose weight

50 fix() gets us the effect of the ram that is not contaminated by the selection effect/correlation between ɛ and η that increases the participation among people more likely to lose weight predict(fix()) tells margins to specify fix() to predict when computing each predicted probability 32 / 41

51 fix() gets us the effect of the ram that is not contaminated by the selection effect/correlation between ɛ and η that increases the participation among people more likely to lose weight predict(fix()) tells margins to specify fix() to predict when computing each predicted probability fix() causes the value of not to affect ɛ, even though they are correlated 32 / 41

52 fix() gets us the effect of the ram that is not contaminated by the selection effect/correlation between ɛ and η that increases the participation among people more likely to lose weight predict(fix()) tells margins to specify fix() to predict when computing each predicted probability fix() causes the value of not to affect ɛ, even though they are correlated fix() specifies that the part of ɛ that is correlated with be integrated out 32 / 41

53 This type of prediction is sometimes called the structural prediction or an average structural function; see Blundell and Powell (2003), Blundell and Powell (2004), Wooldridge (2005), Wooldridge (2010), and Wooldridge (2014), The difference between the mean of the average of the structural predictions when =1 and the mean of the average of the structural predictions when =0 is an average treatment effect (Blundell and Powell (2003) and Wooldridge (2014)) 33 / 41

54 Endogenous sample selection Reconsider our fictional weight-loss ram 34 / 41

55 Endogenous sample selection Reconsider our fictional weight-loss ram Some ram participants and some nonparticipants will not show up for the final weigh in This is commonly known as lost to follow up 34 / 41

56 Endogenous sample selection Reconsider our fictional weight-loss ram Some ram participants and some nonparticipants will not show up for the final weigh in This is commonly known as lost to follow up If unobservables that affect whether someone is lost to follow up 34 / 41

57 Endogenous sample selection Reconsider our fictional weight-loss ram Some ram participants and some nonparticipants will not show up for the final weigh in This is commonly known as lost to follow up If unobservables that affect whether someone is lost to follow up are independent of the unobservables that affect ram participantion 34 / 41

58 Endogenous sample selection Reconsider our fictional weight-loss ram Some ram participants and some nonparticipants will not show up for the final weigh in This is commonly known as lost to follow up If unobservables that affect whether someone is lost to follow up are independent of the unobservables that affect ram participantion and they are independent of the unobservables that affect the outcomes with and without the ram, 34 / 41

59 Endogenous sample selection Reconsider our fictional weight-loss ram Some ram participants and some nonparticipants will not show up for the final weigh in This is commonly known as lost to follow up If unobservables that affect whether someone is lost to follow up are independent of the unobservables that affect ram participantion and they are independent of the unobservables that affect the outcomes with and without the ram, the previously discussed estimator consistently estimates the effects 34 / 41

60 Endogenous sample selection Reconsider our fictional weight-loss ram Some ram participants and some nonparticipants will not show up for the final weigh in This is commonly known as lost to follow up If unobservables that affect whether someone is lost to follow up are independent of the unobservables that affect ram participantion and they are independent of the unobservables that affect the outcomes with and without the ram, the previously discussed estimator consistently estimates the effects Any dependence among the unobservables must be modeled 34 / 41

61 insamp = (xα + α 1 wtsamp + ξ > 0) 35 / 41

62 insamp = (xα + α 1 wtsamp + ξ > 0) = (xγ + γ 1 wt + η > 0) 35 / 41

63 insamp = (xα + α 1 wtsamp + ξ > 0) = (xγ + γ 1 wt + η > 0) Loss if xβ 0 + ɛ 0 cut1 0 wchange = No change if cut1 0 < xβ 0 + ɛ 0 cut2 0 Gain if cut2 0 < xβ 0 + ɛ 0 xβ 0 = β 1,0 age + β 2,0 over + β 3,0 phealth for the observations at which =0, and 35 / 41

64 insamp = (xα + α 1 wtsamp + ξ > 0) = (xγ + γ 1 wt + η > 0) Loss if xβ 0 + ɛ 0 cut1 0 wchange = No change if cut1 0 < xβ 0 + ɛ 0 cut2 0 Gain if cut2 0 < xβ 0 + ɛ 0 xβ 0 = β 1,0 age + β 2,0 over + β 3,0 phealth for the observations at which =0, and Loss if xβ 1 + ɛ 1 cut1 1 wchange = No change if cut1 1 < xβ 1 + ɛ 1 cut2 1 Gain if cut2 1 < xβ 1 + ɛ 1 xβ 1 = β 1,1 age + β 2,1 over + β 3,1 phealth for the observations at which =1 35 / 41

65 insamp = (xα + α 1 wtsamp + ξ > 0) = (xγ + γ 1 wt + η > 0) Loss if xβ 0 + ɛ 0 cut1 0 wchange = No change if cut1 0 < xβ 0 + ɛ 0 cut2 0 Gain if cut2 0 < xβ 0 + ɛ 0 xβ 0 = β 1,0 age + β 2,0 over + β 3,0 phealth for the observations at which =0, and Loss if xβ 1 + ɛ 1 cut1 1 wchange = No change if cut1 1 < xβ 1 + ɛ 1 cut2 1 Gain if cut2 1 < xβ 1 + ɛ 1 xβ 1 = β 1,1 age + β 2,1 over + β 3,1 phealth for the observations at which =1 ξ, ɛ 0, ɛ 1, and η are correlated and joint normal distinct correlations between each treatment error and others 35 / 41

66 . eoprobit wchange age over phealth, /// > entreat( = age over phealth wt, pocorr ) /// > select(insamp = age over phealth wtsamp ) /// > vce(robust) vsquish nolog Extended ordered probit regression Number of obs = 3,000 Selected = 1,884 Nonselected = 1,116 Wald chi2(6) = Log pseudolikelihood = Prob > chi2 = Robust Coef. Std. Err. z P> z [95% Conf. Interval] wchange #c.age No Yes #c.over No Yes # c.phealth No Yes insamp age over phealth wtsamp 36 /

67 c.phealth No Yes insamp age over phealth wtsamp _cons age over phealth wt _cons /wchange #c.cut1 No Yes #c.cut2 No Yes corr(e.ins ~ p, e.wchange) No Yes corr(e., e.wchange) 37 / 41

68 #c.cut1 No Yes #c.cut2 No Yes corr(e.ins ~ p, e.wchange) No Yes corr(e., e.wchange) No Yes corr(e., e.insamp) Nonzero correlations between e.insamp and e.wchange imply endogenous sample selection for outcomes Nonzero correlations between e. and e.wchange imply endogenous treatment assignment 38 / 41

69 . estat teffects Predictive margins Number of obs = 3,000 ATE_Pr0 : Pr(wchange=0=Loss) ATE_Pr1 : Pr(wchange=1=No change) ATE_Pr2 : Pr(wchange=2=Gain) Unconditional Margin Std. Err. z P> z [95% Conf. Interval] ATE_Pr0 (Yes vs No) ATE_Pr1 (Yes vs No) ATE_Pr2 (Yes vs No) When everyone joins the ram instead of when no one participants in the ram, 39 / 41

70 . estat teffects Predictive margins Number of obs = 3,000 ATE_Pr0 : Pr(wchange=0=Loss) ATE_Pr1 : Pr(wchange=1=No change) ATE_Pr2 : Pr(wchange=2=Gain) Unconditional Margin Std. Err. z P> z [95% Conf. Interval] ATE_Pr0 (Yes vs No) ATE_Pr1 (Yes vs No) ATE_Pr2 (Yes vs No) When everyone joins the ram instead of when no one participants in the ram, On average, the probablity of Loss goes up by / 41

71 . estat teffects Predictive margins Number of obs = 3,000 ATE_Pr0 : Pr(wchange=0=Loss) ATE_Pr1 : Pr(wchange=1=No change) ATE_Pr2 : Pr(wchange=2=Gain) Unconditional Margin Std. Err. z P> z [95% Conf. Interval] ATE_Pr0 (Yes vs No) ATE_Pr1 (Yes vs No) ATE_Pr2 (Yes vs No) When everyone joins the ram instead of when no one participants in the ram, On average, the probablity of Loss goes up by.14 On average, the probablity of No change does not change 39 / 41

72 . estat teffects Predictive margins Number of obs = 3,000 ATE_Pr0 : Pr(wchange=0=Loss) ATE_Pr1 : Pr(wchange=1=No change) ATE_Pr2 : Pr(wchange=2=Gain) Unconditional Margin Std. Err. z P> z [95% Conf. Interval] ATE_Pr0 (Yes vs No) ATE_Pr1 (Yes vs No) ATE_Pr2 (Yes vs No) / 41 When everyone joins the ram instead of when no one participants in the ram, On average, the probablity of Loss goes up by.14 On average, the probablity of No change does not change On average, the probablity of Gain goes down.16

73 . margins r., /// > predict(fix() outlevel("loss")) /// > predict(fix() outlevel("no change")) /// > predict(fix() outlevel("gain")) /// > contrast(nowald) vce(unconditional) noesample Contrasts of predictive margins 1._predict : Pr(wchange==Loss), predict(fix() outlevel("loss")) 2._predict : Pr(wchange==No change), predict(fix() outlevel("no change")) 3._predict : Pr(wchange==Gain), predict(fix() outlevel("gain")) Unconditional Contrast Std. Err. [95% Conf. (Yes vs No) (Yes vs No) (Yes vs No) When everyone joins the ram instead of when no one participants in the ram, 40 / 41

74 . margins r., /// > predict(fix() outlevel("loss")) /// > predict(fix() outlevel("no change")) /// > predict(fix() outlevel("gain")) /// > contrast(nowald) vce(unconditional) noesample Contrasts of predictive margins 1._predict : Pr(wchange==Loss), predict(fix() outlevel("loss")) 2._predict : Pr(wchange==No change), predict(fix() outlevel("no change")) 3._predict : Pr(wchange==Gain), predict(fix() outlevel("gain")) Unconditional Contrast Std. Err. [95% Conf. (Yes vs No) (Yes vs No) (Yes vs No) When everyone joins the ram instead of when no one participants in the ram, On average, the probablity of Loss goes up by / 41

75 . margins r., /// > predict(fix() outlevel("loss")) /// > predict(fix() outlevel("no change")) /// > predict(fix() outlevel("gain")) /// > contrast(nowald) vce(unconditional) noesample Contrasts of predictive margins 1._predict : Pr(wchange==Loss), predict(fix() outlevel("loss")) 2._predict : Pr(wchange==No change), predict(fix() outlevel("no change")) 3._predict : Pr(wchange==Gain), predict(fix() outlevel("gain")) Unconditional Contrast Std. Err. [95% Conf. (Yes vs No) (Yes vs No) (Yes vs No) When everyone joins the ram instead of when no one participants in the ram, On average, the probablity of Loss goes up by.14 On average, the probablity of No change does not change 40 / 41

76 . margins r., /// > predict(fix() outlevel("loss")) /// > predict(fix() outlevel("no change")) /// > predict(fix() outlevel("gain")) /// > contrast(nowald) vce(unconditional) noesample Contrasts of predictive margins 1._predict : Pr(wchange==Loss), predict(fix() outlevel("loss")) 2._predict : Pr(wchange==No change), predict(fix() outlevel("no change")) 3._predict : Pr(wchange==Gain), predict(fix() outlevel("gain")) Unconditional Contrast Std. Err. [95% Conf. (Yes vs No) (Yes vs No) (Yes vs No) / 41 When everyone joins the ram instead of when no one participants in the ram, On average, the probablity of Loss goes up by.14 On average, the probablity of No change does not change On average, the probablity of Gain goes down.16

77 More about ERM commands The commands eregress, eprobit, and eintreg fit ERMs handle continuous-and-unbounded, binary, and censored/corner outcomes Look at for more examples and a wealth of details 41 / 41

78 Bibliography Blundell, R. W., and J. L. Powell Endogeity in nonparametric and semiparametric regression models. In Advances in Economics and Econometrics: Theory and Applications, Eighth World Congress, ed. M. Dewatripont, L. P. Hansen, and S. J. Turnovsky, vol. 2, Cambridge: Cambridge University Press Endogeneity in semiparametric binary response models. Review of Economic Studies 71: Wooldridge, J. M Unobserved heterogeneity and estimation of average partial effects. In Identification and Inference for Econometric Models: Essays in Honor of Thomas Rothenberg, ed. D. K. Andrews and J. H. Stock, Cambridge, UK: Cambridge: Cambridge University Press Econometric Analysis of Cross Section and Panel Data. 2nd ed. Cambridge, Massachusetts: MIT Press Quasi-maximum likelihood estimation and testing for nonlinear models with endogenous explanatory variables. Journal of Econometrics 182: / 41

Extended regression models using Stata 15

Extended regression models using Stata 15 Extended regression models using Stata 15 Charles Lindsey Senior Statistician and Software Developer Stata July 19, 2018 Lindsey (Stata) ERM July 19, 2018 1 / 103 Introduction Common problems in observational

More information

Description Remarks and examples Also see

Description Remarks and examples Also see Title stata.com intro 6 Model interpretation Description Description Remarks and examples Also see After you fit a model using one of the ERM commands, you can generally interpret the coefficients in the

More information

Description Remarks and examples Also see

Description Remarks and examples Also see Title stata.com intro 6 Model interpretation Description Remarks and examples Also see Description After you fit a model using one of the ERM commands, you can interpret the coefficients in the usual way.

More information

Generalized method of moments estimation in Stata 11

Generalized method of moments estimation in Stata 11 Generalized method of moments estimation in Stata 11 David M. Drukker StataCorp Stata Conference Washington, DC 2009 1 / 27 Outline 1 A quick introduction to GMM 2 gmm examples Ordinary least squares Two-stage

More information

ECON 594: Lecture #6

ECON 594: Lecture #6 ECON 594: Lecture #6 Thomas Lemieux Vancouver School of Economics, UBC May 2018 1 Limited dependent variables: introduction Up to now, we have been implicitly assuming that the dependent variable, y, was

More information

Jeffrey M. Wooldridge Michigan State University

Jeffrey M. Wooldridge Michigan State University Fractional Response Models with Endogenous Explanatory Variables and Heterogeneity Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Fractional Probit with Heteroskedasticity 3. Fractional

More information

Generalized linear models

Generalized linear models Generalized linear models Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016 Christopher F Baum (BC / DIW) Generalized linear models Boston College, Spring 2016 1 / 1 Introduction

More information

Dealing With and Understanding Endogeneity

Dealing With and Understanding Endogeneity Dealing With and Understanding Endogeneity Enrique Pinzón StataCorp LP October 20, 2016 Barcelona (StataCorp LP) October 20, 2016 Barcelona 1 / 59 Importance of Endogeneity Endogeneity occurs when a variable,

More information

fhetprob: A fast QMLE Stata routine for fractional probit models with multiplicative heteroskedasticity

fhetprob: A fast QMLE Stata routine for fractional probit models with multiplicative heteroskedasticity fhetprob: A fast QMLE Stata routine for fractional probit models with multiplicative heteroskedasticity Richard Bluhm May 26, 2013 Introduction Stata can easily estimate a binary response probit models

More information

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.

More information

Estimating and Interpreting Effects for Nonlinear and Nonparametric Models

Estimating and Interpreting Effects for Nonlinear and Nonparametric Models Estimating and Interpreting Effects for Nonlinear and Nonparametric Models Enrique Pinzón September 18, 2018 September 18, 2018 1 / 112 Objective Build a unified framework to ask questions about model

More information

Control Function and Related Methods: Nonlinear Models

Control Function and Related Methods: Nonlinear Models Control Function and Related Methods: Nonlinear Models Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. General Approach 2. Nonlinear

More information

A Journey to Latent Class Analysis (LCA)

A Journey to Latent Class Analysis (LCA) A Journey to Latent Class Analysis (LCA) Jeff Pitblado StataCorp LLC 2017 Nordic and Baltic Stata Users Group Meeting Stockholm, Sweden Outline Motivation by: prefix if clause suest command Factor variables

More information

Exploring Marginal Treatment Effects

Exploring Marginal Treatment Effects Exploring Marginal Treatment Effects Flexible estimation using Stata Martin Eckhoff Andresen Statistics Norway Oslo, September 12th 2018 Martin Andresen (SSB) Exploring MTEs Oslo, 2018 1 / 25 Introduction

More information

CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M.

CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Linear

More information

Latent class analysis and finite mixture models with Stata

Latent class analysis and finite mixture models with Stata Latent class analysis and finite mixture models with Stata Isabel Canette Principal Mathematician and Statistician StataCorp LLC 2017 Stata Users Group Meeting Madrid, October 19th, 2017 Introduction Latent

More information

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated

More information

Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois

Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 217, Chicago, Illinois Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control

More information

A simple alternative to the linear probability model for binary choice models with endogenous regressors

A simple alternative to the linear probability model for binary choice models with endogenous regressors A simple alternative to the linear probability model for binary choice models with endogenous regressors Christopher F Baum, Yingying Dong, Arthur Lewbel, Tao Yang Boston College/DIW Berlin, U.Cal Irvine,

More information

leebounds: Lee s (2009) treatment effects bounds for non-random sample selection for Stata

leebounds: Lee s (2009) treatment effects bounds for non-random sample selection for Stata leebounds: Lee s (2009) treatment effects bounds for non-random sample selection for Stata Harald Tauchmann (RWI & CINCH) Rheinisch-Westfälisches Institut für Wirtschaftsforschung (RWI) & CINCH Health

More information

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Michele Aquaro University of Warwick This version: July 21, 2016 1 / 31 Reading material Textbook: Introductory

More information

Practice 2SLS with Artificial Data Part 1

Practice 2SLS with Artificial Data Part 1 Practice 2SLS with Artificial Data Part 1 Yona Rubinstein July 2016 Yona Rubinstein (LSE) Practice 2SLS with Artificial Data Part 1 07/16 1 / 16 Practice with Artificial Data In this note we use artificial

More information

Parametric and semiparametric estimation of ordered response models with sample selection and individual-specific thresholds

Parametric and semiparametric estimation of ordered response models with sample selection and individual-specific thresholds Parametric and semiparametric estimation of ordered response models with sample selection and individual-specific thresholds Giuseppe De Luca ISFOL Valeria Perotti ISFOL Claudio Rossetti Tor Vergata University

More information

Binary Dependent Variables

Binary Dependent Variables Binary Dependent Variables In some cases the outcome of interest rather than one of the right hand side variables - is discrete rather than continuous Binary Dependent Variables In some cases the outcome

More information

Case of single exogenous (iv) variable (with single or multiple mediators) iv à med à dv. = β 0. iv i. med i + α 1

Case of single exogenous (iv) variable (with single or multiple mediators) iv à med à dv. = β 0. iv i. med i + α 1 Mediation Analysis: OLS vs. SUR vs. ISUR vs. 3SLS vs. SEM Note by Hubert Gatignon July 7, 2013, updated November 15, 2013, April 11, 2014, May 21, 2016 and August 10, 2016 In Chap. 11 of Statistical Analysis

More information

Using generalized structural equation models to fit customized models without programming, and taking advantage of new features of -margins-

Using generalized structural equation models to fit customized models without programming, and taking advantage of new features of -margins- Using generalized structural equation models to fit customized models without programming, and taking advantage of new features of -margins- Isabel Canette Principal Mathematician and Statistician StataCorp

More information

Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p )

Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p ) Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p. 376-390) BIO656 2009 Goal: To see if a major health-care reform which took place in 1997 in Germany was

More information

-redprob- A Stata program for the Heckman estimator of the random effects dynamic probit model

-redprob- A Stata program for the Heckman estimator of the random effects dynamic probit model -redprob- A Stata program for the Heckman estimator of the random effects dynamic probit model Mark B. Stewart University of Warwick January 2006 1 The model The latent equation for the random effects

More information

2. We care about proportion for categorical variable, but average for numerical one.

2. We care about proportion for categorical variable, but average for numerical one. Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is

More information

GMM Estimation in Stata

GMM Estimation in Stata GMM Estimation in Stata Econometrics I Department of Economics Universidad Carlos III de Madrid Master in Industrial Economics and Markets 1 Outline Motivation 1 Motivation 2 3 4 2 Motivation 3 Stata and

More information

Selection endogenous dummy ordered probit, and selection endogenous dummy dynamic ordered probit models

Selection endogenous dummy ordered probit, and selection endogenous dummy dynamic ordered probit models Selection endogenous dummy ordered probit, and selection endogenous dummy dynamic ordered probit models Massimiliano Bratti & Alfonso Miranda In many fields of applied work researchers need to model an

More information

Nonlinear Econometric Analysis (ECO 722) : Homework 2 Answers. (1 θ) if y i = 0. which can be written in an analytically more convenient way as

Nonlinear Econometric Analysis (ECO 722) : Homework 2 Answers. (1 θ) if y i = 0. which can be written in an analytically more convenient way as Nonlinear Econometric Analysis (ECO 722) : Homework 2 Answers 1. Consider a binary random variable y i that describes a Bernoulli trial in which the probability of observing y i = 1 in any draw is given

More information

Dynamic Panels. Chapter Introduction Autoregressive Model

Dynamic Panels. Chapter Introduction Autoregressive Model Chapter 11 Dynamic Panels This chapter covers the econometrics methods to estimate dynamic panel data models, and presents examples in Stata to illustrate the use of these procedures. The topics in this

More information

Some Stata commands for endogeneity in nonlinear panel-data models

Some Stata commands for endogeneity in nonlinear panel-data models Some Stata commands for endogeneity in nonlinear panel-data models David M. Drukker Director of Econometrics Stata 2014 German Stata Users Group meeting June 13, 2014 1 / 51 Overview Two approaches to

More information

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. The Basic Methodology 2. How Should We View Uncertainty in DD Settings?

More information

Treatment interactions with nonexperimental data in Stata

Treatment interactions with nonexperimental data in Stata The Stata Journal (2011) 11, umber4,pp.1 11 Treatment interactions with nonexperimental data in Stata Graham K. Brown Centre for Development Studies University of Bath Bath, UK g.k.brown@bath.ac.uk Thanos

More information

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

4 Instrumental Variables Single endogenous variable One continuous instrument. 2 Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................

More information

ECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 48

ECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 48 ECON2228 Notes 10 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 10 2014 2015 1 / 48 Serial correlation and heteroskedasticity in time series regressions Chapter 12:

More information

Chapter 11. Regression with a Binary Dependent Variable

Chapter 11. Regression with a Binary Dependent Variable Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score

More information

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser Simultaneous Equations with Error Components Mike Bronner Marko Ledic Anja Breitwieser PRESENTATION OUTLINE Part I: - Simultaneous equation models: overview - Empirical example Part II: - Hausman and Taylor

More information

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

4 Instrumental Variables Single endogenous variable One continuous instrument. 2 Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................

More information

Spatial Regression Models: Identification strategy using STATA TATIANE MENEZES PIMES/UFPE

Spatial Regression Models: Identification strategy using STATA TATIANE MENEZES PIMES/UFPE Spatial Regression Models: Identification strategy using STATA TATIANE MENEZES PIMES/UFPE Intruduction Spatial regression models are usually intended to estimate parameters related to the interaction of

More information

ECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 54

ECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 54 ECON2228 Notes 10 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 10 2014 2015 1 / 54 erial correlation and heteroskedasticity in time series regressions Chapter 12:

More information

Estimating the Fractional Response Model with an Endogenous Count Variable

Estimating the Fractional Response Model with an Endogenous Count Variable Estimating the Fractional Response Model with an Endogenous Count Variable Estimating FRM with CEEV Hoa Bao Nguyen Minh Cong Nguyen Michigan State Universtiy American University July 2009 Nguyen and Nguyen

More information

Lecture#12. Instrumental variables regression Causal parameters III

Lecture#12. Instrumental variables regression Causal parameters III Lecture#12 Instrumental variables regression Causal parameters III 1 Demand experiment, market data analysis & simultaneous causality 2 Simultaneous causality Your task is to estimate the demand function

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND

More information

Sociology 63993, Exam 2 Answer Key [DRAFT] March 27, 2015 Richard Williams, University of Notre Dame,

Sociology 63993, Exam 2 Answer Key [DRAFT] March 27, 2015 Richard Williams, University of Notre Dame, Sociology 63993, Exam 2 Answer Key [DRAFT] March 27, 2015 Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ I. True-False. (20 points) Indicate whether the following statements

More information

xtdpdqml: Quasi-maximum likelihood estimation of linear dynamic short-t panel data models

xtdpdqml: Quasi-maximum likelihood estimation of linear dynamic short-t panel data models xtdpdqml: Quasi-maximum likelihood estimation of linear dynamic short-t panel data models Sebastian Kripfganz University of Exeter Business School, Department of Economics, Exeter, UK UK Stata Users Group

More information

Mediation Analysis: OLS vs. SUR vs. 3SLS Note by Hubert Gatignon July 7, 2013, updated November 15, 2013

Mediation Analysis: OLS vs. SUR vs. 3SLS Note by Hubert Gatignon July 7, 2013, updated November 15, 2013 Mediation Analysis: OLS vs. SUR vs. 3SLS Note by Hubert Gatignon July 7, 2013, updated November 15, 2013 In Chap. 11 of Statistical Analysis of Management Data (Gatignon, 2014), tests of mediation are

More information

point estimates, standard errors, testing, and inference for nonlinear combinations

point estimates, standard errors, testing, and inference for nonlinear combinations Title xtreg postestimation Postestimation tools for xtreg Description The following postestimation commands are of special interest after xtreg: command description xttest0 Breusch and Pagan LM test for

More information

Soc 63993, Homework #7 Answer Key: Nonlinear effects/ Intro to path analysis

Soc 63993, Homework #7 Answer Key: Nonlinear effects/ Intro to path analysis Soc 63993, Homework #7 Answer Key: Nonlinear effects/ Intro to path analysis Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 20, 2015 Problem 1. The files

More information

Instrumental Variables, Simultaneous and Systems of Equations

Instrumental Variables, Simultaneous and Systems of Equations Chapter 6 Instrumental Variables, Simultaneous and Systems of Equations 61 Instrumental variables In the linear regression model y i = x iβ + ε i (61) we have been assuming that bf x i and ε i are uncorrelated

More information

Recent Developments in Multilevel Modeling

Recent Developments in Multilevel Modeling Recent Developments in Multilevel Modeling Roberto G. Gutierrez Director of Statistics StataCorp LP 2007 North American Stata Users Group Meeting, Boston R. Gutierrez (StataCorp) Multilevel Modeling August

More information

Thoughts on Heterogeneity in Econometric Models

Thoughts on Heterogeneity in Econometric Models Thoughts on Heterogeneity in Econometric Models Presidential Address Midwest Economics Association March 19, 2011 Jeffrey M. Wooldridge Michigan State University 1 1. Introduction Much of current econometric

More information

Model and Working Correlation Structure Selection in GEE Analyses of Longitudinal Data

Model and Working Correlation Structure Selection in GEE Analyses of Longitudinal Data The 3rd Australian and New Zealand Stata Users Group Meeting, Sydney, 5 November 2009 1 Model and Working Correlation Structure Selection in GEE Analyses of Longitudinal Data Dr Jisheng Cui Public Health

More information

Analyzing spatial autoregressive models using Stata

Analyzing spatial autoregressive models using Stata Analyzing spatial autoregressive models using Stata David M. Drukker StataCorp Summer North American Stata Users Group meeting July 24-25, 2008 Part of joint work with Ingmar Prucha and Harry Kelejian

More information

Multivariate probit regression using simulated maximum likelihood

Multivariate probit regression using simulated maximum likelihood Multivariate probit regression using simulated maximum likelihood Lorenzo Cappellari & Stephen P. Jenkins ISER, University of Essex stephenj@essex.ac.uk 1 Overview Introduction and motivation The model

More information

Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 20, 2018

Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 20, 2018 Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 20, 2018 References: Long 1997, Long and Freese 2003 & 2006 & 2014,

More information

STATA EXTENDED REGRESSION MODELS REFERENCE MANUAL RELEASE 15

STATA EXTENDED REGRESSION MODELS REFERENCE MANUAL RELEASE 15 STATA EXTENDED REGRESSION MODELS REFERENCE MANUAL RELEASE 15 A Stata Press Publication StataCorp LLC College Station, Texas Copyright c 1985 2017 StataCorp LLC All rights reserved Version 15 Published

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Princeton University Asian Political Methodology Conference University of Sydney Joint

More information

Homework Solutions Applied Logistic Regression

Homework Solutions Applied Logistic Regression Homework Solutions Applied Logistic Regression WEEK 6 Exercise 1 From the ICU data, use as the outcome variable vital status (STA) and CPR prior to ICU admission (CPR) as a covariate. (a) Demonstrate that

More information

Day 2A Instrumental Variables, Two-stage Least Squares and Generalized Method of Moments

Day 2A Instrumental Variables, Two-stage Least Squares and Generalized Method of Moments Day 2A nstrumental Variables, Two-stage Least Squares and Generalized Method of Moments c A. Colin Cameron Univ. of Calif.- Davis Frontiers in Econometrics Bavarian Graduate Program in Economics. Based

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University

More information

Analyzing Proportions

Analyzing Proportions Institut für Soziologie Eberhard Karls Universität Tübingen www.maartenbuis.nl The problem A proportion is bounded between 0 and 1, this means that: the effect of explanatory variables tends to be non-linear,

More information

Stata tip 63: Modeling proportions

Stata tip 63: Modeling proportions The Stata Journal (2008) 8, Number 2, pp. 299 303 Stata tip 63: Modeling proportions Christopher F. Baum Department of Economics Boston College Chestnut Hill, MA baum@bc.edu You may often want to model

More information

Problem set - Selection and Diff-in-Diff

Problem set - Selection and Diff-in-Diff Problem set - Selection and Diff-in-Diff 1. You want to model the wage equation for women You consider estimating the model: ln wage = α + β 1 educ + β 2 exper + β 3 exper 2 + ɛ (1) Read the data into

More information

Flexible Estimation of Treatment Effect Parameters

Flexible Estimation of Treatment Effect Parameters Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both

More information

Practice exam questions

Practice exam questions Practice exam questions Nathaniel Higgins nhiggins@jhu.edu, nhiggins@ers.usda.gov 1. The following question is based on the model y = β 0 + β 1 x 1 + β 2 x 2 + β 3 x 3 + u. Discuss the following two hypotheses.

More information

Binary Models with Endogenous Explanatory Variables

Binary Models with Endogenous Explanatory Variables Binary Models with Endogenous Explanatory Variables Class otes Manuel Arellano ovember 7, 2007 Revised: January 21, 2008 1 Introduction In Part I we considered linear and non-linear models with additive

More information

Estimation of Treatment Effects under Essential Heterogeneity

Estimation of Treatment Effects under Essential Heterogeneity Estimation of Treatment Effects under Essential Heterogeneity James Heckman University of Chicago and American Bar Foundation Sergio Urzua University of Chicago Edward Vytlacil Columbia University March

More information

Handout 12. Endogeneity & Simultaneous Equation Models

Handout 12. Endogeneity & Simultaneous Equation Models Handout 12. Endogeneity & Simultaneous Equation Models In which you learn about another potential source of endogeneity caused by the simultaneous determination of economic variables, and learn how to

More information

raise Coef. Std. Err. z P> z [95% Conf. Interval]

raise Coef. Std. Err. z P> z [95% Conf. Interval] 1 We will use real-world data, but a very simple and naive model to keep the example easy to understand. What is interesting about the example is that the outcome of interest, perhaps the probability or

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

Lab 11 - Heteroskedasticity

Lab 11 - Heteroskedasticity Lab 11 - Heteroskedasticity Spring 2017 Contents 1 Introduction 2 2 Heteroskedasticity 2 3 Addressing heteroskedasticity in Stata 3 4 Testing for heteroskedasticity 4 5 A simple example 5 1 1 Introduction

More information

Econometrics II Censoring & Truncation. May 5, 2011

Econometrics II Censoring & Truncation. May 5, 2011 Econometrics II Censoring & Truncation Måns Söderbom May 5, 2011 1 Censored and Truncated Models Recall that a corner solution is an actual economic outcome, e.g. zero expenditure on health by a household

More information

Introduction to Markov-switching regression models using the mswitch command

Introduction to Markov-switching regression models using the mswitch command Introduction to Markov-switching regression models using the mswitch command Gustavo Sánchez StataCorp May 18, 2016 Aguascalientes, México (StataCorp) Markov-switching regression in Stata May 18 1 / 1

More information

Problem Set 10: Panel Data

Problem Set 10: Panel Data Problem Set 10: Panel Data 1. Read in the data set, e11panel1.dta from the course website. This contains data on a sample or 1252 men and women who were asked about their hourly wage in two years, 2005

More information

Non-linear panel data modeling

Non-linear panel data modeling Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1

More information

Estimating Markov-switching regression models in Stata

Estimating Markov-switching regression models in Stata Estimating Markov-switching regression models in Stata Ashish Rajbhandari Senior Econometrician StataCorp LP Stata Conference 2015 Ashish Rajbhandari (StataCorp LP) Markov-switching regression Stata Conference

More information

Applied Health Economics (for B.Sc.)

Applied Health Economics (for B.Sc.) Applied Health Economics (for B.Sc.) Helmut Farbmacher Department of Economics University of Mannheim Autumn Semester 2017 Outlook 1 Linear models (OLS, Omitted variables, 2SLS) 2 Limited and qualitative

More information

Эконометрика, , 4 модуль Семинар Для Группы Э_Б2015_Э_3 Семинарист О.А.Демидова

Эконометрика, , 4 модуль Семинар Для Группы Э_Б2015_Э_3 Семинарист О.А.Демидова Эконометрика, 2017-2018, 4 модуль Семинар 3 160418 Для Группы Э_Б2015_Э_3 Семинарист ОАДемидова * Stata program * copyright C 2010 by A Colin Cameron and Pravin K Trivedi * used for "Microeconometrics

More information

Lecture 8: Instrumental Variables Estimation

Lecture 8: Instrumental Variables Estimation Lecture Notes on Advanced Econometrics Lecture 8: Instrumental Variables Estimation Endogenous Variables Consider a population model: y α y + β + β x + β x +... + β x + u i i i i k ik i Takashi Yamano

More information

Difference-in-Differences Estimation

Difference-in-Differences Estimation Difference-in-Differences Estimation Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. The Basic Methodology 2. How Should We

More information

Partial effects in fixed effects models

Partial effects in fixed effects models 1 Partial effects in fixed effects models J.M.C. Santos Silva School of Economics, University of Surrey Gordon C.R. Kemp Department of Economics, University of Essex 22 nd London Stata Users Group Meeting

More information

Reply to Manovskii s Discussion on The Limited Macroeconomic Effects of Unemployment Benefit Extensions

Reply to Manovskii s Discussion on The Limited Macroeconomic Effects of Unemployment Benefit Extensions Reply to Manovskii s Discussion on The Limited Macroeconomic Effects of Unemployment Benefit Extensions Gabriel Chodorow-Reich Harvard University and NBER Loukas Karabarbounis University of Minnesota and

More information

ECON Introductory Econometrics. Lecture 16: Instrumental variables

ECON Introductory Econometrics. Lecture 16: Instrumental variables ECON4150 - Introductory Econometrics Lecture 16: Instrumental variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 12 Lecture outline 2 OLS assumptions and when they are violated Instrumental

More information

Estimating dynamic stochastic general equilibrium models in Stata

Estimating dynamic stochastic general equilibrium models in Stata Estimating dynamic stochastic general equilibrium models in Stata David Schenck Senior Econometrician Stata 2017 Canadian Stata User Group Meeting June 9, 2017 Schenck (Stata) DSGE June 9, 2017 1 / 21

More information

Econometrics. 8) Instrumental variables

Econometrics. 8) Instrumental variables 30C00200 Econometrics 8) Instrumental variables Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Thery of IV regression Overidentification Two-stage least squates

More information

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is Practice Final Exam Last Name:, First Name:. Please write LEGIBLY. Answer all questions on this exam in the space provided (you may use the back of any page if you need more space). Show all work but do

More information

Propensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018

Propensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018 Propensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018 Schedule and outline 1:00 Introduction and overview 1:15 Quasi-experimental vs. experimental designs

More information

(a) Briefly discuss the advantage of using panel data in this situation rather than pure crosssections

(a) Briefly discuss the advantage of using panel data in this situation rather than pure crosssections Answer Key Fixed Effect and First Difference Models 1. See discussion in class.. David Neumark and William Wascher published a study in 199 of the effect of minimum wages on teenage employment using a

More information

APEC 8212: Econometric Analysis II

APEC 8212: Econometric Analysis II APEC 8212: Econometric Analysis II Instructor: Paul Glewwe Spring, 2014 Office: 337a Ruttan Hall (formerly Classroom Office Building) Phone: 612-625-0225 E-Mail: pglewwe@umn.edu Class Website: http://faculty.apec.umn.edu/pglewwe/apec8212.html

More information

Models of Qualitative Binary Response

Models of Qualitative Binary Response Models of Qualitative Binary Response Probit and Logit Models October 6, 2015 Dependent Variable as a Binary Outcome Suppose we observe an economic choice that is a binary signal. The focus on the course

More information

ECON Introductory Econometrics. Lecture 11: Binary dependent variables

ECON Introductory Econometrics. Lecture 11: Binary dependent variables ECON4150 - Introductory Econometrics Lecture 11: Binary dependent variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 11 Lecture Outline 2 The linear probability model Nonlinear probability

More information

Dynamic stochastic general equilibrium models in Stata 15

Dynamic stochastic general equilibrium models in Stata 15 Dynamic stochastic general equilibrium models in Stata 15 David Schenck Senior Econometrician Stata October 11, 2017 Schenck (Stata) DSGE October 11, 2017 1 / 25 Motivation Models used in macroeconomics

More information

Econ 371 Problem Set #6 Answer Sheet In this first question, you are asked to consider the following equation:

Econ 371 Problem Set #6 Answer Sheet In this first question, you are asked to consider the following equation: Econ 37 Problem Set #6 Answer Sheet 0. In this first question, you are asked to consider the following equation: Y it = β 0 + β X it + β 3 S t + u it. () You are asked how you might time-demean the data

More information

Quantitative Methods Final Exam (2017/1)

Quantitative Methods Final Exam (2017/1) Quantitative Methods Final Exam (2017/1) 1. Please write down your name and student ID number. 2. Calculator is allowed during the exam, but DO NOT use a smartphone. 3. List your answers (together with

More information

i (x i x) 2 1 N i x i(y i y) Var(x) = P (x 1 x) Var(x)

i (x i x) 2 1 N i x i(y i y) Var(x) = P (x 1 x) Var(x) ECO 6375 Prof Millimet Problem Set #2: Answer Key Stata problem 2 Q 3 Q (a) The sample average of the individual-specific marginal effects is 0039 for educw and -0054 for white Thus, on average, an extra

More information

ivporbit:an R package to estimate the probit model with continuous endogenous regressors

ivporbit:an R package to estimate the probit model with continuous endogenous regressors MPRA Munich Personal RePEc Archive ivporbit:an R package to estimate the probit model with continuous endogenous regressors Taha Zaghdoudi University of Jendouba, Faculty of Law Economics and Management

More information

Week 3: Simple Linear Regression

Week 3: Simple Linear Regression Week 3: Simple Linear Regression Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ALL RIGHTS RESERVED 1 Outline

More information