Now what do I do with this function?

Size: px
Start display at page:

Download "Now what do I do with this function?"

Transcription

1 Now what do I do with this function? Enrique Pinzón StataCorp LP December 08, 2017 Sao Paulo (StataCorp LP) December 08, 2017 Sao Paulo 1 / 42

2 Initial thoughts Nonparametric regression and about effects/questions npregress Mean relation between an outcome and covariates Model birtweight : age, education level, smoked, number of prenatal visits,... Model wages: age, education level, profession, tenure,... E (y X), conditional mean Parametric models have a known functional form Linear regression: E (y X) = X β Binary: E (y X) = F(Xβ) Poisson: E (y X) = exp(x β) Nonparametric E (y X). The result of using predict (StataCorp LP) December 08, 2017 Sao Paulo 2 / 42

3 Initial thoughts Nonparametric regression and about effects/questions npregress Mean relation between an outcome and covariates Model birtweight : age, education level, smoked, number of prenatal visits,... Model wages: age, education level, profession, tenure,... E (y X), conditional mean Parametric models have a known functional form Linear regression: E (y X) = X β Binary: E (y X) = F(Xβ) Poisson: E (y X) = exp(x β) Nonparametric E (y X). The result of using predict (StataCorp LP) December 08, 2017 Sao Paulo 2 / 42

4 Initial thoughts Nonparametric regression and about effects/questions npregress Mean relation between an outcome and covariates Model birtweight : age, education level, smoked, number of prenatal visits,... Model wages: age, education level, profession, tenure,... E (y X), conditional mean Parametric models have a known functional form Linear regression: E (y X) = X β Binary: E (y X) = F(Xβ) Poisson: E (y X) = exp(x β) Nonparametric E (y X). The result of using predict (StataCorp LP) December 08, 2017 Sao Paulo 2 / 42

5 Initial thoughts Nonparametric regression and about effects/questions npregress Mean relation between an outcome and covariates Model birtweight : age, education level, smoked, number of prenatal visits,... Model wages: age, education level, profession, tenure,... E (y X), conditional mean Parametric models have a known functional form Linear regression: E (y X) = X β Binary: E (y X) = F(Xβ) Poisson: E (y X) = exp(x β) Nonparametric E (y X). The result of using predict (StataCorp LP) December 08, 2017 Sao Paulo 2 / 42

6 Initial thoughts Nonparametric regression and about effects/questions npregress Mean relation between an outcome and covariates Model birtweight : age, education level, smoked, number of prenatal visits,... Model wages: age, education level, profession, tenure,... E (y X), conditional mean Parametric models have a known functional form Linear regression: E (y X) = X β Binary: E (y X) = F(Xβ) Poisson: E (y X) = exp(x β) Nonparametric E (y X). The result of using predict (StataCorp LP) December 08, 2017 Sao Paulo 2 / 42

7 Initial thoughts Nonparametric regression and about effects/questions npregress Mean relation between an outcome and covariates Model birtweight : age, education level, smoked, number of prenatal visits,... Model wages: age, education level, profession, tenure,... E (y X), conditional mean Parametric models have a known functional form Linear regression: E (y X) = X β Binary: E (y X) = F(Xβ) Poisson: E (y X) = exp(x β) Nonparametric E (y X). The result of using predict (StataCorp LP) December 08, 2017 Sao Paulo 2 / 42

8 (StataCorp LP) December 08, 2017 Sao Paulo 3 / 42

9 But... We had nonparametric regression tools lpoly lowess (StataCorp LP) December 08, 2017 Sao Paulo 4 / 42

10 But... We had nonparametric regression tools lpoly lowess (StataCorp LP) December 08, 2017 Sao Paulo 4 / 42

11 What happened in the past lpoly bweight mage if (msmoke==0 & medu>12 & fedu>12), /// mcolor(%30) lineopts(lwidth(thick)) (StataCorp LP) December 08, 2017 Sao Paulo 5 / 42

12 Effects: A thought experiment I give you the true function. list y x a gx in 1/10, noobs y x a gx (StataCorp LP) December 08, 2017 Sao Paulo 6 / 42

13 Effects: A thought experiment I give you the true function. list y x a gx in 1/10, noobs y x a gx (StataCorp LP) December 08, 2017 Sao Paulo 6 / 42

14 Effects: A thought experiment I give you the true function Do we know what are the marginal effects Do we know causal/treatment effects Do we know counterfactuals It seems cosmetic We cannot use margins (StataCorp LP) December 08, 2017 Sao Paulo 7 / 42

15 Effects: A thought experiment I give you the true function Do we know what are the marginal effects Do we know causal/treatment effects Do we know counterfactuals It seems cosmetic We cannot use margins (StataCorp LP) December 08, 2017 Sao Paulo 7 / 42

16 Effects: A thought experiment I give you the true function Do we know what are the marginal effects Do we know causal/treatment effects Do we know counterfactuals It seems cosmetic We cannot use margins (StataCorp LP) December 08, 2017 Sao Paulo 7 / 42

17 Effects: A thought experiment I give you the true function Do we know what are the marginal effects Do we know causal/treatment effects Do we know counterfactuals It seems cosmetic We cannot use margins (StataCorp LP) December 08, 2017 Sao Paulo 7 / 42

18 A detour margins (StataCorp LP) December 08, 2017 Sao Paulo 8 / 42

19 Effects: outcome of interest (StataCorp LP) December 08, 2017 Sao Paulo 9 / 42

20 Data crash 1 if crash traffic Measure of vehicular traffic tickets Number of traffic tickets male 1 if male (StataCorp LP) December 08, 2017 Sao Paulo 10 / 42

21 Probit model and average marginal effects probit crash tickets traffic i.male. margins Predictive margins Number of obs = 948 Model VCE : OIM Expression : Pr(crash), predict() Delta-method Margin Std. Err. z P> z [95% Conf. Interval] _cons margins, dydx(traffic tickets) Average marginal effects Number of obs = 948 Model VCE : OIM Expression : Pr(crash), predict() dy/dx w.r.t. : tickets traffic Delta-method dy/dx Std. Err. z P> z [95% Conf. Interval] tickets traffic (StataCorp LP) December 08, 2017 Sao Paulo 11 / 42

22 Probit model and average marginal effects probit crash tickets traffic i.male. margins Predictive margins Number of obs = 948 Model VCE : OIM Expression : Pr(crash), predict() Delta-method Margin Std. Err. z P> z [95% Conf. Interval] _cons margins, dydx(traffic tickets) Average marginal effects Number of obs = 948 Model VCE : OIM Expression : Pr(crash), predict() dy/dx w.r.t. : tickets traffic Delta-method dy/dx Std. Err. z P> z [95% Conf. Interval] tickets traffic (StataCorp LP) December 08, 2017 Sao Paulo 11 / 42

23 Probit model and average marginal effects probit crash tickets traffic i.male. margins Predictive margins Number of obs = 948 Model VCE : OIM Expression : Pr(crash), predict() Delta-method Margin Std. Err. z P> z [95% Conf. Interval] _cons margins, dydx(traffic tickets) Average marginal effects Number of obs = 948 Model VCE : OIM Expression : Pr(crash), predict() dy/dx w.r.t. : tickets traffic Delta-method dy/dx Std. Err. z P> z [95% Conf. Interval] tickets traffic (StataCorp LP) December 08, 2017 Sao Paulo 11 / 42

24 Not calculus. margins, at(traffic=generate(traffic*1.10)) at(traffic=generate(traffic)) /// > contrast(atcontrast(r) nowald) Contrasts of predictive margins Model VCE : OIM Expression : Pr(crash), predict() 1._at : traffic = traffic* _at : traffic = traffic Delta-method Contrast Std. Err. [95% Conf. Interval] _at (2 vs 1) (StataCorp LP) December 08, 2017 Sao Paulo 12 / 42

25 Probit model and counterfactuals. margins male Predictive margins Number of obs = 948 Model VCE : OIM Expression : Pr(crash), predict() Delta-method Margin Std. Err. z P> z [95% Conf. Interval] male margins, dydx(male) Average marginal effects Number of obs = 948 Model VCE : OIM Expression : Pr(crash), predict() dy/dx w.r.t. : 1.male Delta-method dy/dx Std. Err. z P> z [95% Conf. Interval] 1.male Note: dy/dx for factor levels is the discrete change from the base level. (StataCorp LP) December 08, 2017 Sao Paulo 13 / 42

26 Probit model and counterfactuals. margins male Predictive margins Number of obs = 948 Model VCE : OIM Expression : Pr(crash), predict() Delta-method Margin Std. Err. z P> z [95% Conf. Interval] male margins, dydx(male) Average marginal effects Number of obs = 948 Model VCE : OIM Expression : Pr(crash), predict() dy/dx w.r.t. : 1.male Delta-method dy/dx Std. Err. z P> z [95% Conf. Interval] 1.male Note: dy/dx for factor levels is the discrete change from the base level. (StataCorp LP) December 08, 2017 Sao Paulo 13 / 42

27 More counterfactuals. margins, dydx(tickets) Average marginal effects Number of obs = 948 Model VCE : OIM Expression : Pr(crash), predict() dy/dx w.r.t. : tickets Delta-method dy/dx Std. Err. z P> z [95% Conf. Interval] tickets margins, at(tickets=(0(1)5)) contrast(atcontrast(ar) nowald) Contrasts of predictive margins Model VCE : OIM Expression : Pr(crash), predict() 1._at : tickets = 0 2._at : tickets = 1 3._at : tickets = 2 4._at : tickets = 3 5._at : tickets = 4 6._at : tickets = 5 Delta-method Contrast Std. Err. [95% Conf. Interval] _at (2 vs 1) (3 vs 2) (4 vs 3) (5 vs 4) (6 vs 5) (StataCorp LP) December 08, 2017 Sao Paulo 14 / 42

28 More counterfactuals. margins, dydx(tickets) Average marginal effects Number of obs = 948 Model VCE : OIM Expression : Pr(crash), predict() dy/dx w.r.t. : tickets Delta-method dy/dx Std. Err. z P> z [95% Conf. Interval] tickets margins, at(tickets=(0(1)5)) contrast(atcontrast(ar) nowald) Contrasts of predictive margins Model VCE : OIM Expression : Pr(crash), predict() 1._at : tickets = 0 2._at : tickets = 1 3._at : tickets = 2 4._at : tickets = 3 5._at : tickets = 4 6._at : tickets = 5 Delta-method Contrast Std. Err. [95% Conf. Interval] _at (2 vs 1) (3 vs 2) (4 vs 3) (5 vs 4) (6 vs 5) (StataCorp LP) December 08, 2017 Sao Paulo 14 / 42

29 marginsplot margins, at(tickets=(0(1)5)) marginsplot, ciopts(recast(rarea)) (StataCorp LP) December 08, 2017 Sao Paulo 15 / 42

30 Back to nonparametric regression npregress and nonparametric regression (StataCorp LP) December 08, 2017 Sao Paulo 16 / 42

31 Nonparametric regression: discrete covariates Mean function for a discrete covariate Mean wage conditional on having a college degree. mean wage if collgrad==1 Mean estimation Number of obs = 4,795 Mean Std. Err. [95% Conf. Interval] wage regress wage collgrad, noconstant E(wage collgrad = 1), nonparametric estimate (StataCorp LP) December 08, 2017 Sao Paulo 17 / 42

32 Nonparametric regression: discrete covariates Mean function for a discrete covariate Mean wage conditional on having a college degree. mean wage if collgrad==1 Mean estimation Number of obs = 4,795 Mean Std. Err. [95% Conf. Interval] wage regress wage collgrad, noconstant E(wage collgrad = 1), nonparametric estimate (StataCorp LP) December 08, 2017 Sao Paulo 17 / 42

33 Nonparametric regression: discrete covariates Mean function for a discrete covariate Mean wage conditional on having a college degree. mean wage if collgrad==1 Mean estimation Number of obs = 4,795 Mean Std. Err. [95% Conf. Interval] wage regress wage collgrad, noconstant E(wage collgrad = 1), nonparametric estimate (StataCorp LP) December 08, 2017 Sao Paulo 17 / 42

34 Nonparametric regression: continuous covariates Conditional mean for a continuous covariate Mean wage conditional on tenure, measured in years E(wage tenure = ) Take observations near the value of and then take an average tenure i h h is a small number referred to as the bandwidth (StataCorp LP) December 08, 2017 Sao Paulo 18 / 42

35 Nonparametric regression: continuous covariates Conditional mean for a continuous covariate Mean wage conditional on tenure, measured in years E(wage tenure = ) Take observations near the value of and then take an average tenure i h h is a small number referred to as the bandwidth (StataCorp LP) December 08, 2017 Sao Paulo 18 / 42

36 Nonparametric regression: continuous covariates Conditional mean for a continuous covariate Mean wage conditional on tenure, measured in years E(wage tenure = ) Take observations near the value of and then take an average tenure i h h is a small number referred to as the bandwidth (StataCorp LP) December 08, 2017 Sao Paulo 18 / 42

37 Nonparametric regression: continuous covariates Conditional mean for a continuous covariate Mean wage conditional on tenure, measured in years E(wage tenure = ) Take observations near the value of and then take an average tenure i h h is a small number referred to as the bandwidth (StataCorp LP) December 08, 2017 Sao Paulo 18 / 42

38 Nonparametric regression: continuous covariates Conditional mean for a continuous covariate Mean wage conditional on tenure, measured in years E(wage tenure = ) Take observations near the value of and then take an average tenure i h h is a small number referred to as the bandwidth (StataCorp LP) December 08, 2017 Sao Paulo 18 / 42

39 Nonparametric regression: continuous covariates Conditional mean for a continuous covariate Mean wage conditional on tenure, measured in years E(wage tenure = ) Take observations near the value of and then take an average tenure i h h is a small number referred to as the bandwidth (StataCorp LP) December 08, 2017 Sao Paulo 18 / 42

40 Graphical example (StataCorp LP) December 08, 2017 Sao Paulo 19 / 42

41 Graphical example (StataCorp LP) December 08, 2017 Sao Paulo 20 / 42

42 Graphical example continued (StataCorp LP) December 08, 2017 Sao Paulo 21 / 42

43 Two concepts 1 h 2 Definition of distance between points, x i x h 1 (StataCorp LP) December 08, 2017 Sao Paulo 22 / 42

44 Kernel weights u x i x h Kernel K (u) ( ) 1 Gaussian exp u2 ( 2π ) 2 3 Epanechnikov 4 1 u2 5 5 I ( u 5 ) ( 3 Epanechnikov2 4 1 u 2 ) I ( u 1) 1 Rectangular(Uniform) 2I ( u 1) Triangular (1 u ) I ( u 1) ( 15 Biweight 16 1 u 2 ) 2 I ( u 1) ( 35 Triweight 32 1 u 2 ) 3 I ( u 1) Cosine (1 + cos (2πu)) I ( u 1 ) ( 2 Parzen 4 3 8u2 + 8 u 3) I ( u 1 2) (1 u )3 I ( 1 2 < u 1) (StataCorp LP) December 08, 2017 Sao Paulo 23 / 42

45 Kernel weights u x i x h Kernel K (u) ( ) 1 Gaussian exp u2 ( 2π ) 2 3 Epanechnikov 4 1 u2 5 5 I ( u 5 ) ( 3 Epanechnikov2 4 1 u 2 ) I ( u 1) 1 Rectangular(Uniform) 2I ( u 1) Triangular (1 u ) I ( u 1) ( 15 Biweight 16 1 u 2 ) 2 I ( u 1) ( 35 Triweight 32 1 u 2 ) 3 I ( u 1) Cosine (1 + cos (2πu)) I ( u 1 ) ( 2 Parzen 4 3 8u2 + 8 u 3) I ( u 1 2) (1 u )3 I ( 1 2 < u 1) (StataCorp LP) December 08, 2017 Sao Paulo 23 / 42

46 Discrete bandwidths Default Cell mean { 1 if xi = x k (.) = h otherwise k (.) = { 1 if xi = x 0 otherwise (StataCorp LP) December 08, 2017 Sao Paulo 24 / 42

47 Bandwidth (bias) (StataCorp LP) December 08, 2017 Sao Paulo 25 / 42

48 Bandwidth (variance) (StataCorp LP) December 08, 2017 Sao Paulo 26 / 42

49 Estimation Choose bandwidth optimally. Minimize bias variance trade off Cross-validation (default) Improved AIC (IMAIC) Compute a regression for every point in data (local linear) Computes derivatives and derivative bandwidths Compute a mean for every point in data (local-constant) (StataCorp LP) December 08, 2017 Sao Paulo 27 / 42

50 Example citations monthly drunk driving citations taxes 1 if alcoholic beverages are taxed fines drunk driving fines in thousands csize city size (small, medium, large) college 1 if college town (StataCorp LP) December 08, 2017 Sao Paulo 28 / 42

51 Example citations monthly drunk driving citations taxes 1 if alcoholic beverages are taxed fines drunk driving fines in thousands csize city size (small, medium, large) college 1 if college town (StataCorp LP) December 08, 2017 Sao Paulo 28 / 42

52 npregress bandwidth. npregress kernel citations fines Computing mean function Minimizing cross-validation function: Iteration 0: Cross-validation criterion = Iteration 1: Cross-validation criterion = Iteration 2: Cross-validation criterion = Iteration 3: Cross-validation criterion = Iteration 4: Cross-validation criterion = Iteration 5: Cross-validation criterion = Iteration 6: Cross-validation criterion = Computing optimal derivative bandwidth Iteration 0: Cross-validation criterion = Iteration 1: Cross-validation criterion = Iteration 2: Cross-validation criterion = Iteration 3: Cross-validation criterion = Iteration 4: Cross-validation criterion = Iteration 5: Cross-validation criterion = Iteration 6: Cross-validation criterion = Iteration 7: Cross-validation criterion = Iteration 8: Cross-validation criterion = (StataCorp LP) December 08, 2017 Sao Paulo 29 / 42

53 npregress output. npregress kernel citations fines, nolog Bandwidth Mean Mean Effect fines Local-linear regression Number of obs = 500 Kernel : epanechnikov E(Kernel obs) = 282 Bandwidth: cross validation R-squared = citations Estimate Mean citations Effect fines Note: Effect estimates are averages of derivatives. Note: You may compute standard errors using vce(bootstrap) or reps(). (StataCorp LP) December 08, 2017 Sao Paulo 30 / 42

54 npregress predicted values. describe _* storage display value variable name type format label variable label _Mean_citations double %10.0g mean function _d_mean_citat ~ s double %10.0g derivative of mean function w.r.t fines (StataCorp LP) December 08, 2017 Sao Paulo 31 / 42

55 npgraph. npgraph (StataCorp LP) December 08, 2017 Sao Paulo 32 / 42

56 npregress standard errors I. quietly npregress kernel citations fines, reps(3) seed(111). estimates store A. quietly npregress kernel citations fines, vce(bootstrap, reps(3) seed(111)). estimates store B. estimates table A B, se Variable A B Mean citations Effect fines legend: b/se (StataCorp LP) December 08, 2017 Sao Paulo 33 / 42

57 npregress standard errors II (percentile C.I.). npregress Bandwidth Mean Mean Effect fines Local-linear regression Number of obs = 500 Kernel : epanechnikov E(Kernel obs) = 282 Bandwidth: cross validation R-squared = Observed Bootstrap Percentile citations Estimate Std. Err. z P> z [95% Conf. Interval] Mean citations Effect fines Note: Effect estimates are averages of derivatives. (StataCorp LP) December 08, 2017 Sao Paulo 34 / 42

58 A more interesting model. npregress kernel citations fines i.taxes i.csize i.college, reps(200) seed(10) Bandwidth Mean Mean Effect fines taxes csize college Local-linear regression Number of obs = 500 Continuous kernel : epanechnikov E(Kernel obs) = 224 Discrete kernel : liracine R-squared = Bandwidth : cross validation Observed Bootstrap Percentile citations Estimate Std. Err. z P> z [95% Conf. Interval] Mean citations Effect fines taxes (tax vs no tax) csize (medium vs small) (large vs small) college (college vs not coll..) Note: Effect estimates are averages of derivatives for continuous covariates and averages of contrasts for factor covariates. (StataCorp LP) December 08, 2017 Sao Paulo 35 / 42

59 margins. margins, at(fines=generate(fines)) at(fines=generate(fines*1.15)) /// > contrast(atcontrast(r) nowald) reps(200) seed(12) (running margins on estimation sample) Bootstrap replications (200) Contrasts of predictive margins Number of obs = 500 Replications = 200 Expression : mean function, predict() 1._at : fines = fines 2._at : fines = fines*1.15 Observed Bootstrap Percentile Contrast Std. Err. [95% Conf. Interval] _at (2 vs 1) (StataCorp LP) December 08, 2017 Sao Paulo 36 / 42

60 Another example with margins 10 + x 3 + ε if a = 0 y = 10 + x 3 10x + ε if a = x 3 + 3x + ε if a = 2 (StataCorp LP) December 08, 2017 Sao Paulo 37 / 42

61 Mean and marginal effects. quietly regress y (c.x#c.x#c.x)#i.a c.x#i.a. margins Predictive margins Number of obs = 1,000 Model VCE : OLS Expression : Linear prediction, predict() Delta-method Margin Std. Err. t P> t [95% Conf. Interval] _cons margins, dydx(*) Average marginal effects Number of obs = 1,000 Model VCE : OLS Expression : Linear prediction, predict() dy/dx w.r.t. : 1.a 2.a x Delta-method dy/dx Std. Err. t P> t [95% Conf. Interval] a x Note: dy/dx for factor levels is the discrete change from the base level. (StataCorp LP) December 08, 2017 Sao Paulo 38 / 42

62 Mean and marginal effects. quietly regress y (c.x#c.x#c.x)#i.a c.x#i.a. margins Predictive margins Number of obs = 1,000 Model VCE : OLS Expression : Linear prediction, predict() Delta-method Margin Std. Err. t P> t [95% Conf. Interval] _cons margins, dydx(*) Average marginal effects Number of obs = 1,000 Model VCE : OLS Expression : Linear prediction, predict() dy/dx w.r.t. : 1.a 2.a x Delta-method dy/dx Std. Err. t P> t [95% Conf. Interval] a x Note: dy/dx for factor levels is the discrete change from the base level. (StataCorp LP) December 08, 2017 Sao Paulo 38 / 42

63 npregress estimates. npregress kernel y x i.a, vce(bootstrap, reps(100) seed(111)) (running npregress on estimation sample) Bootstrap replications (100) Bandwidth Mean Mean Effect x a 3.05e e-06 Local-linear regression Number of obs = 1,000 Continuous kernel : epanechnikov E(Kernel obs) = 363 Discrete kernel : liracine R-squared = Bandwidth : cross validation Mean Effect Observed Bootstrap Percentile y Estimate Std. Err. z P> z [95% Conf. Interval] y x a (1 vs 0) (2 vs 0) Note: Effect estimates are averages of derivatives for continuous covariates and averages of contrasts for factor covariates. (StataCorp LP) December 08, 2017 Sao Paulo 39 / 42

64 Function for different values of x. margins, at(x=(1(.5)3)) reps(100) seed(111) (StataCorp LP) December 08, 2017 Sao Paulo 40 / 42

65 Funtion at different values of x for all a. margins a, at(x=(-1(1)3)) reps(100) seed(111) (StataCorp LP) December 08, 2017 Sao Paulo 41 / 42

66 Conclusion Intuition about nonparametric regression Details about how npregress Importance of being able to ask questions to your model (StataCorp LP) December 08, 2017 Sao Paulo 42 / 42

Estimating and Interpreting Effects for Nonlinear and Nonparametric Models

Estimating and Interpreting Effects for Nonlinear and Nonparametric Models Estimating and Interpreting Effects for Nonlinear and Nonparametric Models Enrique Pinzón September 18, 2018 September 18, 2018 1 / 112 Objective Build a unified framework to ask questions about model

More information

Day 4A Nonparametrics

Day 4A Nonparametrics Day 4A Nonparametrics A. Colin Cameron Univ. of Calif. - Davis... for Center of Labor Economics Norwegian School of Economics Advanced Microeconometrics Aug 28 - Sep 2, 2017. Colin Cameron Univ. of Calif.

More information

Day 3B Nonparametrics and Bootstrap

Day 3B Nonparametrics and Bootstrap Day 3B Nonparametrics and Bootstrap c A. Colin Cameron Univ. of Calif.- Davis Frontiers in Econometrics Bavarian Graduate Program in Economics. Based on A. Colin Cameron and Pravin K. Trivedi (2009,2010),

More information

Dealing With and Understanding Endogeneity

Dealing With and Understanding Endogeneity Dealing With and Understanding Endogeneity Enrique Pinzón StataCorp LP October 20, 2016 Barcelona (StataCorp LP) October 20, 2016 Barcelona 1 / 59 Importance of Endogeneity Endogeneity occurs when a variable,

More information

ECON 594: Lecture #6

ECON 594: Lecture #6 ECON 594: Lecture #6 Thomas Lemieux Vancouver School of Economics, UBC May 2018 1 Limited dependent variables: introduction Up to now, we have been implicitly assuming that the dependent variable, y, was

More information

Generalized method of moments estimation in Stata 11

Generalized method of moments estimation in Stata 11 Generalized method of moments estimation in Stata 11 David M. Drukker StataCorp Stata Conference Washington, DC 2009 1 / 27 Outline 1 A quick introduction to GMM 2 gmm examples Ordinary least squares Two-stage

More information

A Journey to Latent Class Analysis (LCA)

A Journey to Latent Class Analysis (LCA) A Journey to Latent Class Analysis (LCA) Jeff Pitblado StataCorp LLC 2017 Nordic and Baltic Stata Users Group Meeting Stockholm, Sweden Outline Motivation by: prefix if clause suest command Factor variables

More information

i (x i x) 2 1 N i x i(y i y) Var(x) = P (x 1 x) Var(x)

i (x i x) 2 1 N i x i(y i y) Var(x) = P (x 1 x) Var(x) ECO 6375 Prof Millimet Problem Set #2: Answer Key Stata problem 2 Q 3 Q (a) The sample average of the individual-specific marginal effects is 0039 for educw and -0054 for white Thus, on average, an extra

More information

Econ 582 Nonparametric Regression

Econ 582 Nonparametric Regression Econ 582 Nonparametric Regression Eric Zivot May 28, 2013 Nonparametric Regression Sofarwehaveonlyconsideredlinearregressionmodels = x 0 β + [ x ]=0 [ x = x] =x 0 β = [ x = x] [ x = x] x = β The assume

More information

Latent class analysis and finite mixture models with Stata

Latent class analysis and finite mixture models with Stata Latent class analysis and finite mixture models with Stata Isabel Canette Principal Mathematician and Statistician StataCorp LLC 2017 Stata Users Group Meeting Madrid, October 19th, 2017 Introduction Latent

More information

Generalized linear models

Generalized linear models Generalized linear models Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016 Christopher F Baum (BC / DIW) Generalized linear models Boston College, Spring 2016 1 / 1 Introduction

More information

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.

More information

Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 20, 2018

Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 20, 2018 Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 20, 2018 References: Long 1997, Long and Freese 2003 & 2006 & 2014,

More information

raise Coef. Std. Err. z P> z [95% Conf. Interval]

raise Coef. Std. Err. z P> z [95% Conf. Interval] 1 We will use real-world data, but a very simple and naive model to keep the example easy to understand. What is interesting about the example is that the outcome of interest, perhaps the probability or

More information

CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M.

CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Linear

More information

Tables and Figures. This draft, July 2, 2007

Tables and Figures. This draft, July 2, 2007 and Figures This draft, July 2, 2007 1 / 16 Figures 2 / 16 Figure 1: Density of Estimated Propensity Score Pr(D=1) % 50 40 Treated Group Untreated Group 30 f (P) 20 10 0.01~.10.11~.20.21~.30.31~.40.41~.50.51~.60.61~.70.71~.80.81~.90.91~.99

More information

Description Remarks and examples Also see

Description Remarks and examples Also see Title stata.com intro 6 Model interpretation Description Description Remarks and examples Also see After you fit a model using one of the ERM commands, you can generally interpret the coefficients in the

More information

Lab 10 - Binary Variables

Lab 10 - Binary Variables Lab 10 - Binary Variables Spring 2017 Contents 1 Introduction 1 2 SLR on a Dummy 2 3 MLR with binary independent variables 3 3.1 MLR with a Dummy: different intercepts, same slope................. 4 3.2

More information

Binary Dependent Variables

Binary Dependent Variables Binary Dependent Variables In some cases the outcome of interest rather than one of the right hand side variables - is discrete rather than continuous Binary Dependent Variables In some cases the outcome

More information

Using generalized structural equation models to fit customized models without programming, and taking advantage of new features of -margins-

Using generalized structural equation models to fit customized models without programming, and taking advantage of new features of -margins- Using generalized structural equation models to fit customized models without programming, and taking advantage of new features of -margins- Isabel Canette Principal Mathematician and Statistician StataCorp

More information

Multivariate probit regression using simulated maximum likelihood

Multivariate probit regression using simulated maximum likelihood Multivariate probit regression using simulated maximum likelihood Lorenzo Cappellari & Stephen P. Jenkins ISER, University of Essex stephenj@essex.ac.uk 1 Overview Introduction and motivation The model

More information

University of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points

University of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points EEP 118 / IAS 118 Elisabeth Sadoulet and Kelly Jones University of California at Berkeley Fall 2008 Introductory Applied Econometrics Final examination Scores add up to 125 points Your name: SID: 1 1.

More information

41903: Introduction to Nonparametrics

41903: Introduction to Nonparametrics 41903: Notes 5 Introduction Nonparametrics fundamentally about fitting flexible models: want model that is flexible enough to accommodate important patterns but not so flexible it overspecializes to specific

More information

ECON Introductory Econometrics. Lecture 11: Binary dependent variables

ECON Introductory Econometrics. Lecture 11: Binary dependent variables ECON4150 - Introductory Econometrics Lecture 11: Binary dependent variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 11 Lecture Outline 2 The linear probability model Nonlinear probability

More information

Description Remarks and examples Also see

Description Remarks and examples Also see Title stata.com intro 6 Model interpretation Description Remarks and examples Also see Description After you fit a model using one of the ERM commands, you can interpret the coefficients in the usual way.

More information

Binary Outcomes. Objectives. Demonstrate the limitations of the Linear Probability Model (LPM) for binary outcomes

Binary Outcomes. Objectives. Demonstrate the limitations of the Linear Probability Model (LPM) for binary outcomes Binary Outcomes Objectives Demonstrate the limitations of the Linear Probability Model (LPM) for binary outcomes Develop latent variable & transformational approach for binary outcomes Present several

More information

Model and Working Correlation Structure Selection in GEE Analyses of Longitudinal Data

Model and Working Correlation Structure Selection in GEE Analyses of Longitudinal Data The 3rd Australian and New Zealand Stata Users Group Meeting, Sydney, 5 November 2009 1 Model and Working Correlation Structure Selection in GEE Analyses of Longitudinal Data Dr Jisheng Cui Public Health

More information

Estimation of multivalued treatment effects under conditional independence

Estimation of multivalued treatment effects under conditional independence The Stata Journal (2013) 13, Number 3, pp. 407 450 Estimation of multivalued treatment effects under conditional independence Matias D. Cattaneo Department of Economics University of Michigan Ann Arbor,

More information

Nonparametric Methods

Nonparametric Methods Nonparametric Methods Michael R. Roberts Department of Finance The Wharton School University of Pennsylvania July 28, 2009 Michael R. Roberts Nonparametric Methods 1/42 Overview Great for data analysis

More information

Nonlinear Econometric Analysis (ECO 722) : Homework 2 Answers. (1 θ) if y i = 0. which can be written in an analytically more convenient way as

Nonlinear Econometric Analysis (ECO 722) : Homework 2 Answers. (1 θ) if y i = 0. which can be written in an analytically more convenient way as Nonlinear Econometric Analysis (ECO 722) : Homework 2 Answers 1. Consider a binary random variable y i that describes a Bernoulli trial in which the probability of observing y i = 1 in any draw is given

More information

Case of single exogenous (iv) variable (with single or multiple mediators) iv à med à dv. = β 0. iv i. med i + α 1

Case of single exogenous (iv) variable (with single or multiple mediators) iv à med à dv. = β 0. iv i. med i + α 1 Mediation Analysis: OLS vs. SUR vs. ISUR vs. 3SLS vs. SEM Note by Hubert Gatignon July 7, 2013, updated November 15, 2013, April 11, 2014, May 21, 2016 and August 10, 2016 In Chap. 11 of Statistical Analysis

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

12 - Nonparametric Density Estimation

12 - Nonparametric Density Estimation ST 697 Fall 2017 1/49 12 - Nonparametric Density Estimation ST 697 Fall 2017 University of Alabama Density Review ST 697 Fall 2017 2/49 Continuous Random Variables ST 697 Fall 2017 3/49 1.0 0.8 F(x) 0.6

More information

Homework Solutions Applied Logistic Regression

Homework Solutions Applied Logistic Regression Homework Solutions Applied Logistic Regression WEEK 6 Exercise 1 From the ICU data, use as the outcome variable vital status (STA) and CPR prior to ICU admission (CPR) as a covariate. (a) Demonstrate that

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

David M. Drukker Italian Stata Users Group meeting 15 November Executive Director of Econometrics Stata

David M. Drukker Italian Stata Users Group meeting 15 November Executive Director of Econometrics Stata Estimating the ATE of an endogenously assigned treatment from a sample with endogenous selection by regression adjustment using an extended regression models David M. Drukker Executive Director of Econometrics

More information

Suggested Answers Problem set 4 ECON 60303

Suggested Answers Problem set 4 ECON 60303 Suggested Answers Problem set 4 ECON 60303 Bill Evans Spring 04. A program that answers part A) is on the web page and is named psid_iv_comparison.do. Below are some key results and a summary table is

More information

Firm-sponsored Training and Poaching Externalities in Regional Labor Markets

Firm-sponsored Training and Poaching Externalities in Regional Labor Markets Firm-sponsored Training and Poaching Externalities in Regional Labor Markets Samuel Muehlemann University of Berne & IZA Bonn Intl. Conference on Economics of Education, Firm Behaviour and Training Policies

More information

Chapter 11. Regression with a Binary Dependent Variable

Chapter 11. Regression with a Binary Dependent Variable Chapter 11 Regression with a Binary Dependent Variable 2 Regression with a Binary Dependent Variable (SW Chapter 11) So far the dependent variable (Y) has been continuous: district-wide average test score

More information

Consider Table 1 (Note connection to start-stop process).

Consider Table 1 (Note connection to start-stop process). Discrete-Time Data and Models Discretized duration data are still duration data! Consider Table 1 (Note connection to start-stop process). Table 1: Example of Discrete-Time Event History Data Case Event

More information

Adding Uncertainty to a Roy Economy with Two Sectors

Adding Uncertainty to a Roy Economy with Two Sectors Adding Uncertainty to a Roy Economy with Two Sectors James J. Heckman The University of Chicago Nueld College, Oxford University This draft, August 7, 2005 1 denotes dierent sectors. =0denotes choice of

More information

Introduction to Econometrics

Introduction to Econometrics Introduction to Econometrics T H I R D E D I T I O N Global Edition James H. Stock Harvard University Mark W. Watson Princeton University Boston Columbus Indianapolis New York San Francisco Upper Saddle

More information

fhetprob: A fast QMLE Stata routine for fractional probit models with multiplicative heteroskedasticity

fhetprob: A fast QMLE Stata routine for fractional probit models with multiplicative heteroskedasticity fhetprob: A fast QMLE Stata routine for fractional probit models with multiplicative heteroskedasticity Richard Bluhm May 26, 2013 Introduction Stata can easily estimate a binary response probit models

More information

Marginal Effects in Multiply Imputed Datasets

Marginal Effects in Multiply Imputed Datasets Marginal Effects in Multiply Imputed Datasets Daniel Klein daniel.klein@uni-kassel.de University of Kassel 14th German Stata Users Group meeting GESIS Cologne June 10, 2016 1 / 25 1 Motivation 2 Marginal

More information

Description Remarks and examples Reference Also see

Description Remarks and examples Reference Also see Title stata.com example 38g Random-intercept and random-slope models (multilevel) Description Remarks and examples Reference Also see Description Below we discuss random-intercept and random-slope models

More information

Problem Set 10: Panel Data

Problem Set 10: Panel Data Problem Set 10: Panel Data 1. Read in the data set, e11panel1.dta from the course website. This contains data on a sample or 1252 men and women who were asked about their hourly wage in two years, 2005

More information

ECO220Y Simple Regression: Testing the Slope

ECO220Y Simple Regression: Testing the Slope ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x

More information

GMM Estimation in Stata

GMM Estimation in Stata GMM Estimation in Stata Econometrics I Department of Economics Universidad Carlos III de Madrid Master in Industrial Economics and Markets 1 Outline Motivation 1 Motivation 2 3 4 2 Motivation 3 Stata and

More information

Right-censored Poisson regression model

Right-censored Poisson regression model The Stata Journal (2011) 11, Number 1, pp. 95 105 Right-censored Poisson regression model Rafal Raciborski StataCorp College Station, TX rraciborski@stata.com Abstract. I present the rcpoisson command

More information

Section 7: Local linear regression (loess) and regression discontinuity designs

Section 7: Local linear regression (loess) and regression discontinuity designs Section 7: Local linear regression (loess) and regression discontinuity designs Yotam Shem-Tov Fall 2015 Yotam Shem-Tov STAT 239/ PS 236A October 26, 2015 1 / 57 Motivation We will focus on local linear

More information

Lecture 2: Poisson and logistic regression

Lecture 2: Poisson and logistic regression Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK S 3 RI, 11-12 December 2014 introduction to Poisson regression application to the BELCAP study introduction

More information

Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois

Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 217, Chicago, Illinois Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control

More information

The Regression Tool. Yona Rubinstein. July Yona Rubinstein (LSE) The Regression Tool 07/16 1 / 35

The Regression Tool. Yona Rubinstein. July Yona Rubinstein (LSE) The Regression Tool 07/16 1 / 35 The Regression Tool Yona Rubinstein July 2016 Yona Rubinstein (LSE) The Regression Tool 07/16 1 / 35 Regressions Regression analysis is one of the most commonly used statistical techniques in social and

More information

Lecture Data Science

Lecture Data Science Web Science & Technologies University of Koblenz Landau, Germany Lecture Data Science Regression Analysis JProf. Dr. Last Time How to find parameter of a regression model Normal Equation Gradient Decent

More information

The Simulation Extrapolation Method for Fitting Generalized Linear Models with Additive Measurement Error

The Simulation Extrapolation Method for Fitting Generalized Linear Models with Additive Measurement Error The Stata Journal (), Number, pp. 1 12 The Simulation Extrapolation Method for Fitting Generalized Linear Models with Additive Measurement Error James W. Hardin Norman J. Arnold School of Public Health

More information

Problem Set #3-Key. wage Coef. Std. Err. t P> t [95% Conf. Interval]

Problem Set #3-Key. wage Coef. Std. Err. t P> t [95% Conf. Interval] Problem Set #3-Key Sonoma State University Economics 317- Introduction to Econometrics Dr. Cuellar 1. Use the data set Wage1.dta to answer the following questions. a. For the regression model Wage i =

More information

Remedial Measures for Multiple Linear Regression Models

Remedial Measures for Multiple Linear Regression Models Remedial Measures for Multiple Linear Regression Models Yang Feng http://www.stat.columbia.edu/~yangfeng Yang Feng (Columbia University) Remedial Measures for Multiple Linear Regression Models 1 / 25 Outline

More information

2. We care about proportion for categorical variable, but average for numerical one.

2. We care about proportion for categorical variable, but average for numerical one. Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is

More information

The simulation extrapolation method for fitting generalized linear models with additive measurement error

The simulation extrapolation method for fitting generalized linear models with additive measurement error The Stata Journal (2003) 3, Number 4, pp. 373 385 The simulation extrapolation method for fitting generalized linear models with additive measurement error James W. Hardin Arnold School of Public Health

More information

Lab 07 Introduction to Econometrics

Lab 07 Introduction to Econometrics Lab 07 Introduction to Econometrics Learning outcomes for this lab: Introduce the different typologies of data and the econometric models that can be used Understand the rationale behind econometrics Understand

More information

Nonparametric Density Estimation

Nonparametric Density Estimation Nonparametric Density Estimation Econ 690 Purdue University Justin L. Tobias (Purdue) Nonparametric Density Estimation 1 / 29 Density Estimation Suppose that you had some data, say on wages, and you wanted

More information

Week 2: Review of probability and statistics

Week 2: Review of probability and statistics Week 2: Review of probability and statistics Marcelo Coca Perraillon University of Colorado Anschutz Medical Campus Health Services Research Methods I HSMP 7607 2017 c 2017 PERRAILLON ALL RIGHTS RESERVED

More information

LOGISTIC REGRESSION Joseph M. Hilbe

LOGISTIC REGRESSION Joseph M. Hilbe LOGISTIC REGRESSION Joseph M. Hilbe Arizona State University Logistic regression is the most common method used to model binary response data. When the response is binary, it typically takes the form of

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

General Linear Model (Chapter 4)

General Linear Model (Chapter 4) General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients

More information

Exploring Marginal Treatment Effects

Exploring Marginal Treatment Effects Exploring Marginal Treatment Effects Flexible estimation using Stata Martin Eckhoff Andresen Statistics Norway Oslo, September 12th 2018 Martin Andresen (SSB) Exploring MTEs Oslo, 2018 1 / 25 Introduction

More information

Time Series and Forecasting Lecture 4 NonLinear Time Series

Time Series and Forecasting Lecture 4 NonLinear Time Series Time Series and Forecasting Lecture 4 NonLinear Time Series Bruce E. Hansen Summer School in Economics and Econometrics University of Crete July 23-27, 2012 Bruce Hansen (University of Wisconsin) Foundations

More information

ECON3150/4150 Spring 2016

ECON3150/4150 Spring 2016 ECON3150/4150 Spring 2016 Lecture 4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo Last updated: January 26, 2016 1 / 49 Overview These lecture slides covers: The linear regression

More information

Propensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018

Propensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018 Propensity Score Matching and Analysis TEXAS EVALUATION NETWORK INSTITUTE AUSTIN, TX NOVEMBER 9, 2018 Schedule and outline 1:00 Introduction and overview 1:15 Quasi-experimental vs. experimental designs

More information

Regression #8: Loose Ends

Regression #8: Loose Ends Regression #8: Loose Ends Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #8 1 / 30 In this lecture we investigate a variety of topics that you are probably familiar with, but need to touch

More information

Econ 371 Problem Set #6 Answer Sheet. deaths per 10,000. The 90% confidence interval for the change in death rate is 1.81 ±

Econ 371 Problem Set #6 Answer Sheet. deaths per 10,000. The 90% confidence interval for the change in death rate is 1.81 ± Econ 371 Problem Set #6 Answer Sheet 10.1 This question focuses on the regression model results in Table 10.1. a. The first part of this question asks you to predict the number of lives that would be saved

More information

Econometrics I. Lecture 10: Nonparametric Estimation with Kernels. Paul T. Scott NYU Stern. Fall 2018

Econometrics I. Lecture 10: Nonparametric Estimation with Kernels. Paul T. Scott NYU Stern. Fall 2018 Econometrics I Lecture 10: Nonparametric Estimation with Kernels Paul T. Scott NYU Stern Fall 2018 Paul T. Scott NYU Stern Econometrics I Fall 2018 1 / 12 Nonparametric Regression: Intuition Let s get

More information

User s Guide for interflex

User s Guide for interflex User s Guide for interflex A STATA Package for Producing Flexible Marginal Effect Estimates Yiqing Xu (Maintainer) Jens Hainmueller Jonathan Mummolo Licheng Liu Description: interflex performs diagnostics

More information

Empirical Asset Pricing

Empirical Asset Pricing Department of Mathematics and Statistics, University of Vaasa, Finland Texas A&M University, May June, 2013 As of May 24, 2013 Part III Stata Regression 1 Stata regression Regression Factor variables Postestimation:

More information

The Stata Journal. Editors H. Joseph Newton Department of Statistics Texas A&M University College Station, Texas

The Stata Journal. Editors H. Joseph Newton Department of Statistics Texas A&M University College Station, Texas The Stata Journal Editors H. Joseph Newton Department of Statistics Texas A&M University College Station, Texas editors@stata-journal.com Nicholas J. Cox Department of Geography Durham University Durham,

More information

An explanation of Two Stage Least Squares

An explanation of Two Stage Least Squares Introduction Introduction to Econometrics An explanation of Two Stage Least Squares When we get an endogenous variable we know that OLS estimator will be inconsistent. In addition OLS regressors will also

More information

Introduction to machine learning and pattern recognition Lecture 2 Coryn Bailer-Jones

Introduction to machine learning and pattern recognition Lecture 2 Coryn Bailer-Jones Introduction to machine learning and pattern recognition Lecture 2 Coryn Bailer-Jones http://www.mpia.de/homes/calj/mlpr_mpia2008.html 1 1 Last week... supervised and unsupervised methods need adaptive

More information

QIC program and model selection in GEE analyses

QIC program and model selection in GEE analyses The Stata Journal (2007) 7, Number 2, pp. 209 220 QIC program and model selection in GEE analyses James Cui Department of Epidemiology and Preventive Medicine Monash University Melbourne, Australia james.cui@med.monash.edu.au

More information

A simple alternative to the linear probability model for binary choice models with endogenous regressors

A simple alternative to the linear probability model for binary choice models with endogenous regressors A simple alternative to the linear probability model for binary choice models with endogenous regressors Christopher F Baum, Yingying Dong, Arthur Lewbel, Tao Yang Boston College/DIW Berlin, U.Cal Irvine,

More information

Selection on Observables: Propensity Score Matching.

Selection on Observables: Propensity Score Matching. Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017

More information

Practice exam questions

Practice exam questions Practice exam questions Nathaniel Higgins nhiggins@jhu.edu, nhiggins@ers.usda.gov 1. The following question is based on the model y = β 0 + β 1 x 1 + β 2 x 2 + β 3 x 3 + u. Discuss the following two hypotheses.

More information

Logit estimates Number of obs = 5054 Wald chi2(1) = 2.70 Prob > chi2 = Log pseudolikelihood = Pseudo R2 =

Logit estimates Number of obs = 5054 Wald chi2(1) = 2.70 Prob > chi2 = Log pseudolikelihood = Pseudo R2 = August 2005 Stata Application Tutorial 4: Discrete Models Data Note: Code makes use of career.dta, and icpsr_discrete1.dta. All three data sets are available on the Event History website. Code is based

More information

Estimation of Treatment Effects under Essential Heterogeneity

Estimation of Treatment Effects under Essential Heterogeneity Estimation of Treatment Effects under Essential Heterogeneity James Heckman University of Chicago and American Bar Foundation Sergio Urzua University of Chicago Edward Vytlacil Columbia University March

More information

Multiple Regression: Inference

Multiple Regression: Inference Multiple Regression: Inference The t-test: is ˆ j big and precise enough? We test the null hypothesis: H 0 : β j =0; i.e. test that x j has no effect on y once the other explanatory variables are controlled

More information

Estimation of ordered response models with sample selection

Estimation of ordered response models with sample selection The Stata Journal (2011) 11, Number 2, pp. 213 239 Estimation of ordered response models with sample selection Giuseppe De Luca Istituto per lo Sviluppo della Formazione Professionale dei Lavoratori Rome,

More information

Lecture 5: Poisson and logistic regression

Lecture 5: Poisson and logistic regression Dankmar Böhning Southampton Statistical Sciences Research Institute University of Southampton, UK S 3 RI, 3-5 March 2014 introduction to Poisson regression application to the BELCAP study introduction

More information

From the help desk: Comparing areas under receiver operating characteristic curves from two or more probit or logit models

From the help desk: Comparing areas under receiver operating characteristic curves from two or more probit or logit models The Stata Journal (2002) 2, Number 3, pp. 301 313 From the help desk: Comparing areas under receiver operating characteristic curves from two or more probit or logit models Mario A. Cleves, Ph.D. Department

More information

Density estimation Nonparametric conditional mean estimation Semiparametric conditional mean estimation. Nonparametrics. Gabriel Montes-Rojas

Density estimation Nonparametric conditional mean estimation Semiparametric conditional mean estimation. Nonparametrics. Gabriel Montes-Rojas 0 0 5 Motivation: Regression discontinuity (Angrist&Pischke) Outcome.5 1 1.5 A. Linear E[Y 0i X i] 0.2.4.6.8 1 X Outcome.5 1 1.5 B. Nonlinear E[Y 0i X i] i 0.2.4.6.8 1 X utcome.5 1 1.5 C. Nonlinearity

More information

Regression Discontinuity Design

Regression Discontinuity Design Chapter 11 Regression Discontinuity Design 11.1 Introduction The idea in Regression Discontinuity Design (RDD) is to estimate a treatment effect where the treatment is determined by whether as observed

More information

Estimation of quantile treatment effects with Stata

Estimation of quantile treatment effects with Stata The Stata Journal (2010) 10, Number 3, pp. 423 457 Estimation of quantile treatment effects with Stata Markus Frölich Universität Mannheim and Institute for the Study of Labor Bonn, Germany froelich@uni-mannheim.de

More information

Understanding the multinomial-poisson transformation

Understanding the multinomial-poisson transformation The Stata Journal (2004) 4, Number 3, pp. 265 273 Understanding the multinomial-poisson transformation Paulo Guimarães Medical University of South Carolina Abstract. There is a known connection between

More information

PSC 8185: Multilevel Modeling Fitting Random Coefficient Binary Response Models in Stata

PSC 8185: Multilevel Modeling Fitting Random Coefficient Binary Response Models in Stata PSC 8185: Multilevel Modeling Fitting Random Coefficient Binary Response Models in Stata Consider the following two-level model random coefficient logit model. This is a Supreme Court decision making model,

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

ECON Introductory Econometrics. Lecture 6: OLS with Multiple Regressors

ECON Introductory Econometrics. Lecture 6: OLS with Multiple Regressors ECON4150 - Introductory Econometrics Lecture 6: OLS with Multiple Regressors Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 6 Lecture outline 2 Violation of first Least Squares assumption

More information

Econometrics Review questions for exam

Econometrics Review questions for exam Econometrics Review questions for exam Nathaniel Higgins nhiggins@jhu.edu, 1. Suppose you have a model: y = β 0 x 1 + u You propose the model above and then estimate the model using OLS to obtain: ŷ =

More information

Problem Set 4 ANSWERS

Problem Set 4 ANSWERS Economics 20 Problem Set 4 ANSWERS Prof. Patricia M. Anderson 1. Suppose that our variable for consumption is measured with error, so cons = consumption + e 0, where e 0 is uncorrelated with inc, educ

More information

Statistics 203: Introduction to Regression and Analysis of Variance Course review

Statistics 203: Introduction to Regression and Analysis of Variance Course review Statistics 203: Introduction to Regression and Analysis of Variance Course review Jonathan Taylor - p. 1/?? Today Review / overview of what we learned. - p. 2/?? General themes in regression models Specifying

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

Lecture 3.1 Basic Logistic LDA

Lecture 3.1 Basic Logistic LDA y Lecture.1 Basic Logistic LDA 0.2.4.6.8 1 Outline Quick Refresher on Ordinary Logistic Regression and Stata Women s employment example Cross-Over Trial LDA Example -100-50 0 50 100 -- Longitudinal Data

More information

The Stata Journal. Editors H. Joseph Newton Department of Statistics Texas A&M University College Station, Texas

The Stata Journal. Editors H. Joseph Newton Department of Statistics Texas A&M University College Station, Texas The Stata Journal Editors H. Joseph Newton Department of Statistics Texas A&M University College Station, Texas editors@stata-journal.com Nicholas J. Cox Department of Geography Durham University Durham,

More information