Problem set - Selection and Diff-in-Diff

Size: px
Start display at page:

Download "Problem set - Selection and Diff-in-Diff"

Transcription

1 Problem set - Selection and Diff-in-Diff 1. You want to model the wage equation for women You consider estimating the model: ln wage = α + β 1 educ + β 2 exper + β 3 exper 2 + ɛ (1) Read the data into Stata:. use http :// fmwww.bc.edu /ec -p/ data / wooldridge / mroz. dta. label var nwifeinc " Income not from wife ". label var kidslt6 " Kids 1-6 years old ". label var kidsge6 " Kids > 6 years old " (a) Estimate the model in equation 1 using OLS. Assuming that the RHS-variables are all uncorrelated with the error term in the population, why may the estimates still be biased?. reg lwage educ exper expersq Source SS df MS Number of obs = F( 3, 424) = Model Prob > F = Residual R- squared = Adj R- squared = Total Root MSE = lwage Coef. Std. Err. t P > t [95% Conf. Interval ] educ exper expersq _cons est sto ols 1

2 . /* > Estimates may still be biased because we only observe wages for > individuals who choose to work. Since choosing to work depends > on the wage, we would expect low - earning women ( conditional on > covariates ) to opt out and appear censored in our data. > */ (b) Estimate the Heckman selection model in two individual steps without exclusion restrictions, by predicting the inverse mills ratios and including this as a control variable in the wage equation.. gen d = lwage!=.. probit d educ exper expersq Iteration 0: log likelihood = Iteration 1: log likelihood = Iteration 2: log likelihood = Iteration 3: log likelihood = Probit regression Number of obs = 753 LR chi2 (3) = Prob > chi2 = Log likelihood = Pseudo R2 = d Coef. Std. Err. z P > z [95% Conf. Interval ] educ exper expersq _cons est sto select. predict xb, xb. gen invmills = normalden (-xb )/(1 - normal (-xb )) 2

3 . * Note that this is equivalent to:. * gen invmills = normalden ( xb )/ normal ( xb). reg lwage educ exper expersq invmills, Source SS df MS Number of obs = F( 4, 423) = Model Prob > F = Residual R- squared = Adj R- squared = Total Root MSE = lwage Coef. Std. Err. t P > t [95% Conf. Interval ] educ exper expersq invmills _cons est sto noexcl (c) Estimate the Heckman selection model using the command -heckman-, and include the variables -nwifeinc-, -age-, -kidslt6-, and -kidsge6- in the selection model. Do you think the exclusion restriction is plausible?. heckman lwage educ exper expersq, /// > select ( educ exper expersq nwifeinc age kidslt6 kidsge6 ) Iteration 0: log likelihood = Iteration 1: log likelihood = Iteration 2: log likelihood =

4 Heckman selection model Number of obs = 753 ( regression model with sample selection ) Censored obs = 325 Uncensored obs = 428 Wald chi2 (3) = Log likelihood = Prob > chi2 = lwage Coef. Std. Err. z P > z [95% Conf. Interval ] lwage educ exper expersq _cons select educ exper expersq nwifeinc age kidslt kidsge _cons / athrho / lnsigma rho

5 sigma lambda LR test of indep. eqns. ( rho = 0): chi2 (1) = 0.03 Prob > chi2 = est sto mle (d) Reestimate the model in 1c using the twostep estimator (-heckman, twostep-) and in individual steps (as in 1b above), and compare the estimates.. heckman lwage educ exper expersq, twostep /// > select ( educ exper expersq nwifeinc age kidslt6 kidsge6 ) Heckman selection model -- two - step estimates Number of obs = 753 ( regression model with sample selection ) Censored obs = 325 Uncensored obs = 428 = = Wald chi2 (3) Prob > chi2 lwage Coef. Std. Err. z P > z [95% Conf. Interval ] lwage educ exper expersq e -06 _cons select educ exper

6 expersq nwifeinc age kidslt kidsge _cons mills lambda rho sigma est sto twostep. probit d educ exper expersq nwifeinc age kidslt6 kidsge6 Iteration 0: log likelihood = Iteration 1: log likelihood = Iteration 2: log likelihood = Iteration 3: log likelihood = Iteration 4: log likelihood = Probit regression Number of obs = 753 LR chi2 (7) = Prob > chi2 = Log likelihood = Pseudo R2 = d Coef. Std. Err. z P > z [95% Conf. Interval ] educ exper expersq nwifeinc

7 age kidslt kidsge _cons est sto select2. drop xb invmills. predict xb, xb. gen invmills = normalden (-xb )/(1 - normal (-xb )). reg lwage educ exper expersq invmills Source SS df MS Number of obs = F( 4, 423) = Model Prob > F = Residual R- squared = Adj R- squared = Total Root MSE = lwage Coef. Std. Err. t P > t [95% Conf. Interval ] educ exper expersq e -06 invmills _cons est sto excl 7

8 2. This exercise uses data from Feb Mar and Nov Dec 1992 on employment at fast food restaurants in the US States of New Jersey and Pennsylvania taken from Card and Krueger in The American Economic Review, Vol. 84(4). The data are described in the output below. We are interested in how the minimum wage affects employment decisions in these restaurants. In April 1992, New Jersey increased the minimum wage from $4.25 to $5.05. In Pennsylvania, the minimum wage was unchanged, and we assume it to be $3.80 for this exercise. Read the data into Stata:. use [ COURSE URL ]/ seminars / data / econ4136seminar09_ did. dta (a) Describe the data using -codebook-.. codebook, c Variable Obs Unique Mean Min Max Label sheet store id post if after the law ; 0 if before the law chain = Burger King ; 2= KFC ; 3= Roy Rogers ; 4= Wendys state if New Jersey ; 0 if Pennsylvania empft # full - time employees hrsopen number hrs open per day nregs number of cash registers in store minwage minimum wage rate ( USD ) temp More than 75% part - time employees d if Burger King ; 0 otherwise d if KFC ; 0 otherwise d if Roy Rogers ; 0 otherwise d if Wendy s; 0 otherwise (b) Estimate the following regression on the sample of fast food restaurants in Feb Mar empft ikt = α + γminwage kt + β 1 nregs ikt + β 2 hrsopen ikt + η j dj + ɛ ikt (2) j=2 where i denotes restaurant, k denotes state, and t = 0 if the observation is from Feb Mar and t = 1 if the observation is from Nov Dec.. reg empft minwage hrsopen nregs d2 -d4 if post == 0 Source SS df MS Number of obs = F( 6, 390) = Model Prob > F = Residual R- squared = Adj R- squared = Total Root MSE = empft Coef. Std. Err. t P > t [95% Conf. Interval ] minwage

9 hrsopen nregs d d d _cons i. Interpret the coefficient γ, and calculate a 90% confidence interval. γ is supposed to reflect the impact on employment per dollar change in the minimum wage rate. A very good student might note that ˆγ is simply the difference in employment across states (conditional on covariates), divided by the difference in the minimum wage rate. CI α (ˆγ) = ˆγ ± t 1 α/2 ŝe (ˆγ) = 5.18 ± = [ 8.94, 1.43] ii. Use the Sum of squares table from the regression output to calculate the R 2 and the standard error of the regression (Root MSE). R 2 = SER = T SS RSS = MSS T SS T SS = = RSS n k 1 = = iii. Give an economic interpretation of the coefficients η 2 η 4. What might explain the relatively large coefficient on -d4-? η 2 η 4 estimates how many employees there are at KFC, Roy Rogers and Wendys, compared to at Burger King. Our estimates suggest that controlling for opening hours and the number of registers, Roy Rogers employs about one person less than Burger King, KFC one person more and Wendys four persons more. An explanation for the latter could be that Wendys sells higher quality food that is more labor intensive. iv. Test H 0 : η 2 = η 3 = 0.. test d2 = d3 = 0 ( 1) d2 - d3 = 0 ( 2) d2 = 0 F( 2, 390) = 1.00 Prob > F =

10 . estimates store unrest. reg empft minwage hrsopen nregs d4 if post == 0 Source SS df MS Number of obs = F( 4, 392) = Model Prob > F = Residual R- squared = Adj R- squared = Total Root MSE = empft Coef. Std. Err. t P > t [95% Conf. Interval ] minwage hrsopen nregs d _cons lrtest unrest Likelihood - ratio test LR chi2 (2) = 2.04 ( Assumption :. nested in unrest ) Prob > chi2 = Alternatively, we can do this by hand, e.g. F = = ( R 2 u Rr 2 ) / (ku k r ) (1 Ru) 2 / (n k u 1) ( ) /2 ( ) /390 So we cannot reject at reasonable significance levels. v. Test the hypothesis H 0 : η 2 = η 3 using the estimated covariance matrix of the coefficients (-mat list e(v)-). Verify your answer by running the test in Stata using -test-, and/or by performing an F-test. Use the covariance matrix of the coefficients to calculate the standard error of ˆη 2 ˆη 3, and test whether this is significantly different from zero, var (ˆη 2 ˆη 3 ) = var (ˆη 2 ) + var (ˆη 3 ) + 2cov (ˆη 2, ˆη 3 ). estimates restore unrest ( results unrest are active now ). mat list e(v) symmetric e(v )[7,7] minwage hrsopen nregs d2 d3 d4 _cons minwage hrsopen nregs d d d _cons

11 This gives var (ˆη 2 ˆη 3 ) = (0.5866) = ˆη 2 ˆη = = ŝe (ˆη 2 ˆη 3 ) Using 2-sided test and a t-table, p 20%. Alternatively, one could run the test directly in STATA, or estimate the model empft ikt = α+γminwage kt +β 1 nregs ikt +β 2 hrsopen ikt +η 2 (d2 + d3)+η 4 d4+ɛ ikt and do an F-test compared to the unrestricted model above.. test d2 = d3 ( 1) d2 - d3 = 0 F( 1, 390) = 1.75 Prob > F = gen d23 = d2 + d3. reg empft minwage hrsopen nregs d23 d4 if post == 0 Source SS df MS Number of obs = F( 5, 391) = Model Prob > F = Residual R- squared = Adj R- squared = Total Root MSE = empft Coef. Std. Err. t P > t [95% Conf. Interval ] minwage hrsopen nregs d d _cons lrtest unrest Likelihood - ratio test LR chi2 (1) = 1.78 ( Assumption :. nested in unrest ) Prob > chi2 = (c) We now want to control for potential selection issues by using the panel structure of our data. i. Explain why the previous estimate of γ is likely to suffer from omitted variable bias. State-specific factors will probably impact both employment and the minimum wage. For instance, a high minimum wage could mean a higher wage level in the state, which might cause lower employment at a given employer. Or a high minimum wage could mean that many employees are at the lower bound, which could imply the opposite. ii. Assume that ɛ ikt = µ k + ζ t + u ikt and that E [u ikt x ikt ] = 0 (where x ikt is the 11

12 vector of RHS-variables in (2) except -minwage-). Explain how you can then use the increase in the minimum wage in New Jersey and a difference-in-differences (DD) model to identify the effect of the minimum wage on employment. Give an example where the necessary assumption(s) are violated. You can use DD to control for (i) unobservables that change similarly across states and (ii) unobservables that are constant in a state. Specifically, if x kt is the vector of all control variables, -post-, and -state-, then we assume E (ɛ kt x kt ) = µ k + ν t + E (u kt x kt ) E (u kt x kt ) = 0 which implies that E [Y state = 1, post = 1] E [Y state = 1, post = 0] = γ minwage + ν t E [Y state = 0, post = 1] E [Y state = 0, post = 0] = ν t and hence the DD estimator is a consistent estimator of γ minwage: DD = E [Y state = 1, post = 1] E [Y state = 1, post = 0] E [Y state = 0, post = 1] E [Y state = 0, post = 0] This assumption is not satisfied if there are time-varying factors affecting employment, that differ across states. For instance, if the macroeconomic environment improves in New Jersey relative to Pennsylvania, then the DD estimator is biased upwards. iii. Generate a table of means, a table of standard errors and a table of frequencies for -empft- in each state and each time period (post = 1 and post = 0).. table state post, c( mean empft sd empft n empft ) if New Jersey ; 0 if 1 if after the law ; Pennsylva 0 if before the law nia iv. Using these statistics, calculate a DD estimate of the impact of the minimum wage law on employment. 12

13 ˆ DD = [Ȳ (state = 1, post = 1) Ȳ (state = 1, post = 0)] ] [Ȳ (state = 0, post = 1) Ȳ (state = 0, post = 0) = ( ) ( ) = 3.38 v. Specify and estimate the corresponding regression.. gen pt = post * state. reg empft state post pt, Source SS df MS Number of obs = F( 3, 794) = 2.20 Model Prob > F = Residual R- squared = Adj R- squared = Total Root MSE = empft Coef. Std. Err. t P > t [95% Conf. Interval ] state post pt _cons vi. How much does this suggest that the minimum wage affects full time employment in fast food restaurants? To interpret the estimate as the impact of the minimum wage hike, we should divide the reduced form by the first stage ( minwage): ˆγ = DD/ ( ) = 4.22 vii. Explain why the t-test from the regression above may understate the uncertainty in the effect of the minimum wage on full time employment. How could you correct the standard error? Compare the t-values with and without this correction. To get the standard error correct, we should take into account the estimation error on the first stage. An easy way to do this is to estimate the IV-model directly.. ivreg empft state post ( minwage = pt) Instrumental variables (2 SLS ) regression Source SS df MS Number of obs = F( 3, 794) = 2.20 Model Prob > F = Residual R- squared = Adj R- squared = Total Root MSE =

14 empft Coef. Std. Err. t P > t [95% Conf. Interval ] minwage state post _cons Instrumented : minwage Instruments : state post pt viii. What regression would you run to estimate the DD model including control variables? Run the regression using robust standard errors. A regression equivalent with our data would be (suppressing the i subscript) empft kt = α + ξpost t + ηstate k + DD post t state k + x ktβ + ɛ kt or equivalently empft kt = α k + η t + γ kt + x ktβ + ɛ kt. reg empft state post pt hrsopen nregs, robust Linear regression Number of obs = 775 F( 5, 769) = Prob > F = R- squared = Root MSE = Robust empft Coef. Std. Err. t P > t [95% Conf. Interval ] state post pt hrsopen nregs _cons ix. How might you test the key identifying assumptions underlying your DiD-estimation in this application, and in general? Essentially, you need a common trend, and you need to argue that the treatment is likely to be random wrt trends. It is useful to decompose the common trend assumption into Stable composition: test by A. comparing characteristics across groups in different time periods B. estimating effects on outcomes that should NOT be affected but should be determined by similar background variables Common time shocks: test by A. comparing trend in outcomes across groups in different time periods B. estimating placebo treatments in periods before (sometimes also after) actual treatment 14

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Michele Aquaro University of Warwick This version: July 21, 2016 1 / 31 Reading material Textbook: Introductory

More information

Ecmt 675: Econometrics I

Ecmt 675: Econometrics I Ecmt 675: Econometrics I Assignment 7 Problem 1 a. reg hours lwage educ age kidslt6 kidsge6 nwifeinc, r Linear regression Number of obs = 428 F( 6, 421) = 3.93 Prob > F = 0.0008 R-squared = 0.0670 Root

More information

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

4 Instrumental Variables Single endogenous variable One continuous instrument. 2 Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................

More information

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

4 Instrumental Variables Single endogenous variable One continuous instrument. 2 Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................

More information

Dealing With and Understanding Endogeneity

Dealing With and Understanding Endogeneity Dealing With and Understanding Endogeneity Enrique Pinzón StataCorp LP October 20, 2016 Barcelona (StataCorp LP) October 20, 2016 Barcelona 1 / 59 Importance of Endogeneity Endogeneity occurs when a variable,

More information

Control Function and Related Methods: Nonlinear Models

Control Function and Related Methods: Nonlinear Models Control Function and Related Methods: Nonlinear Models Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. General Approach 2. Nonlinear

More information

Problem Set #5-Key Sonoma State University Dr. Cuellar Economics 317- Introduction to Econometrics

Problem Set #5-Key Sonoma State University Dr. Cuellar Economics 317- Introduction to Econometrics Problem Set #5-Key Sonoma State University Dr. Cuellar Economics 317- Introduction to Econometrics C1.1 Use the data set Wage1.dta to answer the following questions. Estimate regression equation wage =

More information

Problem Set 10: Panel Data

Problem Set 10: Panel Data Problem Set 10: Panel Data 1. Read in the data set, e11panel1.dta from the course website. This contains data on a sample or 1252 men and women who were asked about their hourly wage in two years, 2005

More information

Binary Dependent Variables

Binary Dependent Variables Binary Dependent Variables In some cases the outcome of interest rather than one of the right hand side variables - is discrete rather than continuous Binary Dependent Variables In some cases the outcome

More information

ECO220Y Simple Regression: Testing the Slope

ECO220Y Simple Regression: Testing the Slope ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x

More information

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is Practice Final Exam Last Name:, First Name:. Please write LEGIBLY. Answer all questions on this exam in the space provided (you may use the back of any page if you need more space). Show all work but do

More information

Lecture 8: Instrumental Variables Estimation

Lecture 8: Instrumental Variables Estimation Lecture Notes on Advanced Econometrics Lecture 8: Instrumental Variables Estimation Endogenous Variables Consider a population model: y α y + β + β x + β x +... + β x + u i i i i k ik i Takashi Yamano

More information

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser Simultaneous Equations with Error Components Mike Bronner Marko Ledic Anja Breitwieser PRESENTATION OUTLINE Part I: - Simultaneous equation models: overview - Empirical example Part II: - Hausman and Taylor

More information

Lecture 9. Matthew Osborne

Lecture 9. Matthew Osborne Lecture 9 Matthew Osborne 22 September 2006 Potential Outcome Model Try to replicate experimental data. Social Experiment: controlled experiment. Caveat: usually very expensive. Natural Experiment: observe

More information

Course Econometrics I

Course Econometrics I Course Econometrics I 4. Heteroskedasticity Martin Halla Johannes Kepler University of Linz Department of Economics Last update: May 6, 2014 Martin Halla CS Econometrics I 4 1/31 Our agenda for today Consequences

More information

Quantitative Methods Final Exam (2017/1)

Quantitative Methods Final Exam (2017/1) Quantitative Methods Final Exam (2017/1) 1. Please write down your name and student ID number. 2. Calculator is allowed during the exam, but DO NOT use a smartphone. 3. List your answers (together with

More information

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10)

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10) Name Economics 170 Spring 2004 Honor pledge: I have neither given nor received aid on this exam including the preparation of my one page formula list and the preparation of the Stata assignment for the

More information

Lecture#12. Instrumental variables regression Causal parameters III

Lecture#12. Instrumental variables regression Causal parameters III Lecture#12 Instrumental variables regression Causal parameters III 1 Demand experiment, market data analysis & simultaneous causality 2 Simultaneous causality Your task is to estimate the demand function

More information

Practice exam questions

Practice exam questions Practice exam questions Nathaniel Higgins nhiggins@jhu.edu, nhiggins@ers.usda.gov 1. The following question is based on the model y = β 0 + β 1 x 1 + β 2 x 2 + β 3 x 3 + u. Discuss the following two hypotheses.

More information

FTE Employment before FTE Employment after

FTE Employment before FTE Employment after 1. (25 points) In 1992, there was an increase in the (state) minimum wage in one U.S. state (New Jersey) but not in a neighboring location (eastern Pennsylvania). The study provides you with the following

More information

Empirical Application of Simple Regression (Chapter 2)

Empirical Application of Simple Regression (Chapter 2) Empirical Application of Simple Regression (Chapter 2) 1. The data file is House Data, which can be downloaded from my webpage. 2. Use stata menu File Import Excel Spreadsheet to read the data. Don t forget

More information

University of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points

University of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points EEP 118 / IAS 118 Elisabeth Sadoulet and Kelly Jones University of California at Berkeley Fall 2008 Introductory Applied Econometrics Final examination Scores add up to 125 points Your name: SID: 1 1.

More information

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like.

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like. Measurement Error Often a data set will contain imperfect measures of the data we would ideally like. Aggregate Data: (GDP, Consumption, Investment are only best guesses of theoretical counterparts and

More information

Estimating the return to education for married women mroz.csv: 753 observations and 22 variables

Estimating the return to education for married women mroz.csv: 753 observations and 22 variables Return to education Estimating the return to education for married women mroz.csv: 753 observations and 22 variables 1. inlf =1 if in labor force, 1975 2. hours hours worked, 1975 3. kidslt6 # kids < 6

More information

Practice 2SLS with Artificial Data Part 1

Practice 2SLS with Artificial Data Part 1 Practice 2SLS with Artificial Data Part 1 Yona Rubinstein July 2016 Yona Rubinstein (LSE) Practice 2SLS with Artificial Data Part 1 07/16 1 / 16 Practice with Artificial Data In this note we use artificial

More information

1 Independent Practice: Hypothesis tests for one parameter:

1 Independent Practice: Hypothesis tests for one parameter: 1 Independent Practice: Hypothesis tests for one parameter: Data from the Indian DHS survey from 2006 includes a measure of autonomy of the women surveyed (a scale from 0-10, 10 being the most autonomous)

More information

(a) Briefly discuss the advantage of using panel data in this situation rather than pure crosssections

(a) Briefly discuss the advantage of using panel data in this situation rather than pure crosssections Answer Key Fixed Effect and First Difference Models 1. See discussion in class.. David Neumark and William Wascher published a study in 199 of the effect of minimum wages on teenage employment using a

More information

Handout 12. Endogeneity & Simultaneous Equation Models

Handout 12. Endogeneity & Simultaneous Equation Models Handout 12. Endogeneity & Simultaneous Equation Models In which you learn about another potential source of endogeneity caused by the simultaneous determination of economic variables, and learn how to

More information

Econometrics. 8) Instrumental variables

Econometrics. 8) Instrumental variables 30C00200 Econometrics 8) Instrumental variables Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Thery of IV regression Overidentification Two-stage least squates

More information

Lab 07 Introduction to Econometrics

Lab 07 Introduction to Econometrics Lab 07 Introduction to Econometrics Learning outcomes for this lab: Introduce the different typologies of data and the econometric models that can be used Understand the rationale behind econometrics Understand

More information

Homework Solutions Applied Logistic Regression

Homework Solutions Applied Logistic Regression Homework Solutions Applied Logistic Regression WEEK 6 Exercise 1 From the ICU data, use as the outcome variable vital status (STA) and CPR prior to ICU admission (CPR) as a covariate. (a) Demonstrate that

More information

General Linear Model (Chapter 4)

General Linear Model (Chapter 4) General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients

More information

Introductory Econometrics. Lecture 13: Hypothesis testing in the multiple regression model, Part 1

Introductory Econometrics. Lecture 13: Hypothesis testing in the multiple regression model, Part 1 Introductory Econometrics Lecture 13: Hypothesis testing in the multiple regression model, Part 1 Jun Ma School of Economics Renmin University of China October 19, 2016 The model I We consider the classical

More information

Econometrics I Lecture 7: Dummy Variables

Econometrics I Lecture 7: Dummy Variables Econometrics I Lecture 7: Dummy Variables Mohammad Vesal Graduate School of Management and Economics Sharif University of Technology 44716 Fall 1397 1 / 27 Introduction Dummy variable: d i is a dummy variable

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Statistical Inference with Regression Analysis

Statistical Inference with Regression Analysis Introductory Applied Econometrics EEP/IAS 118 Spring 2015 Steven Buck Lecture #13 Statistical Inference with Regression Analysis Next we turn to calculating confidence intervals and hypothesis testing

More information

Handout 11: Measurement Error

Handout 11: Measurement Error Handout 11: Measurement Error In which you learn to recognise the consequences for OLS estimation whenever some of the variables you use are not measured as accurately as you might expect. A (potential)

More information

Specification Error: Omitted and Extraneous Variables

Specification Error: Omitted and Extraneous Variables Specification Error: Omitted and Extraneous Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 5, 05 Omitted variable bias. Suppose that the correct

More information

Treatment interactions with nonexperimental data in Stata

Treatment interactions with nonexperimental data in Stata The Stata Journal (2011) 11, umber4,pp.1 11 Treatment interactions with nonexperimental data in Stata Graham K. Brown Centre for Development Studies University of Bath Bath, UK g.k.brown@bath.ac.uk Thanos

More information

ECON Introductory Econometrics. Lecture 16: Instrumental variables

ECON Introductory Econometrics. Lecture 16: Instrumental variables ECON4150 - Introductory Econometrics Lecture 16: Instrumental variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 12 Lecture outline 2 OLS assumptions and when they are violated Instrumental

More information

Economics 326 Methods of Empirical Research in Economics. Lecture 14: Hypothesis testing in the multiple regression model, Part 2

Economics 326 Methods of Empirical Research in Economics. Lecture 14: Hypothesis testing in the multiple regression model, Part 2 Economics 326 Methods of Empirical Research in Economics Lecture 14: Hypothesis testing in the multiple regression model, Part 2 Vadim Marmer University of British Columbia May 5, 2010 Multiple restrictions

More information

An explanation of Two Stage Least Squares

An explanation of Two Stage Least Squares Introduction Introduction to Econometrics An explanation of Two Stage Least Squares When we get an endogenous variable we know that OLS estimator will be inconsistent. In addition OLS regressors will also

More information

Econometrics II Censoring & Truncation. May 5, 2011

Econometrics II Censoring & Truncation. May 5, 2011 Econometrics II Censoring & Truncation Måns Söderbom May 5, 2011 1 Censored and Truncated Models Recall that a corner solution is an actual economic outcome, e.g. zero expenditure on health by a household

More information

Econometrics Homework 1

Econometrics Homework 1 Econometrics Homework Due Date: March, 24. by This problem set includes questions for Lecture -4 covered before midterm exam. Question Let z be a random column vector of size 3 : z = @ (a) Write out z

More information

Lab 11 - Heteroskedasticity

Lab 11 - Heteroskedasticity Lab 11 - Heteroskedasticity Spring 2017 Contents 1 Introduction 2 2 Heteroskedasticity 2 3 Addressing heteroskedasticity in Stata 3 4 Testing for heteroskedasticity 4 5 A simple example 5 1 1 Introduction

More information

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b.

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. B203: Quantitative Methods Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. Part I: Compulsory Questions. Answer all questions. Each question carries

More information

Problem Set #3-Key. wage Coef. Std. Err. t P> t [95% Conf. Interval]

Problem Set #3-Key. wage Coef. Std. Err. t P> t [95% Conf. Interval] Problem Set #3-Key Sonoma State University Economics 317- Introduction to Econometrics Dr. Cuellar 1. Use the data set Wage1.dta to answer the following questions. a. For the regression model Wage i =

More information

Essential of Simple regression

Essential of Simple regression Essential of Simple regression We use simple regression when we are interested in the relationship between two variables (e.g., x is class size, and y is student s GPA). For simplicity we assume the relationship

More information

Implementing weak-instrument robust tests for a general class of instrumental-variables models

Implementing weak-instrument robust tests for a general class of instrumental-variables models The Stata Journal (2009) 9, Number 3, pp. 398 421 Implementing weak-instrument robust tests for a general class of instrumental-variables models Keith Finlay Tulane University New Orleans, LA kfinlay@tulane.edu

More information

ECON3150/4150 Spring 2016

ECON3150/4150 Spring 2016 ECON3150/4150 Spring 2016 Lecture 4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo Last updated: January 26, 2016 1 / 49 Overview These lecture slides covers: The linear regression

More information

ECON Introductory Econometrics. Lecture 17: Experiments

ECON Introductory Econometrics. Lecture 17: Experiments ECON4150 - Introductory Econometrics Lecture 17: Experiments Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 13 Lecture outline 2 Why study experiments? The potential outcome framework.

More information

Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page!

Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page! Econometrics - Exam May 11, 2011 1 Exam Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page! Problem 1: (15 points) A researcher has data for the year 2000 from

More information

Lecture 3.1 Basic Logistic LDA

Lecture 3.1 Basic Logistic LDA y Lecture.1 Basic Logistic LDA 0.2.4.6.8 1 Outline Quick Refresher on Ordinary Logistic Regression and Stata Women s employment example Cross-Over Trial LDA Example -100-50 0 50 100 -- Longitudinal Data

More information

2.1. Consider the following production function, known in the literature as the transcendental production function (TPF).

2.1. Consider the following production function, known in the literature as the transcendental production function (TPF). CHAPTER Functional Forms of Regression Models.1. Consider the following production function, known in the literature as the transcendental production function (TPF). Q i B 1 L B i K i B 3 e B L B K 4 i

More information

2. We care about proportion for categorical variable, but average for numerical one.

2. We care about proportion for categorical variable, but average for numerical one. Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is

More information

Fixed and Random Effects Models: Vartanian, SW 683

Fixed and Random Effects Models: Vartanian, SW 683 : Vartanian, SW 683 Fixed and random effects models See: http://teaching.sociology.ul.ie/dcw/confront/node45.html When you have repeated observations per individual this is a problem and an advantage:

More information

Fortin Econ Econometric Review 1. 1 Panel Data Methods Fixed Effects Dummy Variables Regression... 7

Fortin Econ Econometric Review 1. 1 Panel Data Methods Fixed Effects Dummy Variables Regression... 7 Fortin Econ 495 - Econometric Review 1 Contents 1 Panel Data Methods 2 1.1 Fixed Effects......................... 2 1.1.1 Dummy Variables Regression............ 7 1.1.2 First Differencing Methods.............

More information

Multiple Regression: Inference

Multiple Regression: Inference Multiple Regression: Inference The t-test: is ˆ j big and precise enough? We test the null hypothesis: H 0 : β j =0; i.e. test that x j has no effect on y once the other explanatory variables are controlled

More information

ECON Introductory Econometrics. Lecture 13: Internal and external validity

ECON Introductory Econometrics. Lecture 13: Internal and external validity ECON4150 - Introductory Econometrics Lecture 13: Internal and external validity Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 9 Lecture outline 2 Definitions of internal and external

More information

Lecture 3 Linear random intercept models

Lecture 3 Linear random intercept models Lecture 3 Linear random intercept models Example: Weight of Guinea Pigs Body weights of 48 pigs in 9 successive weeks of follow-up (Table 3.1 DLZ) The response is measures at n different times, or under

More information

Group Comparisons: Differences in Composition Versus Differences in Models and Effects

Group Comparisons: Differences in Composition Versus Differences in Models and Effects Group Comparisons: Differences in Composition Versus Differences in Models and Effects Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised February 15, 2015 Overview.

More information

Exam ECON3150/4150: Introductory Econometrics. 18 May 2016; 09:00h-12.00h.

Exam ECON3150/4150: Introductory Econometrics. 18 May 2016; 09:00h-12.00h. Exam ECON3150/4150: Introductory Econometrics. 18 May 2016; 09:00h-12.00h. This is an open book examination where all printed and written resources, in addition to a calculator, are allowed. If you are

More information

Lab 6 - Simple Regression

Lab 6 - Simple Regression Lab 6 - Simple Regression Spring 2017 Contents 1 Thinking About Regression 2 2 Regression Output 3 3 Fitted Values 5 4 Residuals 6 5 Functional Forms 8 Updated from Stata tutorials provided by Prof. Cichello

More information

leebounds: Lee s (2009) treatment effects bounds for non-random sample selection for Stata

leebounds: Lee s (2009) treatment effects bounds for non-random sample selection for Stata leebounds: Lee s (2009) treatment effects bounds for non-random sample selection for Stata Harald Tauchmann (RWI & CINCH) Rheinisch-Westfälisches Institut für Wirtschaftsforschung (RWI) & CINCH Health

More information

ECON2228 Notes 7. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 41

ECON2228 Notes 7. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 41 ECON2228 Notes 7 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 6 2014 2015 1 / 41 Chapter 8: Heteroskedasticity In laying out the standard regression model, we made

More information

Exercices for Applied Econometrics A

Exercices for Applied Econometrics A QEM F. Gardes-C. Starzec-M.A. Diaye Exercices for Applied Econometrics A I. Exercice: The panel of households expenditures in Poland, for years 1997 to 2000, gives the following statistics for the whole

More information

Spatial Regression Models: Identification strategy using STATA TATIANE MENEZES PIMES/UFPE

Spatial Regression Models: Identification strategy using STATA TATIANE MENEZES PIMES/UFPE Spatial Regression Models: Identification strategy using STATA TATIANE MENEZES PIMES/UFPE Intruduction Spatial regression models are usually intended to estimate parameters related to the interaction of

More information

Econometrics I KS. Module 1: Bivariate Linear Regression. Alexander Ahammer. This version: March 12, 2018

Econometrics I KS. Module 1: Bivariate Linear Regression. Alexander Ahammer. This version: March 12, 2018 Econometrics I KS Module 1: Bivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: March 12, 2018 Alexander Ahammer (JKU) Module 1: Bivariate

More information

ECON3150/4150 Spring 2016

ECON3150/4150 Spring 2016 ECON3150/4150 Spring 2016 Lecture 6 Multiple regression model Siv-Elisabeth Skjelbred University of Oslo February 5th Last updated: February 3, 2016 1 / 49 Outline Multiple linear regression model and

More information

1 Linear Regression Analysis The Mincer Wage Equation Data Econometric Model Estimation... 11

1 Linear Regression Analysis The Mincer Wage Equation Data Econometric Model Estimation... 11 Econ 495 - Econometric Review 1 Contents 1 Linear Regression Analysis 4 1.1 The Mincer Wage Equation................. 4 1.2 Data............................. 6 1.3 Econometric Model.....................

More information

Nonrecursive Models Highlights Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 6, 2015

Nonrecursive Models Highlights Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 6, 2015 Nonrecursive Models Highlights Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised April 6, 2015 This lecture borrows heavily from Duncan s Introduction to Structural

More information

Sociology Exam 2 Answer Key March 30, 2012

Sociology Exam 2 Answer Key March 30, 2012 Sociology 63993 Exam 2 Answer Key March 30, 2012 I. True-False. (20 points) Indicate whether the following statements are true or false. If false, briefly explain why. 1. A researcher has constructed scales

More information

Jeffrey M. Wooldridge Michigan State University

Jeffrey M. Wooldridge Michigan State University Fractional Response Models with Endogenous Explanatory Variables and Heterogeneity Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Fractional Probit with Heteroskedasticity 3. Fractional

More information

THE MULTIVARIATE LINEAR REGRESSION MODEL

THE MULTIVARIATE LINEAR REGRESSION MODEL THE MULTIVARIATE LINEAR REGRESSION MODEL Why multiple regression analysis? Model with more than 1 independent variable: y 0 1x1 2x2 u It allows : -Controlling for other factors, and get a ceteris paribus

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

Mediation Analysis: OLS vs. SUR vs. 3SLS Note by Hubert Gatignon July 7, 2013, updated November 15, 2013

Mediation Analysis: OLS vs. SUR vs. 3SLS Note by Hubert Gatignon July 7, 2013, updated November 15, 2013 Mediation Analysis: OLS vs. SUR vs. 3SLS Note by Hubert Gatignon July 7, 2013, updated November 15, 2013 In Chap. 11 of Statistical Analysis of Management Data (Gatignon, 2014), tests of mediation are

More information

Econometrics II. Lecture 4: Instrumental Variables Part I

Econometrics II. Lecture 4: Instrumental Variables Part I Econometrics II Lecture 4: Instrumental Variables Part I Måns Söderbom 12 April 2011 mans.soderbom@economics.gu.se. www.economics.gu.se/soderbom. www.soderbom.net 1. Introduction Recall from lecture 3

More information

Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 20, 2018

Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 20, 2018 Marginal Effects for Continuous Variables Richard Williams, University of Notre Dame, https://www3.nd.edu/~rwilliam/ Last revised January 20, 2018 References: Long 1997, Long and Freese 2003 & 2006 & 2014,

More information

1 The basics of panel data

1 The basics of panel data Introductory Applied Econometrics EEP/IAS 118 Spring 2015 Related materials: Steven Buck Notes to accompany fixed effects material 4-16-14 ˆ Wooldridge 5e, Ch. 1.3: The Structure of Economic Data ˆ Wooldridge

More information

Lab 10 - Binary Variables

Lab 10 - Binary Variables Lab 10 - Binary Variables Spring 2017 Contents 1 Introduction 1 2 SLR on a Dummy 2 3 MLR with binary independent variables 3 3.1 MLR with a Dummy: different intercepts, same slope................. 4 3.2

More information

. *DEFINITIONS OF ARTIFICIAL DATA SET. mat m=(12,20,0) /*matrix of means of RHS vars: edu, exp, error*/

. *DEFINITIONS OF ARTIFICIAL DATA SET. mat m=(12,20,0) /*matrix of means of RHS vars: edu, exp, error*/ . DEFINITIONS OF ARTIFICIAL DATA SET. mat m=(,,) /matrix of means of RHS vars: edu, exp, error/. mat c=(5,-.6, \ -.6,9, \,,.) /covariance matrix of RHS vars /. mat l m /displays matrix of means / c c c3

More information

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47 ECON2228 Notes 2 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 2 2014 2015 1 / 47 Chapter 2: The simple regression model Most of this course will be concerned with

More information

Sociology 63993, Exam 2 Answer Key [DRAFT] March 27, 2015 Richard Williams, University of Notre Dame,

Sociology 63993, Exam 2 Answer Key [DRAFT] March 27, 2015 Richard Williams, University of Notre Dame, Sociology 63993, Exam 2 Answer Key [DRAFT] March 27, 2015 Richard Williams, University of Notre Dame, http://www3.nd.edu/~rwilliam/ I. True-False. (20 points) Indicate whether the following statements

More information

Lecture 4: Multivariate Regression, Part 2

Lecture 4: Multivariate Regression, Part 2 Lecture 4: Multivariate Regression, Part 2 Gauss-Markov Assumptions 1) Linear in Parameters: Y X X X i 0 1 1 2 2 k k 2) Random Sampling: we have a random sample from the population that follows the above

More information

****Lab 4, Feb 4: EDA and OLS and WLS

****Lab 4, Feb 4: EDA and OLS and WLS ****Lab 4, Feb 4: EDA and OLS and WLS ------- log: C:\Documents and Settings\Default\Desktop\LDA\Data\cows_Lab4.log log type: text opened on: 4 Feb 2004, 09:26:19. use use "Z:\LDA\DataLDA\cowsP.dta", clear.

More information

Case of single exogenous (iv) variable (with single or multiple mediators) iv à med à dv. = β 0. iv i. med i + α 1

Case of single exogenous (iv) variable (with single or multiple mediators) iv à med à dv. = β 0. iv i. med i + α 1 Mediation Analysis: OLS vs. SUR vs. ISUR vs. 3SLS vs. SEM Note by Hubert Gatignon July 7, 2013, updated November 15, 2013, April 11, 2014, May 21, 2016 and August 10, 2016 In Chap. 11 of Statistical Analysis

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 7 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 68 Outline of Lecture 7 1 Empirical example: Italian labor force

More information

Question 1 [17 points]: (ch 11)

Question 1 [17 points]: (ch 11) Question 1 [17 points]: (ch 11) A study analyzed the probability that Major League Baseball (MLB) players "survive" for another season, or, in other words, play one more season. They studied a model of

More information

ECON Introductory Econometrics. Lecture 7: OLS with Multiple Regressors Hypotheses tests

ECON Introductory Econometrics. Lecture 7: OLS with Multiple Regressors Hypotheses tests ECON4150 - Introductory Econometrics Lecture 7: OLS with Multiple Regressors Hypotheses tests Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 7 Lecture outline 2 Hypothesis test for single

More information

Econometrics Midterm Examination Answers

Econometrics Midterm Examination Answers Econometrics Midterm Examination Answers March 4, 204. Question (35 points) Answer the following short questions. (i) De ne what is an unbiased estimator. Show that X is an unbiased estimator for E(X i

More information

Section I. Define or explain the following terms (3 points each) 1. centered vs. uncentered 2 R - 2. Frisch theorem -

Section I. Define or explain the following terms (3 points each) 1. centered vs. uncentered 2 R - 2. Frisch theorem - First Exam: Economics 388, Econometrics Spring 006 in R. Butler s class YOUR NAME: Section I (30 points) Questions 1-10 (3 points each) Section II (40 points) Questions 11-15 (10 points each) Section III

More information

Basic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler

Basic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler Basic econometrics Tutorial 3 Dipl.Kfm. Introduction Some of you were asking about material to revise/prepare econometrics fundamentals. First of all, be aware that I will not be too technical, only as

More information

REGRESSION RECAP. Josh Angrist. MIT (Fall 2014)

REGRESSION RECAP. Josh Angrist. MIT (Fall 2014) REGRESSION RECAP Josh Angrist MIT 14.387 (Fall 2014) Regression: What You Need to Know We spend our lives running regressions (I should say: "regressions run me"). And yet this basic empirical tool is

More information

Rockefeller College University at Albany

Rockefeller College University at Albany Rockefeller College University at Albany PAD 705 Handout: Polynomial Distributed Lags In the Handouts section of the web site you will find the data sets (GrangerPoly.dta) I constructed for the example

More information

Econ 371 Problem Set #6 Answer Sheet In this first question, you are asked to consider the following equation:

Econ 371 Problem Set #6 Answer Sheet In this first question, you are asked to consider the following equation: Econ 37 Problem Set #6 Answer Sheet 0. In this first question, you are asked to consider the following equation: Y it = β 0 + β X it + β 3 S t + u it. () You are asked how you might time-demean the data

More information

ECON Introductory Econometrics. Lecture 11: Binary dependent variables

ECON Introductory Econometrics. Lecture 11: Binary dependent variables ECON4150 - Introductory Econometrics Lecture 11: Binary dependent variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 11 Lecture Outline 2 The linear probability model Nonlinear probability

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

Question 1 carries a weight of 25%; Question 2 carries 20%; Question 3 carries 20%; Question 4 carries 35%.

Question 1 carries a weight of 25%; Question 2 carries 20%; Question 3 carries 20%; Question 4 carries 35%. UNIVERSITY OF EAST ANGLIA School of Economics Main Series PGT Examination 017-18 ECONOMETRIC METHODS ECO-7000A Time allowed: hours Answer ALL FOUR Questions. Question 1 carries a weight of 5%; Question

More information

8. Nonstandard standard error issues 8.1. The bias of robust standard errors

8. Nonstandard standard error issues 8.1. The bias of robust standard errors 8.1. The bias of robust standard errors Bias Robust standard errors are now easily obtained using e.g. Stata option robust Robust standard errors are preferable to normal standard errors when residuals

More information

ECON Introductory Econometrics. Lecture 6: OLS with Multiple Regressors

ECON Introductory Econometrics. Lecture 6: OLS with Multiple Regressors ECON4150 - Introductory Econometrics Lecture 6: OLS with Multiple Regressors Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 6 Lecture outline 2 Violation of first Least Squares assumption

More information