Problem Set # 1. Master in Business and Quantitative Methods

Similar documents
ECO375 Tutorial 8 Instrumental Variables

Exercise Sheet 4 Instrumental Variables and Two Stage Least Squares Estimation

Topics in Applied Econometrics and Development - Spring 2014

5.2. a. Unobserved factors that tend to make an individual healthier also tend

Exercise sheet 6 Models with endogenous explanatory variables

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

Solutions to Problem Set 6 (Due December 8) Maximum number of points for Problem set 8 is: 220. Problem 10.6

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

Econometrics Master in Business and Quantitative Methods

Dealing With Endogeneity

An explanation of Two Stage Least Squares

1 Motivation for Instrumental Variable (IV) Regression

Economics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 1 SOLUTIONS Fall 2016 Instructor: Martin Farnham

Instrumental Variables

Estimating the return to education for married women mroz.csv: 753 observations and 22 variables

Economics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 2 Fall 2016 Instructor: Martin Farnham

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Economics 241B Estimation with Instruments

ECO375 Tutorial 4 Wooldridge: Chapter 6 and 7

Answer Key: Problem Set 5

ECO375 Tutorial 9 2SLS Applications and Endogeneity Tests

Rockefeller College University at Albany

UNIVERSIDAD CARLOS III DE MADRID ECONOMETRICS FINAL EXAM (Type B) 2. This document is self contained. Your are not allowed to use any other material.

Problemsets for Applied Econometrics

14.32 Final : Spring 2001

CHAPTER 7. + ˆ δ. (1 nopc) + ˆ β1. =.157, so the new intercept is = The coefficient on nopc is.157.

Applied Health Economics (for B.Sc.)

Single-Equation GMM: Endogeneity Bias

Lecture 8: Instrumental Variables Estimation

ECNS 561 Topics in Multiple Regression Analysis

Multiple Regression Analysis: Inference MULTIPLE REGRESSION ANALYSIS: INFERENCE. Sampling Distributions of OLS Estimators

UNIVERSIDAD CARLOS III DE MADRID ECONOMETRICS Academic year 2009/10 FINAL EXAM (2nd Call) June, 25, 2010

ECONOMET RICS P RELIM EXAM August 24, 2010 Department of Economics, Michigan State University

Econometrics Problem Set 4

Econometrics Review questions for exam

Final Exam. Economics 835: Econometrics. Fall 2010

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like.

IV Estimation WS 2014/15 SS Alexander Spermann. IV Estimation

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis. 18th Class 7/2/10

Handout 11: Measurement Error

University of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points

We begin by thinking about population relationships.

Sample Problems. Note: If you find the following statements true, you should briefly prove them. If you find them false, you should correct them.

Problem Set - Instrumental Variables


Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

Endogeneity. Tom Smith

Økonomisk Kandidateksamen 2004 (I) Econometrics 2. Rettevejledning

EC402 - Problem Set 3

4.8 Instrumental Variables

Statistical Inference. Part IV. Statistical Inference

Econ 510 B. Brown Spring 2014 Final Exam Answers

Wooldridge, Introductory Econometrics, 3d ed. Chapter 16: Simultaneous equations models. An obvious reason for the endogeneity of explanatory

Lecture 14. More on using dummy variables (deal with seasonality)

Applied Microeconometrics (L5): Panel Data-Basics

SIMULTANEOUS EQUATION MODEL

Economics Introduction to Econometrics - Fall 2007 Final Exam - Answers


Final Exam - Solutions

Brief Suggested Solutions

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Econ 583 Final Exam Fall 2008

Ability Bias, Errors in Variables and Sibling Methods. James J. Heckman University of Chicago Econ 312 This draft, May 26, 2006

Answer Key: Problem Set 6

ESTIMATE THE REGRESSION COEFFICIENTS OF VARIABLES SPL. REFERENCE TO FERTILITY

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Econometrics I Lecture 7: Dummy Variables

Ch 7: Dummy (binary, indicator) variables

Econometrics Problem Set 3

Making sense of Econometrics: Basics

Instrumental Variables and the Problem of Endogeneity

Linear Regression. Junhui Qian. October 27, 2014

Instrumental Variables

Economics 308: Econometrics Professor Moody

x i = 1 yi 2 = 55 with N = 30. Use the above sample information to answer all the following questions. Show explicitly all formulas and calculations.

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

Problem Set 10: Panel Data

ECON3150/4150 Spring 2016

More on Specification and Data Issues

Multiple Regression Analysis. Part III. Multiple Regression Analysis

Write your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover).

Recitation Notes 5. Konrad Menzel. October 13, 2006

Ecmt 675: Econometrics I

Applied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid

Instrumental Variables, Simultaneous and Systems of Equations

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE

Advanced Econometrics

Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

PhD/MA Econometrics Examination January 2012 PART A

Unless provided with information to the contrary, assume for each question below that the Classical Linear Model assumptions hold.

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

Answers to Problem Set #4

Instrumental Variables

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b.


Transcription:

Problem Set # 1 Master in Business and Quantitative Methods

Contents 0.1 Problems on endogeneity of the regressors........... 2 0.2 Lab exercises on endogeneity of the regressors......... 4 1

0.1 Problems on endogeneity of the regressors Problem 1 For the simple regression model, y i = µ + u i, u i N(0,σ 2 ), prove that the sample mean is consistent and asymptotically normally distributed. Now consider the alternative estimator ˆµ = i w iy i w i = i (n(n+1)/2) = i i i. Note that i w i = 1 and i i2 = n(n + 1)(2n + 1)/6. Prove that this is a consistent estimator of µ and obtain its asymptotic variance. Problem 2 The following equation explains weekly hours of television viewing by a child in terms of the childs age, mothers education, fathers education, and number of siblings: tvhours i = β 0+β 1 age i +β 2 age 2 i +β 3motheduc i +β 4 fatheduc i +β 5 sibs i +u i We are worried that tvhours is measured with error in the survey. Let tvhours denote the reported hours of television viewing per week. (1) What do the classical errors-in-variables (CEV) assumptions require in this application? (2) Do you think the CEV assumptions are likely to hold? Explain. Problem 3 In the context of the instrumental variables we have shown that the least squares estimator ˆβ is biased and inconsistent. Nonetheless, ˆβ does estimate something. Derive the asymptotic covariance matrix of ˆβ, and show that ˆβ is asymptotically normally distributed. Problem 4 Prove that the limiting distribution of ˆβ IV is n(ˆβiv β) d N ( 0,σ 2 rq 1 ZX Q ZZQ 1 XZ). and present the asymptotic distribution of ˆβ IV.

Problem 5 Suppose that we want to estimate the marginal effect of income on the average grade obtained in the university (colgpa). Beside the family income, average grade in high school (hsgpa) and grade obtained in the exam to enter the university (SAT) also explain colgpa. Consider the model for colgpa: colgpa t = β 0 + β 1 faminc t + β 2hsGPA t + β 3 SAT t + u t, where faminc is the true value of the family income. Nevertheless, we observe that the data on the family income provided by the students can contain measurement errors, ie, faminc = faminc +e, where e N(0,σ 2 e). 1. Which are the consequences of this information to the properties of the OLS estimators? Justify your answer with a formal proof. 2. Propose an alternative method to estimate the previous model and explain in detail its assumptions and properties. Problem 6 Consider a model for the health of an individual: health i = β 0 + β 1age i + β 2weight i + β 3height i + β 4male i + β 5work i + β 6exercise i + u i, where health is some quantitative measure of the persons health, age, weight, height, and male are self-explanatory, work is weekly hours worked, and exercise is the hours of exercise per week. a. Why might you be concerned about exercise being correlated with the error term u? b. Suppose you can collect data on two additional variables, disthome and distwork, the distances from home and from work to the nearest health club or gym. Discuss whether these are likely to be uncorrelated with u. Problem 7 Consider the model: y = βx + ε Prove that when only x is measured with error, the squared correlation between y and x is less than that between y and x. (Note the assumption that y = y). Does the same hold true if y is also measured with error?

0.2 Lab exercises on endogeneity of the regressors Problem 8 Consider a consumption function of the form c t = α + βy t + u t. It is estimated using 204 observations on aggregate U.S. consumption and disposable personal income. The data set is called consumption.txt. Suppose that there is a possibility of bias due to correlation between y t and u t. Consider instrumental variables estimation using y t 1 as the instrument for y t, and, of course, the constant term is its own instrument. One observation is lost because of the lagged values, so the results are based on 203 quarterly observations. Estimate the model using the instrumental variables method and compare it with the OLS estimation Test for the possible endogeneity using the Hausman test (version Wu). Problem 9 Regarding the previous exercise that specifies a very simple consumption model, one might wonder if the meager specification of the model could help explain the finding in the Hausman test.use the data to carry out the test in a more elaborate specification c t = β 1 + β 2 y t + β 3 i t + β 4 c t 1 + ε t where c t is the log of real consumption, y t is the log of real disposable income, and i t is the interest rate (90-day T bill rate).(hint: use as instrument of y t, y t 1 ). Compare these results to those obtained previously. Problem 10 Use the data in WAGE2.xls for this exercise: (i) We want to explain log(wage) based on educ, but educ is an endogenous variable. Use IV to estimate this model, using sibs as an instrumental variable. Moreover, run the regression of educ on sibs to test the correlation between educ and sibs. (ii) The variable brthord is birth order (brthord is one for a first-born child, two for a second-born child, and so on). Explain why educ and brthord might be negatively correlated. Regress educ on brthord to determine whether there is a statistically significant negative correlation. (iii) Now, suppose that we include number of siblings as an explanatory variable in the wage equation; this controls for family background, to some extent log(wage) i = β 0 + β 1 educ i + β 2 sibs i + u i.

Suppose that we want to use brthord as an IV for educ, assuming that sibs is exogenous. The reduced form for educ is: educ i = π 0 + π 1 sibs i + π 2 brthord i + v i. State and test the identification assumption. (iv) Estimate the equation from part (iii) using brthord as an IV for educ (and sibs as its own IV). Comment on the standard errors for β 1 y de β 2. (v) Using the fitted values from part (iii), educ, ˆ compute the correlation between educ ˆ and sibs. Use this result to explain your findings from part (iv). Problem 11 The data in FERTIL2.xls includes, for women in Botswana during 1988, information on number of children, years of education, age, and religious and economic status variables. (i) Estimate this model by OLS children i = β 0 + β 1 educ i + β 2 age i + β 3 age 2 i + u i and interpret the estimates. In particular, holding age fixed, what is the estimated effect of another year of education on fertility? If 100 women receive another year of education, how many fewer children are they expected to have? (ii) frsthalf is a dummy variable equal to one if the woman was born during the first six months of the year. Assuming that frsthalf is uncorrelated with the error term from part (i), show that frsthalf is a reasonable IV candidate for educ. (Hint: You need to do a regression.) (iii) Estimate the model from part (i) by using frsthalf as an IV for educ. Compare the estimated effect of education with the OLS estimate from part (i). (iv) Add the binary variables electric, tv, and bicycle to the model and assume these are exogenous. Estimate the equation by OLS and 2SLS and compare the estimated coefficients on educ. Interpret the coefficient on tv and explain why television ownership has a negative effect on fertility.

Problem 12 The aim of this exercise is to compare the estimations obtained with 2SLS with others obtained by incorrect methods. The data is WAGE2.xls (i) Use 2SLS to estimate: log(wage) i = β 0 + β 1 educ i + β 2 exper i + β 3 tenure i + β 4 black i + u i, using sibs as instrumental variable for educ. (ii) Estimate now the regression of educ on sibs, exper, tenure and black. Then plug in educ ˆ in the equation of part (i) and estimate it by OLS. Compare these ˆβ j with the ones obtained in part (i) and their standard errors. Comment these standard errors.