Unless provided with information to the contrary, assume for each question below that the Classical Linear Model assumptions hold.

Similar documents
Economics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 1 SOLUTIONS Fall 2016 Instructor: Martin Farnham

Economics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 2 Fall 2016 Instructor: Martin Farnham

ECON3150/4150 Spring 2016

Brief Suggested Solutions

CHAPTER 6: SPECIFICATION VARIABLES

Final Exam - Solutions

Lecture 4: Multivariate Regression, Part 2

Notes 6: Multivariate regression ECO 231W - Undergraduate Econometrics

Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals

2. (3.5) (iii) Simply drop one of the independent variables, say leisure: GP A = β 0 + β 1 study + β 2 sleep + β 3 work + u.

Multiple Regression Analysis: Inference MULTIPLE REGRESSION ANALYSIS: INFERENCE. Sampling Distributions of OLS Estimators

Multiple Linear Regression CIVL 7012/8012

Lecture 4: Multivariate Regression, Part 2

The Simple Linear Regression Model

Multiple Regression. Midterm results: AVG = 26.5 (88%) A = 27+ B = C =

ECON3150/4150 Spring 2016

Linear Regression with one Regressor

Final Exam - Solutions

Lectures 5 & 6: Hypothesis Testing

Business Statistics. Lecture 9: Simple Regression

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Sociology 593 Exam 2 Answer Key March 28, 2002

ECON 497 Midterm Spring

Econometrics Review questions for exam

STAT 512 MidTerm I (2/21/2013) Spring 2013 INSTRUCTIONS


Statistical Inference with Regression Analysis

Ecn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section:

Midterm 2 - Solutions

coefficients n 2 are the residuals obtained when we estimate the regression on y equals the (simple regression) estimated effect of the part of x 1

Problem Set #3-Key. wage Coef. Std. Err. t P> t [95% Conf. Interval]

OSU Economics 444: Elementary Econometrics. Ch.10 Heteroskedasticity

Problem Set #6: OLS. Economics 835: Econometrics. Fall 2012

Second Midterm Exam Economics 410 Thurs., April 2, 2009

statistical sense, from the distributions of the xs. The model may now be generalized to the case of k regressors:

Economics 113. Simple Regression Assumptions. Simple Regression Derivation. Changing Units of Measurement. Nonlinear effects

Multiple Regression Analysis. Part III. Multiple Regression Analysis

Correlation in Linear Regression

STAT 350: Summer Semester Midterm 1: Solutions

Problem Set 1 ANSWERS

Sociology 593 Exam 2 March 28, 2002

STAT 285: Fall Semester Final Examination Solutions

Econ 1123: Section 2. Review. Binary Regressors. Bivariate. Regression. Omitted Variable Bias

LECTURE 15: SIMPLE LINEAR REGRESSION I

Eco 391, J. Sandford, spring 2013 April 5, Midterm 3 4/5/2013

2 Prediction and Analysis of Variance

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

Multiple Regression Analysis

MORE ON SIMPLE REGRESSION: OVERVIEW

Regression Analysis Tutorial 34 LECTURE / DISCUSSION. Statistical Properties of OLS

ECON3150/4150 Spring 2015

REED TUTORIALS (Pty) LTD ECS3706 EXAM PACK

UNIVERSIDAD CARLOS III DE MADRID ECONOMETRICS Academic year 2009/10 FINAL EXAM (2nd Call) June, 25, 2010

9. Linear Regression and Correlation

Statistics and Quantitative Analysis U4320. Segment 10 Prof. Sharyn O Halloran

2. Linear regression with multiple regressors

Basic Linear Model. Chapters 4 and 4: Part II. Basic Linear Model

1: a b c d e 2: a b c d e 3: a b c d e 4: a b c d e 5: a b c d e. 6: a b c d e 7: a b c d e 8: a b c d e 9: a b c d e 10: a b c d e

MS&E 226. In-Class Midterm Examination Solutions Small Data October 20, 2015

Econometrics Midterm Examination Answers

Business Economics BUSINESS ECONOMICS. PAPER No. : 8, FUNDAMENTALS OF ECONOMETRICS MODULE No. : 3, GAUSS MARKOV THEOREM

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

Steps in Regression Analysis

Linear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons

ECNS 561 Multiple Regression Analysis

M(t) = 1 t. (1 t), 6 M (0) = 20 P (95. X i 110) i=1

An overview of applied econometrics

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Ordinary Least Squares Regression Explained: Vartanian

Quantitative Methods Final Exam (2017/1)

Rockefeller College University at Albany

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

Introduction to Econometrics. Heteroskedasticity

AP Final Review II Exploring Data (20% 30%)

review session gov 2000 gov 2000 () review session 1 / 38

CIVL 7012/8012. Simple Linear Regression. Lecture 3

ECO220Y Simple Regression: Testing the Slope

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

Midterm 2 - Solutions

AMS 7 Correlation and Regression Lecture 8

Economics 308: Econometrics Professor Moody

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

Multiple Linear Regression

ECO375 Tutorial 4 Wooldridge: Chapter 6 and 7

Stat 135, Fall 2006 A. Adhikari HOMEWORK 10 SOLUTIONS

WISE International Masters

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity

Econometrics Homework 1

Making sense of Econometrics: Basics

1 Warm-Up: 2 Adjusted R 2. Introductory Applied Econometrics EEP/IAS 118 Spring Sylvan Herskowitz Section #

- a value calculated or derived from the data.

Final Exam. Name: Solution:

Applied Statistics and Econometrics

Recitation 1: Regression Review. Christina Patterson

1 Motivation for Instrumental Variable (IV) Regression

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

MGEC11H3Y L01 Introduction to Regression Analysis Term Test Friday July 5, PM Instructor: Victor Yu

Handout 12. Endogeneity & Simultaneous Equation Models

Simple Linear Regression

Assessing Studies Based on Multiple Regression

Transcription:

Economics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 1 SOLUTIONS Spring 2015 Instructor: Martin Farnham Unless provided with information to the contrary, assume for each question below that the Classical Linear Model assumptions hold. Section 1: Multiple Choice (3 points each) Select the most appropriate answer, and carefully bubble in the letter of the answer on your bubble form. 1) Suppose you obtain the following fitted model: bwght = ˆβ0 + ˆβ 1 cigs + ˆβ 2 faminc, where bwght is child birth weight in ounces, cigs is the average daily number of cigarettes smoked per day by the mother during pregnancy, and faminc is family income measured in dollars. ˆβ 0 is an estimate of A) how many cigarettes a day it takes to lower birth weight by 1 ounce, on average B) how many ounces an extra cigarette a day lowers birth weight, on average. C) how many ounces the average baby weighs, when cigs=0 and faminc=0. D) the standard error of cigs. E) none of the above. 2) Suppose that you estimate the model y = β 0 x + u. You calculate residuals and find that the explained sum of squares is 400 and the total sum of squares is 1200. The R-squared is A) 1/4 B) 1/3 C) 1/2 D) 2/3 E) 3/4 1

3) In linear regression, the assumption of homoscedasticity is needed for I. unbiasedness II. simple calculation of variance and standard errors of coefficient estimates. III. the claim that the OLS estimator is BLUE. D) II and III only. 4) Which of the following is an advantage of multivariate regression over univariate regression? A) One can control for other determinants of y that may be correlated with the primary x variable of interest. B) The zero conditional mean assumption is more likely to hold with multivariate regression. C) One can often reduce bias by moving to multivariate regression. D) One can often obtain a model that better fits the data with more right-hand-side variables in the model. E) All of the above are advantages of multivariate regression over univariate regression. 5) Suppose the true population model of y is given by y = β 0 x 1 x 2 + β 3 x 3 + u. Which of the following will lead to a higher variance of the OLS estimator, ˆβ 3? A) A smaller sample size. B) Less variation in x 3. C) Greater variation in u. D) Higher correlation between x 1 and x 3. E) All of the above will lead to a higher variance of the OLS estimator, ˆβ 3. 2

6) Which of the following is/are consequences of overspecifying a model (including irrelevant variables on the right-hand-side)? I. The variance of the estimators may increase. II. The variance of the estimators may stay the same. III. Bias of the estimators may increase. D) I and II only. Questions 7-8 share the following setup: Suppose the true population model of y is given by (1) y = β 0 x 1 x 2 + u, but you estimate the following model instead: (2) y = γ 0 +γ 2 x 2 + v. [The gammas (γ ) are just constant parameters like the betas ( β )] 7) Suppose x 1 and x 2 are uncorrelated, and β 1 0. Which of the following statements is/are DEFINITELY true? I. ˆγ 2 will be biased. II. Var( ˆγ 2 )<Var( ˆβ 2 ) III. The error variance of model (1) will be smaller than that of model (2). D) I and III only. 3

8) Suppose x 1 and x 2 are negatively correlated, and β 1 < 0. Which of the following statements is/are DEFINITELY true? I. ˆγ 2 will be biased. II. Var( ˆγ 2 )<Var( ˆβ 2 ) III. E[ ˆγ 2 ]>E[ ˆβ 2 ]. D) I and III only. END SECTION 1. Section 2: Short Answers Answer each question as clearly and concisely as possible on the exam paper. 1) 26 points total Note: This question has multiple parts, but each can be answered on its own, so if you get stuck on one part, move onto another. Suppose you are interested in the effect of classroom size (number of students) on student achievement an important question for education policymakers. You have classroom-level data for 4 th grades across Canada that includes average test scores for each classroom (from a standardized test), and the number of students in each classroom. For example, a 4 th grade class at one school may have 80 students divided into three classrooms: one with 24 students, one with 26 students, and another with 30 students. In order to determine the effect of classroom size, you estimate the following regression by OLS: Score i = β 0 Size i + u i where i denotes classroom, Score is the standardized test score for classroom i, and Size is the number of students in classroom i. 4

a) (4 points) In this real world setting, is the OLS estimate, ˆ β 1, likely to be unbiased? Explain. Explanation required for credit. No, it s likely to be biased. This is because there are likely to be many things that affect Score, and some of them are likely to be correlated with Size. If they are omitted from the model, they are present in the error term, and those that are correlated with Size will cause problems. In particular, their presence in the error term will cause the error term to be correlated with Size. This represents a violation of the zero conditional mean assumption. Suppose you learn that the true model of test scores is Score i = β 0 Size i hhinc i + v i (note: this is almost certainly not the true model, but just assume that it is). hhinci is the average household income of a child in classroom i. b) (4 points) Assume that all schools are financed out of local property taxes, and that some school districts are rich and some are poor. Are Size and hhinc likely to be correlated? How? Explain. Explanation required for credit. Yes, Size and hhinc are likely to be negatively correlated. This is because parents in high income areas can afford to pay high taxes to support good schools. Well-funded schools are likely to have lower class sizes. Thus areas with high income will likely have low values of Size. c) (4 points) Now just assume that Size and hhinc are negatively correlated (one can tell stories for the correlation to go either way, so just assume this) and that hhinc affects Score. What effect does including hhinc in the regression have on bias of ˆβ 1? Explain clearly what happens to the estimator when you include hhinc. Explanation required for credit. This question threw a lot of people. Remember the basic omitted variables bias story. If the true model should include hhinc, but we leave it out, that will cause the estimated coefficient on Size to be biased. The bias will be negative, because hhinc and Size are negatively correlated and because beta2 is positive (kids in areas with higher income tend to perform better in school on average). 5

If omitting hhinc causes negative omitted variables bias, then including hhinc should cause the negative omitted variables bias to go away. Another way of putting this is that ˆβ 1 should become larger (in expected value) once we include hhinc. Many people said that adding hhinc would cause bias. But adding variables doesn t cause bias. If anything, it reduces bias. d) (4 points) Can you see any potential disadvantage of adding hhinc to the model? Explain. Yes. Consider the formula for variance of the OLS slope estimators: Var( β ˆ 1 ) = n i =1 σ 2 (x 1i x 1 ) 2 (1 R 1 2 ) If you include hhinc, the error variance in the model (the numerator of the above expression) will fall. Holding the denomimator constant, this would cause variance to fall, which is not a problem. However, the denominator isn t likely to stay constant. This is because the R-squared term will likely go up. This is the R-squared obtained by regressing Size on hhinc, and that s likely to be greater than zero. Without hhinc, the R-squared is zero (because Size regressed on a constant term has an R-squared of zero). But with hhinc, the R- squared goes up. A higher R-squared means a small denominator which makes the Variance go up (holding the numerator constant). If the effect of the higher R-squared term dominates the effect of the lower sigmasquared term, then variance will go up overall. Higher variance would be a disadvantage of adding hhinc to the model. e) (4 points) Suppose that Size is number of students in a classroom and Score is a number between 1 and 100. If you obtain ˆβ 1 = 1.2, what does that mean? Interpret this estimate clearly (as if trying to explain the effect of class size on test scores to a friend who doesn t know econometrics). Use proper units in explaining what that coefficient estimate means. A short (1 sentence) answer will suffice. This means that if you increase the number of students in a class by 1, the test score will fall by 1.2 points, on average. 6

Suppose you estimate Score i = β 0 Size i hhinc i + v i. You obtain the following standard errors and parameter estimates: ˆβ 0 = 95; se( ˆβ 0 ) = 30 ˆβ 1 = 1.2; se( ˆβ 1 ) = 0.4 ˆβ 2 = 0.003; se( ˆβ 2 ) = 0.0015 n = 25 f) 6 points Conduct the following hypothesis test at the 1% significance level: H 0 : β 1 = 0 H 1 : β 1 < 0 In your answer draw the distribution of the test statistic that you use, and clearly define the rejection region(s), indicating the critical value(s). t = ˆ β 1 0 se( β ˆ 1 ) = 1.2 0.4 = 3 degrees of freedom are n-k-1=25-2-1=22 We need to find the critical value for a one-sided alternative at the 1% level, from a t- distribution with 22 degrees of freedom. The t-critical value is -2.508. Since our t-statistic is more negative than t-critical, we reject the null hypothesis in favour of the alternative. (picture to follow) END SECTION 2. END OF EXAM. 7