Homework Set 2, ECO 311, Spring 2014

Similar documents
Homework Set 2, ECO 311, Fall 2014

Homework Set 3, ECO 311, Spring 2014

Motivation for multiple regression

Empirical Application of Simple Regression (Chapter 2)

Essential of Simple regression

Chapter 2: simple regression model

Econometrics Problem Set 6

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Lecture 2 Multiple Regression and Tests

Sample Problems. Note: If you find the following statements true, you should briefly prove them. If you find them false, you should correct them.

Econometrics Problem Set 3

ECON3150/4150 Spring 2016

Econometrics Multiple Regression Analysis: Heteroskedasticity

Econ 2120: Section 2

ECONOMET RICS P RELIM EXAM August 24, 2010 Department of Economics, Michigan State University

Econometrics - 30C00200

Econometrics Problem Set 6

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

Econometrics Homework 1

Econometrics Review questions for exam

Statistical Inference with Regression Analysis

Homoskedasticity. Var (u X) = σ 2. (23)

Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook)

Inference in Regression Model

Multiple Regression Analysis: Heteroskedasticity

Section 3: Simple Linear Regression

Introductory Econometrics

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

ECON3150/4150 Spring 2015

ECONOMET RICS P RELIM EXAM August 19, 2014 Department of Economics, Michigan State University

Intro to Applied Econometrics: Basic theory and Stata examples

ECON The Simple Regression Model

Reference: Davidson and MacKinnon Ch 2. In particular page

1. The shoe size of five randomly selected men in the class is 7, 7.5, 6, 6.5 the shoe size of 4 randomly selected women is 6, 5.

5. Let W follow a normal distribution with mean of μ and the variance of 1. Then, the pdf of W is

Multiple Regression: Inference

Economics 326 Methods of Empirical Research in Economics. Lecture 14: Hypothesis testing in the multiple regression model, Part 2

ECO375 Tutorial 7 Heteroscedasticity

Multiple Regression Analysis

Linear Regression with 1 Regressor. Introduction to Econometrics Spring 2012 Ken Simons

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

Multiple Regression Analysis. Part III. Multiple Regression Analysis

THE MULTIVARIATE LINEAR REGRESSION MODEL

Lab 11 - Heteroskedasticity

Concordia University (5+5)Q 1.

STAT 3A03 Applied Regression With SAS Fall 2017

Statistics II Exercises Chapter 5

Two-Variable Regression Model: The Problem of Estimation

coefficients n 2 are the residuals obtained when we estimate the regression on y equals the (simple regression) estimated effect of the part of x 1

The Simple Regression Model. Part II. The Simple Regression Model

1 Warm-Up: 2 Adjusted R 2. Introductory Applied Econometrics EEP/IAS 118 Spring Sylvan Herskowitz Section #

Universidad Carlos III de Madrid Econometría Nonlinear Regression Functions Problem Set 8

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2.

Inference in Regression Analysis

Heteroskedasticity. Part VII. Heteroskedasticity

Section I. Define or explain the following terms (3 points each) 1. centered vs. uncentered 2 R - 2. Frisch theorem -

Statistical Inference. Part IV. Statistical Inference

Applied Econometrics (QEM)

F3: Classical normal linear rgression model distribution, interval estimation and hypothesis testing

Simple Linear Regression

ECON2228 Notes 2. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 47

2. Linear regression with multiple regressors

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Making sense of Econometrics: Basics

SIMPLE REGRESSION ANALYSIS. Business Statistics

Final Exam. Economics 835: Econometrics. Fall 2010

Econometrics I Lecture 3: The Simple Linear Regression Model

Measuring the fit of the model - SSR

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Hypothesis Tests and Confidence Intervals in Multiple Regression

The Simple Regression Model. Simple Regression Model 1

Multiple Regression Analysis: Inference MULTIPLE REGRESSION ANALYSIS: INFERENCE. Sampling Distributions of OLS Estimators

The Simple Linear Regression Model

ECO220Y Simple Regression: Testing the Slope

Econometrics. Final Exam. 27thofJune,2008. Timeforcompletion: 2h30min

Problem Set #6: OLS. Economics 835: Econometrics. Fall 2012

Applied Regression. Applied Regression. Chapter 2 Simple Linear Regression. Hongcheng Li. April, 6, 2013

ECON Introductory Econometrics. Lecture 13: Internal and external validity

ECO375 Tutorial 8 Instrumental Variables

WISE International Masters

Diagnostics of Linear Regression

Correlation Analysis

(4) 1. Create dummy variables for Town. Name these dummy variables A and B. These 0,1 variables now indicate the location of the house.

Chapter 15 Panel Data Models. Pooling Time-Series and Cross-Section Data

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

The multiple regression model; Indicator variables as regressors

Ma 3/103: Lecture 24 Linear Regression I: Estimation

Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model

1 Linear Regression Analysis The Mincer Wage Equation Data Econometric Model Estimation... 11

Regression Models. Chapter 4. Introduction. Introduction. Introduction

Regression #8: Loose Ends

Linear Regression. Junhui Qian. October 27, 2014

1 A Non-technical Introduction to Regression

STAT420 Midterm Exam. University of Illinois Urbana-Champaign October 19 (Friday), :00 4:15p. SOLUTIONS (Yellow)

Ch 7: Dummy (binary, indicator) variables

Ordinary Least Squares Regression

1 Independent Practice: Hypothesis tests for one parameter:

Answer Key: Problem Set 6

Lecture 4: Regression Analysis

Transcription:

Homework Set 2, ECO 311, Spring 2014 Due Date: At the beginning of class on March 31, 2014 Instruction: There are twelve questions. Each question is worth 2 points. You need to submit the answers of only Six questions which you choose. The maximum point you can get is 12 points. For the purpose of preparing exam, you need to understand all questions. I will discuss the homework on the due date. Please do not ask me to go through the homework before the due date. However, you can discuss the homework with your classmates. You need to submit the homework individually though. Q1: t test and p value Please use the data file 311 house.dta and provide the stata commands and results. Consider the simple regression and the null hypothesis is H 0 : β 1 = 30 Find (a) the t value for the above null hypothesis (1 point); and (b) the p value for this t test using the two-tailed alternative H 1 : β 1 30. What is your conclusion using the significance level 0.05 (1 point). Q2: confidence interval Please use the data file 311 house.dta and provide the stata commands and results. Consider the simple regression baths = β 0 + β 1 area + u Find (a) the 95% confidence intervals for β 1 (0.5 point); (b) the 90% confidence intervals for β 1 (0.5 point); (c) interpret the 90% confidence intervals (1 point). Q3: fitted value and residual Regression is used a lot for the purpose of investment. For example, we may want to buy an underpriced house (a bargain) if its actual price is below the price predicted by the regression. On the other hand, we may sell a house if it is overpriced. Please use the data 1

file 311 house.dta for this problem. Consider the simple regression Find (a) the predicted real price for a house with 2000 square feet (1 point); and (b) the residual for the first house in the sample. How to interpret this residual? Do you want to buy or sell this house? Why? (1 point). Q4: goodness of fit The overall performance of a regression can be evaluated by the R-squared, which is also called the coefficient of determination. Eco 311 emphasizes causality, so we downplay the importance of R-squared. In contrast, R-squared is the focus for the course of applied regression analysis ISA 291. The formula for R-squared is R 2 = 1 SSR SST where SSR = û 2 i is the sum of squared residual, and SST = (y i ȳ) 2 is the total sum of squares. Basically, R-squared can be interpreted as the fraction of the sample variation in y that is explained by x or explained by the regression. Everything else equal, we want to use a regression that produces greater R-squared. Please use the data file 311 house.dta for this problem. Consider two simple regressions: rprice = β 0 + β 1 baths + u Notice that the two regressions share the same dependent variable; otherwise the R-squared is not comparable. Please (a) interpret the R-squared in each regression (1 point), and (b) explain which model has better fit (1 point). Q5: standard error and t value Most often, we run a regression in the hope that some key regressor is statistically significant. For example, we may hope to find that working out regularly has significant effect on the body weight. In terms of statistics, that means we hope to obtain a big t value, or equivalently, small standard error of ˆβ1, the coefficient of the key regressor. Please use the data file 2

311 house.dta for this problem. Consider running two simple regressions: rprice i = β 0 + β 1 baths i + u i, (i = 1, 2,... 100) rprice i = β 0 + β 1 baths i + u i, (i = 1, 2,... 321) So the first regression uses only the first 100 houses (observations), while the second regression uses all 321 houses in the sample. Please explain in detail why the second regression has smaller standard error and bigger t value for ˆβ 1 than the first regression. Be specific! (Hint: consider the stata command reg y x in 1/100) Q6: Orthogonal Regressors One variable is orthogonal to the other if their covariance is zero. Suppose the true model is a multiple regression Y = β 0 + β 1 X 1 + β 2 X 2 + u, E(u X 1, X 2 ) = 0. Assume cov(x 1, X 2 ) = 0 (so the two regressors are orthogonal), and we run a simple regression of Y = β 0 + β 1 X 1 + e where e is the simple-regression error term 1. (1 point) Explain whether ˆβ 1 in the simple regression is an unbiased estimate for β 1. You may read formula 3.45 and Table 3.2 of the textbook. 2. (1 point) Use math and compare var(e) to var(u). Which regression has more precise estimate for β 1? Why? Hint var(a + B) = var(a) + var(b) + 2cov(A, B) Q7: Frisch-Waugh Theorem This exercise intends to show how to apply the Frisch-Waugh Theorem. Read page 76-79 (5th edition) of the textbook 1. (1 point) Use the data file 311 wage1.dta, and report the regression wage = β 0 + β 1 educ + β 2 exper + u. Interpret ˆβ 1 and ˆβ 2 3

2. (1 point) Suppose the key regressor is educ. Please show the stata code and result that implement the two-step procedure of the Frisch-Waugh Theorem to get ˆβ 1 Q8: F Test for Overall Significance of a Regression Use the data file 311 wage1.dta. You may read page 152-153 (5th edition) of the textbook. 1. (1 point) Report the unrestricted regression wage = β 0 + β 1 educ + β 2 exper + u, and find the F test for overall significance of this regression. Specify the null hypothesis for this F test. 2. (1 point) Report the restricted regression that imposes the restriction in the null hypothesis. Find the residual sum squares of both restricted and unrestricted regressions and construct the F test explicitly using formula 4.37 of textbook. Q9: Testing Hypothesis about a Single Linear Combination of Coefficient Use the data file 311 wage1.dta. 1. (1 point) Suppose the model is wage = β 0 + β 1 female + β 2 married + u. Report the regression result, and report the F statistic for the hypothesis H 0 : β 1 = β 2 Hint: stata command is test female = married 2. (1 point) Run a transformed regression and get a t statistic for H 0. You need to verify that F statistic = t 2. You may read Section 4.4 of the textbook. Q10: Variance of OLS Estimator This exercise shows how to apply formula 3.51 of the textbook. Use the data file 311 wage1.dta. 1. (1 point) Find the variance of ˆβ 1 in the regression wage = β 0 + β 1 educ + β 2 exper + u. Find the estimate for σ 2. 2. (1 point) Please show the stata code and result that obtain SST j and Rj 2 in formula 3.51, where j = 1 for this problem (X 1 =educ is the key regressor). 4

Q11: Quadratic and Interaction Terms Use the data file 311 house.dta. You may read example 6.2 and 6.3 in the textbook. 1. (1 point) Report the regression with a quadratic term rprice = β 0 +β 1 age+β 2 age 2 +u. Explain whether age 2 should be included. When a house gets older, does the house price fall at increasing or decreasing rate? Hint: stata command to get the quadratic term is gen agesq = age 2 2. (1 point) Report the regression with an interaction term rprice = β 0 +β 1 age+β 2 age bath + u. Compare two houses, one with one bathroom and one with two bathrooms. Which one s value falls faster when it gets older? Why? Q12: Log and Adjusted R Squared Use the data file 311 house.dta. You may read Table 2.3, page 191-193, 202-205 (5th edition) of the textbook. 1. (1 point) Report regression log(rprice) = β 0 +β 1 age+u, and regression log(rprice) = β 0 + β 1 log(age) + u. Interpret ˆβ 1 in each regression 2. (1 point) How to tell which regression fits data better? (Hint: consider adjusted R 2 ) 5