Solution to the take home exam for ECON 3150/4150

Size: px
Start display at page:

Download "Solution to the take home exam for ECON 3150/4150"

Transcription

1 Solution to the tae home exam for ECO 350/450 Jia Zhiyan and Jo Thori Lind April 2004 General comments Most of the copies we ot were quite ood, and it seems most of you have done a real effort on the problem set. We will first ive some eneral comments on common mistaes and misunderstandins, and then present solutions to problems 2 and 3. Particularly problem 3 is very much a discussion question, so there is no such thin as a correct answer. But the suested solution attempts to include most required elements. There are also other ways of taclin the problem that are equally satisfactory. We have not included a solution to problem as it is very much explained in the boo. Some of you don t discuss your results. You should always try to add a line or two of comments after a test, and when you et a reression try to discuss the main results a bit. There is some confusion about the requirements for OLS to be unbiased. When the reressors are non-stochastic, all that is needed is that the true model is linear, and that the residual has expectation zero. Assumptions on the variance and covariance are only necessary to derive variances of the estimators and for BLUE to hold. Some of you hardly define what BLUE is. Remember to answer the question! In problem 2, it is sometimes hard to follow what you do. Try to add some words to motivate what you re tryin to do. In problem 3, when ased to set up an econometric model, this includes statin assumptions on the residual.

2 Some of you are tempted to use the central limit theorem as we have a lot of observations. This is nice. However, the CLT does not say that each residual becomes normal when is lare. However, even if each residual is non-normal but with identical distribution, the estimators for the parameters, which in a sense are averaes of the residuals, become normally distributed. When testin whether two reression coefficients are different, there are three approaches. One is to find the covariance between the estimates (B not between the variables) and construct a standard t-test. The second is to rephrase the reression as we do in the solution of problem 3 below. The last is to use an F-test and comparin RRS R and RSS U.

3 Problem 2 (answers in bold) Suppose we are interested in estimatin the followin reression model: () Y = α0 + αx +ε 2 σ if i=j and = var( ε, ε j ) = ε 0 otherwise and E( ε ) = 0 for all i and Unfortunately, the variables Yij and on roup averaes, which are defined by * i= has identical size.) So instead, we can only estimate the followin model: X ij are not directly observable. We have only access to data * Y = Y / and X = X / (Each roup i= (2) Y = β + β X + e * * 0 Denote the OLS estimates of β obtained from reression (2) by ˆβ Denote the OLS estimator of α obtained from reression () by ˆ α. Question. Clarify the connections between the reressions () and (2). Is ˆβ an unbiased estimator of α? Answer: Sum up () for each roup, we have Y = α + α X + ε 0 i= i= i= Divide by at both sides Denote e = i= It is easy to verify that Y / = α + α X / + ε / 0 i= i= i= ε /, and use definition of i= * Y and Ee ( ) = E( ε )/ = 0 cov( e, e) = cov( ε /, εi / ) = i= i= * X, we have (2) 2 σ ε / if = 0 otherwise

4 So (2) also satisfy the assumption for OLS. Usin the OLS formula, we have * * * * * ( X X) Y ( X X)( α0 + αx + e) ( X X) e ˆ β = = = α * 2 * 2 + * 2 ( X X) ( X X) ( X X) so we have * ( X X) E( e) * 2 ( X X) E( ˆ β ) = α + = α * ( X X) e 2 var( e) σ = = = ε * 2 * 2 * 2 ( X X) ( X X) ( X X) var( ˆ β ) var( ) Question 2. Calculate the variances of ˆ α and ˆβ. Which estimator of α do you prefer. Explain the reason for your choice Similar to above: we have ˆ α ( X X) ε i == α+ 2 ( X X) i One thin to note here is the double summation X. The outer ( subscript ) i= summation is over different roups, the inner summation (subscript i) is over the observations within the same roup. You can thin it as summin over the followin table row by row: First individual of the roup (i=) Group (=) X Group 2 (=2) X2 Last individual of the roup (i=) X X X X X 2 X Last roup (=G) XG X X G X Easy to show that ˆ α is unbiased estimator of α. And the variance of ˆ α is

5 2 σ ε ˆ 2 ( X X ) i var( α ) == Since both ˆ α and ˆβ is unbiased, we will choose the one with smaller variance. Compare the formulae for variance of ˆ α and ˆβ, we see that ˆ α ˆ β iff X X X X Usin the hints iven, we see that 2 * 2 var( ) var( ) ( ) ( ) i ( X X) = ( X X + X X) 2 * * 2 i i = (( X X ) + ( X X)) i i * * 2 = + + * 2 * 2 * * (( X X ) ( X X) 2( X X )( X X)) = ( X X ) + ( X X) + (( X X) ( X X * 2 * 2 * * i i = ( X X ) + ( X X) * 2 * 2 i * ( X X ) =0 for all. i ote that in the last step we use that fact that So which proves that var( ˆ α ˆ ) var( β). X X X X = X X 0 2 * 2 * 2 ( ) ) ( ) ( ) i i )) Question 3. Will you chane your conclusion if you are informed that the reressor X is constant within each roup but varies between roups i.e. X =X j. Answer: if X =X j then X = X * for all, so ( X X) ) ( X X) 2 * i 2 =0 which means that var( ˆ α ˆ ) = var( β). And there is no loss of efficiency for usin roup averae instead of individual data.

6 Problem 3 a) Consider the model w i = α + β lo y i + ε i, i =,..., where w is the expenditure share on rice, y total consumer expenditure, and ε i a random error term. I will assume that. ε i (0, σ 2 ) 2. cov (ε i, ε j ) = 0, i j. 3. y i is non-stochastic This will ive me an econometric model where OLS is unbiased and BLUE, and where the usual formulae ive correct expressions for the variances. otice that assumption ) implies that ε i both has expectation zero and a constant variance. Usin the supplied data, I estimated this model. The results are as follows: Constant Ltotexp sma RSS R^ F(,2245) = [0.000]** lo-lielihood DW.5 no. of observations 2247 no. of parameters 2 mean(w_rice) var(w_rice) This shows that there is a neative relationship between lo of total expenditure and the budet share on rice. A one percentae increase in income ives a increase in the budet share on rice. The t-value of -7.5 shows that this is a stronly snificant result that and we would reject the null hypothesis of no relationship between total expenditure and the budet share on rice at all usual levels of snificance. The R 2 of 0.2 is relatively low, so there

7 is a reat deal of variation that is not explained by expenditure. As this is household micro data, it is not unusual to et relative low values of R 2 thouh. b) An Enel curve ives the relationship between total expenditure and expenditure on a iven ood. If we let x denote expenditure on rice, the Worin-Leser specification ives rise to an Enle curve of the form x = αy + βy lo y Pluin in the estimates from the estimation, we et a curve as shown on the next pae. Each data point is also shown to ive an impression of the fit of the model. This ives an increasin but concave curve. At very hh incomes it starts to decrease, but almost no households are on this part of the curve. This shows that rice is a normal non-luxury ood (for the very rich, it is an inferior ood). c) We first construct a variable hhsize that ives the number of person livin in the household. Let us assume that the same assumptions that under a) applies. We include this new variable in the reression, which yield the followin output: Constant Ltotexp hhsize sma RSS R^ F(2,2244) = [0.000]** lo-lielihood DW.05 no. of observations 2247 no. of parameters 3 mean(w_rice) var(w_rice) We notice that household size has a positive and snificant impact on the budet share for food. An additional person in the household is estimated to imply a increase in the budet share on rice. This is to some extent evidence that lare households are more costly

8 Enel curve showin the relationship between total consumer expenditure and expenditure on rice.

9 to run than smaller households. Also, the coefficient on expenditure is now larer. This is probably due to multicollinearity as it is natural to assume that larer households also have larer expenditures. Finally, the fit of the model as measured by the R 2 increases substantially. It seems plausible that children and adults have different needs and hence different impacts on rice consumption. To see whether this is the case, we separate hhsize into adults and children. The new estimates are as follows: Constant Ltotexp adults children sma RSS R^ F(3,2243) = 6 [0.000]** lo-lielihood DW.05 no. of observations 2247 no. of parameters 4 mean(w_rice) var(w_rice) We notice that the budet share on rice is affected twice as much but an additional child as by an additional adult. To test whether this difference is snificant, notice that we can write the relationship as w = α + β lo y + γ children + γ 2 adults The null hypothesis is γ =γ 2, which we test versus the alternative of γ γ 2. Under the null, we can write the model as w = α + β lo y + γ (children+adults) oticin that children+adults=hhsize, a convenient way to test the hypothesis is to include hhsize and children (or adults) as explanatory variables. Under the null, the coefficient on children should be zero, so the test of γ =γ 2 reduces to a simple test of one coefficient bein snificantly different from zero. The model we then estimate is w = α + β lo y + κhhsize + δchildren

10 We want to test H 0 : δ = 0 vs. H A : δ 0. As we have assumed the residual to be normally distributed, we now that ˆδ (δ, seδ ). As we have to estimate the variance of the residual, it follows that under the null hypothesis, T = ˆδ se δ t (2247 4) We chose a 5% level of snificance. We then reject the null hypothesis if T >,96 (usin 8 derees of freedom to approximate 2243). The reression results are shown below: Constant Ltotexp children hhsize sma RSS R^ F(3,2243) = 6 [0.000]** lo-lielihood DW.05 no. of observations 2247 no. of parameters 4 mean(w_rice) var(w_rice) We find a value of the test statistic of T=3,4, which permits us to reject the null hypothesis at the 5% level of snificance. If the null were true, it would be a probability of 5% or less of ettin data that yields a test statistic of this manitude or above. Hence we can conclude that adults and children have snificantly different impacts on the demand behaviour. d) We want to test whether there is discrimination amon boys and irls. To do this, I will first study the effect of females and males in the different ae roups in the supplied data set. I then have a model of the form w i = α + β lo y i + γ females in + δ males in + ε i

11 where there are ae roups and γ and δ are the coefficients on the number of females and males in the roup. This reression ives the followin result: Constant Ltotexp f ft f5t f0t f5t f20t f35t f50t m mt m5t m0t m5t m20t m35t m50t sma RSS R^ F(7,2229) = [0.000]** lo-lielihood DW.07 no. of observations 2247 no. of parameters 8 mean(w_rice) var(w_rice) To test whether there are snificant differences, notice that we can rewrite the model as w i = α + β lo y i + θ females in + κ persons in + ε i.

12 Under the null hypothesis of γ = δ, this model would ive θ = 0, which we wish to test aainst an alternative hypothesis of γ δ. We can then run a reression with number of females and total number of persons in each roup, and the test of discrimination within this roup reduces to testin whether the coefficient on females is snificantly different from zero. This reression is iven below: Constant Ltotexp f ft f5t f0t f5t f20t f35t f50t c ct c5t c0t c5t c20t c35t c50t sma RSS R^ F(7,2229) = [0.000]** lo-lielihood DW.07 no. of observations 2247 no. of parameters 8 mean(w_rice) var(w_rice)

13 The variables startin with c denotes the number of people in the actual roup. We notice that except for the roups 5 to 9 and 25 to 49, we cannot reject the null hypothesis of no discrimination. Within these two roups, the coefficient on women is hher, so it seems they et more food. To ive an overall test of discrimination, we want to test a null hypothesis H 0 : γ = δ for all vs. the alternative H A : γ δ for at least one. To test this hypothesis, we consider a reression where the null is imposed, This is one where one the number of persons in each roup is included and the decomposition into sexes is excluded. Such a reression is reported below: Constant Ltotexp c ct c5t c0t c5t c20t c35t c50t sma RSS R^ F(9,2237) = [0.000]** lo-lielihood DW.04 no. of observations 2247 no. of parameters 0 mean(w_rice) var(w_rice) We now that iven the assumptions of the model, under the null hypothesis, the test statistic F = (RSS R RSS U ) /q F (q, ) RSS U / where q is the number of restrictions, here 8, the number of explanatory variables in the unconstrained reression, here 8, and RSS R and RSS U the RSSs from the restricted and

14 unrestricted reressions. At the 5% level, we reject the null hypothesis if we observe F above,94 (usin df in the denominator). Calculation yields F = ( ) / / (2247 8) = which is below the critical value. Hence we cannot reject the null hypothesis of no overall discrimination. e) Instead of looin at the demand for rice, we now want to study the demand for adult oods as this may ive a cleaner view of the total expenses on ids. Otherwise, the analysis is analoue to the one above. First, we study the effect of the number of females and males in each ae roup on demand: Constant Ltotexp f ft f5t f0t f5t f20t f35t f50t m mt m5t m0t m5t m20t m35t m50t

15 sma RSS R^ F(7,2229) = [0.000]** lo-lielihood DW.7 no. of observations 2247 no. of parameters 8 mean(w_adult) var(w_adult) o clear pattern emeres. To test whether there are snificant differences, aain we replace males by total number of persons in each roup and loo at whether the coefficient on females are snificant: Constant Ltotexp f ft f5t f0t f5t f20t f35t f50t c ct c5t c0t c5t c20t c35t c50t sma RSS

16 R^ F(7,2229) = [0.000]** lo-lielihood DW.7 no. of observations 2247 no. of parameters 8 mean(w_adult) var(w_adult) Aain, there are no snificant differences for the children as the t-values on females are below the critical value of,96 for all ae roups except 20 to 34. However, there are some sns of discrimination for adults, which may indicate that adult oods are mostly consumed by men. To test the hypothesis of overall discrimination, I aain run the constrained reression: Constant Ltotexp c ct c5t e c0t c5t c20t c35t e c50t sma RSS R^ F(9,2237) = [0.000]** lo-lielihood 3974 DW.7 no. of observations 2247 no. of parameters 0 mean(w_adult) var(w_adult) The F-test is exactly as above, and aain I reject the null if I observe a statistic above,94. ow I et F = ( ) / / (2247 8) =. 07 7,

17 which aain is under the critical value. Hence we cannot reject the null hypothesis of no discrimination. h) Finally, we want to study the effect of villae size. I now o bac to a simpler demoraphic specification where I only have adults and children, and I also study the demand for rice. I et: Constant Ltotexp children adults vilsize e e sma RSS R^ F(4,2242) = 24.3 [0.000]** lo-lielihood DW.06 no. of observations 2247 no. of parameters 5 mean(w_rice) var(w_rice) Villae size is seen to have a numerically small impact, but the coefficient is snificantly different from zero as the t-value is Hence people in larer villaes tend to have a smaller budet share on rice then people in smaller villaes, other thins equal. It seems strane that villae size in itself should have an impact on the demand for rice, so the most plausible explanation is that villae size is correlated with an omitted variable. We could imaine several, for instance that in larer villaes, the maret structure could be different, so prices differ or households demand other oods. It could also be that the linear specification of lo expenditure of adults and children is wron. If villae size is correlated with say the square of number of children, and number of children has a non-linear effect on the budet share for rice, this could explain the findin.

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the

More information

Statistical Inference with Regression Analysis

Statistical Inference with Regression Analysis Introductory Applied Econometrics EEP/IAS 118 Spring 2015 Steven Buck Lecture #13 Statistical Inference with Regression Analysis Next we turn to calculating confidence intervals and hypothesis testing

More information

Rockefeller College University at Albany

Rockefeller College University at Albany Rockefeller College University at Albany PAD 705 Handout: Suggested Review Problems from Pindyck & Rubinfeld Original prepared by Professor Suzanne Cooper John F. Kennedy School of Government, Harvard

More information

Unless provided with information to the contrary, assume for each question below that the Classical Linear Model assumptions hold.

Unless provided with information to the contrary, assume for each question below that the Classical Linear Model assumptions hold. Economics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 1 SOLUTIONS Spring 2015 Instructor: Martin Farnham Unless provided with information to the contrary,

More information

An overview of applied econometrics

An overview of applied econometrics An overview of applied econometrics Jo Thori Lind September 4, 2011 1 Introduction This note is intended as a brief overview of what is necessary to read and understand journal articles with empirical

More information

ECON 497: Lecture Notes 10 Page 1 of 1

ECON 497: Lecture Notes 10 Page 1 of 1 ECON 497: Lecture Notes 10 Page 1 of 1 Metropolitan State University ECON 497: Research and Forecasting Lecture Notes 10 Heteroskedasticity Studenmund Chapter 10 We'll start with a quote from Studenmund:

More information

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression

STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression STAT 135 Lab 11 Tests for Categorical Data (Fisher s Exact test, χ 2 tests for Homogeneity and Independence) and Linear Regression Rebecca Barter April 20, 2015 Fisher s Exact Test Fisher s Exact Test

More information

Using regression to study economic relationships is called econometrics. econo = of or pertaining to the economy. metrics = measurement

Using regression to study economic relationships is called econometrics. econo = of or pertaining to the economy. metrics = measurement EconS 450 Forecasting part 3 Forecasting with Regression Using regression to study economic relationships is called econometrics econo = of or pertaining to the economy metrics = measurement Econometrics

More information

Making sense of Econometrics: Basics

Making sense of Econometrics: Basics Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/

More information

Answers to Problem Set #4

Answers to Problem Set #4 Answers to Problem Set #4 Problems. Suppose that, from a sample of 63 observations, the least squares estimates and the corresponding estimated variance covariance matrix are given by: bβ bβ 2 bβ 3 = 2

More information

Lecture 8. Using the CLR Model. Relation between patent applications and R&D spending. Variables

Lecture 8. Using the CLR Model. Relation between patent applications and R&D spending. Variables Lecture 8. Using the CLR Model Relation between patent applications and R&D spending Variables PATENTS = No. of patents (in 000) filed RDEP = Expenditure on research&development (in billions of 99 $) The

More information

2.1 Linear regression with matrices

2.1 Linear regression with matrices 21 Linear regression with matrices The values of the independent variables are united into the matrix X (design matrix), the values of the outcome and the coefficient are represented by the vectors Y and

More information

ECON 4230 Intermediate Econometric Theory Exam

ECON 4230 Intermediate Econometric Theory Exam ECON 4230 Intermediate Econometric Theory Exam Multiple Choice (20 pts). Circle the best answer. 1. The Classical assumption of mean zero errors is satisfied if the regression model a) is linear in the

More information

Lab 07 Introduction to Econometrics

Lab 07 Introduction to Econometrics Lab 07 Introduction to Econometrics Learning outcomes for this lab: Introduce the different typologies of data and the econometric models that can be used Understand the rationale behind econometrics Understand

More information

CHAPTER 7. + ˆ δ. (1 nopc) + ˆ β1. =.157, so the new intercept is = The coefficient on nopc is.157.

CHAPTER 7. + ˆ δ. (1 nopc) + ˆ β1. =.157, so the new intercept is = The coefficient on nopc is.157. CHAPTER 7 SOLUTIONS TO PROBLEMS 7. (i) The coefficient on male is 87.75, so a man is estimated to sleep almost one and one-half hours more per week than a comparable woman. Further, t male = 87.75/34.33

More information

WISE International Masters

WISE International Masters WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators

More information

Blinder-Oaxaca Decomposition for Tobit Models

Blinder-Oaxaca Decomposition for Tobit Models Blinder-Oaxaca Decomposition for Tobit Models Thomas K. Bauer, Mathias Sinnin To cite this version: Thomas K. Bauer, Mathias Sinnin. Blinder-Oaxaca Decomposition for Tobit Models. Applied Economics, Taylor

More information

UNIVERSIDAD CARLOS III DE MADRID ECONOMETRICS Academic year 2009/10 FINAL EXAM (2nd Call) June, 25, 2010

UNIVERSIDAD CARLOS III DE MADRID ECONOMETRICS Academic year 2009/10 FINAL EXAM (2nd Call) June, 25, 2010 UNIVERSIDAD CARLOS III DE MADRID ECONOMETRICS Academic year 2009/10 FINAL EXAM (2nd Call) June, 25, 2010 Very important: Take into account that: 1. Each question, unless otherwise stated, requires a complete

More information

Lectures 5 & 6: Hypothesis Testing

Lectures 5 & 6: Hypothesis Testing Lectures 5 & 6: Hypothesis Testing in which you learn to apply the concept of statistical significance to OLS estimates, learn the concept of t values, how to use them in regression work and come across

More information

Introduction to Econometrics. Review of Probability & Statistics

Introduction to Econometrics. Review of Probability & Statistics 1 Introduction to Econometrics Review of Probability & Statistics Peerapat Wongchaiwat, Ph.D. wongchaiwat@hotmail.com Introduction 2 What is Econometrics? Econometrics consists of the application of mathematical

More information

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE 1. You have data on years of work experience, EXPER, its square, EXPER, years of education, EDUC, and the log of hourly wages, LWAGE You estimate the following regressions: (1) LWAGE =.00 + 0.05*EDUC +

More information

Lecture 5 Processing microarray data

Lecture 5 Processing microarray data Lecture 5 Processin microarray data (1)Transform the data into a scale suitable for analysis ()Remove the effects of systematic and obfuscatin sources of variation (3)Identify discrepant observations Preprocessin

More information

PHY 133 Lab 1 - The Pendulum

PHY 133 Lab 1 - The Pendulum 3/20/2017 PHY 133 Lab 1 The Pendulum [Stony Brook Physics Laboratory Manuals] Stony Brook Physics Laboratory Manuals PHY 133 Lab 1 - The Pendulum The purpose of this lab is to measure the period of a simple

More information

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies

The t-test: A z-score for a sample mean tells us where in the distribution the particular mean lies The t-test: So Far: Sampling distribution benefit is that even if the original population is not normal, a sampling distribution based on this population will be normal (for sample size > 30). Benefit

More information

2 Prediction and Analysis of Variance

2 Prediction and Analysis of Variance 2 Prediction and Analysis of Variance Reading: Chapters and 2 of Kennedy A Guide to Econometrics Achen, Christopher H. Interpreting and Using Regression (London: Sage, 982). Chapter 4 of Andy Field, Discovering

More information

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity LECTURE 10 Introduction to Econometrics Multicollinearity & Heteroskedasticity November 22, 2016 1 / 23 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists

More information

1 Regression with Time Series Variables

1 Regression with Time Series Variables 1 Regression with Time Series Variables With time series regression, Y might not only depend on X, but also lags of Y and lags of X Autoregressive Distributed lag (or ADL(p; q)) model has these features:

More information

OSU Economics 444: Elementary Econometrics. Ch.10 Heteroskedasticity

OSU Economics 444: Elementary Econometrics. Ch.10 Heteroskedasticity OSU Economics 444: Elementary Econometrics Ch.0 Heteroskedasticity (Pure) heteroskedasticity is caused by the error term of a correctly speciþed equation: Var(² i )=σ 2 i, i =, 2,,n, i.e., the variance

More information

1/34 3/ Omission of a relevant variable(s) Y i = α 1 + α 2 X 1i + α 3 X 2i + u 2i

1/34 3/ Omission of a relevant variable(s) Y i = α 1 + α 2 X 1i + α 3 X 2i + u 2i 1/34 Outline Basic Econometrics in Transportation Model Specification How does one go about finding the correct model? What are the consequences of specification errors? How does one detect specification

More information

Chapter 13. Multiple Regression and Model Building

Chapter 13. Multiple Regression and Model Building Chapter 13 Multiple Regression and Model Building Multiple Regression Models The General Multiple Regression Model y x x x 0 1 1 2 2... k k y is the dependent variable x, x,..., x 1 2 k the model are the

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis y = 0 + 1 x 1 + x +... k x k + u 6. Heteroskedasticity What is Heteroskedasticity?! Recall the assumption of homoskedasticity implied that conditional on the explanatory variables,

More information

AVERAGE AND STANDARD DEVIATION

AVERAGE AND STANDARD DEVIATION AVERAGE AND STANDARD DEVIATION Link to: physicspaes home pae. To leave a comment or report an error, please use the auxiliary blo. Post date: 2 Jun 25. Reference: Griffiths, David J. (25), Introduction

More information

Introduction to Econometrics. Heteroskedasticity

Introduction to Econometrics. Heteroskedasticity Introduction to Econometrics Introduction Heteroskedasticity When the variance of the errors changes across segments of the population, where the segments are determined by different values for the explanatory

More information

Lecture 4: Heteroskedasticity

Lecture 4: Heteroskedasticity Lecture 4: Heteroskedasticity Econometric Methods Warsaw School of Economics (4) Heteroskedasticity 1 / 24 Outline 1 What is heteroskedasticity? 2 Testing for heteroskedasticity White Goldfeld-Quandt Breusch-Pagan

More information

Multi-sample structural equation models with mean structures, with special emphasis on assessing measurement invariance in cross-national research

Multi-sample structural equation models with mean structures, with special emphasis on assessing measurement invariance in cross-national research 1 Multi-sample structural equation models with mean structures, with special emphasis on assessin measurement invariance in cross-national research Measurement invariance measurement invariance: whether

More information

Chapter 11 Optimization with Equality Constraints

Chapter 11 Optimization with Equality Constraints Ch. - Optimization with Equalit Constraints Chapter Optimization with Equalit Constraints Albert William Tucker 95-995 arold William Kuhn 95 oseph-ouis Giuseppe odovico comte de arane 76-. General roblem

More information

Homework Set 2, ECO 311, Fall 2014

Homework Set 2, ECO 311, Fall 2014 Homework Set 2, ECO 311, Fall 2014 Due Date: At the beginning of class on October 21, 2014 Instruction: There are twelve questions. Each question is worth 2 points. You need to submit the answers of only

More information

CHAPTER 6: SPECIFICATION VARIABLES

CHAPTER 6: SPECIFICATION VARIABLES Recall, we had the following six assumptions required for the Gauss-Markov Theorem: 1. The regression model is linear, correctly specified, and has an additive error term. 2. The error term has a zero

More information

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0

2) For a normal distribution, the skewness and kurtosis measures are as follows: A) 1.96 and 4 B) 1 and 2 C) 0 and 3 D) 0 and 0 Introduction to Econometrics Midterm April 26, 2011 Name Student ID MULTIPLE CHOICE. Choose the one alternative that best completes the statement or answers the question. (5,000 credit for each correct

More information

Statistical Inference. Part IV. Statistical Inference

Statistical Inference. Part IV. Statistical Inference Part IV Statistical Inference As of Oct 5, 2017 Sampling Distributions of the OLS Estimator 1 Statistical Inference Sampling Distributions of the OLS Estimator Testing Against One-Sided Alternatives Two-Sided

More information

Multiple Regression. Peerapat Wongchaiwat, Ph.D.

Multiple Regression. Peerapat Wongchaiwat, Ph.D. Peerapat Wongchaiwat, Ph.D. wongchaiwat@hotmail.com The Multiple Regression Model Examine the linear relationship between 1 dependent (Y) & 2 or more independent variables (X i ) Multiple Regression Model

More information

The Standard Linear Model: Hypothesis Testing

The Standard Linear Model: Hypothesis Testing Department of Mathematics Ma 3/103 KC Border Introduction to Probability and Statistics Winter 2017 Lecture 25: The Standard Linear Model: Hypothesis Testing Relevant textbook passages: Larsen Marx [4]:

More information

AGEC 621 Lecture 16 David Bessler

AGEC 621 Lecture 16 David Bessler AGEC 621 Lecture 16 David Bessler This is a RATS output for the dummy variable problem given in GHJ page 422; the beer expenditure lecture (last time). I do not expect you to know RATS but this will give

More information

Empirical Economic Research, Part II

Empirical Economic Research, Part II Based on the text book by Ramanathan: Introductory Econometrics Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna December 7, 2011 Outline Introduction

More information

1 Independent Practice: Hypothesis tests for one parameter:

1 Independent Practice: Hypothesis tests for one parameter: 1 Independent Practice: Hypothesis tests for one parameter: Data from the Indian DHS survey from 2006 includes a measure of autonomy of the women surveyed (a scale from 0-10, 10 being the most autonomous)

More information

Next is material on matrix rank. Please see the handout

Next is material on matrix rank. Please see the handout B90.330 / C.005 NOTES for Wednesday 0.APR.7 Suppose that the model is β + ε, but ε does not have the desired variance matrix. Say that ε is normal, but Var(ε) σ W. The form of W is W w 0 0 0 0 0 0 w 0

More information

Experiment 3 The Simple Pendulum

Experiment 3 The Simple Pendulum PHY191 Fall003 Experiment 3: The Simple Pendulum 10/7/004 Pae 1 Suested Readin for this lab Experiment 3 The Simple Pendulum Read Taylor chapter 5. (You can skip section 5.6.IV if you aren't comfortable

More information

Inferences Based on Two Samples

Inferences Based on Two Samples Chapter 6 Inferences Based on Two Samples Frequently we want to use statistical techniques to compare two populations. For example, one might wish to compare the proportions of families with incomes below

More information

Applied Microeconometrics (L5): Panel Data-Basics

Applied Microeconometrics (L5): Panel Data-Basics Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

2 Regression Analysis

2 Regression Analysis FORK 1002 Preparatory Course in Statistics: 2 Regression Analysis Genaro Sucarrat (BI) http://www.sucarrat.net/ Contents: 1 Bivariate Correlation Analysis 2 Simple Regression 3 Estimation and Fit 4 T -Test:

More information

ECONOMETRIC THEORY. MODULE VI Lecture 19 Regression Analysis Under Linear Restrictions

ECONOMETRIC THEORY. MODULE VI Lecture 19 Regression Analysis Under Linear Restrictions ECONOMETRIC THEORY MODULE VI Lecture 9 Regression Analysis Under Linear Restrictions Dr Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur One of the basic objectives

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

41903: Group-Based Inference

41903: Group-Based Inference 41903: Clusterin and Fama-MacBeth Notes 4 Dependent Data in Economics Many real-world data sets plausibly exhibit complicated dependence structures Consider areate panel data model: y it = x itβ + α i

More information

ECON 497 Midterm Spring

ECON 497 Midterm Spring ECON 497 Midterm Spring 2009 1 ECON 497: Economic Research and Forecasting Name: Spring 2009 Bellas Midterm You have three hours and twenty minutes to complete this exam. Answer all questions and explain

More information

Lecture 1: OLS derivations and inference

Lecture 1: OLS derivations and inference Lecture 1: OLS derivations and inference Econometric Methods Warsaw School of Economics (1) OLS 1 / 43 Outline 1 Introduction Course information Econometrics: a reminder Preliminary data exploration 2

More information

Asymptotic Behavior of a t Test Robust to Cluster Heterogeneity

Asymptotic Behavior of a t Test Robust to Cluster Heterogeneity Asymptotic Behavior of a t est Robust to Cluster Heteroeneity Andrew V. Carter Department of Statistics University of California, Santa Barbara Kevin. Schnepel and Doulas G. Steierwald Department of Economics

More information

Homoskedasticity. Var (u X) = σ 2. (23)

Homoskedasticity. Var (u X) = σ 2. (23) Homoskedasticity How big is the difference between the OLS estimator and the true parameter? To answer this question, we make an additional assumption called homoskedasticity: Var (u X) = σ 2. (23) This

More information

FinQuiz Notes

FinQuiz Notes Reading 10 Multiple Regression and Issues in Regression Analysis 2. MULTIPLE LINEAR REGRESSION Multiple linear regression is a method used to model the linear relationship between a dependent variable

More information

Disclaimer: This lab write-up is not

Disclaimer: This lab write-up is not Disclaimer: This lab write-up is not to be copied, in whole or in part, unless a proper reference is made as to the source. (It is stronly recommended that you use this document only to enerate ideas,

More information

Sociology 593 Exam 1 Answer Key February 17, 1995

Sociology 593 Exam 1 Answer Key February 17, 1995 Sociology 593 Exam 1 Answer Key February 17, 1995 I. True-False. (5 points) Indicate whether the following statements are true or false. If false, briefly explain why. 1. A researcher regressed Y on. When

More information

Motivation for multiple regression

Motivation for multiple regression Motivation for multiple regression 1. Simple regression puts all factors other than X in u, and treats them as unobserved. Effectively the simple regression does not account for other factors. 2. The slope

More information

Demand analysis is one of the rst topics come to in economics. Very important especially in the Keynesian paradigm.

Demand analysis is one of the rst topics come to in economics. Very important especially in the Keynesian paradigm. 1 Demand Analysis Demand analysis is one of the rst topics come to in economics. Very important especially in the Keynesian paradigm. Very important for companies: mainstay of consultancies As have seen

More information

Multiple Linear Regression

Multiple Linear Regression Multiple Linear Regression Simple linear regression tries to fit a simple line between two variables Y and X. If X is linearly related to Y this explains some of the variability in Y. In most cases, there

More information

Econ107 Applied Econometrics

Econ107 Applied Econometrics Econ107 Applied Econometrics Topics 2-4: discussed under the classical Assumptions 1-6 (or 1-7 when normality is needed for finite-sample inference) Question: what if some of the classical assumptions

More information

The regression model with one fixed regressor cont d

The regression model with one fixed regressor cont d The regression model with one fixed regressor cont d 3150/4150 Lecture 4 Ragnar Nymoen 27 January 2012 The model with transformed variables Regression with transformed variables I References HGL Ch 2.8

More information

Inference in Regression Analysis

Inference in Regression Analysis ECNS 561 Inference Inference in Regression Analysis Up to this point 1.) OLS is unbiased 2.) OLS is BLUE (best linear unbiased estimator i.e., the variance is smallest among linear unbiased estimators)

More information

UNIVERSIDAD CARLOS III DE MADRID ECONOMETRICS FINAL EXAM (Type B) 2. This document is self contained. Your are not allowed to use any other material.

UNIVERSIDAD CARLOS III DE MADRID ECONOMETRICS FINAL EXAM (Type B) 2. This document is self contained. Your are not allowed to use any other material. DURATION: 125 MINUTES Directions: UNIVERSIDAD CARLOS III DE MADRID ECONOMETRICS FINAL EXAM (Type B) 1. This is an example of a exam that you can use to self-evaluate about the contents of the course Econometrics

More information

y ˆ i = ˆ " T u i ( i th fitted value or i th fit)

y ˆ i = ˆ  T u i ( i th fitted value or i th fit) 1 2 INFERENCE FOR MULTIPLE LINEAR REGRESSION Recall Terminology: p predictors x 1, x 2,, x p Some might be indicator variables for categorical variables) k-1 non-constant terms u 1, u 2,, u k-1 Each u

More information

Inference in Regression Model

Inference in Regression Model Inference in Regression Model Christopher Taber Department of Economics University of Wisconsin-Madison March 25, 2009 Outline 1 Final Step of Classical Linear Regression Model 2 Confidence Intervals 3

More information

ACE 564 Spring Lecture 8. Violations of Basic Assumptions I: Multicollinearity and Non-Sample Information. by Professor Scott H.

ACE 564 Spring Lecture 8. Violations of Basic Assumptions I: Multicollinearity and Non-Sample Information. by Professor Scott H. ACE 564 Spring 2006 Lecture 8 Violations of Basic Assumptions I: Multicollinearity and Non-Sample Information by Professor Scott H. Irwin Readings: Griffiths, Hill and Judge. "Collinear Economic Variables,

More information

Second Midterm Exam Economics 410 Thurs., April 2, 2009

Second Midterm Exam Economics 410 Thurs., April 2, 2009 Second Midterm Exam Economics 410 Thurs., April 2, 2009 Show All Work. Only partial credit will be given for correct answers if we can not figure out how they were derived. Note that we have not put equal

More information

STA 431s17 Assignment Eight 1

STA 431s17 Assignment Eight 1 STA 43s7 Assignment Eight The first three questions of this assignment are about how instrumental variables can help with measurement error and omitted variables at the same time; see Lecture slide set

More information

Applied Quantitative Methods II

Applied Quantitative Methods II Applied Quantitative Methods II Lecture 4: OLS and Statistics revision Klára Kaĺıšková Klára Kaĺıšková AQM II - Lecture 4 VŠE, SS 2016/17 1 / 68 Outline 1 Econometric analysis Properties of an estimator

More information

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Estimation - Theory Department of Economics University of Gothenburg December 4, 2014 1/28 Why IV estimation? So far, in OLS, we assumed independence.

More information

Interpreting Regression Results

Interpreting Regression Results Interpreting Regression Results Carlo Favero Favero () Interpreting Regression Results 1 / 42 Interpreting Regression Results Interpreting regression results is not a simple exercise. We propose to split

More information

Problem #1 #2 #3 #4 #5 #6 Total Points /6 /8 /14 /10 /8 /10 /56

Problem #1 #2 #3 #4 #5 #6 Total Points /6 /8 /14 /10 /8 /10 /56 STAT 391 - Spring Quarter 2017 - Midterm 1 - April 27, 2017 Name: Student ID Number: Problem #1 #2 #3 #4 #5 #6 Total Points /6 /8 /14 /10 /8 /10 /56 Directions. Read directions carefully and show all your

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

Incentives and Nutrition for Rotten Kids: Intrahousehold Food Allocation in the Philippines

Incentives and Nutrition for Rotten Kids: Intrahousehold Food Allocation in the Philippines Incentives and Nutrition for Rotten Kids: Intrahousehold Food Allocation in the Philippines Pierre Dubois and Ethan Ligon presented by Rachel Heath November 3, 2006 Introduction Outline Introduction Modification

More information

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2.

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2. Updated: November 17, 2011 Lecturer: Thilo Klein Contact: tk375@cam.ac.uk Contest Quiz 3 Question Sheet In this quiz we will review concepts of linear regression covered in lecture 2. NOTE: Please round

More information

Phase Diagrams: construction and comparative statics

Phase Diagrams: construction and comparative statics 1 / 11 Phase Diarams: construction and comparative statics November 13, 215 Alecos Papadopoulos PhD Candidate Department of Economics, Athens University of Economics and Business papadopalex@aueb.r, https://alecospapadopoulos.wordpress.com

More information

Day 4: Shrinkage Estimators

Day 4: Shrinkage Estimators Day 4: Shrinkage Estimators Kenneth Benoit Data Mining and Statistical Learning March 9, 2015 n versus p (aka k) Classical regression framework: n > p. Without this inequality, the OLS coefficients have

More information

Correlation 1. December 4, HMS, 2017, v1.1

Correlation 1. December 4, HMS, 2017, v1.1 Correlation 1 December 4, 2017 1 HMS, 2017, v1.1 Chapter References Diez: Chapter 7 Navidi, Chapter 7 I don t expect you to learn the proofs what will follow. Chapter References 2 Correlation The sample

More information

1 Multiple Regression

1 Multiple Regression 1 Multiple Regression In this section, we extend the linear model to the case of several quantitative explanatory variables. There are many issues involved in this problem and this section serves only

More information

Computer Exercise 3 Answers Hypothesis Testing

Computer Exercise 3 Answers Hypothesis Testing Computer Exercise 3 Answers Hypothesis Testing. reg lnhpay xper yearsed tenure ---------+------------------------------ F( 3, 6221) = 512.58 Model 457.732594 3 152.577531 Residual 1851.79026 6221.297667619

More information

Econometrics Problem Set 6

Econometrics Problem Set 6 Econometrics Problem Set 6 WISE, Xiamen University Spring 2016-17 Conceptual Questions 1. This question refers to the estimated regressions shown in Table 1 computed using data for 1988 from the CPS. The

More information

Brief Suggested Solutions

Brief Suggested Solutions DEPARTMENT OF ECONOMICS UNIVERSITY OF VICTORIA ECONOMICS 366: ECONOMETRICS II SPRING TERM 5: ASSIGNMENT TWO Brief Suggested Solutions Question One: Consider the classical T-observation, K-regressor linear

More information

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS Postponed exam: ECON4160 Econometrics Modeling and systems estimation Date of exam: Wednesday, January 8, 2014 Time for exam: 09:00 a.m. 12:00 noon The problem

More information

Föreläsning /31

Föreläsning /31 1/31 Föreläsning 10 090420 Chapter 13 Econometric Modeling: Model Speci cation and Diagnostic testing 2/31 Types of speci cation errors Consider the following models: Y i = β 1 + β 2 X i + β 3 X 2 i +

More information

Linearized optimal power flow

Linearized optimal power flow Linearized optimal power flow. Some introductory comments The advantae of the economic dispatch formulation to obtain minimum cost allocation of demand to the eneration units is that it is computationally

More information

Graduate Econometrics Lecture 4: Heteroskedasticity

Graduate Econometrics Lecture 4: Heteroskedasticity Graduate Econometrics Lecture 4: Heteroskedasticity Department of Economics University of Gothenburg November 30, 2014 1/43 and Autocorrelation Consequences for OLS Estimator Begin from the linear model

More information

The Linear Regression Model

The Linear Regression Model The Linear Regression Model Carlo Favero Favero () The Linear Regression Model 1 / 67 OLS To illustrate how estimation can be performed to derive conditional expectations, consider the following general

More information

Habits and Multiple Equilibria

Habits and Multiple Equilibria Habits and Multiple Equilibria Lorenz Kuen and Eveny Yakovlev February 5, 2018 Abstract Abstract TBA. A Introduction Brief introduction TBA B A Structural Model of Taste Chanes Several structural models

More information

Multiple Regression Analysis: Inference MULTIPLE REGRESSION ANALYSIS: INFERENCE. Sampling Distributions of OLS Estimators

Multiple Regression Analysis: Inference MULTIPLE REGRESSION ANALYSIS: INFERENCE. Sampling Distributions of OLS Estimators 1 2 Multiple Regression Analysis: Inference MULTIPLE REGRESSION ANALYSIS: INFERENCE Hüseyin Taştan 1 1 Yıldız Technical University Department of Economics These presentation notes are based on Introductory

More information

Notes on Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of Self-Selection in the Labor Market

Notes on Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of Self-Selection in the Labor Market Notes on Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of Self-Selection in the Labor Market Heckman and Sedlacek, JPE 1985, 93(6), 1077-1125 James Heckman University of Chicago

More information

Econometrics Part Three

Econometrics Part Three !1 I. Heteroskedasticity A. Definition 1. The variance of the error term is correlated with one of the explanatory variables 2. Example -- the variance of actual spending around the consumption line increases

More information

Intermediate Econometrics

Intermediate Econometrics Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the

More information

Econometrics Homework 1

Econometrics Homework 1 Econometrics Homework Due Date: March, 24. by This problem set includes questions for Lecture -4 covered before midterm exam. Question Let z be a random column vector of size 3 : z = @ (a) Write out z

More information

One-Way ANOVA. Some examples of when ANOVA would be appropriate include:

One-Way ANOVA. Some examples of when ANOVA would be appropriate include: One-Way ANOVA 1. Purpose Analysis of variance (ANOVA) is used when one wishes to determine whether two or more groups (e.g., classes A, B, and C) differ on some outcome of interest (e.g., an achievement

More information