Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Similar documents
Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10)

e i is a random error

x i1 =1 for all i (the constant ).

Chapter 11: Simple Linear Regression and Correlation

Properties of Least Squares

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3.

Chapter 3. Two-Variable Regression Model: The Problem of Estimation

Lecture 4 Hypothesis Testing

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

Statistics for Economics & Business

Basically, if you have a dummy dependent variable you will be estimating a probability.

Economics 130. Lecture 4 Simple Linear Regression Continued

The Ordinary Least Squares (OLS) Estimator

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

Econometrics of Panel Data

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

A Comparative Study for Estimation Parameters in Panel Data Model

β0 + β1xi and want to estimate the unknown

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity

Systems of Equations (SUR, GMM, and 3SLS)

Comparison of Regression Lines

Primer on High-Order Moment Estimators

Linear Regression Analysis: Terminology and Notation

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Chapter 8 Indicator Variables

Chapter 5: Hypothesis Tests, Confidence Intervals & Gauss-Markov Result

III. Econometric Methodology Regression Analysis

Exam. Econometrics - Exam 1

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

January Examinations 2015

Introduction to Regression

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

STAT 3008 Applied Regression Analysis

Limited Dependent Variables

Chapter 14 Simple Linear Regression

Lecture 3 Stat102, Spring 2007

Statistics Chapter 4

j) = 1 (note sigma notation) ii. Continuous random variable (e.g. Normal distribution) 1. density function: f ( x) 0 and f ( x) dx = 1

Chapter 15 Student Lecture Notes 15-1

Chapter 13: Multiple Regression

Basic Business Statistics, 10/e

Durban Watson for Testing the Lack-of-Fit of Polynomial Regression Models without Replications

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

Maximum Likelihood Estimation of Binary Dependent Variables Models: Probit and Logit. 1. General Formulation of Binary Dependent Variables Models

Statistics for Business and Economics

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

Lecture 3 Specification

Empirical Methods for Corporate Finance. Identification

However, since P is a symmetric idempotent matrix, of P are either 0 or 1 [Eigen-values

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

x = , so that calculated

a. (All your answers should be in the letter!

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Linear Approximation with Regularization and Moving Least Squares

THE ROYAL STATISTICAL SOCIETY 2006 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE

Continuous vs. Discrete Goods

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

Chapter 4: Regression With One Regressor

Asymptotic Properties of the Jarque-Bera Test for Normality in General Autoregressions with a Deterministic Term

β0 + β1xi. You are interested in estimating the unknown parameters β

β0 + β1xi. You are interested in estimating the unknown parameters β

Correlation and Regression

[ ] λ λ λ. Multicollinearity. multicollinearity Ragnar Frisch (1934) perfect exact. collinearity. multicollinearity. exact

Problem of Estimation. Ordinary Least Squares (OLS) Ordinary Least Squares Method. Basic Econometrics in Transportation. Bivariate Regression Analysis

Introduction to Dummy Variable Regressors. 1. An Example of Dummy Variable Regressors

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

princeton univ. F 13 cos 521: Advanced Algorithm Design Lecture 3: Large deviations bounds and applications Lecturer: Sanjeev Arora

Notes on Frequency Estimation in Data Streams

/ n ) are compared. The logic is: if the two

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

28. SIMPLE LINEAR REGRESSION III

Negative Binomial Regression

STAT 511 FINAL EXAM NAME Spring 2001

Professor Chris Murray. Midterm Exam

A Monte Carlo Study for Swamy s Estimate of Random Coefficient Panel Data Model

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).

Kernel Methods and SVMs Extension

Testing for seasonal unit roots in heterogeneous panels

Computing MLE Bias Empirically

This column is a continuation of our previous column

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Inner Product. Euclidean Space. Orthonormal Basis. Orthogonal

LECTURE 9 CANONICAL CORRELATION ANALYSIS

Tests of Single Linear Coefficient Restrictions: t-tests and F-tests. 1. Basic Rules. 2. Testing Single Linear Coefficient Restrictions

18. SIMPLE LINEAR REGRESSION III

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

Estimation: Part 2. Chapter GREG estimation

Fall 2012 Analysis of Experimental Measurements B. Eisenstein/rev. S. Errede

RELIABILITY ASSESSMENT

Transcription:

I. Classcal Assumptons Econ7 Appled Econometrcs Topc 3: Classcal Model (Studenmund, Chapter 4) We have defned OLS and studed some algebrac propertes of OLS. In ths topc we wll study statstcal propertes of OLS estmators, whch help justfy the use of OLS and also help to make statstcal nference. To study statstcal propertes of OLS estmators, we need to mpose a set of assumptons, most of them on the error term. When the frst 5 assumptons are held, we call the error term a classcal error term. If Assumpton 7 s added, t s called a classcal normal error term. Assumpton : No specfcaton error n the model. That s, the regresson model s lnear n the coeffcents, s correctly specfed (has the correct varables and functonal forms), has no measurement error, and has an addtve error term. Y = K K ε Assumpton : Dsturbances have zero mean. The expected value of the dsturbance term s zero. E( ε ) = Assumpton 3: All the ndependent varables are uncorrelated wth the error term (we say ndependent varables are exogenous). That s, for all =,,n, j=,,k, Cov( ε, j )= E[( ε - E( ε ))( j - E( j ))] = E[ ε ( j - E( j ))] = E( ε ) - E( ε )E( )= E( ε )= j j j Assumpton 4: No autocorrelaton between the errors. Gven any values of and j (where j), the correlaton between ε and ε j s zero. Cov( ε, ε j, )= E( ε ε, j) =, j j, If ths s true for tme seres data, we say errors are serally uncorrelated. Cross-sectonal data have less problem of correlaton, but there are exceptons. j

Assumpton 5: Errors are homoskedastc. Gven the value of, the varance of the dsturbance s the same for all observatons. Var( ε )= E[ ε - E( ε ) ] = E( ε )= σ When ths assumpton does not hold, the error s sad to be heteroskedastc. Heteroskedastcty s tradtonally beleved to be an ssue for cross-sectonal data. However, t may well be a problem n the tme seres context. Assumpton 6: No 'perfect multcollnearty' between ndependent varables. That s, no explanatory varable can be wrtten as a lnear functon of other explanatory varables, e.g., the followng equaton cannot hold = α α K K If the above equaton holds, there s perfect multcollnearty n explanatory varables. When K=, we say and are perfectly collnear. Suppose we have a 3-varable regresson model, Y = ε If there s an exact lnear relatonshp between the ndependent varables: = α for some parameter α Substtute t nto orgnal regresson: Y = = α ( α ) = γ ε ε ε Ths reduces to -varable regresson. Sngle slope coeffcent. Can't dentfy separate effects of the ndependent varables. Assumpton 7: Errors are normally dstrbuted. Combned wth Assumptons, 4 & 5, we have ε ~ N(, σ ). Ths assumpton s mportant f we have a small sample, otherwse t s not mportant.

Page 3 II. Further Detals about Assumptons Assumpton : No specfcaton error.. Lnearty may be n dsguse. Consder the followng model Y = e e ε where e s the exponental. Ths model looks nonlnear, but can be transformed nto a lnear form by takng a log both sde ln( Y ) = ln( ) ε whch s lnear n s wth ln(y) and ln() as dependent and ndependent varables, respectvely. Note that the followng models are all lnear: ln( Y ) = ε Y = ln( ) ε Y = ε Y = ε. Correct specfcaton also requres all relevant explanatory varables to be taken n account. If the true model s Y = ε But you estmate the model Y = ε Then your model s msspecfed. We call ths mssng relevant varable problem. However, f you estmate the model Y = 3 3 ε

Page 4 Then your model s also msspecfed. We call ths ncludng rrelevant varable problem. 3. In the case where you estmate the model wth a dfferent functonal form, eg, Y = ε you also commt a specfcaton error. Assumpton : Error has zero mean. Ths s a rather weak assumpton. What would happen f you estmate the followng model Y = ε, E( ε ) = δ * We can ntroduce another error term ε so that * ε = ε δ As a result, the new error term has zero mean and the model becomes Y = ( δ ε ) * It mples that the estmated parameters are δ and. In other word, f the parameter of nterest s, and a constant ntercept s ncluded n the model, Assumpton s automatcally satsfed. Assumpton 3: Explanatory varables are exogenous. What mght cause the volaton of the assumpton of E( ε k ) = for some k? We call ths problem of the volaton of exogenety endogenety or smultanety. Exogenety occurs when the explanatory varable s determned ndependently of the error term, that s, outsde of the model. Ths assumpton s automatcally satsfed f s non-stochastc. However, f both ndependent and dependent varables are smultaneously determned n the model, we have the endogenety problem. Let s use the followng example to llustrate how the exogenety assumpton s volated. (note that the example used n the textbook s not very clear.)

Q d Page 5 = α P ε where Qd s the quantty of demand for a good and P s the prce. Ths model s a demand functon and n ths model P cannot be exogenous. Ths s because P cannot be determned outsde of the model, e, Q d and P are smultaneously determned wthn the model. To see ths, we have to examne how the prce s determned: Q Q s d = P ε = Q = Q s The frst of these two equatons s the supply functon. The second s the equlbrum. Solvng all three equatons for P, we have ε ε P = α Snce P s a functon ofε, P must be correlated wth ε. Hence the exogenety assumpton s volated. Assumptons, 4, 5, and 6 wll be dscussed n subsequent Topcs. Assumpton 7: Normalty. Ths assumpton s often justfed accordng to the Central Lmt Theorem. Central Lmt Theorem: The mean (or sum) of a large number of ndependent and dentcally dstrbuted (d) random wll tend to be a normal dstrbuton, regardless f ther dstrbuton, f the number of such varables s large enough. III. Samplng Dstrbuton of ˆ Need to know somethng about the precson of our least-squares estmators. The dea s that these coeffcent estmates are a functon of the sample data. They wll vary from sample to sample. We care about the relablty of our estmates of the coeffcents. For the SLR model, under Assumptons -6, OLS estmators are unbased, that s,

Page 6 E ( ˆ ˆ =. ) =, E( ) Also we have the formulae for the varance and standard devatons of the OLS estmators (proofs are avalable n more advanced econometrc textbooks). For the estmated ntercept: Var( ˆ )= Σ n Σ x σ σ( ˆ )= Σ n Σ x σ For the estmated slope coeffcent: ˆ σ Var( )= Σ x σ( ˆ )= σ Σ x The varance of the error term can be estmated wth: Σ where n- s the number of degrees of freedom (df). The square root s known as the Standard Error of the Estmate (or Regresson). It s a common summary statstc from regresson analyss. e σˆ = n - σˆ = Σ e n - Fnally, note the non-zero covarance between the two coeffcent estmates. Snce the varance s always postve, the nature of the covarance depends on the mean of.

Page 7 Cov( ˆ, ˆ )= - Var( ˆ )= - σ Σ x If, n addton, Assumpton 7 s also satsfed, we have the samplng dstrbuton for : ˆ ˆ ~ N(, Var( ˆ ) ) ~ N(, Var( ˆ ) ) The same deas apply to MLR models. However, the expressons are more complcated usng non-matrx notaton. IV. Statstcal Propertes of OLS Estmators: Gauss-Markov Theorem Gauss-Markov Theorem: Under Assumptons -6, the OLS estmators, n the class of unbased lnear estmators, have mnmum varance (.e., they re the best lnear unbased estmator (BLUE)). Won t prove ths proposton. The estmated coeffcents and varance of the dsturbances are unbased: E( ˆ )= E( ˆ )= E( ˆ σ )= σ On average, we ll get the estmated coeffcent and varance of the dsturbances rght n repeated samplng. The varance of the estmators of ˆ and ˆ are smaller than those from any other lnear unbased estmator. They have mnmum varance. Remarks: ) The Gauss-Markov theorem does not say that OLS estmates are normally dstrbuted (and t does not depend on Assumpton 7). ) But f Assumpton 7 s met, the OLS estmates are normally dstrbuted. 3) If the errors are not normal, OLS s approxmately normal provded the sample sze s large enough. Ths s due to the Central Lmt Theorem. Consstency: OLS estmator s also consstent. That s as n goes to nfnty, ˆ and ˆ n probablty. Why?

Page 8 V. Dscusson Questons: Q4.3, Q4.4, Q4.7 VI. Learnng the Notatons n Table 4.. VII. Computng Exercses: Monte Carlo