Problem of Estimation. Ordinary Least Squares (OLS) Ordinary Least Squares Method. Basic Econometrics in Transportation. Bivariate Regression Analysis

Similar documents
Chapter 3. Two-Variable Regression Model: The Problem of Estimation

Statistics for Economics & Business

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Chapter 11: Simple Linear Regression and Correlation

Properties of Least Squares

Economics 130. Lecture 4 Simple Linear Regression Continued

Basic Business Statistics, 10/e

Linear Regression Analysis: Terminology and Notation

Lecture 4 Hypothesis Testing

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

x i1 =1 for all i (the constant ).

STAT 3008 Applied Regression Analysis

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 14 Simple Linear Regression

Comparison of Regression Lines

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

The Ordinary Least Squares (OLS) Estimator

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Chapter 13: Multiple Regression

Statistics for Business and Economics

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Lecture 6: Introduction to Linear Regression

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

e i is a random error

18. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III

a. (All your answers should be in the letter!

/ n ) are compared. The logic is: if the two

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10)

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

Introduction to Regression

III. Econometric Methodology Regression Analysis

First Year Examination Department of Statistics, University of Florida

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3.

Negative Binomial Regression

x = , so that calculated

Correlation and Regression

Chapter 9: Statistical Inference and the Relationship between Two Variables

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Psychology 282 Lecture #24 Outline Regression Diagnostics: Outliers

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

Professor Chris Murray. Midterm Exam

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

January Examinations 2015

Statistics Chapter 4

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

Diagnostics in Poisson Regression. Models - Residual Analysis

CHAPER 11: HETEROSCEDASTICITY: WHAT HAPPENS WHEN ERROR VARIANCE IS NONCONSTANT?

Chapter 5: Hypothesis Tests, Confidence Intervals & Gauss-Markov Result

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

Statistics MINITAB - Lab 2

Lecture 3 Stat102, Spring 2007

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

Learning Objectives for Chapter 11

Statistics II Final Exam 26/6/18

See Book Chapter 11 2 nd Edition (Chapter 10 1 st Edition)

Exam. Econometrics - Exam 1

Continuous vs. Discrete Goods

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Biostatistics 360 F&t Tests and Intervals in Regression 1

LOGIT ANALYSIS. A.K. VASISHT Indian Agricultural Statistics Research Institute, Library Avenue, New Delhi

Chapter 15 Student Lecture Notes 15-1

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

ANSWERS. Problem 1. and the moment generating function (mgf) by. defined for any real t. Use this to show that E( U) var( U)

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting.

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

STAT 511 FINAL EXAM NAME Spring 2001

F8: Heteroscedasticity

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

17 - LINEAR REGRESSION II

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

7.1. Single classification analysis of variance (ANOVA) Why not use multiple 2-sample 2. When to use ANOVA

ANSWERS CHAPTER 9. TIO 9.2: If the values are the same, the difference is 0, therefore the null hypothesis cannot be rejected.

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity

Chapter 4: Regression With One Regressor

Basically, if you have a dummy dependent variable you will be estimating a probability.

β0 + β1xi and want to estimate the unknown

Unit 10: Simple Linear Regression and Correlation

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

SIMPLE LINEAR REGRESSION

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

CHAPTER 8. Exercise Solutions

Midterm Examination. Regression and Forecasting Models

A Comparative Study for Estimation Parameters in Panel Data Model

The SAS program I used to obtain the analyses for my answers is given below.

Chapter 7 Generalized and Weighted Least Squares Estimation. In this method, the deviation between the observed and expected values of

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

STATISTICS QUESTIONS. Step by Step Solutions.

Module Contact: Dr Susan Long, ECO Copyright of the University of East Anglia Version 1

Transcription:

1/60 Problem of Estmaton Basc Econometrcs n Transportaton Bvarate Regresson Analyss Amr Samm Cvl Engneerng Department Sharf Unversty of Technology Ordnary Least Squares (OLS) Maxmum Lkelhood (ML) Generally, OLS s used extensvely n regresson analyss: It s ntutvely appealng Mathematcally much smpler than the MLE. Both methods generally gve smlar results n the lnear regresson context. Prmary Source: Basc Econometrcs (Gujarat) 2/60 3/60 Ordnary Least Squares Method To odetermne e PRF: Y 1 2 Populaton Regresson Functon We estmate t from the SRF: Sample Regresson Functon X u We would lke to determne the SRF n a way that t s as close as possble to the actual Y. Y ˆ ˆ 1 2 X.e. sum of the resduals û s as small as possble. uˆ Ordnary Least Squares Method 2 uˆ f ( ˆ, ˆ 1 2) OLS fnds unque estmates of β 1 and β 2 that gve the smallest possble value of the functon above. ˆ 2 ˆ Y ˆ 2 X 1 ( X X )( Y Y ) 2 ( X X ) x y 2 x Why square? More weght to large resduals. Sgn of the resduals. Devaton form: y Y Y x X X

4/60 5/60 OLS Propertes: Sample Mean OLS Propertes: Lnearty Regresson lne passes through the sample mean Lnearty It s a lnear functon (a weghted average) of Y. X and thus k are nonstochastc and: Note: 6/60 7/60 OLS Propertes: Unbasedness OLS Propertes: Mean of Estmated Y Unbasedness ess Mean value of the estmated Y s equal to the mean value of the actual Y: Sum both sdes over the sample values and dvde by sample sze

8/60 9/60 OLS Propertes: Mean of Resduals OLS Propertes: Uncorrelated Resduals Y The mean value of the resduals s zero: The resduals are uncorrelated wth the predcted Y : 10/60 11/60 OLS Propertes: Uncorrelated Resduals X OLS Assumptons The resduals s are uncorrelated wth X : Our objectve s not only to estmate some coeffcents c e but also to draw nferences about the true coeffcents. Thus certan assumptons are made about the manner n whch Y are generated. Y = β 1 + β 2 X + u Unless we are specfc about how X and u are created, there s no way we can make any statstcal nference about the Y, β 1 and β 2. The Gaussan standard, or classcal lnear regresson model (CLRM), makes 10 assumptons that are extremely crtcal to the vald nterpretaton of the regresson estmates.

12/60 13/60 Assumpton 1 Assumpton 2 The regresson model s lnear n the parameters. E(Y X ) = β 1 + β 2 X 2 s a lnear model. E(Y X ) = β 1 + β 22 X s not a lnear model. X values are fxed n repeated samplng. X s assumed to be non-stochastc. Our regresson analyss s condtonal regresson analyss, that s, condtonal on the gven values of the regressor(s) X. 14/60 15/60 Assumpton 3 Assumpton 4 Zero mean value of dsturbance u E(u X ) = 0 Homoscedastcty oscedast c ty or equal varance a of u Factors not explctly ncluded n the model, and therefore subsumed n u, do not systematcally affect the mean value of Y.

16/60 17/60 Assumpton 4 Assumpton 5 Heteroscedastcty No autocorrelaton (seral correlaton) between the dsturbances. 18/60 19/60 Assumpton 6 Assumpton 7 Dsturbance u and dexplanatory ato varable ab X are uncorrelated. n must be greater than the number of explanatory ato varables. ab Obvously, we need at least two pars of observatons to estmate the two unknowns! If X and u are correlated, ther ndvdual effects on Y may not be assessed.

20/60 21/60 Assumpton 8 Assumpton 9 Varablty n X values. Mathematcally, f all the X values are dentcal, t mpossble to estmate β 2 (the denomnator wll be zero) and therefore β 1. Intutvely, t s obvous as well. The regresson model s correctly specfed. Important questons that arse n the specfcaton of a model: What varables should be ncluded n the model? What s the functonal form of the model? Is t lnear n the parameters, the varables, or both? What are the probablstc assumptons made about the Y, the X, and the u enterng the model? 22/60 23/60 Assumpton 10 Assumptons There e s no perfect ect multcollnearty. ty. No perfect lnear relatonshps among the explanatory varables. Wll be further dscussed n multple regresson models. How realstc are all these assumptons? s? We make certan assumptons because they facltate the study, not because they are realstc. Consequences of volaton of CLRM assumptons wll be examned later. We wll look nto: Precson of OLS estmates, and Statstcal propertes of OLS.

24/60 25/60 Precson of OLS Estmates Standard Error of Precson of an estmate s measured by ts standard error. Precson of an estmate s measured by ts standard error. 26/60 27/60 Homoscedastc Varance of u Features of the Varances How to estmate 2 The varance a of s drectly proportonal o to but nversely proportonal to sum of. As n ncreases, the precson wth whch β 2 can be estmated also ncreases. If there s substantal varaton n X, β 2 can be measured more accurately. The varance of s drectly proportonal to σ 2 and X 2, but nversely proportonal to x 2 2 and the sample sze n. Note:. If the slope coeffcent s overestmated, the ntercept wll be underestmated.

28/60 29/60 Gauss Markov Theorem Goodness of Ft Gven the assumptons of the classcal lnear regresson model, the OLS estmators, n the class of unbased lnear estmators, have mnmum varance, that s, they are BLUE. The coeffcent of determnaton R 2 s a summary measure that tells how well the sample regresson lne fts the data. The theorem makes no assumptons about the probablty dstrbuton of u, and therefore of Y. Total Sum of Squares (TSS) = Explaned Sum of Squares (ESS) + Resdual Sum of Squares (RSS) 30/60 31/60 Consstency Before Hypothess Testng An asymptotc property. An estmator s consstent f t s unbased and ts varance tends to zero as the sample sze n tends to nfnty. Unbasedness s already proved. Usng the method of OLS we can estmate β 2 1, β 2, and σ. Estmators ( ) are random varables. To draw nferences about PRF, we must fnd out how close s to the true. We need to fnd out PDF of the estmators.

32/60 33/60 Probablty Dstrbuton of Dsturbances Why the Normalty Assumpton? s ultmately a lnear functon of the random varable u, whch s random by assumpton. The nature of the probablty dstrbuton of u plays an extremely mportant role n hypothess testng. It s usually assumed that u NID(0, σ 2 ) NID: Normally and Independently Dstrbuted. We hope that the nfluence of these omtted or neglected varables s small and at best random. We show by the CLT, that f there are a large number of IID random varables, the dstrbuton of ther sum tends to a normal dstrbuton as the number of such varables ncrease ndefntely. Central lmt theorem (CLT): Let X 1, X 2,..., X n denote n ndependent random varables, all of whch have the same PDF wth mean = μ and varance = σ 2. Let : 34/60 35/60 Why the Normalty Assumpton? Normalty Test A varant a of the CLT states that, even f the number of varables ab s not very large or f these varables are not strctly ndependent, ther sum may stll be normally dstrbuted. Wth ths assumpton, PDF of OLS estmators can be easly derved, as any lnear functon of normally dstrbuted varables s tself normally dstrbuted. The normal dstrbuton s a comparatvely smple dstrbuton nvolvng only two parameters. It enables us to use the t, F, and χ 2 tests for regresson models. The normalty assumpton plays a crtcal role for small sample sze data. In reasonably large sample sze, we may relax the normalty assumpton. Snce we are mposng the normalty assumpton, t behooves us to fnd out n practcal applcatons nvolvng small sample sze data whether the normalty assumpton s approprate. Later, we wll ntroduces some tests to do just that. We wll come across stuatons where the normalty assumpton may be napproprate. Untl then we wll contnue wth the normalty assumpton.

36/60 37/60 Estmators Propertes wth Normalty Assumpton Method of Maxmum Lkelhood They are unbased, wth mnmum varance (effcent), and consstent.,, and s dstrbuted as the χ 2 wth (n 2) df. Wll help us to draw nferences about the true σ 2 from the estmated σ 2.. The mportance of ths wll be explaned later. Estmators have mnmum varance n the entre class of unbased estmators, whether lnear or not. Best Unbased Estmators (BUE) If u are assumed to be normally dstrbuted, the ML and OLS estmators of the regresson coeffcents, are dentcal. The ML estmator of σ 2 s based for small sample sze. Asymptotcally, the ML estmator of σ 2 s unbased. ML method can be appled to regresson models that are nonlnear n the parameters, n whch OLS s generally not used. 38/60 39/60 ML Estmaton ML Estmaton Assume ethe etwova two-varable ab model Y = β 1 + β 2 X + u n whch Havng ndependent Y s, jont PDF of Y 1,...,Y n, can be wrtten as: The method of maxmum lkelhood conssts sts n estmatng the unknown parameters n such a manner that the probablty of observng the gven Y s s as hgh as possble. Where: β 1, β 2, and σ 2 are unknowns n lkelhood functon:

40/60 41/60 ML Estmaton Interval Estmaton From the frst-order condton for optmzaton: How relable are the pont estmates? We try to fnd out two postve numbers δ and α, such that: Note how ML underestmates the true σ 2 n small samples. Probablty of constructng an nterval that contans β 2 s 1 α. Such an nterval s known as a confdence nterval. α (0 < α < 1) s known as the level of sgnfcance. How are the confdence ntervals constructed? If the probablty dstrbutons of the estmators are known, the task of constructng confdence ntervals s a smple one. 42/60 43/60 Confdence Intervals for Confdence Intervals for It tcan beshown that atthe ett varable ab follows ows the t dstrbuton wth n 2 df. It can be shown that under the normalty assumpton, followng ow varable follows χ 2 dstrbuton wth n 2 df. Wdth of the confdence nterval s proportonal to the standard error of the estmator. Same for 1 Interpretaton of ths nterval: If we establsh 95% confdence lmts on σ 2 and f we mantan a pror that these lmts wll nclude true σ 2,we shall be rght n the long run 95 percent of the tme.

44/60 45/60 Hypothess Testng Confdence Interval Approach Is a gven observaton compatble wth some stated hypothess? In statstcs, the stated hypothess s known as the null hypothess or H 0 (versus an alternatve hypothess or H 1 ). Hypothess testng s developng rules for rejectng or acceptng the null hypothess. Confdence nterval approach, Test of sgnfcance approach. Most of the statstcal hypotheses of our nterest make statements about one or more values of the parameters of some assumed probablty dstrbuton such as the normal, F, t, or χ 2. Decson Rule: Construct a 100(1 α)% confdence nterval for β 2. If the β 2 under H 0 falls wthn ths nterval, do not reject H 0, but f t falls outsde ths nterval, reject H 0. Note: There s a 100α percent chance of commttng a Type I error. If α = 0.05, there s a 5 percent chance that we could reject the null hypothess even though t s true. When we reject the null hypothess, we say that our fndng s statstcally sgnfcant. One-tal or two-tal test: Sometmes we have a strong expectaton that the alternatve hypothess s onesded rather than two-sded. 46/60 47/60 Test of Sgnfcance Approach Practcal Aspects In the econfdence confdence-nterval nterval procedure e we try to establsh s a range that has a probablty of ncludng the true but unknown β 2. In the test-of-sgnfcance approach we hypothesze some value for β 2 and try to see whether the estmated β 2 les wthn confdence lmts around the hypotheszed value. A large t value wll be evdence aganst the null hypothess. Acceptng null hypothess: s: All we can say: based on the sample evdence we have no reason to reject t. Another null hypothess may be equally compatble wth the data. 2-t rule of thumb If df >20 and α = 0.05, then the null hypothess β 2 = 0 can be rejected f t > 2. In these cases we do not even have to refer to the t table to assess the sgnfcance of the estmated slope coeffcent. Formng the null hypotheses Theoretcal expectatons or pror emprcal work can be reled upon to formulate hypotheses. p value The lowest sgnfcance level at whch a null hypothess can be rejected.

48/60 49/60 Analyss of Varance Analyss of Varance A study of two components of TSS (= ESS + RSS) s known as ANOVA from the regresson vewpont. If we assume that the dsturbances u are normally dstrbuted, whch we do under the CNLRM, and f the null hypothess (H 0 ) s that β 2 = 0, then t can be shown that the F follows the F dstrbuton wth 1 df n the numerator and (n 2) df n the denomnator. What use can be made of the precedng F rato? 50/60 51/60 F-rato Example It tcan beshown that Compute F rato and obtan p value of the computed F statstc. Note that β 2 2 and σ are the true parameters. If β 2 s zero, both equatons provde us wth dentcal estmates of true σ 2. Thus, X has no lnear nfluence on Y. F rato provdes a test of the null hypothess H 0 : β 2 = 0. p value of ths F statstc wth 1 and 8 df s 0.0000001. Therefore, f we reject the null hypothess, the probablty of commttng a Type I error s very small.

52/60 53/60 Applcaton Of Regresson Analyss Mean Predcton One use s to predct or forecast the future consumpton expendture Y correspondng to some gven level of ncome X. Now there are two knds of predctons: Predcton of the condtonal mean value of Y correspondng to a chosen X. Predcton of an ndvdual Y value correspondng to a chosen X. Estmator of E(Y X 0 ): It can be shown that: Ths statstc follows the t dstrbuton wth n 2 df and may be used to derve confdence ntervals 54/60 55/60 Indvdual Predcton Confdence Bands Estmator of E(Y ( X 0 0): It can be shown that: Ths statstc follows the t dstrbuton wth n 2 df and may be used to derve confdence ntervals

56/60 57/60 Indvdual Versus Mean Predcton Reportng the Results Confdence nterval for ndvdual Y 0 s wder than that for the mean value of Y 0. The wdth of confdence bands s smallest when X 0 = X. 58/60 59/60 Evaluatng the Results Normalty Tests How good s the ftted model? Any standard? d? Are the sgns of estmated coeffcents n accordance wth theoretcal or pror expectatons? How well does the model explan varaton n Y? One can use r 2 Does the model satsfes the assumptons of CNLRM? For now, we would lke to check the normalty of the dsturbance term. Recall that the t and F tests requre that the error term follow the normal dstrbuton. Several tests n the lterature. We look at: Hstogram of resduals: A smple graphc devce to learn about the shape of the PDF. Horzontal axs: the values of OLS resduals are dvded nto sutable ntervals. Vertcal axs: erect rectangles equal n heght to the frequency n that nterval. From a normal populaton we wll get a bell-shape PDF. Normal probablty yp plot (NPP): A smple graphc devce. Horzontal axs: plot values of OLS resduals, Vertcal axs: show expected value of varable f t were normally dstrbuted. From a normal populaton we wll get a straght lne. The Jarque Bera test. An asymptotc test, wth ch-squared dstrbuton and 2 df:

60/60 Homework 2 Basc Econometrcs (Gujarat, 2003) 1. Chapter 3, Problem 21 [10 ponts] 2. Chapter 3, Problem 23 [30 ponts] 3. Chapter 5, Problem 9 [30 ponts] 4. Chapter 5, Problem 19 [30 ponts] Assgnment weght factor = 1