Heteroskedasticity. y i = β 0 + β 1 x 1i + β 2 x 2i β k x ki + e i. where E(e i. ) σ 2, non-constant variance.

Similar documents
Reading Assignment. Serial Correlation and Heteroskedasticity. Chapters 12 and 11. Kennedy: Chapter 8. AREC-ECON 535 Lec F1 1

Lecture 4: Heteroskedasticity

Heteroskedasticity and Autocorrelation

Topic 7: Heteroskedasticity

Outline. Possible Reasons. Nature of Heteroscedasticity. Basic Econometrics in Transportation. Heteroscedasticity

Heteroskedasticity. We now consider the implications of relaxing the assumption that the conditional

Intermediate Econometrics

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Heteroscedasticity. Jamie Monogan. Intermediate Political Methodology. University of Georgia. Jamie Monogan (UGA) Heteroscedasticity POLS / 11

the error term could vary over the observations, in ways that are related

Reading Assignment. Distributed Lag and Autoregressive Models. Chapter 17. Kennedy: Chapters 10 and 13. AREC-ECON 535 Lec G 1

Semester 2, 2015/2016

Questions and Answers on Heteroskedasticity, Autocorrelation and Generalized Least Squares

Multiple Regression Analysis

Week 11 Heteroskedasticity and Autocorrelation

Chapter 8 Heteroskedasticity

Introductory Econometrics

Econometrics - 30C00200

Applied Econometrics. Applied Econometrics. Applied Econometrics. Applied Econometrics. What is Autocorrelation. Applied Econometrics

Graduate Econometrics Lecture 4: Heteroskedasticity

Econometrics Multiple Regression Analysis: Heteroskedasticity

1 The Multiple Regression Model: Freeing Up the Classical Assumptions

Chapter 15 Panel Data Models. Pooling Time-Series and Cross-Section Data

Review of Econometrics

Multiple Regression Analysis

AUTOCORRELATION. Phung Thanh Binh

Heteroskedasticity. Occurs when the Gauss Markov assumption that the residual variance is constant across all observations in the data set

Autocorrelation. Think of autocorrelation as signifying a systematic relationship between the residuals measured at different points in time

Iris Wang.

Outline. Nature of the Problem. Nature of the Problem. Basic Econometrics in Transportation. Autocorrelation

GLS. Miguel Sarzosa. Econ626: Empirical Microeconomics, Department of Economics University of Maryland

1 Introduction to Generalized Least Squares

Freeing up the Classical Assumptions. () Introductory Econometrics: Topic 5 1 / 94

ECON 366: ECONOMETRICS II. SPRING TERM 2005: LAB EXERCISE #10 Nonspherical Errors Continued. Brief Suggested Solutions

Linear Model Under General Variance

Quick Review on Linear Multiple Regression

Economics 308: Econometrics Professor Moody

Econ 510 B. Brown Spring 2014 Final Exam Answers

Economics 582 Random Effects Estimation

ECON 497: Lecture Notes 10 Page 1 of 1

Heteroskedasticity. (In practice this means the spread of observations around any given value of X will not now be constant)

Making sense of Econometrics: Basics

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE

1 Correlation between an independent variable and the error

Econometrics Honor s Exam Review Session. Spring 2012 Eunice Han

Introduction to Econometrics. Heteroskedasticity

Econometrics. 9) Heteroscedasticity and autocorrelation

Ordinary Least Squares Regression

Lecture 3: Multiple Regression

7. GENERALIZED LEAST SQUARES (GLS)

Rockefeller College University at Albany

ECONOMETRICS HONOR S EXAM REVIEW SESSION

ECONOMICS 210C / ECONOMICS 236A MONETARY HISTORY

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

11.1 Gujarati(2003): Chapter 12

Formulary Applied Econometrics

Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

9. AUTOCORRELATION. [1] Definition of Autocorrelation (AUTO) 1) Model: y t = x t β + ε t. We say that AUTO exists if cov(ε t,ε s ) 0, t s.

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Econometrics Summary Algebraic and Statistical Preliminaries

CHAPTER 6: SPECIFICATION VARIABLES

Econ 582 Fixed Effects Estimation of Panel Data

Agricultural and Applied Economics 637 Applied Econometrics II

ECON 4230 Intermediate Econometric Theory Exam

Economics 536 Lecture 7. Introduction to Specification Testing in Dynamic Econometric Models

1 Motivation for Instrumental Variable (IV) Regression

Spatial Regression. 3. Review - OLS and 2SLS. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Model Mis-specification

ECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 48

Auto correlation 2. Note: In general we can have AR(p) errors which implies p lagged terms in the error structure, i.e.,

Variable Selection and Model Building

Using EViews Vox Principles of Econometrics, Third Edition

Linear Regression with Time Series Data

Econometrics Part Three

Environmental Econometrics

2. Linear regression with multiple regressors

Topic 6: Non-Spherical Disturbances

Økonomisk Kandidateksamen 2004 (I) Econometrics 2. Rettevejledning

Time Series Methods. Sanjaya Desilva

Introductory Econometrics

ECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 54

Contents. Part I Statistical Background and Basic Data Handling 5. List of Figures List of Tables xix

Likely causes: The Problem. E u t 0. E u s u p 0

Economics 620, Lecture 7: Still More, But Last, on the K-Varable Linear Model

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.

08 Endogenous Right-Hand-Side Variables. Andrius Buteikis,

Statistics 910, #5 1. Regression Methods

Fixed Effects Models for Panel Data. December 1, 2014

Heteroskedasticity. Part VII. Heteroskedasticity

statistical sense, from the distributions of the xs. The model may now be generalized to the case of k regressors:

Brief Suggested Solutions

Homoskedasticity. Var (u X) = σ 2. (23)

Non-Spherical Errors

Simple Linear Regression for the Advertising Data

Peter Hoff Linear and multilinear models April 3, GLS for multivariate regression 5. 3 Covariance estimation for the GLM 8

Multiple Regression Analysis: Heteroskedasticity

ECON The Simple Regression Model

PBAF 528 Week 8. B. Regression Residuals These properties have implications for the residuals of the regression.

Models, Testing, and Correction of Heteroskedasticity. James L. Powell Department of Economics University of California, Berkeley

Transcription:

Heteroskedasticity y i = β + β x i + β x i +... + β k x ki + e i where E(e i ) σ, non-constant variance. Common problem with samples over individuals. ê i e ˆi x k x k AREC-ECON 535 Lec F

Suppose y i = β + β x i + β x i +... + β k x ki + e i where E(e i ) = E(e i ) = σ i E(e i e j ) =. Assuming the error variance changes over the sample and that we can explain the heteroskedasticity. AREC-ECON 535 Lec F

Why does heteroskedasticity occur? Variance of dependent variable increases with increases in the level of the dependent variable. Pattern in random variables. Variance of dependent variable increases or decreases with changes in independent variable. error-learning, discretionary income, data collection Outliers in data. Small number of outliers result in a large variance. Specification Bias: missing variable or incorrect functional form. Systematic process remains in error term. Consequences of Heteroskedasticity ˆ are unbiased but inefficient. Formulas for variances of OLS estimators are biased and inconsistent. Variances are generally too small and hypothesis test statistics are too large. However, opposite can happen. AREC-ECON 535 Lec F 3

Detection of Heteroskedasticity ) Graph the residuals against all X's and y. (Not as simple as autocorrelation. Temporal processes are much simpler than multi-dimensional ones.) ) Breusch-Pagan (LM) test e i = α + α z i + α z i +... + α p z pi + u i H : α = α =... = α p = Steps of test: a) Run OLS regression and obtain ê i b) Calculate ~ = Σ e ˆi / N. c) Construct v i = e ˆi / ~ d) Regress v i on Z s (usually we use the X's so p = k but we don t have to). e) Obtain Estimated SS AREC-ECON 535 Lec F 4

If sample size is large ½ ESS ~ χ p ex) ½ (.788) = 5.394 > 3.845 = χ (5%) f) Econometric research shows an F-test on the auxiliary regression has better small sample properties. H : α = α =... = α p = AREC-ECON 535 Lec F 5

3) White (LM) test e i = α + α x i + α x i +... + α k x ki + α k+ x i + α k+ x i +... + α k+k x ki + α k+ x i x i + α k+ x i x 3i +... + α m x k-i x ki + u i m = ((k - k)/) + k H : α = α =... = α m = (N) R ~ χ m Steps of test are same as Breusch-Pagan test (except use e ˆi instead of v i ). White test examines for more complex heteroskedasticity. But be careful of sample size and looking for too much... (Including irrelevant variables does what?) AREC-ECON 535 Lec F 6

4) ARCH test - time series data (Autoregressive Conditional Heteroskedasticity) Suppose y t = β + β x t +... + β k x kt + e t and σ t = α + α e t- +... + α p e t-p + u t Model has strong intuitive appeal. e t e t t t AREC-ECON 535 Lec F 7

e t = α + α e t- +... + α p e t-p + u t H : α = α =... = α p = Steps of test: a) Run OLS regression and obtain ê t. b) Regress e ˆt on X's in model and ˆ t e,..., ˆ t p e. c) Obtain R. d) (T - p) R ~ χ p under H. e) F-test on the auxiliary regression. 5) Other tests: Park test, Glejser test, Goldfeld-Quandt test... These can be special cases or at least thought of as of Breusch-Pagan test. However, small sample properties may be better and look for specific forms of heteroskedasticity. Like with serial correlation, use caution. Do not want result to be due to assumption made by researcher. AREC-ECON 535 Lec F 8

What to do about heteroskedasticity? Heteroskedastic Consistent Variance-Covariance Matrix OLS: V( ˆ ) = ˆ (X X) - White's: V( ˆ ) = (X X) - (Σ eˆi x i x i ) (X X) - Compare the two. Correct if problem in interpretation is present. Many regression packages will generate White standard errors or covariance matrix. Likewise, Serial Correlation Consistent Variance-Covariance Matrix Newey-West: V( ˆ ) = (X X) - (X Ω( ˆ )X)(X X) - Many regression packages will do this but I don t think you should use it unless AREC-ECON 535 Lec F 9

Practical Note: If: y i = β + β x i +... + β k x ki + e i is heteroskedastic, conduct test on ln(y i ) = β + β x i +... + β k x ki + e i or ln(y i ) = β + β ln(x i ) +... + β k ln(x ki ) + e i. Use the model without heteroskedasticity unless theory suggests a functional form. Correcting for Heteroskedasticity Procedure: transform the error term into a random variable that meets OLS assumption. Results in Generalized Least Squares. i.e., E(e i * ) = σ. Be careful. Interpretation of slope coefficients does not remain necessarily the same as it does in models corrected for serial correlation. AREC-ECON 535 Lec F

Generalized Least Squares: when σ i is known. also called Weighted Least Squares (WLS) y i = β + β x i + e i (y i /σ i ) = (β + β x i + e i )/σ i (y i /σ i ) = (β /σ i ) + β (x i / σ i ) + (e i / σ i ) * * * * y i = β + β x i + e i * * V(e i ) = E(e i ) = E(e i /σ i ) = E(e i )/σ i = Run OLS on this model. ˆ are BLUE. If we knew σ i we could fix the problem. AREC-ECON 535 Lec F

Generalized Least Squares: estimation Assume a particular form of heteroskedasticity. ) Error variance proportional to independent variable squared. (Standard error model heteroskedasticity.) y i = β + β x i + e i and E(e i ) = σ i = σ x i or σ i = σx i y i / x i = β / x i + β + e i / x i y i * = β ( / x i ) + β + e i * E(e i * ) = E(e i / x i ) = E(e i ) /x i = σ ) Error variance proportional to independent variable. (Variance model heteroskedasticity.) y i = β + β x i + e i and E(e i ) = σ i = σ x i y i / x i = β / x i + β x i + e i / x i y i * = β (/ x i ) + β ( x i ) + e i * E(e i * ) = E(e i / x i ) = E(e i )/x i = σ AREC-ECON 535 Lec F

So the weight function in EViews is: e ˆi x k Dark: w i = /x i Dashed: w i = / x i AREC-ECON 535 Lec F 3

3) Error variance proportional to dependent variable. (Dependent variable model heteroskedasticity.) y i = β + β x i + e i and E(e i ) = σ i = σ [E(y i )] = σ [ ŷ i ] (y i / ŷ i ) = (β / ŷ i ) + β (x i / ŷ i ) + (e i / ŷ i ) * y i = β (/ ŷ i ) + β (x i / ŷ * * i ) + e i E(e i ) = E(e i / ŷ i ) = E(e i )/[E(y i )] = σ AREC-ECON 535 Lec F 4

AREC-ECON 535 Lec F 5 GLS in Matrix Notation β = (X Ω - X) - X Ω - y V(β) = (X Ω - X) - (under OLS: Ω = σ I ) where with heteroskedasticity 3 N 3 N

AREC-ECON 535 Lec F 6 and with serial correlation 3 3 T T T T T T

AREC-ECON 535 Lec F 7 Ω can be rewritten as Ω = H H so that EGLS is OLS on the transformed model (Hy) = (HX)β + (He) or y* = X*β + e* where with heteroskedasticity & with serial correlation N H 3 ) ( ) ( H these transformations make e* have good properties.

Estimated GLS (EGLS) or Feasible GLS (FGLS) Replace ρ and σ i with consistent estimates of ρ and σ i and iterate... (Careful about the number of parameters. The problem is iterating until convergence. So just twice ) Maximum Likelihood Alternative to GLS and EGLS lnl = -(T/)ln(π) - (/)Σln(σ t ) - (/)Σ((y t - μ t ) /σ t ) The common model is: μ t = β + β x t +... + β k x kt σ t = σ However, this can also be a common model: μ t = β + β x t +... + β k x kt σ t = α + α z t +... + α p z pt Think about, specify, estimate, test models of the conditional variance... AREC-ECON 535 Lec F 8

Several computer regression packages will do these procedures (LIMDEP: HREG, SAS: AUTOREG, STATA:...). LIMDEP: y t = x t β + e t σ t = exp{α + α z t } σ t = σ [x t β] SAS & SHAZAM: y t = x t β + e t σ t = exp{α + α z t } σ t = σ [x t β] σ t = σ [α + α z t ] σ t = σ [α + α z t ] AREC-ECON 535 Lec F 9

Example application mean and variance equation: Transaction Price = f(..., Market Institution,...) The introduction of Mandatory Price Reporting (the market institution) had a negative effect as a variable in the mean equation the conditional mean of transaction prices was reduced under the institution was introduced. And the market institution had a negative coefficient in the variance equation, i.e., decreased the variance of transaction prices. MPR decreased the mean price and decreased price risk. So heteroskedasticity is not just something to fix. It can have an economic interpretation. And if that is the case then it should be explored. AREC-ECON 535 Lec F