Economics 582 Random Effects Estimation

Similar documents
Multiple Equation GMM with Common Coefficients: Panel Data

Econ 582 Fixed Effects Estimation of Panel Data

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

Panel Data Models. James L. Powell Department of Economics University of California, Berkeley

Econ 583 Final Exam Fall 2008

Intermediate Econometrics

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

1 Introduction to Generalized Least Squares

Econometrics of Panel Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Lecture 4: Heteroskedasticity

Econ 583 Homework 7 Suggested Solutions: Wald, LM and LR based on GMM and MLE

Advanced Econometrics

y it = α i + β 0 ix it + ε it (0.1) The panel data estimators for the linear model are all standard, either the application of OLS or GLS.

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Max. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes

MS&E 226: Small Data. Lecture 11: Maximum likelihood (v2) Ramesh Johari

Lecture 4: Linear panel models

Nonlinear GMM. Eric Zivot. Winter, 2013

Econometric Analysis of Cross Section and Panel Data

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Econometrics of Panel Data

Nonstationary Panels

Introductory Econometrics

Economics 536 Lecture 7. Introduction to Specification Testing in Dynamic Econometric Models

Specification testing in panel data models estimated by fixed effects with instrumental variables

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

Econometrics of Panel Data

A Course on Advanced Econometrics

Økonomisk Kandidateksamen 2004 (I) Econometrics 2. Rettevejledning

Zellner s Seemingly Unrelated Regressions Model. James L. Powell Department of Economics University of California, Berkeley

Increasing the Power of Specification Tests. November 18, 2018

7. GENERALIZED LEAST SQUARES (GLS)

Lecture 10: Panel Data

GLS. Miguel Sarzosa. Econ626: Empirical Microeconomics, Department of Economics University of Maryland

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Applied Economics. Panel Data. Department of Economics Universidad Carlos III de Madrid

Topic 7: Heteroskedasticity

Lecture 8 Panel Data

Dynamic Panel Data Ch 1. Reminder on Linear Non Dynamic Models

Economics 583: Econometric Theory I A Primer on Asymptotics

Testing Random Effects in Two-Way Spatial Panel Data Models

Testing Error Correction in Panel data

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

Heteroskedasticity. y i = β 0 + β 1 x 1i + β 2 x 2i β k x ki + e i. where E(e i. ) σ 2, non-constant variance.

8. Hypothesis Testing

MS&E 226: Small Data

Statistics and econometrics

Casuality and Programme Evaluation

Generalized Method of Moment

Graduate Econometrics Lecture 4: Heteroskedasticity

Non-Spherical Errors

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley

Test of hypotheses with panel data

Heteroskedasticity-Robust Inference in Finite Samples

Non-linear panel data modeling

Econometrics - 30C00200

1 Outline. 1. Motivation. 2. SUR model. 3. Simultaneous equations. 4. Estimation

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Panel Data Econometrics

the error term could vary over the observations, in ways that are related

Heteroskedasticity. Part VII. Heteroskedasticity

Ordinary Least Squares Regression

mrw.dat is used in Section 14.2 to illustrate heteroskedasticity-robust tests of linear restrictions.

Bootstrapping Heteroskedasticity Consistent Covariance Matrix Estimator

The outline for Unit 3

Dealing With Endogeneity

Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III)

GLS and FGLS. Econ 671. Purdue University. Justin L. Tobias (Purdue) GLS and FGLS 1 / 22

Econometrics of Panel Data

Heteroscedasticity. Jamie Monogan. Intermediate Political Methodology. University of Georgia. Jamie Monogan (UGA) Heteroscedasticity POLS / 11

Instrumental Variables, Simultaneous and Systems of Equations

Heteroskedasticity and Autocorrelation

Fixed Effects Models for Panel Data. December 1, 2014

Applied Quantitative Methods II

Short Questions (Do two out of three) 15 points each

Sample Problems. Note: If you find the following statements true, you should briefly prove them. If you find them false, you should correct them.

Exogeneity tests and weak identification

Dynamic Panel Data Workshop. Yongcheol Shin, University of York University of Melbourne

Economics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama

A Practitioner s Guide to Cluster-Robust Inference

The BLP Method of Demand Curve Estimation in Industrial Organization

Econ 510 B. Brown Spring 2014 Final Exam Answers

Panel Data: Linear Models

OSU Economics 444: Elementary Econometrics. Ch.10 Heteroskedasticity

Statistics II. Management Degree Management Statistics IIDegree. Statistics II. 2 nd Sem. 2013/2014. Management Degree. Simple Linear Regression

Cross Sectional Time Series: The Normal Model and Panel Corrected Standard Errors

EC821: Time Series Econometrics, Spring 2003 Notes Section 9 Panel Unit Root Tests Avariety of procedures for the analysis of unit roots in a panel

A Primer on Asymptotics

Analysis of Cross-Sectional Data

Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

Linear Regression. Junhui Qian. October 27, 2014

ECON 4160, Autumn term Lecture 1

Questions and Answers on Heteroskedasticity, Autocorrelation and Generalized Least Squares

Single Equation Linear GMM

Cluster-Robust Inference

Econometrics II - EXAM Answer each question in separate sheets in three hours

Topic 10: Panel Data Analysis

Transcription:

Economics 582 Random Effects Estimation Eric Zivot May 29, 2013

Random Effects Model Hence, the model can be re-written as = x 0 β + + [x ] = 0 (no endogeneity) [ x ] = = + x 0 β + + [x ] = 0 [ x ] = 0 RE framework is a SUR model with common coefficients - efficient estimation depends on the covariance structure of +

The Homoskedastic Equi-correlation Framework A common covariance structure in the RE model is based on the assumptions Let = + Then and ( )= iid (0 2 ) iid (0 2 ) ( 2 2 + 2 for 6= for = Note, ( )= 2 2 + 2 for 6= ( )=0for 6=

In matrix notation we have Then y 1 = W δ + ε W = [1 X ] δ =( β 0 ) 0 ε = 1 + η [ε ε 0 ] = Ω [ε ε 0 ] = [( 1 + η )( 1 + η ) 0 ] = [ 2 ]1 1 0 + [η η 0 ] = 2 1 1 0 + [η η 0 ]

Now [η η 0 ]= 2... = 2 I 2 Hence, [ε ε 0 ] = 2 1 1 0 + 2 I = Ω 2 + 2 2 2 = 2 2 + 2 2...... 2 2 2 + 2

Remarks Because Ω 6= 2 I pooled OLS is not an efficient estimator. The efficient estimator is pooled GLS, which is also the SUR estimator

Efficient Estimation of the RE model Using Ω 1 = Ω 1 2 Ω 1 20 the transformed model is or Ω 1 2 y = Ω 1 2 W δ + Ω 1 2 ε ỹ = W δ + ε [ ε ε 0 ] = I Hence, the GLS (SUR) estimator is the pooled OLS estimator X ˆδ = W 0 W X = =1 X =1 W 0 Ω 1 W W ỹ 0 =1 X =1 W 0 Ω 1 y

Feasible GLE estimation Because 2 and 2 are typically unknown, GLS is not feasible. The feasible GLS estimator is where ˆδ = X =1 ˆ 2 W 0 ˆΩ 1 W X =1 ˆΩ =ˆ 2 1 1 0 +ˆ 2 I 2 and ˆ 2 2 W 0 ˆΩ 1 y Q: How to consistently estimate 2 and 2?

Consistent Estimation of 2 and 2 There are several ways to consistently estimate 2 and 2 The most common estimators are as follows. Estimation of 2 is based on the FE Within estimator from the regression = x 0 β + = x = x = Then ˆ 2 = 1 ( 1) X X =1 =1 b 2 2

Estimation of 2 is bsed on the Between regression Then = w 0 δ + + =1 ( + ) = 2 + 2 = 1 ˆ 2 = 1 ( +1) X =1 X =1 ³ w 0 ˆδ 2 1 ˆ 2 2

Alternative Form of RE Estimator Result: The FGLS estimator ˆδ = X =1 W 0 ˆΩ 1 W X =1 W 0 ˆΩ 1 y can be computed as the pooled OLS estimator from the transformed model and is a consistent estimator of ˆ = (w ˆ w ) 0 δ + = (1 ˆ ) +( ˆ ) =1 q 2 + 2

Remarks: ˆ =0 ˆδ = pooled OLS ˆ =1 ˆδ = ˆβ = ˆβ ˆ 1 as

Linear Panel Example: Hours and Wages (Cameron and Trivedi, ch. 21) Does labor supply respond to wages? For people working, result is ambiguous due to offsetting substitution and income effects Data on = 532 males for each of the =10yearsfrom1979to1988 (5320 total obvs) Simple linear panel regression ln = ln + + is indep over but not

POLS Between Within First Diff RE 7.442 7.483 7.346.083.067.168.109.119 Panel Robust SE( ) (.030) (.024) (.085) (.084) (.051) Default (iid) SE( ) [.009] [.020] [.019] [.021] [.014] 2.015.021.016.008.014.181.161.283.232.233 0 1.585 5320 532 5320 4788 5320 Comments Table 1: Comparison of Panel Data Estimators TheWithinFEestimateof.168ismuchhigherthanPOLSestimateof.083

TheFDFEestimateof.109isalsohigherthanPOLS RE estimate of.119 is in between Between and Within FE POLS default SE of.009 is much smaller than panel robust SE of.030 Apparent efficiency loss (higher SEs) from using FE estimators

Autocorrelations of Pooled OLS Residuals 79 80 81 82 83 84 85 86 87 88 79 1 80 33 1 81 44 4 1 82 3 31 57 1 83 21 23 37 47 1 84 2 23 32 34 64 1 85 24 32 41 35 39 58 1 86 2 19 28 25 31 35 4 1 87 2 32 33 29 31 34 39 35 1 88 16 25 3 26 21 25 34 55 53 1 Autocorrelations ( ) resemble pattern of equi-correlation RE model

Hausman Specification Test for FE vs. RE = x 0 β + + The hypotheses to be tested are 0 : [x ]=0(RE estimation) 1 : [x ] 6= 0(FE estimation) Hausman and Taylor (1981, Ecta) considered a test statistic based on ˆq = ˆβ ˆβ in the context of maximum likelihood estimation under the equi-correlation RE framework.

Intuition: Under 0 both ˆβ and ˆβ are consistent (but ˆβ is efficient) so that ˆq 0 Under 1 ˆβ is not consistent but ˆβ is consistent so that Therefore, consider the test statistic ˆq 9 0 = ˆq ( avar(ˆq)) d 1 ˆq If is big then reject 0 ; otherwise do not reject 0 Q1: What is avar(ˆq)? d Q2: What is the asymptotic distribution of?

Hausman Test Principle In general, consider two estimators ˆθ and θ of θ R such that 0 : ˆθ θ 0 and ³ˆθ θ (0 V) : ˆθ θ 90 The Hausman statistic is ³ˆθ = θ 0 ³ 1 ˆV 1 ³ˆθ θ and under 0 2 ( )

What is? 1 = (ˆθ θ) = (ˆθ)+ ( θ) 2 (ˆθ θ) Result (Hausman, 1978). If ˆθ is efficient under 0 then (ˆθ θ) = (ˆθ) and (ˆθ θ) = ( θ) (ˆθ) Hence, = ³ˆθ θ 0 ³ d ( θ) d (ˆθ) 1 ³ˆθ θ

Comments Hausman test only requires d ( θ) and d (ˆθ) and not (ˆθ θ) It is possible for d ( θ) d (ˆθ) to be negative definite in finite samples It is possible for d ( θ) d (ˆθ) to be less than full rank in finite samples Hausman test can often be computed using an auxiliary regression

Hausman Test for FE vs. RE Under the RE model based on the assumptions iid (0 2 ) iid (0 2 ) The GLS estimator is asymptotically equivalent to the MLE under normality and is efficient. Hence, for ˆq = ˆβ ˆβ = ˆq ³ d (ˆβ ) d (ˆβ ) 1 ˆq 2 ( ) Comment: The statistic is not robust to serial correlation and heteroskedasticity

Hausman Statistic Based on Auxiliary Regression An asymptotically equivalent version of the Hausman test can be computed as the Wald test of γ = 0 in the auxiliary regression ˆ = (w ˆ w ) 0 δ + γ 0 (x x )+ w = (1 x 0 )0 ˆ 2 ˆ = 1 q ˆ 2 +ˆ 2 Why? Because the RE estimator is obtained when γ = 0 is imposed on the auxiliary regression.

Comment Auxiliary regression approach avoids problems of ˆq having non-full rank variance or negative definite variance. Wald test of γ = 0 can be made robust to serial correlation and heteroskedasticity using panel robust covariance estimates

Testing FE vs. RE in Hours and Wages Panel Regression ln = ln + + Using the original form of the Hausman test, we have = (ˆ ˆ ) 2 d (ˆ ) d (ˆ ) ( 168 119)2 = ( 019 2 014 2 ) =14 2 95 (1) = 3 84 so we reject 0 : [ln ]=0at the 5% level. Comment: Test may not be reliable because of serial correlation and heteroskedasticity

The auxiliary regression approach based on the regression ˆ = (w ˆ w ) 0 δ + ( )+ = ln w =(1 ln ) 0 =ln with panel robust standard errors gives the statistic 2 =0 =(1 28)2 =1 65 2 95 (1) = 3 84 so we cannot reject 0 : [ln ]=0at the 5% level.