Econometrics of Panel Data

Similar documents
Econometrics of Panel Data

Econometrics of Panel Data

Econometrics of Panel Data

Econometrics of Panel Data

Econometrics of Panel Data

Advanced Econometrics

Test of hypotheses with panel data

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Statistics and econometrics

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Økonomisk Kandidateksamen 2004 (I) Econometrics 2. Rettevejledning

Review of Econometrics

Topic 10: Panel Data Analysis

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

Empirical Economic Research, Part II

Specification testing in panel data models estimated by fixed effects with instrumental variables

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Applied Microeconometrics (L5): Panel Data-Basics

Practical Econometrics. for. Finance and Economics. (Econometrics 2)

Economics 582 Random Effects Estimation

Econ 510 B. Brown Spring 2014 Final Exam Answers

Applied Econometrics (QEM)

ECON 4551 Econometrics II Memorial University of Newfoundland. Panel Data Models. Adapted from Vera Tabakova s notes

Exogeneity tests and weak identification

Lecture 3: Multiple Regression

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

The outline for Unit 3

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL

PANEL DATA RANDOM AND FIXED EFFECTS MODEL. Professor Menelaos Karanasos. December Panel Data (Institute) PANEL DATA December / 1

Applied Econometrics (QEM)

Maximum Likelihood (ML) Estimation

Non-linear panel data modeling

Increasing the Power of Specification Tests. November 18, 2018

Heteroskedasticity. Part VII. Heteroskedasticity

1. The OLS Estimator. 1.1 Population model and notation

Ma 3/103: Lecture 24 Linear Regression I: Estimation

Fixed Effects Models for Panel Data. December 1, 2014

Nonstationary Panels

Answers to Problem Set #4

An overview of applied econometrics

Introductory Econometrics

Hypothesis testing Goodness of fit Multicollinearity Prediction. Applied Statistics. Lecturer: Serena Arima

Simple Linear Regression: The Model

Linear Panel Data Models

Problem Set #6: OLS. Economics 835: Econometrics. Fall 2012

Econometric Analysis of Cross Section and Panel Data

F9 F10: Autocorrelation

the error term could vary over the observations, in ways that are related

Final Exam. Economics 835: Econometrics. Fall 2010

Short T Panels - Review

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Heteroskedasticity. We now consider the implications of relaxing the assumption that the conditional

Lecture 4: Heteroskedasticity

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

Applied Economics. Panel Data. Department of Economics Universidad Carlos III de Madrid

Econ 582 Fixed Effects Estimation of Panel Data

Empirical Market Microstructure Analysis (EMMA)

So far our focus has been on estimation of the parameter vector β in the. y = Xβ + u

Introduction to Estimation Methods for Time Series models. Lecture 1

10 Panel Data. Andrius Buteikis,

ECONOMETRICS FIELD EXAM Michigan State University August 21, 2009

Ma 3/103: Lecture 25 Linear Regression II: Hypothesis Testing and ANOVA

ECON 4160, Lecture 11 and 12

Spatial Regression. 9. Specification Tests (1) Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Multiple Regression Analysis: Heteroskedasticity

The regression model with one fixed regressor cont d

Dealing With Endogeneity

Panel Data: Linear Models

Lecture 6: Dynamic panel models 1

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

8. Hypothesis Testing

OSU Economics 444: Elementary Econometrics. Ch.10 Heteroskedasticity

Econometrics - 30C00200

The Standard Linear Model: Hypothesis Testing

Public Sector Management I

Topic 7: Heteroskedasticity

Chapter 15 Panel Data Models. Pooling Time-Series and Cross-Section Data

Multiple Equation GMM with Common Coefficients: Panel Data

Homoskedasticity. Var (u X) = σ 2. (23)

Lectures 5 & 6: Hypothesis Testing

Regression with time series

Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Panel Data Model (January 9, 2018)

ECON 5350 Class Notes Functional Form and Structural Change

Sensitivity of GLS estimators in random effects models

Instrumental Variables, Simultaneous and Systems of Equations

Intermediate Econometrics

Least Squares Estimation-Finite-Sample Properties

Interpreting Regression Results

EC327: Advanced Econometrics, Spring 2007

Analisi Statistica per le Imprese

Topic 3: Inference and Prediction

Christopher Dougherty London School of Economics and Political Science

Lecture 1: OLS derivations and inference

Multivariate Regression Analysis

Chapter 4: Constrained estimators and tests in the multiple linear regression model (Part III)

Panel Data Models. James L. Powell Department of Economics University of California, Berkeley

Size and Power of the RESET Test as Applied to Systems of Equations: A Bootstrap Approach

Transcription:

Econometrics of Panel Data Jakub Mućk Meeting # 3 Jakub Mućk Econometrics of Panel Data Meeting # 3 1 / 21

Outline 1 Fixed or Random Hausman Test 2 Between Estimator 3 Coefficient of determination (R 2 ) 4 Testing linear hypotheses Jakub Mućk Econometrics of Panel Data Fixed or Random Meeting # 3 2 / 21

Fixed or Random Random effects Fixed effects Individual µ i N ( ) 0, σµ 2 α i effects drawn from the random sample α i are assumed to be constant = we can estimate the parameter over time of distribution, i.e., σµ 2 Assumptions: (i) E (µ i ε it) = 0 (i) E (α i ε it) = 0 (ii) E (µ i x it) = 0 individual effects are independent of the explanatory variable x it Estimation GLS OLS (within or LSDV) Efficiency higher lower Additional: impossible to use time invariant regressors (collinearity with α i) Jakub Mućk Econometrics of Panel Data Fixed or Random Meeting # 3 3 / 21

Fixed or Random Random and Fixed effects models: RE: y it = α + β 1 x 1it +... + β 1 x kit + µ i + ε it, FE: y it = α i + β 1 x 1it +... + β 1 x kit + u it where µ i N (0, σµ), 2 ε i,t N (0, σε) 2 and u i,t N (0, σu). 2 We can test significance of individual effects: The random effect model: the Langrange multiplier statistic: H 0 : σµ 2 = 0 The fixed effect model: test for poolability: H 0 : α = α 1 =... = α N. Can we compare the FE and RE models? Jakub Mućk Econometrics of Panel Data Fixed or Random Meeting # 3 4 / 21

Classical example Cobb-Douglas production function. Consider the following model: y it = βx it + u it + η i (1) yit the log output. xit the log of a variable input; ηi an firm input that is constant over time, e.g., managerial skills. uit a stochastic input that is outside framer s control. β - the technological parameter. η i is the unobserved individual effect. Let s assume that the input (x it ) depends on the quality of managerial skills (individuals effects η i ) = E (η i x it ) 0 = the RE estimator is not consistent. Jakub Mućk Econometrics of Panel Data Fixed or Random Meeting # 3 5 / 21

Hausman Test The null H 0 in the Hausman test is that both the random and fixed effect estimates are consistent. If the alternative hypothesis H 1 holds then the random effect estimates are inconsistent. Test statistics: H = [ ˆβFE ˆβ RE] ( Var ˆβ FE Var ˆβ RE) 1 [ ˆβFE ˆβ RE] (2) The statistics H is distributed χ 2 with degrees of freedom determined by K, i.e., the dimension of the coefficient vector β. Jakub Mućk Econometrics of Panel Data Fixed or Random Meeting # 3 6 / 21

Hausman Test- some remarks In general, the Hausman test asks whether the fixed effects and random effects estimates of β are significantly different. We can test only models with the same set of explanatory variables: We cannot compare the random effects estimates corresponding to time-invariant regressors due to their collinearity with individual intercept. The rejection of the null hypothesis indicates that the random effect estimates of β are not consistent or that the model is wrongly specified (misspecification error). It is assumed that the fixed effect model is consistent under both null and alternative. What if regressors are not strictly exogenous? The Hausman may be used in more general context. Jakub Mućk Econometrics of Panel Data Fixed or Random Meeting # 3 7 / 21

Empirical example Pooled OLS FE RE α 42.714 58.744 57.834 (9.512) (12.454) (28.899) [0.000] [0.000] [0.045] β 1 0.231 0.310 0.308 (0.025) (0.017) (0.012) [0.000] [0.000] [0.000] β 2 0.116 0.110 0.110 (0.006) (0.012) (0.010) [0.000] [0.000] [0.000] Note: The expressions in round and squared brackets stand for standard errors and probability values corresponding to the null about parameter s insignificance, respectively. I it = α + β 1 K it + β 2 F it + u it Jakub Mućk Econometrics of Panel Data Fixed or Random Meeting # 3 8 / 21

Empirical example The RE and FE estimates: [ ˆβ FE 0.3101 = 0.1101 ], ˆβRE = [ 0.3081 0.1098 ] = ˆβ FE ˆβ RE = [ 0.0019 0.0003 ] The estimates of the variance-covariance matrix: ( ) [ 0.00030 0.00008 Var ˆβFE = 0.00008 0.00014 ], Var ( ˆβRE ) = [ 0.00029 0.00006 0.00006 0.00011 ], but diag The test statistics: 2.33. The probability value: 0.3119. ( ( ) ( )) [ 0.0024 Var ˆβFE Var ˆβRE = 0.0055 ] Jakub Mućk Econometrics of Panel Data Fixed or Random Meeting # 3 9 / 21

Outline 1 Fixed or Random Hausman Test 2 Between Estimator 3 Coefficient of determination (R 2 ) 4 Testing linear hypotheses Jakub Mućk Econometrics of Panel Data Between Estimator Meeting # 3 10 / 21

Between Estimator The between estimator uses just cross-sectional variation. Averaging over time yields: ȳ i = α Between + β Between 1 x 1i +... + β Between k x ki + ū i, (3) where ȳ i = T 1 t y it, x 1i = T 1 t x 1it,..., x ki = T 1 t x kit, ū i = T 1 t u it. or in the matrix form: ȳ = α Between + xβ Between + u. (4) The parameters α Between, β1 Between,..., βk Between OLS estimator. can be estimated with the Jakub Mućk Econometrics of Panel Data Between Estimator Meeting # 3 11 / 21

Empirical example Pooled OLS FE RE BE α 42.714 58.744 57.834 8.527 (9.512) (12.454) (28.899) (47.515) [0.000] [0.000] [0.045] [0.863] β 1 0.231 0.310 0.308 0.032 (0.025) (0.017) (0.012) (0.191) [0.000] [0.000] [0.000] [0.872] β 2 0.116 0.110 0.110 0.135 (0.006) (0.012) (0.010) (0.029) [0.000] [0.000] [0.000] [0.002] Note: The expressions in round and squared brackets stand for standard errors and probability values corresponding to the null about parameter s insignificance, respectively. I it = α + β 1 K it + β 2 F it + u it Jakub Mućk Econometrics of Panel Data Between Estimator Meeting # 3 12 / 21

Outline 1 Fixed or Random Hausman Test 2 Between Estimator 3 Coefficient of determination (R 2 ) 4 Testing linear hypotheses Jakub Mućk Econometrics of Panel Data Coefficient of determination (R 2 ) Meeting # 3 13 / 21

Coefficient of determination (R 2 ) Coefficient of determination (R 2 ): is an indicator showing how well data fit a statistical model. General definition: R 2 = 1 ESS TSS, (5) where ESS - is the residuals sum of squares: ESS = (ŷ i y) 2, (6) i TSS - is the total (observed) sum of square, TSS = (y i ȳ) 2. (7) i R 2 is between 0 and 1. In the context of panel data, the conventional coefficient of determination is maximized by the pooled estimator. Jakub Mućk Econometrics of Panel Data Coefficient of determination (R 2 ) Meeting # 3 14 / 21

The within R 2 The within R 2 : where ESSWithin - is the residuals sum of squares: R 2 Within = 1 ESS Within TSS Within, (8) ESS Within = N i=1 T t=1 ( yˆ it y it) 2, (9) TSSWithin - is the total (observed) sum of square, TSS Within = N i=1 T (y it ȳ i) 2. (10) t=1 The within R 2 is maximized by the within estimator. Jakub Mućk Econometrics of Panel Data Coefficient of determination (R 2 ) Meeting # 3 15 / 21

The between R 2 The between R 2 : where R 2 Between = 1 ESS Between TSS Between, (11) ESSBetween - is the residuals sum of squares: ESS Between = N ( ) 2 ˆȳ i ȳ i, (12) i=1 TSSBetween - is the total (observed) sum of square, TSS Between = N (ȳ i ȳ) 2. (13) i=1 The between R 2 is maximized by the between estimator. Jakub Mućk Econometrics of Panel Data Coefficient of determination (R 2 ) Meeting # 3 16 / 21

Empirical example Table: The comparison of the coefficients of determination R 2 Pooled FE RE BE the overall R 2 0.812 0.806 0.806 0.755 the within R 2 0.767 0.767 0.478 the between R 2 0.819 0.820 0.858 Jakub Mućk Econometrics of Panel Data Coefficient of determination (R 2 ) Meeting # 3 17 / 21

Outline 1 Fixed or Random Hausman Test 2 Between Estimator 3 Coefficient of determination (R 2 ) 4 Testing linear hypotheses Jakub Mućk Econometrics of Panel Data Testing linear hypotheses Meeting # 3 18 / 21

Testing linear hypotheses Statistical models are usually formulated in order to verify testable implications of the economic theory. A set of linear hypotheses can be described as follows: Rβ = q, (14) where β is the vector of parameters, the matrix R and vector q describe the linear constraints. R is m (K + 1) and q is m 1 when m is the number of linear constraints. Example. Consider the case when k = 4 and the following three hypotheses: β 1 + β 3 = 1 β 2 β 4 = 0 β 1 + 0.5β 4 = 2 then the matrix R and vector q: 0 1 0 1 0 R = 0 0 1 0 1 and q = 0 1 0 0 0.5 Jakub Mućk Econometrics of Panel Data Testing linear hypotheses Meeting # 3 19 / 21 1 0 2.

Testing linear hypotheses Restricted Least Squares (RLS) estimator: ˆβ RLS = ˆβ OLS + ( X X ) ( 1 R R ( X X ) 1 R ) 1 ( ) q R ˆβOLS, (15) where ˆβ OLS is the OLS estimator. Jakub Mućk Econometrics of Panel Data Testing linear hypotheses Meeting # 3 20 / 21

Testing linear hypotheses Restricted Least Squares (RLS) estimator: ˆβ RLS = ˆβ OLS + ( X X ) ( 1 R R ( X X ) 1 R ) 1 ( ) q R ˆβOLS, (15) where ˆβ OLS is the OLS estimator. Wald test: Test statistics for m linear hypotheses: H 0 : Rβ = q H 1 : Rβ q F = (Rβ q) ( R (X X) 1 R ) 1 (Rβ q) m if true then F F(m, NT K) where K is the number of regressors and NT refers to the number of observations. Alternatively: W = (Rβ q) ( R ( X X ) 1 R ) 1 (Rβ q) = mf (17) The W is χ 2 distributed with m degree of freedoms. Jakub Mućk Econometrics of Panel Data Testing linear hypotheses Meeting # 3 20 / 21 (16)

Testing linear hypotheses general remarks The rank of the matrix R should equal the number of restrictions (m). The test statistics F is preferable in smaller samples. The test statistics, both F and W, are sometimes very sensitive to the estimates of the variance-covariance of the error terms. Viewed from the empirical perspective, it s essentially to investigate whether the obtained results are not sensitive to a choice of the variance-covariance estimator. Alternative procedures can be used here, e.g. the Lagrange multiplier test (score test) or the likelihood-ratio test. Engle (1983) shows that these tests (Wald, score & LR) are asymptotically equivalent but in finite samples they can lead to different conclusions. Jakub Mućk Econometrics of Panel Data Testing linear hypotheses Meeting # 3 21 / 21