Dynamic Panel Data Models

Similar documents
Dynamic Panel Data Models

Dynamic Panel Data estimators

Econometrics of Panel Data

Dynamic Panels. Chapter Introduction Autoregressive Model

xtdpdqml: Quasi-maximum likelihood estimation of linear dynamic short-t panel data models

EC327: Advanced Econometrics, Spring 2007

1 Estimation of Persistent Dynamic Panel Data. Motivation

Fortin Econ Econometric Review 1. 1 Panel Data Methods Fixed Effects Dummy Variables Regression... 7

Lecture 7: Dynamic panel models 2

Dealing With Endogeneity

System GMM estimation of Empirical Growth Models

Linear dynamic panel data models

Working Paper Number 103 December 2006

xtseqreg: Sequential (two-stage) estimation of linear panel data models

Dynamic panel data methods

Advanced Econometrics

Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook)

Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

A Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

Lecture 8 Panel Data

Bias Correction Methods for Dynamic Panel Data Models with Fixed Effects

Lecture 6: Dynamic panel models 1

Anova-type consistent estimators of variance components in unbalanced multi-way error components models

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Econometrics of Panel Data

Corporate Finance Data & The Role of Dynamic Panels. Mark Flannery, University of Florida Kristine W. Hankins, University of Kentucky

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

Package panelvar. April 3, 2018

A Transformed System GMM Estimator for Dynamic Panel Data Models. February 26, 2014

Econometrics of Panel Data

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Topic 10: Panel Data Analysis

Econometric Analysis of Cross Section and Panel Data

Applied Quantitative Methods II

A forward demeaning transformation for a dynamic count panel data model *

Linear Panel Data Models

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

Panel data methods for policy analysis

1. Overview of the Basic Model

Applied Economics. Panel Data. Department of Economics Universidad Carlos III de Madrid

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

Applied Econometrics. Lecture 3: Introduction to Linear Panel Data Models

Econometrics Homework 4 Solutions

Instrumental Variables, Simultaneous and Systems of Equations

Econ 582 Fixed Effects Estimation of Panel Data

Econometrics. 8) Instrumental variables

Improving GMM efficiency in dynamic models for panel data with mean stationarity

Dynamic Panel Data Ch 1. Reminder on Linear Non Dynamic Models

Consistent estimation of dynamic panel data models with time-varying individual effects

08 Endogenous Right-Hand-Side Variables. Andrius Buteikis,

Econ 510 B. Brown Spring 2014 Final Exam Answers

Applied Microeconometrics (L5): Panel Data-Basics

Advanced Quantitative Methods: panel data

10 Panel Data. Andrius Buteikis,

1 Introduction The time series properties of economic series (orders of integration and cointegration) are often of considerable interest. In micro pa

Endogeneity. Tom Smith

Instrumental variables estimation using heteroskedasticity-based instruments

CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M.

ADVANCED ECONOMETRICS I. Course Description. Contents - Theory 18/10/2017. Theory (1/3)

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication

A Robust Approach to Estimating Production Functions: Replication of the ACF procedure

Graduate Econometrics Lecture 4: Heteroskedasticity

Notes on Panel Data and Fixed Effects models

Nonstationary Panels

The Stata module felsdvreg to fit a linear model with two high-dimensional fixed effects

1 The basics of panel data

Microeconometrics (PhD) Problem set 2: Dynamic Panel Data Solutions

Economics 308: Econometrics Professor Moody

DEPARTMENT OF ECONOMICS AND FINANCE COLLEGE OF BUSINESS AND ECONOMICS UNIVERSITY OF CANTERBURY CHRISTCHURCH, NEW ZEALAND

Fixed and Random Effects Models: Vartanian, SW 683

Econ 1123: Section 5. Review. Internal Validity. Panel Data. Clustered SE. STATA help for Problem Set 5. Econ 1123: Section 5.

ECON 4551 Econometrics II Memorial University of Newfoundland. Panel Data Models. Adapted from Vera Tabakova s notes

Lecture Module 8. Agenda. 1 Endogeneity. 2 Instrumental Variables. 3 Two-stage least squares. 4 Panel Data: First Differencing

Basic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Re-estimating Euler Equations

Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates

6/10/15. Session 2: The two- way error component model. Session 2: The two- way error component model. Two- way error component regression model.

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b.

Introduction to Eco n o m et rics

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

ECO 2901 EMPIRICAL INDUSTRIAL ORGANIZATION

Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page!

Simultaneous Equation Models

Short T Panels - Review

ECO375 Tutorial 8 Instrumental Variables

Linear Regression. Junhui Qian. October 27, 2014

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

WISE International Masters

Problem Set 10: Panel Data

Solutions to Problem Set 5 (Due December 4) Maximum number of points for Problem set 5 is: 62. Problem 9.C3

Panel Threshold Regression Models with Endogenous Threshold Variables

PS 271B: Quantitative Methods II Lecture Notes

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.

Consistent OLS Estimation of AR(1) Dynamic Panel Data Models with Short Time Series

Beyond the Target Customer: Social Effects of CRM Campaigns

Transcription:

June 23, 2010

Contents Motivation 1 Motivation 2 Basic set-up Problem Solution 3 4 5 Literature

Motivation Many economic issues are dynamic by nature and use the panel data structure to understand adjustment. Examples: Demand (i.e. present demand depends on past demand) Dynamic wage equation Employment models Investment of firms; etc.

One way error component model Basic set-up Problem Solution where y it = δy i,t 1 +x itβ +u it i = 1,...,N t = 1,...,T u it = µ i +ν it µ i usual individual effects, when necessary iid(0,σ 2 µ ) ν i usual error term iid(0,σ 2 ν) independent of each other and among themselves

OLS Motivation Basic set-up Problem Solution y it is correlated with µ i y i,t 1 is also correlated with µ i = OLS is biased an inconsistent even if ν it are not serially correlated

Fixed effects Motivation Basic set-up Problem Solution Within transformation sweeps out µ i (y i,t 1 ȳ i, 1 ), where ȳ i, 1 = T t=2 (ν it ν i. ) y i,t 1 T 1 is correlated with FE estimator is biased, BUT consistent for T (not for N )

Random effects Motivation Basic set-up Problem Solution Quasi-demeaning transforms the data to (y i,t 1 θȳ i, 1 ) and accordingly for the other terms (y i,t 1 θȳ i, 1 ) is correlated with (u it θu i. ) because contains u i,t 1 which is correlated with y i,t 1 RE GLS estimator is biased u i.

How to deal with this problem Basic set-up Problem Solution There are several ways in the literature, e.g. correcting for the bias, system GMM estimation techniques, etc. Idea of one approach: IV-estimation Take first differences to get rid of the individual effects Use all the past information of y it for instruments and the structure of the error term to get consistent estimates

Differencing I Motivation First difference y it = δy i,t 1 +u it y it y i,t 1 = δ(y i,t 1 y i,t 2 )+(ν it ν i,t 1 ) First period we have observation on this model is period t = 3, we have y i3 y i,2 = δ(y i,2 y i,1 )+(ν i3 ν i,2 ) y i1 is not correlated with the error and a valid instrument

Differencing II Motivation One period forward we have y i4 y i3 = δ(y i3 y i2 )+(ν i4 ν i3 ) y i1 and y i2 are not correlated with the error and a valid instrument One more instrument for each of the following periods Define a matrix the contains all instruments of individual i [y i1 ] 0 0 0 [y i1,y i2 ] 0 0 W i =.... 0 0 [y i1,...,y i,t 2 ] All instruments in the model: W = [W 1,W 2,...,W N ]

Error term Motivation Variance-covariance matrix of the error where E[ v i v i ] = σ2 ν (I N G) 2 1 0 0 1 2 1 0 0 G =..... 0 0 0 1 2 1 0 0 0 1 2 Since the instruments are orthogonal to the error we have the moment condition (used later for GMM) E[W i v i] = 0

Consistent estimates Pre-multiplying the model with the matrix of all instruments gives W y = W ( y 1 )δ +W ν Performing GLS on this model gives the consistent one step estimator (Arellano and Bond, 1991) ˆδ 1 =[( y 1 ) W(W (I N G)W) 1 W ( y 1 )] 1 x[( y 1 ) W(W (I N G)W) 1 W ( y)]

Optimal GMM estimates It can be shown that the the optimal GMM estimator ( la Hansen) for this model is the same formula except replacing by V N = (W (I N G)W) N i=1 W i( v i )( v i ) W i where the v are obtain from the residuals form the above explained estimation Two step Arellano and Bond (1991) estimator is then ˆδ 1 =[( y 1 ) W(ˆV N ) 1 W ( y 1 )] 1 x[( y 1 ) W(ˆV N ) 1 W ( y)]

The data Motivation Illustration with Arellano-Bonds dataset (can be freely downloaded from the web) firm level employment (Arellano-Bond 1991:Some tests of specification for panel data: Monte Carlo evidence and an application to employment equations, Review of Economic Studies) 140 UK firms annual data 1976-1984 unbalanced

Idea behind the estimations Hiring and firing workers is costly Employment should adjust with delay to changes in factors such as capital stock, wages, and output demand and on the difference between equilibrium employment level and the last years actual level Dynamic model where lags of the dependent variable are also regressors

Summary Statistics Employment is at the firm level and output is at the industry level as a proxy for demand Estimations are done with the logarithm of the variables Table: Summary Statistics Statistics Employment Wage Capital Output Mean 7.89 23.92 2.51 103.80 Standard Deviation 15.94 5.65 6.25 9.94 Minimum.10 8.02.02 86.9 Maximum 108.56 45.23 47.11 128.37 Source: http://www.stata-press.com/data/r10/abdata.dta

First naive approach - OLS Stata comand: reg n nl1 nl2 w wl1 k kl1 kl2 ys ysl1 ysl2 yr* Variable Coefficient (Std. Err.) nl1 1.045 (0.034) nl2-0.077 (0.033) w -0.524 (0.049) wl1 0.477 (0.049) k 0.343 (0.026) kl1-0.202 (0.040) kl2-0.116 (0.028) ys 0.433 (0.123) ysl1-0.768 (0.166) ysl2 0.312 (0.111) year dummies are not reported

Problems with OLS Lagged dependent variable is endogenous to fixed effects in the error term Estimates are inconsistent One can show that there is a positive correlation between regressor and error term Thus it inflates the coefficient for lagged employment by attributing predictive power to it that belongs to the fixed effect One should expect the true estimate to be lower

Fixed effects: disregarding the dynamic structure Stata comand: xtreg n nl1 nl2 w wl1 k kl1 kl2 ys ysl1 ysl2 yr*, fe Variable Coefficient (Std. Err.) L1 employment 0.733 (0.039) L2 employment -0.139 (0.040) Wage -0.560 (0.057) L1 wage 0.315 (0.061) Capital 0.388 (0.031) L1 capital -0.081 (0.038) L2 capital -0.028 (0.033) Output 0.469 (0.123) L1 Output -0.629 (0.158) L2 Output 0.058 (0.135) year dummies are not reported

Problems with Fixed Effects Purging out the individual effects does not eliminate dynamic panel bias, it essentially makes every observation of transformed y* endogenous to the error One cannot use previous lags as instruments Estimates are now biased downwards Reasonable estimates should therefore lie between these FE-and OLS estimates; i.e. between 1.045 and 0.733

Arellano-Bond (difference GMM) Stata comand: xtabond2 n L.n L2.n w L1.w L(0.2).(k,ys) yr*, gmm(l.n) iv(l2.n w L.w L(0.2).(k,ys) yr*) nolevel robust Variable Coefficient (Std. Err.) L1 employment 0.686 (0.145) L2 employment -0.085 (0.056) Wage -0.608 (0.178) L1 wage 0.393 (0.168) Capital 0.357 (0.059) L1 capital -0.058 (0.073) L2 capital -0.020 (0.033) Output 0.609 (0.173) L1 Output -0.711 (0.232) L2 Output 0.106 (0.141) year dummies are not reported

Problems with Arellano-Bond I - Application Estimate of lagged dependent variable is NOT in credible range between OLS and fixed effect estimator Problem? Blundell and Bond 1998 do not expect wages and capital to be strictly exogenous in our employment application Therefore one can instrument them too

Arellano-Bond with more instruments Stata comand: xtabond2 n L.n L2.n w L1.w L(0.2).(k ys) yr*, gmm(l.(n w k)) iv(l(0.2).ys yr*) nolevel robust small Variable Coefficient (Std. Err.) L1 employment 0.818 (0.086) L2 employment -0.112 (0.050) Wage -0.682 (0.143) L1 wage 0.656 (0.202) Capital 0.353 (0.122) L2 capital -0.154 (0.086) L2 capital -0.030 (0.032) Output 0.651 (0.190) L1 Output -0.916 (0.264) L2 Output 0.279 (0.186) year dummies are not reported

Problems with Arellano-Bond II - General Now the estimate is between the previously showed FE and OLS estimate Blundell and Bond (1998) show that difference GMM performs bad when y is close to a random walk because untransformed lags are weak instruments for transformed variables. Instead of transforming the regressors it transforms the instruments to make them exogenous to the fixed effect Additional assumption that first differences of instruments are uncorrelated with fixed effects is necessary.

Arelano-Bond (system GMM) - one step Stata comand: xtabond2 n L.n L(0.1).(w k) yr*, gmmstyle(l.(n w k)) ivstyle(yr*, equation(level)) robust small Variable Coefficient (Std. Err.) L1 employment 1.030 (0.057) L2 employment -0.089 (0.434) Wage -0.641 (0.123) L wage 0.534 (0.148) Capital 0.428 (0.062) L capital -0.374 (0.066) year dummies are not reported Intercept 0.552 (0.197)

Conclusions Motivation Literature How can we estimate a dynamic model with panel data It is relatively complicated in theory but easy with stata One has to carefully check the results from stata, because it always gives estimates.

Literature Motivation Literature Econometric Analysis of Panel Data Baltagi, Badi H., John Wiley & Sons, Ltd (2005); 3rd edition; especially Chapter 8 How to Do xtabond2: An Introduction to Difference and System GMM in Stata Roodman David, Center for Global Development, Working Paper no. 103, revised version 2001