xtseqreg: Sequential (two-stage) estimation of linear panel data models

Similar documents
xtdpdqml: Quasi-maximum likelihood estimation of linear dynamic short-t panel data models

Lecture 8 Panel Data

Dynamic Panels. Chapter Introduction Autoregressive Model

Dynamic Panel Data Models

Improving GMM efficiency in dynamic models for panel data with mean stationarity

Instrumental variables estimation using heteroskedasticity-based instruments

Econometric Analysis of Cross Section and Panel Data

Dynamic Panel Data estimators

EC327: Advanced Econometrics, Spring 2007

Linear dynamic panel data models

1. GENERAL DESCRIPTION

Econometrics of Panel Data

1 Estimation of Persistent Dynamic Panel Data. Motivation

Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook)

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools

xtdpdml for Estimating Dynamic Panel Models

Lecture 12 Panel Data

Non-linear panel data modeling

Econometrics of Panel Data

A forward demeaning transformation for a dynamic count panel data model *

Instrumental variables estimation using heteroskedasticity-based instruments

ADVANCED ECONOMETRICS I. Course Description. Contents - Theory 18/10/2017. Theory (1/3)

Instrumental Variables Estimation in Stata

A Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL

Applied Microeconometrics (L5): Panel Data-Basics

Dynamic panel data methods

Maximum Likelihood for Cross-Lagged Panel Models with Fixed Effects

Multiple Equation GMM with Common Coefficients: Panel Data

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

Partial effects in fixed effects models

1. Overview of the Basic Model

Advanced Econometrics

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

Dealing With Endogeneity

Linear Panel Data Models

Instrumental Variables and GMM: Estimation and Testing. Steven Stillman, New Zealand Department of Labour

Specification testing in panel data models estimated by fixed effects with instrumental variables

Dynamic Panel Data Modeling using Maximum Likelihood: An Alternative to Arellano-Bond

On the Problem of Endogenous Unobserved Effects in the Estimation of Gravity Models

Short T Panels - Review

Bias Correction Methods for Dynamic Panel Data Models with Fixed Effects

MULTILEVEL MODELS WHERE THE RANDOM EFFECTS ARE CORRELATED WITH THE FIXED PREDICTORS

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

1 Motivation for Instrumental Variable (IV) Regression

Economics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

GMM Estimation in Stata

Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

SUR Estimation of Error Components Models With AR(1) Disturbances and Unobserved Endogenous Effects

Topic 10: Panel Data Analysis

GMM ESTIMATION OF SHORT DYNAMIC PANEL DATA MODELS WITH INTERACTIVE FIXED EFFECTS

Online-Only Appendix to Blunt Instruments: Avoiding Common Pitfalls in Identifying the Causes of Economic Growth

Applied Economics. Panel Data. Department of Economics Universidad Carlos III de Madrid

Fixed Effects Models for Panel Data. December 1, 2014

Estimation of Dynamic Panel Data Models with Sample Selection

Dynamic Panel Data Models

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

Econometrics for PhDs

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Departamento de Economía Universidad de Chile

Econometrics. 8) Instrumental variables

Basic Regressions and Panel Data in Stata

Session 3-4: Estimating the gravity models

Working Paper Number 103 December 2006

Efficient Estimation of Dynamic Panel Data Models: Alternative Assumptions and Simplified Estimation

Lecture 10: Panel Data

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

Econometrics of Panel Data

Econ 582 Fixed Effects Estimation of Panel Data

System GMM estimation of Empirical Growth Models

Re-estimating Euler Equations

A Robust Test for Weak Instruments in Stata

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

Panel Data: Fixed and Random Effects

Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates

Testing for time-invariant unobserved heterogeneity in nonlinear panel-data models

GMM Estimation of Empirical Growth Models

Instrumental Variables and the Problem of Endogeneity

Missing dependent variables in panel data models

Lecture 6: Dynamic panel models 1

Panel data methods for policy analysis

Microeconometrics. C. Hsiao (2014), Analysis of Panel Data, 3rd edition. Cambridge, University Press.

10 Panel Data. Andrius Buteikis,

Specification Tests in Unbalanced Panels with Endogeneity.

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Birkbeck Working Papers in Economics & Finance

Capital humain, développement et migrations: approche macroéconomique (Empirical Analysis - Static Part)

1 Introduction The time series properties of economic series (orders of integration and cointegration) are often of considerable interest. In micro pa

GMM-based inference in the AR(1) panel data model for parameter values where local identi cation fails

Day 2A Instrumental Variables, Two-stage Least Squares and Generalized Method of Moments

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

A Transformed System GMM Estimator for Dynamic Panel Data Models. February 26, 2014

17/003. Alternative moment conditions and an efficient GMM estimator for dynamic panel data models. January, 2017

The aggregation problem in its hystorical perspective: a summary overview

Introduction to Panel Data Analysis

Instrumental variables: Overview and advances

Transcription:

xtseqreg: Sequential (two-stage) estimation of linear panel data models and some pitfalls in the estimation of dynamic panel models Sebastian Kripfganz University of Exeter Business School, Department of Economics, Exeter, UK 23 rd UK Stata Users Group Meeting London, September 7, 2017 net install xtseqreg, from(http://www.kripfganz.de/stata/) or ssc install xtseqreg Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 1/22

Time-invariant regressors in linear panel models In many applications, important determinants of the outcome variable can be time invariant. Education, gender, nationality, ethnic and religious background, and other individual-specific characteristics play important roles in the determination of labor market or health outcomes. Institutional, socio-economic, and geographic factors matter in convergence models of economic growth, and they are key variables in gravity models of international trade and investment flows. A researcher might be particularly interested in their effects. Yet, traditional fixed-effects procedures (xtreg, fe) wipe out all time-invariant variables from the model. Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 2/22

Time-invariant regressors in linear panel models To identify the coefficients of time-invariant regressors, the assumption that a sufficient number of regressors (or excluded instrumental variables) is uncorrelated with the unit-specific error component cannot be avoided. Identification strategies for static panel models include: Classical random-effects model: xtreg, re, Correlated random-effects (Mundlak, 1978; Chamberlain, 1982) or hybrid models (Allison, 2009; Schunck, 2013): xthybrid (Schunck and Perales, 2017), Hausman and Taylor (1981) model: xthtaylor, Other instrumental variables strategies: xtivreg. Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 3/22

Time-invariant regressors in linear panel models In the context of dynamic panel models, generalized method of moments (GMM) estimators (Arellano and Bover, 1995; Blundell and Bond, 1998) are frequently employed: xtdpd, xtdpdsys, and xtabond2 (Roodman, 2009). Incorrect assumptions about the exogeneity of some variables may cause inconsistency of all coefficient estimates. A sequential procedure can provide partial robustness to such misspecification. In a first stage, only the coefficients of time-varying regressors are estimated. In a second stage, the coefficients of time-invariant regressors are recovered. New Stata command: xtseqreg Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 4/22

Two-stage estimation Linear panel data model with time-invariant regressors and error-components structure: y it = x itβ + f i γ + u i + e it Sequential estimation procedure: 1 Estimation of the coefficients of time-varying regressors: y it = x itβ + ũ i + e it, ũ i = f i γ + u i 2 Estimation of the coefficients of time-invariant regressors: y it x it ˆβ = f i γ + u i + ẽ it, ẽ it = e it x it( ˆβ β) Conventional standard errors at the second stage are incorrect and often far too small. xtseqreg computes proper standard errors with the analytical correction term derived by Kripfganz and Schwarz (2015). Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 5/22

Stata syntax of the xtseqreg command Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 6/22

Stata syntax of xtseqreg postestimation commands Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 7/22

Empirical example: distance and FDI Estimation of a gravity model for U.S. outward FDI. Annual data, 1989 1999, for 341 bilateral industry-level relationships, compiled by Egger and Pfaffermayr (2004). Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 8/22

First-stage system GMM estimation Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 9/22

First-stage system GMM estimation Replication with xtabond2: Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 10/22

How (not) to do xtabond2: Always double check! The first two specifications yield identical estimation results. The results from the last specification differ (but should not): Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 11/22

Second-stage 2SLS estimation Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 12/22

Second-stage 2SLS estimation Replication with ivregress (incorrect standard errors): Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 13/22

One-stage GMM estimation Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 14/22

How (not) to do xtabond2: Remember the assumptions! Instruments for the first-differenced equation are uncorrelated with time-invariant variables by construction, first-differenced instruments for the level equation by assumption. Difference-in-Hansen tests might be based on asymptotically incorrect (or at least debatable) degrees of freedom: Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 15/22

Alternative first-stage QML estimator First-stage QML estimator of Hsiao et al. (2002): Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 16/22

Alternative first-stage GMM estimator First-stage GMM estimator of Ahn and Schmidt (1995): Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 17/22

Time effects Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 18/22

How (not) to do xtabond2: Beware of the dummy trap! Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 19/22

How (not) to do xtabond2: Always specify equation()! Instruments for the time dummies should only be included for the level equation. Asymptotically, the additional instruments for the first-differenced equation are redundant. Hansen s J-test is based on incorrect degrees of freedom: Never use the iv() option without suboption equation()! It is not equivalent to the joint specification of iv(, equation(diff)) and iv(, equation(level)): Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 20/22

Summary: the new xtseqreg package for Stata Sequential estimation can provide partial robustness to model misspecification. Is is important to compute corrected standard errors at the second stage that account for the first-stage estimation error. The new xtseqreg Stata command implements this standard error correction for two-stage linear panel data models. The two-stage procedure is particularly relevant in the presence of time-invariant regressors, but it can be easily applied to more general settings. Kripfganz, S., and C. Schwarz (2015). Estimation of linear dynamic panel data models with time-invariant regressors. ECB Working Paper 1838. European Central Bank. net install xtseqreg, from(http://www.kripfganz.de/stata/) or ssc install xtseqreg help xtseqreg help xtseqreg postestimation Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 21/22

References Ahn, S. C., and P. Schmidt (1995). Efficient estimation of models for dynamic panel data. Journal of Econometrics 68(1): 5 27. Allison, P. D. (2009). Fixed effects regression models. Quantitative applications in the social sciences 160. Thousand Oaks: SAGE Publications. Arellano, M., and O. Bover (1995). Another look at the instrumental variable estimation of error-components models. Journal of Econometrics 68(1): 29 51. Blundell, R., and S. R. Bond (1991). Initial conditions and moment restrictions in dynamic panel data models. Journal of Econometrics 87(1): 115 143. Chamberlain, G. (1982). Multivariate regression models for panel data. Journal of Econometrics 18(1), 5 46. Egger, P., and M. Pfaffermayr (2004). Distance, trade and FDI: A Hausman-Taylor SUR approach. Journal of Applied Econometrics 19(2), 227 246; data set available from the JAE data archive: http://qed.econ.queensu.ca/jae/2004-v19.2/egger-pfaffermayr/. Hausman, J. A., and W. E. Taylor (1981). Panel data and unobservable individual effects. Econometrica 49(6), 1377 1398. Hsiao, C., M. H. Pesaran, and A. K. Tahmiscioglu (2002). Maximum likelihood estimation of fixed effects dynamic panel data models covering short time periods. Journal of Econometrics 109(1): 107 150. Kripfganz, S., and C. Schwarz (2015). Estimation of linear dynamic panel data models with time-invariant regressors. ECB Working Paper 1838. European Central Bank. Mundlak, Y. (1978). On the pooling of time series and cross section data. Econometrica 46(1), 69 85. Roodman, D. (2009). How to do xtabond2: An introduction to difference and system GMM in Stata. Stata Journal 9(1): 86 136. Schunck, D. (2013). Within and between estimates in random-effects models: Advantages and drawbacks of correlated random effects and hybrid models. Stata Journal 13(1): 65 76. Schunck, D., and F. Perales (2017). Within- and between-cluster effects in generalized linear mixed models: A discussion of approaches and the xthybrid command. Stata Journal 17(1): 89 115. Sebastian Kripfganz (2017) xtseqreg: Sequential (two-stage) estimation of linear panel data models 22/22