Moment Condition Selection for Dynamic Panel Data Models

Similar documents
Statistical inference in dynamic panel data models

A TWO-STAGE MODEL AND MOMENT SELECTION CRITERION FOR MOMENT RESTRICTION MODELS

Econometrics of Panel Data

High Dimensional Empirical Likelihood for Generalized Estimating Equations with Dependent Data

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

Bias Correction Methods for Dynamic Panel Data Models with Fixed Effects

1 Procedures robust to weak instruments

1 Glivenko-Cantelli type theorems

Non-Stationary Time Series and Unit Root Testing

GMM-based inference in the AR(1) panel data model for parameter values where local identi cation fails

Improving GMM efficiency in dynamic models for panel data with mean stationarity

Lecture 7: Dynamic panel models 2

Lasso Maximum Likelihood Estimation of Parametric Models with Singular Information Matrices

Efficient Estimation of Dynamic Panel Data Models: Alternative Assumptions and Simplified Estimation

Tail bound inequalities and empirical likelihood for the mean

Non-Stationary Time Series and Unit Root Testing

1 Estimation of Persistent Dynamic Panel Data. Motivation

Lecture 2: Univariate Time Series

Non-Stationary Time Series and Unit Root Testing

Prof. Dr. Roland Füss Lecture Series in Applied Econometrics Summer Term Introduction to Time Series Analysis

A Course on Advanced Econometrics

Dynamic Panel Data Models

Missing dependent variables in panel data models

Generalized Neyman Pearson optimality of empirical likelihood for testing parameter hypotheses

Moment and IV Selection Approaches: A Comparative Simulation Study

Testing Restrictions and Comparing Models

Lecture 13: Subsampling vs Bootstrap. Dimitris N. Politis, Joseph P. Romano, Michael Wolf

Model Specification Testing in Nonparametric and Semiparametric Time Series Econometrics. Jiti Gao

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria

Econ 583 Homework 7 Suggested Solutions: Wald, LM and LR based on GMM and MLE

Estimation of Dynamic Panel Data Models with Sample Selection

Introduction to Self-normalized Limit Theory

ECON3327: Financial Econometrics, Spring 2016

Max. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes

What s New in Econometrics. Lecture 15

Likelihood-Based Methods

Next, we discuss econometric methods that can be used to estimate panel data models.

Maximum Likelihood Tests and Quasi-Maximum-Likelihood

Final Exam. 1. (6 points) True/False. Please read the statements carefully, as no partial credit will be given.

MLE and GMM. Li Zhao, SJTU. Spring, Li Zhao MLE and GMM 1 / 22

Generalized Method of Moments Estimation

Testing Algebraic Hypotheses

Using Invalid Instruments on Purpose: Focused Moment Selection and Averaging for GMM

The Time Series and Cross-Section Asymptotics of Empirical Likelihood Estimator in Dynamic Panel Data Models

SEQUENTIAL ESTIMATION OF DYNAMIC DISCRETE GAMES. Victor Aguirregabiria (Boston University) and. Pedro Mira (CEMFI) Applied Micro Workshop at Minnesota

A forward demeaning transformation for a dynamic count panel data model *

Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach

17/003. Alternative moment conditions and an efficient GMM estimator for dynamic panel data models. January, 2017

The Bootstrap: Theory and Applications. Biing-Shen Kuo National Chengchi University

xtdpdml for Estimating Dynamic Panel Models

The loss function and estimating equations

Semiparametric posterior limits

Nonparametric Tests for Multi-parameter M-estimators

Linear dynamic panel data models

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

large number of i.i.d. observations from P. For concreteness, suppose

7 Introduction to Time Series

On detection of unit roots generalizing the classic Dickey-Fuller approach

Dynamic Panel Data Models

Marginal Regression Analysis of Longitudinal Data With Time-Dependent Covariates: A Generalised Method of Moments Approach

An Introduction to Statistical Machine Learning - Theoretical Aspects -

STAT 461/561- Assignments, Year 2015

Econometric Analysis of Cross Section and Panel Data

Panel Data Exercises Manuel Arellano. Using panel data, a researcher considers the estimation of the following system:

Empirical Market Microstructure Analysis (EMMA)

13. Parameter Estimation. ECE 830, Spring 2014

Least Squares Model Averaging. Bruce E. Hansen University of Wisconsin. January 2006 Revised: August 2006

7 Introduction to Time Series Time Series vs. Cross-Sectional Data Detrending Time Series... 15

Lecture 6: Discrete Choice: Qualitative Response

Model Selection Tutorial 2: Problems With Using AIC to Select a Subset of Exposures in a Regression Model

DYNAMIC PANEL GMM WITH NEAR UNITY. Peter C. B. Phillips. December 2014 COWLES FOUNDATION DISCUSSION PAPER NO. 1962

ECE531 Lecture 10b: Maximum Likelihood Estimation

Iterative Bias Correction Procedures Revisited: A Small Scale Monte Carlo Study

11. Further Issues in Using OLS with TS Data

Asymptotic Properties of Empirical Likelihood Estimator in Dynamic Panel Data Models

A Factor Analytical Method to Interactive Effects Dynamic Panel Models with or without Unit Root

Nonnested Model Selection Criteria

An Information Criteria for Order-restricted Inference

Statistical Inference

Dynamic Panels. Chapter Introduction Autoregressive Model

Ultra High Dimensional Variable Selection with Endogenous Variables

Empirical Evaluation and Estimation of Large-Scale, Nonlinear Economic Models

ECON 3150/4150, Spring term Lecture 6

Persistence Heterogeneity Testing in Panels with Interactive Fixed Effects. Yunus Emre Ergemen and Carlos Velasco. CREATES Research Paper

Inference for Identifiable Parameters in Partially Identified Econometric Models

BIO5312 Biostatistics Lecture 13: Maximum Likelihood Estimation

Bayesian Indirect Inference and the ABC of GMM

Final Examination Statistics 200C. T. Ferguson June 11, 2009

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

EMPIRICAL LIKELIHOOD METHODS IN ECONOMETRICS: THEORY AND PRACTICE

Dynamic panel data methods

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

Introduction to Economic Time Series

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction

Minimum Message Length Analysis of the Behrens Fisher Problem

AR, MA and ARMA models

MEI Exam Review. June 7, 2002

Some Non-Parametric Identification Results using Timing and Information Set Assumptions

Munich Lecture Series 2 Non-linear panel data models: Binary response and ordered choice models and bias-corrected fixed effects models

Econ 423 Lecture Notes: Additional Topics in Time Series 1

Transcription:

Moment Condition Selection for Dynamic Panel Data Models Dylan Small Department of Statistics The Wharton School, University of Pennsylvania Joint work with Tze Leung Lai and Jia Liu

Dynamic Panel Data Panel (longitudinal) data set: Follows a sample of units over time, providing a time series for each unit in the sample. In many economic settings, the number of units N is relatively large but the time series for each unit is relatively short (let length =T ). Example: Panel of 738 Spanish manufacturing firms, observed yearly from 1983-1990. We are interested in understanding the dynamics of the firms employment levels. We remove a time trend and transform employment to log employment. Source: Alonso-Borrego and Arellano (1999).

Log Employment 2 3 4 5 6 7 2 3 4 5 6 7 1983 1985 1987 1989 Year 1983 1985 1987 1989 Year Log Employment 2 3 4 5 6 7 Log Employment 2 3 4 5 6 7 Log Employment 1983 1985 1987 1989 Year 1983 1985 1987 1989 Year

Log Employment 3.4 3.6 3.8 4.0 4.2 6.84 6.88 6.92 1983 1985 1987 1989 1983 1985 1987 1989 Year Year Log Employment 3.8 4.0 4.2 Log Employment 3.40 3.50 3.60 Log Employment 1983 1985 1987 1989 Year 1983 1985 1987 1989 Year

Dynamic Panel Data Model y it = θy i,t 1 +α i +u it,i=1,...,n,t=1,...,t E(α i )=0, E(u it )=0. u it iid. α i iid. E(α i u it )=0

Role of Initial Conditions y it = θy i,t 1 + α i + u it. The initial conditions are not specified in the base model. Different assumptions about the initial conditions affect the interpretation of how y it is generated (Anderson and Hsiao, 1981, Estimation of Dynamic Models with Error Components, JASA; 1982, Formulation and Estimation of Dynamic Models Using Panel Data, J. Econ.). 1. y i0 = τ i, τ i and α i are independent. A cross-sectional unit starts from a random initial position and gradually drifts towards its equilibrium level. 2. y i0 = α i. The individual effect represents the initial endowment and the effect of the initial endowment cumulates over time. 3. y i0 = α i 1 θ + v i0, Var(v i0 )=σ 2 v. Each individual s time series is stationary.

Dependence of MLE on Initial Conditions y it = θy i,t 1 + α i + u it. Suppose u it and α i are normally distributed. Anderson and Hsiao (1981): The properties of the MLE depend crucially on the assumption of the initial conditions. Different assumptions about the initial conditions call for different methods to obtain the MLE. Mistaking one case for the other will generally not lead to asymptotically equivalent formulas. Consequently, the misused estimator may be inconsistent. Unfortunately, we usually have little information to rely on in making a correct choice of the initial conditions.

Instrumental Variables Estimators y it = θy i,t 1 + α i + u it,i=1,...,n,t= 1,...,T, u it iid. y it y i,t 1 = θ(y i,t 1 y i,t 2 )+u it u i,t 1. Δy it = θδy i,t 1 +Δu it Anderson and Hsiao proposed two instrumental variables estimators of θ that are consistent regardless of the initial conditions. 1. E[y i,t 2 {Δy it θδy i,t 1 }]= E[y i,t 2 Δu it ]=0. P N i=1 ˆθ IV,1 = P T P t=2 (Δy it)y i,t 2 N P T i=1 t=2 (Δy. i,t 1)y i,t 2 2. E[Δy i,t 2 {Δy it θ(δy i,t 1 )}] = E[(Δy i,t 2 )(Δu it )] = 0. P N i=1 ˆθ IV,2 = P T t=3 (Δy it)(δy i,t 2 ) P Ni=1 P Tt=3 (Δy i,t 1 )(Δy i,t 2 ) Anderson and Hsiao compared the asymptotic variances of ˆθ IV,1 and ˆθ IV,2 and gave guidelines for how to choose between the estimators.

Moment Conditions for Dynamic Panel Data Model A semiparametric version of dynamic panel data model: y it = θy i,t 1 + α i + u it,t=1,...,t (1) E (u it y i0,...,y i,t 1 )=0 (2) (1) is equivalent to system of equations y i1 = θy i0 + α i + u it (3) Δy it = θδy i,t 1 +Δu it (3) is not informative about θ in absence of restrictions on joint distribution of (α i,y i0 ). Set of distributions F for i.i.d. random vectors (y i0,...,y it ) satisfying (1) and (2) for θ = θ 0 is equivalent to the set of distributions F satisfying moment restrictions E[y i,t s (Δy it θ 0 Δy i,t 1 )] = 0, s =0,...,t 2,t=2,...,T (4) All the moment conditions in (4) cannot be used directly in the method of moments because it leads to an overdetermined system of estimating equations.

Generalized Method of Moments Let z i =(y i0,...,y it, x i0,...,x it ), g(z,θ)=y i,t s (Δy it θδy i,t 1 )] = 0, s =0,...,t 2,t=2,...,T. Note that E F [g(z,θ)] = 0 for the true F, θ. In analogy with the method of moments, we can obtain a consistent estimate of θ from minimizing a weighted sum of the sample moments: ˆθ = arg min θ g(z,θ) Wg(z,θ) Equivalent to solving the estimating equation ( g(z,θ) ) Wg(z,θ)=0 θ

Properties of GMM First order asymptotic theory of GMM Hansen (1982,Econ.): The asymptotically optimal weight matrix W is S =[E(gg )] 1. The following two step procedure is asymptotically as efficient as using the weight matrix S: (i) Use a consistent estimate of θ to estimates; (ii) Estimate θ by using the weight matrix Ŝ. Chamberlain (1987,J.Econ.): The two-step GMM procedure is semiparametrically efficient. The first order asymptotic theory does not reflect the sampling fluctuations in estimating the weight matrix. This is particularly problematic when The number of estimating equations is large. Some of the estimating equations are weak in that they provide little information about θ.

Estimation Using Additional Moment Conditions We can construct a more efficient GMM estimator than that based on E[y i,t s (Δy it θ 0 Δy i,t 1 )] = 0,s=0,...,t 2,t=2,...,T if we are willing to make additional assumptions. Example: The stationarity condition. E[α i y i1 ]=E[α i y i0 ] E[( T t=2 y it θy i,t 1 )Δy i1 ]=0

Gains and Losses from Additional Assumptions T =5, n = 250, normal disturbances, 100 iterations. Setting I: θ =.5, initial conditions stationary. Setting II: θ =.5, y i, 2 = α i. Setting III: θ =.9, initial conditions stationary. Setting IV: θ =.9, y i, 2 = α i. Setting V: θ =.5, initial conditions stationary, double exponential disturbances. RMSE of different estimators Estimator I II III IV V BM.09.10.39.04.49 S.35.23.17.19.58 BM+S.09.15.25.17.56 BM=basic model, S=stationarity, BM+S=basic model + stationary

Moment Condition Selection Description of problem How to select which moment conditions to use? Two considerations: 1. Some moment conditions may be based on assumptions about which the researcher is unsure but the moment conditions would provide substantial information about θ if the assumptions are approximately true. 2. Using some true moment conditions may degrade estimation if the moment conditions are only weakly informative about θ.

Review of Literature Pretesting: The estimator is chosen by stepwise hypothesis tests. Overidentification tests for selecting moment restrictions. Two problems with this approach: 1. The selection of significance levels of the tests is subjective and their interpretation is unclear. 2. The tests are based on an asymptotic theory which is unreliable for typical sample sizes being considered. Andrews and Lu (2001, J.Econ.): Variable selection and moment selection is based on penalizing the GMM objective function for the number of parameters and number of moments. For example, Andrews BIC criterion for a model with b variables and c moment conditions is J n (b, c) (c b)lnn J n = g(z, ˆθ) Ŝg(z, ˆθ) Hong, Preston and Shum (2003) developed an approach based on empirical likelihood that is similar in spirit to Andrews and Lu s.

Motivation for New Approach Problem with Andrews and Lu approach: Seeks to choose all possible correct moment restrictions. Does not work well in problems where a subset of the moment restrictions have weak information and it is better not to use them. Our two stage approach: 1. Eliminate all inconsistent estimators using empirical likelihood test statistic. 2. Choose estimator with best variance for parameter(s) of interest using the bootstrap.

Empirical Likelihood Let z i =(y i0,...,y it, x i0,...,x it ), g(z,θ)=y i,t s (Δy it θδy i,t 1 )] = 0, s =0,...,t 2,t=2,...,T. Note that E F [g(z,θ)] = 0 for the true F, θ. The empirical likelihood L E (θ) of θ is the following: n sup{ p i p i 0, i=1 n p i =1, i=1 n p i g(z i,θ)=0} i=1 Empirical likelihood and two step GMM are asymptotically equivalent to the first order. Empirical likelihood tests of moment restrictions are asymptotically optimal in the large deviations sense of Bahadur and Hoeffding (Kitamura, Econ., 2000).

Setup for Two Stage Approach We observe i.i.d. d dimensional random vectors z i from F 0 M where M is the space of probability measures on (R d,b d ). We are interested in a p-dimensional parameter θ Θ. Basic model: We know that F 0 belongs to a family of probability measures P, P(θ) ={F M : E F [g P (z,θ)] = 0}, e.g., P = θ Θ P(θ) Let Q m = θ Θ m Q m (θ) be subsets of P, where the members of Q m have an r m dimensional vector of moment restrictions E[g m (z,θ)] = 0. g(z,θ)=y i,t s (Δy it θδy i,t 1 )] = 0, s =0,...,t 2,t=2,...,T. Q 1 = P, Q 2 : E[{g(z,θ),E[( T t=2 y it θy i,t 1 )Δy i1 ]}].

First Stage: Eliminate Wrong Models Let ˆθ m be the maximum empirical likelihood estimator associated with Q m. Compute the empirical log-likelihood ratio l m =2{n log n log L Qm (ˆθ m )} where L Qm (θ) is the empirical likelihood of θ under Q m. Eliminate Q m from consideration if l m > max{(r m p + q m )nɛ n,χ 2 r m p+q m ;1 α} where nɛ n and ɛ n 0. This ensures that wrong models will consistently eliminated and correct models will consistently not be eliminated. Rationale is related to both asymptotic χ 2 -distribution and the theory of moderate deviations for empirical likelihood ratio statistics. A typical choice of ɛ n is log n 2n in analogy with BIC.

Second Stage Among the models which have not been eliminated from consideration in Step 1, choose the associated estimator ˆθ m which minimizes an estimate of a measure of the size of Cov(ˆθ). Measuring the size of Cov(ˆθ m ): The size of Cov(ˆθ i ) is measured in a way that is suited to the goals of the estimation problem. For example, if we are interested in one particular component of θ, then we use the variance of the estimator of that component. In the presence of moment conditions which only weakly identify the parameters of interest, the bootstrap provides far better estimates of Cov(ˆθ m ) than the asymptotic variance.

Estimates of Variance y it = θy i,t 1 + α i + u it Each time series is stationary, unit normal disturbances, θ =.9. Actual Bootstrap Asymptotic BM 0.17 0.14 0.02 BM+S 0.08 0.09 0.01 S 0.02 0.02 0.01 BM=base model, S=stationarity moment restriction, BM+S=base model plus stationarity moment restriction

Simulation Study y it = θy i,t 1 + α i + u it (α i,u it ) iid N(0,I) Three models: basic model, basic model plus stationarity moment restriction and stationarity moment restriction alone. Three designs: (I) The individual time series are stationary and θ =.9 so that each individual time series is highly persistent. (II) The individual time series are stationary and θ =.5. (III) The individual time series are nonstationary and θ =.9. Each individual time series began two periods before the initial observation, y i, 2 = α i.

Results of Simulation Study Table 1: Design I: Stationary, highly persistent time series RMSE N = 100 N = 250 N = 500 A& L BIC 0.530 0.501 0.369 Two stage 0.389 0.273 0.209 Table 2: Design II: Stationary, moderately persistent time series RMSE N = 100 N = 250 N = 500 A& L BIC 0.158 0.096 0.062 Two stage 0.176 0.107 0.065

Table 3: Design III: Nonstationary, persistent time series RMSE N = 100 N = 250 N = 500 A& L BIC 0.257 0.059 0.029 Two stage 0.254 0.078 0.034

Conclusion Summary Our methodology addresses problem of how to decide whether to use additional plausible, but uncertain, assumptions in parameter estimation for dynamic panel data model. Our two stage approach provides a general methodology for selecting moment conditions when some proposed moment conditions may not be true and other proposed moment conditions may be true but too weak to be useful.