Comments on: Panel Data Analysis Advantages and Challenges. Manuel Arellano CEMFI, Madrid November 2006

Similar documents
Binary Models with Endogenous Explanatory Variables

Microeconometrics. C. Hsiao (2014), Analysis of Panel Data, 3rd edition. Cambridge, University Press.

Flexible Estimation of Treatment Effect Parameters

Non-linear panel data modeling

On IV estimation of the dynamic binary panel data model with fixed effects

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case

Panel Data Exercises Manuel Arellano. Using panel data, a researcher considers the estimation of the following system:

1 Estimation of Persistent Dynamic Panel Data. Motivation

AGEC 661 Note Fourteen

Final Exam. Economics 835: Econometrics. Fall 2010

APEC 8212: Econometric Analysis II

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2016 Instructor: Victor Aguirregabiria

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015

Sensitivity checks for the local average treatment effect

What s New in Econometrics? Lecture 14 Quantile Methods

Limited Dependent Variables and Panel Data

Econometric Analysis of Cross Section and Panel Data

Econometrics of Panel Data

Identification and Estimation of Marginal Effects in Nonlinear Panel Models 1

Introduction to Econometrics

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools

A dynamic model for binary panel data with unobserved heterogeneity admitting a n-consistent conditional estimator

ECO Class 6 Nonparametric Econometrics

UNIVERSITY OF CALIFORNIA Spring Economics 241A Econometrics

Dynamic Panels. Chapter Introduction Autoregressive Model

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

Quantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be

Introduction to Causal Inference. Solutions to Quiz 4

Michael Lechner Causal Analysis RDD 2014 page 1. Lecture 7. The Regression Discontinuity Design. RDD fuzzy and sharp

Jinyong Hahn. Department of Economics Tel: (310) Bunche Hall Fax: (310) Professional Positions

Nonlinear Panel Data Analysis

A Course in Applied Econometrics. Lecture 5. Instrumental Variables with Treatment Effect. Heterogeneity: Local Average Treatment Effects.

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha

New Developments in Econometrics Lecture 16: Quantile Estimation

Efficient Estimation of Non-Linear Dynamic Panel Data Models with Application to Smooth Transition Models

Selection endogenous dummy ordered probit, and selection endogenous dummy dynamic ordered probit models

Panel Data Seminar. Discrete Response Models. Crest-Insee. 11 April 2008

Iterative Bias Correction Procedures Revisited: A Small Scale Monte Carlo Study

Identification in Triangular Systems using Control Functions

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

The relationship between treatment parameters within a latent variable framework

Estimation of Dynamic Nonlinear Random E ects Models with Unbalanced Panels.

Econ 273B Advanced Econometrics Spring

Course Description. Course Requirements

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Identification of Regression Models with Misclassified and Endogenous Binary Regressor

The problem of causality in microeconometrics.

Munich Lecture Series 2 Non-linear panel data models: Binary response and ordered choice models and bias-corrected fixed effects models

CAUSALITY AND EXOGENEITY IN ECONOMETRICS

Semiparametric Identification in Panel Data Discrete Response Models

The problem of causality in microeconometrics.

GMM-based inference in the AR(1) panel data model for parameter values where local identi cation fails

12E016. Econometric Methods II 6 ECTS. Overview and Objectives

Econometric Analysis of Games 1

Parametric identification of multiplicative exponential heteroskedasticity ALYSSA CARLSON

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Harvard University. A Note on the Control Function Approach with an Instrumental Variable and a Binary Outcome. Eric Tchetgen Tchetgen

NBER WORKING PAPER SERIES A NOTE ON ADAPTING PROPENSITY SCORE MATCHING AND SELECTION MODELS TO CHOICE BASED SAMPLES. James J. Heckman Petra E.

ECO 2403 TOPICS IN ECONOMETRICS

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria

Industrial Organization II (ECO 2901) Winter Victor Aguirregabiria. Problem Set #1 Due of Friday, March 22, 2013

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case

Efficient Estimation of Dynamic Panel Data Models: Alternative Assumptions and Simplified Estimation

Estimation of Treatment Effects under Essential Heterogeneity

Informational Content in Static and Dynamic Discrete Response Panel Data Models

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous

Conditions for the Existence of Control Functions in Nonseparable Simultaneous Equations Models 1

Four Parameters of Interest in the Evaluation. of Social Programs. James J. Heckman Justin L. Tobias Edward Vytlacil

Groupe de lecture. Instrumental Variables Estimates of the Effect of Subsidized Training on the Quantiles of Trainee Earnings. Abadie, Angrist, Imbens

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

ECONOMETRICS HONOR S EXAM REVIEW SESSION

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Applied Microeconometrics Chapter 8 Regression Discontinuity (RD)

Program Evaluation with High-Dimensional Data

On Variance Estimation for 2SLS When Instruments Identify Different LATEs

Chapter 2. Dynamic panel data models

Quantitative Economics for the Evaluation of the European Policy

The Problem of Causality in the Analysis of Educational Choices and Labor Market Outcomes Slides for Lectures

Random Coefficients on Endogenous Variables in Simultaneous Equations Models

Imbens, Lecture Notes 2, Local Average Treatment Effects, IEN, Miami, Oct 10 1

Parametric Identification of Multiplicative Exponential Heteroskedasticity

Supplementary material to: Tolerating deance? Local average treatment eects without monotonicity.

An Alternative Assumption to Identify LATE in Regression Discontinuity Design

A Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Methods to Estimate Causal Effects Theory and Applications. Prof. Dr. Sascha O. Becker U Stirling, Ifo, CESifo and IZA

Principles Underlying Evaluation Estimators

The Generalized Roy Model and Treatment Effects

A nonparametric test for path dependence in discrete panel data

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case

Short Questions (Do two out of three) 15 points each

Applied Health Economics (for B.Sc.)

Discussion of Bootstrap prediction intervals for linear, nonlinear, and nonparametric autoregressions, by Li Pan and Dimitris Politis

Next, we discuss econometric methods that can be used to estimate panel data models.

Identification and Estimation of Nonlinear Dynamic Panel Data. Models with Unobserved Covariates

Estimation of Structural Parameters and Marginal Effects in Binary Choice Panel Data Models with Fixed Effects

Lecture 8. Roy Model, IV with essential heterogeneity, MTE

An Economic Analysis of Exclusion Restrictions for Instrumental Variable Estimation

Transcription:

Comments on: Panel Data Analysis Advantages and Challenges Manuel Arellano CEMFI, Madrid November 2006 This paper provides an impressive, yet compact and easily accessible review of the econometric literature on panel data analysis. Professor Cheng Hsiao has succeeded in surveying, in a coherent manner, classic results as well as more challenging recent developments on nonlinear models, crosssectional dependence, and long time-series panels. The coverage of topics in the article reflects the breadth of Professor Hsiao s important contributions to panel data econometrics over many years. In my comments I will focus on two aspects of recent work on nonlinear models. Firstly, I will discuss state dependence and dynamics from a treatment effect perspective. Secondly, I will consider the issue of choice of population framework and its implications for identifiability. Both comments are closely interconnected. 1. State dependence and treatment effects Take a random sample of binary sequences (y i1,..., y it ). Unit i chooses 1 or 0 in period t. This choice may depend on the choice in t 1. The purpose is to measure this dependence. The problem can be cast into the framework of potential outcomes: y it = ( yit (1) if y it 1 =1 y it (0) if y it 1 =0. The treatment is y it 1, and the potential outcomes are y it (1),y it (0). The causal effect for person i is y it (1) y it (0). A measure of population state dependence is provided by the average treatment effect E [y it (1) y it (0)]. We may also consider a conditional average given some exogenous variables or covariates. A discussion on identified bounds in this setting is in Manski (2006). Because (y i1,..., y it ) is a sequence of outcomes, it is difficult to imagine a conceptual experiment that would justify a nonstructural treatment-effects formulation. One could assign initial conditions randomly and regard the rest of the time series as a vector of outcomes, but this is not typically the intention when seeking to measure the extent of state dependence. Thus, it is natural to regard the potential outcome representation as describing a structural decision rule. 1.1 Exclusion restrictions I wish to discuss an aspect of the identifying content of time-varying covariates, which is standard in the context of linear models but has not received attention from a potential outcome perspective. I consider a nonparametric partial adjustment structural model that exploits exclusion restrictions in a time-varying strictly exogenous covariate x T i =(x i1,..., x it ). 1

The idea is that Pr [y it (s)] (s =0, 1) is conditional on x T i, but we would expect Pr [y it (s)] to be more sensitive to x it than to x 0 s from other periods. A drastic but convenient implementation of this notion is: Pr [y it (s) x i1,..., x it ]=Pr[y it (s) x it ]. So, using x it 1, we have the instrumental-variable (IV) assumption {y it (0),y it (1)} x it 1 x it. As an example, think of y it as smoking status and suppose that cigarette prices x it 1 and x it are set exogenously. The IV assumption says that, given current prices, (past smoking-induced) potential smoking outcomes are independent of past prices. This is the type of situation discussed in the local average treatment effect (LATE) literature (c.f. Imbens and Angrist, 1994). Using a potential outcome formulation for y it 1 and a binary x it 1 : y it 1 = ( [1] y it 1 if x it 1 =1 y [0] it 1 if x it 1 =0, we can distinguish between compliers (those induced to quit smoking by changing x it 1 from 0 to 1: y [0] it 1 y[1] it 1 =1), stayers, and defiers (those with y[0] it 1 y[1] it 1 = 1). If we rule out defiers, the distributions of y it (0) and y it (1) for compliers are point identified: ³ Pr y it (s) y [0] it 1 y[1] it 1 =1,x it (s =0, 1). Given this, we can get measures of state dependence (addiction) and price effects on smoking. Note that we have defined two different sequences of potential outcomes, y [s] it and y it (s). 1.2 Exogeneity An alternative conditional exogeneity assumption is {y it (0),y it (1)} y it 1 x it. This is a strong assumption because y it 1 is not randomly assigned. A linear version of this is the standard partial adjustment model without serial correlation. 1.3 Fixed effects The previous discussion can be thought of as being conditional on time-invariant observable covariates. The panel literature has emphasized parametric situations where the results hold conditional on a time-invariant unobserved effect α i : {y it (0),y it (1)} y it 1 x it, α i (1) 2

or {y it (0),y it (1)} x it 1 x it, α i, thus allowing for fixed-effects endogeneity of y it 1 or x it 1. In situations of this kind, we only have fixed-t point identification for particular objects in certain models. An example of (1) is the binary autoregressive formulation y it (s) =1(γs + α i + v it 0) (s =0, 1) (2) where v it are iid across i and t, independent of α i, with logit or probit cdf F. The average treatment effect in this case is φ E [y it (1) y it (0)] = E αi [F (γ + α i ) F (α i )]. There is point identification of γ for logit if T 4, but not for probit, although the identified set for γ seems to be small (Honoré and Tamer, 2006). There is set identification for φ for both logit and probit. 1.4 Unobserved heterogeneity and identification Take just one individual time series and think of it as the realization of a well defined, suitably stable, but individual-specific, stochastic process. A descriptive measure of unit s i persistence is the first-order autocorrelation: ρ i = P i (y it =1 y it 1 =1) P i (y it =1 y it 1 =0) = plim 1 X y it 1 X y it T 1 T 0 T y it 1 =1 y it 1 =0 where T 1 = P T t=2 y it 1 and T 0 = P T t=2 (1 y it 1). On the other hand, a time-series average of causal effects is: 1 r i = plim T T TX [y it (1) y it (0)]. t=1 In general, ρ i and r i are different concepts. Note that y it =[y it (1) y it (0)] y it 1 + y it (0), so that we have r i = ρ i if [y it (0),y it (1)] are independent of y it 1 over time. For example, this is true for the binary fixed-effect autoregressive model (2). A cross-sectional measure of persistence in a two-period panel is π t = Pr(y it =1 y it 1 =1) Pr (y it =1 y it 1 =0) = plim 1 X y it 1 X y it N 1 N 0 N y it 1 =1 y it 1 =0 3

where N 1 = P N i=1 y it 1 and N 0 = P N i=1 (1 y it 1). Ifρ i and π t are constant for all i and t, theywill coincide, but not otherwise. The microeconometric literature on genuine versus spurious state dependence has been concerned with approximating summary measures of ρ i from short panels. This may still be a descriptive pursuit, although E (ρ i ) is arguably more informative than π t because it distinguishes between cross-sectional unobserved heterogeneity and unit-specific time-series persistence. Even for some of these descriptive objects, we lack point identification under fixed T. However, the fact that ρ i or cross-sectional functionals of it are not point identified from a fixed-t perspective, reflects a limitation of this perspective when T is statistically informative. I now turn to discuss this problem. 2. Population framework and identification Fixed T identification may be problematic because it rules out statistical learning from individual time series data. For micro panels of moderate time dimension, approximate solutions to the incidental parameter problem from a time-series perspective (reviewed in Arellano and Hahn, 2007) are a promising avenue for progress. In this literature three different approaches can be distinguished. One approach is to construct and analytical or numerical bias correction of a fixed effects estimator. A second approach is to consider estimators from bias corrected moment equations. The third one is to consider estimation from a bias corrected objective function relative to some target criterion. The latter is particularly attractive for its simplicity, specially in models with multiple fixed effects. By way of illustration, suppose a likelihood model for independent data with common parameter θ and a potentially vector-valued individual effect α i, where the log likelihood for individual i is P T t=1 `it (θ, α i ).Amodifiedconcentrated likelihood that produces estimates of the common parameters with bias of order 1/T 2 or less is given by L M (θ) = " NX X T # `it (θ, bα i (θ)) + 1 2 ln det H i (θ) 1 2 ln det Υ i (θ) i=1 t=1 where bα i (θ) is the maximum likelihood estimate of α i for given θ, H i (θ) = P T t=1 2`it (θ, bα i (θ)) / α i α 0 i, and Υ i (θ) = P T t=1 q it (θ) q it (θ) 0,whereq it (θ) = `it (θ, bα i (θ)) / α i. Thus, as discussed in Arellano and Hahn (2007), the adjustment depends exclusively on the sample Hessian H i (θ) and the sample outer product of score term Υ i (θ). 3. Final remarks In panel data analysis, there is a choice of population framework, which may lead to conflicting identification arrangements. In situations of this kind, there is much to be learned from research on both partial identification and estimability issues. 4

References Arellano, M. and J. Hahn (2007): Understanding Bias in Nonlinear Panel Models: Some Recent Developments. In: R. Blundell, W. Newey, and T. Persson (eds.): Advances in Economics and Econometrics, Ninth World Congress, Cambridge University Press. Honoré, B. E. and E. Tamer (2006): Bounds on Parameters in Panel Dynamic Discrete Choice Models, Econometrica, 74, 611 629. Imbens,G.W.andJ.Angrist(1994): Identification and Estimation of Local Average Treatment Effects, Econometrica, 62, 467 475. Manski,C.F.(2006): Two Problems of Partial Identification with Panel Data, 13th Conference on Panel Data, Cambridge. 5