Specification testing in panel data models estimated by fixed effects with instrumental variables

Similar documents
Deriving Some Estimators of Panel Data Regression Models with Individual Effects

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Estimation of Panel Data Models with Binary Indicators when Treatment Effects are not Constant over Time. Audrey Laporte a,*, Frank Windmeijer b

Econometrics of Panel Data

Econometric Analysis of Cross Section and Panel Data

Specification Tests in Unbalanced Panels with Endogeneity.

Applied Econometrics. Lecture 3: Introduction to Linear Panel Data Models

Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

Dealing With Endogeneity

Chapter 15 Panel Data Models. Pooling Time-Series and Cross-Section Data

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Final Exam. Economics 835: Econometrics. Fall 2010

1 Motivation for Instrumental Variable (IV) Regression

Econometrics of Panel Data

Advanced Econometrics

Specification Test for Instrumental Variables Regression with Many Instruments

Econometrics of Panel Data

Economics 582 Random Effects Estimation

8. Instrumental variables regression

Introduction to Regression Models for Panel Data Analysis. Indiana University Workshop in Methods February 13, Professor Patricia A.

Modeling GARCH processes in Panel Data: Theory, Simulations and Examples

Multiple Equation GMM with Common Coefficients: Panel Data

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

Heteroskedasticity-Robust Standard Errors for Fixed Effects Panel Data Regression

Instrumental Variables and the Problem of Endogeneity

Econometrics II - EXAM Outline Solutions All questions have 25pts Answer each question in separate sheets

applications to the cases of investment and inflation January, 2001 Abstract

Missing dependent variables in panel data models

Econ 582 Fixed Effects Estimation of Panel Data

Econometrics II - EXAM Answer each question in separate sheets in three hours

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

ECON 4551 Econometrics II Memorial University of Newfoundland. Panel Data Models. Adapted from Vera Tabakova s notes

Non-linear panel data modeling

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook)

Instrumental Variables and GMM: Estimation and Testing. Steven Stillman, New Zealand Department of Labour


What s New in Econometrics? Lecture 14 Quantile Methods

Short T Panels - Review

Multiple Regression Analysis: Heteroskedasticity

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

TECHNICAL WORKING PAPER SERIES HETEROSKEDASTICITY-ROBUST STANDARD ERRORS FOR FIXED EFFECTS PANEL DATA REGRESSION. James H. Stock Mark W.

EC327: Advanced Econometrics, Spring 2007

DEPARTMENT OF STATISTICS

Fixed Effects Models for Panel Data. December 1, 2014

Linear Panel Data Models

Applied Economics. Panel Data. Department of Economics Universidad Carlos III de Madrid

Heteroskedasticity. We now consider the implications of relaxing the assumption that the conditional

Instrumental Variables

Panel Data Model (January 9, 2018)

Dynamic Panels. Chapter Introduction Autoregressive Model

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

Økonomisk Kandidateksamen 2004 (I) Econometrics 2. Rettevejledning

Econometrics of Panel Data

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

The outline for Unit 3

Econometrics of Panel Data

New Developments in Econometrics Lecture 16: Quantile Estimation

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

Applied Statistics and Econometrics. Giuseppe Ragusa Lecture 15: Instrumental Variables

A Practical Test for Strict Exogeneity in Linear Panel Data Models with Fixed Effects

Birkbeck Working Papers in Economics & Finance

A Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Panel data methods for policy analysis

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE

10 Panel Data. Andrius Buteikis,

Jeffrey M. Wooldridge Michigan State University

Lecture 10: Panel Data

α version (only brief introduction so far)

Selection Correction in Panel data Models: An Application to Labour Supply and Wages

1 Estimation of Persistent Dynamic Panel Data. Motivation

Instrumental Variables, Simultaneous and Systems of Equations

Increasing the Power of Specification Tests. November 18, 2018

Multiple Linear Regression

ECON Introductory Econometrics. Lecture 16: Instrumental variables

Econometrics of Panel Data

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley

Linear Regression with Time Series Data

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL

Econometrics Summary Algebraic and Statistical Preliminaries

Linear dynamic panel data models

Generalized Method of Moment

Heteroskedasticity. Part VII. Heteroskedasticity

xtseqreg: Sequential (two-stage) estimation of linear panel data models

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Economics 308: Econometrics Professor Moody

CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M.

Nonstationary Panels

Test of hypotheses with panel data

PANEL DATA RANDOM AND FIXED EFFECTS MODEL. Professor Menelaos Karanasos. December Panel Data (Institute) PANEL DATA December / 1

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

Applied Microeconometrics (L5): Panel Data-Basics

Introductory Econometrics

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation

ECO375 Tutorial 8 Instrumental Variables

Economics 583: Econometric Theory I A Primer on Asymptotics

Introductory Econometrics

Transcription:

Specification testing in panel data models estimated by fixed effects wh instrumental variables Carrie Falls Department of Economics Michigan State Universy Abstract I show that a handful of the regressions based tests tradional to cross-sectional or time series models can be extended to panel data models wh correlated fixed effects. Specifically I extend the tests for endogeney, overindentification, and nonlinearies developed by Wooldridge (1994). This results in regression based tests that can be easily made robust to arbrary heteroskedasticy and/or cluster serial correlation. I would like to thank Jeff Wooldridge for his comments, assistance, and overwhelming patience. Any contribution of my work is truly a result of his teachings.

1. Introduction Estimation methods have become increasing complex as the availabily of longudinal data sets have grown. While there are clearly many benefs to these more sophisticated models, specification testing whin such models can be arduous. Tradional estimation methods for static panel data models are the whin-group and the generalized least squares (GLS) estimators. The key distinction between the two methods is in the treatment of the unobserved component. The GLS estimator assumes the unobserved component and the observed explanatory variables are uncorrelated where as the whin-group estimator allows for correlation between the unobserved component and the time averages of the explanatory variables. The focus of this paper is specification testing in the context of panel data models estimated using the whin-group estimator. Throughout the paper we allow for arbrary correlation between the time averages of the regressors and the unobserved component. We extend robust regression based methods, for testing hypotheses about the condional mean, that have been tradionally applied to eher purely cross-sectional or time-series models. Specifically, we derive test for endogeney, over-identification, and non-linearies for the whingroup estimator. In deriving the above test we assume that at least some of the observed explanatory variables are correlated wh the idiosyncratic errors. This necessates an estimation method involving instrumental variables along wh the whin transformation. The test statistics are derived under minimal assumptions pertaining to the distribution of the unobserved component. We assume the unobserved component is a random variable that is invariant through time for each un of the cross-section. The time 1

invariance allows us to apply standard fixed effects estimation techniques using instruments. One could, of course, allow the unobserved component to trend, assuming we had enough time periods to difference the data before applying the whin transformation. The remainder of the paper is organized as follows: section 2 reviews the existing testing lerature pertaining to panel data models. Section 3 develops the model and assumptions necessary for consistent estimation of the population parameters. Section 4 derives the test statistic and s asymptotic distribution. Section 5 lays out each of the three tests specifically and section 6 presents a brief conclusion. 2. Lerature Review Panel data is unique in that has both a cross-section and time-series dimension. This allows us to construct specification tests by applying eher T or asymptotics. The decision of which dimension you assume fixed verses growing infinely is imperative. Assuming the time-series dimension tending off to infiny changes the whole setting of the model; while the unobserved component is no longer a problem the time-series nature of the observed explanatory variables is. Tradionally the approach in panel data models has been to assume the time dimension is fixed while the cross-sectional dimension tends to infiny. However, Baltagi discusses panel data models containing correlated unobserved components using T asymototics. In deriving the test statistic we assume the cross-section to grow while the timeseries dimension remains fixed. We discuss the implications of allowing the timedimension to grow in the conclusion. Therefore, the focus of the lerature review is on 2

panel data models where the cross-sectional dimension is thought to be significantly large relative to the time-dimension. Much of the testing lerature pertaining to panel data models builds on the work of Hausman (1978). In general, the Hausman test can be applied anytime an econometric model can be consistently estimated under the alternative hypothesis as well as under the null. The test is based on comparing the two estimates. Since, under the null hypothesis both estimation procedures are consistent, therefore, observing a statistical difference between the two provides evidence against the null. The standard, and most widely applied, version of the test assumes that the estimation procedure under the null hypothesis is more efficient. This assumption simplifies the calculation of the variance covariance matrix. While is possible to compute the variance covariance matrix whout maintaining such efficiency assumptions, most statistical packages do not allow for computation of the robust version directly. Hausman and Taylor (1981) derive estimation and testing methods in the context of linear panel data models containing correlated fixed effects where interest lies in the parameters associated wh observed time invariant explanatory variables. More specifically they develop an estimation procedure using instruments to estimate the parameters of the observed time-invariant variables. Where the instruments are the whin transformations of the time-varying explanatory variables that are assumed to have no relationship wh the unobserved component. Therefore, Hausman and Taylor do not rely on instrument from outside the model. They also extend the work of Hausman in the context of testing for correlation between the unobserved component and the included explanatory variables. 3

Metcalf (1996) extends above procedures of Hausman and Taylor to models containing endogenous variables in addion to the correlated fixed effects. That is, Metcalf requires instruments outside of the model. The test statistic developed is then pertaining to possible correlation between the instruments and the unobserved component. Ahn and Low (1996) further extend the testing lerature regarding panel data models through reformulating the Hausman test in the context of GMM estimation. They note that the Hausman test statistic for testing correlation between the unobserved component and the regressors implies that the individual means or time averages of the regressors are exogenous. Their alternative GMM statistic incorporates a much broader set of moment condions signifying that each of the time-varying explanatory variables is exogenous. The key feature of the alternative GMM test statistic is that unlike the Hausman test has power in detecting nonstationary coefficients of the regressors. The tests derived in this paper differ in two distinct ways: first most of the existing tests focus on testing for correlation between the unobserved effect and observed explanatory variables where as here, we assume the unobserved effects are correlated wh the included explanatory variables. The focus here is on testing for correlation between the observed explanatory variables and the idiosyncratic error. Second, the tests derived in this paper focus on computational ease for both the non-robust and robust test statistic. Where, the robust version is robust to arbrary forms of heteroskedasticy as well as whin-group serial correlation. While many statistical packages will compute the 4

non-robust version of the Hausman statistic following a regression; in general is much more tedious obtaining the robust version. In addion, 's not clear how to apply the Hausman test to panel data models containing both correlated unobserved components and endogenous explanatory variables. The test statistics derived in this paper extend the regression based tests derived by Wooldridge (1994). Wooldridge develops a score type test statistic in the context of time series models estimated by two-stage least squares. The appeal of the tests is both computational ease and flexibily. The flexibily of the test results from a partialling out method. This allows for misspecification indicators that are non-linear functions of the instruments, for instruments that are not strictly exogenous, and for errors that that are not assumed to be independent or even uncorrelated. The resulting test statistic can be applied to a variety of hypotheses including endogeney, serial correlation, overindentification, and nonlinearies, all of which can be easily made robust to arbrary forms of heteroskedasticy and serial correlation. 3. Model and Assumptions Let {(z, x, y ): i = 1, ; t = 1,..T }be a sequence of observations, where z is a 1 L vector of instruments, x is a 1 K vector of time-varying explanatory variables, and y is the dependent variable. We assume random sampling in the crosssection and fixed T. Generally, the set of instruments and the set of explanatory variables will overlap. For example, is often the case that time dummies are included in the set of explanatory variables. We are interested in the standard unobserved effects model (2.1) y = x β + c i + u ( i = 1,... ; t = 1,.T). 5

Where, c i is the time-invariant unobserved heterogeney and u is the idiosyncratic error. We are interested in consistently estimating the K 1 vector β. Throughout the paper we allow for arbrary correlation between c i and x, thus we eliminate c i using the fixed effects or whin transformation: (2.2) ÿ = x&& β + ü ( i = 1,... ; t = 1,.T). The time demeaning necessates strict exogeney for the consistency of the estimation procedures. That is (2.3) E( x isu ) = 0, ( s, t = 1,... T). Maintaining we know the fixed effects estimation of β is consistent and asymptotically normal. However, if we believe that at least some of the elements of x are correlated wh the idiosyncratic error then the standard fixed effects estimation procedure is inconsistent. In such cases the existence of instruments allows for estimation of β by pooled two-stage least squares (2SLS) using time-demeaned instruments. Under the appropriate assumptions, this estimation procedure is consistent and asymptotically normal. We then analyze the model (2.1) under the following assumptions. Assumption A.1 E( z is u ) = 0 ( s, t = 1,... T). assumption A sufficient condion for assumption A.1 is the zero condional mean (2.4) E(u z i ) = 0. Where, z i is a T L matrix of instrumental variables. The general analysis assumes A.1 since is a weaker condion. However, is necessary to assume 2.4 in the test for 6

nonlinearies as shown in the examples. The next assumption imposed is the identification condion. * -1 Assumption A.2 Let && x L( && x && z ) = && z Π, where, Π [ E( && zz && )] E( && zx && ) and L( x&& z&& ) denotes the linear projection of x&& onto z&&. Then let T t= 1 (2.5) E( XX && ) * i i A Where A is posive define, that is, the rank of A = K. Thus a necessary condion is the order condion, meaning L K. The pooled 2SLS estimator of β can be expressed as (2.6) ˆ ˆ ˆ -1 ˆ ˆ ˆ -1 ˆ β = ( XX && && ) ( X&& y ) = β + ( XX && && ) ( X&& u ). i i i i i i i i i=1 i=1 i=1 i=1 Where, ˆx&& = z&& ˆΠ are the fted values from the first-stage regression of (2.6) x&& on z&& ; i = 1,... ; t = 1,.T. Under assumptions A.1 and A.2 and fine second moment assumptions p ˆβ β. * This follows from the law of large numbers (LL); since E( X && u )=0 under A.1, i i * * E( X&& X&& ) is nonsingular undera.2, and i i ˆΠ Π p. By the central lim theorem (CLT) (2.7) -1/2 * X && d iui normal (0,B). Where * (2.8) Β E( X && uux&& * ). i=1 i i i i Then the first order asymptotic representation is 7

(2.9) ( ˆ β β) = A -1-1/2 * X && i ui + o p (1). Equation (2.9) shows that the first-stage regression does not affect the liming distribution of the pooled 2SLS estimator. That is, the same liming distribution is i=1 * obtained when ˆx&& replaces x&&, or when ˆΠ replaces Π. Whout imposing further assumptions (2.10) ( ˆ d β β ) normal (0, A -1 BA -1 ). Thus the asymptotic variance of ˆβ is (2.11) avar( ˆβ )= A -1 BA -1 /. We could of course simplify (2.11) by imposing homoskedasticy and no serial correlation assumptions. However, we are interested in the fully robust statistic. Later as a special case we consider the case of no serial correlation and/or homoskedasticy. 3. Specification Tests The specification tests developed here focus on when and if the estimation procedure outline in section two is consistent and necessary. That is we want to test if a particular set of variables are related to the idiosyncratic error. For example, if interest lies in testing when the two stage method is necessary we would want to test whether or not x is endogenous. In deriving the test statistic and s properties we generalize by defining a set of misspecification indicators that are to be tested for correlation wh the idiosyncratic error. This allows us to model the three tests regarding the condional mean of equation 2.2 in one setup. 8

Let v (γ) be a 1 Q vector of misspecification indicators that may possibly depend on some G 1 vector of parameters (γ) and elements of {z, x, y }. Where we assume a consistent estimator of γ, that is, ( ˆ γ - γ) = O p (1). The tests take the null hypothesis to be (3.1) H o : E( v u is) = 0, s,t = 1, T. That is, we are interested in testing whether v is strictly exogenous in (2.1). This is equivalent to (3.2) H o : E( && v && u ) = 0. In deriving a test of (3.1) we note that E( && v && u ) = E( && v u ). Thus the test statistics derived take the null hypothesis to be (3.3) H o : E( && v u ). While the equivalence between (3.1) and (3.3) simplifies the derivation of the test statistics is not whout cost, we have to restrict the analysis to tests about the condional mean. For example we cannot take v = ( u -1,.u -q ), since under the null hypothesis of no serial correlation, (3.3) is naturally violated. Tests of (3.1) are derived under assumptions that are extensions of those made in section 2; when each is used is assumed to hold under the null. However, the robust forms of the tests are most easily estimated under the alternative; we therefore analyze the behavior of the tests under local alternatives. 9

The first assumption is Assumption B.1 Assumption A.1 holds and, in addion (i) E[ && v ( γ )u ] = 0 s,t = 1,.T (ii) E [ && v ( γ )u ] = 0. Where v&& (γ) denotes the gradient of v&& wh respect to γ. Part (ii) of B.1 ensures that the estimation of γ does not effect the liming distribution of the test statistic so long as ( ˆ γ - γ) = O p (1). Requiring B.1 (ii) is innocuous since is trivially satisfied in the test for endogeneney and overidentification under A.1 and for the test of nonlinearies is satisfied under the zero condional mean assumption (2.4). The sample analog of (3.3) is (3.4) T 1 && v i=1 t=1 ˆ uˆ Where ^ denotes that each function is evaluated at ( ˆ β, ˆ γ ). To use (3.4) as a test of (3.3), entails deriving the liming distribution of (3.5) T 1/2 && v i=1 t=1 ˆ uˆ under the null hypothesis. However, is often the case that the liming distribution of (3.5) is different from the liming distribution of T 1/2 && v u. i=1 t=1 That is, we need to know when the estimation of β does not effect the liming distribution. Since B.1 (ii) ensures that the estimation of v&& does not affect the liming distribution so long as ( ˆ γ - γ) = O p (1); deriving a quadratic form of (3.5) that has an asymptotic chi-squared distribution is straightforward, although nothing ensures that the 10

resulting statistic will be easy to compute via regressions. A simpler approach is to transform (3.5) so that the first stage estimation does not affect the liming distribution. This results in robust test statistics that are easy to compute using a series of regressions, and are asymptotically equivalent so long as a consistent estimation procedure is used in the first-stage estimation as shown by Wooldridge (1990). This brings us to our second assumption. Assumption B.2 Assumption A.2 holds and, in addion, let x&& * L( x&& z&&, v&& ). Then rank T T 1 E( r r ) = Q. t= 1 Where r is a 1 Q vector of time-demeaned population residuals from the linear * projection of v&& onto x&& r v&& L( v&& x&& * ). We adjust (3.5) by replacing ˆv&& wh ˆr. This partialling out of v&& essentially allows us to ignore the first-stage estimation when deriving the liming distribution of (3.6) T -1/2 r ˆuˆ. i=1 t=1 Where ˆr is obtained from the regression (3.7) ˆv&& on ˆx&& * ; i = 1,... ; t = 1,.T. Where ˆx&& * are the fted values obtained from the regression (3.8) x&& on ˆv&&, z&& ; i = 1,... ; t = 1,.T. 11

However, in the test for endogentiety and overidentification ˆv&& can be omted from (3.8) since in both tests v&& is a strict subset of z&& under the null hypothesis. We discus when ˆv&& can be omted from (3.8) in the test for nonlinearies in the examples. In the derivation of the first-order asymptotic representation of (3.6) we assume B.1 and B.2 hold under the null hypothesis. The first step shows that estimation of r does not affect the first-order asymptotic distribution. That is, -1/2 T r ˆ ˆ u i=1 t=1 = T -1/2 r ˆ u + o p (1) i=1 t=1 under H o. This can be verified by a mean-value expansion, since ˆr is a linear function of ( ˆv&& ( ˆ γ ), z&& ) B.1(ii) ensures that T -1 * && ˆ i=1 t=1 x uˆ p 0. ext we show that estimation of u does not affect the first-order asymptotic distribution by a mean-value expansion of T T T T -1/2-1/2-1 ˆ -1/2 r ˆ r r && x β β r p i=1 t=1 i=1 t=1 i=1 t=1 i=1 t=1 u = u + ( - ) = u + o (1) By the CLT, ( ˆ β β ) = O p (1) and by definion of x&& *, x&& = x&& * + g where * E( z&& g ) = 0, E( && v g ) = 0. Therefore, E( r g ) = 0, thus E( rx && ) = E( rx && ) = 0; since r is the population residual from the linear projection of v&& onto x&& *, then by the LL -1 T p rx && 0. This asymptotic equivalence simplifies finding the liming i=1 t=1 distribution of (3.6) since we can ignore the fact that r and u have been estimated. 12

Define (3.9) C var ( r u ). T t=1 The test statistic proposed is (3.10) T T -1/2-1 -1/2 rˆ ˆ Cˆ rˆ ˆ i=1 t=1 i=1 t=1 δ ˆ ( u ) ( u ) T T -1 ˆ ˆ ˆ ˆ ˆ i=1 t=1 i=1 t=1. = ( r u )( C) ( r u ) Provided a consistent estimator of C is used ˆ δ d χ 2 wh Q degrees of freedom under B.1 and B.2. Of course the actual form of the test statistic depends on the estimator used for C. We first derive the fully robust version of the test. That is, a test statistic that is robust to heteroskedasticy of unknown form and to cluster serial correlation. A consistent estimator of (3.9) is then T T r -1 (3.11) Cˆ = r ˆˆˆ u uˆ. i=1 s=1 t=1 is is Therefore the proposed test statistic in (3.10) takes the form T T T T (3.12) ˆ 1 δ = ( rˆ u ˆ ) ( rˆ rˆ uˆ u ˆ ) ( rˆ u ˆ ). is is ˆ δ i=1 t=1 i=1 s=1 t=1 i=1 t=1 The above test statistic, in general, cannot be computed via regressions. However, there exist a test statistic that is asymptotically equivalent to (3.12) under H o and local alternatives that can be easily computed using simple regression. The alternative statistic is the fully robust Wald statistic of ˆλ from the regression (3.13) û on ˆr ; i = 1,... ; t = 1,.T. 13

Where ˆλ is the Q 1 vector of coefficients on ˆr. This results in a test statistic of the form T T T T (3.14) ˆ 1 ( rˆ u ˆ )( rˆ rˆ eˆ e ˆ ) ( r ˆ. Γ ˆ u ) is is i=1 t=1 i=1 s=1 t=1 i=1 t=1 Where, ê is the residual from the regression in (3.13). Thus the regression form of the test replaces ( û is û ) wh ( ê is ê ) in the estimation of C, that is (3.14) is just (3.12) estimated under the alternative that λ 0. It can be shown that for sequence of local alternatives λ = λ -1/2, where u = r λ + e ; the test statistic in (3.14) has a liming non-central chi-squared distribution wh non-centraly parameter equal to λ. This facilates interpreting a rejection since λ = λ -1/2 0 as sequence of local alternatives, as well as under H o where ˆλ p 0. under the The more restrictive form of (3.10) adds the following assumptions. Assumption B.3 The homoskedasticy assumption E(u 2 z i,v i ) = σ 2 u. Assumption B.4 The no serial correlation assumption E(u u +j z i,v i ) = 0 j 1. Under B.3 and B.4 a consistent estimator of C is (3.15) Cˆ = σˆ T 2-1 rˆˆ r. i=1 t=1 Wh T 2-1 uˆ 2 i=1 t=1 σ ˆ = ((T-1) This gives the following form for the test statistic in (3.10) T T T T -1 2 1 2 2 u χ Q ˆ δ i=1 t=1 i=1 t=1 i=1 t=1 i=1 t=1 (3.16) ˆ δ = ( rˆ u ˆ ) (((T-1) u ˆ rˆ rˆ ) ( rˆ u ˆ ) = (T-1)R 14

where R u 2 is the R-squared from the regression in (3.13). 4. Examples Example 4.1 (testing for endogeney) ). Partion the model as (4.1) y = x 1 β 1 + x 2 β 2 + c i + u. Where, x 1 is maintained to be exogenous. The null hypothesis of interest is then H o : E( && x 2u ) =0. The model is estimated by Fixed Effects under the null, that is, z = x. The test statistic is based on the sample covariance between u and the fted values from the regression (4.2) x&& 2 on w&& ; i = 1,... ; t = 1,.T where w&& is a set of instruments that contains x&& 1 but not x&& 2. A necessary condion is that w&& contain at least as many variables as x&&. We then define r x&& * 2 - x&& θ, where x&& * 2 is the population fted value from the regression in (4.2). Thus v&& (γ) x&& * 2 = w&& γ. Therefore, the test is really testing if a particular linear combination of the instruments w&& are correlated wh u. Procedure (4.1) (1) Obtain û form the estimation of (2.1) by fixed effects. (2) Obtain the predicted values of x&& * 2 as the fted values from the regression of x&& 2 on w&&. (3) Regress the fted values from step two on x&& and obtain the residuals ˆr. (4) Regress û on ˆr. Then the robust version of the test is attained from computing the fully robust version of the Wald test and the non-robust test is just (T-1) times the R-squared. 15

Example 4.2 (Testing for overidentifying restrictions) There û denotes the pooled 2SLS residuals using instruments z&&. To test for overidentifying restrictions we must have L > K where Q L-K. Let v&& be the 1 Q vector of possible overidentifying restrictions, thus the null hypothesis is H o : E( && v u ) =0. We define r v&& L( v&& ˆx&& ), where are the fted values from the ˆx&& first-stage regression of x&& on z&&. Procedure 4.2 (1) Obtain û from pooled 2SLS on the time-demeaned data using the full set of instruments z&&. (2) Define v&& as a strict subset of z&&, regress v&& on, the fted values from the ˆx&& first-stage regression in step one; save the residuals as ˆr. (3) Same as step four in procedure (4.1) Example 4.3 (Testing for nonlinearies). As in test for endogeney, partion (2.1) as (4.3) y = x 1 β 1 + x 2 β 2 + c i + u ( i = 1,... ; t = 1,.T) where x 1 exogenous and x 2 is endogenous. Therefore, x 1 is contained in z the set of instruments. ote that the test for nonlinearies assumes the more restrictive zero condional mean assumption (2.4) under the null hypothesis. To obtain a RESET-type (Ramsey 1969) we augment the (2.1) (4.4) y = x 1 β 1 + x 2 β 2 + c i + α 1 ( x 1 β 1 + x 2 β 2 + c i ) 2 + α 2 (x 1 β 1 + x 2 β 2 + c i ) 3 + u ( i = 1,... ; t = 1,.T). 16

The null hypothesis is H o : α 1 = α 2 = 0. ote that the time-demeaning of (4.4) eliminates any interaction between the timeinvariant unobservables and time-varying explanatory variables since the correlation between them is usually thought to be though the time average on the explanatory variables. That is, in general we think that the time-demeaned explanatory variables are unrelated to the time-invariant unobservables. This results in the following augmentation of the estimating equation (2.2) (4.5) ÿ = x&& 1 β 1 + x&& 2 β 2 + α 1 ( x&& 1 β 1 + x&& 2 β 2 ) 2 + α 2 ( x&& 1 β 1 + x&& 2 β 2 ) 3 + ü ( i = 1,... ; t = 1,.T). However, if E( && x ci ) 0 then the following test can be viewed as a test for possible interactions between the time-varying explanatory variables and the time-invariant unobservables. Procedure 4.3 (1) Obtain û from the pooled 2SLS regression on the time-demeaned data using instruments z&&. (2) Define the 1 2 vector ĥ {( x&& ˆβ 1 1 + x&& ˆβ 2 2 ) 2, ( x&& ˆβ 1 1 + x&& ˆβ 2 2 ) 3 } where we use the coefficient estimates obtained in step (1). (3) Augment the instruments z wh some non-linear functions of z, that is, define g g(z ). Generally you would include the squares, cubes and cross products of z in g. Obtain ˆx&& * 2 as the fted values from the pooled OLS regression x&& 2 on g&&, z&& ; i = 1,... ; t = 1,.T. Define ˆx&& * ( x&& 1, ˆx&& * 2 ). 17

ote that step three can be omted if E( x&& 2 z&& ) is linear in z&&. Then ˆx&& * ( x&& 1, ˆx&& 2 ), where ˆx&& 2 are the fted values from the first-stage regression of x&& 2 on z&& from step one. (4) Obtain ˆv&& as the 1 2 vector of fted values from the regression ĥ on g&&, z&& ; i = 1,... ; t = 1,.T. (5) Obtain ˆr as the residuals from the regression ˆv&& on ˆx&& * ; i = 1,... ; t = 1,.T. (6) Same as step four in procedure (4.1) 5. Conclusion The above procedures offer simple methods for testing hypotheses about the condional mean in linear panel data models estimated by pooled 2SLS using time-demeaned data. We derived test statistics that are robust to heteroskedasticy of unknown form as well as cluster serial correlation. However, we have not address the presence of Arch errors. That is even the robust form the tests assumes the variance is constant across time. If the time dimension is relatively small such an assumption is minor. However, as the time dimension grows one would need to consider the potential problems associated wh time-series models. Since we assume a fixed T we have not addressed such issues. 18

19