ON THE PRACTICE OF LAGGING VARIABLES TO AVOID SIMULTANEITY

Similar documents
DEPARTMENT OF ECONOMICS AND FINANCE COLLEGE OF BUSINESS AND ECONOMICS UNIVERSITY OF CANTERBURY CHRISTCHURCH, NEW ZEALAND

DEPARTMENT OF ECONOMICS AND FINANCE COLLEGE OF BUSINESS AND ECONOMICS UNIVERSITY OF CANTERBURY CHRISTCHURCH, NEW ZEALAND

Testing for Unit Roots with Cointegrated Data

Econometrics of Panel Data

Endogeneity. Tom Smith

System GMM estimation of Empirical Growth Models

LECTURE 11. Introduction to Econometrics. Autocorrelation

Corporate Finance Data & The Role of Dynamic Panels. Mark Flannery, University of Florida Kristine W. Hankins, University of Kentucky

DEPARTMENT OF ECONOMICS AND FINANCE COLLEGE OF BUSINESS AND ECONOMICS UNIVERSITY OF CANTERBURY CHRISTCHURCH, NEW ZEALAND

P E R S P E C T I V E S

DEPARTMENT OF ECONOMICS COLLEGE OF BUSINESS AND ECONOMICS UNIVERSITY OF CANTERBURY CHRISTCHURCH, NEW ZEALAND

Department of Economics, UCSB UC Santa Barbara

Economics 270c Graduate Development Economics. Professor Ted Miguel Department of Economics University of California, Berkeley

Dynamic Panel Data Models

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Motivation Non-linear Rational Expectations The Permanent Income Hypothesis The Log of Gravity Non-linear IV Estimation Summary.

1 Estimation of Persistent Dynamic Panel Data. Motivation

A Robust Approach to Estimating Production Functions: Replication of the ACF procedure

Chapter 2. Dynamic panel data models

Linear dynamic panel data models

A TIME SERIES PARADOX: UNIT ROOT TESTS PERFORM POORLY WHEN DATA ARE COINTEGRATED

PhD/MA Econometrics Examination January 2012 PART A

Exogenous Treatment and Endogenous Factors: Vanishing of Omitted Variable Bias on the Interaction Term

The TransPacific agreement A good thing for VietNam?

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Inflation Revisited: New Evidence from Modified Unit Root Tests

Dynamic Panels. Chapter Introduction Autoregressive Model

Lecture 7: Dynamic panel models 2

Improving GMM efficiency in dynamic models for panel data with mean stationarity

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

More on Specification and Data Issues

Lecture 5: Unit Roots, Cointegration and Error Correction Models The Spurious Regression Problem

Panel Data Exercises Manuel Arellano. Using panel data, a researcher considers the estimation of the following system:

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Relationship between Trade Openness and Economic Growth in. Sri Lanka: a Time Series Analysis

Predicting bond returns using the output gap in expansions and recessions

Lecture 6: Dynamic panel models 1

Next, we discuss econometric methods that can be used to estimate panel data models.

EC327: Advanced Econometrics, Spring 2007

A forward demeaning transformation for a dynamic count panel data model *

Markov-Switching Models with Endogenous Explanatory Variables. Chang-Jin Kim 1

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Applied Microeconometrics (L5): Panel Data-Basics

Modified Variance Ratio Test for Autocorrelation in the Presence of Heteroskedasticity

Short T Panels - Review

Nearly Unbiased Estimation in Dynamic Panel Data Models

Topic 4 Forecasting Exchange Rate

Research Division Federal Reserve Bank of St. Louis Working Paper Series

An overview of applied econometrics

Instrumental Variables, Simultaneous and Systems of Equations

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Lecture 6: Dynamic Models

Testing for Regime Switching in Singaporean Business Cycles

EMERGING MARKETS - Lecture 2: Methodology refresher

Capital, Institutions and Urban Growth Systems

A Non-Parametric Approach of Heteroskedasticity Robust Estimation of Vector-Autoregressive (VAR) Models

Limited Dependent Variables and Panel Data

REED TUTORIALS (Pty) LTD ECS3706 EXAM PACK

Bias Correction Methods for Dynamic Panel Data Models with Fixed Effects

1 Introduction The time series properties of economic series (orders of integration and cointegration) are often of considerable interest. In micro pa

Identifying the Monetary Policy Shock Christiano et al. (1999)

Testing Random Effects in Two-Way Spatial Panel Data Models

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Regression with time series

DISCUSSION PAPERS IN ECONOMICS

University of Pretoria Department of Economics Working Paper Series

Defence Spending and Economic Growth: Re-examining the Issue of Causality for Pakistan and India

Lecture 8 Panel Data

xtdpdml for Estimating Dynamic Panel Models

DEPARTMENT OF ECONOMICS COLLEGE OF BUSINESS AND ECONOMICS UNIVERSITY OF CANTERBURY CHRISTCHURCH, NEW ZEALAND

Vector Autoregressions as a Guide to Constructing Dynamic General Equilibrium Models

Government Expenditure and Economic Growth in Iran

Global Value Chain Participation and Current Account Imbalances

The challenge of globalization for Finland and its regions: The new economic geography perspective

Journal of Economic Cooperation and Development, 35, 2 (2014), 71-90

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Econometrics - 30C00200

Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

Club Convergence: Some Empirical Issues

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL

Econometrics Summary Algebraic and Statistical Preliminaries

A Note on the Correlated Random Coefficient Model. Christophe Kolodziejczyk

Econometrics 2, Class 1

Does the Use of Imported Intermediates Increase Productivity? Plant-Level Evidence

Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

A Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Linear Panel Data Models

The Predictability of Aggregate Consumption Growth in OECD Countries: A Panel Data Analysis

Are Forecast Updates Progressive?

Consistent OLS Estimation of AR(1) Dynamic Panel Data Models with Short Time Series

Econometrics. 9) Heteroscedasticity and autocorrelation

Introduction to Econometrics

ECON3327: Financial Econometrics, Spring 2016

PRELIMINARY ANALYSIS OF SPATIAL REGIONAL GROWTH ELASTICITY OF POVERTY IN SUMATRA

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b.

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

xtseqreg: Sequential (two-stage) estimation of linear panel data models

Transcription:

ON THE PRACTICE OF LAGGING VARIABLES TO AVOID SIMULTANEITY by W. Robert Reed Department of Economics and Finance University of Canterbury Christchurch, New Zealand 28 August 2013 Contact Information: W. Robert Reed, Private Bag 4800, Christchurch 8140, New Zealand. Phone: +64-3-364-2846; Email: bob.reed@canterbury.ac.nz.

Abstract A common practice in applied econometrics work consists of replacing a suspected endogenous variable with its lagged values. This note demonstrates that lagging an endogenous variable does not enable one to escape simultaneity bias. The associated estimates are still distorted by simultaneity bias, and hypothesis testing is invalid. Further, I show that these problems are exacerbated when the suspected endogenous variable is characterised by serial correlation, as is likely often to be the case. Keywords: Simultaneity, Endogeneity, Lagged variables JEL classification: C1, C15, C2, C3

I. INTRODUCTION Simultaneity is a concern in much of empirical economic analysis. One approach that has been employed to avoid the problems associated with simultaneity is to replace a suspected endogenous variable with its lagged values. The practice is widespread, as can be confirmed by searching for variations of avoid simultaneity lagged variables on Google Scholar. Recent examples are Aschoff and Schmidt (2008); Bania, Gray, and Stone (2007); Bansak, Morin and Starr (2007); Brinks and Coppedge (2006); Buch, Koch, and Koetter (2013); Clemens, Radelet, Bhavnani and Bazzi (2012); Cornett, Marcus, Saunders, and Tehranian (2007); Green Malpezzi, and Mayo (2005); Gupta (2005); Hayo, Kutan, and Neuenkirch (2010); MacKay and Phillips (2005); Spilimbergo (2009); Stiebala (2011); and Vergara (2010). Many of these papers are heavily cited. The rationale for the practice is explicitly identified in statements such as the following: We avoid poor-quality instrumental variables and instead address potential biases from reverse and simultaneous causation by the more transparent methods of lagging and differencing 1 (Clemens, Radelet, Bhavnani and Bazzi, 2012); The vector of controls contains lagged returns Contemporaneous U.S. returns are excluded to avoid simultaneity problems (Hayo, Kutan, and Neuenkirch, 2010); and The variable is expressed as a percentage of GDP. The lagged variable was used in both cases to avoid possible simultaneity problems (Vergara, 2010). 2 1 Lagging is the authors solution for reverse causation; differencing is designed to eliminate endogeneity from unobserved fixed effects. 2 Other examples are: Innovation intensity is included as a lagged variable in order to mitigate simultaneity problems (Aschhoff and Schmidt, 2008, p. 48); The variables IP, I/K, and STDEV are intended to capture effects on utilization of output growth, investment level, and output volatility, respectively; they are included in lagged form to avoid problems with simultaneity (Bansak et al., 2007, p. 636f.); so long as we avoid the simultaneity problem and incorporate the main sources of common shocks to the countries at issue (as we do by lagging the diffusion variable and including the most important domestic variables), our estimates should be relatively unbiased and consistent (Brinks and Coppedge, 2006, p. 476); We lag the explanatory variables X i,t- 1 by one period to avoid simultaneity (Buch et al., 2013, p. 1415); We lag all measures of institutional ownership and institutional board membership by one year. This lag allows for the effect of any change in governance structure to show up in firm performance. This also mitigates simultaneity issues (Cornett et al., 2007, p. 1781); We therefore also perform regressions with lagged changes to avoid simultaneity problems 1

The purpose of this note is to draw attention to the fact that replacing a contemporaneous explanatory variable with its lagged value does not avoid the inconsistency problems associated with simultaneity. Furthermore, the problem is exacerbated by the presence of serial correlation in the endogenous variable. II. THEORY Let Y be a function of either, or both, contemporaneous and lagged X and let us assume that the effect of X on Y is represented by the following relationship: (1) Y t = a + bx t + cx t 1 + ε t, where ε t ~ NID(0, σ Y ), and b and/or c may be zero. A researcher suspects that Y and X are simultaneously determined. In an effort to avoid simultaneously bias, the researcher estimates (1 ) Y t = α + βx t 1 + error term. To determine the relationship between β and c, one needs to know how X is affected by contemporaneous Y and (possibly) lagged X. Let us assume this relationship is represented by (2) X t = d + ey t + fx t 1 + ν t, where ν t ~ NID(0, σ X ), Then (3) Y t = (a + bd) (c + bf) + (1 be) (1 be) X t 1 + b (1 be) ν t + 2 1 (1 be) ε t. 3 It follows that OLS estimation of Equation (1 ) produces a consistent estimate of the reduced form coefficient on lagged X, (Green et al., 2005, p. 335f.); First, to minimize the possibility of simultaneity between privatization and performance, we investigate the impact of the lagged share of private ownership on current performance (Gupta, 2005, p. 989); We lag the industry medians to avoid endogeneity problems (MacKay and Phillips, 2005, p. 1450); To avoid simultaneity bias, this specification has explanatory variables lagged five years in the five-year specifications as well (Spilimbergo, 2009, p. 536); In all specifications lagged values of the financial indicators are used. This is to allow for a time lag between financial development and the export decision, since planning and realisation of foreign market entry and expansion might take time. The use of lagged values also reduces simultaneity problems (Stiebale, 2011, p. 130). 3 The corresponding value of X t is X t = (d+ae) + (f+ce) X (1 be) (1 be) t 1 + 1 ν (1 be) t + e ε (1 be) t.

(4) plim β = (c + bf) (1 be), where simultaneity is represented by the parameter e, and serial correlation in the explanatory variable by f. In general, plim β c, where c is the true effect of X t-1 on Y. 4 (5) Y t = A similar problem arises if we regress the change in Y on lagged X. In this case, OLS estimation of (a + bd) (1-be) (c + bf) + (1 be) X t 1 Y t 1 + (5 ) Y t = α + βx t 1 + γy t 1 + error term, b (1 be) ν t + 1 (1 be) ε t. produces a consistent estimate of the reduced form coefficient on lagged X, so that again plim β = (c + bf) (1 be) c. The preceding demonstrates that lagging X does not enable one to escape simultaneity bias. Furthermore, serial correlation in the explanatory variable further distorts parameter estimation and hypothesis testing. The next section uses Monte Carlo analysis to highlight these points. III. SIMULATION RESULTS TABLE 1 presents the first set of simulation results. In the first two simulations, the DGP is: 6a) Y t = 0.5 X t + 0 X t-1 + ε t X t = 0.5 Y t + f X t-1 + ν t, ε t,, ν t ~NID(0,1); where f is alternatively set equal to 0 and 0.5. Note that Y t and X t are simultaneously determined, and that the true effect of X t-1 on Y t is zero. When f = 0, there is no serial correlation in X (SIMULATION 1). However, when f = 0.5, X t and X t-1 are correlated, with a mean sample correlation of approximately 0.63 (SIMULATION 2). For each case I use OLS to estimate the equation 4 It is sometimes stated that the use of lagged variables reduces or mitigates simultaneity bias. However, these statements have no meaning because the coefficients on X t and X t-1 measure inherently different things. 3

6b) Y t = α + β X t-1 + error term. The simulated datasets vary in size from T=10 to T=1000. For each value of T, 1000 data sets are generated, producing 1000 estimates of β, the coefficient on X t-1 in Equation (6b). If X t affects Y t, and X is serially correlated, then regression of Y t on X t-1 will incorrectly estimate the effect of X t-1 on Y t. The table reports the mean estimate of β for each set of replications, along with the rate at which the null hypothesis, H 0 : β = 0, is rejected. Rejection rates larger than 0.05 would cause the researcher to too-frequently conclude that X t-1 has a direct impact on Y t. As noted above, the only difference between SIMULATION 1 and SIMULATION 2 is serial correlation in X. In SIMULATION 1, the mean estimated value of β suffers from finite sample bias, but converges to zero as the sample sizes increase in size. The rejection rates for H 0 : β = 0 are all close to 0.05. In contrast, in the presence of serial correlation (SIMULATION 2), the estimates of β converge to their reduced form value of (c+bf) (1 be) = 0.33. This causes the researcher to incorrectly conclude that X t-1 has a direct impact on Y t in more than 5 percent of the estimated equations. In fact, the rejection rate of the null converges to 1.000 as the sample size increases. Note that there are two sources contributing to this error: (i) serial correlation in X, and (ii) the researcher s misguided attempt to attach a structural interpretation to a reduced-form expression. SIMULATION 3 repeats the analysis of SIMULATION 2 except that the dependent variable in the DGP is Y t rather than Y t, and the estimated equation is now Y t = α + β X t-1 + γ Y t-1 + error term. Comparison of Equation (5) with Equation (3) suggests that the results should be similar to SIMULATION 2, and indeed this is the case. The estimates of β converge to their reduced form value of (c+bf) = 0.33, and the rejection rate of the null (1 be) converges to 1.000 as the sample size increases. 4

The next set of simulations investigates the same type of relationships, except that I now work with panel data. In particular, I set the number of cross-sectional units (N) equal to 50, and let the sample sizes range from T=10 to T=100. The DGP is now: 7a) Y it = 0.5 X it + 0 X i,t-1 + λ i + ε it X it = 0.5 Y it + f X i,t-1 + μ i + ν it ; λ i = μ i = 0, i = 1,2,,50; ε t,, ν t ~NID(0,1) where f, again, is alternatively set equal to 0 and 0.5. The estimated equation is now 7b) Y it = α + β X i,t-1 + fixed effects + error term, which is estimated using OLS with fixed effects. The corresponding results are reported in TABLE 2. SIMULATION 4 is the case with no serial correlation (f = 0). It is noteworthy that the Mean β values are similar to the time series results of SIMULATION 1 -- another no serial correlation case -- for similar T values. However, the standard errors are smaller given the overall larger sample sizes (NT), so that the rejection rates are substantially greater than 0.05 at small T. As T gets larger, the rejection rates converge to their true value of 0.05. When serial correlation is present (SIMULATION 5), the values for Mean β are again similar to their non-panel matches for like T (cf. SIMULATION 2), but the rejection rates converge to 1.000 much faster. By T=20, the null hypothesis of H 0 : β = 0, is rejected 100 percent of the time. SIMULATIONS 6 and 7 repeat the panel data analysis of SIMULATIONS 4 and 5, except the dependent variable is now Y t rather than Y t. The corresponding estimated equation is Y it = α + β X i,t-1 + γ Y i,t-1 + fixed effects + error term. The combination of a lagged dependent variable and fixed effects raises concerns of dynamic panel bias (Nickell, 1981), which is problematic for large N, small T data. Given N = 50, I investigate the consequences of serial correlation for the cases T = 10 and T = 20. I estimate the model 5

using difference GMM (Arellano and Bond, 1991). 5 Note that difference GMM transforms Y it = α + β X i,t-1 + γ Y i,t-1 + fixed effects to Y it = β X i,t-1 + γ Y i,t-1, where X i,t-1 and Y i,t-1 are instrumented by past levels of X and Y (we assume the researcher would consider both to be predetermined, but not endogenous). This is an important difference with previous estimation, because X t-1 is effectively replaced by its instrumented predicted value. TABLE 3 reports the corresponding Mean β and rejection rate results for the no serial correlation and serial correlation cases, respectively. While there are some interesting differences, the main conclusions from the preceding analysis continues to hold. 6 IV. CONCLUSION A common practice in applied econometrics work consists of replacing a suspected endogenous variable with its lagged values. This note demonstrates that lagging an endogenous variable does not enable one to escape simultaneity bias. The associated estimates are still distorted by simultaneity bias, and hypothesis testing is invalid. Further, I show that these problems are exacerbated when the suspected endogenous variable is characterised by serial correlation, as is likely often to be the case. 5 Estimates were obtained using the xtabond2 procedure using the following commands: xtabond2 D.y L.x L.y, gmm(l.(x y)) nolevel twostep robust small. 6 While not reported, we also used the system GMM procedure. For the serial correlation case, the coefficient estimates were closer to their reduced-form, plim values; and the rejection rates were somewhat higher. 6

REFERENCES Arellano, M., and S. Bond. 1991. Some tests of specification for panel data: Monte Carlo evidence and an application to employment equations. Review of Economic Studies 58: 277 297. Aschhoff, B., and T. Schmidt. 2008. Empirical evidence on the success of R&D cooperation Happy together? Review of Industrial Organization 33:41 62 Bania, N., Gray, J.A., and J.A. Stone. 2007. Growth, taxes, and government expenditures: Growth hills for U.S. states. National Tax Journal LX: 193-204. Bansak, C., Morin, N., and M. Starr. 2007. Technology, capital spending, and capacity utilization. Economic Inquiry 45:631 645 Brinks, D. and M. Coppedge. 2006. Diffusion is no illusion: Neighbor emulation in the third wave of democracy. Comparative Political Studies 39:463-489 Buch, C., Koch, C.T., and M. Koetter. 2013. Do banks benefit from internationalization? Revisiting the market power risk nexus. Review of Finance 17:1401 1435. Clemens, M.A., Radelet, S., Bhavnani, R.R., and S. Bazzi. 2012. Counting chickens when they hatch: Timing and the effects of aid on growth. The Economic Journal 122:590 617. Cornett, M.M., Marcus, A.J., Saunders, A., and H. Tehranian. 2007. The impact of institutional ownership on corporate operating performance. Journal of Banking & Finance 31:1771 1794. Green, R.K., Malpezzi, S., and S.K. Mayo. 2005. Metropolitan-Specific Estimates of the Price Elasticity of Supply of Housing, and Their Sources. American Economic Review 95:334-339. Gupta. N. 2005. Partial privatization and firm performance. Journal of Finance 60:987-1015. Hayo, B., Kutan, M.K., and M. Neuenkirch. 2010. The impact of U.S. central bank communication on European and pacific equity markets. Economics Letters 108:172 174 MacKay, P. and G.M. Phillips. 2005. How does industry affect firm financial structure? Review of Financial Studies 18:1433-1466. Nickell, S. J. 1981. Biases in dynamic models with fixed effects. Econometrica 49: 1417 1426. Spilimbergo, A. 2009. Democracy and Foreign Education. American Economic Review 99:528-543. Stiebale, J. 2011. Do financial constraints matter for foreign market entry? A firm-level examination. World Economy 34:123 153. 7

Vergara, R. 2010. Taxation and private investment: Evidence for Chile. Applied Economics, 42:717-725. 8

TABLE 1 Simulation Results: Time Series Data A) SIMULATION 1: Simultaneity but no Serial Correlation in the Endogenous Explanatory Variable (Dep. Var. in Level Form) 1) Y t = 0.5 X t + 0 X t-1 + ε t 2) X t = 0.5 Y t + 0 X t-1 + ν t, ε t,, ν t ~NID(0,1) Estimated Equation: Y t = α + β X t-1 + error term T=10 T=20 T=50 T=100 T=1000 Mean β -0.0728-0.0309-0.0093-0.0039 0.0004 Rejection Rate for H 0 : β = 0 0.039 0.047 0.051 0.051 0.038 B) SIMULATION 2: Simultaneity with Serial Correlation in the Endogenous Explanatory Variable (Dep. Var. in Level Form) 1) Y t = 0.5 X t + 0 X t-1 + ε t, ε 2) X t = 0.5 Y t + 0.5 X t-1 + ν t,, ν t ~NID(0,1) t Estimated Equation: Y t = α + β X t-1 + error term T=10 T=20 T=50 T=100 T=1000 Mean β 0.1198 0.2202 0.2925 0.3123 0.3318 Rejection Rate for H 0 : β = 0 0.071 0.234 0.701 0.951 1.000 C) SIMULATION 3: Simultaneity with Serial Correlation in the Endogenous Explanatory Variable (Dep. Var. in Difference Form) 1) Y t = 0.5 X t + 0 X t-1 Y t-1 + ε t 2) X t = 0.5 Y t + 0.5 X t-1 + ν t, ε t,, ν t ~NID(0,1) Estimated Equation: Y t = α + β X t-1 + γ Y t-1 + error term T=10 T=20 T=50 T=100 T=1000 Mean β 0.1634 0.2478 0.3087 0.3185 0.3323 Rejection Rate for H 0 : β = 0 0.067 0.113 0.348 0.671 1.000 9

TABLE 2 Simulation Results: Panel Data (Dependent Variable in Level Form) A) SIMULATION 4: Simultaneity but no Serial Correlation in the Endogenous Explanatory Variable 1) Y it = 0.5 X it + 0 X i,t-1 + λ i + ε it 2) X it = 0.5 Y it + 0 X i,t-1 + μ i + ν it ; λ i = μ i = 0, i = 1,2,,50; ε t,, ν t ~NID(0,1) Estimated Equation (OLS): Y it = α + β X i,t-1 + fixed effects + error term T=10 T=20 T=50 T=100 Mean β -0.0869-0.0410-0.0161-0.0080 Rejection Rate for H 0 : β = 0 0.404 0.231 0.127 0.091 B) SIMULATION 5: Simultaneity with Serial Correlation in the Endogenous Explanatory Variable 1) Y it = 0.5 X it + 0 X i,t-1 + λ i + ε it 2) X it = 0.5 Y it + 0.5 X i,t-1 + μ i + ν it ; λ i = μ i = 0, i = 1,2,,50; ε t,, ν t ~NID(0,1) Estimated Equation (OLS): Y it = α + β X i,t-1 + fixed effects + error term T=10 T=20 T=50 T=100 Mean β 0.1626 0.2564 0.3043 0.3191 Rejection Rate for H 0 : β = 0 0.934 1.000 1.000 1.000 10

TABLE 3 Simulation Results: Panel Data (Dependent Variable in Difference Form) A) SIMULATION 6: Simultaneity but no Serial Correlation in the Endogenous Explanatory Variable 1) Y it = 0.5 X it + 0 X i,t-1 Y i,t-1 + λ i + ε it 2) X it = 0.5 Y it + 0 X i,t-1 + μ i + ν it ; λ i = μ i = 0, i = 1,2,,50; ε t,, ν t ~NID(0,1) Estimated Equation (Difference GMM): Y it = α + β X i,t-1 + γ Y i,t-1 + fixed effects + error term T=10 T=20 T=50 T=100 Mean β 0.0013 0.0256 ---- ---- Rejection Rate for H 0 : β = 0 0.044 0.001 ---- ---- B) SIMULATION 7: Simultaneity with Serial Correlation in the Endogenous Explanatory Variable 1) Y it = 0.5 X it + 0 X i,t-1 Y i,t-1 + λ i + ε it 2) X it = 0.5 Y it + 0.5 X i,t-1 + μ i + ν it ; λ i = μ i = 0, i = 1,2,,50; ε t,, ν t ~NID(0,1) Estimated Equation (Difference GMM): Y it = α + β X i,t-1 + γ Y i,t-1 + fixed effects + error term T=10 T=20 T=50 T=100 Mean β 0.2792 0.3008 ---- ---- Rejection Rate for H 0 : β = 0 0.745 0.615 ---- ---- 11