Gravity Models, PPML Estimation and the Bias of the Robust Standard Errors

Similar documents
Constrained Poisson Pseudo Maximum Likelihood Estimation of Structural Gravity Models

Workshop for empirical trade analysis. December 2015 Bangkok, Thailand

A COMPARISON OF HETEROSCEDASTICITY ROBUST STANDARD ERRORS AND NONPARAMETRIC GENERALIZED LEAST SQUARES

The Poisson Quasi-Maximum Likelihood Estimator: A Solution to the Adding Up Problem in Gravity Models

Bootstrapping Heteroskedasticity Consistent Covariance Matrix Estimator

Further simulation evidence on the performance of the Poisson pseudo-maximum likelihood estimator

Trade costs in bilateral trade flows: Heterogeneity and zeroes in structural gravity models

1 Estimation of Persistent Dynamic Panel Data. Motivation

On the Problem of Endogenous Unobserved Effects in the Estimation of Gravity Models

Topic 7: Heteroskedasticity

Supplement to Quantile-Based Nonparametric Inference for First-Price Auctions

Reliability of inference (1 of 2 lectures)

Heteroskedasticity-Robust Inference in Finite Samples

Nonlinear factor models for network and panel data

Elasticity of Trade Flow to Trade Barriers

LIML Estimation of Import Demand and Export Supply Elasticities. Vahagn Galstyan. TEP Working Paper No March 2016

SUR Estimation of Error Components Models With AR(1) Disturbances and Unobserved Endogenous Effects

Heteroskedasticity. Part VII. Heteroskedasticity

Trading partners and trading volumes: Implementing the Helpman-Melitz-Rubinstein model empirically

Estimating the Gravity Model When Zero Trade Flows Are Frequent. Will Martin and Cong S. Pham * February 2008

Quantifying the effects of NTMs. Xinyi Li Trade Policies Review Division, WTO Secretariat 12 th ARTNeT Capacity Building Workshop December 2016

The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors

Lecture 4: Heteroskedasticity

The OLS Estimation of a basic gravity model. Dr. Selim Raihan Executive Director, SANEM Professor, Department of Economics, University of Dhaka

WISE International Masters

Lecture 7: Dynamic panel models 2

Topic 10: Panel Data Analysis

Heteroskedasticity. in the Error Component Model

A Practitioner s Guide to Cluster-Robust Inference

Bootstrapping heteroskedastic regression models: wild bootstrap vs. pairs bootstrap

Gravity Models: Theoretical Foundations and related estimation issues

IV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade

More efficient tests robust to heteroskedasticity of unknown form

Quantile regression and heteroskedasticity

Modelling Methods for Trade Policy II: Introduction to OLS Regression Analysis

the error term could vary over the observations, in ways that are related

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Estimating Gravity Equation Models in the Presence of. Sample Selection and Heteroskedasticity

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

Andreas Steinhauer, University of Zurich Tobias Wuergler, University of Zurich. September 24, 2010

ARTNeT Interactive Gravity Modeling Tool

Determining Changes in Welfare Distributions at the Micro-level: Updating Poverty Maps By Chris Elbers, Jean O. Lanjouw, and Peter Lanjouw 1

The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors

Elasticity of trade flow to trade barriers: A comparison among emerging estimation techniques

Modified Variance Ratio Test for Autocorrelation in the Presence of Heteroskedasticity

Migration Gravity Revisited

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

GLM estimation of trade gravity models with fixed effects

Testing Random Effects in Two-Way Spatial Panel Data Models

LESLIE GODFREY LIST OF PUBLICATIONS

Econometrics of Panel Data

New heteroskedasticity-robust standard errors for the linear regression model

Testing non-nested models for non-negative data with many zeros

Alternative HAC Covariance Matrix Estimators with Improved Finite Sample Properties

Finite Sample Performance of A Minimum Distance Estimator Under Weak Instruments

Spatial Autocorrelation and Interactions between Surface Temperature Trends and Socioeconomic Changes

Session 3-4: Estimating the gravity models

Gravity, log of gravity and the distance puzzle

Econometrics II. Nonstandard Standard Error Issues: A Guide for the. Practitioner

DEPARTMENT OF ECONOMICS

Some Monte Carlo Evidence for Adaptive Estimation of Unit-Time Varying Heteroscedastic Panel Data Models

Models for Heterogeneous Choices

NBER WORKING PAPER SERIES ROBUST STANDARD ERRORS IN SMALL SAMPLES: SOME PRACTICAL ADVICE. Guido W. Imbens Michal Kolesar

DEPARTMENT OF ECONOMICS AND FINANCE COLLEGE OF BUSINESS AND ECONOMICS UNIVERSITY OF CANTERBURY CHRISTCHURCH, NEW ZEALAND

DEPARTMENT OF ECONOMICS

Bayesian Interpretations of Heteroskedastic Consistent Covariance. Estimators Using the Informed Bayesian Bootstrap

Tests of the Present-Value Model of the Current Account: A Note

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

The Case Against JIVE

EMERGING MARKETS - Lecture 2: Methodology refresher

Chapter 1 Statistical Inference

Casuality and Programme Evaluation

OSU Economics 444: Elementary Econometrics. Ch.10 Heteroskedasticity

The gravity models for trade research

The Simple Linear Regression Model

Econometrics of Panel Data

Motivation Non-linear Rational Expectations The Permanent Income Hypothesis The Log of Gravity Non-linear IV Estimation Summary.

Nonlinear Factor Models for Network and Panel Data

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity

Robust covariance estimation for quantile regression

HETEROSKEDASTICITY, TEMPORAL AND SPATIAL CORRELATION MATTER

The Wild Bootstrap, Tamed at Last. Russell Davidson. Emmanuel Flachaire

Short T Panels - Review

TECHNICAL APPENDIX. 7. Agents Decisions in the Cobb-Douglas Case

The Bivariate Probit Model, Maximum Likelihood Estimation, Pseudo True Parameters and Partial Identification

Bias-Correction in Vector Autoregressive Models: A Simulation Study

mrw.dat is used in Section 14.2 to illustrate heteroskedasticity-robust tests of linear restrictions.

On the Power of Tests for Regime Switching

GMM estimation of spatial panels

GARCH Models Estimation and Inference

Least Absolute Value vs. Least Squares Estimation and Inference Procedures in Regression Models with Asymmetric Error Distributions

Heteroskedasticity-Consistent Covariance Matrix Estimators in Small Samples with High Leverage Points

The Bootstrap: Theory and Applications. Biing-Shen Kuo National Chengchi University

Obtaining Critical Values for Test of Markov Regime Switching

Robust Standard Errors in Small Samples: Some Practical Advice

Bayesian Heteroskedasticity-Robust Regression. Richard Startz * revised February Abstract

Ordinary Least Squares Regression

SMOOTHED BLOCK EMPIRICAL LIKELIHOOD FOR QUANTILES OF WEAKLY DEPENDENT PROCESSES

Vector Autoregressive Model. Vector Autoregressions II. Estimation of Vector Autoregressions II. Estimation of Vector Autoregressions I.

Transcription:

Gravity Models, PPML Estimation and the Bias of the Robust Standard Errors Michael Pfaffermayr August 23, 2018 Abstract In gravity models with exporter and importer dummies the robust standard errors of the slope parameters tend to be severely downward biased when estimated by PPML. The coverage rate of confidence intervals of the estimated slope parameters may thus be much too small in cross-sections of the size typically used in empirical research. Keywords: Poisson Pseudo Maximum Likelihood Estimation; Heteroskedasticity-robust inference; Gravity Equation JEL: F10, F15, C13, C50 Department of Economic Theory, -Policy and -History, University of Innsbruck, Universitaetsstr. 15, A-6020 Innsbruck, Austria and Austrian Institute of Economic Research, E-mail: michael.pfaffermayr@uibk.ac.at.

1 Introduction In cross-sections the Poisson pseudo maximum likelihood estimator (PPML, Santos Silva and Trenreyro, 2006) is routinely used to estimate gravity models with exporter and importer dummies. Fernandez-Val and Weidner (2017) show that the PPML estimator for two-dimensional panels with large n and large t asymptotics is among the few, which allow to estimate the slope parameters (but not the coefficients of the dummies) consistently and without asymptotic bias. This note demonstrates that under PPML the heteroskedasticity robust standard errors of the slope parameters tend to be severely downward biased in crosssections of the size typically used in empirical research. The reason is twofold. On the one hand, the distribution of trade flows is highly skewed and the PPMLprojection matrix used to estimate the residuals typically exhibits excessive leverage. On the other hand, due the inclusion of the exporter and importer dummies the convergence rate of the leverage to zero is of the order of the number of the countries, rather than its square. 2 Robust standard errors in a cross-section gravity model and their bias Formally, in a cross-section of C countries the DGP of the gravity model in levels generates bilateral trade flows, y ij, as y ij = e z ij α+β i+γ j η ij. (1) Bilateral trade flows depend on trade frictions collected in the the K 1 vector z ij with corresponding vector of slope parameter vector α. Exporter and importer specific effects are denoted by β i and γ j, respectively. The disturbances η ij are assumed to be independently distributed with E[η ij z ij ] = 1, but possibly heteroskedastic. For estimation, the gravity model can be reformulated with additive disturbances y ij = e z ij α+β i+γ j + ε ij, ε ij = e z ij α+β i+γ j (η ij 1), (2) Observations on the explanatory variables are collected in the C 2 K matrix Z and exporter and importer dummies (depending on the dummy design possibly also the constant) in the C 2 2C 1 design matrix D. W = [Z, D] contains all right hand side variables including exporter and importer dummies. For missing values one may define the selection matrix V that is derived from the identity matrix by setting all ones in the main diagonal to zero if the corresponding observation 1

is missing. Following Fernandez-Val and Weidner (2017) the PPML estimator for the slope parameters can be written as 1 ( Z) 1 α = Z V Q V D V Z V Q V Dỹ, (3) with M = diag(e z ij α+ β i + γ j j ), ỹ = M 1 2 y, Z = M 1 2 Z, D = M 1 2 D and Q V D = I V D ( D V D) 1 D V. Under a set of standard regularity conditions, the limit distribution of α can be derived as C ( α α 0 ) d N (0, V α ), where V α = B0 1 A 0 Ω ε A 0B0 1 1 with B 0 = p lim Z C V Q C D V Z 2 V is assumed to be invertible, A 0 Ω ε A 1 0 = p lim Z C V Q C D M 1 2 2 V εε M 1 2 Q D V Z V. 2 and Ω ε = E[εε ]. Plugging in the estimated residuals ε, one can use 1 V C 2 α = C2 1B( α) 1 C 4 A( α) diag( ε ε )A( α) B( α) 1 for inference in finite samples. Following Chesher and Jewitt (1987) the bias of V α is given by ] E [ Vα V α = B0 1 A 0 M 0 diag(p W,ij (Ω η 2ω ij I C 2) p W,ij )M 0 A 0B0 1 + o(1). (4) p W,ij denotes the ij-th column of P W = M 1/2 0 V W ( W V W ) 1 W V M 1/2 0 and ω ij the ij, ij-th main diagonal element of Ω η. Under homoskedasticity of η the term diag(p W,ij (Ω η 2ω ij I C 2) p W,ij ) reverts to σ 2 diag(p W,ij,ij ), where p W,ij,ij denotes the leverage of observation ij (the diagonal element of P W referring to observation ij). In this case, V α is always downward biased of order O(C 1 ) rather than O(C 2 ), since the leverage is bounded in [0, 1] and rank(p W [ ) = ] K + 2C 1. v The proportionate bias (pb) can be defined as E ( V α V α)v for some vector v V αv ν and, as Chesher and Jewitt (1987) show, may be bounded by [ ] [( ) ] σ 2 sup pb( V α ) max η 1 v ij σ 2 η p W,ij,ij (1 p W,ij,ij ) p W,ij,ij = O(C 1 ) (5) ( ) ( ) ( ) inf pb( V α ) max v ij p W,ij,ij max ij (p W,ij,ij ) 2 = O(C 1 ) (6) 1 An Appendix provided as a supplement provides details on all derivations of the paper. https://eeecon.uibk.ac.at/~pfaffermayr/pfaffermayr_gravity_econometrics_ PPML_standard_errors_Appendix.pdf 2 The star indicates that these matrices are evaluated at parameter values lying in between the estimated and true ones element by element. 2

If heteroskedastiy of η is not too severe ( σ2 η < 2), the proportionate bias of the σ 2 η ] ] V α is always negative, since in this case sup v [pb( V α ) max ij [ p 2 W,ij,ij. More importantly, the lower bound does not depend on the nature of heteroscedasticity, but only on the leverage. Using the cross-section data from WIOD (Timmer et al. 2015) covering 42 economies and estimating a standard gravity model specified as Model 3 in the next section for a random sample of 21 economies (441 country pairs) exhibits a maximum leverage of 0.995 and the minimum leverage amounts to 0.004. This translates in a lower bound of the proportionate bias of 99.997%. The maximum and minimum leverage changes only marginally to 0.983 and 0.001, respectively, when using all 42 countries (1764 country pairs), as does the lower bound of the proportionate bias. Similar numbers are observed using trade data from OECD- STAN or GTAP. The estimated robust standard errors of the slope parameters are thus expected to be severely downward biased. 3 Monte Carlo simulations In each Monte Carlo run a simplified structural gravity model (Model 1) is generated as s ij = e 1,66z ij,1 0.90z ij,2 +β i +γ j η ij (7) C κ i = e 1,66z ij,1 0.90z ij,2 +β i +γ j (8) θ j = j=1 C e 1,66z ij,1 0.90z ij,2 +β i +γ j, (9) i=1 where the disturbances η ij are independent with E[η ij ] = 1 and enter in multiplicative form. Aggregate sales κ i,c and expenditure θ j,c (both measured as share of GDP in world GDP ) come from GTAP. β i and γ j, are derived as equilibrium solutions to the system of multilateral resistances (8) and (9) at true parameter values α 0. Data are sorted such that the sample always includes the C largest ones. z ij,1 is a border dummy taking the value 1 for i = j and zero else. z ij,2 refers to log weighted distance and is taken from CEPII s database (Model 1). To illustrate the impact of the leverage on the estimated robust standard errors, in Model 2 z i1 and z i2 are generated from a uniform distribution. Model 3 mimics a more realistic standard gravity model using dummies for contiguity, common language, colonial relationships and regional trade agreements as explanatory variables in addition to border and ln distance, all entering with coefficients as estimated by 3

PPML ( 1.66, 0.90, 0.35, 0.49,.04, 0.41). Models 2 and 3 keep the exporter and importer effects as in Model 1. Lastly, Model 4 uses the explanatory variables of Model 2, but additionally generates the exporter and importer effects from a uniform distribution as well. In all four models the explanatory variables including the exporter and importer effects remain fixed in repeated samples. The Monte Carlo experiments consider C {20, 60} and generate independent disturbances from three different DGPs. DGP 1 generates the homoskedastic disturbances from a truncated normal that guarantees positive trade flows. DGP exp(0.1z 2 multiplies the truncated normal disturbances of DGP 1 by ij,2 ) C C i=1 j=1 exp(0.1z ij,2) to induce heteroskedasticity in addition to that stemming from the additive disturbances. The third DGP is based on a skewed distribution of the disturbances assuming that η is generated as χ 2 (10). In all three cases the disturbances are transformed to obtain E[η] = 1 and (an average) variance is 0.2. Under these assumptions the standard errors of the estimated parameters are similar to those found in the literature. Figure 1 delivers a clear message. The estimated standard errors of ˆα 1 under Models 1 and 3 are heavily downward biased and exhibit considerable variation. In Model 2 with uniformly distributed explanatory variables the bias is substantially reduced. However, comparing the results of Model 2 and Model 4 shows that it is mainly the skewness of the fixed exporter and importer effects that leads to the large variation of the estimated standard errors. Moreover, the bias disappears in Model 4 as one would expect. Table 1 exhibits the average of the simulated standard errors of α 1 along with its estimated counterpart, the simulated coverage rates of a 95%-confidence intervals and the simulated proportionate bias. 3 The simulated standard errors are calculated as the standard deviation of α 1 in 10000 Monte Carlo runs, while the estimated ones are based on the means of the estimated standard errors. In addition, the table reports the asymptotic lower and upper bound of the proportionate bias as derived in (5) and (6). 3 Results for α 2 are rather similar and available upon request. 4

5

Table 1: Monte Carlo simulation results: Leverage, simulated standard errors, estimated standard errors and 95% coverage rates of the slope parameters under dummy PPML for α 1 Model Leverage in % σ 2 η/σ 2 η sim. sd est. sd sim. bias in % cr lower b. upper b. C=20 Truncated normal 1 99.86 0.608 1.00 0.084 0.046 70.43 0.705 2 91.34 0.380 1.00 0.049 0.042 27.48 0.907 3 99.98 0.318 1.00 0.179 0.088 75.99 0.657 4 50.95 1.675 1.00 0.028 0.027 5.13 0.944 Truncated normal, heteroskedastic 1 99.86 13.679 3.06 0.072 0.040 69.03 0.719 2 91.34 0.297 1.22 0.048 0.041 26.03 0.909 3 99.98 13.679 3.06 0.160 0.079 75.33 0.662 4 50.95 1.311 1.22 0.012 0.012 4.78 0.944 χ(10) 1 99.86 0.608 1.00 0.082 0.044 70.74 0.695 2 91.34 0.380 1.00 0.049 0.041 28.29 0.903 3 99.98 0.318 1.00 0.173 0.084 76.40 0.641 4 50.95 1.675 1.00 0.028 0.027 6.42 0.941 C=60 Truncated normal 1 99.80 0.022 1.00 0.078 0.040 74.07 0.681 2 78.41 0.002 1.00 0.043 0.038 20.09 0.918 3 99.96 0.021 1.00 0.132 0.062 78.14 0.638 4 18.54 0.360 1.00 0.009 0.009 0.66 0.949 Truncated normal, heteroskedastic 1 99.80 33.440 4.00 0.066 0.034 72.72 0.692 2 78.41 0.001 1.22 0.042 0.038 19.14 0.918 3 99.96 33.440 4.00 0.118 0.055 77.86 0.642 4 18.54 0.281 1.22 0.004 0.004 0.55 0.950 χ(10) 1 99.80 0.022 1.00 0.074 0.038 73.45 0.677 2 78.41 0.002 1.00 0.043 0.038 23.86 0.913 3 99.96 0.021 1.00 0.125 0.059 77.51 0.633 4 18.54 0.360 1.00 0.009 0.009 2.00 0.953 Notes: 10000 Monte Carlo runs. cr refers to the coverage rate of a nominal 95%-confidence interval using the normal distribution. 6

In the sample of the 20 biggest countries, Model 1 implies a simulated proportionate bias of around minus 70% and a coverage rate of a 95%-confidence interval between 69.5% and 71.9% under the three considered DPGs. It turns out that the leverage is excessively high in bilateral trade data. Under DGP 1 and 3 the lower bound of the proportionate bias amounts to 99.86% at C = 20 and to 99.80% at C = 60, while its upper bound are 0.61% and 0.02%, respectively. Under DGP 2 the ratio of maximum and minimum variances of η ij amounts to σ 2 /σ 2 = 3.06 at C = 20 and 4.00 at C = 60. In Model 1 the upper bound is now at 13.68 at C = 20 (33.40 at C = 60). The proportionate bias of the estimated standard error of α 1 decreases just marginally. Only very pronounced heteroskedasticty of η would reduce it substantially. Results for Model 2 generating the explanatory variables from a uniform distribution, indicate a considerable lower proportionate bias of the estimated standard error of α 1 and, at C = 20, a coverage ratio of the 95%-confidence interval between 0.903 and 0.909. Model 3 looks at the richer gravity model. With a simulated downward bias of as much as 77.86% it exhibits an even worse performance. In Model 4 all variables and the exporter and importer fixed effects come from the uniform distribution. The proportionate bias is negligible and lies between 6.48% and 4.79% at C = 20 and the coverage ratio is correct. Overall, these results suggest that the downward bias of the estimated standard error of α 1 is mainly determined by the large leverage induced by the trade friction indicators. Increasing the sample size to the 60 largest countries does not reduce the bias in Models 1 and 3, while for Models 2 and 4 the bias becomes smaller. The nature of the DGP seems not to make a big difference. Using a degrees of freedom correction reduces the bias and improves the coverage rate only marginally. Simulation results for this case are available upon request. 4 Conclusions PPML with dummies is now widely used for estimation of gravity models in levels. However, for calculating standard errors and confidence intervals of the estimated slope parameters it is of limited use. Typically, bilateral trade flows data are characterized by excessively high leverage with a slow convergence rate due to the increase of the number of parameters in sample size. As a result, the coverage rates of confidence intervals of the estimated slope parameters are far to small and parameter tests severely oversized. The present available statistical software such as stata and R does not account for this problem and there is an urgent need for improving the finite sample properties of PPML estimation in this respect. Approaches similar to those available for linear models and surveyed by MacKinnon (2013) could possibly bring improve- 7

ments. References Chesher A. and I. Jewitt (1987), The Bias of a Heteroskedasticity Consistent Covariance Matrix Estimator, Econometrica 55(5), 1217 1222. Fernandez-Val, I. and M. Weidner (2016), Individual and Time Effects in Nonlinear Panel Models with large N, T, Journal of Econometrics 192(1), 291 312. MacKinnon J.G. (2013), Thirty Years of Heteroskedasticity-Robust Inference. In: Chen X., Swanson N. (eds) Recent Advances and Future Directions in Causality, Prediction, and Specification Analysis. Springer, New York, NY. Santos Silva, J.M.C. and S. Tenreyro (2006), The Log of Gravity, Review of Economics and Statistics 88(4), 641 658. Timmer, M. P., Dietzenbacher, E., Los, B., Stehrer, R. and de Vries, G. J. (2015), An Illustrated User Guide to the World Input Output Database: the Case of Global Automotive Production, Review of International Economics 23(3), 575 605. 8