Trading partners and trading volumes: Implementing the Helpman-Melitz-Rubinstein model empirically

Size: px
Start display at page:

Download "Trading partners and trading volumes: Implementing the Helpman-Melitz-Rubinstein model empirically"

Transcription

1 Trading partners and trading volumes: Implementing the Helpman-Melitz-Rubinstein model empirically J.M.C. SANTOS SILVA and SILVANA TENREYRO University of Essex and CEMAPRE. Wivenhoe Park, Colchester CO4 3SQ, UK ( London School of Economics, CFM, CEP, and CEPR. St. Clement s Building, Houghton St., London WC2A 2AE, UK ( s.tenreyro@lse.ac.uk) Abstract: Helpman, Melitz, and Rubinstein (2008) HMR present a rich theoretical model to study the determinants of bilateral trade flows across countries. The model is then empirically implemented through a two-stage estimation procedure. We argue that this estimation procedure is only valid under the strong distributional assumptions maintained in the paper. Statistical tests using the HMR sample, however, clearly reject such assumptions. Moreover, we perform numerical experiments which show that the HMR two-stage estimator is very sensitive to departures from the assumption of homoskedasticity. These findings cast serious doubts on any inference drawn from the empirical implementation of the HMR model. JEL Classification codes: C13; C50; F10. Key words: Gravity equation, Heteroskedasticity, Sample-selection, Weak instruments. We are most grateful to three anonymous referees and to the Editors for many helpful suggestions. We are also grateful to Elhanan Helpman, Marc Melitz, and Yona Rubinstein for generously sharing their data. We thank especially Marc Melitz for useful conversations. Santos Silva gratefully acknowledges the partial financial support from Fundação para a Ciência e Tecnologia, (Programme PEst-OE/EGE/UI0491/2013). Tenreyro gratefully acknowledges financial support from the European Research Council under the European Community s ERC starting grant agreement

2 I. Introduction In a highly insightful and stimulating paper, Helpman, Melitz and Rubinstein (2008), hereinafter HMR, present a theoretical framework to study bilateral trade flows across countries. The model is especially appealing because it can potentially explain three prevalent regularities in trade data: The asymmetry in bilateral trade flows between country pairs; the high prevalence of zeroes (in either one or both directions of bilateral trade flows); and the remarkably good fit of the gravity equation. HMR use their conceptual framework to develop a two-stage estimation procedure that generalizes the empirical gravity equation by taking into account the extensive margin (the decision to export from j to i), and the intensive margin (the volume of exports from j to i, conditional on exporting). 1 Although HMR s model makes a significant step towards a better understanding of the determinants of bilateral trade flows, the proposed two-stage estimation procedure has some limitations. In this paper we analyze the estimation method proposed by HMR and emphasize the two following results. First, the approach used by HMR to deal with the selectivity bias caused by dropping the observations with zero trade is only approximately correct. We discuss the conditions under which the approximation tends to work better. Second, and more importantly, the HMR model and associated estimator depend critically on untested distributional assumptions. As we show in this paper, such assumptions are strongly rejected by the HMR data. Moreover, we show that the results of the two-stage estimation method proposed by HMR are very sensitive to the presence of 1 An alternative two-stage procedure to estimate the extensive and intensive margins is proposed by Egger, Larch, Staub, and Winkelmann (2011). 1

3 heteroskedasticity. 2 These findings cast serious doubts on any inference drawn from the empirical implementation of the HMR model using their proposed two-stage estimator. II. The HMR model HMR specify a trade equation which can be written as (see equations (6) and (8) in HMR): M ij = B 0 Λ j X i τ 1 ε ij max { (aij a L ) k ε+1 1, 0}, (1) where M ij denotes the trade flow from j to i, Λ j denotes a fixed effect for exporter j, X i is a fixed effect for importer i, τ ij represents the usual melting iceberg transport cost, a ij is a measure of the minimum productivity needed for it to be profitable for a firm to export from j to i, a L is a measure of the productivity of the most productive firm, and B 0, k, and ε are parameters. Furthermore, the authors assume that τ ε 1 ij = D γ ij exp ( u ij), (2) where γ is a parameter, D ij is the distance (and other factors creating trade resistance) between countries i and j, and u ij N (0, σ u ). Therefore, M ij = B 0 Λ j X i D γ ij max { (aij a L ) k ε+1 1, 0} exp (u ij ). (3) Direct estimation of this equation would require information about a ij and a L, which is typically not available. To overcome this problem HMR define the latent variable (see equations (10) and (11) in HMR) Z ij = Γ 0 Ξ j Υ i D γ ij Ψ κ ij exp (v ij + u ij ), (4) 2 Throughout we understand heteroskedasticity to mean that the skedastic function is not constant; that is, that the conditional variances of the errors of the model are functions of the regressors. 2

4 where Γ 0 and κ are parameters, Ξ j denotes a fixed effect for exporter j, Υ i is a fixed effect for importer i, Ψ ij denotes additional country-pair specific fixed trade costs, and v ij N (0, σ v ). The new variable Z ij can be interpreted as the ratio of the variable export profits for the most productive firm to the fixed cost of exporting from j to i, and is not observable. However, positive trade is observed only when Z ij > 1, which leads HMR to propose the following two-stage estimation strategy. Let T ij be a binary variable defined as T ij = 1 [M ij > 0], where 1 [A] is the indicator function of the event A. Then, defining z ij = ln (Z ij ), γ 0 = ln (Γ 0 ), ξ i = ln (Ξ i ), y j = ln (Υ j ), d ij = ln (D ij ), and ψ ij = ln (Ψ ij ), we have that the conditional probability that j exports to i is 3 ρ ij = Pr (T ij = 1) = Pr ( γ 0 + ξ i + y j γd ij κψ ij > (v ij + u ij ) ). (5) Under the maintained assumptions of normality and homoskedasticity, the unknown parameters can be consistently estimated up to scale using a probit. Indeed, under these assumptions we have that ( γ0 + ξ i + y j γd ij κψ ij Pr (T ij = 1) = Φ σ u+v ), (6) where σ u+v denotes the standard deviation of (v ij + u ij ) and Φ ( ) is the CDF of the standard normal distribution. Therefore, under the distributional assumptions made by HMR it is possible to consistently estimate Zij = ( ) Γ 0 Ξ i Υ j D γ 1 ij Ψ κ σ u+v ij, (7) 3 To simplify the notation, throughout we do not make explicit that this probability is conditional on the regressors. 3

5 by taking the exponential of the linear index estimated by the probit model. Using ( ) ε 1 aij this result and the fact that Z ij = a L for Tij = 1, it is possible to rewrite (3) as M ij = T ij B 0 Λ j X i D γ ij { [Z ij exp (ς ij ) ] δ 1 } exp (u ij ), (8) where ς ij = (v ij + u ij ) /σ u+v and δ = σ u+v (k ε + 1) / (ε 1). The second stage in the HMR procedure is the estimation of the trade equation for the positive observations of M ij. To do this, the authors take logs of both sides of (8), leading to m ij = β 0 + λ j + χ i γd ij + ln { exp [ δ ( z ij + ς ij )] 1 } + uij, (9) where, as usual, lower-case letters represent the log of the quantity corresponding to the same upper case letter. Distributional assumptions The definition of δ shows that it is proportional to the standard deviation of (v ij + u ij ). Therefore, δ is a parameter if v ij and u ij are homoskedastic but otherwise it is a function of the regressors. Hence, the homoskedasticity of the errors is critical to establish how the regressors enter both (8) and (9). Indeed, under the assumptions in HMR, the regressors enter the equation both directly and through Zij, which is estimated in the first stage. However, under heteroskedasticity, δ will also be a function of the regressors. Therefore, under heteroskedasticity, the regressors will enter the model in a much more complex form than what is assumed by HMR. Ignoring that δ is a function of the regressors has the potential to introduce severe misspecification in (8) and (9), making consistent estimation of the parameters of interest generally impossible. Moreover, heteroskedasticity will also make the estimation of the first stage inconsistent, which 4

6 will bring an additional source of misspecification into the model. 4 Of course, one may be tempted to tackle this problem by specifying σ u+v as a function of the regressors, but this approach is foiled by the fact that economic theory provides no guidance on the possible heteroskedasticity patterns. Most of the results in HMR are also obtained under the assumption of normality. Although HMR partially relax it, normality is always assumed in the estimation of the first stage. Therefore, the twin assumptions of homoskedasticity and normality are critical for the correct specification of (8) and (9). Moreover, these assumptions are also critical for the construction of the selectivity corrections used by HMR. Cosslett (1991), Chen and Khan (2003), and Das, Newey, and Vella (2003), among others, have studied semi-parametric estimators for linear sample selection models which are robust to non-normality and heteroskedasticity. However, the validity of these estimators depends on conditions such as particular forms of heteroskedasticity or the existence of valid exclusion restrictions in the second stage, which are unlikely to be valid in the context of trade data. 5 More importantly, (9) has two random components affected by sample selection and one of them enters the model in a nonlinear form; none of the currently available semi-parametric estimators can deal with sample selection models of this form. The selectivity correction Estimation of (9) is performed using only observations with positive values of M ij, which originates a sample-selection issue. To account for the fact that E [u ij M ij > 0] 4 As noted by Santos Silva and Tenreyro (2006), homoskedasticity is also critical for the validity of the log-linearization leading from (8) to (9). However, in this paper we focus on a very different issue: the additional implications that heteroskedasticity has in the context of the two-stage estimation proposed by HMR. 5 Additionally, due to the large number of regressors typically used, implementation of some of these methods with trade data is far from trivial. 5

7 0, HMR include in the regression equation the inverse Mills ratio from the first stage, which (under normality and homoskedasticity) is proportional to E [u ij M ij > 0]. This is the correct procedure to account for selectivity in an additive error (see, e.g., Wooldridge, 2010). However, the equation of interest has a second random component, ς ij, which enters the equation within a non-linear function. HMR deal with the effect of the sampleselection on ς ij in a way that is akin to the ad-hoc method used by Greene (1994). In particular, HMR replace ς ij with its expectation conditional on M ij > 0, which is the inverse Mills ratio from the first stage. That is, using zij to denote the linear index in (6) and denoting the inverse Mills ratio by η ij = φ ( ( ) zij) /Φ z ij, the second stage of HMR s procedure is the estimation of m ij = β 0 + λ j + χ i γd ij + ln { exp [ δ ( z ij + ˆη ij )] 1 } + βuηˆη ij + e ij, (10) where ˆη ij denotes the fitted value of η ij and β uη is a parameter (see equation (14) in HMR). As noted for example by Terza (1998), this approach to correct the effect of the sample selection on ς ij is generally inappropriate. Indeed, for any non-linear function f ( ), Jensen s inequality implies that f[e(ς ij M ij > 0)] E [f (ς ij ) M ij > 0]. Therefore, f (ˆη ) ij is not a consistent estimator of E [f (ςij ) M ij > 0] and, consequently, the proposed estimation method will generally be inconsistent for all the parameters of interest. Nevertheless, it is interesting to notice that the approximation used by HMR is likely to be reasonably accurate in many practical situations. To see this, notice that ln { exp [ δ ( z ij + ς ij )] 1 } = ln { exp [ δ ( z ij + η ij + ω ij )] 1 }, (11) 6

8 where ω ij = ς ij η ij. The approximation used by HMR consists of ignoring ω ij, which would be innocuous if the function was linear in this random term because in that case ω ij would just be added to the error of the equation. However, it is clear that for a wide range of values of zij and reasonable values of δ, (11) is approximately linear in ω ij. Therefore, under their assumptions, the approximation used by HMR is likely to be reasonable, especially because positive values of M ij tend to be associated with large values of zij, which are the ones for which the approximation is better. 6 III. A reappraisal of the HMR study In this section we reconsider the empirical study presented in HMR. 7 We start by testing whether there is evidence of violations of the distributional assumptions required for the validity of the HMR estimator and then study the sensitivity of the results to the presence of heteroskedasticity. Testing the distributional assumptions Since δ is proportional to the standard deviation of the error in the first stage, the assumption that δ is independent of the regressors can be tested by testing for heteroskedasticity in the probit defined by (6). Tests for this purpose were introduced by Davidson and MacKinnon (1984) and are now described in textbooks such as Wooldridge s (2010, pp ). Although these tests are well known, it is perhaps useful to briefly present them here. 6 Santos Silva and Tenreyro (2009) show that, under the distributional assumptions maintained by HMR, it is possible to obtain an exact selectivity correction. 7 In order to maintain comparability with the results in HMR, we use exactly the same data, the same set of regressors, and the same estimation methods used in the original study. HMR provide details on the data and data sources. 7

9 Consider the following generalization of (6) ( γ0 + ξ i + y j γd ij κψ ij Pr (T ij = 1) = Φ σ u+v (w ij θ) ), (12) where w ij is a vector of functions of ξ i, y j, d ij, and ψ ij, θ is a vector of parameters, and σ u+v (w ij θ) denotes the standard deviation of u + v. Moreover, let σ u+v (w ij θ) be such that σ u+v (0) = σ u+v, which is constant. In this setup the null of homoskedasticity can tested by testing H 0 : θ = 0 and, as Davidson and MacKinnon (1984) show, under mild regularity conditions the test can be derived without specifying the form of σ u+v (w ij θ). Indeed, by expanding the argument of Φ ( ) in a Taylor series around θ = 0 we obtain Pr (T ij = 1) Φ ( z ij θσ u+vz ijw ij /σ u+v ), (13) where zij is defined as before and σ u+v denotes the derivative of σ u+v ( ) evaluated at zero. Therefore, if σ u+v 0, (12) is locally equivalent to a homoskedastic probit where the linear index zij is augmented by the inclusion of variables of the form zijw ij and H 0 : θ = 0 can be tested by testing the significance of the parameters associated with the additional regressors. The test can be implemented in three simple steps: first, we estimate the model under the null, i.e., (6); then we construct variables of the form ẑijw ij, where ẑij is the fitted value of zij obtained under the null; finally, we estimate the augmented probit model defined by (13) and check the significance of the parameters associated with the additional regressors. That is, in the context of a probit model, homoskedasticity can be tested by testing the exclusion of a particular type of regressors. To implement the test it is necessary to choose the variables in w ij. Although w ij can contain any function of the regressors, in what follows we will consider only the 8

10 case where w ij = ( ) ẑij, ẑij 2 ; 8 that is, the test is performed by checking for the joint significance of the parameters associated with the additional regressors ẑ 2 ij and ẑ 3 ij. This form of the heteroskedasticity test, which is analogous to a two-degrees-offreedom RESET test (Ramsey, 1969), is particularly interesting in the case of a probit model because it can also be interpreted as a normality test. Heuristically, the intuition for this can be provided as follows (see Cramer and Ridder, 1988, pp ). Suppose that Pr (T ij = 1) = F ( z ij), where F ( ) is a cumulative distribution function, and rewrite F ( ( zij) ( as Φ Φ 1 F ( zij))) (. Then, approximating Φ 1 F ( zij)) with a third order polynomial we obtain F ( ( ) zij) = Φ υ1 zij + υ 2 zij 2 + υ 3 zij 3. If F ( ) = Φ ( ), then Φ 1 ( F ( z ij)) = z ij and therefore υ 2 = υ 3 = 0. That is, in a probit the assumption of normality can be tested using a RESET test; see Newey (1985) for an alternative derivation and more details. Ramalho and Ramalho (2012) perform an extensive simulation study on the properties of RESET tests in the context of binary choice models and conclude that the particular version of the RESET we use has very good performance under the null and has good power against a range of alternatives, including heteroskedasticity and non-normality. Therefore, this simple test provides a direct check for the validity of the main distributional assumptions required for consistent estimation of the model developed by HMR. Additionally, because heteroskedasticity also impacts on the functional form of (10) it is important to check whether the specification of this model is reasonably adequate. In the spirit of Cosslett (1991), HMR partially relax the distributional assumptions used to obtain (10) by estimating models of the form Q m ij = λ j + χ i γd ij + α s 1 [ ] q s 1 < ρ ij q s + e ij, (14) s=1 8 This choice is motivated by analogy with the popular two-degrees-of-freedom special case of White s test for heteroskedasticity (see Wooldridge, 2010, p. 140). 9

11 where, as before, ρ ij = Pr (T ij = 1) and 1 [A] is the indicator function of the event A, α 1,..., α Q are parameters, q 0,..., q Q are constants defining quantiles of ρ ij, and q 0 = and q Q =. Although this model is more flexible than (10) it still assumes that the selectivity correction depends on the regressors only through zij; that is, like (8), (9), and (10), (14) assumes that δ is constant. To check for departures from this assumption one can test the significance of interactions between the indicator variables and functions of the other regressors. A simple way of doing this is again to perform a RESET test for the significance of additional variables constructed as powers of the estimated linear indexes. By analogy with what is done for the probit, in what follows we will also use two-degrees-of-freedom RESET tests to check the validity of (14). The performance of the RESET in linear models has been studied, among others, by Godfrey and Orme (1994), who conclude that the test has good behaviour under the null and good power against alternatives of the type considered here. Table 1 presents the test statistics and p-values for the two-degrees-of-freedom RE- SET tests for some of the models estimated by HMR. In particular, for the two sets of exclusion restrictions considered by HMR, we test the specification of the probit used in the first stage and the two most flexible specifications of the second stage, which are defined by (14) with Q {50, 100}. Although the simulation studies referred above suggest that the RESET tests have good size properties, it is important to make sure that in this particular application the tests do not lead to spurious rejections of the null. Therefore, we computed the p-values of the test in 3 different ways: 1) using the usual clustered standard errors, 2) using standard errors obtained with a bootstrap by clusters, and 3) bootstrap p-values. Notice that because the test statistic is asymptotically pivotal the bootstrap p-values benefit from the well-know asymptotic refinements and therefore lead to much more reliable inference than the p-values obtained by the first two methods (see, e.g., Godfrey, 2009, pp ). In the three cases the results are the same. 10

12 TABLE 1: Specification test results Costs excluded Religion excluded Probit 50 Bins 100 Bins Probit 50 Bins 100 Bins RESET statistic p-value Sample size 12, 198 6, 602 6, , , , 146 The p-values of the RESET tests for the probit models reveal clear signs of misspecification. As noted before, this particular version of the RESET test can be interpreted as a test for the assumptions of homoskedasticity and normality of u i and v i and the results in Table 1 strongly suggest that these assumptions are not valid in this context. This impression is reinforced by the results for the second stage models, which also clearly fail the RESET tests. Therefore, there are reasons to suspect that all the models considered by HMR are misspecified; we next evaluate how sensitive the results are to possible misspecification. Gauging the consequences of heteroskedasticity One way to assess the sensitivity of the HMR estimation procedure to the presence of heteroskedasticity is to estimate (14) using fitted values of ρ ij obtained from an heteroskedastic probit as in (12) rather than from a standard probit. To do that we need to specify the functional form of σ u+v (w ij θ) and to define the set of variables to include in w ij. Here we follow Harvey (1976) and specify σ u+v (w ij θ) = exp(w ij θ). As for the choice of w ij we consider two cases: a) w ij = ( ) d ij, ψ ij, that is, wij contains all the regressors in the original probit regression except the importer and exporter dummies; and b) w ij is a set of indicator variables obtained by partitioning the fitted values of (6) into bins as done in (14); in this case we use 35 bins. 9 It is important 9 Estimation of the heteroskedastic probit is relatively diffi cult and therefore it is necessary to be somewhat parsimonious in the definition of w ij. 11

13 to notice that because economic theory provides no guidance on the choice of the functional form of σ u+v (w ij θ) or of the variables in w ij our decisions on these are somewhat arbitrary and we do not in any way presume that they lead to adequate specifications. Consequently, the results presented below should be interpreted merely as illustrating the sensitivity of the HMR estimation procedure to alternative forms of accounting for heteroskedasticity in the first stage. Table 2 presents the results obtained by estimating (14) using three different specifications for the probit model in the fist stage: 10 columns (A) and (D) correspond to the results reported by HMR using the standard probit that ignores the possible presence of heteroskedasticity, columns (B) and (E) correspond to the results obtained using the heteroskedastic probit with w ij = ( ) d ij, ψ ij, and finally columns (C) and (F) are obtained again by using an heteroskedastic probit where now w ij is a set of indicator variables obtained as described above. The results presented in Table 2 show that the way heteroskedasticity is accounted for in the first stage has dramatic consequences for the estimated elasticities of the firm s trade with respect to the different trade barriers. Indeed, changing the specification of σ u+v (w ij θ) leads to important changes in the magnitude of the estimated elasticities, sometimes changing their statistical significance (e.g., Colonial ties and FTA). For example, the estimated distance elasticity varies between and 1.073, and the estimate for the coeffi cient on the currency union dummy varies between and 1.376, implying that the effect of this variable varies by a factor of almost 2.5. Additionally, we note that in (C) the coeffi cient on Religion has a p-value of 0.029, which calls into question the use of this variable as the excluded instrument in the second set of estimates. Finally, we note that although the models based on the heteroskedastic probit are more general than the ones used by HMR, the results of the RESET test 10 We only present results for the case where the indicators are obtained as percentiles of ρ ij, results with fewer indicator dummies are essentially the same. 12

14 TABLE 2: Estimation Results Costs excluded Religion excluded (A) (B) (C) (D) (E) (F) w ij Const. d ij, ψ ij Bins Const. d ij, ψ ij Bins Log distance (0.088) (0.048) (0.041) (0.076) (0.044) (0.034) Land border (0.170) (0.177) (0.165) (0.150) (0.153) (0.146) Island (0.258) (0.279) (0.266) (0.121) (0.119) (0.118) Landlock (0.187) (0.188) (0.190) (0.186) (0.192) (0.187) Legal (0.065) (0.068) (0.063) (0.050) (0.051) (0.049) Language (0.083) (0.076) (0.075) (0.068) (0.062) (0.060) Colonial ties (0.153) (0.264) (0.152) (0.119) (0.339) (0.118) Currency union (0.346) (0.339) (0.334) (0.270) (0.267) (0.262) FTA (0.348) (0.257) (0.224) (0.210) (0.311) (0.189) Religion (0.128) (0.146) (0.119) RESET statistic p-value R Sample size 6, 602 6, 602 6, , , , 146 Note: Clustered standard errors in parentheses. All models include importer and exporter dummies. 13

15 suggest that they no not alleviate the misspecification of the second stage. This is not surprising because, due to the log transformation, the consistency of the second stage is also heavily dependent on the assumption of homoskedastic errors (see Santos Silva and Tenreyro, 2006) and that is not at all taken into account in the estimation procedure suggested by HMR. The results in Table 2 clearly illustrate the sensitivity of the HMR estimation procedure to the presence of heteroskedasticity. However, because all of the models are potentially severely misspecified, these results are not informative about the magnitude of the biases caused by heteroskedasticity in the first stage. To investigate this issue we performed a small simulation study in which data are generated as follows. First, using (4), Z ij is generated as Z ij = exp ( γ 0 + ξ i + y j γd ij κψ ij + v ij + u ij ). (15) Then, for Z ij > 1, m ij is generated according to (9), that is { } m ij = β 0 + λ j + χ i γd ij + ln Z k ε+1 ε 1 ij 1 + u ij. (16) For the instruments ψ ij, we used either Costs or Religion as in HMR. We also performed a set of experiments similar to those where Religion is the instrument but replacing this variable by random draws from a normal distribution with mean 0 and standard deviation equal to 3. When Costs and Religion are the instruments the sample sizes are as in the HMR paper and when the random instrument is used the sample size is the same as when Religion is used as the instrument. The variables in d ij are a subset of the trade barriers included in the original study. Specifically, to speed-up the simulation experiments we considered only Log distance, 14

16 Land border, Landlock, and Legal. As in HMR, when Costs are the excluded instrument Religion is also included in d ij. 11 We performed experiments with homoskedastic and heteroskedastic errors. In the homoskedastic case v ij and u ij are obtained as random draws from a normal distribution with mean 0 and standard deviation equal to 0.5; therefore σ u+v = 1. In the heteroskedastic case u ij is obtained as random draws from a normal distribution with mean 0 and standard deviation equal to 0.25 and v ij is obtained as random draws from a normal distribution with mean 0 and standard deviation equal to exp ( θ 1 d ij + θ 2 ψ ij ) , implying σ u+v (w ij θ) = exp ( θ 1 d ij + θ 2 ψ ij ). 12 To complete the specification of the data generation process it is necessary to set the parameters of the model. 13 In the homoskedastic case, the parameters in (15) are set to match the estimates obtained when the first stage is a probit with the relevant regressors. In the heteroskedastic case, the parameters in (15) and θ 1 and θ 2 are set to match the estimates obtained when the first stage is a heteroskedastic probit. In both cases the coeffi cients on the regressors in (16) were set to match the estimates obtained from (14) and constructing the indicator functions as the percentiles of the probabilities estimated in the first stage. 14 Finally, we set k = 1.85 and ε = 2 (see Broda and Weinstein, 2006). After generating the data, the model was estimated using a probit in the first stage and then estimating the second stage using (14) with indicator functions constructed as the percentiles of the probabilities estimated in the first stage; i.e., estimation is per- 11 We preformed some limited experiments with the full set of regressors and the results are qualitatively similar. 12 Notice that in both cases u ij, the error of the second stage, is homoskedastic. 13 Values for the main parameters used in the simulations are given in Appendix A. 14 Notice, however, that the biases to be reported below are invariant to the value of these parameters. 15

17 formed exactly as in HMR and therefore ignores the possible presence of heteroskedasticity. Table 3 reports the biases of the estimates of the coeffi cients of d ij obtained with 10, 000 replicas of the procedure described above. These results show that when Costs or Religion are used as instruments there are substantial biases even if the errors are homoskedastic; the bias on the coeffi cient of Log distance is particularly noteworthy. This suggests that, at least with samples of this size, the instruments used by HMR do not have enough influence in the first stage to lead to estimates of the parameters of interest with reasonably small biases. However, when ψ ij N (0, 3) and the errors are homoskedastic the biases are generally reasonably small. 15 With heteroskedastic errors the biases again depend on how the data are generated and on the instrument used, and can be very substantial. Moreover, the results show that the sign of the biases depends on how the data is generated. 16 TABLE 3: Simulation Results (estimated biases) Homoskedastic errors Heteroskedastic errors Instrument Costs Religion N (0, 3) Costs Religion N (0, 3) Log distance Land border Landlock Legal Religion Sample size 6, , , 146 6, , , 146 The results in Table 3 confirm that the two-stage estimation procedure suggested by HMR is heavily reliant on the assumption that the errors of the model are homoskedas- 15 Under homoskedasticity, smaller biases are also obtained with the other instruments if κ, the coeffi cient of the instrument in the generation of Z ij, is increased substantially. 16 The results of additional experiments show that, as expected, the magnitude of the biases increase with the value of (k ε + 1) / (ε 1). 16

18 tic. Moreover, these results show that even under homoskedasticity the ability of the estimator to identify the parameters of interest depends on the availability of an instrument that not only can be validly excluded form the second stage regression but also has a determinant role in the first stage. The results obtained using Costs and Religion as instruments suggest that, even if they are valid, their role in the first stage is not significant enough to allow the estimation of the parameters of interest with reasonably small biases. IV. Concluding remarks In this paper we discuss some econometric aspects of the implementation of the model for bilateral trade flows between countries proposed by Helpman, Melitz and Rubinstein (2008). In particular, we have emphasized that consistent estimation of the structural parameters in the model proposed by HMR is only possible under the assumption that all random components of the model are homoskedastic. This dependence on the homoskedasticity assumption is the most important drawback of the HMR model and contrasts with more standard models for trade (e.g., Anderson and van Wincoop, 2003), which can be made robust to the presence of heteroskedasticity. Additionally, our simulation results reveal that estimation of the HMR model using Costs or Religion as instruments may lead to substantial biases, even if the errors are homeskedastic and the instruments valid. This happens because Costs or Religion play only a minor role in the first stage and in that sense they are weak instruments. Finally, we notice that while it is likely to be reasonably accurate in many empirical studies, the selectivity correction used by HMR is only approximately valid. To gauge the severity of these problems we revisited the empirical illustration presented by HMR and found overwhelming evidence that all models used in their study 17

19 are misspecified. Additionally, we illustrated that their proposed estimator is very sensitive to the presence of heteroskedasticity. These findings cast doubts on the validity of any inference drawn upon the results obtained using the two-stage estimator of the model for bilateral trade flows proposed by HMR. References Anderson, J. and E. van Wincoop (2003), Gravity with gravitas: A solution to the border puzzle, American Economic Review, 93, Broda, C. and Weinstein, D.E. (2006), Globalization and the gains from variety, Quarterly Journal of Economics, 121, Chen, S. and Khan, S. (2003), Semiparametric estimation of a heteroskedastic sample selection model, Econometric Theory, 19, Cosslett, S.R. (1991), Semiparametric estimation of a regression model with sample selectivity, in Nonparametric and Semiparametric Methods in Econometrics and Statistics, ed. Barnett, W.A., Powell, J. and Tauchen, G., , Cambridge: Cambridge University Press. Cramer, J.S. and Ridder, G. (1988). The logit model in economics, Statistica Neerlandica, 42, Das, M., Newey, W.K., and Vella, F. (2003). Nonparametric estimation of sample selection models, Review of Economic Studies, 70, Davidson, R. and MacKinnon, J.G. (1984). Convenient specification tests for logit and probit models, Journal of Econometrics, 25, Egger, P., Larch, M., Staub, K.E., and Winkelmann, R. (2011). The trade effects of endogenous preferential trade agreements, American Economic Journal: Economic Policy, 3,

20 Godfrey, L.G. and Orme, C.D. (1994). The sensitivity of some general checks to omitted variables in the linear model, International Economic Review, 35, Godfrey, L.G. (2009). Bootstrap tests for regression models, Houndmills: Palgrave Macmillan. Greene, W. (1994), Accounting for excess zeros and sample selection in Poisson and negative binomial regression models, Working Paper No. EC-94-10, Department of Economics, Stern School of Business, New York University. Harvey, A. C. (1976). Estimating regression models with multiplicative heteroscedasticity, Econometrica 44, Helpman, E., Melitz, M. and Rubinstein, Y. (2008), Estimating trade flows: Trading partners and trading volumes, Quarterly Journal of Economics, 123, Newey, W. (1985), Maximum Likelihood Specification Testing and Conditional Moments Tests, Econometrica, 53, Ramalho, E.A. and Ramalho, J.J.S. (2012), Alternative versions of the RESET test for binary response index models: a comparative study, Oxford Bulletin of Economics and Statistics, 74, Ramsey, J.B. (1969), Tests for Specification Errors in Classical Linear Least Squares Regression Analysis, Journal of the Royal Statistical Society B, 31, Santos Silva, J.M.C. and Tenreyro, Silvana (2006), The log of gravity, The Review of Economics and Statistics, 88, Santos Silva, J.M.C. and Tenreyro, Silvana (2009), Trading Partners and Trading Volumes: Implementing the Helpman-Melitz-Rubinstein Model Empirically, CEP Discussion Papers dp0935, Centre for Economic Performance, LSE. 19

21 Terza, J. (1998), Estimating count data models with endogenous switching: Sample selection and endogenous treatment effects, Journal of Econometrics, 84, Wooldridge, J.M. (2010). Econometric Analysis of Cross Section and Panel Data, 2nd ed, Cambridge, MA: MIT Press. 20

22 Appendix A TABLE A1: Main simulation parameters Homoskedastic case Heteroskedastic case Instrument Costs Religion Costs Religion First Stage Log distance Land border Landlock Legal Religion Reg. costs Days & proc σ u+v (w ij θ) Log distance Land border Landlock Legal Religion Reg. costs Days & proc Second Stage Log distance Land border Landlock Legal Religion All models include importer and exporter dummies. The results are invariant to the choice of the second stage parameters. 21

Further simulation evidence on the performance of the Poisson pseudo-maximum likelihood estimator

Further simulation evidence on the performance of the Poisson pseudo-maximum likelihood estimator Further simulation evidence on the performance of the Poisson pseudo-maximum likelihood estimator J.M.C. Santos Silva Silvana Tenreyro January 27, 2009 Abstract We extend the simulation results given in

More information

Quantile regression and heteroskedasticity

Quantile regression and heteroskedasticity Quantile regression and heteroskedasticity José A. F. Machado J.M.C. Santos Silva June 18, 2013 Abstract This note introduces a wrapper for qreg which reports standard errors and t statistics that are

More information

Gravity Models, PPML Estimation and the Bias of the Robust Standard Errors

Gravity Models, PPML Estimation and the Bias of the Robust Standard Errors Gravity Models, PPML Estimation and the Bias of the Robust Standard Errors Michael Pfaffermayr August 23, 2018 Abstract In gravity models with exporter and importer dummies the robust standard errors of

More information

Testing non-nested models for non-negative data with many zeros

Testing non-nested models for non-negative data with many zeros Testing non-nested models for non-negative data with many zeros J.M.C. Santos Silva, Silvana Tenreyro, Frank Windmeijer 16 February 2014 Abstract In economic applications it is often the case that the

More information

poisson: Some convergence issues

poisson: Some convergence issues The Stata Journal (2011) 11, Number 2, pp. 207 212 poisson: Some convergence issues J. M. C. Santos Silva University of Essex and Centre for Applied Mathematics and Economics Colchester, United Kingdom

More information

Parametric Identification of Multiplicative Exponential Heteroskedasticity

Parametric Identification of Multiplicative Exponential Heteroskedasticity Parametric Identification of Multiplicative Exponential Heteroskedasticity Alyssa Carlson Department of Economics, Michigan State University East Lansing, MI 48824-1038, United States Dated: October 5,

More information

Parametric identification of multiplicative exponential heteroskedasticity ALYSSA CARLSON

Parametric identification of multiplicative exponential heteroskedasticity ALYSSA CARLSON Parametric identification of multiplicative exponential heteroskedasticity ALYSSA CARLSON Department of Economics, Michigan State University East Lansing, MI 48824-1038, United States (email: carls405@msu.edu)

More information

LESLIE GODFREY LIST OF PUBLICATIONS

LESLIE GODFREY LIST OF PUBLICATIONS LESLIE GODFREY LIST OF PUBLICATIONS This list is in two parts. First, there is a set of selected publications for the period 1971-1996. Second, there are details of more recent outputs. SELECTED PUBLICATIONS,

More information

Gravity Models: Theoretical Foundations and related estimation issues

Gravity Models: Theoretical Foundations and related estimation issues Gravity Models: Theoretical Foundations and related estimation issues ARTNet Capacity Building Workshop for Trade Research Phnom Penh, Cambodia 2-6 June 2008 Outline 1. Theoretical foundations From Tinbergen

More information

Elasticity of Trade Flow to Trade Barriers

Elasticity of Trade Flow to Trade Barriers Alessandro Olper Valentina Raimondi Elasticity of Trade Flow to Trade Barriers A Comparison among Emerging Estimation Techniques ARACNE Copyright MMVIII ARACNE editrice S.r.l. www.aracneeditrice.it info@aracneeditrice.it

More information

Finite Sample Performance of A Minimum Distance Estimator Under Weak Instruments

Finite Sample Performance of A Minimum Distance Estimator Under Weak Instruments Finite Sample Performance of A Minimum Distance Estimator Under Weak Instruments Tak Wai Chau February 20, 2014 Abstract This paper investigates the nite sample performance of a minimum distance estimator

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND

More information

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

Alternative versions of the RESET test for binary response index models: a comparative study

Alternative versions of the RESET test for binary response index models: a comparative study CEFAGE-UE Working Paper 2010/09 Alternative versions of the RESET test for binary response index models: a comparative study Esmeralda A. Ramalho and Joaquim J.S. Ramalho Departament of Economics, Universidade

More information

Workshop for empirical trade analysis. December 2015 Bangkok, Thailand

Workshop for empirical trade analysis. December 2015 Bangkok, Thailand Workshop for empirical trade analysis December 2015 Bangkok, Thailand Cosimo Beverelli (WTO) Rainer Lanz (WTO) Content a. What is the gravity equation? b. Naïve gravity estimation c. Theoretical foundations

More information

Birkbeck Working Papers in Economics & Finance

Birkbeck Working Papers in Economics & Finance ISSN 1745-8587 Birkbeck Working Papers in Economics & Finance Department of Economics, Mathematics and Statistics BWPEF 1809 A Note on Specification Testing in Some Structural Regression Models Walter

More information

Partial effects in fixed effects models

Partial effects in fixed effects models 1 Partial effects in fixed effects models J.M.C. Santos Silva School of Economics, University of Surrey Gordon C.R. Kemp Department of Economics, University of Essex 22 nd London Stata Users Group Meeting

More information

Estimating the Gravity Model When Zero Trade Flows Are Frequent. Will Martin and Cong S. Pham * February 2008

Estimating the Gravity Model When Zero Trade Flows Are Frequent. Will Martin and Cong S. Pham * February 2008 Estimating the Gravity Model When Zero Trade Flows Are Frequent Will Martin and Cong S. Pham * February 2008 Abstract In this paper we estimate the gravity model allowing for the pervasive issues of heteroscedasticity

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS Page 1 MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level

More information

Beyond the Target Customer: Social Effects of CRM Campaigns

Beyond the Target Customer: Social Effects of CRM Campaigns Beyond the Target Customer: Social Effects of CRM Campaigns Eva Ascarza, Peter Ebbes, Oded Netzer, Matthew Danielson Link to article: http://journals.ama.org/doi/abs/10.1509/jmr.15.0442 WEB APPENDICES

More information

A Robust Approach to Estimating Production Functions: Replication of the ACF procedure

A Robust Approach to Estimating Production Functions: Replication of the ACF procedure A Robust Approach to Estimating Production Functions: Replication of the ACF procedure Kyoo il Kim Michigan State University Yao Luo University of Toronto Yingjun Su IESR, Jinan University August 2018

More information

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit

More information

ARTNeT Interactive Gravity Modeling Tool

ARTNeT Interactive Gravity Modeling Tool Evidence-Based Trade Policymaking Capacity Building Programme ARTNeT Interactive Gravity Modeling Tool Witada Anukoonwattaka (PhD) UNESCAP 26 July 2011 Outline Background on gravity model of trade and

More information

Estimating Gravity Equation Models in the Presence of. Sample Selection and Heteroskedasticity

Estimating Gravity Equation Models in the Presence of. Sample Selection and Heteroskedasticity Estimating Gravity Equation Models in the Presence of Sample Selection and Heteroskedasticity Bo Xiong Sixia Chen Abstract: Gravity models are widely used to explain patterns of trade. However, two stylized

More information

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION VICTOR CHERNOZHUKOV CHRISTIAN HANSEN MICHAEL JANSSON Abstract. We consider asymptotic and finite-sample confidence bounds in instrumental

More information

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i, A Course in Applied Econometrics Lecture 18: Missing Data Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. When Can Missing Data be Ignored? 2. Inverse Probability Weighting 3. Imputation 4. Heckman-Type

More information

Esmeralda Ramalho 1, Joaquim Ramalho 1, José Murteira 2

Esmeralda Ramalho 1, Joaquim Ramalho 1, José Murteira 2 CEFAGE-UE Working Paper 2013/09 A generalized goodness-of-functional form test for binary and fractional regression models Esmeralda Ramalho 1, Joaquim Ramalho 1, José Murteira 2 1 Departamento de Economia

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

Non-linear panel data modeling

Non-linear panel data modeling Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1

More information

Flexible Estimation of Treatment Effect Parameters

Flexible Estimation of Treatment Effect Parameters Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both

More information

Trade costs in bilateral trade flows: Heterogeneity and zeroes in structural gravity models

Trade costs in bilateral trade flows: Heterogeneity and zeroes in structural gravity models Società Italiana degli Economisti 52.ma Riunione Scientifica Annuale Università degli Studi di Roma Tre, Roma, 14-15 ottobre 2011 Trade costs in bilateral trade flows: Heterogeneity and zeroes in structural

More information

A Note on Demand Estimation with Supply Information. in Non-Linear Models

A Note on Demand Estimation with Supply Information. in Non-Linear Models A Note on Demand Estimation with Supply Information in Non-Linear Models Tongil TI Kim Emory University J. Miguel Villas-Boas University of California, Berkeley May, 2018 Keywords: demand estimation, limited

More information

Testing for Regime Switching in Singaporean Business Cycles

Testing for Regime Switching in Singaporean Business Cycles Testing for Regime Switching in Singaporean Business Cycles Robert Breunig School of Economics Faculty of Economics and Commerce Australian National University and Alison Stegman Research School of Pacific

More information

The Poisson Quasi-Maximum Likelihood Estimator: A Solution to the Adding Up Problem in Gravity Models

The Poisson Quasi-Maximum Likelihood Estimator: A Solution to the Adding Up Problem in Gravity Models Working Paper DTC-2011-3 The Poisson Quasi-Maximum Likelihood Estimator: A Solution to the Adding Up Problem in Gravity Models Jean-François Arvis, Senior Economist, the World Bank. Ben Shepherd, Principal,

More information

A Practitioner s Guide to Cluster-Robust Inference

A Practitioner s Guide to Cluster-Robust Inference A Practitioner s Guide to Cluster-Robust Inference A. C. Cameron and D. L. Miller presented by Federico Curci March 4, 2015 Cameron Miller Cluster Clinic II March 4, 2015 1 / 20 In the previous episode

More information

Testing Overidentifying Restrictions with Many Instruments and Heteroskedasticity

Testing Overidentifying Restrictions with Many Instruments and Heteroskedasticity Testing Overidentifying Restrictions with Many Instruments and Heteroskedasticity John C. Chao, Department of Economics, University of Maryland, chao@econ.umd.edu. Jerry A. Hausman, Department of Economics,

More information

LIML Estimation of Import Demand and Export Supply Elasticities. Vahagn Galstyan. TEP Working Paper No March 2016

LIML Estimation of Import Demand and Export Supply Elasticities. Vahagn Galstyan. TEP Working Paper No March 2016 LIML Estimation of Import Demand and Export Supply Elasticities Vahagn Galstyan TEP Working Paper No. 0316 March 2016 Trinity Economics Papers Department of Economics Trinity College Dublin LIML Estimation

More information

CEP Discussion Paper No 701 July The Log of Gravity Joao Santos Silva and Silvana Tenreyro

CEP Discussion Paper No 701 July The Log of Gravity Joao Santos Silva and Silvana Tenreyro CEP Discussion Paper No 701 July 2005 The Log of Gravity Joao Santos Silva and Silvana Tenreyro Abstract Although economists have long been aware of Jensen's inequality, many econometric applications have

More information

Reliability of inference (1 of 2 lectures)

Reliability of inference (1 of 2 lectures) Reliability of inference (1 of 2 lectures) Ragnar Nymoen University of Oslo 5 March 2013 1 / 19 This lecture (#13 and 14): I The optimality of the OLS estimators and tests depend on the assumptions of

More information

UNIVERSITY OF CALIFORNIA Spring Economics 241A Econometrics

UNIVERSITY OF CALIFORNIA Spring Economics 241A Econometrics DEPARTMENT OF ECONOMICS R. Smith, J. Powell UNIVERSITY OF CALIFORNIA Spring 2006 Economics 241A Econometrics This course will cover nonlinear statistical models for the analysis of cross-sectional and

More information

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Arthur Lewbel Boston College Original December 2016, revised July 2017 Abstract Lewbel (2012)

More information

A better way to bootstrap pairs

A better way to bootstrap pairs A better way to bootstrap pairs Emmanuel Flachaire GREQAM - Université de la Méditerranée CORE - Université Catholique de Louvain April 999 Abstract In this paper we are interested in heteroskedastic regression

More information

An Encompassing Test for Non-Nested Quantile Regression Models

An Encompassing Test for Non-Nested Quantile Regression Models An Encompassing Test for Non-Nested Quantile Regression Models Chung-Ming Kuan Department of Finance National Taiwan University Hsin-Yi Lin Department of Economics National Chengchi University Abstract

More information

Harvard University. A Note on the Control Function Approach with an Instrumental Variable and a Binary Outcome. Eric Tchetgen Tchetgen

Harvard University. A Note on the Control Function Approach with an Instrumental Variable and a Binary Outcome. Eric Tchetgen Tchetgen Harvard University Harvard University Biostatistics Working Paper Series Year 2014 Paper 175 A Note on the Control Function Approach with an Instrumental Variable and a Binary Outcome Eric Tchetgen Tchetgen

More information

Estimation of Treatment Effects under Essential Heterogeneity

Estimation of Treatment Effects under Essential Heterogeneity Estimation of Treatment Effects under Essential Heterogeneity James Heckman University of Chicago and American Bar Foundation Sergio Urzua University of Chicago Edward Vytlacil Columbia University March

More information

Robustness of Logit Analysis: Unobserved Heterogeneity and Misspecified Disturbances

Robustness of Logit Analysis: Unobserved Heterogeneity and Misspecified Disturbances Discussion Paper: 2006/07 Robustness of Logit Analysis: Unobserved Heterogeneity and Misspecified Disturbances J.S. Cramer www.fee.uva.nl/ke/uva-econometrics Amsterdam School of Economics Department of

More information

A Bootstrap Test for Conditional Symmetry

A Bootstrap Test for Conditional Symmetry ANNALS OF ECONOMICS AND FINANCE 6, 51 61 005) A Bootstrap Test for Conditional Symmetry Liangjun Su Guanghua School of Management, Peking University E-mail: lsu@gsm.pku.edu.cn and Sainan Jin Guanghua School

More information

On the Problem of Endogenous Unobserved Effects in the Estimation of Gravity Models

On the Problem of Endogenous Unobserved Effects in the Estimation of Gravity Models Journal of Economic Integration 19(1), March 2004; 182-191 On the Problem of Endogenous Unobserved Effects in the Estimation of Gravity Models Peter Egger University of Innsbruck Abstract We propose to

More information

Binary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment

Binary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment BINARY CHOICE MODELS Y ( Y ) ( Y ) 1 with Pr = 1 = P = 0 with Pr = 0 = 1 P Examples: decision-making purchase of durable consumer products unemployment Estimation with OLS? Yi = Xiβ + εi Problems: nonsense

More information

Estimation of some nonlinear panel data models with both. time-varying and time-invariant explanatory variables

Estimation of some nonlinear panel data models with both. time-varying and time-invariant explanatory variables Estimation of some nonlinear panel data models with both time-varying and time-invariant explanatory variables Bo E. Honoré Michaela Kesina July 2015 Abstract The so-called fixed effects approach to the

More information

Lecture 4: Heteroskedasticity

Lecture 4: Heteroskedasticity Lecture 4: Heteroskedasticity Econometric Methods Warsaw School of Economics (4) Heteroskedasticity 1 / 24 Outline 1 What is heteroskedasticity? 2 Testing for heteroskedasticity White Goldfeld-Quandt Breusch-Pagan

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case Arthur Lewbel Boston College December 2016 Abstract Lewbel (2012) provides an estimator

More information

11. Bootstrap Methods

11. Bootstrap Methods 11. Bootstrap Methods c A. Colin Cameron & Pravin K. Trivedi 2006 These transparencies were prepared in 20043. They can be used as an adjunct to Chapter 11 of our subsequent book Microeconometrics: Methods

More information

Efficient Estimation of Dynamic Panel Data Models: Alternative Assumptions and Simplified Estimation

Efficient Estimation of Dynamic Panel Data Models: Alternative Assumptions and Simplified Estimation Efficient Estimation of Dynamic Panel Data Models: Alternative Assumptions and Simplified Estimation Seung C. Ahn Arizona State University, Tempe, AZ 85187, USA Peter Schmidt * Michigan State University,

More information

Missing dependent variables in panel data models

Missing dependent variables in panel data models Missing dependent variables in panel data models Jason Abrevaya Abstract This paper considers estimation of a fixed-effects model in which the dependent variable may be missing. For cross-sectional units

More information

Nonparametric Identication of a Binary Random Factor in Cross Section Data and

Nonparametric Identication of a Binary Random Factor in Cross Section Data and . Nonparametric Identication of a Binary Random Factor in Cross Section Data and Returns to Lying? Identifying the Effects of Misreporting When the Truth is Unobserved Arthur Lewbel Boston College This

More information

Elasticity of trade flow to trade barriers: A comparison among emerging estimation techniques

Elasticity of trade flow to trade barriers: A comparison among emerging estimation techniques 1 Elasticity of trade flow to trade barriers: A comparison among emerging estimation techniques Olper A. 1, Raimondi V. 1 1 University of Milano, Department of Agricultural Economics Abstract The objective

More information

A Monte Carlo Comparison of Various Semiparametric Type-3 Tobit Estimators

A Monte Carlo Comparison of Various Semiparametric Type-3 Tobit Estimators ANNALS OF ECONOMICS AND FINANCE 4, 125 136 (2003) A Monte Carlo Comparison of Various Semiparametric Type-3 Tobit Estimators Insik Min Department of Economics, Texas A&M University E-mail: i0m5376@neo.tamu.edu

More information

Once an Enemy, Forever an Enemy? The Long-run Impact of Japanese Invasion of China on Trade and Investment

Once an Enemy, Forever an Enemy? The Long-run Impact of Japanese Invasion of China on Trade and Investment Once an Enemy, Forever an Enemy? The Long-run Impact of Japanese Invasion of China on Trade and Investment Yi Che a, Julan Du b, Yi Lu c, and Zhigang Tao a a University of Hong Kong, b Chinese University

More information

Testing Homogeneity Of A Large Data Set By Bootstrapping

Testing Homogeneity Of A Large Data Set By Bootstrapping Testing Homogeneity Of A Large Data Set By Bootstrapping 1 Morimune, K and 2 Hoshino, Y 1 Graduate School of Economics, Kyoto University Yoshida Honcho Sakyo Kyoto 606-8501, Japan. E-Mail: morimune@econ.kyoto-u.ac.jp

More information

CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M.

CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Linear

More information

Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems

Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems Wooldridge, Introductory Econometrics, 3d ed. Chapter 9: More on specification and data problems Functional form misspecification We may have a model that is correctly specified, in terms of including

More information

Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates

Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates Matthew Harding and Carlos Lamarche January 12, 2011 Abstract We propose a method for estimating

More information

Marginal Specifications and a Gaussian Copula Estimation

Marginal Specifications and a Gaussian Copula Estimation Marginal Specifications and a Gaussian Copula Estimation Kazim Azam Abstract Multivariate analysis involving random variables of different type like count, continuous or mixture of both is frequently required

More information

Session 4-5: The benchmark of theoretical gravity models

Session 4-5: The benchmark of theoretical gravity models ARTNeT- GIZ Capacity Building Workshop on Introduction to Gravity Modelling: 19-21 April 2016, Ulaanbaatar Session 4-5: The benchmark of theoretical gravity models Dr. Witada Anukoonwattaka Trade and Investment

More information

the error term could vary over the observations, in ways that are related

the error term could vary over the observations, in ways that are related Heteroskedasticity We now consider the implications of relaxing the assumption that the conditional variance Var(u i x i ) = σ 2 is common to all observations i = 1,..., n In many applications, we may

More information

Christopher Dougherty London School of Economics and Political Science

Christopher Dougherty London School of Economics and Political Science Introduction to Econometrics FIFTH EDITION Christopher Dougherty London School of Economics and Political Science OXFORD UNIVERSITY PRESS Contents INTRODU CTION 1 Why study econometrics? 1 Aim of this

More information

Do Markov-Switching Models Capture Nonlinearities in the Data? Tests using Nonparametric Methods

Do Markov-Switching Models Capture Nonlinearities in the Data? Tests using Nonparametric Methods Do Markov-Switching Models Capture Nonlinearities in the Data? Tests using Nonparametric Methods Robert V. Breunig Centre for Economic Policy Research, Research School of Social Sciences and School of

More information

The Log of Gravity. J.M.C. Santos Silva y. Silvana Tenreyro z. October 13, 2004

The Log of Gravity. J.M.C. Santos Silva y. Silvana Tenreyro z. October 13, 2004 The Log of Gravity J.M.C. Santos Silva y Silvana Tenreyro z October 13, 2004 The authors are grateful to Francesco Caselli, Juan Carlos Hallak, Daniel Mota, Paulo Parente, Manuela Simarro and Kim Underhill

More information

On IV estimation of the dynamic binary panel data model with fixed effects

On IV estimation of the dynamic binary panel data model with fixed effects On IV estimation of the dynamic binary panel data model with fixed effects Andrew Adrian Yu Pua March 30, 2015 Abstract A big part of applied research still uses IV to estimate a dynamic linear probability

More information

What can we learn about correlations from multinomial probit estimates?

What can we learn about correlations from multinomial probit estimates? What can we learn about correlations from multinomial probit estimates? Chiara Monfardini J.M.C. Santos Silva February 2006 Abstract It is well known that, in a multinomial probit, only the covariance

More information

Robust covariance estimation for quantile regression

Robust covariance estimation for quantile regression 1 Robust covariance estimation for quantile regression J. M.C. Santos Silva School of Economics, University of Surrey UK STATA USERS GROUP, 21st MEETING, 10 Septeber 2015 2 Quantile regression (Koenker

More information

Modified Variance Ratio Test for Autocorrelation in the Presence of Heteroskedasticity

Modified Variance Ratio Test for Autocorrelation in the Presence of Heteroskedasticity The Lahore Journal of Economics 23 : 1 (Summer 2018): pp. 1 19 Modified Variance Ratio Test for Autocorrelation in the Presence of Heteroskedasticity Sohail Chand * and Nuzhat Aftab ** Abstract Given that

More information

Økonomisk Kandidateksamen 2004 (I) Econometrics 2. Rettevejledning

Økonomisk Kandidateksamen 2004 (I) Econometrics 2. Rettevejledning Økonomisk Kandidateksamen 2004 (I) Econometrics 2 Rettevejledning This is a closed-book exam (uden hjælpemidler). Answer all questions! The group of questions 1 to 4 have equal weight. Within each group,

More information

Estimating the Gravity Model When Zero Trade Flows Are Frequent and Economically Determined

Estimating the Gravity Model When Zero Trade Flows Are Frequent and Economically Determined Policy Research Working Paper 7308 WPS7308 Estimating the Gravity Model When Zero Trade Flows Are Frequent and Economically Determined Will Martin Cong S. Pham Public Disclosure Authorized Public Disclosure

More information

The OLS Estimation of a basic gravity model. Dr. Selim Raihan Executive Director, SANEM Professor, Department of Economics, University of Dhaka

The OLS Estimation of a basic gravity model. Dr. Selim Raihan Executive Director, SANEM Professor, Department of Economics, University of Dhaka The OLS Estimation of a basic gravity model Dr. Selim Raihan Executive Director, SANEM Professor, Department of Economics, University of Dhaka Contents I. Regression Analysis II. Ordinary Least Square

More information

Quantifying the effects of NTMs. Xinyi Li Trade Policies Review Division, WTO Secretariat 12 th ARTNeT Capacity Building Workshop December 2016

Quantifying the effects of NTMs. Xinyi Li Trade Policies Review Division, WTO Secretariat 12 th ARTNeT Capacity Building Workshop December 2016 Quantifying the effects of NTMs Xinyi Li Trade Policies Review Division, WTO Secretariat 12 th ARTNeT Capacity Building Workshop December 2016 1 Approaches to quantifying NTMs Chen and Novy (2012) described

More information

Increasing the Power of Specification Tests. November 18, 2018

Increasing the Power of Specification Tests. November 18, 2018 Increasing the Power of Specification Tests T W J A. H U A MIT November 18, 2018 A. This paper shows how to increase the power of Hausman s (1978) specification test as well as the difference test in a

More information

Panel Threshold Regression Models with Endogenous Threshold Variables

Panel Threshold Regression Models with Endogenous Threshold Variables Panel Threshold Regression Models with Endogenous Threshold Variables Chien-Ho Wang National Taipei University Eric S. Lin National Tsing Hua University This Version: June 29, 2010 Abstract This paper

More information

Structural Estimation of Gravity Models with Market Entry Dynamics

Structural Estimation of Gravity Models with Market Entry Dynamics Structural Estimation of Gravity Models with Market Entry Dynamics Peter Egger, Andrea Leiter and Michael Pfaffermayr 13th August 2010 Abstract This paper develops a structural empirical general equilibrium

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Endogeneity b) Instrumental

More information

The Border E ect Through the Rearview Mirror: Would the puzzle have existed if today s tools had been used?

The Border E ect Through the Rearview Mirror: Would the puzzle have existed if today s tools had been used? The Border E ect Through the Rearview Mirror: Would the puzzle have existed if today s tools had been used? Daniel A. Dias y December 4, 20 Abstract In this paper I re-evaluate the McCallum (995) border

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Princeton University Asian Political Methodology Conference University of Sydney Joint

More information

Specification Test on Mixed Logit Models

Specification Test on Mixed Logit Models Specification est on Mixed Logit Models Jinyong Hahn UCLA Jerry Hausman MI December 1, 217 Josh Lustig CRA Abstract his paper proposes a specification test of the mixed logit models, by generalizing Hausman

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint

More information

DEPARTMENT OF ECONOMICS

DEPARTMENT OF ECONOMICS ISSN 0819-64 ISBN 0 7340 616 1 THE UNIVERSITY OF MELBOURNE DEPARTMENT OF ECONOMICS RESEARCH PAPER NUMBER 959 FEBRUARY 006 TESTING FOR RATE-DEPENDENCE AND ASYMMETRY IN INFLATION UNCERTAINTY: EVIDENCE FROM

More information

Session 3-4: Estimating the gravity models

Session 3-4: Estimating the gravity models ARTNeT- KRI Capacity Building Workshop on Trade Policy Analysis: Evidence-based Policy Making and Gravity Modelling for Trade Analysis 18-20 August 2015, Kuala Lumpur Session 3-4: Estimating the gravity

More information

The regression model with one fixed regressor cont d

The regression model with one fixed regressor cont d The regression model with one fixed regressor cont d 3150/4150 Lecture 4 Ragnar Nymoen 27 January 2012 The model with transformed variables Regression with transformed variables I References HGL Ch 2.8

More information

Economics Division University of Southampton Southampton SO17 1BJ, UK. Title Overlapping Sub-sampling and invariance to initial conditions

Economics Division University of Southampton Southampton SO17 1BJ, UK. Title Overlapping Sub-sampling and invariance to initial conditions Economics Division University of Southampton Southampton SO17 1BJ, UK Discussion Papers in Economics and Econometrics Title Overlapping Sub-sampling and invariance to initial conditions By Maria Kyriacou

More information

Testing an Autoregressive Structure in Binary Time Series Models

Testing an Autoregressive Structure in Binary Time Series Models ömmföäflsäafaäsflassflassflas ffffffffffffffffffffffffffffffffffff Discussion Papers Testing an Autoregressive Structure in Binary Time Series Models Henri Nyberg University of Helsinki and HECER Discussion

More information

The Size and Power of Four Tests for Detecting Autoregressive Conditional Heteroskedasticity in the Presence of Serial Correlation

The Size and Power of Four Tests for Detecting Autoregressive Conditional Heteroskedasticity in the Presence of Serial Correlation The Size and Power of Four s for Detecting Conditional Heteroskedasticity in the Presence of Serial Correlation A. Stan Hurn Department of Economics Unversity of Melbourne Australia and A. David McDonald

More information

Econometrics - 30C00200

Econometrics - 30C00200 Econometrics - 30C00200 Lecture 11: Heteroskedasticity Antti Saastamoinen VATT Institute for Economic Research Fall 2015 30C00200 Lecture 11: Heteroskedasticity 12.10.2015 Aalto University School of Business

More information

Darmstadt Discussion Papers in Economics

Darmstadt Discussion Papers in Economics Darmstadt Discussion Papers in Economics The Effect of Linear Time Trends on Cointegration Testing in Single Equations Uwe Hassler Nr. 111 Arbeitspapiere des Instituts für Volkswirtschaftslehre Technische

More information

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35 Rewrap ECON 4135 November 18, 2011 () Rewrap ECON 4135 November 18, 2011 1 / 35 What should you now know? 1 What is econometrics? 2 Fundamental regression analysis 1 Bivariate regression 2 Multivariate

More information

Bootstrapping Heteroskedasticity Consistent Covariance Matrix Estimator

Bootstrapping Heteroskedasticity Consistent Covariance Matrix Estimator Bootstrapping Heteroskedasticity Consistent Covariance Matrix Estimator by Emmanuel Flachaire Eurequa, University Paris I Panthéon-Sorbonne December 2001 Abstract Recent results of Cribari-Neto and Zarkos

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Censoring and truncation b)

More information