Robust Inference with Multi-way Clustering

Size: px
Start display at page:

Download "Robust Inference with Multi-way Clustering"

Transcription

1 Robust Inference with Multi-way Clustering Colin Cameron, Jonah Gelbach, Doug Miller U.C. - Davis, U. Arizona, U.C. - Davis February 2010 February / 44

2 1.1 Introduction Moulton (1986, 1990) and Bertrand, Duo, and Mullainathan (2004) showed the importance of controlling for clustering. Failure to control underestimates OLS standard errors and overstates t statistics. Initially use one-way random eects model. Now use cluster-robust standard errors (White (1984), Arellano (1987), Liang and Zeger (1986), Rogers (1993)). February / 44

3 This paper extends to cluster-robust in two (nonnested) dimensions Example 1: More than one grouped regressor. Example 2: Cross-section unit and time for panel data. Outline of presentation Lengthy discussion of one-way cluster-robust. Present method for two-way cluster-robust. Simulation and application. Conclusion. February / 44

4 1.2 OLS with Cluster Errors Model for G clusters with N g individuals per cluster: OLS estimator y ig = x 0 ig β + u ig, i = 1,..., N g, g = 1,..., G, y g = X g β + u g, g = 1,..., G, y = Xβ + u, bβ = ( G g=1 N g i=1 x ig x 0 ig ) 1 ( G g=1 N g i=1 x ig y ig ) = ( G g=1 X0 g X g ) 1 ( G g=1 X g y g ) = (X 0 X) 1 X 0 y. February / 44

5 OLS with Clustered Errors (continued) As usual bβ = β + (X 0 X) 1 X 0 u = β + (X 0 X) 1 ( G g=1 X g u g ). Assume independence over g with u g [0, Σ g = E[u g ug 0 ]]. Then β b a N [β, V[ β]] b with V[ b β] = (X 0 X) 1 ( G g=1 X g Σ g X 0 g )(X 0 X) 1. February / 44

6 OLS with Clustered Errors (continued) If ignore clustering the default OLS variance estimate should be inated by approximately where τ j ' 1 + ρ xj ρ u ( N g 1), ρ xj is the within cluster correlation of x j ρ u is the within cluster error correlation N g is the average cluster size. Kloek (1981), Scott and Holt (1982). Moulton (1986, 1990) showed that could be large even if ρ u small. e.g. N G = 81, ρ x = 1 and ρ u = 0.1 then τ j = 9! So should correct for clustering - but Σ g is unknown. February / 44

7 1.3 Random eects approach The original way to obtain consistent variance matrix estimate. Assume a random eects (RE) model u ig = α g + ε ig α g iid[0, σ 2 α] ε ig iid[0, σ 2 ε ]. Then Σ g = σ 2 ui Ng + σ 2 αe Ng e 0 N g and we use bv RE [ b β] = (X 0 X) 1 ( G g=1 X g b Σ g X 0 g )(X 0 X) 1, where bσ g = bσ 2 ui Ng + bσ 2 αe Ng e 0 N g, and bσ 2 ε and bσ 2 α are consistent. Weakness is strong distributional assumptions. February / 44

8 1.4 Cluster-Robust Variance Estimates The current method to obtain variance estimates. The cluster-robust variance estimate (CRVE) bv CR [ b β] = (X 0 X) 1 ( G g=1 X g eu g eu 0 g X 0 g )(X 0 X) 1, provides a consistent estimator of V CR [ b β] if plim 1 G G g=1 X g eu g eu g 0 X 0 g = plim 1 G G g=1 X g Σ g X 0 g. bv CR [ b β] yields cluster-robust standard errors. If OLS residuals are used then eu g = bu g = y g X g b β. Then bv CR [ b β] is downwards-biased for V[ b β]. February / 44

9 Cluster-Robust Variance Estimates (continued) The CRVE was proposed by Liang and Zeger (1986) for grouped data proposed by Arellano (1987) for FE estimator for short panels (where the grouping is on the individual) popularized by incorporation in Stata as the cluster option (Rogers (1993)). Bertrand, Duo and Mullainathan (2004) in a much-cited paper noted that with state-year data many studies were erroneously clustering on state-year (implying independence over time) rather than clustering on just state. February / 44

10 1.5 CRVE with few clusters CRVE assumes G!. What if G is small? Stata uses eu g = p cbu g where c = critical values. Reasonable starting point. G G 1 N 1 N k ' G G 1 and T (G 1) Donald and Lang (2007) do asymptotic theory when G is small and N g!. Then a two-step FGLS RE estimator yields t-test that is T (G 2) under some assumptions. Ibragimov and Muller (2010) do asymptotic theory when G is small and N g! and only within-group variation is relevant. Then separately estimate b β g s and average. Cameron, Gelbach and Miller (2007) propose cluster bootstraps with asymptotic renement. February / 44

11 1.6 Wild Cluster Bootstrap-T Procedure Test H 0 : β 1 = β 0 1 against H a : β 1 6= β 0 1 using w = ( bβ 1 β 0 1 )/s bβ 1. 1 Obtain the OLS estimator b β and OLS residuals bu g, g = 1,..., G. [Best to use residuals that impose H 0 ]. 2 Do B iterations of this step. On the b th iteration: 1 For each cluster g = 1,..., G, form bu g = bu g or bu g = bu g each with probability 0.5 and hence form by g = X 0 g b β + bu g. This yields wild cluster bootstrap resample f(by 1, X 1),..., (by G, X G )g. 2 Calculate the OLS estimate bβ 1,b and its standard error s bβ 1,b and given these form the Wald test statistic w b = ( bβ 1,b 3 Reject H 0 at level α if and only if bβ 1 )/s b β 1,b. w < w [α/2] or w > w [1 α/2], where w [q] denotes the qth quantile of w 1,..., w B. February / 44

12 1.7 Other work on clustering Wooldridge (2003) survey. Bhattacharya (2005) combines stratication and clustering, and in a GMM setting. Christian Hansen (2007a) shows that even for long panels it is okay to use CRVE with FE estimator (as shown in simulations by Krezde (2002)). Christian Hansen (2007b) shows that for short panels with AR(p) errors bias-corrected FGLS does well. Hausman and Kuersteiner (2005) do qualitatively similar small-t correction for Kiefer (1980) FGLS estimator in panel. Conley (1998) analyzes GMM with spatial correlation. Cameron and Miller (2010) show that degrees-of-freedom corrections can lead to cluster-robust variance estimates diering substantially in LSDV versus mean-dierenced model if T is small. February / 44

13 2.0 Motivating example How do job injury rates eect wages? Hersch (1998). CPS individual data on male wages N = But there is no individual data on job injury rate. Instead aggregated data: data on industry injury rates for 211 industries data on occupations injury rates for 387 occupations. Model estimated is What should we do? y igh = α + x 0 igh β + γ rind ig + δ rocc ih + u igh. GLS two-way random eects: u igh = ε g + ε h + ε igh ; ε g, ε h, ε igh i.i.d. Strong assumptions. OLS then cluster on industry for bγ and cluster on occupation for bδ. Ad hoc. February / 44

14 2.1 One-way clustering Return to one-way clustering before extending to two-way. Clustering on group g (for g = 1,..., G ) with N g individuals in group g : y ig = xig 0 β + u ig E[u ig u jg 0jx ig, x jg 0] = 0, for i 6= j unless g = g 0. Then informally the estimated asymptotic variance matrix of b β OLS, denoted Avar[ b β], is where Avar[ b β] = (X 0 X) 1 bb(x 0 X) 1 February / 44

15 One-way Clustering bb = G g=1 X 0 g bu g bu 0 g X g 2 bu 1 bu = X 0 0 bu 2 bu X 0 bu G bu G E = X 0 bubu0. 0 E C 5A X 0 E G = X 0 (bubu 0. S G )X, February / 44

16 2.2 Two-way Clustering With g denoting rst group and h denoting second group y igh = x 0 igh β+u igh, E[u igh u jg 0 h 0jx igh, x jg 0 h 0] = 0, for i 6= j unless g = g 0 or h = h 0. Then use bb = X 0 (bubu 0. S GH )X, (1) where S GH is an N N indicator matrix with ij th entry equal to one if the i th and j th observation share any cluster and equal to zero otherwise. February / 44

17 Two-way Clustering Dene three N N indicator matrices: S G with ij th entry equal to one if the i th and j th observation belong to the same cluster g 2 f1, 2,..., G g S H with ij th entry equal to one if the i th and j th observation belong to the same cluster h 2 f1, 2,..., Hg S G \H with ij th entry equal to one if the i th and j th observation belong to both the same cluster g 2 f1, 2,..., G g and the same cluster h 2 f1, 2,..., Hg. Then so S GH = S G + S H S G \H, bb = X 0 (bubu 0. S G )X + X 0 (bubu 0. S H )X X 0 (bubu 0. S G \H. X), February / 44

18 Two-way Clustering Hence Avar[ b β] = (X 0 X) 1 X 0 (bubu 0. S G )X(X 0 X) 1 (2) +(X 0 X) 1 X 0 (bubu 0. S H )X(X 0 X) 1 (X 0 X) 1 X 0 (bubu 0. S G \H )X(X 0 X) 1. The three components can be separately computed by (1) OLS regression of y on X with standard errors computed using clustering on g 2 f1, 2,..., G g; (2) OLS regression of y on X with standard errors computed using clustering on h 2 f1, 2,..., Hg; and (3) OLS regression of y on X with standard errors computed using clustering on (g, h) 2 f(1, 1),..., (G, H)g Stata add-on cgmreg.ado does this. February / 44

19 Two-way Clustering: other research This method has been independently proposed by others (we began in 2004). Applied: Acemoglu and Pischke (2003): cluster on individual and region-by-time. Applied: Fafchamps and Gubert (2006): cluster on person 1 and person 2 in person-pair data (dyadic) for networks. Theory: Thompson (2006): cluster on rm on time. Theory: Miglioretti and Heagarty (2006): cluster on person, radiologist, and facility. Theory: Conley (1998): cluster on space and time (dierent asymptotic theory). February / 44

20 2.3 Practical Considerations If bv[ b β] is not positive-denite (small G, H) then Decompose bv[ β] b = UΛU 0 ; U contains eigenvectors of bv, and Λ = Diag[λ 1,..., λ d ] contains eigenvalues. Create Λ + = Diag[λ 1 +,..., λ+ d ], with λ+ j = max 0, λ j, and use bv + [ b β] = UΛ + U 0. Fixed eects in one or both dimensions We do not formally address this complication Intuitively if G! and H! then each xed eect is estimated using many observations. In practice the main consequence of including xed eects is a reduction in within-cluster correlation. February / 44

21 2.4 Multi-way clustering Extends to multi-way. e.g. 7 variance estimates for three-way clustering and bb may be written as B e (1,0,0) + eb (0,1,0) + eb (0,0,1) B e (1,1,0) + eb (1,0,1) + eb (0,1,1) + eb (1,1,1 Extends to m-estimators and GMM e.g. For probit three components can be separately computed by (1) probit regression with standard errors computed using clustering on g 2 f1, 2,..., G g; (2) probit regression with standard errors computed using clustering on h 2 f1, 2,..., Hg; and (3) probit regression with standard errors computed using clustering on (g, h) 2 f(1, 1),..., (G, H)g Then add (1) and (2) and subtract (3). February / 44

22 2.5 GMM M-estimator solves N i=1 h i (bθ) = 0. Under standard assumptions, bθ is asymptotically normal with bv[bθ] = ba 1 bbba 1, h where ba = i i, and bb is an estimate of b V[ i h i ]. θ θ For independence over i For one-way clustering bb = bb = N N i=1 j=1 N bh i bh i. 0 i=1 bh i bh 0 ji (i, j) where I (i, j) = 1 if i and j are in the same cluster. February / 44

23 GMM Two-way clustering For two-way clustering bb = N N i=1 j=1 bh i bh 0 ji 1 (i, j) + N N i=1 j=1 bh i bh 0 ji 2 (i, j) N N i=1 j=1 bh i bh 0 ji 12 (i, j) where I 1 (i, j) = 1 if i and j are in the same cluster in dimension 1 and I 2 (i, j) = 1 if i and j are in the same cluster in dimension 2 and I 12 (i, j) = 1 if i and j are in the same cluster in dimensions 1 and 2. February / 44

24 3.1.1 Dgp with no clustering Simulations are for one observation per (h, g) combination. So HG observations and can drop the subscript i. Balanced and equal size groups H = G = 10, 20,..., 100 plus some cases H 6= G. Dgp is y gh = β 0 + β 1 x 1gh + β 2 x 2gh + α g + δ h + ε gh where β 0 = β 1 = β 2 = 1 and x ig, x ih, α g, δ h and ε igh are all iid N[0, 1]. February / 44

25 Dgp with no clustering Compute actual rejection rates for 5% two-sided Wald tests of β 1 = 1 and β 2 = 1. Dierent standard error estimates lead to dierent Wald tests. Methods used are 1. Assume iid errors 2. One-way cluster on g 3. Two-way random eects 4. Two-way cluster robust with T critical values. All methods should work with no clustering. February / 44

26 Dgp with no clustering All methods work ne with G = H = 100 Small sample over-rejection greatest for two-way cluster-robust, though diminished if use T (min(g, H) 1) degrees of freedom.. February / 44

27 Table 1 Rejection probabilities for a true null hypothesis True model: iid errors Number of Group Number of Group 1 Clusters 2 Clusters Assume iid errors Assumption about errors in construction of Variance One-way cluster robust (cluster on group1) Two-way random effects Two-way clusterrobust Two-way clusterrobust, T critical values % 6.7% 9.0% 9.7% 4.4% 5.8% 13.3% 14.9% 9.9% 11.5% % 4.8% 7.9% 5.7% 5.9% 4.9% 9.0% 7.6% 7.2% 5.9% % 4.8% 5.9% 5.9% 5.1% 5.0% 6.8% 7.1% 6.1% 6.1% % 4.8% 5.8% 5.5% 5.3% 4.7% 6.6% 6.1% 5.5% 5.4% % 5.3% 5.7% 6.1% 5.2% 5.8% 5.7% 6.5% 4.9% 5.6% % 5.6% 5.7% 5.8% 4.9% 5.4% 6.5% 6.0% 6.0% 5.2% % 5.3% 4.3% 5.6% 4.9% 5.5% 5.9% 6.0% 5.7% 5.7% % 5.3% 5.2% 6.0% 4.7% 5.2% 6.0% 6.4% 5.6% 5.9% % 4.8% 6.2% 5.0% 5.7% 4.8% 6.1% 4.6% 5.6% 4.3% % 5.1% 4.9% 5.3% 4.7% 5.2% 5.1% 6.1% 4.9% 5.7% % 4.0% 8.6% 7.3% 5.5% 4.1% 8.9% 9.7% 5.9% 6.8% % 5.1% 6.2% 7.3% 5.3% 5.1% 6.7% 7.4% 5.2% 5.8% % 5.1% 7.0% 7.7% 4.7% 5.0% 8.9% 8.9% 5.9% 5.8% % 3.8% 5.8% 5.0% 3.9% 3.7% 6.7% 7.7% 5.4% 5.9% % 5.0% 5.5% 5.4% 5.0% 4.9% 5.9% 6.5% 5.2% 5.7% Note: The null hypothesis should be rejected 5% of the time. Number of monte carlo simulations is 2000, except that for methods 1-3 it is 1000 when G*H > February / 44

28 3.1.2 Dgp with two-way clustered homoskedastic errors Dgp is y gh = β 0 + β 1 x 1gh + β 2 x 2gh + α g + δ h + u gh where now u gh = ε g + ε h + ε gh and each error is iid N [0, 1] and x 1gh = z g + z gh where each is iid N [0, 1] and x 2gh is similar. Then only the two-way methods work. Again two-way random eects does best as this is situation it is intended for. February / 44

29 Table 2 Rejection probabilities for a true null hypothesis True model: random effects on both Group1 and Group 2 Assumption about errors in construction of Variance Number of Group Number of Group 1 Clusters 2 Clusters Assume iid errors One-way cluster robust (cluster on group1) Two-way random effects Two-way clusterrobust Two-way clusterrobust, T critical values % 23.7% 13.7% 33.2% 6.2% 6.6% 17.4% 17.4% 12.6% 13.4% % 32.3% 8.6% 42.7% 5.7% 5.3% 10.3% 9.6% 8.7% 7.6% % 41.7% 7.4% 50.6% 5.2% 5.6% 8.6% 9.1% 7.8% 7.7% % 47.6% 8.7% 55.2% 6.5% 5.4% 9.0% 9.3% 7.9% 8.1% % 50.4% 6.0% 58.8% 4.9% 4.8% 7.0% 6.7% 6.3% 6.2% % 56.3% 6.4% 64.1% 5.6% 6.5% 6.7% 6.1% 5.9% 5.6% % 57.6% 5.5% 64.8% 4.8% 6.0% 6.4% 6.5% 5.9% 6.0% % 60.9% 6.5% 67.0% 4.9% 4.7% 6.3% 7.0% 5.7% 6.5% % 60.7% 5.6% 67.9% 5.0% 5.4% 6.0% 6.7% 5.8% 6.3% % 60.4% 5.8% 67.9% 5.3% 3.6% 6.4% 5.3% 6.1% 5.1% % 21.3% 13.3% 33.4% 8.9% 4.0% 15.0% 9.3% 10.2% 5.8% % 33.1% 9.8% 44.5% 6.7% 4.5% 9.3% 8.1% 8.2% 6.2% % 21.0% 14.1% 31.7% 10.4% 3.3% 14.2% 8.1% 9.2% 4.6% % 33.9% 10.0% 43.7% 6.2% 3.7% 9.2% 6.3% 7.7% 4.6% % 54.0% 6.6% 60.5% 5.3% 3.6% 6.9% 6.9% 6.4% 6.1% Note: See Table 1. February / 44

30 3.1.3 Dgp with two-way clustered heteroskedastic errors Dgp is y gh = β 0 + β 1 x 1gh + β 2 x 2gh + α g + δ h + u gh where u gh = ε g + ε h + ε gh and each error is iid N [0, 1] EXCEPT now ε gh is N [0, jx 1gh x 2gh j] and x 1gh = z g + z gh where each is iid N [0, 1] and x 2gh is similar. Then only the two-way cluster-robust method works. February / 44

31 Table 1 Rejection probabilities for a true null hypothesis True model: a random effect common to each group, and a heteroscedastic component. Assumption about errors in construction of Variance Number of Group 1 Clusters Number of Group 2 Clusters Assume independent errors One-way cluster robust (cluster on group1) Two-way random effects Two-way clusterrobust % 7.9% 15.7% 9.8% 15.9% 14.3% 18.4% 16.5% 14.5% 12.9% 8.6% 8.9% % 5.4% 9.5% 7.1% 13.0% 10.8% 11.9% 10.9% 10.3% 8.8% 7.1% 5.9% % 6.9% 7.0% 8.1% 9.7% 10.8% 8.2% 9.2% 7.1% 8.0% 6.2% 6.0% % 8.2% 6.0% 8.5% 8.8% 9.4% 7.1% 7.0% 6.3% 6.5% 5.9% 5.7% % 10.5% 6.1% 11.2% 9.6% 9.0% 6.4% 6.9% 6.0% 6.4% 4.4% 5.4% % 5.6% 12.9% 8.8% 12.9% 12.3% 13.7% 9.8% 9.6% 5.9% 6.1% 6.0% % 7.5% 7.9% 8.1% 10.5% 11.5% 9.2% 8.6% 7.6% 6.6% 5.2% 6.7% % 6.4% 10.4% 9.4% 10.1% 13.0% 11.3% 10.0% 7.3% 6.8% 7.5% 6.2% % 5.3% 9.2% 6.7% 10.8% 10.1% 9.4% 6.4% 7.7% 4.5% 5.1% 6.2% % 8.1% 6.7% 8.7% 9.9% 10.0% 6.9% 6.8% 6.1% 6.2% 6.1% 5.2% Note: The null hypothesis should be rejected 5% of the time. Number of monte carlo simulations is Two-way clusterrobust, T critical values Group fixed effects, Two-way cluster-robust February / 44

32 3.2 Dgp with Time-State Correlation Extension of BDM (2004). Data on 1,358,623 women from CPS. Model is y ist = αd st + xist 0 β + δ s + γ t + u ist, where d st is placebo policy created to be correlated over s and t : Specically d st = dst s + 2dst t ; s = 1,..., 50 ordered states where dst s = 0.6dst s 1 + v st s, with v st s iid N [0, 1] and dst t = 0.6ds t 1,t + v st t with v st t iid N [0, 1], and also independent from other variables. Here the index s ranges Two-way cluster-robust is best when no xed eects included. No gain when xed eects included. So here xed eects are enough. February / 44

33 Table 2 Rejection probabilities for a true null hypothesis Monte Carlos with micro (CPS) data quartic in age, 4 education dummies quartic in age, 4 education dummies, state fixed effects RHS control variables quartic in age, 4 education dummies, year fixed effects quartic in age, 4 education dummies, state and year fixed effects Standard error estimator: Heteroscedasticity robust 91.6% 92.1% 82.2% 79.0% One-way cluster robust (cluster on state-by-year cell) 19.8% 22.4% 13.1% 13.9% One-way cluster robust (cluster on state) 16.2% 17.0% 12.0% 15.0% One-way cluster robust (cluster on year) 10.2% 8.9% 8.7% 7.6% Two-way cluster-robust (cluster on state and year) 7.2% 6.9% 7.6% 7.1% Note: Data come from 1.3 million employed women from the March CPS. Table reports rejection rates for testing a (true) null hypothesis of zero on the coefficient of fake treatments. The "treatments" are generated as (t = e_s + 2 e_t), with e_s a state-specific autoregressive component and e_t a year-specific "spatial" autoregressive component. The outcome is also modified by an independent yearspecific autoregressive component. See text for details Monte Carlo replications February / 44

34 4.1 Hersch - Cross-Section with Two-Way Clustering CPS data on male wages N = Separate data on industry and occupation injury rates. 211 injuries and 387 occupations. Model estimates is y igh = α + x 0 igh β + γ rind ig + δ rocc ih + u igh. Two-way clustering is best. C February / 44

35 Table 3 Replication of Hersch (1998) Industry Injury Rate Variable Occupation Injury Rate Estimated slope coefficient: Estimated standard errors Default (iid) (0.415) {0.0000} (0.235) {0.0478} and p-values: Heteroscedastic robust (0.397) {0.0000} (0.260) {0.0737} One-way cluster on Industry (0.643) {0.0032} (0.251) {0.0639} One-way cluster on Occupation (0.486) {0.0001} (0.363) {0.2002} Two-way clustering (0.702) {0.0070} (0.357) {0.1927} Note: Replication of Hersch (1998), pg 604, Table 3, Panel B, Column 4. Standard errors in parentheses. P- values from a test of each coefficient equal to zero in brackets. Data are 5960 observations on working men from the Current Population Survey. Both columns come from the same regression. There are 211 industries and 387 occupations in the data set. February / 44

36 4.3 Rose and Engel - Bilateral Trade Dyadic data: trade ows between pairs of countries. First column of Table 3 of Rose and Engel (2002). gravity model is tted for the natural logarithm of bilateral trade single cross-section on trade ows between 98 countries with 3262 unique country pairs. Cameron and Golotvina (2005) control for two-way clustering, using FGLS estimation based on iid country random eects. Here instead apply the more robust method to their paper. Focus on coecient of the log product of real GDP (= 0.867) heteroskedastic-robust standard error of average one-way clustered standard error of two-way robust standard error is if country specic eects are included as alternative way to control for the clustering, then the coecient of the log product of real GDP is no longer identied. February / 44

37 4.3 Gruber and Madrian - Rotating Panel March CPS data on 39,063 men. Model of retirement (y ist ) and health insurance coverage (d st ) Pr[y ist = 1] = Φ(αd st + xist 0 β + δ s + γ t ). Rotating design - many individuals appear in sample twice. February / 44

38 Table 6 Replication of Gruber and Madrian (1995) Model Probit OLS Estimated slope coefficient (* 1000): Estimated standard errors (* 1000): Default (iid) (5.709) (0.675) Heteroscedastic robust (5.732) (0.684) One-way cluster on state-year (6.265) (0.759) One-way cluster on household id (5.866) (0.702) One-way cluster on hhid-by-state-year (5.732) (0.685) Two-way clustering (6.389) (0.775) One-way cluster on State (6.030) (0.718) Note: Replication of Gruber and Madrian (1995), pg 943, Table 3, Model 1, Column 1. Standard errors in parentheses. Data are 39,063 observations on year-old men from the Current Population Surveys. February / 44

39 Conclusion Method is straightforward to implement. Works well except in smallest designs with G = H = 10. Many potential applications. Sometimes has big impact and sometimes modest. But researcher will not know this a priori. February / 44

40 Asymptotic theory Key is h B 0 = lim E G 1 H 2 N i=1 N j=1 z i (θ)z j (θ) 0i. where N i=1 z i (θ) = G g=1 H h=1 z gh (θ) and z gh (θ) = i2cgh z igh (θ) = i2cgh (y igh xigh 0 β)x igh. For two-way clustering since i z i (θ) = g h z gh (θ) we have h E G 1 H 2 N i=1 N j=1 z i (θ)z j (θ) 0i = E hg 1 H 2 Gg=1 Hh=1 Gg 0=1 Hh 0=1 z gh(θ)z g 0 h 0(θ)0i = G 1 H 2 g h h 0 E[z gh z 0 gh 0] g = g 0 +G 1 H 2 h g g 0 E[z gh z 0 g 0 h ] h = h0 G 1 H 2 g h E[z gh z 0 gh]. February / 44

41 Assume the triple sums are of order GH 2 = G. The triple sums are instead of order GH, rather than GH 2, if the dependence of observations in common cluster g goes to zero as clusters h and h 0 become further apart. case with declining time series dependence or spatial dependence. then in B 0 normalize by (GH) rate of convergence of the estimator becomes a faster p GH rather than p G. Still obtain the same asymptotic variance matrix for b β. February / 44

42 Some References Acemoglu, D., and J.-S. Pischke (2003), \Minimum Wages and On-the-job Training," Research in Labor Economics, 22, Arellano, M. (1987), \Computing Robust Standard Errors for Within-Group Estimators," Oxford Bulletin of Economics and Statistics, 49, Bertrand, M., E. Duo, and S. Mullainathan (2004), \How Much Should we Trust Dierences-in-Dierences Estimates," Quarterly Journal of Economics, Bhattacharya, D. (2005), \Asymptotic inference from multi-stage samples,"' Hournal of Econometrics, 126: Cameron, A.C., and N. Golotvina (2005), \Estimation of Country-Pair Data Models Controlling for Clustered Errors: with International Trade Applications," U.C.-Davis Economics Department Working Paper No Conley, T. G. (1999), \GMM with cross sectional dependence," Journal of Econometrics, 92: Donald, S. and K. Lang (2004), \Inferences with Dierences in Dierences and Other Panel Data," October 22, February / 44

43 Some References Fafchamps, M., and F. Gubert (2006), \The Formation of Risk Sharing Networks," mimeo, April Hansen, C. B. (2007a), \Generalized least squares inference in panel and multilevel models with serial correlation and xed eects," Journal of Econometrics, 140, Hansen, C. B. (2007b), \Asymptotic properties of a robust variance matrix estimator for panel data when T is large,"journal of Econometrics, 141: Ibragimov, R. and U.K. Muller (2007), \T-Statistic Based Correlation and Heterogeneity Robust Inference," Harvard Institute of Economic Research Discussion Paper No Kuersteiner, G. and J. Hausman (2007), \Dierence in Dierence meets Generalized Least Squares: Higher Order Properties of Hypotheses Tests". Liang, K.-Y., and S.L. Zeger (1986), \Longitudinal Data Analysis Using Generalized Linear Models," Biometrika, 73, Miglioretti, D.L., and P.J. Heagerty (2006), \Marginal Modeling of Nonnested Multilevel Data using Standard Software," American Journal of Epidemiology, 165(4), February / 44

44 Some References Moulton, B.R. (1986), \Random Group Eects and the Precision of Regression Estimates," Journal of Econometrics, 32, Moulton, B.R. (1990), \An Illustration of a Pitfall in Estimating the Eects of Aggregate Variables on Micro Units," Review of Economics and Statistics, 72, Petersen, M. (2006), \Estimating Standard Errors in Finance Panel Data Sets: Comparing Approaches," unpublished manuscript, Northwestern University. Rogers, W.H. (1993), \Regression Standard Errors in Clustered Samples," Stata Technical Bulletin, 13, Thompson, S. (2005), \A Simple Formula for Standard Errors that Cluster by Both Firm and Time," unpublished manuscript. White, H. (1984), Asymptotic Theory for Econometricians, San Diego, Academic Press. Wooldridge, J.M. (2003), \Cluster-Sample Methods in Applied Econometrics," American Economic Review, 93, February / 44

Bootstrap-Based Improvements for Inference with Clustered Errors

Bootstrap-Based Improvements for Inference with Clustered Errors Bootstrap-Based Improvements for Inference with Clustered Errors Colin Cameron, Jonah Gelbach, Doug Miller U.C. - Davis, U. Maryland, U.C. - Davis May, 2008 May, 2008 1 / 41 1. Introduction OLS regression

More information

Wild Bootstrap Inference for Wildly Dierent Cluster Sizes

Wild Bootstrap Inference for Wildly Dierent Cluster Sizes Wild Bootstrap Inference for Wildly Dierent Cluster Sizes Matthew D. Webb October 9, 2013 Introduction This paper is joint with: Contributions: James G. MacKinnon Department of Economics Queen's University

More information

TECHNICAL WORKING PAPER SERIES ROBUST INFERENCE WITH MULTI-WAY CLUSTERING. A. Colin Cameron Jonah B. Gelbach Douglas L. Miller

TECHNICAL WORKING PAPER SERIES ROBUST INFERENCE WITH MULTI-WAY CLUSTERING. A. Colin Cameron Jonah B. Gelbach Douglas L. Miller TECHNICAL WORKING PAPER SERIES ROBUST INFERENCE WITH MULTI-WAY CLUSTERING A. Colin Cameron Jonah B. Gelbach Douglas L. Miller Technical Working Paper 327 http://www.nber.org/papers/t0327 NATIONAL BUREAU

More information

Robust Inference with Clustered Data

Robust Inference with Clustered Data Robust Inference with Clustered Data A. Colin Cameron and Douglas L. Miller Department of Economics, University of California - Davis. This version: Feb 10, 2010 Abstract In this paper we survey methods

More information

Casuality and Programme Evaluation

Casuality and Programme Evaluation Casuality and Programme Evaluation Lecture V: Difference-in-Differences II Dr Martin Karlsson University of Duisburg-Essen Summer Semester 2017 M Karlsson (University of Duisburg-Essen) Casuality and Programme

More information

Robust Inference for Regression with Clustered Data

Robust Inference for Regression with Clustered Data Robust nference for Regression with Clustered Data Colin Cameron Univ. of California - Davis Joint work with Douglas L. Miller (UC - Davis). Based on A Practitioners Guide to Cluster-Robust nference, Journal

More information

Robust Inference with Clustered Errors

Robust Inference with Clustered Errors Robust nference with Clustered Errors Colin Cameron Univ. of California - Davis Keynote address at The 8th Annual Health Econometrics Workshop, University of Colorado Denver, Anschutz Medical Campus..

More information

Cluster-Robust Inference

Cluster-Robust Inference Cluster-Robust Inference David Sovich Washington University in St. Louis Modern Empirical Corporate Finance Modern day ECF mainly focuses on obtaining unbiased or consistent point estimates (e.g. indentification!)

More information

A Practitioner s Guide to Cluster-Robust Inference

A Practitioner s Guide to Cluster-Robust Inference A Practitioner s Guide to Cluster-Robust Inference A. C. Cameron and D. L. Miller presented by Federico Curci March 4, 2015 Cameron Miller Cluster Clinic II March 4, 2015 1 / 20 In the previous episode

More information

A Practitioner's Guide to Cluster-Robust Inference

A Practitioner's Guide to Cluster-Robust Inference A Practitioner's Guide to Cluster-Robust Inference A. Colin Cameron and Douglas L. Miller Department of Economics, University of California - Davis. This version: September 30, 2013 Abstract Note: Sections

More information

A Practitioner s Guide to Cluster-Robust Inference

A Practitioner s Guide to Cluster-Robust Inference A Practitioner s Guide to Cluster-Robust Inference A. Colin Cameron Douglas L. Miller Cameron and Miller abstract We consider statistical inference for regression when data are grouped into clusters, with

More information

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 7: Cluster Sampling Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of roups and

More information

Lecture 4: Linear panel models

Lecture 4: Linear panel models Lecture 4: Linear panel models Luc Behaghel PSE February 2009 Luc Behaghel (PSE) Lecture 4 February 2009 1 / 47 Introduction Panel = repeated observations of the same individuals (e.g., rms, workers, countries)

More information

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE Chapter 6. Panel Data Joan Llull Quantitative Statistical Methods II Barcelona GSE Introduction Chapter 6. Panel Data 2 Panel data The term panel data refers to data sets with repeated observations over

More information

Inference in difference-in-differences approaches

Inference in difference-in-differences approaches Inference in difference-in-differences approaches Mike Brewer (University of Essex & IFS) and Robert Joyce (IFS) PEPA is based at the IFS and CEMMAP Introduction Often want to estimate effects of policy/programme

More information

The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors

The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors by Bruce E. Hansen Department of Economics University of Wisconsin October 2018 Bruce Hansen (University of Wisconsin) Exact

More information

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43 Panel Data March 2, 212 () Applied Economoetrics: Topic March 2, 212 1 / 43 Overview Many economic applications involve panel data. Panel data has both cross-sectional and time series aspects. Regression

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 1 Jakub Mućk Econometrics of Panel Data Meeting # 1 1 / 31 Outline 1 Course outline 2 Panel data Advantages of Panel Data Limitations of Panel Data 3 Pooled

More information

11. Bootstrap Methods

11. Bootstrap Methods 11. Bootstrap Methods c A. Colin Cameron & Pravin K. Trivedi 2006 These transparencies were prepared in 20043. They can be used as an adjunct to Chapter 11 of our subsequent book Microeconometrics: Methods

More information

Finite Sample Performance of A Minimum Distance Estimator Under Weak Instruments

Finite Sample Performance of A Minimum Distance Estimator Under Weak Instruments Finite Sample Performance of A Minimum Distance Estimator Under Weak Instruments Tak Wai Chau February 20, 2014 Abstract This paper investigates the nite sample performance of a minimum distance estimator

More information

The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors

The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors by Bruce E. Hansen Department of Economics University of Wisconsin June 2017 Bruce Hansen (University of Wisconsin) Exact

More information

Empirical Methods in Applied Microeconomics

Empirical Methods in Applied Microeconomics Empirical Methods in Applied Microeconomics Jörn-Ste en Pischke LSE October 007 1 Nonstandard Standard Error Issues The discussion so far has concentrated on identi cation of the e ect of interest. Obviously,

More information

Clustering as a Design Problem

Clustering as a Design Problem Clustering as a Design Problem Alberto Abadie, Susan Athey, Guido Imbens, & Jeffrey Wooldridge Harvard-MIT Econometrics Seminar Cambridge, February 4, 2016 Adjusting standard errors for clustering is common

More information

Economics 582 Random Effects Estimation

Economics 582 Random Effects Estimation Economics 582 Random Effects Estimation Eric Zivot May 29, 2013 Random Effects Model Hence, the model can be re-written as = x 0 β + + [x ] = 0 (no endogeneity) [ x ] = = + x 0 β + + [x ] = 0 [ x ] = 0

More information

Day 2B Inference for Clustered Data: Part 2 With a Cross-section Data Example (Revised August 25 to add 14. Dyadic Clustering)

Day 2B Inference for Clustered Data: Part 2 With a Cross-section Data Example (Revised August 25 to add 14. Dyadic Clustering) Day 2B nference for Clustered Data: Part 2 With a Cross-section Data Example (Revised August 25 to add 14. Dyadic Clustering) A. Colin Cameron Univ. of Calif. - Davis... for Center of Labor Economics Norwegian

More information

Lecture 7: Dynamic panel models 2

Lecture 7: Dynamic panel models 2 Lecture 7: Dynamic panel models 2 Ragnar Nymoen Department of Economics, UiO 25 February 2010 Main issues and references The Arellano and Bond method for GMM estimation of dynamic panel data models A stepwise

More information

WILD CLUSTER BOOTSTRAP CONFIDENCE INTERVALS*

WILD CLUSTER BOOTSTRAP CONFIDENCE INTERVALS* L Actualité économique, Revue d analyse économique, vol 91, n os 1-2, mars-juin 2015 WILD CLUSTER BOOTSTRAP CONFIDENCE INTERVALS* James G MacKinnon Department of Economics Queen s University Abstract Confidence

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

dqd: A command for treatment effect estimation under alternative assumptions

dqd: A command for treatment effect estimation under alternative assumptions UC3M Working Papers Economics 14-07 April 2014 ISSN 2340-5031 Departamento de Economía Universidad Carlos III de Madrid Calle Madrid, 126 28903 Getafe (Spain) Fax (34) 916249875 dqd: A command for treatment

More information

IV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade

IV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade IV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade Denis Chetverikov Brad Larsen Christopher Palmer UCLA, Stanford and NBER, UC Berkeley September

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND

More information

Inference about Clustering and Parametric. Assumptions in Covariance Matrix Estimation

Inference about Clustering and Parametric. Assumptions in Covariance Matrix Estimation Inference about Clustering and Parametric Assumptions in Covariance Matrix Estimation Mikko Packalen y Tony Wirjanto z 26 November 2010 Abstract Selecting an estimator for the variance covariance matrix

More information

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data Panel data Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data - possible to control for some unobserved heterogeneity - possible

More information

Econometrics II. Nonstandard Standard Error Issues: A Guide for the. Practitioner

Econometrics II. Nonstandard Standard Error Issues: A Guide for the. Practitioner Econometrics II Nonstandard Standard Error Issues: A Guide for the Practitioner Måns Söderbom 10 May 2011 Department of Economics, University of Gothenburg. Email: mans.soderbom@economics.gu.se. Web: www.economics.gu.se/soderbom,

More information

Motivation Non-linear Rational Expectations The Permanent Income Hypothesis The Log of Gravity Non-linear IV Estimation Summary.

Motivation Non-linear Rational Expectations The Permanent Income Hypothesis The Log of Gravity Non-linear IV Estimation Summary. Econometrics I Department of Economics Universidad Carlos III de Madrid Master in Industrial Economics and Markets Outline Motivation 1 Motivation 2 3 4 5 Motivation Hansen's contributions GMM was developed

More information

Chapter 2. Dynamic panel data models

Chapter 2. Dynamic panel data models Chapter 2. Dynamic panel data models School of Economics and Management - University of Geneva Christophe Hurlin, Université of Orléans University of Orléans April 2018 C. Hurlin (University of Orléans)

More information

Non-linear panel data modeling

Non-linear panel data modeling Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1

More information

Applied Microeconometrics (L5): Panel Data-Basics

Applied Microeconometrics (L5): Panel Data-Basics Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics

More information

INFERENCE WITH LARGE CLUSTERED DATASETS*

INFERENCE WITH LARGE CLUSTERED DATASETS* L Actualité économique, Revue d analyse économique, vol. 92, n o 4, décembre 2016 INFERENCE WITH LARGE CLUSTERED DATASETS* James G. MACKINNON Queen s University jgm@econ.queensu.ca Abstract Inference using

More information

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35 Rewrap ECON 4135 November 18, 2011 () Rewrap ECON 4135 November 18, 2011 1 / 35 What should you now know? 1 What is econometrics? 2 Fundamental regression analysis 1 Bivariate regression 2 Multivariate

More information

Making sense of Econometrics: Basics

Making sense of Econometrics: Basics Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/

More information

FNCE 926 Empirical Methods in CF

FNCE 926 Empirical Methods in CF FNCE 926 Empirical Methods in CF Lecture 11 Standard Errors & Misc. Professor Todd Gormley Announcements Exercise #4 is due Final exam will be in-class on April 26 q After today, only two more classes

More information

y it = α i + β 0 ix it + ε it (0.1) The panel data estimators for the linear model are all standard, either the application of OLS or GLS.

y it = α i + β 0 ix it + ε it (0.1) The panel data estimators for the linear model are all standard, either the application of OLS or GLS. 0.1. Panel Data. Suppose we have a panel of data for groups (e.g. people, countries or regions) i =1, 2,..., N over time periods t =1, 2,..., T on a dependent variable y it and a kx1 vector of independent

More information

QED. Queen s Economics Department Working Paper No. 1355

QED. Queen s Economics Department Working Paper No. 1355 QED Queen s Economics Department Working Paper No 1355 Randomization Inference for Difference-in-Differences with Few Treated Clusters James G MacKinnon Queen s University Matthew D Webb Carleton University

More information

Least Squares Estimation-Finite-Sample Properties

Least Squares Estimation-Finite-Sample Properties Least Squares Estimation-Finite-Sample Properties Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu (HKU) Finite-Sample 1 / 29 Terminology and Assumptions 1 Terminology and Assumptions

More information

Economics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama

Economics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama Problem Set #1 (Random Data Generation) 1. Generate =500random numbers from both the uniform 1 ( [0 1], uniformbetween zero and one) and exponential exp ( ) (set =2and let [0 1]) distributions. Plot the

More information

Wild Cluster Bootstrap Confidence Intervals

Wild Cluster Bootstrap Confidence Intervals Wild Cluster Bootstrap Confidence Intervals James G MacKinnon Department of Economics Queen s University Kingston, Ontario, Canada K7L 3N6 jgm@econqueensuca http://wwweconqueensuca/faculty/macinnon/ Abstract

More information

1 Estimation of Persistent Dynamic Panel Data. Motivation

1 Estimation of Persistent Dynamic Panel Data. Motivation 1 Estimation of Persistent Dynamic Panel Data. Motivation Consider the following Dynamic Panel Data (DPD) model y it = y it 1 ρ + x it β + µ i + v it (1.1) with i = {1, 2,..., N} denoting the individual

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao

More information

Modified Variance Ratio Test for Autocorrelation in the Presence of Heteroskedasticity

Modified Variance Ratio Test for Autocorrelation in the Presence of Heteroskedasticity The Lahore Journal of Economics 23 : 1 (Summer 2018): pp. 1 19 Modified Variance Ratio Test for Autocorrelation in the Presence of Heteroskedasticity Sohail Chand * and Nuzhat Aftab ** Abstract Given that

More information

Exam D0M61A Advanced econometrics

Exam D0M61A Advanced econometrics Exam D0M61A Advanced econometrics 19 January 2009, 9 12am Question 1 (5 pts.) Consider the wage function w i = β 0 + β 1 S i + β 2 E i + β 0 3h i + ε i, where w i is the log-wage of individual i, S i is

More information

Heteroskedasticity-Robust Inference in Finite Samples

Heteroskedasticity-Robust Inference in Finite Samples Heteroskedasticity-Robust Inference in Finite Samples Jerry Hausman and Christopher Palmer Massachusetts Institute of Technology December 011 Abstract Since the advent of heteroskedasticity-robust standard

More information

Intermediate Econometrics

Intermediate Econometrics Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the

More information

NBER WORKING PAPER SERIES ROBUST STANDARD ERRORS IN SMALL SAMPLES: SOME PRACTICAL ADVICE. Guido W. Imbens Michal Kolesar

NBER WORKING PAPER SERIES ROBUST STANDARD ERRORS IN SMALL SAMPLES: SOME PRACTICAL ADVICE. Guido W. Imbens Michal Kolesar NBER WORKING PAPER SERIES ROBUST STANDARD ERRORS IN SMALL SAMPLES: SOME PRACTICAL ADVICE Guido W. Imbens Michal Kolesar Working Paper 18478 http://www.nber.org/papers/w18478 NATIONAL BUREAU OF ECONOMIC

More information

Topic 7: Heteroskedasticity

Topic 7: Heteroskedasticity Topic 7: Heteroskedasticity Advanced Econometrics (I Dong Chen School of Economics, Peking University Introduction If the disturbance variance is not constant across observations, the regression is heteroskedastic

More information

Inference in Differences-in-Differences with Few Treated Groups and Heteroskedasticity

Inference in Differences-in-Differences with Few Treated Groups and Heteroskedasticity Inference in Differences-in-Differences with Few Treated Groups and Heteroskedasticity Bruno Ferman Cristine Pinto Sao Paulo School of Economics - FGV First Draft: October, 205 This Draft: October, 207

More information

Heteroskedasticity. We now consider the implications of relaxing the assumption that the conditional

Heteroskedasticity. We now consider the implications of relaxing the assumption that the conditional Heteroskedasticity We now consider the implications of relaxing the assumption that the conditional variance V (u i x i ) = σ 2 is common to all observations i = 1,..., In many applications, we may suspect

More information

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. The Basic Methodology 2. How Should We View Uncertainty in DD Settings?

More information

Gravity Models, PPML Estimation and the Bias of the Robust Standard Errors

Gravity Models, PPML Estimation and the Bias of the Robust Standard Errors Gravity Models, PPML Estimation and the Bias of the Robust Standard Errors Michael Pfaffermayr August 23, 2018 Abstract In gravity models with exporter and importer dummies the robust standard errors of

More information

BOOTSTRAPPING DIFFERENCES-IN-DIFFERENCES ESTIMATES

BOOTSTRAPPING DIFFERENCES-IN-DIFFERENCES ESTIMATES BOOTSTRAPPING DIFFERENCES-IN-DIFFERENCES ESTIMATES Bertrand Hounkannounon Université de Montréal, CIREQ December 2011 Abstract This paper re-examines the analysis of differences-in-differences estimators

More information

Small sample methods for cluster-robust variance estimation and hypothesis testing in fixed effects models

Small sample methods for cluster-robust variance estimation and hypothesis testing in fixed effects models Small sample methods for cluster-robust variance estimation and hypothesis testing in fixed effects models arxiv:1601.01981v1 [stat.me] 8 Jan 2016 James E. Pustejovsky Department of Educational Psychology

More information

Estimating the Variance of Decomposition Effects

Estimating the Variance of Decomposition Effects City University of New York (CUNY) CUNY Academic Works Economics Working Papers CUNY Academic Works 2015 Estimating the Variance of Decomposition Effects Takuya Hasebe Sophia University Follow this and

More information

Econ 836 Final Exam. 2 w N 2 u N 2. 2 v N

Econ 836 Final Exam. 2 w N 2 u N 2. 2 v N 1) [4 points] Let Econ 836 Final Exam Y Xβ+ ε, X w+ u, w N w~ N(, σi ), u N u~ N(, σi ), ε N ε~ Nu ( γσ, I ), where X is a just one column. Let denote the OLS estimator, and define residuals e as e Y X.

More information

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63 1 / 63 Panel Data Models Chapter 5 Financial Econometrics Michael Hauser WS17/18 2 / 63 Content Data structures: Times series, cross sectional, panel data, pooled data Static linear panel data models:

More information

Multiple Equation GMM with Common Coefficients: Panel Data

Multiple Equation GMM with Common Coefficients: Panel Data Multiple Equation GMM with Common Coefficients: Panel Data Eric Zivot Winter 2013 Multi-equation GMM with common coefficients Example (panel wage equation) 69 = + 69 + + 69 + 1 80 = + 80 + + 80 + 2 Note:

More information

A Direct Test for Consistency of Random Effects Models that Outperforms the Hausman Test

A Direct Test for Consistency of Random Effects Models that Outperforms the Hausman Test A Direct Test for Consistency of Random Effects Models that Outperforms the Hausman Test Preliminary Version: This paper is under active development. Results and conclusions may change as research progresses.

More information

Environmental Econometrics

Environmental Econometrics Environmental Econometrics Syngjoo Choi Fall 2008 Environmental Econometrics (GR03) Fall 2008 1 / 37 Syllabus I This is an introductory econometrics course which assumes no prior knowledge on econometrics;

More information

Small sample methods for cluster-robust variance estimation and hypothesis testing in fixed effects models

Small sample methods for cluster-robust variance estimation and hypothesis testing in fixed effects models Small sample methods for cluster-robust variance estimation and hypothesis testing in fixed effects models James E. Pustejovsky and Elizabeth Tipton July 18, 2016 Abstract In panel data models and other

More information

ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008

ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008 ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008 Instructions: Answer all four (4) questions. Point totals for each question are given in parenthesis; there are 00 points possible. Within

More information

Difference-in-Differences Estimation

Difference-in-Differences Estimation Difference-in-Differences Estimation Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. The Basic Methodology 2. How Should We

More information

Econometrics - 30C00200

Econometrics - 30C00200 Econometrics - 30C00200 Lecture 11: Heteroskedasticity Antti Saastamoinen VATT Institute for Economic Research Fall 2015 30C00200 Lecture 11: Heteroskedasticity 12.10.2015 Aalto University School of Business

More information

4.8 Instrumental Variables

4.8 Instrumental Variables 4.8. INSTRUMENTAL VARIABLES 35 4.8 Instrumental Variables A major complication that is emphasized in microeconometrics is the possibility of inconsistent parameter estimation due to endogenous regressors.

More information

Short T Panels - Review

Short T Panels - Review Short T Panels - Review We have looked at methods for estimating parameters on time-varying explanatory variables consistently in panels with many cross-section observation units but a small number of

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Endogeneity b) Instrumental

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

Efficient Estimation of Dynamic Panel Data Models: Alternative Assumptions and Simplified Estimation

Efficient Estimation of Dynamic Panel Data Models: Alternative Assumptions and Simplified Estimation Efficient Estimation of Dynamic Panel Data Models: Alternative Assumptions and Simplified Estimation Seung C. Ahn Arizona State University, Tempe, AZ 85187, USA Peter Schmidt * Michigan State University,

More information

Chapter 1. GMM: Basic Concepts

Chapter 1. GMM: Basic Concepts Chapter 1. GMM: Basic Concepts Contents 1 Motivating Examples 1 1.1 Instrumental variable estimator....................... 1 1.2 Estimating parameters in monetary policy rules.............. 2 1.3 Estimating

More information

Inference for Clustered Data

Inference for Clustered Data Inference for Clustered Data Chang Hyung Lee and Douglas G. Steigerwald Department of Economics University of California, Santa Barbara March 23, 2017 Abstract This article introduces eclust, a new Stata

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

PANEL DATA RANDOM AND FIXED EFFECTS MODEL. Professor Menelaos Karanasos. December Panel Data (Institute) PANEL DATA December / 1

PANEL DATA RANDOM AND FIXED EFFECTS MODEL. Professor Menelaos Karanasos. December Panel Data (Institute) PANEL DATA December / 1 PANEL DATA RANDOM AND FIXED EFFECTS MODEL Professor Menelaos Karanasos December 2011 PANEL DATA Notation y it is the value of the dependent variable for cross-section unit i at time t where i = 1,...,

More information

the error term could vary over the observations, in ways that are related

the error term could vary over the observations, in ways that are related Heteroskedasticity We now consider the implications of relaxing the assumption that the conditional variance Var(u i x i ) = σ 2 is common to all observations i = 1,..., n In many applications, we may

More information

E cient Estimation and Inference for Di erence-in-di erence Regressions with Persistent Errors

E cient Estimation and Inference for Di erence-in-di erence Regressions with Persistent Errors E cient Estimation and Inference for Di erence-in-di erence Regressions with Persistent Errors Ryan Greenaway-McGrevy Bureau of Economic Analysis Chirok Han Korea University January 2014 Donggyu Sul Univeristy

More information

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication G. S. Maddala Kajal Lahiri WILEY A John Wiley and Sons, Ltd., Publication TEMT Foreword Preface to the Fourth Edition xvii xix Part I Introduction and the Linear Regression Model 1 CHAPTER 1 What is Econometrics?

More information

x i = 1 yi 2 = 55 with N = 30. Use the above sample information to answer all the following questions. Show explicitly all formulas and calculations.

x i = 1 yi 2 = 55 with N = 30. Use the above sample information to answer all the following questions. Show explicitly all formulas and calculations. Exercises for the course of Econometrics Introduction 1. () A researcher is using data for a sample of 30 observations to investigate the relationship between some dependent variable y i and independent

More information

Introduction to Linear Regression Analysis

Introduction to Linear Regression Analysis Introduction to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018 Econometrics: What is it? Interaction of economic theory, observed data and statistical methods. The science of testing

More information

Robust Standard Errors in Small Samples: Some Practical Advice

Robust Standard Errors in Small Samples: Some Practical Advice Robust Standard Errors in Small Samples: Some Practical Advice Guido W. Imbens Michal Kolesár First Draft: October 2012 This Draft: December 2014 Abstract In this paper we discuss the properties of confidence

More information

Economics 241B Estimation with Instruments

Economics 241B Estimation with Instruments Economics 241B Estimation with Instruments Measurement Error Measurement error is de ned as the error resulting from the measurement of a variable. At some level, every variable is measured with error.

More information

12E016. Econometric Methods II 6 ECTS. Overview and Objectives

12E016. Econometric Methods II 6 ECTS. Overview and Objectives Overview and Objectives This course builds on and further extends the econometric and statistical content studied in the first quarter, with a special focus on techniques relevant to the specific field

More information

EC327: Advanced Econometrics, Spring 2007

EC327: Advanced Econometrics, Spring 2007 EC327: Advanced Econometrics, Spring 2007 Wooldridge, Introductory Econometrics (3rd ed, 2006) Chapter 14: Advanced panel data methods Fixed effects estimators We discussed the first difference (FD) model

More information

Discrete Dependent Variable Models

Discrete Dependent Variable Models Discrete Dependent Variable Models James J. Heckman University of Chicago This draft, April 10, 2006 Here s the general approach of this lecture: Economic model Decision rule (e.g. utility maximization)

More information

ADJUSTED POWER ESTIMATES IN. Ji Zhang. Biostatistics and Research Data Systems. Merck Research Laboratories. Rahway, NJ

ADJUSTED POWER ESTIMATES IN. Ji Zhang. Biostatistics and Research Data Systems. Merck Research Laboratories. Rahway, NJ ADJUSTED POWER ESTIMATES IN MONTE CARLO EXPERIMENTS Ji Zhang Biostatistics and Research Data Systems Merck Research Laboratories Rahway, NJ 07065-0914 and Dennis D. Boos Department of Statistics, North

More information

Advanced Econometrics

Advanced Econometrics Based on the textbook by Verbeek: A Guide to Modern Econometrics Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna May 16, 2013 Outline Univariate

More information

Basic Regressions and Panel Data in Stata

Basic Regressions and Panel Data in Stata Developing Trade Consultants Policy Research Capacity Building Basic Regressions and Panel Data in Stata Ben Shepherd Principal, Developing Trade Consultants 1 Basic regressions } Stata s regress command

More information

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i, A Course in Applied Econometrics Lecture 18: Missing Data Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. When Can Missing Data be Ignored? 2. Inverse Probability Weighting 3. Imputation 4. Heckman-Type

More information

Day 3B Nonparametrics and Bootstrap

Day 3B Nonparametrics and Bootstrap Day 3B Nonparametrics and Bootstrap c A. Colin Cameron Univ. of Calif.- Davis Frontiers in Econometrics Bavarian Graduate Program in Economics. Based on A. Colin Cameron and Pravin K. Trivedi (2009,2010),

More information

Econ 582 Fixed Effects Estimation of Panel Data

Econ 582 Fixed Effects Estimation of Panel Data Econ 582 Fixed Effects Estimation of Panel Data Eric Zivot May 28, 2012 Panel Data Framework = x 0 β + = 1 (individuals); =1 (time periods) y 1 = X β ( ) ( 1) + ε Main question: Is x uncorrelated with?

More information

Bootstrapping the Grainger Causality Test With Integrated Data

Bootstrapping the Grainger Causality Test With Integrated Data Bootstrapping the Grainger Causality Test With Integrated Data Richard Ti n University of Reading July 26, 2006 Abstract A Monte-carlo experiment is conducted to investigate the small sample performance

More information

A Course on Advanced Econometrics

A Course on Advanced Econometrics A Course on Advanced Econometrics Yongmiao Hong The Ernest S. Liu Professor of Economics & International Studies Cornell University Course Introduction: Modern economies are full of uncertainties and risk.

More information

An overview of applied econometrics

An overview of applied econometrics An overview of applied econometrics Jo Thori Lind September 4, 2011 1 Introduction This note is intended as a brief overview of what is necessary to read and understand journal articles with empirical

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 2 Jakub Mućk Econometrics of Panel Data Meeting # 2 1 / 26 Outline 1 Fixed effects model The Least Squares Dummy Variable Estimator The Fixed Effect (Within

More information