Robust Inference with Multi-way Clustering
|
|
- Annabel McCoy
- 6 years ago
- Views:
Transcription
1 Robust Inference with Multi-way Clustering Colin Cameron, Jonah Gelbach, Doug Miller U.C. - Davis, U. Arizona, U.C. - Davis February 2010 February / 44
2 1.1 Introduction Moulton (1986, 1990) and Bertrand, Duo, and Mullainathan (2004) showed the importance of controlling for clustering. Failure to control underestimates OLS standard errors and overstates t statistics. Initially use one-way random eects model. Now use cluster-robust standard errors (White (1984), Arellano (1987), Liang and Zeger (1986), Rogers (1993)). February / 44
3 This paper extends to cluster-robust in two (nonnested) dimensions Example 1: More than one grouped regressor. Example 2: Cross-section unit and time for panel data. Outline of presentation Lengthy discussion of one-way cluster-robust. Present method for two-way cluster-robust. Simulation and application. Conclusion. February / 44
4 1.2 OLS with Cluster Errors Model for G clusters with N g individuals per cluster: OLS estimator y ig = x 0 ig β + u ig, i = 1,..., N g, g = 1,..., G, y g = X g β + u g, g = 1,..., G, y = Xβ + u, bβ = ( G g=1 N g i=1 x ig x 0 ig ) 1 ( G g=1 N g i=1 x ig y ig ) = ( G g=1 X0 g X g ) 1 ( G g=1 X g y g ) = (X 0 X) 1 X 0 y. February / 44
5 OLS with Clustered Errors (continued) As usual bβ = β + (X 0 X) 1 X 0 u = β + (X 0 X) 1 ( G g=1 X g u g ). Assume independence over g with u g [0, Σ g = E[u g ug 0 ]]. Then β b a N [β, V[ β]] b with V[ b β] = (X 0 X) 1 ( G g=1 X g Σ g X 0 g )(X 0 X) 1. February / 44
6 OLS with Clustered Errors (continued) If ignore clustering the default OLS variance estimate should be inated by approximately where τ j ' 1 + ρ xj ρ u ( N g 1), ρ xj is the within cluster correlation of x j ρ u is the within cluster error correlation N g is the average cluster size. Kloek (1981), Scott and Holt (1982). Moulton (1986, 1990) showed that could be large even if ρ u small. e.g. N G = 81, ρ x = 1 and ρ u = 0.1 then τ j = 9! So should correct for clustering - but Σ g is unknown. February / 44
7 1.3 Random eects approach The original way to obtain consistent variance matrix estimate. Assume a random eects (RE) model u ig = α g + ε ig α g iid[0, σ 2 α] ε ig iid[0, σ 2 ε ]. Then Σ g = σ 2 ui Ng + σ 2 αe Ng e 0 N g and we use bv RE [ b β] = (X 0 X) 1 ( G g=1 X g b Σ g X 0 g )(X 0 X) 1, where bσ g = bσ 2 ui Ng + bσ 2 αe Ng e 0 N g, and bσ 2 ε and bσ 2 α are consistent. Weakness is strong distributional assumptions. February / 44
8 1.4 Cluster-Robust Variance Estimates The current method to obtain variance estimates. The cluster-robust variance estimate (CRVE) bv CR [ b β] = (X 0 X) 1 ( G g=1 X g eu g eu 0 g X 0 g )(X 0 X) 1, provides a consistent estimator of V CR [ b β] if plim 1 G G g=1 X g eu g eu g 0 X 0 g = plim 1 G G g=1 X g Σ g X 0 g. bv CR [ b β] yields cluster-robust standard errors. If OLS residuals are used then eu g = bu g = y g X g b β. Then bv CR [ b β] is downwards-biased for V[ b β]. February / 44
9 Cluster-Robust Variance Estimates (continued) The CRVE was proposed by Liang and Zeger (1986) for grouped data proposed by Arellano (1987) for FE estimator for short panels (where the grouping is on the individual) popularized by incorporation in Stata as the cluster option (Rogers (1993)). Bertrand, Duo and Mullainathan (2004) in a much-cited paper noted that with state-year data many studies were erroneously clustering on state-year (implying independence over time) rather than clustering on just state. February / 44
10 1.5 CRVE with few clusters CRVE assumes G!. What if G is small? Stata uses eu g = p cbu g where c = critical values. Reasonable starting point. G G 1 N 1 N k ' G G 1 and T (G 1) Donald and Lang (2007) do asymptotic theory when G is small and N g!. Then a two-step FGLS RE estimator yields t-test that is T (G 2) under some assumptions. Ibragimov and Muller (2010) do asymptotic theory when G is small and N g! and only within-group variation is relevant. Then separately estimate b β g s and average. Cameron, Gelbach and Miller (2007) propose cluster bootstraps with asymptotic renement. February / 44
11 1.6 Wild Cluster Bootstrap-T Procedure Test H 0 : β 1 = β 0 1 against H a : β 1 6= β 0 1 using w = ( bβ 1 β 0 1 )/s bβ 1. 1 Obtain the OLS estimator b β and OLS residuals bu g, g = 1,..., G. [Best to use residuals that impose H 0 ]. 2 Do B iterations of this step. On the b th iteration: 1 For each cluster g = 1,..., G, form bu g = bu g or bu g = bu g each with probability 0.5 and hence form by g = X 0 g b β + bu g. This yields wild cluster bootstrap resample f(by 1, X 1),..., (by G, X G )g. 2 Calculate the OLS estimate bβ 1,b and its standard error s bβ 1,b and given these form the Wald test statistic w b = ( bβ 1,b 3 Reject H 0 at level α if and only if bβ 1 )/s b β 1,b. w < w [α/2] or w > w [1 α/2], where w [q] denotes the qth quantile of w 1,..., w B. February / 44
12 1.7 Other work on clustering Wooldridge (2003) survey. Bhattacharya (2005) combines stratication and clustering, and in a GMM setting. Christian Hansen (2007a) shows that even for long panels it is okay to use CRVE with FE estimator (as shown in simulations by Krezde (2002)). Christian Hansen (2007b) shows that for short panels with AR(p) errors bias-corrected FGLS does well. Hausman and Kuersteiner (2005) do qualitatively similar small-t correction for Kiefer (1980) FGLS estimator in panel. Conley (1998) analyzes GMM with spatial correlation. Cameron and Miller (2010) show that degrees-of-freedom corrections can lead to cluster-robust variance estimates diering substantially in LSDV versus mean-dierenced model if T is small. February / 44
13 2.0 Motivating example How do job injury rates eect wages? Hersch (1998). CPS individual data on male wages N = But there is no individual data on job injury rate. Instead aggregated data: data on industry injury rates for 211 industries data on occupations injury rates for 387 occupations. Model estimated is What should we do? y igh = α + x 0 igh β + γ rind ig + δ rocc ih + u igh. GLS two-way random eects: u igh = ε g + ε h + ε igh ; ε g, ε h, ε igh i.i.d. Strong assumptions. OLS then cluster on industry for bγ and cluster on occupation for bδ. Ad hoc. February / 44
14 2.1 One-way clustering Return to one-way clustering before extending to two-way. Clustering on group g (for g = 1,..., G ) with N g individuals in group g : y ig = xig 0 β + u ig E[u ig u jg 0jx ig, x jg 0] = 0, for i 6= j unless g = g 0. Then informally the estimated asymptotic variance matrix of b β OLS, denoted Avar[ b β], is where Avar[ b β] = (X 0 X) 1 bb(x 0 X) 1 February / 44
15 One-way Clustering bb = G g=1 X 0 g bu g bu 0 g X g 2 bu 1 bu = X 0 0 bu 2 bu X 0 bu G bu G E = X 0 bubu0. 0 E C 5A X 0 E G = X 0 (bubu 0. S G )X, February / 44
16 2.2 Two-way Clustering With g denoting rst group and h denoting second group y igh = x 0 igh β+u igh, E[u igh u jg 0 h 0jx igh, x jg 0 h 0] = 0, for i 6= j unless g = g 0 or h = h 0. Then use bb = X 0 (bubu 0. S GH )X, (1) where S GH is an N N indicator matrix with ij th entry equal to one if the i th and j th observation share any cluster and equal to zero otherwise. February / 44
17 Two-way Clustering Dene three N N indicator matrices: S G with ij th entry equal to one if the i th and j th observation belong to the same cluster g 2 f1, 2,..., G g S H with ij th entry equal to one if the i th and j th observation belong to the same cluster h 2 f1, 2,..., Hg S G \H with ij th entry equal to one if the i th and j th observation belong to both the same cluster g 2 f1, 2,..., G g and the same cluster h 2 f1, 2,..., Hg. Then so S GH = S G + S H S G \H, bb = X 0 (bubu 0. S G )X + X 0 (bubu 0. S H )X X 0 (bubu 0. S G \H. X), February / 44
18 Two-way Clustering Hence Avar[ b β] = (X 0 X) 1 X 0 (bubu 0. S G )X(X 0 X) 1 (2) +(X 0 X) 1 X 0 (bubu 0. S H )X(X 0 X) 1 (X 0 X) 1 X 0 (bubu 0. S G \H )X(X 0 X) 1. The three components can be separately computed by (1) OLS regression of y on X with standard errors computed using clustering on g 2 f1, 2,..., G g; (2) OLS regression of y on X with standard errors computed using clustering on h 2 f1, 2,..., Hg; and (3) OLS regression of y on X with standard errors computed using clustering on (g, h) 2 f(1, 1),..., (G, H)g Stata add-on cgmreg.ado does this. February / 44
19 Two-way Clustering: other research This method has been independently proposed by others (we began in 2004). Applied: Acemoglu and Pischke (2003): cluster on individual and region-by-time. Applied: Fafchamps and Gubert (2006): cluster on person 1 and person 2 in person-pair data (dyadic) for networks. Theory: Thompson (2006): cluster on rm on time. Theory: Miglioretti and Heagarty (2006): cluster on person, radiologist, and facility. Theory: Conley (1998): cluster on space and time (dierent asymptotic theory). February / 44
20 2.3 Practical Considerations If bv[ b β] is not positive-denite (small G, H) then Decompose bv[ β] b = UΛU 0 ; U contains eigenvectors of bv, and Λ = Diag[λ 1,..., λ d ] contains eigenvalues. Create Λ + = Diag[λ 1 +,..., λ+ d ], with λ+ j = max 0, λ j, and use bv + [ b β] = UΛ + U 0. Fixed eects in one or both dimensions We do not formally address this complication Intuitively if G! and H! then each xed eect is estimated using many observations. In practice the main consequence of including xed eects is a reduction in within-cluster correlation. February / 44
21 2.4 Multi-way clustering Extends to multi-way. e.g. 7 variance estimates for three-way clustering and bb may be written as B e (1,0,0) + eb (0,1,0) + eb (0,0,1) B e (1,1,0) + eb (1,0,1) + eb (0,1,1) + eb (1,1,1 Extends to m-estimators and GMM e.g. For probit three components can be separately computed by (1) probit regression with standard errors computed using clustering on g 2 f1, 2,..., G g; (2) probit regression with standard errors computed using clustering on h 2 f1, 2,..., Hg; and (3) probit regression with standard errors computed using clustering on (g, h) 2 f(1, 1),..., (G, H)g Then add (1) and (2) and subtract (3). February / 44
22 2.5 GMM M-estimator solves N i=1 h i (bθ) = 0. Under standard assumptions, bθ is asymptotically normal with bv[bθ] = ba 1 bbba 1, h where ba = i i, and bb is an estimate of b V[ i h i ]. θ θ For independence over i For one-way clustering bb = bb = N N i=1 j=1 N bh i bh i. 0 i=1 bh i bh 0 ji (i, j) where I (i, j) = 1 if i and j are in the same cluster. February / 44
23 GMM Two-way clustering For two-way clustering bb = N N i=1 j=1 bh i bh 0 ji 1 (i, j) + N N i=1 j=1 bh i bh 0 ji 2 (i, j) N N i=1 j=1 bh i bh 0 ji 12 (i, j) where I 1 (i, j) = 1 if i and j are in the same cluster in dimension 1 and I 2 (i, j) = 1 if i and j are in the same cluster in dimension 2 and I 12 (i, j) = 1 if i and j are in the same cluster in dimensions 1 and 2. February / 44
24 3.1.1 Dgp with no clustering Simulations are for one observation per (h, g) combination. So HG observations and can drop the subscript i. Balanced and equal size groups H = G = 10, 20,..., 100 plus some cases H 6= G. Dgp is y gh = β 0 + β 1 x 1gh + β 2 x 2gh + α g + δ h + ε gh where β 0 = β 1 = β 2 = 1 and x ig, x ih, α g, δ h and ε igh are all iid N[0, 1]. February / 44
25 Dgp with no clustering Compute actual rejection rates for 5% two-sided Wald tests of β 1 = 1 and β 2 = 1. Dierent standard error estimates lead to dierent Wald tests. Methods used are 1. Assume iid errors 2. One-way cluster on g 3. Two-way random eects 4. Two-way cluster robust with T critical values. All methods should work with no clustering. February / 44
26 Dgp with no clustering All methods work ne with G = H = 100 Small sample over-rejection greatest for two-way cluster-robust, though diminished if use T (min(g, H) 1) degrees of freedom.. February / 44
27 Table 1 Rejection probabilities for a true null hypothesis True model: iid errors Number of Group Number of Group 1 Clusters 2 Clusters Assume iid errors Assumption about errors in construction of Variance One-way cluster robust (cluster on group1) Two-way random effects Two-way clusterrobust Two-way clusterrobust, T critical values % 6.7% 9.0% 9.7% 4.4% 5.8% 13.3% 14.9% 9.9% 11.5% % 4.8% 7.9% 5.7% 5.9% 4.9% 9.0% 7.6% 7.2% 5.9% % 4.8% 5.9% 5.9% 5.1% 5.0% 6.8% 7.1% 6.1% 6.1% % 4.8% 5.8% 5.5% 5.3% 4.7% 6.6% 6.1% 5.5% 5.4% % 5.3% 5.7% 6.1% 5.2% 5.8% 5.7% 6.5% 4.9% 5.6% % 5.6% 5.7% 5.8% 4.9% 5.4% 6.5% 6.0% 6.0% 5.2% % 5.3% 4.3% 5.6% 4.9% 5.5% 5.9% 6.0% 5.7% 5.7% % 5.3% 5.2% 6.0% 4.7% 5.2% 6.0% 6.4% 5.6% 5.9% % 4.8% 6.2% 5.0% 5.7% 4.8% 6.1% 4.6% 5.6% 4.3% % 5.1% 4.9% 5.3% 4.7% 5.2% 5.1% 6.1% 4.9% 5.7% % 4.0% 8.6% 7.3% 5.5% 4.1% 8.9% 9.7% 5.9% 6.8% % 5.1% 6.2% 7.3% 5.3% 5.1% 6.7% 7.4% 5.2% 5.8% % 5.1% 7.0% 7.7% 4.7% 5.0% 8.9% 8.9% 5.9% 5.8% % 3.8% 5.8% 5.0% 3.9% 3.7% 6.7% 7.7% 5.4% 5.9% % 5.0% 5.5% 5.4% 5.0% 4.9% 5.9% 6.5% 5.2% 5.7% Note: The null hypothesis should be rejected 5% of the time. Number of monte carlo simulations is 2000, except that for methods 1-3 it is 1000 when G*H > February / 44
28 3.1.2 Dgp with two-way clustered homoskedastic errors Dgp is y gh = β 0 + β 1 x 1gh + β 2 x 2gh + α g + δ h + u gh where now u gh = ε g + ε h + ε gh and each error is iid N [0, 1] and x 1gh = z g + z gh where each is iid N [0, 1] and x 2gh is similar. Then only the two-way methods work. Again two-way random eects does best as this is situation it is intended for. February / 44
29 Table 2 Rejection probabilities for a true null hypothesis True model: random effects on both Group1 and Group 2 Assumption about errors in construction of Variance Number of Group Number of Group 1 Clusters 2 Clusters Assume iid errors One-way cluster robust (cluster on group1) Two-way random effects Two-way clusterrobust Two-way clusterrobust, T critical values % 23.7% 13.7% 33.2% 6.2% 6.6% 17.4% 17.4% 12.6% 13.4% % 32.3% 8.6% 42.7% 5.7% 5.3% 10.3% 9.6% 8.7% 7.6% % 41.7% 7.4% 50.6% 5.2% 5.6% 8.6% 9.1% 7.8% 7.7% % 47.6% 8.7% 55.2% 6.5% 5.4% 9.0% 9.3% 7.9% 8.1% % 50.4% 6.0% 58.8% 4.9% 4.8% 7.0% 6.7% 6.3% 6.2% % 56.3% 6.4% 64.1% 5.6% 6.5% 6.7% 6.1% 5.9% 5.6% % 57.6% 5.5% 64.8% 4.8% 6.0% 6.4% 6.5% 5.9% 6.0% % 60.9% 6.5% 67.0% 4.9% 4.7% 6.3% 7.0% 5.7% 6.5% % 60.7% 5.6% 67.9% 5.0% 5.4% 6.0% 6.7% 5.8% 6.3% % 60.4% 5.8% 67.9% 5.3% 3.6% 6.4% 5.3% 6.1% 5.1% % 21.3% 13.3% 33.4% 8.9% 4.0% 15.0% 9.3% 10.2% 5.8% % 33.1% 9.8% 44.5% 6.7% 4.5% 9.3% 8.1% 8.2% 6.2% % 21.0% 14.1% 31.7% 10.4% 3.3% 14.2% 8.1% 9.2% 4.6% % 33.9% 10.0% 43.7% 6.2% 3.7% 9.2% 6.3% 7.7% 4.6% % 54.0% 6.6% 60.5% 5.3% 3.6% 6.9% 6.9% 6.4% 6.1% Note: See Table 1. February / 44
30 3.1.3 Dgp with two-way clustered heteroskedastic errors Dgp is y gh = β 0 + β 1 x 1gh + β 2 x 2gh + α g + δ h + u gh where u gh = ε g + ε h + ε gh and each error is iid N [0, 1] EXCEPT now ε gh is N [0, jx 1gh x 2gh j] and x 1gh = z g + z gh where each is iid N [0, 1] and x 2gh is similar. Then only the two-way cluster-robust method works. February / 44
31 Table 1 Rejection probabilities for a true null hypothesis True model: a random effect common to each group, and a heteroscedastic component. Assumption about errors in construction of Variance Number of Group 1 Clusters Number of Group 2 Clusters Assume independent errors One-way cluster robust (cluster on group1) Two-way random effects Two-way clusterrobust % 7.9% 15.7% 9.8% 15.9% 14.3% 18.4% 16.5% 14.5% 12.9% 8.6% 8.9% % 5.4% 9.5% 7.1% 13.0% 10.8% 11.9% 10.9% 10.3% 8.8% 7.1% 5.9% % 6.9% 7.0% 8.1% 9.7% 10.8% 8.2% 9.2% 7.1% 8.0% 6.2% 6.0% % 8.2% 6.0% 8.5% 8.8% 9.4% 7.1% 7.0% 6.3% 6.5% 5.9% 5.7% % 10.5% 6.1% 11.2% 9.6% 9.0% 6.4% 6.9% 6.0% 6.4% 4.4% 5.4% % 5.6% 12.9% 8.8% 12.9% 12.3% 13.7% 9.8% 9.6% 5.9% 6.1% 6.0% % 7.5% 7.9% 8.1% 10.5% 11.5% 9.2% 8.6% 7.6% 6.6% 5.2% 6.7% % 6.4% 10.4% 9.4% 10.1% 13.0% 11.3% 10.0% 7.3% 6.8% 7.5% 6.2% % 5.3% 9.2% 6.7% 10.8% 10.1% 9.4% 6.4% 7.7% 4.5% 5.1% 6.2% % 8.1% 6.7% 8.7% 9.9% 10.0% 6.9% 6.8% 6.1% 6.2% 6.1% 5.2% Note: The null hypothesis should be rejected 5% of the time. Number of monte carlo simulations is Two-way clusterrobust, T critical values Group fixed effects, Two-way cluster-robust February / 44
32 3.2 Dgp with Time-State Correlation Extension of BDM (2004). Data on 1,358,623 women from CPS. Model is y ist = αd st + xist 0 β + δ s + γ t + u ist, where d st is placebo policy created to be correlated over s and t : Specically d st = dst s + 2dst t ; s = 1,..., 50 ordered states where dst s = 0.6dst s 1 + v st s, with v st s iid N [0, 1] and dst t = 0.6ds t 1,t + v st t with v st t iid N [0, 1], and also independent from other variables. Here the index s ranges Two-way cluster-robust is best when no xed eects included. No gain when xed eects included. So here xed eects are enough. February / 44
33 Table 2 Rejection probabilities for a true null hypothesis Monte Carlos with micro (CPS) data quartic in age, 4 education dummies quartic in age, 4 education dummies, state fixed effects RHS control variables quartic in age, 4 education dummies, year fixed effects quartic in age, 4 education dummies, state and year fixed effects Standard error estimator: Heteroscedasticity robust 91.6% 92.1% 82.2% 79.0% One-way cluster robust (cluster on state-by-year cell) 19.8% 22.4% 13.1% 13.9% One-way cluster robust (cluster on state) 16.2% 17.0% 12.0% 15.0% One-way cluster robust (cluster on year) 10.2% 8.9% 8.7% 7.6% Two-way cluster-robust (cluster on state and year) 7.2% 6.9% 7.6% 7.1% Note: Data come from 1.3 million employed women from the March CPS. Table reports rejection rates for testing a (true) null hypothesis of zero on the coefficient of fake treatments. The "treatments" are generated as (t = e_s + 2 e_t), with e_s a state-specific autoregressive component and e_t a year-specific "spatial" autoregressive component. The outcome is also modified by an independent yearspecific autoregressive component. See text for details Monte Carlo replications February / 44
34 4.1 Hersch - Cross-Section with Two-Way Clustering CPS data on male wages N = Separate data on industry and occupation injury rates. 211 injuries and 387 occupations. Model estimates is y igh = α + x 0 igh β + γ rind ig + δ rocc ih + u igh. Two-way clustering is best. C February / 44
35 Table 3 Replication of Hersch (1998) Industry Injury Rate Variable Occupation Injury Rate Estimated slope coefficient: Estimated standard errors Default (iid) (0.415) {0.0000} (0.235) {0.0478} and p-values: Heteroscedastic robust (0.397) {0.0000} (0.260) {0.0737} One-way cluster on Industry (0.643) {0.0032} (0.251) {0.0639} One-way cluster on Occupation (0.486) {0.0001} (0.363) {0.2002} Two-way clustering (0.702) {0.0070} (0.357) {0.1927} Note: Replication of Hersch (1998), pg 604, Table 3, Panel B, Column 4. Standard errors in parentheses. P- values from a test of each coefficient equal to zero in brackets. Data are 5960 observations on working men from the Current Population Survey. Both columns come from the same regression. There are 211 industries and 387 occupations in the data set. February / 44
36 4.3 Rose and Engel - Bilateral Trade Dyadic data: trade ows between pairs of countries. First column of Table 3 of Rose and Engel (2002). gravity model is tted for the natural logarithm of bilateral trade single cross-section on trade ows between 98 countries with 3262 unique country pairs. Cameron and Golotvina (2005) control for two-way clustering, using FGLS estimation based on iid country random eects. Here instead apply the more robust method to their paper. Focus on coecient of the log product of real GDP (= 0.867) heteroskedastic-robust standard error of average one-way clustered standard error of two-way robust standard error is if country specic eects are included as alternative way to control for the clustering, then the coecient of the log product of real GDP is no longer identied. February / 44
37 4.3 Gruber and Madrian - Rotating Panel March CPS data on 39,063 men. Model of retirement (y ist ) and health insurance coverage (d st ) Pr[y ist = 1] = Φ(αd st + xist 0 β + δ s + γ t ). Rotating design - many individuals appear in sample twice. February / 44
38 Table 6 Replication of Gruber and Madrian (1995) Model Probit OLS Estimated slope coefficient (* 1000): Estimated standard errors (* 1000): Default (iid) (5.709) (0.675) Heteroscedastic robust (5.732) (0.684) One-way cluster on state-year (6.265) (0.759) One-way cluster on household id (5.866) (0.702) One-way cluster on hhid-by-state-year (5.732) (0.685) Two-way clustering (6.389) (0.775) One-way cluster on State (6.030) (0.718) Note: Replication of Gruber and Madrian (1995), pg 943, Table 3, Model 1, Column 1. Standard errors in parentheses. Data are 39,063 observations on year-old men from the Current Population Surveys. February / 44
39 Conclusion Method is straightforward to implement. Works well except in smallest designs with G = H = 10. Many potential applications. Sometimes has big impact and sometimes modest. But researcher will not know this a priori. February / 44
40 Asymptotic theory Key is h B 0 = lim E G 1 H 2 N i=1 N j=1 z i (θ)z j (θ) 0i. where N i=1 z i (θ) = G g=1 H h=1 z gh (θ) and z gh (θ) = i2cgh z igh (θ) = i2cgh (y igh xigh 0 β)x igh. For two-way clustering since i z i (θ) = g h z gh (θ) we have h E G 1 H 2 N i=1 N j=1 z i (θ)z j (θ) 0i = E hg 1 H 2 Gg=1 Hh=1 Gg 0=1 Hh 0=1 z gh(θ)z g 0 h 0(θ)0i = G 1 H 2 g h h 0 E[z gh z 0 gh 0] g = g 0 +G 1 H 2 h g g 0 E[z gh z 0 g 0 h ] h = h0 G 1 H 2 g h E[z gh z 0 gh]. February / 44
41 Assume the triple sums are of order GH 2 = G. The triple sums are instead of order GH, rather than GH 2, if the dependence of observations in common cluster g goes to zero as clusters h and h 0 become further apart. case with declining time series dependence or spatial dependence. then in B 0 normalize by (GH) rate of convergence of the estimator becomes a faster p GH rather than p G. Still obtain the same asymptotic variance matrix for b β. February / 44
42 Some References Acemoglu, D., and J.-S. Pischke (2003), \Minimum Wages and On-the-job Training," Research in Labor Economics, 22, Arellano, M. (1987), \Computing Robust Standard Errors for Within-Group Estimators," Oxford Bulletin of Economics and Statistics, 49, Bertrand, M., E. Duo, and S. Mullainathan (2004), \How Much Should we Trust Dierences-in-Dierences Estimates," Quarterly Journal of Economics, Bhattacharya, D. (2005), \Asymptotic inference from multi-stage samples,"' Hournal of Econometrics, 126: Cameron, A.C., and N. Golotvina (2005), \Estimation of Country-Pair Data Models Controlling for Clustered Errors: with International Trade Applications," U.C.-Davis Economics Department Working Paper No Conley, T. G. (1999), \GMM with cross sectional dependence," Journal of Econometrics, 92: Donald, S. and K. Lang (2004), \Inferences with Dierences in Dierences and Other Panel Data," October 22, February / 44
43 Some References Fafchamps, M., and F. Gubert (2006), \The Formation of Risk Sharing Networks," mimeo, April Hansen, C. B. (2007a), \Generalized least squares inference in panel and multilevel models with serial correlation and xed eects," Journal of Econometrics, 140, Hansen, C. B. (2007b), \Asymptotic properties of a robust variance matrix estimator for panel data when T is large,"journal of Econometrics, 141: Ibragimov, R. and U.K. Muller (2007), \T-Statistic Based Correlation and Heterogeneity Robust Inference," Harvard Institute of Economic Research Discussion Paper No Kuersteiner, G. and J. Hausman (2007), \Dierence in Dierence meets Generalized Least Squares: Higher Order Properties of Hypotheses Tests". Liang, K.-Y., and S.L. Zeger (1986), \Longitudinal Data Analysis Using Generalized Linear Models," Biometrika, 73, Miglioretti, D.L., and P.J. Heagerty (2006), \Marginal Modeling of Nonnested Multilevel Data using Standard Software," American Journal of Epidemiology, 165(4), February / 44
44 Some References Moulton, B.R. (1986), \Random Group Eects and the Precision of Regression Estimates," Journal of Econometrics, 32, Moulton, B.R. (1990), \An Illustration of a Pitfall in Estimating the Eects of Aggregate Variables on Micro Units," Review of Economics and Statistics, 72, Petersen, M. (2006), \Estimating Standard Errors in Finance Panel Data Sets: Comparing Approaches," unpublished manuscript, Northwestern University. Rogers, W.H. (1993), \Regression Standard Errors in Clustered Samples," Stata Technical Bulletin, 13, Thompson, S. (2005), \A Simple Formula for Standard Errors that Cluster by Both Firm and Time," unpublished manuscript. White, H. (1984), Asymptotic Theory for Econometricians, San Diego, Academic Press. Wooldridge, J.M. (2003), \Cluster-Sample Methods in Applied Econometrics," American Economic Review, 93, February / 44
Bootstrap-Based Improvements for Inference with Clustered Errors
Bootstrap-Based Improvements for Inference with Clustered Errors Colin Cameron, Jonah Gelbach, Doug Miller U.C. - Davis, U. Maryland, U.C. - Davis May, 2008 May, 2008 1 / 41 1. Introduction OLS regression
More informationWild Bootstrap Inference for Wildly Dierent Cluster Sizes
Wild Bootstrap Inference for Wildly Dierent Cluster Sizes Matthew D. Webb October 9, 2013 Introduction This paper is joint with: Contributions: James G. MacKinnon Department of Economics Queen's University
More informationTECHNICAL WORKING PAPER SERIES ROBUST INFERENCE WITH MULTI-WAY CLUSTERING. A. Colin Cameron Jonah B. Gelbach Douglas L. Miller
TECHNICAL WORKING PAPER SERIES ROBUST INFERENCE WITH MULTI-WAY CLUSTERING A. Colin Cameron Jonah B. Gelbach Douglas L. Miller Technical Working Paper 327 http://www.nber.org/papers/t0327 NATIONAL BUREAU
More informationRobust Inference with Clustered Data
Robust Inference with Clustered Data A. Colin Cameron and Douglas L. Miller Department of Economics, University of California - Davis. This version: Feb 10, 2010 Abstract In this paper we survey methods
More informationCasuality and Programme Evaluation
Casuality and Programme Evaluation Lecture V: Difference-in-Differences II Dr Martin Karlsson University of Duisburg-Essen Summer Semester 2017 M Karlsson (University of Duisburg-Essen) Casuality and Programme
More informationRobust Inference for Regression with Clustered Data
Robust nference for Regression with Clustered Data Colin Cameron Univ. of California - Davis Joint work with Douglas L. Miller (UC - Davis). Based on A Practitioners Guide to Cluster-Robust nference, Journal
More informationRobust Inference with Clustered Errors
Robust nference with Clustered Errors Colin Cameron Univ. of California - Davis Keynote address at The 8th Annual Health Econometrics Workshop, University of Colorado Denver, Anschutz Medical Campus..
More informationCluster-Robust Inference
Cluster-Robust Inference David Sovich Washington University in St. Louis Modern Empirical Corporate Finance Modern day ECF mainly focuses on obtaining unbiased or consistent point estimates (e.g. indentification!)
More informationA Practitioner s Guide to Cluster-Robust Inference
A Practitioner s Guide to Cluster-Robust Inference A. C. Cameron and D. L. Miller presented by Federico Curci March 4, 2015 Cameron Miller Cluster Clinic II March 4, 2015 1 / 20 In the previous episode
More informationA Practitioner's Guide to Cluster-Robust Inference
A Practitioner's Guide to Cluster-Robust Inference A. Colin Cameron and Douglas L. Miller Department of Economics, University of California - Davis. This version: September 30, 2013 Abstract Note: Sections
More informationA Practitioner s Guide to Cluster-Robust Inference
A Practitioner s Guide to Cluster-Robust Inference A. Colin Cameron Douglas L. Miller Cameron and Miller abstract We consider statistical inference for regression when data are grouped into clusters, with
More informationA Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008
A Course in Applied Econometrics Lecture 7: Cluster Sampling Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of roups and
More informationLecture 4: Linear panel models
Lecture 4: Linear panel models Luc Behaghel PSE February 2009 Luc Behaghel (PSE) Lecture 4 February 2009 1 / 47 Introduction Panel = repeated observations of the same individuals (e.g., rms, workers, countries)
More informationChapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE
Chapter 6. Panel Data Joan Llull Quantitative Statistical Methods II Barcelona GSE Introduction Chapter 6. Panel Data 2 Panel data The term panel data refers to data sets with repeated observations over
More informationInference in difference-in-differences approaches
Inference in difference-in-differences approaches Mike Brewer (University of Essex & IFS) and Robert Joyce (IFS) PEPA is based at the IFS and CEMMAP Introduction Often want to estimate effects of policy/programme
More informationThe Exact Distribution of the t-ratio with Robust and Clustered Standard Errors
The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors by Bruce E. Hansen Department of Economics University of Wisconsin October 2018 Bruce Hansen (University of Wisconsin) Exact
More informationPanel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43
Panel Data March 2, 212 () Applied Economoetrics: Topic March 2, 212 1 / 43 Overview Many economic applications involve panel data. Panel data has both cross-sectional and time series aspects. Regression
More informationEconometrics of Panel Data
Econometrics of Panel Data Jakub Mućk Meeting # 1 Jakub Mućk Econometrics of Panel Data Meeting # 1 1 / 31 Outline 1 Course outline 2 Panel data Advantages of Panel Data Limitations of Panel Data 3 Pooled
More information11. Bootstrap Methods
11. Bootstrap Methods c A. Colin Cameron & Pravin K. Trivedi 2006 These transparencies were prepared in 20043. They can be used as an adjunct to Chapter 11 of our subsequent book Microeconometrics: Methods
More informationFinite Sample Performance of A Minimum Distance Estimator Under Weak Instruments
Finite Sample Performance of A Minimum Distance Estimator Under Weak Instruments Tak Wai Chau February 20, 2014 Abstract This paper investigates the nite sample performance of a minimum distance estimator
More informationThe Exact Distribution of the t-ratio with Robust and Clustered Standard Errors
The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors by Bruce E. Hansen Department of Economics University of Wisconsin June 2017 Bruce Hansen (University of Wisconsin) Exact
More informationEmpirical Methods in Applied Microeconomics
Empirical Methods in Applied Microeconomics Jörn-Ste en Pischke LSE October 007 1 Nonstandard Standard Error Issues The discussion so far has concentrated on identi cation of the e ect of interest. Obviously,
More informationClustering as a Design Problem
Clustering as a Design Problem Alberto Abadie, Susan Athey, Guido Imbens, & Jeffrey Wooldridge Harvard-MIT Econometrics Seminar Cambridge, February 4, 2016 Adjusting standard errors for clustering is common
More informationEconomics 582 Random Effects Estimation
Economics 582 Random Effects Estimation Eric Zivot May 29, 2013 Random Effects Model Hence, the model can be re-written as = x 0 β + + [x ] = 0 (no endogeneity) [ x ] = = + x 0 β + + [x ] = 0 [ x ] = 0
More informationDay 2B Inference for Clustered Data: Part 2 With a Cross-section Data Example (Revised August 25 to add 14. Dyadic Clustering)
Day 2B nference for Clustered Data: Part 2 With a Cross-section Data Example (Revised August 25 to add 14. Dyadic Clustering) A. Colin Cameron Univ. of Calif. - Davis... for Center of Labor Economics Norwegian
More informationLecture 7: Dynamic panel models 2
Lecture 7: Dynamic panel models 2 Ragnar Nymoen Department of Economics, UiO 25 February 2010 Main issues and references The Arellano and Bond method for GMM estimation of dynamic panel data models A stepwise
More informationWILD CLUSTER BOOTSTRAP CONFIDENCE INTERVALS*
L Actualité économique, Revue d analyse économique, vol 91, n os 1-2, mars-juin 2015 WILD CLUSTER BOOTSTRAP CONFIDENCE INTERVALS* James G MacKinnon Department of Economics Queen s University Abstract Confidence
More informationRecent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data
Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)
More informationdqd: A command for treatment effect estimation under alternative assumptions
UC3M Working Papers Economics 14-07 April 2014 ISSN 2340-5031 Departamento de Economía Universidad Carlos III de Madrid Calle Madrid, 126 28903 Getafe (Spain) Fax (34) 916249875 dqd: A command for treatment
More informationIV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade
IV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade Denis Chetverikov Brad Larsen Christopher Palmer UCLA, Stanford and NBER, UC Berkeley September
More informationEconometric Analysis of Cross Section and Panel Data
Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND
More informationInference about Clustering and Parametric. Assumptions in Covariance Matrix Estimation
Inference about Clustering and Parametric Assumptions in Covariance Matrix Estimation Mikko Packalen y Tony Wirjanto z 26 November 2010 Abstract Selecting an estimator for the variance covariance matrix
More informationRepeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data
Panel data Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data - possible to control for some unobserved heterogeneity - possible
More informationEconometrics II. Nonstandard Standard Error Issues: A Guide for the. Practitioner
Econometrics II Nonstandard Standard Error Issues: A Guide for the Practitioner Måns Söderbom 10 May 2011 Department of Economics, University of Gothenburg. Email: mans.soderbom@economics.gu.se. Web: www.economics.gu.se/soderbom,
More informationMotivation Non-linear Rational Expectations The Permanent Income Hypothesis The Log of Gravity Non-linear IV Estimation Summary.
Econometrics I Department of Economics Universidad Carlos III de Madrid Master in Industrial Economics and Markets Outline Motivation 1 Motivation 2 3 4 5 Motivation Hansen's contributions GMM was developed
More informationChapter 2. Dynamic panel data models
Chapter 2. Dynamic panel data models School of Economics and Management - University of Geneva Christophe Hurlin, Université of Orléans University of Orléans April 2018 C. Hurlin (University of Orléans)
More informationNon-linear panel data modeling
Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1
More informationApplied Microeconometrics (L5): Panel Data-Basics
Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics
More informationINFERENCE WITH LARGE CLUSTERED DATASETS*
L Actualité économique, Revue d analyse économique, vol. 92, n o 4, décembre 2016 INFERENCE WITH LARGE CLUSTERED DATASETS* James G. MACKINNON Queen s University jgm@econ.queensu.ca Abstract Inference using
More informationRewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35
Rewrap ECON 4135 November 18, 2011 () Rewrap ECON 4135 November 18, 2011 1 / 35 What should you now know? 1 What is econometrics? 2 Fundamental regression analysis 1 Bivariate regression 2 Multivariate
More informationMaking sense of Econometrics: Basics
Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/
More informationFNCE 926 Empirical Methods in CF
FNCE 926 Empirical Methods in CF Lecture 11 Standard Errors & Misc. Professor Todd Gormley Announcements Exercise #4 is due Final exam will be in-class on April 26 q After today, only two more classes
More informationy it = α i + β 0 ix it + ε it (0.1) The panel data estimators for the linear model are all standard, either the application of OLS or GLS.
0.1. Panel Data. Suppose we have a panel of data for groups (e.g. people, countries or regions) i =1, 2,..., N over time periods t =1, 2,..., T on a dependent variable y it and a kx1 vector of independent
More informationQED. Queen s Economics Department Working Paper No. 1355
QED Queen s Economics Department Working Paper No 1355 Randomization Inference for Difference-in-Differences with Few Treated Clusters James G MacKinnon Queen s University Matthew D Webb Carleton University
More informationLeast Squares Estimation-Finite-Sample Properties
Least Squares Estimation-Finite-Sample Properties Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu (HKU) Finite-Sample 1 / 29 Terminology and Assumptions 1 Terminology and Assumptions
More informationEconomics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama
Problem Set #1 (Random Data Generation) 1. Generate =500random numbers from both the uniform 1 ( [0 1], uniformbetween zero and one) and exponential exp ( ) (set =2and let [0 1]) distributions. Plot the
More informationWild Cluster Bootstrap Confidence Intervals
Wild Cluster Bootstrap Confidence Intervals James G MacKinnon Department of Economics Queen s University Kingston, Ontario, Canada K7L 3N6 jgm@econqueensuca http://wwweconqueensuca/faculty/macinnon/ Abstract
More information1 Estimation of Persistent Dynamic Panel Data. Motivation
1 Estimation of Persistent Dynamic Panel Data. Motivation Consider the following Dynamic Panel Data (DPD) model y it = y it 1 ρ + x it β + µ i + v it (1.1) with i = {1, 2,..., N} denoting the individual
More informationEconometrics of Panel Data
Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao
More informationModified Variance Ratio Test for Autocorrelation in the Presence of Heteroskedasticity
The Lahore Journal of Economics 23 : 1 (Summer 2018): pp. 1 19 Modified Variance Ratio Test for Autocorrelation in the Presence of Heteroskedasticity Sohail Chand * and Nuzhat Aftab ** Abstract Given that
More informationExam D0M61A Advanced econometrics
Exam D0M61A Advanced econometrics 19 January 2009, 9 12am Question 1 (5 pts.) Consider the wage function w i = β 0 + β 1 S i + β 2 E i + β 0 3h i + ε i, where w i is the log-wage of individual i, S i is
More informationHeteroskedasticity-Robust Inference in Finite Samples
Heteroskedasticity-Robust Inference in Finite Samples Jerry Hausman and Christopher Palmer Massachusetts Institute of Technology December 011 Abstract Since the advent of heteroskedasticity-robust standard
More informationIntermediate Econometrics
Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the
More informationNBER WORKING PAPER SERIES ROBUST STANDARD ERRORS IN SMALL SAMPLES: SOME PRACTICAL ADVICE. Guido W. Imbens Michal Kolesar
NBER WORKING PAPER SERIES ROBUST STANDARD ERRORS IN SMALL SAMPLES: SOME PRACTICAL ADVICE Guido W. Imbens Michal Kolesar Working Paper 18478 http://www.nber.org/papers/w18478 NATIONAL BUREAU OF ECONOMIC
More informationTopic 7: Heteroskedasticity
Topic 7: Heteroskedasticity Advanced Econometrics (I Dong Chen School of Economics, Peking University Introduction If the disturbance variance is not constant across observations, the regression is heteroskedastic
More informationInference in Differences-in-Differences with Few Treated Groups and Heteroskedasticity
Inference in Differences-in-Differences with Few Treated Groups and Heteroskedasticity Bruno Ferman Cristine Pinto Sao Paulo School of Economics - FGV First Draft: October, 205 This Draft: October, 207
More informationHeteroskedasticity. We now consider the implications of relaxing the assumption that the conditional
Heteroskedasticity We now consider the implications of relaxing the assumption that the conditional variance V (u i x i ) = σ 2 is common to all observations i = 1,..., In many applications, we may suspect
More informationNew Developments in Econometrics Lecture 11: Difference-in-Differences Estimation
New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. The Basic Methodology 2. How Should We View Uncertainty in DD Settings?
More informationGravity Models, PPML Estimation and the Bias of the Robust Standard Errors
Gravity Models, PPML Estimation and the Bias of the Robust Standard Errors Michael Pfaffermayr August 23, 2018 Abstract In gravity models with exporter and importer dummies the robust standard errors of
More informationBOOTSTRAPPING DIFFERENCES-IN-DIFFERENCES ESTIMATES
BOOTSTRAPPING DIFFERENCES-IN-DIFFERENCES ESTIMATES Bertrand Hounkannounon Université de Montréal, CIREQ December 2011 Abstract This paper re-examines the analysis of differences-in-differences estimators
More informationSmall sample methods for cluster-robust variance estimation and hypothesis testing in fixed effects models
Small sample methods for cluster-robust variance estimation and hypothesis testing in fixed effects models arxiv:1601.01981v1 [stat.me] 8 Jan 2016 James E. Pustejovsky Department of Educational Psychology
More informationEstimating the Variance of Decomposition Effects
City University of New York (CUNY) CUNY Academic Works Economics Working Papers CUNY Academic Works 2015 Estimating the Variance of Decomposition Effects Takuya Hasebe Sophia University Follow this and
More informationEcon 836 Final Exam. 2 w N 2 u N 2. 2 v N
1) [4 points] Let Econ 836 Final Exam Y Xβ+ ε, X w+ u, w N w~ N(, σi ), u N u~ N(, σi ), ε N ε~ Nu ( γσ, I ), where X is a just one column. Let denote the OLS estimator, and define residuals e as e Y X.
More informationPanel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63
1 / 63 Panel Data Models Chapter 5 Financial Econometrics Michael Hauser WS17/18 2 / 63 Content Data structures: Times series, cross sectional, panel data, pooled data Static linear panel data models:
More informationMultiple Equation GMM with Common Coefficients: Panel Data
Multiple Equation GMM with Common Coefficients: Panel Data Eric Zivot Winter 2013 Multi-equation GMM with common coefficients Example (panel wage equation) 69 = + 69 + + 69 + 1 80 = + 80 + + 80 + 2 Note:
More informationA Direct Test for Consistency of Random Effects Models that Outperforms the Hausman Test
A Direct Test for Consistency of Random Effects Models that Outperforms the Hausman Test Preliminary Version: This paper is under active development. Results and conclusions may change as research progresses.
More informationEnvironmental Econometrics
Environmental Econometrics Syngjoo Choi Fall 2008 Environmental Econometrics (GR03) Fall 2008 1 / 37 Syllabus I This is an introductory econometrics course which assumes no prior knowledge on econometrics;
More informationSmall sample methods for cluster-robust variance estimation and hypothesis testing in fixed effects models
Small sample methods for cluster-robust variance estimation and hypothesis testing in fixed effects models James E. Pustejovsky and Elizabeth Tipton July 18, 2016 Abstract In panel data models and other
More informationECONOMETRICS FIELD EXAM Michigan State University May 9, 2008
ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008 Instructions: Answer all four (4) questions. Point totals for each question are given in parenthesis; there are 00 points possible. Within
More informationDifference-in-Differences Estimation
Difference-in-Differences Estimation Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. The Basic Methodology 2. How Should We
More informationEconometrics - 30C00200
Econometrics - 30C00200 Lecture 11: Heteroskedasticity Antti Saastamoinen VATT Institute for Economic Research Fall 2015 30C00200 Lecture 11: Heteroskedasticity 12.10.2015 Aalto University School of Business
More information4.8 Instrumental Variables
4.8. INSTRUMENTAL VARIABLES 35 4.8 Instrumental Variables A major complication that is emphasized in microeconometrics is the possibility of inconsistent parameter estimation due to endogenous regressors.
More informationShort T Panels - Review
Short T Panels - Review We have looked at methods for estimating parameters on time-varying explanatory variables consistently in panels with many cross-section observation units but a small number of
More informationRecent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data
Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Endogeneity b) Instrumental
More informationNinth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"
Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric
More informationEfficient Estimation of Dynamic Panel Data Models: Alternative Assumptions and Simplified Estimation
Efficient Estimation of Dynamic Panel Data Models: Alternative Assumptions and Simplified Estimation Seung C. Ahn Arizona State University, Tempe, AZ 85187, USA Peter Schmidt * Michigan State University,
More informationChapter 1. GMM: Basic Concepts
Chapter 1. GMM: Basic Concepts Contents 1 Motivating Examples 1 1.1 Instrumental variable estimator....................... 1 1.2 Estimating parameters in monetary policy rules.............. 2 1.3 Estimating
More informationInference for Clustered Data
Inference for Clustered Data Chang Hyung Lee and Douglas G. Steigerwald Department of Economics University of California, Santa Barbara March 23, 2017 Abstract This article introduces eclust, a new Stata
More informationEconometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018
Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate
More informationPANEL DATA RANDOM AND FIXED EFFECTS MODEL. Professor Menelaos Karanasos. December Panel Data (Institute) PANEL DATA December / 1
PANEL DATA RANDOM AND FIXED EFFECTS MODEL Professor Menelaos Karanasos December 2011 PANEL DATA Notation y it is the value of the dependent variable for cross-section unit i at time t where i = 1,...,
More informationthe error term could vary over the observations, in ways that are related
Heteroskedasticity We now consider the implications of relaxing the assumption that the conditional variance Var(u i x i ) = σ 2 is common to all observations i = 1,..., n In many applications, we may
More informationE cient Estimation and Inference for Di erence-in-di erence Regressions with Persistent Errors
E cient Estimation and Inference for Di erence-in-di erence Regressions with Persistent Errors Ryan Greenaway-McGrevy Bureau of Economic Analysis Chirok Han Korea University January 2014 Donggyu Sul Univeristy
More informationG. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication
G. S. Maddala Kajal Lahiri WILEY A John Wiley and Sons, Ltd., Publication TEMT Foreword Preface to the Fourth Edition xvii xix Part I Introduction and the Linear Regression Model 1 CHAPTER 1 What is Econometrics?
More informationx i = 1 yi 2 = 55 with N = 30. Use the above sample information to answer all the following questions. Show explicitly all formulas and calculations.
Exercises for the course of Econometrics Introduction 1. () A researcher is using data for a sample of 30 observations to investigate the relationship between some dependent variable y i and independent
More informationIntroduction to Linear Regression Analysis
Introduction to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018 Econometrics: What is it? Interaction of economic theory, observed data and statistical methods. The science of testing
More informationRobust Standard Errors in Small Samples: Some Practical Advice
Robust Standard Errors in Small Samples: Some Practical Advice Guido W. Imbens Michal Kolesár First Draft: October 2012 This Draft: December 2014 Abstract In this paper we discuss the properties of confidence
More informationEconomics 241B Estimation with Instruments
Economics 241B Estimation with Instruments Measurement Error Measurement error is de ned as the error resulting from the measurement of a variable. At some level, every variable is measured with error.
More information12E016. Econometric Methods II 6 ECTS. Overview and Objectives
Overview and Objectives This course builds on and further extends the econometric and statistical content studied in the first quarter, with a special focus on techniques relevant to the specific field
More informationEC327: Advanced Econometrics, Spring 2007
EC327: Advanced Econometrics, Spring 2007 Wooldridge, Introductory Econometrics (3rd ed, 2006) Chapter 14: Advanced panel data methods Fixed effects estimators We discussed the first difference (FD) model
More informationDiscrete Dependent Variable Models
Discrete Dependent Variable Models James J. Heckman University of Chicago This draft, April 10, 2006 Here s the general approach of this lecture: Economic model Decision rule (e.g. utility maximization)
More informationADJUSTED POWER ESTIMATES IN. Ji Zhang. Biostatistics and Research Data Systems. Merck Research Laboratories. Rahway, NJ
ADJUSTED POWER ESTIMATES IN MONTE CARLO EXPERIMENTS Ji Zhang Biostatistics and Research Data Systems Merck Research Laboratories Rahway, NJ 07065-0914 and Dennis D. Boos Department of Statistics, North
More informationAdvanced Econometrics
Based on the textbook by Verbeek: A Guide to Modern Econometrics Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna May 16, 2013 Outline Univariate
More informationBasic Regressions and Panel Data in Stata
Developing Trade Consultants Policy Research Capacity Building Basic Regressions and Panel Data in Stata Ben Shepherd Principal, Developing Trade Consultants 1 Basic regressions } Stata s regress command
More informationA Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,
A Course in Applied Econometrics Lecture 18: Missing Data Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. When Can Missing Data be Ignored? 2. Inverse Probability Weighting 3. Imputation 4. Heckman-Type
More informationDay 3B Nonparametrics and Bootstrap
Day 3B Nonparametrics and Bootstrap c A. Colin Cameron Univ. of Calif.- Davis Frontiers in Econometrics Bavarian Graduate Program in Economics. Based on A. Colin Cameron and Pravin K. Trivedi (2009,2010),
More informationEcon 582 Fixed Effects Estimation of Panel Data
Econ 582 Fixed Effects Estimation of Panel Data Eric Zivot May 28, 2012 Panel Data Framework = x 0 β + = 1 (individuals); =1 (time periods) y 1 = X β ( ) ( 1) + ε Main question: Is x uncorrelated with?
More informationBootstrapping the Grainger Causality Test With Integrated Data
Bootstrapping the Grainger Causality Test With Integrated Data Richard Ti n University of Reading July 26, 2006 Abstract A Monte-carlo experiment is conducted to investigate the small sample performance
More informationA Course on Advanced Econometrics
A Course on Advanced Econometrics Yongmiao Hong The Ernest S. Liu Professor of Economics & International Studies Cornell University Course Introduction: Modern economies are full of uncertainties and risk.
More informationAn overview of applied econometrics
An overview of applied econometrics Jo Thori Lind September 4, 2011 1 Introduction This note is intended as a brief overview of what is necessary to read and understand journal articles with empirical
More informationEconometrics of Panel Data
Econometrics of Panel Data Jakub Mućk Meeting # 2 Jakub Mućk Econometrics of Panel Data Meeting # 2 1 / 26 Outline 1 Fixed effects model The Least Squares Dummy Variable Estimator The Fixed Effect (Within
More information