The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors

Similar documents
The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors

Bootstrapping Heteroskedasticity Consistent Covariance Matrix Estimator

Wild Bootstrap Inference for Wildly Dierent Cluster Sizes

NBER WORKING PAPER SERIES ROBUST STANDARD ERRORS IN SMALL SAMPLES: SOME PRACTICAL ADVICE. Guido W. Imbens Michal Kolesar

WILD CLUSTER BOOTSTRAP CONFIDENCE INTERVALS*

Robust Standard Errors in Small Samples: Some Practical Advice

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Small sample methods for cluster-robust variance estimation and hypothesis testing in fixed effects models

Bootstrap-Based Improvements for Inference with Clustered Errors

Supplement to Randomization Tests under an Approximate Symmetry Assumption

Small sample methods for cluster-robust variance estimation and hypothesis testing in fixed effects models

Bayesian Interpretations of Heteroskedastic Consistent Covariance Estimators Using the Informed Bayesian Bootstrap

Bayesian Interpretations of Heteroskedastic Consistent Covariance. Estimators Using the Informed Bayesian Bootstrap

Wild Cluster Bootstrap Confidence Intervals

INFERENCE WITH LARGE CLUSTERED DATASETS*

A Practitioner s Guide to Cluster-Robust Inference

Heteroskedasticity-Robust Inference in Finite Samples

The Wild Bootstrap, Tamed at Last. Russell Davidson. Emmanuel Flachaire

Central Bank of Chile October 29-31, 2013 Bruce Hansen (University of Wisconsin) Structural Breaks October 29-31, / 91. Bruce E.

Empirical Methods in Applied Microeconomics

New heteroskedasticity-robust standard errors for the linear regression model

Bootstrapping heteroskedastic regression models: wild bootstrap vs. pairs bootstrap

Inference in Differences-in-Differences with Few Treated Groups and Heteroskedasticity

Time Series Econometrics For the 21st Century

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

Small-Sample Methods for Cluster-Robust Inference in School-Based Experiments

Clustering as a Design Problem

A better way to bootstrap pairs

Finite Sample Performance of A Minimum Distance Estimator Under Weak Instruments

11. Bootstrap Methods

QED. Queen s Economics Department Working Paper No. 1355

Econometrics II. Nonstandard Standard Error Issues: A Guide for the. Practitioner

Small-sample cluster-robust variance estimators for two-stage least squares models

Inference in difference-in-differences approaches

Topic 7: Heteroskedasticity

Casuality and Programme Evaluation

Heteroskedasticity ECONOMETRICS (ECON 360) BEN VAN KAMMEN, PHD

Simple and Trustworthy Cluster-Robust GMM Inference

Increasing the Power of Specification Tests. November 18, 2018

Robust Inference with Multi-way Clustering

Inference for Clustered Data

Reliability of inference (1 of 2 lectures)

A Robust Test for Weak Instruments in Stata

More efficient tests robust to heteroskedasticity of unknown form

Economics 582 Random Effects Estimation

Inference Based on the Wild Bootstrap

the error term could vary over the observations, in ways that are related

Robust Inference with Clustered Data

Inference with Dependent Data Using Cluster Covariance Estimators

Inference about Clustering and Parametric. Assumptions in Covariance Matrix Estimation

Microeconometrics: Clustering. Ethan Kaplan

Andreas Steinhauer, University of Zurich Tobias Wuergler, University of Zurich. September 24, 2010

QED. Queen s Economics Department Working Paper No Bootstrap and Asymptotic Inference with Multiway Clustering

GLS. Miguel Sarzosa. Econ626: Empirical Microeconomics, Department of Economics University of Maryland

Discussion of Bootstrap prediction intervals for linear, nonlinear, and nonparametric autoregressions, by Li Pan and Dimitris Politis

A COMPARISON OF HETEROSCEDASTICITY ROBUST STANDARD ERRORS AND NONPARAMETRIC GENERALIZED LEAST SQUARES

econstor Make Your Publications Visible.

Testing Overidentifying Restrictions with Many Instruments and Heteroskedasticity

Instrumental Variables Estimation in Stata

New Developments in Econometrics Lecture 16: Quantile Estimation

Imbens/Wooldridge, Lecture Notes 13, Summer 07 1

APEC 8212: Econometric Analysis II

Week 11 Heteroskedasticity and Autocorrelation

Applied Statistics and Econometrics

HETEROSKEDASTICITY, TEMPORAL AND SPATIAL CORRELATION MATTER

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Alternative HAC Covariance Matrix Estimators with Improved Finite Sample Properties

BOOTSTRAPPING DIFFERENCES-IN-DIFFERENCES ESTIMATES

Location Properties of Point Estimators in Linear Instrumental Variables and Related Models

A New Approach to Robust Inference in Cointegration

Economics 308: Econometrics Professor Moody

The Case Against JIVE

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication

Robust Two Step Confidence Sets, and the Trouble with the First Stage F Statistic

On block bootstrapping areal data Introduction

Lecture 7: Dynamic panel models 2

FNCE 926 Empirical Methods in CF

Gravity Models, PPML Estimation and the Bias of the Robust Standard Errors

Heteroskedasticity-Consistent Covariance Matrix Estimators in Small Samples with High Leverage Points

IV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade

Bootstrap prediction intervals for factor models

Inference with Dependent Data Using Cluster Covariance Estimators

Instrumental Variable Estimation with Heteroskedasticity and Many Instruments

Simple and Trustworthy Cluster-Robust GMM Inference

POLSCI 702 Non-Normality and Heteroskedasticity

On Variance Estimation for 2SLS When Instruments Identify Different LATEs

Bootstrap Quasi-Likelihood Ratio Tests for Nested Models

What s New in Econometrics? Lecture 14 Quantile Methods

Robust Inference for Regression with Clustered Data

Econ 836 Final Exam. 2 w N 2 u N 2. 2 v N

ECON Introductory Econometrics. Lecture 5: OLS with One Regressor: Hypothesis Tests

A Simple, Graphical Procedure for Comparing. Multiple Treatment Effects

41903: Group-Based Inference

E cient Estimation and Inference for Di erence-in-di erence Regressions with Persistent Errors

Consistency without Inference: Instrumental Variables in Practical Application *

Models, Testing, and Correction of Heteroskedasticity. James L. Powell Department of Economics University of California, Berkeley

Introduction to Econometrics

Darmstadt Discussion Papers in Economics

Econometrics of Panel Data

Econometrics Summary Algebraic and Statistical Preliminaries

Transcription:

The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors by Bruce E. Hansen Department of Economics University of Wisconsin October 2018 Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 1 / 38

Peter Phillips Contributions to Exact Distribution Theory Before unit roots, Peter was the master of exact distribution theory Econometrica 1977 (asymptotic expansions) Journal of Econometrics 1977 (SUR) Econometrica 1980 (IV) Biometrika 1982 (F) Handbook of Econometrics 1983 (simultaneous equations) International Economic Review 1984 (LIML) Journal of Econometrics 1984 (Stein Rule) Journal of Econometrics 1984 (exogenous case) International Economic Review 1985 (LIML) Econometrica 1985 (SUR) International Economic Review 1986 (FIML) Econometrica 1986 (Wald statistic) Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 2 / 38

t-ratio Gosset (1908) The t-ratio of the sample mean has the exact t n 1 distribution A fundamental intellectual achievement Linear regression Gosset s result extends to classical t-ratios (classical standard errors) Classical t-ratios have t n k distribution Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 3 / 38

But... Classical standard errors are no longer used in economic research Papers use either Heteroskedasticity-consistent (HC) Cluster-robust (CR) Heteroskedasticity-and-autocorrelation-consistent (HAC) Justification is asymptotic Most assess significance (testing and confidence intervals) using finite sample distribution: t n k distribution (HC) t G 1 distribution (CR) THIS IS WRONG!!! Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 4 / 38

Most applied regressions use Stata with Regression: reg y x, r Uses HC1 variance estimator White estimator scaled by n/(n k) Uses t n k distribution for p-values and confidence intervals UNJUSTIFIED! Clustered: reg y x, cluster(id ) Uses CR1 variance estimator Described later, ad hoc Uses t G 1 distribution for p-values and confidence intervals No finite sample justification Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 5 / 38

This paper Provides an exact theory of inference Linear regression with robust standard errors Linear regression with clustered standard errors Exact distribution of HC and CR t-ratios under i.i.d. normality Computable Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 6 / 38

Linear Regression with Heteroskedasticity y i = x i β + e i E (e i x i ) = 0 E ( e 2 i x i ) = σ 2 i n observations k regressors Core model in applied econometrics Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 7 / 38

Heteroskedastic (HC) Variance Estimation: Some History Eicker (1963): HC0 Horn, Horn and Duncan (1975): HC2 Hinkley (1977): HC1 White (1980): HC0 for econometrics MacKinnon and White (1985): HC3 Chesher and Jewitt (1987): Bias can be large Bera, Suprayitno and Premaratne (2002): Unbiased estimator Bell-McCaffrey (2002): Distributional approximation Cribari-Neto (2004): HC4 Cribari-Neto, Souza and Vasconcellos (2007): HC5 Cattaneo, Jansson and Newey (2017): Many regressors Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 8 / 38

HC Variance Estimation OLS: Residuals: HC0 HC1 V 1 = V 0 = ( X X ) 1 β = ( X X ) 1 X Y ê i = y i x i β ( n x i x i êi 2 i=1 ) (X X ) 1 n ( X X ) ( 1 n ) (X x i x i êi 2 X ) 1 n k i=1 robust covariance matrix in Stata Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 9 / 38

HC2: V 2 = ( X X ) 1 ( n ) (X x i x i êi 2 (1 h i ) 1 X ) 1 i=1 h i = x i (X X ) 1 x i Unbiased under homoskedasticity HC3: V 3 = ( X X ) 1 ( n ) (X x i x i êi 2 (1 h i ) 2 X ) 1 i=1 jackknife Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 10 / 38

HC3 (jackknife) is a conservative estimator Theorem. In the linear regression model, ( ) E V 3 X ( ) ) V = E ( β β) ( β β X (However, inference using HC3 is not necessarily conservative.) Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 11 / 38

HC t-ratios t-ratio for R β: ( β T = R β) R V R Distribution theory Asymptotic: T d N(0, 1) This is what we (typically) teach Distribution used in practical applications Finite Sample: T t n k This is what most applied papers use Incorrect Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 12 / 38

Clustered Samples Observations are (y ig, x ig ) g = 1,..., G indexes cluster (group) i = 1,..., n n indexes observation within g th cluster Clusters are mutually independent Observations within a cluster have unknown dependence In panels, (y ig, x ig ) could be demeaned observations Assumptions fully allow for this Number of observations n g per cluster may vary across cluster Total number of observations n = G g =1 n g Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 13 / 38

Cluster Regression y g = (y 1g,..., y ng g ) is n g 1 vector of dependent variables X g = (x 1g,..., x ng g ) is n g K regressor matrix for g th cluster. Linear regression model y g = X g β + e g E (e g X g ) = 0 E ( e g e g ) X g = Sg Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 14 / 38

Cluster-Robust (CR) Variance Estimation OLS: β = Residual: Variance estimator V 0 = ( G ) 1 ( G X g X g g =1 g =1 ê g = y g X g β X g y g ) ( G ) 1 ( G ) ( G X g X g X g ê g ê g X g g =1 g =1 g =1 X g X g ) 1 Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 15 / 38

Adjustments Chris Hansen (2007) adjustment ( ) G V = V 0 G 1 Justified in Large homogenous clusters framework Stata adjusment No justification V 1 = ( ) ( ) n 1 G V 0 n k G 1 Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 16 / 38

Cluster-Robust (CR) Variance Estimation: Some History Methods: Moulton (1986, 1990), Arellano (1987) Popularization: Rogers (1993), Bertrand, Duflo and Mullainathan (2004) Large G asymptotics: White (1984), C. Hansen (2007), Carter, Schnepel and Steigerwald (2017) Fixed G asymptotics: C. Hansen (2007), Bester, Conley and C. Hansen (2011), Conley and Taber (2011), Ibragimov and Mueller (2010, 2016) Small Sample: Donald and Lang (2007), Imbens and Kolesar (2016), Young (2017), Canay, Romano, and Shaikh (2017) Bootstrap: Cameron, Gelbach and Miller (2008), MacKinnon and Webb (2017abc, 2018), MacKinnon, Nielsen, and Webb (2017), Djogbenou, MacKinnon, and Nielsen (2018), Canay, Santos, and Shaikh (2018) Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 17 / 38

Illustration: Heteroskedastic Dummy Variable Regression Dummy variable model Angrist and Pinchke (2009) Imbens and Kolesar (2016) y i = β 0 + β 1 x i + e i n i=1 x i = 3 e i N(0, 1) Coeffi cient of interest: β 1 Simulation with 100,000 replications Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 18 / 38

Large Size Distortion with HC Standard Errors Rejection Probability of Nominal 5% Tests Using t n k Critical Values n = 30 HC0 0.18 HC1 0.17 HC2 0.14 HC3 0.10 Notice that even conservative HC3 t-ratio over-rejects. That is because the t n k distribution is incorrect. Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 19 / 38

Distortion increases with Sample size! Rejection Probability of Nominal 5% Tests Using t n k Critical Values n = 30 n = 100 n = 500 HC0 0.18 0.23 0.24 HC1 0.17 0.22 0.24 HC2 0.14 0.17 0.18 HC3 0.10 0.13 0.14 Reason: Highly Leveraged Design Matrix Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 20 / 38

Simulation Results All procedures over-reject HC1 correction doesn t help Unbiased estimator HC2 over-rejects Conservative estimator HC3 over-rejects t n k vs N(0, 1) ineffective Conclusion: Distributional approximation needs improvement Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 21 / 38

Exact Distribution of White t-ratio Assumption: Observations are i.i.d., e i x i N ( 0, σ 2) Goal: Calculate exact density and distribution of White t-ratio T Parameter of interest: θ = R β The distribution is unknown Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 22 / 38

Notation M = I X (X X ) 1 X D = diag { d1 2,..., d n 2 } d i = R (X X ) 1 x i λ 1,..., λ K are the non-zero eigenvalues of D 1/2 MD 1/2 w i = λ i /R (X X ) 1 R Distribution depends on normalized eigenvalues w i Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 23 / 38

Theorem: Distribution of White Variance Estimator where G r (u) is the χ 2 r distribution, R V R ( u ) R VR b m G K +2m m=0 δ δ = min w m m N ( δ b 0 = i=1 b m = 1 m a m = w i m l=1 N 1 i=1 2 ) ki /2 b m l a l, m 1 (1 δwi ) m Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 24 / 38

Comments The Theorem shows that the distribution of the variance estimator can be written as an infinite mixture of chi-square distributions The weights b m are non-negative, sum to one Weights are determined by a simple recursion in known parameters Weights determined by eigenvalues of matrix D 1/2 MD 1/2 Specializes to conventional chi-square when eigenvalues are all equal This theorem is a refinement of Castano and Lopez (2005) Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 25 / 38

Theorem: Distribution of White t-ratio T ) b m F K +2m (u (K + 2m) δ m=0 where F r (u) is the student t distribution, and weights b m are from the previous theorem. Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 26 / 38

Comments The Theorem shows that the distribution of T can be written as an infinite mixture of student t distributions The weights b m are non-negative, sum to one Weights are determined by a simple recursion in known parameters Weights determined by eigenvalues of matrix D 1/2 MD 1/2 Specializes to conventional student t when eigenvalues are all equal Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 27 / 38

Finite Sample Distribution This is the exact finite sample distribution of the White HC t-ratio under normality. The distribution is determined by the design matrix X X This result is entirely new The exact distribution is not student t. It is a mixture of student t distributions. The difference can be large when the design matrix is highly leveraged. Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 28 / 38

Exact Distribution Advantages Computatable exact distribution under normality Improved accuracy when regressor matrix is highly leveraged Disadvantages Increased computation cost relative to classical methods Reliable algorithm in development Limitations Assumes homoskedasticity Assumes normality Linear parameters Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 29 / 38

Alternative Bell-McCaffrey (2002) Satterthwaite (1946) approximation for Q is αχ 2 K match first two moments of Q Approximate distribution of t by t K Endorsed by Imbens-Kolesar (2016) An approximation but no formal theory where α and K Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 30 / 38

Simulation Experiement Dummy variable model Angrist and Pinchke (2009) Imbens and Kolesar (2016) y i = β 0 + β 1 x i + e i n i=1 x i = 3 Coeffi cient of interest: β 1 n = 50, 100, 500 Compare: HC1, HC2, HC3 t n k, Bell-McCaffrey, and T distributions Size and median length of confidence regions e i N(0, 1), Heteroskedastic, and student-t errors 100,000 replications Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 31 / 38

Rejection Probability of Nominal 5% Tests Median Length of 95% Confidence Intervals Normal Homoskedastic Errors t n k Bell-McCaffrey Exact T size Length size Length n = 50 HC1 0.174 0.032 3.5 0.053 3.0 HC2 0.139 0.033 3.7 0.052 3.2 HC3 0.101 0.035 3.9 0.052 3.3 n = 100 HC1 0.224 0.036 3.9 0.052 3.4 HC2 0.173 0.040 4.0 0.051 3.6 HC3 0.126 0.042 4.0 0.051 3.7 n = 500 HC1 0.240 0.046 4.1 0.051 3.9 HC2 0.183 0.047 4.1 0.051 3.9 HC3 0.137 0.049 4.1 0.051 4.0 Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 32 / 38

Rejection Probability of Nominal 5% Tests Median Length of 95% Confidence Intervals Normal Heteroskedastic Errors σ 2 (x) = 1(x = 1) + 0.5(x = 0) t n k Bell-McCaffrey T size Length size Length n = 50 HC1 0.201 0.053 3.3 0.079 2.8 HC2 0.158 0.049 3.6 0.072 3.0 HC3 0.115 0.046 3.8 0.065 3.3 n = 100 HC1 0.228 0.051 3.8 0.065 3.4 HC2 0.175 0.050 4.0 0.061 3.6 HC3 0.128 0.049 4.0 0.058 3.7 n = 500 HC1 0.259 0.052 4.0 0.057 3.8 HC2 0.197 0.052 4.0 0.055 3.9 HC3 0.144 0.052 4.0 0.054 3.9 Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 33 / 38

Rejection Probability of Nominal 5% Tests Median Length of 95% Confidence Intervals Normal Heteroskedastic Errors σ 2 (x) = 1(x = 1) + 2(x = 0) t n k Bell-McCaffrey T size Length size Length n = 50 HC1 0.112 0.003 4.5 0.017 3.8 HC2 0.093 0.009 4.5 0.021 3.8 HC3 0.068 0.013 4.5 0.024 3.8 n = 100 HC1 0.182 0.012 4.3 0.021 3.8 HC2 0.140 0.017 4.3 0.027 3.9 HC3 0.106 0.025 4.3 0.033 3.9 n = 500 HC1 0.231 0.034 4.2 0.039 4.0 HC2 0.177 0.039 4.2 0.042 4.0 HC3 0.132 0.042 4.2 0.044 4.1 Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 34 / 38

Rejection Probability of Nominal 5% Tests Median Length of 95% Confidence Intervals t 5 Errors t n k Bell-McCaffrey T size Length size Length n = 50 HC1 0.153 0.022 4.2 0.039 3.6 HC2 0.122 0.023 4.4 0.039 3.7 HC3 0.086 0.024 4.5 0.039 3.9 n = 100 HC1 0.182 0.012 4.6 0.038 4.0 HC2 0.140 0.017 4.6 0.039 4.2 HC3 0.106 0.025 4.7 0.040 4.3 n = 500 HC1 0.226 0.035 4.7 0.038 4.5 HC2 0.166 0.036 4.7 0.039 4.5 HC3 0.119 0.037 4.7 0.040 4.6 Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 35 / 38

Simulation Summary t n k criticals inappropriate Bell-McCaffrey can be quite conservative T is precise under homoskedastic normality (as expected) Both Bell-McCaffrey and T sensitive to heteroskedasticity and non-normality HC3 appears least sensitive HC3 with T distribution reasonably reliable Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 36 / 38

Clustered Samples Same analysis applies to clustered regression and CR standard errors Under i.i.d. normality, clustered t-ratios have exact T distributions Weights are determined by regressor matrix Distortions from normality when design matrix is highly leveraged When clusters are heterogeneous When only a few clusters are treated Accuracy of conventional distribution theory depends on the number of clusters G and degree of leverage Conventional asymptotics requires a large G, not large n Many applied papers don t even report G G should be reported, along with sample size! Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 37 / 38

Conclusion In 1908, Gosset revolutionized statistical inference by providing the exact distribution of the classical t-ratio Peter Phillips extended Gosset s theory to a broad set of econometric estimators and statistics Applied econometrics relies on heteroskedasticity-robust and cluster-robust standard errors There is no finite sample theory for HC and CR t-ratios This paper provides the first exact distribution theory Bruce Hansen (University of Wisconsin) Exact Inference for Robust t ratio October 2018 38 / 38