Robust covariance estimation for quantile regression

Similar documents
Quantile regression and heteroskedasticity

Partial effects in fixed effects models

UNIVERSITY OF CALIFORNIA Spring Economics 241A Econometrics

Censored quantile instrumental variable estimation with Stata

What s New in Econometrics? Lecture 14 Quantile Methods

the error term could vary over the observations, in ways that are related

LESLIE GODFREY LIST OF PUBLICATIONS

Instrumental Variables Estimation in Stata

Basic Regressions and Panel Data in Stata

Instrumental Variables and GMM: Estimation and Testing. Steven Stillman, New Zealand Department of Labour

Econometrics - 30C00200

PEP-PMMA Technical Note Series (PTN 01) Percentile Weighed Regression

New Developments in Econometrics Lecture 16: Quantile Estimation

Topic 7: Heteroskedasticity

Heteroskedasticity. We now consider the implications of relaxing the assumption that the conditional

Modified Variance Ratio Test for Autocorrelation in the Presence of Heteroskedasticity

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction

The Glejser Test and the Median Regression

Economics 582 Random Effects Estimation

lqmm: Estimating Quantile Regression Models for Independent and Hierarchical Data with R

Problem Set 3: Bootstrap, Quantile Regression and MCMC Methods. MIT , Fall Due: Wednesday, 07 November 2007, 5:00 PM

A Robust Test for Weak Instruments in Stata

References. Regression standard errors in clustered samples. diagonal matrix with elements

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Spatial Regression. 3. Review - OLS and 2SLS. Luc Anselin. Copyright 2017 by Luc Anselin, All Rights Reserved

Models, Testing, and Correction of Heteroskedasticity. James L. Powell Department of Economics University of California, Berkeley

Quantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be

Discussion of Bootstrap prediction intervals for linear, nonlinear, and nonparametric autoregressions, by Li Pan and Dimitris Politis

Nonparametric Econometrics

GARCH Models Estimation and Inference

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Short T Panels - Review

New Developments in Econometrics Lecture 9: Stratified Sampling

Graduate Econometrics Lecture 4: Heteroskedasticity

11. Bootstrap Methods

Heteroskedasticity and Autocorrelation

Multiple Regression Analysis: Heteroskedasticity

Instrumental variables estimation using heteroskedasticity-based instruments

IV and IV-GMM. Christopher F Baum. EC 823: Applied Econometrics. Boston College, Spring 2014

Jeffrey M. Wooldridge Michigan State University

fhetprob: A fast QMLE Stata routine for fractional probit models with multiplicative heteroskedasticity

Instrumental variables and GMM: Estimation and testing

IV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade

Implementing Conditional Tests with Correct Size in the Simultaneous Equations Model

Non-linear panel data modeling

Gravity Models, PPML Estimation and the Bias of the Robust Standard Errors

Identification and Estimation of Partially Linear Censored Regression Models with Unknown Heteroscedasticity

A Note on Implementing Box-Cox Quantile Regression

Truncated Regression Model and Nonparametric Estimation for Gifted and Talented Education Program

Birkbeck Working Papers in Economics & Finance

AFT Models and Empirical Likelihood

Exploring Marginal Treatment Effects

Improving linear quantile regression for

Applied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid

Impact Evaluation Technical Workshop:

WEIGHTED QUANTILE REGRESSION THEORY AND ITS APPLICATION. Abstract

BOOTSTRAP INFERENCES IN HETEROSCEDASTIC SAMPLE SELECTION MODELS: A MONTE CARLO INVESTIGATION

GLS. Miguel Sarzosa. Econ626: Empirical Microeconomics, Department of Economics University of Maryland

Practical Bayesian Quantile Regression. Keming Yu University of Plymouth, UK

Heteroskedasticity. Part VII. Heteroskedasticity

Lecture 4: Heteroskedasticity

Econometric Analysis of Cross Section and Panel Data

xtdpdqml: Quasi-maximum likelihood estimation of linear dynamic short-t panel data models

Stata module for decomposing goodness of fit according to Shapley and Owen values

Spatial groupwise heteroskedasticity and the SCAN approach

Heteroskedasticity-Robust Inference in Finite Samples

Quantile Impulse Response Functions

Consistency without Inference: Instrumental Variables in Practical Application *

Tobit and Interval Censored Regression Model

Density estimation Nonparametric conditional mean estimation Semiparametric conditional mean estimation. Nonparametrics. Gabriel Montes-Rojas

-redprob- A Stata program for the Heckman estimator of the random effects dynamic probit model

Microeconometrics and Stata over the Past Thirty Years

1. GENERAL DESCRIPTION

Recitation 5. Inference and Power Calculations. Yiqing Xu. March 7, 2014 MIT

Lab 11 - Heteroskedasticity

Introduction to Econometrics. Heteroskedasticity

Lecture 11/12. Roy Model, MTE, Structural Estimation

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Chapter 8. Quantile Regression and Quantile Treatment Effects

The Case Against JIVE

Week 11 Heteroskedasticity and Autocorrelation

Binary choice 3.3 Maximum likelihood estimation

Case of single exogenous (iv) variable (with single or multiple mediators) iv à med à dv. = β 0. iv i. med i + α 1

xtseqreg: Sequential (two-stage) estimation of linear panel data models

Outline. Possible Reasons. Nature of Heteroscedasticity. Basic Econometrics in Transportation. Heteroscedasticity

1 Motivation for Instrumental Variable (IV) Regression

Intermediate Econometrics

Christopher Dougherty London School of Economics and Political Science

Econ 582 Fixed Effects Estimation of Panel Data

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

Ordinary Least Squares Regression

A Resampling Method on Pivotal Estimating Functions

For right censored data with Y i = T i C i and censoring indicator, δ i = I(T i < C i ), arising from such a parametric model we have the likelihood,

Economics 308: Econometrics Professor Moody

α version (only brief introduction so far)

Bootstrap-Based Improvements for Inference with Clustered Errors

Instrumental variables: Overview and advances

The Exact Distribution of the t-ratio with Robust and Clustered Standard Errors

Wild Bootstrap Inference for Wildly Dierent Cluster Sizes

Transcription:

1 Robust covariance estimation for quantile regression J. M.C. Santos Silva School of Economics, University of Surrey UK STATA USERS GROUP, 21st MEETING, 10 Septeber 2015

2 Quantile regression (Koenker and Bassett, 1978) is increasingly used by practitioners, but there are still some misconceptions about how diffi cult it is to obtain valid standard errors in this context. 1. Summary In this presentation I discuss the estimation of the covariance matrix of the quantile regression estimator, focusing special attention on the case where the regression errors may be heteroskedastic and/or clustered. Specification tests to detect heteroskedasticity and intra-cluster correlation are also discussed. The presentation concludes with a brief description of qreg2, which is a wrapper for qreg that implements all the methods discussed in the presentation.

3 2. Basics of quantile regression For 0 < α < 1, the α-th quantile of y given x is defined by Q y (α x) = min{η P(y η x) α}. Assume that Q y (α x) is linear, so that Q y (α x) = x β (α), which is equivalent to y = x β (α) + u (α) ; Q u(α) (α x) = 0.

4 0 2 4 6 8 10 0 1 2 3 4 5 x

5 3. Estimation with clustered data Let the data be {(y gi, x gi ), g = 1,..., G, i = 1,..., n g }, where g indexes a set of G clusters, each with n g elements (for simplicity, we set n g = n). It is assumed that the disturbances are conditionally independent across clusters (but can be correlated within clusters). Note that for n g 1 we have the usual (heteroskedastic) case. So the model to be estimated is: y gi = x gi β (α) + u (α) gi. Examples include: Cross-sectional regression with clustered data (by regions, industry, etc.), Pooled quantile regression, Quantiles with correlated random effects.

6 β (α) can be estimated as 1 ˆβ (α) = arg min b G G g =1 α y gi x gi b + (1 α) y gi x gi b, y gi x gi b y gi <x gi b ˆβ (α) is usually estimated by linear programming methods. Asymptotic theory is non-standard because the objective function is not differentiable.

7 It is possible to show that (Parente and Santos Silva, 2016): G ( ˆβ (α) β (α) ) D N ( 0, B 1 AB 1). where [ ] n A = E i=1 n j=1 x gi x gj (α I [u gi < 0]) (α I [u gj < 0]), B = n i=1 E[x gi x gi f (0 x gi )] Notice that the asymptotic results depend on G.

8 4. Robust covariance matrix estimation One way to perform robust inference is to use bootstrap. This, however, can be quite expensive especially for large models for large datasets. Parente and Santos Silva (2016) show that it is possible to obtain consistent estimators of A and B : Â = 1 G G g =1 n i=1 n j=1 x gi x gj ψ α (i)ψ α (j), B = 1 2δ G G G g =1 n i=1 1 ( δ G ( y gi x gi ˆβ (α) ) δ G ) xgi x gi, ψ α (i) = α I [( y gi x gi ˆβ (α) ) < 0 ], where δ G is a bandwidth parameter.

9 As in Koenker (2005, p. 81), we can define δ G = κ [ Φ 1 (α + h G ) Φ 1 (α h G ) ], where h G is (see Koenker, 2005, p. 140) ( ( h G = (ng ) 1/3 Φ 1 1 0.05 )) ( 2/3 ( ( 1.5 φ Φ 1 (α) )) 2 2 2 (Φ 1 (α)) 2 + 1 ) 1/3, and κ is a robust estimate of scale. For example, κ can be defined as the MAD (median absolute deviation) of the α-th quantile regression residuals.

10 5. Specification tests When there is no intra-cluster correlation the proposed covariance estimator is equivalent to a standard heteroskedasticity robust estimator (see Powell, 1984, Chamberlain, 1994, and Kim and White, 2003). This is also the case when n g 1. When the errors are i.i.d., the estimator is equivalent to the one originally proposed by Koenker and Bassett, 1978). Specification tests can be used to detect intra-cluster correlation and heteroskedasticity.

11 Parente and Santos Silva (2016) proposed a test to check for intra-cluster correlation. Jeff Wooldridge proposed a similar test based on the OLS residuals. These are robust versions of Breusch and Pagan s (1980) error components test Machado and Santos Silva (2000) proposed a test to check for heteroskedasticity in quantile regression. For α = 0.5, this is the well-known Glejser (1969) test for heteroskedasticity. Simulation results suggest the tests have good performance both under the null and under the alternative.

12 6. The qreg2 command qreg2 is a wrapper for qreg which estimates quantile regression and reports robust standard errors and t-statistics. By default the standard errors are asymptotically valid under heteroskedasticity and misspecification. Standard errors that are also robust to intra-cluster correlation can be obtained with the option cluster. By default, the Machado-Santos Silva (2000) test for heteroskedasticity is reported. When the option cluster is used the Parente-Santos Silva (2016) test for intra-cluster correlation is reported.

13 qreg2 depvar [indepvars] [if] [in] [weight] [, options] quantile(#): estimates # quantile; default is quantile(.5) cluster(clustvar): standard errors are computed allowing for intra-cluster correlation mss(varlist): use varlist in the MSS heteroskedasticity test silverman: uses Silverman s rule-of-thumb as a scaling factor for the bandwidth epsilon(#): controls the number of residuals set to zero; default is epsilon(1e-7)

14 7. Final notes qreg has an option to compute robust standard errors and t-statistics (but not clustered-robust). However, it is not clear to me how this is implemented. Simulations suggest that our estimator performs much better. The discussion of quantile (median) regression in the Stata manual could be much improved. The comparison with regress and rreg is very misleading.

15 Further reading Chamberlain, G. (1994). Quantile Regression, Censoring and the Structure of Wages, in C.A. Sims, ed., Advances in Econometrics, 171 209. Cambridge: CUP. Kim, T.H. and White, H. (2003). Estimation, Inference, and Specification Testing for Possibly Misspecified Quantile Regressions, in T. Fomby and R.C. Hill, eds., Maximum Likelihood Estimation of Misspecified Models: Twenty Years Later, 107-132. New York (NY): Elsevier. Koenker, R. (2005). Quantile Regression, Cambridge: Cambridge University Press. Koenker, R. and Bassett Jr., G.S. (1978). Regression Quantiles, Econometrica, 46, 33-50.

16 Machado, J.A.F., Parente, P.M.D.C., and Santos Silva, J.M.C. (2013). qreg2: Stata module to perform quantile regression with robust and clustered standard errors, Statistical Software Components S457369, Boston College Department of Economics. Machado, J.A.F. and Santos Silva, J.M.C. (2000), Glejser s Test Revisited, Journal of Econometrics, 97, 189-202. Parente, P.M.D.C. and Santos Silva, J.M.C. (2016). Quantile Regression with Clustered Data, Journal of Econometric Methods, forthcoming. Powell, J.L. (1984). Least Absolute Deviation Estimation for Censored Regression Model, Journal of Econometrics, 25, 303-325.