What s New in Econometrics? Lecture 14 Quantile Methods

Similar documents
New Developments in Econometrics Lecture 16: Quantile Estimation

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

Quantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

CENSORED DATA AND CENSORED NORMAL REGRESSION

Flexible Estimation of Treatment Effect Parameters

Econometric Analysis of Cross Section and Panel Data

Jeffrey M. Wooldridge Michigan State University

CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M.

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

CORRELATED RANDOM EFFECTS MODELS WITH UNBALANCED PANELS

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation

UNIVERSITY OF CALIFORNIA Spring Economics 241A Econometrics

A Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Non-linear panel data modeling

Thoughts on Heterogeneity in Econometric Models

Control Function and Related Methods: Nonlinear Models

INVERSE PROBABILITY WEIGHTED ESTIMATION FOR GENERAL MISSING DATA PROBLEMS

Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics

Short T Panels - Review

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook)

Final Exam. Economics 835: Econometrics. Fall 2010

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

1. Overview of the Basic Model

Panel Data Seminar. Discrete Response Models. Crest-Insee. 11 April 2008

A Practitioner s Guide to Cluster-Robust Inference

Review of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University.

Informational Content in Static and Dynamic Discrete Response Panel Data Models

Quantile regression and heteroskedasticity

Comments on: Panel Data Analysis Advantages and Challenges. Manuel Arellano CEMFI, Madrid November 2006

ECONOMETFUCS FIELD EXAM Michigan State University May 11, 2007

EMERGING MARKETS - Lecture 2: Methodology refresher

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Specification testing in panel data models estimated by fixed effects with instrumental variables

Robust covariance estimation for quantile regression

Estimating Panel Data Models in the Presence of Endogeneity and Selection

Binary Models with Endogenous Explanatory Variables

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Panel data methods for policy analysis

QUANTILE REGRESSION WITH CENSORING AND ENDOGENEITY

Difference-in-Differences Estimation

Impact Evaluation Technical Workshop:

Panel Data Models. James L. Powell Department of Economics University of California, Berkeley

ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction

ECO Class 6 Nonparametric Econometrics

Program Evaluation with High-Dimensional Data

Applied Microeconometrics (L5): Panel Data-Basics

ESTIMATION OF TREATMENT EFFECTS VIA MATCHING

Lecture 9: Quantile Methods 2

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley

α version (only brief introduction so far)

Regression with a Single Regressor: Hypothesis Tests and Confidence Intervals

INTRODUCTION TO MULTILEVEL MODELLING FOR REPEATED MEASURES DATA. Belfast 9 th June to 10 th June, 2011

A Guide to Modern Econometric:

Intermediate Econometrics

What s New in Econometrics. Lecture 13

Identification of Panel Data Models with Endogenous Censoring

A Test for Rank Similarity and Partial Identification of the Distribution of Treatment Effects Preliminary and incomplete

Quantile Regression for Panel Data Models with Fixed Effects and Small T : Identification and Estimation

Review of Econometrics

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

WORKING P A P E R. Unconditional Quantile Treatment Effects in the Presence of Covariates DAVID POWELL WR-816. December 2010

Biost 518 Applied Biostatistics II. Purpose of Statistics. First Stage of Scientific Investigation. Further Stages of Scientific Investigation

Specification Tests in Unbalanced Panels with Endogeneity.

1 Motivation for Instrumental Variable (IV) Regression

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case

On the Use of Linear Fixed Effects Regression Models for Causal Inference

WISE International Masters

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

K. Model Diagnostics. residuals ˆɛ ij = Y ij ˆµ i N = Y ij Ȳ i semi-studentized residuals ω ij = ˆɛ ij. studentized deleted residuals ɛ ij =

Econ 273B Advanced Econometrics Spring

Quantitative Analysis of Financial Markets. Summary of Part II. Key Concepts & Formulas. Christopher Ting. November 11, 2017

A simple alternative to the linear probability model for binary choice models with endogenous regressors

Subject CS1 Actuarial Statistics 1 Core Principles

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case

ECNS 561 Multiple Regression Analysis

A Test for Rank Similarity and Partial Identification of the Distribution of Treatment Effects Preliminary and incomplete

Michael Lechner Causal Analysis RDD 2014 page 1. Lecture 7. The Regression Discontinuity Design. RDD fuzzy and sharp

Exam D0M61A Advanced econometrics

Econometrics Summary Algebraic and Statistical Preliminaries

Parametric identification of multiplicative exponential heteroskedasticity ALYSSA CARLSON

Discussion of Bootstrap prediction intervals for linear, nonlinear, and nonparametric autoregressions, by Li Pan and Dimitris Politis

IV Quantile Regression for Group-level Treatments, with an Application to the Distributional Effects of Trade

Independent and conditionally independent counterfactual distributions

Data Models. Dalia A. Ghanem. May 8, Abstract. Recent work on nonparametric identification of average partial effects (APEs) from panel

Censored quantile instrumental variable estimation with Stata

Testing for Rank Invariance or Similarity in Program Evaluation: The Effect of Training on Earnings Revisited

Increasing the Power of Specification Tests. November 18, 2018

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

Jinyong Hahn. Department of Economics Tel: (310) Bunche Hall Fax: (310) Professional Positions

Chapter 27 Summary Inferences for Regression

On IV estimation of the dynamic binary panel data model with fixed effects

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

A Course in Applied Econometrics. Lecture 5. Instrumental Variables with Treatment Effect. Heterogeneity: Local Average Treatment Effects.

Transcription:

What s New in Econometrics? Lecture 14 Quantile Methods Jeff Wooldridge NBER Summer Institute, 2007 1. Reminders About Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile Regression with Endogenous Explanatory Variables 4. Quantile Regression for Panel Data 1

5. Quantile Methods for Censored Data 1. Reminders About Means, Medians, and Quantiles Consider the standard linear model in a population, with intercept and K 1 slopes : y x u. (1) Assume E u 2, so that the distribution of u is not too spread out. Given a large random sample, when should we expect ordinary least squares, which solves 2

min a,b N i 1 y i a x i b 2, (2) and least absolute deviations (LAD), which solves min a,b N i 1 y i a x i b, (3) to provide similar parameter estimates? There are two important cases. If D u x is symmetric about zero (4) then OLS and LAD both consistently estimate and. If u is independent of x with E u 0, (5) 3

where E u 0 is the normalization that identifies, then OLS and LAD both consistently estimate the slopes,. Ifu has an asymmetric distribution, then Med u 0, and LAD converges to because Med y x x Med u x x. In many applications, neither (4) nor (5) is likely to be true. For example, y may be a measure of wealth, in which case the error distribution is probably asymmetric and Var u x not constant. Therefore, it is important to remember that if D u x is asymmetric and changes with x, then we should not expect OLS and LAD to deliver similar estimates of, even for thin-tailed distributions. It is important to separate discussions of resiliency to outliers from the 4

different quantities identified by least squares (E y x ) and least absolution deviations Med y x. Of course, LAD is much more resilient to changes in extreme values because, as a measure of central tendency, the median is much less sensitive than the mean to changes in extreme values. But it does not follow that a large difference in OLS and LAD estimates means something is wrong with OLS. Big advantage for median over mean: the median passes through monotonic functions. For example, if log y x u and Med u x 0, then Med y x exp Med log y x exp x. By contrast, we cannot generally find E y x exp x E exp u x. 5

But the expectation operator has useful properties that the median does not: linearity and the law of iterated expectations. Suppose we begin with a random coefficient model y i a i x i b i, (6) If a i, b i is independent of x i, then E y i x i E a i x i x i E b i x i x i, (7) where E a i and E b i. So OLS consistently estimates and. By contrast, no way to derive Med y i x i without imposing more restrictions. What can we add so that LAD estimates something of interest in (7)? If u i is a vector, then its distribution conditional on x i is centrally 6

symmetric if D u i x i D u i x i, which implies that, if g i is any vector function of x i,d g i u i x i has a univariate distribution that is symmetric about zero. This implies E u i x i 0. Apply central symmetry to random coefficient model by writing c i a i, b i with E c i, and let d i c i. Then y i x i a i x i b i (8) with g i 1, x i.ifc i given x i is centrally symmetric about, then Med g i c i x i 0, and LAD applied to the usual model y i x i u i consistently estimates and. For 0 1, q is the th quantile of y i if P y i q P y i q 1. 7 and

Usually, we are interested in how covariates affect quantiles (of which the median is the special case with 1/2. Under linearity, Quant y i x i x i. (9) Under (9), consistent estimators of and are obtained by minimizing the check function: min, K N i 1 c y i x i, (10) where c u 1 u 0 1 1 u 0 u 1 u 0 u and 1 is the indicator function. Consistency is relatively easy to establish because, are known to minimize E c y i x i (for example, Manski (1988)). Asymptotic 8

normality is more difficult because any sensible definition of the Hessian of the objective function, away from the nondifferentiable kink, is identically zero. But it has been worked out under a variety of conditions; see Koenker (2005) for a recent treatment. 2. Some Useful Asymptotic Results What Happens if the Quantile Function is Misspecified? Property of OLS: if and are the plims from the OLS regression y i on 1, x i then these provide the smallest mean squared error approximation to E y x x in that, solve min, E x x 2. (11) Under restrictive assumptions on distribution of x, j can be equal to or 9

proportionl to average partial effects. Linear quantile formulation has been viewed by several authors as an approximation (Buchinsky (1991), Chamberlain (1991), Abadie, Angrist, Imbens (2002)). Recently, Angrist, Chernozhukov, and Fernandez-Val (2006) characterized the probability limit of the quantile regression estimator. Absorb the intercept into x and let be the solution to the population quantile regression problem. ACF show that solves min E w x, q x x 2, (12) where the weight function w x, is 10

w x, 0 1 1 u fy x ux 1 u q x x du. (13) In other words, is the best weighted mean square approximation to the true quantile function, where the weights depend on average of the conditional density of y i over a line from x, to the true quantile function, q x. Computing Standard Errors For given, write y i x i u i, Quant u i x i 0, (14) and let be the quantile estimator. Define quantile residuals û i y i x i. Under weak conditions (see, for example, Koenker 11

(2005)), N is asymptotically normal with asymptotic variance A 1 BA 1, where A E f u 0 x i x i x i (15) and B 1 E x i x i. (16) When we assume the quantile function is actually linear, a consistent estimator of B is B 1 N N 1 i 1 x i x i. (17) Generally, a consistent estimator of A is (Powell (1986, 1991)) 12

N Â 2Nh N 1 1 û i h N x i x i, i 1 (18) where h N 0 is a nonrandom sequence shrinking to zero as N with N h N. For example, h N an 1/3 for any a 0. Might use a smoothed version so that all residuals contribute. Works for reasons similar to heteroskedasticity-robust standard errors. If u i and x i are independent, Avar N and Avar is estimated as 1 f u 0 2 E x i x i 1, 13 (19)

Avar 1 f u 0 2 N N 1 i 1 x i x i 1, (20) where, say, f u 0 is the histogram estimator N f u 0 2Nh N 1 i 1 1 û i h N. (21) Estimate in (20) is commonly reported (by, say, Stata). If the quantile function is misspecified, even the robust form of the variance matrix, based on the estimate in (20), is not valid. In the generalized linear models literature, the distinction is sometimes made between a fully robust variance estimator and a semi-robust 14

variance estimator. If mean is correctly specified and estimator allows unspecified variance, it is semi-robust. If the mean is allowed to be misspecified, fully robust. For quantile regression, a fully robust variance requires a different estimator of B. Kim and White (2002) and Angrist, Chernozhukov, and Fernández-Val (2006) show B N N 1 1 û i 0 2 x i x i i 1 (22) is generally consistent, and then Avar  1 B  1 with  given by (18). Hahn (1995, 1997) shows that the nonparametric bootstrap and the 15

Bayesian bootstrap generally provide consistent estimates of the fully robust variance without claims about the conditional quantile being correct. Bootstrap does not provide asymptotic refinements for testing and confidence intervals. ACF provide the covariance function for the process : 1 for some 0, which can be used to test hypotheses jointly across multiple quantiles (including all quantiles at once). Example using Abadie (2003). These are nonrobust standard errors. nettfa is net total financial assets. 16

Dependent Variable: nettfa Explanatory Variable Mean (OLS).25 Quantile Median (LAD).75 Quantile inc. 783. 0713. 324. 798. 104. 0072. 012. 025 age 1. 568. 0336. 244 1. 386 1. 076. 0955. 146. 287 age 2. 0284. 0004. 0048. 0242. 0138. 0011. 0017. 0034 e401k 6. 837 1. 281 2. 598 4. 460 2. 173. 263. 404. 801 N 2, 017 2, 017 2, 017 2, 017 17

3. Quantile Regression with Endogenous Explanatory Variables Suppose y 1 z 1 1 1 y 2 u 1, (23) where z is exogenous and y 2 is endogenous whatever that means in the context of quantile regression. First, LAD. Amemiya s (1982) two-stage LAD estimator adds a reduced form for y 2,say y 2 z 2 v 2. (24) First step applies OLS or LAD to (24), and gets fitted values, y i2 z i 2. These are inserted for y i2 to give LAD of y i1 on z i1, ŷ i2.the 2SLAD estimator relies on symmetry of the composite error 1 v 2 u 1 18

given z. If D u 1, v 2 z is centrally symmetric, can use a control function approach. Write u 1 1 v 2 e 1, (25) where e 1 given z would have a symmetric distribution. Get LAD residuals v i2 y i2 z i 2 and do LAD of y i1 on z i1, y i2, v i2.uset test on v i2 to test null that y 2 is exogenous. Interpretation of LAD in context of omitted variables is difficult unless lots of symmetry assumed. Abadie (2003) and Abadie, Angrist, and Imbens (2002) define and estimate policy parameters with a binary endogenous treatment, say D, 19

and binary instrumental variable, say Z. The potential outcomes are Y d, d 0, 1 that is, without treatment and with treatment, respectively. The counterfactuals for treatment are D z, z 0, 1. Observed are X, Z, D 1 Z D 0 ZD 1, and Y 1 D Y 0 DY 1. AAI study treatment effects for compliers, that is, the (unobserved) subpopulation with D 1 D 0. The assumptions are Y 1, Y 0, D 1, D 0 independent of Z conditional on X 0 P Z 1 X 1 P D 1 1 X P D 0 1 X P D 1 D 0 X 1. (26) (27) (28) (29) 20

Under these assumptions, treatment is unconfounded for compliers: D Y 0, Y 1 D, X, D 1 D 0 D Y 0, Y 1 X, D 1 D 0 (30) and so treatment effects can be defined based on D Y X, D, D 1 D 0, where Y is the observed outcome. AAI focus on quantile treatment effects (Abadie looks at other distributional features): Quant Y X, D, D 1 D 0 D X. (31) (This results in estimated differences for the quantiles of Y 1 and Y 0, not the quantile of the difference Y 1 Y 0. If the dummy variable C 1 D 1 D 0 could be observed, problem would be straightforward. Would like to use linear quantile estimation for the subpopulation C 1 because the parameters solve 21

min E C g Y, X, D,,, (32) where g Y, X, D,, c Y D X is the check function for a linear quantile estimation. Instead, can solve min, E U g Y, X, D,,, (33) where U Y, X, D and U P C 1 U. AAI show v U 1 D 1 v U 1 X 1 D v U X, (34) where v U P Z 1 U, and X P Z 1 X, which can both be estimated using observed data. Two-step estimator solves 22

min N i 1 1 v U i 0 v U i c Y i W i. (35) where W i D i, X i and contains and. The indicator function 1 v U i 0 ensures that only observations with nonnegative weights are used. Can use flexible parametric models (series) estimators for v u and x. Chernozhukov and Hansen (2005, 2006) consider identification and estimation of QTEs in a model with endogenous treatment. Let q d, x, denote the th quantile function for treatment level D d and covariates x. In the binary case, CH define the QTE as QTE x q 1, x, q 0, x,. (36) 23

CH use the representation that Y d, conditional on X x, can be expressed as Y d q d, x, U d (37) where U d Z ~Uniform 0, 1, (38) and Z is the instrumental variable for treatment assignment, D. Key assumptions are that q d, x, u is strictly increasing in u and a rank invariance condition, whose simplest form is conditional on X x and Z z, U d does not depend on d. CH show that, with the observed Y defined as Y q D, X, U D, 24

P Y q D, X, X, Z P Y q D, X, X, Z. (39) If we could take Z D, (39) would define the quantile Quant Y D, X. Generally, it defines conditional moment conditions E 1 Y q D, X, X, Z 0, (40) which is analogous to conditional moment conditions in models with additive errors. Chernozhukov and Hansen (2006) assume a linear functional form and obtain the quantile regression instrumental variables estimator. 4. Quantile Regression for Panel Data Without unobserved effects, easy to use quantile regression methods on panel data: 25

Quant y it x it x it, t 1,...,T. (41) Use pooled quantile regression. But need to generally account for serial correlation in the scores, Use s it x it 1 y it x it 0 1 1 y it x it 0. N B N 1 i 1 T t 1 T r 1 s it s ir (42) and then N Â 2Nh N 1 i 1 T t 1 1 û it h N x it x it. (43) 26

Explicitly allowing unobserved effects is harder. Quant y it x i, c i Quant y it x it, c i x it c i. (44) Fixed effects approach, where do not restrict D c i x i, is attractive. From Honoré (1992) applied to the uncensored case, LAD on the first differences is consistent when u it : t 1,...,T is an iid. sequence conditional on x i, c i, even if the common distribution is not symmetric. But this is a fairly strong assumption. When T 2, applying LAD on the first differences is equivalent to estimating the c i along with. Generally, an incidental parameters problem with small T. Alternative suggested by Abrevaya and Dahl (2006) for T 2. In 27

Chamberlain s correlated random effects linear model, E y t x 1, x 2 t x t x 1 1 x 2 2, t 1, (45) E y 1 x x 1 E y 2 x x 1. (46) Abrevaya and Dahl suggest modeling Quant y t x 1, x 2 as in (46) and then defining the partial effect as Quant y 1 x x 1 Quant y 2 x x 1. (47) Generally, correlated random effects approaches are hampered because finding quantiles of sums of random variables is difficult. Suppose we write c i x i a i and then 28

y it x it x i a i u it. (48) Generally, v it a i u it will not have zero conditional quantile. Could just estimate (48) by pooled quantile regression for different quantiles and use the ACF results on approximating quantiles. A little more flexibility if we start with median, y it x it c i u it,med u it x i, c i 0, (49) and make symmetry assumptions. If D u i x i D u i x i then all linear combinations of the errors have a symmetric distribution, and so we can apply LAD to the time-demeaned equation ÿ it ẍ it ü it, being sure to obtain fully robust standard errors for pooled LAD. If we impose the Chamberlain-Mundlak device as in (48), we can get 29

by with central symmetry of D a i, u it x i has a symmetric distribution around zero then D a i u it x i is symmetric about zero, and, if this holds for each t, pooled LAD of y it on 1, x it, and x i consistently estimates t,,. (If we use pooled OLS with x i included, we obtain the FE estimate.) Should use robust inference. 30

5. Quantile Methods for Censored Data Censored LAD applicable to data censoring and and corner solutions. Very useful for true data censoring, where parameters of underlying linear model are of interest. w i is the response variable (say, wealth or log of a duration) following w i x i u i, (50) but it is top coded or right censored at r i, then we can estimate under the assumption Med u i x i, r i 0 (51) because Med y i x i, r i min x i, r i where y i min y i, r i. Leads to Powell s (1986) CLAD estimator. (Need to always observe r i ;see 31

Honoré, Khan, and Powell (2002) to relax.) Less clear that CLAD is better than parametric models for corner solution responses. CLAD identifies a single feature of D y x, namely, Med y x. Models such as Tobit assume more but deliver more. Not just enough to estimate parameters. Common model for corner at zero: y max 0, x u, Med u x 0. (52) j measures the partial effects on Med y x max 0, x once Med y x 0. A model no more or less restrictive than (52) is y a exp x, E a x 1, (53) in which case E y x exp x is identified. Allows for corner because 32

P a 0 x 0 is allowed. How to interpret panel data applications of CLAD for corner solutions? Med y it x i, c i max 0, x it c i. (54) Honoré (1992), Honoré and Hu (2004) show how to estimate under exchangeability assumptions on the idiosyncratic errors in the latent variable model. The partial effect of x tj on Med y it x it x t, c i c is tj x t, c 1 x t c 0 j. (55) What values should we insert for c? We need to know something about D c i. The average of (55) across the distribution of unobserved heterogeneity would be average partial effects (on the median). Again, 33

we need to identify D c i.the j give us the sign and relative effects of the APEs. If c i has a Normal c, c 2 distribution, then it is easy to show E ci tj x t, c i c x t / c j. 34