Inference approaches for instrumental variable quantile regression

Similar documents
Efficient GMM LECTURE 12 GMM II

Resampling Methods. X (1/2), i.e., Pr (X i m) = 1/2. We order the data: X (1) X (2) X (n). Define the sample median: ( n.

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction

Properties and Hypothesis Testing

First Year Quantitative Comp Exam Spring, Part I - 203A. f X (x) = 0 otherwise

Slide Set 13 Linear Model with Endogenous Regressors and the GMM estimator

Statistical Inference Based on Extremum Estimators

LECTURE 14 NOTES. A sequence of α-level tests {ϕ n (x)} is consistent if

Rank tests and regression rank scores tests in measurement error models

Lecture 6 Simple alternatives and the Neyman-Pearson lemma

x iu i E(x u) 0. In order to obtain a consistent estimator of β, we find the instrumental variable z which satisfies E(z u) = 0. z iu i E(z u) = 0.

STA6938-Logistic Regression Model

Random Variables, Sampling and Estimation

11 THE GMM ESTIMATION

DS 100: Principles and Techniques of Data Science Date: April 13, Discussion #10

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

Algebra of Least Squares

Overview. p 2. Chapter 9. Pooled Estimate of. q = 1 p. Notation for Two Proportions. Inferences about Two Proportions. Assumptions

POLS, GLS, FGLS, GMM. Outline of Linear Systems of Equations. Common Coefficients, Panel Data Model. Preliminaries

Final Examination Solutions 17/6/2010

Economics 241B Relation to Method of Moments and Maximum Likelihood OLSE as a Maximum Likelihood Estimator

Lecture 19: Convergence

Asymptotic distribution of the first-stage F-statistic under weak IVs

The standard deviation of the mean

Statistics Lecture 27. Final review. Administrative Notes. Outline. Experiments. Sampling and Surveys. Administrative Notes

ENGI 4421 Confidence Intervals (Two Samples) Page 12-01

Regression with an Evaporating Logarithmic Trend

Confidence intervals summary Conservative and approximate confidence intervals for a binomial p Examples. MATH1005 Statistics. Lecture 24. M.

Lesson 11: Simple Linear Regression

Topic 9: Sampling Distributions of Estimators

¹Y 1 ¹ Y 2 p s. 2 1 =n 1 + s 2 2=n 2. ¹X X n i. X i u i. i=1 ( ^Y i ¹ Y i ) 2 + P n

32 estimating the cumulative distribution function

CEE 522 Autumn Uncertainty Concepts for Geotechnical Engineering

[412] A TEST FOR HOMOGENEITY OF THE MARGINAL DISTRIBUTIONS IN A TWO-WAY CLASSIFICATION

STAT Homework 1 - Solutions

A statistical method to determine sample size to estimate characteristic value of soil parameters

Last Lecture. Wald Test

Zeros of Polynomials

Linear Regression Models

17. Joint distributions of extreme order statistics Lehmann 5.1; Ferguson 15

Summary. Recap ... Last Lecture. Summary. Theorem

Mathematical Statistics - MS

TMA4245 Statistics. Corrected 30 May and 4 June Norwegian University of Science and Technology Department of Mathematical Sciences.

Lecture 7: October 18, 2017

Lecture 33: Bootstrap

Problem Set 4 Due Oct, 12

MATHEMATICAL SCIENCES PAPER-II

Basics of Probability Theory (for Theory of Computation courses)

MA Advanced Econometrics: Properties of Least Squares Estimators

Chapter 6 Principles of Data Reduction

ECONOMETRIC THEORY. MODULE XIII Lecture - 34 Asymptotic Theory and Stochastic Regressors

MATH 320: Probability and Statistics 9. Estimation and Testing of Parameters. Readings: Pruim, Chapter 4

Quantile regression with multilayer perceptrons.

Continuous Data that can take on any real number (time/length) based on sample data. Categorical data can only be named or categorised

A NEW METHOD FOR CONSTRUCTING APPROXIMATE CONFIDENCE INTERVALS FOR M-ESTU1ATES. Dennis D. Boos

Tests of Hypotheses Based on a Single Sample (Devore Chapter Eight)

Since X n /n P p, we know that X n (n. Xn (n X n ) Using the asymptotic result above to obtain an approximation for fixed n, we obtain

Journal of Multivariate Analysis. Superefficient estimation of the marginals by exploiting knowledge on the copula

November 2002 Course 4 solutions

EECS564 Estimation, Filtering, and Detection Hwk 2 Solns. Winter p θ (z) = (2θz + 1 θ), 0 z 1

Econ 325 Notes on Point Estimator and Confidence Interval 1 By Hiro Kasahara

Mathematical Notation Math Introduction to Applied Statistics

3.2 Properties of Division 3.3 Zeros of Polynomials 3.4 Complex and Rational Zeros of Polynomials

Topic 9: Sampling Distributions of Estimators

ECE 901 Lecture 12: Complexity Regularization and the Squared Loss

4. Partial Sums and the Central Limit Theorem

LINEAR REGRESSION ANALYSIS. MODULE IX Lecture Multicollinearity

Statistical Inference (Chapter 10) Statistical inference = learn about a population based on the information provided by a sample.

Approximate Confidence Interval for the Reciprocal of a Normal Mean with a Known Coefficient of Variation

Section 14. Simple linear regression.

Logit regression Logit regression

Goodness-Of-Fit For The Generalized Exponential Distribution. Abstract

Direction: This test is worth 250 points. You are required to complete this test within 50 minutes.

Stat 139 Homework 7 Solutions, Fall 2015

Estimation for Complete Data

UNIVERSITY OF TORONTO Faculty of Arts and Science APRIL/MAY 2009 EXAMINATIONS ECO220Y1Y PART 1 OF 2 SOLUTIONS

Goodness-of-Fit Tests and Categorical Data Analysis (Devore Chapter Fourteen)

1 Inferential Methods for Correlation and Regression Analysis

Simple Linear Regression

10. Comparative Tests among Spatial Regression Models. Here we revisit the example in Section 8.1 of estimating the mean of a normal random

Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Discrete Mathematics for CS Spring 2008 David Wagner Note 22

A quick activity - Central Limit Theorem and Proportions. Lecture 21: Testing Proportions. Results from the GSS. Statistics and the General Population

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

11 Correlation and Regression

Simple Regression Model

Stat 319 Theory of Statistics (2) Exercises

Topic 9: Sampling Distributions of Estimators

Geometry of LS. LECTURE 3 GEOMETRY OF LS, PROPERTIES OF σ 2, PARTITIONED REGRESSION, GOODNESS OF FIT

Summary: CORRELATION & LINEAR REGRESSION. GC. Students are advised to refer to lecture notes for the GC operations to obtain scatter diagram.

Statistics 511 Additional Materials


Frequentist Inference

K. Grill Institut für Statistik und Wahrscheinlichkeitstheorie, TU Wien, Austria

Math 2784 (or 2794W) University of Connecticut

CSE 527, Additional notes on MLE & EM

POWER COMPARISON OF EMPIRICAL LIKELIHOOD RATIO TESTS: SMALL SAMPLE PROPERTIES THROUGH MONTE CARLO STUDIES*

R. van Zyl 1, A.J. van der Merwe 2. Quintiles International, University of the Free State

Statistical Properties of OLS estimators

Stat 200 -Testing Summary Page 1

Transcription:

Ecoomics Letters 95 (2007) 272 277 www.elsevier.com/locate/ecobase Iferece approaches for istrumetal variable quatile regressio Victor Cherozhukov a, Christia Hase b,, Michael Jasso c a Massachusetts Istitute of Techology, Uited States b Uiversity of Chicago, Graduate School of Busiess, 5807 S. Woodlaw Ave., Chicago, IL 60637, Uited States c Uiversity of Califoria-Berkeley, Uited States Received 7 February 2006; received i revised form 13 September 2006; accepted 25 October 2006 Abstract We cosider asymptotic ad fiite sample cofidece bouds i istrumetal variables quatile regressios of wages o schoolig with relatively weak istrumets. We fid practically importat differeces betwee the asymptotic ad fiite sample iterval estimates. 2006 Elsevier B.V. All rights reserved. Keywords: Quatile regressio; Istrumetal variables; Schoolig JEL classificatio: C10; J31 1. Itroductio I this ote, we outlie three approaches to obtaiig iferece statemets for quatile regressio (QR) models with edogeeity: a asymptotic approximatio provided i Cherozhukov ad Hase (2006) that exteds the results of Koeker ad Bassett (1978) for the QR model with all exogeous variables to a model with edogeeity whe istrumets are available, a alterate approach to iferece preseted i Cherozhukov ad Hase (2006) that will be asymptotically valid uder weak idetificatio, ad a approach to obtaiig fiite sample cofidece regios for parameters of a model defied by quatile restrictios developed i Cherozhukov, Hase, ad Jasso (2005). We compare Correspodig author. Tel.: +1 773 834 1702. E-mail address: chase1@chicagogsb.edu (C. Hase). 0165-1765/$ - see frot matter 2006 Elsevier B.V. All rights reserved. doi:10.1016/j.ecolet.2006.10.016

V. Cherozhukov et al. / Ecoomics Letters 95 (2007) 272 277 273 the three approaches i a example due to Card (1995) whichusescollegeproximityasaistrumet for years of educatio i a quatile regressio of wages o educatio. We fid substatial differeces betwee the three approaches that appear to be due to the weakess of the istrumets. The fidigs suggest cautio should be used whe relyig o asymptotic approximatios i QR models with edogeeity. 2. Iferece approaches We cosider a radom coefficiet model with structural equatio give by Y ¼ DVaðUÞþXVbðUÞ; where D ad U may be statistically depedet, ð2:1þ A1. D α(u )+X β(u ) is strictly icreasig i U for almost every D ad X, A2. U U(0, 1), ad A3. U is idepedet of X ad Z variables that do ot eter the structural equatio. That is, D is a vector of potetially edogeous variables with radom coefficiets α(u ); X is a vector of exogeous variables that eter the structural equatio with radom coefficiets β(u ), ad Z is a vector of exogeous variables that are excluded from the structural equatio where it is assumed that dim (Z ) dim(d). This model icorporates the covetioal liear QR model of Koeker ad Bassett (1978) where α(u ) 0 ad Z=X. Uder coditios A1. A3., the problem of depedece betwee U ad D is overcome through the presece of istrumets, Z, that affect D but are idepedet of U. From Eq. (2.1) ad the mootoicity assumed i A1., the evet {Y D α(τ)+x β(τ)} is equivalet to the evet {U τ}. It the follows uder A2. ad A3. that P½Y V DVaðsÞþX VbðsÞjZ; X Š¼s: ð2:2þ Eq. (2.2) provides a momet restrictio that ca be used to estimate the structural parameters α(τ) ad β (τ). For example, oe may use this set of momet restrictios to form a GMM estimator; see Pakes ad Pollard (1989). Momet coditios (2.2) also suggest a differet procedure for estimatig α(τ) adβ(τ). For a give value of the structural parameter, say α, ru the ordiary QR of Y D α o X ad Z to obtai (βˆ(α,τ), γˆ (α,τ)) where γˆ (α,τ) are the estimated coefficiets o the istrumets Z. The ote that Eq. (2.2) implies that 0 is the τth coditioal quatile of Y D α(τ) X β (τ) givez ad X, so oe may estimate α(τ) by fidig a value for α that makes the coefficiet o the istrumetal variable γˆ (α,τ) as close to 0 as possible. That is, âðsþ ¼arg if ½ĝða; sþvš AðaÞ½ ˆ ĝða; sþš; aaa ð2:3þ where A is the parameter space for α, Â(α)=A(α)+o p (1) ad A(α) is positive defiite uiformly i p aaa. ffiffi It is coveiet to set A(α) equal to the iverse of the asymptotic co-variace matrix of ðĝða; sþ gða; sþþ i which case W (α) is the Wald statistic for testig γ(α,τ)=0. The parameter

274 V. Cherozhukov et al. / Ecoomics Letters 95 (2007) 272 277 estimates are the give by (αˆ (τ), βˆ (τ))=(αˆ (τ), βˆ (αˆ (τ),τ)). Cherozhukov ad Hase (2006) provide additioal details, verify that the above procedure cosistetly estimates α(τ) ad β(τ), ad fid the limitig distributio from which covetioal asymptotic iferece immediately follows. The above procedure may be modified to obtai cofidece itervals that will be asymptotically valid whe there is oly a weak relatioship betwee D ad Z. The idea is to base iferece o the Wald statistic W (α) for testig whether the coefficiets o the istrumets are zero (i.e. whether γ(α,τ)=0). The ituitio for this approach is that at the true α(τ) the structural Eq. (2.1) implies that γ(α,τ)=0. Thus, fixig a value for α(τ), γ(α,τ) should be ear zero if this value is cosistet with the structural equatio. More formally, whe a ¼ aðsþ; W ðaþ Y d v 2 ðdimðgþþ. Thus, a valid cofidece regio for α(τ) ca be based o the iversio of this Wald statistic: CR p [α(τ)]:={α:w (α)bc p } cotais α(τ) with probability approachig p, where c p is the p-percetile of a χ 2 (dim(γ)) distributio. Cherozhukov ad Hase (2006) show that this procedure provides a valid cofidece regio for α(τ) without puttig restrictios o the depedece betwee D ad Z. The fial approach to iferece we cosider is based o the observatio that P[Y D α(τ)+x β(τ) Z,X ]=τ implies that the evet {Y D α(τ)+x β(τ)} is distributed exactly as a Beroulli(τ) coditioal o X ad Z regardless of the sample size. Thus, ay test statistic that depeds oly o this evet, X, adz will have a distributio that does ot deped o ay ukow parameters i fiite samples ad so ca be used to costruct valid fiite sample iferece statemets. This basic approach exteds work o fiite sample iferece for ucoditioal quatiles; see Walsh (1960) ad MacKio (1964). Cherozhukov et al. (2005) make use of the aforemetioed distributioal fact ad cosider iferece based o the GMM objective fuctio: L ða; bþ ¼ 1 2 1 p X ffiffiffi VW 1 m i ða; bþ! p X ffiffi! m i ða; bþ ; ð2:4þ where m i (α,β)=[τ 1(Y i D i α X i β)] (Z i,x i ). I this expressio, W is a positive defiite weight matrix, which is fixed coditioal o (X 1,Z 1 ),, (X,Z ). W ¼ 1 sð1 sþ 1 X ðz i V; X i VÞVðZ i V; X i VÞ! 1 ; which equals the iverse of the variace of 1=2 P m iðaðsþ; bðsþþ coditioal o (X 1,Z 1 ),, (X,Z ), is a atural choice of W. Sice statistic L (α,β) defied by Eq. (2.4) depeds oly o X, Z, ad {Y D α+x β} it is coditioally pivotal i fiite samples whe evaluated at (α,β)=(α(τ), β(τ) ) ; that is, its distributio at the true parameter values does ot deped o ay ukow parameters ad so exact fiite sample critical values for this statistic may be obtaied. Let c ( p) deotethepth quatile of the distributio of the statistic give i Eq. (2.4), ad ote that c ( p) may be obtaied easily through simulatio by, for j =1,, J, 1.drawig(B i, j, 1 p X V ffiffiffi m 1 i; j W ffiffi p X where mi, j = i ) as iid Beroulli(τ) radom variables, 2. computig L ;j ¼ 1 2 m i; j [τ B i, j ](Z, i X ), i ad 3. obtaiig c (α) astheα-quatile of the sample (L, j, j =1,, J ) for a large umber J.

V. Cherozhukov et al. / Ecoomics Letters 95 (2007) 272 277 275 Table 1 Schoolig coefficiet Estimatio method τ=0.25 τ =0.50 τ=0.75 A. Quatile regressio (o istrumets) α(τ) 0.074 0.074 0.079 Quatile regressio (asymptotic) (0.064,0.083) (0.065,0.083) (0.071,0.087) Fiite sample (0.047,0.100) (0.050,0.101) (0.057,0.098) B. IV Quatile regressio (proximity to 2 ad 4 year college as istrumets) α(τ) 0.175 0.033 0.103 Iverse quatile regressio (asymptotic) (0.081,0.269) ( 0.077,0.142) (0.025,0.180) Weak istrumet (asymptotic) (0.018,0.500] ( 0.053,0.500] (0.068,0.483) Fiite sample [ 0.100,0.500] [ 0.100,0.500] [ 0.100,0.500] Note: 95% level cofidece iterval estimates for the schoolig coefficiet i a simple wage model. Schoolig is treated as exogeous i Pael A ad as edogeous i Pael B. The first row i each pael reports the poit estimate, ad the secod row reports the iterval estimate obtaied usig the asymptotic approximatio. I Pael A, the third row reports a iterval estimate obtaied usig the fiite sample approach. I Pael B, the third row reports a weak-istrumet robust iterval, ad the fourth row reports the fiite sample iterval. Give c (p), a valid p-level cofidece regio for (α(τ), β(τ) ) may the be obtaied as the set CR( p) {(α, β ) : L (α, β ) c ( p)}. CR( p) defies a joit cofidece set for all of the model's parameters from which oe ca defie a cofidece boud for a real-valued fuctioal ψ(α(τ), β(τ), τ) as CR(p,ψ)={ψ(α, β,τ), (α,β ) CR(p)}. It is worth otig that the joit cofidece regio CR( p) is ot coservative, but cofidece bouds for fuctioals may be. Also, formig cofidece regios by ivertig a test-statistic will geerally be computatioally demadig. Cherozhukov et al. (2005) presets various feasible algorithms for computig cofidece regios ad bouds i this cotext. 3. Case study: Card's (1995) schoolig example We illustrate the differet approaches to iferece by estimatig a wage model that allows for heterogeeity i the effect of schoolig o wages as well as other heterogeeity i the effects of other cotrol variables. Because we are cocered that the level of schoolig ad earigs may be joitly determied, we istrumet for schoolig. We employ the same data ad idetificatio strategy as Card (1995). Specifically, we suppose that the log of the wage is determied by the followig liear quatile model: Y ¼ aðuþs þ X VbðUÞ where Y is the log of the hourly wage; S is years of completed schoolig; X cosists of a costat ad 14 demographic cotrols; ad U is a uobservable ormalized to follow a uiform distributio over (0,1). We might thik of U as idexig uobserved ability, i which case α(τ) may be thought of as the retur to schoolig for a idividual with uobserved ability τ. Sice we believe that years of schoolig may be joitly determied with uobserved ability, we use a idicator for residece ear a two year college i 1966 ad a idicator for residece ear a four year college i 1966 as istrumets for schoolig. Further

276 V. Cherozhukov et al. / Ecoomics Letters 95 (2007) 272 277 details about the data sources, descriptive statistics, ad argumets for the validity of the istrumets are i Card (1995). We summarize results for the schoolig coefficiet i Table 1. For quatile regressio, we report 95%- level cofidece itervals computed via the asymptotic approximatio ad fiite sample approach. For istrumetal variable quatile regressio, we report 95%-level cofidece itervals computed usig the asymptotic approximatio, the weak-istrumet robust approach, ad the fiite sample approach. Both the fiite sample ad weak-istrumet approach require that we ivert a test-statistic for a rage of potetial values for α(τ); i all cases, we cosider values i the iterval [.1,.5] which we feel covers essetially all plausible values for the retur to a additioal year of schoolig. The results have a umber of iterestig features. Lookig first at the results i Pael A of Table 1 which treat schoolig as exogeous, we see that the poit estimates across the various quatiles are quite close, suggestig little heterogeeity i how the quatiles of wages vary with schoolig. We also see that the fiite sample ad asymptotic itervals are qualitatively similar, though the fiite sample itervals are wider tha the asymptotic itervals. I the estimates which treat schoolig as edogeous, the differeces betwee the iferece approaches are strikig. The poit estimates suggest cosiderable heterogeeity i the returs to schoolig, though eve the usual asymptotic itervals are wide eough to make fidig statistical evidece for this quite ulikely. Cosiderig ext the weak istrumet robust itervals, we see that they are substatially wider tha the usual asymptotic itervals i all cases. For each quatile we cosider, the weak istrumet robust itervals iclude essetially the etire parameter space. The weak istrumet robust itervals do exclude 0 i the two cases where the usual itervals do. While the weak istrumet robust itervals rely o less striget assumptios tha do the usual asymptotic itervals, they still rely o a asymptotic argumet. Lookig last at the fiite sample itervals which are valid uder miimal assumptios ad do ot require ay asymptotic argumet, we see that they cover the etire parameter space i every case. The likely cause for the large discrepacies is the weakess of the relatioship betwee the istrumets ad schoolig. If oe regresses educatio o the cotrol variables ad the two proximity istrumets, the F-statistic o the excluded istrumets is 8.32 which is i the rage that may would cosider weak i the usual liear IV model. The weakess of the istrumets clearly explais the differece betwee the usual asymptotic ad weak istrumet asymptotic results. The differeces betwee the weak istrumet ad fiite sample itervals is the likely drive by the fact that with a large umber of covariates ad weak istrumets the effective sample size is quite small. It would be iterestig to explore these differeces ad develop formal procedures for judgig differeces betwee asymptotic ad fiite sample itervals. Overall, the differece betwee the asymptotic ad fiite sample iferece statemets calls the validity of the asymptotics ito questio i this example. Refereces Card, D., 1995. Usig geographic variatio i college proximity to estimate the retur to schoolig. I: Christofides, L., Grat, E.K., Swidisky, R. (Eds.), Aspects of Labour Ecoomics: Essays i Hoour of Joh Vaderkamp. Uiversity of Toroto Press. Cherozhukov, V., Hase, C., 2006. Istrumetal quatile regressio iferece for structural ad treatmet effect models. J. Ecoom. 132 (2), 491 525. Cherozhukov, V., Hase, C., Jasso, M., 2005. Fiite-Sample Iferece i Ecoo-metric Models via Quatile Restrictios, mimeo.

V. Cherozhukov et al. / Ecoomics Letters 95 (2007) 272 277 277 Koeker, R., Bassett, G.S., 1978. Regressio quatiles. Ecoometrica 46, 33 50. MacKio, W.J., 1964. Table for both the sig test ad distributio-free cofidece itervals of the media for sample sizes to 1000. J. Am. Stat. Assoc. 59, 935 956. Pakes, A., Pollard, D., 1989. Simulatio ad asymptotics of optimizatio estimators. Ecoometrica 57, 1027 1057. Walsh, J.E., 1960. Noparametric tests for media by iterpolatio from sig tests. A. Ist. Stat. Math. 11, 183 188.