Endogenous binary choice models with median restrictions: A comment

Similar documents
The relationship between treatment parameters within a latent variable framework

Simplified marginal effects in discrete choice models

Identification of Regression Models with Misclassified and Endogenous Binary Regressor

Estimation of growth convergence using a stochastic production frontier approach

Generalized Roy Model and Cost-Benefit Analysis of Social Programs 1

Generated Covariates in Nonparametric Estimation: A Short Review.

A Note on Adapting Propensity Score Matching and Selection Models to Choice Based Samples

Econometric Analysis of Games 1

Flexible Estimation of Treatment Effect Parameters

Microeconometrics. C. Hsiao (2014), Analysis of Panel Data, 3rd edition. Cambridge, University Press.

Instrumental Variables in Models with Multiple Outcomes: The General Unordered Case

NBER WORKING PAPER SERIES A NOTE ON ADAPTING PROPENSITY SCORE MATCHING AND SELECTION MODELS TO CHOICE BASED SAMPLES. James J. Heckman Petra E.

The Generalized Roy Model and Treatment Effects

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models

GLS detrending and unit root testing

IDENTIFICATION OF MARGINAL EFFECTS IN NONSEPARABLE MODELS WITHOUT MONOTONICITY

Dummy Endogenous Variables in Weakly Separable Models

Estimation of Treatment Effects under Essential Heterogeneity

Made available courtesy of Elsevier:

On Uniform Inference in Nonlinear Models with Endogeneity

Supplementary material to: Tolerating deance? Local average treatment eects without monotonicity.

Bootstrap tests of multiple inequality restrictions on variance ratios

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Mathematical Social Sciences

IDENTIFICATION OF THE BINARY CHOICE MODEL WITH MISCLASSIFICATION

Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates

Semiparametric Identification in Panel Data Discrete Response Models

Economics 241B Review of Limit Theorems for Sequences of Random Variables

Identification in Nonparametric Limited Dependent Variable Models with Simultaneity and Unobserved Heterogeneity

Selection endogenous dummy ordered probit, and selection endogenous dummy dynamic ordered probit models

Parametric identification of multiplicative exponential heteroskedasticity ALYSSA CARLSON

Conditions for the Existence of Control Functions in Nonseparable Simultaneous Equations Models 1

A distance measure between cointegration spaces

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2014 Instructor: Victor Aguirregabiria

Panel Threshold Regression Models with Endogenous Threshold Variables

Identification in Discrete Choice Models with Fixed Effects

Identification for Difference in Differences with Cross-Section and Panel Data

Another Look at the Identification at Infinity of Sample Selection Models

Discussion of Bootstrap prediction intervals for linear, nonlinear, and nonparametric autoregressions, by Li Pan and Dimitris Politis

Comments on: Panel Data Analysis Advantages and Challenges. Manuel Arellano CEMFI, Madrid November 2006

Consistency and Asymptotic Normality for Equilibrium Models with Partially Observed Outcome Variables

ESTIMATION OF NONPARAMETRIC MODELS WITH SIMULTANEITY

Birkbeck Working Papers in Economics & Finance

Econometric Analysis of Cross Section and Panel Data

Control Functions in Nonseparable Simultaneous Equations Models 1

A Note on the Closed-form Identi cation of Regression Models with a Mismeasured Binary Regressor

A Note on the Correlated Random Coefficient Model. Christophe Kolodziejczyk

Identi cation of Positive Treatment E ects in. Randomized Experiments with Non-Compliance

Statistical models and likelihood functions

Information Structure and Statistical Information in Discrete Response Models

What do instrumental variable models deliver with discrete dependent variables?

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

A note on Two-warehouse inventory model with deterioration under FIFO dispatch policy

Revisiting independence and stochastic dominance for compound lotteries

Comparison of inferential methods in partially identified models in terms of error in coverage probability

Missing dependent variables in panel data models

What s New in Econometrics? Lecture 14 Quantile Methods

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction

Efficient Estimation of Dynamic Panel Data Models: Alternative Assumptions and Simplified Estimation

An Overview of the Special Regressor Method

Identification and Estimation Using Heteroscedasticity Without Instruments: The Binary Endogenous Regressor Case

Federal Reserve Bank of Chicago

On IV estimation of the dynamic binary panel data model with fixed effects

IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL

Supplement to Quantile-Based Nonparametric Inference for First-Price Auctions

Testing for Rank Invariance or Similarity in Program Evaluation: The Effect of Training on Earnings Revisited

Specification Test for Instrumental Variables Regression with Many Instruments

FLEXIBLE CORRELATED RANDOM EFFECTS ESTIMATION IN PANEL MODELS WITH UNOBSERVED HETEROGENEITY

The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation

poisson: Some convergence issues

The Generalized Roy Model and the Cost-Benefit Analysis of Social Programs

Parametric Identification of Multiplicative Exponential Heteroskedasticity

SIMILAR-ON-THE-BOUNDARY TESTS FOR MOMENT INEQUALITIES EXIST, BUT HAVE POOR POWER. Donald W. K. Andrews. August 2011

UNIVERSITY OF NOTTINGHAM. Discussion Papers in Economics CONSISTENT FIRM CHOICE AND THE THEORY OF SUPPLY

Of Nickell Bias, Cross-Sectional Dependence, and Their Cures: Reply

Random Coefficients on Endogenous Variables in Simultaneous Equations Models

Nonparametric Identi cation of Regression Models Containing a Misclassi ed Dichotomous Regressor Without Instruments

Unconditional Quantile Regression with Endogenous Regressors

Identification in Nonparametric Models for Dynamic Treatment Effects

An Alternative Assumption to Identify LATE in Regression Discontinuity Designs

Identification of Nonparametric Simultaneous Equations Models with a Residual Index Structure

Weak Stochastic Increasingness, Rank Exchangeability, and Partial Identification of The Distribution of Treatment Effects

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2.

Identification and Estimation of Bidders Risk Aversion in. First-Price Auctions

Using Instrumental Variables for Inference about Policy Relevant Treatment Parameters

Exam D0M61A Advanced econometrics

New Developments in Econometrics Lecture 16: Quantile Estimation

An axiomatic characterization of the Hirsch-index

A Generic Bound on Cycles in Two-Player Games

Comparing IV with Structural Models: What Simple IV Can and Cannot Identify

Trimming for Bounds on Treatment Effects with Missing Outcomes *

Principles Underlying Evaluation Estimators

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

Scheduling jobs with agreeable processing times and due dates on a single batch processing machine

Nonparametric Identi cation of Regression Models Containing a Misclassi ed Dichotomous Regressor Without Instruments

Discriminative Learning can Succeed where Generative Learning Fails

Minimax-Regret Sample Design in Anticipation of Missing Data, With Application to Panel Data. Jeff Dominitz RAND. and

IDENTIFICATION IN NONPARAMETRIC SIMULTANEOUS EQUATIONS

Tobit and Interval Censored Regression Model

Transcription:

Available online at www.sciencedirect.com Economics Letters 98 (2008) 23 28 www.elsevier.com/locate/econbase Endogenous binary choice models with median restrictions: A comment Azeem M. Shaikh a, Edward Vytlacil b, a Department of Economics, University of Chicago, United States b Department of Economics, Columbia University, MC:3308, 420 West 118th Street, New York, NY 10027, United States Received 25 February 2005; received in revised form 1 March 2007; accepted 15 March 2007 Available online 29 June 2007 Abstract Hong and Tamer [Hong, H. and Tamer, E. (2003). Endogenous binary choice model with median restrictions. Economics Letters, 80 219 225] provide a sufficient condition for identification of a binary choice model with endogenous regressors. For a special case of their model, we show that this condition essentially requires that the endogenous regressor is degenerate conditional on the instrument with positive probability. Moreover, under weak assumptions, we show that this condition fails to rule out any possible value for the coefficient on the endogenous regressor. 2007 Elsevier B.V. All rights reserved. Keywords: Binary response; Endogeneity; Instrumental variables JEL classification: C14; C25; C35 1. Introduction Hong and Tamer (2003) consider the following binary choice model: y ¼ 1fyTz0g yt ¼ x V b þ ϵ; ð1þ The authors gratefully acknowledge financial support from the National Science Foundation, SES-05-51089. The authors thank two anonymous referees and the editor Eric Maskin for useful comments. Aprajit Mahajan also provided helpful discussions. Corresponding author. Tel.: +1 212 854 2512; fax: +1 212 854 8059. E-mail address: ev2156@columbia.edu (E. Vytlacil). 0165-1765/$ - see front matter 2007 Elsevier B.V. All rights reserved. doi:10.1016/j.econlet.2007.03.006

24 A.M. Shaikh, E. Vytlacil / Economics Letters 98 (2008) 23 28 where 1{A} denotes the indicator function of the event A, x R k is a vector of observed covariates and ϵ is an unobserved random variable. Let x j denote the jth element of x and let β j denote the corresponding coefficient. The first element of x, x 0, will be assumed to be identically equal to 1. While the maximum score approach of Manski (1985) imposes Med(ϵ x) = 0, Hong and Tamer (2003) instead propose an approach based on the weaker assumption that Med(ϵ z)=0 for some random vector z R d. Thus, they allow for x to be endogenous in the sense that Med(ϵ x) 0 and impose median independence in terms of a vector of instruments z instead. For ease of exposition, we will consider a simple example with one exogenous regressor and one endogenous regressor; that is, y ¼ 1fy T z 0g y T ¼ b 0 þ b 1 x 1 þ b 2 x 2 þ ϵ; with z=(x 1, w) and Med(ϵ z)=0. In addition, we will suppose further that β 1 is known to be strictly positive, so that without loss of generality we can impose the normalization β 1 =1. Our results, however, do not depend on these assumptions and will therefore be valid more generally. Denote by B the parameter space for β. In this case, B={(b 0, 1, b 2 ):(b 0, b 2 ) R 2 }. Under the assumptions that the model is given by Eq. (1), Med(ϵ z)=0, and the conditional density of ϵ given (x, z) is continuous and bounded away from zero in a neighborhood of zero uniformly in x and z, Hong and Tamer (2003) show that β is point identified if for all b B such that b β, Prfz : PrfxVb b 0 V xvbjzg ¼ 1 [ PrfxVb b 0 V xvbjzg ¼ 1gN 0: Hong and Tamer (2003) note that sufficient conditions for Eq. (3) can be characterized using support conditions on the distribution of (x, z). In particular, when x=z (so the regressors are instruments for themselves), Eq. (3) reduces to the condition for identification found in Manski (1985). Yet the analysis of Hong and Tamer (2003) leaves open the question of the extent to which Eq. (3) can be satisfied if x z, that is, if the regressors are not instruments for themselves. We show that their analysis for point identification only allows a mild relaxation of the assumption that the regressors are instruments for themselves. In fact, in the special case in which x 2 is a discrete endogenous regressor, a necessary condition for Eq. (3) to hold for all b B such that b β is that x 2 is degenerate conditional on z for a set of z values with positive probability. More generally, point identification requires that for all δn0 the support of x 2 conditional on z is contained in an interval of length δ for a set of z values with positive probability. When x 2 is discrete, this condition reduces to the one above; when x 2 is not discrete, however, the condition requires that the distribution of x 2 conditional on z is arbitrarily close to degenerate for a set of z values with positive probability. Hong and Tamer (2003) point out that even when Eq. (3) fails to point identify β, it may still be possible to partially identify β using Eq. (3). In particular, it follows from their analysis that β does not equal any value of b R that satisfies Eq. (3). Thus, β B I, where B I ¼fbaB : Prfz : PrfxVb b 0 V xvb jzg ¼ 1 [ PrfxVb b 0 V xvb j zg ¼ 1g ¼ 0g: It follows that β 2, the coefficient on the endogenous regressors, lies in the set B I,2, where B I;2 ¼fb 2 ar : a b 0 with ðb 0 ; 1; b 2 ÞaBg: Under mild restrictions on the joint distribution of the endogenous variable and the instrument, we show that B I,2 =R, so the bounds on β 2 derived from the restriction (3) are typically trivial. ð2þ ð3þ

A.M. Shaikh, E. Vytlacil / Economics Letters 98 (2008) 23 28 25 The rest of the paper is organized as follows. In Section 2, we consider a special case of the model (2) in which the endogenous regressor, x 2, is a dummy variable. This special case helps us illustrate ideas and also allows us to draw comparisons of this approach to dummy endogenous regressors in binary choice models with approaches based on identification-at-infinity arguments. We relax the assumption that x 2 is a dummy endogenous regressor in Section 3. Section 4 concludes. 2. Special case: dummy endogenous variable In this section, we consider the special case of the model (2) in which the endogenous regressor, x 2, is a dummy variable. The next proposition shows that unless x 2 is non-degenerate conditional on z for a set of z values with positive probability, Eq. (3) will not hold for some b B such that b β. In fact, it shows further that unless this condition is satisfied, B I,2 =R, so the bounds on β 2 are vacuous. Proposition 2.1. Suppose that y is determined according to Eq.(2), x 2 is a Bernoulli random variable, Med(ϵ z)=0, and z=(x 1, w). If Prfz : Prfx 2 ¼ 0jzgN0g ¼ 1 or Prfz : Prfx 2 ¼ 1jzgN0g ¼ 1; ð4þ then Eq.(3) does not hold for some b B such that b β. Furthermore, B I,2 =R. Proof. First suppose Pr{z:Pr{x 2 =0 z}n0}=1. Let b 2 R. For b=(β0, 1, b 2 ), we have that PrfxVb b 0 V xvbjzg ¼ Prfx 2 ¼ 1jzgPrfb 0 þ x 1 þ b T 2 b 0 V b 0 þ x 1 þ b 2 jz; x 2 ¼ 1g þ Prfx 2 ¼ 0jzgPrfb 0 þ x 1 b 0 V b 0 þ x 1 jz; x 2 ¼ 0g ¼ Prfx 2 ¼ 1jzg1fb 0 þ x 1 þ b T 2 b0v b 0 þ x 1 þ b 2 g þ Prfx 2 ¼ 0jzg1fb 0 þ x 1 b0vb 0 þ x 1 g: It follows that for any z such that Pr{x 2 =0 z}n0 we have Pr{x bb0 x β z}=1 only if β 0 +x 1 b0 β 0 +x 1. The same argument mutatis mutandis shows that Pr{x βb0 x b z}=1 only if β 0 +x 1 b0 β 0 +x 1 b0. Thus, Prfz : ðprfxvbb0v xvbjzg ¼ 1 [ PrfxVbb0V xvbjzg ¼ 1Þ\ Prfx 2 ¼ 0jzgN0g ¼ 0: Hence, Eq. (3) cannot hold for such a value of b. Moreover, since the choice of b 2 was arbitrary, we have that B I,2 =R. Now suppose Pr{z:Pr{x 2 =1 z}n0}=1. As before, let b 2 R. For b=(β0 +β 2 b 2, 1, b2 ), we have that PrfxVbb0V xvbjzg ¼ Prfx 2 ¼ 1jzgPrfb 0 þ x 1 þ b 2 b0vb 0 þ x 1 þ b 2 jz; x 2 ¼ 1g þ Prfx 2 ¼ 0jzgPrfb 0 þ b 2 b T 2 þ x 1 b0vb 0 þ x 1 jz; x 2 ¼ 0g ¼ Prfx 2 ¼ 1jzg1fb 0 þ x 1 þ b 2 b0vb 0 þ x 1 þ b 2 g þ Prfx 2 ¼ 0jzg1fb 0 þ b 2 b T 2 þ x 1 b0vb 0 þ x 1 g:

26 A.M. Shaikh, E. Vytlacil / Economics Letters 98 (2008) 23 28 It follows that for any z such that Pr{x 2 = 1 z} N0 we have Pr{x b b0 x β z} = 1 only if β 0 + x 1 + β 2 b0 β 0 + x 1 + β 2. The same argument mutatis mutandis shows that Pr{x β b0 x b z} = 1 only if β 0 + x 1 + β 2 b0 β 0 + x 1 + β 2. Thus, Prfz : ðprfxvbb0vxvbjzg ¼ 1 [ PrfxVbb0V xvbjzg ¼ 1Þ\ Prfx 2 ¼ 1jzgN0g ¼ 0: Hence, Eq. (3) cannot hold for such a value of b. Moreover, since the choice of b 2 was arbitrary, we have that B I,2 =R. Remark 2.1. It is interesting to compare the approach proposed by Hong and Tamer (2003) with the identification-at-infinity approach for selection models with a dummy endogenous regressor (see, for example, Heckman (1990)). 1 Proposition 2.1 shows that in order to identify the coefficient on the dummy endogenous regressor, Hong and Tamer (2003) require that the probability that the endogenous variable equals one conditional on covariates is exactly equal to one and zero with positive probability. In contrast, the identification-at-infinity approach only requires the weaker condition that the probability that the endogenous variable equals one conditional on covariates is arbitrarily close to one and zero with positive probability. On the other hand, this approach also imposes several restrictions not imposed by Hong and Tamer (2003); in particular, it requires that the error term is fully independent (instead of just median independent) of the instruments and that the dummy endogenous variable is determined by a threshold crossing model. 2 Of course, both approaches allow for the possibility of partial identification when the sufficient conditions for point identification are relaxed, but Proposition 2.1 shows that the restriction (3) does not rule out any possible value for the coefficient on the endogenous regressor unless the endogenous regressor is degenerate conditional on the instrument with positive probability. In contrast, it is possible to derive nontrivial bounds for certain parameters without such strong requirements on the joint distribution of the endogenous regressor and the instrument when the conditions for point identification under the identification-at-infinity approach are relaxed (see, for example, Shaikh and Vytlacil (2007)). 3. General case For the case of a dummy endogenous variable, we have shown in the preceding section that a necessary condition for Eq. (3) to be satisfied for all b B such that b β is that the endogenous variable is degenerate conditional on the instrument with positive probability. In this section, we first show that for Eq. (3) to be satisfied for all b B such that b β the endogenous variable must be arbitrarily close to degenerate conditional on the instrument with positive probability. We also show under a mild restriction on the joint distribution of the endogenous regressor and the instrument that Eq. (3) fails to rule out any possible value for β 2 ; that is, B I,2 =R. 1 Heckman (1990) formally assumes that the outcome equation is additively separable in the regressors and the error term, but his analysis extends immediately to the case of a binary choice model. See also Cameron and Heckman (1998) and Aakvik et al. (1998) for identification-at-infinity arguments in the context of a system of discrete choice equations. 2 It is worthwhile to note that since Eq. (3) is only a sufficient condition for identification, it does not preclude the possibility of an identification-at-infinity strategy for the model considered by Hong and Tamer (2003).

A.M. Shaikh, E. Vytlacil / Economics Letters 98 (2008) 23 28 27 Proposition 3.1. Suppose that y is determined according to Eq.(2), Med(ϵ z)=0, and z=(x 1, w). (i) If β 2 0 and there exists δn0 such that for a.e. value of z the support of x 2 conditional on z is not contained in any interval of δ, then Eq.(3) does not hold for some b B such that b β. (ii) If there exists a value x 2 such that x 2 is contained in the support of x 2 conditional on z for a.e. value of z, then Eq.(3) does not hold for some b B such that b β. Furthermore, B I,2 =R. Proof. (i) Suppose β 2 N 0. The same argument mutatis mutandis will establish the result for β 2 b 0. Let b 0 satisfy 0 b jb 0 b 0 j b db 2 ð5þ and define b=(b 0, 1, β 2 ). First consider z such that Pr{x β 0 z}=1. It follows that Prfx V bzdb 2 jzgn0: To see this, suppose by way of contradiction that Pr{x β δβ 2 z} = 0, which in turn implies that Pr{x β b δβ 2 z}=1. Since Pr{x β 0 z}=1 by assumption, we have as a result that Pr{0 x β b δβ 2 z}=1. Thus, ðb Pr 0 þ x 1 Þ b 2 V x 2 V ðb 0 þ x 1 Þ þ d j z b 2 ¼ 1; which implies that the support of x 2 conditional on z is contained in an interval of length δ. This contradiction establishes that Pr{x β δβ 2 z}n0. Next note that Eq. (5) implies that b 0 V b 0 þ db 2 : It therefore follows from Eq. (6) that Pr{x bn0 z}n0, which in turn implies that Pr{x bb0 x β z}b1. Hence, Pr{z:Pr{x bb0 x β z}=1}=0. Now consider z such that Pr{x β b0 z} = 1. An argument parallel to the one given above shows that Pr{z : Pr{x β b0 x b z} = 1} = 0. Thus, Eq. (3) does not hold for such a value of b. (ii) Let b 2 R. Suppose z is such that x2 is an element of the support of x2 conditional on z. Consider b=(b 0, 1, b2 ) with b0 =β0 +(β 2 b 2 )x2. For such b, we have that Z PrfxVb b 0 V xvbjzg ¼ 1fb T 0 þ x 1 þ b T 2 x 2 b 0 V b 0 þ x 1 þ b 2 x 2 gdfðx 2 jzþ: ð6þ Thus, in order to have Pr{x bb0 x β z}=1, we must have that b T 0 þ x 1 þ b T 2 x 2 b0vb 0 þ x 1 þ b 2 x 2 ð7þ for a.e. x 2 conditional on z. In particular, Eq. (7) must hold for x 2 = x 2, which implies that b0 + x1 + b 2 x2 = β0 + x 1 + β 2 x 2 b β0 + x 1 + β 2 x 2. The same argument mutatis mutandis shows that in order to have Pr{x b b0 x β z} = 1 we must have β 0 + x 1 + β 2 x 2 b β 0 + x 1 + β 2 x 2. Since x2 is an element of the support of x 2 conditional on z for a.e. value of z, it follows that Eq. (3) cannot hold for such a value of b. Moreover, since the choice of b 2 was arbitrary, we have that BI,2 = R.

28 A.M. Shaikh, E. Vytlacil / Economics Letters 98 (2008) 23 28 Remark 3.1. Note that in the special case in which x 2 is a dummy endogenous regressor the hypotheses of both part (i) and part (ii) are satisfied when Eq. (4) holds. Thus, Proposition 3.1 generalizes the results in the preceding section. Remark 3.2. As noted by Hong and Tamer (2003) and demonstrated by the above proposition, Eq. (3) is indeed a very strong condition. More surprising, however, is the fact that under the very mild restriction that there exists a value x 2 such that x2 is contained in the support of x2 conditional on z for a.e. value of z the restriction (3) fails to rule out any value of β 2. For example, this condition is satisfied trivially when x 2 =γz+u and u is distributed with support equal to the real line. Thus, the suggestion of Hong and Tamer (2003) to use Eq. (3) to partially identify β 2 may not be useful. Of course, when there is reason to believe that the hypotheses of part (ii) of Proposition 3.1 do not hold, then it may still be possible to follow this suggestion. Remark 3.3. Denote by Z a set of values for z such that x 2 is degenerate conditional on z when z Z. Obviously, Med(ϵ z)=0 implies Med(ϵ x)=0 for any z Z. Thus, if such a set Z with positive probability is known to the researcher, then one may proceed by following the approach of Manski (1985) after restricting attention to observations satisfying z Z. 4. Conclusion In a recent paper, Hong and Tamer (2003) suggest an approach for endogenous regressors in a binary choice model that exploits median independence of the latent error term from a vector of instruments. In this paper, we consider a special case of their model in which there is a single endogenous regressor and show that a necessary condition for point identification in this model is essentially that the endogenous regressor is degenerate conditional on the instrument with positive probability. Furthermore, under weak restrictions on the joint distribution of the endogenous regressor and the instrument, we show that this approach fails to rule out any possible value for the coefficient on the endogenous regressor. References Aakvik, A., Heckman, J., Vytlacil, E., 1998. Semiparametric program evaluation lessons from an evaluation of a Norwegian training program. Mimeo, University of Chicago. Cameron, S., Heckman, J., 1998. Life cycle schooling and dynamic selection bias: models and evidence for five cohorts of American males. Journal of Political Economy 106, 262 333. Heckman, J., 1990. Varieties of selection bias. American Economic Review 80, 313 318. Hong, H., Tamer, E., 2003. Endogenous binary choice model with median restrictions. Economics Letters 80, 219 225. Manski, C.F., 1985. Semiparametric analysis of discrete response: asymptotic properties of the maximum score estimator. Journal of Econometrics 27, 313 333. Shaikh, A., Vytlacil, E., 2007. Threshold crossing models and bounds on treatment effects: a nonparametric analysis. Mimeo, University of Chicago and Columbia University.