Marginal Quantile Treatment E ect

Size: px
Start display at page:

Download "Marginal Quantile Treatment E ect"

Transcription

1 Marginal Quantile Treatment E ect Ping Yu University of Auckland Started: April 23 First Version: October 23 This Version: April 24 Abstract This paper studies estimation and inference based on the marginal quantile treatment e ect. First, we illustrate the importance of the rank preservation assumption in the quantile treatment e ects evaluation, show the identi ability of the marginal quantile treatment e ect, and clarify the relationship between the marginal quantile treatment e ect and other quantile treatment parameters. Second, we develop sharp bounds for the quantile treatment e ect with and without the monotonicity assumption, and also su cient and necessary conditions for point identi cation. Third, we estimate the marginal quantile treatment e ect and associated quantile treatment e ect and integrated quantile treatment e ect based on the distribution regression, derive the corresponding weak limits and show the validity of the bootstrap inferences. The inference procedure can be used to construct uniform con dence bands for quantile treatment parameters and test unconfoundedness and stochastic dominance. We also develop goodness of t tests to choose regressors in the distribution regression. Fourth, we conduct two counterfactual analyses: deriving the transition matri and developing the relative marginal policy relevant quantile treatment e ect parameter under the policy invariance. Fifth, we compare the identi cation schemes in some important literature with that by the marginal quantile treatment e ect, and point out advantages and also weaknesses of each scheme, e.g., Chernozhukov and Hansen (25) concentrate mainly on the quantile treatment e ect with the selection select but without the essential heterogeneity; Abadie, Angrist and Imbens (22), Aakvik, Heckman and Vytlacil (25) and Chernozhukov and Hansen (26) su er from some obvious misspeci cation problems. Meanwhile, an alternative estimator of the local quantile treatment e ect is developed and its weak limit is derived. Finally, we apply the estimation methods to the famous return to schooling dataset of Angrist and Krueger (99) to illustrate the usefulness of the techniques developed in this paper to practitioners. Keywords: marginal quantile treatment e ect, local quantile treatment e ect, rank preservation, selection e ect, essential heterogeneity, sharp bound, point identi cation, distribution regression, two-step estimator, Hadamard di erentiability, weak limit, uniform con dence band, unconfoundedness, completeness, stochastic dominance, goodness of t test, transition matri, relative marginal policy relevant quantile treatment e ect, counterfactual analysis, policy invariance, bootstrap validity, return to schooling JEL-Classification: C2, C3, C4, C2, C26 p.yu@auckland.ac.nz.

2 Introduction Treatment e ect evaluation is one main task of econometric analysis. Most literature concentrates on the average treatment e ect evaluation; see Heckman and Vytlacil (27a,b) for a comprehensive summary. Meanwhile, as illustrated in Heckman (992), Heckman et al. (997) and Heckman and Smith (993, 998), questions of political economy or "social justice" requires knowledge of the distribution of the treatment e ect. As a result, distributional treatment e ects (especially when unconfoundedness does not hold) become natural parameters of interest among econometricians. Actually, distributional treatment e ects have been studied etensively in the empirical literature. For eample, Card (996) uses a panel data set to study the e ects of unions on the structure of wages; DiNardo et al. (996) presents a semiparametric procedure to analyze the e ects of institutional and labor market factors on changes in the U.S. distribution of wages; Bitler et al. (26) estimate quantile treatment e ects using random-assignment data from Connecticut s Job First waiver. Distributional treatment e ects are usually estimated based on quantile regression initiated by Koenker and Bassett (978) (see Koenker (25) for an introduction to quantile regression). One related eld that recently attracts much attention is the "general" semiparametric and nonparametric quantile regression with endogeneity. For the semiparametric setups, see, e.g, Hong and Tamer (23), Honoré and Hu (24), Ma and Koenker (26), Lee (27), Sakata (27) and Jun (28) among others. For nonparametric setups, see, e.g., Chesher (23), Chernozhukov et al. (27), Horowitz and Lee (27), Imbens and Newey (29), Chen and Pouzo (22), and Gagliardini and Scaillet (22) among others. However, the main interest of this paper concentrates on the special structure of the treatment model, namely, the endogenous variable is binary. A key parameter we will develop is the marginal quantile treatment e ect (MQTE), which is the counterpart of the marginal treatment e ect (MTE) in the average treatment e ect estimation. The idea of the MTE was rst introduced in the contet of a parametric normal generalized Roy model by Björklund and Mo tt (987), and was analyzed more generally by Heckman (997). In a choice (or selection, or participation) model with the latent variable structure, Heckman and Vytlacil (999, 2a) epress the conventional average treatment e ect parameters as di erent weighted averages of the MTE, and also identify the MTE by the local instrumental variable (LIV) estimator. Actually, Heckman and Vytlacil (27b) use the MTE to unify the econometric literature on the evaluation of social programs, so it is well recognized that the MTE is a convenient tool to organize the nonparametric literature on the average treatment e ect evaluation. An embarrassing situation is that the counterpart of the MTE in the quantile treatment e ect literature, the MQTE, is yet to be well understood. The purpose of this paper is to integrate the relevant literature on the quantile treatment e ect evaluation without unconfoundedness into one framework and provide some useful estimation and inference methods to practitioners based on the MQTE. There are two strands of literature concerning about the distributional treatment e ects, and they are interwined. Before reviewing the relevant literature, we must emphasize that the distributional treatment e ects are functionals of the distribution of Y Y, which requires the joint distribution of Y and Y, where Y and Y are the outcome under the treatment status and the control status, respectively. As mentioned in Section II.B of Manski (996) or footnote 5 of Manski (997), "knowledge of F (Y Y ) neither implying nor being implied by knowledge of F (Y ) and F (Y )", where F (X) is the cumulative distribution function (CDF) of X for a random variable X. Due to the fundamental problem of causal inference (page 947 of Holland (986)), Y and Y cannot be observed simultaneously. As a result, even in a random eperiment, the joint distribution F (Y ; Y ) or F (Y Y ) cannot be identi ed if without further restrictions although F (Y ) and F (Y ) can be identi ed. On the other hand, marginal distributions F (Y ) and F (Y ) are also of interest in econometric analysis. For eample, in Atkinson (97), Sen (997, 2), Manski (996, p74),

3 Imbens and Rubin (997, p558), Imbens and Wooldridge (29, p7), and Imbens (2, p49), the marginal distributions of outcomes are more relevant for a social planner choosing between two programs; see also the Introduction of Abadie (22) for an eample where only the marginal distributions are relevant. The rst strand of literature tackles the joint distribution F (Y ; Y ) directly. This strand of literature is mainly interested in the quantile of di erences of Y and Y. First, using the classical probability results due to Hoe ding (94) and Fréchet (95), Heckman et al. (997) and Heckman and Smith (993, 998) bound F (Y ; Y ) using F (Y ) and F (Y ) in a random eperiment. It turns out that this kind of bounds are too wide to be useful. Later, Aakvik et al. (25) impose more structures on the problem to get more stringent identi cation results. Basically, two key structures are imposed: (i) the error terms in the outcome equations and the choice equation are independent of all the covariates; (ii) the error terms follow a one-factor structure, i.e., the correlation among the error terms is only through the factor. Under these two assumptions, F (Y ; Y ) can be identi ed, so all interesting functionals of F (Y ; Y ) can be identi ed. For eample, the proportion of people who bene t from participation in the program (P (Y > Y )), gains to participants at selected levels of the no-treatment distribution (F (Y Y jy = y )) or treatment distribution (F (Y Y jy = y )), and a variety of other questions including the quantile treatment e ect (QTE) and the quantile treatment e ect on the treated (QTT) can be answered. Aakvik et al. (25) consider only two binary potential outcomes, and parametric one-factor models with cross-sectional data. Etensions to multiple (possibly continuous or mied discrete and continuous) outcomes, and semiparametric (or nonparametric) multiple-factor models with possibly panel data can be found in Aakvik et al. (999) and Carneiro et al. (2, 23). See Section 2 of Abbring and Heckman (27) for a summary of this strand of literature. The second strand of literature concentrates on the marginal distribution of Y and Y (maybe also conditional on some covariates or some speci ed population). However, as noted above, the distributional treatment e ects require the joint distribution of Y and Y. To circumvent this problem, this strand of literature eplicitly or implicitly assume some type of rank preservation (RP) condition. Such type of condition was initiated by Lehmann (974) and Doksum (974). Under this assumption, the distributional treatment e ects can be described by the di erence of quantiles of Y and Y. The rst part of this strand of literature bounds F (Y ) and F (Y ) without imposing any restrictions on the choice process. Under the RP assumption, these bounds imply bounds on the QTE. Such literature are summarized in Manski (994, 995, 23). When further restrictions on the choice process are imposed, point identifying some type of quantile treatment e ects is possible. The second part of this strand of literature estimates and conducts inferences on some type of quantile treatment e ects under point identi cation. Firgo (27) estimates the QTE and the QTT in observational studies under the unconfoundedness assumption. When the unconfoundedness assumption fails while only the selection e ect eists, Chernozhukov and Hansen (25) show that the QTE can be identi ed under some completeness assumption, and Chernozhukov and Hansen (26) provide a speci c estimation scheme; see also Chernozhukov and Hansen (23) for most updated developments along this line. When there is also the essential heterogeneity, the monotonicity assumption of Imbens and Angrist (994) or the uniformity assumption of Heckman and Vytlacil (25) is usually imposed. Under this assumption, Abadie et al. (22) estimate the local quantile treatment e ect (LQTE), which is the counterpart of the local average treatment e ect (LATE), using the identi cation results in Abadie (23); see also Imbens and Rubin (997) for identi cation of the marginal potential distributions of compliers when no covariates are present, and Abadie (22) for bootstrap tests of distributional treatment e ects in a similar framework. Carneiro and Lee (29) deal with the essential heterogeneity in an alternative way. They borrow a key assumption, the independence assumption (i) in the last paragraph, from the rst strand The LATE parameter is rst introduced by Imbens and Angrist (994). The MTE is a limit form of the LATE, see, e.g., Björklund and Mo tt (987), Heckman (997) and Angrist et al. (2). 2

4 of literature to identify the (conditional) marginal distributions of potential outcomes. These distributions imply the MQTE, which is also the main objective of this paper but we do not need the independence assumption. The above-mentioned literature concentrates on the cross-sectional data; Athey and Imbens (26) also use the panel data to identify the QTT through what they called change-in-change approach under the RP condition on the treated. Although these two strands of literature use di erent identi cation assumptions, their targets are the same, namely, identifying the joint distribution of Y and Y. This paper can be put in the second strand of literature, i.e., we impose some RP assumptions to identify F (Y ; Y ). Consequently, the quantile treatment e ect in this paper refers to the di erence of quantiles rather than the quantile of di erences. Meanwhile, we employ the framework in the rst strand of literature to study the di erence of quantiles. The rest of this paper is structured as follows. Section 2 sets up our treatment model, illustrates the importance of the RP assumption in the quantile treatment e ect evaluation, shows the identi ability of the MQTE, and clari es the relationship between the MQTE and other quantile treatment parameters. Section 3 develops sharp bounds and su cient and necessary conditions for point identi cation of the QTE with and without the monotonicity assumption. In Section 4, we estimate the MQTE based on the distribution regression introduced by Foresi and Peracchi (995), derive its weak limit and show the validity of the bootstrap inferences, and we also develop goodness of t tests to choose regressors. In Section 5, we conduct two counterfactual analyses: deriving the transition matri and developing the relative marginal policy relevant quantile treatment e ect parameter under the policy invariance. In Section 6, we comment some key literature in the two strands above, pointing out their weaknesses, underlying assumptions, and interactions with this paper. Section 7 presents an empirical application to the return to schooling and Section 8 concludes. All proofs are contained in an appendi. Some notations are collected here for future reference. d is always used for indicating the two treatment statuses, so is not written out eplicitly as "d = ; " throughout the paper. supp(x) for a random variable X denotes the support of the distribution of X. Both Q X () and Q (X) denote the th quantile of a random variable X. The capital letters such as X denote random variables and the corresponding lower case letter such as denote the potential values they may take. For any parameter, d is the dimension of. The space `(F) represents the space of real-valued bounded functions de ned on the inde set equipped with the supremum norm kk`(f). C (Y) is the space of continuous functions on Y. 2 The Setup and Parameters of Interest We use the nonlinear and nonseparable outcome model as in Heckman and Vytlacil (25), Y = (X; U ); Y = (X; U ): () Actually, the additively separable setup, Y d = d (X) + U d, does not lose generality since we can de ne the new U d as Y d Q Yd jx(jx) and all our analysis in this paper is conditional on X. The distribution of Y d may be discrete (e.g., employment status), continuous (e.g., wage), or mied discrete and continuous (e.g., in the national JTPA study 8 month impact sample used in Heckman et al. (997), a substantial proportion of persons has zero earnings in both distributions of Y and Y ). The participation decision D = ( D (X; Z) V ); (2) 3

5 where Z includes the instruments for the choice process. Both X and Z appearing as the arguments of D does not lose generality since D (X; Z) may not depend on all elements of X. By transforming D (X; Z) and V by F V jx;z, we can rewrite D = (p(x; Z) U D ); (3) where U D jx; Z U(; ) and p(x; Z) is the propensity score. We use these two formulations of D interchangeably throughout the paper. As shown in Vytlacil (26), there is a larger class of latent inde models that will have a representation of this form. Also, this setup of D implies the monotonicity assumption of Imbens and Angrist (994) as shown in Vytlacil (22). We impose the following assumptions on the outcome equation and the choice equation. (A) D (X; Z) is a nondegenerate random variable conditional on X. (A2) The random vectors (U ; V ) and (U ; V ) are independent of Z conditional on X. (A3) The distribution of V is absolutely continuous with respect to Lebesgue measure. (A4) X = X almost everywhere, where X d denote a value of X if D is set to d. (A5) > P (D = jx) >. (A6) Conditional on X =, V = v, Y and Y have the same rank: (A)-(A5) corresponds to (A-)-(A-3), (A-6) and (A-5) in Heckman and Vytlacil (25), respectively. These assumptions are prevalent in the literature with heterogeneous treatment e ects. A necessary condition for (A) is that Z contains a continuous variable. (A2) allows for both the selection e ect (U 6? DjX) and the essential heterogeneity ((U U ) 6? DjX). Also, (A2) implies the usual assumption in the control function approach, say, Z? (U ; U )j (X; V ). (A)-(A5), combined with () and (2), impose testable restrictions on the distribution of (Y; D; Z; X); see Heckman and Vytlacil (25) (page 678) for the inde su ciency restriction and the monotonicity restriction. We refer to Heckman and Vytlacil (25) for more detailed discussions on (A)-(A5). The assumption (A6) deserves further eamination. 2. The Rank Preservation Condition The key etra assumption beyond those in Heckman and Vytlacil (25) is the RP condition (A6). Chernozhukov and Hansen (25) state the RP assumption via the Skorohod representation. We try to do the same thing here although unlike them, this representation is not essential for the development of our identi cation scheme. Suppose Y d is continuous, and the th conditional quantile of Y d given X and V is q(d; X; V; ); then we can represent Y d = q(d; X; V; R d ) by the Skorohod representation, where R d j(x; V ) U(; ) is the rank variable which represents some unobserved characteristic of Y d, e.g., ability or proneness, among the slice of people with a speci c value of X and V. The RP assumption (A6) can be restated as R j(x; V ) = R j(x; V ). We now clarify two key points of the Skorohod representation. First, the Skorohod representation decomposes the information in U d of () into two components: the value information and the rank information. The former is incorporated in the quantile function q() and the later is included in R d. Second, because R d j(x; V ) U(; ) does not depend on (X; V ), it may be suspected that R d is independent of (X; V ). This is incorrect. This mistake is immediately clear if we rewrite Y d = q(d; X; V; R d (X; V )) ; in other words, R d must be understood as a conditional random variable. Suppose there are N distinct points on the support of (X; V ), and then there are N rank variables R d (X; V ). Although R d (X; V )j(x = ; V = v) U(; ) does not depend on (; v), the unconditional random variable R d may depend on (X; V ). The RP condition does not restrict the dependence between R d and (X; V ); rather, it restricts the total number of conditional rank variables 4

6 R d (X; V ) from 2N to N. To be consistent with the notation in the literature, R d is replaced by U d in the rest of this paper. The meaning of U d should be clear from the contet. For eample, when V appears as an argument of the representation of Y d, or Y d is represented as Y d = q(), U d means the rank variable. In this paper, we do not consider the quality of the evidence supporting the assumption (A6). Instead, we consider the evaluation of speci c programs under this assumption. Figure : Rank Preserved Conditional on U D BUT Unconditionally Unpreserved Our RP assumption is weaker than the usual assumption that Y and Y have the same rank conditional on X =. Think about the following eample. Suppose Z is the only covariate in the determination of D, and Z can take only and. So the only nontrivial values for U D are p() and p(). Suppose for each value of U D, there are only two persons. Figure shows that although the rank is preserved among the people with a speci c U D value, the rank is unpreserved if all people are taken into account. In other words, (A6) only requires the RP condition to hold locally (X = ; U D = u D ) instead of globally (X = ). Conditional on X =, the rank may not be maintained under the treatment. However, for a ner slice of individuals, the rank is maintained. Local rank preservation is much weaker than global rank preservation. The larger the conditional set on which the RP condition is imposed, the harder for the RP condition to hold. Actually, the analysis in Heckman et al. (997) show that the unconditional RP condition cannot hold although substantial departures from the perfect positive dependence across Y and Y are not credible in their contet; see also Carneiro et al. (23) for further evidences against the unconditional RP condition. The RP condition also imposes a restriction on the joint distribution of Y and Y given X = and U D = u D, namely, the joint distribution is fully determined by the marginal distribution. It is not hard to see that when the RP condition holds, P (Y y ; Y y jx = ; U D = u D ) = min fp (Y y jx = ; U D = u D ) ; P (Y y jx = ; U D = u D )g ; 5

7 which implies that the joint distribution of Y and Y given X = ; U D = u D is degenerate. To see how this joint distribution looks like, suppose Y d j (X = ; U D = u D ) is continuously distributed and supp(y d jx = ; U D = u D ) = [; ] to simplify the discussion. It turns out that only on the line y ; F Y jx;u D F YjX;U D (y j; u D )j; u D with y 2 [; ] there is probability. In other words, only on the Q-Q plot, (Y ; Y ) can occur simultaneously. An implication of this result is that if F YjX;U D (j; u D ) is the same as F YjX;U D (j; u D ), then the correlation between Y and Y conditional on X = ; U D = u D must be. Figure 2 shows a typical Q-Q plot of (Y ; Y ) conditional on X = ; U D = u D. In Figure 2, P (Y Y jy = y ; X = ; U D = u D ) = when y :6 and P (Y Y jy = y ; X = ; U D = u D ) = when y > :6. In other words, for the slice of people with Y = y ; X = ; U D = u D, the participant always bene ts as long as y :6, and vice versa. Nevertheless, it is more likely that P (Y Y jy = y ; X = ) 2 (; ), P (Y Y jx = ; U D = u D ) = F YjX;U D (:6j; u D ) 2 (; ) and P (Y Y jx = ) = R P (Y Y jx = ; U D = u D )du D 2 (; )..6.6 Figure 2: Q-Q Plot of (Y ; Y ) Conditional on X = ; U D = u D It should be emphasized that the RP condition is only for de ning various quantile treatment e ects. Even without this condition, we can still identify various marginal distributions which, as argued in the introduction, are useful for many other purposes. Under the RP assumption, we de ne the MQTE in Carneiro and Lee (29) as MQT E (; u D ) = Q YjX;U D (j; u D ) Q YjX;U D (j; u D ): If we strengthen the RP assumption to be conditional on X = or on X = ; D =, then we can de ne the QTE in Chernozhukov and Hansen (25, 26) and the QTT as QT E () = Q YjX(j) Q YjX(j) and QT T () = Q YjX;D(j; ) Q YjX;D(j; ); 6

8 respectively. If the RP assumption is conditional on X = ; u D < U D u D, then the LQTE of Abadie et al. (22) 2 is de ned as LQT E (; u D ; u D) = Q YjX;U D (j; (u D ; u D]) Q YjX;U D (j; (u D ; u D]): Finally, if the RP assumption holds unconditionally (with respect to X), 3 then we de ne the integrated QTE (IQTE) IQT E = Q Y () Q Y (); the integrated QTT (IQTT) IQT T = Q YjD(j) Q YjD(j) as in Firpo (27), 4 and the integrated LQTE (ILQTE) 2.2 Identi cation of the MQTE ILQT E (u D ; u D) = Q YjU D (j(u D ; u D]) Q YjU D (j(u D ; u D]): The following theorem states that the MQTE can be identi ed for a range of u D. Theorem Suppose assumptions (A)-(A6) hold. If u D is not an isolated point of P \P, then MQT E (; u D ) can be identi ed for any 2 (; ), where P d =supp(p(x; Z)jX = ; D = d). Proof. To simplify notations, we depress the conditioning on X =. Given the RP assumption (A6), we need only identify Q Yd ju D (ju D ) whose identi cation is equivalent to the identi cation of F Yd ju D (ju D ). We provide two methods to identify F Yd ju D (ju D ). Method : Note that P (Y yjp(z) = p; D = ) p = P (Y yjp(z) = p; D = ) P (D = jp(z) = p) and similarly, P (Y yjp(z) = p; D = ) ( p) = = P (Y yju D p) p = Z p Z p F YjU D (yju D )du D, so d [P (Y yjp(z) = p; D = ) p] = F YjU dp D (yjp); d [P (Y yjp(z) = p; D = ) ( p)] = F YjU dp D (yjp): F YjU D (yju D )du D ; 2 Abadie et al. (22) con ate issues of de nition of parameters with issues of identi cation; see Section 6.2 below for their de nition. Actually, LQT E (; u D ; u D ) can be de ned for any u D; u D 2 (; ) although it can only be identi ed for u D; u D on the support of p(; Z). 3 Note that if the RP assumption holds on X =, Y d jx can be epressed as Y d = q(d; X; U) by the Skorohod representation, where UjX = U jx = U jx. If the RP assumption holds unconditionally, then Y d can be epressed as Y d = q(d; U) by the Skorohod representation, where U = U = U 2. This by no means implies that information in X and Z is useless to the identi cation or e ciency improvement in the quantile treatment e ect evaluation. 4 Be careful about the terminology in the literature. Our IQTE and IQTT are the QTE and QTT of Firpo (27). Also, the MQTE of Cattaneo (2) means Q (Y ) and Q (Y ) rather than MQT E (; u D ), and the MQTE, QTE and QTT in the rst strand of literature mentioned in the introduction means Q Y Y jx;u D (j; u D ), Q Y Y jx(j) and Q Y Y jx;d(j; ) rather than MQT E (; u D ), QT E () and QT T (). 7

9 Method 2: As in Hahn (998), we can use DY and ( y D)Y to identify F Yd ju D (ju D ). Note that for any P (DY yjp(z) = p) ( p) = P (DY yjp(z) = p; D = ) P (D = jp(z) = p) + P (DY yjp(z) = p; D = ) P (D = jp(z) = p) ( p) = P (Y yju D p) p + P ( yju D > p) P (D = jp(z) = p) ( p) = so Z p F YjU D (yju D )du D + ( p) ( p) = For y <, repeat the analysis above, we have Similarly, and Z p F YjU D (yju D )du D ; d [P (DY yjp(z) = p) ( p)] dp dp (DY yjp(z) = p) dp d [P (( D) Y yjp(z) = p) p] dp dp (( D) Y yjp(z) = p) dp = F YjU D (yjp): (4) = F YjU D (yjp): (5) = F YjU D (yjp) when y ; (6) = F YjU D (yjp) when y < : (7) Inverting F Yd ju D (yjp) as a function of y, we can get Q Yd ju D (jp). Since p(z), P (Y yjp(z) = p; D = d), P (DY yjp(z) = p) and P (( D) Y yjp(z) = p) for y 2 R and p 2 P d can be identi ed, MQT E (u D ) = Q YjU D (ju D ) Q YjU D (ju D ) for all 2 (; ) and u D not being an isolated point of P\P can be identi ed. Figure 3: Intuition for Identi cation of F YjU D (yjp) with y and y < Figure 3 provides some intuition for the arguments in the second method. For y, P (DY yjp(z) = p) includes a point mass p at, and the remaining probability is 8 Z p F YjU D (yju D )du D, while for y <,

10 P (DY yjp(z) = p) does not include the point mass. This intuition is similar in spirit to that of the censored quantile regression models discussed in Powell (984, 986). The arguments in Theorem can be applied to the discrete Y d case. Suppose Y and Y have the same support fy ; ; y S g, and then the counterpart of the MQTE is P YjU D (y s ju D ) P YjU D (y s ju D ), s = ; ; S, where P Yd ju D (y s ju D ) is the point mass of Y d j (U D = u D ) at y s. We can still identify F Yd ju D (y s jp) by (4), (5), (6) and (7), and then P Yd ju D (y jp) = F Yd ju D (y jp) and P Yd ju D (y s jp) = F Yd ju D (y s jp) F Yd ju D (y s jp) for s = 2; ; S can be sequentially identi ed. If Y d can take only and, then the parameter of interest is P YjU D (ju D ) P YjU D (ju D ) which coincides with the MTE. Of course, we can also consider the case with mied discrete and continuous outcomes. Both the discrete case and the mied case are easier to handle than the continuous case, so we will concentrate on the continuous case in the rest of this paper unless stated otherwise. If we use the idea of LIV as in Heckman and Vytlacil (2a), we have and P (Y yjp(z) = p) = P (Y yjp(z) = p; D = ) p + P (Y yjp(z) = p; D = ) ( p) = Z p F YjU D (yju D )du D (Y yjp(z) = Z p F YjU D (yju D )du D ; = F YjU D (yjp) F YjU D (yjp); which is the di erence of CDFs in the two treatment statuses. So it is hard to identify the MQTE (Y yjp(z) = p) =@p. From Theorem, we can identify E[Y ju D = p] and E[Y ju D = p] separately, not just their di erence E[Y Y ju D = p] as in the LIV method of Heckman and Vytlacil (2a). Method of the proof is a special case of Theorem in Carneiro and Lee (29). We also discuss Method 2 to distinguish the di erence between the identi cation scheme of the MTE and the MQTE. For the MTE, E[DY jp(z) = p] = E [Y jp(z) = p; D = ] p = p] = E [Y jp(z) = p; D = ] ( p) = Z p Z p E [Y ju D = u D ] du D, and E[( D) Y jp(z) = E [Y ju D = u D ] du D, so the two methods in the proof are the same in the MTE identi cation. We close this subsection by a concrete eample. Suppose Y = V +2U; Y = 2V +U, and D = (Z V > ), where U V Z C A N (; ) with = :5 :5 It can be shown that MQT E (u D ) = :5 (u D ) + p :75 (). Figure 4 shows MQT E (u D ) for = :; :25; :5; :75 and :9. In this simple model, the spreading measure of the MQTE, e.g., MQT E (u D ) MQT E (u D ) for 2 (; :5), is the same for any u D, which may not be standard in practice. Also, MQT E (u D ) is a decreasing function of p, which indicates that the more likely will an individual participate in the program, the higher bene t will she receive. 5 In the gure, we also show MT E (u D ), QT E and AT E ( E[Y ] E[Y ]) for comparison. Note that in this eample, MT E (u D ) = MQT E :5 (u D ), and QT E = = AT E does not depend on. 6 Obviously, MQT E (u D ) provides more information than MT E (u D ), QT E, and AT E. C A : 5 Aakvik et al. (25) provide a converse eample. 6 It should be emphasized that QT E is not well de ned in this eample since the RP condition does not hold unconditionally given that Y and Y have the same marginal distribution but Corr(Y ; Y ) = 6:5=7 <. 9

11 Figure 4: MQT E (u D ) for = :; :25; :5; :75 and :9 in a Simple Eample 2.3 Relationship with Other Parameters of Treatment E ects In this subsection, we rst discuss the relationship between MQT E IQT E, IQT T, ILQT E (; u D ) and QT T (), QT E (), LQT E (; u D ; u D ),. It turns out that the building block is F Yd jx;u D (y d j; u D ) rather than MQT E (; u D ). Actually, MQT E (; u D ) is more relevant to the (conditional) quantile of Y Y. From the supplementary materials, we can show that QT T () = F Y jx;d (j; ) FY jx;d (j; ) and the quantile treatment e ect on the untreated (QTUT) where QT UT () = F Y jx;d (j; ) FY jx;d (j; ); F Yd jx;d(y d j; ) = F Yd jx;d(y d j; ) = Z Z F Yd jx;u D (y d j; u D )h T T (; u D )du D ; F Yd jx;u D (y d j; u D )h T UT (; u D )du D ; with h T T (; u D ) = F p(x;z)jx (u D j) =E[p(X; Z)jX = ] and h T UT (; u D ) = F p(x;z)jx (u D j)=e[ p(x; Z)jX = ]. Also, QT E () = F Y (j) F E jx Y jx (j); IQT = F Y () F (); IQT T = F Y (j) F UT jd Y jd (j); IQT = F Y (j) F jd Y (j) jd Y

12 where F Yd jx(y d j) = Z Z F Yd jx;u D (y d j; u D )du D ; F Yd (y d ) = F Yd jx(y d j)df X (); Z Z Z F Yd jd(yj) = F Yd jx;d(yj; )df XjD (j) = F Yd jx;u D (yj; u D ) Z Z Z F Yd jd(yj) = F Yd jx;d(yj; )df XjD (j) = Finally, where F p(x;z)jx(u D j) du D df X (); P (D = ) F Yd jx;u D (yj; u D ) F p(x;z)jx(u D j) du D df X (): P (D = ) LQT E (; u D ; u D) = F Y jx;u D (j; (u D ; u D]) F Y jx;u D (j; (u D ; u D]); ILQT E (u D ; u D) = F Y ju D (j(u D ; u D]) F Y ju D (j(u D ; u D]); Z u D F Yd jx;u D (y d j; (u D ; u D]) = u D u F Yd jx;u D (y d j; u D )du D ; D u Z D F Yd ju D (y d j(u D; u D ]) = F Yd jx;u D (y d j; (u D ; u D])dF Xjp(X;Z) (j(u D ; u D]) Z = F Yd jx;u D (y d j; (u D ; u D]) P (p(x; Z) 2 (u D; u D ]jx = ) P (p(x; Z) 2 (u D ; u D ]) df X (): See Appendi B. of Carneiro and Lee (29) for implementation of some of these parameters in practice. Z Z Note that QT E () 6= QT E (; u D )du D, and IQT E 6= QT E ()df X (), so it is hard to nd a relationship between these quantile treatment parameters and MQT E (; u D ). We can also identify the MTE Z MT E (; u D ) = y df YjX;U D (y j; u D ) Z Z y df YjX;U D (y j; u D ) = MQT E (; u D )d; so all the parameters that can be identi ed by MT E (; u D ) as listed in Table IA of Heckman and Vytlacil (25) can also be identi ed by MQT E (; u D ). In other words, MQT E (; u D ) is a more basic building block of the average treatment parameters. Note that to identify MT E, we do not need the RP assumption, but we need to assume E [jy d j] <. Heckman et al. (997) consider also the following parameters of treatment e ects: (a) the proportion of people taking the program who bene t from it, P (Y > Y jd = ); (b) the proportion of the total population that bene ts from the program, P (Y > Y jd = )P (D = ); (c) selected quantiles of the impact distribution, Q Y Y jd(j); (d) the distribution of gains at selected base state values, F Y Y jy ;D(jy ; ); 7 These parameters can be identi ed from MQT E (; u D ). It is not hard to show that under the RP assumption (A6), Z Z Z P (Y Y yjd = ) = ( MQT E (; u D ) y)d Fp(X;Z)jX (u D j) du D df X () P (D = ) 7 They also consider the option value of a social program, E[ma(Y ; Z)jD = ] E[Y jd = ], where Z is the option provided by the program.

13 by a similar derivation as in the epression of F Yd jd(yj), so Q Y Y jd(j) can be identi ed and P (Y > Y jd = ) = P (Y Y jd = ) can also be identi ed. 8 Actually, we can identify any conditional or unconditional quantile of Y Y of interest, e.g., Q Y Y jx;d(j; d), Q Y Y jx(j), Q Y Y jd(jd), Q Y Y () and Q Y Y jx;u D (j; (u D ; u D]), based on MQT E (; u D ). Since the corresponding weights can be similarly de ned as above, we neglect the details. Note that if only assumption (A6) holds, P (Y > Y jd = ) need not equal R R (QT T () > )ddf XjD (j) or R (IQT T > )d. They are equal only if the RP assumption holds on X = ; D = or D =. This observation can be used to test whether the RP assumption holds on a larger set than X = ; U D = u D. Because quantile is not a linear operator of the distribution function, Q Y Y () and Q Y () Q Y () are generally unequal (and do not have any identi able relationships), so the quantile treatment e ect and the quantile of the impact distribution are two di erent parameters. On the contrary, since mean is a linear operator of the distribution function, the average treatment e ect and the average of the impact distribution are the same parameter. In this paper, we concentrate on three most popular quantile treatment e ect parameters in the literature: MQT E (; u D ), QT E () and IQT E. We concentrate on di erence of quantiles rather than quantile of di erences because the latter may not be interesting. For eample, in the common e ect model, the distribution of Y Y is a point mass at a ed value. Even if the treatment e ect is not common, Y Y may still have discrete components in its distribution. See Section 3.2 of Aakvik et al. (25) for de nitions of the distributional counterparts of the MTE, ATE and ATT based on Y Y when the outcomes are binary, and see Section 2 of Abbring and Heckman (27) for de nitions of the distributional treatment e ects in more general settings. Finally, we study F Y Y jy ;D(jy ; ). We have already shown in Section 2. that under the RP assumption (A6), so P (Y Y yjy = y ; X = ; U D = u D ) = (Q YjX;U D (F YjX;U D (y j; u D )j; u D ) y + y ); P (Y Y yjy = y ; D = ) = P (Y Y yjy = y ; U D p(x; Z)) = R R h F UD ju ;X (p(;z)jf Y jx (y j);) R p(;z) P (Y Y yjy = y ; X = ; U D = u D )df UD ju ;X(pjF YjX(y j); ) df ZjX (zj)df XjY (jy ); where the equality is from the fact that F (UD ;X;Z)jY =y = F UD jy =y ;X;ZF ZjY=y ;XF XjY=y = F UD ju =F Y jx (y jx);x F ZjX F XjY=y, and U is de ned in the Skorohod representation of Y, Y jx = F Y jx (U jx). So this parameter is a complicated functional of Q YjX;U D (F YjX;U D (y j; u D )j; u D ) and is not easy to estimate. Actually, it is unknown whether it can be point identi ed since F UD ju ;X is hard to be nonparametrically identi ed without further structures on the model. i 3 Sharp Bounds for the QTE Although Q Yd jx;u D (j; u D ) can be point identi ed from Theorem, we show in this section that Q Yd jx(j) generally can only be partially identi ed, which implies that QT E () can only be partially identi ed. Here, we implicitly assume that the RP assumption on X = holds (i.e., Y d can be represented as Y d = q(d; X; U d ) 8 This parameter is useful, e.g., in the median-voter model, we need to check whether P (Y > Y jd = )P (D = ) > =2. 2

14 with U j(x = ) = U j(x = )), but we do not eplicitly eplore the information content in this assumption. 9 First, we impose the quantile independence assumption (QIA), Q Yd jx;z (jx; Z) = Q Yd jx (jx) for all 2 (; ). (8) This assumption is equivalent to (Y ; Y )? ZjX. This assumption is parallel to the usual IV assumption E[Y d jx; Z] = E[Y d jx] in the average treatment e ect evaluation. As in Heckman and Vytlacil (2b), we assume further that D = (p(x; Z) U D ) and Z? U D jx (9) to study the improvement on the bounds for QT E (). 3. Bounds Under the Quantile Independence Assumption From Proposition 2 and (36) of Manski (994), we have sharp bounds for Q Yd jx(j) under (8): sup L (; z) Q YjX(j) inf R (; z) ; z2z z2z sup L (; z) Q YjX(j) inf R (; z) ; z2z z2z where Z supp(zjx = ), ( QY L jx;z;d p(;z) ; z; ; (; z) = ; ( QY R jx;z;d p(;z) j; z; ; if p(; z) ; (; z) = ; otherwise, ( QY L jx;z;d p(;z) ; z; ; (; z) = ; ( QY R jx;z;d p(;z) ; z; ; (; z) = ; if p(; z) > ; otherwise, if p(; z) < ; otherwise, if p(; z) ; otherwise. () So I L () sup z2z L (; z) inf R (; z) QT E () inf R (; z) z2z z2z sup L (; z) I U (): () z2z This bound is trivial, since I L () = and I U () = if Y and Y are unbounded. Similar phenomena also happen in the average treatment e ect evaluation. To avoid such trivial results, we assume that P y l d() Y d y u d ()jx = ; Z = ; (2) where yd l (); yu d () 2 R does not depend on Z from (8). To simplify notations, we assume that yl () = y(), l denoted as y l (), and y u () = y u (), denoted as y u (). Then in () is changed to y l () and is changed to y u (). Let P supp(p(x; Z)jX = ), p sup = sup P and p inf = inf P. The width of the bounds is I U () I L (), a complicated epression to evaluate, especially if Z is uncountable. Note that the above bounds eactly identify QT E () if I L () = I U (). Note also that it is neither necessary nor su cient for p(; z) 9 In Section 6., we will show how Chernozhukov and Hansen (25) point identify Q Yd jx(j) by eploring the information content in this assumption and imposing some completeness conditions. 3

15 to be a nontrivial function of z for these bounds to improve upon the bounds when (8) is not imposed (i.e., sup and inf are dropped from ()); this is because Q Y jx;z;d (j; z; d) may depend on z through other z2z z2z channels than p(; z). Evaluating the bounds for QT E () requires knowledge of p(; z); Q Y jx;z;d p(; z) ; z; ; Q Y jx;z;d p(; z) ; z; ; y l (); y u (); Q Y jx;z;d p(; z) ; z; ; Q Y jx;z;d p(; z) ; z; for each z 2 Z ; estimators of these objects can be constructed in an obvious way, so are omitted here. The following theorem is parallel to Corollary and 2 of Proposition 6 in Manski (994). It develops necessary and su cient conditions on p sup and p inf Theorem 2 Suppose assumptions (8) and (2) hold. for point identifying QT E (). (i) p sup min f; g and p inf ma f; g are necessary for point identi cation of QT E (). Also, when p inf and p sup are achieved at some values that Z can take, p inf = and p sup = are su cient for point identi cation of QT E () for any ed 2 (; ). (ii) When (Y ; Y )? Dj (X; Z), p sup = and p inf = is su cient for point identi cation of QT E () for any ed 2 (; ). If assume further that Y d j (X = ) is continuously distributed with a positive density p inf on (y l (); y u ()), then p sup (). = and p inf = is also necessary for point identi cation of QT E () using For the average treatment e ect evaluation, Corollary of Proposition 6 in Manski (994) implies that =2 and p sup =2 are necessary for point identi cation of AT E () E[Y Y jx = ]. A key assumption for this result to hold is that the support of Y d j (X = ; Z) does not depend on Z. Our rst necessary condition requires only the support independence assumption rather that the full independence assumption (8). So these two sets of necessary conditions are comparable. When = =2, they are the same. To understand the su cient condition for point identi cation of QT E () in Theorem 2(i), we need to clarify the meaning of Q Y jx;z;d ((; z)j; z; d) when z 2 Z but cannot be taken by Z, where (; z) is the quantile inde as a function of and z. Since in this case Q Y jx;z;d ((; z)j; z; d) is not de ned, it should be understood as the continuous etension of Q Y jx;z;d ((; z)j; z; d) as z converges to z. The quantile functions in (3) below are similarly understood when p sup and p inf cannot be taken by p(; z). This etension is not required when Z is discretely distributed. When Z has a continuous component, we can assume that Z can take all values in Z, and assume (8) is satis ed for Z 2 Z. This assumption does not lose generality since rede ning a continuous random variable on a set with Lebesgue measure zero will not a ect its distribution at all. Under this etension, p inf and p sup must be achieved at some values in Z, so p inf = and p sup = are su cient for point identi cation of QT E of Z is discrete or continuous or a miture. Finally, note that p inf () for any ed 2 (; ) regardless = is essentially the usual = and p sup large support condition which entails identi cation-at-in nity. In the average treatment e ect evaluation, there are not su cient conditions for point identi cation of AT E () in the literature. From Manski (994), I L () AT E () I U (); 4

16 where I L () = sup z2z p(; z)e [Y jx = ; Z = z; D = ] + ( p(; z))y l (; z) inf f( p(; z)) E [Y jx = ; Z = z; D = ] + p(; z)y u (; z)g ; z2z I U () = inf z2z fp(; z)e [Y jx = ; Z = z; D = ] + ( p(; z))y u (; z)g sup ( p(; z)) E [Y jx = ; Z = z; D = ] + p(; z)y l (; z) ; z2z and y l (; z), y u (; z) 2 R satisfy P y l (; z) Y d y u (; z)jx = ; Z = z =. Note here that y l (; z) and y u (; z) depend on z if only the mean independence assumption, E[Y d jx; Z] = E[Y d jx], is imposed. As in Theorem 2(i), when p inf and p sup are achieved at some values that Z can take, p inf = and p sup = implies that E [Y jx = ; Z = z; D = ] E [Y jx = ; Z = z; D = ] I L () I U () E [Y jx = ; Z = z; D = ] E [Y jx = ; Z = z; D = ] ; so AT E () is point identi ed. Corollary 2 of Proposition 6 in Manski (994) implies that when (Y ; Y )? Dj (X; Z), AT E () is point identi ed using his bound (35) or [I L (); I U ()] above if and only if p sup = and p inf =. Our result parallels his result when Y d j (X = ) is continuously distributed with a positive density on (y l (); y u ()). It should be emphasized that when (Y ; Y )? Dj (X; Z), p sup = and p inf = is necessary for point identi cation of QT E () only when () is used. Actually, since Q Yd jx(j) = Q Yd jx;z(j; z) = Q Y jx;z;d (j; z; d), Q Yd jx(j) can be identi ed directly from Q Y jx;z;d (j; z; d). 3.2 Bounds Under the Nonparametric Selection Model The following theorem states the bounds for QT E () when assumption (9) is imposed. Theorem 3 Suppose assumptions (8), (9) and (2) hold. (i) QT E () has sharp bounds, L () R () QT E () R () L () ; (3) where ( QY L jx;p(x;z);d p () = sup ; p sup ; ; y l (); ( QY R jx;p(x;z);d p () = sup ; p sup ; ; if p sup ; y u (); otherwise, ( QY L jx;p(x;z);d p ; p inf () = inf ; ; y l (); ( QY R jx;p(x;z);d p ; p inf () = inf ; ; y u (); if p sup > ; otherwise, if p inf < ; otherwise, if p inf ; otherwise. (ii) p inf = and p sup = are su cient for point identi cation of QT E () for any ed 2 (; ). When ; D = ) and Y j X = ; p(x; Z) = p inf ; D = are continuously distributed Y j (X = ; p(x; Z) = p sup with a positive density on (y l (); y u ()), they are also necessary. 5

17 (iii) [I L (); I U ()] in () will simplify to the bounds in (3) under assumption (9). Figure 5: Intuition for L Q Y () R : p sup = :8, = :5, = :5, 2 = :46 and 3 = :9 Figure 5 provides some intuition for why L () Q YjX (j) R (); similar intuition can be applied to the bounds for Q YjX (j). From the proof of Theorem 3, P (Y yjp(z) = p sup ; D = ) p sup P (Y y) P (Y yjp(z) = p sup ; D = ) p sup + ( p sup ); where the conditioning on X = is depressed. Suppose (Y ; V ) N ;!! Z! p sup P (Y yjp(z) = p sup ; D = ) p sup = y (u D ) p du D : 2 ; then Figure 5 shows the bounds for P (Y y) when p sup = :8 and = :5. Inverting the bounds for P (Y y), we can get the bounds for Q Y (). When p sup, L = y l ; when > p sup, R = y u. Only if 2 ( p sup ; p sup ), both bounds are nontrivial. This is not always possible; only if p sup > ma(; ) =2 (p min < min(; )), neither the left nor the right bound for Q Y () (Q Y ()) is trivial. Pushing! or, we can see that there are nontrivial bounds for Q Y () (Q Y ()) for all if and only if p sup = (p min = ). Note that L d () and R d () are increasing functions of ; hence the bound for Q Yd jx(j) shifts to the right as increases. Also observe that p sup p sup and p inf p inf : Hence Q Y jx;p(x;z);d (j; p sup ; ) and Q Y jx;p(x;z);d j; p inf ; lie within the bound for Q YjX(j) and Q YjX(j), respectively. This implies that F YjX(j) = F Y jx;p(x;z);d (j; p sup ; ) and F YjX(j) = F Y jx;p(x;z);d (j; p inf ; ) are not rejectable in the absence of other information. 6

18 Evaluating the bounds for QT E () requires knowledge only of p inf ; p sup ; Q Y jx;p(x;z);d y l (); y u (); Q Y jx;p(x;z);d p inf p sup ; psup ; ; pinf ; ; Q Y jx;p(x;z);d ; Q Y jx;p(x;z);d p sup ; ; ; psup p inf ; pinf ; Estimators of these objects can be constructed in an obvious way, so are omitted here. The simpler structure of these bounds results from assumption (9). Also, it is both necessary and su cient for p(; z) to be a nontrivial function of z for the bounds in Theorem 3 to improve upon the bounds when (8) and (9) are not imposed. As shown in Section 4 of Heckman and Vytlacil (2b), p inf = and p sup : = are necessary and su cient for point identi cation of AT E () under assumption (9). Our result parallels their result when Y d j (X = ) is continuously distributed with a positive density on (y l (); y u ()). As shown in Section 6 of Heckman and Vytlacil (2b), the Manski IV bounds of AT E () simplify to their bounds under assumption (9); the last part of Theorem 3 parallels their result. Finally, note that the bounds for QT E () can be integrated (with respect to ) to get the bounds for IQT E. 3.3 Some Countereamples The bounds for QT E () in Theorem 3 can be applied to cases with discrete, continuous or mied response variables. Note that p inf Y j (X = ; p(x; Z) = p sup a positive density on (y l (); y u ()). = are necessary for point identi cation of QT E () only when ; D = are continuously distributed with = and p sup ; D = ) and Y j X = ; p(x; Z) = p inf The following eample illustrates that p inf = and p sup = is not necessary for point identi cation of QT E () when Y d is binary. The supplementary materials include another eample in a similar spirit where the distribution of Y d is a miture of continuous and discrete. Eample Suppose Y d 2 f; g. p sup P (Y = jx = ; p(x; Z) = psup ; D = ) 2 (; ) and p inf P (Y = jx = ; p(x; Z) = p inf ; D = ) 2 (; ). First check the bounds for Q YjX (j): ( L () = ( R () = ; ; ; if p sup > and p sup if p sup or [p sup > and p sup if p sup and p sup ; if p sup p sup > p sup ; < or [p sup and p sup > p sup ]: p sup ]; n o When ma ; < p sup p sup or p sup p sup p sup, L () = R () = ; when < p n o sup p sup < or ma ; < p sup p sup <, L p sup () = R () =. Similarly, when p inf p inf < n o min ; or p inf p inf, L p inf () = R () = ; when < p inf < or < p inf p n o inf p inf < min ;, L p inf () = R () =. Figure 6 shows the point identi cation combination of p sup (p inf ) and p sup (pinf ) for = :; :25; :5; :75; :9. Obviously, p inf for point identi cation of QT E also that p sup min f; g and p inf p inf 2 (; ) as predicted by Theorem 2. = and p sup = are not necessary (). Only if p sup = pinf =, p inf = and p sup = are necessary. Note ma f; g for point identi cation of QT E () for any p sup, 7

19 Figure 6: p sup (p inf and Blue Area for Q Yd jx (j) = ) and p sup (pinf ) for Point Identi cation of Q Yd jx (j): Red Area for Q Yd jx (j) = The net eample shows that [I L (); I U ()] in () may not simplify to the bounds in Theorem 3 if assumption (9) is not imposed. This eample parallels the eample in Section 6 of Heckman and Vytlacil (2b) where they show a similar result for AT E (). Eample 2 Suppose Z is binary and there are no other covariates. Take inf R (; z) as an eample; z2z suppose y l () =, y u () = and p() p(; ) > p(; ) p(). We want to show that it is possible to have min Q Y jz;d p() ; (p() ) + (p() < ); Q Y jz;d p() ; (p() ) + (p() < ) = Q Y jz;d p() ; (p() ) + (p() < ) < Q Y jz;d p() ; (p() ) + (p() < ): We must assume min fp(); p()g to make this result hold. If min fp(); p()g, we need only check q Q Y jz;d p() ; > Q Y jz;d p() ; q : First, the QIA needs to be satis ed. Without loss of generality, assume Y jz is uniformly distributed. Then the QIA is satis ed if F YjZ(y j) = F YjZ;D(y j; )( p()) + F YjZ;D(y j; )p() = y ; F YjZ(y j) = F YjZ;D(y j; )( p()) + F YjZ;D(y j; )p() = y ; (4) 8

20 for any y 2 [; ]. As long as F YjZ;D(q j; ) = q p() 2 (; ) or < q < + ( p()); F YjZ;D(q j; ) = q p() 2 (; ) or < q < + ( p()); we can nd quali ed F YjZ;D(y jz; d), z = ;, d = ; such that (4) is satis ed. For eample, let q q p() q F YjZ;D(y j; ) = y (y q ) + + q! p() y (y > q ); ( p()) q q q F YjZ;D(y j; ) = p() q y (y q ) + +! p() y (y > q ); p()q q q q q p() q F YjZ;D(y j; ) = y (y q ) + + q! p() y (y > q ); ( p()) q q q F YjZ;D(y j; ) = p() q y (y q ) + +! p() y (y > q ): p()q q q Figure 7 shows the case with = :5; p() = :6; p() = :7; q = :65 < :75 = q Figure 7: An Illustration of inf z2z R (; z) 6= R () When (9) is NOT Satis ed: = :5 It is useful to construct a test to check the hypothesis that the bounds [I L (); I U ()] and those in Theorem 3(i) coincide. Since I L () L () R () and I U () R () L () always hold, our null hypothesis is L () R () I L (), and I U () R () L () ; 9

Marginal Quantile Treatment E ect

Marginal Quantile Treatment E ect Marginal Quantile Treatment E ect Ping Yu y The University of Hong Kong Started: August 2 First Version: October 23 This Version: November 24 Abstract This paper studies estimation and inference based

More information

Identi cation of Positive Treatment E ects in. Randomized Experiments with Non-Compliance

Identi cation of Positive Treatment E ects in. Randomized Experiments with Non-Compliance Identi cation of Positive Treatment E ects in Randomized Experiments with Non-Compliance Aleksey Tetenov y February 18, 2012 Abstract I derive sharp nonparametric lower bounds on some parameters of the

More information

Flexible Estimation of Treatment Effect Parameters

Flexible Estimation of Treatment Effect Parameters Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both

More information

The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest

The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest The Econometric Evaluation of Policy Design: Part I: Heterogeneity in Program Impacts, Modeling Self-Selection, and Parameters of Interest Edward Vytlacil, Yale University Renmin University, Department

More information

Principles Underlying Evaluation Estimators

Principles Underlying Evaluation Estimators The Principles Underlying Evaluation Estimators James J. University of Chicago Econ 350, Winter 2019 The Basic Principles Underlying the Identification of the Main Econometric Evaluation Estimators Two

More information

Testing for Rank Invariance or Similarity in Program Evaluation: The Effect of Training on Earnings Revisited

Testing for Rank Invariance or Similarity in Program Evaluation: The Effect of Training on Earnings Revisited Testing for Rank Invariance or Similarity in Program Evaluation: The Effect of Training on Earnings Revisited Yingying Dong University of California, Irvine Shu Shen University of California, Davis First

More information

Testing for Rank Invariance or Similarity in Program Evaluation

Testing for Rank Invariance or Similarity in Program Evaluation Testing for Rank Invariance or Similarity in Program Evaluation Yingying Dong University of California, Irvine Shu Shen University of California, Davis First version, February 2015; this version, October

More information

The relationship between treatment parameters within a latent variable framework

The relationship between treatment parameters within a latent variable framework Economics Letters 66 (2000) 33 39 www.elsevier.com/ locate/ econbase The relationship between treatment parameters within a latent variable framework James J. Heckman *,1, Edward J. Vytlacil 2 Department

More information

ECONOMETRICS II (ECO 2401) Victor Aguirregabiria. Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS

ECONOMETRICS II (ECO 2401) Victor Aguirregabiria. Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS ECONOMETRICS II (ECO 2401) Victor Aguirregabiria Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS 1. Introduction and Notation 2. Randomized treatment 3. Conditional independence

More information

What s New in Econometrics? Lecture 14 Quantile Methods

What s New in Econometrics? Lecture 14 Quantile Methods What s New in Econometrics? Lecture 14 Quantile Methods Jeff Wooldridge NBER Summer Institute, 2007 1. Reminders About Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile Regression

More information

Policy-Relevant Treatment Effects

Policy-Relevant Treatment Effects Policy-Relevant Treatment Effects By JAMES J. HECKMAN AND EDWARD VYTLACIL* Accounting for individual-level heterogeneity in the response to treatment is a major development in the econometric literature

More information

SIMILAR-ON-THE-BOUNDARY TESTS FOR MOMENT INEQUALITIES EXIST, BUT HAVE POOR POWER. Donald W. K. Andrews. August 2011

SIMILAR-ON-THE-BOUNDARY TESTS FOR MOMENT INEQUALITIES EXIST, BUT HAVE POOR POWER. Donald W. K. Andrews. August 2011 SIMILAR-ON-THE-BOUNDARY TESTS FOR MOMENT INEQUALITIES EXIST, BUT HAVE POOR POWER By Donald W. K. Andrews August 2011 COWLES FOUNDATION DISCUSSION PAPER NO. 1815 COWLES FOUNDATION FOR RESEARCH IN ECONOMICS

More information

Testing for Rank Invariance or Similarity in Program Evaluation: The Effect of Training on Earnings Revisited

Testing for Rank Invariance or Similarity in Program Evaluation: The Effect of Training on Earnings Revisited Testing for Rank Invariance or Similarity in Program Evaluation: The Effect of Training on Earnings Revisited Yingying Dong and Shu Shen UC Irvine and UC Davis Sept 2015 @ Chicago 1 / 37 Dong, Shen Testing

More information

A Test for Rank Similarity and Partial Identification of the Distribution of Treatment Effects Preliminary and incomplete

A Test for Rank Similarity and Partial Identification of the Distribution of Treatment Effects Preliminary and incomplete A Test for Rank Similarity and Partial Identification of the Distribution of Treatment Effects Preliminary and incomplete Brigham R. Frandsen Lars J. Lefgren August 1, 2015 Abstract We introduce a test

More information

Empirical Methods in Applied Microeconomics

Empirical Methods in Applied Microeconomics Empirical Methods in Applied Microeconomics Jörn-Ste en Pischke LSE November 2007 1 Nonlinearity and Heterogeneity We have so far concentrated on the estimation of treatment e ects when the treatment e

More information

Non-parametric Identi cation and Testable Implications of the Roy Model

Non-parametric Identi cation and Testable Implications of the Roy Model Non-parametric Identi cation and Testable Implications of the Roy Model Francisco J. Buera Northwestern University January 26 Abstract This paper studies non-parametric identi cation and the testable implications

More information

CALIFORNIA INSTITUTE OF TECHNOLOGY

CALIFORNIA INSTITUTE OF TECHNOLOGY DIVISION OF THE HUMANITIES AND SOCIAL SCIENCES CALIFORNIA INSTITUTE OF TECHNOLOGY PASADENA, CALIFORNIA 91125 IDENTIFYING TREATMENT EFFECTS UNDER DATA COMBINATION Yanqin Fan University of Washington Robert

More information

Limited Information Econometrics

Limited Information Econometrics Limited Information Econometrics Walras-Bowley Lecture NASM 2013 at USC Andrew Chesher CeMMAP & UCL June 14th 2013 AC (CeMMAP & UCL) LIE 6/14/13 1 / 32 Limited information econometrics Limited information

More information

Control Functions in Nonseparable Simultaneous Equations Models 1

Control Functions in Nonseparable Simultaneous Equations Models 1 Control Functions in Nonseparable Simultaneous Equations Models 1 Richard Blundell 2 UCL & IFS and Rosa L. Matzkin 3 UCLA June 2013 Abstract The control function approach (Heckman and Robb (1985)) in a

More information

Nonparametric Identi cation of Regression Models Containing a Misclassi ed Dichotomous Regressor Without Instruments

Nonparametric Identi cation of Regression Models Containing a Misclassi ed Dichotomous Regressor Without Instruments Nonparametric Identi cation of Regression Models Containing a Misclassi ed Dichotomous Regressor Without Instruments Xiaohong Chen Yale University Yingyao Hu y Johns Hopkins University Arthur Lewbel z

More information

New Developments in Econometrics Lecture 16: Quantile Estimation

New Developments in Econometrics Lecture 16: Quantile Estimation New Developments in Econometrics Lecture 16: Quantile Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. Review of Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile

More information

A Test for Rank Similarity and Partial Identification of the Distribution of Treatment Effects Preliminary and incomplete

A Test for Rank Similarity and Partial Identification of the Distribution of Treatment Effects Preliminary and incomplete A Test for Rank Similarity and Partial Identification of the Distribution of Treatment Effects Preliminary and incomplete Brigham R. Frandsen Lars J. Lefgren April 30, 2015 Abstract We introduce a test

More information

Lecture 8. Roy Model, IV with essential heterogeneity, MTE

Lecture 8. Roy Model, IV with essential heterogeneity, MTE Lecture 8. Roy Model, IV with essential heterogeneity, MTE Economics 2123 George Washington University Instructor: Prof. Ben Williams Heterogeneity When we talk about heterogeneity, usually we mean heterogeneity

More information

Generalized Roy Model and Cost-Benefit Analysis of Social Programs 1

Generalized Roy Model and Cost-Benefit Analysis of Social Programs 1 Generalized Roy Model and Cost-Benefit Analysis of Social Programs 1 James J. Heckman The University of Chicago University College Dublin Philipp Eisenhauer University of Mannheim Edward Vytlacil Columbia

More information

The Asymptotic Variance of Semi-parametric Estimators with Generated Regressors

The Asymptotic Variance of Semi-parametric Estimators with Generated Regressors The Asymptotic Variance of Semi-parametric stimators with Generated Regressors Jinyong Hahn Department of conomics, UCLA Geert Ridder Department of conomics, USC October 7, 00 Abstract We study the asymptotic

More information

The Generalized Roy Model and Treatment Effects

The Generalized Roy Model and Treatment Effects The Generalized Roy Model and Treatment Effects Christopher Taber University of Wisconsin November 10, 2016 Introduction From Imbens and Angrist we showed that if one runs IV, we get estimates of the Local

More information

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models

Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models Using Matching, Instrumental Variables and Control Functions to Estimate Economic Choice Models James J. Heckman and Salvador Navarro The University of Chicago Review of Economics and Statistics 86(1)

More information

Identifying Structural E ects in Nonseparable Systems Using Covariates

Identifying Structural E ects in Nonseparable Systems Using Covariates Identifying Structural E ects in Nonseparable Systems Using Covariates Halbert White UC San Diego Karim Chalak Boston College October 16, 2008 Abstract This paper demonstrates the extensive scope of an

More information

Supplementary material to: Tolerating deance? Local average treatment eects without monotonicity.

Supplementary material to: Tolerating deance? Local average treatment eects without monotonicity. Supplementary material to: Tolerating deance? Local average treatment eects without monotonicity. Clément de Chaisemartin September 1, 2016 Abstract This paper gathers the supplementary material to de

More information

Job Displacement of Older Workers during the Great Recession: Tight Bounds on Distributional Treatment Effect Parameters

Job Displacement of Older Workers during the Great Recession: Tight Bounds on Distributional Treatment Effect Parameters Job Displacement of Older Workers during the Great Recession: Tight Bounds on Distributional Treatment Effect Parameters using Panel Data Brantly Callaway October 24, 205 Job Market Paper Abstract Older

More information

Exploring Marginal Treatment Effects

Exploring Marginal Treatment Effects Exploring Marginal Treatment Effects Flexible estimation using Stata Martin Eckhoff Andresen Statistics Norway Oslo, September 12th 2018 Martin Andresen (SSB) Exploring MTEs Oslo, 2018 1 / 25 Introduction

More information

Lecture Notes on Measurement Error

Lecture Notes on Measurement Error Steve Pischke Spring 2000 Lecture Notes on Measurement Error These notes summarize a variety of simple results on measurement error which I nd useful. They also provide some references where more complete

More information

A test of the conditional independence assumption in sample selection models

A test of the conditional independence assumption in sample selection models A test of the conditional independence assumption in sample selection models Martin Huber, Blaise Melly First draft: December 2006, Last changes: September 2012 Abstract: Identi cation in most sample selection

More information

Fuzzy Differences-in-Differences

Fuzzy Differences-in-Differences Review of Economic Studies (2017) 01, 1 30 0034-6527/17/00000001$02.00 c 2017 The Review of Economic Studies Limited Fuzzy Differences-in-Differences C. DE CHAISEMARTIN University of California at Santa

More information

Instrumental Variables in Models with Multiple Outcomes: The General Unordered Case

Instrumental Variables in Models with Multiple Outcomes: The General Unordered Case DISCUSSION PAPER SERIES IZA DP No. 3565 Instrumental Variables in Models with Multiple Outcomes: The General Unordered Case James J. Heckman Sergio Urzua Edward Vytlacil June 2008 Forschungsinstitut zur

More information

STRUCTURAL EQUATIONS, TREATMENT EFFECTS, AND ECONOMETRIC POLICY EVALUATION 1

STRUCTURAL EQUATIONS, TREATMENT EFFECTS, AND ECONOMETRIC POLICY EVALUATION 1 Econometrica, Vol. 73, No. 3 (May, 2005), 669 738 STRUCTURAL EQUATIONS, TREATMENT EFFECTS, AND ECONOMETRIC POLICY EVALUATION 1 BY JAMES J. HECKMAN ANDEDWARDVYTLACIL This paper uses the marginal treatment

More information

Differences-in-differences, differences of quantiles and quantiles of differences

Differences-in-differences, differences of quantiles and quantiles of differences Differences-in-differences, differences of quantiles and quantiles of differences Franco Peracchi October 13, 2006 1 A MOTIVATING EXAMPLE 1 A motivating example Economists have largely debated on the causes

More information

Lecture 11 Roy model, MTE, PRTE

Lecture 11 Roy model, MTE, PRTE Lecture 11 Roy model, MTE, PRTE Economics 2123 George Washington University Instructor: Prof. Ben Williams Roy Model Motivation The standard textbook example of simultaneity is a supply and demand system

More information

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed

More information

Nonparametric Identi cation of Regression Models Containing a Misclassi ed Dichotomous Regressor Without Instruments

Nonparametric Identi cation of Regression Models Containing a Misclassi ed Dichotomous Regressor Without Instruments Nonparametric Identi cation of Regression Models Containing a Misclassi ed Dichotomous Regressor Without Instruments Xiaohong Chen Yale University Yingyao Hu y Johns Hopkins University Arthur Lewbel z

More information

Lectures on Identi cation 2

Lectures on Identi cation 2 Lectures on Identi cation 2 Andrew Chesher CeMMAP & UCL April 16th 2008 Andrew Chesher (CeMMAP & UCL) Identi cation 2 4/16/2008 1 / 28 Topics 1 Monday April 14th. Motivation, history, de nitions, types

More information

Partial Identification of Nonseparable Models using Binary Instruments

Partial Identification of Nonseparable Models using Binary Instruments Partial Identification of Nonseparable Models using Binary Instruments Takuya Ishihara October 13, 2017 arxiv:1707.04405v2 [stat.me] 12 Oct 2017 Abstract In this study, we eplore the partial identification

More information

Lecture 4: Linear panel models

Lecture 4: Linear panel models Lecture 4: Linear panel models Luc Behaghel PSE February 2009 Luc Behaghel (PSE) Lecture 4 February 2009 1 / 47 Introduction Panel = repeated observations of the same individuals (e.g., rms, workers, countries)

More information

Chapter 1. GMM: Basic Concepts

Chapter 1. GMM: Basic Concepts Chapter 1. GMM: Basic Concepts Contents 1 Motivating Examples 1 1.1 Instrumental variable estimator....................... 1 1.2 Estimating parameters in monetary policy rules.............. 2 1.3 Estimating

More information

Simple Estimators for Semiparametric Multinomial Choice Models

Simple Estimators for Semiparametric Multinomial Choice Models Simple Estimators for Semiparametric Multinomial Choice Models James L. Powell and Paul A. Ruud University of California, Berkeley March 2008 Preliminary and Incomplete Comments Welcome Abstract This paper

More information

Estimation of Treatment Effects under Essential Heterogeneity

Estimation of Treatment Effects under Essential Heterogeneity Estimation of Treatment Effects under Essential Heterogeneity James Heckman University of Chicago and American Bar Foundation Sergio Urzua University of Chicago Edward Vytlacil Columbia University March

More information

Trimming for Bounds on Treatment Effects with Missing Outcomes *

Trimming for Bounds on Treatment Effects with Missing Outcomes * CENTER FOR LABOR ECONOMICS UNIVERSITY OF CALIFORNIA, BERKELEY WORKING PAPER NO. 5 Trimming for Bounds on Treatment Effects with Missing Outcomes * David S. Lee UC Berkeley and NBER March 2002 Abstract

More information

Simultaneous Choice Models: The Sandwich Approach to Nonparametric Analysis

Simultaneous Choice Models: The Sandwich Approach to Nonparametric Analysis Simultaneous Choice Models: The Sandwich Approach to Nonparametric Analysis Natalia Lazzati y November 09, 2013 Abstract We study collective choice models from a revealed preference approach given limited

More information

Testing Rank Similarity

Testing Rank Similarity Testing Rank Similarity Brigham R. Frandsen Lars J. Lefgren November 13, 2015 Abstract We introduce a test of the rank invariance or rank similarity assumption common in treatment effects and instrumental

More information

Generated Covariates in Nonparametric Estimation: A Short Review.

Generated Covariates in Nonparametric Estimation: A Short Review. Generated Covariates in Nonparametric Estimation: A Short Review. Enno Mammen, Christoph Rothe, and Melanie Schienle Abstract In many applications, covariates are not observed but have to be estimated

More information

Four Parameters of Interest in the Evaluation. of Social Programs. James J. Heckman Justin L. Tobias Edward Vytlacil

Four Parameters of Interest in the Evaluation. of Social Programs. James J. Heckman Justin L. Tobias Edward Vytlacil Four Parameters of Interest in the Evaluation of Social Programs James J. Heckman Justin L. Tobias Edward Vytlacil Nueld College, Oxford, August, 2005 1 1 Introduction This paper uses a latent variable

More information

Bounds on Average and Quantile Treatment E ects of Job Corps Training on Participants Wages

Bounds on Average and Quantile Treatment E ects of Job Corps Training on Participants Wages Bounds on Average and Quantile Treatment E ects of Job Corps Training on Participants Wages German Blanco Food and Resource Economics Department, University of Florida gblancol@u.edu Carlos A. Flores Department

More information

Lecture 9: Quantile Methods 2

Lecture 9: Quantile Methods 2 Lecture 9: Quantile Methods 2 1. Equivariance. 2. GMM for Quantiles. 3. Endogenous Models 4. Empirical Examples 1 1. Equivariance to Monotone Transformations. Theorem (Equivariance of Quantiles under Monotone

More information

STRUCTURAL EQUATIONS, TREATMENT EFFECTS AND ECONOMETRIC POLICY EVALUATION 1. By James J. Heckman and Edward Vytlacil

STRUCTURAL EQUATIONS, TREATMENT EFFECTS AND ECONOMETRIC POLICY EVALUATION 1. By James J. Heckman and Edward Vytlacil STRUCTURAL EQUATIONS, TREATMENT EFFECTS AND ECONOMETRIC POLICY EVALUATION 1 By James J. Heckman and Edward Vytlacil First Draft, August 2; Revised, June 21 Final Version, September 25, 23. 1 This paper

More information

NBER WORKING PAPER SERIES

NBER WORKING PAPER SERIES NBER WORKING PAPER SERIES IDENTIFICATION OF TREATMENT EFFECTS USING CONTROL FUNCTIONS IN MODELS WITH CONTINUOUS, ENDOGENOUS TREATMENT AND HETEROGENEOUS EFFECTS Jean-Pierre Florens James J. Heckman Costas

More information

Comparative Advantage and Schooling

Comparative Advantage and Schooling Comparative Advantage and Schooling Pedro Carneiro University College London, Institute for Fiscal Studies and IZA Sokbae Lee University College London and Institute for Fiscal Studies June 7, 2004 Abstract

More information

Quantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be

Quantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be Quantile methods Class Notes Manuel Arellano December 1, 2009 1 Unconditional quantiles Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be Q τ (Y ) q τ F 1 (τ) =inf{r : F

More information

Nonparametric Identi cation and Estimation of Truncated Regression Models with Heteroskedasticity

Nonparametric Identi cation and Estimation of Truncated Regression Models with Heteroskedasticity Nonparametric Identi cation and Estimation of Truncated Regression Models with Heteroskedasticity Songnian Chen a, Xun Lu a, Xianbo Zhou b and Yahong Zhou c a Department of Economics, Hong Kong University

More information

SIMILAR-ON-THE-BOUNDARY TESTS FOR MOMENT INEQUALITIES EXIST, BUT HAVE POOR POWER. Donald W. K. Andrews. August 2011 Revised March 2012

SIMILAR-ON-THE-BOUNDARY TESTS FOR MOMENT INEQUALITIES EXIST, BUT HAVE POOR POWER. Donald W. K. Andrews. August 2011 Revised March 2012 SIMILAR-ON-THE-BOUNDARY TESTS FOR MOMENT INEQUALITIES EXIST, BUT HAVE POOR POWER By Donald W. K. Andrews August 2011 Revised March 2012 COWLES FOUNDATION DISCUSSION PAPER NO. 1815R COWLES FOUNDATION FOR

More information

Unconditional Quantile Regressions

Unconditional Quantile Regressions Unconditional Regressions Sergio Firpo, Pontifícia Universidade Católica -Rio Nicole M. Fortin, and Thomas Lemieux University of British Columbia October 2006 Comments welcome Abstract We propose a new

More information

PSC 504: Instrumental Variables

PSC 504: Instrumental Variables PSC 504: Instrumental Variables Matthew Blackwell 3/28/2013 Instrumental Variables and Structural Equation Modeling Setup e basic idea behind instrumental variables is that we have a treatment with unmeasured

More information

Matching. James J. Heckman Econ 312. This draft, May 15, Intro Match Further MTE Impl Comp Gen. Roy Req Info Info Add Proxies Disc Modal Summ

Matching. James J. Heckman Econ 312. This draft, May 15, Intro Match Further MTE Impl Comp Gen. Roy Req Info Info Add Proxies Disc Modal Summ Matching James J. Heckman Econ 312 This draft, May 15, 2007 1 / 169 Introduction The assumption most commonly made to circumvent problems with randomization is that even though D is not random with respect

More information

Unconditional Quantile Regression with Endogenous Regressors

Unconditional Quantile Regression with Endogenous Regressors Unconditional Quantile Regression with Endogenous Regressors Pallab Kumar Ghosh Department of Economics Syracuse University. Email: paghosh@syr.edu Abstract This paper proposes an extension of the Fortin,

More information

Weak Stochastic Increasingness, Rank Exchangeability, and Partial Identification of The Distribution of Treatment Effects

Weak Stochastic Increasingness, Rank Exchangeability, and Partial Identification of The Distribution of Treatment Effects Weak Stochastic Increasingness, Rank Exchangeability, and Partial Identification of The Distribution of Treatment Effects Brigham R. Frandsen Lars J. Lefgren December 16, 2015 Abstract This article develops

More information

Continuous Treatments

Continuous Treatments Continuous Treatments Stefan Hoderlein Boston College Yuya Sasaki Brown University First Draft: July 15, 2009 This Draft: January 19, 2011 Abstract This paper explores the relationship between nonseparable

More information

Bounds on Population Average Treatment E ects with an Instrumental Variable

Bounds on Population Average Treatment E ects with an Instrumental Variable Bounds on Population Average Treatment E ects with an Instrumental Variable Preliminary (Rough) Draft Xuan Chen y Carlos A. Flores z Alfonso Flores-Lagunes x October 2013 Abstract We derive nonparametric

More information

MC3: Econometric Theory and Methods. Course Notes 4

MC3: Econometric Theory and Methods. Course Notes 4 University College London Department of Economics M.Sc. in Economics MC3: Econometric Theory and Methods Course Notes 4 Notes on maximum likelihood methods Andrew Chesher 25/0/2005 Course Notes 4, Andrew

More information

TECHNICAL WORKING PAPER SERIES LOCAL INSTRUMENTAL VARIABLES. James J. Heckman Edward J. Vytlacil

TECHNICAL WORKING PAPER SERIES LOCAL INSTRUMENTAL VARIABLES. James J. Heckman Edward J. Vytlacil TECHNICAL WORKING PAPER SERIES LOCAL INSTRUMENTAL VARIABLES James J. Heckman Edward J. Vytlacil Technical Working Paper 252 http://www.nber.org/papers/t252 NATIONAL BUREAU OF ECONOMIC RESEARCH 15 Massachusetts

More information

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria SOLUTION TO FINAL EXAM Friday, April 12, 2013. From 9:00-12:00 (3 hours) INSTRUCTIONS:

More information

A Note on the Closed-form Identi cation of Regression Models with a Mismeasured Binary Regressor

A Note on the Closed-form Identi cation of Regression Models with a Mismeasured Binary Regressor A Note on the Closed-form Identi cation of Regression Models with a Mismeasured Binary Regressor Xiaohong Chen Yale University Yingyao Hu y Johns Hopkins University Arthur Lewbel z Boston College First

More information

Counterfactual worlds

Counterfactual worlds Counterfactual worlds Andrew Chesher Adam Rosen The Institute for Fiscal Studies Department of Economics, UCL cemmap working paper CWP22/15 Counterfactual Worlds Andrew Chesher and Adam M. Rosen CeMMAP

More information

Stochastic Demand and Revealed Preference

Stochastic Demand and Revealed Preference Stochastic Demand and Revealed Preference Richard Blundell Dennis Kristensen Rosa Matzkin UCL & IFS, Columbia and UCLA November 2010 Blundell, Kristensen and Matzkin ( ) Stochastic Demand November 2010

More information

Labor Economics, Lecture 11: Partial Equilibrium Sequential Search

Labor Economics, Lecture 11: Partial Equilibrium Sequential Search Labor Economics, 14.661. Lecture 11: Partial Equilibrium Sequential Search Daron Acemoglu MIT December 6, 2011. Daron Acemoglu (MIT) Sequential Search December 6, 2011. 1 / 43 Introduction Introduction

More information

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous Econometrics of causal inference Throughout, we consider the simplest case of a linear outcome equation, and homogeneous effects: y = βx + ɛ (1) where y is some outcome, x is an explanatory variable, and

More information

Impact Evaluation Technical Workshop:

Impact Evaluation Technical Workshop: Impact Evaluation Technical Workshop: Asian Development Bank Sept 1 3, 2014 Manila, Philippines Session 19(b) Quantile Treatment Effects I. Quantile Treatment Effects Most of the evaluation literature

More information

Speci cation of Conditional Expectation Functions

Speci cation of Conditional Expectation Functions Speci cation of Conditional Expectation Functions Econometrics Douglas G. Steigerwald UC Santa Barbara D. Steigerwald (UCSB) Specifying Expectation Functions 1 / 24 Overview Reference: B. Hansen Econometrics

More information

ted: a Stata Command for Testing Stability of Regression Discontinuity Models

ted: a Stata Command for Testing Stability of Regression Discontinuity Models ted: a Stata Command for Testing Stability of Regression Discontinuity Models Giovanni Cerulli IRCrES, Research Institute on Sustainable Economic Growth National Research Council of Italy 2016 Stata Conference

More information

IDENTIFICATION OF MARGINAL EFFECTS IN NONSEPARABLE MODELS WITHOUT MONOTONICITY

IDENTIFICATION OF MARGINAL EFFECTS IN NONSEPARABLE MODELS WITHOUT MONOTONICITY Econometrica, Vol. 75, No. 5 (September, 2007), 1513 1518 IDENTIFICATION OF MARGINAL EFFECTS IN NONSEPARABLE MODELS WITHOUT MONOTONICITY BY STEFAN HODERLEIN AND ENNO MAMMEN 1 Nonseparable models do not

More information

Econ 273B Advanced Econometrics Spring

Econ 273B Advanced Econometrics Spring Econ 273B Advanced Econometrics Spring 2005-6 Aprajit Mahajan email: amahajan@stanford.edu Landau 233 OH: Th 3-5 or by appt. This is a graduate level course in econometrics. The rst part of the course

More information

7(&+1,&$/:25.,1*3$3(56(5,(6,167580(17$/9$5,$%/(66(/(&7,2102'(/6$1' 7,*+7%281'6217+($9(5$*(75($70(17())(&7 -DPHV-+H NPDQ (GZDUG-9\WOD LO

7(&+1,&$/:25.,1*3$3(56(5,(6,167580(17$/9$5,$%/(66(/(&7,2102'(/6$1' 7,*+7%281'6217+($9(5$*(75($70(17())(&7 -DPHV-+H NPDQ (GZDUG-9\WOD LO 7(&+1,&$/:25.,1*3$3(56(5,(6,167580(17$/9$5,$%/(66(/(&7,2102'(/6$1' 7,*+7%281'6217+($9(5$*(75($70(17())(&7 -DPHV-+H NPDQ (GZDUG-9\WOD LO 7H KQL DO:RUNLQJ3DSHU KWWSZZZQEHURUJSDSHUV7 1$7,21$/%85($82)(&2120,&5(6($5&+

More information

Wageningen Summer School in Econometrics. The Bayesian Approach in Theory and Practice

Wageningen Summer School in Econometrics. The Bayesian Approach in Theory and Practice Wageningen Summer School in Econometrics The Bayesian Approach in Theory and Practice September 2008 Slides for Lecture on Qualitative and Limited Dependent Variable Models Gary Koop, University of Strathclyde

More information

Endogenous binary choice models with median restrictions: A comment

Endogenous binary choice models with median restrictions: A comment Available online at www.sciencedirect.com Economics Letters 98 (2008) 23 28 www.elsevier.com/locate/econbase Endogenous binary choice models with median restrictions: A comment Azeem M. Shaikh a, Edward

More information

Nonadditive Models with Endogenous Regressors

Nonadditive Models with Endogenous Regressors Nonadditive Models with Endogenous Regressors Guido W. Imbens First Draft: July 2005 This Draft: February 2006 Abstract In the last fifteen years there has been much work on nonparametric identification

More information

Simple Estimators for Monotone Index Models

Simple Estimators for Monotone Index Models Simple Estimators for Monotone Index Models Hyungtaik Ahn Dongguk University, Hidehiko Ichimura University College London, James L. Powell University of California, Berkeley (powell@econ.berkeley.edu)

More information

Granger Causality and Structural Causality in Cross-Section and Panel Data

Granger Causality and Structural Causality in Cross-Section and Panel Data Granger Causality and Structural Causality in Cross-Section and anel Data Xun Lu a, Liangjun Su b ; and Halbert White c a Department of Economics, Hong Kong University of Science and Technology b School

More information

Matching using Semiparametric Propensity Scores

Matching using Semiparametric Propensity Scores Matching using Semiparametric Propensity Scores Gregory Kordas Department of Economics University of Pennsylvania kordas@ssc.upenn.edu Steven F. Lehrer SPS and Department of Economics Queen s University

More information

Blinder-Oaxaca as a Reweighting Estimator

Blinder-Oaxaca as a Reweighting Estimator Blinder-Oaxaca as a Reweighting Estimator By Patrick Kline A large literature focuses on the use of propensity score methods as a semi-parametric alternative to regression for estimation of average treatments

More information

COUNTERFACTUAL MAPPING AND INDIVIDUAL TREATMENT EFFECTS IN NONSEPARABLE MODELS WITH DISCRETE ENDOGENEITY*

COUNTERFACTUAL MAPPING AND INDIVIDUAL TREATMENT EFFECTS IN NONSEPARABLE MODELS WITH DISCRETE ENDOGENEITY* COUNTERFACTUAL MAPPING AND INDIVIDUAL TREATMENT EFFECTS IN NONSEPARABLE MODELS WITH DISCRETE ENDOGENEITY* QUANG VUONG AND HAIQING XU ABSTRACT. This paper establishes nonparametric identification of individual

More information

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION VICTOR CHERNOZHUKOV CHRISTIAN HANSEN MICHAEL JANSSON Abstract. We consider asymptotic and finite-sample confidence bounds in instrumental

More information

GMM-based inference in the AR(1) panel data model for parameter values where local identi cation fails

GMM-based inference in the AR(1) panel data model for parameter values where local identi cation fails GMM-based inference in the AR() panel data model for parameter values where local identi cation fails Edith Madsen entre for Applied Microeconometrics (AM) Department of Economics, University of openhagen,

More information

Marginal treatment effects

Marginal treatment effects 11 Marginal treatment effects In this chapter, we review policy evaluation and Heckman and Vytlacil s [25, 27a] (HV) strategy for linking marginal treatment effects to other average treatment effects including

More information

Random Coefficients on Endogenous Variables in Simultaneous Equations Models

Random Coefficients on Endogenous Variables in Simultaneous Equations Models Review of Economic Studies (207) 00, 6 0034-6527/7/0000000$02.00 c 207 The Review of Economic Studies Limited Random Coefficients on Endogenous Variables in Simultaneous Equations Models MATTHEW A. MASTEN

More information

4.8 Instrumental Variables

4.8 Instrumental Variables 4.8. INSTRUMENTAL VARIABLES 35 4.8 Instrumental Variables A major complication that is emphasized in microeconometrics is the possibility of inconsistent parameter estimation due to endogenous regressors.

More information

Applied Microeconometrics. Maximilian Kasy

Applied Microeconometrics. Maximilian Kasy Applied Microeconometrics Maximilian Kasy 7) Distributional Effects, quantile regression (cf. Mostly Harmless Econometrics, chapter 7) Sir Francis Galton (Natural Inheritance, 1889): It is difficult to

More information

13 Endogeneity and Nonparametric IV

13 Endogeneity and Nonparametric IV 13 Endogeneity and Nonparametric IV 13.1 Nonparametric Endogeneity A nonparametric IV equation is Y i = g (X i ) + e i (1) E (e i j i ) = 0 In this model, some elements of X i are potentially endogenous,

More information

WORKING P A P E R. Unconditional Quantile Treatment Effects in the Presence of Covariates DAVID POWELL WR-816. December 2010

WORKING P A P E R. Unconditional Quantile Treatment Effects in the Presence of Covariates DAVID POWELL WR-816. December 2010 WORKING P A P E R Unconditional Quantile Treatment Effects in the Presence of Covariates DAVID POWELL WR-816 December 2010 This paper series made possible by the NIA funded RAND Center for the Study of

More information

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated

More information

IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL

IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL BRENDAN KLINE AND ELIE TAMER Abstract. Randomized trials (RTs) are used to learn about treatment effects. This paper

More information

Nonparametric Welfare Analysis for Discrete Choice

Nonparametric Welfare Analysis for Discrete Choice Nonparametric Welfare Analysis for Discrete Choice Debopam Bhattacharya University of Oxford September 26, 2014. Abstract We consider empirical measurement of exact equivalent/compensating variation resulting

More information

What do instrumental variable models deliver with discrete dependent variables?

What do instrumental variable models deliver with discrete dependent variables? What do instrumental variable models deliver with discrete dependent variables? Andrew Chesher Adam Rosen The Institute for Fiscal Studies Department of Economics, UCL cemmap working paper CWP10/13 What

More information