NBER WORKING PAPER SERIES MATCHING ON THE ESTIMATED PROPENSITY SCORE. Alberto Abadie Guido W. Imbens

Size: px
Start display at page:

Download "NBER WORKING PAPER SERIES MATCHING ON THE ESTIMATED PROPENSITY SCORE. Alberto Abadie Guido W. Imbens"

Transcription

1 BER WORKIG PAPER SERIES MATCHIG O THE ESTIMATED PROPESITY SCORE Alberto Abadie Guido W. Imbens Working Paper ATIOAL BUREAU OF ECOOMIC RESEARCH 050 Massachusetts Avenue Cambridge, MA 0238 August 2009 We are grateful to Ben Hansen, James Robins, Paul Rosenbaum, Donald Rubin, and participants in seminars at the Banff Center, Brown, Georgetown, Harvard/MIT, Montreal, and UPenn for comments and discussions. Software implementing these methods is available on our websites. The views expressed herein are those of the authors and do not necessarily reflect the views of the ational Bureau of Economic Research. BER working papers are circulated for discussion and comment purposes. They have not been peerreviewed or been subject to the review by the BER Board of Directors that accompanies official BER publications by Alberto Abadie and Guido W. Imbens. All rights reserved. Short sections of text, not to exceed two paragraphs, may be quoted without explicit permission provided that full credit, including notice, is given to the source.

2 Matching on the Estimated Propensity Score Alberto Abadie and Guido W. Imbens BER Working Paper o. 530 August 2009, Revised December 2009 JEL o. C3,C4 ABSTRACT Propensity score matching estimators Rosenbaum and Rubin, 983 are widely used in evaluation research to estimate average treatment effects. In this article, we derive the large sample distribution of propensity score matching estimators. Our derivations take into account that the propensity score is itself estimated in a first step, prior to matching. We prove that first step estimation of the propensity score affects the large sample distribution of propensity score matching estimators. Moreover, we derive an adjustment to the large sample variance of propensity score matching estimators that corrects for first step estimation of the propensity score. In spite of the great popularity of propensity score matching estimators, these results were previously unavailable in the literature. Alberto Abadie John F. Kennedy School of Government Harvard University 79 JFK Street Cambridge, MA 0238 and BER alberto_abadie@harvard.edu Guido W. Imbens Department of Economics Littauer Center Harvard University 805 Cambridge Street Cambridge, MA 0238 and BER imbens@fas.harvard.edu

3 I. Introduction Propensity score matching estimators Rosenbaum and Rubin, 983 are widely used to estimate treatment effects when all treatment confounders are measured. Rosenbaum and Rubin 983 define the propensity score as the conditional probability of assignment to a treatment given a vector of covariates including the values of all treatment confounders. Their key insight is that adjusting for the propensity score is enough to remove the bias created by all treatment confounders. Relative to matching directly on the covariates, propensity score matching has the advantage of reducing the dimensionality of matching to a single dimension. This greatly facilitates the matching process, because units with dissimilar covariate values may nevertheless have similar values in their propensity scores. Propensity score values are rarely observed in practice. Usually the propensity score has to be estimated prior to matching. In spite of the great popularity that propensity score matching methods have gained since they were proposed by Rosenbaum and Rubin in 983, their large sample distribution has not yet been derived for the case when the propensity score is estimated in a first step. A possible reason for this void in the literature is that matching estimators are highly non-smooth functionals of the distribution of the matching variables, which makes it difficult to establish an asymptotic approximation to the distribution of matching estimators when a matching variable is estimated in a first step. This has motivated the use of bootstrap standard errors for propensity score matching estimators. However, recently it has been shown that the bootstrap is not in general valid for matching estimators Abadie and Imbens, In this article, we derive the large sample distribution of propensity score matching estimators. Our derivations take into account that the propensity score is itself estimated in a first step. We prove that first step estimation of the propensity score affects the large sample distribution of propensity score matching estimators. Moreover, we derive In contexts different than matching, Hirano, Imbens and Ridder 2003, Abadie 2005, Wooldridge 2007, and Angrist and Kuersteiner 2009 derive large sample properties of statistics based on a first step estimator of the propensity score. In all these cases, the second step statistics are smooth functionals of the propensity scores and, therefore, standard expansions for two-step estimators apply see, e.g., ewey and McFadden, 994.

4 an adjustment to the large sample variance of propensity score matching estimators that corrects for first step estimation of the propensity score. Finally, we use a small simulation exercise to illustrate the implications of our theoretical results. To preview our results, let F x θ be a parametric model for the propensity score, with unknown parameters θ, and let ˆθ be the maximum likelihood estimator for θ. We show that, under regularity conditions, the estimator ˆτ, for the average treatment effect τ = EY Y 0, based on matching on the estimated propensity score F X i ˆθ, satisfies ˆτ τ d 0, σ 2 c I θ c. In this expression, σ 2 is the variance of the matching estimator based on matching on the true propensity score F X iθ which follows from results in Abadie and Imbens, 2006, I θ is the Fisher information matrix for the parametric model for the propensity score, and c is a vector that depends on the covariance between the covariates and the outcome, conditional on the propensity score and the treatment. Thus, matching on the estimated propensity score has a smaller asymptotic variance than matching on the true propensity score. This is in line with results in Rubin and Thomas 992ab who argue that, in settings with normally distributed covariates, matching on the estimated rather than the true propensity score improves the properties of matching estimators. Hirano, Imbens and Ridder 2003 obtain a similar result for weighting estimators. The rest of the article is organized as follows. Section II provides an introduction to propensity score matching. Section III is the main section of the article. In this section we derive the large sample properties of an estimator that match on estimated propensity scores. Section IV proposes an estimator for the adjusted standard errors derived in section III. In section V we report the results of a small simulation exercise. Section VI concludes. II. Matching on the Estimated Propensity Score In evaluation research the focus of the analysis is typically the effect of a binary treatment, represented in this paper by the indicator variable W, on some outcome variable, Y. More specifically, W = indicates exposure to treatment, while W = 0 indicates lack of exposure 2

5 to treatment. Following Rubin 974, we define treatment effects in terms of potential outcomes. We define Y as the potential outcome under exposure to treatment, and Y 0 as the potential outcome under no exposure to treatment. Our goal is to estimate the average treatment effect, τ = E Y Y 0, where the expectation is taken over the population of interest, based on a random sample from this population. Estimation of treatment effects is complicated by the fact that for each unit in the population, the observed outcome reflects only one of the potential outcomes: Y = { Y 0 if W = 0, Y if W =. Let X be a vector of covariates that includes treatment confounders, that is, variables that affect the probability of treatment exposure and the potential outcomes. The propensity score is px = PrW = X. The following assumption is often referred to as strong ignorability Rosenbaum and Rubin, 983. Assumption : i Y, Y 0 W X almost surely; ii 0 < px < almost surely. Assumption i will hold if all treatment confounders are included in X; so, after controlling for X, treatment exposure is independent of the potential outcomes. Assumption ii states that for almost all values of X the population includes treated and untreated units. Let µw, x = EY W = w, X = x and µw, p = EY W = w, px = p be the regression of the outcome on the treatment indicator and the covariates, and on the treatment indicator and the propensity score respectively. Rosenbaum and Rubin 983 prove that, under Assumption, τ = E µ, px µ0, px. In other words, adjusting for the propensity score is enough to eliminate the bias created by all treatment confounders. This result by Rosenbaum and Rubin 983 motivates the use of propensity score matching estimators. Following Rosenbaum and Rubin 983 and the vast majority of 3

6 the empirical literature, consider a generalized linear specification for the propensity score px = F x θ. 2 In empirical research the link function F is usually specified as logit or probit. Assume for the moment that the parameters of the propensity score, θ, are known. For each observation, i, let J M i, θ be a set of M observations in the treatment group opposite to i and with values of F X θ similar to F X iθ. A propensity score matching estimator can be defined as: τ θ = 2W i Y i M j J M i,θ In this article we will consider matching with replacement, so each unit in the sample can be used as a match multiple times. In the absence of matching ties, the sets J M i, θ can be defined as: { } J M i, θ = j : W j = W i, {Wk = W i } { F X i θ F X k θ F X i θ F X j θ } M. k= Let K,i θ be the number of times that observation i is used as a match when matching on F X θ: K,i θ = The estimator τ θ can be represented as: τ θ = k= {i JM k,θ}. Y j 2W i + K,iθ Y i. M In practice, propensity scores are not directly observed and estimators that match on the true propensity score are therefore unfeasible. For some random sample {Y i, W i, X i }, let θ be an estimator of θ. A matching estimator of τ that matches on estimated propensity. scores is given by: τ θ = 2W i Y i M j J M i, θ Y j. 2 It is easy to extend our results to more general parametric models for the propensity score. We restrict our attention to generalized linear specifications because in practice they are widely used to estimate propensity scores. 4

7 We assume, in concordance with the literature, that θ is the Maximum Likelihood estimator of θ. 3 In the next section, we derive the large sample distribution of τ θ. III. Large Sample Distribution We begin by introducing a decomposition of τ θ that will be used later in this section. Define T θ = τ θ τ = 2W i + K,iθ Y i τ. M otice that T θ = D θ + R θ, where and R θ = D θ = = µ, F X iθ µ0, F X iθ τ 2W i + K,iθ Y i µw i, F X M iθ, 2W i µ W i, F X iθ M j J M i µ W i, F X iθ. Let P θ be the distribution of Z = {Y, W, X}, induced by the propensity score, F x θ, the marginal distribution of X, and the conditional distribution of Y given X and W. To simplify the exposition, we will implicitly assume that Y and X are bounded, so all moments exist for these two variables. Consider Z,i = {Y,i, W,i, X,i } with distribution given by the local shift P θ with θ = θ + h/, where h is a conformable vector of constants. Assumption 2: i For some ε > 0, all x in the support of X, and all θ R k such that θ θ ε, the distribution of F X θ is continuous with support equal to an interval bounded away from zero and one. ii For all θ R k such that θ θ ε, all F in the 3 This is done to conform with empirical practice. Our results can be readily extended to estimators of θ other than Maximum Likelihood. 5

8 support of F X θ, and all w = 0,, the regression function EY,i W,i = w, F X,i θ = F is Lipschitz-continuous in F. Proposition : If Assumption 2 holds, R θ p 0 under P θ. All proof are provided in the appendix. and Proposition implies that T θ = D θ + o p under P θ. Let In addition, let θ = Λ θ θ = be the Fisher Information Matrix for θ. Assumption 3: Under P θ : log dp θ Z dp θ,i, X,i W,i F X,i θ F X,i θ F X,i θfx,iθ. fx θ 2 I θ = E F X θ F X θ XX, Λ θ θ = h θ 2 h I θ h + o p, θ d 0, I θ, 2 and θ θ = I θ θ + o p. 3 For regular parametric models, equation can be established using Proposition 2..2 in Bickel et al Also for regular parametric models, equation 2 is derived in the proof of Proposition 2..2 in Bickel et al Equation 3 can be established using the same set of results plus classical conditions for asymptotic linearity of maximum likelihood estimators see, e.g., van der Vaart 998 Theorem 5.39; Lehmann and Romano 2005 Theorem The following assumption collects some technical regularity conditions that will be used later in this section. 6

9 Assumption 4: i The function F has a continuous derivative. ii There is some ε > 0, such that for all θ such that θ θ ε the density of F X θ is bounded and bounded away from zero. iii For all bounded functions hy, W, X, E θ hy, W, X F X θ, W converges to EhY, W, X F X θ, W where E θ P θ. denotes an expectation with respect to Assumption 4i is satisfied in the most usual binary choice models employed for the estimation of the propensity score Probit, Logit. We adopt Assumption 4ii for technical reasons, because it simplifies matters considerably in the proof of our main theorem. This assumption typically implies some trimming on the population of interest to discard lowdensity values of the propensity score. To avoid cluttering, we leave such trimming implicit in our notation. Primitive conditions for assumption 4iii can be established using the results in Ganssler and Pfanzagl 97. Our derivation of the limit distribution of τ τ is based on the techniques developed in Andreou and Werker 2005 to analyze to limit distribution of residualbased statistics. We proceed in four steps. First, we derive the joint limit distribution of T θ, θ under P θ. The following result is useful in that respect. Proposition 2: Suppose that Assumption 3 holds. Then, under P θ : D θ d 0 σ 2 c,, θ 0 c where σ 2 is the asymptotic variance of T θ and c = E covx, µw, X F X θ, W fx θ otice that propositions and 2 imply: T θ θ under P θ. d 0 0 I θ W F X θ 2 + σ 2 c, c I θ W. F X θ 2, 4 Second, we use equation 4, along with Assumption 3, to obtain the joint limit distribution of T θ, θ θ, Λ θ θ under P θ : T θ 0 σ 2 c I θ θ d θ c h 0, I Λ θ θ h θ c I θ h. I θ h/2 h c h h I θ h 7

10 Third, applying Le Cam s third lemma, we obtain T θ d c h, θ θ h σ 2 c I θ I θ c I θ, or equivalently: T θ + h/ θ θ d c h 0, σ 2 c I θ I θ c I θ, under P θ, for any h R k. Finally, we calculate the limit distribution of T θ = τ τ as the limit distribution of T θ +h/ conditional on θ = θ +h/ i.e. θ θ = h, integrated over the distribution of θ θ. Theorem : Under P θ τ τ d 0, σ 2 c I θ c. The asymptotic variance of τ is adjusted by c I θ c to account for first-step estimation of the propensity score. In this case, the adjustment reduces the asymptotic variance. This need not be the case for matching estimators of other treatment parameters, such as the average treatment effect on the treated. Formally, the proof of Theorem requires a discretization of the first step estimator θ see Andreou and Werker, 2005, for details. This discretization can be arbitrarily fine and the result of Theorem arises in the limit, as we make the discretization increasingly finer. IV. Estimation of the Asymptotic Variance Let H J i, θ be the set of the J units with W = W i and closest values of F X θ to F X iθ, and let Ȳ J,θ i be the average of Y for for the units in {i H J i, θ}. Consider the following estimator of vary i F X iθ, W i : σ 2,iθ = J j {i H J i,θ} Y j Ȳ J,θ i Y j Ȳ J,θ i. Abadie and Imbens 2006 propose the following estimator for σ 2 θ: σ θ 2 = 2W i Y i Y j τ θ M 8 j J M i,θ 2

11 = 2 K,i θ + 2M K,i θ σ 2 M M M,iθ. Let J,θ X i be the averages of X for for the units in {i H J i, θ}. otice that Y = µw, X + ε, where Eε X, W = 0. As a result: covx, µw, X F X θ, W = covx, Y F X θ, W. Consider the following estimator of covx, Y F X θ, W : ĉovx i, Y i F X iθ, W i = J j {i H J i, θ } X j X J, θ i Y j Ȳ J, θ i. Our estimator of c is: ĉ = ĉovx i, Y i F X iθ, W i fx i θ W i F X i θ + W i. 2 F X i θ 2 Finally, let Î θ, = fx i θ 2 F X i θ F X i θ X ix i. Because I θ is non-singular, the inverse of Îθ, exists with probability approaching one. Our estimator of the large sample variance of the propensity score matching estimator, adjusted for first step estimation of the propensity score, is: σ 2 adj, θ = σ 2 θ ĉ Î θ,ĉ. Consistency of this estimator can be shown using the results in Abadie and Imbens 2006 and the contiguity arguments employed in section III. V. A Small Simulation Exercise In this section, we run a small Monte Carlo exercise to investigate the sampling distribution of propensity score matching estimators and of the approximation to that distribution that we propose in the article. We use a simple Monte Carlo design. The outcome variable is generated by Y = 5W + 4X + X 2 + U, where X and X 2 are independent and uniform on 0, and U 9

12 is a standard ormal variable independent of W, X, X 2. The treatment variable, W, is related to X, X 2 through the propensity score, which is logistic PrW = X = x, X 2 = x 2 = exp + x x 2 + exp + x x 2. Table I reports the results of our Monte Carlo simulation for M = and = As in our theoretical results, the variance of τ θ, the estimator that matches on the true propensity score, is larger than the variance of τ θ, the estimator that matches on the estimated propensity score. The estimator of the variance of τ θ proposed in Abadie and Imbens 2006, σ 2 θ, is centered at the variance of τ θ. σ 2 θ is the estimator of the variance that treats the first step estimate of the propensity score θ as if it was the true propensity score, and σ 2 adj, θ is the adjusted estimator of the variance that takes into account that the propensity score is itself estimated in a first step. Finally, the table reports also confidence interval constructed with adjusted and unadjusted standard errors. In concordance with out theoretical results, the simulation shows that σ 2 θ is biased and too large on average. In addition, confidence intervals constructed with σ 2 θ have larger than nominal coverage rates. In contrast, σ 2 adj, θ is unbiased and produce confidence intervals that have coverage rates close to nominal rates. VI. Conclusions In this article, we propose a method to correct to the asymptotic variance of propensity score matching estimators when the propensity scores are estimated in a first step. Our results allow valid large sample inference for propensity score matching estimators. 0

13 Appendix For the proof of Proposition we will need some preliminary lemmas. Lemma A.: Consider two independent samples of sizes n 0 and n from continuous distributions F 0 and F with common support: V 0,,..., V 0,n0 i.i.d. F 0 and V,,..., V,n i.i.d. F. Let = n 0 + n. Assume that the support of F 0 and F is an interval inside 0,. Let f 0 and f be the densities of F 0 and F, respectively. Suppose that for any v in the supports of F 0 and F, f v/f 0 v r. For i n and m M n 0, let U n0,n,i m be the m-th order statistic of { V,i V 0,,..., V,i V 0,n0 }. Then, for n 0 3: E n M M n U n0,n,i m r /2 n 3/4 0 + M n m= 0 exp n /4 0. /2 nm /4 Proof: Consider balls assigned at random among n bins of equal probability. It is known that the mean of the number of bins with exactly m balls is equal to m n m m n n see Johnson and Kotz, 977, p. 4. Because f v/f 0 v r, for any measurable set A: f v PrV,i A = f v dv = f 0 v dv r PrV 0,i A. f 0 v A A Divide the support of F 0 and F in n 3/4 0 cells of equal probability / n 3/4 0 under F 0. Let Z M,n0 be the number of such cells are not occupied by at least M observations from the second sample: V 0,,..., V 0,n0. For i =,...,. Let µ M,n0 = EZ M,n0. otice that n 0 3 implies n 3/ Then, M m µ M,n0 = n 3/4 n0 0 m m=0 n 3/4 n0 m 0 n 3/4 0 M m n 3/4 0 nm 0 m! m=0 n 3/4 n0 m 0 n 3/4 0 Mn M /4 0 n0 n 3/4. 0 Using Markov s inequality, PrZ M,n0 > 0 = PrZ M,n0 µ M,n0 Mn M /4 0 n0 n 3/4. 0 otice that for any positive a, we have that a loga. Therefore, for any b <, we have that log b/ b/ and b/ exp b. As a result, we obtain: n0 n0 n 3/4 = n/4 0 exp n /4 0. n 0 0

14 Putting together the last two displayed equations, we obtain the following exponential bound for PrZ M,n0 > 0: PrZ M,n0 > 0 Mn M /4 0 exp n /4 0. otice that U n0,n,i m. For 0 n n 3/4 0, let c n0,n = F n/ n 3/4 0, then M E U n0,n,i m ZM,n0 = 0 m= ow, n E M M U n0,n,i m m= = n 3/4 0 n= M r n 3/4 0 M r n 3/4 0. M c n0,n c n0,n Pr cn0,n V,i c n0,n ZM,n0 = 0 n 3/4 0 n= n = E M + E + = + E n M n M n cn0,n c n0,n M ZM,n0 U n0,n,i m = 0 PrZ M,n0 = 0 m= M ZM,n0 U n0,n,i m > 0 PrZ M,n0 > 0 m= M ZM,n0 U n0,n,i m = 0 m= /2 PrZ M,n 0 > 0 n M E U n0,n M,i m Z M,n0 = 0 n m= /2 PrZ M,n 0 > 0 n r /2 n 3/4 0 + M n 0 exp n /4 0. /2 nm /4 Lemma A.2: Inverse Moments of the Doubly Truncated Binomial Distribution Let 0 be a Binomial variable with parameters, p that is left-truncated for values smaller than M and right-truncated for values greater than M, where M < /2. Then, for any r > 0, there exist a constant C r, such that r E C r, 0 for all > 2M. Proof: Let = 0. For all q > 0, r r { } r { } E = E > q + E q

15 = M M r Pr r Pr > > q + q r 0 q + q r otice that: Pr > q = x M x> / q,x M x M x M x x p x p x p x p x For > 2M the denominator can be bounded away from zero. Therefore, for some positive constant C, and q > / p, Pr > q C x M x> / q,x M C x> / q x x p x p x p x p x C exp { 2 / q p 2 }, by Hoeffding s Inequality e.g. van der Vaart and Wellner 996, p Therefore E/ 0 r is uniformly bounded for > 2M. Lemma A.3: Suppose that the propensity score, PrW = X, is continuously distributed and that there exist c L > 0 and c U < such that c L PrW = X = x c U for all x X. Let f be the distribution of the propensity score conditional on W =, and let f 0 be the distribution of the propensity score conditional on W = 0. Then, the ratio f p/f 0 p is bounded and bounded away from zero. Proof: Use Bayes Theorem to show that f p/f 0 p = p/ pprw = / PrW = 0. Proof of Proposition : Let f θ be the distribution of the propensity score conditional on W =, and let f θ 0 be the distribution of the propensity score conditional on W = 0. By lemma A.3 the ratio f θ p/f θ 0 p is uniformly bounded by some constant r. Consider 0 and as in Lemma A.2. Let ψ M, 0, = r /2 3/4 0 + M 0 exp /4 0. /2 M /4 Then, ψ M, 0, p 0. Rearrange the observations in the sample so that the first observations have W = and the remaining 0 = observations have W = 0. For i and m M, let U 0,,i m be the m-th order statistic of { F X,i θ F X, + θ,..., F X,i θ F X, θ }. For + i and m M, 3

16 let U 0,,i m be the m-th order statistic of { F X,i θ F X, θ,..., F X,i θ F X, θ }. Lemma A. implies that for large enough : E M M m= U 0,,i m E ψ M, 0,. A. Therefore, to prove that the left-hand-side of equation A. converges to zero, it is enough to show that ψ M, 0, is asymptotically uniformly integrable: lim lim sup E ψ k M, 0, {ψ M, 0, > k} = 0 see, e.g., van der Vaart 998, p. 7. otice that the ratio 3/4 / 3/4 is bounded. This, in combination with Lemma A.2, implies that for all k > 0 and some positive constant C, E ψ M, 0, {ψ M, 0, > k} E ψ M, 0, E r /2 3/4 0 + M /2 3/4 M+/2 0 exp /4 0 0 Similarly, E = i= + C E C /4 E M M m= /2 3/4 0 3/4 3/4 0 U 0,,i m Using Markov s inequality, we obtain that for any ε > 0: Pr M M U 0,,i m > ε m= E 0. p 0. M M U 0,,i m m= ε 0. The result now follows from Lipschitz-continuity of the regression functions, EY,i W,i = w, F X,i θ = F for some ε > 0 and θ θ ε. Proof of Proposition 2 Sketch: In this proof, we first establish a extend the martingale representation of matching estimators Abadie and Imbens, 2009 to the propensity score matching estimator studied in this article. Consider the linear combination C = z D θ + z 2 θ. C = z µ, F X,iθ µ0, F X,iθ τ 4

17 + z + z 2 2W,i + K,iθ Y,i µw,i, F X M,iθ X,i W,i F X,i θ F X,i θ F X,i θ fx,iθ. C can be analyzed using martingale methods. otice that: C = 3 k= ξ,k, where ξ,k = z µ, F X,k θ µ0, F X,k θ τ for k, + z 2 E θ X,k F X,k θ W,k F X,k θ F X,k θ F X,k θ fx,k θ, ξ,k = z 2 X,k E θ X,k F X,k θ W,k F X,k θ fx,k θ F X,k θ F X,k θ + z 2W,k + K,k θ µw,k, X,k µw,k, F X,k M θ. for + k 2, ξ,k = z 2W,k 2 + K,k 2θ Y,k 2 µw,k 2, X,k 2, M for 2 + k 3. Consider the σ-fields F,k = σ{w,,..., W,k, X, θ,..., X,k θ } for k, F,k = σ{w,,..., W,, X, θ,..., X, θ, X,,..., X,k } for + k 2, and F,k = σ{w,,..., W,, X,,..., X,, Y,,..., Y,k } for 2 + k 3. Then, i ξ,j, F,i, i 3 j= is a martingale for each. Therefore, the limiting distribution of C can be studied using Martingale Central Limit Theorem e.g., Theorem 35.2 in Billingsley 995, p. 476; importantly, notice that this theorem allows that the probability space varies with. Because Y,i, X,i, and W,i are bounded random variables uniformly in, and because K,i has uniformly bounded moments see Abadie and Imbens, 2009, it follows that: 3 k= Eξ 2+δ,k 0 for some δ > 0. 5

18 Lindeberg s condition in Billingsley s theorem follows easily from last equation Lyapounov s condition. As a result, we obtain that under P θ where and C d 0, σ 2 + σ σ 2 3, σ 2 = plim σ 2 2 = plim σ 2 3 = plim E θ ξ,k 2 F,k, k= 2 k=+ 3 k=2+ E θ ξ 2,k F,k, E θ ξ 2,k F,k. After some algebra, we obtain: σ 2 = z 2 E µ, F X θ µ0, F X θ τ 2 + z 2E f 2 X θ F X θ F X θ EX F X θex F X θ z 2. Following the calculations in Abadie and Imbens 2006, additional proofs for the expectation of + K,i /M 2 : σ2 2 = z 2E f 2 X θ F X θ F X θ varx F X θ z 2 varµ, X F X + ze 2 θ F X + varµ0, X F X θ θ F X θ + z 2 2M E F X θ F X θ varµ, X F X θ + z 2 2M E F X θ F X θ varµ0, X F X θ + 2 z 2E covx, µw, X F X fx θ θ, W F X θ F X z. θ Here we use the fact that, conditional on the propensity score, X is independent of W. To derive the constant vector of the cross-product notice that: E cov X, µx, W F X θ, W W F X θ2w F X θ F X fx θ + K θ θ M = E cov X, µx, F X θ fx θ F X + K W θ = p θ M + E cov X, µx, 0 F X θ fx θ F X + K W θ = 0 p θ M 6

19 E cov X, µx, F X θ fx θ W = F X θ 2 p + E cov X, µx, 0 F X θ fx θ W = 0 F X θ 2 p W = E covx, µw, X F X θ, W fx θ F X θ 2 + W F X θ 2. Finally, vary X, W = σ3 2 = z 2 vary X, W = 0 E F X + θ F X θ + z 2 2M E F X θ F X θ vary X, W = + z 2 2M E F X θ F X θ vary X, W = 0. otice that for any integrable function gf X θ: E gf X θ varµw, X F X θ + vary X, W = w = E = E gf X θ As a result, under P θ : where z = z, z 2, and where gf X θ varµw, X F X θ + E vary X, W = w F X θ varµw, X F X θ, W = w + E vary X, W = w F X θ, W = w = E gf X θ vary F X θ, W = w. C d 0, z V z, σ 2 c V = c W c = E covx, µw, X F X θ, W fx θ F X θ 2 + W F X θ 2, and σ 2 is the asymptotic variance calculated in Abadie and Imbens for the case of a known propensity score. Applying the Cramer-Wold device, under P θ : D θ d 0, V. θ Proof of Theorem : Given our preliminary results, Theorem follows from Andreou and Werker I θ, 7

20 References Abadie, A Semiparametric Difference-in-Differences Estimators, Review of Economic Studies, vol. 72, no., -9. Abadie, A. and Imbens, G.W Large Sample Properties of Matching Estimators for Average Treatment Effects, Econometrica, vol. 74, no., Abadie, A. and Imbens, G.W On the Failure of the Bootstrap for Matching Estimators, Econometrica, vol. 76, no. 6, Abadie, A. and Imbens, G.W A Martingale Representation for Matching Estimators, BER working paper, no Andreou, E. and Werker, B.J.M An Alternative Asymptotic Analysis of Residual- Based Statistics, mimeo. Angrist, J.D. and Kuersteiner, G.M Causal Effects of Monetary Shocks: Semiparametric Conditional Independence Tests with a Multinomial Propensity Score, mimeo. Bickel, P.J., Klaassen, C.A., Ritov, Y. and Wellner, J.A. 998 Efficient and Adaptive Estimation for Semiparametric Models, Springer, ew York. Billingsley, P. 995, Probability and Measure, third edition. Wiley, ew York. Ganssler, P. and Pfanzagl, J. 97 Convergence of Conditional Expectations, The Annals of Mathematical Statistics, vol. 42, no., Hirano, K., G. Imbens, and G. Ridder 2003 Efficient Estimation of Average Treatment Effects Using the Estimated Propensity Score, Econometrica, vol. 7, no. 4, Johnson,. and Kotz, S. 977 Urn Models and Their Applications, John Wiley & Sons, ew York. ewey, W.K. and McFadden, D. 994 Large sample estimation and hypothesis testing. In: Engle, R.F., McFadden, D. Eds., Handbook of Econometrics, vol. 4. Elsevier Science, Amsterdam. Lehmann, E.L. and Romano, J.P Testing Statistical Hypothesis. Springer, ew York. Rosenbaum, P. and Rubin, D.B. 983 The Central Role of the Propensity Score in Observational Studies for Causal Effects, Biometrika, vol. 70, 455. Rubin, D.B. 974 Estimating Causal Effects of Treatments in Randomized and on-randomized Studies, Journal of Educational Psychology, vol. 66, Rubin, D., and. Thomas 992a Characterizing the effect of matching using linear propensity score methods with normal distributions, Biometrika, vol. 79, Rubin, D., and. Thomas 992b Affinely Invariant Matching Methods with Ellipsoidal Distributions, Annals of Statistics, vol. 20, no. 2,

21 van der Vaart, A. 998, Asymptotic Statistics, Cambridge University Press, ew York. van der Vaart, A.W. and Wellner, J.A. 996, Weak Convergence and Empirical Processes, Springer-Verlag, ew York. Wooldridge, J.M Inverse Probability Weighted Estimation for General Missing Data Problems, Journal of Econometrics, vol. 4,

22 Table I Simulation Results = 5000, umber of simulations = 0000 Variances over simulations Coverage of 95% C.I. asymp. s.e. = τ θ τ θ, σ 2 θ τ θ τ θ, σ 2 θ τ θ, σ adj, 2 θ Averages over simulations σ 2 θ σ 2 θ σ adj, 2 θ

Matching on the Estimated Propensity Score

Matching on the Estimated Propensity Score atching on the Estimated Propensity Score Alberto Abadie Harvard University and BER Guido W. Imbens Harvard University and BER October 29 Abstract Propensity score matching estimators Rosenbaum and Rubin,

More information

Estimation of the Conditional Variance in Paired Experiments

Estimation of the Conditional Variance in Paired Experiments Estimation of the Conditional Variance in Paired Experiments Alberto Abadie & Guido W. Imbens Harvard University and BER June 008 Abstract In paired randomized experiments units are grouped in pairs, often

More information

NBER WORKING PAPER SERIES A NOTE ON ADAPTING PROPENSITY SCORE MATCHING AND SELECTION MODELS TO CHOICE BASED SAMPLES. James J. Heckman Petra E.

NBER WORKING PAPER SERIES A NOTE ON ADAPTING PROPENSITY SCORE MATCHING AND SELECTION MODELS TO CHOICE BASED SAMPLES. James J. Heckman Petra E. NBER WORKING PAPER SERIES A NOTE ON ADAPTING PROPENSITY SCORE MATCHING AND SELECTION MODELS TO CHOICE BASED SAMPLES James J. Heckman Petra E. Todd Working Paper 15179 http://www.nber.org/papers/w15179

More information

What s New in Econometrics. Lecture 1

What s New in Econometrics. Lecture 1 What s New in Econometrics Lecture 1 Estimation of Average Treatment Effects Under Unconfoundedness Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Potential Outcomes 3. Estimands and

More information

SUPPLEMENT TO MATCHING ON THE ESTIMATED PROPENSITY SCORE (Econometrica, Vol. 84, No. 2, March 2016, )

SUPPLEMENT TO MATCHING ON THE ESTIMATED PROPENSITY SCORE (Econometrica, Vol. 84, No. 2, March 2016, ) Econometrica Supplementary Material SUPPLEMET TO MATCHIG O THE ESTIMATED PROPESITY SCORE Econometrica, Vol. 84, o. 2, March 26, 78 87 BY ALBERTO ABADIE ADGUIDO W. IMBES THE FIRST PART OF THIS SUPPLEMET

More information

Flexible Estimation of Treatment Effect Parameters

Flexible Estimation of Treatment Effect Parameters Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both

More information

A Martingale Representation for Matching Estimators

A Martingale Representation for Matching Estimators A Martingale Representation for Matching Estimators Alberto Abadie Harvard University and BER Guido W. Imbens Harvard University and BER September 20 Abstract Matching estimators Rubin, 973a, 977; Rosenbaum,

More information

On-Line Appendix. Matching on the Estimated Propensity Score (Abadie and Imbens, 2015)

On-Line Appendix. Matching on the Estimated Propensity Score (Abadie and Imbens, 2015) On-Line Aendix Matching on the Estimated Proensity Score Abadie and Imbens, 205 Alberto Abadie and Guido W. Imbens Current version: August 0, 205 The first art of this aendix contains additional roofs.

More information

Implementing Matching Estimators for. Average Treatment Effects in STATA

Implementing Matching Estimators for. Average Treatment Effects in STATA Implementing Matching Estimators for Average Treatment Effects in STATA Guido W. Imbens - Harvard University West Coast Stata Users Group meeting, Los Angeles October 26th, 2007 General Motivation Estimation

More information

Implementing Matching Estimators for. Average Treatment Effects in STATA. Guido W. Imbens - Harvard University Stata User Group Meeting, Boston

Implementing Matching Estimators for. Average Treatment Effects in STATA. Guido W. Imbens - Harvard University Stata User Group Meeting, Boston Implementing Matching Estimators for Average Treatment Effects in STATA Guido W. Imbens - Harvard University Stata User Group Meeting, Boston July 26th, 2006 General Motivation Estimation of average effect

More information

Selection on Observables: Propensity Score Matching.

Selection on Observables: Propensity Score Matching. Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017

More information

Covariate Balancing Propensity Score for General Treatment Regimes

Covariate Balancing Propensity Score for General Treatment Regimes Covariate Balancing Propensity Score for General Treatment Regimes Kosuke Imai Princeton University October 14, 2014 Talk at the Department of Psychiatry, Columbia University Joint work with Christian

More information

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A.

Fall 2017 STAT 532 Homework Peter Hoff. 1. Let P be a probability measure on a collection of sets A. 1. Let P be a probability measure on a collection of sets A. (a) For each n N, let H n be a set in A such that H n H n+1. Show that P (H n ) monotonically converges to P ( k=1 H k) as n. (b) For each n

More information

Combining multiple observational data sources to estimate causal eects

Combining multiple observational data sources to estimate causal eects Department of Statistics, North Carolina State University Combining multiple observational data sources to estimate causal eects Shu Yang* syang24@ncsuedu Joint work with Peng Ding UC Berkeley May 23,

More information

Robustness to Parametric Assumptions in Missing Data Models

Robustness to Parametric Assumptions in Missing Data Models Robustness to Parametric Assumptions in Missing Data Models Bryan Graham NYU Keisuke Hirano University of Arizona April 2011 Motivation Motivation We consider the classic missing data problem. In practice

More information

Working Paper No Maximum score type estimators

Working Paper No Maximum score type estimators Warsaw School of Economics Institute of Econometrics Department of Applied Econometrics Department of Applied Econometrics Working Papers Warsaw School of Economics Al. iepodleglosci 64 02-554 Warszawa,

More information

Matching Techniques. Technical Session VI. Manila, December Jed Friedman. Spanish Impact Evaluation. Fund. Region

Matching Techniques. Technical Session VI. Manila, December Jed Friedman. Spanish Impact Evaluation. Fund. Region Impact Evaluation Technical Session VI Matching Techniques Jed Friedman Manila, December 2008 Human Development Network East Asia and the Pacific Region Spanish Impact Evaluation Fund The case of random

More information

Harvard University. A Note on the Control Function Approach with an Instrumental Variable and a Binary Outcome. Eric Tchetgen Tchetgen

Harvard University. A Note on the Control Function Approach with an Instrumental Variable and a Binary Outcome. Eric Tchetgen Tchetgen Harvard University Harvard University Biostatistics Working Paper Series Year 2014 Paper 175 A Note on the Control Function Approach with an Instrumental Variable and a Binary Outcome Eric Tchetgen Tchetgen

More information

Clustering as a Design Problem

Clustering as a Design Problem Clustering as a Design Problem Alberto Abadie, Susan Athey, Guido Imbens, & Jeffrey Wooldridge Harvard-MIT Econometrics Seminar Cambridge, February 4, 2016 Adjusting standard errors for clustering is common

More information

Increasing the Power of Specification Tests. November 18, 2018

Increasing the Power of Specification Tests. November 18, 2018 Increasing the Power of Specification Tests T W J A. H U A MIT November 18, 2018 A. This paper shows how to increase the power of Hausman s (1978) specification test as well as the difference test in a

More information

The propensity score with continuous treatments

The propensity score with continuous treatments 7 The propensity score with continuous treatments Keisuke Hirano and Guido W. Imbens 1 7.1 Introduction Much of the work on propensity score analysis has focused on the case in which the treatment is binary.

More information

Large Sample Properties of Matching Estimators for Average Treatment Effects

Large Sample Properties of Matching Estimators for Average Treatment Effects Revision in progress. December 7, 4 Large Sample Properties of atching Estimators for Average Treatment Effects Alberto Abadie Harvard University and BER Guido Imbens UC Bereley and BER First version:

More information

Causal Inference Basics

Causal Inference Basics Causal Inference Basics Sam Lendle October 09, 2013 Observed data, question, counterfactuals Observed data: n i.i.d copies of baseline covariates W, treatment A {0, 1}, and outcome Y. O i = (W i, A i,

More information

Lecture 8: Information Theory and Statistics

Lecture 8: Information Theory and Statistics Lecture 8: Information Theory and Statistics Part II: Hypothesis Testing and I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 23, 2015 1 / 50 I-Hsiang

More information

BOOTSTRAP INFERENCE FOR PROPENSITY SCORE MATCHING

BOOTSTRAP INFERENCE FOR PROPENSITY SCORE MATCHING BOOTSTRAP IFERECE FOR PROPESITY SCORE ATCHIG KARU ADUSUILLI Abstract. Propensity score matching, where the propensity scores are estimated in a first step, is widely used for estimating treatment effects.

More information

Imbens/Wooldridge, Lecture Notes 1, Summer 07 1

Imbens/Wooldridge, Lecture Notes 1, Summer 07 1 Imbens/Wooldridge, Lecture Notes 1, Summer 07 1 What s New in Econometrics NBER, Summer 2007 Lecture 1, Monday, July 30th, 9.00-10.30am Estimation of Average Treatment Effects Under Unconfoundedness 1.

More information

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing Primal-dual Covariate Balance and Minimal Double Robustness via (Joint work with Daniel Percival) Department of Statistics, Stanford University JSM, August 9, 2015 Outline 1 2 3 1/18 Setting Rubin s causal

More information

arxiv: v1 [math.st] 28 Feb 2017

arxiv: v1 [math.st] 28 Feb 2017 Bridging Finite and Super Population Causal Inference arxiv:1702.08615v1 [math.st] 28 Feb 2017 Peng Ding, Xinran Li, and Luke W. Miratrix Abstract There are two general views in causal analysis of experimental

More information

A better way to bootstrap pairs

A better way to bootstrap pairs A better way to bootstrap pairs Emmanuel Flachaire GREQAM - Université de la Méditerranée CORE - Université Catholique de Louvain April 999 Abstract In this paper we are interested in heteroskedastic regression

More information

A Note on Adapting Propensity Score Matching and Selection Models to Choice Based Samples

A Note on Adapting Propensity Score Matching and Selection Models to Choice Based Samples DISCUSSION PAPER SERIES IZA DP No. 4304 A Note on Adapting Propensity Score Matching and Selection Models to Choice Based Samples James J. Heckman Petra E. Todd July 2009 Forschungsinstitut zur Zukunft

More information

Large Sample Properties of Matching Estimators for Average Treatment Effects

Large Sample Properties of Matching Estimators for Average Treatment Effects Large Sample Properties of atching Estimators for Average Treatment Effects Alberto Abadie Harvard University and BER Guido Imbens UC Bereley and BER First version: July This version: January 4 A previous

More information

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i, A Course in Applied Econometrics Lecture 18: Missing Data Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. When Can Missing Data be Ignored? 2. Inverse Probability Weighting 3. Imputation 4. Heckman-Type

More information

Testing Overidentifying Restrictions with Many Instruments and Heteroskedasticity

Testing Overidentifying Restrictions with Many Instruments and Heteroskedasticity Testing Overidentifying Restrictions with Many Instruments and Heteroskedasticity John C. Chao, Department of Economics, University of Maryland, chao@econ.umd.edu. Jerry A. Hausman, Department of Economics,

More information

TECHNICAL WORKING PAPER SERIES ON THE FAILURE OF THE BOOTSTRAP FOR MATCHING ESTIMATORS. Alberto Abadie Guido W. Imbens

TECHNICAL WORKING PAPER SERIES ON THE FAILURE OF THE BOOTSTRAP FOR MATCHING ESTIMATORS. Alberto Abadie Guido W. Imbens TECHNICAL WORKING PAPER SERIES ON THE FAILURE OF THE BOOTSTRAP FOR MATCHING ESTIMATORS Alberto Abadie Guido W. Imbens Technical Working Paper 325 http://www.nber.org/papers/t0325 NATIONAL BUREAU OF ECONOMIC

More information

Identi cation of Positive Treatment E ects in. Randomized Experiments with Non-Compliance

Identi cation of Positive Treatment E ects in. Randomized Experiments with Non-Compliance Identi cation of Positive Treatment E ects in Randomized Experiments with Non-Compliance Aleksey Tetenov y February 18, 2012 Abstract I derive sharp nonparametric lower bounds on some parameters of the

More information

Learning 2 -Continuous Regression Functionals via Regularized Riesz Representers

Learning 2 -Continuous Regression Functionals via Regularized Riesz Representers Learning 2 -Continuous Regression Functionals via Regularized Riesz Representers Victor Chernozhukov MIT Whitney K. Newey MIT Rahul Singh MIT September 10, 2018 Abstract Many objects of interest can be

More information

Parametric Identification of Multiplicative Exponential Heteroskedasticity

Parametric Identification of Multiplicative Exponential Heteroskedasticity Parametric Identification of Multiplicative Exponential Heteroskedasticity Alyssa Carlson Department of Economics, Michigan State University East Lansing, MI 48824-1038, United States Dated: October 5,

More information

Program Evaluation with High-Dimensional Data

Program Evaluation with High-Dimensional Data Program Evaluation with High-Dimensional Data Alexandre Belloni Duke Victor Chernozhukov MIT Iván Fernández-Val BU Christian Hansen Booth ESWC 215 August 17, 215 Introduction Goal is to perform inference

More information

Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed

Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed 18.466 Notes, March 4, 2013, R. Dudley Maximum likelihood estimation: actual or supposed 1. MLEs in exponential families Let f(x,θ) for x X and θ Θ be a likelihood function, that is, for present purposes,

More information

arxiv: v1 [stat.me] 15 May 2011

arxiv: v1 [stat.me] 15 May 2011 Working Paper Propensity Score Analysis with Matching Weights Liang Li, Ph.D. arxiv:1105.2917v1 [stat.me] 15 May 2011 Associate Staff of Biostatistics Department of Quantitative Health Sciences, Cleveland

More information

Estimating the Marginal Odds Ratio in Observational Studies

Estimating the Marginal Odds Ratio in Observational Studies Estimating the Marginal Odds Ratio in Observational Studies Travis Loux Christiana Drake Department of Statistics University of California, Davis June 20, 2011 Outline The Counterfactual Model Odds Ratios

More information

Statement: With my signature I confirm that the solutions are the product of my own work. Name: Signature:.

Statement: With my signature I confirm that the solutions are the product of my own work. Name: Signature:. MATHEMATICAL STATISTICS Homework assignment Instructions Please turn in the homework with this cover page. You do not need to edit the solutions. Just make sure the handwriting is legible. You may discuss

More information

Summary and discussion of The central role of the propensity score in observational studies for causal effects

Summary and discussion of The central role of the propensity score in observational studies for causal effects Summary and discussion of The central role of the propensity score in observational studies for causal effects Statistics Journal Club, 36-825 Jessica Chemali and Michael Vespe 1 Summary 1.1 Background

More information

Chapter 3: Maximum Likelihood Theory

Chapter 3: Maximum Likelihood Theory Chapter 3: Maximum Likelihood Theory Florian Pelgrin HEC September-December, 2010 Florian Pelgrin (HEC) Maximum Likelihood Theory September-December, 2010 1 / 40 1 Introduction Example 2 Maximum likelihood

More information

Section 10: Inverse propensity score weighting (IPSW)

Section 10: Inverse propensity score weighting (IPSW) Section 10: Inverse propensity score weighting (IPSW) Fall 2014 1/23 Inverse Propensity Score Weighting (IPSW) Until now we discussed matching on the P-score, a different approach is to re-weight the observations

More information

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) 1 EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix) Taisuke Otsu London School of Economics Summer 2018 A.1. Summation operator (Wooldridge, App. A.1) 2 3 Summation operator For

More information

Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach

Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach Goodness-of-Fit Tests for Time Series Models: A Score-Marked Empirical Process Approach By Shiqing Ling Department of Mathematics Hong Kong University of Science and Technology Let {y t : t = 0, ±1, ±2,

More information

ECON 3150/4150, Spring term Lecture 6

ECON 3150/4150, Spring term Lecture 6 ECON 3150/4150, Spring term 2013. Lecture 6 Review of theoretical statistics for econometric modelling (II) Ragnar Nymoen University of Oslo 31 January 2013 1 / 25 References to Lecture 3 and 6 Lecture

More information

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Institute of Statistics

More information

An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data

An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data An Efficient Estimation Method for Longitudinal Surveys with Monotone Missing Data Jae-Kwang Kim 1 Iowa State University June 28, 2012 1 Joint work with Dr. Ming Zhou (when he was a PhD student at ISU)

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND

More information

A Note on Bootstraps and Robustness. Tony Lancaster, Brown University, December 2003.

A Note on Bootstraps and Robustness. Tony Lancaster, Brown University, December 2003. A Note on Bootstraps and Robustness Tony Lancaster, Brown University, December 2003. In this note we consider several versions of the bootstrap and argue that it is helpful in explaining and thinking about

More information

The properties of L p -GMM estimators

The properties of L p -GMM estimators The properties of L p -GMM estimators Robert de Jong and Chirok Han Michigan State University February 2000 Abstract This paper considers Generalized Method of Moment-type estimators for which a criterion

More information

Additive Isotonic Regression

Additive Isotonic Regression Additive Isotonic Regression Enno Mammen and Kyusang Yu 11. July 2006 INTRODUCTION: We have i.i.d. random vectors (Y 1, X 1 ),..., (Y n, X n ) with X i = (X1 i,..., X d i ) and we consider the additive

More information

Microeconometrics. C. Hsiao (2014), Analysis of Panel Data, 3rd edition. Cambridge, University Press.

Microeconometrics. C. Hsiao (2014), Analysis of Panel Data, 3rd edition. Cambridge, University Press. Cheng Hsiao Microeconometrics Required Text: C. Hsiao (2014), Analysis of Panel Data, 3rd edition. Cambridge, University Press. A.C. Cameron and P.K. Trivedi (2005), Microeconometrics, Cambridge University

More information

Bootstrapping Sensitivity Analysis

Bootstrapping Sensitivity Analysis Bootstrapping Sensitivity Analysis Qingyuan Zhao Department of Statistics, The Wharton School University of Pennsylvania May 23, 2018 @ ACIC Based on: Qingyuan Zhao, Dylan S. Small, and Bhaswar B. Bhattacharya.

More information

A note on L convergence of Neumann series approximation in missing data problems

A note on L convergence of Neumann series approximation in missing data problems A note on L convergence of Neumann series approximation in missing data problems Hua Yun Chen Division of Epidemiology & Biostatistics School of Public Health University of Illinois at Chicago 1603 West

More information

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census

More information

Notes on causal effects

Notes on causal effects Notes on causal effects Johan A. Elkink March 4, 2013 1 Decomposing bias terms Deriving Eq. 2.12 in Morgan and Winship (2007: 46): { Y1i if T Potential outcome = i = 1 Y 0i if T i = 0 Using shortcut E

More information

Quantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be

Quantile methods. Class Notes Manuel Arellano December 1, Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be Quantile methods Class Notes Manuel Arellano December 1, 2009 1 Unconditional quantiles Let F (r) =Pr(Y r). Forτ (0, 1), theτth population quantile of Y is defined to be Q τ (Y ) q τ F 1 (τ) =inf{r : F

More information

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling

Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Fractional Hot Deck Imputation for Robust Inference Under Item Nonresponse in Survey Sampling Jae-Kwang Kim 1 Iowa State University June 26, 2013 1 Joint work with Shu Yang Introduction 1 Introduction

More information

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification Todd MacKenzie, PhD Collaborators A. James O Malley Tor Tosteson Therese Stukel 2 Overview 1. Instrumental variable

More information

Lecture 7 Introduction to Statistical Decision Theory

Lecture 7 Introduction to Statistical Decision Theory Lecture 7 Introduction to Statistical Decision Theory I-Hsiang Wang Department of Electrical Engineering National Taiwan University ihwang@ntu.edu.tw December 20, 2016 1 / 55 I-Hsiang Wang IT Lecture 7

More information

The Value of Knowing the Propensity Score for Estimating Average Treatment Effects

The Value of Knowing the Propensity Score for Estimating Average Treatment Effects DISCUSSION PAPER SERIES IZA DP No. 9989 The Value of Knowing the Propensity Score for Estimating Average Treatment Effects Christoph Rothe June 2016 Forschungsinstitut zur Zukunft der Arbeit Institute

More information

Imbens/Wooldridge, IRP Lecture Notes 2, August 08 1

Imbens/Wooldridge, IRP Lecture Notes 2, August 08 1 Imbens/Wooldridge, IRP Lecture Notes 2, August 08 IRP Lectures Madison, WI, August 2008 Lecture 2, Monday, Aug 4th, 0.00-.00am Estimation of Average Treatment Effects Under Unconfoundedness, Part II. Introduction

More information

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai Weighting Methods Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Weighting Methods Stat186/Gov2002 Fall 2018 1 / 13 Motivation Matching methods for improving

More information

Calibration Estimation for Semiparametric Copula Models under Missing Data

Calibration Estimation for Semiparametric Copula Models under Missing Data Calibration Estimation for Semiparametric Copula Models under Missing Data Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Economics and Economic Growth Centre

More information

ECO Class 6 Nonparametric Econometrics

ECO Class 6 Nonparametric Econometrics ECO 523 - Class 6 Nonparametric Econometrics Carolina Caetano Contents 1 Nonparametric instrumental variable regression 1 2 Nonparametric Estimation of Average Treatment Effects 3 2.1 Asymptotic results................................

More information

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design 1 / 32 Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design Changbao Wu Department of Statistics and Actuarial Science University of Waterloo (Joint work with Min Chen and Mary

More information

Reconstruction of individual patient data for meta analysis via Bayesian approach

Reconstruction of individual patient data for meta analysis via Bayesian approach Reconstruction of individual patient data for meta analysis via Bayesian approach Yusuke Yamaguchi, Wataru Sakamoto and Shingo Shirahata Graduate School of Engineering Science, Osaka University Masashi

More information

Propensity Score Methods for Causal Inference

Propensity Score Methods for Causal Inference John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good

More information

A Simple Way to Calculate Confidence Intervals for Partially Identified Parameters. By Tiemen Woutersen. Draft, September

A Simple Way to Calculate Confidence Intervals for Partially Identified Parameters. By Tiemen Woutersen. Draft, September A Simple Way to Calculate Confidence Intervals for Partially Identified Parameters By Tiemen Woutersen Draft, September 006 Abstract. This note proposes a new way to calculate confidence intervals of partially

More information

Causal Inference in Observational Studies with Non-Binary Treatments. David A. van Dyk

Causal Inference in Observational Studies with Non-Binary Treatments. David A. van Dyk Causal Inference in Observational Studies with Non-Binary reatments Statistics Section, Imperial College London Joint work with Shandong Zhao and Kosuke Imai Cass Business School, October 2013 Outline

More information

What s New in Econometrics. Lecture 13

What s New in Econometrics. Lecture 13 What s New in Econometrics Lecture 13 Weak Instruments and Many Instruments Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Motivation 3. Weak Instruments 4. Many Weak) Instruments

More information

Asymptotic Statistics-III. Changliang Zou

Asymptotic Statistics-III. Changliang Zou Asymptotic Statistics-III Changliang Zou The multivariate central limit theorem Theorem (Multivariate CLT for iid case) Let X i be iid random p-vectors with mean µ and and covariance matrix Σ. Then n (

More information

Supplement to Quantile-Based Nonparametric Inference for First-Price Auctions

Supplement to Quantile-Based Nonparametric Inference for First-Price Auctions Supplement to Quantile-Based Nonparametric Inference for First-Price Auctions Vadim Marmer University of British Columbia Artyom Shneyerov CIRANO, CIREQ, and Concordia University August 30, 2010 Abstract

More information

Econ 273B Advanced Econometrics Spring

Econ 273B Advanced Econometrics Spring Econ 273B Advanced Econometrics Spring 2005-6 Aprajit Mahajan email: amahajan@stanford.edu Landau 233 OH: Th 3-5 or by appt. This is a graduate level course in econometrics. The rst part of the course

More information

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke

More information

SOME CONVERSE LIMIT THEOREMS FOR EXCHANGEABLE BOOTSTRAPS

SOME CONVERSE LIMIT THEOREMS FOR EXCHANGEABLE BOOTSTRAPS SOME CONVERSE LIMIT THEOREMS OR EXCHANGEABLE BOOTSTRAPS Jon A. Wellner University of Washington The bootstrap Glivenko-Cantelli and bootstrap Donsker theorems of Giné and Zinn (990) contain both necessary

More information

A Bootstrap Test for Conditional Symmetry

A Bootstrap Test for Conditional Symmetry ANNALS OF ECONOMICS AND FINANCE 6, 51 61 005) A Bootstrap Test for Conditional Symmetry Liangjun Su Guanghua School of Management, Peking University E-mail: lsu@gsm.pku.edu.cn and Sainan Jin Guanghua School

More information

Large Sample Properties of Estimators in the Classical Linear Regression Model

Large Sample Properties of Estimators in the Classical Linear Regression Model Large Sample Properties of Estimators in the Classical Linear Regression Model 7 October 004 A. Statement of the classical linear regression model The classical linear regression model can be written in

More information

Independent and conditionally independent counterfactual distributions

Independent and conditionally independent counterfactual distributions Independent and conditionally independent counterfactual distributions Marcin Wolski European Investment Bank M.Wolski@eib.org Society for Nonlinear Dynamics and Econometrics Tokyo March 19, 2018 Views

More information

Brief Review on Estimation Theory

Brief Review on Estimation Theory Brief Review on Estimation Theory K. Abed-Meraim ENST PARIS, Signal and Image Processing Dept. abed@tsi.enst.fr This presentation is essentially based on the course BASTA by E. Moulines Brief review on

More information

Spring 2012 Math 541B Exam 1

Spring 2012 Math 541B Exam 1 Spring 2012 Math 541B Exam 1 1. A sample of size n is drawn without replacement from an urn containing N balls, m of which are red and N m are black; the balls are otherwise indistinguishable. Let X denote

More information

Invariant HPD credible sets and MAP estimators

Invariant HPD credible sets and MAP estimators Bayesian Analysis (007), Number 4, pp. 681 69 Invariant HPD credible sets and MAP estimators Pierre Druilhet and Jean-Michel Marin Abstract. MAP estimators and HPD credible sets are often criticized in

More information

Statistics 3858 : Maximum Likelihood Estimators

Statistics 3858 : Maximum Likelihood Estimators Statistics 3858 : Maximum Likelihood Estimators 1 Method of Maximum Likelihood In this method we construct the so called likelihood function, that is L(θ) = L(θ; X 1, X 2,..., X n ) = f n (X 1, X 2,...,

More information

Paul Schrimpf. January 23, UBC Economics 326. Statistics and Inference. Paul Schrimpf. Properties of estimators. Finite sample inference

Paul Schrimpf. January 23, UBC Economics 326. Statistics and Inference. Paul Schrimpf. Properties of estimators. Finite sample inference UBC Economics 326 January 23, 2018 1 2 3 4 Wooldridge (2013) appendix C Stock and Watson (2009) chapter 3 Angrist and Pischke (2014) chapter 1 appendix Diez, Barr, and Cetinkaya-Rundel (2012) chapters

More information

Consistency and Asymptotic Normality for Equilibrium Models with Partially Observed Outcome Variables

Consistency and Asymptotic Normality for Equilibrium Models with Partially Observed Outcome Variables Consistency and Asymptotic Normality for Equilibrium Models with Partially Observed Outcome Variables Nathan H. Miller Georgetown University Matthew Osborne University of Toronto November 25, 2013 Abstract

More information

Irr. Statistical Methods in Experimental Physics. 2nd Edition. Frederick James. World Scientific. CERN, Switzerland

Irr. Statistical Methods in Experimental Physics. 2nd Edition. Frederick James. World Scientific. CERN, Switzerland Frederick James CERN, Switzerland Statistical Methods in Experimental Physics 2nd Edition r i Irr 1- r ri Ibn World Scientific NEW JERSEY LONDON SINGAPORE BEIJING SHANGHAI HONG KONG TAIPEI CHENNAI CONTENTS

More information

Some Theoretical Properties and Parameter Estimation for the Two-Sided Length Biased Inverse Gaussian Distribution

Some Theoretical Properties and Parameter Estimation for the Two-Sided Length Biased Inverse Gaussian Distribution Journal of Probability and Statistical Science 14(), 11-4, Aug 016 Some Theoretical Properties and Parameter Estimation for the Two-Sided Length Biased Inverse Gaussian Distribution Teerawat Simmachan

More information

Semiparametric Efficiency in Irregularly Identified Models

Semiparametric Efficiency in Irregularly Identified Models Semiparametric Efficiency in Irregularly Identified Models Shakeeb Khan and Denis Nekipelov July 008 ABSTRACT. This paper considers efficient estimation of structural parameters in a class of semiparametric

More information

Testing Statistical Hypotheses

Testing Statistical Hypotheses E.L. Lehmann Joseph P. Romano Testing Statistical Hypotheses Third Edition 4y Springer Preface vii I Small-Sample Theory 1 1 The General Decision Problem 3 1.1 Statistical Inference and Statistical Decisions

More information

Propensity Score Matching

Propensity Score Matching Methods James H. Steiger Department of Psychology and Human Development Vanderbilt University Regression Modeling, 2009 Methods 1 Introduction 2 3 4 Introduction Why Match? 5 Definition Methods and In

More information

A Course in Applied Econometrics. Lecture 5. Instrumental Variables with Treatment Effect. Heterogeneity: Local Average Treatment Effects.

A Course in Applied Econometrics. Lecture 5. Instrumental Variables with Treatment Effect. Heterogeneity: Local Average Treatment Effects. A Course in Applied Econometrics Lecture 5 Outline. Introduction 2. Basics Instrumental Variables with Treatment Effect Heterogeneity: Local Average Treatment Effects 3. Local Average Treatment Effects

More information

Economics 583: Econometric Theory I A Primer on Asymptotics

Economics 583: Econometric Theory I A Primer on Asymptotics Economics 583: Econometric Theory I A Primer on Asymptotics Eric Zivot January 14, 2013 The two main concepts in asymptotic theory that we will use are Consistency Asymptotic Normality Intuition consistency:

More information

ESTIMATION OF NONLINEAR BERKSON-TYPE MEASUREMENT ERROR MODELS

ESTIMATION OF NONLINEAR BERKSON-TYPE MEASUREMENT ERROR MODELS Statistica Sinica 13(2003), 1201-1210 ESTIMATION OF NONLINEAR BERKSON-TYPE MEASUREMENT ERROR MODELS Liqun Wang University of Manitoba Abstract: This paper studies a minimum distance moment estimator for

More information

Lecture 8. Roy Model, IV with essential heterogeneity, MTE

Lecture 8. Roy Model, IV with essential heterogeneity, MTE Lecture 8. Roy Model, IV with essential heterogeneity, MTE Economics 2123 George Washington University Instructor: Prof. Ben Williams Heterogeneity When we talk about heterogeneity, usually we mean heterogeneity

More information

High Dimensional Propensity Score Estimation via Covariate Balancing

High Dimensional Propensity Score Estimation via Covariate Balancing High Dimensional Propensity Score Estimation via Covariate Balancing Kosuke Imai Princeton University Talk at Columbia University May 13, 2017 Joint work with Yang Ning and Sida Peng Kosuke Imai (Princeton)

More information

Inference For High Dimensional M-estimates. Fixed Design Results

Inference For High Dimensional M-estimates. Fixed Design Results : Fixed Design Results Lihua Lei Advisors: Peter J. Bickel, Michael I. Jordan joint work with Peter J. Bickel and Noureddine El Karoui Dec. 8, 2016 1/57 Table of Contents 1 Background 2 Main Results and

More information

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 An Introduction to Causal Mediation Analysis Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 1 Causality In the applications of statistics, many central questions

More information