Discussion of Identifiability and Estimation of Causal Effects in Randomized. Trials with Noncompliance and Completely Non-ignorable Missing Data

Size: px
Start display at page:

Download "Discussion of Identifiability and Estimation of Causal Effects in Randomized. Trials with Noncompliance and Completely Non-ignorable Missing Data"

Transcription

1 Biometrics 000, DOI: Discussion of Identifiability and Estimation of Causal Effects in Randomized Trials with Noncompliance and Completely Non-ignorable Missing Data Dylan S. Small Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, PA 19104, U.S.A. and Jing Cheng Division of Biostatistics, University of Florida College of Medicine, Gainesville, FL 32610, U.S.A. 1

2 Discussion of CGZ Paper on Randomized Trials with Noncompliance and CN Missing Data 1 1. Introduction We congratulate Chen, Geng and Zhou (CGZ) on an important contribution to the analysis of randomized trials with noncompliance and missing data, two common complications in randomized trials. Most previous methods for analyzing trials in which noncompliance and missing data occur together have either assumed (i) the outcomes are missing at random or (ii) latent ignorability. Missing at random or latent ignorability assume that missingness is not related to the outcome within certain strata of random assignment, compliance and treatment received. For situations in which missingness is related to outcomes within these strata, CGZ develop novel methods of identifying causal effects. To emphasize the importance of CGZ s methods and illustrate several points, we consider a trial in which missingness might be related to outcome. We use the same notation as CGZ throughout. Ferro et al. (1996) conducted a trial of methods for teaching breast selfexamination (BSE) for breast cancer screening in which women were randomly assigned to either a standard treatment of receiving mailed information about BSE (Z = 0) or an enhanced treatment of attendance in a self-exam course in addition to receiving the mailed information (Z = 1). A woman was considered to have received the treatment (D = 1) if she attended the self-exam course. Of the women assigned to attend the self-exam course, only 55% of the women actually attended the course. The primary outcome for the study was whether or not a woman practiced BSE one year after the beginning of the study, where the outcome was obtained from a mailed self-administered questionnaire. Only 65% of the women returned the questionnaire (R = 1). The data, taken from Mealli et al. (2004) s Table 1, are shown in our Table 1. There are structural zeroes for Z = 0, D = 1 because women assigned to the standard treatment could not attend the self-exam course. Missingness might depend on the outcome because women who are practicing BSE might feel more commitment to returning a questionnaire for a study on BSE. Missingness might also

3 2 Biometrics, depend on compliance class and random assignment. For example, if lack of cooperativeness in returning the questionnaire is associated with lack of cooperativeness in following assigned protocol, we would expect never takers to have more missingness than compliers. [Table 1 about here.] CGZ s model of Section 3.1 handles the situation in which missingness depends on the outcome, but, conditional on the outcome, missingness does not depend on compliance class and random assignment. For situations in which missingness might depend on compliance class and/or random assignment conditional on the outcome, CGZ introduce additional identification strategies in Section 3.2. These strategies require covariates that predict outcomes but not missingness in a certain way. Such covariates are not always available. In the remainder of this contribution, we discuss additional analytical methods for situations in which missingness might depend not only on outcomes but also on compliance class and/or random assignment. We focus on the following topics: (1) we formulate a class of models for missingness that contains CGZ s model of Section 3.1; (2) we discuss how the goodness of fit of CGZ s models can be tested; (3) we discuss methods of sensitivity analysis for when a researcher is not completely confident in a model s assumptions. 2. Models for Missingness In this section, we discuss data without covariates and formulate a class of models for the observable data (Z i, D i, Y i, R i), where Y i = Y i if R i = 1 and Y i =? if R i = 0; CGZ s model in their Section 3.1 is a member of this class of models. We shall write the distribution of (Z i, D i, Y i, R i ) in terms of the partially unobservable (Z i, U i, Y i, R i ). To do this, we define the partially unobserved compliance class U i to be consistent with the observed (D i, Z i ) if P(D(Z i ) = D i U = U i ) > 0. Similarly, we define the partially unobserved Y i to be consistent with the observed (Y i, R i ) if P(Y = Y i R = R i, Y = Y i ) > 0. We can express

4 Discussion of CGZ Paper on Randomized Trials with Noncompliance and CN Missing Data 3 the distribution of (Z i, D i, Y i, R i) in terms of the distribution of (Z i, U i, Y i, R i ) as follows: P(Z i, D i, Y i, R i) = P(Z i, U i, Y i, R i ). (1) U i : U i consistent with (D i, Z i ) Y i : Y i consistent with (Yi, i) We now consider a class of models for the summands (Z i, U i, Y i, R i ) in (1). We decompose the probability distribution of (Z i, U i, Y i, R i ) as P(Z i, U i, Y i, R i ) = P(Z i, U i, Y i )P(R i Z i, U i, Y i ). For P(Z i, U i, Y i ), under CGZ s Assumptions 1-4, we have the following, where I( ) is the indicator function: P(Z i, U i, Y i ) = ξ Z i (1 ξ) 1 Z i ω I(U i=n) n θ Y ii(u i =a) 10a ω I(U i=a) a [1 ω a ω n ] I(U i=c) (1 θ 10a ) (1 Y i)i(u i =a) θ Y ii(u i =n) 11n (1 θ 11n ) (1 Y i)i(u i =n) θ Y ii(u i =c)(1 Z i ) 10c (1 θ 10c ) (1 Y i)i(u i =c)(1 Z i ) θ Y ii(u i =c)z i 11c (1 θ 11c ) (1 Y i)i(u i =c)z i. (2) For P(R i Z i, U i, Y i ), under the monotonicity assumption (Assumption 3 in CGZ), the saturated model can be written as logit[p(r i = 1 Z i, U i, Y i )] = α 0 + α 1 I(Y i = 1) + α 2 I(U i = a) + α 3 I(U i = n) + α 4 I(Z i = 1, U i = a) + α 5 I(Z i = 1, U i = n) + α 6 I(Z i = 1, U i = c). (3) Under model (2)-(3) for the partially unobservable data (Z i, U i, Y i, R i ), the model (1) for the observable data (Z i, D i, Y i, R i) contains 14 free parameters. However, the contingency table of the the observable data (Z i, D i, Y i, R i) (Table 3 in CGZ) contains 12 cells, meaning at most 11 free parameters can be identified. Thus, in order to identify all the parameters in model (2)-(3), we must impose at least three constraints. Table 2 presents nine models that impose constraints on the model for missingness (3) that enable identification. Note that as in CGZ s Theorem 1, there are regularity conditions needed beyond one of the models in

5 4 Biometrics, Table 2 holding to ensure identification. Also, note that if our goal is only to identify certain parameters, e.g., the intention to treat effect, and not the entire distribution of (Z, U, Y, R), less stringent conditions may be needed; see Imai (2007) for discussion of this. [Table 2 about here.] The first five models we consider do not allow the outcome to affect missingness within strata of (Z, U). 1. Missing completely at random (MCAR): α 1 = α 2 = α 3 = α 4 = α 5 = α 6 = 0. In this model, missingness does not depend on outcome, random assignment or compliance class. 2. Missing at random (MAR): α 1 = 0, α 2 + α 4 = α 6, α 3 = 0. In this model, missingness is independent of the value of Y given treatment assignment Z and treatment received D. 3. Latent ignorability and response exclusion restriction for never takers and always takers (LI-RER-NT&AT): α 1 = α 4 = α 5 = 0. This is the model formulated by Frangakis and Rubin (1999) and discussed by CGZ. Note that a subject s response behavior is said to satisfy the response exclusion restriction if the missingness of the subject s outcome would be the same irrespective of whether the subject was assigned to the treatment or control. 4. Latent ignorability and response exclusion restriction for compliers and always takers (LI-RER-C&AT): α 1 = α 4 = α 6 = 0. This alternative to model 3 (LI-RER-NT&AT) was formulated by Mealli et al. (2004) in a setting in which there were no always takers. 5. Latent ignorability and response exclusion restriction for all compliance classes (LI-RER- All): α 1 = α 4 = α 5 = α 6 = 0. This model combines the assumptions of models 3 (LI-RER-NT&AT) and 4 (LI-RER-C&AT). The remaining models we consider allow for outcome to affect missingness within strata of (Z, U).

6 Discussion of CGZ Paper on Randomized Trials with Noncompliance and CN Missing Data 5 6. Missingness affected only by outcome the completely non-ignorable without covariates (CN-WOCOV) model: α 2 = α 3 = α 4 = α 5 = α 6 = 0. This is the model developed by CGZ in their Section 3.1. CGZ s Assumption 6 implies P(R = 1 Z, Y, U) = P(R = 1 Y ) which imposes the constraints α 2 = α 3 = α 4 = α 5 = α 6 = 0 on the model (3) for missingness. 7. Missingness affected only by outcome and treatment received (M-YD): α 2 = α 6, α 3 = α 4 = α 5 = 0. This model implies the following for d = 0, 1, y = 0, 1: P(R i = 1 Y i = y, D i = d, Z i = 1) = P(R i = 1 Y i = y, D i = d, Z i = 0). (4) Imai (2007) formulates model (4) and shows how it enables identification of the complier average causal effect (CACE). Imai s model contains the CN-WOCOV model, but also allows for an effect of receiving the treatment on missingness. In the BSE study, Imai s model would be a useful extension of CN-WOCOV if attending the course on BSE creates a further commitment to returning the study questionnaire beyond that created just by practicing BSE. 8. Missingness affected only by outcome and random assignment (M-YZ): α 2 = α 3 = 0, α 4 = α 5 = α 6. This model also contains the CN-WOCOV model. It differs from Imai s model in that it allows for an effect of random assignment but not of treatment received once random assignment is controlled for. 9. Response exclusion restriction for all compliance classes (RER-All): α 4 = α 5 = α 6 = 0. This is another model that contains the CN-WOCOV model. It allows for the outcome and the compliance class to affect missingness, but not the random assignment.

7 6 Biometrics, Testing CGZ s Models 3.1 Testing the CN-WOCOV Model CGZ s CN-WOCOV model uses nine parameters to model the contingency table of observable data (Table 3 in CGZ) whereas a saturated model has 11 parameters. Consequently the goodness of fit of the CN-WOCOV model can be tested. For the flu shot data in CGZ s Table 3, a likelihood ratio test of the CN-WOCOV model versus its saturated counterpart was conducted (p < 0.01 on 2 d.f.) with a bootstrap-based test giving similar results. Thus, there is strong evidence that the CN-WOCOV model does not hold for the flu shot data. For the BSE study, a goodness of fit test does not reject the CN-WOCOV model: a likelihood ratio test of the CN-WOCOV model versus its saturated counterpart gives a p-value of 0.28 on 1 d.f. A goodness of fit test is useful for detecting certain settings in which the CN-WOCOV model does not hold. However, it is important to realize that the goodness of fit test is not consistent for some alternatives, i.e., there are members of the model (2)-(3) that are not part of the CN-WOCOV model, a submodel of (2)-(3), for which the power of the goodness of fit test does not converge to one asymptotically. The reason is as follows. Let θ denote the 14 parameters in model (2)-(3). We can divide the parameter space of θ into equivalence classes of the θ that give the same distribution for the observable data (Z i, D i, Y i, R i). Each of these equivalence classes is a three-dimensional manifold. An equivalence class contains at most one θ that is in the CN-WOCOV model. If the true θ is not in the CN-WOCOV model but is in an equivalence class that contains a point in the CN-WOCOV model, then the goodness of fit test is not consistent (because the true θ has the same distribution for the observed data as a θ in the CN-WOCOV model). We illustrate this in the BSE study. There are no always takers for the BSE study so ω a, θ 10a, α 2 and α 4 are not considered in (2)-(3) and the CN-WOCOV model is the submodel of (2)-(3) that constrains α 3 = α 5 = α 6 = 0. The MLE

8 Discussion of CGZ Paper on Randomized Trials with Noncompliance and CN Missing Data 7 of the CN-WOCOV model is ˆθ = (ˆξ = 0.501, ˆω n = 0.447, ˆθ 11n = 0.191, ˆθ 11c = 0.717, ˆθ 10c = 0.839, ˆα 0 = 0.913, ˆα 1 = 6.555, ˆα 3 = 0, ˆα 5 = 0, ˆα 6 = 0). There is a two-dimensional manifold of θ that give the same distribution for the observable data (Z i, D i, Y i, R i) as ˆθ. If we constrain α 6 = 0, there is a one-dimensional manifold of θ that give the same distribution for the observable data as ˆθ. If the true θ is in this manifold but not equal to ˆθ, the goodness of fit test is not consistent. The coordinates of (α 3, α 5 ) for the θ in this manifold that satisfy 2 < α 3 < 2 are plotted in Figure 1. [Figure 1 about here.] 3.2 Testing CGZ s Models That Use Covariates in Identification In CGZ s Section 3.2, they formulate two models for identifying the CACE that use covariates that are assumed to affect the outcome within certain strata of (Z, U) but are assumed not to affect missingness within these strata. The approach for testing the CN-WOCOV model described above can be adapted to CGZ s models that use covariates. In particular, if X is binary, then CGZ s model based on Assumption 7 has 18 parameters and CGZ s model based on Assumption 8 has 22 parameters, while the saturated model for the observable data has 23 parameters. Thus, the goodness of fit of both of CGZ s models can be tested. 4. Sensitivity Analysis For many studies in which an investigator thinks that one of CGZ s models is reasonable, the investigator will not be completely confident that the model holds. For such studies, a sensitivity analysis should be done. A sensitivity analysis examines the impact of substantively plausible violations on the inferences that can be made (Rosenbaum, 2002, Ch. 4). The output of a sensitivity analysis that we consider is a sensitivity interval, an analogue of a confidence interval (CI) for when there is uncertainty about a model s assumptions. We now illustrate how a sensitivity analysis can be done for the CN-WOCOV model,

9 8 Biometrics, using the BSE study as an example. For the BSE study, the CN-WOCOV model s estimate of the CACE is with a 95% CI of ( 0.265, 0.021). This CI does not fully reflect our uncertainty about the CACE if we are not completely confident that the CN-WOCOV model holds. One reason to be concerned about the CN-WOCOV model holding that was discussed in the introduction is that never takers might be less cooperative than compliers in responding to the questionnaire. In other words, α 3 might be less than zero in model (3). Another reason to be concerned about the CN-WOCOV model holding that was raised by Mealli et al. (2004) is that once never takers show that they are unwilling to follow their assigned protocol, as they do under assignment to the enhanced treatment (Z = 1) but not under assignment to the standard treatment (Z = 0), they might be less inclined to respond to the questionnaire. In other words, α 5 might be less than zero. For illustrative purposes, we will consider a range of concern for (α 3, α 5 ) of 2 α 3, α 5 0 in a clinical study we were involved in, we would consult with the clinical investigators to determine the range of concern for (α 3, α 5 ). Our procedure for forming a 100(1 γ)% sensitivity interval (SI) for the CACE is to first form a joint confidence region for (α 3, α 5, CACE) under the assumption that α 3 and α 5 belong to the range of concern (and α 6 = 0) and then to project this confidence region to form a CI for the CACE. To form the joint confidence region, our approach is to invert a test of H 0 : CACE = CACE, α 3 = α3, α 5 = α5 for (α 3, α 5 ) in the range of concern for (α 3, α 5 ), where we carry out the test of H 0 by testing H 0A : α 3 = α 3, α 5 = α 5 at level γ 1 < γ and H 0B : CACE = CACE α 3 = α 3, α 5 = α 5 at level γ γ 1, and then reject H 0 if H 0A or H 0B is rejected. By Bonferroni s inequality, the size of the test that rejects H 0 if H 0A or H 0B is rejected is at most γ. To put more emphasis on the test of H 0B, we chose γ 1 = for γ = 0.05 in our example. The SI is the set of all CACE s for which there exists an (α 3, α 5 ) such that H 0 : CACE = CACE, α 3 = α 3, α 5 = α 5 is not rejected. The SI is analogous to a

10 Discussion of CGZ Paper on Randomized Trials with Noncompliance and CN Missing Data 9 CI in that as long as the true (α 3, α 5 ) belongs to the range of concern, the SI will contain the true CACE at least 1 γ proportion of the time. See Small (2007) for further discussion of SIs. The SI is difficult to compute exactly but can easily be approximated by considering the (α 3, α 5 ) on a grid over the range of concern. Figure 2 shows for such a grid for the BSE study, the (α 3, α 5 ) that are accepted and rejected at level A summary of our procedure for finding the 95% SI is as follows: for each (α 3, α 5 ) that is accepted in Figure 2 (the solid circles), we form a 95.5% CI for the CACE assuming the given value of (α 3, α 5 ) holds (and α 6 = 0); the 95% SI is (minimum of lower end points of CIs, maximum of upper end points of CIs). Using this procedure, we obtain a 95% SI of ( 0.475, 0.245), which is more than twice as wide as the 95% CI of ( 0.265, 0.021) that is obtained if we are sure that the CN- WOCOV model holds. In summary, if we are concerned that (α 3, α 5 ) might be in the range of 2 α 3, α 5 0, then we are substantially more uncertain about the CACE than if we are confident that the CN-WOCOV model holds (which would mean α 3 = α 5 = 0). [Figure 2 about here.] 5. Conclusions CGZ make a valuable contribution to the analysis of randomized trials with noncompliance and missing outcomes by formulating identifiable models that allow for the outcome to affect missingness. In our discussion, we have shown how CGZ s models relate to a larger class of models. We have also discussed tests of CGZ s models and an approach to sensitivity analysis for CGZ s models. We recommend the following analysis strategy when one of CGZ s models (or one of the other models described in Section 2) is thought to be plausible. If the model has less parameters than the saturated model for the observed data, test the goodness of fit of the model as we did in Section 3. If the model is not rejected by the test (or cannot be

11 10 Biometrics, tested), then carry out a sensitivity analysis for the most plausible violations of the model s assumptions as we illustrated in Section 4. The resulting sensitivity interval for the CACE (or any other parameter of interest) expresses our uncertainty about the CACE, accounting for both sampling fluctuations and uncertainty about the assumptions of the model holding. References Ferro, S., Caroli, A., Nanni, O., Biggeri, A. and Gambi, A. (1996). A randomized trial on breast self-examination in Faenze (Northern Italy). Tumori 82, Frangakis, C.E. and Rubin, D.B. (1999). Addressing complications of intention-to-treat analysis in the combined presence of all-or-none treatment-noncompliance and subsequent missing outcomes. Biometrika 86, Imai, K. (2007). Statistical analysis of randomized experiments with nonignorable missing binary outcomes: application to a voting experiment. Technical report: Mealli, F., Imbens, G. W., Ferro, S. and Biggeri, A. (2004). Analyzing a randomized trial on breast self-examination with noncompliance and missing outcomes. Biostatistics, 5, Rosenbaum, P. R. (2002). Observational studies, 2nd edition. Springer, New York. Small, D. (2007). Sensitivity analysis for instrumental variables regression with overidentifying restrictions. Journal of the American Statistical Association, 102,

12 Discussion of CGZ Paper on Randomized Trials with Noncompliance and CN Missing Data 11 Observationally Equivalent Parameter Values α α 3 Figure 1. The (α 3, α 5 ) for the θ that lie on the manifold of parameters that give the same distribution for the observable data as ˆθ, the MLE from the BSE study.

13 12 Biometrics, Range of concern for α 3,α 5 α α 3,α 5 not rejected α 3,α 5 rejected α 3 Figure 2. Results of tests of (α 3, α 5 ) in the range of concern for the BSE study.

14 Discussion of CGZ Paper on Randomized Trials with Noncompliance and CN Missing Data 13 Table 1 Breast Self Examination Study (Mealli et al., 2004) Z = 0, D = 0 Z = 0, D = 1 Z = 1, D = 0 Z = 1, D = 1 R = 1, Y = R = 1, Y = R = 0, Y =?

15 14 Biometrics, Table 2 Models for Missingness. Sets of constraints on the model for missingness (3) that enable identification of the parameters in (2)-(3). The full names for the abbreviated model descriptions in the table are provided in Section 2. # Model Constraints Additional Description α 1 α 2 α 3 α 4 α 5 α 6 Constraints 1 MCAR MAR 0 0 α 2 + α 4 = α 6 3 LI-RER-NT&AT LI-RER-C&AT LI-RER-All CN-WOCOV M-YD α 2 = α 6 8 M-YZ 0 0 α 4 = α 5 = α 6 9 RER-All 0 0 0

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes Kosuke Imai Department of Politics Princeton University July 31 2007 Kosuke Imai (Princeton University) Nonignorable

More information

Relaxing Latent Ignorability in the ITT Analysis of Randomized Studies with Missing Data and Noncompliance

Relaxing Latent Ignorability in the ITT Analysis of Randomized Studies with Missing Data and Noncompliance UW Biostatistics Working Paper Series 2-19-2009 Relaxing Latent Ignorability in the ITT Analysis of Randomized Studies with Missing Data and Noncompliance L Taylor University of Washington, taylorl@u.washington.edu

More information

Bounds on Causal Effects in Three-Arm Trials with Non-compliance. Jing Cheng Dylan Small

Bounds on Causal Effects in Three-Arm Trials with Non-compliance. Jing Cheng Dylan Small Bounds on Causal Effects in Three-Arm Trials with Non-compliance Jing Cheng Dylan Small Department of Biostatistics and Department of Statistics University of Pennsylvania June 20, 2005 A Three-Arm Randomized

More information

Identifiability and Estimation of Causal Effects in Randomized Trials with Noncompliance and Completely Non-ignorable Missing-Data

Identifiability and Estimation of Causal Effects in Randomized Trials with Noncompliance and Completely Non-ignorable Missing-Data UW Biostatistics Working Paper Series 11-1-2007 Identifiability and Estimation of Causal Effects in Randomized Trials with Noncompliance and Completely Non-ignorable Missing-Data Hua Chen Peking University,

More information

Noncompliance in Randomized Experiments

Noncompliance in Randomized Experiments Noncompliance in Randomized Experiments Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Noncompliance in Experiments Stat186/Gov2002 Fall 2018 1 / 15 Encouragement

More information

DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS

DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS DEALING WITH MULTIVARIATE OUTCOMES IN STUDIES FOR CAUSAL EFFECTS Donald B. Rubin Harvard University 1 Oxford Street, 7th Floor Cambridge, MA 02138 USA Tel: 617-495-5496; Fax: 617-496-8057 email: rubin@stat.harvard.edu

More information

Identification Analysis for Randomized Experiments with Noncompliance and Truncation-by-Death

Identification Analysis for Randomized Experiments with Noncompliance and Truncation-by-Death Identification Analysis for Randomized Experiments with Noncompliance and Truncation-by-Death Kosuke Imai First Draft: January 19, 2007 This Draft: August 24, 2007 Abstract Zhang and Rubin 2003) derives

More information

Multiple Imputation Methods for Treatment Noncompliance and Nonresponse in Randomized Clinical Trials

Multiple Imputation Methods for Treatment Noncompliance and Nonresponse in Randomized Clinical Trials UW Biostatistics Working Paper Series 2-19-2009 Multiple Imputation Methods for Treatment Noncompliance and Nonresponse in Randomized Clinical Trials Leslie Taylor UW, taylorl@u.washington.edu Xiao-Hua

More information

University of Pennsylvania and The Children s Hospital of Philadelphia

University of Pennsylvania and The Children s Hospital of Philadelphia Submitted to the Annals of Applied Statistics arxiv: arxiv:0000.0000 ESTIMATION OF CAUSAL EFFECTS USING INSTRUMENTAL VARIABLES WITH NONIGNORABLE MISSING COVARIATES: APPLICATION TO EFFECT OF TYPE OF DELIVERY

More information

Harvard University. Harvard University Biostatistics Working Paper Series

Harvard University. Harvard University Biostatistics Working Paper Series Harvard University Harvard University Biostatistics Working Paper Series Year 2010 Paper 117 Estimating Causal Effects in Trials Involving Multi-treatment Arms Subject to Non-compliance: A Bayesian Frame-work

More information

Estimating causal effects in trials involving multitreatment arms subject to non-compliance: a Bayesian framework

Estimating causal effects in trials involving multitreatment arms subject to non-compliance: a Bayesian framework Appl. Statist. (2010) 59, Part 3, pp. 513 531 Estimating causal effects in trials involving multitreatment arms subject to non-compliance: a Bayesian framework Qi Long, Emory University, Atlanta, USA Roderick

More information

Bayesian Inference for Sequential Treatments under Latent Sequential Ignorability. June 19, 2017

Bayesian Inference for Sequential Treatments under Latent Sequential Ignorability. June 19, 2017 Bayesian Inference for Sequential Treatments under Latent Sequential Ignorability Alessandra Mattei, Federico Ricciardi and Fabrizia Mealli Department of Statistics, Computer Science, Applications, University

More information

arxiv: v1 [stat.me] 3 Feb 2016

arxiv: v1 [stat.me] 3 Feb 2016 Principal stratification analysis using principal scores Peng Ding and Jiannan Lu arxiv:602.096v [stat.me] 3 Feb 206 Abstract Practitioners are interested in not only the average causal effect of the treatment

More information

arxiv: v1 [stat.me] 8 Jun 2016

arxiv: v1 [stat.me] 8 Jun 2016 Principal Score Methods: Assumptions and Extensions Avi Feller UC Berkeley Fabrizia Mealli Università di Firenze Luke Miratrix Harvard GSE arxiv:1606.02682v1 [stat.me] 8 Jun 2016 June 9, 2016 Abstract

More information

Growth Mixture Modeling and Causal Inference. Booil Jo Stanford University

Growth Mixture Modeling and Causal Inference. Booil Jo Stanford University Growth Mixture Modeling and Causal Inference Booil Jo Stanford University booil@stanford.edu Conference on Advances in Longitudinal Methods inthe Socialand and Behavioral Sciences June 17 18, 2010 Center

More information

150C Causal Inference

150C Causal Inference 150C Causal Inference Instrumental Variables: Modern Perspective with Heterogeneous Treatment Effects Jonathan Mummolo May 22, 2017 Jonathan Mummolo 150C Causal Inference May 22, 2017 1 / 26 Two Views

More information

A Comparison of Methods for Estimating the Causal Effect of a Treatment in Randomized. Clinical Trials Subject to Noncompliance.

A Comparison of Methods for Estimating the Causal Effect of a Treatment in Randomized. Clinical Trials Subject to Noncompliance. Draft June 6, 006 A Comparison of Methods for Estimating the Causal Effect of a Treatment in Randomized Clinical Trials Subject to Noncompliance Roderick Little 1, Qi Long and Xihong Lin 3 Abstract We

More information

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Michael J. Daniels and Chenguang Wang Jan. 18, 2009 First, we would like to thank Joe and Geert for a carefully

More information

Sensitivity checks for the local average treatment effect

Sensitivity checks for the local average treatment effect Sensitivity checks for the local average treatment effect Martin Huber March 13, 2014 University of St. Gallen, Dept. of Economics Abstract: The nonparametric identification of the local average treatment

More information

WORKSHOP ON PRINCIPAL STRATIFICATION STANFORD UNIVERSITY, Luke W. Miratrix (Harvard University) Lindsay C. Page (University of Pittsburgh)

WORKSHOP ON PRINCIPAL STRATIFICATION STANFORD UNIVERSITY, Luke W. Miratrix (Harvard University) Lindsay C. Page (University of Pittsburgh) WORKSHOP ON PRINCIPAL STRATIFICATION STANFORD UNIVERSITY, 2016 Luke W. Miratrix (Harvard University) Lindsay C. Page (University of Pittsburgh) Our team! 2 Avi Feller (Berkeley) Jane Furey (Abt Associates)

More information

Imbens, Lecture Notes 2, Local Average Treatment Effects, IEN, Miami, Oct 10 1

Imbens, Lecture Notes 2, Local Average Treatment Effects, IEN, Miami, Oct 10 1 Imbens, Lecture Notes 2, Local Average Treatment Effects, IEN, Miami, Oct 10 1 Lectures on Evaluation Methods Guido Imbens Impact Evaluation Network October 2010, Miami Methods for Estimating Treatment

More information

IsoLATEing: Identifying Heterogeneous Effects of Multiple Treatments

IsoLATEing: Identifying Heterogeneous Effects of Multiple Treatments IsoLATEing: Identifying Heterogeneous Effects of Multiple Treatments Peter Hull December 2014 PRELIMINARY: Please do not cite or distribute without permission. Please see www.mit.edu/~hull/research.html

More information

Basics of Modern Missing Data Analysis

Basics of Modern Missing Data Analysis Basics of Modern Missing Data Analysis Kyle M. Lang Center for Research Methods and Data Analysis University of Kansas March 8, 2013 Topics to be Covered An introduction to the missing data problem Missing

More information

Alexina Mason. Department of Epidemiology and Biostatistics Imperial College, London. 16 February 2010

Alexina Mason. Department of Epidemiology and Biostatistics Imperial College, London. 16 February 2010 Strategy for modelling non-random missing data mechanisms in longitudinal studies using Bayesian methods: application to income data from the Millennium Cohort Study Alexina Mason Department of Epidemiology

More information

A Course in Applied Econometrics. Lecture 5. Instrumental Variables with Treatment Effect. Heterogeneity: Local Average Treatment Effects.

A Course in Applied Econometrics. Lecture 5. Instrumental Variables with Treatment Effect. Heterogeneity: Local Average Treatment Effects. A Course in Applied Econometrics Lecture 5 Outline. Introduction 2. Basics Instrumental Variables with Treatment Effect Heterogeneity: Local Average Treatment Effects 3. Local Average Treatment Effects

More information

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach

Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Statistical Methods for Handling Incomplete Data Chapter 2: Likelihood-based approach Jae-Kwang Kim Department of Statistics, Iowa State University Outline 1 Introduction 2 Observed likelihood 3 Mean Score

More information

Treatment evaluation with multiple outcome periods under endogeneity and attrition

Treatment evaluation with multiple outcome periods under endogeneity and attrition Treatment evaluation with multiple outcome periods under endogeneity and attrition Markus Frölich*, Martin Huber** *Universität Mannheim and IZA (Bonn) **Universität St.Gallen Last changes: 27 January

More information

Pubh 8482: Sequential Analysis

Pubh 8482: Sequential Analysis Pubh 8482: Sequential Analysis Joseph S. Koopmeiners Division of Biostatistics University of Minnesota Week 8 P-values When reporting results, we usually report p-values in place of reporting whether or

More information

AGEC 661 Note Fourteen

AGEC 661 Note Fourteen AGEC 661 Note Fourteen Ximing Wu 1 Selection bias 1.1 Heckman s two-step model Consider the model in Heckman (1979) Y i = X iβ + ε i, D i = I {Z iγ + η i > 0}. For a random sample from the population,

More information

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case Maximilian Kasy Department of Economics, Harvard University 1 / 40 Agenda instrumental variables part I Origins of instrumental

More information

Instrumental Variables

Instrumental Variables Instrumental Variables Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Noncompliance in Experiments Stat186/Gov2002 Fall 2018 1 / 18 Instrumental Variables

More information

arxiv: v3 [stat.me] 20 Feb 2016

arxiv: v3 [stat.me] 20 Feb 2016 Posterior Predictive p-values with Fisher Randomization Tests in Noncompliance Settings: arxiv:1511.00521v3 [stat.me] 20 Feb 2016 Test Statistics vs Discrepancy Variables Laura Forastiere 1, Fabrizia Mealli

More information

Accounting for Population Uncertainty in Covariance Structure Analysis

Accounting for Population Uncertainty in Covariance Structure Analysis Accounting for Population Uncertainty in Structure Analysis Boston College May 21, 2013 Joint work with: Michael W. Browne The Ohio State University matrix among observed variables are usually implied

More information

Calibration Estimation for Semiparametric Copula Models under Missing Data

Calibration Estimation for Semiparametric Copula Models under Missing Data Calibration Estimation for Semiparametric Copula Models under Missing Data Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Economics and Economic Growth Centre

More information

Longitudinal Nested Compliance Class Model in the Presence of Time-Varying Noncompliance

Longitudinal Nested Compliance Class Model in the Presence of Time-Varying Noncompliance Longitudinal Nested Compliance Class Model in the Presence of Time-Varying Noncompliance Julia Y. Lin Thomas R. Ten Have Michael R. Elliott Julia Y. Lin is a doctoral candidate (E-mail: jlin@cceb.med.upenn.edu),

More information

Shu Yang and Jae Kwang Kim. Harvard University and Iowa State University

Shu Yang and Jae Kwang Kim. Harvard University and Iowa State University Statistica Sinica 27 (2017), 000-000 doi:https://doi.org/10.5705/ss.202016.0155 DISCUSSION: DISSECTING MULTIPLE IMPUTATION FROM A MULTI-PHASE INFERENCE PERSPECTIVE: WHAT HAPPENS WHEN GOD S, IMPUTER S AND

More information

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A. Linero and M. Daniels UF, UT-Austin SRC 2014, Galveston, TX 1 Background 2 Working model

More information

Bootstrapping Sensitivity Analysis

Bootstrapping Sensitivity Analysis Bootstrapping Sensitivity Analysis Qingyuan Zhao Department of Statistics, The Wharton School University of Pennsylvania May 23, 2018 @ ACIC Based on: Qingyuan Zhao, Dylan S. Small, and Bhaswar B. Bhattacharya.

More information

Statistical Methods. Missing Data snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23

Statistical Methods. Missing Data  snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23 1 / 23 Statistical Methods Missing Data http://www.stats.ox.ac.uk/ snijders/sm.htm Tom A.B. Snijders University of Oxford November, 2011 2 / 23 Literature: Joseph L. Schafer and John W. Graham, Missing

More information

Introduction to Causal Inference. Solutions to Quiz 4

Introduction to Causal Inference. Solutions to Quiz 4 Introduction to Causal Inference Solutions to Quiz 4 Teppei Yamamoto Tuesday, July 9 206 Instructions: Write your name in the space provided below before you begin. You have 20 minutes to complete the

More information

Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing

Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing Alessandra Mattei Dipartimento di Statistica G. Parenti Università

More information

Identi cation of Positive Treatment E ects in. Randomized Experiments with Non-Compliance

Identi cation of Positive Treatment E ects in. Randomized Experiments with Non-Compliance Identi cation of Positive Treatment E ects in Randomized Experiments with Non-Compliance Aleksey Tetenov y February 18, 2012 Abstract I derive sharp nonparametric lower bounds on some parameters of the

More information

Nonrespondent subsample multiple imputation in two-phase random sampling for nonresponse

Nonrespondent subsample multiple imputation in two-phase random sampling for nonresponse Nonrespondent subsample multiple imputation in two-phase random sampling for nonresponse Nanhua Zhang Division of Biostatistics & Epidemiology Cincinnati Children s Hospital Medical Center (Joint work

More information

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random

Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Calibration Estimation of Semiparametric Copula Models with Data Missing at Random Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Institute of Statistics

More information

The propensity score with continuous treatments

The propensity score with continuous treatments 7 The propensity score with continuous treatments Keisuke Hirano and Guido W. Imbens 1 7.1 Introduction Much of the work on propensity score analysis has focused on the case in which the treatment is binary.

More information

Multivariate CACE Analysis with an Application to Arthritis Health Journal Study

Multivariate CACE Analysis with an Application to Arthritis Health Journal Study Multivariate CACE Analysis with an Application to Arthritis Health Journal Study by Yue Ma B.Sc., Nankai University, 206 Project Submitted in Partial Fulfillment of the Requirements for the Degree of Master

More information

Sharp Bounds on Causal Effects under Sample Selection*

Sharp Bounds on Causal Effects under Sample Selection* OXFORD BULLETIN OF ECONOMICS AND STATISTICS, 77, 1 (2015) 0305 9049 doi: 10.1111/obes.12056 Sharp Bounds on Causal Effects under Sample Selection* Martin Huber and Giovanni Mellace Department of Economics,

More information

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015 Introduction to causal identification Nidhiya Menon IGC Summer School, New Delhi, July 2015 Outline 1. Micro-empirical methods 2. Rubin causal model 3. More on Instrumental Variables (IV) Estimating causal

More information

Exact Nonparametric Inference for a Binary. Endogenous Regressor

Exact Nonparametric Inference for a Binary. Endogenous Regressor Exact Nonparametric Inference for a Binary Endogenous Regressor Brigham R. Frandsen December 5, 2013 Abstract This paper describes a randomization-based estimation and inference procedure for the distribution

More information

Identification Problem for The Analysis of Binary Data with Non-ignorable Missing

Identification Problem for The Analysis of Binary Data with Non-ignorable Missing Identification Problem for The Analysis of Binary Data with Non-ignorable Missing Kosuke Morikawa, Yutaka Kano Division of Mathematical Science, Graduate School of Engineering Science, Osaka University,

More information

Estimating Post-Treatment Effect Modification With Generalized Structural Mean Models

Estimating Post-Treatment Effect Modification With Generalized Structural Mean Models Estimating Post-Treatment Effect Modification With Generalized Structural Mean Models Alisa Stephens Luke Keele Marshall Joffe December 5, 2013 Abstract In randomized controlled trials, the evaluation

More information

Targeted Maximum Likelihood Estimation in Safety Analysis

Targeted Maximum Likelihood Estimation in Safety Analysis Targeted Maximum Likelihood Estimation in Safety Analysis Sam Lendle 1 Bruce Fireman 2 Mark van der Laan 1 1 UC Berkeley 2 Kaiser Permanente ISPE Advanced Topics Session, Barcelona, August 2012 1 / 35

More information

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census

More information

Reconstruction of individual patient data for meta analysis via Bayesian approach

Reconstruction of individual patient data for meta analysis via Bayesian approach Reconstruction of individual patient data for meta analysis via Bayesian approach Yusuke Yamaguchi, Wataru Sakamoto and Shingo Shirahata Graduate School of Engineering Science, Osaka University Masashi

More information

6 Pattern Mixture Models

6 Pattern Mixture Models 6 Pattern Mixture Models A common theme underlying the methods we have discussed so far is that interest focuses on making inference on parameters in a parametric or semiparametric model for the full data

More information

arxiv: v1 [stat.ap] 6 Mar 2018

arxiv: v1 [stat.ap] 6 Mar 2018 Biometrics, DOI: Using Survival Information in Truncation by Death Problems Without the Monotonicity Assumption arxiv:83.224v [stat.ap] 6 Mar 28 Fan Yang Department of Biostatistics and Informatics, University

More information

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design

Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design 1 / 32 Empirical Likelihood Methods for Two-sample Problems with Data Missing-by-Design Changbao Wu Department of Statistics and Actuarial Science University of Waterloo (Joint work with Min Chen and Mary

More information

Analyzing Pilot Studies with Missing Observations

Analyzing Pilot Studies with Missing Observations Analyzing Pilot Studies with Missing Observations Monnie McGee mmcgee@smu.edu. Department of Statistical Science Southern Methodist University, Dallas, Texas Co-authored with N. Bergasa (SUNY Downstate

More information

7 Sensitivity Analysis

7 Sensitivity Analysis 7 Sensitivity Analysis A recurrent theme underlying methodology for analysis in the presence of missing data is the need to make assumptions that cannot be verified based on the observed data. If the assumption

More information

Evaluating the Effect of Training on Wages in the Presence of Noncompliance, Nonemployment, and Missing Outcome Data

Evaluating the Effect of Training on Wages in the Presence of Noncompliance, Nonemployment, and Missing Outcome Data This article was downloaded by: [Universita Degli Studi di Firenze On: 25 July 2012 At: 00:45 Publisher: Taylor & Francis Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered

More information

Survivor Bias and Effect Heterogeneity

Survivor Bias and Effect Heterogeneity Survivor Bias and Effect Heterogeneity Anton Strezhnev Draft April 27, 207 Abstract In multi-period studies where an outcome is observed at some initial point in time and at some later follow-up wave,

More information

Advanced Statistical Methods for Observational Studies L E C T U R E 0 6

Advanced Statistical Methods for Observational Studies L E C T U R E 0 6 Advanced Statistical Methods for Observational Studies L E C T U R E 0 6 class management Problem set 1 is posted Questions? design thus far We re off to a bad start. 1 2 1 2 1 2 2 2 1 1 1 2 1 1 2 2 2

More information

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i, A Course in Applied Econometrics Lecture 18: Missing Data Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. When Can Missing Data be Ignored? 2. Inverse Probability Weighting 3. Imputation 4. Heckman-Type

More information

Partial Identification of Average Treatment Effects in Program Evaluation: Theory and Applications

Partial Identification of Average Treatment Effects in Program Evaluation: Theory and Applications University of Miami Scholarly Repository Open Access Dissertations Electronic Theses and Dissertations 2013-07-11 Partial Identification of Average Treatment Effects in Program Evaluation: Theory and Applications

More information

Using Post Outcome Measurement Information in Censoring by Death Problems

Using Post Outcome Measurement Information in Censoring by Death Problems Using Post Outcome Measurement Information in Censoring by Death Problems Fan Yang University of Chicago, Chicago, USA. Dylan S. Small University of Pennsylvania, Philadelphia, USA. Summary. Many clinical

More information

A Reliable Constrained Method for Identity Link Poisson Regression

A Reliable Constrained Method for Identity Link Poisson Regression A Reliable Constrained Method for Identity Link Poisson Regression Ian Marschner Macquarie University, Sydney Australasian Region of the International Biometrics Society, Taupo, NZ, Dec 2009. 1 / 16 Identity

More information

An Empirical Comparison of Multiple Imputation Approaches for Treating Missing Data in Observational Studies

An Empirical Comparison of Multiple Imputation Approaches for Treating Missing Data in Observational Studies Paper 177-2015 An Empirical Comparison of Multiple Imputation Approaches for Treating Missing Data in Observational Studies Yan Wang, Seang-Hwane Joo, Patricia Rodríguez de Gil, Jeffrey D. Kromrey, Rheta

More information

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3 University of California, Irvine 2017-2018 1 Statistics (STATS) Courses STATS 5. Seminar in Data Science. 1 Unit. An introduction to the field of Data Science; intended for entering freshman and transfers.

More information

Whether to use MMRM as primary estimand.

Whether to use MMRM as primary estimand. Whether to use MMRM as primary estimand. James Roger London School of Hygiene & Tropical Medicine, London. PSI/EFSPI European Statistical Meeting on Estimands. Stevenage, UK: 28 September 2015. 1 / 38

More information

Flexible Estimation of Treatment Effect Parameters

Flexible Estimation of Treatment Effect Parameters Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both

More information

Causal-inference and Meta-analytic Approaches to Surrogate Endpoint Validation

Causal-inference and Meta-analytic Approaches to Surrogate Endpoint Validation Causal-inference and Meta-analytic Approaches to Surrogate Endpoint Validation Tomasz Burzykowski I-BioStat, Hasselt University, Belgium, and International Drug Development Institute (IDDI), Belgium tomasz.burzykowski@uhasselt.be

More information

Bayesian methods for missing data: part 1. Key Concepts. Nicky Best and Alexina Mason. Imperial College London

Bayesian methods for missing data: part 1. Key Concepts. Nicky Best and Alexina Mason. Imperial College London Bayesian methods for missing data: part 1 Key Concepts Nicky Best and Alexina Mason Imperial College London BAYES 2013, May 21-23, Erasmus University Rotterdam Missing Data: Part 1 BAYES2013 1 / 68 Outline

More information

ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW

ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW SSC Annual Meeting, June 2015 Proceedings of the Survey Methods Section ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW Xichen She and Changbao Wu 1 ABSTRACT Ordinal responses are frequently involved

More information

Instrumental Variables

Instrumental Variables Instrumental Variables Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 Noncompliance in Randomized Experiments Often we cannot force subjects to take specific treatments Units

More information

A note on multiple imputation for general purpose estimation

A note on multiple imputation for general purpose estimation A note on multiple imputation for general purpose estimation Shu Yang Jae Kwang Kim SSC meeting June 16, 2015 Shu Yang, Jae Kwang Kim Multiple Imputation June 16, 2015 1 / 32 Introduction Basic Setup Assume

More information

DISIA WORKING PAPER 2015/06

DISIA WORKING PAPER 2015/06 Bayesian inference for causal mechanisms with application to a randomized study for postoperative pain control Michela Baccini, Alessandra Mattei, Fabrizia Mealli DISIA WORKING PAPER 205/06 Copyright is

More information

Gov 2002: 4. Observational Studies and Confounding

Gov 2002: 4. Observational Studies and Confounding Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What

More information

Estimating the Dynamic Effects of a Job Training Program with M. Program with Multiple Alternatives

Estimating the Dynamic Effects of a Job Training Program with M. Program with Multiple Alternatives Estimating the Dynamic Effects of a Job Training Program with Multiple Alternatives Kai Liu 1, Antonio Dalla-Zuanna 2 1 University of Cambridge 2 Norwegian School of Economics June 19, 2018 Introduction

More information

ECO Class 6 Nonparametric Econometrics

ECO Class 6 Nonparametric Econometrics ECO 523 - Class 6 Nonparametric Econometrics Carolina Caetano Contents 1 Nonparametric instrumental variable regression 1 2 Nonparametric Estimation of Average Treatment Effects 3 2.1 Asymptotic results................................

More information

arxiv: v1 [stat.me] 15 May 2011

arxiv: v1 [stat.me] 15 May 2011 Working Paper Propensity Score Analysis with Matching Weights Liang Li, Ph.D. arxiv:1105.2917v1 [stat.me] 15 May 2011 Associate Staff of Biostatistics Department of Quantitative Health Sciences, Cleveland

More information

9/12/17. Types of learning. Modeling data. Supervised learning: Classification. Supervised learning: Regression. Unsupervised learning: Clustering

9/12/17. Types of learning. Modeling data. Supervised learning: Classification. Supervised learning: Regression. Unsupervised learning: Clustering Types of learning Modeling data Supervised: we know input and targets Goal is to learn a model that, given input data, accurately predicts target data Unsupervised: we know the input only and want to make

More information

Package noncompliance

Package noncompliance Type Package Package noncompliance February 15, 2016 Title Causal Inference in the Presence of Treatment Noncompliance Under the Binary Instrumental Variable Model Version 0.2.2 Date 2016-02-11 A finite-population

More information

Nested Markov Compliance Class Model in the Presence of Time-Varying Noncompliance

Nested Markov Compliance Class Model in the Presence of Time-Varying Noncompliance Biometrics 65, 505 513 June 2009 DOI: 10.1111/j.1541-0420.2008.01113.x Nested Markov Compliance Class Model in the Presence of Time-Varying Noncompliance Julia Y. Lin, 1, Thomas R. Ten Have, 2 and Michael

More information

Causal Inference with Counterfactuals

Causal Inference with Counterfactuals Causal Inference with Counterfactuals Robin Evans robin.evans@stats.ox.ac.uk Hilary 2014 1 Introduction What does it mean to say that a (possibly random) variable X is a cause of the random variable Y?

More information

Paradoxes in instrumental variable studies with missing data and one-sided noncompliance

Paradoxes in instrumental variable studies with missing data and one-sided noncompliance Paradoxes in instrumental variable studies with missing data and one-sided noncompliance arxiv:1705.00506v2 [stat.me] 18 Apr 2018 Edward H. Kennedy Department of Statistics, Carnegie Mellon University

More information

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke

More information

Beyond LATE: A Simple Method for Estimation of the. Average Treatment Effect with an Instrumental Variable

Beyond LATE: A Simple Method for Estimation of the. Average Treatment Effect with an Instrumental Variable Beyond LATE: A Simple Method for Estimation of the Average Treatment Effect with an Instrumental Variable Peter M. Aronow and Allison Carnegie August 30, 2012 Peter M. Aronow is Doctoral Candidate, Department

More information

leebounds: Lee s (2009) treatment effects bounds for non-random sample selection for Stata

leebounds: Lee s (2009) treatment effects bounds for non-random sample selection for Stata leebounds: Lee s (2009) treatment effects bounds for non-random sample selection for Stata Harald Tauchmann (RWI & CINCH) Rheinisch-Westfälisches Institut für Wirtschaftsforschung (RWI) & CINCH Health

More information

Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies

Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies Unpacking the Black-Box: Learning about Causal Mechanisms from Experimental and Observational Studies Kosuke Imai Princeton University Joint work with Keele (Ohio State), Tingley (Harvard), Yamamoto (Princeton)

More information

Latent Variable Model for Weight Gain Prevention Data with Informative Intermittent Missingness

Latent Variable Model for Weight Gain Prevention Data with Informative Intermittent Missingness Journal of Modern Applied Statistical Methods Volume 15 Issue 2 Article 36 11-1-2016 Latent Variable Model for Weight Gain Prevention Data with Informative Intermittent Missingness Li Qin Yale University,

More information

arxiv: v2 [stat.me] 17 Jan 2017

arxiv: v2 [stat.me] 17 Jan 2017 Semiparametric Estimation with Data Missing Not at Random Using an Instrumental Variable arxiv:1607.03197v2 [stat.me] 17 Jan 2017 BaoLuo Sun 1, Lan Liu 1, Wang Miao 1,4, Kathleen Wirth 2,3, James Robins

More information

arxiv: v2 [stat.me] 24 Apr 2018

arxiv: v2 [stat.me] 24 Apr 2018 Biometrika, pp. 1 4 Printed in Great Britain Identification and Estimation of Causal Effects with Outcomes Truncated by Death arxiv:1605.05309v2 [stat.me] 24 Apr 2018 BY LINBO WANG Department of Biostatistics,

More information

Causal Effect Estimation Under Linear and Log- Linear Structural Nested Mean Models in the Presence of Unmeasured Confounding

Causal Effect Estimation Under Linear and Log- Linear Structural Nested Mean Models in the Presence of Unmeasured Confounding University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations Summer 8-13-2010 Causal Effect Estimation Under Linear and Log- Linear Structural Nested Mean Models in the Presence of

More information

An Introduction to Causal Analysis on Observational Data using Propensity Scores

An Introduction to Causal Analysis on Observational Data using Propensity Scores An Introduction to Causal Analysis on Observational Data using Propensity Scores Margie Rosenberg*, PhD, FSA Brian Hartman**, PhD, ASA Shannon Lane* *University of Wisconsin Madison **University of Connecticut

More information

Applied Statistics Lecture Notes

Applied Statistics Lecture Notes Applied Statistics Lecture Notes Kosuke Imai Department of Politics Princeton University February 2, 2008 Making statistical inferences means to learn about what you do not observe, which is called parameters,

More information

Quantile Regression for Residual Life and Empirical Likelihood

Quantile Regression for Residual Life and Empirical Likelihood Quantile Regression for Residual Life and Empirical Likelihood Mai Zhou email: mai@ms.uky.edu Department of Statistics, University of Kentucky, Lexington, KY 40506-0027, USA Jong-Hyeon Jeong email: jeong@nsabp.pitt.edu

More information

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis. Estimating the Mean Response of Treatment Duration Regimes in an Observational Study Anastasios A. Tsiatis http://www.stat.ncsu.edu/ tsiatis/ Introduction to Dynamic Treatment Regimes 1 Outline Description

More information

Statistical Inference

Statistical Inference Statistical Inference Classical and Bayesian Methods Revision Class for Midterm Exam AMS-UCSC Th Feb 9, 2012 Winter 2012. Session 1 (Revision Class) AMS-132/206 Th Feb 9, 2012 1 / 23 Topics Topics We will

More information

The Essential Role of Pair Matching in. Cluster-Randomized Experiments. with Application to the Mexican Universal Health Insurance Evaluation

The Essential Role of Pair Matching in. Cluster-Randomized Experiments. with Application to the Mexican Universal Health Insurance Evaluation The Essential Role of Pair Matching in Cluster-Randomized Experiments, with Application to the Mexican Universal Health Insurance Evaluation Kosuke Imai Princeton University Gary King Clayton Nall Harvard

More information

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang

Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features. Yangxin Huang Bayesian Inference on Joint Mixture Models for Survival-Longitudinal Data with Multiple Features Yangxin Huang Department of Epidemiology and Biostatistics, COPH, USF, Tampa, FL yhuang@health.usf.edu January

More information