Bootstrapping Sensitivity Analysis

Size: px
Start display at page:

Download "Bootstrapping Sensitivity Analysis"

Transcription

1 Bootstrapping Sensitivity Analysis Qingyuan Zhao Department of Statistics, The Wharton School University of Pennsylvania May 23, ACIC Based on: Qingyuan Zhao, Dylan S. Small, and Bhaswar B. Bhattacharya. Sensitivity analysis for inverse probability weighting estimators via the percentile bootstrap. arxiv: R package bootsens:

2 Why sensitivity analysis? Unless we have perfectly executed randomized experiment, causal inference is based on some unverifiable assumptions. In observational studies, the most commonly used assumption is ignorability or no unmeasured confounding: A Y (0), Y (1) X. We can only say this assumption is plausible. Sensitivity analysis asks: what if this assumption does not hold? Does our qualitative conclusion still hold? This question appears in multiple settings: 1. Confounded observational studies. 2. Survey sampling with missing not at random (MNAR). 3. Longitudinal study with non-ignorable dropout. In general, this means that the target parameter (e.g. average treatment effect) is only partially identified. 1/20

3 Overview: Bootstrapping sensitivity analysis Point-identified parameter: Efron s bootstrap Bootstrap Point estimator ============ Confidence interval Partially identified parameter: An analogy Optimization Percentile Bootstrap Minimax inequality Extrema estimator ============ Confidence interval Rest of the talk Apply this idea to IPW estimators in a marginal sensitivity model. 2/20

4 Some existing sensitivity models Generally, we need to specify how unconfoundedness is violated. 1. Y models: Consider a specific difference between the conditional distribution Y (a) X, A and Y (a) X. Usually called pattern mixture models. Robins (1999, 2002); Birmingham et al. (2003); Vansteelandt et al. (2006); Daniels and Hogan (2008). 2. A models: Consider a specific difference between the conditional distribution A X, Y (a) and A X. Usually called selection models. Scharfstein et al. (1999); Birmingham et al. (2003); Vansteelandt et al. (2006); Scharfstein et al. (2014). 3. Simultaneous models: Consider a range of A models and/or Y models and report the worst case result. Cornfield et al. (1959); Rosenbaum (2002); Ding and VanderWeele (2016); Fogarty (2017). Our sensitivity model: A hybrid of 2nd and 3rd. 3/20

5 Rosenbaum s sensitivity model Imagine there is an unobserved confounder U that summarizes all confounding, so A Y (0), Y (1) X, U. Let e 0 (x, u) = P 0 (A = 1 X = x, U = u). Rosenbaum s sensitivity model { 1 } R(Γ) = e(x, u) : Γ OR(e(x, u 1), e(x, u 2 )) Γ, x X, u 1, u 2, where OR(p 1, p 2 ) := [p 1 /(1 p 1 )]/[p 2 /(1 p 2 )] is the odds ratio. Rosenbaum s question: can we reject the sharp null hypothesis Y (0) Y (1) for every e 0 (x, u) R(Γ)? Robins (2002): we don t need to assume the existence of U. Let U = Y (1) when the goal is to estimate E[Y (1)]. 4/20

6 Our sensitivity model Let e 0 (x) = P 0 (A = 1 X = x) be the propensity score. Marginal sensitivity models { M(Γ) = e(x, y) : Compare this to Rosenbaum s model: R(Γ) = { e(x, u) : 1 Γ OR(e(x, y), e 0(x)) Γ, x X, y }. 1 } Γ OR(e(x, u 1), e(x, u 2 )) Γ, x X, u 1, u 2. Tan (2006) (first?) considered this model, but he did not consider statistical inference in finite sample. Relationship between the two models: M( Γ) R(Γ) M(Γ). 1 For observational studies, we assume both P 0 (A = 1 X, Y (1)), P 0 (A = 1 X, Y (0)) M(Γ). 1 The second part needs compatibility : e(x, y) marginalizes to e0(x). 5/20

7 Parametric extension In practice, the propensity score e 0 (X ) = P 0 (A = 1 X ) is often estimated by a parametric model. Definition (Parametric marginal sensitivity models) { 1 } M β0 (Γ) = e(x, y) : Γ OR(e(x, y), e β 0 (x)) Γ, x X, y, where e β0 (x) is the best parametric approximation of e 0(x). This model covers both 1. Model misspecification, that is, e β0 (x) e 0 (x); and 2. Missing not at random, that is, e 0 (x) e 0 (x, y). 6/20

8 Logistic representations 1. Rosenbaum s sensitivity model: logit(e(x, u)) = g(x) + γu, where 0 U 1 and γ = log Γ. 2. Marginal sensitivity model: logit(e (h) (x, y)) = logit(e 0 (x)) + h(x, y), where h = sup h(x, y) γ. Due to this reason we also call it marginal L -sensitivity model. 3. Parametric marginal sensitivity model: logit(e (h) (x, y)) = logit(e β0 (x)) + h(x, y), where h = sup h(x, y) γ. 7/20

9 Confidence interval I For simplicity, consider the missing data problem where Y = Y (1) is only observed if A = 1. Observe i.i.d. samples (A i, X i, A i Y i ), i = 1,..., n. The estimand is µ 0 = E 0 [Y ], however it is only partially identified under a simultaneous sensitivity model. Goal 1 (Coverage of true parameter) Construct a data-dependent interval [L, U] such that P 0 ( µ0 [L, U] ) 1 α whenever e 0 (X, Y ) = P 0 (A = 1 X, Y ) M(Γ). 8/20

10 Confidence interval II The inverse probability weighting (IPW) identity: Define [ E 0 [Y ] = E AY e 0 (X, Y ) ] [ MAR AY ] = E. e 0 (X ) [ ] µ (h) AY = E 0 e (h) (X, Y ) Partially identified region: {µ (h) : e (h) M(Γ)}. Goal 2 (Coverage of partially identified region) Construct a data-dependent interval [L, U] such that ) P 0 ({µ (h) : e (h) M(Γ)} [L, U] 1 α. Imbens and Manski (2004) have discussed the difference between these two Goals. 9/20

11 An intuitive idea: The Union Method Suppose for any h, we have a confidence interval [L (h), U (h) ] such that lim inf n P 0(µ (h) [L (h), U (h) ]) 1 α Let L = inf h L(h) and U = sup U (h), so [L, U] is the union interval. h Theorem 1. [L, U] satisfies Goal 1 asymptotically. 2. Furthermore if the intervals are congruent : α < α such that lim sup n ( P 0 µ (h) < L (h)) α (, lim sup P 0 µ (h) > U (h)) α α. n Then [L, U] satisfies Goal 2 asymptotically. 10/20

12 Practical challenge: How to take the union? Suppose ĝ(x) is an estimate of logit(e 0 (x)). For a specific difference h, we can estimate e (h) (x, y) by ê (h) (x, y) = e h(x,y) ĝ(x,y). This leads to an (stabilized) IPW estimate of µ (h) : ˆµ (h) = [ 1 n n i=1 A i ] 1 [ 1 ê (h) (X i, Y i ) n n i=1 Using the general Z-estimation theory, n ( ˆµ (h) µ (h)) d N(0, (σ (h) ) 2 ) ] A i Y i. ê (h) (X i, Y i ) Therefore we can use [L (h), U (h) ] = ˆµ (h) z α 2 ˆσ(h) n. However, computing the union interval requires solving a complicated optimization problem. 11/20

13 Bootstrapping sensitivity analysis Point-identified parameter: Efron s bootstrap Bootstrap Point estimator ============ Confidence interval Partially identified parameter: An analogy Optimization Percentile Bootstrap Minimax inequality Extrema estimator ============ Confidence interval A simple procedure for simultaneous sensitivity analysis 1. Generate B random resamples of the data. For each resample, compute the extrema of IPW estimates under M β0 (Γ). 2. Construct the confidence interval using L = Q α/2 of the B minima and U = Q 1 α/2 of the B maxima. Theorem [L, U] achieves Goal 2 for M β0 (Γ) asymptotically. 12/20

14 Proof of the Theorem Partially identified parameter: Three ideas Optimization 1. Percentile Bootstrap 2. Minimax inequality Extrema estimator ============ Confidence interval 1. The sampling variability of ˆµ (h) can be captured by bootstrap. The percentile bootstrap CI is given by [ ) )] Q α (ˆµ (h) 2 b, Q 1 α (ˆµ (h) 2 b. 2. Generalized minimax inequality: Q α 2 ( inf h ) ˆµ (h) b inf h Q α 2 Percentile Bootstrap CI ) (ˆµ (h) b sup h Union CI Q 1 α 2 ) (ˆµ (h) b Q 1 α 2 ( sup h ) ˆµ (h) b. 13/20

15 Computation Partially identified parameter: Three ideas 3. Optimization Percentile Bootstrap Minimax inequality Extrema estimator ============ Confidence interval 3. Computing extrema of ˆµ (h) is a linear fractional programming: Let z i = e h(x i,y i ), we just need to solve n i=1 max or min A ( iy i 1 + zi e ) ĝ(x i ) n i=1 A ( i 1 + zi e ), ĝ(x i ) subject to z i [Γ 1, Γ], i = 1,..., n. This can be converted to a linear programming. Moreover, the solution z must have the same/opposite order as Y, so the time complexity can be reduced to O(n) (optimal). The role of Bootstrap Comapred to the union method, the workflow is greatly simplified: 1. No need to derive σ (h) analytically (though we could). 2. No need to optimize σ (h) (which is very challenging). 14/20

16 Comparison with Rosenbaum s sensitivity analysis Rosenbaum s paradigm New bootstrap approach Population Sample Super-population Design Matching Weighting Sensitivity model M( Γ) R(Γ) M(Γ) Inference Bounding p-value CI for ATE/ATT Effect modification Constant effect Allow for heterogeneity Extension Carefully developed for observational studies Can be applied to missing data problems 15/20

17 Example Fish consumption and blood mercury 873 controls: 1 serving of fish per month. 234 treated: 12 servings of fish per month. Covariates: gender, age, income (very imblanced), race, education, ever smoked, # cigarettes. Implementation details Rosenbaum s method: 1-1 matching, CI constructed by Hodges-Lehmann (assuming causal effect is constant). Our method (percentile Bootstrap): stabilized IPW for ATT w/wo augmentation by outcome linear regression. 16/20

18 Results Recall that M( Γ) R(Γ) M(Γ). 4 3 Causal effect 2 1 Matching SIPW (ATT) SAIPW (ATT) Γ Figure: The solid error bars are the range of point estimates and the dashed error bars (together with the solid bars) are the confidence intervals. The circles/triangles/squares are the mid-points of the solid bars. 17/20

19 References I J. Birmingham, A. Rotnitzky, and G. M. Fitzmaurice. Pattern-mixture and selection models for analysing longitudinal data with monotone missing patterns. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 65(1): , J. Cornfield, W. Haenszel, E. C. Hammond, A. M. Lilienfeld, M. B. Shimkin, and E. L. Wynder. Smoking and lung cancer: recent evidence and a discussion of some questions. Journal of the National Cancer Institute, 22(1): , M. J. Daniels and J. W. Hogan. Missing data in longitudinal studies: Strategies for Bayesian modeling and sensitivity analysis. CRC Press, P. Ding and T. J. VanderWeele. Sensitivity analysis without assumptions. Epidemiology, 27(3): 368, C. B. Fogarty. Studentized sensitivity analysis for the sample average treatment effect in paired observational studies. arxiv preprint arxiv: , G. W. Imbens and C. F. Manski. Confidence intervals for partially identified parameters. Econometrica, 72(6): , J. M. Robins. Association, causation, and marginal structural models. Synthese, 121(1): , J. M. Robins. Comment on covariance adjustment in randomized experiments and observational studies. Statistical Science, 17(3): , P. R. Rosenbaum. Observational Studies. Springer New York, D. Scharfstein, A. McDermott, W. Olson, and F. Wiegand. Global sensitivity analysis for repeated measures studies with informative dropout: A fully parametric approach. Statistics in Biopharmaceutical Research, 6(4): , D. O. Scharfstein, A. Rotnitzky, and J. M. Robins. Adjusting for nonignorable drop-out using semiparametric nonresponse models. Journal of the American Statistical Association, 94(448): , /20

20 References II Z. Tan. A distributional approach for causal inference using propensity scores. Journal of the American Statistical Association, 101(476): , S. Vansteelandt, E. Goetghebeur, M. G. Kenward, and G. Molenberghs. Ignorance and uncertainty regions as inferential tools in a sensitivity analysis. Statistica Sinica, 16(3): , /20

21 Possible open problems for discussion We didn t require e (h) (x, y) to be compatible (marginalizes to e 0 (x)). Can we use this property to reduce the length of CI? Not a problem for Rosenbaum s analysis because he conditions on one treated in every pair. Compatibility between A model and Y model? Can we consider different sensitivity models? Example: require h(x, y) to be Lipschitz continuous. Considered in the paper, still a LFP. Can we do average case sensitivity analysis, e.g. L1 instead of L? How can we extend this framework to other semiparametric estimation problems (e.g. Longitudinal data with non-ignorable dropout)? 20/20

arxiv: v2 [stat.me] 8 Oct 2018

arxiv: v2 [stat.me] 8 Oct 2018 SENSITIVITY ANALYSIS FOR INVERSE PROBABILITY WEIGHTING ESTIMATORS VIA THE PERCENTILE BOOTSTRAP QINGYUAN ZHAO, DYLAN S. SMALL AND BHASWAR B. BHATTACHARYA arxiv:1711.1186v [stat.me] 8 Oct 018 Department

More information

SENSITIVITY ANALYSIS FOR INVERSE PROBABILITY WEIGHTING ESTIMATORS VIA THE PERCENTILE BOOTSTRAP

SENSITIVITY ANALYSIS FOR INVERSE PROBABILITY WEIGHTING ESTIMATORS VIA THE PERCENTILE BOOTSTRAP SENSITIVITY ANALYSIS FOR INVERSE PROBABILITY WEIGHTING ESTIMATORS VIA THE PERCENTILE BOOTSTRAP QINGYUAN ZHAO, DYLAN S. SMALL AND BHASWAR B. BHATTACHARYA Department of Statistics, The Wharton School, University

More information

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Michael J. Daniels and Chenguang Wang Jan. 18, 2009 First, we would like to thank Joe and Geert for a carefully

More information

Sensitivity analysis and distributional assumptions

Sensitivity analysis and distributional assumptions Sensitivity analysis and distributional assumptions Tyler J. VanderWeele Department of Health Studies, University of Chicago 5841 South Maryland Avenue, MC 2007, Chicago, IL 60637, USA vanderweele@uchicago.edu

More information

Selective Inference for Effect Modification

Selective Inference for Effect Modification Inference for Modification (Joint work with Dylan Small and Ashkan Ertefaie) Department of Statistics, University of Pennsylvania May 24, ACIC 2017 Manuscript and slides are available at http://www-stat.wharton.upenn.edu/~qyzhao/.

More information

arxiv: v1 [stat.me] 15 May 2011

arxiv: v1 [stat.me] 15 May 2011 Working Paper Propensity Score Analysis with Matching Weights Liang Li, Ph.D. arxiv:1105.2917v1 [stat.me] 15 May 2011 Associate Staff of Biostatistics Department of Quantitative Health Sciences, Cleveland

More information

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A. Linero and M. Daniels UF, UT-Austin SRC 2014, Galveston, TX 1 Background 2 Working model

More information

Covariate Balancing Propensity Score for General Treatment Regimes

Covariate Balancing Propensity Score for General Treatment Regimes Covariate Balancing Propensity Score for General Treatment Regimes Kosuke Imai Princeton University October 14, 2014 Talk at the Department of Psychiatry, Columbia University Joint work with Christian

More information

Combining multiple observational data sources to estimate causal eects

Combining multiple observational data sources to estimate causal eects Department of Statistics, North Carolina State University Combining multiple observational data sources to estimate causal eects Shu Yang* syang24@ncsuedu Joint work with Peng Ding UC Berkeley May 23,

More information

Estimating the Marginal Odds Ratio in Observational Studies

Estimating the Marginal Odds Ratio in Observational Studies Estimating the Marginal Odds Ratio in Observational Studies Travis Loux Christiana Drake Department of Statistics University of California, Davis June 20, 2011 Outline The Counterfactual Model Odds Ratios

More information

High Dimensional Propensity Score Estimation via Covariate Balancing

High Dimensional Propensity Score Estimation via Covariate Balancing High Dimensional Propensity Score Estimation via Covariate Balancing Kosuke Imai Princeton University Talk at Columbia University May 13, 2017 Joint work with Yang Ning and Sida Peng Kosuke Imai (Princeton)

More information

Selection on Observables: Propensity Score Matching.

Selection on Observables: Propensity Score Matching. Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017

More information

Double Robustness. Bang and Robins (2005) Kang and Schafer (2007)

Double Robustness. Bang and Robins (2005) Kang and Schafer (2007) Double Robustness Bang and Robins (2005) Kang and Schafer (2007) Set-Up Assume throughout that treatment assignment is ignorable given covariates (similar to assumption that data are missing at random

More information

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed

More information

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 An Introduction to Causal Mediation Analysis Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 1 Causality In the applications of statistics, many central questions

More information

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai

Weighting Methods. Harvard University STAT186/GOV2002 CAUSAL INFERENCE. Fall Kosuke Imai Weighting Methods Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Weighting Methods Stat186/Gov2002 Fall 2018 1 / 13 Motivation Matching methods for improving

More information

A weighted simulation-based estimator for incomplete longitudinal data models

A weighted simulation-based estimator for incomplete longitudinal data models To appear in Statistics and Probability Letters, 113 (2016), 16-22. doi 10.1016/j.spl.2016.02.004 A weighted simulation-based estimator for incomplete longitudinal data models Daniel H. Li 1 and Liqun

More information

A Sampling of IMPACT Research:

A Sampling of IMPACT Research: A Sampling of IMPACT Research: Methods for Analysis with Dropout and Identifying Optimal Treatment Regimes Marie Davidian Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information

Harvard University. Harvard University Biostatistics Working Paper Series

Harvard University. Harvard University Biostatistics Working Paper Series Harvard University Harvard University Biostatistics Working Paper Series Year 2015 Paper 197 On Varieties of Doubly Robust Estimators Under Missing Not at Random With an Ancillary Variable Wang Miao Eric

More information

Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing

Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing Estimating and Using Propensity Score in Presence of Missing Background Data. An Application to Assess the Impact of Childbearing on Wellbeing Alessandra Mattei Dipartimento di Statistica G. Parenti Università

More information

Propensity Score Weighting with Multilevel Data

Propensity Score Weighting with Multilevel Data Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative

More information

This is the submitted version of the following book chapter: stat08068: Double robustness, which will be

This is the submitted version of the following book chapter: stat08068: Double robustness, which will be This is the submitted version of the following book chapter: stat08068: Double robustness, which will be published in its final form in Wiley StatsRef: Statistics Reference Online (http://onlinelibrary.wiley.com/book/10.1002/9781118445112)

More information

Sensitivity Analysis for matched pair analysis of binary data: From worst case to average case analysis

Sensitivity Analysis for matched pair analysis of binary data: From worst case to average case analysis Sensitivity Analysis for matched pair analysis of binary data: From worst case to average case analysis Raiden B. Hasegawa and Dylan S. Small arxiv:1707.09549v2 [stat.me] 16 May 2018 May 16, 2018 Abstract

More information

Methods for inferring short- and long-term effects of exposures on outcomes, using longitudinal data on both measures

Methods for inferring short- and long-term effects of exposures on outcomes, using longitudinal data on both measures Methods for inferring short- and long-term effects of exposures on outcomes, using longitudinal data on both measures Ruth Keogh, Stijn Vansteelandt, Rhian Daniel Department of Medical Statistics London

More information

6 Pattern Mixture Models

6 Pattern Mixture Models 6 Pattern Mixture Models A common theme underlying the methods we have discussed so far is that interest focuses on making inference on parameters in a parametric or semiparametric model for the full data

More information

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014 Assess Assumptions and Sensitivity Analysis Fan Li March 26, 2014 Two Key Assumptions 1. Overlap: 0

More information

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing Primal-dual Covariate Balance and Minimal Double Robustness via (Joint work with Daniel Percival) Department of Statistics, Stanford University JSM, August 9, 2015 Outline 1 2 3 1/18 Setting Rubin s causal

More information

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes

Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes Statistical Analysis of Randomized Experiments with Nonignorable Missing Binary Outcomes Kosuke Imai Department of Politics Princeton University July 31 2007 Kosuke Imai (Princeton University) Nonignorable

More information

Global Sensitivity Analysis for Repeated Measures Studies with. Informative Drop-out: A Fully Parametric Approach

Global Sensitivity Analysis for Repeated Measures Studies with. Informative Drop-out: A Fully Parametric Approach Global Sensitivity Analysis for Repeated Measures Studies with Informative Drop-out: A Fully Parametric Approach Daniel Scharfstein (dscharf@jhsph.edu) Aidan McDermott (amcdermo@jhsph.edu) Department of

More information

Flexible Estimation of Treatment Effect Parameters

Flexible Estimation of Treatment Effect Parameters Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both

More information

A Flexible Bayesian Approach to Monotone Missing. Data in Longitudinal Studies with Nonignorable. Missingness with Application to an Acute

A Flexible Bayesian Approach to Monotone Missing. Data in Longitudinal Studies with Nonignorable. Missingness with Application to an Acute A Flexible Bayesian Approach to Monotone Missing Data in Longitudinal Studies with Nonignorable Missingness with Application to an Acute Schizophrenia Clinical Trial Antonio R. Linero, Michael J. Daniels

More information

Propensity Score Methods for Causal Inference

Propensity Score Methods for Causal Inference John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good

More information

ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW

ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW SSC Annual Meeting, June 2015 Proceedings of the Survey Methods Section ANALYSIS OF ORDINAL SURVEY RESPONSES WITH DON T KNOW Xichen She and Changbao Wu 1 ABSTRACT Ordinal responses are frequently involved

More information

Bayesian regression tree models for causal inference: regularization, confounding and heterogeneity

Bayesian regression tree models for causal inference: regularization, confounding and heterogeneity Bayesian regression tree models for causal inference: regularization, confounding and heterogeneity P. Richard Hahn, Jared Murray, and Carlos Carvalho June 22, 2017 The problem setting We want to estimate

More information

Propensity Score Analysis with Hierarchical Data

Propensity Score Analysis with Hierarchical Data Propensity Score Analysis with Hierarchical Data Fan Li Alan Zaslavsky Mary Beth Landrum Department of Health Care Policy Harvard Medical School May 19, 2008 Introduction Population-based observational

More information

Identification Problem for The Analysis of Binary Data with Non-ignorable Missing

Identification Problem for The Analysis of Binary Data with Non-ignorable Missing Identification Problem for The Analysis of Binary Data with Non-ignorable Missing Kosuke Morikawa, Yutaka Kano Division of Mathematical Science, Graduate School of Engineering Science, Osaka University,

More information

Causal Inference Lecture Notes: Selection Bias in Observational Studies

Causal Inference Lecture Notes: Selection Bias in Observational Studies Causal Inference Lecture Notes: Selection Bias in Observational Studies Kosuke Imai Department of Politics Princeton University April 7, 2008 So far, we have studied how to analyze randomized experiments.

More information

Statistica Sinica Preprint No: SS R1

Statistica Sinica Preprint No: SS R1 Statistica Sinica Preprint No: SS-2016-0133R1 Title Sensitivity Analysis for Unmeasured Confounding in Coarse Structural Nested Mean Models Manuscript ID SS-2016-0133R1 URL http://www.stat.sinica.edu.tw/statistica/

More information

arxiv: v2 [stat.me] 17 Jan 2017

arxiv: v2 [stat.me] 17 Jan 2017 Semiparametric Estimation with Data Missing Not at Random Using an Instrumental Variable arxiv:1607.03197v2 [stat.me] 17 Jan 2017 BaoLuo Sun 1, Lan Liu 1, Wang Miao 1,4, Kathleen Wirth 2,3, James Robins

More information

arxiv: v2 [stat.me] 24 Apr 2018

arxiv: v2 [stat.me] 24 Apr 2018 Biometrika, pp. 1 4 Printed in Great Britain Identification and Estimation of Causal Effects with Outcomes Truncated by Death arxiv:1605.05309v2 [stat.me] 24 Apr 2018 BY LINBO WANG Department of Biostatistics,

More information

Bayesian methods for missing data: part 1. Key Concepts. Nicky Best and Alexina Mason. Imperial College London

Bayesian methods for missing data: part 1. Key Concepts. Nicky Best and Alexina Mason. Imperial College London Bayesian methods for missing data: part 1 Key Concepts Nicky Best and Alexina Mason Imperial College London BAYES 2013, May 21-23, Erasmus University Rotterdam Missing Data: Part 1 BAYES2013 1 / 68 Outline

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2015 Paper 341 The Statistics of Sensitivity Analyses Alexander R. Luedtke Ivan Diaz Mark J. van der

More information

Inference on Breakdown Frontiers

Inference on Breakdown Frontiers Inference on Breakdown Frontiers Matthew A. Masten Alexandre Poirier May 12, 2017 Abstract A breakdown frontier is the boundary between the set of assumptions which lead to a specific conclusion and those

More information

A Sensitivity Analysis for Missing Outcomes Due to Truncation-by-Death under the Matched-Pairs Design

A Sensitivity Analysis for Missing Outcomes Due to Truncation-by-Death under the Matched-Pairs Design Research Article Statistics Received XXXX www.interscience.wiley.com DOI: 10.1002/sim.0000 A Sensitivity Analysis for Missing Outcomes Due to Truncation-by-Death under the Matched-Pairs Design Kosuke Imai

More information

What s New in Econometrics. Lecture 1

What s New in Econometrics. Lecture 1 What s New in Econometrics Lecture 1 Estimation of Average Treatment Effects Under Unconfoundedness Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Potential Outcomes 3. Estimands and

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2015 Paper 334 Targeted Estimation and Inference for the Sample Average Treatment Effect Laura B. Balzer

More information

Discussion of Identifiability and Estimation of Causal Effects in Randomized. Trials with Noncompliance and Completely Non-ignorable Missing Data

Discussion of Identifiability and Estimation of Causal Effects in Randomized. Trials with Noncompliance and Completely Non-ignorable Missing Data Biometrics 000, 000 000 DOI: 000 000 0000 Discussion of Identifiability and Estimation of Causal Effects in Randomized Trials with Noncompliance and Completely Non-ignorable Missing Data Dylan S. Small

More information

Mediation Analysis for Health Disparities Research

Mediation Analysis for Health Disparities Research Mediation Analysis for Health Disparities Research Ashley I Naimi, PhD Oct 27 2016 @ashley_naimi wwwashleyisaacnaimicom ashleynaimi@pittedu Orientation 24 Numbered Equations Slides at: wwwashleyisaacnaimicom/slides

More information

Help! Statistics! Mediation Analysis

Help! Statistics! Mediation Analysis Help! Statistics! Lunch time lectures Help! Statistics! Mediation Analysis What? Frequently used statistical methods and questions in a manageable timeframe for all researchers at the UMCG. No knowledge

More information

Balancing Covariates via Propensity Score Weighting

Balancing Covariates via Propensity Score Weighting Balancing Covariates via Propensity Score Weighting Kari Lock Morgan Department of Statistics Penn State University klm47@psu.edu Stochastic Modeling and Computational Statistics Seminar October 17, 2014

More information

Investigating mediation when counterfactuals are not metaphysical: Does sunlight exposure mediate the effect of eye-glasses on cataracts?

Investigating mediation when counterfactuals are not metaphysical: Does sunlight exposure mediate the effect of eye-glasses on cataracts? Investigating mediation when counterfactuals are not metaphysical: Does sunlight exposure mediate the effect of eye-glasses on cataracts? Brian Egleston Fox Chase Cancer Center Collaborators: Daniel Scharfstein,

More information

Causal Inference with Measurement Error

Causal Inference with Measurement Error Causal Inference with Measurement Error by Di Shu A thesis presented to the University of Waterloo in fulfillment of the thesis requirement for the degree of Doctor of Philosophy in Statistics Waterloo,

More information

7 Sensitivity Analysis

7 Sensitivity Analysis 7 Sensitivity Analysis A recurrent theme underlying methodology for analysis in the presence of missing data is the need to make assumptions that cannot be verified based on the observed data. If the assumption

More information

Flexible mediation analysis in the presence of non-linear relations: beyond the mediation formula.

Flexible mediation analysis in the presence of non-linear relations: beyond the mediation formula. FACULTY OF PSYCHOLOGY AND EDUCATIONAL SCIENCES Flexible mediation analysis in the presence of non-linear relations: beyond the mediation formula. Modern Modeling Methods (M 3 ) Conference Beatrijs Moerkerke

More information

Comparison of Three Approaches to Causal Mediation Analysis. Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh

Comparison of Three Approaches to Causal Mediation Analysis. Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh Comparison of Three Approaches to Causal Mediation Analysis Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh Introduction Mediation defined using the potential outcomes framework natural effects

More information

IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM

IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS 776 1 12 IPW and MSM IP weighting and marginal structural models ( 12) Outline 12.1 The causal question 12.2 Estimating IP weights via modeling

More information

finite-sample optimal estimation and inference on average treatment effects under unconfoundedness

finite-sample optimal estimation and inference on average treatment effects under unconfoundedness finite-sample optimal estimation and inference on average treatment effects under unconfoundedness Timothy Armstrong (Yale University) Michal Kolesár (Princeton University) September 2017 Introduction

More information

arxiv: v2 [stat.me] 21 Nov 2016

arxiv: v2 [stat.me] 21 Nov 2016 Biometrika, pp. 1 8 Printed in Great Britain On falsification of the binary instrumental variable model arxiv:1605.03677v2 [stat.me] 21 Nov 2016 BY LINBO WANG Department of Biostatistics, Harvard School

More information

Comment: Understanding OR, PS and DR

Comment: Understanding OR, PS and DR Statistical Science 2007, Vol. 22, No. 4, 560 568 DOI: 10.1214/07-STS227A Main article DOI: 10.1214/07-STS227 c Institute of Mathematical Statistics, 2007 Comment: Understanding OR, PS and DR Zhiqiang

More information

Marginal and Nested Structural Models Using Instrumental Variables

Marginal and Nested Structural Models Using Instrumental Variables Marginal and Nested Structural Models Using Instrumental Variables Zhiqiang TAN The objective of many scientific studies is to evaluate the effect of a treatment on an outcome of interest ceteris paribus.

More information

War and Wages: The Strength of Instrumental Variables and Their Sensitivity to Unobserved Biases

War and Wages: The Strength of Instrumental Variables and Their Sensitivity to Unobserved Biases University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 2008 War and Wages: The Strength of Instrumental Variables and Their Sensitivity to Unobserved Biases Dylan S. Small

More information

Propensity score modelling in observational studies using dimension reduction methods

Propensity score modelling in observational studies using dimension reduction methods University of Colorado, Denver From the SelectedWorks of Debashis Ghosh 2011 Propensity score modelling in observational studies using dimension reduction methods Debashis Ghosh, Penn State University

More information

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1 Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1 Gary King GaryKing.org April 13, 2014 1 c Copyright 2014 Gary King, All Rights Reserved. Gary King ()

More information

Casual Mediation Analysis

Casual Mediation Analysis Casual Mediation Analysis Tyler J. VanderWeele, Ph.D. Upcoming Seminar: April 21-22, 2017, Philadelphia, Pennsylvania OXFORD UNIVERSITY PRESS Explanation in Causal Inference Methods for Mediation and Interaction

More information

Semi and Nonparametric Models in Econometrics

Semi and Nonparametric Models in Econometrics Semi and Nonparametric Models in Econometrics Part 4: partial identification Xavier d Haultfoeuille CREST-INSEE Outline Introduction First examples: missing data Second example: incomplete models Inference

More information

Propensity Score Methods for Estimating Causal Effects from Complex Survey Data

Propensity Score Methods for Estimating Causal Effects from Complex Survey Data Propensity Score Methods for Estimating Causal Effects from Complex Survey Data Dissertation Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate School

More information

Global Sensitivity Analysis for Repeated Measures Studies with Informative Drop-out: A Semi-Parametric Approach

Global Sensitivity Analysis for Repeated Measures Studies with Informative Drop-out: A Semi-Parametric Approach Global for Repeated Measures Studies with Informative Drop-out: A Semi-Parametric Approach Daniel Aidan McDermott Ivan Diaz Johns Hopkins University Ibrahim Turkoz Janssen Research and Development September

More information

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke

More information

DOUBLY ROBUST NONPARAMETRIC MULTIPLE IMPUTATION FOR IGNORABLE MISSING DATA

DOUBLY ROBUST NONPARAMETRIC MULTIPLE IMPUTATION FOR IGNORABLE MISSING DATA Statistica Sinica 22 (2012), 149-172 doi:http://dx.doi.org/10.5705/ss.2010.069 DOUBLY ROBUST NONPARAMETRIC MULTIPLE IMPUTATION FOR IGNORABLE MISSING DATA Qi Long, Chiu-Hsieh Hsu and Yisheng Li Emory University,

More information

Calibration Estimation for Semiparametric Copula Models under Missing Data

Calibration Estimation for Semiparametric Copula Models under Missing Data Calibration Estimation for Semiparametric Copula Models under Missing Data Shigeyuki Hamori 1 Kaiji Motegi 1 Zheng Zhang 2 1 Kobe University 2 Renmin University of China Economics and Economic Growth Centre

More information

Selective Inference for Effect Modification: An Empirical Investigation

Selective Inference for Effect Modification: An Empirical Investigation Observational Studies () Submitted ; Published Selective Inference for Effect Modification: An Empirical Investigation Qingyuan Zhao Department of Statistics The Wharton School, University of Pennsylvania

More information

General Doubly Robust Identification and Estimation

General Doubly Robust Identification and Estimation General Doubly Robust Identification and Estimation Arthur Lewbel, Jin-Young Choi, and Zhuzhu Zhou original Nov. 2016, revised August 2018 THIS VERSION IS PRELIMINARY AND INCOMPLETE Abstract Consider two

More information

5 Methods Based on Inverse Probability Weighting Under MAR

5 Methods Based on Inverse Probability Weighting Under MAR 5 Methods Based on Inverse Probability Weighting Under MAR The likelihood-based and multiple imputation methods we considered for inference under MAR in Chapters 3 and 4 are based, either directly or indirectly,

More information

Part IV Statistics in Epidemiology

Part IV Statistics in Epidemiology Part IV Statistics in Epidemiology There are many good statistical textbooks on the market, and we refer readers to some of these textbooks when they need statistical techniques to analyze data or to interpret

More information

Ratio of Mediator Probability Weighting for Estimating Natural Direct and Indirect Effects

Ratio of Mediator Probability Weighting for Estimating Natural Direct and Indirect Effects Ratio of Mediator Probability Weighting for Estimating Natural Direct and Indirect Effects Guanglei Hong University of Chicago, 5736 S. Woodlawn Ave., Chicago, IL 60637 Abstract Decomposing a total causal

More information

Causal Inference Basics

Causal Inference Basics Causal Inference Basics Sam Lendle October 09, 2013 Observed data, question, counterfactuals Observed data: n i.i.d copies of baseline covariates W, treatment A {0, 1}, and outcome Y. O i = (W i, A i,

More information

Robustness to Parametric Assumptions in Missing Data Models

Robustness to Parametric Assumptions in Missing Data Models Robustness to Parametric Assumptions in Missing Data Models Bryan Graham NYU Keisuke Hirano University of Arizona April 2011 Motivation Motivation We consider the classic missing data problem. In practice

More information

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Overview In observational and experimental studies, the goal may be to estimate the effect

More information

Telescope Matching: A Flexible Approach to Estimating Direct Effects *

Telescope Matching: A Flexible Approach to Estimating Direct Effects * Telescope Matching: A Flexible Approach to Estimating Direct Effects * Matthew Blackwell Anton Strezhnev August 4, 2018 Abstract Estimating the direct effect of a treatment fixing the value of a consequence

More information

Balancing Covariates via Propensity Score Weighting: The Overlap Weights

Balancing Covariates via Propensity Score Weighting: The Overlap Weights Balancing Covariates via Propensity Score Weighting: The Overlap Weights Kari Lock Morgan Department of Statistics Penn State University klm47@psu.edu PSU Methodology Center Brown Bag April 6th, 2017 Joint

More information

Simulation-Extrapolation for Estimating Means and Causal Effects with Mismeasured Covariates

Simulation-Extrapolation for Estimating Means and Causal Effects with Mismeasured Covariates Observational Studies 1 (2015) 241-290 Submitted 4/2015; Published 10/2015 Simulation-Extrapolation for Estimating Means and Causal Effects with Mismeasured Covariates J.R. Lockwood Educational Testing

More information

Modification and Improvement of Empirical Likelihood for Missing Response Problem

Modification and Improvement of Empirical Likelihood for Missing Response Problem UW Biostatistics Working Paper Series 12-30-2010 Modification and Improvement of Empirical Likelihood for Missing Response Problem Kwun Chuen Gary Chan University of Washington - Seattle Campus, kcgchan@u.washington.edu

More information

Improved Doubly Robust Estimation when Data are Monotonely Coarsened, with Application to Longitudinal Studies with Dropout

Improved Doubly Robust Estimation when Data are Monotonely Coarsened, with Application to Longitudinal Studies with Dropout Biometrics 64, 1 23 December 2009 DOI: 10.1111/j.1541-0420.2005.00454.x Improved Doubly Robust Estimation when Data are Monotonely Coarsened, with Application to Longitudinal Studies with Dropout Anastasios

More information

A mean score method for sensitivity analysis. to departures from the missing at random. assumption in randomised trials

A mean score method for sensitivity analysis. to departures from the missing at random. assumption in randomised trials arxiv:1705.00951v1 [stat.me] 2 May 2017 A mean score method for sensitivity analysis to departures from the missing at random assumption in randomised trials Ian R. White 1,2,, James Carpenter 2,3, and

More information

Assessing the Assumption of Strongly Ignorable Treatment Assignment Under Assumed Causal Models

Assessing the Assumption of Strongly Ignorable Treatment Assignment Under Assumed Causal Models Assessing the Assumption of Strongly Ignorable Treatment Assignment Under Assumed Causal Models Takeshi Emura, Jingfang Wang and Hitomi Katsuyama April 23, 2008 Abstract The assumption of strong ignorability

More information

Estimating direct effects in cohort and case-control studies

Estimating direct effects in cohort and case-control studies Estimating direct effects in cohort and case-control studies, Ghent University Direct effects Introduction Motivation The problem of standard approaches Controlled direct effect models In many research

More information

Experiments & Observational Studies: Causal Inference in Statistics

Experiments & Observational Studies: Causal Inference in Statistics Experiments & Observational Studies: Causal Inference in Statistics Paul R. Rosenbaum Department of Statistics University of Pennsylvania Philadelphia, PA 19104-6340 1 A Causal Question At age 45, Ms.

More information

The propensity score with continuous treatments

The propensity score with continuous treatments 7 The propensity score with continuous treatments Keisuke Hirano and Guido W. Imbens 1 7.1 Introduction Much of the work on propensity score analysis has focused on the case in which the treatment is binary.

More information

A Measure of Robustness to Misspecification

A Measure of Robustness to Misspecification A Measure of Robustness to Misspecification Susan Athey Guido W. Imbens December 2014 Graduate School of Business, Stanford University, and NBER. Electronic correspondence: athey@stanford.edu. Graduate

More information

Unbiased estimation of exposure odds ratios in complete records logistic regression

Unbiased estimation of exposure odds ratios in complete records logistic regression Unbiased estimation of exposure odds ratios in complete records logistic regression Jonathan Bartlett London School of Hygiene and Tropical Medicine www.missingdata.org.uk Centre for Statistical Methodology

More information

Missing Data: Theory & Methods. Introduction

Missing Data: Theory & Methods. Introduction STAT 992/BMI 826 University of Wisconsin-Madison Missing Data: Theory & Methods Chapter 1 Introduction Lu Mao lmao@biostat.wisc.edu 1-1 Introduction Statistical analysis with missing data is a rich and

More information

Ignoring the matching variables in cohort studies - when is it valid, and why?

Ignoring the matching variables in cohort studies - when is it valid, and why? Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association

More information

INVERSE PROBABILITY WEIGHTED ESTIMATION FOR GENERAL MISSING DATA PROBLEMS

INVERSE PROBABILITY WEIGHTED ESTIMATION FOR GENERAL MISSING DATA PROBLEMS IVERSE PROBABILITY WEIGHTED ESTIMATIO FOR GEERAL MISSIG DATA PROBLEMS Jeffrey M. Wooldridge Department of Economics Michigan State University East Lansing, MI 48824-1038 (517) 353-5972 wooldri1@msu.edu

More information

Moving the Goalposts: Addressing Limited Overlap in Estimation of Average Treatment Effects by Changing the Estimand

Moving the Goalposts: Addressing Limited Overlap in Estimation of Average Treatment Effects by Changing the Estimand Moving the Goalposts: Addressing Limited Overlap in Estimation of Average Treatment Effects by Changing the Estimand Richard K. Crump V. Joseph Hotz Guido W. Imbens Oscar Mitnik First Draft: July 2004

More information

Gov 2002: 13. Dynamic Causal Inference

Gov 2002: 13. Dynamic Causal Inference Gov 2002: 13. Dynamic Causal Inference Matthew Blackwell December 19, 2015 1 / 33 1. Time-varying treatments 2. Marginal structural models 2 / 33 1/ Time-varying treatments 3 / 33 Time-varying treatments

More information

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3

Prerequisite: STATS 7 or STATS 8 or AP90 or (STATS 120A and STATS 120B and STATS 120C). AP90 with a minimum score of 3 University of California, Irvine 2017-2018 1 Statistics (STATS) Courses STATS 5. Seminar in Data Science. 1 Unit. An introduction to the field of Data Science; intended for entering freshman and transfers.

More information

Marginal, crude and conditional odds ratios

Marginal, crude and conditional odds ratios Marginal, crude and conditional odds ratios Denitions and estimation Travis Loux Gradute student, UC Davis Department of Statistics March 31, 2010 Parameter Denitions When measuring the eect of a binary

More information

Propensity score weighting for causal inference with multi-stage clustered data

Propensity score weighting for causal inference with multi-stage clustered data Propensity score weighting for causal inference with multi-stage clustered data Shu Yang Department of Statistics, North Carolina State University arxiv:1607.07521v1 stat.me] 26 Jul 2016 Abstract Propensity

More information

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers: Use of Matching Methods for Causal Inference in Experimental and Observational Studies Kosuke Imai Department of Politics Princeton University April 13, 2009 Kosuke Imai (Princeton University) Matching

More information

arxiv: v1 [stat.me] 28 Jul 2017

arxiv: v1 [stat.me] 28 Jul 2017 Sensitivity Analysis for Unmeasured Confounding in Meta-Analyses Maya B. Mathur 1,2 and Tyler J. VanderWeele 1,3 arxiv:1707.09076v1 [stat.me] 28 Jul 2017 1 Department of Biostatistics, Harvard T. H. Chan

More information