Harvard University. Harvard University Biostatistics Working Paper Series

Similar documents
Harvard University. A Note on the Control Function Approach with an Instrumental Variable and a Binary Outcome. Eric Tchetgen Tchetgen

Sensitivity analysis and distributional assumptions

Harvard University. Harvard University Biostatistics Working Paper Series

A Unification of Mediation and Interaction. A 4-Way Decomposition. Tyler J. VanderWeele

Modification and Improvement of Empirical Likelihood for Missing Response Problem

DATA-ADAPTIVE VARIABLE SELECTION FOR

University of California, Berkeley

Instrumental variables estimation in the Cox Proportional Hazard regression model

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

University of California, Berkeley

arxiv: v2 [stat.me] 21 Nov 2016

Ignoring the matching variables in cohort studies - when is it valid, and why?

Harvard University. Harvard University Biostatistics Working Paper Series. Sheng-Hsuan Lin Jessica G. Young Roger Logan

Harvard University. Harvard University Biostatistics Working Paper Series. Semiparametric Estimation of Models for Natural Direct and Indirect Effects

University of California, Berkeley

Unbiased estimation of exposure odds ratios in complete records logistic regression

Simple Sensitivity Analysis for Differential Measurement Error. By Tyler J. VanderWeele and Yige Li Harvard University, Cambridge, MA, U.S.A.

Casual Mediation Analysis

BIAS OF MAXIMUM-LIKELIHOOD ESTIMATES IN LOGISTIC AND COX REGRESSION MODELS: A COMPARATIVE SIMULATION STUDY

Rank preserving Structural Nested Distribution Model (RPSNDM) for Continuous

Instrumental variables as bias amplifiers with general outcome and confounding

Harvard University. Harvard University Biostatistics Working Paper Series

University of California, Berkeley

University of California, Berkeley

Harvard University. Harvard University Biostatistics Working Paper Series

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

A Hypothesis Test for the End of a Common Source Outbreak

Estimating direct effects in cohort and case-control studies

Additive and multiplicative models for the joint effect of two risk factors

Generalized confidence intervals for the ratio or difference of two means for lognormal populations with zeros

Assess Assumptions and Sensitivity Analysis. Fan Li March 26, 2014

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

University of California, Berkeley

Standardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE

Causal Modeling in Environmental Epidemiology. Joel Schwartz Harvard University

Natural direct and indirect effects on the exposed: effect decomposition under. weaker assumptions

Strategy of Bayesian Propensity. Score Estimation Approach. in Observational Study

Attributable Risk Function in the Proportional Hazards Model

Measurement error effects on bias and variance in two-stage regression, with application to air pollution epidemiology

University of California, Berkeley

Memorial Sloan-Kettering Cancer Center

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

Research Note: A more powerful test statistic for reasoning about interference between units

Causal inference in epidemiological practice

Extending causal inferences from a randomized trial to a target population

Measurement Error in Spatial Modeling of Environmental Exposures

Statistics in medicine

Harvard University. Inverse Odds Ratio-Weighted Estimation for Causal Mediation Analysis. Eric J. Tchetgen Tchetgen

Targeted Maximum Likelihood Estimation in Safety Analysis

Harvard University. Harvard University Biostatistics Working Paper Series

arxiv: v2 [stat.me] 24 Apr 2018

Power and Sample Size Calculations with the Additive Hazards Model

Harvard University. Harvard University Biostatistics Working Paper Series. On Partial Identification of the Pure Direct Effect

Causal exposure effect on a time-to-event response using an IV.

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

The GENIUS Approach to Robust Mendelian Randomization Inference

Sensitivity Analysis with Several Unmeasured Confounders

University of Michigan School of Public Health

Tests for the Odds Ratio in a Matched Case-Control Design with a Quantitative X

Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai

Harvard University. The use of permutation tests for the analysis of parallel and stepped-wedge cluster randomized trials. Rui Wang Victor DeGruttola

Estimation of direct causal effects.

Survival Analysis I (CHL5209H)

Nonlinear Difference-in-Differences and Difference-in- Difference-in-Differences with Placebo and Surrogate Outcomes

Constructing Confidence Intervals of the Summary Statistics in the Least-Squares SROC Model

Propensity Score Methods for Causal Inference

UNIVERSITY OF CALIFORNIA, SAN DIEGO

The identi cation of synergism in the su cient-component cause framework

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Introduction to Statistical Analysis

On the Use of the Bross Formula for Prioritizing Covariates in the High-Dimensional Propensity Score Algorithm

The Harvard community has made this article openly available. Please share how this access benefits you. Your story matters

Confounding, mediation and colliding

University of California, Berkeley

SIMPLE EXAMPLES OF ESTIMATING CAUSAL EFFECTS USING TARGETED MAXIMUM LIKELIHOOD ESTIMATION

Control Function Instrumental Variable Estimation of Nonlinear Causal Effect Models

Selection on Observables: Propensity Score Matching.

Analysis of propensity score approaches in difference-in-differences designs

Estimating the Marginal Odds Ratio in Observational Studies

Causal Inference Basics

Marginal, crude and conditional odds ratios

Causal Inference. Prediction and causation are very different. Typical questions are:

The Use of Spatial Exposure Predictions in Health Effects Models: An Application to PM Epidemiology

Bounds on Direct Effects in the Presence of Confounded Intermediate Variables

An Introduction to Causal Analysis on Observational Data using Propensity Scores

Comparison of Three Approaches to Causal Mediation Analysis. Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh

Effects of multiple interventions

A proof of Bell s inequality in quantum mechanics using causal interactions

Approximate Median Regression via the Box-Cox Transformation

Specification Errors, Measurement Errors, Confounding

Empirical and counterfactual conditions for su cient cause interactions

Marginal Structural Cox Model for Survival Data with Treatment-Confounder Feedback

In some settings, the effect of a particular exposure may be

EXAMINATION: QUANTITATIVE EMPIRICAL METHODS. Yale University. Department of Political Science

Correlation and regression

Flexible modelling of the cumulative effects of time-varying exposures

The identification of synergism in the sufficient-component cause framework

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case

Statistical integration of disparate information for spatially-resolved PM exposure estimation

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha

Transcription:

Harvard University Harvard University Biostatistics Working Paper Series Year 2015 Paper 192 Negative Outcome Control for Unobserved Confounding Under a Cox Proportional Hazards Model Eric J. Tchetgen Tchetgen Tamar Sofer David Richardson Harvard T.H. Chan School of Public Health, etchetge@hsph.harvard.edu University of Washington, tsofer@uw.edu University of North Carolina, david.richardson@unc.edu This working paper is hosted by The Berkeley Electronic Press (bepress) and may not be commercially reproduced without the permission of the copyright holder. http://biostats.bepress.com/harvardbiostat/paper192 Copyright c 2015 by the authors.

Commentary Negative outcome control for unobserved confounding under a Cox proportional hazards model Eric J. Tchetgen Tchetgen 1,2,Tamar Sofer 3 and David Richardson 4 Departments of Biostatistics 1 and Epidemiology 2, Harvard University Department of Biostatistics 3, University of Washington, Department of Epidemiology 4,University of North Carolina Corresponding author: Eric J. Tchetgen Tchetgen, Department of Epidemiology, Harvard School of Public Health 677 Huntington Avenue, Boston, MA 02115. 1 Hosted by The Berkeley Electronic Press

Unobserved confounding can seldom be ruled out with certainty in observational studies. An approach sometimes used to determine the extent to which a treatment effect on a primary outcome may be subject to confounding bias, is to evaluate whether the treatment of interest is associated with a so-called negative control outcome after adjusting for all observed confounders. 1 6 A secondary outcome is then said to be a valid negative control variable, to the extent that it is influenced by unobserved confounders of the treatment effects on the primary outcome, while not causally influenced by the treatment. 3 Thus, a negative control outcome found to be empirically associated with the treatment may provide compelling evidence of unobserved confounding for the primary outcome, provided that no unobserved confounder of the treatment-negative control outcome fails also to confound the treatment-primary outcome relation. 3 Suppose that in a certain application, beyond assessing the presence of an association between a negative control outcome and the treatment to detect confounding bias, one may wish to use the negative control outcome to correct for confounding bias. A natural next step might be to consider the magnitude of the estimated association as an unbiased estimate of bias due to unmeasured confounding, and one might be tempted to simply correct the confounded estimate of the treatment-outcome association by subtracting the estimated bias. While this ad-hoc bias correction is intuitively appealing, it has previously been noted that it is sensitive to the relative scales of the primary and negative control outcomes. 5 A notable diffi culty with the approach is that it requires interpreting the bias observed for the negative control outcome as somehow equivalent to the bias one would have observed between the treatment and the primary outcome under the null hypothesis of no causal effect of the treatment (more precisely between the treatment and the treatment-free potential outcome). A prerequisite for such "bias equivalence" is that the primary and negative control outcomes are measured on scales of comparable magnitude. An important case where bias equivalence can be expected to hold is when the control outcome is a baseline measure of the outcome process prior 2 http://biostats.bepress.com/harvardbiostat/paper192

to treatment. In such settings, the well known difference-in-differences approach to control for unobserved confounding may formally be justified as a negative outcome control approach. 5 However, outside of this special situation, the assumption of bias equivalence may not be appropriate if the outcomes are clearly measured on different scales, such as, say if the negative control outcome is dichotomous while the primary outcome is continuous. The assumption would then likely be violated since an additive association of the exposure with the control outcome would a priori be restricted by the binary nature of the outcome, while the additive association of the outcome in view with exposure would not. 5 In this note, we describe another prominent setting where indirect adjustment for unobserved confounding can be achieved by differencing regression estimates of the effect of treatment on the primary and negative outcome. Specifically, we show that when the primary and negative outcomes are time to event outcomes that follow a certain Cox proportional hazards model respectively, then under fairly reasonable monotonicity assumptions, a consistent estimate of the treatment causal effect can be obtained by simply subtracting the estimated log-hazards ratio for the treatmentnegative outcome association from that of the treatment-primary outcome association. Thus, we show that for the Cox proportional hazards regression model, the intuitive indirect adjustment for unobserved confounding by differencing effect estimates is in fact sound for a wide range of settings of common interest. To formally state the result requires introducing some notation. Let T denote the primary time to event outcome, N denote the negative control time to event outcome, A denote the exposure of interest, and X denote a set of pre-exposure covariates. In addition, suppose that the hazard of N conditional on A and X, h N A,X (n) is given by: h N A,X (n) = exp {β(a, X)} h N A=0,X (n) (1) 3 Hosted by The Berkeley Electronic Press

where the log hazards ratio function β(a, X) is independent of N, satisfies β(0, X) = 0, and therefore the proportional hazards assumption holds conditional on X. Likewise, suppose that the hazard function of T conditional on A, X also satisfies the proportional hazards assumption within levels of X: h T A,X (t) = exp {η(a, X)} h T A=0,X (t) (2) In addition, let N a and T a denote the potential outcomes under treatment a for the negative and primary outcomes respectively. Under consistency, we have that N a = N and T a = T if A = a. Under the assumption that N is a valid negative control outcome, we also have that N a = N 1 a = N, that is A has no individual causal effect on N. We further let U denote an unmeasure confounder which is a continous unobserved common cause of A, T and N. Unobserved confounding can naturally be incorporated algebraically by supposing that there exist functions h 0 and h 1 such that given X, the potential outcomes in the absence of treatment T 0 and N 0 = N satisfy T 0 = h 0 (U, X) (3) N = h 1 (U, X) (4) with h j (u, x) monotone increasing in u for all x, but otherwise unrestricted functions. We then have the following result. Result : Under assumptions (1) (4), we have that ψ (X) = h T 1 A=1,X (t) h T0 A=1,X (t) = exp (η(1, X) β(1, X)) and therefore the causal log hazards ratio ψ (X) encoding the effect of A for the exposed conditional 4 http://biostats.bepress.com/harvardbiostat/paper192

on X can be obtained by simply subtracting the log- hazards ratio of A for the negative control outcome from that for the primary outcome. The proof of Result is provided in the Supplemental Materials, where we show that the result continues to hold even if the unobserved confounder of the A-T association is distinct from that of the A-N association, provided that the association between the former and treatment is equivalent to that of the second in a sense made precise in the Supplemental Materials. 5 Hosted by The Berkeley Electronic Press

APPENDIX Proof of Result. Let S N A,X (n) = P {N n A, X} and F N A,X (n) = P {N < n A, X}. Let F T0 A,X and F U A,X be likewise defined. Then, by Theorem 1 of Sofer et al, 11 we have that under monotonicity assumptions (3) and (4), for all v in the unit interval, F T0 A,X F 1 T 0 A=0,X (v) (5) = F N A,X F 1 N A,X (v) (6) = F U A,X F 1 U A=0,X (v) (7) Note that, under the proportional hazards assumption, we have that S N A,X (n) = exp { exp (β(a, X)) H N A=0,X (n) }, where H N A=0,X (n) = n 0 h N A=0,X (m) dm, and F 1 N A,X (v) = S 1 N A,X (1 v) = H 1 N A=0,X { log (1 v) exp ( β(a, X))} therefore F N A=0,X F 1 N A,X (v) ( )} = 1 exp { H N A=0,X F 1 N A,X (v) ( )} = 1 exp { H N A=0,X H 1 N A=0,X { log (1 v) exp ( β(a, X))} = 1 exp {log (1 v) exp ( β(a, X))} 6 http://biostats.bepress.com/harvardbiostat/paper192

does not depend on the conditional cumulative hazard function H N A=0,X. Consider the causal quantile-quantile transformation ψ (v, A, X) = F T0 A,X F 1 T A A,X (v) Under the Cox model h T A,X (t) = exp (η(a, X)) h T A=0,X (t) holds, we then have that, ψ (v, A, X) = F T0 A,X F 1 T 0 A=0,X F T 0 A=0,X F 1 T A A,X (v) = F U A,X F 1 U A=0,X F T 0 A=0,X F 1 T A A,X (v) = F N A,X F 1 N A=0,X F T 0 A=0,X F 1 T A A,X (v) = 1 exp {log (1 v) exp (η(a, X) β(a, X))} or equivalently h TA A,X (t) h T0 A,X (t) = exp (η(a, X) β(a, X)), proving the result. The result continues to hold if we relax the assumption of a common unmeasured confounder for T as for N, thus allowing the unobserved confounder U of the A T association to be distinct from the unobserved confounder W of the A N association, provided that the association between U and A is similar to the association between W and A in the following sense. For all u in the unit interval, F U A,X F 1 U A=0,X (v) = F W A,X F 1 W A=0,X (v) (8) 7 Hosted by The Berkeley Electronic Press

This is the assumption of quantile-quantile equi-confounding introduced in Sofer et al 11. The assumption is weaker and thus implied by the more easier interprete assumption of distributional equi-confounding which states that the conditional distribution of U given A and X matches the conditional distribution of W given A, X, i.e. F U A,X ( ) = F W A,X ( ). The result then follows from the fact that equation (8) implies equations (6) and (7). References [1] Rosenbaum P. The Role of known effects in observational studies. Biometrics.1989, 45(2), 557-569. [2] Rosenbaum P. Observational Studies. 2002. Second Edition. Springer-Verlag, New York. [3] Lipsitch M, Tchetgen Tchetgen EJ, Cohen T. Negative Controls: A Tool for Detecting Confounding and Bias in Observational Studies Epidemiology. Epidemiology. 2010; 21(3): 383 388. [4] Flanders WD, Klein M, Strickland M, Darrow L, Sarnat S, Sarnat J, Waller L, Winquist A, Tolbert PE. A Method for Detection of Residual Confounding in Time-Series and Other Observational Studies. Epidemiology. 2011;22(1):59 67. [5] Tchetgen Tchetgen EJ. The control outcome calibration approach (COCA) for unobserved confounding (2013) American Journal of Epidemiology, 179(5):633-40. 8 http://biostats.bepress.com/harvardbiostat/paper192

[6] Schuemie, Martijn J., et al. "Interpreting observational studies: why empirical calibration is needed to correct p-values." Statistics in medicine 33.2 (2014): 209-218. [7] Groenwold, Rolf HH. "Falsification End Points for Observational Studies." JAMA 309.17 (2013): 1769-1771. [8] Richardson D, Cole S, Tchetgen Tchetgen EJ, Laurier D. Assessment and indirect adjustment for confounding by smoking in cohort studies using relative hazards models. 2014. American Journal of Epidemiology.180(9): 933-40. [9] Richardson D, Tchetgen Tchetgen EJ, Keil A,Cooper G. Negative Control Outcomes and the Analysis of Standardized Mortality Ratios. (2015). Epidemiology. In Press. [10] Dusetzina, S, Brookhart A, and Maciejewski, M. Control Outcomes and Exposures for Improving Internal Validity of Nonrandomized Studies. Health services research (2015). In Press. [11] Sofer T, Colicino E, Schwartz, Richardson, D,Tchetgen Tchetgen EJ.(2015) On simple relations between difference-in-differences and negative outcome control: identifying assumptions and a generalization. Submitted. 9 Hosted by The Berkeley Electronic Press