Causal exposure effect on a time-to-event response using an IV.

Size: px
Start display at page:

Download "Causal exposure effect on a time-to-event response using an IV."

Transcription

1 Faculty of Health Sciences Causal exposure effect on a time-to-event response using an IV. Torben Martinussen 1 Stijn Vansteelandt 2 Eric Tchetgen 3 1 Department of Biostatistics University of Copenhagen 2 Department of Applied Mathematics and Computer Science Ghent University 3 Department of Epidemiology and Department of Biostatistics Harvard 9. juli 2012 Slide 1/31

2 Causal effect and instrumental variables Inferring causation from observed observed association is problematic with epidemiologic data. Golden standard being the randomized controlled trial (RCT). RCT is not always possible when evaluating effect of say smoking, physical activity, Vit-D,... We are interested in eval. eff. of Vit-D on a number different responses: Cont., binary, time-to-event Collaboration with Tea Skaaby, Region H. Data from Monica10. Confounding is likely, and not fully understood Will estimate causal effect using so-called instrumental variables. Done here using so-called Mendelian randomization. Slide 2/31

3 Causal OC Setting is U G X Y X: Exposure (Vit.D) Y: Response U: Unmeasured confounders G: The instrument Mendelian randomization is when G is some genetical determinant Genes are transferred randomly from parents to offsprings. Here: filaggrin mutations (increases Vit.D). Individuals with filaggrin mutations are strongly predisposed to a severe form of dry skin Slide 3/31

4 Outline Explain what we mean by causal effect Y is continuous and we assume linear models: leads to 2SLS Y is binary; only approximate methods exists. Y is a time-to-event response (survival in our case). Slide 4/31

5 Pearls definition of causality Three variables, X, Y, and Z. Intervene on X. U x 0 U G X Y G X Y f (y, u, x, g) = f (y x, u)f (x u, g)f (u)f (g) f (y, u, g do(x = x 0 )) = f (y u, x 0 )f (u)f (g) f (y do(x = x 0 )) = u,g f (y u, x 0)f (u)f (g) f (y do(x = x 0 )) = u f (y u, x 0)f (u) G-computation formula. Slide 5/31

6 G-computation and potential outcomes Y x is the potential outcome of Y under treatment x Can think of Y x as Y do(x = x) Slide 6/31

7 The easy case Y is continuous E(Y X = x, U = u) = α + β 1 x + β 2 u E(X G = g, U = u) = γ + δ 1 g + δ 2 u β 1 is the causal effect but cannot be estimated directly as U is unobserved. E(Y do(x = x)) = y,u yf (y u, x)f (u) = u (α + β 1 x + β 2 u)f (u) = α + β 2 E(U) + β 1 x = α + β 1 x Also, E(Y X = x) = α + β 1 x + β 2 E(U X = x), and U and X are not independent. At first sight no easy solution. Slide 7/31

8 The easy case E(Y X = x, U = u) = α + β 1 x + β 2 u E(X G = g, U = u) = γ + δ 1 g + δ 2 u E(X G = g) = E(γ + δ 1 g + δ 2 u G = g) = γ + δ 1 g E(Y G = g) = E(α + β 1 x + β 2 u G = g) = α + β 1 E(X G = g) = ( α + β 1 γ) + β 1 δ 1 g Hence ˆβ 1 = r Y G /r X G Slide 8/31

9 -to-event response In rest of the talk I ll focus on a survival time response, T To my best knowledge, no solution to this exist. Mimicking workflow from linear models will not work here! Accelerated failure time models, SNF s (Robins and Tsiatis, 1991) do not work particularly well in the presence of censoring. artificial censoring; some that fails are considered censored numerical problems, see Joffe et al. (Biometrics, 2012). Slide 9/31

10 -to-event response First attempt: λ(t; X, G, U) = β 0 (t) + β X (t)x + β G (t)g + β U (t)u with β G (t) = 0. Note, that we cannot check this model since U is not observed. Center G so that E(G) = 0. (Replace G i with G i G). Slide 10/31

11 -to-event response Estimation E{Ge B X (t)x R(t)(dN(t) db X (t)x)} = 0 Proof: LHS = EE( F X,G,U t ) = E{Ge B X (t)x R(t)(dB 0 (t) + db U (t)u)} = EE( X, G, U) = E{Ge B X (t)x e B 0(t) B X (t)x B U (t)u (db 0 (t) + db U (t)u)} = E{G h(t, U)} = 0 Slide 11/31

12 -to-event response Estimation E{Ge B X (t)x R(t)(dN(t) db X (t)x)} = 0 Estimator: ˆB X (t) = t i G i eˆb X (s )X i dni (s) 0 i G i R i (s)eˆb X (s )X i with G i = G i G. Xi Note the recursive structure of ˆB X (t). Assuming β X (t) = β X, then may estimate β X by ˆβ X = τ 0 w(t)d ˆB X (t) with w(t) = w(t)/ τ 0 w(s) ds with w(t) = R (t). Slide 12/31

13 -to-event response We will no longer assume λ(t; X, G, U) = β 0 (t) + β X (t)x + β G (t)g + β U (t)u Instead we focus on the so-called local causal effect of exposure X: P(T x > t X = x, G) P(T > t X = x, G) P(T 0 = > t X = x, G) P(T 0 > t X = x, G) = e B X (t)x B X (t) is the effect of removing exposure for those who are exposed and keeping G fixed. Note that this model makes no explicit assumptions about the effect of U. We will show that ( ) = E{Ge B X (t)x R(t)(dN(t) db X (t)x)} = 0 still holds, and therefore the same estimator applies! Slide 13/31

14 Proof using potential outcomes Have: P(T 0 > t X, G) = e B X (t)x P(T > t X, G), giving λ T 0(t X, G) = λ T (t X, G) XdB X (t) Hence E(( )) = EE(( ) F t, X, G) = E(Ge B X (t)x R(t)λ T 0(t X, G))dt = EE( X, G) = E(GP(T 0 = t X, G))dt = E(GI(T 0 = t))dt = E(G)P(T 0 = t)dt = 0 since T 0 G. Slide 14/31

15 Simulations n=1000 G=rnorm(n,0,1); U=rnorm(n,0,1) gamma_g=.2 p.gu=exp( *u-gamma_g*g)/(1+exp( *u-gamma_g*g)) X=rbinom(n,1,p.GU) beta0=0.1; betag=0; betax=0.1; betau=0.05 doed=rexp(n)/(beta0+betag*g+betax*x+betau*u^2) ## Aalen model given G,X,U; B_X(t)=0.1*t ## In addition censoring resulting in around 50% ## Gives here correlation beta. X and G of around -0.1 ## Run now 9 simulations Slide 15/31

16 Simulations. Correlation approx -0.1 Slide 16/31

17 Simulations. Correlation approx -0.2 Slide 17/31

18 Simulations. Correlation approx -0.5 Slide 18/31

19 Asymptotics Estimator: ˆB X (t) = t i G i eˆb X (s )X i dni (s) 0 i G i R i (s)eˆb X (s )X i with G i = G i G. Xi Note the recursive structure of ˆB X (t). Actually, ˆB X (t) = ˆB X (t, ˆθ) where θ = E(G) and ˆθ = G And later this is extended to handle additional observed confounders C: µ(c; θ) = E(G C; θ) Slide 19/31

20 Asymptotics For known θ: ˆB X (t, θ) = t 0 H{s, ˆB X (s, θ)}dn(s), where the kth element of the n-vector H{s, ˆB X (t, θ)} is {G k µ(c k ; θ)}eˆb X (t,θ)x k / i {G i µ(c i ; θ)}eˆb X (t,θ)x i X i. Let V (t, θ) = n 1/2 {ˆB X (t, θ) B X (t)} and Ḣ the derivative of H with respect to second argument. V (t, θ) solves a Volterra-equation: t V (t, θ) =n 1/2 H(s, B X (s )) [dn(s) XdB X (s)] + 0 t Slide 20/31 0 V (s, θ)ḣ(s, B X (s))dn(s) + o p (1)

21 Asymptotics The solution to this equation is given by V (t, θ) = t 0 F(s, t)n 1/2 H(s, B X (s )) [dn(s) XdB X (s)], where F(s, t) = (s,t] { } 1 + Ḣ(, B X ( ))dn( ) This leads to the iid-representation n V (t, θ) = n 1/2 ɛ B i (t) with the ɛ B i (t) s being zero-mean iid terms. Specifically ɛ B i (t) = t 0 i=1 F(s, t)n 1/2 {H(s, B X (s ))} i [dn(s) XdB X (s)] i Slide 21/31

22 Asymptotics Rewrite n 1/2 {ˆB X (t, ˆθ) B X (t)} as n 1/2 {ˆB X (t, θ) B X (t)} + n 1/2 {ˆB X (t, ˆθ) ˆB X (t, θ)} =n 1/2 {ˆB X (t, θ) B X (t)} + D θ (ˆB X (t, θ)) ˆθ n1/2 (ˆθ θ) + o p (1), Gives an iid-decomposition of n 1/2 {ˆB X (t, ˆθ) B X (t)}: n n 1/2 {ˆB X (t, ˆθ) B X (t)} = n 1/2 ɛ B i (t, θ) + o p (1), i=1 Slide 22/31

23 Asymptotics Hence, n 1/2 {ˆB X (t, ˆθ) B X (t)} converges to a zero-mean Gaussian process with a variance that is consistently estimated by n n 1 ˆɛ B i (t, ˆθ) 2. i=1 Inference such as H 0 : β X (t) = β X also doable. If so, then ˆβ X = τ 0 w(t)d ˆB X (t) Slide 23/31

24 (n, ρ) t = 1 t = 2 t = 3 t = 4 (400,-0.3) Mean ˆB X (t) B X (t) % CP(ˆB X (t)) Mean ˆβ X β X % CP( ˆβ X ) (800,-0.3) Mean ˆB X (t) B X (t) < % CP(ˆB X (t)) Mean ˆβ X β X < % CP( ˆβ X ) (400,-0.5) Mean ˆB X (t) B X (t) % CP(ˆB X (t)) Mean ˆβ X β X < % CP( ˆβ X ) (800,-0.5) Mean ˆB X (t) B X (t) < % CP(ˆB X (t)) Mean ˆβ X β X < % CP( ˆβ X ) Slide 24/31

25 Causal effect of Vitamin d Vit D (nmol/l) > head(data_vitd) alder filmut vitd dod timedod time > table(data_vitd$filmut) > lm(formula = vitd ~ filmut, data = data_vitd) Coefficients: Std. Error t value Pr(> t ) (Intercept) < 2e-16 *** filmut ** Have also information on age. Need to adjust for that. Slide 25/31

26 Causal effect of Vitamin d > fit.aalen=aalen(surv(time,dod==1)~filmut+factor(vitdg)+alder,max.time=15,data=data_vitd) Additive Aalen Model Test for nonparametric terms Test for non-significant effects Supremum-test of significance p-value H_0: B(t)=0 (Intercept) filmut factor(vitdg)(44.7,60.9] factor(vitdg)(60.9,80.8] factor(vitdg)(80.8,204] alder Test for time invariant effects Kolmogorov-Smirnov test p-value H_0:constant effect (Intercept) filmut factor(vitdg)(44.7,60.9] factor(vitdg)(60.9,80.8] factor(vitdg)(80.8,204] alder Slide 26/31

27 Effect of Vitamin D (Intercept) filmut factor(vitdg)(44.7,60.9] Cumulative coefficients Cumulative coefficients Cumulative coefficients factor(vitdg)(60.9,80.8] factor(vitdg)(80.8,204] alder Cumulative coefficients Cumulative coefficients Cumulative coefficients Slide 27/31

28 Additional covariates Parameter is now given by P(T > t X = x, G, C) P(T 0 > t X = x, G, C) = e B X (t)x B X (t) is the effect of removing exposure for those who are exposed and keeping G and C fixed. Estimation. Can show that E{(G E(G C))e B X (t)x R(t)(dN(t) db X (t)x)} = 0 Estimator: ˆB X (t) = t 0 G C i i = G i E(G i C i ). Efficiency. eˆb X (s )X i dni (s) with R i i (s)eˆb X (s )X i Xi i G C i i i G C i Slide 28/31

29 Eff. of Vit. D - Discr. vers. of Vit D(>30) factor(vitddd)1 Cumulative coefficients Cumulative (years) Slide 29/31

30 Eff. of Vit. D - Discr. vers. of Vit D(>30) factor(vitddd)1 Cumulative coefficients Cumulative (years) Slide 30/31

31 Conlcuding remarks Work in progress... Censoring... May need to incorporate inverse probabilty weighting to accomodate different kinds of censoring. Extend to Cox model; seems difficult :( IV s? Attractive approach but it is based on assumptions... See paper "Instruments for causal inference: an epidemiologist s dream?"by Hernan and Robins for further discission on this. Efficiency considerations, optimal estimating equation. Slide 31/31

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Causality II: How does causal inference fit into public health and what it is the role of statistics? Causality II: How does causal inference fit into public health and what it is the role of statistics? Statistics for Psychosocial Research II November 13, 2006 1 Outline Potential Outcomes / Counterfactual

More information

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model

Other Survival Models. (1) Non-PH models. We briefly discussed the non-proportional hazards (non-ph) model Other Survival Models (1) Non-PH models We briefly discussed the non-proportional hazards (non-ph) model λ(t Z) = λ 0 (t) exp{β(t) Z}, where β(t) can be estimated by: piecewise constants (recall how);

More information

Causal inference in biomedical sciences: causal models involving genotypes. Mendelian randomization genes as Instrumental Variables

Causal inference in biomedical sciences: causal models involving genotypes. Mendelian randomization genes as Instrumental Variables Causal inference in biomedical sciences: causal models involving genotypes Causal models for observational data Instrumental variables estimation and Mendelian randomization Krista Fischer Estonian Genome

More information

Harvard University. A Note on the Control Function Approach with an Instrumental Variable and a Binary Outcome. Eric Tchetgen Tchetgen

Harvard University. A Note on the Control Function Approach with an Instrumental Variable and a Binary Outcome. Eric Tchetgen Tchetgen Harvard University Harvard University Biostatistics Working Paper Series Year 2014 Paper 175 A Note on the Control Function Approach with an Instrumental Variable and a Binary Outcome Eric Tchetgen Tchetgen

More information

Extending causal inferences from a randomized trial to a target population

Extending causal inferences from a randomized trial to a target population Extending causal inferences from a randomized trial to a target population Issa Dahabreh Center for Evidence Synthesis in Health, Brown University issa dahabreh@brown.edu January 16, 2019 Issa Dahabreh

More information

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis

Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Statistics 262: Intermediate Biostatistics Non-parametric Survival Analysis Jonathan Taylor & Kristin Cobb Statistics 262: Intermediate Biostatistics p.1/?? Overview of today s class Kaplan-Meier Curve

More information

Harvard University. Harvard University Biostatistics Working Paper Series

Harvard University. Harvard University Biostatistics Working Paper Series Harvard University Harvard University Biostatistics Working Paper Series Year 214 Paper 179 Instrumental Variable Estimation in a Survival Context Eric J. Tchetgen Tchetgen Stefan Walter Stijn Vansteelandt

More information

Statistical Methods for Alzheimer s Disease Studies

Statistical Methods for Alzheimer s Disease Studies Statistical Methods for Alzheimer s Disease Studies Rebecca A. Betensky, Ph.D. Department of Biostatistics, Harvard T.H. Chan School of Public Health July 19, 2016 1/37 OUTLINE 1 Statistical collaborations

More information

Statistical inference in Mendelian randomization: From genetic association to epidemiological causation

Statistical inference in Mendelian randomization: From genetic association to epidemiological causation Statistical inference in Mendelian randomization: From genetic association to epidemiological causation Department of Statistics, The Wharton School, University of Pennsylvania March 1st, 2018 @ UMN Based

More information

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification Todd MacKenzie, PhD Collaborators A. James O Malley Tor Tosteson Therese Stukel 2 Overview 1. Instrumental variable

More information

Estimating direct effects in cohort and case-control studies

Estimating direct effects in cohort and case-control studies Estimating direct effects in cohort and case-control studies, Ghent University Direct effects Introduction Motivation The problem of standard approaches Controlled direct effect models In many research

More information

IV-estimators of the causal odds ratio for a continuous exposure in prospective and retrospective designs

IV-estimators of the causal odds ratio for a continuous exposure in prospective and retrospective designs IV-estimators of the causal odds ratio for a continuous exposure in prospective and retrospective designs JACK BOWDEN (corresponding author) MRC Biostatistics Unit, Institute of Public Health, Robinson

More information

Casual Mediation Analysis

Casual Mediation Analysis Casual Mediation Analysis Tyler J. VanderWeele, Ph.D. Upcoming Seminar: April 21-22, 2017, Philadelphia, Pennsylvania OXFORD UNIVERSITY PRESS Explanation in Causal Inference Methods for Mediation and Interaction

More information

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials

Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials Application of Time-to-Event Methods in the Assessment of Safety in Clinical Trials Progress, Updates, Problems William Jen Hoe Koh May 9, 2013 Overview Marginal vs Conditional What is TMLE? Key Estimation

More information

Power and Sample Size Calculations with the Additive Hazards Model

Power and Sample Size Calculations with the Additive Hazards Model Journal of Data Science 10(2012), 143-155 Power and Sample Size Calculations with the Additive Hazards Model Ling Chen, Chengjie Xiong, J. Philip Miller and Feng Gao Washington University School of Medicine

More information

Comparison of Three Approaches to Causal Mediation Analysis. Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh

Comparison of Three Approaches to Causal Mediation Analysis. Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh Comparison of Three Approaches to Causal Mediation Analysis Donna L. Coffman David P. MacKinnon Yeying Zhu Debashis Ghosh Introduction Mediation defined using the potential outcomes framework natural effects

More information

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen

Goodness-Of-Fit for Cox s Regression Model. Extensions of Cox s Regression Model. Survival Analysis Fall 2004, Copenhagen Outline Cox s proportional hazards model. Goodness-of-fit tools More flexible models R-package timereg Forthcoming book, Martinussen and Scheike. 2/38 University of Copenhagen http://www.biostat.ku.dk

More information

Two-stage predictor substitution for time-to-event data. Simon Lergenmuller Master s Thesis, Spring 2017

Two-stage predictor substitution for time-to-event data. Simon Lergenmuller Master s Thesis, Spring 2017 Two-stage predictor substitution for time-to-event data Simon Lergenmuller Master s Thesis, Spring 2017 This master s thesis is submitted under the master s programme Modelling and Data Analysis, with

More information

Variable selection and machine learning methods in causal inference

Variable selection and machine learning methods in causal inference Variable selection and machine learning methods in causal inference Debashis Ghosh Department of Biostatistics and Informatics Colorado School of Public Health Joint work with Yeying Zhu, University of

More information

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data

Approximation of Survival Function by Taylor Series for General Partly Interval Censored Data Malaysian Journal of Mathematical Sciences 11(3): 33 315 (217) MALAYSIAN JOURNAL OF MATHEMATICAL SCIENCES Journal homepage: http://einspem.upm.edu.my/journal Approximation of Survival Function by Taylor

More information

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data Outline Frailty modelling of Multivariate Survival Data Thomas Scheike ts@biostat.ku.dk Department of Biostatistics University of Copenhagen Marginal versus Frailty models. Two-stage frailty models: copula

More information

ST745: Survival Analysis: Nonparametric methods

ST745: Survival Analysis: Nonparametric methods ST745: Survival Analysis: Nonparametric methods Eric B. Laber Department of Statistics, North Carolina State University February 5, 2015 The KM estimator is used ubiquitously in medical studies to estimate

More information

Mendelian randomization (MR)

Mendelian randomization (MR) Mendelian randomization (MR) Use inherited genetic variants to infer causal relationship of an exposure and a disease outcome. 1 Concepts of MR and Instrumental variable (IV) methods motivation, assumptions,

More information

Causal Inference. Prediction and causation are very different. Typical questions are:

Causal Inference. Prediction and causation are very different. Typical questions are: Causal Inference Prediction and causation are very different. Typical questions are: Prediction: Predict Y after observing X = x Causation: Predict Y after setting X = x. Causation involves predicting

More information

Harvard University. Harvard University Biostatistics Working Paper Series

Harvard University. Harvard University Biostatistics Working Paper Series Harvard University Harvard University Biostatistics Working Paper Series Year 2015 Paper 192 Negative Outcome Control for Unobserved Confounding Under a Cox Proportional Hazards Model Eric J. Tchetgen

More information

Specification Errors, Measurement Errors, Confounding

Specification Errors, Measurement Errors, Confounding Specification Errors, Measurement Errors, Confounding Kerby Shedden Department of Statistics, University of Michigan October 10, 2018 1 / 32 An unobserved covariate Suppose we have a data generating model

More information

Sensitivity analysis and distributional assumptions

Sensitivity analysis and distributional assumptions Sensitivity analysis and distributional assumptions Tyler J. VanderWeele Department of Health Studies, University of Chicago 5841 South Maryland Avenue, MC 2007, Chicago, IL 60637, USA vanderweele@uchicago.edu

More information

Mendelian randomization as an instrumental variable approach to causal inference

Mendelian randomization as an instrumental variable approach to causal inference Statistical Methods in Medical Research 2007; 16: 309 330 Mendelian randomization as an instrumental variable approach to causal inference Vanessa Didelez Departments of Statistical Science, University

More information

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis.

Estimating the Mean Response of Treatment Duration Regimes in an Observational Study. Anastasios A. Tsiatis. Estimating the Mean Response of Treatment Duration Regimes in an Observational Study Anastasios A. Tsiatis http://www.stat.ncsu.edu/ tsiatis/ Introduction to Dynamic Treatment Regimes 1 Outline Description

More information

From semi- to non-parametric inference in general time scale models

From semi- to non-parametric inference in general time scale models From semi- to non-parametric inference in general time scale models Thierry DUCHESNE duchesne@matulavalca Département de mathématiques et de statistique Université Laval Québec, Québec, Canada Research

More information

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers:

Use of Matching Methods for Causal Inference in Experimental and Observational Studies. This Talk Draws on the Following Papers: Use of Matching Methods for Causal Inference in Experimental and Observational Studies Kosuke Imai Department of Politics Princeton University April 27, 2007 Kosuke Imai (Princeton University) Matching

More information

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1

Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1 Advanced Quantitative Research Methodology, Lecture Notes: Research Designs for Causal Inference 1 Gary King GaryKing.org April 13, 2014 1 c Copyright 2014 Gary King, All Rights Reserved. Gary King ()

More information

Estimating the treatment effect on the treated under time-dependent confounding in an application to the Swiss HIV Cohort Study

Estimating the treatment effect on the treated under time-dependent confounding in an application to the Swiss HIV Cohort Study Appl. Statist. (2018) 67, Part 1, pp. 103 125 Estimating the treatment effect on the treated under time-dependent confounding in an application to the Swiss HIV Cohort Study Jon Michael Gran, Oslo University

More information

Department of Biostatistics University of Copenhagen

Department of Biostatistics University of Copenhagen Comparison of five software solutions to mediation analysis Liis Starkopf Mikkel Porsborg Andersen Thomas Alexander Gerds Christian Torp-Pedersen Theis Lange Research Report 17/01 Department of Biostatistics

More information

Survival Analysis for Case-Cohort Studies

Survival Analysis for Case-Cohort Studies Survival Analysis for ase-ohort Studies Petr Klášterecký Dept. of Probability and Mathematical Statistics, Faculty of Mathematics and Physics, harles University, Prague, zech Republic e-mail: petr.klasterecky@matfyz.cz

More information

Estimating the Marginal Odds Ratio in Observational Studies

Estimating the Marginal Odds Ratio in Observational Studies Estimating the Marginal Odds Ratio in Observational Studies Travis Loux Christiana Drake Department of Statistics University of California, Davis June 20, 2011 Outline The Counterfactual Model Odds Ratios

More information

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models

Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models NIH Talk, September 03 Efficient Semiparametric Estimators via Modified Profile Likelihood in Frailty & Accelerated-Failure Models Eric Slud, Math Dept, Univ of Maryland Ongoing joint project with Ilia

More information

Cox s proportional hazards model and Cox s partial likelihood

Cox s proportional hazards model and Cox s partial likelihood Cox s proportional hazards model and Cox s partial likelihood Rasmus Waagepetersen October 12, 2018 1 / 27 Non-parametric vs. parametric Suppose we want to estimate unknown function, e.g. survival function.

More information

The GENIUS Approach to Robust Mendelian Randomization Inference

The GENIUS Approach to Robust Mendelian Randomization Inference Original Article The GENIUS Approach to Robust Mendelian Randomization Inference Eric J. Tchetgen Tchetgen, BaoLuo Sun, Stefan Walter Department of Biostatistics,Harvard University arxiv:1709.07779v5 [stat.me]

More information

Goodness-of-fit test for the Cox Proportional Hazard Model

Goodness-of-fit test for the Cox Proportional Hazard Model Goodness-of-fit test for the Cox Proportional Hazard Model Rui Cui rcui@eco.uc3m.es Department of Economics, UC3M Abstract In this paper, we develop new goodness-of-fit tests for the Cox proportional hazard

More information

UNIVERSITY OF CALIFORNIA, SAN DIEGO

UNIVERSITY OF CALIFORNIA, SAN DIEGO UNIVERSITY OF CALIFORNIA, SAN DIEGO Estimation of the primary hazard ratio in the presence of a secondary covariate with non-proportional hazards An undergraduate honors thesis submitted to the Department

More information

Natural direct and indirect effects on the exposed: effect decomposition under. weaker assumptions

Natural direct and indirect effects on the exposed: effect decomposition under. weaker assumptions Biometrics 59, 1?? December 2006 DOI: 10.1111/j.1541-0420.2005.00454.x Natural direct and indirect effects on the exposed: effect decomposition under weaker assumptions Stijn Vansteelandt Department of

More information

Does Cox analysis of a randomized survival study yield a causal treatment effect?

Does Cox analysis of a randomized survival study yield a causal treatment effect? Published in final edited form as: Lifetime Data Analysis (2015), 21(4): 579 593 DOI: 10.1007/s10985-015-9335-y Does Cox analysis of a randomized survival study yield a causal treatment effect? Odd O.

More information

Causal Modeling in Environmental Epidemiology. Joel Schwartz Harvard University

Causal Modeling in Environmental Epidemiology. Joel Schwartz Harvard University Causal Modeling in Environmental Epidemiology Joel Schwartz Harvard University When I was Young What do I mean by Causal Modeling? What would have happened if the population had been exposed to a instead

More information

arxiv: v1 [math.st] 22 Oct 2018

arxiv: v1 [math.st] 22 Oct 2018 Subtleties in the interpretation of hazard ratios arxiv:1810.09192v1 [math.st] 22 Oct 2018 Torben Martinussen 1, Stijn Vansteelandt 2 and Per Kragh Andersen 1 1 Department of Biostatistics University of

More information

Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai

Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective. Anastasios (Butch) Tsiatis and Xiaofei Bai Optimal Treatment Regimes for Survival Endpoints from a Classification Perspective Anastasios (Butch) Tsiatis and Xiaofei Bai Department of Statistics North Carolina State University 1/35 Optimal Treatment

More information

Mediation and Interaction Analysis

Mediation and Interaction Analysis Mediation and Interaction Analysis Andrea Bellavia abellavi@hsph.harvard.edu May 17, 2017 Andrea Bellavia Mediation and Interaction May 17, 2017 1 / 43 Epidemiology, public health, and clinical research

More information

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates

Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Improving Efficiency of Inferences in Randomized Clinical Trials Using Auxiliary Covariates Anastasios (Butch) Tsiatis Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information

Survival Analysis I (CHL5209H)

Survival Analysis I (CHL5209H) Survival Analysis Dalla Lana School of Public Health University of Toronto olli.saarela@utoronto.ca January 7, 2015 31-1 Literature Clayton D & Hills M (1993): Statistical Models in Epidemiology. Not really

More information

IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM

IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS 776 1 12 IPW and MSM IP weighting and marginal structural models ( 12) Outline 12.1 The causal question 12.2 Estimating IP weights via modeling

More information

BIOS 6649: Handout Exercise Solution

BIOS 6649: Handout Exercise Solution BIOS 6649: Handout Exercise Solution NOTE: I encourage you to work together, but the work you submit must be your own. Any plagiarism will result in loss of all marks. This assignment is based on weight-loss

More information

You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?

You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What? You know I m not goin diss you on the internet Cause my mama taught me better than that I m a survivor (What?) I m not goin give up (What?) I m not goin stop (What?) I m goin work harder (What?) Sir David

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2011 Paper 288 Targeted Maximum Likelihood Estimation of Natural Direct Effect Wenjing Zheng Mark J.

More information

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score

Causal Inference with General Treatment Regimes: Generalizing the Propensity Score Causal Inference with General Treatment Regimes: Generalizing the Propensity Score David van Dyk Department of Statistics, University of California, Irvine vandyk@stat.harvard.edu Joint work with Kosuke

More information

Multistate Modeling and Applications

Multistate Modeling and Applications Multistate Modeling and Applications Yang Yang Department of Statistics University of Michigan, Ann Arbor IBM Research Graduate Student Workshop: Statistics for a Smarter Planet Yang Yang (UM, Ann Arbor)

More information

Beyond GLM and likelihood

Beyond GLM and likelihood Stat 6620: Applied Linear Models Department of Statistics Western Michigan University Statistics curriculum Core knowledge (modeling and estimation) Math stat 1 (probability, distributions, convergence

More information

Instrumental variables estimation in the Cox Proportional Hazard regression model

Instrumental variables estimation in the Cox Proportional Hazard regression model Instrumental variables estimation in the Cox Proportional Hazard regression model James O Malley, Ph.D. Department of Biomedical Data Science The Dartmouth Institute for Health Policy and Clinical Practice

More information

Final Exam. Economics 835: Econometrics. Fall 2010

Final Exam. Economics 835: Econometrics. Fall 2010 Final Exam Economics 835: Econometrics Fall 2010 Please answer the question I ask - no more and no less - and remember that the correct answer is often short and simple. 1 Some short questions a) For each

More information

Bios 6648: Design & conduct of clinical research

Bios 6648: Design & conduct of clinical research Bios 6648: Design & conduct of clinical research Section 2 - Formulating the scientific and statistical design designs 2.5(b) Binary 2.5(c) Skewed baseline (a) Time-to-event (revisited) (b) Binary (revisited)

More information

STAT Sample Problem: General Asymptotic Results

STAT Sample Problem: General Asymptotic Results STAT331 1-Sample Problem: General Asymptotic Results In this unit we will consider the 1-sample problem and prove the consistency and asymptotic normality of the Nelson-Aalen estimator of the cumulative

More information

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017

Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Ph.D. Qualifying Exam Friday Saturday, January 6 7, 2017 Put your solution to each problem on a separate sheet of paper. Problem 1. (5106) Let X 1, X 2,, X n be a sequence of i.i.d. observations from a

More information

Help! Statistics! Mediation Analysis

Help! Statistics! Mediation Analysis Help! Statistics! Lunch time lectures Help! Statistics! Mediation Analysis What? Frequently used statistical methods and questions in a manageable timeframe for all researchers at the UMCG. No knowledge

More information

Confounding, mediation and colliding

Confounding, mediation and colliding Confounding, mediation and colliding What types of shared covariates does the sibling comparison design control for? Arvid Sjölander and Johan Zetterqvist Causal effects and confounding A common aim of

More information

Measurement error effects on bias and variance in two-stage regression, with application to air pollution epidemiology

Measurement error effects on bias and variance in two-stage regression, with application to air pollution epidemiology Measurement error effects on bias and variance in two-stage regression, with application to air pollution epidemiology Chris Paciorek Department of Statistics, University of California, Berkeley and Adam

More information

1 Glivenko-Cantelli type theorems

1 Glivenko-Cantelli type theorems STA79 Lecture Spring Semester Glivenko-Cantelli type theorems Given i.i.d. observations X,..., X n with unknown distribution function F (t, consider the empirical (sample CDF ˆF n (t = I [Xi t]. n Then

More information

The decision theoretic approach to causal inference OR Rethinking the paradigms of causal modelling

The decision theoretic approach to causal inference OR Rethinking the paradigms of causal modelling The decision theoretic approach to causal inference OR Rethinking the paradigms of causal modelling A.P.Dawid 1 and S.Geneletti 2 1 University of Cambridge, Statistical Laboratory 2 Imperial College Department

More information

A comparison of 5 software implementations of mediation analysis

A comparison of 5 software implementations of mediation analysis Faculty of Health Sciences A comparison of 5 software implementations of mediation analysis Liis Starkopf, Thomas A. Gerds, Theis Lange Section of Biostatistics, University of Copenhagen Illustrative example

More information

arxiv: v3 [stat.me] 10 Mar 2016

arxiv: v3 [stat.me] 10 Mar 2016 Submitted to the Annals of Statistics ESTIMATING THE EFFECT OF JOINT INTERVENTIONS FROM OBSERVATIONAL DATA IN SPARSE HIGH-DIMENSIONAL SETTINGS arxiv:1407.2451v3 [stat.me] 10 Mar 2016 By Preetam Nandy,,

More information

Assessing In/Direct Effects: from Structural Equation Models to Causal Mediation Analysis

Assessing In/Direct Effects: from Structural Equation Models to Causal Mediation Analysis Assessing In/Direct Effects: from Structural Equation Models to Causal Mediation Analysis Part 1: Vanessa Didelez with help from Ryan M Andrews Leibniz Institute for Prevention Research & Epidemiology

More information

Causal modelling in Medical Research

Causal modelling in Medical Research Causal modelling in Medical Research Debashis Ghosh Department of Biostatistics and Informatics, Colorado School of Public Health Biostatistics Workshop Series Goals for today Introduction to Potential

More information

Flexible mediation analysis in the presence of non-linear relations: beyond the mediation formula.

Flexible mediation analysis in the presence of non-linear relations: beyond the mediation formula. FACULTY OF PSYCHOLOGY AND EDUCATIONAL SCIENCES Flexible mediation analysis in the presence of non-linear relations: beyond the mediation formula. Modern Modeling Methods (M 3 ) Conference Beatrijs Moerkerke

More information

Lecture Notes 22 Causal Inference

Lecture Notes 22 Causal Inference Lecture Notes 22 Causal Inference Prediction and causation are very different. Typical questions are: Prediction: Predict after observing = x Causation: Predict after setting = x. Causation involves predicting

More information

Causal Discovery by Computer

Causal Discovery by Computer Causal Discovery by Computer Clark Glymour Carnegie Mellon University 1 Outline 1. A century of mistakes about causation and discovery: 1. Fisher 2. Yule 3. Spearman/Thurstone 2. Search for causes is statistical

More information

Harvard University. Harvard University Biostatistics Working Paper Series

Harvard University. Harvard University Biostatistics Working Paper Series Harvard University Harvard University Biostatistics Working Paper Series Year 2014 Paper 176 A Simple Regression-based Approach to Account for Survival Bias in Birth Outcomes Research Eric J. Tchetgen

More information

Faculty of Health Sciences. Cox regression. Torben Martinussen. Department of Biostatistics University of Copenhagen. 20. september 2012 Slide 1/51

Faculty of Health Sciences. Cox regression. Torben Martinussen. Department of Biostatistics University of Copenhagen. 20. september 2012 Slide 1/51 Faculty of Health Sciences Cox regression Torben Martinussen Department of Biostatistics University of Copenhagen 2. september 212 Slide 1/51 Survival analysis Standard setup for right-censored survival

More information

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates

Analysis of Gamma and Weibull Lifetime Data under a General Censoring Scheme and in the presence of Covariates Communications in Statistics - Theory and Methods ISSN: 0361-0926 (Print) 1532-415X (Online) Journal homepage: http://www.tandfonline.com/loi/lsta20 Analysis of Gamma and Weibull Lifetime Data under a

More information

Lecture 5 Models and methods for recurrent event data

Lecture 5 Models and methods for recurrent event data Lecture 5 Models and methods for recurrent event data Recurrent and multiple events are commonly encountered in longitudinal studies. In this chapter we consider ordered recurrent and multiple events.

More information

Survival Regression Models

Survival Regression Models Survival Regression Models David M. Rocke May 18, 2017 David M. Rocke Survival Regression Models May 18, 2017 1 / 32 Background on the Proportional Hazards Model The exponential distribution has constant

More information

arxiv: v2 [stat.me] 31 Dec 2012

arxiv: v2 [stat.me] 31 Dec 2012 1 arxiv:1109.1070v2 [stat.me] 31 Dec 2012 1 Mediation Analysis Without Sequential Ignorability: Using Baseline Covariates Interacted with Random Assignment as Instrumental Variables Dylan S. Small University

More information

Sensitivity Analysis with Several Unmeasured Confounders

Sensitivity Analysis with Several Unmeasured Confounders Sensitivity Analysis with Several Unmeasured Confounders Lawrence McCandless lmccandl@sfu.ca Faculty of Health Sciences, Simon Fraser University, Vancouver Canada Spring 2015 Outline The problem of several

More information

Step-Stress Models and Associated Inference

Step-Stress Models and Associated Inference Department of Mathematics & Statistics Indian Institute of Technology Kanpur August 19, 2014 Outline Accelerated Life Test 1 Accelerated Life Test 2 3 4 5 6 7 Outline Accelerated Life Test 1 Accelerated

More information

Introduction to repairable systems STK4400 Spring 2011

Introduction to repairable systems STK4400 Spring 2011 Introduction to repairable systems STK4400 Spring 2011 Bo Lindqvist http://www.math.ntnu.no/ bo/ bo@math.ntnu.no Bo Lindqvist Introduction to repairable systems Definition of repairable system Ascher and

More information

Double Robustness. Bang and Robins (2005) Kang and Schafer (2007)

Double Robustness. Bang and Robins (2005) Kang and Schafer (2007) Double Robustness Bang and Robins (2005) Kang and Schafer (2007) Set-Up Assume throughout that treatment assignment is ignorable given covariates (similar to assumption that data are missing at random

More information

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census

More information

Nonparametric two-sample tests of longitudinal data in the presence of a terminal event

Nonparametric two-sample tests of longitudinal data in the presence of a terminal event Nonparametric two-sample tests of longitudinal data in the presence of a terminal event Jinheum Kim 1, Yang-Jin Kim, 2 & Chung Mo Nam 3 1 Department of Applied Statistics, University of Suwon, 2 Department

More information

A Distinction between Causal Effects in Structural and Rubin Causal Models

A Distinction between Causal Effects in Structural and Rubin Causal Models A istinction between Causal Effects in Structural and Rubin Causal Models ionissi Aliprantis April 28, 2017 Abstract: Unspecified mediators play different roles in the outcome equations of Structural Causal

More information

Journal of Statistical Software

Journal of Statistical Software JSS Journal of Statistical Software January 2011, Volume 38, Issue 2. http://www.jstatsoft.org/ Analyzing Competing Risk Data Using the R timereg Package Thomas H. Scheike University of Copenhagen Mei-Jie

More information

8. Parametric models in survival analysis General accelerated failure time models for parametric regression

8. Parametric models in survival analysis General accelerated failure time models for parametric regression 8. Parametric models in survival analysis 8.1. General accelerated failure time models for parametric regression The accelerated failure time model Let T be the time to event and x be a vector of covariates.

More information

Simulating from Marginal Structural Models with Time-Dependent Confounding

Simulating from Marginal Structural Models with Time-Dependent Confounding Research Article Received XXXX (www.interscience.wiley.com) DOI: 10.1002/sim.0000 Simulating from Marginal Structural Models with Time-Dependent Confounding W. G. Havercroft and V. Didelez We discuss why

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2010 Paper 260 Collaborative Targeted Maximum Likelihood For Time To Event Data Ori M. Stitelman Mark

More information

A proof of Bell s inequality in quantum mechanics using causal interactions

A proof of Bell s inequality in quantum mechanics using causal interactions A proof of Bell s inequality in quantum mechanics using causal interactions James M. Robins, Tyler J. VanderWeele Departments of Epidemiology and Biostatistics, Harvard School of Public Health Richard

More information

arxiv: v1 [stat.me] 15 May 2011

arxiv: v1 [stat.me] 15 May 2011 Working Paper Propensity Score Analysis with Matching Weights Liang Li, Ph.D. arxiv:1105.2917v1 [stat.me] 15 May 2011 Associate Staff of Biostatistics Department of Quantitative Health Sciences, Cleveland

More information

Covariance and Correlation

Covariance and Correlation Covariance and Correlation ST 370 The probability distribution of a random variable gives complete information about its behavior, but its mean and variance are useful summaries. Similarly, the joint probability

More information

Survival Analysis. Lu Tian and Richard Olshen Stanford University

Survival Analysis. Lu Tian and Richard Olshen Stanford University 1 Survival Analysis Lu Tian and Richard Olshen Stanford University 2 Survival Time/ Failure Time/Event Time We will introduce various statistical methods for analyzing survival outcomes What is the survival

More information

Harvard University. Harvard University Biostatistics Working Paper Series. Semiparametric Estimation of Models for Natural Direct and Indirect Effects

Harvard University. Harvard University Biostatistics Working Paper Series. Semiparametric Estimation of Models for Natural Direct and Indirect Effects Harvard University Harvard University Biostatistics Working Paper Series Year 2011 Paper 129 Semiparametric Estimation of Models for Natural Direct and Indirect Effects Eric J. Tchetgen Tchetgen Ilya Shpitser

More information

SEMIPARAMETRIC METHODS FOR ESTIMATING CUMULATIVE TREATMENT EFFECTS IN THE PRESENCE OF NON-PROPORTIONAL HAZARDS AND DEPENDENT CENSORING

SEMIPARAMETRIC METHODS FOR ESTIMATING CUMULATIVE TREATMENT EFFECTS IN THE PRESENCE OF NON-PROPORTIONAL HAZARDS AND DEPENDENT CENSORING SEMIPARAMETRIC METHODS FOR ESTIMATING CUMULATIVE TREATMENT EFFECTS IN THE PRESENCE OF NON-PROPORTIONAL HAZARDS AND DEPENDENT CENSORING by Guanghui Wei A dissertation submitted in partial fulfillment of

More information

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where

STAT 331. Accelerated Failure Time Models. Previously, we have focused on multiplicative intensity models, where STAT 331 Accelerated Failure Time Models Previously, we have focused on multiplicative intensity models, where h t z) = h 0 t) g z). These can also be expressed as H t z) = H 0 t) g z) or S t z) = e Ht

More information

A Unification of Mediation and Interaction. A 4-Way Decomposition. Tyler J. VanderWeele

A Unification of Mediation and Interaction. A 4-Way Decomposition. Tyler J. VanderWeele Original Article A Unification of Mediation and Interaction A 4-Way Decomposition Tyler J. VanderWeele Abstract: The overall effect of an exposure on an outcome, in the presence of a mediator with which

More information

Standardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE

Standardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE ORIGINAL ARTICLE Marginal Structural Models as a Tool for Standardization Tosiya Sato and Yutaka Matsuyama Abstract: In this article, we show the general relation between standardization methods and marginal

More information

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data

Outline. Frailty modelling of Multivariate Survival Data. Clustered survival data. Clustered survival data Outline Frailty modelling of Multivariate Survival Data Thomas Scheike ts@biostat.ku.dk Department of Biostatistics University of Copenhagen Marginal versus Frailty models. Two-stage frailty models: copula

More information