Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall
|
|
- Alyson Jackson
- 5 years ago
- Views:
Transcription
1 1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Dept. of Biostatistics, Duke University Medical Joint work with Thomas Ten Have (UPenn) & Susan Murphy (UMich) NC State University, Dept. of Statistics September 27, 2007
2 Contents 2 Contents 1 Warm-up: Why condition on pre-treatment variables? 4 2 Time-Varying Effect Moderation 8 3 Robins Structural Nested Mean Model 11 4 Estimation in Time-Varying Setting 14 5 Bias-Variance Trade-off 20 6 Conclusions 25
3 Contents 3 7 Extra Slides 28
4 1 Warm-up: Why condition on pre-treatment variables? 4 1 Warm-up: Why condition on pre-treatment variables?...in a study of the effects of a treatment A (e.g., intervention to help reduce depression) on a response Y (e.g., depression)?
5 1 Warm-up: Why condition on pre-treatment variables? 5 We want the effect of A on Y. Why condition on (adjust for) pre-treatment variables S? 1. Confounding: S is correlated with both A and Y. In this case, S is known as a confounder of the effect of A on Y. 2. Precision: S may be a pre-treatment measure of Y, or any other variable highly correlated with Y. 3. Missing Data: The outcome Y is missing for some units, S and A predict missingness, and S is associated with Y. 4. Effect Heterogeneity: S may moderate, temper, or specify the effect of A on Y. In this case, S is known as a moderator of the effect of A on Y. Formalized in next slide.
6 1 Warm-up: Why condition on pre-treatment variables? 6 Effect Moderation in One Time Point Definition: Fix a 0. Let µ(s, a) E[Y (a) Y (0) S = s]. If µ(s, a) is non-constant in s, then S is said to be a moderator of the effect of a on Y. Example: If the structural model for the conditional mean of Y (a) given S is E[Y (a) S = s] = β 0 + γ 1 s + β 1 a + β 2 sa, then µ(s, a) = β 1 a + β 2 sa. In this example, S is a moderator of the effect of a on Y if β 2 0.
7 1 Warm-up: Why condition on pre-treatment variables? 7 Relevance of Effect Moderation Two main implications: Theoretical Implication: Understanding the heterogeneity of the effects of causes enhances our understanding of various/competing scientific theories; and it may suggest new scientific (etiologic) hypotheses to be tested. Practical Implication: Identifying types, or subgroups, of individuals for which treatment is not effective may suggest altering the treatment to suit the needs of that particular type of individual.
8 2 Time-Varying Effect Moderation 8 2 Time-Varying Effect Moderation The focus of this paper is time-varying effect moderation. The data structure in the time-varying setting is {S 1, a 1, S 2 (a 1 ), a 2, Y (a 1, a 2 )} = {ā 2, S 2 (a 1 ), Y (ā 2 )}. Running Example, from the PROSPECT Study: (a 1, a 2 ) Time-varying treatment pattern; a t is binary (0,1) Y (a 1, a 2 ) Depression at the end of the study; continuous S 1 S 2 (a 1 ) Suicidal Ideation at baseline visit; continuous Suicidal Ideation at second visit; continuous
9 2 Time-Varying Effect Moderation 9 Formal Definition of Time-Varying Causal Effects Recall the data structure {S 1, a 1, S 2 (a 1 ), a 2, Y (a 1, a 2 )}. Conditional Intermediate Causal Effect at t = 1: µ 1 (s 1, a 1 ) E[Y (a 1, 0) Y (0, 0) S 1 = s 1 ] Conditional Intermediate Causal Effect at t = 2: µ 2 ( s 2, ā 2 ) E[Y (a 1, a 2 ) Y (a 1, 0) S 1 = s 1, S 2 (a 1 ) = s 2 ]
10 2 Time-Varying Effect Moderation 10 As a Decomposition of the Marginal Causal Effect Recall the data structure {S 1, a 1, S 2 (a 1 ), a 2, Y (a 1, a 2 )}. Consider the following arithmetic decomposition of the causal effect of (a 1, a 2 ) on Y, using the covariates S 2 (a 1 ): E [ Y (a 1, a 2 ) Y (0, 0) ] = E [E [ Y (a 1, a 2 ) Y (a 1, 0) S 2 (a 1 ) ]] [ + E E [ ] ] Y (a 1, 0) Y (0, 0) S 1. The inner expectations represent the conditional intermediate causal effects µ 1 and µ 2, respectively.
11 3 Robins Structural Nested Mean Model 11 3 Robins Structural Nested Mean Model Recall the data structure {S 1, a 1, S 2 (a 1 ), a 2, Y (a 1, a 2 )}. The SNMM for the conditional mean of Y (a 1, a 2 ) given S 2 (a 1 ) is: E [ Y (a 1, a 2 ) S 2 (a 1 ) ] { } = E[Y (0, 0)] + E[Y (0, 0) S 1 ] E[Y (0, 0)] + {E [ ] } Y(a 1, 0) Y(0, 0) S 1 { + E[Y (a 1, 0) S } 2 (a 1 )] E[Y (a 1, 0) S 1 ] + {E [ Y(a 1, a 2 ) Y(a 1, 0) S 2 (a 1 ) ]}
12 3 Robins Structural Nested Mean Model 12 The Causal versus Nuisance Functionals Recall the data structure {S 1, a 1, S 2 (a 1 ), a 2, Y (a 1, a 2 )}. The decomposition can be written more succinctly as: E [ Y (a 1, a 2 ) S ] 2 (a 1 ) = s 2 = µ0 + ɛ 1 (s 1 ) + µ 1 (s 1, a 1 ) + ɛ 2 ( s 2, a 1 ) + µ 2 ( s 2, ā 2 ), where ɛ 2 ( s 2, a 1 ) = E[Y (a 1, 0) S 2 (a 1 ) = s 2 ] E[Y (a 1, 0) S 1 = s 1 ], ɛ 1 (s 1 ) = E[Y (0, 0) S 1 = s 1 ] E[Y (0, 0)], µ 2 ( s 2, a 2, 0) = 0 and µ 1 (s 1, 0) = 0, E S2 S 1 [ɛ 2 ( s 2, a 1 ) S 1 = s 1 ] = 0, and E S1 [ɛ 1 (s 1 )] = 0.
13 3 Robins Structural Nested Mean Model 13 Parametrizing the Intermediate Causal Effects Recall the data structure {S 1, a 1, S 2 (a 1 ), a 2, Y (a 1, a 2 )}. Conditional Intermediate Causal Effect at t = 1: µ 1 (s 1, a 1 ) E[Y (a 1, 0) Y (0, 0) S 1 = s 1 ] = a 1 H 1 β 1 (Linear Parameterization) say = a 1 (β 11 + β 12 f 1 (s 1 )) (Sample) Conditional Intermediate Causal Effect at t = 2: µ 2 ( s 2, ā 2 ) E[Y (a 1, a 2 ) Y (a 1, 0) S 1 = s 1, S 2 (a 1 ) = s 2 ] = a 2 H 2 β 2 (Linear Parameterization) say = a 2 (β 21 + β 22 f 2 (s 1, s 2 ) + β 23 a 1 ) (Sample)
14 4 Estimation in Time-Varying Setting 14 4 Estimation in Time-Varying Setting Assume: True models for µ t s Consistency Assumptions Sequential Ignorability (given S 2 (a 1 )) Assumption In the sequel, we consider three estimators for β: 1. Traditional (Naive) Regression Estimator 2. Proposed 2-Stage Regression Estimator 3. Robins Semi-parametric G-Estimator Two flavors of the Semi-parametric G-Estimator
15 4 Estimation in Time-Varying Setting 15 What s wrong with the Traditional Estimator? An Example of The Traditional Estimator: Apply OLS with E(Y S 2 = s 2, Ā2 = ā 2 ) = β 0 + η 1 s 1 + a 1 (β 1 + β 2s 1 ) + η 2 s 2 + a 2 (β 3 + β 4(s 1 + s 2 )/2) - Possibly incorrectly specified nuisance functions. - Two problems arise when using the traditional regression estimator. - These problems occur even in the absence of time-varying confounders (that is, even under Sequential Randomization).
16 4 Estimation in Time-Varying Setting 16 First problem with the Traditional Estimator E(Y S 2 = s 2, Ā2 = ā 2 ) = β 0 + η 1 s 1 + a 1 (β 1 + β 2s 1 ) + η 2 s 2 + a 2 (β 3 + β 4(s 1 + s 2 )/2) Wrong Effect Baseline 4-month Visit 8-month Visit a Set 1 a 2 = 0 S 1 S 2 (a 1 ) Y (a 1, 0) What about the effect transmitted through S 2 (a 1 )?
17 4 Estimation in Time-Varying Setting 17 Second problem with the Traditional Estimator E(Y S 2 = s 2, Ā2 = ā 2 ) = β0 + η 1 s 1 + a 1 (β1 + β2s 1 ) + η 2 s 2 + a 2 (β3 + β4(s 1 + s 2 )/2) Spurious Effect Baseline 4-month Visit 8-month Visit V 0 S 1 a Set 1 a 2 = 0 S 2 (a 1 ) Y (a 1, 0) Berkson s paradox; Judea Pearl s backdoor criterion
18 4 Estimation in Time-Varying Setting 18 Proposed 2-Stage Estimator Approach Recall the nuisance function constraints: E S2 S 1 [ɛ 2 ( s 2, a 1 ) S 1 = s 1 ] = 0, and E S1 [ɛ 1 (s 1 )] = 0. General steps for the 2-Stage Approach: 1. Model the distributions f 1 (S 1 ; γ 1 ) and f 2 (S 2 S 1, A 1 ; γ 2 ) 2. Using η for added flexibility, construct nuisance models ɛ 1 (s 1 ; γ 1, η 1 ) and ɛ 2 ( s 2, a 1 ; γ 2, η 2 ) that satisfy the constraints 3. Then solve for β in the following OLS estimating equations: { ( 2 2 ) 0 = P n Y µ 0 A t H t β t ɛ t ( s t, a t 1 ; γ t, η t ) A } 1H 1 t=1 t=1 A 2 H 2
19 4 Estimation in Time-Varying Setting 19 Semi-parametric G-Estimator Approach The G-Estimator is the solution to these estimating equations: { ( 0 = P n Y A 2 H 2 β 2 b 2 ( S ) ( 2, A 1 ) A 2 p 2 ( S ) 2, A 1 ) 0 H 2 } ( ) ( ) + Y A 2 H 2 β 2 A 1 H 1 β 1 b 1 (S 1 ) A 1 p 1 (S 1 ) H 1 (H 1 ) [ ] H 2 A 2 S 1, A 1 = 1 (H 1 ) = E E b 2 ( S 2, A 1 ) = E [ Y A 2 H 2 β 2 S ] 2, A 1 p 2 ( S 2, A 1 ) = P r [ A 2 = 1 S ] 2, A 1 b 1 (S 1 ) = E [Y A 2 H 2 β 2 A 1 H 1 β 1 S 1 ] p 1 (S 1 ) = P r [A 1 = 1 S 2 ] [ ] H 2 A 2 S 1, A 1 = 0
20 5 Bias-Variance Trade-off 20 5 Bias-Variance Trade-off This discussion assumes true models for the causal effects, the µ t s: Robins G-Estimator is unbiased if either p t or b t are correctly specified. So-called double-robustness property. Robins G-Estimator is semi-parametric efficient if p t, b t, and are all correctly specified. 2-Stage Regression Estimator is unbiased only if the nuisance functions, the ɛ t s, are correctly specified. 2-Stage Regression Estimator with correctly specified nuisance is more efficient than G-Estimator What happens as we mis-specify the nuisance functions, the ɛ t s?
21 5 Bias-Variance Trade-off 21 A Simulation Study of the Bias-Variance Trade-off 1. Generate Y ( S 3, Ā3) using a SNMM; use n = 300, K = 3 2. Fit 2-Stage Estimator and two versions of G-Estimator: Linear Version of 2-Stage Estimator Robins G-Estimator with b t = 0 Robins G-Estimator using 2-Stage Guesses for b t 3. Repeat N = 1000 times 4. Repeat for different ɛ t -mis-specified 2-Stage fits 5. Based on MSE of estimates of β, compare bias-variance trade-off of the 2-Stage Estimator versus the G-Estimator as you move away from true fit
22 5 Bias-Variance Trade-off 22 Specifying/Measuring Mis-specification of ɛ t How do we measure the distance of the fitted ɛ t (ν) from the true nuisance function ɛ TRUE t? SRMSD(ν) = K t E(ɛ TRUE t ɛ t (ν)) 2 V ar(y ) Note that this has an effect-size-like interpretation. Thus, SRMSD=0.5 is about a moderate amount of mis-specification.
23 5 Bias-Variance Trade-off 23 Simulation Experiment Results
24 5 Bias-Variance Trade-off 24 REL MSE (G Estimator to 2 Stage Estimator) SRMSD β 10 β 11 β 20 β 21 β 30 β SRMSD G Estimator with b = 0 G Estimator with b = 2 Stage Guess
25 6 Conclusions 25 6 Conclusions 1. Offered a way to think about time-varying effect moderation 2. The bias-variance trade-off is real and important. 3. G-Estimator starts to dominate (in terms of MSE) when amount of ɛ t -mis-specifiction is moderate 4. 2-Stage Estimator provides a principled method by which to construct high quality b t guesses for the G-Estimator Stage Estimator is intuitive, and can stand alone as its own estimator. 6. Linear implementation of 2-Stage Estimator is easy to carry out, but standard errors are difficult. (Bootstrap SEs?)
26 6 Conclusions 26 Main References Robins JM (1994). Correcting for Non-compliance in Randomized Trials Using Structural Nested Mean Models. Communications in Statistics, Theory and Methods, 23, Almirall D, Ten Have T, Murphy SA. Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Submitted.
27 6 Conclusions 27 Thank you! More Questions?
28 7 Extra Slides 28 7 Extra Slides
29 7 Extra Slides 29 Linear Implementation of the 2-Stage Approach
30 7 Extra Slides 30 Linear Parametrization of the Nuisance Functions So we must parameterize the nuisance functions correctly. Recall the constraints on the nuisance functions: ɛ 2 ( s 2, a 1 ) = E[Y (a 1, 0) S 2 (a 1 ) = s 2 ] E[Y (a 1, 0) S 1 = s 1 ], ɛ 1 (s 1 ) = E[Y (0, 0) S 1 = s 1 ] E[Y (0, 0)], E S2 S 1 [ɛ 2 ( s 2, a 1 ) S 1 = s 1 ] = 0, and E S1 [ɛ 1 (s 1 )] = 0. Example parameterizations for the nuisance functions: ɛ 1 (s 1 ) say ( = η 1,1 s1 E(S 1 ) ) ɛ 2 ( s 2, a 1 ) say = ( η 2,1 + η 2,2 s 1 )( s2 E(S 2 (a 1 ) S 1 = s 1 ) )
31 7 Extra Slides 31 Steps for Linear Implementation Recall that E [ Y (a 1, a 2 ) S 2 (a 1 ) = s 2 ] = µ2 ( s 2, ā 2 ; β 2 ) + ɛ 2 ( s 2, a 1 ; η 2, γ 2 ) + µ 1 (s 1, a 1 ; β 1 ) + ɛ 1 (s 1 ; η 1, γ 1 ) + µ We have models for the µ s: A 1 H 1 β 1 and A 2 H 2 β 2 ; Set aside 2. Model m 1 (γ 1 ) = E(S 1 ), estimate γ 1 with GLM; model m 2 (s 1, a 1 ; γ 2 ) = E(S 2 (a 1 ) S 1 = s 1 ), estimate γ 2 with GLM 3. Construct residuals ˆδ 1 = s 1 ˆm 1 (ˆγ 1 ) and ˆδ 2 = s 2 ˆm 2 (s 1, a 1 ; ˆγ 2 ) 4. Construct models for ɛ s: G 1ˆδ1 η 1 = G 1 η 1 and G 2ˆδ2 η 2 = G 2 η 2 5. Obtain ˆβ and ˆη using OLS of Y [1, G 1, A 1H 1, G 2, A 2H 2 ]
32 7 Extra Slides 32 An Illustrative Data Analysis
33 7 Extra Slides 33 The Data n = 277 geriatric primary care patients from PROSPECT Study K = 3: visits to clinic at baseline, 4-, 8-, and 12-months Data structure is {S 1, A 1, S 2, A 2, S 3, A 3, Y } S t = suicidal ideation, A t = adherence, Y = 12-month depression Treatment is defined as ever meeting with a health specialist Monotonic treatment pattern: (0, 0, 0), (1, 0, 0), (1, 1, 0), (1, 1, 1) Assumptions: - Consistency/SUTVA - Sequential Ignorability given S 3 (very likely violated) - Modeling assumptions
34 7 Extra Slides 34 Models Used in the Illustrative Analysis Causal effects: (expect β t0 < 0 and β t1 > 0) 1. µ 1 (S 1, a 1 ) = a 1 (β 10 + β 11 S 1 ), 2. µ 2 ( S 2, ā 2 ) = a 2 (β 20 + β 21 (S 1 + S 2 ) /2), and 3. µ 3 ( S 3, ā 3 ) = a 3 (β 30 + β 31 (S 1 + S 2 + S 3 )) /3). Nuisance functions: 1. ɛ 1 (S 1 ) = η 1,1 (S 1 γ 1,1 ) 2. ɛ 2 ( S 2, A 1 ) = η 2,1 (S 2 (γ 2,1 + γ 2,2 S 1 + γ 2,3 A 1 + γ 2,4 S 1 A 1 + γ 2,4 S1 2)) 3. ɛ 3 ( S 3, Ā2) = (η 3,1 + η 3,2 S 2 ) (S 3 (γ 3,1 + γ 3,2 S 1 + γ 3,3 A 1 + γ 3,4 S 1 A 1 + γ 3,5 S 2 + γ 3,6 S1 2 + γ 3,7S2 2)).
35 7 Extra Slides 35 Results 2-Stage Estimator Robins G-Estimator Parameters β ŜE( β) P r(> z ) β ŜE( β) P r(> z ) Intercept β < 0.01 ɛ 1 η 1, µ 1 β 1, β 1, ɛ 2 η 2, µ 2 β 2, β 2, ɛ 3 η 3, η 3, µ 3 β 3, < 0.01 β 3, < 0.01 G-Estimation does not produce estimates for the nuisance functions.
Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall
1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Duke University Medical, Dept. of Biostatistics Joint work
More informationDaniel Almirall 1, Daniel F. McCaffrey 2, Beth Ann Griffin 2, Rajeev Ramchand 2, Susan A. Murphy 1
1 Examining moderated effects of additional adolescent substance use treatment: Structural nested mean model estimation using inverse-weighted regression-with-residuals Daniel Almirall 1, Daniel F. McCaffrey
More informationDaniel Almirall 1, Daniel F. McCaffrey 2, Beth Ann Griffin 2, Rajeev Ramchand 2, Susan A. Murphy 1
1 Examining moderated effects of additional adolescent substance use treatment: Structural nested mean model estimation using inverse-weighted regression-with-residuals Daniel Almirall 1, Daniel F. McCaffrey
More informationExtending causal inferences from a randomized trial to a target population
Extending causal inferences from a randomized trial to a target population Issa Dahabreh Center for Evidence Synthesis in Health, Brown University issa dahabreh@brown.edu January 16, 2019 Issa Dahabreh
More informationGeoffrey T. Wodtke. University of Toronto. Daniel Almirall. University of Michigan. Population Studies Center Research Report July 2015
Estimating Heterogeneous Causal Effects with Time-Varying Treatments and Time-Varying Effect Moderators: Structural Nested Mean Models and Regression-with-Residuals Geoffrey T. Wodtke University of Toronto
More informationFor more information about how to cite these materials visit
Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/
More informationLongitudinal Nested Compliance Class Model in the Presence of Time-Varying Noncompliance
Longitudinal Nested Compliance Class Model in the Presence of Time-Varying Noncompliance Julia Y. Lin Thomas R. Ten Have Michael R. Elliott Julia Y. Lin is a doctoral candidate (E-mail: jlin@cceb.med.upenn.edu),
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint
More informationCasual Mediation Analysis
Casual Mediation Analysis Tyler J. VanderWeele, Ph.D. Upcoming Seminar: April 21-22, 2017, Philadelphia, Pennsylvania OXFORD UNIVERSITY PRESS Explanation in Causal Inference Methods for Mediation and Interaction
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Princeton University Asian Political Methodology Conference University of Sydney Joint
More informationPropensity Score Weighting with Multilevel Data
Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative
More informationMarginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal
Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Overview In observational and experimental studies, the goal may be to estimate the effect
More informationTelescope Matching: A Flexible Approach to Estimating Direct Effects
Telescope Matching: A Flexible Approach to Estimating Direct Effects Matthew Blackwell and Anton Strezhnev International Methods Colloquium October 12, 2018 direct effect direct effect effect of treatment
More informationLecture 9: Learning Optimal Dynamic Treatment Regimes. Donglin Zeng, Department of Biostatistics, University of North Carolina
Lecture 9: Learning Optimal Dynamic Treatment Regimes Introduction Refresh: Dynamic Treatment Regimes (DTRs) DTRs: sequential decision rules, tailored at each stage by patients time-varying features and
More informationPersonalized Treatment Selection Based on Randomized Clinical Trials. Tianxi Cai Department of Biostatistics Harvard School of Public Health
Personalized Treatment Selection Based on Randomized Clinical Trials Tianxi Cai Department of Biostatistics Harvard School of Public Health Outline Motivation A systematic approach to separating subpopulations
More informationGov 2002: 4. Observational Studies and Confounding
Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What
More informationMissing Covariate Data in Matched Case-Control Studies
Missing Covariate Data in Matched Case-Control Studies Department of Statistics North Carolina State University Paul Rathouz Dept. of Health Studies U. of Chicago prathouz@health.bsd.uchicago.edu with
More informationMeasurement error effects on bias and variance in two-stage regression, with application to air pollution epidemiology
Measurement error effects on bias and variance in two-stage regression, with application to air pollution epidemiology Chris Paciorek Department of Statistics, University of California, Berkeley and Adam
More informationDATA-ADAPTIVE VARIABLE SELECTION FOR
DATA-ADAPTIVE VARIABLE SELECTION FOR CAUSAL INFERENCE Group Health Research Institute Department of Biostatistics, University of Washington shortreed.s@ghc.org joint work with Ashkan Ertefaie Department
More informationIdentification and Inference in Causal Mediation Analysis
Identification and Inference in Causal Mediation Analysis Kosuke Imai Luke Keele Teppei Yamamoto Princeton University Ohio State University November 12, 2008 Kosuke Imai (Princeton) Causal Mediation Analysis
More informationSpecification Errors, Measurement Errors, Confounding
Specification Errors, Measurement Errors, Confounding Kerby Shedden Department of Statistics, University of Michigan October 10, 2018 1 / 32 An unobserved covariate Suppose we have a data generating model
More informationPubh 8482: Sequential Analysis
Pubh 8482: Sequential Analysis Joseph S. Koopmeiners Division of Biostatistics University of Minnesota Week 12 Review So far... We have discussed the role of phase III clinical trials in drug development
More informationWhen Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?
When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University
More informationCausal Mechanisms Short Course Part II:
Causal Mechanisms Short Course Part II: Analyzing Mechanisms with Experimental and Observational Data Teppei Yamamoto Massachusetts Institute of Technology March 24, 2012 Frontiers in the Analysis of Causal
More informationRegression #3: Properties of OLS Estimator
Regression #3: Properties of OLS Estimator Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #3 1 / 20 Introduction In this lecture, we establish some desirable properties associated with
More informationAdaptive Trial Designs
Adaptive Trial Designs Wenjing Zheng, Ph.D. Methods Core Seminar Center for AIDS Prevention Studies University of California, San Francisco Nov. 17 th, 2015 Trial Design! Ethical:!eg.! Safety!! Efficacy!
More informationRunning Head: Effect Heterogeneity with Time-varying Treatments and Moderators ESTIMATING HETEROGENEOUS CAUSAL EFFECTS WITH TIME-VARYING
Running Head: Effect Heterogeneity with Time-varying Treatments and Moderators ESTIMATING HETEROGENEOUS CAUSAL EFFECTS WITH TIME-VARYING TREATMENTS AND TIME-VARYING EFFECT MODERATORS: STRUCTURAL NESTED
More informationCategorical Predictor Variables
Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively
More informationQuantitative Economics for the Evaluation of the European Policy
Quantitative Economics for the Evaluation of the European Policy Dipartimento di Economia e Management Irene Brunetti Davide Fiaschi Angela Parenti 1 25th of September, 2017 1 ireneb@ec.unipi.it, davide.fiaschi@unipi.it,
More informationDifference-in-Differences Methods
Difference-in-Differences Methods Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 1 Introduction: A Motivating Example 2 Identification 3 Estimation and Inference 4 Diagnostics
More informationUnpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies
Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies Kosuke Imai Princeton University February 23, 2012 Joint work with L. Keele (Penn State)
More informationarxiv: v2 [stat.me] 31 Dec 2012
1 arxiv:1109.1070v2 [stat.me] 31 Dec 2012 1 Mediation Analysis Without Sequential Ignorability: Using Baseline Covariates Interacted with Random Assignment as Instrumental Variables Dylan S. Small University
More information6.3 How the Associational Criterion Fails
6.3. HOW THE ASSOCIATIONAL CRITERION FAILS 271 is randomized. We recall that this probability can be calculated from a causal model M either directly, by simulating the intervention do( = x), or (if P
More informationUnpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies
Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies Kosuke Imai Princeton University January 23, 2012 Joint work with L. Keele (Penn State)
More informationCausal Effect Estimation Under Linear and Log- Linear Structural Nested Mean Models in the Presence of Unmeasured Confounding
University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations Summer 8-13-2010 Causal Effect Estimation Under Linear and Log- Linear Structural Nested Mean Models in the Presence of
More informationTargeted Maximum Likelihood Estimation in Safety Analysis
Targeted Maximum Likelihood Estimation in Safety Analysis Sam Lendle 1 Bruce Fireman 2 Mark van der Laan 1 1 UC Berkeley 2 Kaiser Permanente ISPE Advanced Topics Session, Barcelona, August 2012 1 / 35
More informationWhether to use MMRM as primary estimand.
Whether to use MMRM as primary estimand. James Roger London School of Hygiene & Tropical Medicine, London. PSI/EFSPI European Statistical Meeting on Estimands. Stevenage, UK: 28 September 2015. 1 / 38
More informationIP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM
IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS 776 1 12 IPW and MSM IP weighting and marginal structural models ( 12) Outline 12.1 The causal question 12.2 Estimating IP weights via modeling
More informationCombining multiple observational data sources to estimate causal eects
Department of Statistics, North Carolina State University Combining multiple observational data sources to estimate causal eects Shu Yang* syang24@ncsuedu Joint work with Peng Ding UC Berkeley May 23,
More informationHelp! Statistics! Mediation Analysis
Help! Statistics! Lunch time lectures Help! Statistics! Mediation Analysis What? Frequently used statistical methods and questions in a manageable timeframe for all researchers at the UMCG. No knowledge
More informationCausal Mediation Analysis in R. Quantitative Methodology and Causal Mechanisms
Causal Mediation Analysis in R Kosuke Imai Princeton University June 18, 2009 Joint work with Luke Keele (Ohio State) Dustin Tingley and Teppei Yamamoto (Princeton) Kosuke Imai (Princeton) Causal Mediation
More informationUniversity of California, Berkeley
University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2004 Paper 155 Estimation of Direct and Indirect Causal Effects in Longitudinal Studies Mark J. van
More informationA Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness
A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A. Linero and M. Daniels UF, UT-Austin SRC 2014, Galveston, TX 1 Background 2 Working model
More informationDiscussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs
Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Michael J. Daniels and Chenguang Wang Jan. 18, 2009 First, we would like to thank Joe and Geert for a carefully
More informationSEQUENTIAL MULTIPLE ASSIGNMENT RANDOMIZATION TRIALS WITH ENRICHMENT (SMARTER) DESIGN
SEQUENTIAL MULTIPLE ASSIGNMENT RANDOMIZATION TRIALS WITH ENRICHMENT (SMARTER) DESIGN Ying Liu Division of Biostatistics, Medical College of Wisconsin Yuanjia Wang Department of Biostatistics & Psychiatry,
More informationOnline Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha
Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha January 18, 2010 A2 This appendix has six parts: 1. Proof that ab = c d
More informationSTA441: Spring Multiple Regression. More than one explanatory variable at the same time
STA441: Spring 2016 Multiple Regression More than one explanatory variable at the same time This slide show is a free open source document. See the last slide for copyright information. One Explanatory
More informationRegression I: Mean Squared Error and Measuring Quality of Fit
Regression I: Mean Squared Error and Measuring Quality of Fit -Applied Multivariate Analysis- Lecturer: Darren Homrighausen, PhD 1 The Setup Suppose there is a scientific problem we are interested in solving
More informationStatistical Analysis of Causal Mechanisms
Statistical Analysis of Causal Mechanisms Kosuke Imai Princeton University November 17, 2008 Joint work with Luke Keele (Ohio State) and Teppei Yamamoto (Princeton) Kosuke Imai (Princeton) Causal Mechanisms
More informationPropensity Score Methods for Causal Inference
John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good
More informationChapter 2: simple regression model
Chapter 2: simple regression model Goal: understand how to estimate and more importantly interpret the simple regression Reading: chapter 2 of the textbook Advice: this chapter is foundation of econometrics.
More informationUniversity of California, Berkeley
University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2010 Paper 260 Collaborative Targeted Maximum Likelihood For Time To Event Data Ori M. Stitelman Mark
More informationStat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010
1 Linear models Y = Xβ + ɛ with ɛ N (0, σ 2 e) or Y N (Xβ, σ 2 e) where the model matrix X contains the information on predictors and β includes all coefficients (intercept, slope(s) etc.). 1. Number of
More informationCross-Sectional Regression after Factor Analysis: Two Applications
al Regression after Factor Analysis: Two Applications Joint work with Jingshu, Trevor, Art; Yang Song (GSB) May 7, 2016 Overview 1 2 3 4 1 / 27 Outline 1 2 3 4 2 / 27 Data matrix Y R n p Panel data. Transposable
More information1. The OLS Estimator. 1.1 Population model and notation
1. The OLS Estimator OLS stands for Ordinary Least Squares. There are 6 assumptions ordinarily made, and the method of fitting a line through data is by least-squares. OLS is a common estimation methodology
More information10. Time series regression and forecasting
10. Time series regression and forecasting Key feature of this section: Analysis of data on a single entity observed at multiple points in time (time series data) Typical research questions: What is the
More informationSIMPLE EXAMPLES OF ESTIMATING CAUSAL EFFECTS USING TARGETED MAXIMUM LIKELIHOOD ESTIMATION
Johns Hopkins University, Dept. of Biostatistics Working Papers 3-3-2011 SIMPLE EXAMPLES OF ESTIMATING CAUSAL EFFECTS USING TARGETED MAXIMUM LIKELIHOOD ESTIMATION Michael Rosenblum Johns Hopkins Bloomberg
More informationPropensity Score Analysis with Hierarchical Data
Propensity Score Analysis with Hierarchical Data Fan Li Alan Zaslavsky Mary Beth Landrum Department of Health Care Policy Harvard Medical School May 19, 2008 Introduction Population-based observational
More informationConceptual overview: Techniques for establishing causal pathways in programs and policies
Conceptual overview: Techniques for establishing causal pathways in programs and policies Antonio A. Morgan-Lopez, Ph.D. OPRE/ACF Meeting on Unpacking the Black Box of Programs and Policies 4 September
More informationUniversity of California, Berkeley
University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2011 Paper 290 Targeted Minimum Loss Based Estimation of an Intervention Specific Mean Outcome Mark
More informationPSC 504: Instrumental Variables
PSC 504: Instrumental Variables Matthew Blackwell 3/28/2013 Instrumental Variables and Structural Equation Modeling Setup e basic idea behind instrumental variables is that we have a treatment with unmeasured
More informationCausal Inference. Miguel A. Hernán, James M. Robins. May 19, 2017
Causal Inference Miguel A. Hernán, James M. Robins May 19, 2017 ii Causal Inference Part III Causal inference from complex longitudinal data Chapter 19 TIME-VARYING TREATMENTS So far this book has dealt
More informationGEE for Longitudinal Data - Chapter 8
GEE for Longitudinal Data - Chapter 8 GEE: generalized estimating equations (Liang & Zeger, 1986; Zeger & Liang, 1986) extension of GLM to longitudinal data analysis using quasi-likelihood estimation method
More informationChapter 1. Linear Regression with One Predictor Variable
Chapter 1. Linear Regression with One Predictor Variable 1.1 Statistical Relation Between Two Variables To motivate statistical relationships, let us consider a mathematical relation between two mathematical
More informationLinear models and their mathematical foundations: Simple linear regression
Linear models and their mathematical foundations: Simple linear regression Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/21 Introduction
More informationGov 2000: 9. Regression with Two Independent Variables
Gov 2000: 9. Regression with Two Independent Variables Matthew Blackwell Fall 2016 1 / 62 1. Why Add Variables to a Regression? 2. Adding a Binary Covariate 3. Adding a Continuous Covariate 4. OLS Mechanics
More informationSTA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.
STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory
More informationBIOS 6649: Handout Exercise Solution
BIOS 6649: Handout Exercise Solution NOTE: I encourage you to work together, but the work you submit must be your own. Any plagiarism will result in loss of all marks. This assignment is based on weight-loss
More informationA Sampling of IMPACT Research:
A Sampling of IMPACT Research: Methods for Analysis with Dropout and Identifying Optimal Treatment Regimes Marie Davidian Department of Statistics North Carolina State University http://www.stat.ncsu.edu/
More informationSelection on Observables: Propensity Score Matching.
Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017
More informationK. Model Diagnostics. residuals ˆɛ ij = Y ij ˆµ i N = Y ij Ȳ i semi-studentized residuals ω ij = ˆɛ ij. studentized deleted residuals ɛ ij =
K. Model Diagnostics We ve already seen how to check model assumptions prior to fitting a one-way ANOVA. Diagnostics carried out after model fitting by using residuals are more informative for assessing
More informationCausal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions
Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census
More informationGLS and FGLS. Econ 671. Purdue University. Justin L. Tobias (Purdue) GLS and FGLS 1 / 22
GLS and FGLS Econ 671 Purdue University Justin L. Tobias (Purdue) GLS and FGLS 1 / 22 In this lecture we continue to discuss properties associated with the GLS estimator. In addition we discuss the practical
More informationStatistical Distribution Assumptions of General Linear Models
Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions
More informationTelescope Matching: A Flexible Approach to Estimating Direct Effects *
Telescope Matching: A Flexible Approach to Estimating Direct Effects * Matthew Blackwell Anton Strezhnev August 4, 2018 Abstract Estimating the direct effect of a treatment fixing the value of a consequence
More informationStatistical Methods. Missing Data snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23
1 / 23 Statistical Methods Missing Data http://www.stats.ox.ac.uk/ snijders/sm.htm Tom A.B. Snijders University of Oxford November, 2011 2 / 23 Literature: Joseph L. Schafer and John W. Graham, Missing
More informationMeasurement Error in Spatial Modeling of Environmental Exposures
Measurement Error in Spatial Modeling of Environmental Exposures Chris Paciorek, Alexandros Gryparis, and Brent Coull August 9, 2005 Department of Biostatistics Harvard School of Public Health www.biostat.harvard.edu/~paciorek
More informationUniversity of California, Berkeley
University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2010 Paper 269 Diagnosing and Responding to Violations in the Positivity Assumption Maya L. Petersen
More informationExtending the results of clinical trials using data from a target population
Extending the results of clinical trials using data from a target population Issa Dahabreh Center for Evidence-Based Medicine, Brown School of Public Health Disclaimer Partly supported through PCORI Methods
More informationGov 2002: 13. Dynamic Causal Inference
Gov 2002: 13. Dynamic Causal Inference Matthew Blackwell December 19, 2015 1 / 33 1. Time-varying treatments 2. Marginal structural models 2 / 33 1/ Time-varying treatments 3 / 33 Time-varying treatments
More informationComparative effectiveness of dynamic treatment regimes
Comparative effectiveness of dynamic treatment regimes An application of the parametric g- formula Miguel Hernán Departments of Epidemiology and Biostatistics Harvard School of Public Health www.hsph.harvard.edu/causal
More information1 Motivation for Instrumental Variable (IV) Regression
ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data
More informationPractical Considerations Surrounding Normality
Practical Considerations Surrounding Normality Prof. Kevin E. Thorpe Dalla Lana School of Public Health University of Toronto KE Thorpe (U of T) Normality 1 / 16 Objectives Objectives 1. Understand the
More informationDouble Robustness. Bang and Robins (2005) Kang and Schafer (2007)
Double Robustness Bang and Robins (2005) Kang and Schafer (2007) Set-Up Assume throughout that treatment assignment is ignorable given covariates (similar to assumption that data are missing at random
More informationCausality II: How does causal inference fit into public health and what it is the role of statistics?
Causality II: How does causal inference fit into public health and what it is the role of statistics? Statistics for Psychosocial Research II November 13, 2006 1 Outline Potential Outcomes / Counterfactual
More informationIntegrated approaches for analysis of cluster randomised trials
Integrated approaches for analysis of cluster randomised trials Invited Session 4.1 - Recent developments in CRTs Joint work with L. Turner, F. Li, J. Gallis and D. Murray Mélanie PRAGUE - SCT 2017 - Liverpool
More informationCausal Inference with Big Data Sets
Causal Inference with Big Data Sets Marcelo Coca Perraillon University of Colorado AMC November 2016 1 / 1 Outlone Outline Big data Causal inference in economics and statistics Regression discontinuity
More informationComparing Adaptive Interventions Using Data Arising from a SMART: With Application to Autism, ADHD, and Mood Disorders
Comparing Adaptive Interventions Using Data Arising from a SMART: With Application to Autism, ADHD, and Mood Disorders Daniel Almirall, Xi Lu, Connie Kasari, Inbal N-Shani, Univ. of Michigan, Univ. of
More informationUnbiased estimation of exposure odds ratios in complete records logistic regression
Unbiased estimation of exposure odds ratios in complete records logistic regression Jonathan Bartlett London School of Hygiene and Tropical Medicine www.missingdata.org.uk Centre for Statistical Methodology
More informationWeighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i.
Weighting Unconfounded Homework 2 Describe imbalance direction matters STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University
More informationSelective Inference for Effect Modification
Inference for Modification (Joint work with Dylan Small and Ashkan Ertefaie) Department of Statistics, University of Pennsylvania May 24, ACIC 2017 Manuscript and slides are available at http://www-stat.wharton.upenn.edu/~qyzhao/.
More informationPrimal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing
Primal-dual Covariate Balance and Minimal Double Robustness via (Joint work with Daniel Percival) Department of Statistics, Stanford University JSM, August 9, 2015 Outline 1 2 3 1/18 Setting Rubin s causal
More informationApplied Statistics and Econometrics
Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple
More informationUNIVERSITY OF TORONTO Faculty of Arts and Science
UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator
More informationGlobal Sensitivity Analysis for Repeated Measures Studies with Informative Drop-out: A Semi-Parametric Approach
Global for Repeated Measures Studies with Informative Drop-out: A Semi-Parametric Approach Daniel Aidan McDermott Ivan Diaz Johns Hopkins University Ibrahim Turkoz Janssen Research and Development September
More informationMS&E 226: Small Data
MS&E 226: Small Data Lecture 6: Bias and variance (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 49 Our plan today We saw in last lecture that model scoring methods seem to be trading off two different
More informationThe consequences of misspecifying the random effects distribution when fitting generalized linear mixed models
The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models John M. Neuhaus Charles E. McCulloch Division of Biostatistics University of California, San
More informationEstimation of Optimal Treatment Regimes Via Machine Learning. Marie Davidian
Estimation of Optimal Treatment Regimes Via Machine Learning Marie Davidian Department of Statistics North Carolina State University Triangle Machine Learning Day April 3, 2018 1/28 Optimal DTRs Via ML
More informationRandomization-Based Inference With Complex Data Need Not Be Complex!
Randomization-Based Inference With Complex Data Need Not Be Complex! JITAIs JITAIs Susan Murphy 07.18.17 HeartSteps JITAI JITAIs Sequential Decision Making Use data to inform science and construct decision
More informationChapter 1 Statistical Inference
Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations
More information