Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Size: px
Start display at page:

Download "Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall"

Transcription

1 1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Dept. of Biostatistics, Duke University Medical Joint work with Thomas Ten Have (UPenn) & Susan Murphy (UMich) NC State University, Dept. of Statistics September 27, 2007

2 Contents 2 Contents 1 Warm-up: Why condition on pre-treatment variables? 4 2 Time-Varying Effect Moderation 8 3 Robins Structural Nested Mean Model 11 4 Estimation in Time-Varying Setting 14 5 Bias-Variance Trade-off 20 6 Conclusions 25

3 Contents 3 7 Extra Slides 28

4 1 Warm-up: Why condition on pre-treatment variables? 4 1 Warm-up: Why condition on pre-treatment variables?...in a study of the effects of a treatment A (e.g., intervention to help reduce depression) on a response Y (e.g., depression)?

5 1 Warm-up: Why condition on pre-treatment variables? 5 We want the effect of A on Y. Why condition on (adjust for) pre-treatment variables S? 1. Confounding: S is correlated with both A and Y. In this case, S is known as a confounder of the effect of A on Y. 2. Precision: S may be a pre-treatment measure of Y, or any other variable highly correlated with Y. 3. Missing Data: The outcome Y is missing for some units, S and A predict missingness, and S is associated with Y. 4. Effect Heterogeneity: S may moderate, temper, or specify the effect of A on Y. In this case, S is known as a moderator of the effect of A on Y. Formalized in next slide.

6 1 Warm-up: Why condition on pre-treatment variables? 6 Effect Moderation in One Time Point Definition: Fix a 0. Let µ(s, a) E[Y (a) Y (0) S = s]. If µ(s, a) is non-constant in s, then S is said to be a moderator of the effect of a on Y. Example: If the structural model for the conditional mean of Y (a) given S is E[Y (a) S = s] = β 0 + γ 1 s + β 1 a + β 2 sa, then µ(s, a) = β 1 a + β 2 sa. In this example, S is a moderator of the effect of a on Y if β 2 0.

7 1 Warm-up: Why condition on pre-treatment variables? 7 Relevance of Effect Moderation Two main implications: Theoretical Implication: Understanding the heterogeneity of the effects of causes enhances our understanding of various/competing scientific theories; and it may suggest new scientific (etiologic) hypotheses to be tested. Practical Implication: Identifying types, or subgroups, of individuals for which treatment is not effective may suggest altering the treatment to suit the needs of that particular type of individual.

8 2 Time-Varying Effect Moderation 8 2 Time-Varying Effect Moderation The focus of this paper is time-varying effect moderation. The data structure in the time-varying setting is {S 1, a 1, S 2 (a 1 ), a 2, Y (a 1, a 2 )} = {ā 2, S 2 (a 1 ), Y (ā 2 )}. Running Example, from the PROSPECT Study: (a 1, a 2 ) Time-varying treatment pattern; a t is binary (0,1) Y (a 1, a 2 ) Depression at the end of the study; continuous S 1 S 2 (a 1 ) Suicidal Ideation at baseline visit; continuous Suicidal Ideation at second visit; continuous

9 2 Time-Varying Effect Moderation 9 Formal Definition of Time-Varying Causal Effects Recall the data structure {S 1, a 1, S 2 (a 1 ), a 2, Y (a 1, a 2 )}. Conditional Intermediate Causal Effect at t = 1: µ 1 (s 1, a 1 ) E[Y (a 1, 0) Y (0, 0) S 1 = s 1 ] Conditional Intermediate Causal Effect at t = 2: µ 2 ( s 2, ā 2 ) E[Y (a 1, a 2 ) Y (a 1, 0) S 1 = s 1, S 2 (a 1 ) = s 2 ]

10 2 Time-Varying Effect Moderation 10 As a Decomposition of the Marginal Causal Effect Recall the data structure {S 1, a 1, S 2 (a 1 ), a 2, Y (a 1, a 2 )}. Consider the following arithmetic decomposition of the causal effect of (a 1, a 2 ) on Y, using the covariates S 2 (a 1 ): E [ Y (a 1, a 2 ) Y (0, 0) ] = E [E [ Y (a 1, a 2 ) Y (a 1, 0) S 2 (a 1 ) ]] [ + E E [ ] ] Y (a 1, 0) Y (0, 0) S 1. The inner expectations represent the conditional intermediate causal effects µ 1 and µ 2, respectively.

11 3 Robins Structural Nested Mean Model 11 3 Robins Structural Nested Mean Model Recall the data structure {S 1, a 1, S 2 (a 1 ), a 2, Y (a 1, a 2 )}. The SNMM for the conditional mean of Y (a 1, a 2 ) given S 2 (a 1 ) is: E [ Y (a 1, a 2 ) S 2 (a 1 ) ] { } = E[Y (0, 0)] + E[Y (0, 0) S 1 ] E[Y (0, 0)] + {E [ ] } Y(a 1, 0) Y(0, 0) S 1 { + E[Y (a 1, 0) S } 2 (a 1 )] E[Y (a 1, 0) S 1 ] + {E [ Y(a 1, a 2 ) Y(a 1, 0) S 2 (a 1 ) ]}

12 3 Robins Structural Nested Mean Model 12 The Causal versus Nuisance Functionals Recall the data structure {S 1, a 1, S 2 (a 1 ), a 2, Y (a 1, a 2 )}. The decomposition can be written more succinctly as: E [ Y (a 1, a 2 ) S ] 2 (a 1 ) = s 2 = µ0 + ɛ 1 (s 1 ) + µ 1 (s 1, a 1 ) + ɛ 2 ( s 2, a 1 ) + µ 2 ( s 2, ā 2 ), where ɛ 2 ( s 2, a 1 ) = E[Y (a 1, 0) S 2 (a 1 ) = s 2 ] E[Y (a 1, 0) S 1 = s 1 ], ɛ 1 (s 1 ) = E[Y (0, 0) S 1 = s 1 ] E[Y (0, 0)], µ 2 ( s 2, a 2, 0) = 0 and µ 1 (s 1, 0) = 0, E S2 S 1 [ɛ 2 ( s 2, a 1 ) S 1 = s 1 ] = 0, and E S1 [ɛ 1 (s 1 )] = 0.

13 3 Robins Structural Nested Mean Model 13 Parametrizing the Intermediate Causal Effects Recall the data structure {S 1, a 1, S 2 (a 1 ), a 2, Y (a 1, a 2 )}. Conditional Intermediate Causal Effect at t = 1: µ 1 (s 1, a 1 ) E[Y (a 1, 0) Y (0, 0) S 1 = s 1 ] = a 1 H 1 β 1 (Linear Parameterization) say = a 1 (β 11 + β 12 f 1 (s 1 )) (Sample) Conditional Intermediate Causal Effect at t = 2: µ 2 ( s 2, ā 2 ) E[Y (a 1, a 2 ) Y (a 1, 0) S 1 = s 1, S 2 (a 1 ) = s 2 ] = a 2 H 2 β 2 (Linear Parameterization) say = a 2 (β 21 + β 22 f 2 (s 1, s 2 ) + β 23 a 1 ) (Sample)

14 4 Estimation in Time-Varying Setting 14 4 Estimation in Time-Varying Setting Assume: True models for µ t s Consistency Assumptions Sequential Ignorability (given S 2 (a 1 )) Assumption In the sequel, we consider three estimators for β: 1. Traditional (Naive) Regression Estimator 2. Proposed 2-Stage Regression Estimator 3. Robins Semi-parametric G-Estimator Two flavors of the Semi-parametric G-Estimator

15 4 Estimation in Time-Varying Setting 15 What s wrong with the Traditional Estimator? An Example of The Traditional Estimator: Apply OLS with E(Y S 2 = s 2, Ā2 = ā 2 ) = β 0 + η 1 s 1 + a 1 (β 1 + β 2s 1 ) + η 2 s 2 + a 2 (β 3 + β 4(s 1 + s 2 )/2) - Possibly incorrectly specified nuisance functions. - Two problems arise when using the traditional regression estimator. - These problems occur even in the absence of time-varying confounders (that is, even under Sequential Randomization).

16 4 Estimation in Time-Varying Setting 16 First problem with the Traditional Estimator E(Y S 2 = s 2, Ā2 = ā 2 ) = β 0 + η 1 s 1 + a 1 (β 1 + β 2s 1 ) + η 2 s 2 + a 2 (β 3 + β 4(s 1 + s 2 )/2) Wrong Effect Baseline 4-month Visit 8-month Visit a Set 1 a 2 = 0 S 1 S 2 (a 1 ) Y (a 1, 0) What about the effect transmitted through S 2 (a 1 )?

17 4 Estimation in Time-Varying Setting 17 Second problem with the Traditional Estimator E(Y S 2 = s 2, Ā2 = ā 2 ) = β0 + η 1 s 1 + a 1 (β1 + β2s 1 ) + η 2 s 2 + a 2 (β3 + β4(s 1 + s 2 )/2) Spurious Effect Baseline 4-month Visit 8-month Visit V 0 S 1 a Set 1 a 2 = 0 S 2 (a 1 ) Y (a 1, 0) Berkson s paradox; Judea Pearl s backdoor criterion

18 4 Estimation in Time-Varying Setting 18 Proposed 2-Stage Estimator Approach Recall the nuisance function constraints: E S2 S 1 [ɛ 2 ( s 2, a 1 ) S 1 = s 1 ] = 0, and E S1 [ɛ 1 (s 1 )] = 0. General steps for the 2-Stage Approach: 1. Model the distributions f 1 (S 1 ; γ 1 ) and f 2 (S 2 S 1, A 1 ; γ 2 ) 2. Using η for added flexibility, construct nuisance models ɛ 1 (s 1 ; γ 1, η 1 ) and ɛ 2 ( s 2, a 1 ; γ 2, η 2 ) that satisfy the constraints 3. Then solve for β in the following OLS estimating equations: { ( 2 2 ) 0 = P n Y µ 0 A t H t β t ɛ t ( s t, a t 1 ; γ t, η t ) A } 1H 1 t=1 t=1 A 2 H 2

19 4 Estimation in Time-Varying Setting 19 Semi-parametric G-Estimator Approach The G-Estimator is the solution to these estimating equations: { ( 0 = P n Y A 2 H 2 β 2 b 2 ( S ) ( 2, A 1 ) A 2 p 2 ( S ) 2, A 1 ) 0 H 2 } ( ) ( ) + Y A 2 H 2 β 2 A 1 H 1 β 1 b 1 (S 1 ) A 1 p 1 (S 1 ) H 1 (H 1 ) [ ] H 2 A 2 S 1, A 1 = 1 (H 1 ) = E E b 2 ( S 2, A 1 ) = E [ Y A 2 H 2 β 2 S ] 2, A 1 p 2 ( S 2, A 1 ) = P r [ A 2 = 1 S ] 2, A 1 b 1 (S 1 ) = E [Y A 2 H 2 β 2 A 1 H 1 β 1 S 1 ] p 1 (S 1 ) = P r [A 1 = 1 S 2 ] [ ] H 2 A 2 S 1, A 1 = 0

20 5 Bias-Variance Trade-off 20 5 Bias-Variance Trade-off This discussion assumes true models for the causal effects, the µ t s: Robins G-Estimator is unbiased if either p t or b t are correctly specified. So-called double-robustness property. Robins G-Estimator is semi-parametric efficient if p t, b t, and are all correctly specified. 2-Stage Regression Estimator is unbiased only if the nuisance functions, the ɛ t s, are correctly specified. 2-Stage Regression Estimator with correctly specified nuisance is more efficient than G-Estimator What happens as we mis-specify the nuisance functions, the ɛ t s?

21 5 Bias-Variance Trade-off 21 A Simulation Study of the Bias-Variance Trade-off 1. Generate Y ( S 3, Ā3) using a SNMM; use n = 300, K = 3 2. Fit 2-Stage Estimator and two versions of G-Estimator: Linear Version of 2-Stage Estimator Robins G-Estimator with b t = 0 Robins G-Estimator using 2-Stage Guesses for b t 3. Repeat N = 1000 times 4. Repeat for different ɛ t -mis-specified 2-Stage fits 5. Based on MSE of estimates of β, compare bias-variance trade-off of the 2-Stage Estimator versus the G-Estimator as you move away from true fit

22 5 Bias-Variance Trade-off 22 Specifying/Measuring Mis-specification of ɛ t How do we measure the distance of the fitted ɛ t (ν) from the true nuisance function ɛ TRUE t? SRMSD(ν) = K t E(ɛ TRUE t ɛ t (ν)) 2 V ar(y ) Note that this has an effect-size-like interpretation. Thus, SRMSD=0.5 is about a moderate amount of mis-specification.

23 5 Bias-Variance Trade-off 23 Simulation Experiment Results

24 5 Bias-Variance Trade-off 24 REL MSE (G Estimator to 2 Stage Estimator) SRMSD β 10 β 11 β 20 β 21 β 30 β SRMSD G Estimator with b = 0 G Estimator with b = 2 Stage Guess

25 6 Conclusions 25 6 Conclusions 1. Offered a way to think about time-varying effect moderation 2. The bias-variance trade-off is real and important. 3. G-Estimator starts to dominate (in terms of MSE) when amount of ɛ t -mis-specifiction is moderate 4. 2-Stage Estimator provides a principled method by which to construct high quality b t guesses for the G-Estimator Stage Estimator is intuitive, and can stand alone as its own estimator. 6. Linear implementation of 2-Stage Estimator is easy to carry out, but standard errors are difficult. (Bootstrap SEs?)

26 6 Conclusions 26 Main References Robins JM (1994). Correcting for Non-compliance in Randomized Trials Using Structural Nested Mean Models. Communications in Statistics, Theory and Methods, 23, Almirall D, Ten Have T, Murphy SA. Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Submitted.

27 6 Conclusions 27 Thank you! More Questions?

28 7 Extra Slides 28 7 Extra Slides

29 7 Extra Slides 29 Linear Implementation of the 2-Stage Approach

30 7 Extra Slides 30 Linear Parametrization of the Nuisance Functions So we must parameterize the nuisance functions correctly. Recall the constraints on the nuisance functions: ɛ 2 ( s 2, a 1 ) = E[Y (a 1, 0) S 2 (a 1 ) = s 2 ] E[Y (a 1, 0) S 1 = s 1 ], ɛ 1 (s 1 ) = E[Y (0, 0) S 1 = s 1 ] E[Y (0, 0)], E S2 S 1 [ɛ 2 ( s 2, a 1 ) S 1 = s 1 ] = 0, and E S1 [ɛ 1 (s 1 )] = 0. Example parameterizations for the nuisance functions: ɛ 1 (s 1 ) say ( = η 1,1 s1 E(S 1 ) ) ɛ 2 ( s 2, a 1 ) say = ( η 2,1 + η 2,2 s 1 )( s2 E(S 2 (a 1 ) S 1 = s 1 ) )

31 7 Extra Slides 31 Steps for Linear Implementation Recall that E [ Y (a 1, a 2 ) S 2 (a 1 ) = s 2 ] = µ2 ( s 2, ā 2 ; β 2 ) + ɛ 2 ( s 2, a 1 ; η 2, γ 2 ) + µ 1 (s 1, a 1 ; β 1 ) + ɛ 1 (s 1 ; η 1, γ 1 ) + µ We have models for the µ s: A 1 H 1 β 1 and A 2 H 2 β 2 ; Set aside 2. Model m 1 (γ 1 ) = E(S 1 ), estimate γ 1 with GLM; model m 2 (s 1, a 1 ; γ 2 ) = E(S 2 (a 1 ) S 1 = s 1 ), estimate γ 2 with GLM 3. Construct residuals ˆδ 1 = s 1 ˆm 1 (ˆγ 1 ) and ˆδ 2 = s 2 ˆm 2 (s 1, a 1 ; ˆγ 2 ) 4. Construct models for ɛ s: G 1ˆδ1 η 1 = G 1 η 1 and G 2ˆδ2 η 2 = G 2 η 2 5. Obtain ˆβ and ˆη using OLS of Y [1, G 1, A 1H 1, G 2, A 2H 2 ]

32 7 Extra Slides 32 An Illustrative Data Analysis

33 7 Extra Slides 33 The Data n = 277 geriatric primary care patients from PROSPECT Study K = 3: visits to clinic at baseline, 4-, 8-, and 12-months Data structure is {S 1, A 1, S 2, A 2, S 3, A 3, Y } S t = suicidal ideation, A t = adherence, Y = 12-month depression Treatment is defined as ever meeting with a health specialist Monotonic treatment pattern: (0, 0, 0), (1, 0, 0), (1, 1, 0), (1, 1, 1) Assumptions: - Consistency/SUTVA - Sequential Ignorability given S 3 (very likely violated) - Modeling assumptions

34 7 Extra Slides 34 Models Used in the Illustrative Analysis Causal effects: (expect β t0 < 0 and β t1 > 0) 1. µ 1 (S 1, a 1 ) = a 1 (β 10 + β 11 S 1 ), 2. µ 2 ( S 2, ā 2 ) = a 2 (β 20 + β 21 (S 1 + S 2 ) /2), and 3. µ 3 ( S 3, ā 3 ) = a 3 (β 30 + β 31 (S 1 + S 2 + S 3 )) /3). Nuisance functions: 1. ɛ 1 (S 1 ) = η 1,1 (S 1 γ 1,1 ) 2. ɛ 2 ( S 2, A 1 ) = η 2,1 (S 2 (γ 2,1 + γ 2,2 S 1 + γ 2,3 A 1 + γ 2,4 S 1 A 1 + γ 2,4 S1 2)) 3. ɛ 3 ( S 3, Ā2) = (η 3,1 + η 3,2 S 2 ) (S 3 (γ 3,1 + γ 3,2 S 1 + γ 3,3 A 1 + γ 3,4 S 1 A 1 + γ 3,5 S 2 + γ 3,6 S1 2 + γ 3,7S2 2)).

35 7 Extra Slides 35 Results 2-Stage Estimator Robins G-Estimator Parameters β ŜE( β) P r(> z ) β ŜE( β) P r(> z ) Intercept β < 0.01 ɛ 1 η 1, µ 1 β 1, β 1, ɛ 2 η 2, µ 2 β 2, β 2, ɛ 3 η 3, η 3, µ 3 β 3, < 0.01 β 3, < 0.01 G-Estimation does not produce estimates for the nuisance functions.

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall 1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Duke University Medical, Dept. of Biostatistics Joint work

More information

Daniel Almirall 1, Daniel F. McCaffrey 2, Beth Ann Griffin 2, Rajeev Ramchand 2, Susan A. Murphy 1

Daniel Almirall 1, Daniel F. McCaffrey 2, Beth Ann Griffin 2, Rajeev Ramchand 2, Susan A. Murphy 1 1 Examining moderated effects of additional adolescent substance use treatment: Structural nested mean model estimation using inverse-weighted regression-with-residuals Daniel Almirall 1, Daniel F. McCaffrey

More information

Daniel Almirall 1, Daniel F. McCaffrey 2, Beth Ann Griffin 2, Rajeev Ramchand 2, Susan A. Murphy 1

Daniel Almirall 1, Daniel F. McCaffrey 2, Beth Ann Griffin 2, Rajeev Ramchand 2, Susan A. Murphy 1 1 Examining moderated effects of additional adolescent substance use treatment: Structural nested mean model estimation using inverse-weighted regression-with-residuals Daniel Almirall 1, Daniel F. McCaffrey

More information

Extending causal inferences from a randomized trial to a target population

Extending causal inferences from a randomized trial to a target population Extending causal inferences from a randomized trial to a target population Issa Dahabreh Center for Evidence Synthesis in Health, Brown University issa dahabreh@brown.edu January 16, 2019 Issa Dahabreh

More information

Geoffrey T. Wodtke. University of Toronto. Daniel Almirall. University of Michigan. Population Studies Center Research Report July 2015

Geoffrey T. Wodtke. University of Toronto. Daniel Almirall. University of Michigan. Population Studies Center Research Report July 2015 Estimating Heterogeneous Causal Effects with Time-Varying Treatments and Time-Varying Effect Moderators: Structural Nested Mean Models and Regression-with-Residuals Geoffrey T. Wodtke University of Toronto

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/

More information

Longitudinal Nested Compliance Class Model in the Presence of Time-Varying Noncompliance

Longitudinal Nested Compliance Class Model in the Presence of Time-Varying Noncompliance Longitudinal Nested Compliance Class Model in the Presence of Time-Varying Noncompliance Julia Y. Lin Thomas R. Ten Have Michael R. Elliott Julia Y. Lin is a doctoral candidate (E-mail: jlin@cceb.med.upenn.edu),

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint

More information

Casual Mediation Analysis

Casual Mediation Analysis Casual Mediation Analysis Tyler J. VanderWeele, Ph.D. Upcoming Seminar: April 21-22, 2017, Philadelphia, Pennsylvania OXFORD UNIVERSITY PRESS Explanation in Causal Inference Methods for Mediation and Interaction

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Princeton University Asian Political Methodology Conference University of Sydney Joint

More information

Propensity Score Weighting with Multilevel Data

Propensity Score Weighting with Multilevel Data Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative

More information

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Overview In observational and experimental studies, the goal may be to estimate the effect

More information

Telescope Matching: A Flexible Approach to Estimating Direct Effects

Telescope Matching: A Flexible Approach to Estimating Direct Effects Telescope Matching: A Flexible Approach to Estimating Direct Effects Matthew Blackwell and Anton Strezhnev International Methods Colloquium October 12, 2018 direct effect direct effect effect of treatment

More information

Lecture 9: Learning Optimal Dynamic Treatment Regimes. Donglin Zeng, Department of Biostatistics, University of North Carolina

Lecture 9: Learning Optimal Dynamic Treatment Regimes. Donglin Zeng, Department of Biostatistics, University of North Carolina Lecture 9: Learning Optimal Dynamic Treatment Regimes Introduction Refresh: Dynamic Treatment Regimes (DTRs) DTRs: sequential decision rules, tailored at each stage by patients time-varying features and

More information

Personalized Treatment Selection Based on Randomized Clinical Trials. Tianxi Cai Department of Biostatistics Harvard School of Public Health

Personalized Treatment Selection Based on Randomized Clinical Trials. Tianxi Cai Department of Biostatistics Harvard School of Public Health Personalized Treatment Selection Based on Randomized Clinical Trials Tianxi Cai Department of Biostatistics Harvard School of Public Health Outline Motivation A systematic approach to separating subpopulations

More information

Gov 2002: 4. Observational Studies and Confounding

Gov 2002: 4. Observational Studies and Confounding Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What

More information

Missing Covariate Data in Matched Case-Control Studies

Missing Covariate Data in Matched Case-Control Studies Missing Covariate Data in Matched Case-Control Studies Department of Statistics North Carolina State University Paul Rathouz Dept. of Health Studies U. of Chicago prathouz@health.bsd.uchicago.edu with

More information

Measurement error effects on bias and variance in two-stage regression, with application to air pollution epidemiology

Measurement error effects on bias and variance in two-stage regression, with application to air pollution epidemiology Measurement error effects on bias and variance in two-stage regression, with application to air pollution epidemiology Chris Paciorek Department of Statistics, University of California, Berkeley and Adam

More information

DATA-ADAPTIVE VARIABLE SELECTION FOR

DATA-ADAPTIVE VARIABLE SELECTION FOR DATA-ADAPTIVE VARIABLE SELECTION FOR CAUSAL INFERENCE Group Health Research Institute Department of Biostatistics, University of Washington shortreed.s@ghc.org joint work with Ashkan Ertefaie Department

More information

Identification and Inference in Causal Mediation Analysis

Identification and Inference in Causal Mediation Analysis Identification and Inference in Causal Mediation Analysis Kosuke Imai Luke Keele Teppei Yamamoto Princeton University Ohio State University November 12, 2008 Kosuke Imai (Princeton) Causal Mediation Analysis

More information

Specification Errors, Measurement Errors, Confounding

Specification Errors, Measurement Errors, Confounding Specification Errors, Measurement Errors, Confounding Kerby Shedden Department of Statistics, University of Michigan October 10, 2018 1 / 32 An unobserved covariate Suppose we have a data generating model

More information

Pubh 8482: Sequential Analysis

Pubh 8482: Sequential Analysis Pubh 8482: Sequential Analysis Joseph S. Koopmeiners Division of Biostatistics University of Minnesota Week 12 Review So far... We have discussed the role of phase III clinical trials in drug development

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University

More information

Causal Mechanisms Short Course Part II:

Causal Mechanisms Short Course Part II: Causal Mechanisms Short Course Part II: Analyzing Mechanisms with Experimental and Observational Data Teppei Yamamoto Massachusetts Institute of Technology March 24, 2012 Frontiers in the Analysis of Causal

More information

Regression #3: Properties of OLS Estimator

Regression #3: Properties of OLS Estimator Regression #3: Properties of OLS Estimator Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #3 1 / 20 Introduction In this lecture, we establish some desirable properties associated with

More information

Adaptive Trial Designs

Adaptive Trial Designs Adaptive Trial Designs Wenjing Zheng, Ph.D. Methods Core Seminar Center for AIDS Prevention Studies University of California, San Francisco Nov. 17 th, 2015 Trial Design! Ethical:!eg.! Safety!! Efficacy!

More information

Running Head: Effect Heterogeneity with Time-varying Treatments and Moderators ESTIMATING HETEROGENEOUS CAUSAL EFFECTS WITH TIME-VARYING

Running Head: Effect Heterogeneity with Time-varying Treatments and Moderators ESTIMATING HETEROGENEOUS CAUSAL EFFECTS WITH TIME-VARYING Running Head: Effect Heterogeneity with Time-varying Treatments and Moderators ESTIMATING HETEROGENEOUS CAUSAL EFFECTS WITH TIME-VARYING TREATMENTS AND TIME-VARYING EFFECT MODERATORS: STRUCTURAL NESTED

More information

Categorical Predictor Variables

Categorical Predictor Variables Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively

More information

Quantitative Economics for the Evaluation of the European Policy

Quantitative Economics for the Evaluation of the European Policy Quantitative Economics for the Evaluation of the European Policy Dipartimento di Economia e Management Irene Brunetti Davide Fiaschi Angela Parenti 1 25th of September, 2017 1 ireneb@ec.unipi.it, davide.fiaschi@unipi.it,

More information

Difference-in-Differences Methods

Difference-in-Differences Methods Difference-in-Differences Methods Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 1 Introduction: A Motivating Example 2 Identification 3 Estimation and Inference 4 Diagnostics

More information

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies Kosuke Imai Princeton University February 23, 2012 Joint work with L. Keele (Penn State)

More information

arxiv: v2 [stat.me] 31 Dec 2012

arxiv: v2 [stat.me] 31 Dec 2012 1 arxiv:1109.1070v2 [stat.me] 31 Dec 2012 1 Mediation Analysis Without Sequential Ignorability: Using Baseline Covariates Interacted with Random Assignment as Instrumental Variables Dylan S. Small University

More information

6.3 How the Associational Criterion Fails

6.3 How the Associational Criterion Fails 6.3. HOW THE ASSOCIATIONAL CRITERION FAILS 271 is randomized. We recall that this probability can be calculated from a causal model M either directly, by simulating the intervention do( = x), or (if P

More information

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies Kosuke Imai Princeton University January 23, 2012 Joint work with L. Keele (Penn State)

More information

Causal Effect Estimation Under Linear and Log- Linear Structural Nested Mean Models in the Presence of Unmeasured Confounding

Causal Effect Estimation Under Linear and Log- Linear Structural Nested Mean Models in the Presence of Unmeasured Confounding University of Pennsylvania ScholarlyCommons Publicly Accessible Penn Dissertations Summer 8-13-2010 Causal Effect Estimation Under Linear and Log- Linear Structural Nested Mean Models in the Presence of

More information

Targeted Maximum Likelihood Estimation in Safety Analysis

Targeted Maximum Likelihood Estimation in Safety Analysis Targeted Maximum Likelihood Estimation in Safety Analysis Sam Lendle 1 Bruce Fireman 2 Mark van der Laan 1 1 UC Berkeley 2 Kaiser Permanente ISPE Advanced Topics Session, Barcelona, August 2012 1 / 35

More information

Whether to use MMRM as primary estimand.

Whether to use MMRM as primary estimand. Whether to use MMRM as primary estimand. James Roger London School of Hygiene & Tropical Medicine, London. PSI/EFSPI European Statistical Meeting on Estimands. Stevenage, UK: 28 September 2015. 1 / 38

More information

IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM

IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS 776 1 12 IPW and MSM IP weighting and marginal structural models ( 12) Outline 12.1 The causal question 12.2 Estimating IP weights via modeling

More information

Combining multiple observational data sources to estimate causal eects

Combining multiple observational data sources to estimate causal eects Department of Statistics, North Carolina State University Combining multiple observational data sources to estimate causal eects Shu Yang* syang24@ncsuedu Joint work with Peng Ding UC Berkeley May 23,

More information

Help! Statistics! Mediation Analysis

Help! Statistics! Mediation Analysis Help! Statistics! Lunch time lectures Help! Statistics! Mediation Analysis What? Frequently used statistical methods and questions in a manageable timeframe for all researchers at the UMCG. No knowledge

More information

Causal Mediation Analysis in R. Quantitative Methodology and Causal Mechanisms

Causal Mediation Analysis in R. Quantitative Methodology and Causal Mechanisms Causal Mediation Analysis in R Kosuke Imai Princeton University June 18, 2009 Joint work with Luke Keele (Ohio State) Dustin Tingley and Teppei Yamamoto (Princeton) Kosuke Imai (Princeton) Causal Mediation

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2004 Paper 155 Estimation of Direct and Indirect Causal Effects in Longitudinal Studies Mark J. van

More information

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A. Linero and M. Daniels UF, UT-Austin SRC 2014, Galveston, TX 1 Background 2 Working model

More information

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs

Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Discussion of Missing Data Methods in Longitudinal Studies: A Review by Ibrahim and Molenberghs Michael J. Daniels and Chenguang Wang Jan. 18, 2009 First, we would like to thank Joe and Geert for a carefully

More information

SEQUENTIAL MULTIPLE ASSIGNMENT RANDOMIZATION TRIALS WITH ENRICHMENT (SMARTER) DESIGN

SEQUENTIAL MULTIPLE ASSIGNMENT RANDOMIZATION TRIALS WITH ENRICHMENT (SMARTER) DESIGN SEQUENTIAL MULTIPLE ASSIGNMENT RANDOMIZATION TRIALS WITH ENRICHMENT (SMARTER) DESIGN Ying Liu Division of Biostatistics, Medical College of Wisconsin Yuanjia Wang Department of Biostatistics & Psychiatry,

More information

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha January 18, 2010 A2 This appendix has six parts: 1. Proof that ab = c d

More information

STA441: Spring Multiple Regression. More than one explanatory variable at the same time

STA441: Spring Multiple Regression. More than one explanatory variable at the same time STA441: Spring 2016 Multiple Regression More than one explanatory variable at the same time This slide show is a free open source document. See the last slide for copyright information. One Explanatory

More information

Regression I: Mean Squared Error and Measuring Quality of Fit

Regression I: Mean Squared Error and Measuring Quality of Fit Regression I: Mean Squared Error and Measuring Quality of Fit -Applied Multivariate Analysis- Lecturer: Darren Homrighausen, PhD 1 The Setup Suppose there is a scientific problem we are interested in solving

More information

Statistical Analysis of Causal Mechanisms

Statistical Analysis of Causal Mechanisms Statistical Analysis of Causal Mechanisms Kosuke Imai Princeton University November 17, 2008 Joint work with Luke Keele (Ohio State) and Teppei Yamamoto (Princeton) Kosuke Imai (Princeton) Causal Mechanisms

More information

Propensity Score Methods for Causal Inference

Propensity Score Methods for Causal Inference John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good

More information

Chapter 2: simple regression model

Chapter 2: simple regression model Chapter 2: simple regression model Goal: understand how to estimate and more importantly interpret the simple regression Reading: chapter 2 of the textbook Advice: this chapter is foundation of econometrics.

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2010 Paper 260 Collaborative Targeted Maximum Likelihood For Time To Event Data Ori M. Stitelman Mark

More information

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010 1 Linear models Y = Xβ + ɛ with ɛ N (0, σ 2 e) or Y N (Xβ, σ 2 e) where the model matrix X contains the information on predictors and β includes all coefficients (intercept, slope(s) etc.). 1. Number of

More information

Cross-Sectional Regression after Factor Analysis: Two Applications

Cross-Sectional Regression after Factor Analysis: Two Applications al Regression after Factor Analysis: Two Applications Joint work with Jingshu, Trevor, Art; Yang Song (GSB) May 7, 2016 Overview 1 2 3 4 1 / 27 Outline 1 2 3 4 2 / 27 Data matrix Y R n p Panel data. Transposable

More information

1. The OLS Estimator. 1.1 Population model and notation

1. The OLS Estimator. 1.1 Population model and notation 1. The OLS Estimator OLS stands for Ordinary Least Squares. There are 6 assumptions ordinarily made, and the method of fitting a line through data is by least-squares. OLS is a common estimation methodology

More information

10. Time series regression and forecasting

10. Time series regression and forecasting 10. Time series regression and forecasting Key feature of this section: Analysis of data on a single entity observed at multiple points in time (time series data) Typical research questions: What is the

More information

SIMPLE EXAMPLES OF ESTIMATING CAUSAL EFFECTS USING TARGETED MAXIMUM LIKELIHOOD ESTIMATION

SIMPLE EXAMPLES OF ESTIMATING CAUSAL EFFECTS USING TARGETED MAXIMUM LIKELIHOOD ESTIMATION Johns Hopkins University, Dept. of Biostatistics Working Papers 3-3-2011 SIMPLE EXAMPLES OF ESTIMATING CAUSAL EFFECTS USING TARGETED MAXIMUM LIKELIHOOD ESTIMATION Michael Rosenblum Johns Hopkins Bloomberg

More information

Propensity Score Analysis with Hierarchical Data

Propensity Score Analysis with Hierarchical Data Propensity Score Analysis with Hierarchical Data Fan Li Alan Zaslavsky Mary Beth Landrum Department of Health Care Policy Harvard Medical School May 19, 2008 Introduction Population-based observational

More information

Conceptual overview: Techniques for establishing causal pathways in programs and policies

Conceptual overview: Techniques for establishing causal pathways in programs and policies Conceptual overview: Techniques for establishing causal pathways in programs and policies Antonio A. Morgan-Lopez, Ph.D. OPRE/ACF Meeting on Unpacking the Black Box of Programs and Policies 4 September

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2011 Paper 290 Targeted Minimum Loss Based Estimation of an Intervention Specific Mean Outcome Mark

More information

PSC 504: Instrumental Variables

PSC 504: Instrumental Variables PSC 504: Instrumental Variables Matthew Blackwell 3/28/2013 Instrumental Variables and Structural Equation Modeling Setup e basic idea behind instrumental variables is that we have a treatment with unmeasured

More information

Causal Inference. Miguel A. Hernán, James M. Robins. May 19, 2017

Causal Inference. Miguel A. Hernán, James M. Robins. May 19, 2017 Causal Inference Miguel A. Hernán, James M. Robins May 19, 2017 ii Causal Inference Part III Causal inference from complex longitudinal data Chapter 19 TIME-VARYING TREATMENTS So far this book has dealt

More information

GEE for Longitudinal Data - Chapter 8

GEE for Longitudinal Data - Chapter 8 GEE for Longitudinal Data - Chapter 8 GEE: generalized estimating equations (Liang & Zeger, 1986; Zeger & Liang, 1986) extension of GLM to longitudinal data analysis using quasi-likelihood estimation method

More information

Chapter 1. Linear Regression with One Predictor Variable

Chapter 1. Linear Regression with One Predictor Variable Chapter 1. Linear Regression with One Predictor Variable 1.1 Statistical Relation Between Two Variables To motivate statistical relationships, let us consider a mathematical relation between two mathematical

More information

Linear models and their mathematical foundations: Simple linear regression

Linear models and their mathematical foundations: Simple linear regression Linear models and their mathematical foundations: Simple linear regression Steffen Unkel Department of Medical Statistics University Medical Center Göttingen, Germany Winter term 2018/19 1/21 Introduction

More information

Gov 2000: 9. Regression with Two Independent Variables

Gov 2000: 9. Regression with Two Independent Variables Gov 2000: 9. Regression with Two Independent Variables Matthew Blackwell Fall 2016 1 / 62 1. Why Add Variables to a Regression? 2. Adding a Binary Covariate 3. Adding a Continuous Covariate 4. OLS Mechanics

More information

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information. STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory

More information

BIOS 6649: Handout Exercise Solution

BIOS 6649: Handout Exercise Solution BIOS 6649: Handout Exercise Solution NOTE: I encourage you to work together, but the work you submit must be your own. Any plagiarism will result in loss of all marks. This assignment is based on weight-loss

More information

A Sampling of IMPACT Research:

A Sampling of IMPACT Research: A Sampling of IMPACT Research: Methods for Analysis with Dropout and Identifying Optimal Treatment Regimes Marie Davidian Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information

Selection on Observables: Propensity Score Matching.

Selection on Observables: Propensity Score Matching. Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017

More information

K. Model Diagnostics. residuals ˆɛ ij = Y ij ˆµ i N = Y ij Ȳ i semi-studentized residuals ω ij = ˆɛ ij. studentized deleted residuals ɛ ij =

K. Model Diagnostics. residuals ˆɛ ij = Y ij ˆµ i N = Y ij Ȳ i semi-studentized residuals ω ij = ˆɛ ij. studentized deleted residuals ɛ ij = K. Model Diagnostics We ve already seen how to check model assumptions prior to fitting a one-way ANOVA. Diagnostics carried out after model fitting by using residuals are more informative for assessing

More information

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census

More information

GLS and FGLS. Econ 671. Purdue University. Justin L. Tobias (Purdue) GLS and FGLS 1 / 22

GLS and FGLS. Econ 671. Purdue University. Justin L. Tobias (Purdue) GLS and FGLS 1 / 22 GLS and FGLS Econ 671 Purdue University Justin L. Tobias (Purdue) GLS and FGLS 1 / 22 In this lecture we continue to discuss properties associated with the GLS estimator. In addition we discuss the practical

More information

Statistical Distribution Assumptions of General Linear Models

Statistical Distribution Assumptions of General Linear Models Statistical Distribution Assumptions of General Linear Models Applied Multilevel Models for Cross Sectional Data Lecture 4 ICPSR Summer Workshop University of Colorado Boulder Lecture 4: Statistical Distributions

More information

Telescope Matching: A Flexible Approach to Estimating Direct Effects *

Telescope Matching: A Flexible Approach to Estimating Direct Effects * Telescope Matching: A Flexible Approach to Estimating Direct Effects * Matthew Blackwell Anton Strezhnev August 4, 2018 Abstract Estimating the direct effect of a treatment fixing the value of a consequence

More information

Statistical Methods. Missing Data snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23

Statistical Methods. Missing Data  snijders/sm.htm. Tom A.B. Snijders. November, University of Oxford 1 / 23 1 / 23 Statistical Methods Missing Data http://www.stats.ox.ac.uk/ snijders/sm.htm Tom A.B. Snijders University of Oxford November, 2011 2 / 23 Literature: Joseph L. Schafer and John W. Graham, Missing

More information

Measurement Error in Spatial Modeling of Environmental Exposures

Measurement Error in Spatial Modeling of Environmental Exposures Measurement Error in Spatial Modeling of Environmental Exposures Chris Paciorek, Alexandros Gryparis, and Brent Coull August 9, 2005 Department of Biostatistics Harvard School of Public Health www.biostat.harvard.edu/~paciorek

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2010 Paper 269 Diagnosing and Responding to Violations in the Positivity Assumption Maya L. Petersen

More information

Extending the results of clinical trials using data from a target population

Extending the results of clinical trials using data from a target population Extending the results of clinical trials using data from a target population Issa Dahabreh Center for Evidence-Based Medicine, Brown School of Public Health Disclaimer Partly supported through PCORI Methods

More information

Gov 2002: 13. Dynamic Causal Inference

Gov 2002: 13. Dynamic Causal Inference Gov 2002: 13. Dynamic Causal Inference Matthew Blackwell December 19, 2015 1 / 33 1. Time-varying treatments 2. Marginal structural models 2 / 33 1/ Time-varying treatments 3 / 33 Time-varying treatments

More information

Comparative effectiveness of dynamic treatment regimes

Comparative effectiveness of dynamic treatment regimes Comparative effectiveness of dynamic treatment regimes An application of the parametric g- formula Miguel Hernán Departments of Epidemiology and Biostatistics Harvard School of Public Health www.hsph.harvard.edu/causal

More information

1 Motivation for Instrumental Variable (IV) Regression

1 Motivation for Instrumental Variable (IV) Regression ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data

More information

Practical Considerations Surrounding Normality

Practical Considerations Surrounding Normality Practical Considerations Surrounding Normality Prof. Kevin E. Thorpe Dalla Lana School of Public Health University of Toronto KE Thorpe (U of T) Normality 1 / 16 Objectives Objectives 1. Understand the

More information

Double Robustness. Bang and Robins (2005) Kang and Schafer (2007)

Double Robustness. Bang and Robins (2005) Kang and Schafer (2007) Double Robustness Bang and Robins (2005) Kang and Schafer (2007) Set-Up Assume throughout that treatment assignment is ignorable given covariates (similar to assumption that data are missing at random

More information

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Causality II: How does causal inference fit into public health and what it is the role of statistics? Causality II: How does causal inference fit into public health and what it is the role of statistics? Statistics for Psychosocial Research II November 13, 2006 1 Outline Potential Outcomes / Counterfactual

More information

Integrated approaches for analysis of cluster randomised trials

Integrated approaches for analysis of cluster randomised trials Integrated approaches for analysis of cluster randomised trials Invited Session 4.1 - Recent developments in CRTs Joint work with L. Turner, F. Li, J. Gallis and D. Murray Mélanie PRAGUE - SCT 2017 - Liverpool

More information

Causal Inference with Big Data Sets

Causal Inference with Big Data Sets Causal Inference with Big Data Sets Marcelo Coca Perraillon University of Colorado AMC November 2016 1 / 1 Outlone Outline Big data Causal inference in economics and statistics Regression discontinuity

More information

Comparing Adaptive Interventions Using Data Arising from a SMART: With Application to Autism, ADHD, and Mood Disorders

Comparing Adaptive Interventions Using Data Arising from a SMART: With Application to Autism, ADHD, and Mood Disorders Comparing Adaptive Interventions Using Data Arising from a SMART: With Application to Autism, ADHD, and Mood Disorders Daniel Almirall, Xi Lu, Connie Kasari, Inbal N-Shani, Univ. of Michigan, Univ. of

More information

Unbiased estimation of exposure odds ratios in complete records logistic regression

Unbiased estimation of exposure odds ratios in complete records logistic regression Unbiased estimation of exposure odds ratios in complete records logistic regression Jonathan Bartlett London School of Hygiene and Tropical Medicine www.missingdata.org.uk Centre for Statistical Methodology

More information

Weighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i.

Weighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i. Weighting Unconfounded Homework 2 Describe imbalance direction matters STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University

More information

Selective Inference for Effect Modification

Selective Inference for Effect Modification Inference for Modification (Joint work with Dylan Small and Ashkan Ertefaie) Department of Statistics, University of Pennsylvania May 24, ACIC 2017 Manuscript and slides are available at http://www-stat.wharton.upenn.edu/~qyzhao/.

More information

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing Primal-dual Covariate Balance and Minimal Double Robustness via (Joint work with Daniel Percival) Department of Statistics, Stanford University JSM, August 9, 2015 Outline 1 2 3 1/18 Setting Rubin s causal

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

UNIVERSITY OF TORONTO Faculty of Arts and Science

UNIVERSITY OF TORONTO Faculty of Arts and Science UNIVERSITY OF TORONTO Faculty of Arts and Science December 2013 Final Examination STA442H1F/2101HF Methods of Applied Statistics Jerry Brunner Duration - 3 hours Aids: Calculator Model(s): Any calculator

More information

Global Sensitivity Analysis for Repeated Measures Studies with Informative Drop-out: A Semi-Parametric Approach

Global Sensitivity Analysis for Repeated Measures Studies with Informative Drop-out: A Semi-Parametric Approach Global for Repeated Measures Studies with Informative Drop-out: A Semi-Parametric Approach Daniel Aidan McDermott Ivan Diaz Johns Hopkins University Ibrahim Turkoz Janssen Research and Development September

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 6: Bias and variance (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 49 Our plan today We saw in last lecture that model scoring methods seem to be trading off two different

More information

The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models

The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models The consequences of misspecifying the random effects distribution when fitting generalized linear mixed models John M. Neuhaus Charles E. McCulloch Division of Biostatistics University of California, San

More information

Estimation of Optimal Treatment Regimes Via Machine Learning. Marie Davidian

Estimation of Optimal Treatment Regimes Via Machine Learning. Marie Davidian Estimation of Optimal Treatment Regimes Via Machine Learning Marie Davidian Department of Statistics North Carolina State University Triangle Machine Learning Day April 3, 2018 1/28 Optimal DTRs Via ML

More information

Randomization-Based Inference With Complex Data Need Not Be Complex!

Randomization-Based Inference With Complex Data Need Not Be Complex! Randomization-Based Inference With Complex Data Need Not Be Complex! JITAIs JITAIs Susan Murphy 07.18.17 HeartSteps JITAI JITAIs Sequential Decision Making Use data to inform science and construct decision

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information