Daniel Almirall 1, Daniel F. McCaffrey 2, Beth Ann Griffin 2, Rajeev Ramchand 2, Susan A. Murphy 1

Size: px
Start display at page:

Download "Daniel Almirall 1, Daniel F. McCaffrey 2, Beth Ann Griffin 2, Rajeev Ramchand 2, Susan A. Murphy 1"

Transcription

1 1 Examining moderated effects of additional adolescent substance use treatment: Structural nested mean model estimation using inverse-weighted regression-with-residuals Daniel Almirall 1, Daniel F. McCaffrey 2, Beth Ann Griffin 2, Rajeev Ramchand 2, Susan A. Murphy 1 1 Univ of Michigan, Institute for Social Research 2 RAND, Statistics Institute of Mathematical Statistics Asia Pacific Rim Meeting July 2, 2012

2 1 Time-Varying Setting 2 1 Time-Varying Setting The data structure in the time-varying setting is: S 0 a 1 S 1 (a 1 ) a 2 S2 (ā 2 ) a 3 Y (ā 3 ) Motivating Example: Adolescents & Substance Use Treatment S 0 Age, intake, contr. env in p.90 a 1 S 1 (a 1 ) a 2 S 2 (a 1, a 2 ) a 3 Y (a 1, a 2, a 3 ) 0-3mo treatment; binary, a 1 = yes/no 0-3mo 3-6mo treatment; binary, a 2 = yes/no 3-6mo 6-9mo treatment; binary, a 3 = yes/no Substance use frequency 9-12mo

3 2 What Scientific Question of Interest? 3 2 What Scientific Question of Interest? The data structure: {S 0, a 1, S 1 (a 1 ), a 2, S 2 (a 1, a 2 ), a 3, Y (a 1, a 2 )}. We began wondering about: Cumulative effect of treatment? Observed treatment sequences in data are: (A 1, A 2, A 3 ), Rate (0,0,0), 11% (0,0,1), 2% (1,0,0), 41% (0,1,1), 2% (1,1,0), 19% (1,0,1), 5% (1,1,1), 17% (0,1,0), 2% More specific questions emerged: What are the incremental effects of additional substance use treatment? Are these effects heterogeneous? i.e., Do they differ as a function of severity at intake and improvements over time?

4 3 Time-Varying Effect Moderation 4 3 Time-Varying Effect Moderation The data structure: {S 0, a 1, S 1 (a 1 ), a 2, S 2 (a 1, a 2 ), a 3, Y (a 1, a 2 )}. Overarching question: What are the incremental effects of additional substance use treatment, as a function of severity at intake and improvements over time? More specifically, there are 3 types of causal effects of interest: 1. Distal moderated effect of initial treatment: What are the effects of (1,0,0) vs (0,0,0) on Y given S 0? 2. Medial moderated effect of cumulative treatment: What are the effects of (1,1,0) vs (1,0,0) on Y given (S 0, S 1 )? 3. Proximal moderated effect of cumulative treatment: What are effects of (1,1,1) vs (1,1,0) on Y given (S 0, S 1, S 2 )?

5 3 Time-Varying Effect Moderation 5 What are the distal moderated effects of initial treatment? What are the effects of (1,0,0) vs (0,0,0) on Y given S 0? µ 1 = E[Y (1, 0, 0) Y (0, 0, 0) S 0 = s 0 ] a 1 a 2 = 0 a 3 = 0 S 0 Y (ā 3 )

6 3 Time-Varying Effect Moderation 6 What are the medial moderated effects of cumulative initial treatment? What are the effects of (1,1,0) vs (1,0,0) on Y given (S 0, S 1 )? µ 2 = E[Y (1, 1, 0) Y (1, 0, 0) S 0 = s 0, S 1 (1) = s 1 ] a 1 a 2 a 3 = 0 S 0 S 1 (a 1 ) Y (ā 3 )

7 3 Time-Varying Effect Moderation 7 What are the proximal moderated effects of cumulative initial treatment? What are the effects of (1,1,1) vs (1,1,0) on Y given (S 0, S 1, S 2 )? µ 3 = E[Y (1, 1, 1) Y (1, 1, 0) S 2 (1, 1) = s 2 ] a 1 a 2 a 3 S 0 S 1 (a 1 ) S 2 (ā 2 ) Y (ā 3 )

8 4 Robins Structural Nested Mean Model 8 4 Robins Structural Nested Mean Model decomposes E(Y ) into nuisance and causal parts: E [ Y (a 1, a 2 ) S 0, S 1 (a 1 ) ] = E[Y (0, 0)] + { } E[Y (0, 0) S 0 ] E[Y (0, 0)] + {E [ ] } Y(a 1, 0) Y(0, 0) S 0 { + E[Y (a 1, 0) S } 1 (a 1 )] E[Y (a 1, 0) S 0 ] + {E [ Y(a 1, a 2 ) Y(a 1, 0) S 1 (a 1 ) ]} = µ 0 + ɛ 1 (s 0 ) + µ 1 (s 0, a 1 ) + ɛ 2 ( s 1, a 1 ) + µ 2 ( s 1, ā 2 ) Constraint: µ t = 0 when a t = 0 Constraint: E S1 S 0 [ɛ 2 ( s 1, a 1 ) S 0 = s 0 ] = 0, and E S0 [ɛ 1 (s 0 )] = 0

9 5 Problems with Traditional Regression 9 5 Problems with Traditional Regression Ex: Use the Traditional Estimator to model the t = 2 SNMM as: E(Y S 1 = s 1, Ā2 = ā 2 ) = β0 + η 1 s 0 + β1a 1 + β2a 1 s 0 + η 2 s 1 + β3a 2 + β4a 2 s 0 + β5a 2 s 1 Two problems arise from the way we condition on S t : (1)WRONG EFFECT, (2)SPURIOUS BIAS One problem arises from not adjusting for time-varying confounders: (3)TIME-VARYING CONFOUNDING BIAS

10 5 Problems with Traditional Regression 10 First problem with the Traditional Approach Wrong Effect a 1 a 2 = 0 S 0 S 1 Y (ā 2 ) But what about the effect transmitted through S 1 (a 1 )? So the end result is the term β 1 a 1 + β 2 a 1s 0 does not capture the total impact of (a 1, 0) vs (0, 0) on Y given values of S 0.

11 5 Problems with Traditional Regression 11 Second problem with the Traditional Approach Spurious Bias V a 1 a 2 = 0 S 0 S 1 Y (ā 2 ) This is also known as Berkson s paradox ; and is related to Judea Pearl s back-door criterion and collider bias

12 5 Problems with Traditional Regression 12 Intuition about the Spurious Bias Social Support Txt Subst Use Later Substance Use Imagine adolescent who is a high user despite getting treated: Q: What does this tell you in terms of his social support? A: There must be poor social support. Implication: Conditional on substance use, getting treated is associated with more substance use! Bias is 1( )( )( ) = +.

13 5 Problems with Traditional Regression 13 Proposed Regression with Residuals Estimator Instead of the traditional regression estimator E(Y S 1 = s 1, Ā2 = ā 2 ) = β 0 + η 1 s 0 + β 1a 1 + β 2a 1 s 0 we use the following + η 2 s 1 + β 3a 2 + β 4a 2 s 0 + β 5a 2 s 1, E(Y S 1 = s 1, Ā2 = ā 2 ) = β 0 + η 1 s 0 + β 1a 1 + β 2a 1 s 0 + η 2 ( s1 E(S 1 A 1, S 0 ) ) + β 3a 2 + β 4a 2 s 0 + β 5a 2 s 1. We call it regression with residuals because first we estimate E(S 1 A 1, S 0 ), then use the residual s 1 E(S1 A 1, S 0 ) in a second regression to get β s.

14 5 Problems with Traditional Regression 14 Proposed Regression with Residuals Estimator E(Y S 1 = s 1, Ā2 = ā 2 ) = β0 + η 1 s 0 + β1a 1 + β2a 1 s 0 ( + η 2 s1 E(S 1 A 1, S 0 ) ) + β3a 2 + β4a 2 s 0 + β5a 2 s 1. The proposed estimator is unbiased for the µ t s provided: 1. Correctly modeled SNMM, incl. the ɛ t s functions. 2. A 1 {Y (a 1, a 2 )} S 0, and 3. A 2 {Y (a 1, a 2 )} S 0, A 1, S 1 Together, 2. and 3. is a Sequential Ignorability Assumption. But there may be other measured time-varying confounders...

15 5 Problems with Traditional Regression 15 Third Problem with Traditional Approach Time-varying Confounding Bias: Time-varying covariates X t that are confounders, but not moderators of interest? X 0 X 1 A 1 A 2 S 0 S 1 What is X t? EPS + SPS + MAXCE - LRI + AGE - NONWHITE -...and so on. Y (ā 2 ) Use RR with S 1 The auxiliary variables X t may be high-dimensional.

16 5 Problems with Traditional Regression 16 Solution: Inverse-Probability-of-Treatment Weights We use IPTW version of the proposed 2-Stage RR Estimator: X 0 X 1 Use IPTW A 1 A 2 S 0 S 1 Use IPTW What is X t? EPS + SPS + MAXCE - LRI + AGE - NONWHITE -... Y (ā 2 ) Use RR with S 1 The proposed IPTW estimator is unbiased provided (1) correct SNMM, (2) sequential ignorability given ( S t, X t ), (3) consistency, and (4) get the right weights.

17 5 Problems with Traditional Regression 17 The Form of the IPT Weights W 2 = W 1 = 1 P r(a 1 = a 1 S 0 = s 0, X 0 = x 0 ) 1 P r(a 2 = a 2 S 0 = s 0, X 0 = x 0, A 1 = a 1, S 1 = s 1, X 1 = x 1 ) Assumes denominator probabilities are non-zero. We use logistic regressions to estimate the denominator probs.; models chosen to result in best balance. W 1 W 2 is used in the IPTW+RR estimator of the SNMM. Following Murphy, van der Laan, Robins (unpublished), we use a stabilized version where the numerator for W t is P r(a t = a t Āt 1, S t 1 ).

18 6 Data Analysis 18 6 Data Analysis From US substance abuse prgms (CSAT SAMHSA) GAIN: structured clinical interview; over 100 scales/indices n = 2870 adolescents; data every 3 months for 1 year {(S 0, X 0 ), A 1, (S 1, X 1 ), A 2, (S 2, X 2 ), A 3, Y } S 0 = hx controlled environment, age S t = substance frequency scale at intake, 0-3, 3-6 X t = measured time-varying confounders at intake, 0-3, 3-6 A t = none (0) vs some txt (1=outpt, inpt, or both) Y = substance frequency scale at 9-12mo

19 6 Data Analysis 19 The weights did a good job adjusting for X t. t = 1 t = 2 t = 3 Effect Size small medium large Effect Size small medium large Effect Size small medium large Unweighted Weighted B = B = Unweighted Weighted B = B = Unweighted Weighted B = B = 0.037

20 6 Data Analysis 20 EDA µ 1 = Distal effects of initial treatment, given sfs8p0, age, and baseline CE status No CE No CE Yes CE Yes CE Under or older Under or older (1,0,0) (0,0,0) 0.1 Y = sfs8p µ 2 = Medial effects of additional treatment, given sfs8p3, age, and baseline CE status No CE No CE Yes CE Yes CE Under or older Under or older (1,1,0) (1,0,0) µ 3 = Proximal effects of additional treatment, given sfs8p6, age, and baseline CE status No CE No CE Yes CE Yes CE Under or older Under or older (1,1,1) (1,1,0) Time varying moderator = sfs8pt

21 6 Data Analysis 21 Effect Estimates from SNMM, using RR+IPTW Contrast Subgroup Est. Eff.Sz. P-val µ 1 : Distal (1, 0, 0) vs (0, 0, 0) no intake sevrty, < 16yrs (1, 0, 0) vs (0, 0, 0) hi intake sevrty, 16yrs µ 2 : Medial (1, 1, 0) vs (1, 0, 0) no 0-3 severity (1, 1, 0) vs (1, 0, 0) hi 0-3 severity, yes ce (1, 1, 0) vs (1, 0, 0) hi 0-3 severity, no ce µ 3 : Proximal (1, 1, 1) vs (1, 1, 0) no 6-9 severity (1, 1, 1) vs (1, 1, 0) hi 6-9 severity < 0.01 (.,., 1) vs (.,., 0) no 6-9 severity (.,., 1) vs (.,., 0) hi 6-9 severity < 0.01

22 6 Data Analysis 22 Some conjectures about the substantive story Initial treatment alone may be iatrogenic for older kids with high severity at intake (evidence is not so strong here). An additional 3mos of treatment may be more helpful for kids still severe at the end of 3 months who have a hx of a controlled environment (evidence is very weak here). Providing full treatment is especially beneficial for the kids who are still looking bad after 6 months (evidence here is reasonably strong). There is not a lot of evidence for a treatment effect for kids who are not severe.

23 7 Summary 23 7 Summary Time-varying causal effect moderation: What is the incremental effect of additional community-based substance use treatment, as a function of severity at intake and improvements over time? Examine using Robins Structural Nested Mean Model Propose wtd regression with residuals estimator for SNMM Resembles traditional regression estimator; easy-to-use Adjust time-varying confounders via IPT Weighting Standard errors: In simulation experiments, we find bootstrap SEs to be better than ASEs in small samples

24 7 Summary 24 Acknowledgements NIDA Funding: The Methodology Center at Penn State University (P50-DA ; PIs: Collins, Murphy & Co-I: Almirall) RAND (R01-DA ; PIs: McCaffrey, Griffin & Co-I: Ramchand) NIMH Funding: Univ of Michigan (R01-MH ; PI: Murphy) Special Help From: Mary Ellen Slaughter, RAND Bobby Yuen, Graduate Student, Michigan Statistics

25 7 Summary 25 Thank you. Contact Information: Daniel Almirall, Univ of Michigan, ISR * Robins (1994), Communications in Statistics. * Almirall, Tenhave, Murphy (2010), Biometrics. * Almirall, McCaffrey, Ramchand, Murphy (2011), Prevention Science. Software to fit SNMM using RR on my website. * Almirall, McCaffrey, Griffin, Ramchand, Murphy (to submit).

26 8 Back Pocket Slides 26 8 Back Pocket Slides

27 8 Back Pocket Slides 27 Warm-up: Suppose we want A Y. S? A Y Examples S = pre-a covt A = txt/expsr Y = outcome Social Support Inpatient vs. Outpatient Substance Abuse Why condition on ( adjust for ) pre-exposure covariables S?

28 8 Back Pocket Slides 28 Suppose we want the effect of A on Y. Why condition on (adjust for) pre-treatment (or pre-exposure) variables S? 1. Confounding: S is correlated with both A and Y. In this case, S is known as a confounder of the effect of A on Y. 2. Precision: S may be a pre-treatment measure of Y, or any other variable highly correlated with Y. 3. Missing Data: The outcome Y is missing for some units, S and A predict missingness, and S is associated with Y. 4. Effect Heterogeneity: S may moderate, temper, or specify the effect of A on Y. In this case, S is known as a moderator of the effect of A on Y.

29 8 Back Pocket Slides 29 Suppose we want the effect of A on Y. Why condition on (adjust for) pre-treatment (or pre-exposure) variables S? S A Y 4. Effect Heterogeneity: S may moderate, temper, or specify the effect of A on Y. In this case, S is known as a moderator of the effect of A on Y. Formalized in next slide.

30 8 Back Pocket Slides 30 Final Warm-up: Mean Model in One Time Point Decomposition of the conditional mean E(Y (a) S) and the prototypical linear model: E(Y (a) S = s) = E(Y (0)) ( ) + E(Y (0) S = s) E(Y (0)) + E(Y (a) Y (0) S = s) = η 0 + ɛ(s) + µ(s, a) e.g. = η 0 + η 1 (s E(S)) + β 1 a + β 2 as. Boils down to what we always do anyway: that is, treatment covariate interaction terms to examine effect heterogeneity.

31 8 Back Pocket Slides 31 Effect Moderation in One Time Point µ(s, a) E(Y (a) Y (0) S = s) Y(a) = Substance Use: Low is better a = 1 = residential a = 0 = outpatient µ(s) = E( Y(inpat) Y(outpat) S=s ) µ = 0 = No Effect S = Social Support: High is better S = Social Support: High is better Outpatient substance abuse treatment is better than residential treatment for individuals with higher levels of social support.

32 8 Back Pocket Slides 32 Causal Effect Moderation in Context: Relevance? Theoretical Implication: Understanding the heterogeneity of treatment or exposures effects enhances our understanding of various (competing) scientific theories; and it may suggest new scientific hypotheses to be tested. Practical Implication: Identifying types, or subgroups, of individuals for which treatment or exposure is not effective may suggest altering the treatment to suit the needs of those types of individuals.

33 8 Back Pocket Slides 33 Prototypical Linear Parametric Model We use β for our causal parameters of interest: E(Y (a) S) = η 0 + φ(s) + µ(s, a; β) = η 0 + φ(s) + ahβ where H is a function of S. Sometimes we parameterize φ(s) using φ(s; η 0 ) = Gη 0, where G is a function of S. Example: Let G = (S) and H = (1, S): E(Y (a) S = s) = η 0 + η 1 s + a (β 1 + β 2 s). If a and S are binary, then this is the fully saturated model.

34 8 Back Pocket Slides 34 Estimation in One Time Point Consider three estimators for β in µ(s, a; β): 1. Traditional Regression 2. Semi-parametric Estimation Method: Robins E-Estimator 3. Inverse Probability of Treatment Weighted (IPTW) Regression We discuss these (and more) in turn, supposing that 1. a is binary (0,1), and 2. True model for µ(s, a) is µ(s, a; β) = ahβ for some H. Example: H = (1, S) ahβ = a(β 1 + β 2 s). An important consideration in estimation is how A comes about.

35 8 Back Pocket Slides 35 Traditional Ordinary Least Squares Regression Recall true model: E(Y (a) S) = η 0 + φ(s) + ahβ. Useful when S is sole confounder, and have good model for φ(s). Requires model for nuisance function: φ(s; η 0 ) = Gη 0. Regress Y [1, G, A H] to get ( η, β). The β estimates solve ( ( ) 0 = P n Y η 0 Gη 0 AHβ AH ). T β unbiased for β if φ(s; η 0 ) = Gη 0 is true model for φ(s) and A {Y (0), Y (1)} given S.

36 8 Back Pocket Slides 36 Semi-parametric E-Estimator Recall true model: E(Y (a) S) = η 0 + φ(s) + ahβ. Useful when S is sole confounder, but we have no model for φ(s). Does NOT require model for nuisance function φ(s). Get β by solving the following estimating equations ( ( 0 = P n Y b(s; )( ) ξ) AHβ A p(s; α) H ), T where b(s; ξ) is a guess for E(Y AHβ S) = η 0 + φ(s). β unbiased for β if p(s; α) is true model for P r(a = 1 S), and A {Y (0), Y (1)} given S. (Discuss double-robustness.)

37 8 Back Pocket Slides 37 IPT Weighted Regression (WLS) Recall true model: E(Y (a) S) = η 0 + φ(s) + ahβ. Useful when we have measured confounders V ( S). Requires model for nuisance function: φ(s; η 0 ) = Gη 0. Regress Y ŵ [1, G, A H] to get ( η, β), where weights are w(v, A) = A P r(a = 1 S) P r(a = 1 V ) + (1 A) P r(a = 0 S) P r(a = 0 V ). β unbiased for β if φ(s; η 0 ) = Gη 0 is true model for φ(s), and A {Y (0), Y (1)} given V.

38 8 Back Pocket Slides 38 Semi-parametric Regression Method (Encore) Now, model is: E(Y (a) V ) = η 0 + φ (V ) + ahβ. Useful with confounders V ( S), have no model φ (V ), and if we can assume that V S does not moderate impact of a on Y (a). Does NOT require model for nuisance function φ(v ). Get β by solving the following estimating equations ( ( 0 = P n Y b(v )( ) ; ξ) AHβ A p(v ; α) H ). T β unbiased for β if p(s; α) is true model for P r(a = 1 S), and A {Y (0), Y (1)} given V.

39 8 Back Pocket Slides 39 An Overview of Estimation Strategies Model A: E(Y (a) S) = η 0 + φ(s) + ahβ Model B: E(Y (a) V ) = φ 0 + φ(v ) + ahβ H is always a function of S Ex: H = (1, S) Model A Model B No S is Sole Confnders Modrtrs S, Confnders Confnder V Confndrs V φ Is OLS OLS IPTW OLS Known Regression φ Is Not OLS E-estimtr IPTW E-estimtr Known (if S A) E-estimtr just discussed need P r(a = 1 S) need P r(a = 1 V )

40 8 Back Pocket Slides 40 As a Decomposition of the Marginal Causal Effect Recall the data structure {S 1, a 1, S 2 (a 1 ), a 2, Y (a 1, a 2 )}. Consider the following arithmetic decomposition of the causal effect of (a 1, a 2 ) on Y, using the covariates S 2 (a 1 ): E [ Y (a 1, a 2 ) Y (0, 0) ] = E [E [ Y (a 1, a 2 ) Y (a 1, 0) S 2 (a 1 ) ]] [ + E E [ ] ] Y (a 1, 0) Y (0, 0) S 1. The inner expectations represent the conditional intermediate causal effects µ 1 and µ 2, respectively.

41 8 Back Pocket Slides 41 Robins Structural Nested Mean Model The SNMM for the conditional mean of Y (a 1, a 2 ) given S 2 (a 1 ) is: E [ Y (a 1, a 2 ) S 1, S 2 (a 1 ) ] { } = E[Y (0, 0)] + E[Y (0, 0) S 1 ] E[Y (0, 0)] + {E [ ] } Y(a 1, 0) Y(0, 0) S 1 { + E[Y (a 1, 0) S } 2 (a 1 )] E[Y (a 1, 0) S 1 ] + {E [ Y(a 1, a 2 ) Y(a 1, 0) S 2 (a 1 ) ]} = µ 0 + ɛ 1 (s 1 ) + µ 1 (s 1, a 1 ) + ɛ 2 ( s 2, a 1 ) + µ 2 ( s 2, ā 2 )

42 8 Back Pocket Slides 42 SNMM Property I: µ t = 0 when a t = 0. E [ Y (a 1, a 2 ) S 0, S 1 (a 1 ) ] = E[Y (0, 0)] + { } E[Y (0, 0) S 0 ] E[Y (0, 0)] + {E [ ] } Y(a 1, 0) Y(0, 0) S 0 { + E[Y (a 1, 0) S } 1 (a 1 )] E[Y (a 1, 0) S 0 ] + {E [ Y(a 1, a 2 ) Y(a 1, 0) S 1 (a 1 ) ]} = µ 0 + ɛ 1 (s 0 ) + µ 1 (s 0, a 1 ) + ɛ 2 ( s 1, a 1 ) + µ 2 ( s 1, ā 2 ) µ 1 (s 0, 0) = 0 Ex Model: a 1 (β 10 + β 11 s 0 ) µ 2 ( s 1, a 2, 0) = 0 Ex Model: a 2 (β 20 + β 21 s 1 )

43 8 Back Pocket Slides 43 SNMM Property II: ɛ t s are conditional mean zero. E [ Y (a 1, a 2 ) S 0, S 1 (a 1 ) ] = E[Y (0, 0)] + { } E[Y(0, 0) S 0 ] E[Y(0, 0)] + {E [ ] } Y (a 1, 0) Y (0, 0) S 0 { } + E[Y(a 1, 0) S 1 (a 1 )] E[Y(a 1, 0) S 0 ] + {E [ Y (a 1, a 2 ) Y (a 1, 0) S 1 (a 1 ) ]} = µ 0 + ɛ 1 (s 0 ) + µ 1 (s 0, a 1 ) + ɛ 2 ( s 1, a 1 ) + µ 2 ( s 1, ā 2 ) E S1 S 0 [ɛ 2 ( s 1, a 1 ) S 0 = s 0 ] = 0, and E S0 [ɛ 1 (s 0 )] = 0 The ɛ t s make the SNMM a non-standard regression model.

44 8 Back Pocket Slides 44 So what s wrong with the Traditional Estimator? Ex: Use the Traditional Estimator to model the t = 2 SNMM as: E(Y S 1 = s 1, Ā2 = ā 2 ) = β0 + η 1 s 0 + β1a 1 + β2a 1 s 0 + η 2 s 1 + β3a 2 + β4a 2 s 0 + β5a 2 s 1 Two problems arise with the interpretation of β1 and β 2. These two problems may occur even when We use the correct model for the conditional mean, or The sole time-varying confounder is the putative time-varying moderator S t, or There is no time-varying confounding bias at all!

45 8 Back Pocket Slides 45 Traditional approach to estimate µ 1 is problematic. To explain what is wrong with the traditional estimator, we focus on estimating µ 1 using the traditional approach. µ 1 (s 0, a 1 ) = E[Y (a 1, 0) Y (0, 0) S 0 = s 0 ] a 1 a 2 = 0 S 0 Y (ā 2 )

46 8 Back Pocket Slides 46 First problem with the Traditional Approach Wrong Effect a 1 a 2 = 0 S 0 S 1 Y (ā 2 ) But what about the effect transmitted through S 1 (a 1 )? So the end result is the term β 1 a 1 + β 2 a 1s 0 does not capture the total impact of (a 1, 0) vs (0, 0) on Y given values of S 0.

47 8 Back Pocket Slides 47 Second problem with the Traditional Approach Spurious Bias V a 1 a 2 = 0 S 0 S 1 Y (ā 2 ) This is also known as Berkson s paradox ; and is related to Judea Pearl s back-door criterion and collider bias

48 8 Back Pocket Slides 48 Intuition about the Spurious Bias Social Support Txt Subst Use Later Substance Use Imagine adolescent who is a high user despite getting treated: Q: What does this tell you in terms of his social support? A: There must be poor social support. Implication: Conditional on substance use, getting treated is associated with more substance use! Bias is 1( )( )( ) = +.

49 8 Back Pocket Slides 49 The old warning against adjusting for post-treatment measures. Robins, Hernan, Cole, van der Laan, Pearl, Vanderwheele, & many others have published countless articles on elucidating this problem. Rosenbaum has an early article on this issue as well. Berkson s paradox in the context of case-control studies using hospitalized samples is related to this problem. Clinical trialists have been warning against this for a very long time! This is part of the reason why they advocate for ITT.

50 8 Back Pocket Slides 50 Proposed 2-Stage Regression Estimator Recall that E [ Y (a 1, a 2 ) S 2 (a 1 ) = s 2 ] = µ2 ( s 2, ā 2 ; β 2 ) + ɛ 2 ( s 2, a 1 ; η 2, γ 2 ) + µ 1 (s 1, a 1 ; β 1 ) + ɛ 1 (s 1 ; η 1, γ 1 ) + µ We have models for the µ s: A 1 H 1 β 1 and A 2 H 2 β 2 ; Set aside 2. Model m 1 (γ 1 ) = E(S 1 ), estimate γ 1 with GLM; model m 2 (s 1, a 1 ; γ 2 ) = E(S 2 (a 1 ) S 1 = s 1 ), estimate γ 2 with GLM 3. Construct residuals ˆδ 1 = s 1 ˆm 1 (ˆγ 1 ) and ˆδ 2 = s 2 ˆm 2 (s 1, a 1 ; ˆγ 2 ) 4. Construct models for ɛ s: G 1ˆδ1 η 1 = G 1 η 1 and G 2ˆδ2 η 2 = G 2 η 2 5. Obtain ˆβ and ˆη using OLS of Y [1, G 1, A 1H 1, G 2, A 2H 2 ]

51 8 Back Pocket Slides 51 Data Descriptives: Treatment Trajectories Treatment (A 1, A 2, A 3 ) Frequency Proportion (0,0,0) % (0,1,0) 56 2% (1,0,0) % (1,1,0) % (0,0,1) 56 2% (0,1,1) 56 2% (1,0,1) 153 5% (1,1,1) %

52 8 Back Pocket Slides 52 Data Descriptives: Moderators and Outcomes Moderators Mean SD Range S 0 sfs8p (0, 0.89) b2a (12, 25) maxce (0, 90) S 1 sfs8p (0, 0.67) S 2 sfs8p (0, 0.73) Outcome Mean SD Range Y sfs8p (0, 0.78)

53 8 Back Pocket Slides 53 How did we choose our weights? BVAL=0.161 WBAL maxw BVAL=0.155 WBAL maxw BVAL=0.198 WBAL maxw imp= imp=2 imp= maxes maxes maxes ESS ESS ESS All confounders ES Step COR Step Selecting Denominator Model t=1 All confounders ES Step COR Step Selecting Denominator Model t=2 All confounders ES Step COR Step Selecting Denominator Model t=3

54 8 Back Pocket Slides 54 How did the weights do? Denom Pr. Denominator weights Balance t No. (min, max) (min, max) ESS J B M 1 19 (0.20, 0.98) (1.02, 32.83) (0.03, 0.95) (1.03, 15.43) (0.01, 0.97) (1.01, 37.93)

55 8 Back Pocket Slides 55 Existing Semi-parametric G-Estimator Recall our SNMM: E [ Y (a 1, a 2 ) S 1, S 2 (a 1 ) ] = µ 0 + ɛ 1 (s 1 ) + β 10 a 1 + β 11 a 1 s 1 + ɛ 2 ( s 2, a 1 ) + β 20 a 2 + β 21 a 2 s 1 + β 22 a 2 s 2 Robins G-Estimator models the ɛ t s implicitly, as part of an algorithm. It also allows for incorrect models for the ɛ t s if models for the time-varying propensity scores p t = P r(a t S t, A t 1 ) are correctly specified. That is, if either of the p t s or ɛ t s are correctly specified, then the G-Estimator yields unbiased estimates of the causal β s.

56 8 Back Pocket Slides 56 Robins Semi-parametric G-Estimator Robins G-Estimator is the solution to these estimating equations: { ( 0 = P n Y A 2 H 2 β 2 b 2 ( S ) ( 2, A 1 ) A 2 p 2 ( S ) 2, A 1 ) 0 H 2 } ( ) ( ) + Y A 2 H 2 β 2 A 1 H 1 β 1 b 1 (S 1 ) A 1 p 1 (S 1 ) H 1 (H 1 ) [ ] H 2 A 2 S 1, A 1 = 1 (H 1 ) = E E b 2 ( S 2, A 1 ) = E [ Y A 2 H 2 β 2 S ] 2, A 1 p 2 ( S 2, A 1 ) = P r [ A 2 = 1 S ] 2, A 1 b 1 (S 1 ) = E [Y A 2 H 2 β 2 A 1 H 1 β 1 S 1 ] p 1 (S 1 ) = P r [A 1 = 1 S 2 ] [ ] H 2 A 2 S 1, A 1 = 0

57 8 Back Pocket Slides 57 Bias-Variance Trade-off This discussion assumes true models for the causal effects, the µ t s: Robins G-Estimator is unbiased if either p t or b t are correctly specified. So-called double-robustness property. Robins G-Estimator is semi-parametric efficient if p t, b t, and are all correctly specified. 2-Stage Regression Estimator is unbiased only if the nuisance functions are correctly specified. 2-Stage Regression Estimator with correctly specified nuisance is more efficient than G-Estimator But what happens as we mis-specify the nuisance functions?

58 8 Back Pocket Slides 58 Mis-specifying ɛ t s using S = S N(1, sd = ν) Scaled Root Mean Squared Difference (SRMSD) true Larger values of υ correspond to worse fitting 2 Stage Regression estimators. MSD is the mean squared difference between the true nuisance function and the mis specified nuisance function. SRMSD is equal to root MSD divided by the standard deviation of the response Y υ

59 8 Back Pocket Slides 59 Results 1.0 Relative Mean Squared Error for β: MSE(Robins G Estimator) / MSE(2 Stage Estimator) Relative MSE versus level of Mis specification υ SRMSD a1 a1:s1 a2 a2:i((s1 + s2)/2) a3 a3:i((s1 + s2 + s3)/3) υ SRMSD

60 8 Back Pocket Slides 60 The Generative Model in Simulations nits = 1000 simulated data sets each of size n = δ 1 res 1. Then S δ Z Bin(n, p = 0.50). Then A 1 0 if Z = 0; otherwise A 1 Bin(n, p 1 = Λ( s 1 )) 3. δ 2 N n (0, sd = 0.75). Generate S 2 by setting S s a s s 1 a 1 + δ Set A 2 0 if A 1 = 0; otherwise A 2 Bin(n, p 2 = Λ( s s 2 )).

61 8 Back Pocket Slides δ 3 N n (0, sd = 0.51). Generate S 3 by setting S s a s a s Set A 3 0 if A 2 = 0; otherwise 0.04s a 1 s 1 + δ 3. A 3 Bin(n, p 3 = Λ( s 1 0.3s s 3 )). SNMM Generated as follows: Y intercept + ɛ TRUE 1 (s 1 ; η 1 ) + a 1 (β 1,1 + β 1,2 s 1 ) where + ɛ TRUE 2 ( s 2, a 1 ; η 2 ) + a 2 (β 2,1 + β 2,2 (s 1 + s 2 )/2) + ɛ TRUE 3 ( s 3, ā 2 ; η 3 ) + a 3 (β 1,1 + β 3,2 (s 1 + s 2 + s 3 )/3) + δ y,

62 8 Back Pocket Slides intercept = β 1,1 = β 2,1 = β 3,2 = 0.30, 3. β 1,2 = β 2,2 = β 3,1 = 0.30, 4. δ y is a random sample of size n from N(0, sd = 0.7), and where the true nuisance functions are defined as 1. ɛ TRUE 1 (s 1 ; η 1 ) = 0.45 δ 1, 2. ɛ TRUE 2 ( s 2, a 1 ; η 2 ) = ( s a a 1 s sin(4.5s 1 )) δ 2, 3. ɛ TRUE 3 ( s 3, ā 2 ; η 3 ) = ( s a a 2 s sin(2.5s 2 )) δ 3.

63 8 Back Pocket Slides 63 Scaled Root Mean Squared Difference This is how we measured amount of mis-specification: ( K E t=3 ɛtrue t ) 2 K t=1 ɛν t ( η, γ) SRMSD(ν) =, V ar(y ) where ν corresponds to a mis-specified 2-Stage Regression Estimator. The expectation E and variance V ar in SRMSD are over the data D = ( S 3, Ā3, Y ) for fixed ( η, γ). Calculated via Monte Carlo integration. I claim SRMSD has an effect-size-like interpretation.

64 8 Back Pocket Slides 64 Confounding in PROSPECT A 1 = HSANY 4 A 2 = HSANY 8 A 3 = HSANY 12 Before Weighting After Weighting Variable Absolute Effect Name Effect Size Sign Size HAMDA RE RE16N MCS MMSE SSI CAD DYSTH OPS HAMDA CAD POSAF CAD WHITE RP16N

65 8 Back Pocket Slides 65 Adolescent Substance Use and Community-based Treatment Motivating data set / application Collected by Center for Substance Abuse Treatment From a combination of major substance abuse programs Managed and cleaned by Chestnut Health Systems, IL (Michael Dennis) Global Appraisal of Individual Needs (GAIN): structured clinical interview, over 100 measures Full adolescent data set is n = 6000 and counting...

66 8 Back Pocket Slides 66 The Illustrative Data Set n = 2870 adolescents Interested in fitting a K = 2 time points SNMM S 1 A 1 S 2 A 2 Y = ERS S 1 = need0 = binary indicator of need/severity at baseline A 1 = anytxt3 = reported no treatment (0) versus some treatment (1=outpatient, inpatient, or both) at 3-months S 2 = need6 = binary indicator of need/severity at 6-months A 2 = anytxt9 = treatment indicator at at 9-months Y = ERS = Environmental Risk Scale at 12-months

67 8 Back Pocket Slides 67 What is the Scientific Question? µ 1 = What is the effect of receiving treatment versus not at 3-months (and not receiving treatment in the future) on 12-month ERS scores, conditional on baseline severity? µ 2 = What is the effect of receiving treatment versus not at 9-months on 12-month ERS scores, as a function of baseline severity, having received (or not) treatment at 3-months, and 6-month severity?

68 8 Back Pocket Slides 68

69 8 Back Pocket Slides 69

70 8 Back Pocket Slides 70 Specifying the Saturated SNMM Causal effects: 1. µ 1 = anytxt 3 (β 10 + β 11 need 0 ), 2. µ 2 = anytxt 9 (β 20 + β 21 need 0 + β 22 anytxt 3 + β 23 need 6 + β 24 need 0 anytxt 3 + β 25 need 0 need 6 + β 26 anytxt 3 need 6 + β 27 need 0 anytxt 3 need 6 ) Nuisance functions: 1. ɛ 1 = η 11 (need 0 Pr(need 0 = 1)), 2. ɛ 2 = (η 21 + η 22 need 0 + η 23 anytxt 3 + η 24 need 0 anytxt 3 ) (need 6 Pr( need 6 = 1 need 0, anytxt 3 )), where Pr(need 6 = 1 need 0, anytxt 3 ) = γ 20 + γ 21 need 0 + γ 22 anytxt 3 + γ 23 need 0 anytxt 3.

71 8 Back Pocket Slides 71 Estimates of the SNMM Using the 2-Stage Regression Estimator 2-Stage Estimator Parameters β SE P-val µ 0 Int β < 0.01 µ 1 Int β need 0 β µ 2 Int β need 0 β anytxt 3 β need 6 β need 0 anytxt 3 β need 0 need 6 β anytxt 3 need 6 β need 0 anytxt 3 need 6 β

72 8 Back Pocket Slides 72 RR+IPTW vs RR vs TRAD Different Estimators Contrast Subgroup RR+IPTW RR TRAD µ 1 : Distal (1, 0, 0) vs (0, 0, 0) no intake sevrty, < 16yrs (1, 0, 0) vs (0, 0, 0) hi intake sevrty, 16yrs µ 2 : Medial (1, 1, 0) vs (1, 0, 0) no 0-3 severity (1, 1, 0) vs (1, 0, 0) hi 0-3 severity, yes ce (1, 1, 0) vs (1, 0, 0) hi 0-3 severity, no ce µ 3 : Proximal (1, 1, 1) vs (1, 1, 0) no 3-6 severity (1, 1, 1) vs (1, 1, 0) hi 3-6 severity (.,., 1) vs (.,., 0) no 3-6 severity (.,., 1) vs (.,., 0) hi 3-6 severity

73 8 Back Pocket Slides 73 Some conjectures about the methodological story Spurious bias for the distal effect is probably POS (see arguments I made earlier) Confounding bias for the distal effect is probably NEG (good kids get (1,0,0) and also have better/lower Y ) Spurious and confounding bias cancel each other out and this is why we see TRAD approximately the same as RR+IPTW. Confounding bias for the proximal effect is probably POS (bad kids get (1,1,1) and also have worse/higher Y ) and this is why we see the estimated proximal effects under RR+IPTW (vs are much stronger NEG

74 9 Connections with the Marginal Structural Model 74 9 Connections with the Marginal Structural Model The MSM is a model for E(Y (a 1, a 2 ) S 0 ) The SNMM is a model for E(Y (a 1, a 2 ) S 0, S 1 (a 1 )) So the law of iterated expectations gives us the MSM: E [ ] Y (a 1, a 2 ) S 0 = s 0 ( = E S1 (a 1 ) S 0 E [ Y (a 1, a 2 ) S ] ) 1 (a 1 ) = s 1 ) = µ 0 + ɛ 1 (s 0 ) + µ 1 (s 0, a 1 ) + E S1 (a 1 ) S 0 (ɛ 2 ( s 1, a 1 ) + µ 2 ( s 1, ā 2 ) ) = µ 0 + ɛ 1 (s 0 ) + µ 1 (s 0, a 1 ) + E S1 (a 1 ) S 0 (µ 2 ( s 1, ā 2 )

75 9 Connections with the Marginal Structural Model 75 Connections with the Marginal Structural Model Due to linearity: If effect moderation, we can get MSM estimates by plugging-in the estimated stage 1 regression for the time-varying moderators in the µ t s. Think path analysis. But now we have to believe our stage 1 models are causal! If effect moderation, estimates for the µ t s are indeed estimates for the marginal effects. Just read them off. Regardless, it is possible to use the RR+IPTW to get a double robust estimate of the marginal effects. Useful when we fail to balance on some covariates. To do this, (i) employ the plug-in estimator above, and (ii) don t use the numerator propensity score model in the weights.

Daniel Almirall 1, Daniel F. McCaffrey 2, Beth Ann Griffin 2, Rajeev Ramchand 2, Susan A. Murphy 1

Daniel Almirall 1, Daniel F. McCaffrey 2, Beth Ann Griffin 2, Rajeev Ramchand 2, Susan A. Murphy 1 1 Examining moderated effects of additional adolescent substance use treatment: Structural nested mean model estimation using inverse-weighted regression-with-residuals Daniel Almirall 1, Daniel F. McCaffrey

More information

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall 1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Dept. of Biostatistics, Duke University Medical Joint work

More information

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall 1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Duke University Medical, Dept. of Biostatistics Joint work

More information

Propensity scores for repeated treatments: A tutorial for the iptw function in the twang package

Propensity scores for repeated treatments: A tutorial for the iptw function in the twang package Propensity scores for repeated treatments: A tutorial for the iptw function in the twang package Lane Burgette, Beth Ann Griffin and Dan McCaffrey RAND Corporation July, 07 Introduction While standard

More information

Geoffrey T. Wodtke. University of Toronto. Daniel Almirall. University of Michigan. Population Studies Center Research Report July 2015

Geoffrey T. Wodtke. University of Toronto. Daniel Almirall. University of Michigan. Population Studies Center Research Report July 2015 Estimating Heterogeneous Causal Effects with Time-Varying Treatments and Time-Varying Effect Moderators: Structural Nested Mean Models and Regression-with-Residuals Geoffrey T. Wodtke University of Toronto

More information

Propensity Scores for Repeated Treatments

Propensity Scores for Repeated Treatments C O R P O R A T I O N Propensity Scores for Repeated Treatments A Tutorial for the iptw Function in the TWANG Package Lane F. Burgette, Beth Ann Griffin, Daniel F. McCaffrey For more information on this

More information

Step-by-Step Guidelines for Propensity Score Weighting with Two Groups. Beth Ann Griffin Daniel McCaffrey

Step-by-Step Guidelines for Propensity Score Weighting with Two Groups. Beth Ann Griffin Daniel McCaffrey Step-by-Step Guidelines for Propensity Score Weighting with Two Groups Beth Ann Griffin Daniel McCaffrey 1 Four key steps 1) Choose the primary treatment effect of interest (ATE or ATT) 2) Estimate propensity

More information

Targeted Maximum Likelihood Estimation in Safety Analysis

Targeted Maximum Likelihood Estimation in Safety Analysis Targeted Maximum Likelihood Estimation in Safety Analysis Sam Lendle 1 Bruce Fireman 2 Mark van der Laan 1 1 UC Berkeley 2 Kaiser Permanente ISPE Advanced Topics Session, Barcelona, August 2012 1 / 35

More information

Comparing Adaptive Interventions Using Data Arising from a SMART: With Application to Autism, ADHD, and Mood Disorders

Comparing Adaptive Interventions Using Data Arising from a SMART: With Application to Autism, ADHD, and Mood Disorders Comparing Adaptive Interventions Using Data Arising from a SMART: With Application to Autism, ADHD, and Mood Disorders Daniel Almirall, Xi Lu, Connie Kasari, Inbal N-Shani, Univ. of Michigan, Univ. of

More information

Running Head: Effect Heterogeneity with Time-varying Treatments and Moderators ESTIMATING HETEROGENEOUS CAUSAL EFFECTS WITH TIME-VARYING

Running Head: Effect Heterogeneity with Time-varying Treatments and Moderators ESTIMATING HETEROGENEOUS CAUSAL EFFECTS WITH TIME-VARYING Running Head: Effect Heterogeneity with Time-varying Treatments and Moderators ESTIMATING HETEROGENEOUS CAUSAL EFFECTS WITH TIME-VARYING TREATMENTS AND TIME-VARYING EFFECT MODERATORS: STRUCTURAL NESTED

More information

Estimating the effects of timevarying treatments in the presence of time-varying confounding

Estimating the effects of timevarying treatments in the presence of time-varying confounding Estimating the effects of timevarying treatments in the presence of time-varying confounding An application to neighborhood effects on high school graduation David J. Harding, University of Michigan (joint

More information

Gov 2002: 4. Observational Studies and Confounding

Gov 2002: 4. Observational Studies and Confounding Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What

More information

PSC 504: Instrumental Variables

PSC 504: Instrumental Variables PSC 504: Instrumental Variables Matthew Blackwell 3/28/2013 Instrumental Variables and Structural Equation Modeling Setup e basic idea behind instrumental variables is that we have a treatment with unmeasured

More information

Gov 2002: 13. Dynamic Causal Inference

Gov 2002: 13. Dynamic Causal Inference Gov 2002: 13. Dynamic Causal Inference Matthew Blackwell December 19, 2015 1 / 33 1. Time-varying treatments 2. Marginal structural models 2 / 33 1/ Time-varying treatments 3 / 33 Time-varying treatments

More information

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal

Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Marginal versus conditional effects: does it make a difference? Mireille Schnitzer, PhD Université de Montréal Overview In observational and experimental studies, the goal may be to estimate the effect

More information

An Introduction to Causal Analysis on Observational Data using Propensity Scores

An Introduction to Causal Analysis on Observational Data using Propensity Scores An Introduction to Causal Analysis on Observational Data using Propensity Scores Margie Rosenberg*, PhD, FSA Brian Hartman**, PhD, ASA Shannon Lane* *University of Wisconsin Madison **University of Connecticut

More information

Combining multiple observational data sources to estimate causal eects

Combining multiple observational data sources to estimate causal eects Department of Statistics, North Carolina State University Combining multiple observational data sources to estimate causal eects Shu Yang* syang24@ncsuedu Joint work with Peng Ding UC Berkeley May 23,

More information

Integrated approaches for analysis of cluster randomised trials

Integrated approaches for analysis of cluster randomised trials Integrated approaches for analysis of cluster randomised trials Invited Session 4.1 - Recent developments in CRTs Joint work with L. Turner, F. Li, J. Gallis and D. Murray Mélanie PRAGUE - SCT 2017 - Liverpool

More information

Missing Covariate Data in Matched Case-Control Studies

Missing Covariate Data in Matched Case-Control Studies Missing Covariate Data in Matched Case-Control Studies Department of Statistics North Carolina State University Paul Rathouz Dept. of Health Studies U. of Chicago prathouz@health.bsd.uchicago.edu with

More information

e author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls

e author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls e author and the promoter give permission to consult this master dissertation and to copy it or parts of it for personal use. Each other use falls under the restrictions of the copyright, in particular

More information

Bayesian regression tree models for causal inference: regularization, confounding and heterogeneity

Bayesian regression tree models for causal inference: regularization, confounding and heterogeneity Bayesian regression tree models for causal inference: regularization, confounding and heterogeneity P. Richard Hahn, Jared Murray, and Carlos Carvalho June 22, 2017 The problem setting We want to estimate

More information

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies Kosuke Imai Princeton University February 23, 2012 Joint work with L. Keele (Penn State)

More information

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 An Introduction to Causal Mediation Analysis Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 1 Causality In the applications of statistics, many central questions

More information

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies

Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies Unpacking the Black-Box of Causality: Learning about Causal Mechanisms from Experimental and Observational Studies Kosuke Imai Princeton University January 23, 2012 Joint work with L. Keele (Penn State)

More information

Propensity Score Methods for Causal Inference

Propensity Score Methods for Causal Inference John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University

More information

Propensity Score Weighting with Multilevel Data

Propensity Score Weighting with Multilevel Data Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative

More information

Extending causal inferences from a randomized trial to a target population

Extending causal inferences from a randomized trial to a target population Extending causal inferences from a randomized trial to a target population Issa Dahabreh Center for Evidence Synthesis in Health, Brown University issa dahabreh@brown.edu January 16, 2019 Issa Dahabreh

More information

Estimating the Marginal Odds Ratio in Observational Studies

Estimating the Marginal Odds Ratio in Observational Studies Estimating the Marginal Odds Ratio in Observational Studies Travis Loux Christiana Drake Department of Statistics University of California, Davis June 20, 2011 Outline The Counterfactual Model Odds Ratios

More information

Semiparametric Generalized Linear Models

Semiparametric Generalized Linear Models Semiparametric Generalized Linear Models North American Stata Users Group Meeting Chicago, Illinois Paul Rathouz Department of Health Studies University of Chicago prathouz@uchicago.edu Liping Gao MS Student

More information

Causal Mechanisms Short Course Part II:

Causal Mechanisms Short Course Part II: Causal Mechanisms Short Course Part II: Analyzing Mechanisms with Experimental and Observational Data Teppei Yamamoto Massachusetts Institute of Technology March 24, 2012 Frontiers in the Analysis of Causal

More information

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions

Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Causal Inference with a Continuous Treatment and Outcome: Alternative Estimators for Parametric Dose-Response Functions Joe Schafer Office of the Associate Director for Research and Methodology U.S. Census

More information

Empirical Bayes Moderation of Asymptotically Linear Parameters

Empirical Bayes Moderation of Asymptotically Linear Parameters Empirical Bayes Moderation of Asymptotically Linear Parameters Nima Hejazi Division of Biostatistics University of California, Berkeley stat.berkeley.edu/~nhejazi nimahejazi.org twitter/@nshejazi github/nhejazi

More information

DATA-ADAPTIVE VARIABLE SELECTION FOR

DATA-ADAPTIVE VARIABLE SELECTION FOR DATA-ADAPTIVE VARIABLE SELECTION FOR CAUSAL INFERENCE Group Health Research Institute Department of Biostatistics, University of Washington shortreed.s@ghc.org joint work with Ashkan Ertefaie Department

More information

GEE for Longitudinal Data - Chapter 8

GEE for Longitudinal Data - Chapter 8 GEE for Longitudinal Data - Chapter 8 GEE: generalized estimating equations (Liang & Zeger, 1986; Zeger & Liang, 1986) extension of GLM to longitudinal data analysis using quasi-likelihood estimation method

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint

More information

Empirical Bayes Moderation of Asymptotically Linear Parameters

Empirical Bayes Moderation of Asymptotically Linear Parameters Empirical Bayes Moderation of Asymptotically Linear Parameters Nima Hejazi Division of Biostatistics University of California, Berkeley stat.berkeley.edu/~nhejazi nimahejazi.org twitter/@nshejazi github/nhejazi

More information

BIOS 6649: Handout Exercise Solution

BIOS 6649: Handout Exercise Solution BIOS 6649: Handout Exercise Solution NOTE: I encourage you to work together, but the work you submit must be your own. Any plagiarism will result in loss of all marks. This assignment is based on weight-loss

More information

Controlling for Time Invariant Heterogeneity

Controlling for Time Invariant Heterogeneity Controlling for Time Invariant Heterogeneity Yona Rubinstein July 2016 Yona Rubinstein (LSE) Controlling for Time Invariant Heterogeneity 07/16 1 / 19 Observables and Unobservables Confounding Factors

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2010 Paper 269 Diagnosing and Responding to Violations in the Positivity Assumption Maya L. Petersen

More information

Causal Inference Basics

Causal Inference Basics Causal Inference Basics Sam Lendle October 09, 2013 Observed data, question, counterfactuals Observed data: n i.i.d copies of baseline covariates W, treatment A {0, 1}, and outcome Y. O i = (W i, A i,

More information

A Sampling of IMPACT Research:

A Sampling of IMPACT Research: A Sampling of IMPACT Research: Methods for Analysis with Dropout and Identifying Optimal Treatment Regimes Marie Davidian Department of Statistics North Carolina State University http://www.stat.ncsu.edu/

More information

Measurement error effects on bias and variance in two-stage regression, with application to air pollution epidemiology

Measurement error effects on bias and variance in two-stage regression, with application to air pollution epidemiology Measurement error effects on bias and variance in two-stage regression, with application to air pollution epidemiology Chris Paciorek Department of Statistics, University of California, Berkeley and Adam

More information

Specification Errors, Measurement Errors, Confounding

Specification Errors, Measurement Errors, Confounding Specification Errors, Measurement Errors, Confounding Kerby Shedden Department of Statistics, University of Michigan October 10, 2018 1 / 32 An unobserved covariate Suppose we have a data generating model

More information

For more information about how to cite these materials visit

For more information about how to cite these materials visit Author(s): Kerby Shedden, Ph.D., 2010 License: Unless otherwise noted, this material is made available under the terms of the Creative Commons Attribution Share Alike 3.0 License: http://creativecommons.org/licenses/by-sa/3.0/

More information

arxiv: v1 [stat.me] 15 May 2011

arxiv: v1 [stat.me] 15 May 2011 Working Paper Propensity Score Analysis with Matching Weights Liang Li, Ph.D. arxiv:1105.2917v1 [stat.me] 15 May 2011 Associate Staff of Biostatistics Department of Quantitative Health Sciences, Cleveland

More information

Previous lecture. P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing.

Previous lecture. P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing. Previous lecture P-value based combination. Fixed vs random effects models. Meta vs. pooled- analysis. New random effects testing. Interaction Outline: Definition of interaction Additive versus multiplicative

More information

Propensity Score Analysis with Hierarchical Data

Propensity Score Analysis with Hierarchical Data Propensity Score Analysis with Hierarchical Data Fan Li Alan Zaslavsky Mary Beth Landrum Department of Health Care Policy Harvard Medical School May 19, 2008 Introduction Population-based observational

More information

Discussion of Papers on the Extensions of Propensity Score

Discussion of Papers on the Extensions of Propensity Score Discussion of Papers on the Extensions of Propensity Score Kosuke Imai Princeton University August 3, 2010 Kosuke Imai (Princeton) Generalized Propensity Score 2010 JSM (Vancouver) 1 / 11 The Theme and

More information

IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM

IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS IPW and MSM IP WEIGHTING AND MARGINAL STRUCTURAL MODELS (CHAPTER 12) BIOS 776 1 12 IPW and MSM IP weighting and marginal structural models ( 12) Outline 12.1 The causal question 12.2 Estimating IP weights via modeling

More information

Casual Mediation Analysis

Casual Mediation Analysis Casual Mediation Analysis Tyler J. VanderWeele, Ph.D. Upcoming Seminar: April 21-22, 2017, Philadelphia, Pennsylvania OXFORD UNIVERSITY PRESS Explanation in Causal Inference Methods for Mediation and Interaction

More information

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010

Stat/F&W Ecol/Hort 572 Review Points Ané, Spring 2010 1 Linear models Y = Xβ + ɛ with ɛ N (0, σ 2 e) or Y N (Xβ, σ 2 e) where the model matrix X contains the information on predictors and β includes all coefficients (intercept, slope(s) etc.). 1. Number of

More information

Quantitative Economics for the Evaluation of the European Policy

Quantitative Economics for the Evaluation of the European Policy Quantitative Economics for the Evaluation of the European Policy Dipartimento di Economia e Management Irene Brunetti Davide Fiaschi Angela Parenti 1 25th of September, 2017 1 ireneb@ec.unipi.it, davide.fiaschi@unipi.it,

More information

Selection on Observables: Propensity Score Matching.

Selection on Observables: Propensity Score Matching. Selection on Observables: Propensity Score Matching. Department of Economics and Management Irene Brunetti ireneb@ec.unipi.it 24/10/2017 I. Brunetti Labour Economics in an European Perspective 24/10/2017

More information

Summary and discussion of The central role of the propensity score in observational studies for causal effects

Summary and discussion of The central role of the propensity score in observational studies for causal effects Summary and discussion of The central role of the propensity score in observational studies for causal effects Statistics Journal Club, 36-825 Jessica Chemali and Michael Vespe 1 Summary 1.1 Background

More information

Standardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE

Standardization methods have been used in epidemiology. Marginal Structural Models as a Tool for Standardization ORIGINAL ARTICLE ORIGINAL ARTICLE Marginal Structural Models as a Tool for Standardization Tosiya Sato and Yutaka Matsuyama Abstract: In this article, we show the general relation between standardization methods and marginal

More information

Addressing Analysis Issues REGRESSION-DISCONTINUITY (RD) DESIGN

Addressing Analysis Issues REGRESSION-DISCONTINUITY (RD) DESIGN Addressing Analysis Issues REGRESSION-DISCONTINUITY (RD) DESIGN Overview Assumptions of RD Causal estimand of interest Discuss common analysis issues In the afternoon, you will have the opportunity to

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Princeton University Asian Political Methodology Conference University of Sydney Joint

More information

Telescope Matching: A Flexible Approach to Estimating Direct Effects *

Telescope Matching: A Flexible Approach to Estimating Direct Effects * Telescope Matching: A Flexible Approach to Estimating Direct Effects * Matthew Blackwell Anton Strezhnev August 4, 2018 Abstract Estimating the direct effect of a treatment fixing the value of a consequence

More information

Chapter 1 Statistical Inference

Chapter 1 Statistical Inference Chapter 1 Statistical Inference causal inference To infer causality, you need a randomized experiment (or a huge observational study and lots of outside information). inference to populations Generalizations

More information

Telescope Matching: A Flexible Approach to Estimating Direct Effects

Telescope Matching: A Flexible Approach to Estimating Direct Effects Telescope Matching: A Flexible Approach to Estimating Direct Effects Matthew Blackwell and Anton Strezhnev International Methods Colloquium October 12, 2018 direct effect direct effect effect of treatment

More information

1 Warm-Up: 2 Adjusted R 2. Introductory Applied Econometrics EEP/IAS 118 Spring Sylvan Herskowitz Section #

1 Warm-Up: 2 Adjusted R 2. Introductory Applied Econometrics EEP/IAS 118 Spring Sylvan Herskowitz Section # Introductory Applied Econometrics EEP/IAS 118 Spring 2015 Sylvan Herskowitz Section #10 4-1-15 1 Warm-Up: Remember that exam you took before break? We had a question that said this A researcher wants to

More information

Causal inference in epidemiological practice

Causal inference in epidemiological practice Causal inference in epidemiological practice Willem van der Wal Biostatistics, Julius Center UMC Utrecht June 5, 2 Overview Introduction to causal inference Marginal causal effects Estimating marginal

More information

Chapter 2: simple regression model

Chapter 2: simple regression model Chapter 2: simple regression model Goal: understand how to estimate and more importantly interpret the simple regression Reading: chapter 2 of the textbook Advice: this chapter is foundation of econometrics.

More information

Difference-in-Differences Methods

Difference-in-Differences Methods Difference-in-Differences Methods Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 1 Introduction: A Motivating Example 2 Identification 3 Estimation and Inference 4 Diagnostics

More information

The Linear Regression Model

The Linear Regression Model The Linear Regression Model Carlo Favero Favero () The Linear Regression Model 1 / 67 OLS To illustrate how estimation can be performed to derive conditional expectations, consider the following general

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2004 Paper 155 Estimation of Direct and Indirect Causal Effects in Longitudinal Studies Mark J. van

More information

Using Estimating Equations for Spatially Correlated A

Using Estimating Equations for Spatially Correlated A Using Estimating Equations for Spatially Correlated Areal Data December 8, 2009 Introduction GEEs Spatial Estimating Equations Implementation Simulation Conclusion Typical Problem Assess the relationship

More information

PSC 504: Dynamic Causal Inference

PSC 504: Dynamic Causal Inference PSC 504: Dynamic Causal Inference Matthew Blackwell 4/8/203 e problem Let s go back to a problem that we faced earlier, which is how to estimate causal effects with treatments that vary over time. We could

More information

Robustness to Parametric Assumptions in Missing Data Models

Robustness to Parametric Assumptions in Missing Data Models Robustness to Parametric Assumptions in Missing Data Models Bryan Graham NYU Keisuke Hirano University of Arizona April 2011 Motivation Motivation We consider the classic missing data problem. In practice

More information

Bayesian Causal Forests

Bayesian Causal Forests Bayesian Causal Forests for estimating personalized treatment effects P. Richard Hahn, Jared Murray, and Carlos Carvalho December 8, 2016 Overview We consider observational data, assuming conditional unconfoundedness,

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2011 Paper 288 Targeted Maximum Likelihood Estimation of Natural Direct Effect Wenjing Zheng Mark J.

More information

Weighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i.

Weighting. Homework 2. Regression. Regression. Decisions Matching: Weighting (0) W i. (1) -å l i. )Y i. (1-W i 3/5/2014. (1) = Y i. Weighting Unconfounded Homework 2 Describe imbalance direction matters STA 320 Design and Analysis of Causal Studies Dr. Kari Lock Morgan and Dr. Fan Li Department of Statistical Science Duke University

More information

The propensity score with continuous treatments

The propensity score with continuous treatments 7 The propensity score with continuous treatments Keisuke Hirano and Guido W. Imbens 1 7.1 Introduction Much of the work on propensity score analysis has focused on the case in which the treatment is binary.

More information

Investigating mediation when counterfactuals are not metaphysical: Does sunlight exposure mediate the effect of eye-glasses on cataracts?

Investigating mediation when counterfactuals are not metaphysical: Does sunlight exposure mediate the effect of eye-glasses on cataracts? Investigating mediation when counterfactuals are not metaphysical: Does sunlight exposure mediate the effect of eye-glasses on cataracts? Brian Egleston Fox Chase Cancer Center Collaborators: Daniel Scharfstein,

More information

University of California, Berkeley

University of California, Berkeley University of California, Berkeley U.C. Berkeley Division of Biostatistics Working Paper Series Year 2011 Paper 290 Targeted Minimum Loss Based Estimation of an Intervention Specific Mean Outcome Mark

More information

Conceptual overview: Techniques for establishing causal pathways in programs and policies

Conceptual overview: Techniques for establishing causal pathways in programs and policies Conceptual overview: Techniques for establishing causal pathways in programs and policies Antonio A. Morgan-Lopez, Ph.D. OPRE/ACF Meeting on Unpacking the Black Box of Programs and Policies 4 September

More information

G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS G-Estimation

G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS G-Estimation G-ESTIMATION OF STRUCTURAL NESTED MODELS (CHAPTER 14) BIOS 776 1 14 G-Estimation ( G-Estimation of Structural Nested Models 14) Outline 14.1 The causal question revisited 14.2 Exchangeability revisited

More information

Rank preserving Structural Nested Distribution Model (RPSNDM) for Continuous

Rank preserving Structural Nested Distribution Model (RPSNDM) for Continuous Rank preserving Structural Nested Distribution Model (RPSNDM) for Continuous Y : X M Y a=0 = Y a a m = Y a cum (a) : Y a = Y a=0 + cum (a) an unknown parameter. = 0, Y a = Y a=0 = Y for all subjects Rank

More information

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification Todd MacKenzie, PhD Collaborators A. James O Malley Tor Tosteson Therese Stukel 2 Overview 1. Instrumental variable

More information

Statistical Analysis of List Experiments

Statistical Analysis of List Experiments Statistical Analysis of List Experiments Graeme Blair Kosuke Imai Princeton University December 17, 2010 Blair and Imai (Princeton) List Experiments Political Methodology Seminar 1 / 32 Motivation Surveys

More information

Causal Analysis in Social Research

Causal Analysis in Social Research Causal Analysis in Social Research Walter R Davis National Institute of Applied Statistics Research Australia University of Wollongong Frontiers in Social Statistics Metholodogy 8 February 2017 Walter

More information

Double Robustness. Bang and Robins (2005) Kang and Schafer (2007)

Double Robustness. Bang and Robins (2005) Kang and Schafer (2007) Double Robustness Bang and Robins (2005) Kang and Schafer (2007) Set-Up Assume throughout that treatment assignment is ignorable given covariates (similar to assumption that data are missing at random

More information

The Simple Linear Regression Model

The Simple Linear Regression Model The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate

More information

Mediation Analysis for Health Disparities Research

Mediation Analysis for Health Disparities Research Mediation Analysis for Health Disparities Research Ashley I Naimi, PhD Oct 27 2016 @ashley_naimi wwwashleyisaacnaimicom ashleynaimi@pittedu Orientation 24 Numbered Equations Slides at: wwwashleyisaacnaimicom/slides

More information

What s New in Econometrics. Lecture 1

What s New in Econometrics. Lecture 1 What s New in Econometrics Lecture 1 Estimation of Average Treatment Effects Under Unconfoundedness Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Potential Outcomes 3. Estimands and

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 12: Frequentist properties of estimators (v4) Ramesh Johari ramesh.johari@stanford.edu 1 / 39 Frequentist inference 2 / 39 Thinking like a frequentist Suppose that for some

More information

Gov 2000: 9. Regression with Two Independent Variables

Gov 2000: 9. Regression with Two Independent Variables Gov 2000: 9. Regression with Two Independent Variables Matthew Blackwell Fall 2016 1 / 62 1. Why Add Variables to a Regression? 2. Adding a Binary Covariate 3. Adding a Continuous Covariate 4. OLS Mechanics

More information

The returns to schooling, ability bias, and regression

The returns to schooling, ability bias, and regression The returns to schooling, ability bias, and regression Jörn-Steffen Pischke LSE October 4, 2016 Pischke (LSE) Griliches 1977 October 4, 2016 1 / 44 Counterfactual outcomes Scholing for individual i is

More information

6.3 How the Associational Criterion Fails

6.3 How the Associational Criterion Fails 6.3. HOW THE ASSOCIATIONAL CRITERION FAILS 271 is randomized. We recall that this probability can be calculated from a causal model M either directly, by simulating the intervention do( = x), or (if P

More information

Regression #3: Properties of OLS Estimator

Regression #3: Properties of OLS Estimator Regression #3: Properties of OLS Estimator Econ 671 Purdue University Justin L. Tobias (Purdue) Regression #3 1 / 20 Introduction In this lecture, we establish some desirable properties associated with

More information

Potential Outcomes Model (POM)

Potential Outcomes Model (POM) Potential Outcomes Model (POM) Relationship Between Counterfactual States Causality Empirical Strategies in Labor Economics, Angrist Krueger (1999): The most challenging empirical questions in economics

More information

Covariate Balancing Propensity Score for General Treatment Regimes

Covariate Balancing Propensity Score for General Treatment Regimes Covariate Balancing Propensity Score for General Treatment Regimes Kosuke Imai Princeton University October 14, 2014 Talk at the Department of Psychiatry, Columbia University Joint work with Christian

More information

Lecture 9: Learning Optimal Dynamic Treatment Regimes. Donglin Zeng, Department of Biostatistics, University of North Carolina

Lecture 9: Learning Optimal Dynamic Treatment Regimes. Donglin Zeng, Department of Biostatistics, University of North Carolina Lecture 9: Learning Optimal Dynamic Treatment Regimes Introduction Refresh: Dynamic Treatment Regimes (DTRs) DTRs: sequential decision rules, tailored at each stage by patients time-varying features and

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 6: Bias and variance (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 49 Our plan today We saw in last lecture that model scoring methods seem to be trading off two different

More information

1 Motivation for Instrumental Variable (IV) Regression

1 Motivation for Instrumental Variable (IV) Regression ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data

More information

Ignoring the matching variables in cohort studies - when is it valid, and why?

Ignoring the matching variables in cohort studies - when is it valid, and why? Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association

More information

Mediation for the 21st Century

Mediation for the 21st Century Mediation for the 21st Century Ross Boylan ross@biostat.ucsf.edu Center for Aids Prevention Studies and Division of Biostatistics University of California, San Francisco Mediation for the 21st Century

More information

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness

A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A Bayesian Nonparametric Approach to Monotone Missing Data in Longitudinal Studies with Informative Missingness A. Linero and M. Daniels UF, UT-Austin SRC 2014, Galveston, TX 1 Background 2 Working model

More information

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing

Primal-dual Covariate Balance and Minimal Double Robustness via Entropy Balancing Primal-dual Covariate Balance and Minimal Double Robustness via (Joint work with Daniel Percival) Department of Statistics, Stanford University JSM, August 9, 2015 Outline 1 2 3 1/18 Setting Rubin s causal

More information