CAUSAL INFERENCE IN STATISTICS. A Gentle Introduction. Judea Pearl University of California Los Angeles (

Size: px
Start display at page:

Download "CAUSAL INFERENCE IN STATISTICS. A Gentle Introduction. Judea Pearl University of California Los Angeles (www.cs.ucla."

Transcription

1 CAUSAL INFERENCE IN STATISTICS A Gentle Introduction Judea Pearl University of California Los Angeles (

2 OUTLINE Inference: Statistical vs. Causal, distinctions, and mental barriers Unified conceptualization of counterfactuals, structural-equations, and graphs Inference to three types of claims: 1. Causal effects and confounding 2. Attribution (Causes of Effects) 3. Direct and indirect effects Frills external validity and transportability

3 TRADITIONAL STATISTICAL INFERENCE PARADIGM Data P Joint Distribution Q(P) (Aspects of P) Inference e.g., Estimate the mean of Q( P) E( ) xp ( x) x

4 TRADITIONAL STATISTICAL INFERENCE PARADIGM Data P Joint Distribution Q(P) (Aspects of P) Inference e.g., Estimate the probability that a customer who bought product A would also buy product B. Q = P(B A)

5 FROM STATISTICAL TO CAUSAL ANALSIS: 1. THE DIFFERENCES Probability and statistics deal with static relations Data P Joint Distribution change Inference P Joint Distribution Q(P) (Aspects of P) What happens when P changes? e.g., Estimate the probability that a customer who bought A would buy B if we were to double the price.

6 FROM STATISTICAL TO CAUSAL ANALSIS: 1. THE DIFFERENCES What remains invariant when P changes say, to satisfy P (price=2)=1 Data P Joint Distribution change Inference P Joint Distribution Q(P) (Aspects of P) Note: P (B) P (B price = 2) e.g., Doubling price seeing the price doubled. P does not tell us how it ought to change.

7 FROM STATISTICAL TO CAUSAL ANALSIS: 1. THE DIFFERENCES (CONT) 1. Causal and statistical concepts do not mix. CAUSAL Spurious correlation Randomization / Intervention Confounding / Effect Instrumental variable Ignorability / Exogeneity Explanatory variables 2. STATISTICAL Regression Association / Independence Controlling for / Conditioning Odd and risk ratios Collapsibility / Granger causality Propensity score 3. 4.

8

9 FROM STATISTICAL TO CAUSAL ANALSIS: 1. THE DIFFERENCES (CONT) 1. Causal and statistical concepts do not mix. CAUSAL Spurious correlation Randomization / Intervention Confounding / Effect Instrumental variable Ignorability / Exogeneity Explanatory variables 2. STATISTICAL Regression Association / Independence Controlling for / Conditioning Odd and risk ratios Collapsibility / Granger causality Propensity score 3. 4.

10 FROM STATISTICAL TO CAUSAL ANALSIS: 2. MENTAL BARRIERS 1. Causal and statistical concepts do not mix. CAUSAL STATISTICAL Spurious correlation Regression Randomization / Intervention Association / Independence Confounding / Effect Controlling for / Conditioning Instrumental variable Odds and risk ratios Ignorability / Exogeneity Collapsibility / Granger causality Explanatory variables Propensity score 2. No causes in no causes out (Cartwright, 1989) statistical assumptions + data causal assumptions causal conclusions 3. Causal assumptions cannot be expressed in the mathematical language of standard statistics. 4. Non-standard mathematics: a) Structural equation models (Wright, 1920; Simon, 1960) b) Counterfactuals (Neyman-Rubin ( x ), Lewis (x )) }

11 WH PHSICS IS COUNTERFACTUAL Scientific Equations (e.g., Hooke s Law) are non-algebraic e.g., Length () equals a constant (2) times the weight () = 2 = 1 Process information = 1 = 2 The solution

12 WH PHSICS IS COUNTERFACTUAL Scientific Equations (e.g., Hooke s Law) are non-algebraic e.g., Length () equals a constant (2) times the weight () Correct notation: := 2 = 3 = 1 Process information = 1 = 2 The solution Had been 3, would be 6. If we raise to 3, would be 6. Must wipe out = 1.

13 WH PHSICS IS COUNTERFACTUAL Scientific Equations (e.g., Hooke s Law) are non-algebraic e.g., Length () equals a constant (2) times the weight () Correct notation: (or) 2 = 3 = 1 Process information Had been 3, would be 6. If we raise to 3, would be 6. Must wipe out = 1. = 1 = 2 The solution

14 THE STRUCTURAL MODEL PARADIGM Data Joint Distribution Data Generating Model Q(M) (Aspects of M) M Inference M Invariant strategy (mechanism, recipe, law, protocol) by which Nature assigns values to variables in the analysis. Think Nature, not experiment!

15 FAMILIAR CAUSAL MODEL ORACLE FOR MANIPILATION INPUT OUTPUT

16 STRUCTURAL CAUSAL MODELS Definition: A structural causal model is a 4-tuple V,U, F, P(u), where V = {V 1,...,V n } are endogenous variables U = {U 1,...,U m } are background variables F = {f 1,..., f n } are functions determining V, v i = f i (v, u) e.g., y x u P(u) is a distribution over U P(u) and F induce a distribution P(v) over observable variables

17 STRUCTURAL MODELS AND CAUSAL DIAGRAMS The functions v i = f i (v,u) define a graph v i = f i (pa i,u i ) PA i V \ V i U i U Example: Price Quantity equations in economics U 1 I W U 2 q p b b 1 2 p q d d 1 i 2 w u 1 u 2 Q P PA Q

18 SIMULATING INTERVENTIONS IN STRUCTURAL MODELS do(x) Double the price Take a drug Raise taxes Make me laugh

19 SIMULATING INTERVENTIONS IN STRUCTURAL MODELS do(x) Let be a set of variables in V. The action do(x) sets to constants x regardless of the factors which previously determined. do(x) replaces all functions f i determining with the constant functions =x, to create a mutilated model M x q p b b 1 2 p q d d 1 i 2 w u 1 u 2 U 1 I W U 2 Q P

20 SIMULATING INTERVENTIONS IN STRUCTURAL MODELS Let be a set of variables in V. The action do(x) sets to constants x regardless of the factors which previously determined. do(x) replaces all functions f i determining with the constant functions =x, to create a mutilated model M x M p q p p b b 1 p 2 0 p q d d 1 i 2 w u 1 u 2 U 1 I W U 2 Q P P = p 0

21 CAUSAL MODELS AND COUNTERFACTUALS If I were a rich man Had we doubled the price

22 CAUSAL MODELS AND COUNTERFACTUALS Definition: The sentence: would be y (in situation u), had been x, denoted x (u) = y, means: The solution for in a mutilated model M x, (i.e., the equations for replaced by = x) with input U=u, is equal to y. The Fundamental Equation of Counterfactuals: x( u) M ( u) x

23 ), ( ), ' ( ) ( ) ( ) ( ' ) ' ( : ' ) ( : y x u P y x y PN u P y P y P y u u x x y u u x x In particular: do(x) CAUSAL MODELS AND COUNTERFACTUALS Definition: The sentence: would be y (in situation u), had been x, denoted x (u) = y, means: The solution for in a mutilated model M x, (i.e., the equations for replaced by = x) with input U=u, is equal to y. ) ( ), ( ) (, ) ( : u P z y P z u y u u w x w x Joint probabilities of counterfactuals:

24 READING COUNTERFACTUALS FROM SEM Score Treatment Study Time Data shows: = 0.7, = 0.5, = 0.4 A student named Joe, measured = 0.5, = 1.0, = 1.9 Q 1 : What would Joe s score be had he doubled his study time? x z x 2 y x z 3

25 READING COUNTERFACTUALS FROM SEM Score Treatment Study Time Data shows: = 0.7, = 0.5, = 0.4 A student named Joe, measured = 0.5, = 1.0, = 1.9 Q 1 : What would Joe s score be had he doubled his study time? Answer: =2 = = 2.30 x z x 2 y x z 3 z 2.0

26 REGRESSION VS. STRUCTURAL EQUATIONS (THE CONFUSION OF THE CENTUR) Regression (claimless, nonfalsifiable): = ax + Structural (empirical, falsifiable): = bx + u Claim: (regardless of distributions): E( do(x)) = E( do(x), do(z)) = bx The mothers of all questions: Q. When would b equal a? A. When all back-door paths are blocked, (u ) Q. When is b estimable by regression methods? A. Graphical criteria available

27 THE FIVE NECESSAR STEPS OF CAUSAL ANALSIS Define: Assume: Identify: Estimate: Test: Express the target quantity Q as property of the model M. Express causal assumptions in structural or graphical form. Determine if Q is identifiable. Estimate Q if it is identifiable; approximate it, if it is not. If M has testable implications Repeat if necessary

28 THE LOGIC OF CAUSAL ANALSIS A - CAUSAL ASSUMPTIONS Q Queries of interest Q(P) - Identified estimands Data (D) CAUSAL MODEL (M A ) A* - Logical implications of A Causal inference T(M A ) - Testable implications Statistical inference Q - Estimates of Q(P) Q( D, A) Provisional claims g(t) Model testing Goodness of fit

29 THE FIVE NECESSAR STEPS FOR EFFECT ESTIMATION Define: Assume: Identify: Estimate: Test: Express the target quantity Q as a property of the model M. P( x y) or P( y do( x)) Express causal assumptions in structural or graphical form. Determine if Q is identifiable. Estimate Q if it is identifiable; approximate it, if it is not. If M has testable implications

30 THE FIVE NECESSAR STEPS FOR AVERAGE TREATMENT EFFECT Define: Assume: Identify: Estimate: Test: Express the target quantity Q as a property of the model M. ATE E( do( x1 )) E( do( x0)) Express causal assumptions in structural or graphical form. Determine if Q is identifiable. Estimate Q if it is identifiable; approximate it, if it is not. If M has testable implications

31 THE FIVE NECESSAR STEPS FOR DNAMIC POLIC ANALSIS Define: Assume: Identify: Estimate: Test: Express the target quantity Q as a property of the model M. P( y do( x g( z)) Express causal assumptions in structural or graphical form. Determine if Q is identifiable. Estimate Q if it is identifiable; approximate it, if it is not. If M has testable implications

32 THE FIVE NECESSAR STEPS FOR TIME VARING POLIC ANALSIS Define: Assume: Identify: Estimate: Test: Express the target quantity Q as a property of the model M. P( y do( x, z, W w)) Express causal assumptions in structural or graphical form. Determine if Q is identifiable. Estimate Q if it is identifiable; approximate it, if it is not. If M has testable implications

33 THE FIVE NECESSAR STEPS FOR TREATMENT ON TREATED Define: Assume: Identify: Estimate: Test: Express the target quantity Q a property of the model M. ETT P( x y x') Express causal assumptions in structural or graphical form. Determine if Q is identifiable. Estimate Q if it is identifiable; approximate it, if it is not. If M has testable implications

34 THE FIVE NECESSAR STEPS FOR INDIRECT EFFECTS Define: Assume: Identify: Estimate: Test: Express the target quantity Q a property of the model M. IE E[ ) ] Express causal assumptions in structural or graphical form. Determine if Q is identifiable. Estimate Q if it is identifiable; approximate it, if it is not. If M has testable implications E[ x, ( x' x ]

35 THE FIVE NECESSAR STEPS FROM DEFINITION TO ASSUMPTIONS Define: Express the target quantity Q as a property of the model M. Assume: Identify: Estimate: Test: Express causal assumptions in structural or graphical form. Determine if Q is identifiable. Estimate Q if it is identifiable; approximate it, if it is not. If M has testable implications

36 testable? FORMULATING ASSUMPTIONS THREE LANGUAGES 1. English: Smoking (), Cancer (), Tar (), Genotypes (U) Not too friendly: 3. Structural: }, { ), ( ) ( ), ( ) ( ) ( ) ( ), ( ) ( u u u u u u u u z x zx z z zy y yx x 2. Counterfactuals: ), ( ),, ( ), ( x f z u z f y u f x U Consistent?, complete?, redundant?, plausible?,

37 FROM Q AND ASSUMPTIONS TO IDENTIFICATION Define: Assume: Identify: Estimate: Test: Express the target quantity Q as a function Q(M) that can be computed from any model M. Express causal assumptions in structural or graphical form. Determine if Q is identifiable. Estimate Q if it is identifiable; approximate it, if it is not. If M has testable implications SOLVED!!!

38 THE PROBLEM OF CONFOUNDING Find the effect of on, P(y do(x)), given measurements on auxiliary variables 1,..., k G Can P(y do(x)) be estimated if only a subset,, can be measured?

39 ELIMINATING CONFOUNDING BIAS THE BACK-DOOR CRITERION P(y do(x)) is estimable if there is a set of variables that d-separates from in G x. G G x Moreover, P(y do(x)) = P(y x,z) P(z) z ( adjusting for )

40 EFFECT OF INTERVENTION BEOND ADJUSTMENT Theorem (Tian-Pearl 2002) We can identify P(y do(x)) if there is no child of connected to by a confounding path. G

41 EFFECT OF WARM-UP ON INJUR (After Shrier & Platt, 2008) No, no!

42 COUNTERFACTUALS AT WORK ETT EFFECT OF TREATMENT ON THE TREATED 1. Regret: I took a pill to fall asleep. Perhaps I should not have? 2. Program evaluation: What would terminating a program do to those enrolled? P( x y x')

43 IDENTIFICATION OF COUNTERFACTUALS ETT P( x y x') ETT is identifiable in G iff P(y do(x),w) is identifiable in G G' Moreover, W ETT P( y x') x P( y do( x), w) Complete graphical criterion (Shpitser-Pearl, 2009) G' wx'

44 ETT - THE BACK-DOOR CRITERION ETT is identifiable in G if there is a set of variables that d-separates from in G x. G G x Moreover, ETT z P ( y x, z) P( z x' ) 6 Standardized morbidity

45 FROM IDENTIFICATION TO ESTIMATION Define: Assume: Identify: Estimate: Express the target quantity Q as a function Q(M) that can be computed from any model M. e.g., Q P( y do( x)) Formulate causal assumptions using ordinary scientific language and represent their structural part in graphical form. Determine if Q is identifiable. Estimate Q if it is identifiable; approximate it, if it is not.

46 PROPENSIT SCORE ESTIMATOR (Rosenbaum & Rubin, 1983) L ),,,, 1 ( ),,,, ( z z z z z P z z z z z L Adjustment for L replaces Adjustment for z l l L P x l L y P z P x z y P ) ( ), ( ) ( ), ( Theorem: P(y do(x)) =?

47 WHAT PROPENSIT SCORE (PS) PRACTITIONERS NEED TO KNOW z L( z) P( 1 z) P ( y z, x) P( z) P( y l, x) P( l) 1. The assymptotic bias of PS is EQUAL to that of ordinary adjustment (for same ). 2. Including an additional covariate in the analysis CAN SPOIL the bias-reduction potential of others. 3. Choosing sufficient set for PS, requires knowledge about the model. l

48 WHICH COVARIATES MA / SHOULD BE ADJUSTED FOR? Assignment Treatment B 1 Age M Hygiene Outcome B 2 Cost Follow-up Question: Which of these eight covariates may be included in the propensity score function (for matching) and which should be excluded. Answer: Must include: Must exclude: May include: Age B 1, M, B 2, Follow-up, Assignment without Age Cost, Hygiene, {Assignment + Age}, {Hygiene + Age + B 1 }, more...

49 WHICH COVARIATES MA / SHOULD BE ADJUSTED FOR? Assignment Treatment B 1 Age M Hygiene Outcome B 2 Cost Follow-up Question: Which of these eight covariates may be included in the propensity score function (for matching) and which should be excluded. Answer: Must include: Must exclude: May include: Age B 1, M, B 2, Follow-up, Assignment without Age Cost, Hygiene, {Assignment + Age}, {Hygiene + Age + B 1 }, more...

50 WHAT PROPENSIT SCORE (PS) PRACTITIONERS NEED TO KNOW z L( z) P( 1 z) P( y z, x) P( z) 1. The assymptotic bias of PS is EQUAL to that of ordinary adjustment (for same ). 2. Including an additional covariate in the analysis CAN SPOIL the bias-reduction potential of others. 3. Choosing sufficient set for PS, requires knowledge about the model. 4. That any empirical test of the bias-reduction potential of PS, can only be generalized to cases where the causal relationships among covariates, observed and unobserved is the same. 1 P( y l, x) P( l)

51 THE STRUCTURAL-COUNTERFACTUAL SMBIOSIS 1. Express assumptions in structural or graphical language. 2. Express queries in counterfactual language. 3. Translate (1) into (2) for algebraic analysis, Or (2) into (1) for graphical analysis. 4. Use either graphical or algebraic machinery to answer the query in (2).

52 GRAPHICAL COUNTERFACTUALS TRANSLATION Every causal graph expresses counterfactuals assumptions, e.g., 1. Missing arrows x, z ( u) x ( u) 2. Missing arcs x y consistent, and readable from the graph. Every theorem in SCM is a theorem in Potential-Outcome Model, and conversely.

53 DEMSTIFING CONDITIONAL IGNORABILIT { (0), (1)} (Ignorability) ( ) G (Back-door) Where in the graph are { (0), (1)}? W plays the role of { (0), (1)} in the graph W { } (0), (1) S (a) S (b)

54 DETERMINING THE CAUSES OF EFFECTS (The Attribution Problem) our Honor! My client (Mr. A) died BECAUSE he used that drug.

55 DETERMINING THE CAUSES OF EFFECTS (The Attribution Problem) our Honor! My client (Mr. A) died BECAUSE he used that drug. Court to decide if it is MORE PROBABLE THAN NOT that A would be alive BUT FOR the drug! PN = P(? A is dead, took the drug) > 0.50

56 THE ATTRIBUTION PROBLEM Definition: 1. What is the meaning of PN(x,y): Probability that event y would not have occurred if it were not for event x, given that x and y did in fact occur. Answer: Computable from M PN( x, y) P( ' y' x, y) x

57 THE ATTRIBUTION PROBLEM Definition: 1. What is the meaning of PN(x,y): Probability that event y would not have occurred if it were not for event x, given that x and y did in fact occur. Identification: 2. Under what condition can PN(x,y) be learned from statistical data, i.e., observational, experimental and combined.

58 PARTIAL IDENTIFICATION (Tian and Pearl, 2000) Bounds given combined nonexperimental and experimental data 0 P( y) P( y max P( x,y) x' ) PN 1 P( y' ) min x' P( x,y) Identifiability under monotonicity (Combined data) PN P( y x) P( y x' ) P( y x) P( y x' ) P( y P( x,y) x' ) corrected Excess-Risk-Ratio

59 CAN FREQUENC DATA DECIDE LEGAL RESPONSIBILIT? Experimental Nonexperimental do(x) do(x) x x Deaths (y) Survivals (y) ,000 1,000 1,000 1,000 Nonexperimental data: drug usage predicts longer life Experimental data: drug has negligible effect on survival Plaintiff: Mr. A is special. 1. He actually died 2. He used the drug by choice Court to decide (given both data): Is it more probable than not that A would be alive but for the drug? PN P( ' y' x, y) x 0.50

60 SOLUTION TO THE ATTRIBUTION PROBLEM WITH PROBABILIT ONE 1 P(y x x,y) 1 Combined data tell more that each study alone

61 EFFECT DECOMPOSITION (direct vs. indirect effects) 1. Why decompose effects? 2. What is the definition of direct and indirect effects? 3. What are the policy implications of direct and indirect effects? 4. When can direct and indirect effect be estimated consistently from experimental and nonexperimental data?

62 WH DECOMPOSE EFFECTS? 1. To understand how Nature works 2. To comply with legal requirements 3. To predict the effects of new type of interventions: Signal routing, rather than variable fixing

63 LEGAL IMPLICATIONS OF DIRECT EFFECT Can data prove an employer guilty of hiring discrimination? (Gender) (Qualifications) What is the direct effect of on? (averaged over z) (Hiring) E ( do( x1 ), do( z)) E( do( x0), do( z)) Adjust for? No! No!

64 NATURAL INTERPRETATION OF AVERAGE DIRECT EFFECTS Robins and Greenland (1992) Pure z = f (x, u) y = g (x, z, u) Natural Direct Effect of on : The expected change in, when we change from x 0 to x 1 and, for each u, we keep constant at whatever value it attained before the change. E[ x DE( x0, x1; ) 1 x x 0 0 In linear models, DE = Controlled Direct Effect ] ( x1 x0 )

65 DEFINITION OF INDIRECT EFFECTS z = f (x, u) y = g (x, z, u) IE( x0, x1; ) Indirect Effect of on : The expected change in when we keep constant, say at x 0, and let change to whatever value it would have attained had changed to x 1. E[ x 0x x 1 0 In linear models, IE = TE - DE ]

66 POLIC IMPLICATIONS OF INDIRECT EFFECTS What is the indirect effect of on? The effect of Gender on Hiring if sex discrimination is eliminated. GENDER QUALIFICATION IGNORE f HIRING Blocking a link a new type of intervention

67 MEDIATION FORMULAS IN UNCONFOUNDED MODELS DE IE TE IE TE DE [ E( x1, z) E( x0, z)] P( z z [ E( z E( x0, z)[ P( z x1 ) E( x0) Fraction of effect Fraction of effect x1 ) P( z explained owed z f ( x, u1) y g( x, z, u x0) x0)] x u 1 by mediation to mediation 2 u ) 2

68 SUMMAR OF MEDIATION RESULTS 1. Formal semantics of path-specific effects, based on disabling mechanisms, instead of value fixing. 2. Path-analytic techniques extended to nonlinear and nonparametric models. 3. Meaningful (graphical) conditions for estimating direct and indirect effects from experimental and nonexperimental data.

69 ETERNAL VALIDIT From Threats to Licenses `External validity asks the question of generalizability: To what population, settings, treatment variables, and measurement variables can this effect be generalized? (Shadish, Cook and Campbell 2002) An experiment is said to have `external validity if the distribution of outcomes realized by a treatment group is the same as the distribution of outcome that would be realized in an actual program. (Manski, 2007) "A threat to external validity is an explanation of how you might be wrong in making a generalization." (Wikipedia 2011, after Trochin) A license of validity is a set of theoretical assumptions that neutralizes all conceivable threats. (Anon, 2011)

70 TRANSPORTABILIT ACROSS DOMAINS 1. A Theory of causal transportability When can causal relations learned from experiments be transferred to a different environment in which no experiment can be conducted? 2. A Theory of statistical transportability When can statistical information learned in one domain be transferred to a different domain in which a. only a subset of variables can be observed? Or, b. only a few samples are available? 3. Applications to Meta Analysis Combining results from many diverse studies

71 MOTIVATION WHAT CAN EPERIMENTS IN LA TELL ABOUT NC? (Age) (Age) * (Intervention) (Outcome) (Observation) (Outcome) Experimental study in LA Measured: P( x, y, z) P( y do( x), z) Observational study in NC Measured: P* ( x, y, z) P* ( z) P( z) Needed: P* ( y do( x))? z P ( y do( x), z) P*( z) Transport Formula (calibration): F( P, P, P*) do

72 TRANSPORT FORMULAS DEPEND ON THE STOR S (a) a) represents age P*( y do( x)) z P( y do( x), z) P*( z)

73 TRANSPORT FORMULAS DEPEND ON THE STOR S S (a) (b) a) represents age P*( y do( x)) z b) represents language skill P( y do( x), z) P*( z) P *( y do( x)) P( y do( x))

74 TRANSPORT FORMULAS DEPEND ON THE STOR (a) S a) represents age P*( y b) represents language skill do( x)) P* ( y do( x)) z? (b) P( y S do( x), z) P*( z) S Factors producing differences

75 TRANSPORT FORMULAS DEPEND ON THE STOR (a) S a) represents age P*( y do( x)) z P( y b) represents language skill P* ( y do( x)) c) represents a bio-marker P* ( y do( x)) (b) S do( x), z) P*( z) P( y do( x)) z P ( y do( x), z) P*( z x) S (c)

76 GOAL: ALGORITHM TO DETERMINE IF AN EFFECT IS TRANSPORTABLE Back to Transportability U T S W V S INPUT: Annotated Causal Graph S Factors creating differences OUTPUT: 1. Transportable or not? 2. Measurements to be taken in the experimental study 3. Measurements to be taken in the target population 4. A transport formula P* ( y do( x)) f [ P( y, v, z, w, t, u do( x)); P* ( y, v, z, w, t, u)]

77 TRANSPORTABILIT REDUCED TO CALCULUS Theorem 1 A causal relation R is transportable from to * if and only if R(*) is reducible, using the rules of do-calculus, to an expression in which S appears only as a conditioning variable in do-free terms. R * P*( y do( x)) P( y do( x), s) w w w P ( y do( x), s, w) P( w do( x), s) P ( y do( x), w) P( w s) P ( y do( x), w) P*( w) S W

78 RESULT: ALGORITHM TO DETERMINE IF AN EFFECT IS TRANSPORTABLE U T S W V S INPUT: Annotated Causal Graph S Factors creating differences OUTPUT: 1. Transportable or not? 2. Measurements to be taken in the experimental study 3. Measurements to be taken in the target population 4. A transport formula P* ( y do( x)) z P ( y do( x), z) P* ( z w) P( w do( x), t) w t P* ( t)

79 WHICH MODEL LICENSES THE TRANSPORT OF THE CAUSAL EFFECT S External factors creating disparities es No es S S S (a) (b) (c) S es S es S No W (d) W (e) (f)

80 STATISTICAL TRANSPORTABILIT Why should we transport statistical information? i.e., Why not re-learn things from scratch? 1. Measurements are costly. Limit measurements to a subset V * of variables called scope. 2. Samples are scarce. Pooling samples from diverse populations will improve precision, if differences can be filtered out.

81 STATISTICAL TRANSPORTABILIT Definition: (Statistical Transportability) A statistical relation R(P) is said to be transportable from to * over V * if R(P*) is identified from P, P*(V *), where V* is a subset of variables. R=P* (y x) is transportable over S V* = {,}, i.e., R is estimable without re-measuring R P*( z x) P( z y) z Transfer Learning S If few samples (N 2 ) are available from * and many samples (N 1 ) from, then estimating R = P*(y x) by R P*( y x, z) P( z x) z achieves a much higher precision

82 META-ANALSIS OR MULTI-SOURCE LEARNING Target population * R = P*(y do(x)) (a) (b) (c) S W W W (d) (e) S (f) S W W W (g) (h) S (i) S W W W

83 CAN WE GET A BIAS-FREE ESTIMATE OF THE TARGET QUANTIT? Target population * R = P*(y do(x)) (a) (d) W W S Is R identifiable from (d) and (h)? R P*( y do( x), w) P*( w do( x)) w P w ( h) ( y do( x), w) P( d) ( w do( x)) P( h) ( y do( x), w) P( d) ( w x) w R(*) is identifiable from studies (d) and (h). R(*) is not identifiable from studies (d) and (i). (h) S (i) S W W

84 FROM META-ANALSIS TO META-SNTHESIS The problem How to combine results of several experimental and observational studies, each conducted on a different population and under a different set of conditions, so as to construct an aggregate measure of effect size that is "better" than any one study in isolation. Definition (Meta-Estimability) A relation R is said to be "meta estimable" from a set of populations { 1, 2,..., K } to a target population * iff it is identifiable from the information set I = {I( 1 ), I( 2 ),..., I( K ), I(P*)}.

85 FROM META-ANALSIS TO META-SNTHESIS (Cont.) Theorem { 1, 2,..., K } a set of studies. {D 1, D 2,..., D k } selection diagrams (relative to *). A relation R(*) is "meta estimable" if it can be decomposed into terms of the form: Q k P( Vk do( Wk ), k ) such that each Q k is transportable from D k. Open-problem: Systematic decomposition

86 BIAS VS. PRECISION IN META-SNTHESIS Principle 1: Calibrate estimands before pooling (to minimize bias) Principle 2: Decompose to sub-relations before calibrating (to improve precision) R ( *) P *( y do ( x )) (a) (g) (h) S (i) (d) S W W W P ( * g) ( y do( x)) Pooling W W Calibration P ( * ) ( y do( x)) d

87 BIAS VS. PRECISION IN META-SNTHESIS (Cont.) (a) (g) (h) S (i) (d) S W W W W W R( *) P *( y do( x)) P ( * g) ( y do( x)) P ( * ) ( y w, do( x)) P * ( ( )) h ( * ) ( w do( x)) P ( ) w do x i d Pooling P ( w do( )) Composition ( * i, d ) x P* ( i, d, k) ( y do( x)) w P* ( h) ( y w, do( x)) P* ( i, d) ( w do( x)) P* ( all) x Pooling ( y do( ))

88 CONCLUSIONS I TOLD OU CAUSALIT IS SIMPLE Principled methodology for causal and counterfactual inference (complete) Unification of the graphical, potential-outcome and structural equation approaches Friendly and formal solutions to century-old problems and confusions.

89 CONCLUSIONS He is wise who bases causal inference on an explicit causal structure that is defensible on scientific grounds. (Aristotle B.C.) From Charlie Poole

90 QUESTIONS??? Now is the time!

CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea)

CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea) CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea) OUTLINE Inference: Statistical vs. Causal distinctions and mental barriers Formal semantics

More information

OUTLINE THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS. Judea Pearl University of California Los Angeles (www.cs.ucla.

OUTLINE THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS. Judea Pearl University of California Los Angeles (www.cs.ucla. THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea) OUTLINE Modeling: Statistical vs. Causal Causal Models and Identifiability to

More information

OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS. Judea Pearl University of California Los Angeles (www.cs.ucla.

OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS. Judea Pearl University of California Los Angeles (www.cs.ucla. OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea/) Statistical vs. Causal vs. Counterfactual inference: syntax and semantics

More information

OF CAUSAL INFERENCE THE MATHEMATICS IN STATISTICS. Department of Computer Science. Judea Pearl UCLA

OF CAUSAL INFERENCE THE MATHEMATICS IN STATISTICS. Department of Computer Science. Judea Pearl UCLA THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS Judea earl Department of Computer Science UCLA OUTLINE Statistical vs. Causal Modeling: distinction and mental barriers N-R vs. structural model: strengths

More information

OUTLINE CAUSES AND COUNTERFACTUALS IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (www.cs.ucla.

OUTLINE CAUSES AND COUNTERFACTUALS IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (www.cs.ucla. CAUE AND COUNTERFACTUAL IN THE EMPIRICAL CIENCE Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea/jsm9) OUTLINE Inference: tatistical vs. Causal, distinctions, and mental barriers

More information

CAUSAL INFERENCE AS COMPUTATIONAL LEARNING. Judea Pearl University of California Los Angeles (

CAUSAL INFERENCE AS COMPUTATIONAL LEARNING. Judea Pearl University of California Los Angeles ( CAUSAL INFERENCE AS COMUTATIONAL LEARNING Judea earl University of California Los Angeles www.cs.ucla.edu/~judea OUTLINE Inference: Statistical vs. Causal distinctions and mental barriers Formal semantics

More information

CAUSES AND COUNTERFACTUALS: CONCEPTS, PRINCIPLES AND TOOLS

CAUSES AND COUNTERFACTUALS: CONCEPTS, PRINCIPLES AND TOOLS CAUSES AND COUNTERFACTUALS: CONCEPTS, PRINCIPLES AND TOOLS Judea Pearl Elias Bareinboim University of California, Los Angeles {judea, eb}@cs.ucla.edu NIPS 2013 Tutorial OUTLINE! Concepts: * Causal inference

More information

PANEL ON MEDIATION AND OTHER MIRACLES OF THE 21 ST CENTURY

PANEL ON MEDIATION AND OTHER MIRACLES OF THE 21 ST CENTURY PANEL ON EDIAION AND OHER IRACLES OF HE 21 S CENUR Judea Pearl UCLA With Kosuke Imai and many others HE QUESION OF EDIAION (direct vs. indirect effects) 1. Why decompose effects? 2. What is the definition

More information

THE MATHEMATICS OF CAUSAL INFERENCE With reflections on machine learning and the logic of science. Judea Pearl UCLA

THE MATHEMATICS OF CAUSAL INFERENCE With reflections on machine learning and the logic of science. Judea Pearl UCLA THE MATHEMATICS OF CAUSAL INFERENCE With reflections on machine learning and the logic of science Judea Pearl UCLA OUTLINE 1. The causal revolution from statistics to counterfactuals from Babylon to Athens

More information

From Causality, Second edition, Contents

From Causality, Second edition, Contents From Causality, Second edition, 2009. Preface to the First Edition Preface to the Second Edition page xv xix 1 Introduction to Probabilities, Graphs, and Causal Models 1 1.1 Introduction to Probability

More information

CAUSALITY. Models, Reasoning, and Inference 1 CAMBRIDGE UNIVERSITY PRESS. Judea Pearl. University of California, Los Angeles

CAUSALITY. Models, Reasoning, and Inference 1 CAMBRIDGE UNIVERSITY PRESS. Judea Pearl. University of California, Los Angeles CAUSALITY Models, Reasoning, and Inference Judea Pearl University of California, Los Angeles 1 CAMBRIDGE UNIVERSITY PRESS Preface page xiii 1 Introduction to Probabilities, Graphs, and Causal Models 1

More information

Taming the challenge of extrapolation: From multiple experiments and observations to valid causal conclusions. Judea Pearl and Elias Bareinboim, UCLA

Taming the challenge of extrapolation: From multiple experiments and observations to valid causal conclusions. Judea Pearl and Elias Bareinboim, UCLA Taming the challenge of extrapolation: From multiple experiments and observations to valid causal conclusions Judea Pearl and Elias Bareinboim, UCLA One distinct feature of big data applications, setting

More information

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS ANALYTIC COMPARISON of Pearl and Rubin CAUSAL FRAMEWORKS Content Page Part I. General Considerations Chapter 1. What is the question? 16 Introduction 16 1. Randomization 17 1.1 An Example of Randomization

More information

Gov 2002: 4. Observational Studies and Confounding

Gov 2002: 4. Observational Studies and Confounding Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What

More information

The Effects of Interventions

The Effects of Interventions 3 The Effects of Interventions 3.1 Interventions The ultimate aim of many statistical studies is to predict the effects of interventions. When we collect data on factors associated with wildfires in the

More information

Statistical Models for Causal Analysis

Statistical Models for Causal Analysis Statistical Models for Causal Analysis Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 Three Modes of Statistical Inference 1. Descriptive Inference: summarizing and exploring

More information

Introduction to Causal Calculus

Introduction to Causal Calculus Introduction to Causal Calculus Sanna Tyrväinen University of British Columbia August 1, 2017 1 / 1 2 / 1 Bayesian network Bayesian networks are Directed Acyclic Graphs (DAGs) whose nodes represent random

More information

Causality in the Sciences

Causality in the Sciences Causality in the Sciences Phyllis McKay Illari University of Kent Federica Russo Université catholique de Louvain, University of Kent Jon Williamson University of Kent CLARENDON PRESS. OXFORD 2009 1 THE

More information

The Do-Calculus Revisited Judea Pearl Keynote Lecture, August 17, 2012 UAI-2012 Conference, Catalina, CA

The Do-Calculus Revisited Judea Pearl Keynote Lecture, August 17, 2012 UAI-2012 Conference, Catalina, CA The Do-Calculus Revisited Judea Pearl Keynote Lecture, August 17, 2012 UAI-2012 Conference, Catalina, CA Abstract The do-calculus was developed in 1995 to facilitate the identification of causal effects

More information

Technical Track Session I: Causal Inference

Technical Track Session I: Causal Inference Impact Evaluation Technical Track Session I: Causal Inference Human Development Human Network Development Network Middle East and North Africa Region World Bank Institute Spanish Impact Evaluation Fund

More information

Bounding the Probability of Causation in Mediation Analysis

Bounding the Probability of Causation in Mediation Analysis arxiv:1411.2636v1 [math.st] 10 Nov 2014 Bounding the Probability of Causation in Mediation Analysis A. P. Dawid R. Murtas M. Musio February 16, 2018 Abstract Given empirical evidence for the dependence

More information

Causal inference in statistics: An overview

Causal inference in statistics: An overview Statistics Surveys Vol. 3 (2009) 96 146 ISSN: 1935-7516 DOI: 10.1214/09-SS057 Causal inference in statistics: An overview Judea Pearl Computer Science Department University of California, Los Angeles,

More information

The Mathematics of Causal Inference

The Mathematics of Causal Inference In Joint Statistical Meetings Proceedings, Alexandria, VA: American Statistical Association, 2515-2529, 2013. JSM 2013 - Section on Statistical Education TECHNICAL REPORT R-416 September 2013 The Mathematics

More information

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Causality II: How does causal inference fit into public health and what it is the role of statistics? Causality II: How does causal inference fit into public health and what it is the role of statistics? Statistics for Psychosocial Research II November 13, 2006 1 Outline Potential Outcomes / Counterfactual

More information

A Decision Theoretic Approach to Causality

A Decision Theoretic Approach to Causality A Decision Theoretic Approach to Causality Vanessa Didelez School of Mathematics University of Bristol (based on joint work with Philip Dawid) Bordeaux, June 2011 Based on: Dawid & Didelez (2010). Identifying

More information

5.3 Graphs and Identifiability

5.3 Graphs and Identifiability 218CHAPTER 5. CAUSALIT AND STRUCTURAL MODELS IN SOCIAL SCIENCE AND ECONOMICS.83 AFFECT.65.23 BEHAVIOR COGNITION Figure 5.5: Untestable model displaying quantitative causal information derived. method elucidates

More information

An Introduction to Causal Inference

An Introduction to Causal Inference To be published in The International Journal of Biostatistics, 2009. TECHNICAL REPORT R-354 February 2010 An Introduction to Causal Inference Judea Pearl University of California, Los Angeles Computer

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint

More information

External Validity and Transportability: A Formal Approach

External Validity and Transportability: A Formal Approach Int. tatistical Inst.: Proc. 58th World tatistical Congress, 2011, Dublin (ession IP024) p.335 External Validity and Transportability: A Formal Approach Pearl, Judea University of California, Los Angeles,

More information

The Foundations of Causal Inference

The Foundations of Causal Inference The Foundations of Causal Inference Judea Pearl University of California, Los Angeles Computer Science Department Los Angeles, CA, 90095-1596, USA judea@cs.ucla.edu May 12, 2010 Abstract This paper reviews

More information

Probabilities of causation: Bounds and identification

Probabilities of causation: Bounds and identification Annals of Mathematics and Artificial Intelligence 28 (2000) 287 313 287 Probabilities of causation: Bounds and identification Jin Tian and Judea Pearl Cognitive Systems Laboratory, Computer Science Department,

More information

UCLA Department of Statistics Papers

UCLA Department of Statistics Papers UCLA Department of Statistics Papers Title Comment on `Causal inference, probability theory, and graphical insights' (by Stuart G. Baker) Permalink https://escholarship.org/uc/item/9863n6gg Journal Statistics

More information

Mediation for the 21st Century

Mediation for the 21st Century Mediation for the 21st Century Ross Boylan ross@biostat.ucsf.edu Center for Aids Prevention Studies and Division of Biostatistics University of California, San Francisco Mediation for the 21st Century

More information

An Introduction to Causal Analysis on Observational Data using Propensity Scores

An Introduction to Causal Analysis on Observational Data using Propensity Scores An Introduction to Causal Analysis on Observational Data using Propensity Scores Margie Rosenberg*, PhD, FSA Brian Hartman**, PhD, ASA Shannon Lane* *University of Wisconsin Madison **University of Connecticut

More information

THE SCIENCE OF CAUSE AND EFFECT

THE SCIENCE OF CAUSE AND EFFECT HE CIENCE OF CAUE AND EFFEC Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea/) OULINE 1. he causal revolution from associations to intervention to counterfactuals 2. he two fundamental

More information

Single World Intervention Graphs (SWIGs):

Single World Intervention Graphs (SWIGs): Single World Intervention Graphs (SWIGs): Unifying the Counterfactual and Graphical Approaches to Causality Thomas Richardson Department of Statistics University of Washington Joint work with James Robins

More information

A Linear Microscope for Interventions and Counterfactuals

A Linear Microscope for Interventions and Counterfactuals Forthcoming, Journal of Causal Inference (Causal, Casual, and Curious Section). TECHNICAL REPORT R-459 March 2017 A Linear Microscope for Interventions and Counterfactuals Judea Pearl University of California,

More information

Counterfactual Reasoning in Algorithmic Fairness

Counterfactual Reasoning in Algorithmic Fairness Counterfactual Reasoning in Algorithmic Fairness Ricardo Silva University College London and The Alan Turing Institute Joint work with Matt Kusner (Warwick/Turing), Chris Russell (Sussex/Turing), and Joshua

More information

Empirical approaches in public economics

Empirical approaches in public economics Empirical approaches in public economics ECON4624 Empirical Public Economics Fall 2016 Gaute Torsvik Outline for today The canonical problem Basic concepts of causal inference Randomized experiments Non-experimental

More information

CAUSALITY CORRECTIONS IMPLEMENTED IN 2nd PRINTING

CAUSALITY CORRECTIONS IMPLEMENTED IN 2nd PRINTING 1 CAUSALITY CORRECTIONS IMPLEMENTED IN 2nd PRINTING Updated 9/26/00 page v *insert* TO RUTH centered in middle of page page xv *insert* in second paragraph David Galles after Dechter page 2 *replace* 2000

More information

External Validity: From do-calculus to Transportability across Populations

External Validity: From do-calculus to Transportability across Populations ubmitted, tatistical cience. TECHNICAL REPORT R-400 May 2012 External Validity: From do-calculus to Transportability across Populations Judea Pearl and Elias Bareinboim University of California, Los Angeles

More information

Causal and Counterfactual Inference

Causal and Counterfactual Inference Forthcoming section in The Handbook of Rationality, MIT press. TECHNICAL REPORT R-485 October 2018 Causal and Counterfactual Inference Judea Pearl University of California, Los Angeles Computer Science

More information

Ignoring the matching variables in cohort studies - when is it valid, and why?

Ignoring the matching variables in cohort studies - when is it valid, and why? Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association

More information

The Science and Ethics of Causal Modeling

The Science and Ethics of Causal Modeling you said to post r347 on publications page. should i add: To appear as chapter in A.T. Panter and Sonya Serba (Eds.), Handbook of Ethics in Quantitative Methodology, Taylor & Francis Group, 2009. r347

More information

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies Kosuke Imai Department of Politics Princeton University November 13, 2013 So far, we have essentially assumed

More information

Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D.

Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D. Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D. David Kaplan Department of Educational Psychology The General Theme

More information

Single World Intervention Graphs (SWIGs):

Single World Intervention Graphs (SWIGs): Single World Intervention Graphs (SWIGs): Unifying the Counterfactual and Graphical Approaches to Causality Thomas Richardson Department of Statistics University of Washington Joint work with James Robins

More information

Flexible Estimation of Treatment Effect Parameters

Flexible Estimation of Treatment Effect Parameters Flexible Estimation of Treatment Effect Parameters Thomas MaCurdy a and Xiaohong Chen b and Han Hong c Introduction Many empirical studies of program evaluations are complicated by the presence of both

More information

Confounding Equivalence in Causal Inference

Confounding Equivalence in Causal Inference In Proceedings of UAI, 2010. In P. Grunwald and P. Spirtes, editors, Proceedings of UAI, 433--441. AUAI, Corvallis, OR, 2010. TECHNICAL REPORT R-343 July 2010 Confounding Equivalence in Causal Inference

More information

A Linear Microscope for Interventions and Counterfactuals

A Linear Microscope for Interventions and Counterfactuals Journal of Causal Inference, 5(1):March2017. (Errata marked; pages 2 and 6.) J. Causal Infer. 2017; 20170003 TECHNICAL REPORT R-459 March 2017 Causal, Casual and Curious Judea Pearl A Linear Microscope

More information

Single World Intervention Graphs (SWIGs): A Unification of the Counterfactual and Graphical Approaches to Causality

Single World Intervention Graphs (SWIGs): A Unification of the Counterfactual and Graphical Approaches to Causality Single World Intervention Graphs (SWIGs): A Unification of the Counterfactual and Graphical Approaches to Causality Thomas S. Richardson University of Washington James M. Robins Harvard University Working

More information

Identification of Conditional Interventional Distributions

Identification of Conditional Interventional Distributions Identification of Conditional Interventional Distributions Ilya Shpitser and Judea Pearl Cognitive Systems Laboratory Department of Computer Science University of California, Los Angeles Los Angeles, CA.

More information

Propensity Score Methods for Causal Inference

Propensity Score Methods for Causal Inference John Pura BIOS790 October 2, 2015 Causal inference Philosophical problem, statistical solution Important in various disciplines (e.g. Koch s postulates, Bradford Hill criteria, Granger causality) Good

More information

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD

Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification. Todd MacKenzie, PhD Causal Hazard Ratio Estimation By Instrumental Variables or Principal Stratification Todd MacKenzie, PhD Collaborators A. James O Malley Tor Tosteson Therese Stukel 2 Overview 1. Instrumental variable

More information

Difference-in-Differences Methods

Difference-in-Differences Methods Difference-in-Differences Methods Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 1 Introduction: A Motivating Example 2 Identification 3 Estimation and Inference 4 Diagnostics

More information

THE FOUNDATIONS OF CAUSAL INFERENCE

THE FOUNDATIONS OF CAUSAL INFERENCE TECHNICAL REPORT R-355 August 2010 3 THE FOUNDATIONS OF CAUSAL INFERENCE Judea Pearl* This paper reviews recent advances in the foundations of causal inference and introduces a systematic methodology for

More information

Revision list for Pearl s THE FOUNDATIONS OF CAUSAL INFERENCE

Revision list for Pearl s THE FOUNDATIONS OF CAUSAL INFERENCE Revision list for Pearl s THE FOUNDATIONS OF CAUSAL INFERENCE insert p. 90: in graphical terms or plain causal language. The mediation problem of Section 6 illustrates how such symbiosis clarifies the

More information

Confounding Equivalence in Causal Inference

Confounding Equivalence in Causal Inference Revised and submitted, October 2013. TECHNICAL REPORT R-343w October 2013 Confounding Equivalence in Causal Inference Judea Pearl Cognitive Systems Laboratory Computer Science Department University of

More information

Stability of causal inference: Open problems

Stability of causal inference: Open problems Stability of causal inference: Open problems Leonard J. Schulman & Piyush Srivastava California Institute of Technology UAI 2016 Workshop on Causality Interventions without experiments [Pearl, 1995] Genetic

More information

Estimating complex causal effects from incomplete observational data

Estimating complex causal effects from incomplete observational data Estimating complex causal effects from incomplete observational data arxiv:1403.1124v2 [stat.me] 2 Jul 2014 Abstract Juha Karvanen Department of Mathematics and Statistics, University of Jyväskylä, Jyväskylä,

More information

Reflections, Elaborations, and Discussions with Readers

Reflections, Elaborations, and Discussions with Readers CHAPTER ELEVEN Reflections, Elaborations, and Discussions with Readers As X-rays are to the surgeon, graphs are for causation. The author In this chapter, I reflect back on the material covered in Chapters

More information

Propensity Score Weighting with Multilevel Data

Propensity Score Weighting with Multilevel Data Propensity Score Weighting with Multilevel Data Fan Li Department of Statistical Science Duke University October 25, 2012 Joint work with Alan Zaslavsky and Mary Beth Landrum Introduction In comparative

More information

PEARL VS RUBIN (GELMAN)

PEARL VS RUBIN (GELMAN) PEARL VS RUBIN (GELMAN) AN EPIC battle between the Rubin Causal Model school (Gelman et al) AND the Structural Causal Model school (Pearl et al) a cursory overview Dokyun Lee WHO ARE THEY? Judea Pearl

More information

Causal Inference with Big Data Sets

Causal Inference with Big Data Sets Causal Inference with Big Data Sets Marcelo Coca Perraillon University of Colorado AMC November 2016 1 / 1 Outlone Outline Big data Causal inference in economics and statistics Regression discontinuity

More information

Bounds on Direct Effects in the Presence of Confounded Intermediate Variables

Bounds on Direct Effects in the Presence of Confounded Intermediate Variables Bounds on Direct Effects in the Presence of Confounded Intermediate Variables Zhihong Cai, 1, Manabu Kuroki, 2 Judea Pearl 3 and Jin Tian 4 1 Department of Biostatistics, Kyoto University Yoshida-Konoe-cho,

More information

causal inference at hulu

causal inference at hulu causal inference at hulu Allen Tran July 17, 2016 Hulu Introduction Most interesting business questions at Hulu are causal Business: what would happen if we did x instead of y? dropped prices for risky

More information

Estimating the Marginal Odds Ratio in Observational Studies

Estimating the Marginal Odds Ratio in Observational Studies Estimating the Marginal Odds Ratio in Observational Studies Travis Loux Christiana Drake Department of Statistics University of California, Davis June 20, 2011 Outline The Counterfactual Model Odds Ratios

More information

Identification and Overidentification of Linear Structural Equation Models

Identification and Overidentification of Linear Structural Equation Models In D. D. Lee and M. Sugiyama and U. V. Luxburg and I. Guyon and R. Garnett (Eds.), Advances in Neural Information Processing Systems 29, pre-preceedings 2016. TECHNICAL REPORT R-444 October 2016 Identification

More information

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case

Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case Econ 2148, fall 2017 Instrumental variables I, origins and binary treatment case Maximilian Kasy Department of Economics, Harvard University 1 / 40 Agenda instrumental variables part I Origins of instrumental

More information

Econometric Causality

Econometric Causality Econometric (2008) International Statistical Review, 76(1):1-27 James J. Heckman Spencer/INET Conference University of Chicago Econometric The econometric approach to causality develops explicit models

More information

Help! Statistics! Mediation Analysis

Help! Statistics! Mediation Analysis Help! Statistics! Lunch time lectures Help! Statistics! Mediation Analysis What? Frequently used statistical methods and questions in a manageable timeframe for all researchers at the UMCG. No knowledge

More information

Front-Door Adjustment

Front-Door Adjustment Front-Door Adjustment Ethan Fosse Princeton University Fall 2016 Ethan Fosse Princeton University Front-Door Adjustment Fall 2016 1 / 38 1 Preliminaries 2 Examples of Mechanisms in Sociology 3 Bias Formulas

More information

On the Identification of a Class of Linear Models

On the Identification of a Class of Linear Models On the Identification of a Class of Linear Models Jin Tian Department of Computer Science Iowa State University Ames, IA 50011 jtian@cs.iastate.edu Abstract This paper deals with the problem of identifying

More information

Causal Inference. Prediction and causation are very different. Typical questions are:

Causal Inference. Prediction and causation are very different. Typical questions are: Causal Inference Prediction and causation are very different. Typical questions are: Prediction: Predict Y after observing X = x Causation: Predict Y after setting X = x. Causation involves predicting

More information

Effect Modification and Interaction

Effect Modification and Interaction By Sander Greenland Keywords: antagonism, causal coaction, effect-measure modification, effect modification, heterogeneity of effect, interaction, synergism Abstract: This article discusses definitions

More information

Causal Transportability of Experiments on Controllable Subsets of Variables: z-transportability

Causal Transportability of Experiments on Controllable Subsets of Variables: z-transportability Causal Transportability of Experiments on Controllable Subsets of Variables: z-transportability Sanghack Lee and Vasant Honavar Artificial Intelligence Research Laboratory Department of Computer Science

More information

On Identifying Causal Effects

On Identifying Causal Effects 29 1 Introduction This paper deals with the problem of inferring cause-effect relationships from a combination of data and theoretical assumptions. This problem arises in diverse fields such as artificial

More information

Generalized Quantifiers Logical and Linguistic Aspects

Generalized Quantifiers Logical and Linguistic Aspects Generalized Quantifiers Logical and Linguistic Aspects Lecture 1: Formal Semantics and Generalized Quantifiers Dag Westerståhl University of Gothenburg SELLC 2010 Institute for Logic and Cognition, Sun

More information

Transportability of Causal and Statistical Relations: A Formal Approach

Transportability of Causal and Statistical Relations: A Formal Approach In Proceedings of the 25th AAAI Conference on Artificial Intelligence, August 7 11, 2011, an Francisco, CA. AAAI Press, Menlo Park, CA, pp. 247-254. TECHNICAL REPORT R-372-A August 2011 Transportability

More information

Causal Inference & Reasoning with Causal Bayesian Networks

Causal Inference & Reasoning with Causal Bayesian Networks Causal Inference & Reasoning with Causal Bayesian Networks Neyman-Rubin Framework Potential Outcome Framework: for each unit k and each treatment i, there is a potential outcome on an attribute U, U ik,

More information

The Doubly-Modifiable Structural Model (DMSM) The Modifiable Structural Model (MSM)

The Doubly-Modifiable Structural Model (DMSM) The Modifiable Structural Model (MSM) unified theory of counterfactual reasoning hristopher G. Lucas cglucas@cmu.edu Department of Psychology arnegie Mellon University harles Kemp ckemp@cmu.edu Department of Psychology arnegie Mellon University

More information

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016

An Introduction to Causal Mediation Analysis. Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 An Introduction to Causal Mediation Analysis Xu Qin University of Chicago Presented at the Central Iowa R User Group Meetup Aug 10, 2016 1 Causality In the applications of statistics, many central questions

More information

Unit 1. Propositional Logic Reading do all quick-checks Propositional Logic: Ch. 2.intro, 2.2, 2.3, 2.4. Review 2.9

Unit 1. Propositional Logic Reading do all quick-checks Propositional Logic: Ch. 2.intro, 2.2, 2.3, 2.4. Review 2.9 Unit 1. Propositional Logic Reading do all quick-checks Propositional Logic: Ch. 2.intro, 2.2, 2.3, 2.4. Review 2.9 Typeset September 23, 2005 1 Statements or propositions Defn: A statement is an assertion

More information

Causal Directed Acyclic Graphs

Causal Directed Acyclic Graphs Causal Directed Acyclic Graphs Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Causal DAGs Stat186/Gov2002 Fall 2018 1 / 15 Elements of DAGs (Pearl. 2000.

More information

What s New in Econometrics. Lecture 1

What s New in Econometrics. Lecture 1 What s New in Econometrics Lecture 1 Estimation of Average Treatment Effects Under Unconfoundedness Guido Imbens NBER Summer Institute, 2007 Outline 1. Introduction 2. Potential Outcomes 3. Estimands and

More information

CompSci Understanding Data: Theory and Applications

CompSci Understanding Data: Theory and Applications CompSci 590.6 Understanding Data: Theory and Applications Lecture 17 Causality in Statistics Instructor: Sudeepa Roy Email: sudeepa@cs.duke.edu Fall 2015 1 Today s Reading Rubin Journal of the American

More information

Causality. Pedro A. Ortega. 18th February Computational & Biological Learning Lab University of Cambridge

Causality. Pedro A. Ortega. 18th February Computational & Biological Learning Lab University of Cambridge Causality Pedro A. Ortega Computational & Biological Learning Lab University of Cambridge 18th February 2010 Why is causality important? The future of machine learning is to control (the world). Examples

More information

Causes of Effects and Effects of Causes

Causes of Effects and Effects of Causes Short version forthcoming, Journal of Sociological Methods and Research. TECHNICAL REPORT R-431-L October 2014 Causes of Effects and Effects of Causes Judea Pearl University of California, Los Angeles

More information

Structural Causal Models and the Specification of Time-Series-Cross-Section Models

Structural Causal Models and the Specification of Time-Series-Cross-Section Models Structural Causal Models and the Specification of Time-Series-Cross-Section Models Adam N. Glynn Kevin M. Quinn March 13, 2013 Abstract The structural causal models (SCM) of Pearl (1995, 2000, 2009) provide

More information

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall

Structural Nested Mean Models for Assessing Time-Varying Effect Moderation. Daniel Almirall 1 Structural Nested Mean Models for Assessing Time-Varying Effect Moderation Daniel Almirall Center for Health Services Research, Durham VAMC & Dept. of Biostatistics, Duke University Medical Joint work

More information

Targeted Maximum Likelihood Estimation in Safety Analysis

Targeted Maximum Likelihood Estimation in Safety Analysis Targeted Maximum Likelihood Estimation in Safety Analysis Sam Lendle 1 Bruce Fireman 2 Mark van der Laan 1 1 UC Berkeley 2 Kaiser Permanente ISPE Advanced Topics Session, Barcelona, August 2012 1 / 35

More information

A Unification of Mediation and Interaction. A 4-Way Decomposition. Tyler J. VanderWeele

A Unification of Mediation and Interaction. A 4-Way Decomposition. Tyler J. VanderWeele Original Article A Unification of Mediation and Interaction A 4-Way Decomposition Tyler J. VanderWeele Abstract: The overall effect of an exposure on an outcome, in the presence of a mediator with which

More information

Fernando Martel García* and Leonard Wantchekon A Graphical Approximation to Generalization: Definitions and Diagrams

Fernando Martel García* and Leonard Wantchekon A Graphical Approximation to Generalization: Definitions and Diagrams JGD 2015; 6(1): 71 86 Open Access Fernando Martel García* and Leonard Wantchekon A Graphical Approximation to Generalization: Definitions and Diagrams Abstract: The fundamental problem of external validity

More information

IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL

IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL IDENTIFICATION OF TREATMENT EFFECTS WITH SELECTIVE PARTICIPATION IN A RANDOMIZED TRIAL BRENDAN KLINE AND ELIE TAMER Abstract. Randomized trials (RTs) are used to learn about treatment effects. This paper

More information

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015

Introduction to causal identification. Nidhiya Menon IGC Summer School, New Delhi, July 2015 Introduction to causal identification Nidhiya Menon IGC Summer School, New Delhi, July 2015 Outline 1. Micro-empirical methods 2. Rubin causal model 3. More on Instrumental Variables (IV) Estimating causal

More information

For all For every For each For any There exists at least one There exists There is Some

For all For every For each For any There exists at least one There exists There is Some Section 1.3 Predicates and Quantifiers Assume universe of discourse is all the people who are participating in this course. Also let us assume that we know each person in the course. Consider the following

More information

Quantifying Causal Effects on Query Answering in Databases

Quantifying Causal Effects on Query Answering in Databases Quantifying Causal Effects on Query Answering in Databases Babak Salimi University of Washington February 2016 Collaborators: Leopoldo Bertossi (Carleton University), Dan Suciu (University of Washington),

More information

Causal Analysis After Haavelmo

Causal Analysis After Haavelmo After Haavelmo University College London UCL Department Seminar September 3, 2013 Economics has forgotten its own past. (Judea Pearl, 2012) Haavemo s Contributions to Causality: Two seminal papers (1943,

More information

Casual Mediation Analysis

Casual Mediation Analysis Casual Mediation Analysis Tyler J. VanderWeele, Ph.D. Upcoming Seminar: April 21-22, 2017, Philadelphia, Pennsylvania OXFORD UNIVERSITY PRESS Explanation in Causal Inference Methods for Mediation and Interaction

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data? Kosuke Imai Princeton University Asian Political Methodology Conference University of Sydney Joint

More information