Taming the challenge of extrapolation: From multiple experiments and observations to valid causal conclusions. Judea Pearl and Elias Bareinboim, UCLA

Size: px
Start display at page:

Download "Taming the challenge of extrapolation: From multiple experiments and observations to valid causal conclusions. Judea Pearl and Elias Bareinboim, UCLA"

Transcription

1 Taming the challenge of extrapolation: From multiple experiments and observations to valid causal conclusions Judea Pearl and Elias Bareinboim, UCLA One distinct feature of big data applications, setting them apart from traditional causal analyses, is that data are pooled together from multiple sources. Each represents a different population, studied under different conditions, some experimental and some observational, and each contaminated by different confounding factors and different sample selection mechanisms. All of these studies need be pooled together to answer causal questions in yet a new environment, unmatched by any of those studied. The problem, as described above, appears utterly hopeless, almost as hopeless as confronting physics problems in the 16th century, before the advent of algebra. My talk will demonstrate that the solutions to the two types of problems, data pooling on the one hand and algebraic equations on the other, are strikingly similar; they can be unraveled by symbolic operations and managed at ease by anyone who masters the two basic tools of causal inference: graphical models and the logic of causation. The net result is the emergence of algorithms that decide what piece of information must be extracted from each of the available data sources and how to combine those pieces together so as to provide valid answers to a variety of causal questions: hat if? How? and hy? To gain an appreciation for the task, the audience may wish to consider three toy problems described in Figure 3 of this introductory paper All three problems involve the same task: e gather causal information in one environment and we wish to generalize it to another, which differs from the first in a set of characteristics. How do we adjust the available information so as to account for differences between the two populations? It turns out, as readers will immediately discover, that there is no universal adjustment formula for this task; the adjustment should vary from case to case depending on the location of the set in the causal scheme of things. Traditional schemes like post stratification or re weighting would not work, except in special cases. Fortunately, however, once the set is allocated in the causal graph, the proper adjustment can be generated automatically using symbolic operations, in much the same way that we solve algebraic equations. The talk will describe how we go from two to multiple environments, how we characterize the idiosyncratic features of each population or environment, and how we can generalize from unrepresentative samples to population level effects, Related papers can be viewed here: Tutorials reprint.pdf Extrapolation problem

2 FUION + ETRAPOLATION: FROM MULTIPLE EPERIMENT AND OBERVATION TO VALID CAUAL CONCLUION Judea Pearl Elias Bareinboim University of California, Los Angeles ( THE DATA POOLING PROBLEM The problem How to combine results of several experimental and observational studies, each conducted on a different population and under a different set of conditions, so as to construct a valid estimate of effect size in yet a new population, unmatched by any of those studied. 1

3 THE PROBLEM IN REAL LIFE Target population * Query of interest: Q = P*(y do(x)) (a) Arkansas urvey data available (b) New ork urvey data Resembling target (c) Los Angeles urvey data ounger population (d) Boston Age not recorded Mostly successful lawyers (g) Toronto Randomized trial College students (e) an Francisco High post-treatment blood pressure (h) Utah RCT, paid volunteers, unemployed (f) Texas Mostly panish subjects High attrition (i) yoming RCT, young athletes THE PROBLEM IN MATHEMATIC Target population * Query of interest: Q = P*(y do(x)) (a) (b) (c) (d) (e) (f) (g) (h) (i) 2

4 FIVE LEON FROM THE THEATRE OF CAUAL INFERENCE 1. Every causal inference task must rely on judgmental, extra-data assumptions. 2. e have ways of encoding those assumptions mathematically. 3. e have a mathematical machinery to take those assumptions, combine them with data and derive answers to questions of interest. 4. e have a way of doing (2) and (3) in a language that permits us to judge the scientific plausibility of our assumptions and to derive their ramifications swiftly and transparently. 5. Items (2)-(4) make the fusion problem manageable. OUTLINE Three useful tools Reading a diagram Representing interventions Do-calculus: evaluating interventions Transportability: The two-population problem Three toy examples and their solutions hy transportation requires causal calculus The mathematics of solving extrapolation problems The inference engine Examples 6 3

5 READING A DIAGRAM Graph (G) (prinkler) C (Climate) (etness) R (Rain) Model (M) C f C (U C ) f (C,U ) R f R (C,U R ) f (, R,U ) Miracles do happen If the U 's are independent, the observed distribution P(C,R,,) satisfies constraints that are: (1) independent of the f 's and of P(U), (2) readable from the graph. READING A DIAGRAM (Cont.) Graph (G) C (Climate) Model (M) C f C (U C ) R f (C,U ) (prinkler) (Rain) R f R (C,U R ) (etness) f (, R,U ) Every missing arrow advertises an independency, conditional on a separating set. e.g., C (, R) R C Applications: 1. Model testing 2. tructure learning 3. Reducing "what if I do" questions to symbolic calculus 4. Reducing extrapolation to symbolic calculus 4

6 REPREENTING INTERVENTION Definition: Given a model M, the effect of setting to x, P( = y do (=x)), is equal to the probability of = y in a mutilated model M x, in which the equation for is replaced by = x. M U M x U (u) (u) = x x (u) The Fundamental Equation of Interventions: P( y do( x)) P M x ( y) P( x y) THE TO FUNDAMENTAL LA OF CAUAL INFERENCE 1. The Law of Counterfactuals (and Interventions) x (u) M x (u) (M generates and evaluates all counterfactuals.) 2. The Law of Conditional Independence (d-separation) ( sep ) G(M ) ( ) P(v) (eparation in the model independence in the distribution.) 5

7 ELIMINATING CONFOUNDING BIA THE BACK-DOOR CRITERION P(y do(x)) is estimable if there is a set of variables that d-separates from in Gx ("conditional ignorability ) x G G x Moreover, P(y do((x)) ( adjusting for z) 6 P(y x,z)p(z) z GOING BEOND ADJUTMENT Goal: Find the effect of moking on Cancer, P(c do(s)), given samples from P(, T, C), when latent variables confound the relationship -C. Query Genotype (Unobserved) Data moking Tar Cancer 6

8 IDENTIFICATION REDUCED TO CALCULU Query Genotype (Unobserved) moking Tar Cancer P(c do(s)) t P(c do(s),t)p(t do(s)) Probability Axioms t P(c do(s),do(t))p(t do(s)) Rule 2 t P(c do(s),do(t))p(t s) Rule 2 t P(c do(t)p(t s) Rule 3 s' t P(c do(t),s')p(s' do(t))p(t s) Probability Axioms s' t P(c t,s')p(s' do(t))p(t s) Rule 2 s' t P(c t,s')p(s')p(t s) Rule 3 Estimand DO-CALCULU (IDENTIFICATION REDUCED TO CALCULU) The following transformations are valid for every interventional distribution generated by a structural causal model M: Rule 1: Ignoring observations P(y do(x),z,w) P(y do(x),w), Rule 2: Action/observation exchange P(y do(x),do(z),w) P(y do(x), z,w), Rule 3: Ignoring actions P(y do(x),do(z),w) P(y do(x),w), 7

9 UMMAR: THE IDENTIFICATION PROBLEM I OLVED (NONPARAMETRICALL) The estimability of any expression of the form Q P(y 1, y 2,..., y n do(x 1, x 2,..., x m ), z 1,z 2,...,z k ) can be decided in polynomial time. If Q is estimable, then its estimand can be derived in polynomial time. The algorithm is complete. ame for ETT (hpitser 2008). THE TO POPULATION PROBLEM HAT CAN EPERIMENT IN LA TELL U ABOUT NC? (Age) (LA) *(N) (Intervention) (Outcome) Experimental study in LA Measured: Needed: Transport Formula: P(x, y,z) P(y do(x), z) Observational study in NC Measured: P*(x, y,z) P*(z) P(z) Q P*(y do(x))? P(y do(x), z)p*(z) z Q F(P,P do,p*) 8

10 TRANPORT FORMULA DEPEND ON THE CAUAL TOR Lesson: Not every dissimilarity deserves re-weighting. (a) a) represents age (b) z b) represents language skill? c) represents a bio-marker? (c) P *(y do(x)) P(y do(x), z)p *(z) P *(y do(x)) P(y do(x)) P *(y do(x)) P(y do(x), z)p *(z x) TRANPORT FORMULA DEPEND ON THE CAUAL TOR Lesson: There are limits to -ignorability. (a) (b) (c) a) represents age P *(y do(x)) P(y do(x), z)p *(z) z b) represents language skill P *(y do(x)) P(y do(x)) c) represents a bio-marker P *(y do(x)) P(y do(x), z)p *(z x) 9

11 TRANPORTABILIT REDUCED TO CALCULU Theorem A causal relation R is transportable from to * if and only if it is reducible, using the rules of do-calculus, to an expression in which is separated from do( ). Query R * P*(y do(x)) P(y do(x),s) P(y do(x), s,w)p(w do(x), s) w Estimand P(y do(x),w)p(w s) w P(y do(x),w)p*(w) w 19 REULT: ALGORITHM TO DETERMINE IF AN EFFECT I TRANPORTABLE U T ' V INPUT: Annotated Causal Graph Factors creating differences OUTPUT: 1. Transportable or not? 2. Measurements to be taken in the experimental study 3. Measurements to be taken in the target population 4. A transport formula 5. Completeness (Bareinboim, 2012) P*(y do(x)) P(y do(x), z) P *(z w) P(w do(w),t)p *(t) z w t 20 10

12 HICH MODEL LICENE THE TRANPORT OF THE CAUAL EFFECT External factors creating disparities es No es (a) (b) (c) es es No (d) (e) (f) FROM TO TO MULTIPLE ENVIRONMENT Target population * (a) (b) R = P*(y do(x)) (c) (d) (e) (f) (g) (h) (i) 11

13 UMMAR OF TRANPORTABILIT REULT Nonparametric transportability of experimental results from multiple environments can be determined provided that commonalities and differences are encoded in selection diagrams. hen transportability is feasible, the transport formula can be derived in polynomial time. The algorithm is complete. TRANPORTABILIT VERU AMPLING ELECTION BIA Transportability (Beach proximity) (Age) election Bias (Age) (ads) (Outcome) = disparity-producing factors P *(y do(x)) P(y do(x), 1) P(y do(x), z)p(z 1) z (ads) (Outcome) (Army service) = sampling mechanism P *(y do(x)) P(y do(x)) P(y do(x), z, 1) P(z) z 12

14 THE INFERENCE ENGINE Query (The thing I want estimated) Calculus Model (My perception of reality) Design (Type of data collected and how) Goal olution (Estimand) A problem is solved if the model permits the calculus to transform the Query into a format deliverable by the design. BAB INFERENCE ENGINE Query P(, ) (The thing I want estimated) Data {P(), P()} Calculus Model (My perception of reality) Design (Type of data collected and how) Goal olution (Estimand) License: P(, ) P( )P( ) P()P( ) A problem is solved if the model permits the calculus to transform the Query into a format deliverable by the design. 13

15 JUVENILE INFERENCE ENGINE Query P(y do(x)) (The thing I want estimated) Data P(, ) Calculus Model (My perception of reality) Design (Type of data collected and how) Goal License: No back-door olution (Estimand) P( ) A problem is solved if the model permits the calculus to transform the Query into a format deliverable by the design. THE TRANPORT ENGINE Query P*(y do(x)) = P(y do(x), = s) (The thing I want estimated) Data P(y do(x), ) P(,, = s) Design (Type of data collected and how) Calculus Goal Model (My perception of reality) P(y do(x),z)p(z x,s) z olution (Estimand) = s A problem is solved if the model permits the calculus to transform the Query into a format deliverable by the design. 14

16 THE ELECTION-BIA ENGINE Data P(y do(x),, = 1) P(,, ) Design (Type of data collected and how) Query P*(y do(x)) = P(y do(x)) (The thing I want estimated) Calculus Goal Model (My perception of reality) P(y do(x), z, 1)P(z x) z olution (Estimand) A problem is solved if the model permits the calculus to transform the Query into a format deliverable by the design. FIVE LEON FROM THE THEATRE OF CAUAL INFERENCE 1. Every causal inference task must rely on judgmental, extra-data assumptions. 2. e have ways of encoding those assumptions mathematically. 3. e have a mathematical machinery to take those assumptions, combine them with data and derive answers to questions of interest. 4. e have a way of doing (2) and (3) in a language that permits us to judge the scientific plausibility of our assumptions and to derive their ramifications swiftly and transparently. 5. Items (2)-(4) make causal inference easy, fun, and extremely productive. 15

17 CONCLUION More has been learned about causal inference in the last few decades than the sum total of everything that had been learned about it in all prior recorded history. (Gary King, Harvard, 2014) More will be learned about causal inference in the next decade than most of us imagine today. Thank you 16

18 More:

THE MATHEMATICS OF CAUSAL INFERENCE With reflections on machine learning and the logic of science. Judea Pearl UCLA

THE MATHEMATICS OF CAUSAL INFERENCE With reflections on machine learning and the logic of science. Judea Pearl UCLA THE MATHEMATICS OF CAUSAL INFERENCE With reflections on machine learning and the logic of science Judea Pearl UCLA OUTLINE 1. The causal revolution from statistics to counterfactuals from Babylon to Athens

More information

CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea)

CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea) CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea) OUTLINE Inference: Statistical vs. Causal distinctions and mental barriers Formal semantics

More information

OUTLINE THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS. Judea Pearl University of California Los Angeles (www.cs.ucla.

OUTLINE THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS. Judea Pearl University of California Los Angeles (www.cs.ucla. THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea) OUTLINE Modeling: Statistical vs. Causal Causal Models and Identifiability to

More information

CAUSES AND COUNTERFACTUALS: CONCEPTS, PRINCIPLES AND TOOLS

CAUSES AND COUNTERFACTUALS: CONCEPTS, PRINCIPLES AND TOOLS CAUSES AND COUNTERFACTUALS: CONCEPTS, PRINCIPLES AND TOOLS Judea Pearl Elias Bareinboim University of California, Los Angeles {judea, eb}@cs.ucla.edu NIPS 2013 Tutorial OUTLINE! Concepts: * Causal inference

More information

Introduction to Causal Calculus

Introduction to Causal Calculus Introduction to Causal Calculus Sanna Tyrväinen University of British Columbia August 1, 2017 1 / 1 2 / 1 Bayesian network Bayesian networks are Directed Acyclic Graphs (DAGs) whose nodes represent random

More information

CAUSAL INFERENCE IN STATISTICS. A Gentle Introduction. Judea Pearl University of California Los Angeles (www.cs.ucla.

CAUSAL INFERENCE IN STATISTICS. A Gentle Introduction. Judea Pearl University of California Los Angeles (www.cs.ucla. CAUSAL INFERENCE IN STATISTICS A Gentle Introduction Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea/jsm12) OUTLINE Inference: Statistical vs. Causal, distinctions, and mental

More information

OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS. Judea Pearl University of California Los Angeles (www.cs.ucla.

OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS. Judea Pearl University of California Los Angeles (www.cs.ucla. OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea/) Statistical vs. Causal vs. Counterfactual inference: syntax and semantics

More information

OUTLINE CAUSES AND COUNTERFACTUALS IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (www.cs.ucla.

OUTLINE CAUSES AND COUNTERFACTUALS IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (www.cs.ucla. CAUE AND COUNTERFACTUAL IN THE EMPIRICAL CIENCE Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea/jsm9) OUTLINE Inference: tatistical vs. Causal, distinctions, and mental barriers

More information

External Validity and Transportability: A Formal Approach

External Validity and Transportability: A Formal Approach Int. tatistical Inst.: Proc. 58th World tatistical Congress, 2011, Dublin (ession IP024) p.335 External Validity and Transportability: A Formal Approach Pearl, Judea University of California, Los Angeles,

More information

OF CAUSAL INFERENCE THE MATHEMATICS IN STATISTICS. Department of Computer Science. Judea Pearl UCLA

OF CAUSAL INFERENCE THE MATHEMATICS IN STATISTICS. Department of Computer Science. Judea Pearl UCLA THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS Judea earl Department of Computer Science UCLA OUTLINE Statistical vs. Causal Modeling: distinction and mental barriers N-R vs. structural model: strengths

More information

The Do-Calculus Revisited Judea Pearl Keynote Lecture, August 17, 2012 UAI-2012 Conference, Catalina, CA

The Do-Calculus Revisited Judea Pearl Keynote Lecture, August 17, 2012 UAI-2012 Conference, Catalina, CA The Do-Calculus Revisited Judea Pearl Keynote Lecture, August 17, 2012 UAI-2012 Conference, Catalina, CA Abstract The do-calculus was developed in 1995 to facilitate the identification of causal effects

More information

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS ANALYTIC COMPARISON of Pearl and Rubin CAUSAL FRAMEWORKS Content Page Part I. General Considerations Chapter 1. What is the question? 16 Introduction 16 1. Randomization 17 1.1 An Example of Randomization

More information

Transportability of Causal and Statistical Relations: A Formal Approach

Transportability of Causal and Statistical Relations: A Formal Approach In Proceedings of the 25th AAAI Conference on Artificial Intelligence, August 7 11, 2011, an Francisco, CA. AAAI Press, Menlo Park, CA, pp. 247-254. TECHNICAL REPORT R-372-A August 2011 Transportability

More information

CAUSAL INFERENCE AS COMPUTATIONAL LEARNING. Judea Pearl University of California Los Angeles (

CAUSAL INFERENCE AS COMPUTATIONAL LEARNING. Judea Pearl University of California Los Angeles ( CAUSAL INFERENCE AS COMUTATIONAL LEARNING Judea earl University of California Los Angeles www.cs.ucla.edu/~judea OUTLINE Inference: Statistical vs. Causal distinctions and mental barriers Formal semantics

More information

CAUSALITY. Models, Reasoning, and Inference 1 CAMBRIDGE UNIVERSITY PRESS. Judea Pearl. University of California, Los Angeles

CAUSALITY. Models, Reasoning, and Inference 1 CAMBRIDGE UNIVERSITY PRESS. Judea Pearl. University of California, Los Angeles CAUSALITY Models, Reasoning, and Inference Judea Pearl University of California, Los Angeles 1 CAMBRIDGE UNIVERSITY PRESS Preface page xiii 1 Introduction to Probabilities, Graphs, and Causal Models 1

More information

PANEL ON MEDIATION AND OTHER MIRACLES OF THE 21 ST CENTURY

PANEL ON MEDIATION AND OTHER MIRACLES OF THE 21 ST CENTURY PANEL ON EDIAION AND OHER IRACLES OF HE 21 S CENUR Judea Pearl UCLA With Kosuke Imai and many others HE QUESION OF EDIAION (direct vs. indirect effects) 1. Why decompose effects? 2. What is the definition

More information

UCLA Department of Statistics Papers

UCLA Department of Statistics Papers UCLA Department of Statistics Papers Title Comment on `Causal inference, probability theory, and graphical insights' (by Stuart G. Baker) Permalink https://escholarship.org/uc/item/9863n6gg Journal Statistics

More information

Causality. Pedro A. Ortega. 18th February Computational & Biological Learning Lab University of Cambridge

Causality. Pedro A. Ortega. 18th February Computational & Biological Learning Lab University of Cambridge Causality Pedro A. Ortega Computational & Biological Learning Lab University of Cambridge 18th February 2010 Why is causality important? The future of machine learning is to control (the world). Examples

More information

Statistical Models for Causal Analysis

Statistical Models for Causal Analysis Statistical Models for Causal Analysis Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 Three Modes of Statistical Inference 1. Descriptive Inference: summarizing and exploring

More information

External Validity: From do-calculus to Transportability across Populations

External Validity: From do-calculus to Transportability across Populations ubmitted, tatistical cience. TECHNICAL REPORT R-400 May 2012 External Validity: From do-calculus to Transportability across Populations Judea Pearl and Elias Bareinboim University of California, Los Angeles

More information

THE SCIENCE OF CAUSE AND EFFECT

THE SCIENCE OF CAUSE AND EFFECT HE CIENCE OF CAUE AND EFFEC Judea Pearl University of California Los Angeles (www.cs.ucla.edu/~judea/) OULINE 1. he causal revolution from associations to intervention to counterfactuals 2. he two fundamental

More information

From Causality, Second edition, Contents

From Causality, Second edition, Contents From Causality, Second edition, 2009. Preface to the First Edition Preface to the Second Edition page xv xix 1 Introduction to Probabilities, Graphs, and Causal Models 1 1.1 Introduction to Probability

More information

The Mathematics of Causal Inference

The Mathematics of Causal Inference In Joint Statistical Meetings Proceedings, Alexandria, VA: American Statistical Association, 2515-2529, 2013. JSM 2013 - Section on Statistical Education TECHNICAL REPORT R-416 September 2013 The Mathematics

More information

Counterfactual Reasoning in Algorithmic Fairness

Counterfactual Reasoning in Algorithmic Fairness Counterfactual Reasoning in Algorithmic Fairness Ricardo Silva University College London and The Alan Turing Institute Joint work with Matt Kusner (Warwick/Turing), Chris Russell (Sussex/Turing), and Joshua

More information

Causal Inference. Prediction and causation are very different. Typical questions are:

Causal Inference. Prediction and causation are very different. Typical questions are: Causal Inference Prediction and causation are very different. Typical questions are: Prediction: Predict Y after observing X = x Causation: Predict Y after setting X = x. Causation involves predicting

More information

A Linear Microscope for Interventions and Counterfactuals

A Linear Microscope for Interventions and Counterfactuals Forthcoming, Journal of Causal Inference (Causal, Casual, and Curious Section). TECHNICAL REPORT R-459 March 2017 A Linear Microscope for Interventions and Counterfactuals Judea Pearl University of California,

More information

Front-Door Adjustment

Front-Door Adjustment Front-Door Adjustment Ethan Fosse Princeton University Fall 2016 Ethan Fosse Princeton University Front-Door Adjustment Fall 2016 1 / 38 1 Preliminaries 2 Examples of Mechanisms in Sociology 3 Bias Formulas

More information

A Linear Microscope for Interventions and Counterfactuals

A Linear Microscope for Interventions and Counterfactuals Journal of Causal Inference, 5(1):March2017. (Errata marked; pages 2 and 6.) J. Causal Infer. 2017; 20170003 TECHNICAL REPORT R-459 March 2017 Causal, Casual and Curious Judea Pearl A Linear Microscope

More information

The Effects of Interventions

The Effects of Interventions 3 The Effects of Interventions 3.1 Interventions The ultimate aim of many statistical studies is to predict the effects of interventions. When we collect data on factors associated with wildfires in the

More information

Recovering Causal Effects from Selection Bias

Recovering Causal Effects from Selection Bias In Sven Koenig and Blai Bonet (Eds.), Proceedings of the Twenty-Ninth AAAI Conference on Artificial Intelligence, Palo Alto, CA: AAAI Press, 2015, forthcoming. TECHNICAL REPORT R-445 DECEMBER 2014 Recovering

More information

Identification of Conditional Interventional Distributions

Identification of Conditional Interventional Distributions Identification of Conditional Interventional Distributions Ilya Shpitser and Judea Pearl Cognitive Systems Laboratory Department of Computer Science University of California, Los Angeles Los Angeles, CA.

More information

Lecture Notes 22 Causal Inference

Lecture Notes 22 Causal Inference Lecture Notes 22 Causal Inference Prediction and causation are very different. Typical questions are: Prediction: Predict after observing = x Causation: Predict after setting = x. Causation involves predicting

More information

Complete Identification Methods for the Causal Hierarchy

Complete Identification Methods for the Causal Hierarchy Journal of Machine Learning Research 9 (2008) 1941-1979 Submitted 6/07; Revised 2/08; Published 9/08 Complete Identification Methods for the Causal Hierarchy Ilya Shpitser Judea Pearl Cognitive Systems

More information

Estimating complex causal effects from incomplete observational data

Estimating complex causal effects from incomplete observational data Estimating complex causal effects from incomplete observational data arxiv:1403.1124v2 [stat.me] 2 Jul 2014 Abstract Juha Karvanen Department of Mathematics and Statistics, University of Jyväskylä, Jyväskylä,

More information

Bounding the Probability of Causation in Mediation Analysis

Bounding the Probability of Causation in Mediation Analysis arxiv:1411.2636v1 [math.st] 10 Nov 2014 Bounding the Probability of Causation in Mediation Analysis A. P. Dawid R. Murtas M. Musio February 16, 2018 Abstract Given empirical evidence for the dependence

More information

On Identifying Causal Effects

On Identifying Causal Effects 29 1 Introduction This paper deals with the problem of inferring cause-effect relationships from a combination of data and theoretical assumptions. This problem arises in diverse fields such as artificial

More information

Local Characterizations of Causal Bayesian Networks

Local Characterizations of Causal Bayesian Networks In M. Croitoru, S. Rudolph, N. Wilson, J. Howse, and O. Corby (Eds.), GKR 2011, LNAI 7205, Berlin Heidelberg: Springer-Verlag, pp. 1-17, 2012. TECHNICAL REPORT R-384 May 2011 Local Characterizations of

More information

Causal Directed Acyclic Graphs

Causal Directed Acyclic Graphs Causal Directed Acyclic Graphs Kosuke Imai Harvard University STAT186/GOV2002 CAUSAL INFERENCE Fall 2018 Kosuke Imai (Harvard) Causal DAGs Stat186/Gov2002 Fall 2018 1 / 15 Elements of DAGs (Pearl. 2000.

More information

Summary of the Bayes Net Formalism. David Danks Institute for Human & Machine Cognition

Summary of the Bayes Net Formalism. David Danks Institute for Human & Machine Cognition Summary of the Bayes Net Formalism David Danks Institute for Human & Machine Cognition Bayesian Networks Two components: 1. Directed Acyclic Graph (DAG) G: There is a node for every variable D: Some nodes

More information

Confounding Equivalence in Causal Inference

Confounding Equivalence in Causal Inference In Proceedings of UAI, 2010. In P. Grunwald and P. Spirtes, editors, Proceedings of UAI, 433--441. AUAI, Corvallis, OR, 2010. TECHNICAL REPORT R-343 July 2010 Confounding Equivalence in Causal Inference

More information

Bayes Nets III: Inference

Bayes Nets III: Inference 1 Hal Daumé III (me@hal3.name) Bayes Nets III: Inference Hal Daumé III Computer Science University of Maryland me@hal3.name CS 421: Introduction to Artificial Intelligence 10 Apr 2012 Many slides courtesy

More information

Identification and Estimation of Causal Effects from Dependent Data

Identification and Estimation of Causal Effects from Dependent Data Identification and Estimation of Causal Effects from Dependent Data Eli Sherman esherman@jhu.edu with Ilya Shpitser Johns Hopkins Computer Science 12/6/2018 Eli Sherman Identification and Estimation of

More information

Announcements. CS 188: Artificial Intelligence Spring Bayes Net Semantics. Probabilities in BNs. All Conditional Independences

Announcements. CS 188: Artificial Intelligence Spring Bayes Net Semantics. Probabilities in BNs. All Conditional Independences CS 188: Artificial Intelligence Spring 2011 Announcements Assignments W4 out today --- this is your last written!! Any assignments you have not picked up yet In bin in 283 Soda [same room as for submission

More information

Chapter 1: Climate and the Atmosphere

Chapter 1: Climate and the Atmosphere Chapter 1: Climate and the Atmosphere ECC: 1.2.1 WARM-UP Students complete and discuss their responses to prompts in an Anticipation Guide. (10 min) Anticipation Guide. The expectation is that you will

More information

Stability of causal inference: Open problems

Stability of causal inference: Open problems Stability of causal inference: Open problems Leonard J. Schulman & Piyush Srivastava California Institute of Technology UAI 2016 Workshop on Causality Interventions without experiments [Pearl, 1995] Genetic

More information

Causal Inference & Reasoning with Causal Bayesian Networks

Causal Inference & Reasoning with Causal Bayesian Networks Causal Inference & Reasoning with Causal Bayesian Networks Neyman-Rubin Framework Potential Outcome Framework: for each unit k and each treatment i, there is a potential outcome on an attribute U, U ik,

More information

Fernando Martel García* and Leonard Wantchekon A Graphical Approximation to Generalization: Definitions and Diagrams

Fernando Martel García* and Leonard Wantchekon A Graphical Approximation to Generalization: Definitions and Diagrams JGD 2015; 6(1): 71 86 Open Access Fernando Martel García* and Leonard Wantchekon A Graphical Approximation to Generalization: Definitions and Diagrams Abstract: The fundamental problem of external validity

More information

Announcements. CS 188: Artificial Intelligence Fall Causality? Example: Traffic. Topology Limits Distributions. Example: Reverse Traffic

Announcements. CS 188: Artificial Intelligence Fall Causality? Example: Traffic. Topology Limits Distributions. Example: Reverse Traffic CS 188: Artificial Intelligence Fall 2008 Lecture 16: Bayes Nets III 10/23/2008 Announcements Midterms graded, up on glookup, back Tuesday W4 also graded, back in sections / box Past homeworks in return

More information

CS 188: Artificial Intelligence Spring Announcements

CS 188: Artificial Intelligence Spring Announcements CS 188: Artificial Intelligence Spring 2011 Lecture 16: Bayes Nets IV Inference 3/28/2011 Pieter Abbeel UC Berkeley Many slides over this course adapted from Dan Klein, Stuart Russell, Andrew Moore Announcements

More information

Gov 2002: 4. Observational Studies and Confounding

Gov 2002: 4. Observational Studies and Confounding Gov 2002: 4. Observational Studies and Confounding Matthew Blackwell September 10, 2015 Where are we? Where are we going? Last two weeks: randomized experiments. From here on: observational studies. What

More information

Ignoring the matching variables in cohort studies - when is it valid, and why?

Ignoring the matching variables in cohort studies - when is it valid, and why? Ignoring the matching variables in cohort studies - when is it valid, and why? Arvid Sjölander Abstract In observational studies of the effect of an exposure on an outcome, the exposure-outcome association

More information

Gov 2002: 5. Matching

Gov 2002: 5. Matching Gov 2002: 5. Matching Matthew Blackwell October 1, 2015 Where are we? Where are we going? Discussed randomized experiments, started talking about observational data. Last week: no unmeasured confounders

More information

Instrumental Sets. Carlos Brito. 1 Introduction. 2 The Identification Problem

Instrumental Sets. Carlos Brito. 1 Introduction. 2 The Identification Problem 17 Instrumental Sets Carlos Brito 1 Introduction The research of Judea Pearl in the area of causality has been very much acclaimed. Here we highlight his contributions for the use of graphical languages

More information

Mediation analysis for different types of Causal questions: Effect of Cause and Cause of Effect

Mediation analysis for different types of Causal questions: Effect of Cause and Cause of Effect Università degli Studi di Cagliari Dipartimento di Matematica e Informatica Dottorato di Ricerca in Matematica e Calcolo Scientifico Ciclo XXVIII Ph.D. Thesis Mediation analysis for different types of

More information

Probabilistic Causal Models

Probabilistic Causal Models Probabilistic Causal Models A Short Introduction Robin J. Evans www.stat.washington.edu/ rje42 ACMS Seminar, University of Washington 24th February 2011 1/26 Acknowledgements This work is joint with Thomas

More information

Today. Why do we need Bayesian networks (Bayes nets)? Factoring the joint probability. Conditional independence

Today. Why do we need Bayesian networks (Bayes nets)? Factoring the joint probability. Conditional independence Bayesian Networks Today Why do we need Bayesian networks (Bayes nets)? Factoring the joint probability Conditional independence What is a Bayes net? Mostly, we discuss discrete r.v. s What can you do with

More information

Causal inference from big data: Theoretical foundations and the data-fusion problem Elias Bareinboim and Judea Pearl

Causal inference from big data: Theoretical foundations and the data-fusion problem Elias Bareinboim and Judea Pearl Written for the Proceedings of the National Academy of ciences. r450 2015/6/30 23:48 page 1 #1 Causal inference from big data: Theoretical foundations and the data-fusion problem Elias Bareinboim and Judea

More information

Generalized Do-Calculus with Testable Causal Assumptions

Generalized Do-Calculus with Testable Causal Assumptions eneralized Do-Calculus with Testable Causal Assumptions Jiji Zhang Division of Humanities and Social Sciences California nstitute of Technology Pasadena CA 91125 jiji@hss.caltech.edu Abstract A primary

More information

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Causality II: How does causal inference fit into public health and what it is the role of statistics? Causality II: How does causal inference fit into public health and what it is the role of statistics? Statistics for Psychosocial Research II November 13, 2006 1 Outline Potential Outcomes / Counterfactual

More information

Propensity Score Analysis with Hierarchical Data

Propensity Score Analysis with Hierarchical Data Propensity Score Analysis with Hierarchical Data Fan Li Alan Zaslavsky Mary Beth Landrum Department of Health Care Policy Harvard Medical School May 19, 2008 Introduction Population-based observational

More information

What Causality Is (stats for mathematicians)

What Causality Is (stats for mathematicians) What Causality Is (stats for mathematicians) Andrew Critch UC Berkeley August 31, 2011 Introduction Foreword: The value of examples With any hard question, it helps to start with simple, concrete versions

More information

The decision theoretic approach to causal inference OR Rethinking the paradigms of causal modelling

The decision theoretic approach to causal inference OR Rethinking the paradigms of causal modelling The decision theoretic approach to causal inference OR Rethinking the paradigms of causal modelling A.P.Dawid 1 and S.Geneletti 2 1 University of Cambridge, Statistical Laboratory 2 Imperial College Department

More information

Revision list for Pearl s THE FOUNDATIONS OF CAUSAL INFERENCE

Revision list for Pearl s THE FOUNDATIONS OF CAUSAL INFERENCE Revision list for Pearl s THE FOUNDATIONS OF CAUSAL INFERENCE insert p. 90: in graphical terms or plain causal language. The mediation problem of Section 6 illustrates how such symbiosis clarifies the

More information

5.3 Graphs and Identifiability

5.3 Graphs and Identifiability 218CHAPTER 5. CAUSALIT AND STRUCTURAL MODELS IN SOCIAL SCIENCE AND ECONOMICS.83 AFFECT.65.23 BEHAVIOR COGNITION Figure 5.5: Untestable model displaying quantitative causal information derived. method elucidates

More information

Confounding Equivalence in Causal Inference

Confounding Equivalence in Causal Inference Revised and submitted, October 2013. TECHNICAL REPORT R-343w October 2013 Confounding Equivalence in Causal Inference Judea Pearl Cognitive Systems Laboratory Computer Science Department University of

More information

Single World Intervention Graphs (SWIGs): A Unification of the Counterfactual and Graphical Approaches to Causality

Single World Intervention Graphs (SWIGs): A Unification of the Counterfactual and Graphical Approaches to Causality Single World Intervention Graphs (SWIGs): A Unification of the Counterfactual and Graphical Approaches to Causality Thomas S. Richardson University of Washington James M. Robins Harvard University Working

More information

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data?

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Panel Data? Kosuke Imai Department of Politics Center for Statistics and Machine Learning Princeton University Joint

More information

A little (more) about me.

A little (more) about me. Comparative Effectiveness Research Methods Training Module 2: Research Designs J. Michael Oakes, PhD Associate Professor Division of Epidemiology University of Minnesota oakes007@umn.edu A little (more)

More information

Causal inference in statistics: An overview

Causal inference in statistics: An overview Statistics Surveys Vol. 3 (2009) 96 146 ISSN: 1935-7516 DOI: 10.1214/09-SS057 Causal inference in statistics: An overview Judea Pearl Computer Science Department University of California, Los Angeles,

More information

LEARNING CAUSAL EFFECTS: BRIDGING INSTRUMENTS AND BACKDOORS

LEARNING CAUSAL EFFECTS: BRIDGING INSTRUMENTS AND BACKDOORS LEARNING CAUSAL EFFECTS: BRIDGING INSTRUMENTS AND BACKDOORS Ricardo Silva Department of Statistical Science and Centre for Computational Statistics and Machine Learning, UCL Fellow, Alan Turing Institute

More information

Bayes Nets: Independence

Bayes Nets: Independence Bayes Nets: Independence [These slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188 materials are available at http://ai.berkeley.edu.] Bayes Nets A Bayes

More information

m-transportability: Transportability of a Causal Effect from Multiple Environments

m-transportability: Transportability of a Causal Effect from Multiple Environments m-transportability: Transportability of a Causal Effect from Multiple Environments Sanghack Lee and Vasant Honavar Artificial Intelligence Research Laboratory Department of Computer Science Iowa State

More information

Causal inference from big data: Theoretical foundations and the data-fusion problem Elias Bareinboim and Judea Pearl

Causal inference from big data: Theoretical foundations and the data-fusion problem Elias Bareinboim and Judea Pearl Forthcoming, Proceedings of the National Academy of ciences. r450 2015/12/4 15:47 page 1 #1 Causal inference from big data: Theoretical foundations and the data-fusion problem Elias Bareinboim and Judea

More information

Causal analysis after Haavelmo. 8th/last Lecture - Hedibert Lopes

Causal analysis after Haavelmo. 8th/last Lecture - Hedibert Lopes Causal analysis after Haavelmo 8th/last Lecture - Hedibert Lopes Insper - Institute of Education and Research December 1st, 2015 Hedibert Lopes (Insper) Heckman & Pinto (2014) December 1st, 2015 1 / 39

More information

Part 1: You are given the following system of two equations: x + 2y = 16 3x 4y = 2

Part 1: You are given the following system of two equations: x + 2y = 16 3x 4y = 2 Solving Systems of Equations Algebraically Teacher Notes Comment: As students solve equations throughout this task, have them continue to explain each step using properties of operations or properties

More information

Identifying Linear Causal Effects

Identifying Linear Causal Effects Identifying Linear Causal Effects Jin Tian Department of Computer Science Iowa State University Ames, IA 50011 jtian@cs.iastate.edu Abstract This paper concerns the assessment of linear cause-effect relationships

More information

The International Journal of Biostatistics

The International Journal of Biostatistics The International Journal of Biostatistics Volume 7, Issue 1 2011 Article 16 A Complete Graphical Criterion for the Adjustment Formula in Mediation Analysis Ilya Shpitser, Harvard University Tyler J. VanderWeele,

More information

1 Impact Evaluation: Randomized Controlled Trial (RCT)

1 Impact Evaluation: Randomized Controlled Trial (RCT) Introductory Applied Econometrics EEP/IAS 118 Fall 2013 Daley Kutzman Section #12 11-20-13 Warm-Up Consider the two panel data regressions below, where i indexes individuals and t indexes time in months:

More information

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation? Did You Mean Association Or Correlation? AP Statistics Chapter 8 Be careful not to use the word correlation when you really mean association. Often times people will incorrectly use the word correlation

More information

Transportability from Multiple Environments with Limited Experiments

Transportability from Multiple Environments with Limited Experiments In C.J.C. Burges, L. Bottou, M. Welling,. Ghahramani, and K.Q. Weinberger (Eds.), Advances in Neural Information Processing Systems 6 (NIPS Proceedings), 36-44, 03. TECHNICAL REPORT R-49 December 03 Transportability

More information

CS 5522: Artificial Intelligence II

CS 5522: Artificial Intelligence II CS 5522: Artificial Intelligence II Bayes Nets: Independence Instructor: Alan Ritter Ohio State University [These slides were adapted from CS188 Intro to AI at UC Berkeley. All materials available at http://ai.berkeley.edu.]

More information

Difference-in-Differences Methods

Difference-in-Differences Methods Difference-in-Differences Methods Teppei Yamamoto Keio University Introduction to Causal Inference Spring 2016 1 Introduction: A Motivating Example 2 Identification 3 Estimation and Inference 4 Diagnostics

More information

Causal Bayesian networks. Peter Antal

Causal Bayesian networks. Peter Antal Causal Bayesian networks Peter Antal antal@mit.bme.hu A.I. 4/8/2015 1 Can we represent exactly (in)dependencies by a BN? From a causal model? Suff.&nec.? Can we interpret edges as causal relations with

More information

Probabilistic Graphical Models and Bayesian Networks. Artificial Intelligence Bert Huang Virginia Tech

Probabilistic Graphical Models and Bayesian Networks. Artificial Intelligence Bert Huang Virginia Tech Probabilistic Graphical Models and Bayesian Networks Artificial Intelligence Bert Huang Virginia Tech Concept Map for Segment Probabilistic Graphical Models Probabilistic Time Series Models Particle Filters

More information

Recovering from Selection Bias in Causal and Statistical Inference

Recovering from Selection Bias in Causal and Statistical Inference Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence Recovering from Selection Bias in Causal and Statistical Inference Elias Bareinboim Cognitive Systems Laboratory Computer Science

More information

Announcements. CS 188: Artificial Intelligence Spring Probability recap. Outline. Bayes Nets: Big Picture. Graphical Model Notation

Announcements. CS 188: Artificial Intelligence Spring Probability recap. Outline. Bayes Nets: Big Picture. Graphical Model Notation CS 188: Artificial Intelligence Spring 2010 Lecture 15: Bayes Nets II Independence 3/9/2010 Pieter Abbeel UC Berkeley Many slides over the course adapted from Dan Klein, Stuart Russell, Andrew Moore Current

More information

Probabilistic Reasoning. (Mostly using Bayesian Networks)

Probabilistic Reasoning. (Mostly using Bayesian Networks) Probabilistic Reasoning (Mostly using Bayesian Networks) Introduction: Why probabilistic reasoning? The world is not deterministic. (Usually because information is limited.) Ways of coping with uncertainty

More information

Causal and Counterfactual Inference

Causal and Counterfactual Inference Forthcoming section in The Handbook of Rationality, MIT press. TECHNICAL REPORT R-485 October 2018 Causal and Counterfactual Inference Judea Pearl University of California, Los Angeles Computer Science

More information

Compare and Contrast T-CHART

Compare and Contrast T-CHART Using the Language of Comparison and Contrast Compare and Contrast T-CHART List the things that are similar or the same for (x+ 1)( x+ 4) and x + 9x+4on one side of the T- Chart and things that are different

More information

Math 10C Course Outline

Math 10C Course Outline Math 10C Course Outline Mr. Collin Dillon St. Joseph Catholic High School 8:40am-9:55am Phone (780) 532-7779 Email CollinDillon@gpcsd.ca Mr. Dillon s Website The essence of mathematics is not to make simple

More information

Algebra II Honors Curriculum

Algebra II Honors Curriculum Algebra II Honors Curriculum Algebra II Honors Course Overview Course Description Honors Algebra 2 emphasizes the structure of algebra. The students will study complex numbers, relations and functions,

More information

Causal Models with Hidden Variables

Causal Models with Hidden Variables Causal Models with Hidden Variables Robin J. Evans www.stats.ox.ac.uk/ evans Department of Statistics, University of Oxford Quantum Networks, Oxford August 2017 1 / 44 Correlation does not imply causation

More information

Rewriting Absolute Value Functions as Piece-wise Defined Functions

Rewriting Absolute Value Functions as Piece-wise Defined Functions Rewriting Absolute Value Functions as Piece-wise Defined Functions Consider the absolute value function f ( x) = 2x+ 4-3. Sketch the graph of f(x) using the strategies learned in Algebra II finding the

More information

Single World Intervention Graphs (SWIGs):

Single World Intervention Graphs (SWIGs): Single World Intervention Graphs (SWIGs): Unifying the Counterfactual and Graphical Approaches to Causality Thomas Richardson Department of Statistics University of Washington Joint work with James Robins

More information

Automatic Causal Discovery

Automatic Causal Discovery Automatic Causal Discovery Richard Scheines Peter Spirtes, Clark Glymour Dept. of Philosophy & CALD Carnegie Mellon 1 Outline 1. Motivation 2. Representation 3. Discovery 4. Using Regression for Causal

More information

Please bring the task to your first physics lesson and hand it to the teacher.

Please bring the task to your first physics lesson and hand it to the teacher. Pre-enrolment task for 2014 entry Physics Why do I need to complete a pre-enrolment task? This bridging pack serves a number of purposes. It gives you practice in some of the important skills you will

More information

Causal Reasoning with Ancestral Graphs

Causal Reasoning with Ancestral Graphs Journal of Machine Learning Research 9 (2008) 1437-1474 Submitted 6/07; Revised 2/08; Published 7/08 Causal Reasoning with Ancestral Graphs Jiji Zhang Division of the Humanities and Social Sciences California

More information

Limits and Continuity

Limits and Continuity Limits and Continuity MATH 151 Calculus for Management J. Robert Buchanan Department of Mathematics Fall 2018 Objectives After this lesson we will be able to: Determine the left-hand and right-hand limits

More information

Intelligent Systems I

Intelligent Systems I Intelligent Systems I 00 INTRODUCTION Stefan Harmeling & Philipp Hennig 24. October 2013 Max Planck Institute for Intelligent Systems Dptmt. of Empirical Inference Which Card? Opening Experiment Which

More information

WIDER Working Paper No. 2013/082

WIDER Working Paper No. 2013/082 WIDER Working Paper No. 2013/082 A graphical approximation to generalization Definitions and diagrams Fernando Martel García 1 and Leonard Wantchekon 2 September 2013 Abstract The fundamental problem of

More information