Instructor: Sudeepa Roy

Similar documents
Actual Causality: A Survey

Decision Making in Robots and Autonomous Agents

Causes and Explanations: A Structural-Model Approach. Part I: Causes

Quantifying Causal Effects on Query Answering in Databases

From Causality, Second edition, Contents

Cause, Responsibility, and Blame: A Structural-Model Approach

The problem of disjunctive causal factors. Following Hitchcock: fix K and do everything within a single cell K (which we don t mention).

Pearl s Causality In a Logical Setting

CAUSALITY. Models, Reasoning, and Inference 1 CAMBRIDGE UNIVERSITY PRESS. Judea Pearl. University of California, Los Angeles

ARTIFICIAL INTELLIGENCE

Structural Equations and Beyond

CAUSAL INFERENCE IN THE EMPIRICAL SCIENCES. Judea Pearl University of California Los Angeles (

Bounding the Probability of Causation in Mediation Analysis

OF CAUSAL INFERENCE THE MATHEMATICS IN STATISTICS. Department of Computer Science. Judea Pearl UCLA

Causes and Explanations: A Structural-Model Approach. Part II: Explanations

Cognitive Systems 300: Probability and Causality (cont.)

Lecture 31 Woodward on Actual Causes

Causes and Explanations: A Structural-Model Approach Part II: Explanations

Tools for causal analysis and philosophical theories of causation. Isabelle Drouet (IHPST)

Defaults and Normality in Causal Structures

OUTLINE THE MATHEMATICS OF CAUSAL INFERENCE IN STATISTICS. Judea Pearl University of California Los Angeles (

Lecture 34 Woodward on Manipulation and Causation

CS 5522: Artificial Intelligence II

Computational Complexity of Bayesian Networks

CMPT Machine Learning. Bayesian Learning Lecture Scribe for Week 4 Jan 30th & Feb 4th

From Causal Models To Counterfactual Structures

CAUSES AND EXPLANATIONS IN THE STRUCTURAL-MODEL APPROACH: TRACTABLE CASES

Causal Bayesian networks. Peter Antal

Causal Bayesian networks. Peter Antal

COMP219: Artificial Intelligence. Lecture 19: Logic for KR

Propositional Logic. Spring Propositional Logic Spring / 32

Statistical Models for Causal Analysis

OUTLINE CAUSAL INFERENCE: LOGICAL FOUNDATION AND NEW RESULTS. Judea Pearl University of California Los Angeles (

Appendix A Lewis s Counterfactuals

INTRODUCTION TO LOGIC 3 Formalisation in Propositional Logic

Discrete Mathematics and Applications COT3100

Bayesian networks as causal models. Peter Antal

Discrete Structures of Computer Science Propositional Logic I

CS 188: Artificial Intelligence Fall 2008

COMP219: Artificial Intelligence. Lecture 19: Logic for KR

Truth Table Definitions of Logical Connectives

Counterfactual Reasoning in Algorithmic Fairness

Causality II: How does causal inference fit into public health and what it is the role of statistics?

Knowledge representation DATA INFORMATION KNOWLEDGE WISDOM. Figure Relation ship between data, information knowledge and wisdom.

CS 5522: Artificial Intelligence II

David Lewis. Void and Object

The Computational Complexity of Structure-Based Causality

THE MATHEMATICS OF CAUSAL INFERENCE With reflections on machine learning and the logic of science. Judea Pearl UCLA

Logic, Human Logic, and Propositional Logic. Human Logic. Fragments of Information. Conclusions. Foundations of Semantics LING 130 James Pustejovsky

Warm-Up Problem. Is the following true or false? 1/35

2/2/2018. CS 103 Discrete Structures. Chapter 1. Propositional Logic. Chapter 1.1. Propositional Logic

cis32-ai lecture # 18 mon-3-apr-2006

CompSci Understanding Data: Theory and Applications

Logic. Propositional Logic: Syntax

Econometric Causality

1 Propositional Logic

Introduction to Artificial Intelligence Propositional Logic & SAT Solving. UIUC CS 440 / ECE 448 Professor: Eyal Amir Spring Semester 2010

Propositional logic ( ): Review from Mat 1348

Philosophy 5340 Epistemology. Topic 3: Analysis, Analytically Basic Concepts, Direct Acquaintance, and Theoretical Terms. Part 2: Theoretical Terms

Graded Causation and Defaults

REASONING WITH CAUSE AND EFFECT

Machine Learning

CAUSALITY: FROM HUME, VIA SCHLICK TO LEWIS* Christian Damböck Institute Vienna Circle University of Vienna

EPISTEMOLOGY, COMPUTATION AND THE LAWS OF PHYSICS

Completeness in the Monadic Predicate Calculus. We have a system of eight rules of proof. Let's list them:

Solvers for the Problem of Boolean Satisfiability (SAT) Will Klieber Aug 31, 2011

Probabilistic Models

Overview of Today s Lecture

AI Programming CS S-09 Knowledge Representation

PROBABILITIES OF CAUSATION: THREE COUNTERFACTUAL INTERPRETATIONS AND THEIR IDENTIFICATION

CHAPTER 2 INTRODUCTION TO CLASSICAL PROPOSITIONAL LOGIC

Intermediate Logic. Natural Deduction for TFL

Indicative conditionals

Product rule. Chain rule

CAUSAL GAN: LEARNING CAUSAL IMPLICIT GENERATIVE MODELS WITH ADVERSARIAL TRAINING

Comments on The Role of Large Scale Assessments in Research on Educational Effectiveness and School Development by Eckhard Klieme, Ph.D.

Naïve Bayes Classifiers

CSC321 Lecture 18: Learning Probabilistic Models

CHAPTER 1 - LOGIC OF COMPOUND STATEMENTS

Bayesian networks (1) Lirong Xia

Propositional Logic. Fall () Propositional Logic Fall / 30

The statement calculus and logic

CS188 Outline. CS 188: Artificial Intelligence. Today. Inference in Ghostbusters. Probability. We re done with Part I: Search and Planning!

CS 320, Fall Dr. Geri Georg, Instructor 320 NP 1

ANALYTIC COMPARISON. Pearl and Rubin CAUSAL FRAMEWORKS

Treatment Effects. Christopher Taber. September 6, Department of Economics University of Wisconsin-Madison

KB Agents and Propositional Logic

CS 343: Artificial Intelligence

Discrete Mathematics and Probability Theory Fall 2014 Anant Sahai Note 15. Random Variables: Distributions, Independence, and Expectations

Tracing Data Errors Using View-Conditioned Causality

Causes and Explanations: A Structural-Model Approach Part 1: Causes

a. See the textbook for examples of proving logical equivalence using truth tables. b. There is a real number x for which f (x) < 0. (x 1) 2 > 0.

CS 188: Artificial Intelligence Fall 2009

PROPOSITIONAL CALCULUS

Overview. Knowledge-Based Agents. Introduction. COMP219: Artificial Intelligence. Lecture 19: Logic for KR

I N F S Y S R E S E A R C H R E P O R T

Propositional Logic: Syntax

Bayes Nets: Independence

Bayesian Networks: Independencies and Inference

Numbers that are divisible by 2 are even. The above statement could also be written in other logically equivalent ways, such as:

Transcription:

CompSci 590.6 Understanding Data: Theory and Applications Lecture 15 Causality in AI Instructor: Sudeepa Roy Email: sudeepa@cs.duke.edu 1

Today s Reading 1. Causes and Explanations: A Structural- Model Approach Part I: Causes Halpern- Pearl, UAI 2001 2. Causes and Explanations: A Structural- Model Approach Part Ii: Explanations Halpern- Pearl, IJCAI 2001 3. Probabilities of Causation: Three Counterfactual Interpretations and their Identification Pearl 1999, UCLA Technical Report R- 260 A revised version in Tian- Pearl, 2000, UCLA Technical Report R- 271- A Also see numerous surveys/talks by Pearl An excellent survey of history of causality can be found in his lecture slides (with transcripts): 1996 Faculty Research Lecture: The Art and Science of Cause and Effect 2

Acknowledgement: Some of the following slides are from the tutorial Causality and Explanations in Databases, VLDB 2014, Meliou- Roy- Suciu, and are due to Dr. Alexandra Meliou, University of Massachusetts- Amherst. 3

Why Care About Causality? Databases Why is my program taking so long? Why do/don t I see this result? Education Does this special program in a high school encourage higher studies? Politics Why did X win the presidential election in year Y? Healthcare/Medicine Why did patient X survive cancer while patient Y did not? Does this drug help cure melanoma? Agriculture Does this fertilizer help increase plant growth? Social science Does the new housing initiative encourage a population buy houses? 4

Causality in science Science seeks to understand and explain physical observations Why doesn t the wheel turn? What if I make the beam half as thick, will it carry the load? How do I shape the beam so it will carry the load? 5

But laws in science do not tell us about causality F = m a F Does acceleration cause the force? Does the force cause the acceleration? Does the force cause the mass? 6

Today: Causality in AI Pearl s causality model Next two lectures Adoption of Pearl s model for causality in databases Causality in Statistics (Rubin s potential outcome model) 7

Pearl 1996, 1999 Hume s Legacy Analytical vs. Empirical Claims Analytical = product of thoughts Empirical = matter of facts David Hume (1711-1776) Scottish Philosopher The author of A Treatise of Human Nature (1738) Causal claims are empirical human experience is the source Thus we remember to have seen that species of object we call Flame, and to have felt that species of sensation we call Heat. We likewise call to mind their constant conjunction in all past instances. Without any farther ceremony, we call the one Cause and the other Effect, and infer the existence of the one from that of the other. Leads to two riddles of causation 8

Pearl 1996, 1999 Two Riddles of Causation 1. Learning of causal connection What empirical evidence legitimizes a cause- effect connection? How do people ever acquire knowledge of causation e.g. does a rooster cause the sun to rise? succession, correlations are not sufficient e.g. roosters crow before dawn, both ice cream sales and crime rate increase at the same time (in summer months) 2. Usage of causal connection What inferences can be drawn from causal information and how? e.g. what would change if the rooster were to cause the sun to rise, can we make the night shorter by waking him up early? Major focus of Pearl s work is in (2) More of (1) in Rubin s model 9

Main Concepts in Pearl s model Concept Causation Intervention Mechanisms Formalization Encoding of behavior under intervention Surgeries on mechanisms Functional relationships by equations and graphs Devise a computational scheme for causality to facilitate prediction of the effects of actions Use Intervention for Action as actions are external entities originating outside the theory 10

Main Concepts in Pearl s model Concept Causation Intervention Mechanisms Formalization Encoding of behavior under intervention Surgeries on mechanisms Functional relationships by equations and graphs Mechanism: Autonomous physical laws or mechanisms of interest we can change one without changing the others e.g. logic gates of a circuit, mechanical linkages in a machine 11

Main Concepts in Pearl s model Concept Causation Intervention Mechanisms Formalization Encoding of behavior under intervention Surgeries on mechanisms Functional relationships by equations and graphs Intervention Breakdown of a mechanism = surgery 12

Main Concepts in Pearl s model Concept Causation Intervention Mechanisms Formalization Encoding of behavior under intervention Surgeries on mechanisms Functional relationships by equations and graphs Causality Which mechanism is to be surgically modified by a given action Q. Why are these non- trivial? A. A number of factors to take into account 13

Pearl 1996, 1999 Example- 1 If the grass is wet, then it rained If we break this bottle, the grass gets wet Conclusion If we break this bottle, then it rained (!) 14

Example- 2 Lin 1995 Pearl 1999 A suitcase will open iff both locks are open The right lock is open What happens if we open the left lock? Not sure the right lock might get closed (!) 15

Overview: Causal Model Action sentences B (would be true) if we do A Counterfactuals B would change to B (B would be different) if it were A Explanation B occurred because of A 16

Overview: Causal Model Extension Action sentences B if we do A With probability p Counterfactuals B would change to B if it were A With probability p Explanation B occurred because of A With probability p 17

Pearl s Model: At a glance Modeling causality Causal networks and structural equations Reasoning about causality Counterfactual causes Actual causes (Halpern & Pearl) Measuring causality Responsibility Probability of necessity, Probability of sufficiency 18

Modeling Causality Causal Model M = (U, V, F) U = Exogenous variables Values are determined by factors outside the model V = Endogenous variables values are described by structural equations F is a set of structural equations {F X X in V} (endogenous) F X is a mapping, tells us the value of X given the values of all the other variables in U and V represents a mechanism or law in the world 19

Example A forest fire could be caused by either lightning or a lit match by an arsonist Endogenous variables (Boolean) F for fire L for lightning ML for match lit Exogenous variables U whether the wood is dry whether there is enough oxygen in the air F F (U, L, ML) is such that F = 1 if L= 1 or ML = 1 20

Structural Equations as Pearl 2000 Causal networks Causal structural models: Variables: A, B, Y Structural equations: Y = A v B A=1 B =1 Y =A_B Modeling problems: E.g., A bottle breaks if either Alice or Bob throw a rock at it. Endogenous variables: Alice throws a rock (A) Bob throws a rock (B) The bottle breaks (Y) Exogenous variables: Alice s aim, speed of the wind, bottle material etc. 21

Woodward 2003, Hagmeyer 2007 Intervention / contingency External interventions modify the structural equations or values of the variables. A=1 A=1 Y =A_Y 1 Y =A_Y 1 B =1 Y 1 =ĀB B =1 Y 1 =0 Intervention on Y 1 : Y 1 =0 22

Counterfactual vs. Actual Cause Is Counterfactual the right notion of cause? Example 1: Fire A approaches house. The house is burnt. In absence of the fire A, the house would not be burnt. Fire A is the cause Example 2: Fire A and B approach house. Fire A reaches first. The house is burnt. In absence of the fire A, the house would still be burnt. But still Fire A will be a cause (e.g. for legal purposes, replace Fire by a Burglar) Not counterfactual but Cause in Fact Fire B = 0 is the contingency Resemblance with necessary and sufficient conditions 23

Hume 1748, Menzies 2008, Lewis 1973 Counterfactual Cause If not A then not φ In the absence of a cause, the effect doesn t occur C = A B, A =1 B =1 Both counterfactual Problem: Disjunctive causes If Alice doesn t throw a rock, the bottle still breaks (because of Bob) Neither Alice nor Bob are counterfactual causes C = A B, A =1 B =1 No counterfactual causes 24

Halpern- Pearl 2001, Halpern- Pearl 2005 Actual Cause [simplification] A variable X is an actual cause of an effect Y if there exists a contingency that makes X counterfactual for Y. C = A B, A =1 B =1 A is a cause under the contingency B=0 25

A Formal Definition of Actual Cause Actual causes are of the form X 1 = x 1 X 2 = x 2.. X k = X k In short, X = x For X = x to be an actual cause of event Z The following three conditions should hold Both X = x and Z are true in the actual world Changing X to x and some other variables W from w to w changes Z from true to false Setting W to w does not have an effect on Z X is minimal- no subset of X satisfies the above two conditions 26

Example 1 Y = X 1 ^ X 2 X 1 = 1 is counterfactual for Y = 1 X 1 = 1, X 2 = 1 Y = 1 X 1 = 0, X 2 = 1 Y = 0 Example 2 Y = X 1 _ X 2 X 1 = 1 is not counterfactual for Y=1 X 1 = 1, X 2 = 1 Y = 1 X 1 = 1, X 2 = 0 Y = 1 X 1 = 0, X 2 = 0 Y = 0 X 1 = 1 is an actual cause for Y = 1, with contingency X 2 = 0 Example 3 Y =( X 1 ^ X 2 ) _ X 3 X 1 = 1 is not counterfactual for Y = 1 X 1 = 1 is not an actual cause for Y = 1 X 1 = 1, X 2 = 1, X 3 = 1 Y = 1 X 1 = 0, X 2 = 1, X 3 = 1 Y = 1 X 1 = 1, X 2 = 0, X 3 = 1 Y = 1 X 1 = 0, X 2 = 0, X 3 = 1 Y = 1. Y never changes by flipping X 1 for all combinations of X 2, X 3 27

Is the causality definition circular? Are we assuming causes and then inferring causes? No! Causal model represents physical laws or potential causes The goal is to find the cause of a single event in terms of values assigned to the variables e.g. whether arson caused the fire on 6/10/2000, given what is known or assumed about this particular fire The causes are (variable, value) pairs, assuming a causal network on the variables Difference with Rubin s causal model and causal inference in statistics by randomized experiments or observational data 28

Chockler- Halpern 2004 Responsibility A measure of the degree of causality = 1 1 + min size of the contingency set Example Y = A ^ (B _ C) A = B = C =1) Y =1 A=1 is counterfactual for Y=1 (ρ=1) B=1 is an actual cause for Y=1, with contingency C=0 (ρ=0.5) 29

Eiter- Lukasiewicz 2002 Complexity Actual Cause NP- complete for binary variables Σ 2P - complete for non- binary variables Proof sketch: Reduction from SAT. Given F, F is satisfiable iff X is an actual cause for X F 30

Tian- Pearl 2000 Submodel Model M = (U, V, F) exogenous variables U endogenous variables V structural equations F Submodel M X Set endogenous variables X to x as constants in F X Remaining variables/equations remain the same do(x = x) do algebra of Pearl 31

Tian- Pearl 2000 Potential Response in a submodel Model (U, V, F) Y = an endogenous variable in V The solution of Y from the structural equations F X gives the potential response of the action X = x Denoted by Y X = The value that Y would have obtained, had X been x 32

Probabilistic Causal Model Tian- Pearl 2000 A pair (M, P(u)) P(u) is a probability function defined over the exogenous variables U Each endogenous variable in V is a function of exogenous variables U also gives a distribution on V In turn gives the probability of counter factual statement Pr(Y X=x = y) or simply Pr(Y X = y) 33

Tian- Pearl 2000 Probabilistic model probability of necessity = Pr(Y X=x = y X = x, Y = y) = Pr(y x x, y) - - - - in short the probability that event y would not have occurred in the absence of event x, (= y x ), given that x and y did in fact occur probability of sufficiency = Pr(Y X=x = y X = x, Y = y ) = Pr(y x x, y ) - - - - in short the probability that setting x would produce y in a situation where x and y are in fact absent captures the capacity of x to produce y 34

Pearl 1999 Example: Conditional Probability vs. Action X 1 X 3 X 2 X 4 X 5 SEASON SPRINKLER RAIN WET SLIPPERY Structural equations x 1 = u 1 x 2 = f(x 1, u 2 ) x 3 = f(x 1, u 3 ) x 4 = f(x 2, x 3, u 4 ) x 5 = f(x 4, u 5 ) Exogenous vars U = {u 1, u 2, u 3, u 4, u 5 } e.g. u 4 = a pipe is broken assumed to be independent 35

Pearl 1999 Example: Conditional Probability vs. Action X 1 SEASON Joint probability distribution Pr(x 1, x 2, x 3, x 4, x 5 ) SPRINKLER X 3 X 2 RAIN = Pr(x 1 ) Pr(x 2 x 1 ) Pr(x 3 x 1 ) Pr(x 4 x 3, x 2 ) Pr(x 5 x 4 ) X 4 WET X 5 SLIPPERY 36

Pearl 1999 Example: Conditional Probability vs. Action X 1 SEASON Joint probability distribution on observing X 3 = ON Pr(x 1, x 2, x 3, x 4, x 5 x 3 = 1) X 3 X 2 = ON SPRINKLER RAIN X 4 X 5 WET SLIPPERY = Pr(x 1, x 2, x 3 = 1, x 4, x 5 ) / Pr(x 3 = 1) = [Pr(x 1 ) Pr(x 2 x 1 ) Pr(x 3 =1 x 1 ) Pr(x 4 x 3 =1, x 2 ) Pr(x 5 x 4 )] / Pr(x 3 = 1).. Equation (1) 37

Pearl 1999 Example: Conditional Probability vs. Action The link is broken (surgery!) SPRINKLER X 1 X 3 X 2 = ON X 4 SEASON WET RAIN Structural equations for the action do (X 3 = ON) x 1 = u 1 x 2 = f(x 1, u 2 ) x 3 = 1 x 4 = f(x 2, x 3, u 4 ) x 5 = f(x 4, u 5 ) X 5 SLIPPERY 38

Pearl 1999 Example: Conditional Probability vs. Action X 1 SEASON Joint probability distribution on action do (X 3 = ON) Pr(x 1, x 2, x 3, x 4, x 5 do (X 3 = ON) X 3 X 2 = ON SPRINKLER RAIN = Pr(x 1 ) Pr(x 2 x 1 ) Pr(x 4 x 3 =1, x 2 ) Pr(x 5 x 4 )].. Equation (2) X 4 WET x 3 is treated as a constant X 5 SLIPPERY 39

Computing probability of a Pearl 1999 counterfactual sentence Equation 2 and 1 are very different But the probability in equation 2 can still be computed if the causal graph and probabilities are available However, sometimes the graph and probabilities are not sufficient may need the functional forms f i in structural equations e.g. the pavement would be slippery if the sprinkler were off, given that currently the pavement is slippery 40

Computing probability of a Pearl 1999 counterfactual sentence In general, the conditional probability of a counterfactual sentence If it were A then B, given evidence e can be computed in three steps: Abduction update P (u) by the evidence e, to obtain P (u e). Action Modify M by the action do(a), where A is the antecedent of the counterfactual, to obtain the submodel M A. Deduction Use the updated probability P (u e) in conjunction with M A to compute the probability of the counterfactual consequence B. 41