Bayes Nets. CS 188: Artificial Intelligence Fall Example: Alarm Network. Bayes Net Semantics. Building the (Entire) Joint. Size of a Bayes Net

Similar documents
Announcements. CS 188: Artificial Intelligence Fall Example Bayes Net. Bayes Nets. Example: Traffic. Bayes Net Semantics

Product rule. Chain rule

Recap: Bayes Nets. CS 473: Artificial Intelligence Bayes Nets: Independence. Conditional Independence. Bayes Nets. Independence in a BN

CS 5522: Artificial Intelligence II

Outline. CSE 473: Artificial Intelligence Spring Bayes Nets: Big Picture. Bayes Net Semantics. Hidden Markov Models. Example Bayes Net: Car

CSE 473: Artificial Intelligence Autumn 2011

Bayes Nets: Independence

Announcements. CS 188: Artificial Intelligence Spring Bayes Net Semantics. Probabilities in BNs. All Conditional Independences

Announcements. CS 188: Artificial Intelligence Spring Probability recap. Outline. Bayes Nets: Big Picture. Graphical Model Notation

Bayesian Networks. Vibhav Gogate The University of Texas at Dallas

Artificial Intelligence Bayes Nets: Independence

Probabilistic Models. Models describe how (a portion of) the world works

Bayesian Networks. Vibhav Gogate The University of Texas at Dallas

CS 188: Artificial Intelligence Spring Announcements

CS 5522: Artificial Intelligence II

CS 343: Artificial Intelligence

CS 188: Artificial Intelligence Fall 2008

CS 188: Artificial Intelligence Fall 2009

Bayesian networks. Soleymani. CE417: Introduction to Artificial Intelligence Sharif University of Technology Spring 2018

Outline. CSE 573: Artificial Intelligence Autumn Bayes Nets: Big Picture. Bayes Net Semantics. Hidden Markov Models. Example Bayes Net: Car

Probabilistic Models

CS 188: Artificial Intelligence Spring Announcements

Bayesian networks (1) Lirong Xia

Announcements. CS 188: Artificial Intelligence Fall Causality? Example: Traffic. Topology Limits Distributions. Example: Reverse Traffic

Bayes Nets III: Inference

CSE 473: Ar+ficial Intelligence. Hidden Markov Models. Bayes Nets. Two random variable at each +me step Hidden state, X i Observa+on, E i

Bayesian networks. Chapter Chapter

Bayesian networks. Chapter 14, Sections 1 4

Directed Graphical Models

Bayesian Networks BY: MOHAMAD ALSABBAGH

Artificial Intelligence Bayes Nets

CS 2750: Machine Learning. Bayesian Networks. Prof. Adriana Kovashka University of Pittsburgh March 14, 2016

PROBABILISTIC REASONING SYSTEMS

Review: Bayesian learning and inference

Informatics 2D Reasoning and Agents Semester 2,

Bayes Networks. CS540 Bryan R Gibson University of Wisconsin-Madison. Slides adapted from those used by Prof. Jerry Zhu, CS540-1

CS 343: Artificial Intelligence

Probabilistic Reasoning. (Mostly using Bayesian Networks)

CS 188: Artificial Intelligence. Bayes Nets

Introduction to Artificial Intelligence Belief networks

Probability. CS 3793/5233 Artificial Intelligence Probability 1

Objectives. Probabilistic Reasoning Systems. Outline. Independence. Conditional independence. Conditional independence II.

Based on slides by Richard Zemel

Probabilistic Reasoning Systems

Bayesian Network. Outline. Bayesian Network. Syntax Semantics Exact inference by enumeration Exact inference by variable elimination

Quantifying uncertainty & Bayesian networks

CS 380: ARTIFICIAL INTELLIGENCE UNCERTAINTY. Santiago Ontañón

CS Lecture 3. More Bayesian Networks

CS 484 Data Mining. Classification 7. Some slides are from Professor Padhraic Smyth at UC Irvine

CSEP 573: Artificial Intelligence

Bayesian Networks. Semantics of Bayes Nets. Example (Binary valued Variables) CSC384: Intro to Artificial Intelligence Reasoning under Uncertainty-III

Introduction to Artificial Intelligence. Unit # 11

14 PROBABILISTIC REASONING

Bayesian Networks. Motivation

Lecture 10: Introduction to reasoning under uncertainty. Uncertainty

EE562 ARTIFICIAL INTELLIGENCE FOR ENGINEERS

Bayesian Networks. Axioms of Probability Theory. Conditional Probability. Inference by Enumeration. Inference by Enumeration CSE 473

Another look at Bayesian. inference

Directed Graphical Models

Recall from last time. Lecture 3: Conditional independence and graph structure. Example: A Bayesian (belief) network.

Bayes Net Representation. CS 188: Artificial Intelligence. Approximate Inference: Sampling. Variable Elimination. Sampling.

15-780: Graduate Artificial Intelligence. Bayesian networks: Construction and inference

Bayesian belief networks

Introduction to Bayes Nets. CS 486/686: Introduction to Artificial Intelligence Fall 2013

Uncertainty and Bayesian Networks

CSE 473: Artificial Intelligence Probability Review à Markov Models. Outline

qfundamental Assumption qbeyond the Bayes net Chain rule conditional independence assumptions,

Bayes Networks 6.872/HST.950

Probabilistic Reasoning. Kee-Eung Kim KAIST Computer Science

Midterm 2 V1. Introduction to Artificial Intelligence. CS 188 Spring 2015

Implementing Machine Reasoning using Bayesian Network in Big Data Analytics

This lecture. Reading. Conditional Independence Bayesian (Belief) Networks: Syntax and semantics. Chapter CS151, Spring 2004

Axioms of Probability? Notation. Bayesian Networks. Bayesian Networks. Today we ll introduce Bayesian Networks.

Uncertainty. Introduction to Artificial Intelligence CS 151 Lecture 2 April 1, CS151, Spring 2004

Lecture 8. Probabilistic Reasoning CS 486/686 May 25, 2006

Recall from last time: Conditional probabilities. Lecture 2: Belief (Bayesian) networks. Bayes ball. Example (continued) Example: Inference problem

Bayesian networks: Modeling

Uncertainty and Belief Networks. Introduction to Artificial Intelligence CS 151 Lecture 1 continued Ok, Lecture 2!

Outline. CSE 573: Artificial Intelligence Autumn Agent. Partial Observability. Markov Decision Process (MDP) 10/31/2012

Introduction to Causal Calculus

Guest Speaker. CS 416 Artificial Intelligence. First-order logic. Diagnostic Rules. Causal Rules. Causal Rules. Page 1

CMPSCI 240: Reasoning about Uncertainty

CMPT Machine Learning. Bayesian Learning Lecture Scribe for Week 4 Jan 30th & Feb 4th

Bayesian Networks. Philipp Koehn. 6 April 2017

Conditional probabilities and graphical models

What Causality Is (stats for mathematicians)

Learning Bayesian Networks (part 1) Goals for the lecture

COMP5211 Lecture Note on Reasoning under Uncertainty

CS 188: Artificial Intelligence Fall 2011

Markov Networks.

COS402- Artificial Intelligence Fall Lecture 10: Bayesian Networks & Exact Inference

Cavities. Independence. Bayes nets. Why do we care about variable independence? P(W,CY,T,CH) = P(W )P(CY)P(T CY)P(CH CY)

Probabilistic representation and reasoning

Covariance. if X, Y are independent

last two digits of your SID

Example. Bayesian networks. Outline. Example. Bayesian networks. Example contd. Topology of network encodes conditional independence assertions:

Graphical Models - Part I

Outline. Bayesian networks. Example. Bayesian networks. Example contd. Example. Syntax. Semantics Parameterized distributions. Chapter 14.

2 : Directed GMs: Bayesian Networks

Probabilistic representation and reasoning

Transcription:

CS 188: Artificial Intelligence Fall 2010 Lecture 15: ayes Nets II Independence 10/14/2010 an Klein UC erkeley A ayes net is an efficient encoding of a probabilistic model of a domain ayes Nets Questions we can ask: Inference: given a fixed N, what is P( e)? epresentation: given a N graph, what kinds of distributions can it encode? Modeling: what N is most appropriate for a given domain? 3 ayes Net Semantics Example: Network Let s formalize the semantics of a ayes net A set of nodes, one per variable A directed, acyclic graph A conditional distribution for each node A collection of distributions over, one for each combination of parents values CP: conditional probability table escription of a noisy causal process A ayes net = opology (graph) + Local Conditional Probabilities A 1 A n 4 P() +b 0.001 b 0.999 urglary John calls A J P(J A) +a +j 0.9 +a j 0.1 a +j 0.05 a j 0.95 Earthqk Mary calls A M P(M A) +a +m 0.7 +a m 0.3 a +m 0.01 a m 0.99 E P(E) +e 0.002 e 0.998 E A P(A,E) +b +e +a 0.95 +b +e a 0.05 +b e +a 0.94 +b e a 0.06 b +e +a 0.29 b +e a 0.71 b e +a 0.001 b e a 0.999 uilding the (Entire) Joint We can take a ayes net and build any entry from the full joint distribution it encodes ypically, there s no reason to build ALL of it We build what we need on the fly o emphasize: every N over a domain implicitly defines a joint distribution over that domain, specified by local probabilities and graph structure 6 Size of a ayes Net How big is a joint distribution over N oolean variables? 2 N How big is an N-node net if nodes have up to k parents? O(N * 2 k+1 ) oth give you the power to calculate Ns: Huge space savings! Also easier to elicit local CPs Also turns out to be faster to answer queries (coming) 7 1

ayes Nets So Far We now know: What is a ayes net? What joint distribution does a ayes net encode? Now: properties of that joint distribution (independence) Key idea: conditional independence Last class: assembled Ns using an intuitive notion of conditional independence as causality oday: formalize these ideas Main goal: answer queries about conditional independence and influence Next: how to compute posteriors quickly (inference) 8 Conditional Independence eminder: independence and are independent if and are conditionally independent given (Conditional) independence is a property of a distribution 9 Example: Independence opology Limits istributions For this graph, you can fiddle with θ (the CPs) all you want, but you won t be able to represent any distribution in which the flips are dependent! 1 2 All distributions 10 Given some graph topology G, only certain joint distributions can be encoded he graph structure guarantees certain (conditional) independences (here might be more independence) Adding arcs increases the set of distributions, but has several costs Full conditioning can encode any distribution 11 Independence in a N Important question about a N: Are two nodes independent given certain evidence? If yes, can prove using algebra (tedious in general) If no, can prove with a counter example Example: Causal Chains his configuration is a causal chain Is independent of given? : Low pressure : ain : raffic Question: are and necessarily independent? Answer: no. Example: low pressure causes rain, which causes traffic. can influence, can influence (via ) Addendum: they could be independent: how?! Evidence along the chain blocks the influence 13 2

Common Cause Common Effect Another basic configuration: two effects of the same cause Are and independent? Are and independent given? Observing the cause blocks influence between effects.! : Project due : Newsgroup busy : Lab full 14 Last configuration: two causes of one effect (v-structures) Are and independent? : the ballgame and the rain cause traffic, but they are not correlated Still need to prove they must be (try it!) Are and independent given? No: seeing traffic puts the rain and the ballgame in competition as explanation? his is backwards from the other cases Observing an effect activates influence between possible causes. : aining : allgame : raffic 15 he General Case eachability Any complex example can be analyzed using these three canonical cases General question: in a given N, are two variables independent (given evidence)? ecipe: shade evidence nodes Attempt 1: if two nodes are connected by an undirected path not blocked by a shaded node, they are conditionally independent L Solution: analyze the graph Almost works, but not quite Where does it break? Answer: the v-structure at doesn t count as a link in a path unless active 16 17 eachability (-Separation) Example Question: Are and conditionally independent given evidence vars {}?, if and separated by Look for active paths from to No active paths = independence! Active riples Inactive riples A path is active if each triple is active: Causal chain A C where is unobserved (either direction) Common cause A C where is unobserved Common effect (aka v-structure) A C where or one of its descendents is observed All it takes to block a path is a single inactive segment [emo] 19 3

Example Example L Variables: : aining : raffic : oof drips S: I m sad Questions: S 20 21 Causality? When ayes nets reflect the true causal patterns: Often simpler (nodes have fewer parents) Often easier to think about Often easier to elicit from experts Example: raffic asic traffic net Let s multiply out the joint Ns need not actually be causal Sometimes no causal net exists over the domain E.g. consider the variables raffic and rips End up with arrows that reflect correlation, not causation What do the arrows really mean? opology may happen to encode causal structure opology only guaranteed to encode conditional independence r 1/4 r 3/4 r t 3/4 t 1/4 r t 3/16 r t 1/16 r t 6/16 r t 6/16 r t 1/2 22 t 1/2 23 Example: everse raffic everse causality? Example: Coins Extra arcs don t prevent representing independence, just allow non-independence t 9/16 t 7/16 t r 1/3 r 2/3 t r 1/7 r 6/7 r t 3/16 r t 1/16 r t 6/16 r t 6/16 24 1 2 Adding unneeded arcs isn t wrong, it s just inefficient 1 2 h t h t 25 4

Changing ayes Net Structure Example: Alternate he same joint distribution can be encoded in many different ayes nets Causal structure tends to be the simplest urglary Earthquake If we reverse the edges, we make different conditional independence assumptions Analysis question: given some edges, what other edges do you need to add? One answer: fully connect the graph etter answer: don t make any false conditional independence assumptions John calls Mary calls John calls Mary calls 26 o capture the same joint distribution, we have to add more edges to the graph urglary Earthquake 27 Summary ayes nets compactly encode joint distributions Guaranteed independencies of distributions can be deduced from N graph structure -separation gives precise conditional independence guarantees from graph alone A ayes net s joint distribution may have further (conditional) independence that is not detectable until you inspect its specific distribution 28 5