Machine learning: lecture 20. Tommi S. Jaakkola MIT CSAIL

Similar documents
Machine learning: lecture 20. Tommi S. Jaakkola MIT CSAIL

Introduction to Bayesian Networks

2 : Directed GMs: Bayesian Networks

Bayesian Networks. Vibhav Gogate The University of Texas at Dallas

Bayesian Networks. Vibhav Gogate The University of Texas at Dallas

Introduction to Artificial Intelligence. Unit # 11

Outline. CSE 473: Artificial Intelligence Spring Bayes Nets: Big Picture. Bayes Net Semantics. Hidden Markov Models. Example Bayes Net: Car

Directed Graphical Models or Bayesian Networks

Probability Theory and Simulation Methods

2 : Directed GMs: Bayesian Networks

Probabilistic Models

Bayesian Networks. Material used

Outline. CSE 573: Artificial Intelligence Autumn Bayes Nets: Big Picture. Bayes Net Semantics. Hidden Markov Models. Example Bayes Net: Car

CS 188: Artificial Intelligence Spring Announcements

Probabilistic Reasoning Systems

CSE 473: Artificial Intelligence Autumn 2011

Introduction to Bayesian Learning

Objectives. Probabilistic Reasoning Systems. Outline. Independence. Conditional independence. Conditional independence II.

Directed Graphical Models

Introduction to Artificial Intelligence Belief networks

Probabilistic Representation and Reasoning

Review: Bayesian learning and inference

Announcements. CS 188: Artificial Intelligence Spring Probability recap. Outline. Bayes Nets: Big Picture. Graphical Model Notation

University of Washington Department of Electrical Engineering EE512 Spring, 2006 Graphical Models

STATISTICAL METHODS IN AI/ML Vibhav Gogate The University of Texas at Dallas. Bayesian networks: Representation

Probability theory for Networks (Part 1) CS 249B: Science of Networks Week 02: Monday, 02/04/08 Daniel Bilar Wellesley College Spring 2008

CS 188: Artificial Intelligence Spring Announcements

CS839: Probabilistic Graphical Models. Lecture 2: Directed Graphical Models. Theo Rekatsinas

CS 188: Artificial Intelligence Fall 2009

Lecture 5: Bayesian Network

CS 5522: Artificial Intelligence II

Uncertainty. Introduction to Artificial Intelligence CS 151 Lecture 2 April 1, CS151, Spring 2004

Bayesian networks. Soleymani. CE417: Introduction to Artificial Intelligence Sharif University of Technology Spring 2018

Cognitive Systems 300: Probability and Causality (cont.)

CS 343: Artificial Intelligence

Lecture 10: Introduction to reasoning under uncertainty. Uncertainty

Quantifying uncertainty & Bayesian networks

Probabilistic Reasoning. (Mostly using Bayesian Networks)

Implementing Machine Reasoning using Bayesian Network in Big Data Analytics

TDT70: Uncertainty in Artificial Intelligence. Chapter 1 and 2

CS 188: Artificial Intelligence Fall 2008

Bayesian Networks Representation

CS Belief networks. Chapter

Announcements. CS 188: Artificial Intelligence Spring Bayes Net Semantics. Probabilities in BNs. All Conditional Independences

Cheng Soon Ong & Christian Walder. Canberra February June 2018

Bayesian Network. Outline. Bayesian Network. Syntax Semantics Exact inference by enumeration Exact inference by variable elimination

Directed and Undirected Graphical Models

Bayesian belief networks

Chap 4 Probability p227 The probability of any outcome in a random phenomenon is the proportion of times the outcome would occur in a long series of

COMP5211 Lecture Note on Reasoning under Uncertainty

CMPSCI 240: Reasoning about Uncertainty

Bayesian belief networks. Inference.

Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com

Bayesian Learning. Bayesian Learning Criteria

PROBABILISTIC REASONING SYSTEMS

Bayesian Networks (Part II)

6.867 Machine learning, lecture 23 (Jaakkola)

This lecture. Reading. Conditional Independence Bayesian (Belief) Networks: Syntax and semantics. Chapter CS151, Spring 2004

Probabilistic Models. Models describe how (a portion of) the world works

CS 2750: Machine Learning. Bayesian Networks. Prof. Adriana Kovashka University of Pittsburgh March 14, 2016

Introduction to Bayesian Statistics

Bayes Networks. CS540 Bryan R Gibson University of Wisconsin-Madison. Slides adapted from those used by Prof. Jerry Zhu, CS540-1

Bayesian networks: Modeling

Figure 1: Bayesian network for problem 1. P (A = t) = 0.3 (1) P (C = t) = 0.6 (2) Table 1: CPTs for problem 1. C P (D = t) f 0.9 t 0.

Bayesian Networks. Semantics of Bayes Nets. Example (Binary valued Variables) CSC384: Intro to Artificial Intelligence Reasoning under Uncertainty-III

Uncertainty and Belief Networks. Introduction to Artificial Intelligence CS 151 Lecture 1 continued Ok, Lecture 2!

Learning Bayesian Networks (part 1) Goals for the lecture

Bayesian Networks Introduction to Machine Learning. Matt Gormley Lecture 24 April 9, 2018

Machine Learning for Data Science (CS4786) Lecture 19

Graphical Models - Part I

15-780: Graduate Artificial Intelligence. Bayesian networks: Construction and inference

CS 484 Data Mining. Classification 7. Some slides are from Professor Padhraic Smyth at UC Irvine

Recall from last time. Lecture 3: Conditional independence and graph structure. Example: A Bayesian (belief) network.

Probabilistic representation and reasoning

Outline } Conditional independence } Bayesian networks: syntax and semantics } Exact inference } Approximate inference AIMA Slides cstuart Russell and

Data Mining 2018 Bayesian Networks (1)

Online Learning of Probabilistic Graphical Models

Bayesian networks (1) Lirong Xia

Introduction to Graphical Models. Srikumar Ramalingam School of Computing University of Utah

Bayesian networks. Chapter Chapter

A Tutorial on Learning with Bayesian Networks

Another look at Bayesian. inference

Bayesian networks. Chapter Chapter Outline. Syntax Semantics Parameterized distributions. Chapter

Introduction to Graphical Models. Srikumar Ramalingam School of Computing University of Utah

Markov properties for directed graphs

Intelligent Systems: Reasoning and Recognition. Reasoning with Bayesian Networks

Bayesian Networks. Motivation

Rapid Introduction to Machine Learning/ Deep Learning

ECE521 Tutorial 11. Topic Review. ECE521 Winter Credits to Alireza Makhzani, Alex Schwing, Rich Zemel and TAs for slides. ECE521 Tutorial 11 / 4

EE562 ARTIFICIAL INTELLIGENCE FOR ENGINEERS

Probabilistic representation and reasoning

Y1 Y2 Y3 Y4 Y1 Y2 Y3 Y4 Z1 Z2 Z3 Z4

A Brief Introduction to Graphical Models. Presenter: Yijuan Lu November 12,2004

Bayesian Networks: Construction, Inference, Learning and Causal Interpretation. Volker Tresp Summer 2016

Announcements. CS 188: Artificial Intelligence Fall Causality? Example: Traffic. Topology Limits Distributions. Example: Reverse Traffic

Bayesian networks. Chapter AIMA2e Slides, Stuart Russell and Peter Norvig, Completed by Kazim Fouladi, Fall 2008 Chapter 14.

Bayesian networks. Chapter Chapter

Example. Bayesian networks. Outline. Example. Bayesian networks. Example contd. Topology of network encodes conditional independence assertions:

Introduction to Probabilistic Reasoning. Image credit: NASA. Assignment

1 : Introduction. 1 Course Overview. 2 Notation. 3 Representing Multivariate Distributions : Probabilistic Graphical Models , Spring 2014

Transcription:

Machine learning: lecture 20 ommi. Jaakkola MI AI tommi@csail.mit.edu

Bayesian networks examples, specification graphs and independence associated distribution Outline ommi Jaakkola, MI AI 2

Bayesian networks Bayesian networks are directed acyclic graphs, where the nodes represent variables and directed edges capture dependencies "parent of x" A mixture model as a Bayesian network "i influences x" "i causes x" "x depends on i" i x P (i)p (x i) "child of i" ommi Jaakkola, MI AI 3

Bayesian networks Bayesian networks are directed acyclic graphs, where the nodes represent variables and directed edges capture dependencies "parent of x" A mixture model as a Bayesian network "i influences x" "i causes x" "x depends on i" i x P (i)p (x i) Graph semantics: "child of i" graph separation properties independence Association with probability distributions: independence family of distributions ommi Jaakkola, MI AI 4

Example A simple Bayesian network: coin tosses x 1 x 2 ommi Jaakkola, MI AI 5

Example A simple Bayesian network: coin tosses P (x 1 ) : x 1 x 2 P (x2 ) : ommi Jaakkola, MI AI 6

Example A simple Bayesian network: coin tosses P (x 1 ) : x 1 x 2 P (x2 ) : x 3 = same? ommi Jaakkola, MI AI 7

Example A simple Bayesian network: coin tosses P (x 1 ) : x 1 x 2 P (x2 ) : P (x 3 x 1, x 2 ) : x 3 = same? hh ht th tt y 1.0 0.0 0.0 1.0 n 0.0 1.0 1.0 0.0 ommi Jaakkola, MI AI 8

Example A simple Bayesian network: coin tosses P (x 1 ) : x 1 x 2 P (x2 ) : x 3 = same? hh ht th tt y 1.0 0.0 0.0 1.0 P (x 3 x 1, x 2 ) : n 0.0 1.0 1.0 0.0 wo levels of description 1. graph structure (dependencies, independencies) 2. associated probability distribution ommi Jaakkola, MI AI 9

Example cont d What can the graph alone tell us? x 1 x 2 x 3 = same? ommi Jaakkola, MI AI 10

Example cont d What can the graph alone tell us? x 1 x 2 x 3 = same? x 1 and x 2 are marginally independent ommi Jaakkola, MI AI 11

Example cont d What can the graph alone tell us? x 1 x 2 x 3 = same? x 1 and x 2 are marginally independent x 1 x 2 x 3 = same? x 1 and x 2 become dependent if we know x 3 (the dependence concerns our beliefs about the outcomes) ommi Jaakkola, MI AI 12

raffic example = X is nice? = traffic light = X decides to stop? = the other car turns left? = crash? ommi Jaakkola, MI AI 13

raffic example = X is nice? = traffic light = X decides to stop? = the other car turns left? = crash? ommi Jaakkola, MI AI 14

raffic example = X is nice? = traffic light = X decides to stop? = the other car turns left? = crash? ommi Jaakkola, MI AI 15

raffic example = X is nice? = traffic light = X decides to stop? = the other car turns left? = crash? ommi Jaakkola, MI AI 16

raffic example = X is nice? = traffic light = X decides to stop? = the other car turns left? = crash? ommi Jaakkola, MI AI 17

raffic example = X is nice? = traffic light = X decides to stop? = the other car turns left? = crash? If we only know that X decided to stop, can X s character (variable ) tell us anything about the other car turning (variable )? ommi Jaakkola, MI AI 18

Graph, independence, d-separation Are and independent given? ommi Jaakkola, MI AI 19

Graph, independence, d-separation Are and independent given? Definition: Variables and are D-separated given if separates them in the moralized ancestral graph ommi Jaakkola, MI AI 20

Graph, independence, d-separation Are and independent given? Definition: Variables and are D-separated given if separates them in the moralized ancestral graph original ommi Jaakkola, MI AI 21

Graph, independence, d-separation Are and independent given? Definition: Variables and are D-separated given if separates them in the moralized ancestral graph original ancestral ommi Jaakkola, MI AI 22

Graph, independence, d-separation Are and independent given? Definition: Variables and are D-separated given if separates them in the moralized ancestral graph original ancestral moralized ancestral ommi Jaakkola, MI AI 23

Graph, independence, d-separation Are and independent given? Definition: Variables and are D-separated given if separates them in the moralized ancestral graph original ancestral moralized ancestral ommi Jaakkola, MI AI 24

Graphs and distributions A graph is a compact representation of a large collection of independence properties ommi Jaakkola, MI AI 25

Graphs and distributions A graph is a compact representation of a large collection of independence properties heorem: Any probability distribution that is consistent with a directed graph G has to factor according to node given parents : d P (x G) = P (x i x pai ) i=1 where x pai are the parents of x i and d is the number of nodes (variables) in the graph. ommi Jaakkola, MI AI 26

Model Explaining away phenomenon Earthquake Burglary Radio report Alarm ommi Jaakkola, MI AI 27

Model Explaining away phenomenon Earthquake Burglary Radio report Alarm Evidence, competing causes Earthquake Burglary Radio report Alarm ommi Jaakkola, MI AI 28

Model Explaining away phenomenon Earthquake Burglary Radio report Alarm Evidence, competing causes Earthquake Burglary Radio report Alarm Additional evidence and explaining away Earthquake Burglary Radio report Alarm ommi Jaakkola, MI AI 29