Exact Inference Algorithms Bucket-elimination

Similar documents
Exact Inference Algorithms Bucket-elimination

COMPSCI 276 Fall 2007

Bayesian Networks: Representation, Variable Elimination

Capturing Independence Graphically; Undirected Graphs

Capturing Independence Graphically; Undirected Graphs

Sampling Algorithms for Probabilistic Graphical models

Variable Elimination (VE) Barak Sternberg

Inference and Representation

Reasoning with Deterministic and Probabilistic graphical models Class 2: Inference in Constraint Networks Rina Dechter

Bounded Inference Non-iteratively; Mini-Bucket Elimination

From Bayesian Networks to Markov Networks. Sargur Srihari

CS6220: DATA MINING TECHNIQUES

Implementing Machine Reasoning using Bayesian Network in Big Data Analytics

Probabilistic Reasoning; Network-based reasoning

Lecture 17: May 29, 2002

Inference in Graphical Models Variable Elimination and Message Passing Algorithm

Probabilistic Graphical Models (I)

Message Passing and Junction Tree Algorithms. Kayhan Batmanghelich

CS6220: DATA MINING TECHNIQUES

Context-specific independence Parameter learning: MLE

Probabilistic Reasoning; Network-based reasoning

Variable Elimination: Algorithm

Probabilistic Reasoning; Network-based reasoning

STATISTICAL METHODS IN AI/ML Vibhav Gogate The University of Texas at Dallas. Bayesian networks: Representation

Weighted Heuristic Anytime Search

Junction Tree, BP and Variational Methods

Intelligent Systems: Reasoning and Recognition. Reasoning with Bayesian Networks

Computational Complexity of Inference

Variable Elimination: Algorithm

STATISTICAL METHODS IN AI/ML Vibhav Gogate The University of Texas at Dallas. Markov networks: Representation

Probabilistic Partial Evaluation: Exploiting rule structure in probabilistic inference

Probabilistic Graphical Models. Guest Lecture by Narges Razavian Machine Learning Class April

Graphical Probability Models for Inference and Decision Making

4 : Exact Inference: Variable Elimination

Mini-Buckets: A General Scheme for Bounded Inference

Lecture 9: PGM Learning

Artificial Intelligence

{ p if x = 1 1 p if x = 0

Exact Inference I. Mark Peot. In this lecture we will look at issues associated with exact inference. = =

Machine Learning 4771

Machine Learning Lecture 14

UNCERTAIN KNOWLEDGE AND REASONING

Rapid Introduction to Machine Learning/ Deep Learning

Defining Things in Terms of Joint Probability Distribution. Today s Lecture. Lecture 17: Uncertainty 2. Victor R. Lesser

Algorithms for the Nearest Assignment Problem

15-780: Graduate Artificial Intelligence. Bayesian networks: Construction and inference

On finding minimal w-cutset problem

Introduc)on to Ar)ficial Intelligence

CS281A/Stat241A Lecture 19

Directed Graphical Models or Bayesian Networks

Statistical Approaches to Learning and Discovery

MACHINE LEARNING 2 UGM,HMMS Lecture 7

Undirected Graphical Models: Markov Random Fields

Graphical Models. Lecture 10: Variable Elimina:on, con:nued. Andrew McCallum

Improving Bound Propagation

Intro to AI: Lecture 8. Volker Sorge. Introduction. A Bayesian Network. Inference in. Bayesian Networks. Bayesian Networks.

Probability. CS 3793/5233 Artificial Intelligence Probability 1

Module 10. Reasoning with Uncertainty - Probabilistic reasoning. Version 2 CSE IIT,Kharagpur

COMP538: Introduction to Bayesian Networks

Bayesian networks. Chapter 14, Sections 1 4

Introduction to Bayesian Networks. Probabilistic Models, Spring 2009 Petri Myllymäki, University of Helsinki 1

Bayesian Networks. Exact Inference by Variable Elimination. Emma Rollon and Javier Larrosa Q

Introduction to Bayes Nets. CS 486/686: Introduction to Artificial Intelligence Fall 2013

Objectives. Probabilistic Reasoning Systems. Outline. Independence. Conditional independence. Conditional independence II.

Inference as Optimization

CS 484 Data Mining. Classification 7. Some slides are from Professor Padhraic Smyth at UC Irvine

Bayesian Machine Learning

Introduction to Bayesian Networks

Machine Learning 4771

Probabilistic Reasoning Systems

UNIVERSITY OF CALIFORNIA, IRVINE. Fixing and Extending the Multiplicative Approximation Scheme THESIS

Undirected Graphical Models 4 Bayesian Networks and Markov Networks. Bayesian Networks to Markov Networks

Variable and Value Ordering for MPE Search

Order Statistics for Probabilistic Graphical Models

Introduction to Probabilistic Reasoning. Image credit: NASA. Assignment

Quantifying uncertainty & Bayesian networks

Studies in Solution Sampling

Introduction to Bayesian Learning. Machine Learning Fall 2018

Lecture 15: MCMC Sanjeev Arora Elad Hazan. COS 402 Machine Learning and Artificial Intelligence Fall 2016

Part I. C. M. Bishop PATTERN RECOGNITION AND MACHINE LEARNING CHAPTER 8: GRAPHICAL MODELS

Complexity of Inference in Graphical Models

From Stochastic Planning to Marginal MAP

Bayesian networks. Chapter Chapter

Bayesian & Markov Networks: A unified view

EE562 ARTIFICIAL INTELLIGENCE FOR ENGINEERS

Final Exam December 12, 2017

Lecture 5: Bayesian Network

The Bucket Elimination Algorithm for Belief Net Inference R Greiner October 22, 2006

Introduction to Artificial Intelligence. Unit # 11

Bayesian Networks Inference with Probabilistic Graphical Models

Outline. CSE 573: Artificial Intelligence Autumn Agent. Partial Observability. Markov Decision Process (MDP) 10/31/2012

Graphical Models and Kernel Methods

Graph structure in polynomial systems: chordal networks

Approximating the Partition Function by Deleting and then Correcting for Model Edges (Extended Abstract)

Relax then Compensate: On Max-Product Belief Propagation and More

Arithmetic circuits of the noisy-or models

CSEP 573: Artificial Intelligence

Structure Learning: the good, the bad, the ugly

Bayesian Networks BY: MOHAMAD ALSABBAGH

Clique trees & Belief Propagation. Siamak Ravanbakhsh Winter 2018

Transcription:

Exact Inference Algorithms Bucket-elimination COMPSCI 276, Spring 2013 Class 5: Rina Dechter (Reading: class notes chapter 4, Darwiche chapter 6) 1

Belief Updating Smoking lung Cancer Bronchitis X-ray Dyspnoea P (lung cancer=yes smoking=no, dyspnoea=yes ) =? 2

Probabilistic Inference Tasks Belief updating: E is a subset {X1,,Xn}, Y subset X-E, P(Y=y E=e) P(e)? BEL(X i ) = P(Xi = xi evidence) Finding most probable explanation (MPE) x* = argmax P(x,e) Finding maximum a-posteriory hypothesis * * (a1,...,ak) = argmax P(x, e) a X/A Finding maximum-expected-utility (MEU) decision * * (d1,...,dk ) = argmax P(x, e)u(x) d X/D x A X : hypothesis variables D X : decision variables U( x) : utility function 4

Belief updating is NP-hard 5

A simple network Given: A B C D How can we compute P(D)?, P(D A=0)? P(A D=0)? Brute force O(k^4) Maybe O(4k^2) 6

Belief updating: P(X evidence)=? A P(a e=0) P(a,e=0)= B C e = 0, d, c, b P(a)P(b a)p(c a)p(d b,a)p(e b,c)= D E Moral graph P(a) e=0 d P(c a) c b P(b a)p(d b,a)p(e b,c) Variable Elimination h B ( a, d, c, e) 10

A B C D E 11

A B C D E 12

A B C D E Using a different 13

14

A Bayesian network ordering: C,B,E,D,G 20

A Bayesian network ordering: C,B,E,D,G 21

A different ordering 22

A Bayesian network processed along 2 orderings 23

A Bucket elimination Algorithm BE-bel (Dechter 1996) b bucket B: bucket C: bucket D: bucket E: bucket A: P(b a) P(d b,a) P(e b,c) P(c a) e=0 P(a) P(a e=0) h B (a,d,c,e) h C (a,d,e) h D (a,e) h E (a) Elimination operator W*=4 induced width (max clique size) D B B C D E A E C 24

BE-BEL 25

26

Student Network example Difficulty Intelligence P(J)? Grade SAT Letter Apply Job

E D C B A B C D E A 28

Complexity of elimination w * ( d ) O( n exp( w * ( d)) the induced width of moral graph along ordering d The effect of the ordering: A B C E D B C D C E B D E Moral graph A A * * w ( d 1 ) = 4 w ( d 2 ) = 2 31

BE-BEL More accurately: O(r exp(w*(d)) where r is the number of cpts. For Bayesian networks r=n. For Markov networks? 32

34

The impact of observations G G G G B B B B C C C C D D D D F F F F A A A A (a) (b) (c) (d) Ordered graph Induced graph Ordered conditioned graph 35

A B C D E Moral graph BE-BEL Use the ancestral graph only 36

Probabilistic Inference Tasks Belief updating: BEL(X i ) = P(Xi = x i evidence) Finding most probable explanation (MPE) x* = argmax P(x,e) Finding maximum a-posteriory hypothesis * * (a1,...,ak) = argmax P(x, e) a x X/A A X : hypothesis variables 42

43 Finding Algorithm BE-mpe max P(x) MPE x = ), ( ), ( ) ( ) ( ) ( max by replaced is,,,, c b e P b a d P a b P a c P a P MPE : b c d e a = max A D E C B

Finding x Algorithm elim-mpe (Dechter 1996) MPE = bucket B: bucket C: bucket D: bucket E: bucket A: max b P(b a) P(d b,a) P(e b,c) P(c a) e=0 P(a) MPE MPE = is replaced by max : max P( a) P( c a) P( b a) P( d a, e, d, c, b h B (a,d,c,e) h C (a,d,e) h D (a,e) h E (a) max P(x) a, b) P( e Elimination operator W*=4 induced width (max clique size) D b, c) B B C D E A A E C 44

Generating the MPE-tuple 5. b' = arg max P(b a' ) b P(d' b,a' ) P(e' b,c' ) B: P(b a) P(d b,a) P(e b,c) 4. c' = h arg max P(c a' B c (a',d',c,e' ) ) C: P(c a) h B (a,d,c,e) 3. d' = arg max h d C (a',d,e' ) D: h C (a,d,e) 2. e' = 0 E: e=0 h D (a, e) 1. a' = arg max P(a) h a E (a) A: P(a) h E (a) Return (a',b',c',d',e' ) 45

47

49 Finding MAP Algorithm BE-map ), ( ), ( ) ( ) ( ) ( max, c b e P b a d P a b P a c P a P MPE e,d,b c a = : max and A D E C B

50

Algorithm BE-MAP Variable ordering: Restricted: Max buckets should Be processed after sum buckets 52

BE for Markov networks queries 53

O(nexp(w*+1)) and O(n exp(w*)), respectively More accurately: O(r exp(w*(d)) where r is the number of cpts. For Bayesian networks r=n. For Markov networks? 54

Finding small induced-width NP-complete A tree has induced-width of? Greedy algorithms: Min width Min induced-width Max-cardinality Fill-in (thought as the best) See anytime min-width (Gogate and Dechter) 56

Type of graphs 57

The induced width 58

Min-width ordering Proposition: algorithm min-width finds a min-width ordering of a graph 59

Greedy orderings heuristics Theorem: A graph is a tree iff it has both width and induced-width of 1. 60

Different Induced-graphs 61

Induced-width for chordal graphs Definition: A graph is chordal if every cycle of length at least 4 has a chord Finding w* over chordal graph is easy using the maxcardinality ordering: order vertices from 1 to n, always assigning the next number to the node connected to a largest set of previously numbered nodes. Lets d be such an ordering A graph along max-cardinality order has no fill-in edges iff it is chordal. On chordal graphs width=induced-width. 62

Max-cardinality ordering What is the complexity of min-fill? Min-induced-width? 63

K-trees 64

Which greedy algorithm is best? 65

Recent work in my group Vibhav Gogate and Rina Dechter. "A Complete Anytime Algorithm for Treewidth". In UAI 2004. Andrew E. Gelfand, Kalev Kask, and Rina Dechter. "Stopping Rules for Randomized Greedy Triangulation Schemes" in Proceedings of AAAI 2011. Potential project 66