Directed Graphical Models. William W. Cohen Machine Learning

Size: px
Start display at page:

Download "Directed Graphical Models. William W. Cohen Machine Learning"

Transcription

1 Directed Graphical Models William W. Cohen Machine Learning

2 MOTIVATION FOR GRAPHICAL MODELS

3 Recap: A paradox of induction A black crow seems to support the hypothesis all crows are black. A pink highlighter supports the hypothesis all non-black things are non-crows Thus, a pink highlighter supports the hypothesis all crows are black. x CROW( x) BLACK( x) or equivalently x BLACK( x) CROW( x) whut?

4 whut? crows non-crows not black black B = black C = crow collect statistics for P(B=b C=c)

5 Logical reasoning versus common-sense reasoning BLACK(jim) FLY(jim) BIRD(jim) EATS(jim,carrion)

6 Another difficult problem: commonsense reasoning Tweety is a bird. Most birds can fly. B(tweety) * x : B(x) F(x) Opus is a penguin. Penguins are birds. Penguins cannot fly. Pg(opus) x, Pg(x) B(x) x : Pg(x) F(x) We d like to be able to conclude: Opus cannot fly, and Tweety can F(opus) F(opus) F(tweety) Logically default reasoning

7 Another difficult problem: commonsense reasoning Tweety is a bird. Most birds can fly. Opus is a penguin. Penguins are birds. Penguins cannot fly. We d like to be able to conclude: Opus cannot fly, and Tweety can B(tweety) * x : B(x) Pg(x) F(x) Pg(opus) x, Pg(x) B(x) x : Pg(x) F(x) F(opus) F(tweety)? NO: F(tweety) only provable if he s provably NOT a penguin default reasoning

8

9 Another difficult problem: commonsense reasoning Tweety is a bird. Most birds can fly. Opus is a penguin. Penguins are birds. Penguins cannot fly. * x : B(x) We d like to be able to conclude: Opus cannot fly, and Tweety can B(tweety) Pg(x) Dodo(x) Dead(x)... F(x) F(opus) F(tweety)? NO: F(tweety) only provable if he s provably not a penguin and not dead and default reasoning

10 Recap: The Joint Distribution Example: Boolean variables A, B, C Recipe for making a joint distribution of M variables: 1. Make a truth table listing all combinations of values of your variables (if there are M Boolean variables then the table will have 2 M rows). 2. For each combination of values, say how probable it is. A B C Prob

11 Another difficult problem: commonsense reasoning B(tweety) * x : B(x) Pg(x) F(x) Pg(opus) x, Pg(x) B(x) x : Pg(x) F(x) F(tweety) F(opus)

12 Another difficult problem: commonsense reasoning Pr(F, B, Pg) = Pr(F B, Pg)Pr(B Pg)Pr(Pg) Pr(B Pg) Pr(Pg) Pr(F B, Pg)

13 Pr(F, B, Pg) = Pr(F B, Pg)Pr(B Pg)Pr(Pg) The joint for an experiment: I pick an object, say in Frick Park, and measure: can it fly, is it a bird, is it a penguin. Tweety is a bird. Most non-penguin birds can 7ly. Opus is a penguin. Penguins are birds. Penguins cannot 7ly. * x : B(x) Pg(x) F(x) Pg B Pr(F=0 Pg,B) Pr(F=1 Pg,B) Pg Pr(Pg=0) Pr(Pg=1) x, Pg(x) B(x) Pg Pr(B=0 Pg) Pr(B=1 Pg)

14 Pr(F, B, Pg) = Pr(F B, Pg)Pr(B Pg)Pr(Pg) Tweety is a bird. Most birds can 7ly. * x : B(x) Pg(x) F(x) Pg B Pr(F=0 Pg,B) Pr(F=1 Pg,B) Opus is a penguin. Penguins are birds. Penguins cannot 7ly. b=0, Can Opus fly? Pr(F =1 Pg =1) Unlikely: Pr(F=1 B=1,Pg=1) = 0 = Pr(F =1 B = b, Pg =1)Pr(B = b Pg =1) = Pr(F =1 B =1, Pg =1)Pr(B =1 Pg =1)+Pr(F B = 0, Pg =1)Pr(B = 0 Pg =1) = 1

15 Pr(F, B, Pg) = Pr(F B, Pg)Pr(B Pg)Pr(Pg) If flying penguins are rare, it depends: do non-penguins birds fly? Pr(F B=1,Pg=0) are all or most birds non-penguins? Pr(B=1 Pg=0) are non-penguins common? Pr(Pg=0) Tweety is a bird. Most birds can 7ly. Opus is a penguin. Penguins are birds. Penguins cannot 7ly. Pr(F =1 B =1) Can Tweety fly? = Pr(F =1 B =1, Pg = pg)pr(b =1 Pg = pg) Pr(Pg = pg) pg=0,1 = Pr(F =1 B =1, Pg =1)Pr(B =1 Pg =1)Pr(Pg =1) + Pr(F =1 B =1, Pg = 0)Pr(B =1 Pg = 0)Pr(Pg = 0) Tweety is a 7lying penguin Tweety is a 7lying nonpenguin bird

16 Quiz. cid=421 cid=420 Pr(F =1 B =1) = Pr(F =1 B =1, Pg = pg)pr(b =1 Pg = pg) Pr(Pg = pg) pg=0,1 = Pr(F =1 B =1, Pg =1)Pr(B =1 Pg =1)Pr(Pg =1) + Pr(F =1 B =1, Pg = 0)Pr(B =1 Pg = 0)Pr(Pg = 0)

17 Another difficult problem: commonsense reasoning Pr(F, B, Pg) = Pr(F B, Pg)Pr(B Pg)Pr(Pg) Have we solved the common-sense reasoning problem? No: how do we (1) chose the conditional probabilities you need to model a task and (2) use them algorithmically to answer questions? Pr(B Pg) Pr(Pg) Pr(F B, Pg) No: How do we invent numbers for all the rows of the CPTs?

18 Another difficult problem: commonsense reasoning Pr(F, B, Pg) = Pr(F B, Pg)Pr(B Pg)Pr(Pg) Have we solved the common-sense reasoning problem? Yes: We use directed graphical models. Semantics: how to specify them Inference: how to use them Pr(B Pg) Pr(Pg) Pr(F B, Pg) Yes: We use directed graphical models. Learning: how to find parameters

19 Probabilities and probabilistic inference Why is logic attractive? - There are well-understood algorithms for reasoning with a logical theory. E.g: we can use a computer to determine if B(x)èF(x). What about probabilities? - We can do some math manually and answer many questions. Not really satisfying - We can answer questions algorithmically with the joint Eg: we can compute Pr(F=1 B=1) But: this is not tractable for large models. - How can we answer questions algorithmically and efficiently? Answer: Graphical models

20 Probabilities and probabilistic inference Directed graphical models - Today: examples and semantics - Wednesday: inference algorithms - Next week: learning in graphical models using graphical models to specify learning algorithms: Naïve Bayes, LDA, HMMs,

21 GRAPHICAL MODELS: SEMANTICS AND DEFINITIONS

22 Example: practical problem 1 made easy I have 3 standard d20 dice, 1 loaded die. Experiment: (A) pick a d20 uniformly at random then (B) roll it. Let A=d20 picked is fair and B=roll 19 or 20 with that die. What is P(B)? P(A=fair)=0.75 P(A=loaded)=0.25 A B P(B=critical A=fair=0.1 P(B=noncritical A=fair)=0.9 P(B=critical A=loaded)=0.5 P(B=noncritical A=loaded)=0.5 A P(A) Fair 0.75 Loaded 0.25 A B P(B A) Fair Critical 0.1 Fair Noncritical 0.9 P(A,B)=P(B A)P(A) Loaded Critical 0.5 Loaded Noncritical 0.5

23 Example: practical problem 1 made easy We have the information we need to answer other questions as well I have 3 standard d20 dice, 1 loaded die. Experiment: (1) pick a d20 uniformly at random then (2) roll it. Let A=d20 picked is fair and B=roll 19 or 20 with that die. What is P(B)? What is Pr(A=1 B=1)? Example of inference P(A=fair)=0.75 P(A=loaded)=0.25 A B P(B=critical A=fair=0.1 P(B=noncritical A=fair)=0.9 P(B=critical A=loaded)=0.5 P(B=noncritical A=loaded)=0.5 A P(A) Fair 0.75 Loaded 0.25 A B P(B A) Fair Critical 0.1 Fair Noncritical 0.9 P(A,B)=P(B A)P(A) Loaded Critical 0.5 Loaded Noncritical 0.5

24 Example: practical problem 1 made easy In general: any chain-rule decomposition gives a DGM G: G has one node per random variable If P(X Y 1,,Y k ) is a factor in the decomposition, then G has edges fromy 1 àx,,y k àx X is annotated with a conditional probability table (CPT) encoding P(X=x Y 1= y 1,,Y k =y k ) for each tuple (x,y 1,,y k ) P(A=fair)=0.75 P(A=loaded)=0.25 A P(A) Fair 0.75 Loaded 0.25 P(A,B)=P(B A)P(A) A B Applied chain rule P(B=critical A=fair=0.1 P(B=noncritical A=fair)=0.9 P(B=critical A=loaded)=0.5 P(B=noncritical A=loaded)=0.5 A B P(B A) Fair Critical 0.1 Fair Noncritical 0.9 Loaded Critical 0.5 Loaded Noncritical 0.5

25 There s more than one network for any distribution I have 3 standard d20 dice, 1 loaded die. Experiment: (1) pick a d20 uniformly at random then (2) roll it. Let A=d20 picked is fair and B=roll 19 or 20 with that die. B A B P(B) critical non-critical P(A,B)=P(A B)P(B) B A P(A B) Critical Fair Noncritical Fair Critical Loaded Noncritical Loaded

26 There s more than one network for any distribution I have 3 standard d20 dice, 1 loaded die. Experiment: (1) pick a d20 uniformly at random then (2) roll it. Let A=d20 picked is fair and B=roll 19 or 20 with that die. What is P(B)? B A P(A,B)=P(A B)P(B) The moral: we have two things here A B P(A,B)=P(B A)P(A) a generative story, causal model, a joint probability distribution e.g. P(A,B) a decomposition: P(A,B)=P(B A)P(A) another decomposition: P(A,B)=P(A B)P(B) -- totally valid! it s usually cleaner to pick one that fits a generative story

27 There s more than one network for any distribution P(A, B,C, D) = P(A B,C, D)P(B,C, D) = P(A B,C, D)P(B C, D)P(C, D) = P(A B,C, D)P(B C, D)P(C D)P(D) P(A, B,C, D) = P(D A, B,C)P(A, B,C) = P(D A, B,C)P(C A, B)P(A, B) = P(D A, B,C)P(C A, B)P(B A)P(A) There are lots of decompositions of an model with N variables They are all correct Some are better than others.

28 There s more than one network for any distribution P(A, B,C, D) = P(A B,C, D)P(B,C, D) Suppose there are some conditional independencies P(A B,C,D)=P(A B) P(B C,D)=P(B C) = P(A B,C, D)P(B C, D)P(C, D) = P(A B,C, D)P(B C, D)P(C D)P(D) = P(A B)P(B C)P(C D)P(D) P(A, B,C, D) = P(D A, B,C)P(A, B,C) = P(D A, B,C)P(C A, B)P(A, B) = P(D A, B,C)P(C A, B)P(B A)P(A) Then the first decomposition can be simplified and compressed, the second can t

29 The (highly practical) Monty Hall problem You re in a game show. Behind one door is a prize. Behind the others, goats. You pick one of three doors, say #1 The host, Monty Hall, opens one door, revealing a goat! You now can either stick with your guess 3 always change doors flip a coin and pick a new door randomly according to the coin

30 Example: practical problem 2

31 The (highly practical) Monty Hall problem You re in a game show. Behind one door is a prize. Behind the others, goats. You pick one of three doors, say #1 The host, Monty Hall, opens one door, revealing a goat! You now can either stick with your guess or change doors D A P(A) Stick, or swap? P(D) Stick 0.5 Swap 0.5 D Second guess First guess W A E The money B B C The revealed goat P(B) A B C P(C A,B) ( a b) ( c { a, b} ) ( a = b) ( c { a, }) 1.0 if P( C = c A = a, B = b) = 0.5 if b 0 otherwise Slide 31

32 The (highly practical) Monty Hall problem P(E = e A,C, D) # % = $ % &% 1.0 if ( e = a) ( d = stick) 1.0 if ( e {a, c} ) (d = swap) 0 otherwise If you stick: you win if your first guess was right. If you swap: you win if your first guess was wrong. A P(A) ' % ( % )% D Second guess A C D P(E A,C,D) First guess Stick or swap? A E C The money B The goat B P(B) A B C P(C A,B) ( a b) ( c { a, b} ) ( a = b) ( c { a, }) 1.0 if P( C = c A = a, B = b) = 0.5 if b 0 otherwise Slide 32

33 The (highly practical) Monty Hall problem We could construct the joint and compute P(E=B D=swap) again by the chain rule: P(A,B,C,D,E) = P(E A,C,D) * P(D) * P(C A,B ) * P(B ) * P(A) A P(A) D Second guess First guess Stick or swap? A C D P(E A,C,D) A E C The money B The goat B P(B) A B C P(C A,B) Slide 33

34 The (highly practical) Monty Hall problem We could construct the joint and compute P(E=B D=swap) again by the chain rule: P(A,B,C,D,E) = P(E A,B,C,D) * P(D A,B,C) * P(C A,B ) * P(B A) * P(A) A P(A) D Second guess First guess Stick or swap? A C D P(E A,C,D) A E C The money B The goat B P(B) A B C P(C A,B) Slide 34

35 The (highly practical) Monty Hall problem The joint table has? 3*3*3*2*3 = 162 rows The conditional probability tables (CPTs) shown have? *3*3 + 2*3*3 = 51 rows < 162 rows A P(A) First guess Stick or swap? A The money Big questions: The goat D C why are the CPTs smaller? how much smaller 1 1 are 2 the 0.5 CPTs than the joint? E can we compute the answers to queries like P(E=B d) without building the joint probability tables, just using the CPTs? Second guess A C D P(E A,C,D) B B P(B) A B C P(C A,B) Slide 35

36 The (highly practical) Monty Hall problem Why is the CPTs representation smaller? Follow the money! (B) P( E P( E = e 1.0 = a, b, c, d, e = P( E A, C, D) if if = e A ( e = a) ( d = stick ) ( e { a, c} ) ( d = otherwise = e A = a, C = a, B = c, D = b, C swap = d) A ) P(A) D Second guess First guess Stick or swap? A C D P(E A,C,D) = b, D = d) A E C The money B The goat B P(B) E A is B conditionally C P(C A,B) independent of B given A,D,C E B A, C, D I < E,{ A, C, D}, B > Slide 36

37 Conditional Independence formalized Definition: R and L are conditionally independent given M if for all x,y,z in {T,F}: P(R=x M=y ^ L=z) = P(R=x M=y) More generally: Let S1 and S2 and S3 be sets of variables. Set-of-variables S1 and set-of-variables S2 are conditionally independent given S3 if for all assignments of values to the variables in the sets, P(S 1 s assignments S 2 s assignments & S 3 s assignments)= P(S1 s assignments S3 s assignments) Slide 37

38 The (highly practical) Monty Hall problem What are the conditional indepencies? I<A, {B}, C>? I<A, {C}, B>? I<E, {A,C}, B>? I<D, {E}, B>? D First guess Stick or swap? A C The money B The goat Second guess E Slide 38

39 Slide 39

40 Recap: Bayes Nets Formalized A Bayes net (also called a belief network) is an augmented directed acyclic graph, represented by the pair V, E where: - V is a set of vertices. - E is a set of directed edges joining vertices. No loops of any length are allowed. Each vertex in V contains the following information: - The name of a random variable - A probability distribution table indicating how the probability of this variable s values depends on all possible combinations of parent values.

41 Building a Bayes Net Choose a set of relevant variables. Choose an ordering for them Assume they re called X 1.. X m (where X 1 is first in ordering, etc) For i = 1 to m: - Add the X i node to the network - Set Parents(X i ) to be a minimal subset of {X 1 X i-1 } such that we have conditional independence of X i and all other members of {X 1 Xi -1 } given Parents(Xi ) - Define the probability table of P(X i =k Assignments of Parents(X i ) ).

42 The general case P(X 1= x 1 ^ X 2 =x 2 ^.X n-1 =x n-1 ^ X n =x n ) = P(X n =x n ^ X n-1 =x n-1 ^.X 2 =x 2 ^ X 1 =x 1 ) = P(X n =x n X n-1 =x n-1 ^.X 2 =x 2 ^ X 1 =x 1 ) * P(X n-1 =x n-1 ^. X 2 =x 2 ^ X 1 =x 1 ) = P(X n =x n X n-1 =x n-1 ^.X 2 =x 2 ^ X 1 =x 1 ) * P(X n-1 =x n-1. X 2 =x 2 ^ X 1 =x 1 ) * P(X n-2 =x n-2 ^. X 2 =x 2 ^ X 1 =x 1 ) = = : : i= 1 = n n i= 1 P P ( X = x )(( X = x ) ( X = x ))) i i 1 i 1 ( X = x ) Assignments of Parents( X )) i i i So any entry in joint pdf table can be computed. And so any conditional probability can be computed. 1 1 i

43 Question: given a network can I find a chain-rule decomposition of the joint?

44 GRAPHICAL MODELS: DETERMINING CONDITIONAL INDEPENDENCIES

45 What Independencies does a Bayes Net Model? In order for a Bayesian network to model a probability distribution, the following must be true: Each variable is conditionally independent of all its nondescendants in the graph given the value of all its parents. This follows from n i=1 P(X 1 X n ) = P(X i parents(x i )) n i=1 = P(X i X 1 X i 1 ) But what else does it imply?

46 What Independencies does a Bayes Net Model? Example: Z Y X Given Y, does learning the value of Z tell us nothing new about X? I.e., is P(X Y, Z) equal to P(X Y)? Yes. Since we know the value of all of X s parents (namely, Y), and Z is not a descendant of X, X is conditionally independent of Z. Also, since independence is symmetric, P(Z Y, X) = P(Z Y).

47 Quick proof that independence is symmetric Assume: P(X Y, Z) = P(X Y) Then: ), ( ) ( ), ( ), ( Y X P Z P Z Y X P Y X Z P = ) ( ) ( ) ( ), ( ) ( P Y Y X P Z P Z Y X P Z P Y = (Bayes s Rule) (Chain Rule) (By Assumption) (Bayes s Rule) ) ( ) ( ) ( ) ( ) ( Y P Y X P Z P Y X P Z Y P = ) ( ) ( ) ( ) ( Y Z P Y P Z P Z Y P = =

48 What Independencies does a Bayes Net Model? Let I<X,Y,Z> represent X and Z being conditionally independent given Y. Y X Z I<X,Y,Z>? Yes, just as in previous example: All X s parents given, and Z is not a descendant.

49 What Independencies does a Bayes Net Model? I<X,{U},Z>? No. I<X,{U,V},Z>? Yes. Maybe I<X, S, Z> iff S acts a cutset between X and Z in an undirected version of the graph? U Z X V

50 Things get a little more confusing X has no parents, so we know all its parents values trivially Z is not a descendant of X So, I<X,{},Z>, even though there s a undirected path from X to Z through an unknown variable Y. What if we do know the value of Y, though? Or one of its descendants? X Y Z

51 The Burglar Alarm example Your house has a twitchy burglar alarm that is also sometimes triggered by earthquakes. Earth arguably doesn t care whether your house is currently being burgled While you are on vacation, one of your neighbors calls and tells you your hom s burglar alarm is ringing. Uh oh! Burglar Alarm Phone Call Earthquake

52 Things get a lot more confusing But now suppose you learn that there was a medium-sized earthquake in your neighborhood. Oh, whew! Probably not a burglar after all. Earthquake explains away the hypothetical burglar. But then it must not be the case that Burglar Alarm Phone Call Earthquake I<Burglar,{Phone Call}, Earthquake>, even though I<Burglar,{}, Earthquake>!

53

54 d-separation to the rescue Fortunately, there is a relatively simple algorithm for determining whether two variables in a Bayesian network are conditionally independent: d-separation. Definition: X and Z are d-separated by a set of evidence variables E iff every undirected path from X to Z is blocked, where a path is blocked iff one or more of the following conditions is true:... ie. X and Z are dependent iff there exists an unblocked path

55 A path is blocked when... There exists a variable Y on the path such that - it is in the evidence set E - the arcs putting Y in the path are tail-to-tail Y unknown common causes of X and Z impose dependency Or, there exists a variable Y on the path such that - it is in the evidence set E - the arcs putting Y in the path are tail-to-head Or,... Y unknown causal chains connecting X an Z impose dependency

56 A path is blocked when (the funky case) Or, there exists a variable V on the path such that - it is NOT in the evidence set E - neither are any of its descendants - the arcs putting Y on the path are head-to-head Y Known common symptoms of X and Z impose dependencies X may explain away Z

57 d-separation to the rescue, cont d Theorem [Verma & Pearl, 1998]: - If a set of evidence variables E d-separates X and Z in a Bayesian network s graph, then I<X, E, Z>. d-separation can be computed in linear time using a depth-first-searchlike algorithm. Be careful: d-separation finds what must be conditionally independent - Might : Variables may actually be independent when they re not d- separated, depending on the actual probabilities involved

58 d-separation example A C E B D F I<C, {}, D>? I<C, {A}, D>? I<C, {A, B}, D>? I<C, {A, B, J}, D>? I<C, {A, B, E, J}, D>? G H I J

Bayesian Networks: Independencies and Inference

Bayesian Networks: Independencies and Inference Bayesian Networks: Independencies and Inference Scott Davies and Andrew Moore Note to other teachers and users of these slides. Andrew and Scott would be delighted if you found this source material useful

More information

Directed Graphical Models (aka. Bayesian Networks)

Directed Graphical Models (aka. Bayesian Networks) School of Computer Science 10-601B Introduction to Machine Learning Directed Graphical Models (aka. Bayesian Networks) Readings: Bishop 8.1 and 8.2.2 Mitchell 6.11 Murphy 10 Matt Gormley Lecture 21 November

More information

Bayesian Networks (Part II)

Bayesian Networks (Part II) 10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University Bayesian Networks (Part II) Graphical Model Readings: Murphy 10 10.2.1 Bishop 8.1,

More information

Bayesian Networks Introduction to Machine Learning. Matt Gormley Lecture 24 April 9, 2018

Bayesian Networks Introduction to Machine Learning. Matt Gormley Lecture 24 April 9, 2018 10-601 Introduction to Machine Learning Machine Learning Department School of Computer Science Carnegie Mellon University Bayesian Networks Matt Gormley Lecture 24 April 9, 2018 1 Homework 7: HMMs Reminders

More information

Latent Dirichlet Allocation

Latent Dirichlet Allocation Latent Dirichlet Allocation 1 Directed Graphical Models William W. Cohen Machine Learning 10-601 2 DGMs: The Burglar Alarm example Node ~ random variable Burglar Earthquake Arcs define form of probability

More information

Probabilistic Reasoning. (Mostly using Bayesian Networks)

Probabilistic Reasoning. (Mostly using Bayesian Networks) Probabilistic Reasoning (Mostly using Bayesian Networks) Introduction: Why probabilistic reasoning? The world is not deterministic. (Usually because information is limited.) Ways of coping with uncertainty

More information

Probability. CS 3793/5233 Artificial Intelligence Probability 1

Probability. CS 3793/5233 Artificial Intelligence Probability 1 CS 3793/5233 Artificial Intelligence 1 Motivation Motivation Random Variables Semantics Dice Example Joint Dist. Ex. Axioms Agents don t have complete knowledge about the world. Agents need to make decisions

More information

Introduction to Bayes Nets. CS 486/686: Introduction to Artificial Intelligence Fall 2013

Introduction to Bayes Nets. CS 486/686: Introduction to Artificial Intelligence Fall 2013 Introduction to Bayes Nets CS 486/686: Introduction to Artificial Intelligence Fall 2013 1 Introduction Review probabilistic inference, independence and conditional independence Bayesian Networks - - What

More information

CS 484 Data Mining. Classification 7. Some slides are from Professor Padhraic Smyth at UC Irvine

CS 484 Data Mining. Classification 7. Some slides are from Professor Padhraic Smyth at UC Irvine CS 484 Data Mining Classification 7 Some slides are from Professor Padhraic Smyth at UC Irvine Bayesian Belief networks Conditional independence assumption of Naïve Bayes classifier is too strong. Allows

More information

PROBABILISTIC REASONING SYSTEMS

PROBABILISTIC REASONING SYSTEMS PROBABILISTIC REASONING SYSTEMS In which we explain how to build reasoning systems that use network models to reason with uncertainty according to the laws of probability theory. Outline Knowledge in uncertain

More information

Directed Graphical Models

Directed Graphical Models CS 2750: Machine Learning Directed Graphical Models Prof. Adriana Kovashka University of Pittsburgh March 28, 2017 Graphical Models If no assumption of independence is made, must estimate an exponential

More information

Lecture 10: Introduction to reasoning under uncertainty. Uncertainty

Lecture 10: Introduction to reasoning under uncertainty. Uncertainty Lecture 10: Introduction to reasoning under uncertainty Introduction to reasoning under uncertainty Review of probability Axioms and inference Conditional probability Probability distributions COMP-424,

More information

Recall from last time. Lecture 3: Conditional independence and graph structure. Example: A Bayesian (belief) network.

Recall from last time. Lecture 3: Conditional independence and graph structure. Example: A Bayesian (belief) network. ecall from last time Lecture 3: onditional independence and graph structure onditional independencies implied by a belief network Independence maps (I-maps) Factorization theorem The Bayes ball algorithm

More information

Bayesian Networks. Semantics of Bayes Nets. Example (Binary valued Variables) CSC384: Intro to Artificial Intelligence Reasoning under Uncertainty-III

Bayesian Networks. Semantics of Bayes Nets. Example (Binary valued Variables) CSC384: Intro to Artificial Intelligence Reasoning under Uncertainty-III CSC384: Intro to Artificial Intelligence Reasoning under Uncertainty-III Bayesian Networks Announcements: Drop deadline is this Sunday Nov 5 th. All lecture notes needed for T3 posted (L13,,L17). T3 sample

More information

Bayes Nets: Independence

Bayes Nets: Independence Bayes Nets: Independence [These slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188 materials are available at http://ai.berkeley.edu.] Bayes Nets A Bayes

More information

CS 2750: Machine Learning. Bayesian Networks. Prof. Adriana Kovashka University of Pittsburgh March 14, 2016

CS 2750: Machine Learning. Bayesian Networks. Prof. Adriana Kovashka University of Pittsburgh March 14, 2016 CS 2750: Machine Learning Bayesian Networks Prof. Adriana Kovashka University of Pittsburgh March 14, 2016 Plan for today and next week Today and next time: Bayesian networks (Bishop Sec. 8.1) Conditional

More information

CS 188: Artificial Intelligence Spring Announcements

CS 188: Artificial Intelligence Spring Announcements CS 188: Artificial Intelligence Spring 2011 Lecture 14: Bayes Nets II Independence 3/9/2011 Pieter Abbeel UC Berkeley Many slides over the course adapted from Dan Klein, Stuart Russell, Andrew Moore Announcements

More information

Bayesian networks. Soleymani. CE417: Introduction to Artificial Intelligence Sharif University of Technology Spring 2018

Bayesian networks. Soleymani. CE417: Introduction to Artificial Intelligence Sharif University of Technology Spring 2018 Bayesian networks CE417: Introduction to Artificial Intelligence Sharif University of Technology Spring 2018 Soleymani Slides have been adopted from Klein and Abdeel, CS188, UC Berkeley. Outline Probability

More information

Graphical Models - Part I

Graphical Models - Part I Graphical Models - Part I Oliver Schulte - CMPT 726 Bishop PRML Ch. 8, some slides from Russell and Norvig AIMA2e Outline Probabilistic Models Bayesian Networks Markov Random Fields Inference Outline Probabilistic

More information

Rapid Introduction to Machine Learning/ Deep Learning

Rapid Introduction to Machine Learning/ Deep Learning Rapid Introduction to Machine Learning/ Deep Learning Hyeong In Choi Seoul National University 1/32 Lecture 5a Bayesian network April 14, 2016 2/32 Table of contents 1 1. Objectives of Lecture 5a 2 2.Bayesian

More information

Review: Bayesian learning and inference

Review: Bayesian learning and inference Review: Bayesian learning and inference Suppose the agent has to make decisions about the value of an unobserved query variable X based on the values of an observed evidence variable E Inference problem:

More information

Probabilistic Models

Probabilistic Models Bayes Nets 1 Probabilistic Models Models describe how (a portion of) the world works Models are always simplifications May not account for every variable May not account for all interactions between variables

More information

Axioms of Probability? Notation. Bayesian Networks. Bayesian Networks. Today we ll introduce Bayesian Networks.

Axioms of Probability? Notation. Bayesian Networks. Bayesian Networks. Today we ll introduce Bayesian Networks. Bayesian Networks Today we ll introduce Bayesian Networks. This material is covered in chapters 13 and 14. Chapter 13 gives basic background on probability and Chapter 14 talks about Bayesian Networks.

More information

CSE 473: Artificial Intelligence Autumn 2011

CSE 473: Artificial Intelligence Autumn 2011 CSE 473: Artificial Intelligence Autumn 2011 Bayesian Networks Luke Zettlemoyer Many slides over the course adapted from either Dan Klein, Stuart Russell or Andrew Moore 1 Outline Probabilistic models

More information

CS 188: Artificial Intelligence Fall 2009

CS 188: Artificial Intelligence Fall 2009 CS 188: Artificial Intelligence Fall 2009 Lecture 14: Bayes Nets 10/13/2009 Dan Klein UC Berkeley Announcements Assignments P3 due yesterday W2 due Thursday W1 returned in front (after lecture) Midterm

More information

Bayesian networks. Chapter 14, Sections 1 4

Bayesian networks. Chapter 14, Sections 1 4 Bayesian networks Chapter 14, Sections 1 4 Artificial Intelligence, spring 2013, Peter Ljunglöf; based on AIMA Slides c Stuart Russel and Peter Norvig, 2004 Chapter 14, Sections 1 4 1 Bayesian networks

More information

CS 5522: Artificial Intelligence II

CS 5522: Artificial Intelligence II CS 5522: Artificial Intelligence II Bayes Nets: Independence Instructor: Alan Ritter Ohio State University [These slides were adapted from CS188 Intro to AI at UC Berkeley. All materials available at http://ai.berkeley.edu.]

More information

{ p if x = 1 1 p if x = 0

{ p if x = 1 1 p if x = 0 Discrete random variables Probability mass function Given a discrete random variable X taking values in X = {v 1,..., v m }, its probability mass function P : X [0, 1] is defined as: P (v i ) = Pr[X =

More information

Implementing Machine Reasoning using Bayesian Network in Big Data Analytics

Implementing Machine Reasoning using Bayesian Network in Big Data Analytics Implementing Machine Reasoning using Bayesian Network in Big Data Analytics Steve Cheng, Ph.D. Guest Speaker for EECS 6893 Big Data Analytics Columbia University October 26, 2017 Outline Introduction Probability

More information

Quantifying Uncertainty & Probabilistic Reasoning. Abdulla AlKhenji Khaled AlEmadi Mohammed AlAnsari

Quantifying Uncertainty & Probabilistic Reasoning. Abdulla AlKhenji Khaled AlEmadi Mohammed AlAnsari Quantifying Uncertainty & Probabilistic Reasoning Abdulla AlKhenji Khaled AlEmadi Mohammed AlAnsari Outline Previous Implementations What is Uncertainty? Acting Under Uncertainty Rational Decisions Basic

More information

Probabilistic Models. Models describe how (a portion of) the world works

Probabilistic Models. Models describe how (a portion of) the world works Probabilistic Models Models describe how (a portion of) the world works Models are always simplifications May not account for every variable May not account for all interactions between variables All models

More information

Bayesian Networks. Vibhav Gogate The University of Texas at Dallas

Bayesian Networks. Vibhav Gogate The University of Texas at Dallas Bayesian Networks Vibhav Gogate The University of Texas at Dallas Intro to AI (CS 6364) Many slides over the course adapted from either Dan Klein, Luke Zettlemoyer, Stuart Russell or Andrew Moore 1 Outline

More information

Probabilistic Classification

Probabilistic Classification Bayesian Networks Probabilistic Classification Goal: Gather Labeled Training Data Build/Learn a Probability Model Use the model to infer class labels for unlabeled data points Example: Spam Filtering...

More information

Bayes Networks. CS540 Bryan R Gibson University of Wisconsin-Madison. Slides adapted from those used by Prof. Jerry Zhu, CS540-1

Bayes Networks. CS540 Bryan R Gibson University of Wisconsin-Madison. Slides adapted from those used by Prof. Jerry Zhu, CS540-1 Bayes Networks CS540 Bryan R Gibson University of Wisconsin-Madison Slides adapted from those used by Prof. Jerry Zhu, CS540-1 1 / 59 Outline Joint Probability: great for inference, terrible to obtain

More information

CS 5522: Artificial Intelligence II

CS 5522: Artificial Intelligence II CS 5522: Artificial Intelligence II Bayes Nets Instructor: Alan Ritter Ohio State University [These slides were adapted from CS188 Intro to AI at UC Berkeley. All materials available at http://ai.berkeley.edu.]

More information

Bayesian Approaches Data Mining Selected Technique

Bayesian Approaches Data Mining Selected Technique Bayesian Approaches Data Mining Selected Technique Henry Xiao xiao@cs.queensu.ca School of Computing Queen s University Henry Xiao CISC 873 Data Mining p. 1/17 Probabilistic Bases Review the fundamentals

More information

Announcements. CS 188: Artificial Intelligence Spring Probability recap. Outline. Bayes Nets: Big Picture. Graphical Model Notation

Announcements. CS 188: Artificial Intelligence Spring Probability recap. Outline. Bayes Nets: Big Picture. Graphical Model Notation CS 188: Artificial Intelligence Spring 2010 Lecture 15: Bayes Nets II Independence 3/9/2010 Pieter Abbeel UC Berkeley Many slides over the course adapted from Dan Klein, Stuart Russell, Andrew Moore Current

More information

Directed Graphical Models or Bayesian Networks

Directed Graphical Models or Bayesian Networks Directed Graphical Models or Bayesian Networks Le Song Machine Learning II: Advanced Topics CSE 8803ML, Spring 2012 Bayesian Networks One of the most exciting recent advancements in statistical AI Compact

More information

Bayesian networks (1) Lirong Xia

Bayesian networks (1) Lirong Xia Bayesian networks (1) Lirong Xia Random variables and joint distributions Ø A random variable is a variable with a domain Random variables: capital letters, e.g. W, D, L values: small letters, e.g. w,

More information

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14 CS 70 Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14 Introduction One of the key properties of coin flips is independence: if you flip a fair coin ten times and get ten

More information

CS 361: Probability & Statistics

CS 361: Probability & Statistics February 12, 2018 CS 361: Probability & Statistics Random Variables Monty hall problem Recall the setup, there are 3 doors, behind two of them are indistinguishable goats, behind one is a car. You pick

More information

CS 188: Artificial Intelligence Fall 2008

CS 188: Artificial Intelligence Fall 2008 CS 188: Artificial Intelligence Fall 2008 Lecture 14: Bayes Nets 10/14/2008 Dan Klein UC Berkeley 1 1 Announcements Midterm 10/21! One page note sheet Review sessions Friday and Sunday (similar) OHs on

More information

Quantifying uncertainty & Bayesian networks

Quantifying uncertainty & Bayesian networks Quantifying uncertainty & Bayesian networks CE417: Introduction to Artificial Intelligence Sharif University of Technology Spring 2016 Soleymani Artificial Intelligence: A Modern Approach, 3 rd Edition,

More information

Lecture 8. Probabilistic Reasoning CS 486/686 May 25, 2006

Lecture 8. Probabilistic Reasoning CS 486/686 May 25, 2006 Lecture 8 Probabilistic Reasoning CS 486/686 May 25, 2006 Outline Review probabilistic inference, independence and conditional independence Bayesian networks What are they What do they mean How do we create

More information

Probabilistic Graphical Models (I)

Probabilistic Graphical Models (I) Probabilistic Graphical Models (I) Hongxin Zhang zhx@cad.zju.edu.cn State Key Lab of CAD&CG, ZJU 2015-03-31 Probabilistic Graphical Models Modeling many real-world problems => a large number of random

More information

Bayesian Networks. Vibhav Gogate The University of Texas at Dallas

Bayesian Networks. Vibhav Gogate The University of Texas at Dallas Bayesian Networks Vibhav Gogate The University of Texas at Dallas Intro to AI (CS 4365) Many slides over the course adapted from either Dan Klein, Luke Zettlemoyer, Stuart Russell or Andrew Moore 1 Outline

More information

CS 343: Artificial Intelligence

CS 343: Artificial Intelligence CS 343: Artificial Intelligence Bayes Nets Prof. Scott Niekum The University of Texas at Austin [These slides based on those of Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188

More information

Recall from last time: Conditional probabilities. Lecture 2: Belief (Bayesian) networks. Bayes ball. Example (continued) Example: Inference problem

Recall from last time: Conditional probabilities. Lecture 2: Belief (Bayesian) networks. Bayes ball. Example (continued) Example: Inference problem Recall from last time: Conditional probabilities Our probabilistic models will compute and manipulate conditional probabilities. Given two random variables X, Y, we denote by Lecture 2: Belief (Bayesian)

More information

Informatics 2D Reasoning and Agents Semester 2,

Informatics 2D Reasoning and Agents Semester 2, Informatics 2D Reasoning and Agents Semester 2, 2017 2018 Alex Lascarides alex@inf.ed.ac.uk Lecture 23 Probabilistic Reasoning with Bayesian Networks 15th March 2018 Informatics UoE Informatics 2D 1 Where

More information

CMPSCI 240: Reasoning about Uncertainty

CMPSCI 240: Reasoning about Uncertainty CMPSCI 240: Reasoning about Uncertainty Lecture 17: Representing Joint PMFs and Bayesian Networks Andrew McGregor University of Massachusetts Last Compiled: April 7, 2017 Warm Up: Joint distributions Recall

More information

COS402- Artificial Intelligence Fall Lecture 10: Bayesian Networks & Exact Inference

COS402- Artificial Intelligence Fall Lecture 10: Bayesian Networks & Exact Inference COS402- Artificial Intelligence Fall 2015 Lecture 10: Bayesian Networks & Exact Inference Outline Logical inference and probabilistic inference Independence and conditional independence Bayes Nets Semantics

More information

Consider an experiment that may have different outcomes. We are interested to know what is the probability of a particular set of outcomes.

Consider an experiment that may have different outcomes. We are interested to know what is the probability of a particular set of outcomes. CMSC 310 Artificial Intelligence Probabilistic Reasoning and Bayesian Belief Networks Probabilities, Random Variables, Probability Distribution, Conditional Probability, Joint Distributions, Bayes Theorem

More information

COMP5211 Lecture Note on Reasoning under Uncertainty

COMP5211 Lecture Note on Reasoning under Uncertainty COMP5211 Lecture Note on Reasoning under Uncertainty Fangzhen Lin Department of Computer Science and Engineering Hong Kong University of Science and Technology Fangzhen Lin (HKUST) Uncertainty 1 / 33 Uncertainty

More information

Intelligent Systems: Reasoning and Recognition. Reasoning with Bayesian Networks

Intelligent Systems: Reasoning and Recognition. Reasoning with Bayesian Networks Intelligent Systems: Reasoning and Recognition James L. Crowley ENSIMAG 2 / MoSIG M1 Second Semester 2016/2017 Lesson 13 24 march 2017 Reasoning with Bayesian Networks Naïve Bayesian Systems...2 Example

More information

Bayesian Networks BY: MOHAMAD ALSABBAGH

Bayesian Networks BY: MOHAMAD ALSABBAGH Bayesian Networks BY: MOHAMAD ALSABBAGH Outlines Introduction Bayes Rule Bayesian Networks (BN) Representation Size of a Bayesian Network Inference via BN BN Learning Dynamic BN Introduction Conditional

More information

Intelligent Systems (AI-2)

Intelligent Systems (AI-2) Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 11 Oct, 3, 2016 CPSC 422, Lecture 11 Slide 1 422 big picture: Where are we? Query Planning Deterministic Logics First Order Logics Ontologies

More information

Bayesian Networks. Motivation

Bayesian Networks. Motivation Bayesian Networks Computer Sciences 760 Spring 2014 http://pages.cs.wisc.edu/~dpage/cs760/ Motivation Assume we have five Boolean variables,,,, The joint probability is,,,, How many state configurations

More information

Introduction to Artificial Intelligence. Unit # 11

Introduction to Artificial Intelligence. Unit # 11 Introduction to Artificial Intelligence Unit # 11 1 Course Outline Overview of Artificial Intelligence State Space Representation Search Techniques Machine Learning Logic Probabilistic Reasoning/Bayesian

More information

Bayesian Networks. Material used

Bayesian Networks. Material used Bayesian Networks Material used Halpern: Reasoning about Uncertainty. Chapter 4 Stuart Russell and Peter Norvig: Artificial Intelligence: A Modern Approach 1 Random variables 2 Probabilistic independence

More information

Lecture 8: Bayesian Networks

Lecture 8: Bayesian Networks Lecture 8: Bayesian Networks Bayesian Networks Inference in Bayesian Networks COMP-652 and ECSE 608, Lecture 8 - January 31, 2017 1 Bayes nets P(E) E=1 E=0 0.005 0.995 E B P(B) B=1 B=0 0.01 0.99 E=0 E=1

More information

A Brief Introduction to Graphical Models. Presenter: Yijuan Lu November 12,2004

A Brief Introduction to Graphical Models. Presenter: Yijuan Lu November 12,2004 A Brief Introduction to Graphical Models Presenter: Yijuan Lu November 12,2004 References Introduction to Graphical Models, Kevin Murphy, Technical Report, May 2001 Learning in Graphical Models, Michael

More information

Machine Learning. CS Spring 2015 a Bayesian Learning (I) Uncertainty

Machine Learning. CS Spring 2015 a Bayesian Learning (I) Uncertainty Machine Learning CS6375 --- Spring 2015 a Bayesian Learning (I) 1 Uncertainty Most real-world problems deal with uncertain information Diagnosis: Likely disease given observed symptoms Equipment repair:

More information

Bayesian Networks. Distinguished Prof. Dr. Panos M. Pardalos

Bayesian Networks. Distinguished Prof. Dr. Panos M. Pardalos Distinguished Prof. Dr. Panos M. Pardalos Center for Applied Optimization Department of Industrial & Systems Engineering Computer & Information Science & Engineering Department Biomedical Engineering Program,

More information

Formalizing Probability. Choosing the Sample Space. Probability Measures

Formalizing Probability. Choosing the Sample Space. Probability Measures Formalizing Probability Choosing the Sample Space What do we assign probability to? Intuitively, we assign them to possible events (things that might happen, outcomes of an experiment) Formally, we take

More information

Conditional Independence

Conditional Independence Conditional Independence Sargur Srihari srihari@cedar.buffalo.edu 1 Conditional Independence Topics 1. What is Conditional Independence? Factorization of probability distribution into marginals 2. Why

More information

Outline. CSE 573: Artificial Intelligence Autumn Bayes Nets: Big Picture. Bayes Net Semantics. Hidden Markov Models. Example Bayes Net: Car

Outline. CSE 573: Artificial Intelligence Autumn Bayes Nets: Big Picture. Bayes Net Semantics. Hidden Markov Models. Example Bayes Net: Car CSE 573: Artificial Intelligence Autumn 2012 Bayesian Networks Dan Weld Many slides adapted from Dan Klein, Stuart Russell, Andrew Moore & Luke Zettlemoyer Outline Probabilistic models (and inference)

More information

Basic Probability and Statistics

Basic Probability and Statistics Basic Probability and Statistics Yingyu Liang yliang@cs.wisc.edu Computer Sciences Department University of Wisconsin, Madison [based on slides from Jerry Zhu, Mark Craven] slide 1 Reasoning with Uncertainty

More information

Bayesian Learning. Instructor: Jesse Davis

Bayesian Learning. Instructor: Jesse Davis Bayesian Learning Instructor: Jesse Davis 1 Announcements Homework 1 is due today Homework 2 is out Slides for this lecture are online We ll review some of homework 1 next class Techniques for efficient

More information

14 PROBABILISTIC REASONING

14 PROBABILISTIC REASONING 228 14 PROBABILISTIC REASONING A Bayesian network is a directed graph in which each node is annotated with quantitative probability information 1. A set of random variables makes up the nodes of the network.

More information

COS 424: Interacting with Data. Lecturer: Dave Blei Lecture #2 Scribe: Ellen Kim February 7, 2008

COS 424: Interacting with Data. Lecturer: Dave Blei Lecture #2 Scribe: Ellen Kim February 7, 2008 COS 424: Interacting with Data Lecturer: Dave Blei Lecture #2 Scribe: Ellen Kim February 7, 2008 1 Probability Review 1.1 Random Variables A random variable can be used to model any probabilistic outcome.

More information

Machine Learning Lecture 14

Machine Learning Lecture 14 Many slides adapted from B. Schiele, S. Roth, Z. Gharahmani Machine Learning Lecture 14 Undirected Graphical Models & Inference 23.06.2015 Bastian Leibe RWTH Aachen http://www.vision.rwth-aachen.de/ leibe@vision.rwth-aachen.de

More information

Probabilistic representation and reasoning

Probabilistic representation and reasoning Probabilistic representation and reasoning Applied artificial intelligence (EDAF70) Lecture 04 2019-02-01 Elin A. Topp Material based on course book, chapter 13, 14.1-3 1 Show time! Two boxes of chocolates,

More information

Bayesian Belief Network

Bayesian Belief Network Bayesian Belief Network a! b) = a) b) toothache, catch, cavity, Weather = cloudy) = = Weather = cloudy) toothache, catch, cavity) The decomposition of large probabilistic domains into weakly connected

More information

Another look at Bayesian. inference

Another look at Bayesian. inference Another look at Bayesian A general scenario: - Query variables: X inference - Evidence (observed) variables and their values: E = e - Unobserved variables: Y Inference problem: answer questions about the

More information

CS839: Probabilistic Graphical Models. Lecture 2: Directed Graphical Models. Theo Rekatsinas

CS839: Probabilistic Graphical Models. Lecture 2: Directed Graphical Models. Theo Rekatsinas CS839: Probabilistic Graphical Models Lecture 2: Directed Graphical Models Theo Rekatsinas 1 Questions Questions? Waiting list Questions on other logistics 2 Section 1 1. Intro to Bayes Nets 3 Section

More information

CS 188: Artificial Intelligence. Bayes Nets

CS 188: Artificial Intelligence. Bayes Nets CS 188: Artificial Intelligence Probabilistic Inference: Enumeration, Variable Elimination, Sampling Pieter Abbeel UC Berkeley Many slides over this course adapted from Dan Klein, Stuart Russell, Andrew

More information

Probability Review Lecturer: Ji Liu Thank Jerry Zhu for sharing his slides

Probability Review Lecturer: Ji Liu Thank Jerry Zhu for sharing his slides Probability Review Lecturer: Ji Liu Thank Jerry Zhu for sharing his slides slide 1 Inference with Bayes rule: Example In a bag there are two envelopes one has a red ball (worth $100) and a black ball one

More information

Artificial Intelligence Bayes Nets: Independence

Artificial Intelligence Bayes Nets: Independence Artificial Intelligence Bayes Nets: Independence Instructors: David Suter and Qince Li Course Delivered @ Harbin Institute of Technology [Many slides adapted from those created by Dan Klein and Pieter

More information

Learning Bayesian Networks (part 1) Goals for the lecture

Learning Bayesian Networks (part 1) Goals for the lecture Learning Bayesian Networks (part 1) Mark Craven and David Page Computer Scices 760 Spring 2018 www.biostat.wisc.edu/~craven/cs760/ Some ohe slides in these lectures have been adapted/borrowed from materials

More information

Representation. Stefano Ermon, Aditya Grover. Stanford University. Lecture 2

Representation. Stefano Ermon, Aditya Grover. Stanford University. Lecture 2 Representation Stefano Ermon, Aditya Grover Stanford University Lecture 2 Stefano Ermon, Aditya Grover (AI Lab) Deep Generative Models Lecture 2 1 / 32 Learning a generative model We are given a training

More information

Lecture 1: Basics of Probability

Lecture 1: Basics of Probability Lecture 1: Basics of Probability (Luise-Vitetta, Chapter 8) Why probability in data science? Data acquisition is noisy Sampling/quantization external factors: If you record your voice saying machine learning

More information

Bayesian networks. Chapter Chapter

Bayesian networks. Chapter Chapter Bayesian networks Chapter 14.1 3 Chapter 14.1 3 1 Outline Syntax Semantics Parameterized distributions Chapter 14.1 3 2 Bayesian networks A simple, graphical notation for conditional independence assertions

More information

Probabilistic Representation and Reasoning

Probabilistic Representation and Reasoning Probabilistic Representation and Reasoning Alessandro Panella Department of Computer Science University of Illinois at Chicago May 4, 2010 Alessandro Panella (CS Dept. - UIC) Probabilistic Representation

More information

2 : Directed GMs: Bayesian Networks

2 : Directed GMs: Bayesian Networks 10-708: Probabilistic Graphical Models, Spring 2015 2 : Directed GMs: Bayesian Networks Lecturer: Eric P. Xing Scribes: Yi Cheng, Cong Lu 1 Notation Here the notations used in this course are defined:

More information

Objectives. Probabilistic Reasoning Systems. Outline. Independence. Conditional independence. Conditional independence II.

Objectives. Probabilistic Reasoning Systems. Outline. Independence. Conditional independence. Conditional independence II. Copyright Richard J. Povinelli rev 1.0, 10/1//2001 Page 1 Probabilistic Reasoning Systems Dr. Richard J. Povinelli Objectives You should be able to apply belief networks to model a problem with uncertainty.

More information

Chapter 16. Structured Probabilistic Models for Deep Learning

Chapter 16. Structured Probabilistic Models for Deep Learning Peng et al.: Deep Learning and Practice 1 Chapter 16 Structured Probabilistic Models for Deep Learning Peng et al.: Deep Learning and Practice 2 Structured Probabilistic Models way of using graphs to describe

More information

Graphical Models and Kernel Methods

Graphical Models and Kernel Methods Graphical Models and Kernel Methods Jerry Zhu Department of Computer Sciences University of Wisconsin Madison, USA MLSS June 17, 2014 1 / 123 Outline Graphical Models Probabilistic Inference Directed vs.

More information

Probabilistic representation and reasoning

Probabilistic representation and reasoning Probabilistic representation and reasoning Applied artificial intelligence (EDA132) Lecture 09 2017-02-15 Elin A. Topp Material based on course book, chapter 13, 14.1-3 1 Show time! Two boxes of chocolates,

More information

Probabilistic Reasoning Systems

Probabilistic Reasoning Systems Probabilistic Reasoning Systems Dr. Richard J. Povinelli Copyright Richard J. Povinelli rev 1.0, 10/7/2001 Page 1 Objectives You should be able to apply belief networks to model a problem with uncertainty.

More information

CS6220: DATA MINING TECHNIQUES

CS6220: DATA MINING TECHNIQUES CS6220: DATA MINING TECHNIQUES Chapter 8&9: Classification: Part 3 Instructor: Yizhou Sun yzsun@ccs.neu.edu March 12, 2013 Midterm Report Grade Distribution 90-100 10 80-89 16 70-79 8 60-69 4

More information

Our learning goals for Bayesian Nets

Our learning goals for Bayesian Nets Our learning goals for Bayesian Nets Probability background: probability spaces, conditional probability, Bayes Theorem, subspaces, random variables, joint distribution. The concept of conditional independence

More information

Bayes Rule for probability

Bayes Rule for probability Bayes Rule for probability P A B P A P B A PAP B A P AP B A An generalization of Bayes Rule Let A, A 2,, A k denote a set of events such that S A A2 Ak and Ai Aj for all i and j. Then P A i B P Ai P B

More information

1. what conditional independencies are implied by the graph. 2. whether these independecies correspond to the probability distribution

1. what conditional independencies are implied by the graph. 2. whether these independecies correspond to the probability distribution NETWORK ANALYSIS Lourens Waldorp PROBABILITY AND GRAPHS The objective is to obtain a correspondence between the intuitive pictures (graphs) of variables of interest and the probability distributions of

More information

1 : Introduction. 1 Course Overview. 2 Notation. 3 Representing Multivariate Distributions : Probabilistic Graphical Models , Spring 2014

1 : Introduction. 1 Course Overview. 2 Notation. 3 Representing Multivariate Distributions : Probabilistic Graphical Models , Spring 2014 10-708: Probabilistic Graphical Models 10-708, Spring 2014 1 : Introduction Lecturer: Eric P. Xing Scribes: Daniel Silva and Calvin McCarter 1 Course Overview In this lecture we introduce the concept of

More information

Bayesian Networks Representation

Bayesian Networks Representation Bayesian Networks Representation Machine Learning 10701/15781 Carlos Guestrin Carnegie Mellon University March 19 th, 2007 Handwriting recognition Character recognition, e.g., kernel SVMs a c z rr r r

More information

Statistical Approaches to Learning and Discovery

Statistical Approaches to Learning and Discovery Statistical Approaches to Learning and Discovery Graphical Models Zoubin Ghahramani & Teddy Seidenfeld zoubin@cs.cmu.edu & teddy@stat.cmu.edu CALD / CS / Statistics / Philosophy Carnegie Mellon University

More information

Reasoning Under Uncertainty: Bayesian networks intro

Reasoning Under Uncertainty: Bayesian networks intro Reasoning Under Uncertainty: Bayesian networks intro Alan Mackworth UBC CS 322 Uncertainty 4 March 18, 2013 Textbook 6.3, 6.3.1, 6.5, 6.5.1, 6.5.2 Lecture Overview Recap: marginal and conditional independence

More information

Probabilistic Reasoning. Kee-Eung Kim KAIST Computer Science

Probabilistic Reasoning. Kee-Eung Kim KAIST Computer Science Probabilistic Reasoning Kee-Eung Kim KAIST Computer Science Outline #1 Acting under uncertainty Probabilities Inference with Probabilities Independence and Bayes Rule Bayesian networks Inference in Bayesian

More information

Approximate Inference

Approximate Inference Approximate Inference Simulation has a name: sampling Sampling is a hot topic in machine learning, and it s really simple Basic idea: Draw N samples from a sampling distribution S Compute an approximate

More information

Y. Xiang, Inference with Uncertain Knowledge 1

Y. Xiang, Inference with Uncertain Knowledge 1 Inference with Uncertain Knowledge Objectives Why must agent use uncertain knowledge? Fundamentals of Bayesian probability Inference with full joint distributions Inference with Bayes rule Bayesian networks

More information