Probabilistic Reasoning. Kee-Eung Kim KAIST Computer Science

Size: px
Start display at page:

Download "Probabilistic Reasoning. Kee-Eung Kim KAIST Computer Science"

Transcription

1 Probabilistic Reasoning Kee-Eung Kim KAIST Computer Science

2 Outline #1 Acting under uncertainty Probabilities Inference with Probabilities Independence and Bayes Rule Bayesian networks Inference in Bayesian networks

3 Uncertainty Let action A t = leave for airport t minutes before flight Will A t get me there on time? Problems Partial observability (road state, other driver s plans, etc.) Noisy sensors (often wrong traffic reports) Uncertainty in action outcomes (flat tire, etc.) Immense complexity of modeling and predicting traffic A logical agent faced with such a problem has to handle uncertain knowledge??? Agent s knowledge can at best provide only a degree of belief in the relevant sentences (probability theory is your friend)

4 Probability Probability provides a way of summarizing the uncertainty that comes from Laziness: failure to enumerate all exceptions, qualifications, etc. Ignorance: lack of relevant facts, initial conditions, etc. Degree of belief vs. degree of truth Probability of 0.8 does not mean 80% true (fuzzy logic/ambiguity) Rather, 80% degree of belief (probability/chance) Making decisions under uncertainty: Which action to choose? Depends on my preference Utility theory is used to represent and infer preferences Decision Theory = utility theory + probability theory

5 Basic Probability : the sample space E.g., 6 possible outcomes from a roll of a die 2 is a sample point/possible world/atomic event A probability space or probability model is a sample space with an assignment P( ) for every 2 s.t. 0 P( ) 1 P( ) = 1 An event A µ : P(A) = 2 A P( ) A random variable is a function from sample points to some range P induces a probability distribution for any random variable X: P(X = x i ) = :X( )=xi P( )

6 Prior Probability Prior or unconditional probabilities of propositions correspond to belief prior to arrival of any (new) evidence E.g., P(Cavity = true) = 0.1 and P(Weather = sunny) = 0.72 Probability distribution gives values for all possible assignments P(Weather) = [0.72, 0.1, 0.08, 0.1] Joint probability distribution for a set of random variables P(Weather, Cavity) = a 4 2 matrix of values: Weather= sunny rain cloudy snow Cavity=true Cavity=false Every question about a domain can be answered by the joint distribution because every event is a sum of sample points

7 Conditional Probability Conditional or posterior probabilities E.g. P(cavity toothache) = 0.8 i.e., given that toothache is all I know, NOT if toothache then 80% chance of cavity Implementation P(Cavity Toothache) = 2-element vector of 2-element vectors If we know more P(cavity toothache,cavity) = 1 New evidence may be irrelevant P(cavity toothache,rain) = P(cavity toothache) = 0.8 Domain knowledge is necessary for this kind of inference Useful rules Product rule: P(a Æ b) = P(a b)p(b) = P(b a)p(a) Chain rule: P(X 1,, X n ) = P(X 1,, X n-1 ) P(X n X 1,,X n-1 ) = i P(X i X 1,,X i-1 )

8 Inference by Enumeration Given the joint distribution toothache : toothache catch : catch catch : catch cavity : cavity For any proposition, sum the atomic events where P(cavity Ç toothache) =? P(: cavity toothache) =? P(Cavity toothache) = P(Cavity, toothache) = [P(Cavity, toothache, catch) + P(Cavity, toothache, : catch)] Compute distribution on query variable by fixing evidence variables and summing over hidden variables

9 Inference by Enumeration Let X be all random variables. Typically we want The posterior joint distribution of the query variables Y Given specific values e for the evidence variables E Let the hidden variables be H = X Y E P(Y E = e) = P(Y, E = e) = h P(Y, E = e, H = h) Obvious problems Worst-case time complexity O(d^n) where d is the largest arity Space complexity O(d^n) to store joint distribution How to find the numbers for O(d^n) entries??

10 Independence A and B are independent iff P(A B) = P(A), or P(B A) = P(B), or P(A,B) = P(A)P(B) P(Toothache,Catch,Cavity,Weather) = P(Toothache,Catch,Cavity) P(Weather) 32 entries reduced to 12; for n independent biased coins, 2^n! N Absolute independence powerful but rare Dentistry is a large field with hundreds of variables, none of which are independent What to do?

11 Conditional Independence P(Toothache,Cavity,Catch) has = 7 independent entries However, if we have If I have a cavity, the probability that the probe catches in it doesn t dependent on whether I have a toothache: P(catch toothache,cavity) = P(catch cavity) The same independence holds if I don t have a cavity: P(catch toothache,: cavity) = P(catch : cavity) Probe catch is conditionally independent of Toothache given Cavity: P(Catch Toothache,Cavity) = P(Catch Cavity) Equivalent statements: P(Toothache Catch,Cavity) = P(Toothache Cavity) P(Toothache,Catch Cavity) = P(Toothache Cavity)P(Catch Cavity) P(Toothache,Cavity,Catch) = P(Toothache Catch,Cavity)P(Catch Cavity)P(Cavity) = P(Toothache Cavity)P(Catch Cavity)P(Cavity) Ã = 5 entries Conditional independence is our most basic and robust form of knowledge about uncertain environments

12 Bayes Rule From product rule P(a Æ b) = P(a b)p(b) = P(b a)p(a) ) Bayes rule Or in distribution form Useful for accessing diagnostic probability from causal probability Extreme conditional independence: naïve Bayes model

13 Bayesian Networks (= Graphical Models) A simple, graphical notation for conditional independence assertions and hence for compact specification of full joint distributions Syntax A set of nodes, one per random variable A directed, acyclic graph (link ¼ directly influences ) A conditional distribution for each node given its parents Conditional distribution represented as a conditional probability table (CPT) giving the distribution over X i for each combination of parent values.

14 Example: The Alarm Network I have a burglar alarm installed, but it also responds on occasion to earthquakes. My neighbors John and Mary promise to call me at work when they hear alarm. John calls to say alarm is ringing, but Mary doesn t. Is there a burglar? Variables: Burglary, Earthquake, Alarm, JohnCalls, MaryCalls Knowledge to be encoded into topology: A burglar can set the alarm off An earthquake can set the alarm off The alarm can cause Mary to call The alarm can cause John to call

15 Semantics of Bayesian Networks Global semantics defines the full joint distribution as the product of the local conditional distributions E.g., P(j Æ m Æ a Æ : b Æ : e) = P(j a) P(m a) P(a : b, : e) P(: b) P(: e) Local semantics: each node is conditionally independent of its non-descendants given its parents Mental Exercise: Derive global semantics from local semantics and vice versa Global semantics and local semantics are equivalent

16 Constructing Bayesian Networks Choose an ordering of random variables: MaryCalls, JohnCalls, Alarm, Burglary, Earthquake MaryCalls JohnCalls Alarm Burglary Earthquake P(JohnCalls MaryCalls) = P(JohnCalls)? No P(Alarm JohnCalls,MaryCalls) = P(Alarm JohnCalls)? No P(Alarm JohnCalls,MaryCalls) = P(Alarm)? No P(Burglary Alarm,JohnCalls,MaryCalls) = P(Burglary Alarm)? Yes P(Burglary Alarm,JohnCalls,MaryCalls) = P(Burglary)? No P(Earthquake B,Alarm,J,M) = P(Earthquake Alarm)? No (!!) P(Earthquake Burglary,Alarm,J,M) = P(Earthquake Burglary,Alarm)? Yes

17 Constructing Bayesian Networks 1. Choose an ordering of variables X 1,,X n 2. For i = 1 to n Add X i to the network Select parents from X 1,,X i-1 such that This choice of parents guarantees global semantics (by chain rule) (by construction)

18 Inference Example: Car Diagnosis Initial evidence: Car won t start Testable (observed) variables: green broken, so fix it (hypothesis) variables: orange Hidden variables: gray

19 Inference by Enumeration in BN Simple query on the alarm network: Given that John called and Mary called, what is the probability of burglary? P(B j,m) = P(B, j, m) / P(j, m) = P(B, j, m) = e a P(B, e, a, j, m) Rewrite full joint probability using CPT: P(B j, m) = e a P(B) P(e) P(a B,e) P(j a) P(m a) = P(B) e P(e) a P(a B,e) P(j a) P(m a) Recursive depth-first enumeration: O(n) space, O(d^n) time

20 Evaluation Tree Evaluation tree of P(B) e P(e) a P(a B,e) P(j a) P(m a) Enumeration is inefficient: repeated computation Computes P(j a) P(m a) for each value of e

21 Inference by Variable Elimination Variable elimination: carry out summations right-to-left, storing intermediate results (factors) to avoid recomputation 2-element vector 2-element vector matrix 2 2 matrix 2-element vector

22 Basic Operations for Variable Elimination Summing out a variable from a product of factors Move any constant factors outside the summation Add up sub-matrices in pointwise product of remaining factors Assume f 1,, f i do not depend on X: Pointwise product of factors f 1 and f 2 E.g.

23 Irrelevant Variables Consider query P(JohnCalls Burglary=true) P(J b) = P(b) e P(e) a P(a b,e) P(J a) m P(m a) Sum over m is identically 1 (M is irrelevant to the query) Theorem: Y is irrelevant unless Y 2 Ancestors({X} [ E) where {X} is the set of query nodes X = JohnCalls, E = {Burglary} Ancestors({X} [ E) = {Alarm, Earthquake} MaryCalls is irrelevant Remember backward chaining algorithm?

24 Complexity of Exact Inference Singly connected networks (or polytrees) Any two nodes are connected by at most one path Time and space cost of variable elimination are O(d^k n) Multiply connected networks We can reduce 3-SAT problem to inference ) NP-hard To be exact, the problem is as hard as counting number of satisfying assignments in 3-SAT ) #P-hard

25 Summary Uncertainty arises from laziness and ignorance Inherent in complex, dynamic, or inaccessible worlds Probabilities express the agent s inability to reach a definite decision regarding the truth of sentence Probabilities summarize the agent s beliefs Prior probabilities, conditional probabilities, full joint probability distributions Bayesian networks represent conditional independence relationships in the domain in a concise way The topology of the network encodes conditional independence Exact inference by variable elimination Polytime on polytrees, NP-hard on general graphs Very sensitive to topology

26 Bayesian Learning

27 Outline #2 Bayesian learning Maximum a posteriori and maximum likelihood learning Bayes net learning ML parameter learning with complete data Linear regression ML parameter learning with hidden attribute data (EM algorithm)

28 Full Bayesian Learning View learning as Bayesian updating of a probability distribution over the hypothesis space Given H = {h 1, h 2, } and prior distribution P(H) j-th observation d j is the outcome of random variable D j Training data d = d 1,, d N Given the data so far, each hypothesis has a posterior probability P(h i d) = P(d h i ) P(h i ) where P(d h i ) is called the likelihood Prediction use a likelihood-weighted average over the hypotheses: P(X d) = i P(X d,h i ) P(h i d) = i P(X h i ) P(h i d) No need to pick one best-guess hypothesis!!!

29 Example Suppose there are five kinds of bags of candies: 10% are h 1 : 100% cherry candies 20% are h 2 : 75% cherry candies + 25% lime candies 40% are h 3 : 50% cherry candies + 50% lime candies 20% are h 4 : 25% cherry candies + 75% lime candies 10% are h 5 : 100% lime candies Then we observe candies drawn from some bag: What kind of bag is it? What flavour will the next candy be?

30 Posterior/Prediction Prob. of Hypotheses Posterior probability: P(h i d) = P(d h i ) P(h i ) Prediction probability: P(X d) = i P(X h i ) p(h i d)

31 MAP Approximation Summing over the hypothesis space is often intractable e.g., 18,446,744,073,709,551,616 Boolean functions of 6 attributes Instead, make prediction based on a single most probable hypothesis Maximum a posteriori (MAP) learning Choose h MAP maximizing P(h i d) i.e., maximize P(d h i ) P(h i ) or log P(d h i ) + log P(h i ) Log terms can be viewed as (negative of) bits to encode data given hypothesis + bits to encode hypothesis This is the basic idea of minimum description length (MDL) learning For deterministic hypotheses, P(d h i ) is 1 if consistent, 0 otherwise ) MAP = simplest consistent hypothesis

32 ML Approximation For large data sets, prior becomes irrelevant The bits to encode data given hypothesis dominates the bits to encode hypothesis Maximum likelihood (ML) learning Choose h ML maximizing P(d h i ) i.e., simply get the best fit to the data; identical to MAP for uniform prior (which is reasonable if all hypotheses are of the same complexity) ML is the standard (non-bayesian) statistical learning method

33 ML Parameter Learning in Bayes Nets Bag from a new manufacturer; fraction of cherry candies? A Bayesian network with one node Flavor The task is to learn P(Flavor) = Any is possible; continuum of hypotheses h is a parameter for this simple (binomial) family of models Suppose we unwrap N candies, c cherries and l = N c limes These are i.i.d. (independent, identically distributed) observations, so Maximize this w.r.t. which is easier for the log-likelihood: However, given small dataset, ML learning can assign zero probability to unobserved events

34 ML Learning with Multiple Parameters Red/green wrapper depends probabilistically on flavor Likelihood for, e.g., cherry candy in green wrapper: N candies, r c red-wrapped cherry candies, etc.

35 ML Learning with Multiple Parameters Log likelihood: Derivatives of L contain only the relevant parameter: With complete data, parameters can be learned separately

36 Example: Linear Gaussian Model y = 1 x + 2 plus Gaussian noise Maximize P(y x) w.r.t. parameters 1 and 2 = Minimize Minimizing the sum of squared errors gives the ML solution for a linear fit assuming Gaussian noise of fixed variance

37 Learning Bayes Nets w. Hidden Variables Candies from either Bag 1 or Bag 2 In addition to Flavor and Wrapper, some candies have a Hole Parameters: : probability of candy coming from Bag 1 F1, F2 : conditional probabilities of flavor being cherry, given coming from Bag 1 or Bag 2 W1, W2 : conditional probabilities of wrapper being red H1, H2 : conditional probabilities of candy having a hole

38 Learning Bayes Nets w. Hidden Variables EM (expectation-maximization) algorithm An iterative method for maximizing the likelihood with hidden vars Basic idea: Pretend that we know the parameters of the network and compute the posterior probabilities (E-step) Then using those probabilities, maximize the likelihood by adjusting parameters (M-step) Repeat the two steps until convergence Data with 1000 instances generated from a true model True parameters: =0.5, F1 = W1 = H1 =0.8, F2 = W2 = H2 =0.3 The counts from the data: W = red W = green H = 1 H = 0 H = 1 H = 0 F = cherry F = lime

39 MAY SKIP Learning Bayes Nets w. Hidden Variables Start with (arbitrary) initial parameters For at iteration 1: Calculate the expected count (E-step): Find the maximum likelihood parameter (M-step): [Exercise]

40 MAY SKIP Learning Bayes Nets w. Hidden Variables For F1 at iteration 1: Calculate the expected count (E-step): Find the maximum likelihood parameter (M-step): [Exercise] Compute EM steps for the rest of parameters, and repeat

41 Markov Chain Monte Carlo (MCMC): A Universal Inference Algorithm

42

43

44

45

46

47

48

49

50

51

52

53

54

55

56

57

58

59

60

61

62

63

64

65

66

67

68

69

70

Quantifying uncertainty & Bayesian networks

Quantifying uncertainty & Bayesian networks Quantifying uncertainty & Bayesian networks CE417: Introduction to Artificial Intelligence Sharif University of Technology Spring 2016 Soleymani Artificial Intelligence: A Modern Approach, 3 rd Edition,

More information

Probabilistic Reasoning Systems

Probabilistic Reasoning Systems Probabilistic Reasoning Systems Dr. Richard J. Povinelli Copyright Richard J. Povinelli rev 1.0, 10/7/2001 Page 1 Objectives You should be able to apply belief networks to model a problem with uncertainty.

More information

Uncertainty and Bayesian Networks

Uncertainty and Bayesian Networks Uncertainty and Bayesian Networks Tutorial 3 Tutorial 3 1 Outline Uncertainty Probability Syntax and Semantics for Uncertainty Inference Independence and Bayes Rule Syntax and Semantics for Bayesian Networks

More information

Objectives. Probabilistic Reasoning Systems. Outline. Independence. Conditional independence. Conditional independence II.

Objectives. Probabilistic Reasoning Systems. Outline. Independence. Conditional independence. Conditional independence II. Copyright Richard J. Povinelli rev 1.0, 10/1//2001 Page 1 Probabilistic Reasoning Systems Dr. Richard J. Povinelli Objectives You should be able to apply belief networks to model a problem with uncertainty.

More information

Uncertainty. Chapter 13

Uncertainty. Chapter 13 Uncertainty Chapter 13 Outline Uncertainty Probability Syntax and Semantics Inference Independence and Bayes Rule Uncertainty Let s say you want to get to the airport in time for a flight. Let action A

More information

Pengju XJTU 2016

Pengju XJTU 2016 Introduction to AI Chapter13 Uncertainty Pengju Ren@IAIR Outline Uncertainty Probability Syntax and Semantics Inference Independence and Bayes Rule Wumpus World Environment Squares adjacent to wumpus are

More information

Probabilistic Reasoning

Probabilistic Reasoning Probabilistic Reasoning Philipp Koehn 4 April 2017 Outline 1 Uncertainty Probability Inference Independence and Bayes Rule 2 uncertainty Uncertainty 3 Let action A t = leave for airport t minutes before

More information

Artificial Intelligence

Artificial Intelligence Artificial Intelligence Dr Ahmed Rafat Abas Computer Science Dept, Faculty of Computers and Informatics, Zagazig University arabas@zu.edu.eg http://www.arsaliem.faculty.zu.edu.eg/ Uncertainty Chapter 13

More information

Probabilistic Reasoning. (Mostly using Bayesian Networks)

Probabilistic Reasoning. (Mostly using Bayesian Networks) Probabilistic Reasoning (Mostly using Bayesian Networks) Introduction: Why probabilistic reasoning? The world is not deterministic. (Usually because information is limited.) Ways of coping with uncertainty

More information

Artificial Intelligence Uncertainty

Artificial Intelligence Uncertainty Artificial Intelligence Uncertainty Ch. 13 Uncertainty Let action A t = leave for airport t minutes before flight Will A t get me there on time? A 25, A 60, A 3600 Uncertainty: partial observability (road

More information

Bayesian networks. Chapter Chapter

Bayesian networks. Chapter Chapter Bayesian networks Chapter 14.1 3 Chapter 14.1 3 1 Outline Syntax Semantics Parameterized distributions Chapter 14.1 3 2 Bayesian networks A simple, graphical notation for conditional independence assertions

More information

Bayesian Network. Outline. Bayesian Network. Syntax Semantics Exact inference by enumeration Exact inference by variable elimination

Bayesian Network. Outline. Bayesian Network. Syntax Semantics Exact inference by enumeration Exact inference by variable elimination Outline Syntax Semantics Exact inference by enumeration Exact inference by variable elimination s A simple, graphical notation for conditional independence assertions and hence for compact specication

More information

Uncertain Knowledge and Reasoning

Uncertain Knowledge and Reasoning Uncertainty Part IV Uncertain Knowledge and Reasoning et action A t = leave for airport t minutes before flight Will A t get me there on time? Problems: 1) partial observability (road state, other drivers

More information

COMP9414/ 9814/ 3411: Artificial Intelligence. 14. Uncertainty. Russell & Norvig, Chapter 13. UNSW c AIMA, 2004, Alan Blair, 2012

COMP9414/ 9814/ 3411: Artificial Intelligence. 14. Uncertainty. Russell & Norvig, Chapter 13. UNSW c AIMA, 2004, Alan Blair, 2012 COMP9414/ 9814/ 3411: Artificial Intelligence 14. Uncertainty Russell & Norvig, Chapter 13. COMP9414/9814/3411 14s1 Uncertainty 1 Outline Uncertainty Probability Syntax and Semantics Inference Independence

More information

Bayesian networks. Chapter 14, Sections 1 4

Bayesian networks. Chapter 14, Sections 1 4 Bayesian networks Chapter 14, Sections 1 4 Artificial Intelligence, spring 2013, Peter Ljunglöf; based on AIMA Slides c Stuart Russel and Peter Norvig, 2004 Chapter 14, Sections 1 4 1 Bayesian networks

More information

Uncertainty. Introduction to Artificial Intelligence CS 151 Lecture 2 April 1, CS151, Spring 2004

Uncertainty. Introduction to Artificial Intelligence CS 151 Lecture 2 April 1, CS151, Spring 2004 Uncertainty Introduction to Artificial Intelligence CS 151 Lecture 2 April 1, 2004 Administration PA 1 will be handed out today. There will be a MATLAB tutorial tomorrow, Friday, April 2 in AP&M 4882 at

More information

Uncertainty. Outline

Uncertainty. Outline Uncertainty Chapter 13 Outline Uncertainty Probability Syntax and Semantics Inference Independence and Bayes' Rule 1 Uncertainty Let action A t = leave for airport t minutes before flight Will A t get

More information

Uncertainty. 22c:145 Artificial Intelligence. Problem of Logic Agents. Foundations of Probability. Axioms of Probability

Uncertainty. 22c:145 Artificial Intelligence. Problem of Logic Agents. Foundations of Probability. Axioms of Probability Problem of Logic Agents 22c:145 Artificial Intelligence Uncertainty Reading: Ch 13. Russell & Norvig Logic-agents almost never have access to the whole truth about their environments. A rational agent

More information

Probabilistic representation and reasoning

Probabilistic representation and reasoning Probabilistic representation and reasoning Applied artificial intelligence (EDA132) Lecture 09 2017-02-15 Elin A. Topp Material based on course book, chapter 13, 14.1-3 1 Show time! Two boxes of chocolates,

More information

Statistical learning. Chapter 20, Sections 1 4 1

Statistical learning. Chapter 20, Sections 1 4 1 Statistical learning Chapter 20, Sections 1 4 Chapter 20, Sections 1 4 1 Outline Bayesian learning Maximum a posteriori and maximum likelihood learning Bayes net learning ML parameter learning with complete

More information

Statistical learning. Chapter 20, Sections 1 3 1

Statistical learning. Chapter 20, Sections 1 3 1 Statistical learning Chapter 20, Sections 1 3 Chapter 20, Sections 1 3 1 Outline Bayesian learning Maximum a posteriori and maximum likelihood learning Bayes net learning ML parameter learning with complete

More information

Uncertainty. Chapter 13

Uncertainty. Chapter 13 Uncertainty Chapter 13 Uncertainty Let action A t = leave for airport t minutes before flight Will A t get me there on time? Problems: 1. partial observability (road state, other drivers' plans, noisy

More information

Uncertainty. Outline. Probability Syntax and Semantics Inference Independence and Bayes Rule. AIMA2e Chapter 13

Uncertainty. Outline. Probability Syntax and Semantics Inference Independence and Bayes Rule. AIMA2e Chapter 13 Uncertainty AIMA2e Chapter 13 1 Outline Uncertainty Probability Syntax and Semantics Inference Independence and ayes Rule 2 Uncertainty Let action A t = leave for airport t minutes before flight Will A

More information

Uncertainty. Chapter 13. Chapter 13 1

Uncertainty. Chapter 13. Chapter 13 1 Uncertainty Chapter 13 Chapter 13 1 Outline Uncertainty Probability Syntax and Semantics Inference Independence and Bayes Rule Chapter 13 2 Uncertainty Let action A t = leave for airport t minutes before

More information

PROBABILISTIC REASONING SYSTEMS

PROBABILISTIC REASONING SYSTEMS PROBABILISTIC REASONING SYSTEMS In which we explain how to build reasoning systems that use network models to reason with uncertainty according to the laws of probability theory. Outline Knowledge in uncertain

More information

Bayesian networks. Soleymani. CE417: Introduction to Artificial Intelligence Sharif University of Technology Spring 2018

Bayesian networks. Soleymani. CE417: Introduction to Artificial Intelligence Sharif University of Technology Spring 2018 Bayesian networks CE417: Introduction to Artificial Intelligence Sharif University of Technology Spring 2018 Soleymani Slides have been adopted from Klein and Abdeel, CS188, UC Berkeley. Outline Probability

More information

Uncertainty. Chapter 13, Sections 1 6

Uncertainty. Chapter 13, Sections 1 6 Uncertainty Chapter 13, Sections 1 6 Artificial Intelligence, spring 2013, Peter Ljunglöf; based on AIMA Slides c Stuart Russel and Peter Norvig, 2004 Chapter 13, Sections 1 6 1 Outline Uncertainty Probability

More information

Probabilistic representation and reasoning

Probabilistic representation and reasoning Probabilistic representation and reasoning Applied artificial intelligence (EDAF70) Lecture 04 2019-02-01 Elin A. Topp Material based on course book, chapter 13, 14.1-3 1 Show time! Two boxes of chocolates,

More information

Outline. Uncertainty. Methods for handling uncertainty. Uncertainty. Making decisions under uncertainty. Probability. Uncertainty

Outline. Uncertainty. Methods for handling uncertainty. Uncertainty. Making decisions under uncertainty. Probability. Uncertainty Outline Uncertainty Uncertainty Chapter 13 Probability Syntax and Semantics Inference Independence and ayes Rule Chapter 13 1 Chapter 13 2 Uncertainty et action A t =leaveforairportt minutes before flight

More information

Review: Bayesian learning and inference

Review: Bayesian learning and inference Review: Bayesian learning and inference Suppose the agent has to make decisions about the value of an unobserved query variable X based on the values of an observed evidence variable E Inference problem:

More information

Bayesian networks (1) Lirong Xia

Bayesian networks (1) Lirong Xia Bayesian networks (1) Lirong Xia Random variables and joint distributions Ø A random variable is a variable with a domain Random variables: capital letters, e.g. W, D, L values: small letters, e.g. w,

More information

Uncertainty. Logic and Uncertainty. Russell & Norvig. Readings: Chapter 13. One problem with logical-agent approaches: C:145 Artificial

Uncertainty. Logic and Uncertainty. Russell & Norvig. Readings: Chapter 13. One problem with logical-agent approaches: C:145 Artificial C:145 Artificial Intelligence@ Uncertainty Readings: Chapter 13 Russell & Norvig. Artificial Intelligence p.1/43 Logic and Uncertainty One problem with logical-agent approaches: Agents almost never have

More information

Graphical Models - Part I

Graphical Models - Part I Graphical Models - Part I Oliver Schulte - CMPT 726 Bishop PRML Ch. 8, some slides from Russell and Norvig AIMA2e Outline Probabilistic Models Bayesian Networks Markov Random Fields Inference Outline Probabilistic

More information

Chapter 13 Quantifying Uncertainty

Chapter 13 Quantifying Uncertainty Chapter 13 Quantifying Uncertainty CS5811 - Artificial Intelligence Nilufer Onder Department of Computer Science Michigan Technological University Outline Probability basics Syntax and semantics Inference

More information

Pengju

Pengju Introduction to AI Chapter13 Uncertainty Pengju Ren@IAIR Outline Uncertainty Probability Syntax and Semantics Inference Independence and Bayes Rule Example: Car diagnosis Wumpus World Environment Squares

More information

PROBABILISTIC REASONING Outline

PROBABILISTIC REASONING Outline PROBABILISTIC REASONING Outline Danica Kragic Uncertainty Probability Syntax and Semantics Inference Independence and Bayes Rule September 23, 2005 Uncertainty - Let action A t = leave for airport t minutes

More information

14 PROBABILISTIC REASONING

14 PROBABILISTIC REASONING 228 14 PROBABILISTIC REASONING A Bayesian network is a directed graph in which each node is annotated with quantitative probability information 1. A set of random variables makes up the nodes of the network.

More information

Probabilistic Models

Probabilistic Models Bayes Nets 1 Probabilistic Models Models describe how (a portion of) the world works Models are always simplifications May not account for every variable May not account for all interactions between variables

More information

Quantifying Uncertainty & Probabilistic Reasoning. Abdulla AlKhenji Khaled AlEmadi Mohammed AlAnsari

Quantifying Uncertainty & Probabilistic Reasoning. Abdulla AlKhenji Khaled AlEmadi Mohammed AlAnsari Quantifying Uncertainty & Probabilistic Reasoning Abdulla AlKhenji Khaled AlEmadi Mohammed AlAnsari Outline Previous Implementations What is Uncertainty? Acting Under Uncertainty Rational Decisions Basic

More information

Statistical learning. Chapter 20, Sections 1 3 1

Statistical learning. Chapter 20, Sections 1 3 1 Statistical learning Chapter 20, Sections 1 3 Chapter 20, Sections 1 3 1 Outline Bayesian learning Maximum a posteriori and maximum likelihood learning Bayes net learning ML parameter learning with complete

More information

CSE 473: Artificial Intelligence Autumn 2011

CSE 473: Artificial Intelligence Autumn 2011 CSE 473: Artificial Intelligence Autumn 2011 Bayesian Networks Luke Zettlemoyer Many slides over the course adapted from either Dan Klein, Stuart Russell or Andrew Moore 1 Outline Probabilistic models

More information

Uncertainty (Chapter 13, Russell & Norvig) Introduction to Artificial Intelligence CS 150 Lecture 14

Uncertainty (Chapter 13, Russell & Norvig) Introduction to Artificial Intelligence CS 150 Lecture 14 Uncertainty (Chapter 13, Russell & Norvig) Introduction to Artificial Intelligence CS 150 Lecture 14 Administration Last Programming assignment will be handed out later this week. I am doing probability

More information

Bayesian Networks. Vibhav Gogate The University of Texas at Dallas

Bayesian Networks. Vibhav Gogate The University of Texas at Dallas Bayesian Networks Vibhav Gogate The University of Texas at Dallas Intro to AI (CS 6364) Many slides over the course adapted from either Dan Klein, Luke Zettlemoyer, Stuart Russell or Andrew Moore 1 Outline

More information

Probabilistic Models. Models describe how (a portion of) the world works

Probabilistic Models. Models describe how (a portion of) the world works Probabilistic Models Models describe how (a portion of) the world works Models are always simplifications May not account for every variable May not account for all interactions between variables All models

More information

CS 484 Data Mining. Classification 7. Some slides are from Professor Padhraic Smyth at UC Irvine

CS 484 Data Mining. Classification 7. Some slides are from Professor Padhraic Smyth at UC Irvine CS 484 Data Mining Classification 7 Some slides are from Professor Padhraic Smyth at UC Irvine Bayesian Belief networks Conditional independence assumption of Naïve Bayes classifier is too strong. Allows

More information

Lecture 10: Introduction to reasoning under uncertainty. Uncertainty

Lecture 10: Introduction to reasoning under uncertainty. Uncertainty Lecture 10: Introduction to reasoning under uncertainty Introduction to reasoning under uncertainty Review of probability Axioms and inference Conditional probability Probability distributions COMP-424,

More information

CS 188: Artificial Intelligence Spring Announcements

CS 188: Artificial Intelligence Spring Announcements CS 188: Artificial Intelligence Spring 2011 Lecture 14: Bayes Nets II Independence 3/9/2011 Pieter Abbeel UC Berkeley Many slides over the course adapted from Dan Klein, Stuart Russell, Andrew Moore Announcements

More information

CS 561: Artificial Intelligence

CS 561: Artificial Intelligence CS 561: Artificial Intelligence Instructor: TAs: Sofus A. Macskassy, macskass@usc.edu Nadeesha Ranashinghe (nadeeshr@usc.edu) William Yeoh (wyeoh@usc.edu) Harris Chiu (chiciu@usc.edu) Lectures: MW 5:00-6:20pm,

More information

Bayesian Networks. Vibhav Gogate The University of Texas at Dallas

Bayesian Networks. Vibhav Gogate The University of Texas at Dallas Bayesian Networks Vibhav Gogate The University of Texas at Dallas Intro to AI (CS 4365) Many slides over the course adapted from either Dan Klein, Luke Zettlemoyer, Stuart Russell or Andrew Moore 1 Outline

More information

An AI-ish view of Probability, Conditional Probability & Bayes Theorem

An AI-ish view of Probability, Conditional Probability & Bayes Theorem An AI-ish view of Probability, Conditional Probability & Bayes Theorem Review: Uncertainty and Truth Values: a mismatch Let action A t = leave for airport t minutes before flight. Will A 15 get me there

More information

10/18/2017. An AI-ish view of Probability, Conditional Probability & Bayes Theorem. Making decisions under uncertainty.

10/18/2017. An AI-ish view of Probability, Conditional Probability & Bayes Theorem. Making decisions under uncertainty. An AI-ish view of Probability, Conditional Probability & Bayes Theorem Review: Uncertainty and Truth Values: a mismatch Let action A t = leave for airport t minutes before flight. Will A 15 get me there

More information

CS 188: Artificial Intelligence Fall 2009

CS 188: Artificial Intelligence Fall 2009 CS 188: Artificial Intelligence Fall 2009 Lecture 14: Bayes Nets 10/13/2009 Dan Klein UC Berkeley Announcements Assignments P3 due yesterday W2 due Thursday W1 returned in front (after lecture) Midterm

More information

CS 5522: Artificial Intelligence II

CS 5522: Artificial Intelligence II CS 5522: Artificial Intelligence II Bayes Nets Instructor: Alan Ritter Ohio State University [These slides were adapted from CS188 Intro to AI at UC Berkeley. All materials available at http://ai.berkeley.edu.]

More information

This lecture. Reading. Conditional Independence Bayesian (Belief) Networks: Syntax and semantics. Chapter CS151, Spring 2004

This lecture. Reading. Conditional Independence Bayesian (Belief) Networks: Syntax and semantics. Chapter CS151, Spring 2004 This lecture Conditional Independence Bayesian (Belief) Networks: Syntax and semantics Reading Chapter 14.1-14.2 Propositions and Random Variables Letting A refer to a proposition which may either be true

More information

CS 343: Artificial Intelligence

CS 343: Artificial Intelligence CS 343: Artificial Intelligence Bayes Nets Prof. Scott Niekum The University of Texas at Austin [These slides based on those of Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188

More information

Uncertainty and Belief Networks. Introduction to Artificial Intelligence CS 151 Lecture 1 continued Ok, Lecture 2!

Uncertainty and Belief Networks. Introduction to Artificial Intelligence CS 151 Lecture 1 continued Ok, Lecture 2! Uncertainty and Belief Networks Introduction to Artificial Intelligence CS 151 Lecture 1 continued Ok, Lecture 2! This lecture Conditional Independence Bayesian (Belief) Networks: Syntax and semantics

More information

Web-Mining Agents Data Mining

Web-Mining Agents Data Mining Web-Mining Agents Data Mining Prof. Dr. Ralf Möller Dr. Özgür L. Özçep Universität zu Lübeck Institut für Informationssysteme Tanya Braun (Übungen) 2 Uncertainty AIMA Chapter 13 3 Outline Agents Uncertainty

More information

Bayesian Networks BY: MOHAMAD ALSABBAGH

Bayesian Networks BY: MOHAMAD ALSABBAGH Bayesian Networks BY: MOHAMAD ALSABBAGH Outlines Introduction Bayes Rule Bayesian Networks (BN) Representation Size of a Bayesian Network Inference via BN BN Learning Dynamic BN Introduction Conditional

More information

Probabilistic Robotics

Probabilistic Robotics Probabilistic Robotics Overview of probability, Representing uncertainty Propagation of uncertainty, Bayes Rule Application to Localization and Mapping Slides from Autonomous Robots (Siegwart and Nourbaksh),

More information

CS 380: ARTIFICIAL INTELLIGENCE UNCERTAINTY. Santiago Ontañón

CS 380: ARTIFICIAL INTELLIGENCE UNCERTAINTY. Santiago Ontañón CS 380: ARTIFICIAL INTELLIGENCE UNCERTAINTY Santiago Ontañón so367@drexel.edu Summary Probability is a rigorous formalism for uncertain knowledge Joint probability distribution specifies probability of

More information

Informatics 2D Reasoning and Agents Semester 2,

Informatics 2D Reasoning and Agents Semester 2, Informatics 2D Reasoning and Agents Semester 2, 2017 2018 Alex Lascarides alex@inf.ed.ac.uk Lecture 23 Probabilistic Reasoning with Bayesian Networks 15th March 2018 Informatics UoE Informatics 2D 1 Where

More information

Ch.6 Uncertain Knowledge. Logic and Uncertainty. Representation. One problem with logical approaches: Department of Computer Science

Ch.6 Uncertain Knowledge. Logic and Uncertainty. Representation. One problem with logical approaches: Department of Computer Science Ch.6 Uncertain Knowledge Representation Hantao Zhang http://www.cs.uiowa.edu/ hzhang/c145 The University of Iowa Department of Computer Science Artificial Intelligence p.1/39 Logic and Uncertainty One

More information

Basic Probability and Decisions

Basic Probability and Decisions Basic Probability and Decisions Chris Amato Northeastern University Some images and slides are used from: Rob Platt, CS188 UC Berkeley, AIMA Uncertainty Let action A t = leave for airport t minutes before

More information

Cartesian-product sample spaces and independence

Cartesian-product sample spaces and independence CS 70 Discrete Mathematics for CS Fall 003 Wagner Lecture 4 The final two lectures on probability will cover some basic methods for answering questions about probability spaces. We will apply them to the

More information

Statistical Learning. Philipp Koehn. 10 November 2015

Statistical Learning. Philipp Koehn. 10 November 2015 Statistical Learning Philipp Koehn 10 November 2015 Outline 1 Learning agents Inductive learning Decision tree learning Measuring learning performance Bayesian learning Maximum a posteriori and maximum

More information

Bayesian Networks. Motivation

Bayesian Networks. Motivation Bayesian Networks Computer Sciences 760 Spring 2014 http://pages.cs.wisc.edu/~dpage/cs760/ Motivation Assume we have five Boolean variables,,,, The joint probability is,,,, How many state configurations

More information

EE562 ARTIFICIAL INTELLIGENCE FOR ENGINEERS

EE562 ARTIFICIAL INTELLIGENCE FOR ENGINEERS EE562 ARTIFICIAL INTELLIGENCE FOR ENGINEERS Lecture 16, 6/1/2005 University of Washington, Department of Electrical Engineering Spring 2005 Instructor: Professor Jeff A. Bilmes Uncertainty & Bayesian Networks

More information

Resolution or modus ponens are exact there is no possibility of mistake if the rules are followed exactly.

Resolution or modus ponens are exact there is no possibility of mistake if the rules are followed exactly. THE WEAKEST LINK Resolution or modus ponens are exact there is no possibility of mistake if the rules are followed exactly. These methods of inference (also known as deductive methods) require that information

More information

CS 188: Artificial Intelligence Fall 2008

CS 188: Artificial Intelligence Fall 2008 CS 188: Artificial Intelligence Fall 2008 Lecture 14: Bayes Nets 10/14/2008 Dan Klein UC Berkeley 1 1 Announcements Midterm 10/21! One page note sheet Review sessions Friday and Sunday (similar) OHs on

More information

School of EECS Washington State University. Artificial Intelligence

School of EECS Washington State University. Artificial Intelligence School of EECS Washington State University Artificial Intelligence 1 } Full joint probability distribution Can answer any query But typically too large } Conditional independence Can reduce the number

More information

Introduction to Bayesian Learning

Introduction to Bayesian Learning Course Information Introduction Introduction to Bayesian Learning Davide Bacciu Dipartimento di Informatica Università di Pisa bacciu@di.unipi.it Apprendimento Automatico: Fondamenti - A.A. 2016/2017 Outline

More information

COMP5211 Lecture Note on Reasoning under Uncertainty

COMP5211 Lecture Note on Reasoning under Uncertainty COMP5211 Lecture Note on Reasoning under Uncertainty Fangzhen Lin Department of Computer Science and Engineering Hong Kong University of Science and Technology Fangzhen Lin (HKUST) Uncertainty 1 / 33 Uncertainty

More information

Reasoning Under Uncertainty

Reasoning Under Uncertainty Reasoning Under Uncertainty Introduction Representing uncertain knowledge: logic and probability (a reminder!) Probabilistic inference using the joint probability distribution Bayesian networks The Importance

More information

Brief Intro. to Bayesian Networks. Extracted and modified from four lectures in Intro to AI, Spring 2008 Paul S. Rosenbloom

Brief Intro. to Bayesian Networks. Extracted and modified from four lectures in Intro to AI, Spring 2008 Paul S. Rosenbloom Brief Intro. to Bayesian Networks Extracted and modified from four lectures in Intro to AI, Spring 2008 Paul S. Rosenbloom Factor Graph Review Tame combinatorics in many calculations Decoding codes (origin

More information

Learning with Probabilities

Learning with Probabilities Learning with Probabilities CS194-10 Fall 2011 Lecture 15 CS194-10 Fall 2011 Lecture 15 1 Outline Bayesian learning eliminates arbitrary loss functions and regularizers facilitates incorporation of prior

More information

UNCERTAINTY. In which we see what an agent should do when not all is crystal-clear.

UNCERTAINTY. In which we see what an agent should do when not all is crystal-clear. UNCERTAINTY In which we see what an agent should do when not all is crystal-clear. Outline Uncertainty Probabilistic Theory Axioms of Probability Probabilistic Reasoning Independency Bayes Rule Summary

More information

CS Belief networks. Chapter

CS Belief networks. Chapter CS 580 1 Belief networks Chapter 15.1 2 CS 580 2 Outline Conditional independence Bayesian networks: syntax and semantics Exact inference Approximate inference CS 580 3 Independence Two random variables

More information

Bayesian Belief Network

Bayesian Belief Network Bayesian Belief Network a! b) = a) b) toothache, catch, cavity, Weather = cloudy) = = Weather = cloudy) toothache, catch, cavity) The decomposition of large probabilistic domains into weakly connected

More information

COMP9414: Artificial Intelligence Reasoning Under Uncertainty

COMP9414: Artificial Intelligence Reasoning Under Uncertainty COMP9414, Monday 16 April, 2012 Reasoning Under Uncertainty 2 COMP9414: Artificial Intelligence Reasoning Under Uncertainty Overview Problems with Logical Approach What Do the Numbers Mean? Wayne Wobcke

More information

Uncertainty (Chapter 13, Russell & Norvig)

Uncertainty (Chapter 13, Russell & Norvig) Uncertainty (Chapter 13, Russell & Norvig) Introduction to Artificial Intelligence CS 150 Administration Midterm next Tuesday!!! I will try to find an old one to post. The MT will cover chapters 1-6, with

More information

Outline. CSE 573: Artificial Intelligence Autumn Bayes Nets: Big Picture. Bayes Net Semantics. Hidden Markov Models. Example Bayes Net: Car

Outline. CSE 573: Artificial Intelligence Autumn Bayes Nets: Big Picture. Bayes Net Semantics. Hidden Markov Models. Example Bayes Net: Car CSE 573: Artificial Intelligence Autumn 2012 Bayesian Networks Dan Weld Many slides adapted from Dan Klein, Stuart Russell, Andrew Moore & Luke Zettlemoyer Outline Probabilistic models (and inference)

More information

Bayesian Networks. Philipp Koehn. 6 April 2017

Bayesian Networks. Philipp Koehn. 6 April 2017 Bayesian Networks Philipp Koehn 6 April 2017 Outline 1 Bayesian Networks Parameterized distributions Exact inference Approximate inference 2 bayesian networks Bayesian Networks 3 A simple, graphical notation

More information

Introduction to Artificial Intelligence Belief networks

Introduction to Artificial Intelligence Belief networks Introduction to Artificial Intelligence Belief networks Chapter 15.1 2 Dieter Fox Based on AIMA Slides c S. Russell and P. Norvig, 1998 Chapter 15.1 2 0-0 Outline Bayesian networks: syntax and semantics

More information

Uncertainty. Russell & Norvig Chapter 13.

Uncertainty. Russell & Norvig Chapter 13. Uncertainty Russell & Norvig Chapter 13 http://toonut.com/wp-content/uploads/2011/12/69wp.jpg Uncertainty Let A t be the action of leaving for the airport t minutes before your flight Will A t get you

More information

Bayesian networks. Chapter Chapter

Bayesian networks. Chapter Chapter Bayesian networks Chapter 14.1 3 Chapter 14.1 3 1 Outline Syntax Semantics Parameterized distributions Chapter 14.1 3 2 Bayesian networks A simple, graphical notation for conditional independence assertions

More information

CS 2750: Machine Learning. Bayesian Networks. Prof. Adriana Kovashka University of Pittsburgh March 14, 2016

CS 2750: Machine Learning. Bayesian Networks. Prof. Adriana Kovashka University of Pittsburgh March 14, 2016 CS 2750: Machine Learning Bayesian Networks Prof. Adriana Kovashka University of Pittsburgh March 14, 2016 Plan for today and next week Today and next time: Bayesian networks (Bishop Sec. 8.1) Conditional

More information

Bayes Nets: Independence

Bayes Nets: Independence Bayes Nets: Independence [These slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188 materials are available at http://ai.berkeley.edu.] Bayes Nets A Bayes

More information

Bayesian networks: Modeling

Bayesian networks: Modeling Bayesian networks: Modeling CS194-10 Fall 2011 Lecture 21 CS194-10 Fall 2011 Lecture 21 1 Outline Overview of Bayes nets Syntax and semantics Examples Compact conditional distributions CS194-10 Fall 2011

More information

Bayesian networks. Chapter AIMA2e Slides, Stuart Russell and Peter Norvig, Completed by Kazim Fouladi, Fall 2008 Chapter 14.

Bayesian networks. Chapter AIMA2e Slides, Stuart Russell and Peter Norvig, Completed by Kazim Fouladi, Fall 2008 Chapter 14. Bayesian networks Chapter 14.1 3 AIMA2e Slides, Stuart Russell and Peter Norvig, Completed by Kazim Fouladi, Fall 2008 Chapter 14.1 3 1 Outline Syntax Semantics Parameterized distributions AIMA2e Slides,

More information

CMPSCI 240: Reasoning about Uncertainty

CMPSCI 240: Reasoning about Uncertainty CMPSCI 240: Reasoning about Uncertainty Lecture 17: Representing Joint PMFs and Bayesian Networks Andrew McGregor University of Massachusetts Last Compiled: April 7, 2017 Warm Up: Joint distributions Recall

More information

13.4 INDEPENDENCE. 494 Chapter 13. Quantifying Uncertainty

13.4 INDEPENDENCE. 494 Chapter 13. Quantifying Uncertainty 494 Chapter 13. Quantifying Uncertainty table. In a realistic problem we could easily have n>100, makingo(2 n ) impractical. The full joint distribution in tabular form is just not a practical tool for

More information

Reasoning with Uncertainty. Chapter 13

Reasoning with Uncertainty. Chapter 13 Reasoning with Uncertainty Chapter 13 1 Outline Uncertainty Probability Syntax and Semantics Inference Independence and Bayes Rule 2 The real world is an uncertain place... Example: I need a plan that

More information

From inductive inference to machine learning

From inductive inference to machine learning From inductive inference to machine learning ADAPTED FROM AIMA SLIDES Russel&Norvig:Artificial Intelligence: a modern approach AIMA: Inductive inference AIMA: Inductive inference 1 Outline Bayesian inferences

More information

Foundations of Artificial Intelligence

Foundations of Artificial Intelligence Foundations of Artificial Intelligence 12. Making Simple Decisions under Uncertainty Probability Theory, Bayesian Networks, Other Approaches Wolfram Burgard, Maren Bennewitz, and Marco Ragni Albert-Ludwigs-Universität

More information

Bayesian Networks. Philipp Koehn. 29 October 2015

Bayesian Networks. Philipp Koehn. 29 October 2015 Bayesian Networks Philipp Koehn 29 October 2015 Outline 1 Bayesian Networks Parameterized distributions Exact inference Approximate inference 2 bayesian networks Bayesian Networks 3 A simple, graphical

More information

ARTIFICIAL INTELLIGENCE. Uncertainty: probabilistic reasoning

ARTIFICIAL INTELLIGENCE. Uncertainty: probabilistic reasoning INFOB2KI 2017-2018 Utrecht University The Netherlands ARTIFICIAL INTELLIGENCE Uncertainty: probabilistic reasoning Lecturer: Silja Renooij These slides are part of the INFOB2KI Course Notes available from

More information

Bayesian networks. Chapter Chapter Outline. Syntax Semantics Parameterized distributions. Chapter

Bayesian networks. Chapter Chapter Outline. Syntax Semantics Parameterized distributions. Chapter Bayesian networks Chapter 14.1 3 Chapter 14.1 3 1 Outline Syntax Semantics Parameterized distributions Chapter 14.1 3 2 Bayesian networks A simple, graphical notation for conditional independence assertions

More information

CS 188: Artificial Intelligence Fall 2009

CS 188: Artificial Intelligence Fall 2009 CS 188: Artificial Intelligence Fall 2009 Lecture 13: Probability 10/8/2009 Dan Klein UC Berkeley 1 Announcements Upcoming P3 Due 10/12 W2 Due 10/15 Midterm in evening of 10/22 Review sessions: Probability

More information

Reasoning Under Uncertainty

Reasoning Under Uncertainty Reasoning Under Uncertainty Introduction Representing uncertain knowledge: logic and probability (a reminder!) Probabilistic inference using the joint probability distribution Bayesian networks (theory

More information

Another look at Bayesian. inference

Another look at Bayesian. inference Another look at Bayesian A general scenario: - Query variables: X inference - Evidence (observed) variables and their values: E = e - Unobserved variables: Y Inference problem: answer questions about the

More information