Probabilistic Reasoning. Kee-Eung Kim KAIST Computer Science
|
|
- Ami Preston
- 5 years ago
- Views:
Transcription
1 Probabilistic Reasoning Kee-Eung Kim KAIST Computer Science
2 Outline #1 Acting under uncertainty Probabilities Inference with Probabilities Independence and Bayes Rule Bayesian networks Inference in Bayesian networks
3 Uncertainty Let action A t = leave for airport t minutes before flight Will A t get me there on time? Problems Partial observability (road state, other driver s plans, etc.) Noisy sensors (often wrong traffic reports) Uncertainty in action outcomes (flat tire, etc.) Immense complexity of modeling and predicting traffic A logical agent faced with such a problem has to handle uncertain knowledge??? Agent s knowledge can at best provide only a degree of belief in the relevant sentences (probability theory is your friend)
4 Probability Probability provides a way of summarizing the uncertainty that comes from Laziness: failure to enumerate all exceptions, qualifications, etc. Ignorance: lack of relevant facts, initial conditions, etc. Degree of belief vs. degree of truth Probability of 0.8 does not mean 80% true (fuzzy logic/ambiguity) Rather, 80% degree of belief (probability/chance) Making decisions under uncertainty: Which action to choose? Depends on my preference Utility theory is used to represent and infer preferences Decision Theory = utility theory + probability theory
5 Basic Probability : the sample space E.g., 6 possible outcomes from a roll of a die 2 is a sample point/possible world/atomic event A probability space or probability model is a sample space with an assignment P( ) for every 2 s.t. 0 P( ) 1 P( ) = 1 An event A µ : P(A) = 2 A P( ) A random variable is a function from sample points to some range P induces a probability distribution for any random variable X: P(X = x i ) = :X( )=xi P( )
6 Prior Probability Prior or unconditional probabilities of propositions correspond to belief prior to arrival of any (new) evidence E.g., P(Cavity = true) = 0.1 and P(Weather = sunny) = 0.72 Probability distribution gives values for all possible assignments P(Weather) = [0.72, 0.1, 0.08, 0.1] Joint probability distribution for a set of random variables P(Weather, Cavity) = a 4 2 matrix of values: Weather= sunny rain cloudy snow Cavity=true Cavity=false Every question about a domain can be answered by the joint distribution because every event is a sum of sample points
7 Conditional Probability Conditional or posterior probabilities E.g. P(cavity toothache) = 0.8 i.e., given that toothache is all I know, NOT if toothache then 80% chance of cavity Implementation P(Cavity Toothache) = 2-element vector of 2-element vectors If we know more P(cavity toothache,cavity) = 1 New evidence may be irrelevant P(cavity toothache,rain) = P(cavity toothache) = 0.8 Domain knowledge is necessary for this kind of inference Useful rules Product rule: P(a Æ b) = P(a b)p(b) = P(b a)p(a) Chain rule: P(X 1,, X n ) = P(X 1,, X n-1 ) P(X n X 1,,X n-1 ) = i P(X i X 1,,X i-1 )
8 Inference by Enumeration Given the joint distribution toothache : toothache catch : catch catch : catch cavity : cavity For any proposition, sum the atomic events where P(cavity Ç toothache) =? P(: cavity toothache) =? P(Cavity toothache) = P(Cavity, toothache) = [P(Cavity, toothache, catch) + P(Cavity, toothache, : catch)] Compute distribution on query variable by fixing evidence variables and summing over hidden variables
9 Inference by Enumeration Let X be all random variables. Typically we want The posterior joint distribution of the query variables Y Given specific values e for the evidence variables E Let the hidden variables be H = X Y E P(Y E = e) = P(Y, E = e) = h P(Y, E = e, H = h) Obvious problems Worst-case time complexity O(d^n) where d is the largest arity Space complexity O(d^n) to store joint distribution How to find the numbers for O(d^n) entries??
10 Independence A and B are independent iff P(A B) = P(A), or P(B A) = P(B), or P(A,B) = P(A)P(B) P(Toothache,Catch,Cavity,Weather) = P(Toothache,Catch,Cavity) P(Weather) 32 entries reduced to 12; for n independent biased coins, 2^n! N Absolute independence powerful but rare Dentistry is a large field with hundreds of variables, none of which are independent What to do?
11 Conditional Independence P(Toothache,Cavity,Catch) has = 7 independent entries However, if we have If I have a cavity, the probability that the probe catches in it doesn t dependent on whether I have a toothache: P(catch toothache,cavity) = P(catch cavity) The same independence holds if I don t have a cavity: P(catch toothache,: cavity) = P(catch : cavity) Probe catch is conditionally independent of Toothache given Cavity: P(Catch Toothache,Cavity) = P(Catch Cavity) Equivalent statements: P(Toothache Catch,Cavity) = P(Toothache Cavity) P(Toothache,Catch Cavity) = P(Toothache Cavity)P(Catch Cavity) P(Toothache,Cavity,Catch) = P(Toothache Catch,Cavity)P(Catch Cavity)P(Cavity) = P(Toothache Cavity)P(Catch Cavity)P(Cavity) Ã = 5 entries Conditional independence is our most basic and robust form of knowledge about uncertain environments
12 Bayes Rule From product rule P(a Æ b) = P(a b)p(b) = P(b a)p(a) ) Bayes rule Or in distribution form Useful for accessing diagnostic probability from causal probability Extreme conditional independence: naïve Bayes model
13 Bayesian Networks (= Graphical Models) A simple, graphical notation for conditional independence assertions and hence for compact specification of full joint distributions Syntax A set of nodes, one per random variable A directed, acyclic graph (link ¼ directly influences ) A conditional distribution for each node given its parents Conditional distribution represented as a conditional probability table (CPT) giving the distribution over X i for each combination of parent values.
14 Example: The Alarm Network I have a burglar alarm installed, but it also responds on occasion to earthquakes. My neighbors John and Mary promise to call me at work when they hear alarm. John calls to say alarm is ringing, but Mary doesn t. Is there a burglar? Variables: Burglary, Earthquake, Alarm, JohnCalls, MaryCalls Knowledge to be encoded into topology: A burglar can set the alarm off An earthquake can set the alarm off The alarm can cause Mary to call The alarm can cause John to call
15 Semantics of Bayesian Networks Global semantics defines the full joint distribution as the product of the local conditional distributions E.g., P(j Æ m Æ a Æ : b Æ : e) = P(j a) P(m a) P(a : b, : e) P(: b) P(: e) Local semantics: each node is conditionally independent of its non-descendants given its parents Mental Exercise: Derive global semantics from local semantics and vice versa Global semantics and local semantics are equivalent
16 Constructing Bayesian Networks Choose an ordering of random variables: MaryCalls, JohnCalls, Alarm, Burglary, Earthquake MaryCalls JohnCalls Alarm Burglary Earthquake P(JohnCalls MaryCalls) = P(JohnCalls)? No P(Alarm JohnCalls,MaryCalls) = P(Alarm JohnCalls)? No P(Alarm JohnCalls,MaryCalls) = P(Alarm)? No P(Burglary Alarm,JohnCalls,MaryCalls) = P(Burglary Alarm)? Yes P(Burglary Alarm,JohnCalls,MaryCalls) = P(Burglary)? No P(Earthquake B,Alarm,J,M) = P(Earthquake Alarm)? No (!!) P(Earthquake Burglary,Alarm,J,M) = P(Earthquake Burglary,Alarm)? Yes
17 Constructing Bayesian Networks 1. Choose an ordering of variables X 1,,X n 2. For i = 1 to n Add X i to the network Select parents from X 1,,X i-1 such that This choice of parents guarantees global semantics (by chain rule) (by construction)
18 Inference Example: Car Diagnosis Initial evidence: Car won t start Testable (observed) variables: green broken, so fix it (hypothesis) variables: orange Hidden variables: gray
19 Inference by Enumeration in BN Simple query on the alarm network: Given that John called and Mary called, what is the probability of burglary? P(B j,m) = P(B, j, m) / P(j, m) = P(B, j, m) = e a P(B, e, a, j, m) Rewrite full joint probability using CPT: P(B j, m) = e a P(B) P(e) P(a B,e) P(j a) P(m a) = P(B) e P(e) a P(a B,e) P(j a) P(m a) Recursive depth-first enumeration: O(n) space, O(d^n) time
20 Evaluation Tree Evaluation tree of P(B) e P(e) a P(a B,e) P(j a) P(m a) Enumeration is inefficient: repeated computation Computes P(j a) P(m a) for each value of e
21 Inference by Variable Elimination Variable elimination: carry out summations right-to-left, storing intermediate results (factors) to avoid recomputation 2-element vector 2-element vector matrix 2 2 matrix 2-element vector
22 Basic Operations for Variable Elimination Summing out a variable from a product of factors Move any constant factors outside the summation Add up sub-matrices in pointwise product of remaining factors Assume f 1,, f i do not depend on X: Pointwise product of factors f 1 and f 2 E.g.
23 Irrelevant Variables Consider query P(JohnCalls Burglary=true) P(J b) = P(b) e P(e) a P(a b,e) P(J a) m P(m a) Sum over m is identically 1 (M is irrelevant to the query) Theorem: Y is irrelevant unless Y 2 Ancestors({X} [ E) where {X} is the set of query nodes X = JohnCalls, E = {Burglary} Ancestors({X} [ E) = {Alarm, Earthquake} MaryCalls is irrelevant Remember backward chaining algorithm?
24 Complexity of Exact Inference Singly connected networks (or polytrees) Any two nodes are connected by at most one path Time and space cost of variable elimination are O(d^k n) Multiply connected networks We can reduce 3-SAT problem to inference ) NP-hard To be exact, the problem is as hard as counting number of satisfying assignments in 3-SAT ) #P-hard
25 Summary Uncertainty arises from laziness and ignorance Inherent in complex, dynamic, or inaccessible worlds Probabilities express the agent s inability to reach a definite decision regarding the truth of sentence Probabilities summarize the agent s beliefs Prior probabilities, conditional probabilities, full joint probability distributions Bayesian networks represent conditional independence relationships in the domain in a concise way The topology of the network encodes conditional independence Exact inference by variable elimination Polytime on polytrees, NP-hard on general graphs Very sensitive to topology
26 Bayesian Learning
27 Outline #2 Bayesian learning Maximum a posteriori and maximum likelihood learning Bayes net learning ML parameter learning with complete data Linear regression ML parameter learning with hidden attribute data (EM algorithm)
28 Full Bayesian Learning View learning as Bayesian updating of a probability distribution over the hypothesis space Given H = {h 1, h 2, } and prior distribution P(H) j-th observation d j is the outcome of random variable D j Training data d = d 1,, d N Given the data so far, each hypothesis has a posterior probability P(h i d) = P(d h i ) P(h i ) where P(d h i ) is called the likelihood Prediction use a likelihood-weighted average over the hypotheses: P(X d) = i P(X d,h i ) P(h i d) = i P(X h i ) P(h i d) No need to pick one best-guess hypothesis!!!
29 Example Suppose there are five kinds of bags of candies: 10% are h 1 : 100% cherry candies 20% are h 2 : 75% cherry candies + 25% lime candies 40% are h 3 : 50% cherry candies + 50% lime candies 20% are h 4 : 25% cherry candies + 75% lime candies 10% are h 5 : 100% lime candies Then we observe candies drawn from some bag: What kind of bag is it? What flavour will the next candy be?
30 Posterior/Prediction Prob. of Hypotheses Posterior probability: P(h i d) = P(d h i ) P(h i ) Prediction probability: P(X d) = i P(X h i ) p(h i d)
31 MAP Approximation Summing over the hypothesis space is often intractable e.g., 18,446,744,073,709,551,616 Boolean functions of 6 attributes Instead, make prediction based on a single most probable hypothesis Maximum a posteriori (MAP) learning Choose h MAP maximizing P(h i d) i.e., maximize P(d h i ) P(h i ) or log P(d h i ) + log P(h i ) Log terms can be viewed as (negative of) bits to encode data given hypothesis + bits to encode hypothesis This is the basic idea of minimum description length (MDL) learning For deterministic hypotheses, P(d h i ) is 1 if consistent, 0 otherwise ) MAP = simplest consistent hypothesis
32 ML Approximation For large data sets, prior becomes irrelevant The bits to encode data given hypothesis dominates the bits to encode hypothesis Maximum likelihood (ML) learning Choose h ML maximizing P(d h i ) i.e., simply get the best fit to the data; identical to MAP for uniform prior (which is reasonable if all hypotheses are of the same complexity) ML is the standard (non-bayesian) statistical learning method
33 ML Parameter Learning in Bayes Nets Bag from a new manufacturer; fraction of cherry candies? A Bayesian network with one node Flavor The task is to learn P(Flavor) = Any is possible; continuum of hypotheses h is a parameter for this simple (binomial) family of models Suppose we unwrap N candies, c cherries and l = N c limes These are i.i.d. (independent, identically distributed) observations, so Maximize this w.r.t. which is easier for the log-likelihood: However, given small dataset, ML learning can assign zero probability to unobserved events
34 ML Learning with Multiple Parameters Red/green wrapper depends probabilistically on flavor Likelihood for, e.g., cherry candy in green wrapper: N candies, r c red-wrapped cherry candies, etc.
35 ML Learning with Multiple Parameters Log likelihood: Derivatives of L contain only the relevant parameter: With complete data, parameters can be learned separately
36 Example: Linear Gaussian Model y = 1 x + 2 plus Gaussian noise Maximize P(y x) w.r.t. parameters 1 and 2 = Minimize Minimizing the sum of squared errors gives the ML solution for a linear fit assuming Gaussian noise of fixed variance
37 Learning Bayes Nets w. Hidden Variables Candies from either Bag 1 or Bag 2 In addition to Flavor and Wrapper, some candies have a Hole Parameters: : probability of candy coming from Bag 1 F1, F2 : conditional probabilities of flavor being cherry, given coming from Bag 1 or Bag 2 W1, W2 : conditional probabilities of wrapper being red H1, H2 : conditional probabilities of candy having a hole
38 Learning Bayes Nets w. Hidden Variables EM (expectation-maximization) algorithm An iterative method for maximizing the likelihood with hidden vars Basic idea: Pretend that we know the parameters of the network and compute the posterior probabilities (E-step) Then using those probabilities, maximize the likelihood by adjusting parameters (M-step) Repeat the two steps until convergence Data with 1000 instances generated from a true model True parameters: =0.5, F1 = W1 = H1 =0.8, F2 = W2 = H2 =0.3 The counts from the data: W = red W = green H = 1 H = 0 H = 1 H = 0 F = cherry F = lime
39 MAY SKIP Learning Bayes Nets w. Hidden Variables Start with (arbitrary) initial parameters For at iteration 1: Calculate the expected count (E-step): Find the maximum likelihood parameter (M-step): [Exercise]
40 MAY SKIP Learning Bayes Nets w. Hidden Variables For F1 at iteration 1: Calculate the expected count (E-step): Find the maximum likelihood parameter (M-step): [Exercise] Compute EM steps for the rest of parameters, and repeat
41 Markov Chain Monte Carlo (MCMC): A Universal Inference Algorithm
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
Quantifying uncertainty & Bayesian networks
Quantifying uncertainty & Bayesian networks CE417: Introduction to Artificial Intelligence Sharif University of Technology Spring 2016 Soleymani Artificial Intelligence: A Modern Approach, 3 rd Edition,
More informationProbabilistic Reasoning Systems
Probabilistic Reasoning Systems Dr. Richard J. Povinelli Copyright Richard J. Povinelli rev 1.0, 10/7/2001 Page 1 Objectives You should be able to apply belief networks to model a problem with uncertainty.
More informationUncertainty and Bayesian Networks
Uncertainty and Bayesian Networks Tutorial 3 Tutorial 3 1 Outline Uncertainty Probability Syntax and Semantics for Uncertainty Inference Independence and Bayes Rule Syntax and Semantics for Bayesian Networks
More informationObjectives. Probabilistic Reasoning Systems. Outline. Independence. Conditional independence. Conditional independence II.
Copyright Richard J. Povinelli rev 1.0, 10/1//2001 Page 1 Probabilistic Reasoning Systems Dr. Richard J. Povinelli Objectives You should be able to apply belief networks to model a problem with uncertainty.
More informationUncertainty. Chapter 13
Uncertainty Chapter 13 Outline Uncertainty Probability Syntax and Semantics Inference Independence and Bayes Rule Uncertainty Let s say you want to get to the airport in time for a flight. Let action A
More informationPengju XJTU 2016
Introduction to AI Chapter13 Uncertainty Pengju Ren@IAIR Outline Uncertainty Probability Syntax and Semantics Inference Independence and Bayes Rule Wumpus World Environment Squares adjacent to wumpus are
More informationProbabilistic Reasoning
Probabilistic Reasoning Philipp Koehn 4 April 2017 Outline 1 Uncertainty Probability Inference Independence and Bayes Rule 2 uncertainty Uncertainty 3 Let action A t = leave for airport t minutes before
More informationArtificial Intelligence
Artificial Intelligence Dr Ahmed Rafat Abas Computer Science Dept, Faculty of Computers and Informatics, Zagazig University arabas@zu.edu.eg http://www.arsaliem.faculty.zu.edu.eg/ Uncertainty Chapter 13
More informationProbabilistic Reasoning. (Mostly using Bayesian Networks)
Probabilistic Reasoning (Mostly using Bayesian Networks) Introduction: Why probabilistic reasoning? The world is not deterministic. (Usually because information is limited.) Ways of coping with uncertainty
More informationArtificial Intelligence Uncertainty
Artificial Intelligence Uncertainty Ch. 13 Uncertainty Let action A t = leave for airport t minutes before flight Will A t get me there on time? A 25, A 60, A 3600 Uncertainty: partial observability (road
More informationBayesian networks. Chapter Chapter
Bayesian networks Chapter 14.1 3 Chapter 14.1 3 1 Outline Syntax Semantics Parameterized distributions Chapter 14.1 3 2 Bayesian networks A simple, graphical notation for conditional independence assertions
More informationBayesian Network. Outline. Bayesian Network. Syntax Semantics Exact inference by enumeration Exact inference by variable elimination
Outline Syntax Semantics Exact inference by enumeration Exact inference by variable elimination s A simple, graphical notation for conditional independence assertions and hence for compact specication
More informationUncertain Knowledge and Reasoning
Uncertainty Part IV Uncertain Knowledge and Reasoning et action A t = leave for airport t minutes before flight Will A t get me there on time? Problems: 1) partial observability (road state, other drivers
More informationCOMP9414/ 9814/ 3411: Artificial Intelligence. 14. Uncertainty. Russell & Norvig, Chapter 13. UNSW c AIMA, 2004, Alan Blair, 2012
COMP9414/ 9814/ 3411: Artificial Intelligence 14. Uncertainty Russell & Norvig, Chapter 13. COMP9414/9814/3411 14s1 Uncertainty 1 Outline Uncertainty Probability Syntax and Semantics Inference Independence
More informationBayesian networks. Chapter 14, Sections 1 4
Bayesian networks Chapter 14, Sections 1 4 Artificial Intelligence, spring 2013, Peter Ljunglöf; based on AIMA Slides c Stuart Russel and Peter Norvig, 2004 Chapter 14, Sections 1 4 1 Bayesian networks
More informationUncertainty. Introduction to Artificial Intelligence CS 151 Lecture 2 April 1, CS151, Spring 2004
Uncertainty Introduction to Artificial Intelligence CS 151 Lecture 2 April 1, 2004 Administration PA 1 will be handed out today. There will be a MATLAB tutorial tomorrow, Friday, April 2 in AP&M 4882 at
More informationUncertainty. Outline
Uncertainty Chapter 13 Outline Uncertainty Probability Syntax and Semantics Inference Independence and Bayes' Rule 1 Uncertainty Let action A t = leave for airport t minutes before flight Will A t get
More informationUncertainty. 22c:145 Artificial Intelligence. Problem of Logic Agents. Foundations of Probability. Axioms of Probability
Problem of Logic Agents 22c:145 Artificial Intelligence Uncertainty Reading: Ch 13. Russell & Norvig Logic-agents almost never have access to the whole truth about their environments. A rational agent
More informationProbabilistic representation and reasoning
Probabilistic representation and reasoning Applied artificial intelligence (EDA132) Lecture 09 2017-02-15 Elin A. Topp Material based on course book, chapter 13, 14.1-3 1 Show time! Two boxes of chocolates,
More informationStatistical learning. Chapter 20, Sections 1 4 1
Statistical learning Chapter 20, Sections 1 4 Chapter 20, Sections 1 4 1 Outline Bayesian learning Maximum a posteriori and maximum likelihood learning Bayes net learning ML parameter learning with complete
More informationStatistical learning. Chapter 20, Sections 1 3 1
Statistical learning Chapter 20, Sections 1 3 Chapter 20, Sections 1 3 1 Outline Bayesian learning Maximum a posteriori and maximum likelihood learning Bayes net learning ML parameter learning with complete
More informationUncertainty. Chapter 13
Uncertainty Chapter 13 Uncertainty Let action A t = leave for airport t minutes before flight Will A t get me there on time? Problems: 1. partial observability (road state, other drivers' plans, noisy
More informationUncertainty. Outline. Probability Syntax and Semantics Inference Independence and Bayes Rule. AIMA2e Chapter 13
Uncertainty AIMA2e Chapter 13 1 Outline Uncertainty Probability Syntax and Semantics Inference Independence and ayes Rule 2 Uncertainty Let action A t = leave for airport t minutes before flight Will A
More informationUncertainty. Chapter 13. Chapter 13 1
Uncertainty Chapter 13 Chapter 13 1 Outline Uncertainty Probability Syntax and Semantics Inference Independence and Bayes Rule Chapter 13 2 Uncertainty Let action A t = leave for airport t minutes before
More informationPROBABILISTIC REASONING SYSTEMS
PROBABILISTIC REASONING SYSTEMS In which we explain how to build reasoning systems that use network models to reason with uncertainty according to the laws of probability theory. Outline Knowledge in uncertain
More informationBayesian networks. Soleymani. CE417: Introduction to Artificial Intelligence Sharif University of Technology Spring 2018
Bayesian networks CE417: Introduction to Artificial Intelligence Sharif University of Technology Spring 2018 Soleymani Slides have been adopted from Klein and Abdeel, CS188, UC Berkeley. Outline Probability
More informationUncertainty. Chapter 13, Sections 1 6
Uncertainty Chapter 13, Sections 1 6 Artificial Intelligence, spring 2013, Peter Ljunglöf; based on AIMA Slides c Stuart Russel and Peter Norvig, 2004 Chapter 13, Sections 1 6 1 Outline Uncertainty Probability
More informationProbabilistic representation and reasoning
Probabilistic representation and reasoning Applied artificial intelligence (EDAF70) Lecture 04 2019-02-01 Elin A. Topp Material based on course book, chapter 13, 14.1-3 1 Show time! Two boxes of chocolates,
More informationOutline. Uncertainty. Methods for handling uncertainty. Uncertainty. Making decisions under uncertainty. Probability. Uncertainty
Outline Uncertainty Uncertainty Chapter 13 Probability Syntax and Semantics Inference Independence and ayes Rule Chapter 13 1 Chapter 13 2 Uncertainty et action A t =leaveforairportt minutes before flight
More informationReview: Bayesian learning and inference
Review: Bayesian learning and inference Suppose the agent has to make decisions about the value of an unobserved query variable X based on the values of an observed evidence variable E Inference problem:
More informationBayesian networks (1) Lirong Xia
Bayesian networks (1) Lirong Xia Random variables and joint distributions Ø A random variable is a variable with a domain Random variables: capital letters, e.g. W, D, L values: small letters, e.g. w,
More informationUncertainty. Logic and Uncertainty. Russell & Norvig. Readings: Chapter 13. One problem with logical-agent approaches: C:145 Artificial
C:145 Artificial Intelligence@ Uncertainty Readings: Chapter 13 Russell & Norvig. Artificial Intelligence p.1/43 Logic and Uncertainty One problem with logical-agent approaches: Agents almost never have
More informationGraphical Models - Part I
Graphical Models - Part I Oliver Schulte - CMPT 726 Bishop PRML Ch. 8, some slides from Russell and Norvig AIMA2e Outline Probabilistic Models Bayesian Networks Markov Random Fields Inference Outline Probabilistic
More informationChapter 13 Quantifying Uncertainty
Chapter 13 Quantifying Uncertainty CS5811 - Artificial Intelligence Nilufer Onder Department of Computer Science Michigan Technological University Outline Probability basics Syntax and semantics Inference
More informationPengju
Introduction to AI Chapter13 Uncertainty Pengju Ren@IAIR Outline Uncertainty Probability Syntax and Semantics Inference Independence and Bayes Rule Example: Car diagnosis Wumpus World Environment Squares
More informationPROBABILISTIC REASONING Outline
PROBABILISTIC REASONING Outline Danica Kragic Uncertainty Probability Syntax and Semantics Inference Independence and Bayes Rule September 23, 2005 Uncertainty - Let action A t = leave for airport t minutes
More information14 PROBABILISTIC REASONING
228 14 PROBABILISTIC REASONING A Bayesian network is a directed graph in which each node is annotated with quantitative probability information 1. A set of random variables makes up the nodes of the network.
More informationProbabilistic Models
Bayes Nets 1 Probabilistic Models Models describe how (a portion of) the world works Models are always simplifications May not account for every variable May not account for all interactions between variables
More informationQuantifying Uncertainty & Probabilistic Reasoning. Abdulla AlKhenji Khaled AlEmadi Mohammed AlAnsari
Quantifying Uncertainty & Probabilistic Reasoning Abdulla AlKhenji Khaled AlEmadi Mohammed AlAnsari Outline Previous Implementations What is Uncertainty? Acting Under Uncertainty Rational Decisions Basic
More informationStatistical learning. Chapter 20, Sections 1 3 1
Statistical learning Chapter 20, Sections 1 3 Chapter 20, Sections 1 3 1 Outline Bayesian learning Maximum a posteriori and maximum likelihood learning Bayes net learning ML parameter learning with complete
More informationCSE 473: Artificial Intelligence Autumn 2011
CSE 473: Artificial Intelligence Autumn 2011 Bayesian Networks Luke Zettlemoyer Many slides over the course adapted from either Dan Klein, Stuart Russell or Andrew Moore 1 Outline Probabilistic models
More informationUncertainty (Chapter 13, Russell & Norvig) Introduction to Artificial Intelligence CS 150 Lecture 14
Uncertainty (Chapter 13, Russell & Norvig) Introduction to Artificial Intelligence CS 150 Lecture 14 Administration Last Programming assignment will be handed out later this week. I am doing probability
More informationBayesian Networks. Vibhav Gogate The University of Texas at Dallas
Bayesian Networks Vibhav Gogate The University of Texas at Dallas Intro to AI (CS 6364) Many slides over the course adapted from either Dan Klein, Luke Zettlemoyer, Stuart Russell or Andrew Moore 1 Outline
More informationProbabilistic Models. Models describe how (a portion of) the world works
Probabilistic Models Models describe how (a portion of) the world works Models are always simplifications May not account for every variable May not account for all interactions between variables All models
More informationCS 484 Data Mining. Classification 7. Some slides are from Professor Padhraic Smyth at UC Irvine
CS 484 Data Mining Classification 7 Some slides are from Professor Padhraic Smyth at UC Irvine Bayesian Belief networks Conditional independence assumption of Naïve Bayes classifier is too strong. Allows
More informationLecture 10: Introduction to reasoning under uncertainty. Uncertainty
Lecture 10: Introduction to reasoning under uncertainty Introduction to reasoning under uncertainty Review of probability Axioms and inference Conditional probability Probability distributions COMP-424,
More informationCS 188: Artificial Intelligence Spring Announcements
CS 188: Artificial Intelligence Spring 2011 Lecture 14: Bayes Nets II Independence 3/9/2011 Pieter Abbeel UC Berkeley Many slides over the course adapted from Dan Klein, Stuart Russell, Andrew Moore Announcements
More informationCS 561: Artificial Intelligence
CS 561: Artificial Intelligence Instructor: TAs: Sofus A. Macskassy, macskass@usc.edu Nadeesha Ranashinghe (nadeeshr@usc.edu) William Yeoh (wyeoh@usc.edu) Harris Chiu (chiciu@usc.edu) Lectures: MW 5:00-6:20pm,
More informationBayesian Networks. Vibhav Gogate The University of Texas at Dallas
Bayesian Networks Vibhav Gogate The University of Texas at Dallas Intro to AI (CS 4365) Many slides over the course adapted from either Dan Klein, Luke Zettlemoyer, Stuart Russell or Andrew Moore 1 Outline
More informationAn AI-ish view of Probability, Conditional Probability & Bayes Theorem
An AI-ish view of Probability, Conditional Probability & Bayes Theorem Review: Uncertainty and Truth Values: a mismatch Let action A t = leave for airport t minutes before flight. Will A 15 get me there
More information10/18/2017. An AI-ish view of Probability, Conditional Probability & Bayes Theorem. Making decisions under uncertainty.
An AI-ish view of Probability, Conditional Probability & Bayes Theorem Review: Uncertainty and Truth Values: a mismatch Let action A t = leave for airport t minutes before flight. Will A 15 get me there
More informationCS 188: Artificial Intelligence Fall 2009
CS 188: Artificial Intelligence Fall 2009 Lecture 14: Bayes Nets 10/13/2009 Dan Klein UC Berkeley Announcements Assignments P3 due yesterday W2 due Thursday W1 returned in front (after lecture) Midterm
More informationCS 5522: Artificial Intelligence II
CS 5522: Artificial Intelligence II Bayes Nets Instructor: Alan Ritter Ohio State University [These slides were adapted from CS188 Intro to AI at UC Berkeley. All materials available at http://ai.berkeley.edu.]
More informationThis lecture. Reading. Conditional Independence Bayesian (Belief) Networks: Syntax and semantics. Chapter CS151, Spring 2004
This lecture Conditional Independence Bayesian (Belief) Networks: Syntax and semantics Reading Chapter 14.1-14.2 Propositions and Random Variables Letting A refer to a proposition which may either be true
More informationCS 343: Artificial Intelligence
CS 343: Artificial Intelligence Bayes Nets Prof. Scott Niekum The University of Texas at Austin [These slides based on those of Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188
More informationUncertainty and Belief Networks. Introduction to Artificial Intelligence CS 151 Lecture 1 continued Ok, Lecture 2!
Uncertainty and Belief Networks Introduction to Artificial Intelligence CS 151 Lecture 1 continued Ok, Lecture 2! This lecture Conditional Independence Bayesian (Belief) Networks: Syntax and semantics
More informationWeb-Mining Agents Data Mining
Web-Mining Agents Data Mining Prof. Dr. Ralf Möller Dr. Özgür L. Özçep Universität zu Lübeck Institut für Informationssysteme Tanya Braun (Übungen) 2 Uncertainty AIMA Chapter 13 3 Outline Agents Uncertainty
More informationBayesian Networks BY: MOHAMAD ALSABBAGH
Bayesian Networks BY: MOHAMAD ALSABBAGH Outlines Introduction Bayes Rule Bayesian Networks (BN) Representation Size of a Bayesian Network Inference via BN BN Learning Dynamic BN Introduction Conditional
More informationProbabilistic Robotics
Probabilistic Robotics Overview of probability, Representing uncertainty Propagation of uncertainty, Bayes Rule Application to Localization and Mapping Slides from Autonomous Robots (Siegwart and Nourbaksh),
More informationCS 380: ARTIFICIAL INTELLIGENCE UNCERTAINTY. Santiago Ontañón
CS 380: ARTIFICIAL INTELLIGENCE UNCERTAINTY Santiago Ontañón so367@drexel.edu Summary Probability is a rigorous formalism for uncertain knowledge Joint probability distribution specifies probability of
More informationInformatics 2D Reasoning and Agents Semester 2,
Informatics 2D Reasoning and Agents Semester 2, 2017 2018 Alex Lascarides alex@inf.ed.ac.uk Lecture 23 Probabilistic Reasoning with Bayesian Networks 15th March 2018 Informatics UoE Informatics 2D 1 Where
More informationCh.6 Uncertain Knowledge. Logic and Uncertainty. Representation. One problem with logical approaches: Department of Computer Science
Ch.6 Uncertain Knowledge Representation Hantao Zhang http://www.cs.uiowa.edu/ hzhang/c145 The University of Iowa Department of Computer Science Artificial Intelligence p.1/39 Logic and Uncertainty One
More informationBasic Probability and Decisions
Basic Probability and Decisions Chris Amato Northeastern University Some images and slides are used from: Rob Platt, CS188 UC Berkeley, AIMA Uncertainty Let action A t = leave for airport t minutes before
More informationCartesian-product sample spaces and independence
CS 70 Discrete Mathematics for CS Fall 003 Wagner Lecture 4 The final two lectures on probability will cover some basic methods for answering questions about probability spaces. We will apply them to the
More informationStatistical Learning. Philipp Koehn. 10 November 2015
Statistical Learning Philipp Koehn 10 November 2015 Outline 1 Learning agents Inductive learning Decision tree learning Measuring learning performance Bayesian learning Maximum a posteriori and maximum
More informationBayesian Networks. Motivation
Bayesian Networks Computer Sciences 760 Spring 2014 http://pages.cs.wisc.edu/~dpage/cs760/ Motivation Assume we have five Boolean variables,,,, The joint probability is,,,, How many state configurations
More informationEE562 ARTIFICIAL INTELLIGENCE FOR ENGINEERS
EE562 ARTIFICIAL INTELLIGENCE FOR ENGINEERS Lecture 16, 6/1/2005 University of Washington, Department of Electrical Engineering Spring 2005 Instructor: Professor Jeff A. Bilmes Uncertainty & Bayesian Networks
More informationResolution or modus ponens are exact there is no possibility of mistake if the rules are followed exactly.
THE WEAKEST LINK Resolution or modus ponens are exact there is no possibility of mistake if the rules are followed exactly. These methods of inference (also known as deductive methods) require that information
More informationCS 188: Artificial Intelligence Fall 2008
CS 188: Artificial Intelligence Fall 2008 Lecture 14: Bayes Nets 10/14/2008 Dan Klein UC Berkeley 1 1 Announcements Midterm 10/21! One page note sheet Review sessions Friday and Sunday (similar) OHs on
More informationSchool of EECS Washington State University. Artificial Intelligence
School of EECS Washington State University Artificial Intelligence 1 } Full joint probability distribution Can answer any query But typically too large } Conditional independence Can reduce the number
More informationIntroduction to Bayesian Learning
Course Information Introduction Introduction to Bayesian Learning Davide Bacciu Dipartimento di Informatica Università di Pisa bacciu@di.unipi.it Apprendimento Automatico: Fondamenti - A.A. 2016/2017 Outline
More informationCOMP5211 Lecture Note on Reasoning under Uncertainty
COMP5211 Lecture Note on Reasoning under Uncertainty Fangzhen Lin Department of Computer Science and Engineering Hong Kong University of Science and Technology Fangzhen Lin (HKUST) Uncertainty 1 / 33 Uncertainty
More informationReasoning Under Uncertainty
Reasoning Under Uncertainty Introduction Representing uncertain knowledge: logic and probability (a reminder!) Probabilistic inference using the joint probability distribution Bayesian networks The Importance
More informationBrief Intro. to Bayesian Networks. Extracted and modified from four lectures in Intro to AI, Spring 2008 Paul S. Rosenbloom
Brief Intro. to Bayesian Networks Extracted and modified from four lectures in Intro to AI, Spring 2008 Paul S. Rosenbloom Factor Graph Review Tame combinatorics in many calculations Decoding codes (origin
More informationLearning with Probabilities
Learning with Probabilities CS194-10 Fall 2011 Lecture 15 CS194-10 Fall 2011 Lecture 15 1 Outline Bayesian learning eliminates arbitrary loss functions and regularizers facilitates incorporation of prior
More informationUNCERTAINTY. In which we see what an agent should do when not all is crystal-clear.
UNCERTAINTY In which we see what an agent should do when not all is crystal-clear. Outline Uncertainty Probabilistic Theory Axioms of Probability Probabilistic Reasoning Independency Bayes Rule Summary
More informationCS Belief networks. Chapter
CS 580 1 Belief networks Chapter 15.1 2 CS 580 2 Outline Conditional independence Bayesian networks: syntax and semantics Exact inference Approximate inference CS 580 3 Independence Two random variables
More informationBayesian Belief Network
Bayesian Belief Network a! b) = a) b) toothache, catch, cavity, Weather = cloudy) = = Weather = cloudy) toothache, catch, cavity) The decomposition of large probabilistic domains into weakly connected
More informationCOMP9414: Artificial Intelligence Reasoning Under Uncertainty
COMP9414, Monday 16 April, 2012 Reasoning Under Uncertainty 2 COMP9414: Artificial Intelligence Reasoning Under Uncertainty Overview Problems with Logical Approach What Do the Numbers Mean? Wayne Wobcke
More informationUncertainty (Chapter 13, Russell & Norvig)
Uncertainty (Chapter 13, Russell & Norvig) Introduction to Artificial Intelligence CS 150 Administration Midterm next Tuesday!!! I will try to find an old one to post. The MT will cover chapters 1-6, with
More informationOutline. CSE 573: Artificial Intelligence Autumn Bayes Nets: Big Picture. Bayes Net Semantics. Hidden Markov Models. Example Bayes Net: Car
CSE 573: Artificial Intelligence Autumn 2012 Bayesian Networks Dan Weld Many slides adapted from Dan Klein, Stuart Russell, Andrew Moore & Luke Zettlemoyer Outline Probabilistic models (and inference)
More informationBayesian Networks. Philipp Koehn. 6 April 2017
Bayesian Networks Philipp Koehn 6 April 2017 Outline 1 Bayesian Networks Parameterized distributions Exact inference Approximate inference 2 bayesian networks Bayesian Networks 3 A simple, graphical notation
More informationIntroduction to Artificial Intelligence Belief networks
Introduction to Artificial Intelligence Belief networks Chapter 15.1 2 Dieter Fox Based on AIMA Slides c S. Russell and P. Norvig, 1998 Chapter 15.1 2 0-0 Outline Bayesian networks: syntax and semantics
More informationUncertainty. Russell & Norvig Chapter 13.
Uncertainty Russell & Norvig Chapter 13 http://toonut.com/wp-content/uploads/2011/12/69wp.jpg Uncertainty Let A t be the action of leaving for the airport t minutes before your flight Will A t get you
More informationBayesian networks. Chapter Chapter
Bayesian networks Chapter 14.1 3 Chapter 14.1 3 1 Outline Syntax Semantics Parameterized distributions Chapter 14.1 3 2 Bayesian networks A simple, graphical notation for conditional independence assertions
More informationCS 2750: Machine Learning. Bayesian Networks. Prof. Adriana Kovashka University of Pittsburgh March 14, 2016
CS 2750: Machine Learning Bayesian Networks Prof. Adriana Kovashka University of Pittsburgh March 14, 2016 Plan for today and next week Today and next time: Bayesian networks (Bishop Sec. 8.1) Conditional
More informationBayes Nets: Independence
Bayes Nets: Independence [These slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188 materials are available at http://ai.berkeley.edu.] Bayes Nets A Bayes
More informationBayesian networks: Modeling
Bayesian networks: Modeling CS194-10 Fall 2011 Lecture 21 CS194-10 Fall 2011 Lecture 21 1 Outline Overview of Bayes nets Syntax and semantics Examples Compact conditional distributions CS194-10 Fall 2011
More informationBayesian networks. Chapter AIMA2e Slides, Stuart Russell and Peter Norvig, Completed by Kazim Fouladi, Fall 2008 Chapter 14.
Bayesian networks Chapter 14.1 3 AIMA2e Slides, Stuart Russell and Peter Norvig, Completed by Kazim Fouladi, Fall 2008 Chapter 14.1 3 1 Outline Syntax Semantics Parameterized distributions AIMA2e Slides,
More informationCMPSCI 240: Reasoning about Uncertainty
CMPSCI 240: Reasoning about Uncertainty Lecture 17: Representing Joint PMFs and Bayesian Networks Andrew McGregor University of Massachusetts Last Compiled: April 7, 2017 Warm Up: Joint distributions Recall
More information13.4 INDEPENDENCE. 494 Chapter 13. Quantifying Uncertainty
494 Chapter 13. Quantifying Uncertainty table. In a realistic problem we could easily have n>100, makingo(2 n ) impractical. The full joint distribution in tabular form is just not a practical tool for
More informationReasoning with Uncertainty. Chapter 13
Reasoning with Uncertainty Chapter 13 1 Outline Uncertainty Probability Syntax and Semantics Inference Independence and Bayes Rule 2 The real world is an uncertain place... Example: I need a plan that
More informationFrom inductive inference to machine learning
From inductive inference to machine learning ADAPTED FROM AIMA SLIDES Russel&Norvig:Artificial Intelligence: a modern approach AIMA: Inductive inference AIMA: Inductive inference 1 Outline Bayesian inferences
More informationFoundations of Artificial Intelligence
Foundations of Artificial Intelligence 12. Making Simple Decisions under Uncertainty Probability Theory, Bayesian Networks, Other Approaches Wolfram Burgard, Maren Bennewitz, and Marco Ragni Albert-Ludwigs-Universität
More informationBayesian Networks. Philipp Koehn. 29 October 2015
Bayesian Networks Philipp Koehn 29 October 2015 Outline 1 Bayesian Networks Parameterized distributions Exact inference Approximate inference 2 bayesian networks Bayesian Networks 3 A simple, graphical
More informationARTIFICIAL INTELLIGENCE. Uncertainty: probabilistic reasoning
INFOB2KI 2017-2018 Utrecht University The Netherlands ARTIFICIAL INTELLIGENCE Uncertainty: probabilistic reasoning Lecturer: Silja Renooij These slides are part of the INFOB2KI Course Notes available from
More informationBayesian networks. Chapter Chapter Outline. Syntax Semantics Parameterized distributions. Chapter
Bayesian networks Chapter 14.1 3 Chapter 14.1 3 1 Outline Syntax Semantics Parameterized distributions Chapter 14.1 3 2 Bayesian networks A simple, graphical notation for conditional independence assertions
More informationCS 188: Artificial Intelligence Fall 2009
CS 188: Artificial Intelligence Fall 2009 Lecture 13: Probability 10/8/2009 Dan Klein UC Berkeley 1 Announcements Upcoming P3 Due 10/12 W2 Due 10/15 Midterm in evening of 10/22 Review sessions: Probability
More informationReasoning Under Uncertainty
Reasoning Under Uncertainty Introduction Representing uncertain knowledge: logic and probability (a reminder!) Probabilistic inference using the joint probability distribution Bayesian networks (theory
More informationAnother look at Bayesian. inference
Another look at Bayesian A general scenario: - Query variables: X inference - Evidence (observed) variables and their values: E = e - Unobserved variables: Y Inference problem: answer questions about the
More information