Decision Theory Intro: Preferences and Utility

Size: px
Start display at page:

Download "Decision Theory Intro: Preferences and Utility"

Transcription

1 Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29 March 22, 2006 Textbook 9.5 Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 1

2 Lecture Overview Recap Decision Theory Intro Preferences Utility Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 2

3 Markov chain A Markov chain is a special sort of belief network: S 0 S 1 S 2 S 3 S 4 Thus P (S t+1 S 0,..., S t ) = P (S t+1 S t ). The past is independent of the future given the present. A stationary Markov chain is when for all t > 0, t > 0, P (S t+1 S t ) = P (S t +1 S t ). We specify P (S0 ) and P (S t+1 S t ). Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 3

4 Hidden Markov Model A Hidden Markov Model (HMM) starts with a Markov chain, and adds a noisy observation about the state at each time step: S 0 S 1 S 2 S 3 S 4 O 0 O 1 O 2 O 3 O 4 P (S 0 ) specifies initial conditions P (S t+1 S t ) specifies the dynamics P (O t S t ) specifies the sensor model Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 4

5 Lecture Overview Recap Decision Theory Intro Preferences Utility Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 5

6 Decisions Under Uncertainty In the first part of the course we focused on decision making in domains where the environment was understood with certainty Search/CSPs: single decisions Planning: sequential decisions In uncertain domains, we ve so far only considered how to represent and update beliefs What if an agent has to make decisions in a domain that involves uncertainty? this is likely: one of the main reasons to represent the world probabilistically is to be able to use these beliefs as the basis for making decisions Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 6

7 Decisions Under Uncertainty An agent s decision will depend on: 1. what actions are available 2. what beliefs the agent has note: this replaces state from the deterministic setting 3. the agent s goals We ve spoken quite a lot about (1) and (2). today let s consider (3) we ll move from all-or-nothing goals to a richer notion: rating how happy the agent is in different situations Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 7

8 Lecture Overview Recap Decision Theory Intro Preferences Utility Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 8

9 Preferences Actions result in outcomes Agents have preferences over outcomes A rational agent will take the action that leads to the outcome which he most prefers Sometimes agents don t know the outcomes of the actions, but they still need to compare actions Agents have to act (doing nothing is (often) an action). Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 9

10 Preferences Over Outcomes If o 1 and o 2 are outcomes o 1 o 2 means o 1 is at least as desirable as o 2. read this as the agent weakly prefers o 1 to o 2 o 1 o 2 means o 1 o 2 and o 2 o 1. read this as the agent is indifferent between o 1 and o 2. o 1 o 2 means o 1 o 2 and o 2 o 1 read this as the agent strictly prefers o1 to o 2 Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 10

11 Lotteries An agent may not know the outcomes of his actions, but may instead only have a probability distribution over the outcomes. A lottery is a probability distribution over outcomes. It is written [p 1 : o 1, p 2 : o 2,..., p k : o k ] where the o i are outcomes and p i > 0 such that p i = 1 i The lottery specifies that outcome o i occurs with probability p i. We will consider lotteries to be outcomes. Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 11

12 Our Goal We want to reason about preferences mathematically To do this, we must give some rules that allow us to allow us to relate and transform expressions involving the symbols, and, as well as lotteries. Just as we did with probabilities, we will axiomatize preferences. These rules will allow us to derive consequences of preference statements In the end, one has to either accept these consequences or reject one of the axioms Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 12

13 Preference Axioms Completeness: A preference relationship must be defined between every pair of outcomes: o 1 o 2 o 1 o 2 or o 2 o 1 Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 13

14 Preference Axioms Transitivity: Preferences must be transitive: if o 1 o 2 and o 2 o 3 then o 1 o 3 This makes good sense: otherwise o 1 o 2 and o 2 o 3 and o 3 o 1. An agent should be prepared to pay some amount to swap between an outcome they prefer less and an outcome they prefer more Intransitive preferences mean we can construct a money pump! Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 14

15 Preference Axioms Monotonicity: An agent prefers a larger chance of getting a better outcome than a smaller chance: If o 1 o 2 and p > q then [p : o 1, 1 p : o 2 ] [q : o 1, 1 q : o 2 ] Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 15

16 Consequence of axioms Suppose o 1 o 2 and o 2 o 3. Consider whether the agent would prefer o 2 the lottery [p : o1, 1 p : o 3 ] for different values of p [0, 1]. You can plot which one is preferred as a function of p: o 2 - lottery Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 16

17 Preference Axioms Continuity: Suppose o 1 o 2 and o 2 o 3, then there exists a p [0, 1] such that o 2 [p : o 1, 1 p : o 3 ] Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 17

18 Preference Axioms Decomposability: ( no fun in gambling ). An agent is indifferent between lotteries that have same probabilities and outcomes. This includes lotteries over lotteries. For example: [p : o 1, 1 p : [q : o 2, 1 q : o 3 ]] [p : o 1, (1 p)q : o 2, (1 p)(1 q) : o 3 ] Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 18

19 Preference Axioms Substitutivity: if o 1 o 2 then the agent is indifferent between lotteries that only differ by o 1 and o 2 : [p : o 1, 1 p : o 3 ] [p : o 2, 1 p : o 3 ] Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 19

20 Lecture Overview Recap Decision Theory Intro Preferences Utility Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 20

21 What we would like We would like a measure of preference that can be combined with probabilities. So that value([p : o 1, 1 p : o 2 ]) = p value(o 1 ) + (1 p) value(o 2 ) Can we use money as this measure of preference? Would you you prefer $1, 000, 000 or [0.5 : $0, 0.5 : $2, 000, 000]? Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 21

22 What we would like We would like a measure of preference that can be combined with probabilities. So that value([p : o 1, 1 p : o 2 ]) = p value(o 1 ) + (1 p) value(o 2 ) Can we use money as this measure of preference? Would you you prefer $1, 000, 000 or [0.5 : $0, 0.5 : $2, 000, 000]? Money is not going to work. Let s invent an abstract concept called utility. Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 21

23 Utility as a function of money 1 Utility Risk averse Risk neutral Risk seeking 0 $0 $2,000,000 Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 22

24 Theorem Is it possible that preferences are too complex and muti-faceted to be represented by single numbers? If preferences satisfy the preceding axioms, then preferences can be measured by a function such that utility : outcomes [0, 1] 1. o 1 o 2 if and only if utility(o 1 ) utility(o2). 2. Utilities are linear with probabilities: utility([p 1 : o 1, p 2 : o 2,..., p k : o k ]) k = p i utility(o i ) i=1 Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 23

25 Proof Part 1: o 1 o 2 if and only if utility(o 1 ) utility(o2). If all outcomes are equally preferred, set utility(o i ) = 0 for all outcomes o i. Otherwise, suppose the best outcome is best and the worst outcome is worst. For any outcome o i, define utility(o i ) to be the number u i such that o i [u i : best, 1 u i : worst] This exists by the Continuity property. Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 24

26 Proof (cont.) Part 1: o 1 o 2 if and only if utility(o 1 ) utility(o2). Suppose o 1 o 2 and utility(o i ) = u i, then by Substitutivity, [u 1 : best, 1 u 1 : worst] [u 2 : best, 1 u 2 : worst] Which, by completeness and monotonicity implies u 1 u 2. Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 25

27 Proof (cont.) Part 2: utility([p 1 : o 1,..., p k : o k ]) = k i=1 p i utility(o i ) Suppose p = utility([p 1 : o 1, p 2 : o 2,..., p k : o k ]). Suppose utility(o i ) = u i. We know: o i [u i : best, 1 u i : worst] By substitutivity, we can replace each o i by [u i : best, 1 u i : worst], so p = utility( [ p 1 : [u 1 : best, 1 u 1 : worst]... p k : [u k : best, 1 u k : worst]]) Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 26

28 Proof (cont.) Part 2: utility([p 1 : o 1,..., p k : o k ]) = k i=1 p i utility(o i ) By decomposability, this is equivalent to: p = utility( [ p 1 u p k u k : best, Thus, by definition of utility, p 1 (1 u 1 ) + + p k (1 u k ) : worst]]) p = p 1 u p k u k Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 27

Reasoning Under Uncertainty: Introduction to Probability

Reasoning Under Uncertainty: Introduction to Probability Reasoning Under Uncertainty: Introduction to Probability CPSC 322 Uncertainty 1 Textbook 6.1 Reasoning Under Uncertainty: Introduction to Probability CPSC 322 Uncertainty 1, Slide 1 Lecture Overview 1

More information

Reasoning Under Uncertainty: Introduction to Probability

Reasoning Under Uncertainty: Introduction to Probability Reasoning Under Uncertainty: Introduction to Probability CPSC 322 Lecture 23 March 12, 2007 Textbook 9 Reasoning Under Uncertainty: Introduction to Probability CPSC 322 Lecture 23, Slide 1 Lecture Overview

More information

Decision Theory: Markov Decision Processes

Decision Theory: Markov Decision Processes Decision Theory: Markov Decision Processes CPSC 322 Lecture 33 March 31, 2006 Textbook 12.5 Decision Theory: Markov Decision Processes CPSC 322 Lecture 33, Slide 1 Lecture Overview Recap Rewards and Policies

More information

Evaluation for Pacman. CS 188: Artificial Intelligence Fall Iterative Deepening. α-β Pruning Example. α-β Pruning Pseudocode.

Evaluation for Pacman. CS 188: Artificial Intelligence Fall Iterative Deepening. α-β Pruning Example. α-β Pruning Pseudocode. CS 188: Artificial Intelligence Fall 2008 Evaluation for Pacman Lecture 7: Expectimax Search 9/18/2008 [DEMO: thrashing, smart ghosts] Dan Klein UC Berkeley Many slides over the course adapted from either

More information

CS 188: Artificial Intelligence Fall 2008

CS 188: Artificial Intelligence Fall 2008 CS 188: Artificial Intelligence Fall 2008 Lecture 7: Expectimax Search 9/18/2008 Dan Klein UC Berkeley Many slides over the course adapted from either Stuart Russell or Andrew Moore 1 1 Evaluation for

More information

Introduction to Artificial Intelligence (AI)

Introduction to Artificial Intelligence (AI) Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 9 Oct, 11, 2011 Slide credit Approx. Inference : S. Thrun, P, Norvig, D. Klein CPSC 502, Lecture 9 Slide 1 Today Oct 11 Bayesian

More information

Lecture 14: Introduction to Decision Making

Lecture 14: Introduction to Decision Making Lecture 14: Introduction to Decision Making Preferences Utility functions Maximizing exected utility Value of information Actions and consequences So far, we have focused on ways of modeling a stochastic,

More information

Introduction to Artificial Intelligence (AI)

Introduction to Artificial Intelligence (AI) Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 10 Oct, 13, 2011 CPSC 502, Lecture 10 Slide 1 Today Oct 13 Inference in HMMs More on Robot Localization CPSC 502, Lecture

More information

Decision Theory: Q-Learning

Decision Theory: Q-Learning Decision Theory: Q-Learning CPSC 322 Decision Theory 5 Textbook 12.5 Decision Theory: Q-Learning CPSC 322 Decision Theory 5, Slide 1 Lecture Overview 1 Recap 2 Asynchronous Value Iteration 3 Q-Learning

More information

Reasoning Under Uncertainty: Bayesian networks intro

Reasoning Under Uncertainty: Bayesian networks intro Reasoning Under Uncertainty: Bayesian networks intro Alan Mackworth UBC CS 322 Uncertainty 4 March 18, 2013 Textbook 6.3, 6.3.1, 6.5, 6.5.1, 6.5.2 Lecture Overview Recap: marginal and conditional independence

More information

Reasoning under Uncertainty: Intro to Probability

Reasoning under Uncertainty: Intro to Probability Reasoning under Uncertainty: Intro to Probability Computer Science cpsc322, Lecture 24 (Textbook Chpt 6.1, 6.1.1) March, 15, 2010 CPSC 322, Lecture 24 Slide 1 To complete your Learning about Logics Review

More information

Last update: April 15, Rational decisions. CMSC 421: Chapter 16. CMSC 421: Chapter 16 1

Last update: April 15, Rational decisions. CMSC 421: Chapter 16. CMSC 421: Chapter 16 1 Last update: April 15, 2010 Rational decisions CMSC 421: Chapter 16 CMSC 421: Chapter 16 1 Outline Rational preferences Utilities Money Multiattribute utilities Decision networks Value of information CMSC

More information

Computer Science CPSC 322. Lecture 23 Planning Under Uncertainty and Decision Networks

Computer Science CPSC 322. Lecture 23 Planning Under Uncertainty and Decision Networks Computer Science CPSC 322 Lecture 23 Planning Under Uncertainty and Decision Networks 1 Announcements Final exam Mon, Dec. 18, 12noon Same general format as midterm Part short questions, part longer problems

More information

Probability and Time: Hidden Markov Models (HMMs)

Probability and Time: Hidden Markov Models (HMMs) Probability and Time: Hidden Markov Models (HMMs) Computer Science cpsc322, Lecture 32 (Textbook Chpt 6.5.2) Nov, 25, 2013 CPSC 322, Lecture 32 Slide 1 Lecture Overview Recap Markov Models Markov Chain

More information

This corresponds to a within-subject experiment: see same subject make choices from different menus.

This corresponds to a within-subject experiment: see same subject make choices from different menus. Testing Revealed Preference Theory, I: Methodology The revealed preference theory developed last time applied to a single agent. This corresponds to a within-subject experiment: see same subject make choices

More information

Intelligent Systems (AI-2)

Intelligent Systems (AI-2) Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 11 Oct, 3, 2016 CPSC 422, Lecture 11 Slide 1 422 big picture: Where are we? Query Planning Deterministic Logics First Order Logics Ontologies

More information

CS 798: Multiagent Systems

CS 798: Multiagent Systems CS 798: Multiagent Systems and Utility Kate Larson Cheriton School of Computer Science University of Waterloo January 6, 2010 Outline 1 Self-Interested Agents 2 3 4 5 Self-Interested Agents We are interested

More information

Rational preferences. Rational decisions. Outline. Rational preferences contd. Maximizing expected utility. Preferences

Rational preferences. Rational decisions. Outline. Rational preferences contd. Maximizing expected utility. Preferences Rational preferences Rational decisions hapter 16 Idea: preferences of a rational agent must obey constraints. Rational preferences behavior describable as maximization of expected utility onstraints:

More information

1 Uncertainty and Insurance

1 Uncertainty and Insurance Uncertainty and Insurance Reading: Some fundamental basics are in Varians intermediate micro textbook (Chapter 2). A good (advanced, but still rather accessible) treatment is in Kreps A Course in Microeconomic

More information

Hidden Markov Models. Vibhav Gogate The University of Texas at Dallas

Hidden Markov Models. Vibhav Gogate The University of Texas at Dallas Hidden Markov Models Vibhav Gogate The University of Texas at Dallas Intro to AI (CS 4365) Many slides over the course adapted from either Dan Klein, Luke Zettlemoyer, Stuart Russell or Andrew Moore 1

More information

Basic Probability and Decisions

Basic Probability and Decisions Basic Probability and Decisions Chris Amato Northeastern University Some images and slides are used from: Rob Platt, CS188 UC Berkeley, AIMA Uncertainty Let action A t = leave for airport t minutes before

More information

Reasoning Under Uncertainty: Belief Network Inference

Reasoning Under Uncertainty: Belief Network Inference Reasoning Under Uncertainty: Belief Network Inference CPSC 322 Uncertainty 5 Textbook 10.4 Reasoning Under Uncertainty: Belief Network Inference CPSC 322 Uncertainty 5, Slide 1 Lecture Overview 1 Recap

More information

12735: Urban Systems Modeling. Loss and decisions. instructor: Matteo Pozzi. Lec : Urban Systems Modeling Lec. 11 Loss and decisions

12735: Urban Systems Modeling. Loss and decisions. instructor: Matteo Pozzi. Lec : Urban Systems Modeling Lec. 11 Loss and decisions 1735: Urban Systems Modeling Lec. 11 Loss and decisions instructor: Matteo Pozzi 1 outline introduction example of decision under uncertainty attitude toward risk principle of minimum expected loss Bayesian

More information

Intro Utility theory Utility & Loss Functions Value of Information Summary. Decision Theory. (CIV Machine Learning for Civil Engineers)

Intro Utility theory Utility & Loss Functions Value of Information Summary. Decision Theory. (CIV Machine Learning for Civil Engineers) Decision Theory (CIV654 - Machine Learning for Civil Engineers) Professor: James-A. Goulet Département des génies civil, géologique et des mines Chapter 14 Goulet (219). Probabilistic Machine Learning

More information

Choice under Uncertainty

Choice under Uncertainty In the Name of God Sharif University of Technology Graduate School of Management and Economics Microeconomics 2 44706 (1394-95 2 nd term) Group 2 Dr. S. Farshad Fatemi Chapter 6: Choice under Uncertainty

More information

Propositional Definite Clause Logic: Syntax, Semantics and Bottom-up Proofs

Propositional Definite Clause Logic: Syntax, Semantics and Bottom-up Proofs Propositional Definite Clause Logic: Syntax, Semantics and Bottom-up Proofs Computer Science cpsc322, Lecture 20 (Textbook Chpt 5.1.2-5.2.2 ) June, 6, 2017 CPSC 322, Lecture 20 Slide 1 Lecture Overview

More information

Computing Minmax; Dominance

Computing Minmax; Dominance Computing Minmax; Dominance CPSC 532A Lecture 5 Computing Minmax; Dominance CPSC 532A Lecture 5, Slide 1 Lecture Overview 1 Recap 2 Linear Programming 3 Computational Problems Involving Maxmin 4 Domination

More information

14.12 Game Theory Lecture Notes Theory of Choice

14.12 Game Theory Lecture Notes Theory of Choice 14.12 Game Theory Lecture Notes Theory of Choice Muhamet Yildiz (Lecture 2) 1 The basic theory of choice We consider a set X of alternatives. Alternatives are mutually exclusive in the sense that one cannot

More information

Reasoning Under Uncertainty: Conditional Probability

Reasoning Under Uncertainty: Conditional Probability Reasoning Under Uncertainty: Conditional Probability CPSC 322 Uncertainty 2 Textbook 6.1 Reasoning Under Uncertainty: Conditional Probability CPSC 322 Uncertainty 2, Slide 1 Lecture Overview 1 Recap 2

More information

Computing Minmax; Dominance

Computing Minmax; Dominance Computing Minmax; Dominance CPSC 532A Lecture 5 Computing Minmax; Dominance CPSC 532A Lecture 5, Slide 1 Lecture Overview 1 Recap 2 Linear Programming 3 Computational Problems Involving Maxmin 4 Domination

More information

18.600: Lecture 3 What is probability?

18.600: Lecture 3 What is probability? 18.600: Lecture 3 What is probability? Scott Sheffield MIT Outline Formalizing probability Sample space DeMorgan s laws Axioms of probability Outline Formalizing probability Sample space DeMorgan s laws

More information

CS 5522: Artificial Intelligence II

CS 5522: Artificial Intelligence II CS 5522: Artificial Intelligence II Hidden Markov Models Instructor: Alan Ritter Ohio State University [These slides were adapted from CS188 Intro to AI at UC Berkeley. All materials available at http://ai.berkeley.edu.]

More information

CS 5522: Artificial Intelligence II

CS 5522: Artificial Intelligence II CS 5522: Artificial Intelligence II Hidden Markov Models Instructor: Wei Xu Ohio State University [These slides were adapted from CS188 Intro to AI at UC Berkeley.] Pacman Sonar (P4) [Demo: Pacman Sonar

More information

3 Intertemporal Risk Aversion

3 Intertemporal Risk Aversion 3 Intertemporal Risk Aversion 3. Axiomatic Characterization This section characterizes the invariant quantity found in proposition 2 axiomatically. The axiomatic characterization below is for a decision

More information

Rational decisions. Chapter 16. Chapter 16 1

Rational decisions. Chapter 16. Chapter 16 1 Rational decisions Chapter 16 Chapter 16 1 Outline Rational preferences Utilities Money Multiattribute utilities Decision networks Value of information Chapter 16 2 Preferences An agent chooses among prizes

More information

CS 4649/7649 Robot Intelligence: Planning

CS 4649/7649 Robot Intelligence: Planning CS 4649/7649 Robot Intelligence: Planning Probability Primer Sungmoon Joo School of Interactive Computing College of Computing Georgia Institute of Technology S. Joo (sungmoon.joo@cc.gatech.edu) 1 *Slides

More information

1 Uncertainty. These notes correspond to chapter 2 of Jehle and Reny.

1 Uncertainty. These notes correspond to chapter 2 of Jehle and Reny. These notes correspond to chapter of Jehle and Reny. Uncertainty Until now we have considered our consumer s making decisions in a world with perfect certainty. However, we can extend the consumer theory

More information

Axiomatic Decision Theory

Axiomatic Decision Theory Decision Theory Decision theory is about making choices It has a normative aspect what rational people should do... and a descriptive aspect what people do do Not surprisingly, it s been studied by economists,

More information

CSE 573: Artificial Intelligence

CSE 573: Artificial Intelligence CSE 573: Artificial Intelligence Autumn 2010 Lecture 5: Expectimax Search 10/14/2008 Luke Zettlemoyer Most slides over the course adapted from either Dan Klein, Stuart Russell or Andrew Moore 1 Announcements

More information

An AI-ish view of Probability, Conditional Probability & Bayes Theorem

An AI-ish view of Probability, Conditional Probability & Bayes Theorem An AI-ish view of Probability, Conditional Probability & Bayes Theorem Review: Uncertainty and Truth Values: a mismatch Let action A t = leave for airport t minutes before flight. Will A 15 get me there

More information

10/18/2017. An AI-ish view of Probability, Conditional Probability & Bayes Theorem. Making decisions under uncertainty.

10/18/2017. An AI-ish view of Probability, Conditional Probability & Bayes Theorem. Making decisions under uncertainty. An AI-ish view of Probability, Conditional Probability & Bayes Theorem Review: Uncertainty and Truth Values: a mismatch Let action A t = leave for airport t minutes before flight. Will A 15 get me there

More information

CS188 Outline. CS 188: Artificial Intelligence. Today. Inference in Ghostbusters. Probability. We re done with Part I: Search and Planning!

CS188 Outline. CS 188: Artificial Intelligence. Today. Inference in Ghostbusters. Probability. We re done with Part I: Search and Planning! CS188 Outline We re done with art I: Search and lanning! CS 188: Artificial Intelligence robability art II: robabilistic Reasoning Diagnosis Speech recognition Tracking objects Robot mapping Genetics Error

More information

Hidden Markov Models,99,100! Markov, here I come!

Hidden Markov Models,99,100! Markov, here I come! Hidden Markov Models,99,100! Markov, here I come! 16.410/413 Principles of Autonomy and Decision-Making Pedro Santana (psantana@mit.edu) October 7 th, 2015. Based on material by Brian Williams and Emilio

More information

Two-Stage-Partitional Representation and Dynamic Consistency 1

Two-Stage-Partitional Representation and Dynamic Consistency 1 Two-Stage-Partitional Representation and Dynamic Consistency 1 Suguru Ito June 5 th, 2015 Abstract In this paper, we study an individual who faces a three-period decision problem when she accepts the partitional

More information

CSE 473: Artificial Intelligence Probability Review à Markov Models. Outline

CSE 473: Artificial Intelligence Probability Review à Markov Models. Outline CSE 473: Artificial Intelligence Probability Review à Markov Models Daniel Weld University of Washington [These slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley.

More information

Our Status in CSE 5522

Our Status in CSE 5522 Our Status in CSE 5522 We re done with Part I Search and Planning! Part II: Probabilistic Reasoning Diagnosis Speech recognition Tracking objects Robot mapping Genetics Error correcting codes lots more!

More information

Rational decisions From predicting the effect to optimal intervention

Rational decisions From predicting the effect to optimal intervention Rational decisions From predicting the effect to optimal intervention ADAPTED FROM AIMA SLIDES Russel&Norvig:Artificial Intelligence: a modern approach AIMA: Rational decisions AIMA: Rational decisions

More information

Propositional Logic: Semantics and an Example

Propositional Logic: Semantics and an Example Propositional Logic: Semantics and an Example CPSC 322 Logic 2 Textbook 5.2 Propositional Logic: Semantics and an Example CPSC 322 Logic 2, Slide 1 Lecture Overview 1 Recap: Syntax 2 Propositional Definite

More information

CS 188: Artificial Intelligence Spring 2009

CS 188: Artificial Intelligence Spring 2009 CS 188: Artificial Intelligence Spring 2009 Lecture 21: Hidden Markov Models 4/7/2009 John DeNero UC Berkeley Slides adapted from Dan Klein Announcements Written 3 deadline extended! Posted last Friday

More information

Announcements. CS 188: Artificial Intelligence Fall Markov Models. Example: Markov Chain. Mini-Forward Algorithm. Example

Announcements. CS 188: Artificial Intelligence Fall Markov Models. Example: Markov Chain. Mini-Forward Algorithm. Example CS 88: Artificial Intelligence Fall 29 Lecture 9: Hidden Markov Models /3/29 Announcements Written 3 is up! Due on /2 (i.e. under two weeks) Project 4 up very soon! Due on /9 (i.e. a little over two weeks)

More information

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012

Game Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012 Game Theory Lecture Notes By Y. Narahari Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012 Chapter 7: Von Neumann - Morgenstern Utilities Note: This

More information

Reasoning with Uncertainty

Reasoning with Uncertainty Reasoning with Uncertainty Representing Uncertainty Manfred Huber 2005 1 Reasoning with Uncertainty The goal of reasoning is usually to: Determine the state of the world Determine what actions to take

More information

Outline. CSE 573: Artificial Intelligence Autumn Agent. Partial Observability. Markov Decision Process (MDP) 10/31/2012

Outline. CSE 573: Artificial Intelligence Autumn Agent. Partial Observability. Markov Decision Process (MDP) 10/31/2012 CSE 573: Artificial Intelligence Autumn 2012 Reasoning about Uncertainty & Hidden Markov Models Daniel Weld Many slides adapted from Dan Klein, Stuart Russell, Andrew Moore & Luke Zettlemoyer 1 Outline

More information

Lecture 7. Intro to Statistical Decision Analysis. Example. A numerical measures of outcomes. Today: Valuing outcomes - Utility.

Lecture 7. Intro to Statistical Decision Analysis. Example. A numerical measures of outcomes. Today: Valuing outcomes - Utility. Lecture 7 Professor Scott Schmidler Duke University Today: Valuing outcomes -. Stat 340 TuTh 11:45-1:00 Fall 2016 Reference: Lectures draw heavily from Lindley Making Decisions, 2nd ed. Textbook reading:

More information

CS188 Outline. We re done with Part I: Search and Planning! Part II: Probabilistic Reasoning. Part III: Machine Learning

CS188 Outline. We re done with Part I: Search and Planning! Part II: Probabilistic Reasoning. Part III: Machine Learning CS188 Outline We re done with Part I: Search and Planning! Part II: Probabilistic Reasoning Diagnosis Speech recognition Tracking objects Robot mapping Genetics Error correcting codes lots more! Part III:

More information

Reasoning under Uncertainty: Intro to Probability

Reasoning under Uncertainty: Intro to Probability Reasoning under Uncertainty: Intro to Probability Computer Science cpsc322, Lecture 24 (Textbook Chpt 6.1, 6.1.1) Nov, 2, 2012 CPSC 322, Lecture 24 Slide 1 Tracing Datalog proofs in AIspace You can trace

More information

Rational decisions. Chapter 16 1

Rational decisions. Chapter 16 1 Rational decisions Chapter 16 Chapter 16 1 Rational preferences Utilities Money Multiattribute utilities Decision networks Value of information Outline Chapter 16 2 Preferences An agent chooses among prizes

More information

CS 343: Artificial Intelligence

CS 343: Artificial Intelligence CS 343: Artificial Intelligence Probability Prof. Scott Niekum The University of Texas at Austin [These slides based on those of Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188

More information

Almost essential: Consumption and Uncertainty Probability Distributions MICROECONOMICS

Almost essential: Consumption and Uncertainty Probability Distributions MICROECONOMICS Prerequisites Almost essential: Consumption and Uncertainty Probability Distributions RISK MICROECONOMICS Principles and Analysis Frank Cowell July 2017 1 Risk and uncertainty In dealing with uncertainty

More information

Engineering Decisions

Engineering Decisions GSOE9210 vicj@cse.unsw.edu.au www.cse.unsw.edu.au/~gs9210 1 Preferences to values Outline 1 Preferences to values Evaluating outcomes and actions Example (Bus or train?) Would Alice prefer to catch the

More information

Recap Social Choice Fun Game Voting Paradoxes Properties. Social Choice. Lecture 11. Social Choice Lecture 11, Slide 1

Recap Social Choice Fun Game Voting Paradoxes Properties. Social Choice. Lecture 11. Social Choice Lecture 11, Slide 1 Social Choice Lecture 11 Social Choice Lecture 11, Slide 1 Lecture Overview 1 Recap 2 Social Choice 3 Fun Game 4 Voting Paradoxes 5 Properties Social Choice Lecture 11, Slide 2 Formal Definition Definition

More information

Homework #6 (10/18/2017)

Homework #6 (10/18/2017) Homework #6 (0/8/207). Let G be the set of compound gambles over a finite set of deterministic payoffs {a, a 2,...a n } R +. A decision maker s preference relation over compound gambles can be represented

More information

CSE 473: Artificial Intelligence

CSE 473: Artificial Intelligence CSE 473: Artificial Intelligence Hidden Markov Models Dieter Fox --- University of Washington [Most slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188 materials

More information

Where are we in CS 440?

Where are we in CS 440? Where are we in CS 440? Now leaving: sequential deterministic reasoning Entering: probabilistic reasoning and machine learning robability: Review of main concepts Chapter 3 Making decisions under uncertainty

More information

Markov Decision Processes

Markov Decision Processes Markov Decision Processes Noel Welsh 11 November 2010 Noel Welsh () Markov Decision Processes 11 November 2010 1 / 30 Annoucements Applicant visitor day seeks robot demonstrators for exciting half hour

More information

Announcements. CS 188: Artificial Intelligence Fall Adversarial Games. Computing Minimax Values. Evaluation Functions. Recap: Resource Limits

Announcements. CS 188: Artificial Intelligence Fall Adversarial Games. Computing Minimax Values. Evaluation Functions. Recap: Resource Limits CS 188: Artificial Intelligence Fall 2009 Lecture 7: Expectimax Search 9/17/2008 Announcements Written 1: Search and CSPs is up Project 2: Multi-agent Search is up Want a partner? Come to the front after

More information

CS 188: Artificial Intelligence Fall Announcements

CS 188: Artificial Intelligence Fall Announcements CS 188: Artificial Intelligence Fall 2009 Lecture 7: Expectimax Search 9/17/2008 Dan Klein UC Berkeley Many slides over the course adapted from either Stuart Russell or Andrew Moore 1 Announcements Written

More information

Probabilistic Robotics

Probabilistic Robotics Probabilistic Robotics Overview of probability, Representing uncertainty Propagation of uncertainty, Bayes Rule Application to Localization and Mapping Slides from Autonomous Robots (Siegwart and Nourbaksh),

More information

Probability. CS 3793/5233 Artificial Intelligence Probability 1

Probability. CS 3793/5233 Artificial Intelligence Probability 1 CS 3793/5233 Artificial Intelligence 1 Motivation Motivation Random Variables Semantics Dice Example Joint Dist. Ex. Axioms Agents don t have complete knowledge about the world. Agents need to make decisions

More information

Hidden Markov Models. AIMA Chapter 15, Sections 1 5. AIMA Chapter 15, Sections 1 5 1

Hidden Markov Models. AIMA Chapter 15, Sections 1 5. AIMA Chapter 15, Sections 1 5 1 Hidden Markov Models AIMA Chapter 15, Sections 1 5 AIMA Chapter 15, Sections 1 5 1 Consider a target tracking problem Time and uncertainty X t = set of unobservable state variables at time t e.g., Position

More information

Probability and Decision Theory

Probability and Decision Theory Probability and Decision Theory Robert Platt Northeastern University Some images and slides are used from: 1. AIMA 2. Chris Amato 3. Stacy Marsella QUANTIFYING UNCERTAINTY WITH PROBABILITIES Generally

More information

Propositional Logic: Bottom-Up Proofs

Propositional Logic: Bottom-Up Proofs Propositional Logic: Bottom-Up Proofs CPSC 322 Logic 3 Textbook 5.2 Propositional Logic: Bottom-Up Proofs CPSC 322 Logic 3, Slide 1 Lecture Overview 1 Recap 2 Bottom-Up Proofs 3 Soundness of Bottom-Up

More information

Confronting Theory with Experimental Data and vice versa. Lecture I Choice under risk. The Norwegian School of Economics Nov 7-11, 2011

Confronting Theory with Experimental Data and vice versa. Lecture I Choice under risk. The Norwegian School of Economics Nov 7-11, 2011 Confronting Theory with Experimental Data and vice versa Lecture I Choice under risk The Norwegian School of Economics Nov 7-11, 2011 Preferences Let X be some set of alternatives (consumption set). Formally,

More information

CSE 473: Artificial Intelligence

CSE 473: Artificial Intelligence CSE 473: Artificial Intelligence Probability Steve Tanimoto University of Washington [These slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188 materials

More information

CSE 473: Artificial Intelligence Spring 2014

CSE 473: Artificial Intelligence Spring 2014 CSE 473: Artificial Intelligence Spring 2014 Hidden Markov Models Hanna Hajishirzi Many slides adapted from Dan Weld, Pieter Abbeel, Dan Klein, Stuart Russell, Andrew Moore & Luke Zettlemoyer 1 Outline

More information

Recitation 7: Uncertainty. Xincheng Qiu

Recitation 7: Uncertainty. Xincheng Qiu Econ 701A Fall 2018 University of Pennsylvania Recitation 7: Uncertainty Xincheng Qiu (qiux@sas.upenn.edu 1 Expected Utility Remark 1. Primitives: in the basic consumer theory, a preference relation is

More information

Economics 201B Economic Theory (Spring 2017) Bargaining. Topics: the axiomatic approach (OR 15) and the strategic approach (OR 7).

Economics 201B Economic Theory (Spring 2017) Bargaining. Topics: the axiomatic approach (OR 15) and the strategic approach (OR 7). Economics 201B Economic Theory (Spring 2017) Bargaining Topics: the axiomatic approach (OR 15) and the strategic approach (OR 7). The axiomatic approach (OR 15) Nash s (1950) work is the starting point

More information

Utility and Decision Making 1

Utility and Decision Making 1 Utility and Decision Making 1 Suppose that a casino offers to keep flipping a coin until it comes up heads for the first time. If it comes up heads on the 1st flip, you win $1. If it comes up heads for

More information

Introduction to Mobile Robotics Probabilistic Robotics

Introduction to Mobile Robotics Probabilistic Robotics Introduction to Mobile Robotics Probabilistic Robotics Wolfram Burgard 1 Probabilistic Robotics Key idea: Explicit representation of uncertainty (using the calculus of probability theory) Perception Action

More information

15-381: Artificial Intelligence. Hidden Markov Models (HMMs)

15-381: Artificial Intelligence. Hidden Markov Models (HMMs) 15-381: Artificial Intelligence Hidden Markov Models (HMMs) What s wrong with Bayesian networks Bayesian networks are very useful for modeling joint distributions But they have their limitations: - Cannot

More information

Approximate Inference

Approximate Inference Approximate Inference Simulation has a name: sampling Sampling is a hot topic in machine learning, and it s really simple Basic idea: Draw N samples from a sampling distribution S Compute an approximate

More information

CSEP 573: Artificial Intelligence

CSEP 573: Artificial Intelligence CSEP 573: Artificial Intelligence Hidden Markov Models Luke Zettlemoyer Many slides over the course adapted from either Dan Klein, Stuart Russell, Andrew Moore, Ali Farhadi, or Dan Weld 1 Outline Probabilistic

More information

Course 16:198:520: Introduction To Artificial Intelligence Lecture 13. Decision Making. Abdeslam Boularias. Wednesday, December 7, 2016

Course 16:198:520: Introduction To Artificial Intelligence Lecture 13. Decision Making. Abdeslam Boularias. Wednesday, December 7, 2016 Course 16:198:520: Introduction To Artificial Intelligence Lecture 13 Decision Making Abdeslam Boularias Wednesday, December 7, 2016 1 / 45 Overview We consider probabilistic temporal models where the

More information

Bayesian Reasoning. Adapted from slides by Tim Finin and Marie desjardins.

Bayesian Reasoning. Adapted from slides by Tim Finin and Marie desjardins. Bayesian Reasoning Adapted from slides by Tim Finin and Marie desjardins. 1 Outline Probability theory Bayesian inference From the joint distribution Using independence/factoring From sources of evidence

More information

Lecture 10: Introduction to reasoning under uncertainty. Uncertainty

Lecture 10: Introduction to reasoning under uncertainty. Uncertainty Lecture 10: Introduction to reasoning under uncertainty Introduction to reasoning under uncertainty Review of probability Axioms and inference Conditional probability Probability distributions COMP-424,

More information

Discrete Probability and State Estimation

Discrete Probability and State Estimation 6.01, Fall Semester, 2007 Lecture 12 Notes 1 MASSACHVSETTS INSTITVTE OF TECHNOLOGY Department of Electrical Engineering and Computer Science 6.01 Introduction to EECS I Fall Semester, 2007 Lecture 12 Notes

More information

Reinforcement Learning Wrap-up

Reinforcement Learning Wrap-up Reinforcement Learning Wrap-up Slides courtesy of Dan Klein and Pieter Abbeel University of California, Berkeley [These slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley.

More information

Hidden Markov models 1

Hidden Markov models 1 Hidden Markov models 1 Outline Time and uncertainty Markov process Hidden Markov models Inference: filtering, prediction, smoothing Most likely explanation: Viterbi 2 Time and uncertainty The world changes;

More information

Intro to Probability. Andrei Barbu

Intro to Probability. Andrei Barbu Intro to Probability Andrei Barbu Some problems Some problems A means to capture uncertainty Some problems A means to capture uncertainty You have data from two sources, are they different? Some problems

More information

Our Status. We re done with Part I Search and Planning!

Our Status. We re done with Part I Search and Planning! Probability [These slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188 materials are available at http://ai.berkeley.edu.] Our Status We re done with Part

More information

Uncertainty. Michael Peters December 27, 2013

Uncertainty. Michael Peters December 27, 2013 Uncertainty Michael Peters December 27, 20 Lotteries In many problems in economics, people are forced to make decisions without knowing exactly what the consequences will be. For example, when you buy

More information

Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com

Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com 1 School of Oriental and African Studies September 2015 Department of Economics Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com Gujarati D. Basic Econometrics, Appendix

More information

Expected Utility Framework

Expected Utility Framework Expected Utility Framework Preferences We want to examine the behavior of an individual, called a player, who must choose from among a set of outcomes. Let X be the (finite) set of outcomes with common

More information

Intertemporal Risk Aversion, Stationarity, and Discounting

Intertemporal Risk Aversion, Stationarity, and Discounting Traeger, CES ifo 10 p. 1 Intertemporal Risk Aversion, Stationarity, and Discounting Christian Traeger Department of Agricultural & Resource Economics, UC Berkeley Introduce a more general preference representation

More information

Lecture 3: ASR: HMMs, Forward, Viterbi

Lecture 3: ASR: HMMs, Forward, Viterbi Original slides by Dan Jurafsky CS 224S / LINGUIST 285 Spoken Language Processing Andrew Maas Stanford University Spring 2017 Lecture 3: ASR: HMMs, Forward, Viterbi Fun informative read on phonetics The

More information

CS 188: Artificial Intelligence Fall 2011

CS 188: Artificial Intelligence Fall 2011 CS 188: Artificial Intelligence Fall 2011 Lecture 12: Probability 10/4/2011 Dan Klein UC Berkeley 1 Today Probability Random Variables Joint and Marginal Distributions Conditional Distribution Product

More information

Probabilistic Robotics. Slides from Autonomous Robots (Siegwart and Nourbaksh), Chapter 5 Probabilistic Robotics (S. Thurn et al.

Probabilistic Robotics. Slides from Autonomous Robots (Siegwart and Nourbaksh), Chapter 5 Probabilistic Robotics (S. Thurn et al. robabilistic Robotics Slides from Autonomous Robots Siegwart and Nourbaksh Chapter 5 robabilistic Robotics S. Thurn et al. Today Overview of probability Representing uncertainty ropagation of uncertainty

More information

Computer Science CPSC 322. Lecture 18 Marginalization, Conditioning

Computer Science CPSC 322. Lecture 18 Marginalization, Conditioning Computer Science CPSC 322 Lecture 18 Marginalization, Conditioning Lecture Overview Recap Lecture 17 Joint Probability Distribution, Marginalization Conditioning Inference by Enumeration Bayes Rule, Chain

More information

CS 188: Artificial Intelligence. Our Status in CS188

CS 188: Artificial Intelligence. Our Status in CS188 CS 188: Artificial Intelligence Probability Pieter Abbeel UC Berkeley Many slides adapted from Dan Klein. 1 Our Status in CS188 We re done with Part I Search and Planning! Part II: Probabilistic Reasoning

More information

Artificial Intelligence Programming Probability

Artificial Intelligence Programming Probability Artificial Intelligence Programming Probability Chris Brooks Department of Computer Science University of San Francisco Department of Computer Science University of San Francisco p.1/?? 13-0: Uncertainty

More information