Decision Theory Intro: Preferences and Utility
|
|
- Horatio Hutchinson
- 5 years ago
- Views:
Transcription
1 Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29 March 22, 2006 Textbook 9.5 Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 1
2 Lecture Overview Recap Decision Theory Intro Preferences Utility Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 2
3 Markov chain A Markov chain is a special sort of belief network: S 0 S 1 S 2 S 3 S 4 Thus P (S t+1 S 0,..., S t ) = P (S t+1 S t ). The past is independent of the future given the present. A stationary Markov chain is when for all t > 0, t > 0, P (S t+1 S t ) = P (S t +1 S t ). We specify P (S0 ) and P (S t+1 S t ). Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 3
4 Hidden Markov Model A Hidden Markov Model (HMM) starts with a Markov chain, and adds a noisy observation about the state at each time step: S 0 S 1 S 2 S 3 S 4 O 0 O 1 O 2 O 3 O 4 P (S 0 ) specifies initial conditions P (S t+1 S t ) specifies the dynamics P (O t S t ) specifies the sensor model Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 4
5 Lecture Overview Recap Decision Theory Intro Preferences Utility Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 5
6 Decisions Under Uncertainty In the first part of the course we focused on decision making in domains where the environment was understood with certainty Search/CSPs: single decisions Planning: sequential decisions In uncertain domains, we ve so far only considered how to represent and update beliefs What if an agent has to make decisions in a domain that involves uncertainty? this is likely: one of the main reasons to represent the world probabilistically is to be able to use these beliefs as the basis for making decisions Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 6
7 Decisions Under Uncertainty An agent s decision will depend on: 1. what actions are available 2. what beliefs the agent has note: this replaces state from the deterministic setting 3. the agent s goals We ve spoken quite a lot about (1) and (2). today let s consider (3) we ll move from all-or-nothing goals to a richer notion: rating how happy the agent is in different situations Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 7
8 Lecture Overview Recap Decision Theory Intro Preferences Utility Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 8
9 Preferences Actions result in outcomes Agents have preferences over outcomes A rational agent will take the action that leads to the outcome which he most prefers Sometimes agents don t know the outcomes of the actions, but they still need to compare actions Agents have to act (doing nothing is (often) an action). Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 9
10 Preferences Over Outcomes If o 1 and o 2 are outcomes o 1 o 2 means o 1 is at least as desirable as o 2. read this as the agent weakly prefers o 1 to o 2 o 1 o 2 means o 1 o 2 and o 2 o 1. read this as the agent is indifferent between o 1 and o 2. o 1 o 2 means o 1 o 2 and o 2 o 1 read this as the agent strictly prefers o1 to o 2 Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 10
11 Lotteries An agent may not know the outcomes of his actions, but may instead only have a probability distribution over the outcomes. A lottery is a probability distribution over outcomes. It is written [p 1 : o 1, p 2 : o 2,..., p k : o k ] where the o i are outcomes and p i > 0 such that p i = 1 i The lottery specifies that outcome o i occurs with probability p i. We will consider lotteries to be outcomes. Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 11
12 Our Goal We want to reason about preferences mathematically To do this, we must give some rules that allow us to allow us to relate and transform expressions involving the symbols, and, as well as lotteries. Just as we did with probabilities, we will axiomatize preferences. These rules will allow us to derive consequences of preference statements In the end, one has to either accept these consequences or reject one of the axioms Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 12
13 Preference Axioms Completeness: A preference relationship must be defined between every pair of outcomes: o 1 o 2 o 1 o 2 or o 2 o 1 Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 13
14 Preference Axioms Transitivity: Preferences must be transitive: if o 1 o 2 and o 2 o 3 then o 1 o 3 This makes good sense: otherwise o 1 o 2 and o 2 o 3 and o 3 o 1. An agent should be prepared to pay some amount to swap between an outcome they prefer less and an outcome they prefer more Intransitive preferences mean we can construct a money pump! Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 14
15 Preference Axioms Monotonicity: An agent prefers a larger chance of getting a better outcome than a smaller chance: If o 1 o 2 and p > q then [p : o 1, 1 p : o 2 ] [q : o 1, 1 q : o 2 ] Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 15
16 Consequence of axioms Suppose o 1 o 2 and o 2 o 3. Consider whether the agent would prefer o 2 the lottery [p : o1, 1 p : o 3 ] for different values of p [0, 1]. You can plot which one is preferred as a function of p: o 2 - lottery Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 16
17 Preference Axioms Continuity: Suppose o 1 o 2 and o 2 o 3, then there exists a p [0, 1] such that o 2 [p : o 1, 1 p : o 3 ] Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 17
18 Preference Axioms Decomposability: ( no fun in gambling ). An agent is indifferent between lotteries that have same probabilities and outcomes. This includes lotteries over lotteries. For example: [p : o 1, 1 p : [q : o 2, 1 q : o 3 ]] [p : o 1, (1 p)q : o 2, (1 p)(1 q) : o 3 ] Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 18
19 Preference Axioms Substitutivity: if o 1 o 2 then the agent is indifferent between lotteries that only differ by o 1 and o 2 : [p : o 1, 1 p : o 3 ] [p : o 2, 1 p : o 3 ] Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 19
20 Lecture Overview Recap Decision Theory Intro Preferences Utility Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 20
21 What we would like We would like a measure of preference that can be combined with probabilities. So that value([p : o 1, 1 p : o 2 ]) = p value(o 1 ) + (1 p) value(o 2 ) Can we use money as this measure of preference? Would you you prefer $1, 000, 000 or [0.5 : $0, 0.5 : $2, 000, 000]? Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 21
22 What we would like We would like a measure of preference that can be combined with probabilities. So that value([p : o 1, 1 p : o 2 ]) = p value(o 1 ) + (1 p) value(o 2 ) Can we use money as this measure of preference? Would you you prefer $1, 000, 000 or [0.5 : $0, 0.5 : $2, 000, 000]? Money is not going to work. Let s invent an abstract concept called utility. Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 21
23 Utility as a function of money 1 Utility Risk averse Risk neutral Risk seeking 0 $0 $2,000,000 Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 22
24 Theorem Is it possible that preferences are too complex and muti-faceted to be represented by single numbers? If preferences satisfy the preceding axioms, then preferences can be measured by a function such that utility : outcomes [0, 1] 1. o 1 o 2 if and only if utility(o 1 ) utility(o2). 2. Utilities are linear with probabilities: utility([p 1 : o 1, p 2 : o 2,..., p k : o k ]) k = p i utility(o i ) i=1 Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 23
25 Proof Part 1: o 1 o 2 if and only if utility(o 1 ) utility(o2). If all outcomes are equally preferred, set utility(o i ) = 0 for all outcomes o i. Otherwise, suppose the best outcome is best and the worst outcome is worst. For any outcome o i, define utility(o i ) to be the number u i such that o i [u i : best, 1 u i : worst] This exists by the Continuity property. Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 24
26 Proof (cont.) Part 1: o 1 o 2 if and only if utility(o 1 ) utility(o2). Suppose o 1 o 2 and utility(o i ) = u i, then by Substitutivity, [u 1 : best, 1 u 1 : worst] [u 2 : best, 1 u 2 : worst] Which, by completeness and monotonicity implies u 1 u 2. Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 25
27 Proof (cont.) Part 2: utility([p 1 : o 1,..., p k : o k ]) = k i=1 p i utility(o i ) Suppose p = utility([p 1 : o 1, p 2 : o 2,..., p k : o k ]). Suppose utility(o i ) = u i. We know: o i [u i : best, 1 u i : worst] By substitutivity, we can replace each o i by [u i : best, 1 u i : worst], so p = utility( [ p 1 : [u 1 : best, 1 u 1 : worst]... p k : [u k : best, 1 u k : worst]]) Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 26
28 Proof (cont.) Part 2: utility([p 1 : o 1,..., p k : o k ]) = k i=1 p i utility(o i ) By decomposability, this is equivalent to: p = utility( [ p 1 u p k u k : best, Thus, by definition of utility, p 1 (1 u 1 ) + + p k (1 u k ) : worst]]) p = p 1 u p k u k Decision Theory Intro: Preferences and Utility CPSC 322 Lecture 29, Slide 27
Reasoning Under Uncertainty: Introduction to Probability
Reasoning Under Uncertainty: Introduction to Probability CPSC 322 Uncertainty 1 Textbook 6.1 Reasoning Under Uncertainty: Introduction to Probability CPSC 322 Uncertainty 1, Slide 1 Lecture Overview 1
More informationReasoning Under Uncertainty: Introduction to Probability
Reasoning Under Uncertainty: Introduction to Probability CPSC 322 Lecture 23 March 12, 2007 Textbook 9 Reasoning Under Uncertainty: Introduction to Probability CPSC 322 Lecture 23, Slide 1 Lecture Overview
More informationDecision Theory: Markov Decision Processes
Decision Theory: Markov Decision Processes CPSC 322 Lecture 33 March 31, 2006 Textbook 12.5 Decision Theory: Markov Decision Processes CPSC 322 Lecture 33, Slide 1 Lecture Overview Recap Rewards and Policies
More informationEvaluation for Pacman. CS 188: Artificial Intelligence Fall Iterative Deepening. α-β Pruning Example. α-β Pruning Pseudocode.
CS 188: Artificial Intelligence Fall 2008 Evaluation for Pacman Lecture 7: Expectimax Search 9/18/2008 [DEMO: thrashing, smart ghosts] Dan Klein UC Berkeley Many slides over the course adapted from either
More informationCS 188: Artificial Intelligence Fall 2008
CS 188: Artificial Intelligence Fall 2008 Lecture 7: Expectimax Search 9/18/2008 Dan Klein UC Berkeley Many slides over the course adapted from either Stuart Russell or Andrew Moore 1 1 Evaluation for
More informationIntroduction to Artificial Intelligence (AI)
Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 9 Oct, 11, 2011 Slide credit Approx. Inference : S. Thrun, P, Norvig, D. Klein CPSC 502, Lecture 9 Slide 1 Today Oct 11 Bayesian
More informationLecture 14: Introduction to Decision Making
Lecture 14: Introduction to Decision Making Preferences Utility functions Maximizing exected utility Value of information Actions and consequences So far, we have focused on ways of modeling a stochastic,
More informationIntroduction to Artificial Intelligence (AI)
Introduction to Artificial Intelligence (AI) Computer Science cpsc502, Lecture 10 Oct, 13, 2011 CPSC 502, Lecture 10 Slide 1 Today Oct 13 Inference in HMMs More on Robot Localization CPSC 502, Lecture
More informationDecision Theory: Q-Learning
Decision Theory: Q-Learning CPSC 322 Decision Theory 5 Textbook 12.5 Decision Theory: Q-Learning CPSC 322 Decision Theory 5, Slide 1 Lecture Overview 1 Recap 2 Asynchronous Value Iteration 3 Q-Learning
More informationReasoning Under Uncertainty: Bayesian networks intro
Reasoning Under Uncertainty: Bayesian networks intro Alan Mackworth UBC CS 322 Uncertainty 4 March 18, 2013 Textbook 6.3, 6.3.1, 6.5, 6.5.1, 6.5.2 Lecture Overview Recap: marginal and conditional independence
More informationReasoning under Uncertainty: Intro to Probability
Reasoning under Uncertainty: Intro to Probability Computer Science cpsc322, Lecture 24 (Textbook Chpt 6.1, 6.1.1) March, 15, 2010 CPSC 322, Lecture 24 Slide 1 To complete your Learning about Logics Review
More informationLast update: April 15, Rational decisions. CMSC 421: Chapter 16. CMSC 421: Chapter 16 1
Last update: April 15, 2010 Rational decisions CMSC 421: Chapter 16 CMSC 421: Chapter 16 1 Outline Rational preferences Utilities Money Multiattribute utilities Decision networks Value of information CMSC
More informationComputer Science CPSC 322. Lecture 23 Planning Under Uncertainty and Decision Networks
Computer Science CPSC 322 Lecture 23 Planning Under Uncertainty and Decision Networks 1 Announcements Final exam Mon, Dec. 18, 12noon Same general format as midterm Part short questions, part longer problems
More informationProbability and Time: Hidden Markov Models (HMMs)
Probability and Time: Hidden Markov Models (HMMs) Computer Science cpsc322, Lecture 32 (Textbook Chpt 6.5.2) Nov, 25, 2013 CPSC 322, Lecture 32 Slide 1 Lecture Overview Recap Markov Models Markov Chain
More informationThis corresponds to a within-subject experiment: see same subject make choices from different menus.
Testing Revealed Preference Theory, I: Methodology The revealed preference theory developed last time applied to a single agent. This corresponds to a within-subject experiment: see same subject make choices
More informationIntelligent Systems (AI-2)
Intelligent Systems (AI-2) Computer Science cpsc422, Lecture 11 Oct, 3, 2016 CPSC 422, Lecture 11 Slide 1 422 big picture: Where are we? Query Planning Deterministic Logics First Order Logics Ontologies
More informationCS 798: Multiagent Systems
CS 798: Multiagent Systems and Utility Kate Larson Cheriton School of Computer Science University of Waterloo January 6, 2010 Outline 1 Self-Interested Agents 2 3 4 5 Self-Interested Agents We are interested
More informationRational preferences. Rational decisions. Outline. Rational preferences contd. Maximizing expected utility. Preferences
Rational preferences Rational decisions hapter 16 Idea: preferences of a rational agent must obey constraints. Rational preferences behavior describable as maximization of expected utility onstraints:
More information1 Uncertainty and Insurance
Uncertainty and Insurance Reading: Some fundamental basics are in Varians intermediate micro textbook (Chapter 2). A good (advanced, but still rather accessible) treatment is in Kreps A Course in Microeconomic
More informationHidden Markov Models. Vibhav Gogate The University of Texas at Dallas
Hidden Markov Models Vibhav Gogate The University of Texas at Dallas Intro to AI (CS 4365) Many slides over the course adapted from either Dan Klein, Luke Zettlemoyer, Stuart Russell or Andrew Moore 1
More informationBasic Probability and Decisions
Basic Probability and Decisions Chris Amato Northeastern University Some images and slides are used from: Rob Platt, CS188 UC Berkeley, AIMA Uncertainty Let action A t = leave for airport t minutes before
More informationReasoning Under Uncertainty: Belief Network Inference
Reasoning Under Uncertainty: Belief Network Inference CPSC 322 Uncertainty 5 Textbook 10.4 Reasoning Under Uncertainty: Belief Network Inference CPSC 322 Uncertainty 5, Slide 1 Lecture Overview 1 Recap
More information12735: Urban Systems Modeling. Loss and decisions. instructor: Matteo Pozzi. Lec : Urban Systems Modeling Lec. 11 Loss and decisions
1735: Urban Systems Modeling Lec. 11 Loss and decisions instructor: Matteo Pozzi 1 outline introduction example of decision under uncertainty attitude toward risk principle of minimum expected loss Bayesian
More informationIntro Utility theory Utility & Loss Functions Value of Information Summary. Decision Theory. (CIV Machine Learning for Civil Engineers)
Decision Theory (CIV654 - Machine Learning for Civil Engineers) Professor: James-A. Goulet Département des génies civil, géologique et des mines Chapter 14 Goulet (219). Probabilistic Machine Learning
More informationChoice under Uncertainty
In the Name of God Sharif University of Technology Graduate School of Management and Economics Microeconomics 2 44706 (1394-95 2 nd term) Group 2 Dr. S. Farshad Fatemi Chapter 6: Choice under Uncertainty
More informationPropositional Definite Clause Logic: Syntax, Semantics and Bottom-up Proofs
Propositional Definite Clause Logic: Syntax, Semantics and Bottom-up Proofs Computer Science cpsc322, Lecture 20 (Textbook Chpt 5.1.2-5.2.2 ) June, 6, 2017 CPSC 322, Lecture 20 Slide 1 Lecture Overview
More informationComputing Minmax; Dominance
Computing Minmax; Dominance CPSC 532A Lecture 5 Computing Minmax; Dominance CPSC 532A Lecture 5, Slide 1 Lecture Overview 1 Recap 2 Linear Programming 3 Computational Problems Involving Maxmin 4 Domination
More information14.12 Game Theory Lecture Notes Theory of Choice
14.12 Game Theory Lecture Notes Theory of Choice Muhamet Yildiz (Lecture 2) 1 The basic theory of choice We consider a set X of alternatives. Alternatives are mutually exclusive in the sense that one cannot
More informationReasoning Under Uncertainty: Conditional Probability
Reasoning Under Uncertainty: Conditional Probability CPSC 322 Uncertainty 2 Textbook 6.1 Reasoning Under Uncertainty: Conditional Probability CPSC 322 Uncertainty 2, Slide 1 Lecture Overview 1 Recap 2
More informationComputing Minmax; Dominance
Computing Minmax; Dominance CPSC 532A Lecture 5 Computing Minmax; Dominance CPSC 532A Lecture 5, Slide 1 Lecture Overview 1 Recap 2 Linear Programming 3 Computational Problems Involving Maxmin 4 Domination
More information18.600: Lecture 3 What is probability?
18.600: Lecture 3 What is probability? Scott Sheffield MIT Outline Formalizing probability Sample space DeMorgan s laws Axioms of probability Outline Formalizing probability Sample space DeMorgan s laws
More informationCS 5522: Artificial Intelligence II
CS 5522: Artificial Intelligence II Hidden Markov Models Instructor: Alan Ritter Ohio State University [These slides were adapted from CS188 Intro to AI at UC Berkeley. All materials available at http://ai.berkeley.edu.]
More informationCS 5522: Artificial Intelligence II
CS 5522: Artificial Intelligence II Hidden Markov Models Instructor: Wei Xu Ohio State University [These slides were adapted from CS188 Intro to AI at UC Berkeley.] Pacman Sonar (P4) [Demo: Pacman Sonar
More information3 Intertemporal Risk Aversion
3 Intertemporal Risk Aversion 3. Axiomatic Characterization This section characterizes the invariant quantity found in proposition 2 axiomatically. The axiomatic characterization below is for a decision
More informationRational decisions. Chapter 16. Chapter 16 1
Rational decisions Chapter 16 Chapter 16 1 Outline Rational preferences Utilities Money Multiattribute utilities Decision networks Value of information Chapter 16 2 Preferences An agent chooses among prizes
More informationCS 4649/7649 Robot Intelligence: Planning
CS 4649/7649 Robot Intelligence: Planning Probability Primer Sungmoon Joo School of Interactive Computing College of Computing Georgia Institute of Technology S. Joo (sungmoon.joo@cc.gatech.edu) 1 *Slides
More information1 Uncertainty. These notes correspond to chapter 2 of Jehle and Reny.
These notes correspond to chapter of Jehle and Reny. Uncertainty Until now we have considered our consumer s making decisions in a world with perfect certainty. However, we can extend the consumer theory
More informationAxiomatic Decision Theory
Decision Theory Decision theory is about making choices It has a normative aspect what rational people should do... and a descriptive aspect what people do do Not surprisingly, it s been studied by economists,
More informationCSE 573: Artificial Intelligence
CSE 573: Artificial Intelligence Autumn 2010 Lecture 5: Expectimax Search 10/14/2008 Luke Zettlemoyer Most slides over the course adapted from either Dan Klein, Stuart Russell or Andrew Moore 1 Announcements
More informationAn AI-ish view of Probability, Conditional Probability & Bayes Theorem
An AI-ish view of Probability, Conditional Probability & Bayes Theorem Review: Uncertainty and Truth Values: a mismatch Let action A t = leave for airport t minutes before flight. Will A 15 get me there
More information10/18/2017. An AI-ish view of Probability, Conditional Probability & Bayes Theorem. Making decisions under uncertainty.
An AI-ish view of Probability, Conditional Probability & Bayes Theorem Review: Uncertainty and Truth Values: a mismatch Let action A t = leave for airport t minutes before flight. Will A 15 get me there
More informationCS188 Outline. CS 188: Artificial Intelligence. Today. Inference in Ghostbusters. Probability. We re done with Part I: Search and Planning!
CS188 Outline We re done with art I: Search and lanning! CS 188: Artificial Intelligence robability art II: robabilistic Reasoning Diagnosis Speech recognition Tracking objects Robot mapping Genetics Error
More informationHidden Markov Models,99,100! Markov, here I come!
Hidden Markov Models,99,100! Markov, here I come! 16.410/413 Principles of Autonomy and Decision-Making Pedro Santana (psantana@mit.edu) October 7 th, 2015. Based on material by Brian Williams and Emilio
More informationTwo-Stage-Partitional Representation and Dynamic Consistency 1
Two-Stage-Partitional Representation and Dynamic Consistency 1 Suguru Ito June 5 th, 2015 Abstract In this paper, we study an individual who faces a three-period decision problem when she accepts the partitional
More informationCSE 473: Artificial Intelligence Probability Review à Markov Models. Outline
CSE 473: Artificial Intelligence Probability Review à Markov Models Daniel Weld University of Washington [These slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley.
More informationOur Status in CSE 5522
Our Status in CSE 5522 We re done with Part I Search and Planning! Part II: Probabilistic Reasoning Diagnosis Speech recognition Tracking objects Robot mapping Genetics Error correcting codes lots more!
More informationRational decisions From predicting the effect to optimal intervention
Rational decisions From predicting the effect to optimal intervention ADAPTED FROM AIMA SLIDES Russel&Norvig:Artificial Intelligence: a modern approach AIMA: Rational decisions AIMA: Rational decisions
More informationPropositional Logic: Semantics and an Example
Propositional Logic: Semantics and an Example CPSC 322 Logic 2 Textbook 5.2 Propositional Logic: Semantics and an Example CPSC 322 Logic 2, Slide 1 Lecture Overview 1 Recap: Syntax 2 Propositional Definite
More informationCS 188: Artificial Intelligence Spring 2009
CS 188: Artificial Intelligence Spring 2009 Lecture 21: Hidden Markov Models 4/7/2009 John DeNero UC Berkeley Slides adapted from Dan Klein Announcements Written 3 deadline extended! Posted last Friday
More informationAnnouncements. CS 188: Artificial Intelligence Fall Markov Models. Example: Markov Chain. Mini-Forward Algorithm. Example
CS 88: Artificial Intelligence Fall 29 Lecture 9: Hidden Markov Models /3/29 Announcements Written 3 is up! Due on /2 (i.e. under two weeks) Project 4 up very soon! Due on /9 (i.e. a little over two weeks)
More informationGame Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012
Game Theory Lecture Notes By Y. Narahari Department of Computer Science and Automation Indian Institute of Science Bangalore, India August 2012 Chapter 7: Von Neumann - Morgenstern Utilities Note: This
More informationReasoning with Uncertainty
Reasoning with Uncertainty Representing Uncertainty Manfred Huber 2005 1 Reasoning with Uncertainty The goal of reasoning is usually to: Determine the state of the world Determine what actions to take
More informationOutline. CSE 573: Artificial Intelligence Autumn Agent. Partial Observability. Markov Decision Process (MDP) 10/31/2012
CSE 573: Artificial Intelligence Autumn 2012 Reasoning about Uncertainty & Hidden Markov Models Daniel Weld Many slides adapted from Dan Klein, Stuart Russell, Andrew Moore & Luke Zettlemoyer 1 Outline
More informationLecture 7. Intro to Statistical Decision Analysis. Example. A numerical measures of outcomes. Today: Valuing outcomes - Utility.
Lecture 7 Professor Scott Schmidler Duke University Today: Valuing outcomes -. Stat 340 TuTh 11:45-1:00 Fall 2016 Reference: Lectures draw heavily from Lindley Making Decisions, 2nd ed. Textbook reading:
More informationCS188 Outline. We re done with Part I: Search and Planning! Part II: Probabilistic Reasoning. Part III: Machine Learning
CS188 Outline We re done with Part I: Search and Planning! Part II: Probabilistic Reasoning Diagnosis Speech recognition Tracking objects Robot mapping Genetics Error correcting codes lots more! Part III:
More informationReasoning under Uncertainty: Intro to Probability
Reasoning under Uncertainty: Intro to Probability Computer Science cpsc322, Lecture 24 (Textbook Chpt 6.1, 6.1.1) Nov, 2, 2012 CPSC 322, Lecture 24 Slide 1 Tracing Datalog proofs in AIspace You can trace
More informationRational decisions. Chapter 16 1
Rational decisions Chapter 16 Chapter 16 1 Rational preferences Utilities Money Multiattribute utilities Decision networks Value of information Outline Chapter 16 2 Preferences An agent chooses among prizes
More informationCS 343: Artificial Intelligence
CS 343: Artificial Intelligence Probability Prof. Scott Niekum The University of Texas at Austin [These slides based on those of Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188
More informationAlmost essential: Consumption and Uncertainty Probability Distributions MICROECONOMICS
Prerequisites Almost essential: Consumption and Uncertainty Probability Distributions RISK MICROECONOMICS Principles and Analysis Frank Cowell July 2017 1 Risk and uncertainty In dealing with uncertainty
More informationEngineering Decisions
GSOE9210 vicj@cse.unsw.edu.au www.cse.unsw.edu.au/~gs9210 1 Preferences to values Outline 1 Preferences to values Evaluating outcomes and actions Example (Bus or train?) Would Alice prefer to catch the
More informationRecap Social Choice Fun Game Voting Paradoxes Properties. Social Choice. Lecture 11. Social Choice Lecture 11, Slide 1
Social Choice Lecture 11 Social Choice Lecture 11, Slide 1 Lecture Overview 1 Recap 2 Social Choice 3 Fun Game 4 Voting Paradoxes 5 Properties Social Choice Lecture 11, Slide 2 Formal Definition Definition
More informationHomework #6 (10/18/2017)
Homework #6 (0/8/207). Let G be the set of compound gambles over a finite set of deterministic payoffs {a, a 2,...a n } R +. A decision maker s preference relation over compound gambles can be represented
More informationCSE 473: Artificial Intelligence
CSE 473: Artificial Intelligence Hidden Markov Models Dieter Fox --- University of Washington [Most slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188 materials
More informationWhere are we in CS 440?
Where are we in CS 440? Now leaving: sequential deterministic reasoning Entering: probabilistic reasoning and machine learning robability: Review of main concepts Chapter 3 Making decisions under uncertainty
More informationMarkov Decision Processes
Markov Decision Processes Noel Welsh 11 November 2010 Noel Welsh () Markov Decision Processes 11 November 2010 1 / 30 Annoucements Applicant visitor day seeks robot demonstrators for exciting half hour
More informationAnnouncements. CS 188: Artificial Intelligence Fall Adversarial Games. Computing Minimax Values. Evaluation Functions. Recap: Resource Limits
CS 188: Artificial Intelligence Fall 2009 Lecture 7: Expectimax Search 9/17/2008 Announcements Written 1: Search and CSPs is up Project 2: Multi-agent Search is up Want a partner? Come to the front after
More informationCS 188: Artificial Intelligence Fall Announcements
CS 188: Artificial Intelligence Fall 2009 Lecture 7: Expectimax Search 9/17/2008 Dan Klein UC Berkeley Many slides over the course adapted from either Stuart Russell or Andrew Moore 1 Announcements Written
More informationProbabilistic Robotics
Probabilistic Robotics Overview of probability, Representing uncertainty Propagation of uncertainty, Bayes Rule Application to Localization and Mapping Slides from Autonomous Robots (Siegwart and Nourbaksh),
More informationProbability. CS 3793/5233 Artificial Intelligence Probability 1
CS 3793/5233 Artificial Intelligence 1 Motivation Motivation Random Variables Semantics Dice Example Joint Dist. Ex. Axioms Agents don t have complete knowledge about the world. Agents need to make decisions
More informationHidden Markov Models. AIMA Chapter 15, Sections 1 5. AIMA Chapter 15, Sections 1 5 1
Hidden Markov Models AIMA Chapter 15, Sections 1 5 AIMA Chapter 15, Sections 1 5 1 Consider a target tracking problem Time and uncertainty X t = set of unobservable state variables at time t e.g., Position
More informationProbability and Decision Theory
Probability and Decision Theory Robert Platt Northeastern University Some images and slides are used from: 1. AIMA 2. Chris Amato 3. Stacy Marsella QUANTIFYING UNCERTAINTY WITH PROBABILITIES Generally
More informationPropositional Logic: Bottom-Up Proofs
Propositional Logic: Bottom-Up Proofs CPSC 322 Logic 3 Textbook 5.2 Propositional Logic: Bottom-Up Proofs CPSC 322 Logic 3, Slide 1 Lecture Overview 1 Recap 2 Bottom-Up Proofs 3 Soundness of Bottom-Up
More informationConfronting Theory with Experimental Data and vice versa. Lecture I Choice under risk. The Norwegian School of Economics Nov 7-11, 2011
Confronting Theory with Experimental Data and vice versa Lecture I Choice under risk The Norwegian School of Economics Nov 7-11, 2011 Preferences Let X be some set of alternatives (consumption set). Formally,
More informationCSE 473: Artificial Intelligence
CSE 473: Artificial Intelligence Probability Steve Tanimoto University of Washington [These slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188 materials
More informationCSE 473: Artificial Intelligence Spring 2014
CSE 473: Artificial Intelligence Spring 2014 Hidden Markov Models Hanna Hajishirzi Many slides adapted from Dan Weld, Pieter Abbeel, Dan Klein, Stuart Russell, Andrew Moore & Luke Zettlemoyer 1 Outline
More informationRecitation 7: Uncertainty. Xincheng Qiu
Econ 701A Fall 2018 University of Pennsylvania Recitation 7: Uncertainty Xincheng Qiu (qiux@sas.upenn.edu 1 Expected Utility Remark 1. Primitives: in the basic consumer theory, a preference relation is
More informationEconomics 201B Economic Theory (Spring 2017) Bargaining. Topics: the axiomatic approach (OR 15) and the strategic approach (OR 7).
Economics 201B Economic Theory (Spring 2017) Bargaining Topics: the axiomatic approach (OR 15) and the strategic approach (OR 7). The axiomatic approach (OR 15) Nash s (1950) work is the starting point
More informationUtility and Decision Making 1
Utility and Decision Making 1 Suppose that a casino offers to keep flipping a coin until it comes up heads for the first time. If it comes up heads on the 1st flip, you win $1. If it comes up heads for
More informationIntroduction to Mobile Robotics Probabilistic Robotics
Introduction to Mobile Robotics Probabilistic Robotics Wolfram Burgard 1 Probabilistic Robotics Key idea: Explicit representation of uncertainty (using the calculus of probability theory) Perception Action
More information15-381: Artificial Intelligence. Hidden Markov Models (HMMs)
15-381: Artificial Intelligence Hidden Markov Models (HMMs) What s wrong with Bayesian networks Bayesian networks are very useful for modeling joint distributions But they have their limitations: - Cannot
More informationApproximate Inference
Approximate Inference Simulation has a name: sampling Sampling is a hot topic in machine learning, and it s really simple Basic idea: Draw N samples from a sampling distribution S Compute an approximate
More informationCSEP 573: Artificial Intelligence
CSEP 573: Artificial Intelligence Hidden Markov Models Luke Zettlemoyer Many slides over the course adapted from either Dan Klein, Stuart Russell, Andrew Moore, Ali Farhadi, or Dan Weld 1 Outline Probabilistic
More informationCourse 16:198:520: Introduction To Artificial Intelligence Lecture 13. Decision Making. Abdeslam Boularias. Wednesday, December 7, 2016
Course 16:198:520: Introduction To Artificial Intelligence Lecture 13 Decision Making Abdeslam Boularias Wednesday, December 7, 2016 1 / 45 Overview We consider probabilistic temporal models where the
More informationBayesian Reasoning. Adapted from slides by Tim Finin and Marie desjardins.
Bayesian Reasoning Adapted from slides by Tim Finin and Marie desjardins. 1 Outline Probability theory Bayesian inference From the joint distribution Using independence/factoring From sources of evidence
More informationLecture 10: Introduction to reasoning under uncertainty. Uncertainty
Lecture 10: Introduction to reasoning under uncertainty Introduction to reasoning under uncertainty Review of probability Axioms and inference Conditional probability Probability distributions COMP-424,
More informationDiscrete Probability and State Estimation
6.01, Fall Semester, 2007 Lecture 12 Notes 1 MASSACHVSETTS INSTITVTE OF TECHNOLOGY Department of Electrical Engineering and Computer Science 6.01 Introduction to EECS I Fall Semester, 2007 Lecture 12 Notes
More informationReinforcement Learning Wrap-up
Reinforcement Learning Wrap-up Slides courtesy of Dan Klein and Pieter Abbeel University of California, Berkeley [These slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley.
More informationHidden Markov models 1
Hidden Markov models 1 Outline Time and uncertainty Markov process Hidden Markov models Inference: filtering, prediction, smoothing Most likely explanation: Viterbi 2 Time and uncertainty The world changes;
More informationIntro to Probability. Andrei Barbu
Intro to Probability Andrei Barbu Some problems Some problems A means to capture uncertainty Some problems A means to capture uncertainty You have data from two sources, are they different? Some problems
More informationOur Status. We re done with Part I Search and Planning!
Probability [These slides were created by Dan Klein and Pieter Abbeel for CS188 Intro to AI at UC Berkeley. All CS188 materials are available at http://ai.berkeley.edu.] Our Status We re done with Part
More informationUncertainty. Michael Peters December 27, 2013
Uncertainty Michael Peters December 27, 20 Lotteries In many problems in economics, people are forced to make decisions without knowing exactly what the consequences will be. For example, when you buy
More informationPreliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com
1 School of Oriental and African Studies September 2015 Department of Economics Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com Gujarati D. Basic Econometrics, Appendix
More informationExpected Utility Framework
Expected Utility Framework Preferences We want to examine the behavior of an individual, called a player, who must choose from among a set of outcomes. Let X be the (finite) set of outcomes with common
More informationIntertemporal Risk Aversion, Stationarity, and Discounting
Traeger, CES ifo 10 p. 1 Intertemporal Risk Aversion, Stationarity, and Discounting Christian Traeger Department of Agricultural & Resource Economics, UC Berkeley Introduce a more general preference representation
More informationLecture 3: ASR: HMMs, Forward, Viterbi
Original slides by Dan Jurafsky CS 224S / LINGUIST 285 Spoken Language Processing Andrew Maas Stanford University Spring 2017 Lecture 3: ASR: HMMs, Forward, Viterbi Fun informative read on phonetics The
More informationCS 188: Artificial Intelligence Fall 2011
CS 188: Artificial Intelligence Fall 2011 Lecture 12: Probability 10/4/2011 Dan Klein UC Berkeley 1 Today Probability Random Variables Joint and Marginal Distributions Conditional Distribution Product
More informationProbabilistic Robotics. Slides from Autonomous Robots (Siegwart and Nourbaksh), Chapter 5 Probabilistic Robotics (S. Thurn et al.
robabilistic Robotics Slides from Autonomous Robots Siegwart and Nourbaksh Chapter 5 robabilistic Robotics S. Thurn et al. Today Overview of probability Representing uncertainty ropagation of uncertainty
More informationComputer Science CPSC 322. Lecture 18 Marginalization, Conditioning
Computer Science CPSC 322 Lecture 18 Marginalization, Conditioning Lecture Overview Recap Lecture 17 Joint Probability Distribution, Marginalization Conditioning Inference by Enumeration Bayes Rule, Chain
More informationCS 188: Artificial Intelligence. Our Status in CS188
CS 188: Artificial Intelligence Probability Pieter Abbeel UC Berkeley Many slides adapted from Dan Klein. 1 Our Status in CS188 We re done with Part I Search and Planning! Part II: Probabilistic Reasoning
More informationArtificial Intelligence Programming Probability
Artificial Intelligence Programming Probability Chris Brooks Department of Computer Science University of San Francisco Department of Computer Science University of San Francisco p.1/?? 13-0: Uncertainty
More information