A Partition-Based First-Order Probabilistic Logic to Represent Interactive Beliefs
|
|
- Blaze Lionel Oliver
- 5 years ago
- Views:
Transcription
1 A Partition-Based First-Order Probabilistic Logic to Represent Interactive Beliefs Alessandro Panella and Piotr Gmytrasiewicz Fifth International Conference on Scalable Uncertainty Management Dayton, OH October 10, 2011 Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
2 Outline 1 Quick Look 2 Introduction The Problem Related Work 3 Proposed Formalization 0-th Level Beliefs 1st Level Beliefs n-th Level Beliefs 4 Conclusion 5 Bibliography Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
3 Contribution Quick Look Formalization of a theoretical framework that allows to compactly represent interactive beliefs Probability theory (First-Order) Logic Maximum Entropy Main idea: recursive partitioning of the belief simplices... (a) (b) (c) (d) Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
4 Introduction The Problem Stochastic Planning The need for compact representations: Use of first-order logic: Describe sets of states Capture regularities Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
5 Introduction The Problem Stochastic Planning The need for compact representations: Use of first-order logic: Describe sets of states Capture regularities Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
6 Introduction The Problem Stochastic Planning An Example n n grid world Actions: UP, DOWN, LEFT, RIGHT Probabilistic transition function: For every location, P(succeed) =.9 GO RIGHT GO DOWN Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
7 Introduction The Problem Stochastic Planning An Example n n grid world Actions: UP, DOWN, LEFT, RIGHT Probabilistic transition function: For every location, P(succeed) =.9 GO RIGHT GO DOWN Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
8 Introduction The Problem Stochastic Planning An Example n n grid world Actions: UP, DOWN, LEFT, RIGHT Probabilistic transition function: For every location, P(succeed) =.9 GO RIGHT GO DOWN Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
9 Interactive Settings Introduction The Problem Representation needs even more stringent Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
10 Interactive Settings Introduction The Problem Representation needs even more stringent Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
11 Introduction The Problem Interactive Settings Representation needs even more stringent Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
12 Introduction The Problem Interactive Settings Representation needs even more stringent Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
13 Introduction The Problem (Finitely Nested) Interactive POMDPs Level-n belief b i,n (IS i,n ), where IS i,0 = S IS i,1 = S (IS j,0 ). IS i,n = S (IS j,n 1 ) Value function of (I-)POMDPs is Piecewise linear and convex. Divides the simplex into behavior-equivalent partitions. (0, 1) s 1 (1, 0) (0, 0) s 2 From Kaelbling et al. (1998) Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
14 Introduction The Problem (Finitely Nested) Interactive POMDPs Level-n belief b i,n (IS i,n ), where IS i,0 = S IS i,1 = S (IS j,0 ). IS i,n = S (IS j,n 1 ) Value function of (I-)POMDPs is Piecewise linear and convex. Divides the simplex into behavior-equivalent partitions. (0, 1) s 1 (1, 0) (0, 0) s 2 From Kaelbling et al. (1998) Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
15 Introduction Related Work Related Work First Order Probabilistic Languages: Seminal theoretical work (Nilsson, 1986; Halpern, 1989) Recent practical approaches: BLOG (Milch et al., 2005), Markov Logic (Richardson and Domingos, 2006),... Relational stochastic planning: Relational MDPs (Boutilier et al., 2001; Sanner and Boutilier, 2009) Relational POMDPs (Sanner and Kersting, 2010; Wang and Khardon, 2010) Belief hierarchies Extensive treatment in Game Theory, starting from Bayesian Games (Harsanyi, 1967; Aumann, 1999) Probabilistic modal logics (Fagin and Halpern, 1994; Shirazi and Amir, 2008) Interactive POMDPs (Gmytrasiewicz and Doshi, 2005) Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
16 Introduction Related Work Related Work First Order Probabilistic Languages: Seminal theoretical work (Nilsson, 1986; Halpern, 1989) Recent practical approaches: BLOG (Milch et al., 2005), Markov Logic (Richardson and Domingos, 2006),... Relational stochastic planning: Relational MDPs (Boutilier et al., 2001; Sanner and Boutilier, 2009) Relational POMDPs (Sanner and Kersting, 2010; Wang and Khardon, 2010) Belief hierarchies Extensive treatment in Game Theory, starting from Bayesian Games (Harsanyi, 1967; Aumann, 1999) Probabilistic modal logics (Fagin and Halpern, 1994; Shirazi and Amir, 2008) Interactive POMDPs (Gmytrasiewicz and Doshi, 2005) Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
17 Introduction Related Work Related Work First Order Probabilistic Languages: Seminal theoretical work (Nilsson, 1986; Halpern, 1989) Recent practical approaches: BLOG (Milch et al., 2005), Markov Logic (Richardson and Domingos, 2006),... Relational stochastic planning: Relational MDPs (Boutilier et al., 2001; Sanner and Boutilier, 2009) Relational POMDPs (Sanner and Kersting, 2010; Wang and Khardon, 2010) Belief hierarchies Extensive treatment in Game Theory, starting from Bayesian Games (Harsanyi, 1967; Aumann, 1999) Probabilistic modal logics (Fagin and Halpern, 1994; Shirazi and Amir, 2008) Interactive POMDPs (Gmytrasiewicz and Doshi, 2005) Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
18 Grid World Example Proposed Formalization 0-th Level Beliefs n n grid Agent i tagging a moving target j Uncertainty about target s position: predicate jpos(x, y) Auxiliary deterministic predicates: geq(x, k) x k leq(x, k) x k Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
19 Level-0 Belief Base Proposed Formalization 0-th Level Beliefs i s belief about the state of the world B i,0 = φ 1, α 1 : : φ m, α m φ 0 S ψ 0 ψ 1 ψ 2 φ 1 φ k s are arbitrary sentences in predicate logic, and α k [0, 1]; ψ s are the induced partitions (Ψ, 2 Ψ, p i,0 ) Only partial specification of distribution To obtain unique distribution: Maximum Entropy (max-ent): ( max ) p i,0 (S(ψ)) log p i,0 (S(ψ)) p i,0 ψ Ψ B Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
20 Proposed Formalization Grid World Example (cont d) 0-th Level Beliefs Assume agent interested in horizontal position of target w.r.t. center: B i,0 = x, y(jpos(x, y) leq(x, n/2 )), 0.8 x, y(jpos(x, y) geq(x, n/2 )), 0.5 S ψ 2 ψ 1 ψ 0 φ 0 φ 1 In this case, unique distribution p i,0 = (0.5, 0.3, 0.2). Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
21 Level-1 Belief Base Proposed Formalization 1st Level Beliefs i s belief about j s belief B i,1 = φ j,0 1, α1 : : φ j,0 m, α m (1) φ j,0 k is of the form P j (φ) β, {<,, =,, >}, β [0, 1]; The sentences φ j,0 k induce a partitioning on j s L-0 belief simplex; Ψ B Ψ j,0 B pi,1 ψ Ψ j,0 B Φ B Φ j,0 B S (Ψ B) i (a) (b) (c) Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
22 Proposed Formalization Grid World Example (cont d) 1st Level Beliefs Target j models agent i s beliefs about j s position B j,1 = Pi(φ0) 0.4, 0.4 P i(φ 1) > 0.5, 0.7 State of the world: S i's L0 simplex: ψ 2 ψ 2 p i(φ 0)=0.4 ψ i,0 2 ψ 1 ψ 0 ψ i,0 0 ψ i,0 1 φ 0 φ1 ψ p 0 i(φ 1)=0.5 ψ 1 φ i,0 0 φ i,0 1 Unique consistent distribution p j,1 = (0.3, 0.1, 0.6). Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
23 Level-n Belief Base Proposed Formalization n-th Level Beliefs Generalize the procedure to an arbitrary nesting level: i s beliefs about j s beliefs about i s beliefs about... B n i = φ j,n 1 1, α 1 : : φ j,n 1 m α m φ j,n 1 k is of the form P j (φ i,n 2 ) β; The core idea is that the partitions at level l 1 constitute the vertices of the simplex at level l... (a) (b) (c) (d) Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
24 Level-n Belief Base Proposed Formalization n-th Level Beliefs Generalize the procedure to an arbitrary nesting level: i s beliefs about j s beliefs about i s beliefs about... B n i = φ j,n 1 1, α 1 : : φ j,n 1 m α m φ j,n 1 k is of the form P j (φ i,n 2 ) β; The core idea is that the partitions at level l 1 constitute the vertices of the simplex at level l... (a) (b) (c) (d) Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
25 Proposed Formalization Grid World Example (cont d) n-th Level Beliefs Agent i models target j s beliefs about i s beliefs about j s position B i,2 = Pj(Pi(φ0) 0.4) < 0.4, 0.2 P j(p i(φ 1) > 0.5) < 0.7, 0.6 State of the world: i's L0 simplex: ψ 2 j's L1 simplex: S ψ i,1 2 ψ 0 ψ 1 ψ 2 pi(φ0) =0.4 ψ i,0 0 ψ i,0 2 ψ i,0 1 pi(φ i,0 0 )=0.6 ψ j,1 3 ψ j,1 0 ψ j,1 2 ψ j,1 1 φ 0 φ1 ψ pi(φ1) =0.5 0 ψ 1 ψ i,0 pi(φ i,0 0 ψ i,0 0 )=0.7 1 φ i,0 0 φ i,0 1 φ j,1 0 φ j,1 1 Max-ent optimization gives p i,2 = (0.48, 0.32, 0.08, 0.12). Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
26 Conclusion Conclusion and Future Work Contribution: Formalization of a FOPL that allows to represent interactive beliefs through recursively partitioning the belief simplices. Future Work: Practical implementation of (subset of) the proposed formalism; Extension of existing approaches to interactive beliefs semantics; E.g. BLOG, Markov Logic, Relational Probabilistic Models,... Embed the formalism in a (multi-agent) stochastic planning algorithm Piecewise linearity of value functions in (I-)POMDPs makes the proposed approach promising; Extend approaches like Sanner et al., 2010 (Sanner and Kersting, 2010). Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
27 Conclusion Conclusion and Future Work Contribution: Formalization of a FOPL that allows to represent interactive beliefs through recursively partitioning the belief simplices. Future Work: Practical implementation of (subset of) the proposed formalism; Extension of existing approaches to interactive beliefs semantics; E.g. BLOG, Markov Logic, Relational Probabilistic Models,... Embed the formalism in a (multi-agent) stochastic planning algorithm Piecewise linearity of value functions in (I-)POMDPs makes the proposed approach promising; Extend approaches like Sanner et al., 2010 (Sanner and Kersting, 2010). Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
28 Conclusion The end. Any questions? Thank you! Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
29 Bibliography Bibliography R. J. Aumann. Interactive epistemology II: Probability. International Journal of Game Theory, 28(3): , C. Boutilier, R. Reiter, and B. Price. Symbolic Dynamic Programming for First-Order MDPs. In IJCAI 01, pages Morgan Kaufmann, R. Fagin and J. Y. Halpern. Reasoning about knowledge and probability. J. ACM, 41: , March ISSN P. J. Gmytrasiewicz and P. Doshi. A framework for sequential planning in multi-agent settings. Journal of Artificial Intelligence Research, 24:24 49, J. Y. Halpern. An analysis of first-order logics of probability. In Proceedings of the 11th international joint conference on Artificial intelligence - Volume 2, pages , San Francisco, CA, USA, Morgan Kaufmann Publishers Inc. J. C. Harsanyi. Games with Incomplete Information Played by Bayesian Players, I-III. Part I. The Basic Model. Management Science, 14(3): , doi: / L. P. Kaelbling, M. L. Littman, and A. R. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101:99 134, B. Milch, B. Marthi, S. J. Russell, D. Sontag, D. L. Ong, and A. Kolobov. Blog: Probabilistic models with unknown objects. In Probabilistic, Logical and Relational Learning, N. J. Nilsson. Probabilistic logic. Artif. Intell., 28:71 88, February ISSN doi: / (86) M. Richardson and P. Domingos. Markov logic networks. Mach. Learn., 62: , February ISSN doi: /s S. Sanner and C. Boutilier. Practical solution techniques for first-order MDPs. Artif. Intell., 173: , April ISSN doi: /j.artint S. Sanner and K. Kersting. Symbolic dynamic programming for first-order POMDPs. In AAAI 10, pages , A. Shirazi and E. Amir. Factored models for probabilistic modal logic. In Proceedings of the 23rd national conference on Artificial intelligence - Volume 1, pages AAAI Press, ISBN C. Wang and R. Khardon. Relational partially observable MDPs. In AAAI 10, pages , Panella and Gmytrasiewicz (CS Dept. - UIC) A FOPL for Interactive Beliefs October 10, / 18
A Partition-Based First-Order Probabilistic Logic to Represent Interactive Beliefs
A Partition-ased First-Order Probabilistic Logic to Represent nteractive eliefs Alessandro Panella and Piotr Gmytrasiewicz University of llinois at Chicago Department of Computer Science Chicago, L 60607
More informationProbabilistic Representation and Reasoning
Probabilistic Representation and Reasoning Alessandro Panella Department of Computer Science University of Illinois at Chicago May 4, 2010 Alessandro Panella (CS Dept. - UIC) Probabilistic Representation
More informationSymbolic Dynamic Programming for First-order POMDPs
Symbolic Dynamic Programming for First-order POMDPs Scott Sanner NICTA & ANU Canberra, Australia scott.sanner@nicta.com.au Kristian Kersting Fraunhofer IAIS Sankt Augustin, Germany kristian.kersting@iais.fraunhofer.de
More informationDecayed Markov Chain Monte Carlo for Interactive POMDPs
Decayed Markov Chain Monte Carlo for Interactive POMDPs Yanlin Han Piotr Gmytrasiewicz Department of Computer Science University of Illinois at Chicago Chicago, IL 60607 {yhan37,piotr}@uic.edu Abstract
More informationRelational Partially Observable MDPs
Proceedings of the Twenty-Fourth AAAI Conference on Artificial Intelligence (AAAI-10) elational Partially Observable MDPs Chenggang Wang and oni Khardon Department of Computer Science Tufts University
More informationOn Prediction and Planning in Partially Observable Markov Decision Processes with Large Observation Sets
On Prediction and Planning in Partially Observable Markov Decision Processes with Large Observation Sets Pablo Samuel Castro pcastr@cs.mcgill.ca McGill University Joint work with: Doina Precup and Prakash
More informationSymmetry Breaking for Relational Weighted Model Finding
Symmetry Breaking for Relational Weighted Model Finding Tim Kopp Parag Singla Henry Kautz WL4AI July 27, 2015 Outline Weighted logics. Symmetry-breaking in SAT. Symmetry-breaking for weighted logics. Weighted
More informationFirst-Order Open-Universe POMDPs
First-Order Open-Universe POMDPs Siddharth Srivastava and Stuart Russell and Paul Ruan and Xiang Cheng Computer Science Division University of California Berkeley, CA 94720 Abstract Open-universe probability
More informationProbabilistic and Logical Beliefs
Probabilistic and Logical Beliefs J.W. Lloyd 1 and K.S. Ng 2 1 Computer Sciences Laboratory College of Engineering and Computer Science The Australian National University jwl@cecs.anu.edu.au 2 National
More informationMarkov Games of Incomplete Information for Multi-Agent Reinforcement Learning
Interactive Decision Theory and Game Theory: Papers from the 2011 AAAI Workshop (WS-11-13) Markov Games of Incomplete Information for Multi-Agent Reinforcement Learning Liam Mac Dermed, Charles L. Isbell,
More informationALLEGRO: Belief-based Programming in Stochastic Dynamical Domains
ALLEGRO: Belief-based Programming in Stochastic Dynamical Domains Vaishak Belle Dept. of Computer Science KU Leuven Belgium vaishak@cs.kuleuven.be Hector J. Levesque Dept. of Computer Science University
More informationChanging Types. Dominik Klein Eric Pacuit. April 24, 2011
Changing Types Dominik Klein Eric Pacuit April 24, 2011 The central thesis of the epistemic program in game theory (Brandenburger, 2007) is that the basic mathematical models of a game situation should
More informationLearning in Zero-Sum Team Markov Games using Factored Value Functions
Learning in Zero-Sum Team Markov Games using Factored Value Functions Michail G. Lagoudakis Department of Computer Science Duke University Durham, NC 27708 mgl@cs.duke.edu Ronald Parr Department of Computer
More informationWidth and Complexity of Belief Tracking in Non-Deterministic Conformant and Contingent Planning
Width and Complexity of Belief Tracking in Non-Deterministic Conformant and Contingent Planning Blai Bonet Universidad Simón Bolívar Caracas, Venezuela bonet@ldc.usb.ve Hector Geffner ICREA & Universitat
More informationPOMDPs and Policy Gradients
POMDPs and Policy Gradients MLSS 2006, Canberra Douglas Aberdeen Canberra Node, RSISE Building Australian National University 15th February 2006 Outline 1 Introduction What is Reinforcement Learning? Types
More informationPartially Observable Markov Decision Processes (POMDPs)
Partially Observable Markov Decision Processes (POMDPs) Geoff Hollinger Sequential Decision Making in Robotics Spring, 2011 *Some media from Reid Simmons, Trey Smith, Tony Cassandra, Michael Littman, and
More informationOn the Progression of Situation Calculus Basic Action Theories: Resolving a 10-year-old Conjecture
On the Progression of Situation Calculus Basic Action Theories: Resolving a 10-year-old Conjecture Stavros Vassos and Hector J Levesque Department of Computer Science University of Toronto Toronto, ON,
More informationSymbolic Variable Elimination in Discrete and Continuous Graphical Models. Scott Sanner Ehsan Abbasnejad
Symbolic Variable Elimination in Discrete and Continuous Graphical Models Scott Sanner Ehsan Abbasnejad Inference for Dynamic Tracking No one previously did this inference exactly in closed-form! Exact
More informationPlanning and Acting in Partially Observable Stochastic Domains
Planning and Acting in Partially Observable Stochastic Domains Leslie Pack Kaelbling*, Michael L. Littman**, Anthony R. Cassandra*** *Computer Science Department, Brown University, Providence, RI, USA
More informationOn Markov Games Played by Bayesian and Boundedly-Rational Players
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence (AAAI-17) On Markov Games Played by Bayesian and Boundedly-Rational Players Muthukumaran Chandrasekaran THINC Lab, University
More informationExpressive Equivalence of Formalisms for Planning with Sensing
From: ICAPS-03 Proceedings. Copyright 2003, AAAI (www.aaai.org). All rights reserved. Expressive Equivalence of Formalisms for Planning with Sensing Jussi Rintanen Albert-Ludwigs-Universität Freiburg,
More informationPROBABILISTIC LOGIC. J. Webster (ed.), Wiley Encyclopedia of Electrical and Electronics Engineering Copyright c 1999 John Wiley & Sons, Inc.
J. Webster (ed.), Wiley Encyclopedia of Electrical and Electronics Engineering Copyright c 1999 John Wiley & Sons, Inc. PROBABILISTIC LOGIC A deductive argument is a claim of the form: If P 1, P 2,...,andP
More informationEfficient Maximization in Solving POMDPs
Efficient Maximization in Solving POMDPs Zhengzhu Feng Computer Science Department University of Massachusetts Amherst, MA 01003 fengzz@cs.umass.edu Shlomo Zilberstein Computer Science Department University
More informationA Probabilistic Extension of the Stable Model Semantics
A Probabilistic Extension of the Stable Model Semantics Joohyung Lee and Yi Wang School of Computing, Informatics, and Decision Systems Engineering Arizona State University, Tempe, USA {joolee, ywang485}@asu.edu
More informationMulti-agent actions under uncertainty: situation calculus, discrete time, plans and policies
Multi-agent actions under uncertainty: situation calculus, discrete time, plans and policies David Poole Department of Computer Science University of British Columbia Vancouver, B.C., Canada V6T 1Z4 poole@cs.ubc.ca
More informationSymbolic Dynamic Programming for Continuous State and Observation POMDPs
Symbolic Dynamic Programming for Continuous State and Observation POMDPs Zahra Zamani ANU & NICTA Canberra, Australia zahra.zamani@anu.edu.au Pascal Poupart U. of Waterloo Waterloo, Canada ppoupart@uwaterloo.ca
More informationData Structures for Efficient Inference and Optimization
Data Structures for Efficient Inference and Optimization in Expressive Continuous Domains Scott Sanner Ehsan Abbasnejad Zahra Zamani Karina Valdivia Delgado Leliane Nunes de Barros Cheng Fang Discrete
More informationSymbolic Dynamic Programming for Continuous State and Observation POMDPs
Symbolic Dynamic Programming for Continuous State and Observation POMDPs Zahra Zamani ANU & NICTA Canberra, Australia zahra.zamani@anu.edu.au Pascal Poupart U. of Waterloo Waterloo, Canada ppoupart@uwaterloo.ca
More informationFirst Order Decision Diagrams for Relational MDPs
To appear in IJCAI 27 First Order Decision Diagrams for Relational MDPs Chenggang Wang and Saket Joshi and Roni Khardon Department of Computer Science Tufts University 161 College Avenue Medford, MA 2155,
More informationMinimizing Communication Cost in a Distributed Bayesian Network Using a Decentralized MDP
Minimizing Communication Cost in a Distributed Bayesian Network Using a Decentralized MDP Jiaying Shen Department of Computer Science University of Massachusetts Amherst, MA 0003-460, USA jyshen@cs.umass.edu
More informationCourse 16:198:520: Introduction To Artificial Intelligence Lecture 13. Decision Making. Abdeslam Boularias. Wednesday, December 7, 2016
Course 16:198:520: Introduction To Artificial Intelligence Lecture 13 Decision Making Abdeslam Boularias Wednesday, December 7, 2016 1 / 45 Overview We consider probabilistic temporal models where the
More informationProbabilistic and Logical Beliefs
Probabilistic and Logical Beliefs J.W. Lloyd 1 and K.S. Ng 2 1 Computer Sciences Laboratory College of Engineering and Computer Science The Australian National University jwl@mail.rsise.anu.edu.au 2 Symbolic
More informationIdentity Uncertainty
Identity Uncertainty Stuart Russell Computer Science Division University of California, Berkeley, CA 94720, USA russell@cs.berkeley.edu Abstract We are often uncertain about the identity of objects. This
More informationInteractive POMDP Lite: Towards Practical Planning to Predict and Exploit Intentions for Interacting with Self-Interested Agents
Proceedings of the Twenty-Third International Joint Conference on Artificial Intelligence Interactive POMDP Lite: Towards Practical Planning to Predict and Exploit Intentions for Interacting with Self-Interested
More informationGrundlagen der Künstlichen Intelligenz
Grundlagen der Künstlichen Intelligenz Formal models of interaction Daniel Hennes 27.11.2017 (WS 2017/18) University Stuttgart - IPVS - Machine Learning & Robotics 1 Today Taxonomy of domains Models of
More informationProbabilistic Models for Agents Beliefs and Decisions
Probabilistic Models for Agents Beliefs and Decisions Brian Milch Computer Science Department Stanford University Stanford, CA 94305-9010 milch@cs.stanford.edu Daphne Koller Computer Science Department
More informationI N F S Y S R E S E A R C H R E P O R T
I N F S Y S R E S E A R C H R E P O R T INSTITUT FÜR INFORMATIONSSYSTEME ABTEILUNG WISSENSBASIERTE SYSTEME STRUCTURE-BASED CAUSES AND EXPLANATIONS IN THE INDEPENDENT CHOICE LOGIC Alberto Finzi and Thomas
More informationCS 7180: Behavioral Modeling and Decision- making in AI
CS 7180: Behavioral Modeling and Decision- making in AI Hidden Markov Models Prof. Amy Sliva October 26, 2012 Par?ally observable temporal domains POMDPs represented uncertainty about the state Belief
More informationAdding Modal Operators to the Action Language A
Adding Modal Operators to the Action Language A Aaron Hunter Simon Fraser University Burnaby, B.C. Canada V5A 1S6 amhunter@cs.sfu.ca Abstract The action language A is a simple high-level language for describing
More informationPartially Observable Markov Decision Processes (POMDPs) Pieter Abbeel UC Berkeley EECS
Partially Observable Markov Decision Processes (POMDPs) Pieter Abbeel UC Berkeley EECS Many slides adapted from Jur van den Berg Outline POMDPs Separation Principle / Certainty Equivalence Locally Optimal
More informationReasoning about Partially Observed Actions
Appears in Proceedings of 21st National Conference on Artificial Intelligence (AAAI 6). 1 Reasoning about Partially Observed Actions Megan Nance Adam Vogel Eyal Amir Computer Science Department University
More informationEncoding the Consistency of Relational Bayesian Networks
Encoding the Consistency of Relational Bayesian Networks Glauber De Bona 1, Fabio G. Cozman 1 1 Escola Politécnica Universidade de São Paulo (EP-USP) Abstract. Given an input structure, a relational Bayesian
More informationDeductive Algorithmic Knowledge
Deductive Algorithmic Knowledge Riccardo Pucella Department of Computer Science Cornell University Ithaca, NY 14853 riccardo@cs.cornell.edu Abstract The framework of algorithmic knowledge assumes that
More informationChapter 2 Background. 2.1 A Basic Description Logic
Chapter 2 Background Abstract Description Logics is a family of knowledge representation formalisms used to represent knowledge of a domain, usually called world. For that, it first defines the relevant
More informationProbabilistic Modal Logic
Probabilistic Modal Logic Afsaneh Shirazi and Eyal Amir Computer Science Department, University of Illinois at Urbana-Champaign Urbana, IL 61801, USA {hajiamin, eyal}@uiuc.edu Abstract A modal logic is
More informationInterpreting Low and High Order Rules: A Granular Computing Approach
Interpreting Low and High Order Rules: A Granular Computing Approach Yiyu Yao, Bing Zhou and Yaohua Chen Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail:
More informationAmbiguous Language and Differences in Beliefs
Proceedings of the Thirteenth International Conference on Principles of Knowledge Representation and Reasoning Ambiguous Language and Differences in Beliefs Joseph Y. Halpern Computer Science Dept. Cornell
More informationRL 14: POMDPs continued
RL 14: POMDPs continued Michael Herrmann University of Edinburgh, School of Informatics 06/03/2015 POMDPs: Points to remember Belief states are probability distributions over states Even if computationally
More informationProbabilistic and Logistic Circuits: A New Synthesis of Logic and Machine Learning
Probabilistic and Logistic Circuits: A New Synthesis of Logic and Machine Learning Guy Van den Broeck KULeuven Symposium Dec 12, 2018 Outline Learning Adding knowledge to deep learning Logistic circuits
More informationUsing first-order logic, formalize the following knowledge:
Probabilistic Artificial Intelligence Final Exam Feb 2, 2016 Time limit: 120 minutes Number of pages: 19 Total points: 100 You can use the back of the pages if you run out of space. Collaboration on the
More informationReasoning Under Uncertainty: Conditioning, Bayes Rule & the Chain Rule
Reasoning Under Uncertainty: Conditioning, Bayes Rule & the Chain Rule Alan Mackworth UBC CS 322 Uncertainty 2 March 13, 2013 Textbook 6.1.3 Lecture Overview Recap: Probability & Possible World Semantics
More informationPlanning in Partially Observable Domains with Fuzzy Epistemic States and Probabilistic Dynamics
Planning in Partially Observable Domains with Fuzzy Epistemic States and Probabilistic Dynamics Nicolas Drougard 1, Didier Dubois 2, Jean-Loup Farges 1, and Florent Teichteil-Königsbuch 1 1 Onera The French
More informationExploiting Logical Structure in Lifted Probabilistic Inference
Exploiting Logical Structure in Lifted Probabilistic Inference Vibhav Gogate and Pedro Domingos Computer Science & Engineering University of Washington Seattle, WA 9895, U.S.A. {vgogate,pedrod}@cs.washington.edu
More informationSystem f2lp Computing Answer Sets of First-Order Formulas
System f2lp Computing Answer Sets of First-Order Formulas Joohyung Lee and Ravi Palla Computer Science and Engineering Arizona State University, Tempe, AZ, USA {joolee,ravi.palla}@asu.edu Abstract. We
More information1 Lifted First-Order Probabilistic Inference
1 Lifted First-Order Probabilistic Inference Rodrigo de Salvo Braz, Eyal Amir and Dan Roth Most probabilistic inference algorithms are specified and processed on a propositional level, even though many
More informationA Comparative Study of Noncontextual and Contextual Dependencies
A Comparative Study of Noncontextual Contextual Dependencies S.K.M. Wong 1 C.J. Butz 2 1 Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: wong@cs.uregina.ca
More informationReasoning about Actions with Sensing under Qualitative and Probabilistic Uncertainty
Reasoning about Actions with Sensing under Qualitative and Probabilistic Uncertainty Luca Iocchi and homas Lukasiewicz 2 and Daniele Nardi and Riccardo Rosati Abstract. We focus on the aspect of sensing
More informationReasoning with Probabilities. Eric Pacuit Joshua Sack. Outline. Basic probability logic. Probabilistic Epistemic Logic.
Reasoning with July 28, 2009 Plan for the Course Day 1: Introduction and Background Day 2: s Day 3: Dynamic s Day 4: Reasoning with Day 5: Conclusions and General Issues Probability language Let Φ be a
More informationComputing Optimal Policies for Partially Observable Decision Processes using Compact Representations
Computing Optimal Policies for Partially Observable Decision Processes using Compact epresentations Craig Boutilier and David Poole Department of Computer Science niversity of British Columbia Vancouver,
More informationLogical Markov Decision Programs
Logical Markov Decision Programs Kristian Kersting and Luc De Raedt Machine Learning Lab, University of Freiburg, Georges-Koehler-Allee, Building 079, 79110 Freiburg, Germany Abstract Motivated by the
More informationA First-Order Stochastic Modeling Language for Diagnosis
A First-Order Stochastic Modeling Language for Diagnosis Chayan Chakrabarti, Roshan Rammohan and George F. Luger Department of Computer Science University of New Mexico Albuquerque, NM 87131 {cc, roshan,
More informationRough Sets for Uncertainty Reasoning
Rough Sets for Uncertainty Reasoning S.K.M. Wong 1 and C.J. Butz 2 1 Department of Computer Science, University of Regina, Regina, Canada, S4S 0A2, wong@cs.uregina.ca 2 School of Information Technology
More informationLogic and Artificial Intelligence Lecture 6
Logic and Artificial Intelligence Lecture 6 Eric Pacuit Currently Visiting the Center for Formal Epistemology, CMU Center for Logic and Philosophy of Science Tilburg University ai.stanford.edu/ epacuit
More informationNested Epistemic Logic Programs
Nested Epistemic Logic Programs Kewen Wang 1 and Yan Zhang 2 1 Griffith University, Australia k.wang@griffith.edu.au 2 University of Western Sydney yan@cit.uws.edu.au Abstract. Nested logic programs and
More informationA First-Order Logic of Probability and Only Knowing in Unbounded Domains
A First-Order Logic of Probability and Only Knowing in Unbounded Domains Vaishak Belle Dept. of Computer Science KU Leuven Belgium vaishak@cs.kuleuven.be Gerhard Lakemeyer Dept. of Computer Science RWTH
More informationIntroduction to Artificial Intelligence Propositional Logic & SAT Solving. UIUC CS 440 / ECE 448 Professor: Eyal Amir Spring Semester 2010
Introduction to Artificial Intelligence Propositional Logic & SAT Solving UIUC CS 440 / ECE 448 Professor: Eyal Amir Spring Semester 2010 Today Representation in Propositional Logic Semantics & Deduction
More informationPlanning in Factored Action Spaces with Symbolic Dynamic Programming
Proceedings of the Twenty-Sixth AAAI Conference on Artificial Intelligence Planning in Factored Action Spaces with Symbolic Dynamic Programming Aswin Raghavan, Saket Joshi, Alan Fern, Prasad Tadepalli
More informationProbabilistic Model Checking and Strategy Synthesis for Robot Navigation
Probabilistic Model Checking and Strategy Synthesis for Robot Navigation Dave Parker University of Birmingham (joint work with Bruno Lacerda, Nick Hawes) AIMS CDT, Oxford, May 2015 Overview Probabilistic
More informationDon t Plan for the Unexpected: Planning Based on Plausibility Models
Don t Plan for the Unexpected: Planning Based on Plausibility Models Thomas Bolander, DTU Informatics, Technical University of Denmark Joint work with Mikkel Birkegaard Andersen and Martin Holm Jensen
More informationOptimally Solving Dec-POMDPs as Continuous-State MDPs
Optimally Solving Dec-POMDPs as Continuous-State MDPs Jilles Dibangoye (1), Chris Amato (2), Olivier Buffet (1) and François Charpillet (1) (1) Inria, Université de Lorraine France (2) MIT, CSAIL USA IJCAI
More informationComputing Optimal Policies for Partially Observable Decision Processes using Compact Representations
Computing Optimal Policies for Partially Observable Decision Processes using Compact epresentations Craig Boutilier and David Poole Department of Computer Science niversity of British Columbia Vancouver,
More informationPlanning with Predictive State Representations
Planning with Predictive State Representations Michael R. James University of Michigan mrjames@umich.edu Satinder Singh University of Michigan baveja@umich.edu Michael L. Littman Rutgers University mlittman@cs.rutgers.edu
More informationCSL302/612 Artificial Intelligence End-Semester Exam 120 Minutes
CSL302/612 Artificial Intelligence End-Semester Exam 120 Minutes Name: Roll Number: Please read the following instructions carefully Ø Calculators are allowed. However, laptops or mobile phones are not
More informationUndecidability in Epistemic Planning
Undecidability in Epistemic Planning Thomas Bolander, DTU Compute, Tech Univ of Denmark Joint work with: Guillaume Aucher, Univ Rennes 1 Bolander: Undecidability in Epistemic Planning p. 1/17 Introduction
More informationarxiv: v1 [cs.ai] 14 Sep 2018
On Plans With Loops and Noise Vaishak Belle University of Edinburgh & Alan Turing Institute vaishak@ed.ac.uk arxiv:1809.05309v1 [cs.ai] 14 Sep 2018 ABSTRACT In an influential paper, Levesque proposed a
More informationMaximal Introspection of Agents
Electronic Notes in Theoretical Computer Science 70 No. 5 (2002) URL: http://www.elsevier.nl/locate/entcs/volume70.html 16 pages Maximal Introspection of Agents Thomas 1 Informatics and Mathematical Modelling
More informationThe Information Bottleneck Revisited or How to Choose a Good Distortion Measure
The Information Bottleneck Revisited or How to Choose a Good Distortion Measure Peter Harremoës Centrum voor Wiskunde en Informatica PO 94079, 1090 GB Amsterdam The Nederlands PHarremoes@cwinl Naftali
More informationToday s Outline. Recap: MDPs. Bellman Equations. Q-Value Iteration. Bellman Backup 5/7/2012. CSE 473: Artificial Intelligence Reinforcement Learning
CSE 473: Artificial Intelligence Reinforcement Learning Dan Weld Today s Outline Reinforcement Learning Q-value iteration Q-learning Exploration / exploitation Linear function approximation Many slides
More informationA Simple Model for Sequences of Relational State Descriptions
A Simple Model for Sequences of Relational State Descriptions Ingo Thon, Niels Landwehr, and Luc De Raedt Department of Computer Science, Katholieke Universiteit Leuven, Celestijnenlaan 200A, 3001 Heverlee,
More informationPei Wang( 王培 ) Temple University, Philadelphia, USA
Pei Wang( 王培 ) Temple University, Philadelphia, USA Artificial General Intelligence (AGI): a small research community in AI that believes Intelligence is a general-purpose capability Intelligence should
More informationProbabilistic Knowledge-Based Programs
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence (IJCAI 2015) Probabilistic Knowledge-Based Programs Jérôme Lang CNRS-LAMSADE Université Paris-Dauphine, France
More informationA Polynomial-time Nash Equilibrium Algorithm for Repeated Games
A Polynomial-time Nash Equilibrium Algorithm for Repeated Games Michael L. Littman mlittman@cs.rutgers.edu Rutgers University Peter Stone pstone@cs.utexas.edu The University of Texas at Austin Main Result
More informationSOME SEMANTICS FOR A LOGICAL LANGUAGE FOR THE GAME OF DOMINOES
SOME SEMANTICS FOR A LOGICAL LANGUAGE FOR THE GAME OF DOMINOES Fernando R. Velázquez-Quesada Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas Universidad Nacional Autónoma de México
More informationLogical Markov Decision Programs
Logical Markov Decision Programs Kristian Kersting kersting@informatik.uni-freiburg.de Luc De Raedt deraedt@informatik.uni-freiburg.de Machine Learning Lab, University of Freiburg, Georges-Koehler-Allee,
More informationOutline. CSE 573: Artificial Intelligence Autumn Agent. Partial Observability. Markov Decision Process (MDP) 10/31/2012
CSE 573: Artificial Intelligence Autumn 2012 Reasoning about Uncertainty & Hidden Markov Models Daniel Weld Many slides adapted from Dan Klein, Stuart Russell, Andrew Moore & Luke Zettlemoyer 1 Outline
More informationRobust Learning Equilibrium
Robust Learning Equilibrium Itai Ashlagi Dov Monderer Moshe Tennenholtz Faculty of Industrial Engineering and Management Technion Israel Institute of Technology Haifa 32000, Israel Abstract We introduce
More informationPartially Observable Markov Decision Processes (POMDPs)
Partially Observable Markov Decision Processes (POMDPs) Sachin Patil Guest Lecture: CS287 Advanced Robotics Slides adapted from Pieter Abbeel, Alex Lee Outline Introduction to POMDPs Locally Optimal Solutions
More informationRegion-Based Dynamic Programming for Partially Observable Markov Decision Processes
Region-Based Dynamic Programming for Partially Observable Markov Decision Processes Zhengzhu Feng Department of Computer Science University of Massachusetts Amherst, MA 01003 fengzz@cs.umass.edu Abstract
More informationMarkov Logic in Infinite Domains
Markov Logic in Infinite Domains Parag Singla Pedro Domingos Department of Computer Science and Engineering University of Washington Seattle, WA 98195-2350, U.S.A. {parag, pedrod}@cs.washington.edu Abstract
More informationOn analysis of the unicity of Jeffrey s rule of conditioning in a possibilistic framework
Abstract Conditioning is an important task for designing intelligent systems in artificial intelligence. This paper addresses an issue related to the possibilistic counterparts of Jeffrey s rule of conditioning.
More informationLogic. Introduction to Artificial Intelligence CS/ECE 348 Lecture 11 September 27, 2001
Logic Introduction to Artificial Intelligence CS/ECE 348 Lecture 11 September 27, 2001 Last Lecture Games Cont. α-β pruning Outline Games with chance, e.g. Backgammon Logical Agents and thewumpus World
More informationDialogue management: Parametric approaches to policy optimisation. Dialogue Systems Group, Cambridge University Engineering Department
Dialogue management: Parametric approaches to policy optimisation Milica Gašić Dialogue Systems Group, Cambridge University Engineering Department 1 / 30 Dialogue optimisation as a reinforcement learning
More informationEpistemic and Statistical Probabilistic Ontologies
Epistemic and Statistical Probabilistic Ontologies Fabrizio Riguzzi, Elena Bellodi, Evelina Lamma, Riccardo Zese ENDIF University of Ferrara, Italy fabrizio.riguzzi@unife.it, elena.bellodi@unife.it, evelina.lamma@unife.it,
More informationDecentralized Control of Cooperative Systems: Categorization and Complexity Analysis
Journal of Artificial Intelligence Research 22 (2004) 143-174 Submitted 01/04; published 11/04 Decentralized Control of Cooperative Systems: Categorization and Complexity Analysis Claudia V. Goldman Shlomo
More informationBelief Tracking for Planning with Sensing
Belief Tracking for Planning with Sensing Blai Bonet Universidad Simón Boĺıvar 2nd Brazilian Conf. on Intelligent Systems (BRACIS) Fortaleza, Brazil 2013 (joint work with Hector Geffner) Recap on Early
More informationA Generalized Decision Logic in Interval-set-valued Information Tables
A Generalized Decision Logic in Interval-set-valued Information Tables Y.Y. Yao 1 and Qing Liu 2 1 Department of Computer Science, University of Regina Regina, Saskatchewan, Canada S4S 0A2 E-mail: yyao@cs.uregina.ca
More informationA Stochastic l-calculus
A Stochastic l-calculus Content Areas: probabilistic reasoning, knowledge representation, causality Tracking Number: 775 Abstract There is an increasing interest within the research community in the design
More informationHindsight Optimization for Probabilistic Planning with Factored Actions
Hindsight Optimization for Probabilistic Planning with Factored Actions Murugeswari Issakkimuthu a, Alan Fern b, Roni Khardon a, Prasad Tadepalli b, Shan Xue b a Department of Computer Science, Tufts University,
More informationUnderstanding the Brandenburger-Keisler Belief Paradox
Understanding the Brandenburger-Keisler Belief Paradox Eric Pacuit Institute of Logic, Language and Information University of Amsterdam epacuit@staff.science.uva.nl staff.science.uva.nl/ epacuit March
More informationarxiv: v2 [cs.ai] 27 Jul 2017
Domain Recursion for Lifted Inference with Existential Quantifiers Seyed Mehran Kazemi 1, Angelika Kimmig 2, Guy Van den Broeck 3 and David Poole 1 1 University of British Columbia, {smkazemi,poole}@cs.ubc.ca
More information