Games A game is a tuple = (I; (S i ;r i ) i2i) where ffl I is a set of players (i 2 I) ffl S i is a set of (pure) strategies (s i 2 S i ) Q ffl r i :
|
|
- Simon Caldwell
- 6 years ago
- Views:
Transcription
1 On the Connection between No-Regret Learning, Fictitious Play, & Nash Equilibrium Amy Greenwald Brown University Gunes Ercal, David Gondek, Amir Jafari July,
2 Games A game is a tuple = (I; (S i ;r i ) i2i) where ffl I is a set of players (i 2 I) ffl S i is a set of (pure) strategies (s i 2 S i ) Q ffl r i : S! R is a reward function (s 2 S = S i i) A repeated game is a sequence of tuples T
3 Nash Equilibrium A mixed strategy q i is a probability distribution over pure strategy set S i. Given opposing strategy q i, a best-response q Λ i strategy that maximizes (one-step) reward: q Λ i = arg max q i r i (q i ;q i ) is a A Nash equilibrium q Λ = (q Λ ;qλ ) is a strategy profile s.t. i i q Λ is a best-response to q Λ, for all players i. i i All (finite) games have mixed strategy Nash equilibria.
4 No-Regret Regret is the expected difference in rewards for playing mixed strategy q t i rather than pure strategy s i: ρ t i(s i ) = r i (s i ;q t i) r i (q t i ;qt i) An on-line learning algorithm exhibits no-regret iff 8s i : lim T!1 1 T TX t=1 ρ t i(s i )» No-Regret Learning Algorithms Narendra and Thathachar Learning Automata Freund and Schapire Multiplicative Updating Foster and Vohra The Mixing Method Hart and Mas-Colell Correlated Equilibrium Hannan, Banos, Megiddo, etc.
5 No-Regret Regret is the expected difference in rewards for playing mixed strategy q t i rather than pure strategy s i: ρ t i(s i ) = r i (s i ;q t i) r i (q t i ;qt i) An on-line learning algorithm exhibits no-regret iff: lim T!1 TX t=1 ρ t i(s i )» Learning Algorithms Narendra and Thathachar Learning Automata Freund and Schapire Multiplicative Updating Foster and Vohra The Mixing Method Hart and Mas-Colell Correlated Equilibrium Hannan, Banos, Megiddo, etc.
6 Observation In an infinitely repeated game, if all players learn via no-regret algorithms, and if q i t! μq i for all players i, then μq = q Λ : i.e., μq is a mixed strategy Nash equilibrium.
7 Proof By assumption, r i (s i ;q t i)! r i (s i ; μq i ) r i (q t i ;qt i)! r i (q Λ i ; μq i) Therefore, lim T!1 lim T!1 1 T 1 T TX t=1 TX t=1 r i (s i ;q i) t = r i (s i ; μq i) r i (q t i ;qt i) = r i (μq i ; μq i ) By the no-regret property r i (s i ; μq i )» r i (μq i ; μq i ); for all s i ) r i (s Λ i ; μq i)» r i (μq i ; μq i ) where s Λ i 2 arg max s i 2S i r i (s i ; μq i ): i.e., μq = q Λ.
8 Simulations Informed Settings ρ T s i = q T +1 s i = TX t=1 P r i (s i ;q t i) (1 + ff) ρt s i μs i 2S i (1 + ff) ρt μs i [Freund and Schapire 1995] Naive Settings ^r(s i ;q t i) = 1 t s i ^q T +1 i = (1 ffl)q T +1 i + [Auer, Cesa-Bianchi, Freund, and Schapire 1996] ffl ff > is the learning rate ffl ffl > is the exploration rate
9 Simulations Informed Settings ρ T s i = TX t=1 r i (s i ;q t i) q T +1 s i = [Freund and Schapire 1995] Naive Settings ^r(s i ;q t i) = 1 t s i r(s i ;q t i ) ^q t s i ^q T +1 i = (1 ffl)q T +1 i + ffl js i j [Auer, Cesa-Bianchi, Freund, and Schapire 1996] ffl ff > is the learning rate ffl ffl > is the exploration rate
10 Pure Strategy Nash Equilibria Prisoners' Dilemma 1 2 C D C 4,4,5 D 5, 1,1 Battle of the Sexes 1 2 B F B 2,1, F, 1,2
11 Prisoners' Dilemma Informed Setting Weights Time Naive Setting Weights Time
12 Battle of the Sexes Informed Setting Weights Time Naive Setting Weights Time
13 Mixed Strategy Nash Equilibria Matching Pennies 1 2 H T H 1,,1 T,1 1, Shapley Game 1 2 L C R T 1,,1, M, 1,,1 B,1, 1,
14 Matching Pennies Mixed Strategies Weights Time Empirical Frequencies Frequencies Time
15 Shapley Game Mixed Strategies Weights Time Empirical Frequencies Frequencies Time
16 Shapley Game Smoothed Mixed Strategies Weights Time Smoothed Empirical Frequencies Frequencies Time
17 Summary of Simulations Theoretical ffl If no-regret learning converges, it converges to Nash equilibrium. Experimental ffl No-regret learning converges to pure strategy Nash equilibria in games for which such equilibria exist. ffl No-regret learning does not converge otherwise. Empirical frequencies converge to Nash in zero-sum games. Empirical frequencies converge to Nash in non-zero sum games of 2 strategies. Smoothed empirical frequencies converge to Nash in games of > 2 strategies.
18 No-Regret and Fictitious Play Prisoners' Dilemma Prisoner s Dilemma 9 % Fictitous Play Iterations Battle of the Sexes Battle of the Sexes 9 7 % Fictitous Play Iterations
19 No-Regret and Fictitious Play Matching Pennies Matching Pennies 9 7 % Fictitous Play Iterations 9.5 Matching Pennies % Fictitous Play Iterations
20 No-Regret and Fictitious Play Shapley Game Shapley Game 9 7 % Fictitous Play Iterations Shapley Game 9 7 % Fictitous Play Iterations
21 No-Regret and Fictitious Play Shapley Game Shapley Game 9 7 % Fictitous Play Iterations Shapley Game 9 7 % Fictitous Play Iterations
22 Papers ffl Shopbots and Pricebots [w/ Kephart] ffl Santa Fe Bar Problem [w/ Mishra & Parikh] ffl Learning in Networks [w/ Friedman & Shenker]
Game-Theoretic Learning:
Game-Theoretic Learning: Regret Minimization vs. Utility Maximization Amy Greenwald with David Gondek, Amir Jafari, and Casey Marks Brown University University of Pennsylvania November 17, 2004 Background
More informationC 4,4 0,5 0,0 0,0 B 0,0 0,0 1,1 1 2 T 3,3 0,0. Freund and Schapire: Prisoners Dilemma
On No-Regret Learning, Fictitious Play, and Nash Equilibrium Amir Jafari Department of Mathematics, Brown University, Providence, RI 292 Amy Greenwald David Gondek Department of Computer Science, Brown
More informationMultiagent Value Iteration in Markov Games
Multiagent Value Iteration in Markov Games Amy Greenwald Brown University with Michael Littman and Martin Zinkevich Stony Brook Game Theory Festival July 21, 2005 Agenda Theorem Value iteration converges
More informationConvergence and No-Regret in Multiagent Learning
Convergence and No-Regret in Multiagent Learning Michael Bowling Department of Computing Science University of Alberta Edmonton, Alberta Canada T6G 2E8 bowling@cs.ualberta.ca Abstract Learning in a multiagent
More informationBlackwell s Approachability Theorem: A Generalization in a Special Case. Amy Greenwald, Amir Jafari and Casey Marks
Blackwell s Approachability Theorem: A Generalization in a Special Case Amy Greenwald, Amir Jafari and Casey Marks Department of Computer Science Brown University Providence, Rhode Island 02912 CS-06-01
More informationAdvanced Machine Learning
Advanced Machine Learning Learning and Games MEHRYAR MOHRI MOHRI@ COURANT INSTITUTE & GOOGLE RESEARCH. Outline Normal form games Nash equilibrium von Neumann s minimax theorem Correlated equilibrium Internal
More informationLecture 14: Approachability and regret minimization Ramesh Johari May 23, 2007
MS&E 336 Lecture 4: Approachability and regret minimization Ramesh Johari May 23, 2007 In this lecture we use Blackwell s approachability theorem to formulate both external and internal regret minimizing
More informationPrediction and Playing Games
Prediction and Playing Games Vineel Pratap vineel@eng.ucsd.edu February 20, 204 Chapter 7 : Prediction, Learning and Games - Cesa Binachi & Lugosi K-Person Normal Form Games Each player k (k =,..., K)
More informationBelief-based Learning
Belief-based Learning Algorithmic Game Theory Marcello Restelli Lecture Outline Introdutcion to multi-agent learning Belief-based learning Cournot adjustment Fictitious play Bayesian learning Equilibrium
More informationImproving Convergence Rates in Multiagent Learning Through Experts and Adaptive Consultation
Improving Convergence Rates in Multiagent Learning Through Experts and Adaptive Consultation Greg Hines Cheriton School of Computer Science University of Waterloo Waterloo, Ontario, N2L 3G1 Kate Larson
More information4. Opponent Forecasting in Repeated Games
4. Opponent Forecasting in Repeated Games Julian and Mohamed / 2 Learning in Games Literature examines limiting behavior of interacting players. One approach is to have players compute forecasts for opponents
More informationGame Theory. Professor Peter Cramton Economics 300
Game Theory Professor Peter Cramton Economics 300 Definition Game theory is the study of mathematical models of conflict and cooperation between intelligent and rational decision makers. Rational: each
More informationGame Theory, Evolutionary Dynamics, and Multi-Agent Learning. Prof. Nicola Gatti
Game Theory, Evolutionary Dynamics, and Multi-Agent Learning Prof. Nicola Gatti (nicola.gatti@polimi.it) Game theory Game theory: basics Normal form Players Actions Outcomes Utilities Strategies Solutions
More informationNotes on Coursera s Game Theory
Notes on Coursera s Game Theory Manoel Horta Ribeiro Week 01: Introduction and Overview Game theory is about self interested agents interacting within a specific set of rules. Self-Interested Agents have
More informationComputing Minmax; Dominance
Computing Minmax; Dominance CPSC 532A Lecture 5 Computing Minmax; Dominance CPSC 532A Lecture 5, Slide 1 Lecture Overview 1 Recap 2 Linear Programming 3 Computational Problems Involving Maxmin 4 Domination
More informationWalras-Bowley Lecture 2003
Walras-Bowley Lecture 2003 Sergiu Hart This version: September 2004 SERGIU HART c 2004 p. 1 ADAPTIVE HEURISTICS A Little Rationality Goes a Long Way Sergiu Hart Center for Rationality, Dept. of Economics,
More informationSmooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics
Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics Sergiu Hart August 2016 Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics Sergiu Hart Center for the Study of Rationality
More informationUnified Convergence Proofs of Continuous-time Fictitious Play
Unified Convergence Proofs of Continuous-time Fictitious Play Jeff S. Shamma and Gurdal Arslan Department of Mechanical and Aerospace Engineering University of California Los Angeles {shamma,garslan}@seas.ucla.edu
More information6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Potential Games
6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Asu Ozdaglar MIT March 2, 2010 1 Introduction Outline Review of Supermodular Games Reading: Fudenberg and Tirole, Section 12.3.
More informationExponential Moving Average Based Multiagent Reinforcement Learning Algorithms
Exponential Moving Average Based Multiagent Reinforcement Learning Algorithms Mostafa D. Awheda Department of Systems and Computer Engineering Carleton University Ottawa, Canada KS 5B6 Email: mawheda@sce.carleton.ca
More informationStrategic Games: Social Optima and Nash Equilibria
Strategic Games: Social Optima and Nash Equilibria Krzysztof R. Apt CWI & University of Amsterdam Strategic Games:Social Optima and Nash Equilibria p. 1/2 Basic Concepts Strategic games. Nash equilibrium.
More informationCalibration and Nash Equilibrium
Calibration and Nash Equilibrium Dean Foster University of Pennsylvania and Sham Kakade TTI September 22, 2005 What is a Nash equilibrium? Cartoon definition of NE: Leroy Lockhorn: I m drinking because
More informationOnline Learning Class 12, 20 March 2006 Andrea Caponnetto, Sanmay Das
Online Learning 9.520 Class 12, 20 March 2006 Andrea Caponnetto, Sanmay Das About this class Goal To introduce the general setting of online learning. To describe an online version of the RLS algorithm
More informationThe Multi-Armed Bandit Problem
Università degli Studi di Milano The bandit problem [Robbins, 1952]... K slot machines Rewards X i,1, X i,2,... of machine i are i.i.d. [0, 1]-valued random variables An allocation policy prescribes which
More informationLecture 19: UCB Algorithm and Adversarial Bandit Problem. Announcements Review on stochastic multi-armed bandit problem
Lecture 9: UCB Algorithm and Adversarial Bandit Problem EECS598: Prediction and Learning: It s Only a Game Fall 03 Lecture 9: UCB Algorithm and Adversarial Bandit Problem Prof. Jacob Abernethy Scribe:
More informationGambling in a rigged casino: The adversarial multi-armed bandit problem
Gambling in a rigged casino: The adversarial multi-armed bandit problem Peter Auer Institute for Theoretical Computer Science University of Technology Graz A-8010 Graz (Austria) pauer@igi.tu-graz.ac.at
More informationPotential Games. Krzysztof R. Apt. CWI, Amsterdam, the Netherlands, University of Amsterdam. Potential Games p. 1/3
Potential Games p. 1/3 Potential Games Krzysztof R. Apt CWI, Amsterdam, the Netherlands, University of Amsterdam Potential Games p. 2/3 Overview Best response dynamics. Potential games. Congestion games.
More informationGame Theory: Spring 2017
Game Theory: Spring 207 Ulle Endriss Institute for Logic, Language and Computation University of Amsterdam Ulle Endriss Plan for Today We have seen that every normal-form game has a Nash equilibrium, although
More information1 Equilibrium Comparisons
CS/SS 241a Assignment 3 Guru: Jason Marden Assigned: 1/31/08 Due: 2/14/08 2:30pm We encourage you to discuss these problems with others, but you need to write up the actual homework alone. At the top of
More informationAdvanced Machine Learning
Advanced Machine Learning Bandit Problems MEHRYAR MOHRI MOHRI@ COURANT INSTITUTE & GOOGLE RESEARCH. Multi-Armed Bandit Problem Problem: which arm of a K-slot machine should a gambler pull to maximize his
More informationSatisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games
Satisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games Stéphane Ross and Brahim Chaib-draa Department of Computer Science and Software Engineering Laval University, Québec (Qc),
More informationExponential Moving Average Based Multiagent Reinforcement Learning Algorithms
Artificial Intelligence Review manuscript No. (will be inserted by the editor) Exponential Moving Average Based Multiagent Reinforcement Learning Algorithms Mostafa D. Awheda Howard M. Schwartz Received:
More informationHybrid Machine Learning Algorithms
Hybrid Machine Learning Algorithms Umar Syed Princeton University Includes joint work with: Rob Schapire (Princeton) Nina Mishra, Alex Slivkins (Microsoft) Common Approaches to Machine Learning!! Supervised
More informationIterated Strict Dominance in Pure Strategies
Iterated Strict Dominance in Pure Strategies We know that no rational player ever plays strictly dominated strategies. As each player knows that each player is rational, each player knows that his opponents
More informationComputing Minmax; Dominance
Computing Minmax; Dominance CPSC 532A Lecture 5 Computing Minmax; Dominance CPSC 532A Lecture 5, Slide 1 Lecture Overview 1 Recap 2 Linear Programming 3 Computational Problems Involving Maxmin 4 Domination
More informationOn Learning Algorithms for Nash Equilibria
On Learning Algorithms for Nash Equilibria The MIT Faculty has made this article openly available. Please share how this access benefits you. Your story matters. Citation As Published Publisher Daskalakis,
More informationBounded Rationality, Strategy Simplification, and Equilibrium
Bounded Rationality, Strategy Simplification, and Equilibrium UPV/EHU & Ikerbasque Donostia, Spain BCAM Workshop on Interactions, September 2014 Bounded Rationality Frequently raised criticism of game
More informationReducing contextual bandits to supervised learning
Reducing contextual bandits to supervised learning Daniel Hsu Columbia University Based on joint work with A. Agarwal, S. Kale, J. Langford, L. Li, and R. Schapire 1 Learning to interact: example #1 Practicing
More informationCyclic Equilibria in Markov Games
Cyclic Equilibria in Markov Games Martin Zinkevich and Amy Greenwald Department of Computer Science Brown University Providence, RI 02912 {maz,amy}@cs.brown.edu Michael L. Littman Department of Computer
More informationCorrelated Equilibria: Rationality and Dynamics
Correlated Equilibria: Rationality and Dynamics Sergiu Hart June 2010 AUMANN 80 SERGIU HART c 2010 p. 1 CORRELATED EQUILIBRIA: RATIONALITY AND DYNAMICS Sergiu Hart Center for the Study of Rationality Dept
More informationA Modified Q-Learning Algorithm for Potential Games
Preprints of the 19th World Congress The International Federation of Automatic Control A Modified Q-Learning Algorithm for Potential Games Yatao Wang Lacra Pavel Edward S. Rogers Department of Electrical
More informationMechanism Design: Implementation. Game Theory Course: Jackson, Leyton-Brown & Shoham
Game Theory Course: Jackson, Leyton-Brown & Shoham Bayesian Game Setting Extend the social choice setting to a new setting where agents can t be relied upon to disclose their preferences honestly Start
More informationRegret in the On-Line Decision Problem
Games and Economic Behavior 29, 7 35 (1999) Article ID game.1999.0740, available online at http://www.idealibrary.com on Regret in the On-Line Decision Problem Dean P. Foster Department of Statistics,
More informationInternal Regret in On-line Portfolio Selection
Internal Regret in On-line Portfolio Selection Gilles Stoltz (gilles.stoltz@ens.fr) Département de Mathématiques et Applications, Ecole Normale Supérieure, 75005 Paris, France Gábor Lugosi (lugosi@upf.es)
More informationLearning correlated equilibria in games with compact sets of strategies
Learning correlated equilibria in games with compact sets of strategies Gilles Stoltz a,, Gábor Lugosi b a cnrs and Département de Mathématiques et Applications, Ecole Normale Supérieure, 45 rue d Ulm,
More informationarxiv: v1 [cs.gt] 1 Sep 2015
HC selection for MCTS in Simultaneous Move Games Analysis of Hannan Consistent Selection for Monte Carlo Tree Search in Simultaneous Move Games arxiv:1509.00149v1 [cs.gt] 1 Sep 2015 Vojtěch Kovařík vojta.kovarik@gmail.com
More informationTheory and Applications of A Repeated Game Playing Algorithm. Rob Schapire Princeton University [currently visiting Yahoo!
Theory and Applications of A Repeated Game Playing Algorithm Rob Schapire Princeton University [currently visiting Yahoo! Research] Learning Is (Often) Just a Game some learning problems: learn from training
More informationLearning, Games, and Networks
Learning, Games, and Networks Abhishek Sinha Laboratory for Information and Decision Systems MIT ML Talk Series @CNRG December 12, 2016 1 / 44 Outline 1 Prediction With Experts Advice 2 Application to
More informationIntroduction to Game Theory
COMP323 Introduction to Computational Game Theory Introduction to Game Theory Paul G. Spirakis Department of Computer Science University of Liverpool Paul G. Spirakis (U. Liverpool) Introduction to Game
More informationConvergence to Pareto Optimality in General Sum Games via Learning Opponent s Preference
Convergence to Pareto Optimality in General Sum Games via Learning Opponent s Preference Dipyaman Banerjee Department of Math & CS University of Tulsa Tulsa, OK, USA dipyaman@gmail.com Sandip Sen Department
More informationSolving Extensive Form Games
Chapter 8 Solving Extensive Form Games 8.1 The Extensive Form of a Game The extensive form of a game contains the following information: (1) the set of players (2) the order of moves (that is, who moves
More informationarxiv: v2 [cs.gt] 23 Jan 2019
Analysis of Hannan Consistent Selection for Monte Carlo Tree Search in Simultaneous Move Games Vojtěch Kovařík, Viliam Lisý vojta.kovarik@gmail.com, viliam.lisy@agents.fel.cvut.cz arxiv:1804.09045v2 [cs.gt]
More informationChapter 9. Mixed Extensions. 9.1 Mixed strategies
Chapter 9 Mixed Extensions We now study a special case of infinite strategic games that are obtained in a canonic way from the finite games, by allowing mixed strategies. Below [0, 1] stands for the real
More informationPerformance and Convergence of Multi-user Online Learning
Performance and Convergence of Multi-user Online Learning Cem Tekin, Mingyan Liu Department of Electrical Engineering and Computer Science University of Michigan, Ann Arbor, Michigan, 4809-222 Email: {cmtkn,
More informationDECENTRALIZED LEARNING IN GENERAL-SUM MATRIX GAMES: AN L R I LAGGING ANCHOR ALGORITHM. Xiaosong Lu and Howard M. Schwartz
International Journal of Innovative Computing, Information and Control ICIC International c 20 ISSN 349-498 Volume 7, Number, January 20 pp. 0 DECENTRALIZED LEARNING IN GENERAL-SUM MATRIX GAMES: AN L R
More informationBrown s Original Fictitious Play
manuscript No. Brown s Original Fictitious Play Ulrich Berger Vienna University of Economics, Department VW5 Augasse 2-6, A-1090 Vienna, Austria e-mail: ulrich.berger@wu-wien.ac.at March 2005 Abstract
More informationAchieving Pareto Optimality Through Distributed Learning
1 Achieving Pareto Optimality Through Distributed Learning Jason R. Marden, H. Peyton Young, and Lucy Y. Pao Abstract We propose a simple payoff-based learning rule that is completely decentralized, and
More informationGame Theory and Control
Game Theory and Control Lecture 4: Potential games Saverio Bolognani, Ashish Hota, Maryam Kamgarpour Automatic Control Laboratory ETH Zürich 1 / 40 Course Outline 1 Introduction 22.02 Lecture 1: Introduction
More informationREPEATED GAMES. Jörgen Weibull. April 13, 2010
REPEATED GAMES Jörgen Weibull April 13, 2010 Q1: Can repetition induce cooperation? Peace and war Oligopolistic collusion Cooperation in the tragedy of the commons Q2: Can a game be repeated? Game protocols
More informationBetter-Reply Dynamics with Bounded Recall
` Kyiv School of Economics & Kyiv Economics Institute DISCUSSION PAPER SERIES Better-Reply Dynamics with Bounded Recall Andriy Zapechelnyuk (Kyiv School of Economics and Kyiv Economics Institute) First
More informationA Multiplayer Generalization of the MinMax Theorem
A Multiplayer Generalization of the MinMax Theorem Yang Cai Ozan Candogan Constantinos Daskalakis Christos Papadimitriou Abstract We show that in zero-sum polymatrix games, a multiplayer generalization
More informationSelecting Efficient Correlated Equilibria Through Distributed Learning. Jason R. Marden
1 Selecting Efficient Correlated Equilibria Through Distributed Learning Jason R. Marden Abstract A learning rule is completely uncoupled if each player s behavior is conditioned only on his own realized
More informationLEARNING IN CONCAVE GAMES
LEARNING IN CONCAVE GAMES P. Mertikopoulos French National Center for Scientific Research (CNRS) Laboratoire d Informatique de Grenoble GSBE ETBC seminar Maastricht, October 22, 2015 Motivation and Preliminaries
More informationLearning to play partially-specified equilibrium
Learning to play partially-specified equilibrium Ehud Lehrer and Eilon Solan August 29, 2007 August 29, 2007 First draft: June 2007 Abstract: In a partially-specified correlated equilibrium (PSCE ) the
More informationEC3224 Autumn Lecture #04 Mixed-Strategy Equilibrium
Reading EC3224 Autumn Lecture #04 Mixed-Strategy Equilibrium Osborne Chapter 4.1 to 4.10 By the end of this week you should be able to: find a mixed strategy Nash Equilibrium of a game explain why mixed
More informationA General Class of Adaptive Strategies 1
Journal of Economic Theory 98, 2654 (2001) doi:10.1006jeth.2000.2746, available online at http:www.idealibrary.com on A General Class of Adaptive Strategies 1 Sergiu Hart 2 Center for Rationality and Interactive
More informationA (Brief) Introduction to Game Theory
A (Brief) Introduction to Game Theory Johanne Cohen PRiSM/CNRS, Versailles, France. Goal Goal is a Nash equilibrium. Today The game of Chicken Definitions Nash Equilibrium Rock-paper-scissors Game Mixed
More informationMixed Strategies. Krzysztof R. Apt. CWI, Amsterdam, the Netherlands, University of Amsterdam. (so not Krzystof and definitely not Krystof)
Mixed Strategies Krzysztof R. Apt (so not Krzystof and definitely not Krystof) CWI, Amsterdam, the Netherlands, University of Amsterdam Mixed Strategies p. 1/1 Mixed Extension of a Finite Game Probability
More informationLearning to Compete, Compromise, and Cooperate in Repeated General-Sum Games
Learning to Compete, Compromise, and Cooperate in Repeated General-Sum Games Jacob W. Crandall Michael A. Goodrich Computer Science Department, Brigham Young University, Provo, UT 84602 USA crandall@cs.byu.edu
More informationIndustrial Organization Lecture 3: Game Theory
Industrial Organization Lecture 3: Game Theory Nicolas Schutz Nicolas Schutz Game Theory 1 / 43 Introduction Why game theory? In the introductory lecture, we defined Industrial Organization as the economics
More informationThe Query Complexity of Correlated Equilibria
The Query Complexity of Correlated Equilibria arxiv:1305.4874v2 [cs.gt] 25 Dec 2017 Sergiu Hart Noam Nisan December 27, 2017 Abstract We consider the complexity of finding a correlated equilibrium of an
More informationThe Multi-Armed Bandit Problem
The Multi-Armed Bandit Problem Electrical and Computer Engineering December 7, 2013 Outline 1 2 Mathematical 3 Algorithm Upper Confidence Bound Algorithm A/B Testing Exploration vs. Exploitation Scientist
More informationThe Multi-Arm Bandit Framework
The Multi-Arm Bandit Framework A. LAZARIC (SequeL Team @INRIA-Lille) ENS Cachan - Master 2 MVA SequeL INRIA Lille MVA-RL Course In This Lecture A. LAZARIC Reinforcement Learning Algorithms Oct 29th, 2013-2/94
More informationExtensive Form Games I
Extensive Form Games I Definition of Extensive Form Game a finite game tree X with nodes x X nodes are partially ordered and have a single root (minimal element) terminal nodes are z Z (maximal elements)
More informationGame Theory for Linguists
Fritz Hamm, Roland Mühlenbernd 4. Mai 2016 Overview Overview 1. Exercises 2. Contribution to a Public Good 3. Dominated Actions Exercises Exercise I Exercise Find the player s best response functions in
More informationAdvanced Machine Learning
Advanced Machine Learning Follow-he-Perturbed Leader MEHRYAR MOHRI MOHRI@ COURAN INSIUE & GOOGLE RESEARCH. General Ideas Linear loss: decomposition as a sum along substructures. sum of edge losses in a
More informationBiological Applications of Game Theory
Biological Applications of Game Theory Frank Thuijsman 1 st Joint China-Dutch Seminar on Game Theory and 4 th China Meeting on Game Theory and its Applications Presentation Outline Two examples of Biological
More informationRegret Based Dynamics: Convergence in Weakly Acyclic Games
Regret Based Dynamics: Convergence in Weakly Acyclic Games Jason R. Marden Department of Mechanical and Aerospace Engineering University of California Los Angeles, CA 90095 marden@ucla.edu Gürdal Arslan
More informationBELIEFS & EVOLUTIONARY GAME THEORY
1 / 32 BELIEFS & EVOLUTIONARY GAME THEORY Heinrich H. Nax hnax@ethz.ch & Bary S. R. Pradelski bpradelski@ethz.ch May 15, 217: Lecture 1 2 / 32 Plan Normal form games Equilibrium invariance Equilibrium
More informationGame Theory Fall 2003
Game Theory Fall 2003 Problem Set 1 [1] In this problem (see FT Ex. 1.1) you are asked to play with arbitrary 2 2 games just to get used to the idea of equilibrium computation. Specifically, consider the
More informationInternal Regret in On-line Portfolio Selection
Internal Regret in On-line Portfolio Selection Gilles Stoltz gilles.stoltz@ens.fr) Département de Mathématiques et Applications, Ecole Normale Supérieure, 75005 Paris, France Gábor Lugosi lugosi@upf.es)
More informationDynamic Fictitious Play, Dynamic Gradient Play, and Distributed Convergence to Nash Equilibria
Dynamic Fictitious Play, Dynamic Gradient Play, and Distributed Convergence to Nash Equilibria Jeff S. Shamma and Gurdal Arslan Abstract We consider a continuous-time form of repeated matrix games in which
More informationEASINESS IN BANDITS. Gergely Neu. Pompeu Fabra University
EASINESS IN BANDITS Gergely Neu Pompeu Fabra University EASINESS IN BANDITS Gergely Neu Pompeu Fabra University THE BANDIT PROBLEM Play for T rounds attempting to maximize rewards THE BANDIT PROBLEM Play
More informationExtending Fictitious Play with Pattern Recognition
Extending Fictitious Play with Pattern Recognition November 13, 2013 Author: Ronald Chu (info@ronaldchu.nl) Supervisor: dr. G.A.W. Vreeswijk Abstract Fictitious play, an algorithm to predict the opponents
More informationLearning to Coordinate Efficiently: A Model-based Approach
Journal of Artificial Intelligence Research 19 (2003) 11-23 Submitted 10/02; published 7/03 Learning to Coordinate Efficiently: A Model-based Approach Ronen I. Brafman Computer Science Department Ben-Gurion
More informationStatic (or Simultaneous- Move) Games of Complete Information
Static (or Simultaneous- Move) Games of Complete Information Introduction to Games Normal (or Strategic) Form Representation Teoria dos Jogos - Filomena Garcia 1 Outline of Static Games of Complete Information
More informationFrom External to Internal Regret
From External to Internal Regret Avrim Blum School of Computer Science Carnegie Mellon University, Pittsburgh, PA 15213. Yishay Mansour School of Computer Science, Tel Aviv University, Tel Aviv, ISRAEL.
More informationDeterministic Calibration and Nash Equilibrium
University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 2-2008 Deterministic Calibration and Nash Equilibrium Sham M. Kakade Dean P. Foster University of Pennsylvania Follow
More informationGame Theory and Algorithms Lecture 2: Nash Equilibria and Examples
Game Theory and Algorithms Lecture 2: Nash Equilibria and Examples February 24, 2011 Summary: We introduce the Nash Equilibrium: an outcome (action profile) which is stable in the sense that no player
More informationFake news. Xiaofan Li & Andrew Whinston. University of Texas at Austin December 30, 2017
Fake news Xiaofan Li & Andrew Whinston University of Texas at Austin li.x@utexas.edu December 30, 2017 Xiaofan Li & Andrew Whinston (UT Austin) Fake news December 30, 2017 1 / 23 Overview 1 Introduction
More informationAdaptive State Feedback Nash Strategies for Linear Quadratic Discrete-Time Games
Adaptive State Feedbac Nash Strategies for Linear Quadratic Discrete-Time Games Dan Shen and Jose B. Cruz, Jr. Intelligent Automation Inc., Rocville, MD 2858 USA (email: dshen@i-a-i.com). The Ohio State
More informationBayesian-Nash Equilibrium
Bayesian-Nash Equilibrium A Bayesian game models uncertainty over types of opponents a player faces The game was defined in terms of players, their types, their available actions A player s beliefs about
More informationNo-regret algorithms for structured prediction problems
No-regret algorithms for structured prediction problems Geoffrey J. Gordon December 21, 2005 CMU-CALD-05-112 School of Computer Science Carnegie-Mellon University Pittsburgh, PA 15213 Abstract No-regret
More informationAn Introduction to Game Theory
An Introduction to Game Theory Bruce Hajek Department of Electrical and Computer Engineering University of Illinois at Urbana-Champaign December 10, 2018 c 2018 by Bruce Hajek All rights reserved. Permission
More informationPrisoner s Dilemma. Veronica Ciocanel. February 25, 2013
n-person February 25, 2013 n-person Table of contents 1 Equations 5.4, 5.6 2 3 Types of dilemmas 4 n-person n-person GRIM, GRIM, ALLD Useful to think of equations 5.4 and 5.6 in terms of cooperation and
More informationFictitious Self-Play in Extensive-Form Games
Johannes Heinrich, Marc Lanctot, David Silver University College London, Google DeepMind July 9, 05 Problem Learn from self-play in games with imperfect information. Games: Multi-agent decision making
More informationSolving Zero-Sum Extensive-Form Games. Branislav Bošanský AE4M36MAS, Fall 2013, Lecture 6
Solving Zero-Sum Extensive-Form Games ranislav ošanský E4M36MS, Fall 2013, Lecture 6 Imperfect Information EFGs States Players 1 2 Information Set ctions Utility Solving II Zero-Sum EFG with perfect recall
More informationBandit models: a tutorial
Gdt COS, December 3rd, 2015 Multi-Armed Bandit model: general setting K arms: for a {1,..., K}, (X a,t ) t N is a stochastic process. (unknown distributions) Bandit game: a each round t, an agent chooses
More informationMultiagent Learning. Foundations and Recent Trends. Stefano Albrecht and Peter Stone
Multiagent Learning Foundations and Recent Trends Stefano Albrecht and Peter Stone Tutorial at IJCAI 2017 conference: http://www.cs.utexas.edu/~larg/ijcai17_tutorial Overview Introduction Multiagent Models
More informationMultiagent Learning Using a Variable Learning Rate
Multiagent Learning Using a Variable Learning Rate Michael Bowling, Manuela Veloso Computer Science Department, Carnegie Mellon University, Pittsburgh, PA 15213-3890 Abstract Learning to act in a multiagent
More information