A (Brief) Introduction to Game Theory

Similar documents
Game Theory: introduction and applications to computer networks

Quantum Games. Quantum Strategies in Classical Games. Presented by Yaniv Carmeli

Introduction to Game Theory

Game Theory and Control

Normal-form games. Vincent Conitzer

Game Theory, Evolutionary Dynamics, and Multi-Agent Learning. Prof. Nicola Gatti

First Prev Next Last Go Back Full Screen Close Quit. Game Theory. Giorgio Fagiolo

Game Theory and Algorithms Lecture 2: Nash Equilibria and Examples

Evolution & Learning in Games

Introduction to Game Theory

Prisoner s Dilemma. Veronica Ciocanel. February 25, 2013

Non-zero-sum Game and Nash Equilibarium

Game Theory. Professor Peter Cramton Economics 300

Belief-based Learning

Algorithmic Game Theory. Alexander Skopalik

Game Theory -- Lecture 4. Patrick Loiseau EURECOM Fall 2016

Extensive Form Games with Perfect Information

Chapter 2. Equilibrium. 2.1 Complete Information Games

Static (or Simultaneous- Move) Games of Complete Information

Game Theory. Greg Plaxton Theory in Programming Practice, Spring 2004 Department of Computer Science University of Texas at Austin

Introduction to Game Theory. Outline. Topics. Recall how we model rationality. Notes. Notes. Notes. Notes. Tyler Moore.

Industrial Organization Lecture 3: Game Theory

Introduction to Game Theory Lecture Note 2: Strategic-Form Games and Nash Equilibrium (2)

Iterated Strict Dominance in Pure Strategies

Bounded Rationality, Strategy Simplification, and Equilibrium

General-sum games. I.e., pretend that the opponent is only trying to hurt you. If Column was trying to hurt Row, Column would play Left, so

MS&E 246: Lecture 17 Network routing. Ramesh Johari

Game Theory: Spring 2017

Bounded Rationality Lecture 2. Full (Substantive, Economic) Rationality

Game Theory and Evolution

Mohammad Hossein Manshaei 1394

CSC304 Lecture 5. Game Theory : Zero-Sum Games, The Minimax Theorem. CSC304 - Nisarg Shah 1

Basic Game Theory. Kate Larson. January 7, University of Waterloo. Kate Larson. What is Game Theory? Normal Form Games. Computing Equilibria

Outline for today. Stat155 Game Theory Lecture 16: Evolutionary game theory. Evolutionarily stable strategies. Nash equilibrium.

Chapter 2. Equilibrium. 2.1 Complete Information Games

6.207/14.15: Networks Lecture 16: Cooperation and Trust in Networks

CS 798: Multiagent Systems

For general queries, contact

UC Berkeley Haas School of Business Game Theory (EMBA 296 & EWMBA 211) Summer 2016

Microeconomics. 2. Game Theory

Strategic Games: Social Optima and Nash Equilibria

Microeconomics for Business Practice Session 3 - Solutions

Entanglement enhanced multiplayer quantum games

Evolutionary Game Theory

Lecture December 2009 Fall 2009 Scribe: R. Ring In this lecture we will talk about

Games and Their Equilibria

1 Basic Game Modelling

Correlated Equilibria: Rationality and Dynamics

Computing Minmax; Dominance

Computational Evolutionary Game Theory and why I m never using PowerPoint for another presentation involving maths ever again

Quantum Games Have No News for Economists 1

Observations on Cooperation

EC3224 Autumn Lecture #04 Mixed-Strategy Equilibrium

Lecture Notes on Game Theory

arxiv:quant-ph/ v3 18 Aug 2004

6.207/14.15: Networks Lecture 11: Introduction to Game Theory 3

EC3224 Autumn Lecture #03 Applications of Nash Equilibrium

Satisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games

MAS and Games. Department of Informatics Clausthal University of Technology SS 18

1 Equilibrium Comparisons

ALGORITHMIC GAME THEORY. Incentive and Computation

Advanced Microeconomics

EC319 Economic Theory and Its Applications, Part II: Lecture 2

On Selfish Behavior in CSMA/CA Networks

6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Potential Games

MS&E 246: Lecture 4 Mixed strategies. Ramesh Johari January 18, 2007

C31: Game Theory, Lecture 1

Computation of Nash Equilibria of Two-Person Non-Cooperative Games with Maple

Selfishness vs Altruism vs Balance

Computing Minmax; Dominance

VII. Cooperation & Competition

Collective Evolution of Turn-taking Norm in Dispersion Games

An Introduction to Evolutionary Game Theory

Game Theory: Lecture 3

Other Equilibrium Notions

Bayesian-Nash Equilibrium

EVOLUTIONARY STABILITY FOR TWO-STAGE HAWK-DOVE GAMES

Mixed Nash Equilibria

Draft notes for ECE 586 GT: Game theory. Bruce Hajek

princeton univ. F 17 cos 521: Advanced Algorithm Design Lecture 17: Games, Min-Max and Equilibria, and PPAD

Advantage of a quantum player over a classical one in 2 2 quantum games

Economics 703 Advanced Microeconomics. Professor Peter Cramton Fall 2017

STOCHASTIC STABILITY OF GROUP FORMATION IN COLLECTIVE ACTION GAMES. Toshimasa Maruta 1 and Akira Okada 2

Algorithmic Game Theory and Applications. Lecture 4: 2-player zero-sum games, and the Minimax Theorem

Game Theory for Linguists

6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2

Recap Social Choice Fun Game Voting Paradoxes Properties. Social Choice. Lecture 11. Social Choice Lecture 11, Slide 1

CS364A: Algorithmic Game Theory Lecture #13: Potential Games; A Hierarchy of Equilibria

Game Theory. Wolfgang Frimmel. Perfect Bayesian Equilibrium

Bargaining Efficiency and the Repeated Prisoners Dilemma. Bhaskar Chakravorti* and John Conley**

Quantum Solution of Coordination Problems

Other-Regarding Preferences: Theory and Evidence

6 The Principle of Optimality

A Few Games and Geometric Insights

Basics of Game Theory

Mathematical Games and Random Walks

Game-Theoretic Learning:

Equilibrium Computation

Selecting Efficient Correlated Equilibria Through Distributed Learning. Jason R. Marden

Potential Games. Krzysztof R. Apt. CWI, Amsterdam, the Netherlands, University of Amsterdam. Potential Games p. 1/3

Transcription:

A (Brief) Introduction to Game Theory Johanne Cohen PRiSM/CNRS, Versailles, France.

Goal

Goal is a Nash equilibrium.

Today The game of Chicken Definitions Nash Equilibrium Rock-paper-scissors Game Mixed strategy Mixed Nash Equilibrium Prisoner s dilemma Bonus : Toward learning equilibria

Outline The game of Chicken Definitions Nash Equilibrium Rock-paper-scissors Game Mixed strategy Mixed Nash Equilibrium Prisoner s dilemma Bonus : Toward learning equilibria

The game of Chicken (Hawk-dove game) A single lane bridge Two drivers Bob and Alice want to go cross it from opposite directions. Each driver can Cross or Stop

The game of Chicken (Hawk-dove game) A single lane bridge Two drivers Bob and Alice want to go cross it from opposite directions. Each driver can Cross or Stop Both drivers want to minimize the time spent to reach other side.

The game of Chicken (Hawk-dove game) A single lane bridge Two drivers Bob and Alice want to go cross it from opposite directions. Each driver can Cross or Stop if both attempt to cross, the result is a fatal traffic accident. There are 4 outcomes depending on the choices made by each of the 2 drivers

Model. 1. Who play? 2. Which actions/strategies? 3. Which payoff according to strategy profile?

Model. 1. Who play? Alice and Bob 2. Which actions/strategies? Cross or Stop 3. Which payoff according to strategy profile? cost = transport time

Model. 1. Who play? Alice and Bob 2. Which actions/strategies? Cross or Stop 3. Which payoff according to strategy profile? cost = transport time Alice Bob Cross Stop Cross (1, 2) Stop (2, 1)

Model. 1. Who play? Alice and Bob 2. Which actions/strategies? Cross or Stop 3. Which payoff according to strategy profile? cost = transport time Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1)

Model. 1. Who play? Alice and Bob 2. Which actions/strategies? Cross or Stop 3. Which payoff according to strategy profile? cost = transport time Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1) (5, 5)

Games in standard form Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1) (5, 5) The two strategies of Bob correspond to the two columns. The entries of the matrix are the outcomes incurred by the players in each situation.

Rational behavior rational behavior of player : select strategy which minimizes its cost. Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1) (5, 5) For example : 1. If Bob selects to Cross, then Alice would select to Stop. 2. If Bob selects to Stop, then Alice would select to Cross.

Rational behavior rational behavior of player : select strategy which minimizes its cost. Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1) (5, 5) For example : 1. If Bob selects to Cross, then Alice would select to Stop. 2. If Bob selects to Stop, then Alice would select to Cross. Alice has a rational behavior.

Best responses of player i : Best response = the strategies which produce the most favorable outcome for a player Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1) (5, 5)

Best responses of player i : Best response = the strategies which produce the most favorable outcome for a player Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1) (5, 5) Equilibrium = mutual best responses

Nash Equilibrium Consider a game with a set of n players {1,..., n} Each player has a set of possible strategies S i s = (s 1,, s n ) is a vector of strategies selected by the players A pure strategy Nash Equilibrium (NE) is a vector of strategies s = (s 1,, s n ) such that i, s i, we have c i(s 1,, s i,, s n ) c i (s 1,, s i,, s n).

Nash Equilibrium Consider a game with a set of n players {1,..., n} Each player has a set of possible strategies S i s = (s 1,, s n ) is a vector of strategies selected by the players A pure strategy Nash Equilibrium (NE) is a vector of strategies s = (s 1,, s n ) such that i, s i we have c i (s i, s i ) c i (s i, s i).

Back to example Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1) (5, 5) 2 pure strategy Nash Equilibria : (Cross, Stop ) and (Stop, Cross )

Back to example Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1) (5, 5) 2 pure strategy Nash Equilibria : (Cross, Stop ) and (Stop, Cross ) But which equilibrium should be selected? Which one will be selected by the system if its converges?

Back to example Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1) (5, 5) 2 pure strategy Nash Equilibria : (Cross, Stop ) and (Stop, Cross ) But which equilibrium should be selected? Which one will be selected by the system if its converges? One solution

Outline The game of Chicken Definitions Nash Equilibrium Rock-paper-scissors Game Mixed strategy Mixed Nash Equilibrium Prisoner s dilemma Bonus : Toward learning equilibria

Rock-paper-scissors Game Rules : 2 players select one strategy from Standard form : Rock/Paper/scissors. Scissors Paper Rock Scissors (0, 0) ( 1, 1) (1, 1) Paper (1, 1) (0, 0) ( 1, 1) Rock ( 1, 1) (1, 1) (0, 0) @wikipedia No pure strategy Nash equilibrium.

Rock-paper-scissors Game Rules : 2 players select one strategy from Standard form : Rock/Paper/scissors. Scissors Paper Rock Scissors (0, 0) ( 1, 1) (1, 1) Paper (1, 1) (0, 0) ( 1, 1) Rock ( 1, 1) (1, 1) (0, 0) No pure strategy Nash equilibrium. But if players select strategies at random, p i (Rock) + p i (Paper) + p i (Scissors) = 1

Rock-paper-scissors Game Rules : 2 players select one strategy from Rock/Paper/scissors. Standard form : Scissors Paper Rock Scissors (0, 0) ( 1, 1) (1, 1) Paper (1, 1) (0, 0) ( 1, 1) Rock ( 1, 1) (1, 1) (0, 0) No pure strategy Nash equilibrium. But if players select strategies at random, p i (Rock) + p i (Paper) + p i (Scissors) = 1 And if one player prefers one strategy (Paper) to the others then his opponent prefers the corresponding winning strategy (Scissors) :

Rock-paper-scissors Game Rules : 2 players select one strategy from Rock/Paper/scissors. Standard form : Scissors Paper Rock Scissors (0, 0) ( 1, 1) (1, 1) Paper (1, 1) (0, 0) ( 1, 1) Rock ( 1, 1) (1, 1) (0, 0) No pure strategy Nash equilibrium. But if players select strategies at random, p i (Rock) + p i (Paper) + p i (Scissors) = 1 If each player picks each of his 3 strategies with probability 1/3, then nobody can improve its payoff. Nash equilibrium of mixed strategies.

Using the random selection method. Consider a game with a set of n players {1,..., n} Each player has a set of possible pure strategies S i a cost function c i : S 1 S n N A mixed strategy is a probability distribution p i over his set of possible pure strategies (actions). i, s S i p i (s) = 1 A mixed profile p is a vector of n elements (p 1,..., p n ) such that player i selects actions using probability p i.

Expected cost The expected cost C i of player i with the game profile p is C i (p) = E[c i (p)] Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1) (5, 5) Assume that player Bob decides to pick Cross with probability 1/3, that player Alice decides to pick Cross with probability 1/2 C Bob (p Bob, p Alice ) =

Expected cost The expected cost C i of player i with the game profile p is C i (p) = E[c i (p)] Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1) (5, 5) Assume that player Bob decides to pick Cross with probability 1/3, that player Alice decides to pick Cross with probability 1/2 C Bob (p Bob, p Alice ) = 1 3 1 2 60+

Expected cost The expected cost C i of player i with the game profile p is C i (p) = E[c i (p)] Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1) (5, 5) Assume that player Bob decides to pick Cross with probability 1/3, that player Alice decides to pick Cross with probability 1/2 C Bob (p Bob, p Alice ) = 1 3 1 2 60+ 1 6 1 + 1 3 2 + 1 3 5

Expected cost The expected cost C i of player i with the game profile p is C i (p) = E[c i (p)] Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1) (5, 5) Assume that player Bob decides to pick Cross with probability 1/3, that player Alice decides to pick Cross with probability 1/2 C Bob (p Bob, p Alice ) = 75 6

Best response of a mixed strategy Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1) (5, 5) { Cross with probability q Bob picks Stop with probability 1 q What happens for Alice? When does Alice select the action Cross?

Best response of a mixed strategy Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1) (5, 5) { Cross with probability q Bob picks Stop with probability 1 q What happens for Alice? if Alice selects the action Cross, then expected cost = 60q + 1(1 q) if Alice selects the action Stop, then expected cost = 2q + 5(1 q) When does Alice select the action Cross?

Best response of a mixed strategy Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1) (5, 5) { Cross with probability q Bob picks Stop with probability 1 q What happens for Alice? if Alice selects the action Cross, then expected cost = 60q + 1(1 q) if Alice selects the action Stop, then expected cost = 2q + 5(1 q) When does Alice select the action Cross? if 60q + 1(1 q) < 2q + 5(1 q), in others words

Best response of a mixed strategy Alice Bob Cross Stop Cross (60, 60) (1, 2) Stop (2, 1) (5, 5) { Cross with probability q Bob picks Stop with probability 1 q What happens for Alice? if Alice selects the action Cross, then expected cost = 60q + 1(1 q) if Alice selects the action Stop, then expected cost = 2q + 5(1 q) When does Alice select the action Cross? if q < 2/31

Using the random selection method... 1. Alice selects Cross if q < 2/31 Cross Stop q 0 2/31 1

Using the random selection method... p Cross stop 1. Alice selects Cross if q < 2/31 2. Using the same argument as previously : Assume that Alice selects { Cross with probability p Stop with probability 1 p, Bob selects Cross if p < 2/31 2/31 Stop Cross q 0 2/31 1

Using the random selection method... p Cross stop 1. Alice selects Cross if q < 2/31 2. Using the same argument as previously : Assume that Alice selects { Cross with probability p Stop with probability 1 p, Bob selects Cross if p < 2/31 3. Nash Equilibrium = intersection of the both lines. 2/31 Stop Cross q 0 2/31 1

Using the random selection method... p Cross stop 1. Alice selects Cross if q < 2/31 2. Using the same argument as previously : Assume that Alice selects { Cross with probability p Stop with probability 1 p, Bob selects Cross if p < 2/31 3. Nash Equilibrium = intersection of the both lines. 2/31 Two Nash Equilibria of pure strategies Stop Cross q 0 2/31 1

Using the random selection method... p Cross stop 1. Alice selects Cross if q < 2/31 2. Using the same argument as previously : Assume that Alice selects { Cross with probability p Stop with probability 1 p, Bob selects Cross if p < 2/31 3. Nash Equilibrium = intersection of the both lines. 2/31 Two Nash Equilibria of pure strategies Stop Cross q 0 2/31 1 One Nash Equilibrium of mixed strategies

Mixed Nash equilibrium Consider a game with a set of n players {1,..., n} Each player has a set of possible pure strategies S i a cost function c i : S 1 S n N A mixed Nash equilibrium is a profile p = (p 1,, p n) such that i, p i P i we have C i (p i, p i ) C i(p i, p i ). P i the set of mixed strategies of i. back to example

Nash s Theorem Théorème [Nash51] Every finite game (with a finite set of players and finite set of strategies) has a mixed strategy Nash equilibrium Recall : there is a game without pure strategy Nash equilibrium.

Outline The game of Chicken Definitions Nash Equilibrium Rock-paper-scissors Game Mixed strategy Mixed Nash Equilibrium Prisoner s dilemma Bonus : Toward learning equilibria

Prisoner s dilemma : statement Bob and Alice that committed a crime are interviewed separately by the police. The offer of the police is the following : 1. If only one of them confess, then he/she will be relaxed and the other will get a sentense of 10 years. 2. If they both remain silent, then they both will have to serve prison sentences of 1 year. 3. if they both confess then they both will get a sentense of 8 years. They have two strategies : Confess or Silent.

Standard form Two strategies : Confess or Silent. Alice Bob Confess Silent Confess (8, 8) (0, 10) Silent (10, 0) (1, 1) The strategy Confess dominates strategy Silent. s S i c i (s, s i ) c i (Confess, s i )

Standard form Two strategies : Confess or Silent. Alice Bob Confess Silent Confess (8, 8) (0, 10) Silent (10, 0) (1, 1) The strategy Confess dominates strategy Silent. s S i c i (s, s i ) c i (Confess, s i ) (Confess, Confess) is a Nash equilibrium (Silent, Silent) is more favorable than (Confess, Confess)

Domination in the sense of Pareto Alice Bob Confess Silent Confess (8, 8) (0, 10) Silent (10, 0) (1, 1) Definition : Profile ŝ Pareto-dominates profile s if 1. i, c i (ŝ) c i (s), 2. j, c j (ŝ) < c j (s), Remark : (Silent, Silent) Pareto-dominates (Confess, Confess).

Domination in the sense of Pareto Notion of cooperation Definition : Profile ŝ Pareto-dominates profile s if 1. i, c i (ŝ) c i (s), 2. j, c j (ŝ) < c j (s), Remark : (Silent, Silent) Pareto-dominates (Confess, Confess).

And if games are repeated? a fixed number k of times, Confess dominates Silent at step k of the repeated game ; the two players hence play Confess. same reasoning for the last but one step. players play Confess at time k, k 1,, 1

And if games are repeated? a fixed number k of times, Confess dominates Silent at step k of the repeated game ; the two players hence play Confess. same reasoning for the last but one step. players play Confess at time k, k 1,, 1 Introduction of a probability δ that the game continues for one more step

And if games are repeated? a fixed number k of times, Confess dominates Silent at step k of the repeated game ; the two players hence play Confess. same reasoning for the last but one step. players play Confess at time k, k 1,, 1 Introduction of a probability δ that the game continues for one more step in a infinite number of steps, Strategies = mixed strategies in the static game. Construction of a strategy of behaviors that correspond to a simulation of mixed strategy S and if a player i deviates, then it is punished

Outline The game of Chicken Definitions Nash Equilibrium Rock-paper-scissors Game Mixed strategy Mixed Nash Equilibrium Prisoner s dilemma Bonus : Toward learning equilibria

Toward learning equilibria The same game is repeated at each step. At every step t, player i has to solve the following problem : Which action to play at time t, given the past history of the game? that is to say for all players i, v i (t) = f (Q), where f i is a function that gives the behavior of i in function of history Q.

A dynamic : fictitious player A player is a fictitious player if the player has the following behavior : the player will play a best response in function of the past statistic of of strategies of his/her adversary, That is to say If player 2 used n j times the action j between step 1 and t 1, then player 1 will estimate that player 2 will play the action i with probability q 2,j (t) = n j t 1 at time t.

A dynamic : fictitious player player 1 player 2 1 2 1 (3,1) (0,3) 2 (1,2) (2,0) Going from a discrete time to a continuous time The system = the couple (q 1,1, q 2,1 ) with q i,1 = probability that player i plays strategy 1.

A dynamic : fictitious player q2,1 (1, 0) q2,1 (1, 1) (1, 1) (1, 0) D C D C A B A B For zone A : (0, 0) (1, 0) direction of the dynamic in zone A q1,1 (0, 0) (1, 0) example of behavior of the dynamic player 1 will be willing to use pure strategy 2, and player 2 pure strategy 1. the dynamic (q 1,1, q 2,1 ) will stay in A up to time t + τ for small τ > 0. So q 2,1 (t + τ) = tq 2,1(t). By making converging τ 0, we obtain t+τ q 2,1(t) = q 2,1(t). t q1,1

Questions?

Prisoner s dilemma (other interpretation) J 1 J 2 transmit transmit transmit (1-c, 1-c) (-c, 1) transmit (1, -c) (0, 0) c > 0 is the cost of traffic, 1 represents the fact that packets reach the destination.

Domination in the sense of Pareto J 1 J 2 transmit transmit transmit (1-c, 1-c) (-c, 1) transmit (1, -c) (0, 0) Remark : (transmit,transmit) is more favorable than (transmit,transmit). Definition : The profile ŝ Pareto-domine the profil s si 1. i, u i (ŝ) u i (s), 2. j, u j (ŝ) > u j (s),

Domination in the sense of Pareto Notion of cooperation Remark : (transmit,transmit) is more favorable than (transmit,transmit). Definition : The profile ŝ Pareto-domine the profil s si 1. i, u i (ŝ) u i (s), 2. j, u j (ŝ) > u j (s),

Questions?