Game Theory: introduction and applications to computer networks

Similar documents
A (Brief) Introduction to Game Theory

Game Theory: introduction and applications to computer networks

Game Theory. Professor Peter Cramton Economics 300

Quantum Games. Quantum Strategies in Classical Games. Presented by Yaniv Carmeli

Introduction to Game Theory

Game Theory: Spring 2017

MS&E 246: Lecture 18 Network routing. Ramesh Johari

MS&E 246: Lecture 17 Network routing. Ramesh Johari

Game Theory and Control

Normal-form games. Vincent Conitzer

Strategic Games: Social Optima and Nash Equilibria

Algorithmic Game Theory. Alexander Skopalik

Introduction to Game Theory Lecture Note 2: Strategic-Form Games and Nash Equilibrium (2)

For general queries, contact

CSC304 Lecture 5. Game Theory : Zero-Sum Games, The Minimax Theorem. CSC304 - Nisarg Shah 1

Game Theory and Evolution

Game Theory for Linguists

ALGORITHMIC GAME THEORY. Incentive and Computation

MS&E 246: Lecture 4 Mixed strategies. Ramesh Johari January 18, 2007

A Polynomial-time Nash Equilibrium Algorithm for Repeated Games

General-sum games. I.e., pretend that the opponent is only trying to hurt you. If Column was trying to hurt Row, Column would play Left, so

Game Theory. 2.1 Zero Sum Games (Part 2) George Mason University, Spring 2018

Iterated Strict Dominance in Pure Strategies

Introduction to Game Theory

Game Theory and its Applications to Networks - Part I: Strict Competition

EC3224 Autumn Lecture #03 Applications of Nash Equilibrium

Games and Their Equilibria

First Prev Next Last Go Back Full Screen Close Quit. Game Theory. Giorgio Fagiolo

Outline for today. Stat155 Game Theory Lecture 17: Correlated equilibria and the price of anarchy. Correlated equilibrium. A driving example.

Game Theory DR. ÖZGÜR GÜRERK UNIVERSITY OF ERFURT WINTER TERM 2012/13. Strict and nonstrict equilibria

Traffic Games Econ / CS166b Feb 28, 2012

Prisoner s Dilemma. Veronica Ciocanel. February 25, 2013

Extensive Form Games I

Hypothetical Bargaining and Equilibrium Refinement in Non-Cooperative Games

Computation of Nash Equilibria of Two-Person Non-Cooperative Games with Maple

Selfishness vs Altruism vs Balance

C31: Game Theory, Lecture 1

Potential Games. Krzysztof R. Apt. CWI, Amsterdam, the Netherlands, University of Amsterdam. Potential Games p. 1/3

Pareto-Improving Congestion Pricing on General Transportation Networks

Equilibrium Computation

Routing Games 1. Sandip Chakraborty. Department of Computer Science and Engineering, INDIAN INSTITUTE OF TECHNOLOGY KHARAGPUR.

News. Good news. Bad news. Ugly news

Mohammad Hossein Manshaei 1394

Game Theory and Algorithms Lecture 2: Nash Equilibria and Examples

User Equilibrium CE 392C. September 1, User Equilibrium

Static (or Simultaneous- Move) Games of Complete Information

Oligopoly Theory 2 Bertrand Market Games

Other-Regarding Preferences: Theory and Evidence

Bargaining, Contracts, and Theories of the Firm. Dr. Margaret Meyer Nuffield College

Computing Minmax; Dominance

Basic Game Theory. Kate Larson. January 7, University of Waterloo. Kate Larson. What is Game Theory? Normal Form Games. Computing Equilibria

arxiv:quant-ph/ v3 18 Aug 2004

Microeconomics for Business Practice Session 3 - Solutions

Algorithmic Game Theory and Applications. Lecture 4: 2-player zero-sum games, and the Minimax Theorem

EVOLUTIONARY STABILITY FOR TWO-STAGE HAWK-DOVE GAMES

Economics 703 Advanced Microeconomics. Professor Peter Cramton Fall 2017

Industrial Organization Lecture 3: Game Theory

Variants of Nash Equilibrium. CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte

Chapter 2. Equilibrium. 2.1 Complete Information Games

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2016

Long-Run versus Short-Run Player

Efficiency Loss in a Network Resource Allocation Game

University of Warwick, Department of Economics Spring Final Exam. Answer TWO questions. All questions carry equal weight. Time allowed 2 hours.

Game Theory Through The Computational Lens

Game Theory- Normal Form Games

6.254 : Game Theory with Engineering Applications Lecture 13: Extensive Form Games

Non-zero-sum Game and Nash Equilibarium

Satisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games

Quantum Games. Colin Benjamin. School of Physical Sciences, National Institute of Science Education and Research, Bhubaneswar, India

arxiv: v1 [cs.ai] 22 Feb 2018

Game Theory -- Lecture 4. Patrick Loiseau EURECOM Fall 2016

Introduction to game theory LECTURE 1

Chapter 2. Equilibrium. 2.1 Complete Information Games

Computing Solution Concepts of Normal-Form Games. Song Chong EE, KAIST

Computational Evolutionary Game Theory and why I m never using PowerPoint for another presentation involving maths ever again

Game Theory: Lecture 2

Lecture December 2009 Fall 2009 Scribe: R. Ring In this lecture we will talk about

6.207/14.15: Networks Lecture 16: Cooperation and Trust in Networks

Efficient Mechanism Design

Lecture 19: Common property resources

Mathematical Economics - PhD in Economics

NOTES ON COOPERATIVE GAME THEORY AND THE CORE. 1. Introduction

Taking rational decisions: An Introduction to Game Theory

The Paradox Severity Linear Latency General Latency Extensions Conclusion. Braess Paradox. Julian Romero. January 22, 2008.

Computing Equilibria of Repeated And Dynamic Games

1 Basic Game Modelling

1 Equilibrium Comparisons

Review of topics since what was covered in the midterm: Topics that we covered before the midterm (also may be included in final):

Other Equilibrium Notions

Game Theory Lecture 10+11: Knowledge

Game Theory Fall 2003

Game Theory. Greg Plaxton Theory in Programming Practice, Spring 2004 Department of Computer Science University of Texas at Austin

Computation of Efficient Nash Equilibria for experimental economic games

SF2972 Game Theory Written Exam with Solutions June 10, 2011

Computing Minmax; Dominance

1 Games in Normal Form (Strategic Form)

Lecture Notes on Game Theory

Title: The Castle on the Hill. Author: David K. Levine. Department of Economics UCLA. Los Angeles, CA phone/fax

Solving Extensive Form Games

: Cryptography and Game Theory Ran Canetti and Alon Rosen. Lecture 8

Transcription:

Game Theory: introduction and applications to computer networks Introduction Giovanni Neglia INRIA EPI Maestro 27 January 2014 Part of the slides are based on a previous course with D. Figueiredo (UFRJ) and H. Zhang (Suffolk University)

Mixed strategies equilibria Same idea of equilibrium each player plays a mixed strategy (equalizing strategy), that equalizes the opponent payoffs how to calculate it? Rose Colin A B A 5, 0-1, 4 B 3, 2 2, 1

Mixed strategies equilibria Same idea of equilibrium each player plays a mixed strategy, that equalizes the opponent payoffs how to calculate it? Rose Colin A B A -0-4 B -2-1 Rose considers Colin s game 4 1 1/5 4/5

Mixed strategies equilibria Same idea of equilibrium each player plays a mixed strategy, that equalizes the opponent payoffs how to calculate it? Rose Colin A B A 5-1 B 3 2 Colin considers Rose s game 3/5 2/5

Mixed strategies equilibria Same idea of equilibrium each player plays a mixed strategy, that equalizes the opponent payoffs how to calculate it? Rose Colin A B A 5, 0-1, 4 B 3, 2 2, 1 Rose playing (1/5,4/5) Colin playing (3/5,2/5) is an equilibrium Rose gains 13/5 Colin gains 8/5

Good news: Nash s theorem [1950] Every two-person games has at least one equilibrium either in pure strategies or in mixed strategies Proved using fixed point theorem generalized to N person game This equilibrium concept called Nash equilibrium in his honor A vector of strategies (a profile) is a Nash Equilibrium (NE) if no player can unilaterally change its strategy and increase its payoff

A useful property Given a finite game, a profile is a mixed NE of the game if and only if for every player i, every pure strategy used by i with non-null probability is a best response to other players mixed strategies in the profile see Osborne and Rubinstein, A course in game theory, Lemma 33.2

Bad news: what do we lose? equivalence interchangeability identity of equalizing strategies with prudential strategies main cause at equilibrium every player is considering the opponent s payoffs ignoring its payoffs. New problematic aspect group rationality versus individual rationality (cooperation versus competition) absent in zero-sum games Ø we lose the idea of the solution

Game of Chicken 2 2 Game of Chicken (aka. Hawk-Dove Game) Driver 1 driver who swerves looses Driver 2 swerve stay swerve 0, 0-1, 5 stay 5, -1-10, -10 Drivers want to do opposite of one another Two equilibria: not equivalent not interchangeable! playing an equilibrium strategy does not lead to equilibrium

The Prisoner s Dilemma One of the most studied and used games proposed in 1950 Two suspects arrested for joint crime each suspect when interrogated separately, has option to confess Suspect 1 Suspect 2 NC C NC 2, 2 10, 1 C 1, 10 5, 5 payoff is years in jail (smaller is better) better outcome single NE

Pareto Optimal Suspect 1 Suspect 2 NC C NC 2, 2 10, 1 C 1, 10 5, 5 Pareto Optimal Def: outcome o* is Pareto Optimal if no other outcome would give to all the players a payoff not smaller and a payoff higher to at least one of them Pareto Principle: to be acceptable as a solution of a game, an outcome should be Pareto Optimal o the NE of the Prisoner s dilemma is not! Conflict between group rationality (Pareto principle) and individual rationality (dominance principle)

Rose Payoff polygon Colin A B A 5, 0-1, 4 B 3, 2 2, 1 All the points in the convex hull of the pure strategy payoffs correspond to payoffs obtainable by mixed strategies A,B Colin s payoff B,B B,A The north-east boundary contains the Pareto optimal points A,A NE Rose s payoff

Another possible approach to equilibria NE ó equalizing strategies What about prudential strategies?

Prudential strategies Each player tries to minimize its maximum loss (then it plays in its own game) Rose Colin A B A 5, 0-1, 4 B 3, 2 2, 1

Prudential strategies Rose assumes that Colin would like to minimize her gain Rose plays in Rose s game Saddle point in BB B is Rose s prudential strategy and guarantees to Rose at least 2 (Rose s security level) Rose Colin A B A 5-1 B 3 2

Prudential strategies Colin assumes that Rose would like to minimize his gain (maximize his loss) Colin plays in Colin s game mixed strategy equilibrium, (3/5,2/5) is Colin s prudential strategy and guarantees Colin a gain not smaller than 8/5 Rose Colin A B A 0-4 B -2-1

Prudential strategies Prudential strategies Rose plays B, Colin plays A w. prob. 3/5, B w. 2/5 Rose gains 13/5 (>2), Colin gains 8/5 Is it stable? No, if Colin thinks that Rose plays B, he would be better off by playing A (Colin s counter-prudential strategy) Rose Colin A B A 5, 0-1, 4 B 3, 2 2, 1

Prudential strategies are not the solution neither: do not lead to equilibria do not solve the group rationality versus individual rationality conflict dual basic problem: look at your payoff, ignoring the payoffs of the opponents

Exercises Find NE and Pareto optimal outcomes: NC C NC 2, 2 10, 1 C 1, 10 5, 5 A B A 2, 3 3, 2 B 1, 0 0, 1 swerve stay swerve 0, 0-1, 5 stay 5, -1-10, -10 A B A 2, 4 1, 0 B 3, 1 0, 4

Performance Evaluation Routing as a Potential game Giovanni Neglia INRIA EPI Maestro

Routing games Delay? 1 2 Traffic (cars#) Possible in the Internet?

Overlay networks Overlay Internet Underlay

Routing games 1 4 underlay route 2 3 An Overlay for routing: Resilient Overlay Routing 1 3 4 route allowed by the overlay Users can ignore ISP choices

Traffic demand 2 1 a f b f 2,3 1,3 e c d 3 4 unit traffic demands between pair of nodes

Delay costs f 1,3 R 1,3 = {a,b}, R 2,3 ={b} a 2 b f a =f 1,3, f b = f 1,3 + f 2,3, f c =f d =0 1 c e 4 f 2,3 d 3 c α (f α ), α ε E={a,b,c,d,e}, Non-negative, non decreasing functions Social cost: C S = Σ α ε E f α *c α (f α ) User cost: C 1,3 (f)= Σ α ε R1,3 c α (f α )

Pigou s example transit_time a =2 hour 1 2 transit_time b =x hours Two possible roads between 1 and 2 ( time a) a longer highway (almost constant transit b) shorter but traffic sensitive city road 2 Selfish users (choose the road in order to minimize their delay) Colin a b Rose a -2, -2-2, -1 b -1, -2-2, -2

Pigou s example transit_time a =2 hour 4 3 Social cost 1 2 transit_time b =x hours 0 1 2 f b Two possible roads between 1 and 2 ( time a) a longer highway (almost constant transit b) shorter but traffic sensitive city road 2 Selfish users (choose the road in order to minimize their delay) There is 1 (pure-strategy) NE where they all choose the city road... even if the optimal allocation is not worse for the single user! What if transit_time a =2+ε? In what follows we only consider pure strategy NE

What is the cost of user selfishness for the community? Loss of Efficiency (LoE) given a NE with social cost C S (f NE ) and the traffic allocation with minimum social cost C S (f Opt ) LoE = C S (f NE ) / C S (f Opt )

Pigou s example transit_time a =2 hour 1 2 transit_time b =x hours The LoE of (b,b) is 4/3 The LoE of (b,a) and (a,b) is 1 Rose Colin a b a -2, -2-2, -1 b -1, -2-2, -2

Braess's paradox c(x)=x 3 c(x)=2+ε 1 2 c(x)=2+ε 4 c(x)=x User cost: 3+ε Social cost: C NE = 6+2ε (=C Opt )

Braess's paradox transit_time a =3+ε hours 3 c(x)=x c(x)=2+ε 1 c(x)=0 2 c(x)=2+ε 4 c(x)=x

Braess's paradox transit_time a =3+ε hours 3 c(x)=x c(x)=2+ε 1 c(x)=0 2 c(x)=2+ε 4 c(x)=x User cost: 4 Social cost: C NE = 8 > 6+ε (C Opt ) LoE = 8/(6+ε) -> 4/3 ε->0

Routing games 1. Is there always a (pure strategy) NE? 2. Can we always find a NE with a small Loss of Efficiency (LoE)?

Always an equilibrium? Best Response dynamics Start from a given routing and let each player play its Best Response strategy What if after some time there is no change?

BR dynamics c(x)=x 3 c(x)=2+ε 1 2 c(x)=2+ε 4 c(x)=x 1. Users costs: (3+ε, 3+ε) 2. Blue plays BR, costs: (3, 4+ε) 3. Pink plays BR, costs: (4, 4) 4. Nothing changes.

Always an equilibrium? Best Response dynamics Start from a given routing and let each player play its Best Response strategy What if after some time there is no change? Are we sure to stop?

Games with no saddle-point There are games with no saddle-point! An example? R P S min R R 0-1 1-1 P P 1 0-1 -1 maximin S S -1 1 0-1 max max 1 1 1 minimax maximin <> minimax

Always an equilibrium? Best Response dynamics Start from a given routing and let each player play its Best Response strategy What if after some time there is no change? Are we sure to stop? In some cases we can define a potential function that keeps decreasing at each BR until a minimum is reached. Is the social cost a good candidate?

Potential for routing games f 1,3 R 1,3 = {a,b}, R 2,3 ={b} a 2 b f a =f 1,3, f b = f 1,3 + f 2,3, f c =f d =0 1 c e 4 f 2,3 d 3 c α (f α ), α ε E={a,b,c,d,e}, Non-negative, non decreasing functions Potential : P =Σ α ε E P α (f α )=Σ α ε E Σ t=1, fα c α (t)

Potential decreases at every BR c(x)=x 3 c(x)=2+ε 1 2 c(x)=2+ε 4 c(x)=x 1. User costs: (3+ε, 3+ε), P=6+2ε 2. Blue plays BR, costs: (3, 4+ε), P=6+ε 3. Pink plays BR, costs: (4, 4), P=6 4. Nothing changes.

Potential decreases at every BR c(x)=x 3 c(x)=2+ε From route R to route R 1 2 c(x)=2+ε 4 c(x)=x f α =f α +1 if α in R -R, f α =f α -1 if α in R-R P α -P α =-c α (f α +1) if α in R -R, P α -P α =c α (f α ) if α in R-R P-P =Σ α ε R c α (f α )-Σ α ε R c α (f α )= =user difference cost between R and R >0

BR dynamics converges to an equilibrium The potential decreases at every step There is a finite number of possible potential values After a finite number of steps a potential local minimum is reached The final routes identify a (pure strategy) NE