Population Games and Evolutionary Dynamics

Similar documents
Survival of Dominated Strategies under Evolutionary Dynamics. Josef Hofbauer University of Vienna. William H. Sandholm University of Wisconsin

Evolutionary Game Theory: Overview and Recent Results

Deterministic Evolutionary Dynamics

Survival of Dominated Strategies under Evolutionary Dynamics

Survival of dominated strategies under evolutionary dynamics

Pairwise Comparison Dynamics and Evolutionary Foundations for Nash Equilibrium

Stochastic Evolutionary Game Dynamics: Foundations, Deterministic Approximation, and Equilibrium Selection

Evolution & Learning in Games

Evolutionary Dynamics and Extensive Form Games by Ross Cressman. Reviewed by William H. Sandholm *

Pairwise Comparison Dynamics for Games with Continuous Strategy Space

The projection dynamic and the replicator dynamic

Distributional stability and equilibrium selection Evolutionary dynamics under payoff heterogeneity (II) Dai Zusai

Stable Games and their Dynamics

Irrational behavior in the Brown von Neumann Nash dynamics

Local Stability of Strict Equilibria under Evolutionary Game Dynamics

Gains in evolutionary dynamics: unifying rational framework for dynamic stability

Population Games and Evolutionary Dynamics

Gains in evolutionary dynamics. Dai ZUSAI

Deterministic Evolutionary Game Dynamics

Decompositions and Potentials for Normal Form Games

Irrational behavior in the Brown von Neumann Nash dynamics

Computational Evolutionary Game Theory and why I m never using PowerPoint for another presentation involving maths ever again

Stochastic Imitative Game Dynamics with Committed Agents

Deterministic Evolutionary Game Dynamics

Population Dynamics Approach for Resource Allocation Problems. Ashkan Pashaie

Spatial Economics and Potential Games

EVOLUTIONARY GAMES WITH GROUP SELECTION

Evolution & Learning in Games

6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Potential Games

Mathematical Economics - PhD in Economics

Sample Path Large Deviations for Stochastic Evolutionary Game Dynamics

Nonaggregable evolutionary dynamics under payoff heterogeneity

Evolutionary Games and Periodic Fitness

arxiv: v1 [cs.sy] 13 Sep 2017

Evolutionary Internalization of Congestion Externalities When Values of Time are Unknown

MS&E 246: Lecture 17 Network routing. Ramesh Johari

Brown s Original Fictitious Play

LEARNING IN CONCAVE GAMES

DETERMINISTIC AND STOCHASTIC SELECTION DYNAMICS

DISCRETIZED BEST-RESPONSE DYNAMICS FOR THE ROCK-PAPER-SCISSORS GAME. Peter Bednarik. Josef Hofbauer. (Communicated by William H.

The Role of Population Games and Evolutionary Dynamics in Distributed Control Systems

Evolutionary Stochastic Games

EVOLUTIONARY GAMES AND LOCAL DYNAMICS

Unified Convergence Proofs of Continuous-time Fictitious Play

Sampling best response dynamics and deterministic equilibrium selection

Distributed Learning based on Entropy-Driven Game Dynamics

An Introduction to ABED: Agent-Based Simulation of Evolutionary Game Dynamics

Economics 201B Economic Theory (Spring 2017) Bargaining. Topics: the axiomatic approach (OR 15) and the strategic approach (OR 7).

Conjectural Variations in Aggregative Games: An Evolutionary Perspective

The Limits of ESS Methodology

Supermodular Games. Ichiro Obara. February 6, 2012 UCLA. Obara (UCLA) Supermodular Games February 6, / 21

Game Theory -- Lecture 4. Patrick Loiseau EURECOM Fall 2016

Ex Post Cheap Talk : Value of Information and Value of Signals

Computation of Efficient Nash Equilibria for experimental economic games

Reinforcement Learning in Evolutionary Games

Other-Regarding Preferences: Theory and Evidence

The Dynamics of Generalized Reinforcement Learning

Equilibria in Games with Weak Payoff Externalities

Pure Strategy Equilibria in Symmetric Two-Player Zero-Sum Games

Basic Game Theory. Kate Larson. January 7, University of Waterloo. Kate Larson. What is Game Theory? Normal Form Games. Computing Equilibria

The Dynamic Instability of Dispersed Price Equilibria.

Correlated Equilibria: Rationality and Dynamics

Introduction to game theory LECTURE 1

1 Lattices and Tarski s Theorem

SF2972 Game Theory Exam with Solutions March 15, 2013

Robust Mechanism Design and Robust Implementation

Game Theory, Evolutionary Dynamics, and Multi-Agent Learning. Prof. Nicola Gatti

Bargaining, Contracts, and Theories of the Firm. Dr. Margaret Meyer Nuffield College

CS364A: Algorithmic Game Theory Lecture #13: Potential Games; A Hierarchy of Equilibria

Near-Potential Games: Geometry and Dynamics

A finite population ESS and a long run equilibrium in an n players coordination game

Large deviations and stochastic stability in the small noise double limit

1 Basic Game Modelling

Recap Social Choice Functions Fun Game Mechanism Design. Mechanism Design. Lecture 13. Mechanism Design Lecture 13, Slide 1

Recap Social Choice Fun Game Voting Paradoxes Properties. Social Choice. Lecture 11. Social Choice Lecture 11, Slide 1

For general queries, contact

Strategic Properties of Heterogeneous Serial Cost Sharing

Large Deviations and Stochastic Stability in the Small Noise Double Limit

Game Theory: Spring 2017

An Introduction to Dynamo: Diagrams for Evolutionary Game Dynamics

The Ecology of Defensive Medicine and Malpractice Litigation

Other Equilibrium Notions

1 Games in Normal Form (Strategic Form)

CS364A: Algorithmic Game Theory Lecture #16: Best-Response Dynamics

Game Theory. Greg Plaxton Theory in Programming Practice, Spring 2004 Department of Computer Science University of Texas at Austin

2.1 Definition and graphical representation for games with up to three players

First Prev Next Last Go Back Full Screen Close Quit. Game Theory. Giorgio Fagiolo

Lecture December 2009 Fall 2009 Scribe: R. Ring In this lecture we will talk about

Belief-based Learning

Evolutionary Game Theory

Evolutionary Game Theory Notes

6.891 Games, Decision, and Computation February 5, Lecture 2

CS 798: Multiagent Systems

Potential Games. Krzysztof R. Apt. CWI, Amsterdam, the Netherlands, University of Amsterdam. Potential Games p. 1/3

Algorithmic Game Theory. Alexander Skopalik

Economics 3012 Strategic Behavior Andy McLennan October 20, 2006

Influencing Social Evolutionary Dynamics

Robust Learning Equilibrium

epub WU Institutional Repository

OD-Matrix Estimation using Stable Dynamic Model

Transcription:

Population Games and Evolutionary Dynamics (MIT Press, 200x; draft posted on my website) 1. Population games 2. Revision protocols and evolutionary dynamics 3. Potential games and their applications 4. Survival of dominated strategies William H. Sandholm University of Wisconsin a hypnodisk 1

Population Games and Evolutionary Dynamics Population games model strategic interactions in which: 1. The number of agents is large. 2. Individual agents are small. 3. The number of roles is finite. 4. Agents interact anonymously. 5. Payoffs are continuous. Examples: economics: externalities, macroeconomic spillovers, centralized markets biology: animal conflict, genetic natural selection transportation science: highway congestion, mode choice computer science: selfish routing 2

We consider games played by a single unit-mass population of agents. S = {1,, n} strategies X = {x R + n : F i : X R = 1} population states/mixed strategies x is i payoffs to strategy i (Lipschitz continuous) F: X R n payoffs to all strategies = the payoff vector field (geometry: see Sandholm, Dokumaci, and Lahkar (2006)) State x X is a Nash equilibrium (x NE(F)) if x i > 0 F i (x) F j (x) for all j S 3

Example: Congestion games (Beckmann, McGuire, and Winsten (1956)) Home and Work are connected by paths i S consisting of links. The payoff to choosing path i is (the delay on path i) = (the sum of the delays on the links on path i) formally: F i (x) = c (u (x)) payoff to path i x i u (x) = c (u ) i i: i x i mass of players choosing path i utilization of link cost of delay on link Examples: highway congestion c increasing positive externalities c decreasing 4

Revision Protocols and Evolutionary Dynamics Traditionally, predictions in game theory are based on some notion of equilibrium. These concepts in turn rely on the assumption of equilibrium knowledge. In contexts with large numbers of agents, this assumption seems untenable. As an alternative, we consider an explicitly dynamic model of individual choice. 5

Microfoundations for evolutionary dynamics The choice procedure individual agents follow is called a revision protocol. : R n X R + nn ij (F(x), x) revision protocol conditional switch rate from strategy i to strategy j An all-purpose interpretation: Each agent is equipped with a rate R Poisson alarm clock (R = n max i,j,x ij (F(x), x)). When an i player s clock rings, he receives a revision opportunity. He switches to strategy j with probability ij (F(x), x)/r. Often, simpler interpretations work for specific revision protocols. 6

The mean dynamic population game F revision protocol population size N Markov process {X t N } Over finite time spans, as N grows large, the process {X t N } is well-approximated by solutions to an ODE. (Kurtz (1970), Benaim and Weibull (2003), Sandholm (2003)) This ODE, the mean dynamic, is defined by the expected increments of {X t N }. (D) x i = V F i (x) = x j ji (F(x), x) x i ij (F(x), x) js js = inflow into i outflow from i 7

Example 1: Proportional imitation (Schlag (1998)) ij = x j [F j F i ] + The x j term suggests a different interpretation for : pick an opponent at random; then decide whether to imitate him. x i = x i (F i (x) F(x)), where F(x) = x j F js j (x) = the replicator dynamic (Taylor and Jonker (Mathematical Biosciences 1978)) Example 2: Proportional switching with direct selection of alternatives ij = [F j F i ] + x i = x j [F i (x) F j (x)] + x i [F j (x) F i (x)] + js js = the pairwise difference dynamic (Smith (Transportation Science 1984)) 8

Classes of population games Can we find broad classes of population games that allow simple characterizations of Nash equilibrium sets; global convergence results for general classes of evolutionary dynamics? Three important classes of examples: 1. Potential games Monderer and Shapley (1996), Sandholm (2001) 2. Supermodular games Topkis (1979), Hofbauer and Sandholm (2005) 3. Stable games Hofbauer and Sandholm (2006a) (cf Maynard Smith and Price (1973)) (cf Minty (1967), Smith (1979), Dafermos (1980))) 9

1. Potential Games F is a potential game if there exists an f: X R s.t. f (x) = F(x) for all x X externality symmetry: F i x j (x) = F j x i (x) for all i, j X and x X. 10

Example 1: Pure coordination F(x) = x = x 1 x 2 x 3 f(x) = 1 2 (x 1 2 + x 2 2 + x 3 2 ) F(x) = x (projected onto the tangent space of X) 11

Example 2: Congestion games (Beckmann, McGuire, and Winsten (1956)) u (x) F i (x) = c (u (x)) f(x) = c (z)dz i i i 0 u (x)c (x) = F(x) Note: if the c are strictly increasing, then f is strictly concave, and so has a unique maximizer (= the unique NE (see below)). 12

Global convergence of evolutionary dynamics (D) x i = V F i (x) = x j ji (F(x), x) x i ij (F(x), x) js We say that the dynamic (D) satisfies positive correlation if js (PC) V F (x) F(x) = n Cov( V F (x), F(x)) 0, with equality if and only if x NE(F). (PC) illustrated: The replicator dynamic in 123 Coordination 13

Theorem: If F is a potential game, and the dynamic V F satisfies (PC), then nonstationary solutions of V F ascend the potential function f. Therefore, every solution trajectory converges to a set of NE. Proof 1: d f (x ) = f (x dt t t ) x t = F(x t ) V F (x t ) 0. Proof 2: below f(x) = 1 (x 2 2 1 + x 2 2 + x 2 3 ) F(x) = x 14

Example 3: Games generated by variable externality pricing schemes Let F be an arbitrary game. Let E i (x) = js x j F j x i (x) = the total (marginal) externalities created by an i player Let ˆF = F + E be the population game obtained by imposing pricing scheme E on F. Theorem: ˆF is a potential game whose potential function is F, the average payoff function of the original game. Proof: F(x) = F x x i x j F j (x) = F j (x) + j x j i js js x i (x) = F j (x) + E j (x). 15

Application: Evolutionary Mechanism Design ex. A planner would like to ensure efficient behavior in a highway network. He knows the cost functions c, but he does not know any agent s valuation for driving to work. He therefore cannot compute the efficient state. Theorem (Sandholm (2002)): By imposing the pricing scheme E, the planner can ensure that regardless of the agents valuations, efficient behavior is (i) the unique Nash equilibrium, and (ii) globally stable under any dynamic satisfying (PC). (That the original game is a congestion game is irrelevant: see Sandholm (2005, 2006).) 16

Survival of Dominated Strategies under Evolutionary Dynamics In potential games, a wide class of evolutionary dynamics converge to Nash equilibrium. Can evolutionary dynamics provide foundations for rationality-based solution concepts more generally? For Nash equilibrium, the answer is no: there are games in which most solutions of any reasonable evolutionary dynamic fail to converge to Nash equilibrium (Hofbauer and Swinkels (1996), Hart and Mas-Colell (2003)). We therefore consider a more modest goal: elimination of strictly dominated strategies. (Strategy i S is strictly dominated by strategy j S if F j (x) > F i (x) for all x X.) Dominance is the mildest requirement employed in standard game-theoretic analysis, so it is natural to expect evolutionary dynamics to accord with it. 17

But: There is no a priori reason to expect dominated strategies to be eliminated by evolutionary dynamics. Evolutionary dynamics are based on the idea that agents switch to strategies whose current payoffs are reasonably good. Even if a strategy is dominated, it can have reasonably good payoffs in many states. Put differently: decision making in evolutionary models is local ; domination is global. 18

The result (PC) Positive correlation: V F (x) F(x) 0, with equality if and only if x NE(F). (IN) Innovation: If x NE(F), x i = 0, and i BR F (x), then V i F (x) > 0. Theorem (Hofbauer and Sandholm (2006b)): Any continuous dynamic that satisfies (PC) and (IN) does not eliminate strictly dominated strategies in all games. 19

Proof of special cases (BNN dynamic, PD dynamic) (cf Berger and Hofbauer (2006)) Step 1: Cycling in bad RPS Suppose agents are randomly matched to play bad Rock-Paper-Scissors. A = 0 2 1 1 0 2 2 1 0 F(x) = F R (x) F P (x) = F S (x) x S 2x P x R 2x S x P 2x R Unless the initial state is the unique NE x* = ( 1 3, 1 3, 1 3 ), play approaches a limit cycle. Proof: Use a Lyapunov function. (The function depends on the dynamic at issue). 20

the BNN dynamic in bad RPS the PD dynamic in bad RPS 21

Step 2: Add a twin to bad RPS Add a fourth strategy to bad RPS that duplicates Scissors. A = 0 2 1 1 1 0 2 2 2 1 0 0 2 1 0 0 Except from the line of NE {x X: x = (( 1 3, 1 3, 1 6 c, 1 6 + c)}, play converges to a cycle on the plane where x S = x T. 22

the PD dynamic in bad RPS with a twin 23

Step 3: A feeble twin Uniformly reduce the payoff of the twin by : A = 0 2 1 1 1 0 2 2 2 1 0 0 2 1 Since the dynamic depends continuously on the game s payoffs, the attracting interior limit cycle persists when is slightly greater than zero. The feeble twin, a strictly dominated strategy, survives. 24

the PD dynamic in bad RPS + twin the PD dynamic in bad RPS + evil twin 25

Where does this argument fail for arbitrary dynamics that satisfy (PC)? 1. In a game with identical twins, (PC) does not imply convergence to the plane where the twins receive equal weight. 2. In Bad RPS, (PC) is not enough to ensure cycling. Item 1 is handled using tools from dynamical systems theory. For item 2, we need a game that is worse than bad RPS. 26

The hypnodisk game 27

Population Games and Evolutionary Dynamics: Summary 1. Population games 2. Revision protocols and evolutionary dynamics 3. Potential games and their applications 4. Survival of dominated strategies under evolutionary dynamics Some open topics New classes of dynamics, perhaps based on psychologically motivated models of choice Prevalence of convergence/of cycling/of survival of dominated strategies Applications of all kinds 28