Polynomial-time Computation of Exact Correlated Equilibrium in Compact Games

Similar documents
Polynomial-time Computation of Exact Correlated Equilibrium in Compact Games

Computing an Extensive-Form Correlated Equilibrium in Polynomial Time

princeton univ. F 13 cos 521: Advanced Algorithm Design Lecture 17: Duality and MinMax Theorem Lecturer: Sanjeev Arora

Computing Correlated Equilibria in Multi-Player Games

New Perspectives and Challenges in Routing Games: Query models & Signaling. Chaitanya Swamy University of Waterloo

LINEAR PROGRAMMING III

Theory and Internet Protocols

Local-Effect Games. Kevin Leyton-Brown. Moshe Tennenholtz. Stanford UBC. Technion

Part II: Integral Splittable Congestion Games. Existence and Computation of Equilibria Integral Polymatroids

Game Theory: Lecture 3

CS675: Convex and Combinatorial Optimization Fall 2014 Combinatorial Problems as Linear Programs. Instructor: Shaddin Dughmi

of a bimatrix game David Avis McGill University Gabriel Rosenberg Yale University Rahul Savani University of Warwick

Zero-Sum Games Public Strategies Minimax Theorem and Nash Equilibria Appendix. Zero-Sum Games. Algorithmic Game Theory.

1 PROBLEM DEFINITION. i=1 z i = 1 }.

CS599: Convex and Combinatorial Optimization Fall 2013 Lecture 17: Combinatorial Problems as Linear Programs III. Instructor: Shaddin Dughmi

Mechanism Design: Implementation. Game Theory Course: Jackson, Leyton-Brown & Shoham

Massachusetts Institute of Technology 6.854J/18.415J: Advanced Algorithms Friday, March 18, 2016 Ankur Moitra. Problem Set 6

CS675: Convex and Combinatorial Optimization Fall 2016 Combinatorial Problems as Linear and Convex Programs. Instructor: Shaddin Dughmi

Game Theory. Greg Plaxton Theory in Programming Practice, Spring 2004 Department of Computer Science University of Texas at Austin

Computational complexity of solution concepts

Computing Nash Equilibria of Action-Graph Games

Lectures 6, 7 and part of 8

3 Nash is PPAD Complete

Algorithmic Game Theory and Economics: A very short introduction. Mysore Park Workshop August, 2012

7. Lecture notes on the ellipsoid algorithm

Lecture #21. c T x Ax b. maximize subject to

On the Impossibility of Black-Box Truthfulness Without Priors

Approximate Nash Equilibria with Near Optimal Social Welfare

A Note on Approximate Nash Equilibria

Theoretical Computer Science

Computing Equilibria in Multi-Player Games

Selfish Routing. Simon Fischer. December 17, Selfish Routing in the Wardrop Model. l(x) = x. via both edes. Then,

Duality. Peter Bro Mitersen (University of Aarhus) Optimization, Lecture 9 February 28, / 49

האוניברסיטה העברית בירושלים

On the Value of Correlation

Today: Linear Programming (con t.)

Tight Bounds for Cost-Sharing in Weighted Congestion Games

CS 6820 Fall 2014 Lectures, October 3-20, 2014

BBM402-Lecture 20: LP Duality

Lecture December 2009 Fall 2009 Scribe: R. Ring In this lecture we will talk about

The Computational Aspect of Risk in Playing Non-Cooperative Games

6.891 Games, Decision, and Computation February 5, Lecture 2

A Few Games and Geometric Insights

Lecture 3: Semidefinite Programming

CS 573: Algorithmic Game Theory Lecture date: January 23rd, 2008

Well-Supported vs. Approximate Nash Equilibria: Query Complexity of Large Games

Symmetries and the Complexity of Pure Nash Equilibrium

Linear Programming. Scheduling problems

Existence of sparsely supported correlated equilibria

Settling Some Open Problems on 2-Player Symmetric Nash Equilibria

Optimal Machine Strategies to Commit to in Two-Person Repeated Games

Lecture notes on the ellipsoid algorithm

A Multiplayer Generalization of the MinMax Theorem

CS675: Convex and Combinatorial Optimization Spring 2018 The Ellipsoid Algorithm. Instructor: Shaddin Dughmi

On the Complexity of Two-Player Win-Lose Games

Technische Universität München, Zentrum Mathematik Lehrstuhl für Angewandte Geometrie und Diskrete Mathematik. Combinatorial Optimization (MA 4502)

Existence of sparsely supported correlated equilibria

Correlated Equilibria: Rationality and Dynamics

EECS 495: Combinatorial Optimization Lecture Manolis, Nima Mechanism Design with Rounding

The Distribution of Optimal Strategies in Symmetric Zero-sum Games

5.2 A characterization of the nonemptiness of the core

CS364A: Algorithmic Game Theory Lecture #13: Potential Games; A Hierarchy of Equilibria

Integer Programming ISE 418. Lecture 12. Dr. Ted Ralphs

The Steiner Network Problem

Discrete Optimization

Large Supports are required for Well-Supported Nash Equilibria

The price of anarchy of finite congestion games

CMU Noncooperative games 4: Stackelberg games. Teacher: Ariel Procaccia

CS599: Algorithm Design in Strategic Settings Fall 2012 Lecture 12: Approximate Mechanism Design in Multi-Parameter Bayesian Settings

Scheduling on Unrelated Parallel Machines. Approximation Algorithms, V. V. Vazirani Book Chapter 17

Duality of LPs and Applications

CS261: A Second Course in Algorithms Lecture #12: Applications of Multiplicative Weights to Games and Linear Programs

Single parameter FPT-algorithms for non-trivial games

Learning symmetric non-monotone submodular functions

Motivating examples Introduction to algorithms Simplex algorithm. On a particular example General algorithm. Duality An application to game theory

15-850: Advanced Algorithms CMU, Fall 2018 HW #4 (out October 17, 2018) Due: October 28, 2018

Optimal Auctions with Correlated Bidders are Easy

ON A THEOREM OF KALAI AND SAMET

Combinatorial Agency of Threshold Functions

On the Informed Principal Model with Common Values

SEQUENTIAL EQUILIBRIA IN BAYESIAN GAMES WITH COMMUNICATION. Dino Gerardi and Roger B. Myerson. December 2005

2.1 Definition and graphical representation for games with up to three players

Structure of Extreme Correlated Equilibria: a Zero-Sum Example and its Implications

1 Basic Game Modelling

Near-Potential Games: Geometry and Dynamics

Linear Programming. Linear Programming I. Lecture 1. Linear Programming. Linear Programming

Game Theory and Algorithms Lecture 7: PPAD and Fixed-Point Theorems

On the Existence of Optimal Taxes for Network Congestion Games with Heterogeneous Users

SURPLUS SHARING WITH A TWO-STAGE MECHANISM. By Todd R. Kaplan and David Wettstein 1. Ben-Gurion University of the Negev, Israel. 1.

Chapter 2: Linear Programming Basics. (Bertsimas & Tsitsiklis, Chapter 1)

General-sum games. I.e., pretend that the opponent is only trying to hurt you. If Column was trying to hurt Row, Column would play Left, so

Outline for today. Stat155 Game Theory Lecture 17: Correlated equilibria and the price of anarchy. Correlated equilibrium. A driving example.

Outline for today. Stat155 Game Theory Lecture 16: Evolutionary game theory. Evolutionarily stable strategies. Nash equilibrium.

CO759: Algorithmic Game Theory Spring 2015

CSC Linear Programming and Combinatorial Optimization Lecture 8: Ellipsoid Algorithm

Routing Games 1. Sandip Chakraborty. Department of Computer Science and Engineering, INDIAN INSTITUTE OF TECHNOLOGY KHARAGPUR.

LINEAR PROGRAMMING I. a refreshing example standard form fundamental questions geometry linear algebra simplex algorithm

1 Overview. 2 Extreme Points. AM 221: Advanced Optimization Spring 2016

CSC373: Algorithm Design, Analysis and Complexity Fall 2017 DENIS PANKRATOV NOVEMBER 1, 2017

Lecture slides by Kevin Wayne

Transcription:

Polynomial-time Computation of Exact Correlated Equilibrium in Compact Games Albert Xin Jiang Kevin Leyton-Brown Department of Computer Science University of British Columbia

Outline 1 Computing Correlated Equilibrium 2 Papadimitriou and Roughgarden s algorithm 3 Numerical Precision Issues 4 Algorithm for Exact Correlated Equilibrium

Outline 1 Computing Correlated Equilibrium 2 Papadimitriou and Roughgarden s algorithm 3 Numerical Precision Issues 4 Algorithm for Exact Correlated Equilibrium

Correlated Equilibrium correlated equilibrium (CE) [Aumann, 1974; Aumann, 1987]

Correlated Equilibrium correlated equilibrium (CE) [Aumann, 1974; Aumann, 1987] n players player p s pure strategy s p S p pure strategy profile s S = n p=1 S p utility for p under pure strategy profile s is integer u p s

Correlated Equilibrium correlated equilibrium (CE) [Aumann, 1974; Aumann, 1987] n players player p s pure strategy s p S p pure strategy profile s S = n p=1 S p utility for p under pure strategy profile s is integer u p s a CE is a distribution x over S: a trusted intermediary draws a strategy profile s from this distribution announce to each player p (privately) her own component s p p will have no incentive to choose another strategy, assuming others follow suggestions

LP formulation incentive constraints: for all players p and all i, j S p : [u p is up js ]x is 0 s S p write as Ux 0.

LP formulation incentive constraints: for all players p and all i, j S p : [u p is up js ]x is 0 s S p write as Ux 0. x is a distribution: x 0, s x s = 1

LP formulation incentive constraints: for all players p and all i, j S p : [u p is up js ]x is 0 s S p write as Ux 0. x is a distribution: x 0, s x s = 1 m n variables, nm 2 constraints polynomial in the size of normal form

Computing CE for Compact Game Representations Representations for games with structured utility functions symmetric games / anonymous games graphical games [Kearns, Littman & Singh, 2001] congestion games [Rosenthal, 1973] action-graph games [Jiang, Leyton-Brown & Bhat, 2011]

Computing CE for Compact Game Representations Representations for games with structured utility functions symmetric games / anonymous games graphical games [Kearns, Littman & Singh, 2001] congestion games [Rosenthal, 1973] action-graph games [Jiang, Leyton-Brown & Bhat, 2011] Goal: computing a sample CE in time polynomial in the size of representation LP would have exponential number of variables (m n ) writing a solution (i.e. CE) explicitly requires m n numbers

Outline 1 Computing Correlated Equilibrium 2 Papadimitriou and Roughgarden s algorithm 3 Numerical Precision Issues 4 Algorithm for Exact Correlated Equilibrium

Papadimitriou and Roughgarden [2008] Polynomial-time algorithm for computing a CE when the representation satisfies: polynomial type: # of players and # of actions for each player are bounded by polynomials in the size of the representation. polynomial expectation property: poly-time algorithm for computing expected utility under any product distribution x is a product distribution when each player p is randomizing independently over her actions according to some distribution x p, i.e. x s = p xp s p.

Papadimitriou and Roughgarden [2008] Polynomial-time algorithm for computing a CE when the representation satisfies: polynomial type: # of players and # of actions for each player are bounded by polynomials in the size of the representation. polynomial expectation property: poly-time algorithm for computing expected utility under any product distribution x is a product distribution when each player p is randomizing independently over her actions according to some distribution x p, i.e. x s = p xp s p. computes CEs that are mixtures of polynomial number of product distributions

Existence Proof The algorithm is based on proofs of the existence of CE via LP duality [Hart & Shmeidler 1989], [Nau & Mcardle 1990], [Myerson 1997] consider the linear program (P): max s x s Ux 0, x 0

Existence Proof The algorithm is based on proofs of the existence of CE via LP duality [Hart & Shmeidler 1989], [Nau & Mcardle 1990], [Myerson 1997] consider the linear program (P): max s x s Ux 0, x 0 (P) either has x = 0 as its optimal solution or is unbounded. In the latter case the game has a correlated equilibrium.

Existence Proof The algorithm is based on proofs of the existence of CE via LP duality [Hart & Shmeidler 1989], [Nau & Mcardle 1990], [Myerson 1997] consider the linear program (P): max s x s Ux 0, x 0 (P) either has x = 0 as its optimal solution or is unbounded. In the latter case the game has a correlated equilibrium. can prove the existence of CE by showing the infeasibility of its dual (D): U T y 1 y 0

Infeasibility of the Dual U T y 1 y 0 Lemma ([Papadimitriou & Roughgarden, 2008]) For every dual vector y 0, there is a product distribution x such that xu T y = 0.

Infeasibility of the Dual U T y 1 y 0 Lemma ([Papadimitriou & Roughgarden, 2008]) For every dual vector y 0, there is a product distribution x such that xu T y = 0. The lemma implies that the dual program (D) is infeasible (and therefore a CE must exist). This is because xu T y is a convex combination of the left hand sides of the rows of (D), and for any feasible y the result must be less than or equal to -1.

Infeasibility of the Dual U T y 1 y 0 Lemma ([Papadimitriou & Roughgarden, 2008]) For every dual vector y 0, there is a product distribution x such that xu T y = 0. The lemma implies that the dual program (D) is infeasible (and therefore a CE must exist). This is because xu T y is a convex combination of the left hand sides of the rows of (D), and for any feasible y the result must be less than or equal to -1. there is a polynomial-time algorithm that computes such an x given y.

Ellipsoid Against Hope

Ellipsoid Against Hope U T y 1 y 0 (D) has nm 2 variables, about m n constraints

Ellipsoid Against Hope U T y 1 y 0 (D) has nm 2 variables, about m n constraints run the ellipsoid algorithm on (D), with the following Product Separation Oracle: given a vector y (i), the corresponding product distribution x (i) is generated according to the Lemma, and [x (i) U T ]y 1 is given to the ellipsoid algorithm as a cutting plane.

Ellipsoid Against Hope U T y 1 y 0 (D) has nm 2 variables, about m n constraints run the ellipsoid algorithm on (D), with the following Product Separation Oracle: given a vector y (i), the corresponding product distribution x (i) is generated according to the Lemma, and [x (i) U T ]y 1 is given to the ellipsoid algorithm as a cutting plane. The ellipsoid algorithm will stop after a polynomial number of steps and determine that the program is infeasible.

Ellipsoid Against Hope (cont d) Let X be the matrix whose rows are the generated product distributions x (1),..., x (L). Consider the linear program (D ): [XU T ]y 1, y 0 If we apply the same ellipsoid method, with a separation oracle that returns the cut x (i) U T y 1 given query y (i), it would go through the same sequence of queries y (i) and return infeasible. Therefore (D ) is infeasible (presuming that numerical problems do not arise).

Ellipsoid Against Hope (cont d) Let X be the matrix whose rows are the generated product distributions x (1),..., x (L). Consider the linear program (D ): [XU T ]y 1, y 0 If we apply the same ellipsoid method, with a separation oracle that returns the cut x (i) U T y 1 given query y (i), it would go through the same sequence of queries y (i) and return infeasible. Therefore (D ) is infeasible (presuming that numerical problems do not arise). This implies that its dual program (P ): [UX T ]α 0, α 0 is unbounded and has polynomial size. Given such a nonzero α vector, scaled to be a distribution, X T α satisfies the incentive constraints and is therefore a correlated equilibrium.

Computational Issues Although the matrix XU T is polynomial in size, computing it using matrix multiplication would involve an exponential number of operations.

Computational Issues Although the matrix XU T is polynomial in size, computing it using matrix multiplication would involve an exponential number of operations. On the other hand, entries of XU T are differences of expected utilities under product distributions, thus can be computed in polynomial time.

Outline 1 Computing Correlated Equilibrium 2 Papadimitriou and Roughgarden s algorithm 3 Numerical Precision Issues 4 Algorithm for Exact Correlated Equilibrium

Numerical Precision Issues a run of the ellipsoid method requires as inputs initial ball with radius R volume lower bound v correct values of R and v depend on the max encoding size of a constraint of the LP

Numerical Precision Issues a run of the ellipsoid method requires as inputs initial ball with radius R volume lower bound v correct values of R and v depend on the max encoding size of a constraint of the LP a constraint of (D ), e.g. x (i) U T y 1, may require more bits than any of the constraints of (D) running ellipsoid on (D ) with the same R, v as the ellipsoid run on (D) would no longer be valid infeasibility of (D ) is not guaranteed

Numerical Precision Issues a run of the ellipsoid method requires as inputs initial ball with radius R volume lower bound v correct values of R and v depend on the max encoding size of a constraint of the LP a constraint of (D ), e.g. x (i) U T y 1, may require more bits than any of the constraints of (D) running ellipsoid on (D ) with the same R, v as the ellipsoid run on (D) would no longer be valid infeasibility of (D ) is not guaranteed Papadimitriou and Roughgarden [2008] proposed a method to overcome this issue Stein, Parrilo & Ozdaglar [2010] showed that it is insufficient to compute an exact CE. a slightly modified version computes approximate CE in time polynomial in log 1 ɛ and representation size

What about exact CE? Stein et al. [2010] showed that if an algorithm 1 outputs a rational solution 2 outputs a convex combination of product distributions 3 outputs a convex combination of symmetric product distributions when the game is symmetric then there is a symmetric game such that the algorithm fails to find an exact CE.

What about exact CE? Stein et al. [2010] showed that if an algorithm 1 outputs a rational solution 2 outputs a convex combination of product distributions 3 outputs a convex combination of symmetric product distributions when the game is symmetric then there is a symmetric game such that the algorithm fails to find an exact CE. The Product Separation Oracle returns a symmetric product distribution given symmetric game and symmetric y.

What about exact CE? Stein et al. [2010] showed that if an algorithm 1 outputs a rational solution 2 outputs a convex combination of product distributions 3 outputs a convex combination of symmetric product distributions when the game is symmetric then there is a symmetric game such that the algorithm fails to find an exact CE. The Product Separation Oracle returns a symmetric product distribution given symmetric game and symmetric y. On the other hand, there always exists an exact rational CE each vertex of the polytope of the set of CE is rational (correspond to basic feasible solutions) such a CE has O(nm 2 ) non-zero entries, i.e. polynomial-sized support

Outline 1 Computing Correlated Equilibrium 2 Papadimitriou and Roughgarden s algorithm 3 Numerical Precision Issues 4 Algorithm for Exact Correlated Equilibrium

Our Results A variant of the Ellipsoid Against Hope algorithm of [Papadimitriou & Roughgarden, 2008] that computes an exact, rational CE in polynomial time given a representation satisfying polynomial type and polynomial expectation property; outputs a CE that is a vertex of the set of CE, which has polynomial-sized support.

Overview of Our Approach We replace the Product Separation Oracle with a modified version (Purified Separation Oracle) that generates cuts corresponding to pure strategy profiles: given y 0, output (U s ) T y 1 that is violated at y.

Overview of Our Approach We replace the Product Separation Oracle with a modified version (Purified Separation Oracle) that generates cuts corresponding to pure strategy profiles: given y 0, output (U s ) T y 1 that is violated at y. Now each constraint of (D ) is one of the original constraints of (D). any run of ellipsoid method that is valid for (D) is also valid for (D ) no longer requiring special mechanism to deal with numerical issues

Overview of Our Approach We replace the Product Separation Oracle with a modified version (Purified Separation Oracle) that generates cuts corresponding to pure strategy profiles: given y 0, output (U s ) T y 1 that is violated at y. Now each constraint of (D ) is one of the original constraints of (D). any run of ellipsoid method that is valid for (D) is also valid for (D ) no longer requiring special mechanism to deal with numerical issues A solution of (P ) is a mixture of polynomial number of pure-strategy profiles. Get vertex by applying a standard algorithm for finding basic feasible solutions given a feasible solution.

Purified Separation Oracle: Existence Lemma Given any dual vector y 0, there exists a pure strategy profile s such that (U s ) T y 0.

Purified Separation Oracle: Existence Lemma Given any dual vector y 0, there exists a pure strategy profile s such that (U s ) T y 0. Proof. we know there exists a product distribution x such that xu T y = 0. x[u T y] is the expected value of (U s ) T y under distribution x, which we denote E s x [(U s ) T y] there must exist s such that (U s ) T y xu T y = 0.

Purified Separation Oracle: Existence Lemma Given any dual vector y 0, there exists a pure strategy profile s such that (U s ) T y 0. Proof. we know there exists a product distribution x such that xu T y = 0. x[u T y] is the expected value of (U s ) T y under distribution x, which we denote E s x [(U s ) T y] there must exist s such that (U s ) T y xu T y = 0. not efficiently constructive

Purified Separation Oracle: Existence Lemma Given any dual vector y 0, there exists a pure strategy profile s such that (U s ) T y 0. Proof. we know there exists a product distribution x such that xu T y = 0. x[u T y] is the expected value of (U s ) T y under distribution x, which we denote E s x [(U s ) T y] there must exist s such that (U s ) T y xu T y = 0. not efficiently constructive sampling from x yields approximate cutting planes

Purified Separation Oracle: Derandomization Derandomize using the method of conditional probabilities 1 Given y 0, compute product distribution x satisfying xu T y = 0, i.e. E s x [(U s ) T y] = 0.

Purified Separation Oracle: Derandomization Derandomize using the method of conditional probabilities 1 Given y 0, compute product distribution x satisfying xu T y = 0, i.e. E s x [(U s ) T y] = 0. 2 For each player p, pick s p S p such that the conditional expectation E s x [(U s ) T y s 1,..., s p ] 0. such s p must exists because E s x[(u s) T y s 1,..., s p 1] 0 each conditional expectation can be efficiently computed using the expected utility algorithm

Purified Separation Oracle: Derandomization Derandomize using the method of conditional probabilities 1 Given y 0, compute product distribution x satisfying xu T y = 0, i.e. E s x [(U s ) T y] = 0. 2 For each player p, pick s p S p such that the conditional expectation E s x [(U s ) T y s 1,..., s p ] 0. such s p must exists because E s x[(u s) T y s 1,..., s p 1] 0 each conditional expectation can be efficiently computed using the expected utility algorithm 3 Output s = (s 1,..., s p ), and cutting plane (U s ) T y 1.

Purified Separation Oracle: Derandomization Derandomize using the method of conditional probabilities 1 Given y 0, compute product distribution x satisfying xu T y = 0, i.e. E s x [(U s ) T y] = 0. 2 For each player p, pick s p S p such that the conditional expectation E s x [(U s ) T y s 1,..., s p ] 0. such s p must exists because E s x[(u s) T y s 1,..., s p 1] 0 each conditional expectation can be efficiently computed using the expected utility algorithm 3 Output s = (s 1,..., s p ), and cutting plane (U s ) T y 1. Can return asymmetric cuts even for symmetric games and symmetric y.

Conclusion A variant of Ellipsoid Against Hope algorithm that computes an exact CE in polynomial time derandomization of the Product Separation Oracle as a result the algorithm is simplified polynomial-sized support

Conclusion A variant of Ellipsoid Against Hope algorithm that computes an exact CE in polynomial time derandomization of the Product Separation Oracle as a result the algorithm is simplified polynomial-sized support Our derandomization approach can be applied to other related algorithms, e.g. Huang and Von Stengel [2008] polynomial-time algorithm for extensive-form correlated equilibria.

Conclusion A variant of Ellipsoid Against Hope algorithm that computes an exact CE in polynomial time derandomization of the Product Separation Oracle as a result the algorithm is simplified polynomial-sized support Our derandomization approach can be applied to other related algorithms, e.g. Huang and Von Stengel [2008] polynomial-time algorithm for extensive-form correlated equilibria. Practical computation of CE by replacing the ellipsoid method with a cutting-plane method

Aumann, R. (1974). Subjectivity and correlation in randomized strategies. Journal of Mathematical Economics, 1(1), 67 96. Aumann, R. (1987). Correlated equilibrium as an expression of Bayesian rationality. Econometrica: Journal of the Econometric Society, 1 18. Huang, W., & Von Stengel, B. (2008). Computing an extensive-form correlated equilibrium in polynomial time. WINE: Proceedings of the Workshop on Internet and Network Economics (pp. 506 513). Papadimitriou, C., & Roughgarden, T. (2008). Computing correlated equilibria in multi-player games. Journal of the ACM, 55(3), 14. Rosenthal, R. (1973). A class of games possessing pure-strategy Nash equilibria. International Journal of Game Theory, 2, 65 67. Stein, N. D., Parrilo, P. A., & Ozdaglar, A. (2010). Exchangeable equilibria contradict exactness of the Papadimitriou-Roughgarden algorithm. http://arxiv.org/abs/1010.2871v1.