Randomized Nash Equilibrium

Similar documents
6.891 Games, Decision, and Computation February 5, Lecture 2

KAKUTANI S FIXED POINT THEOREM AND THE MINIMAX THEOREM IN GAME THEORY

Non-zero-sum Game and Nash Equilibarium

MS&E 246: Lecture 4 Mixed strategies. Ramesh Johari January 18, 2007

15 Real Analysis II Sequences and Limits , Math for Economists Fall 2004 Lecture Notes, 10/21/2004

Lecture 9: Dynamics in Load Balancing

MS&E 246: Lecture 18 Network routing. Ramesh Johari

6.896: Topics in Algorithmic Game Theory

CS 573: Algorithmic Game Theory Lecture date: January 23rd, 2008

CHAPTER 7. Connectedness

Discrete Optimization 2010 Lecture 12 TSP, SAT & Outlook

Game Theory and Algorithms Lecture 7: PPAD and Fixed-Point Theorems

Course 212: Academic Year Section 1: Metric Spaces

Math 730 Homework 6. Austin Mohr. October 14, 2009

FUNDAMENTAL GROUPS AND THE VAN KAMPEN S THEOREM

Math 6120 Fall 2012 Assignment #1

Efficiency Loss in a Network Resource Allocation Game

1 The Local-to-Global Lemma

Discrete Optimization 2010 Lecture 12 TSP, SAT & Outlook

The fundamental theorem of calculus for definite integration helped us to compute If has an anti-derivative,

Mathematical Preliminaries

Lecture 6: April 25, 2006

1 Some loose ends from last time

Weak Dominance and Never Best Responses

a i,1 a i,j a i,m be vectors with positive entries. The linear programming problem associated to A, b and c is to find all vectors sa b

THE FUNDAMENTAL GROUP AND BROUWER S FIXED POINT THEOREM AMANDA BOWER

Routing Games : From Altruism to Egoism

Linear Algebra. Preliminary Lecture Notes

Impagliazzo s Hardcore Lemma

LECTURE 15: COMPLETENESS AND CONVEXITY

6.254 : Game Theory with Engineering Applications Lecture 7: Supermodular Games

Rose-Hulman Undergraduate Mathematics Journal

Set, functions and Euclidean space. Seungjin Han

6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2

Mathematical Methods in Economics (Part I) Lecture Note

Games and Their Equilibria

Convergence Rate of Best Response Dynamics in Scheduling Games with Conflicting Congestion Effects

Linear Algebra. Preliminary Lecture Notes

6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Potential Games

Nash Equilibrium: Existence & Computation

EC3224 Autumn Lecture #04 Mixed-Strategy Equilibrium

1 More finite deterministic automata

Convex Sets with Applications to Economics

Variants of Nash Equilibrium. CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte

V (v i + W i ) (v i + W i ) is path-connected and hence is connected.

MS&E 246: Lecture 17 Network routing. Ramesh Johari

Multimarket Oligopolies with Restricted Market Access

Mixed Strategies. Krzysztof R. Apt. CWI, Amsterdam, the Netherlands, University of Amsterdam. (so not Krzystof and definitely not Krystof)

Def. A topological space X is disconnected if it admits a non-trivial splitting: (We ll abbreviate disjoint union of two subsets A and B meaning A B =

Convex Sets Strict Separation. in the Minimax Theorem

Lecture 6: September 19

Lecture 24: April 12

Brouwer Fixed-Point Theorem

6.207/14.15: Networks Lecture 11: Introduction to Game Theory 3

A Truthful Mechanism for Fair Load Balancing in Distributed Systems Λ

Lecture 1: Convex Sets January 23

Math General Topology Fall 2012 Homework 1 Solutions

Chapter 9. Mixed Extensions. 9.1 Mixed strategies

Lecture 1: September 25, A quick reminder about random variables and convexity

Munkres Chapter 9. The Fundamental Group

Math 117: Continuity of Functions

A Discrete Duality Between Nonmonotonic Consequence Relations and Convex Geometries

Efficiency, Fairness and Competitiveness in Nash Bargaining Games

Topological properties

Bayesian Persuasion Online Appendix

The effect of learning on membership and welfare in an International Environmental Agreement

A history of Topology

Economics Noncooperative Game Theory Lectures 3. October 15, 1997 Lecture 3

CS 598RM: Algorithmic Game Theory, Spring Practice Exam Solutions

THE UNIQUENESS PROPERTY FOR NETWORKS WITH SEVERAL ORIGIN-DESTINATION PAIRS

Efficiency and Braess Paradox under Pricing

Metric Spaces and Topology

Strong Equilibrium in Congestion Games

Constrained Optimization and Lagrangian Duality

After taking the square and expanding, we get x + y 2 = (x + y) (x + y) = x 2 + 2x y + y 2, inequality in analysis, we obtain.

A Generic Bound on Cycles in Two-Player Games

The Distribution of Optimal Strategies in Symmetric Zero-sum Games

Existence of sparsely supported correlated equilibria

Cauchy s Theorem (rigorous) In this lecture, we will study a rigorous proof of Cauchy s Theorem. We start by considering the case of a triangle.

THE CANTOR GAME: WINNING STRATEGIES AND DETERMINACY. by arxiv: v1 [math.ca] 29 Jan 2017 MAGNUS D. LADUE

Appendix E : Note on regular curves in Euclidean spaces

Constructive Proof of the Fan-Glicksberg Fixed Point Theorem for Sequentially Locally Non-constant Multi-functions in a Locally Convex Space

Writing proofs for MATH 51H Section 2: Set theory, proofs of existential statements, proofs of uniqueness statements, proof by cases

Lecture Overview. 2 Weak Induction

ON SPACE-FILLING CURVES AND THE HAHN-MAZURKIEWICZ THEOREM

CS243, Logic and Computation Nondeterministic finite automata

(x, y) = d(x, y) = x y.

Lecture 1: Overview of percolation and foundational results from probability theory 30th July, 2nd August and 6th August 2007

Connectedness. Proposition 2.2. The following are equivalent for a topological space (X, T ).

Flows and Cuts. 1 Concepts. CS 787: Advanced Algorithms. Instructor: Dieter van Melkebeek

Introduction to Topology

STRONGLY CONNECTED SPACES

Algorithmic Game Theory and Economics: A very short introduction. Mysore Park Workshop August, 2012

CS 573: Algorithmic Game Theory Lecture date: Feb 6, 2008

Theorems. Theorem 1.11: Greatest-Lower-Bound Property. Theorem 1.20: The Archimedean property of. Theorem 1.21: -th Root of Real Numbers

princeton univ. F 17 cos 521: Advanced Algorithm Design Lecture 6: Provable Approximation via Linear Programming

g 2 (x) (1/3)M 1 = (1/3)(2/3)M.

Notes on Fixed Point Theorems

: Cryptography and Game Theory Ran Canetti and Alon Rosen. Lecture 8

Informational Braess Paradox: The Effect of Information on Traffic Congestion

Transcription:

CS 684 Algorithmic Game Theory Scribe: Muthu Venkitasubramaniam Instructor: Eva Tardos October 26, 2004 The construction itself is an art, its application to the world an evil parasite. - L E J Brouwer Randomized Nash Euilibrium In this lecture, we discuss the proof of existence of Nash euilibrium in a randomized game. We consider games where the number of players and the strategy sets are finite. We remark here that the finiteness of the strategy set and player set is crucial to the main theorem. We have discussed several games in the past lectures where the players set and the strategy set are finite and infinite, some of which are listed below. Job-scheduling game - The number of jobs (i.e. players) is finite, and the number of machines (strategy set) each job could be assigned is finite too. Facility-location game - The number of players is finite, and the number of locations where each player could open a facility is finite. Job splitting game - The jobs were considered a large number of infinitesimally small jobs, in effect assuming infinite jobs, with finite strategy. There is also a related game with finite jobs with infinite strategies where users control a finite amount of flow, and may split flow. Johari Tsitsiklis game - The player set is finite, while the number of strategies is infinite. Main theorem There are n players, and each player i has a finite strategy set S i from which it plays. We consider a valuation function v i for player i. v i (s 1, s 2,..., s n ) is the benefit function for player i when the players strategies are s 1, s 2,..., s n (, i.e. player j plays strategy s j ). In the games discussed so far, we have considered only pure Nash euilibrium. In a pure Nash, each player plays exactly one strategy from the strategy set. In this lecture, we consider randomized Nash where each player can play several strategies with some probability. Games in general, need not have a pure Nash euilibrium, while randomized Nash euilibrium always exists. A randomized strategy is a probability distribution p i over the strategy set S i for each player i. p i (s) is the probability with which player i plays strategy s S i. Since this is a probability distribution, p i (s) 0 and p i (s) = 1. s S i Each player tries to maximize its benefit in expectation. Given a probability distribution p j for each player j, the benefit for player i is E(v i (s 1, s 2,..., s n )). This can be thought of as the benefit player i will get on the average, when the game is played several times with the same randomized strategy. E(v i (s 1, s 2,..., s n )) = p 1 (s 1 )p 2 (s 2 ) p n (s n )v i (s 1, s 2,..., s n ) s i S i, i

To make our notation cleaner we will use v i (p 1, p 2,..., p n ) to denote the above expectation. Note that p i here is a probability distribution for player i. We now state the main theorem for the lecture, Theorem 1 (J. F. Nash) For any game with finite players and finite strategy sets, there exists a randomized Nash euilibrium. We take a slight detour into real analysis, to develop machinery to solve the main theorem. Brouwer s Fixed Point theorem Theorem 2 (L. E. J. Brouwer) Every continuous mapping f from a closed, bounded and convex set C n to itself has a fixed point, i.e. there exists x C such that f (x) = x. Refer to Appendix A for the definitions. We do not prove this theorem here, and refer to [2] for the proof. We given some intuition as to why this is true. Consider continuous functions f : [0, 1] [0, 1]. [0, 1] is a closed, bounded and convex, and f is continuous. Since these functions satisfy the conditions of the theorem, there exists an x [0, 1] such that f (x) = x. As seen in the Figure 1, f (0) must lie somewhere on the line segment AB and f (1) must lie somewhere on the line segment CD. Since the function is continuous, the path traced by f is a curve from f (0) to f (1). This should meet line segment AD at some point (x in Figure 1). This point is a fixed point for function f. Figure 1: Fixed point in 1-dimension Figure 2: Fixed point in 1-dimension We illustrate with another example as to why convexity is reuired. Consider the annulus shown in Figure 2. This is a closed and bounded set in 2, but not convex. We define a function f on this, that rotates the annulus by some angle. This function is continuous, but does not have a fixed point. Having set the machinery, we now proceed on to the main theorem. Proof of Theorem 1: The basic idea is that we invoke the Brouwer s fixed point theorem. The set C under consideration is {(p 1, p 2,..., p n )} where p i is the probability distribution over the strategy set S i. Therefore, C S 1 + S 2 +...+ S n. C is closed, bounded and convex, since we have the constraints p i (s) 0 and s S i p i (s) = 1. Given (p 1, p 2,..., p n ), define i to be the best response for player i, that is, we define i to be the probability distribution maximizing max v i (p 1, p 2,..., p i 1,, p i+1,..., p n )).

We now define the mapping f : C C as follows, f (p 1, p 2,..., p n ) = ( 1, 2,..., n ) Clearly, a fixed point for the function f is a Nash euilibrium. If f satisfies the conditions for Brouwer s Theorem, we are done. But we are not uite there, with respect to satisfying the conditions of Brouwer s Theorem for the following reasons: The best response i may not be uniue for player i. We have seen this in several of our games, such as job scheduling, atomic routing, etc. Hence, f is not well defined. Suppose, we chose one of the possible best responses (say the lexicographically first one) as i then f will not be continuous around the points that had multiple best responses. To resolve these issues, there are two approaches, 1. Kakutani s Fixed Point Theorem: This approach deals with more general notions of functions where the function maps the points in the set C to closed and convex subsets of C (in our case the set of best responses), and the theorem states that for such functions there is a point x such that x f (x). This is exactly what we need. 2. So we don t have to deal with extending the fixed point theorem, the proof of Kakutani s fixed point theorem, defined in approach 1, for this case we define a point function f which satisfies the conditions of Brouwer s fixed point theorem and hence prove the theorem. Approach 2 We define i as the probability distribution that maximizes [ max vi (p 1, p 2,..., p i 1,, p i+1,..., p n ) p i 2] We need to show that, 1. f is a function: We claim that there exists an uniue that maximizes the right hand side. Intuitively, this is because p i i 2 is strictly concave. 2. f is continuous: With minimal calculus this can be proved. 3. We claim that, if x = (p 1, p 2,..., p n ) is a fixed point, then it is a Nash. To prove this, we assume it is not a Nash and arrive at a contradiction. Suppose, for some player i, there exists a better strategy as shown in Figure 3, and suppose the value v i improved by δ when moving from p i to. Now consider moving move along a straight line from p i to. We claim that the function increases [ vi (p 1, p 2,..., p i 1,, p i+1,..., p n ) p i 2], and hence is not maximized at p i, and hence (p 1,..., p n ) is not a fixed point. How do we see that the function decreases? The rate of change on the first term is δ/ p i where p j is fixed for j i, since the expectation is linear in.

On the other hand the initial rate of change in the second term p i 2 is 0, as the derivative of x 2 is 2x whose derivative at 0 vanishes ([x 2 ] x=0 = 0). This implies that there exists some point (in the direction of from p i possibly nearby p i ) that has a greater value for v i (p 1, p 2,..., p i 1,, p i+1,..., p n ) p i 2, thus contradicting the maximality of i. p i Therefore, by Brouwer s Theorem, we know there exists a fixed point x and that is a Nash euilibrium. Finding a Nash In this lecture, we have shown that finding a Nash euilibrium is the same as finding the fixed point of a function, satisfying conditions of Brouwer s theorem(brouwer function). It is interesting to note that the converse is also true as proved in [1]. They prove that, given a Brouwer function, there exists a game, whose Nash euilibrium yields a fixed point for the function. In effect, they show that finding a fixed point and finding a Nash has same complexity. References [1] C. Daskalakis, P. W. Goldberg, and C. H. Papadimitriou. The complexity of computing a nash euilibrium. Electronic collouium on computational complexity, 2005. http://eccc.uni-trier.de/ecccreports/2005/tr05-115/. [2] James R. Munkres. Topology. Prentice-Hall. A Definitions from Topology We give some definitions from topology. For this discussion we assume the Euclidean metric space n. Definition 1 A set C n is closed, if every limit point of an infinite seuence is contained in C. A general topological space is defined in terms of its open sets and closed sets are defined as those whose complements are open. In, finite union of closed intervals form closed sets. In a general topological space, the above definition is called a complete set and in the context of the Euclidean space it turns out to be closed too. Definition 2 A set C n is bounded, if there exists a real number r such that, for any x and y in C, x y 2 < r.

Definition 3 A set C n is convex, if for x and y in C, the line joining x and y is contained in C. Definition 4 A mapping f : n n is continuous if it the inverse image of every open set is an open set. Intuitively, this means that for every point y = f (x) in the range of f, if we take a small (open) ball y covering y, then is an (open) ball x covering x such that f ( x ) y.