Game Theory. 2.1 Zero Sum Games (Part 2) George Mason University, Spring 2018

Similar documents
Game Theory. Greg Plaxton Theory in Programming Practice, Spring 2004 Department of Computer Science University of Texas at Austin

Algorithmic Game Theory and Applications. Lecture 4: 2-player zero-sum games, and the Minimax Theorem

Game Theory Correlated equilibrium 1

Zero-Sum Games Public Strategies Minimax Theorem and Nash Equilibria Appendix. Zero-Sum Games. Algorithmic Game Theory.

Game Theory: Spring 2017

Game Theory: introduction and applications to computer networks

Computational Game Theory Spring Semester, 2005/6. Lecturer: Yishay Mansour Scribe: Ilan Cohen, Natan Rubin, Ophir Bleiberg*

Game Theory DR. ÖZGÜR GÜRERK UNIVERSITY OF ERFURT WINTER TERM 2012/13. Strict and nonstrict equilibria

Basic Game Theory. Kate Larson. January 7, University of Waterloo. Kate Larson. What is Game Theory? Normal Form Games. Computing Equilibria

Game Theory. 4. Algorithms. Bernhard Nebel and Robert Mattmüller. May 2nd, Albert-Ludwigs-Universität Freiburg

1 Motivation. Game Theory. 2 Linear Programming. Motivation. 4. Algorithms. Bernhard Nebel and Robert Mattmüller May 15th, 2017

SF2972 Game Theory Exam with Solutions March 15, 2013

April 29, 2010 CHAPTER 13: EVOLUTIONARY EQUILIBRIUM

Introduction to Game Theory Lecture Note 2: Strategic-Form Games and Nash Equilibrium (2)

Game Theory: Lecture 3

1 Basic Game Modelling

MS&E 246: Lecture 17 Network routing. Ramesh Johari

Normal-form games. Vincent Conitzer

Economics 3012 Strategic Behavior Andy McLennan October 20, 2006

Game Theory. Professor Peter Cramton Economics 300

Mixed Strategies. Krzysztof R. Apt. CWI, Amsterdam, the Netherlands, University of Amsterdam. (so not Krzystof and definitely not Krystof)

Belief-based Learning

Area I: Contract Theory Question (Econ 206)

Basics of Game Theory

Lecture 3: Probability

The Distribution of Optimal Strategies in Symmetric Zero-sum Games

On the Complexity of Two-Player Win-Lose Games

3.3.3 Illustration: Infinitely repeated Cournot duopoly.

EC3224 Autumn Lecture #03 Applications of Nash Equilibrium

princeton univ. F 13 cos 521: Advanced Algorithm Design Lecture 17: Duality and MinMax Theorem Lecturer: Sanjeev Arora

MS&E 246: Lecture 12 Static games of incomplete information. Ramesh Johari

Mixed Nash Equilibria

Oligopoly Theory. This might be revision in parts, but (if so) it is good stu to be reminded of...

Mixed strategy equilibria (msne) with N players

University of Warwick, Department of Economics Spring Final Exam. Answer TWO questions. All questions carry equal weight. Time allowed 2 hours.

Module 16: Signaling

CSC304 Lecture 5. Game Theory : Zero-Sum Games, The Minimax Theorem. CSC304 - Nisarg Shah 1

MANAGEMENT SCIENCE doi /mnsc ec pp. ec1 ec6

Selecting Efficient Correlated Equilibria Through Distributed Learning. Jason R. Marden

Notes on Blackwell s Comparison of Experiments Tilman Börgers, June 29, 2009

Exercise 11. Solution. EVOLUTION AND THE THEORY OF GAMES (Spring 2009) EXERCISES Show that every 2 2 matrix game with payoff matrix

MATH4250 Game Theory 1. THE CHINESE UNIVERSITY OF HONG KONG Department of Mathematics MATH4250 Game Theory

CS364A: Algorithmic Game Theory Lecture #13: Potential Games; A Hierarchy of Equilibria

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2016

Today: Linear Programming (con t.)

Lecture 6: April 25, 2006

Review of topics since what was covered in the midterm: Topics that we covered before the midterm (also may be included in final):

Advanced Microeconomics

Models of Reputation with Bayesian Updating

1 The General Definition

Msc Micro I exam. Lecturer: Todd Kaplan.

Chapter 9. Mixed Extensions. 9.1 Mixed strategies

On Hotelling s Stability in Competition

Industrial Organization, Fall 2011: Midterm Exam Solutions and Comments Date: Wednesday October

M340(921) Solutions Practice Problems (c) 2013, Philip D Loewen

TI /1 Tinbergen Institute Discussion Paper Maximin Play in Two-Person Bimatrix Games. Vitaly Pruzhansky. Amsterdam, and Tinbergen Institute.

General-sum games. I.e., pretend that the opponent is only trying to hurt you. If Column was trying to hurt Row, Column would play Left, so

Game Theory: Lecture 2

Motivating examples Introduction to algorithms Simplex algorithm. On a particular example General algorithm. Duality An application to game theory

6.254 : Game Theory with Engineering Applications Lecture 13: Extensive Form Games

Game theory lecture 4. September 24, 2012

Solutions to Exercises of Section III.1. a (13/4, 3) (22/4, 3/4). b (4, 10/4) (21/4, 2)

SF2972 Game Theory Written Exam with Solutions June 10, 2011

Game Theory: Spring 2017

Game theory and market power

A (Brief) Introduction to Game Theory

Industrial Organization Lecture 3: Game Theory

Patience and Ultimatum in Bargaining

האוניברסיטה העברית בירושלים

Variants of Nash Equilibrium. CMPT 882 Computational Game Theory Simon Fraser University Spring 2010 Instructor: Oliver Schulte

EC3224 Autumn Lecture #04 Mixed-Strategy Equilibrium

Bounded Rationality, Strategy Simplification, and Equilibrium

On revealed preferences in oligopoly games

Computing Minmax; Dominance

Fake news. Xiaofan Li & Andrew Whinston. University of Texas at Austin December 30, 2017

Economics 385: Suggested Solutions 2

4: Dynamic games. Concordia February 6, 2017

BELIEFS & EVOLUTIONARY GAME THEORY

1 Primals and Duals: Zero Sum Games

Algebraic Expressions and Identities

Non-zero-sum Game and Nash Equilibarium

Static (or Simultaneous- Move) Games of Complete Information

Lecture December 2009 Fall 2009 Scribe: R. Ring In this lecture we will talk about

Political Economy of Institutions and Development: Problem Set 1. Due Date: Thursday, February 23, in class.

LECTURE 22, WEDNESDAY in lowest terms by H(x) = max{ p, q } and proved. Last time, we defined the height of a rational number x = p q

Calculus in Business. By Frederic A. Palmliden December 7, 1999

Introduction to Game Theory

Guilt in Games. P. Battigalli and M. Dufwenberg (AER, 2007) Presented by Luca Ferocino. March 21 st,2014

Lecture Notes on Game Theory

September 7, Formal Definition of a Nondeterministic Finite Automaton

Game-Theoretic Learning:

Extensive Form Games I

Open Sequential Equilibria of Multi-Stage Games with Infinite Sets of Types and Actions

Game Theory and Algorithms Lecture 2: Nash Equilibria and Examples

CS 598RM: Algorithmic Game Theory, Spring Practice Exam Solutions

Common Knowledge: The Math

Economics 703 Advanced Microeconomics. Professor Peter Cramton Fall 2017

The Nash Bargaining Solution in Gen Title Cooperative Games. Citation Journal of Economic Theory, 145(6):

Game Theory for Linguists

Sequential Equilibrium in Computational Games

Transcription:

Game Theory 2.1 Zero Sum Games (Part 2) George Mason University, Spring 2018

The Nash equilibria of two-player, zero-sum games have various nice properties. Minimax Condition A pair of strategies is in equilibrium if (but not only if) the outcome determined by the strategies equals the minimal value of the row and the maximal value of the column. These solutions can be determined using the maximinimizer method. All maximinimizer strategy pairs are Nash equilibria. All maximinimizer strategy pairs have the same value, and this is the value of the game.

Let min(a j ) designate the lowest utility obtainable by performing act a j A j. Def 2.1.4. A maximinimizer for player j is an action aj A j where min(aj ) = max a j A j (min(a j )). Thm 2.1.1. a 1, a 2 is a Nash equilibrium of the two-player zero sum game G only if a 1 A 1 is a maximinimizer for player 1 and a 2 A 2 is a maximinimizer for player 2.

a 1 a 2 a 3 a 1 8 8 7 a 2 0-10 -4 a 3 9 0-1 Recall that a 1, a 3 is the only Nash equilibrium of this game.

a 1 a 2 a 3 a 1 8 8 7 a 2 0-10 -4 a 3 9 0-1 a 1 A 1 is a maximinimizer for player 1.

a 1 a 2 a 3 a 1 8 8 7 a 2 0-10 -4 a 3 9 0-1 a 3 A 2 is a maximinimizer for player 2.

a 1 a 2 a 3 a 1 0 8 3 a 2 0 1 10 a 3-2 6 5 Recall that a 1, a 1 and a 2, a 1 are the Nash equilibria of this game.

a 1 a 2 a 3 a 1 0 8 3 a 2 0 1 10 a 3-2 6 5 a 1 A 1 and a 2 A 1 are maximinimizers for player 1.

a 1 a 2 a 3 a 1 0 8 3 a 2 0 1 10 a 3-2 6 5 a 1 A 2 is a maximinimizer for player 2.

Proof of Thm 2.1.1. (If you re interested) If a 1, a 2 is a Nash equilibrium, then u 2(g( a 1, a 2 )) u 2(g( a 1, a 2 )) for each a 2 A 2, so u 1 (g( a 1, a 2 )) u 1(g( a 1, a 2 )) for each a 2 A 2. Hence, min(a 1 ) = u 1(g( a 1, a 2 )) max a 1 A 1 (min(a 1 )). If a 1, a 2 is a Nash equilibrium, then u 1(g( a 1, a 2 )) u 1(g( a 1, a 2 )) for each a 1 A 1, so u 1 (g( a 1, a 2 )) min(a 1) for each a 1 A 1. Hence, u 1 (g( a 1, a 2 )) max a 1 A 1 (min(a 1 )). Thus, min(a 1 ) = u 1(g( a 1, a 2 )) = max a 1 A 1 (min(a 1 )). That is, a 1 is a maximinimizer for player 1. Similar reasoning establishes that min(a 2 ) = u 2(g( a 1, a 2 )) = max a 2 A 2 (min(a 2 )).

Thm 2.1.2. If the two-player zero sum game G has a Nash equilibrium, then a1 and a 2 are maximinimizers for players 1 and 2 respectively only if a1, a 2 is a Nash equilibrium of G.

Proof of Thm 2.1.2. (If you re interested) Suppose that G has a Nash equilibrium. Then from the Proof of Thm 2.1.1, we know that max a1 A 1 (min(a 1 )) = max a2 A 2 (min(a 2 )). Let max a1 A 1 (min(a 1 )) = v. Then max a2 A 2 (min(a 2 )) = v. Since a 1 is a maximinimizer for 1, u 1(g( a 1, a 2 )) v for all a 2 A 2, so u 1 (g( a 1, a 2 )) v. Since a 2 is a maximinimizer for 2, u 2(g( a 1, a 2 )) v for all a 1 A 1, so u 1 (g( a 1, a 2 )) v. Thus, u 1 (g( a 1, a 2 )) = v and u 2 (g( a 1, a 2 )) = v, so a 1, a 2 is a Nash equilibrium of G.

Thm 2.1.3. a1, a 2 and a 1, a 2 are Nash equilibria of the two-player zero sum game G only if u 1 (g( a1, a 2 )) = u 1(g( a1, a 2 )) (moreover, u 2 (g( a1, a 2 )) = u 2(g( a1, a 2 ))). Less formally, all Nash equilibria of G have the same utilities. The equilibrium payoff v to player 1 is the value of the game.

Proof of Thm 2.1.3. (If you re interested) Suppose that a1, a 2 and are Nash equilibria of G. a1, a 2 From the Proof of Thm 2.1.1, we know that u 1 (g( a1, a 2 )) = u 1(g( a1, a 2 )) = max a 1 A 1 (min(a 1 )). We also know that u 2 (g( a 1, a 2 )) = u 2(g( a 1, a 2 )) = max a 2 A 2 (min(a 2 )).

a 1 a 2 a 3 a 1 8 8 7 a 2 0-10 -4 a 3 9 0-1 The value of this game is 7.

a 1 a 2 a 3 a 1 0 8 3 a 2 0 1 10 a 3-2 6 5 The value of this game is 0.

Thm 2.1.4 (Coordination Theorem for Zero Sum Games). a1, a 2 and a 1, a 2 are Nash equilibria of the two-player zero sum game G only if a1, a 2 and a 1, a 2 are Nash equilibria of G.

Proof of Thm 2.1.4. Suppose that a1, a 2 and a equilibria of G. 1, a 2 are Nash By Thm 2.1.1, we know that a1 A 1 and a1 A 1 are maximinimizers for player 1, and a2 A 2 and a2 A 2 are maximinimizers for player 2. Thus, by Thm 2.1.2, we know that both a1, a 2 and a 1, a 2 are Nash equilibria of G.

a 1 a 2 a 3 a 4 a 1 1 2 3 1 a 2 0 5 0 0 a 3 1 6 4 1 a 1, a 1 and a 3, a 4 are Nash equilbria.

a 1 a 2 a 3 a 4 a 1 1 2 3 1 a 2 0 5 0 0 a 3 1 6 4 1 So a 1, a 4 and a 3, a 1 are too.

Unfortunately, not every two-player zero sum game has an action profile in equilibrium. paper rock scissors paper 0,0 1,-1-1,1 rock -1,1 0,0 1,-1 scissors 1,-1-1,1 0,0 max a1 A 1 (min(a 1 )) = 1 max a2 A 2 (min(a 2 )) = 1 max a1 A 1 (min(a 1 )) max a2 A 2 (min(a 2 )).

Fortunately, once we allow for mixed strategies, every two-player zero sum game has a pair of pure or mixed strategies in Nash equilibirum.

A player s choices can now be nondeterministic. [ 1 4 ] [ 1 2 ] [ 1 4 ] paper rock scissors [ 1 3 ] paper 0 1-1 [ 1 3 ] rock -1 0 1 [ 1 3 ] scissors 1-1 0 Player 1 s mixed strategy is to play paper with probability 1 3, rock with probability 1 3, and scissors with probability 1 3. Player 2 s mixed strategy is to play paper with probability 1 4, rock with probability 1 2, and scissors with probability 1 4.

A player s choices can now be nondeterministic. [ 1 4 ] [ 1 2 ] [ 1 4 ] paper rock scissors [ 1 3 ] paper 0 1-1 [ 1 3 ] rock -1 0 1 [ 1 3 ] scissors 1-1 0 Viewing mixed strategies naïvely, we can think of player 1 and player 2 as committing themselves to randomized chance mechanisms that select actions with various probabilities.

Each a i A i is a pure strategy of player i. Let (A i ) designate the set of probability measures over A i. Each α i (A i ) is a mixed strategy of player i. α i (a i ) = p i iff player i chooses a i A i with probability p i. The mixed strategy α i for player i N with A i = n where a 1 A i is chosen with probability p 1, a 2 A i is chosen with probability p 2,..., and a n A i is chosen with probability p n can be written as a 1 [p 1 ],..., a n [p n ]. paper[ 1 3 ], rock[ 1 3 ], scissors[ 1 3 ] (A 1). paper[ 1 4 ], rock[ 1 2 ], scissors[ 1 4 ] (A 2). Note that pure strategies can be regarded as special cases of mixed strategies. For instance, rock = paper[0], rock[1], scissors[0].

Before we worked with action profiles in i N A i. We will now work with mixed strategy profiles in i N (A i ). For instance, paper[ 1 3 ], rock[ 1 3 ], scissors[ 1 3 ], paper[ 1 4 ], rock[ 1 2 ], scissors[ 1 4 ] is a mixed strategy profile in (A 1 ) (A 2 ).

What is the expected utility of a mixed strategy profile α 1,..., α N for each player i N? Note that each mixed strategy profile induces a probability distribution over the set i N A i of action profiles. paper rock scissors paper 0 [ 1 12 ] 1 [ 1 6 ] -1 [ 1 12 ] rock -1 [ 1 12 ] 0 [ 1 6 ] 1 [ 1 12 ] scissors 1 [ 1 12 ] -1 [ 1 6 ] 0 [ 1 12 ] In general, the probability of pure strategy profile a 1,..., a N i N A i is Π i N α i (a i ). The expected utility of mixed strategy profile α 1,..., α N i N α i for player i N is Σ a1,...,a N i N A i Π i N α i (a i )u i (g( a 1,..., a N )).

What is the expected utility of a mixed strategy profile α 1,..., α N for each player i N? Note that each mixed strategy profile induces a probability distribution over the set i N A i of action profiles. paper rock scissors paper 0 [ 1 12 ] 1 [ 1 6 ] -1 [ 1 12 ] rock -1 [ 1 12 ] 0 [ 1 6 ] 1 [ 1 12 ] scissors 1 [ 1 12 ] -1 [ 1 6 ] 0 [ 1 12 ] For example, EU 1 ( paper[ 1 3 ], rock[ 1 3 ], scissors[ 1 3 ], paper[ 1 4 ], rock[ 1 2 ], scissors[ 1 4 ] ) = 0 1 12 +1 1 6 1 1 12 1 1 12 +0 1 6 +1 1 12 +1 1 12 1 1 6 +0 1 12 = 0.

What is the expected utility of a mixed strategy profile α 1,..., α N for each player i N? Note that each mixed strategy profile induces a probability distribution over the set i N A i of action profiles. paper rock scissors paper 0 [ 1 12 ] 1 [ 1 6 ] -1 [ 1 12 ] rock -1 [ 1 12 ] 0 [ 1 6 ] 1 [ 1 12 ] scissors 1 [ 1 12 ] -1 [ 1 6 ] 0 [ 1 12 ] For example, EU 2 ( paper[ 1 3 ], rock[ 1 3 ], scissors[ 1 3 ], paper[ 1 4 ], rock[ 1 2 ], scissors[ 1 4 ] ) = 0 1 12 1 1 6 +1 1 12 +1 1 12 +0 1 6 1 1 12 1 1 12 +1 1 6 +0 1 12 = 0.

We can now extend the concept of Nash equilibrium to cover mixed strategies. Def 2.1.6. A mixed strategy Nash equilibrium of G is a mixed strategy profile α i N α i such that for every player j N, the following condition holds: EU j ( α j, α j ) EU j( α j, α j ) for each α j (A j ). In other words, given the other players equilibrium mixed strategy profile α j, the equilibrium mixed strategy α j of player j is optimal. Osborne and Rubinstein: No player can profitably deviate, given the actions of the other players. Pure strategy Nash equilibria can be regarded as special cases of mixed strategy Nash equilibria.

[ 1 4 ] [ 1 2 ] [ 1 4 ] paper rock scissors [ 1 3 ] paper 0 1-1 [ 1 3 ] rock -1 0 1 [ 1 3 ] scissors 1-1 0 Is paper[ 1 3 ], rock[ 1 3 ], scissors[ 1 3 ], paper[ 1 4 ], rock[ 1 2 ], scissors[ 1 4 ] a mixed strategy Nash equilibrium of Rock, Paper, & Scissors? No. Player 1 does well to play paper[1], rock[0], scissors[0] instead. EU 1 ( paper[ 1 3 ], rock[ 1 3 ], scissors[ 1 3 ], paper[ 1 4 ], rock[ 1 2 ], scissors[ 1 4 ] ) = 0. EU 1 ( paper[1], rock[0], scissors[0], paper[ 1 4 ], rock[ 1 2 ], scissors[ 1 4 ] ) = 1 4.

[ 1 3 ] [ 1 3 ] [ 1 3 ] paper rock scissors [ 1 3 ] paper 0 1-1 [ 1 3 ] rock -1 0 1 [ 1 3 ] scissors 1-1 0 Is paper[ 1 3 ], rock[ 1 3 ], scissors[ 1 3 ], paper[ 1 3 ], rock[ 1 3 ], scissors[ 1 3 ] a mixed strategy Nash equilibrium of Rock, Paper, & Scissors? Yes. If either player plays the equiprobable mixed strategy, the expected utilities of both players are 0.

[ 1 2 ] [ 1 2 ] a 1 a 2 [ 1 2 ] a 1 6 3 [ 1 2 ] a 2 2 4 Is a 1 [ 1 2 ], a 2[ 1 2 ], a 1[ 1 2 ], a 2[ 1 2 ] a mixed strategy Nash equilibrium? No. EU 1 ( a 1 [ 1 2 ], a 2[ 1 2 ], a 1[ 1 2 ], a 2[ 1 2 ] ) = 15 4. But EU 1 ( a 1 [1], a 2 [0], a 1 [ 1 2 ], a 2[ 1 2 ] ) = 18 4. Does this game even have any mixed strategy Nash equilibria?

Thm 2.1.5 (Maximin Theorem for Two-player Zero Sum Games). Every two-person zero sum game has at least one mixed strategy Nash equilibrium. Moreover, the expected utilities of each Nash equilibrium mixed strategy profile are the same.

Partial Proof of Thm 2.1.5. We show that any 2x2 zero sum game in standard form has a mixed strategy Nash equilibrium. a 1 a 2 a 1 a b a 2 c d a, b, c, d R[0, ) where a > c, d > b, and d > c. Note that this game does not have any pure strategy Nash equilibria.

[q] [1 q] a 1 a 2 [p] a 1 a b [1 p] a 2 c d EU 1 ( a 1 [p], a 2 [1 p], a 1 [q], a 2 [1 q] ) = apq + bp(1 q) + c(1 p)q + d(1 p)(1 q) = apq + bp bpq + cq cpq + d dp dq + dpq = (a c + d b)pq (d b)p (d c)q + d = Apq Bp Cq + D = A((p C A )(q B DA BC A )) + A where A = a b + d c > 0, B = d b > 0, C = d c > 0, D = d > 0.

EU 1 ( a 1 [p], a 2 [1 p], a 1 [q], a 2 [1 q] ) = A((p C A )(q B DA BC A )) + A. EU 2 ( a 1 [p], a 2 [1 p], a 1 [q], a 2 [1 q] ) = A((p C A )(q B DA BC A )) A. **Player 1 can prevent EU 1 from falling below DA BC A by setting p = C A. **Player 2 can prevent EU 2 from falling below DA BC A q = B A. by setting a 1 [ C A ], a 2[1 C A ], a 1[ B A ], a 2[1 B A ] is a mixed strategy equilibrium. The value of the game is DA BC A.

a 1 a 2 a 1 6 3 a 2 2 4 What is a mixed strategy Nash equilibrium of this game?

a 1 a 2 a 1 6 3 a 2 2 4 A = a b + d c = B = d b = C = d c = D = d = p = C A = q = B A = EU 1 ( a 1 [p], a 2 [1 p], a 1 [q], a 2 [1 q] ) =

a 1 a 2 a 1 6 3 a 2 2 4 A = a b + d c = 5 B = d b = 1 C = d c = 2 D = d = 4 p = C A = 2 5 q = B A = 1 5 EU 1 ( a 1 [ 2 5 ], a 2[ 3 5 ], a 1[ 1 5 ], a 2[ 4 5 ] ) = 2 25 6+ 8 25 3+ 3 25 EU 1 ( a 1 [ 2 5 ], a 2[ 3 5 ], a 1[ 1 5 ], a 2[ 4 DA BC 5 ] ) = A = 18 5 12 90 2+ 25 4 = 25

a 1 a 2 a 1-3 1 a 2 2 0 What is a mixed strategy Nash equilibrium of this game?

a 1 a 2 a 1 0 4 a 2 5 3 First translation: add the constant +3.

a 1 a 2 a 2 5 3 a 1 0 4 Second translation: switch rows.

a 1 a 2 a 2 5 3 a 1 0 4 A = a b + d c = B = d b = C = d c = D = d = p = C A = q = B A = EU 1 ( a 1 [p], a 2 [1 p], a 1 [q], a 2 [1 q] ) = DA BC A =

a 1 a 2 a 2 5 3 a 1 0 4 A = a b + d c = 6 B = d b = 1 C = d c = 4 D = d = 4 p = C A = 2 3 q = B A = 1 6 EU 1 ( a 1 [ 1 3 ], a 2[ 2 3 ], a 1[ 1 6 ], a 2[ 5 DA BC 6 ] ) = A = 20 6

Before we re finished, we need to transform it back to the original utilities! a 1 a 2 a 1-3 1 a 2 2 0 EU 1 ( a 1 [ 1 3 ], a 2[ 2 3 ], a 1[ 1 6 ], a 2[ 5 6 ] ) = 20 6 3 = 1 3

And now for a shortcut. We now have a proof that every two-by-two, two-person, zero-sum game has at least one pure or mixed-strategy Nash equilibrium. So we can construct a recipe for finding them: 1. First, use the maximin strategy to determine if the game has any pure strategy Nash equilibria. If it does, that s the solution to the game. The value of those equilibria is the value of the game.

2. If this strategy comes up empty, then, assume the game has the following structure: [q] [1 q] C 1 C 2 [p] R 1 a b [1 p] R 2 c d Unlike the standard form game needed for the proof, there are no restrictions on a, b, c, and d in this strategy. Our aim is to find values for p and q for which (pr 1, (1 p)r 2 ); (qc 1, (1 q)c 2 )

It turns out that: p = (d c)/[(a + d) (b + c)] and q = (d b)/[(a + d) (b + c)] The value of the game is ap + c(1 p). Why does this work? When Row plays her half of an equilibrium mixed-strategy pair, she fixes the value of the game no matter what Col does. And the same goes for Col. If this is true, the EU of Row s equilibrium strategy against C 1 must be the same as the EU of that strategy against C 2, and the same holds for Col. The EU for Row against C 1 is ap + (1 p)c and against C 2 is bp + (1 p)d. Since they are equal, we have ap + (1 p)c = bp + (1 p)d. Solving for p gives us the equation above, and we can do the same thing for q.

a 1 a 2 a 1-3 1 a 2 2 0 What is a mixed strategy Nash equilibrium of this game?

a 1 a 2 a 1-3 1 a 2 2 0 p = (d c)/[(a + d) (b + c)] = and q = (d b)/[(a + d) (b + c)] = The value of the game is ap + c(1 p) =

a 1 a 2 a 1-3 1 a 2 2 0 p = (d c)/[(a + d) (b + c)] = and q = (d b)/[(a + d) (b + c)] = (0 2) ( 3+0) (1+2) = 2 6 = 1 3 0 1 ( 3+0) (1+2) = 1 6 = 1 6 The value of the game is ap + c(1 p) = 3( 1 3 ) + 2(1 1 3 ) = 1 3