Iterated Strict Dominance in Pure Strategies

Similar documents
First Prev Next Last Go Back Full Screen Close Quit. Game Theory. Giorgio Fagiolo

Game Theory and Algorithms Lecture 2: Nash Equilibria and Examples

Game Theory and Rationality

C31: Game Theory, Lecture 1

Introduction to Game Theory Lecture Note 2: Strategic-Form Games and Nash Equilibrium (2)

Microeconomics for Business Practice Session 3 - Solutions

Computing Solution Concepts of Normal-Form Games. Song Chong EE, KAIST

Belief-based Learning

Weak Dominance and Never Best Responses

Game Theory for Linguists

Conservative Belief and Rationality

Industrial Organization Lecture 3: Game Theory

Notes on Coursera s Game Theory

Computing Minmax; Dominance

A Primer on Strategic Games

Computing Minmax; Dominance

Introduction to Game Theory

6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Potential Games

1 Games in Normal Form (Strategic Form)

6.891 Games, Decision, and Computation February 5, Lecture 2

UC Berkeley Haas School of Business Game Theory (EMBA 296 & EWMBA 211) Summer 2016

EC3224 Autumn Lecture #04 Mixed-Strategy Equilibrium

Advanced Microeconomics

Game Theory Fall 2003

Static (or Simultaneous- Move) Games of Complete Information

Suggested solutions to the 6 th seminar, ECON4260

Lecture Notes on Game Theory

ANSWER KEY 2 GAME THEORY, ECON 395

Game Theory Lecture 10+11: Knowledge

Game Theory. Professor Peter Cramton Economics 300

Game Theory, Evolutionary Dynamics, and Multi-Agent Learning. Prof. Nicola Gatti

6.254 : Game Theory with Engineering Applications Lecture 7: Supermodular Games

MS&E 246: Lecture 12 Static games of incomplete information. Ramesh Johari

Computation of Efficient Nash Equilibria for experimental economic games

Satisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games

EC3224 Autumn Lecture #03 Applications of Nash Equilibrium

Computing Equilibria of Repeated And Dynamic Games

Advanced Microeconomics

Algorithmic Game Theory and Applications. Lecture 4: 2-player zero-sum games, and the Minimax Theorem

Basics of Game Theory

Mixed Strategies. Krzysztof R. Apt. CWI, Amsterdam, the Netherlands, University of Amsterdam. (so not Krzystof and definitely not Krystof)

arxiv: v4 [cs.gt] 24 Jan 2018

Solving Extensive Form Games

Mathematical Economics - PhD in Economics

On revealed preferences in oligopoly games

Equilibrium Refinements

Equilibria in Games with Weak Payoff Externalities

Game Theory: Spring 2017

Tijmen Daniëls Universiteit van Amsterdam. Abstract

Political Economy of Institutions and Development: Problem Set 1. Due Date: Thursday, February 23, in class.

Ex Post Cheap Talk : Value of Information and Value of Signals

Game Theory: introduction and applications to computer networks

GAMES: MIXED STRATEGIES

Quantum Games. Quantum Strategies in Classical Games. Presented by Yaniv Carmeli

A (Brief) Introduction to Game Theory

Reputation and Conflict

MS&E 246: Lecture 4 Mixed strategies. Ramesh Johari January 18, 2007

WHEN ORDER MATTERS FOR ITERATED STRICT DOMINANCE *

Economics 3012 Strategic Behavior Andy McLennan October 20, 2006

Epistemic Game Theory

Non-zero-sum Game and Nash Equilibarium

Bargaining, Contracts, and Theories of the Firm. Dr. Margaret Meyer Nuffield College

Algorithms for cautious reasoning in games

Game Theory: Lecture 3

Game Theory. Monika Köppl-Turyna. Winter 2017/2018. Institute for Analytical Economics Vienna University of Economics and Business

Strategic Games: Social Optima and Nash Equilibria

Approval Voting: Three Examples

M340(921) Solutions Practice Problems (c) 2013, Philip D Loewen

Extensive Form Games with Perfect Information

Supplementary appendix to the paper Hierarchical cheap talk Not for publication

SF2972 Game Theory Written Exam with Solutions June 10, 2011

Theory Field Examination Game Theory (209A) Jan Question 1 (duopoly games with imperfect information)

Prisoner s Dilemma. Veronica Ciocanel. February 25, 2013

Bounded Rationality Lecture 2. Full (Substantive, Economic) Rationality

CSC304 Lecture 5. Game Theory : Zero-Sum Games, The Minimax Theorem. CSC304 - Nisarg Shah 1

Bargaining Efficiency and the Repeated Prisoners Dilemma. Bhaskar Chakravorti* and John Conley**

Basic Game Theory. Kate Larson. January 7, University of Waterloo. Kate Larson. What is Game Theory? Normal Form Games. Computing Equilibria

Review of topics since what was covered in the midterm: Topics that we covered before the midterm (also may be included in final):

EC319 Economic Theory and Its Applications, Part II: Lecture 2

General-sum games. I.e., pretend that the opponent is only trying to hurt you. If Column was trying to hurt Row, Column would play Left, so

Normal-form games. Vincent Conitzer

Introduction to Game Theory. Outline. Topics. Recall how we model rationality. Notes. Notes. Notes. Notes. Tyler Moore.

We set up the basic model of two-sided, one-to-one matching

Recap Social Choice Fun Game Voting Paradoxes Properties. Social Choice. Lecture 11. Social Choice Lecture 11, Slide 1

Chapter 2. Equilibrium. 2.1 Complete Information Games

Other-Regarding Preferences: Theory and Evidence

EconS Advanced Microeconomics II Handout on Subgame Perfect Equilibrium (SPNE)

Perfect Bayesian Equilibrium

WEAKLY DOMINATED STRATEGIES: A MYSTERY CRACKED

Multiple Equilibria in the Citizen-Candidate Model of Representative Democracy.

BELIEFS & EVOLUTIONARY GAME THEORY

Introduction to game theory LECTURE 1

Epistemic Implementation and The Arbitrary-Belief Auction Jing Chen, Silvio Micali, and Rafael Pass

INFORMATIONAL ROBUSTNESS AND SOLUTION CONCEPTS. Dirk Bergemann and Stephen Morris. December 2014 COWLES FOUNDATION DISCUSSION PAPER NO.

Lecture 1. Evolution of Market Concentration

האוניברסיטה העברית בירושלים

Strongly Consistent Self-Confirming Equilibrium

Known Unknowns: Power Shifts, Uncertainty, and War.

Periodic Strategies a New Solution Concept- Algorithm for non-trivial Strategic Form Games

Industrial Organization Lecture 7: Product Differentiation

Transcription:

Iterated Strict Dominance in Pure Strategies We know that no rational player ever plays strictly dominated strategies. As each player knows that each player is rational, each player knows that his opponents will not play strictly dominated strategies and thus all opponents know that effectively they are facing a "smaller" game. As rationality is a common knowledge, everyone knows that everyone knows that the game is effectively smaller. Thus everyone knows, that nobody will play strictly dominated strategies in the smaller game (and such strategies may indeed exist). Because it is a common knowledge that all players will perform this kind of reasoning again, the process can continue until no more strictly dominated strategies can be eliminated. 35

IESDS The previous reasoning yields the Iterated Elimination of Strictly Dominated Strategies (IESDS): Define a sequence D 0 i, D 1 i, D 2 i,... of strategy sets of player i. (Denote by G k DS the game obtained from G by restricting to Dk i, i N.) 1. Initialize k = 0 and D 0 i = S i for each i N. 2. For all players i N: Let D k+1 be the set of all pure strategies of i D k that are not strictly dominated in G k i DS. 3. Let k := k + 1 and go to 2. We say that s i S i survives IESDS if s i D k i for all k = 0, 1, 2,... Definition 10 A strategy profile s = (s 1,..., s n ) S is an IESDS equilibrium if each s i survives IESDS. A game is IESDS solvable if it has a unique IESDS equilibrium. Remark: If all S i are finite, then in 2. we may remove only some of the strictly dominated strategies (not necessarily all). The result is not affected by the order of elimination since strictly dominated strategies remain strictly dominated even after removing some other strictly dominated strategies. 36

37 IESDS Examples In the Prisoner s dilemma: C S C 5, 5 0, 20 S 20, 0 1, 1 (C, C) is the only one surviving the first round of IESDS. In the Battle of Sexes: O F O 2, 1 0, 0 F 0, 0 1, 2 all strategies survive all rounds (i.e. IESDS anything may happen, sorry)

38 A Bit More Interesting Example IESDS on greenboard! L C R L 4, 3 5, 1 6, 2 C 2, 1 8, 4 3, 6 R 3, 0 9, 6 2, 8

39 Political Science Example: Median Voter Theorem Hotelling (1929) and Downs (1957) N = {1, 2} S i = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10} (political and ideological spectrum) 10 voters belong to each position (Here 10 means ten percent in the real-world) Voters vote for the closest candidate. If there is a tie, then 1 2 got to each candidate Payoff: The number of voters for the candidate, each candidate (selfishly) strives to maximize this number

Political Science Example: Median Voter Theorem 1 and 10 are the (only) strictly dominated strategies D 1 = 1 D1 = {2,..., 9} 2 in G 1, 2 and 9 are the (only) strictly dominated strategies DS D 2 = 1 D2 = {3,..., 8} 2... only 5, 6 survive IESDS 40

41 Belief & Best Response IESDS eliminated apparently unreasonable behavior (leaving "reasonable" behavior implicitly untouched). What if we rather want to actively preserve reasonable behavior? What is reasonable?... what we believe is reasonable :-). Intuition: Imagine that your colleague did something stupid What would you ask him? Usually something like "What were you thinking?" The colleague may respond with a reasonable description of his belief in which his action was (one of) the best he could do (You may of course question reasonableness of the belief) Let us formalize this type of reasoning...

42 Belief & Best Response Definition 11 A belief of player i is a pure strategy profile s i S i of his opponents. Definition 12 A strategy s i S i of player i is a best response to a belief s i S i if u i (s i, s i ) u i (s i, s i) for all s i S i Claim 3 A rational player who believes that his opponents will play s i S i always chooses a best response to s i S i. Definition 13 A strategy s i S i is never best response if it is not a best response to any belief s i S i. A rational player never plays any strategy that is never best response.

43 Best Response vs Strict Dominance Proposition 1 If s i is strictly dominated for player i, then it is never best response. The opposite does not have to be true in pure strategies: X Y A 1, 1 1, 1 B 2, 1 0, 1 C 0, 1 2, 1 Here A is never best response but is strictly dominated neither by B, nor by C.

Elimination of Stupid Strategies = Rationalizability Using similar iterated reasoning as for IESDS, strategies that are never best response can be iteratively eliminated. Define a sequence R 0, R 1, R 2,... of strategy sets of player i. i i i (Denote by G k the game obtained from G by restricting to R k, i N.) Rat i 1. Initialize k = 0 and R 0 i = S i for each i N. 2. For all players i N: Let R k+1 be the set of all strategies of R k i i that are best responses to some beliefs in G k Rat. 3. Let k := k + 1 and go to 2. We say that s i S i is rationalizable if s i R k i for all k = 0, 1, 2,... Definition 14 A strategy profile s = (s 1,..., s n ) S is a rationalizable equilibrium if each s i is rationalizable. We say that a game is solvable by rationalizability if it has a unique rationalizable equilibrium. (Warning: For some reasons, rationalizable strategies are almost always defined using mixed strategies!) 44

45 Rationalizability Examples In the Prisoner s dilemma: C S C 5, 5 0, 20 S 20, 0 1, 1 (C, C) is the only rationalizable equilibrium. In the Battle of Sexes: all strategies are rationalizable. O F O 2, 1 0, 0 F 0, 0 1, 2

46 Cournot Duopoly G = (N, (S i ) i N, (u i ) i N ) N = {1, 2} S i = [0, ) u 1 (q 1, q 2 ) = q 1 (κ q 1 q 2 ) q 1 c 1 = (κ c 1 )q 1 q 2 1 q 1q 2 u 2 (q 1, q 2 ) = q 2 (κ q 2 q 1 ) q 2 c 2 = (κ c 2 )q 2 q 2 2 q 2q 1 Assume for simplicity that c 1 = c 2 = c and denote θ = κ c. What is a best response of player 1 to a given q 2? Solve δu 1 δq 1 = θ 2q 1 q 2 = 0, which gives that q 1 = (θ q 2 )/2 is the only best response of player 1 to q 2. Similarly, q 2 = (θ q 1 )/2 is the only best response of player 2 to q 1. Since q 2 0, we obtain that q 1 is never best response iff q 1 > θ/2. Similarly q 2 is never best response iff q 2 > θ/2. Thus R 1 1 = R1 2 = [0, θ/2].

47 Cournot Duopoly G = (N, (S i ) i N, (u i ) i N ) N = {1, 2} S i = [0, ) u 1 (q 1, q 2 ) = q 1 (κ q 1 q 2 ) q 1 c 1 = (κ c 1 )q 1 q 2 1 q 1q 2 u 2 (q 1, q 2 ) = q 2 (κ q 2 q 1 ) q 2 c 2 = (κ c 2 )q 2 q 2 2 q 2q 1 Assume for simplicity that c 1 = c 2 = c and denote θ = κ c. Now, in G 1 Rat, we still have that q 1 = (θ q 2 )/2 is the best response to q 2, and q 2 = (θ q 1 )/2 the best resp. to q 1 Since q 2 R 1 2 = [0, θ/2], we obtain that q 1 is never best response iff q 1 [0, θ/4) Similarly q 2 is never best response iff q 2 [0, θ/4) Thus R 2 1 = R2 2... = [θ/4, θ/2].

Cournot Duopoly (cont.) G = (N, (S i ) i N, (u i ) i N ) N = {1, 2} S i = [0, ) u 1 (q 1, q 2 ) = q 1 (κ q 1 q 2 ) q 1 c 1 = (κ c 1 )q 1 q 2 1 q 1q 2 u 2 (q 1, q 2 ) = q 2 (κ q 2 q 1 ) q 2 c 2 = (κ c 2 )q 2 q 2 2 q 2q 1 Assume for simplicity that c 1 = c 2 = c and denote θ = κ c. In general, after 2k iterations we have R 2k i r k = (θ l k 1 )/2 for k 1 l k = (θ r k )/2 for k 1 and l 0 = 0 Solving the recurrence we obtain l k = θ/3 ( 1 4) k θ/3 = R 2k i = [l k, r k ] where r k = θ/3 + ( 1 4) k 1 θ/6 Hence, lim k l k = lim k r k = θ/3 and thus (θ/3, θ/3) is the only rationalizable equilibrium. 48

49 Cournot Duopoly (cont.) G = (N, (S i ) i N, (u i ) i N ) N = {1, 2} S i = [0, ) u 1 (q 1, q 2 ) = q 1 (κ q 1 q 2 ) q 1 c 1 = (κ c 1 )q 1 q 2 1 q 1q 2 u 2 (q 1, q 2 ) = q 2 (κ q 2 q 1 ) q 2 c 2 = (κ c 2 )q 2 q 2 2 q 2q 1 Assume for simplicity that c 1 = c 2 = c and denote θ = κ c. Are q i = θ/3 Pareto optimal? NO! u 1 (θ/3, θ/3) = u 2 (θ/3, θ/3) = θ 2 /9 but u 1 (θ/4, θ/4) = u 2 (θ/4, θ/4) = θ 2 /8

50 IESDS vs Rationalizability in Pure Strategies Theorem 15 Assume that S is finite. Then for all k we have that R k D k. That is, i i in particular, all rationalizable strategies survive IESDS. The opposite inclusion does not have to be true in pure strategies: X Y A 1, 1 1, 1 B 2, 1 0, 1 C 0, 1 2, 1 Recall that A is never best response but is strictly dominated by neither B, nor C. That is, A survives IESDS but is not rationalizable.

Proof of Theorem 15 By induction on k. For k = 0 we have that R 0 = S i i = D 0 by definition. i Now assume that R k D k for some k 0. i i We prove that R k+1 D k+1 i i by showing the following: For all s R k D k : If s D k+1, then s R k+1 i i i i i i i Let us fix s R k such that s D k+1. By definition, it suffices to i i i i prove that for every s k R k there exists s k R i i i i such that u i (s k i, s k i ) > u i(s i, sk i ) (1) (In words, for every possible behavior of opponents of player i in G k Rat, player i has a strictly better strategy than s i in G k Rat ) As s D k+1, the strategy s must be strictly dominated in G k i i i DS by a strategy s i. That is for all s k D k R k we have i i i u i ( s i, s k i ) > u i(s i, sk i ) (2) (Now note that if s i R k D k, then we are done. Indeed, it suffices to put i i s k := s i i and the equation (1) will be satisfied for all s k D k R k. However, i i i it does not have to be the case that s i R k ) i 51

52 Proof of Theorem 15 (cont.) Clearly, there is l k such that s i R l i. (Note that s i does not have to strictly dominate s i in G l Rat since R l i may be larger than D k i ) Recall that we need to find s k R k for every given s k R k so that i i i i the inequality (1) holds. (That is, s k may be different for different s k s) i i Let us fix s k i R k i D k i. Let s k R l be a strategy maximizing u i i i (s i, s k i ) over all s i R l i. In particular, we obtain the inequality (1): u i (s k i, s k i ) u i( s i, s k i ) > u i(s i, sk i ) Finally, note that s k R k follows immediately from the fact that s k is a i i i best response to s k in all games G l i Rat,..., Gk Rat (Indeed, even after removing some strategies (other than s k i remains a best resp. to s k ) i and s k i ), sk i

53 Pinning Down Beliefs Nash Equilibria Criticism of previous approaches: Strictly dominant strategy equilibria often do not exist IESDS and rationalizability may not remove any strategies Typical example is Battle of Sexes: O F O 2, 1 0, 0 F 0, 0 1, 2 Here all strategies are equally reasonable according to the above concepts. But are all strategy profiles really equally reasonable?

54 Pinning Down Beliefs Nash Equilibria O F O 2, 1 0, 0 F 0, 0 1, 2 Assume that each player has a belief about strategies of other players. By Claim 3, each player plays a best response to his beliefs. Is (O, F) as reasonable as (O, O) in this respect? Note that if player 1 believes that player 2 plays O, then playing O is reasonable, and if player 2 believes that player 1 plays F, then playing F is reasonable. But such beliefs cannot be correct together! (O, O) can be obtained as a profile where each player plays the best response to his belief and the beliefs are correct.

55 Nash Equilibrium Nash equilibrium can be defined as a set of beliefs (one for each player) and a strategy profile in which every player plays a best response to his belief and each strategy of each player is consistent with beliefs of his opponents. A usual definition is following: Definition 16 A pure-strategy profile s = (s 1,..., s n) S is a (pure) Nash equilibrium if s i is a best response to s i for each i N, that is u i (s i, s i ) u i(s i, s i ) for all s i S i and all i N Note that this definition is equivalent to the previous one in the sense that s i may be considered as the (consistent) belief of player i to which he plays a best response s i

56 Nash Equilibria Examples In the Prisoner s dilemma: C S C 5, 5 0, 20 S 20, 0 1, 1 (C, C) is the only Nash equilibrium. In the Battle of Sexes: O F O 2, 1 0, 0 F 0, 0 1, 2 only (O, O) and (F, F) are Nash equilibria. In Cournot Duopoly, (θ/3, θ/3) is the only Nash equilibrium. (Best response relations: q 1 = (θ q 2 )/2 and q 2 = (θ q 1 )/2 are both satisfied only by q 1 = q 2 = θ/3)

Example: Stag Hunt Story: Two (in some versions more than two) hunters, players 1 and 2, can each choose to hunt stag (S) = a large tasty meal hare (H) = also tasty but small Hunting stag is much more demanding and forces of both players need to be joined (hare can be hunted individually) Strategy-form game model: N = {1, 2}, S 1 = S 2 = {S, H}, the payoff: S H S 5, 5 0, 3 H 3, 0 3, 3 Two NE: (S, S), and (H, H), where the former Pareto dominates the latter! Which one is more reasonable? 57

Example: Stag Hunt Strategy-form game model: N = {1, 2}, S 1 = S 2 = {S, H}, the payoff: S H S 5, 5 0, 3 H 3, 0 3, 3 Two NE: (S, S), and (H, H), where the former Pareto dominates the latter! Which one is more reasonable? If each player believes that the other one will go for hare, then (H, H) is a reasonable outcome a society of individualists who do not cooperate at all. If each player believes that the other will cooperate, then this anticipation is self-fulfilling and results in what can be called a cooperative society. This is supposed to explain that in real world there are societies that have similar endowments, access to technology and physical environment but have very different achievements, all because of self-fulfilling beliefs (or norms of behavior). 58

59 Example: Stag Hunt Strategy-form game model: N = {1, 2}, S 1 = S 2 = {S, H}, the payoff: S H S 5, 5 0, 3 H 3, 0 3, 3 Two NE: (S, S), and (H, H), where the former Pareto dominates the latter! Which one is more reasonable? Another point of view: (H, H) is less risky Minimum secured by playing S is 0 as opposed to 3 by playing H (We will get to this minimax principle later) So it seems to be rational to expect (H, H) (?)

60 Nash Equilibria vs Previous Concepts Theorem 17 1. If s is a strictly dominant strategy equilibrium, then it is the unique Nash equilibrium. 2. Each Nash equilibrium is rationalizable and survives IESDS. 3. If S is finite, neither rationalizability, nor IESDS creates new Nash equilibria. Proof: Homework! Corollary 18 Assume that S is finite. If rationalizability or IESDS result in a unique strategy profile, then this profile is a Nash equilibrium.

Interpretations of Nash Equilibria Except the two definitions, usual interpretations are following: When the goal is to give advice to all of the players in a game (i.e., to advise each player what strategy to choose), any advice that was not an equilibrium would have the unsettling property that there would always be some player for whom the advice was bad, in the sense that, if all other players followed the parts of the advice directed to them, it would be better for some player to do differently than he was advised. If the advice is an equilibrium, however, this will not be the case, because the advice to each player is the best response to the advice given to the other players. When the goal is prediction rather than prescription, a Nash equilibrium can also be interpreted as a potential stable point of a dynamic adjustment process in which individuals adjust their behavior to that of the other players in the game, searching for strategy choices that will give them better results. 61