Economics 3012 Strategic Behavior Andy McLennan October 20, 2006

Similar documents
EC3224 Autumn Lecture #03 Applications of Nash Equilibrium

Bayesian Games and Mechanism Design Definition of Bayes Equilibrium

University of Warwick, Department of Economics Spring Final Exam. Answer TWO questions. All questions carry equal weight. Time allowed 2 hours.

Game Theory. Monika Köppl-Turyna. Winter 2017/2018. Institute for Analytical Economics Vienna University of Economics and Business

6.207/14.15: Networks Lecture 11: Introduction to Game Theory 3

Introduction to Game Theory Lecture Note 2: Strategic-Form Games and Nash Equilibrium (2)

1 The General Definition

Perfect Bayesian Equilibrium. Definition. The single-crossing property. This is a draft; me with comments, typos, clarifications, etc.

Theory of Auctions. Carlos Hurtado. Jun 23th, Department of Economics University of Illinois at Urbana-Champaign

EC319 Economic Theory and Its Applications, Part II: Lecture 2

Economics 703 Advanced Microeconomics. Professor Peter Cramton Fall 2017

Game Theory Review Questions

Lecture Note II-3 Static Games of Incomplete Information. Games of incomplete information. Cournot Competition under Asymmetric Information (cont )

Lecture 6 Games with Incomplete Information. November 14, 2008

Game Theory. Wolfgang Frimmel. Perfect Bayesian Equilibrium

Game Theory and Algorithms Lecture 2: Nash Equilibria and Examples

Game theory lecture 4. September 24, 2012

Lecture Notes on Game Theory

Basics of Game Theory

Game Theory. Strategic Form Games with Incomplete Information. Levent Koçkesen. Koç University. Levent Koçkesen (Koç University) Bayesian Games 1 / 15

6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2

SF2972 Game Theory Exam with Solutions March 15, 2013

MS&E 246: Lecture 12 Static games of incomplete information. Ramesh Johari

UC Berkeley Haas School of Business Game Theory (EMBA 296 & EWMBA 211) Summer 2016

Multiple Equilibria in the Citizen-Candidate Model of Representative Democracy.

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2016

Game Theory: Spring 2017

Advanced Microeconomics

Introduction to Game Theory

Microeconomics for Business Practice Session 3 - Solutions

Lecture 4. 1 Examples of Mechanism Design Problems

Extensive Form Games with Perfect Information

Economics 201A Economic Theory (Fall 2009) Extensive Games with Perfect and Imperfect Information

Introduction to Game Theory

Review of topics since what was covered in the midterm: Topics that we covered before the midterm (also may be included in final):

Microeconomics II Lecture 4: Incomplete Information Karl Wärneryd Stockholm School of Economics November 2016

Solution to Tutorial 9

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program May 2012

. Introduction to Game Theory Lecture Note 8: Dynamic Bayesian Games. HUANG Haifeng University of California, Merced

Two hours UNIVERSITY OF MANCHESTER SCHOOL OF COMPUTER SCIENCE. Date: Thursday 17th May 2018 Time: 09:45-11:45. Please answer all Questions.

4: Dynamic games. Concordia February 6, 2017

Outline for Static Games of Incomplete Information

Mechanism Design: Bayesian Incentive Compatibility

Basic Game Theory. Kate Larson. January 7, University of Waterloo. Kate Larson. What is Game Theory? Normal Form Games. Computing Equilibria

ANSWER KEY 2 GAME THEORY, ECON 395

Extensive games (with perfect information)

Game Theory: Spring 2017

Cournot Competition Under Asymmetric Information

July Framing Game Theory. Hitoshi Matsushima. University of Tokyo. February 2018

Lecture Notes on Game Theory

Equilibrium Refinements

6.254 : Game Theory with Engineering Applications Lecture 13: Extensive Form Games

Theory Field Examination Game Theory (209A) Jan Question 1 (duopoly games with imperfect information)

Solving Extensive Form Games

On revealed preferences in oligopoly games

The Revenue Equivalence Theorem 1

Backwards Induction. Extensive-Form Representation. Backwards Induction (cont ) The player 2 s optimization problem in the second stage

Lecture 10: Mechanism Design

Algorithmic Game Theory Introduction to Mechanism Design

Microeconomics. 2. Game Theory

EconS Microeconomic Theory II Midterm Exam #2 - Answer Key

Advanced Microeconomics

APPLIED MECHANISM DESIGN FOR SOCIAL GOOD

Perfect Bayesian Equilibrium

Mechanism Design II. Terence Johnson. University of Notre Dame. Terence Johnson (ND) Mechanism Design II 1 / 30

On Renegotiation-Proof Collusion under Imperfect Public Information*

Government 2005: Formal Political Theory I

Recap Social Choice Fun Game Voting Paradoxes Properties. Social Choice. Lecture 11. Social Choice Lecture 11, Slide 1

Sequential Bidding in the Bailey-Cavallo Mechanism

EconS Advanced Microeconomics II Handout on Subgame Perfect Equilibrium (SPNE)

Introduction to Game Theory

Algorithmic Game Theory and Applications

Oligopoly. Oligopoly. Xiang Sun. Wuhan University. March 23 April 6, /149

C31: Game Theory, Lecture 1

Robust Mechanism Design and Robust Implementation

SF2972 Game Theory Written Exam with Solutions June 10, 2011

October 16, 2018 Notes on Cournot. 1. Teaching Cournot Equilibrium

CS 573: Algorithmic Game Theory Lecture date: April 11, 2008

Msc Micro I exam. Lecturer: Todd Kaplan.

1 Games in Normal Form (Strategic Form)

EconS Sequential Competition

Answer Key for M. A. Economics Entrance Examination 2017 (Main version)

Deceptive Advertising with Rational Buyers

9 A Class of Dynamic Games of Incomplete Information:

EC319 Economic Theory and Its Applications, Part II: Lecture 7

Industrial Organization Lecture 7: Product Differentiation

Game Theory for Linguists

Symmetric Separating Equilibria in English Auctions 1

Microeconomic Theory (501b) Problem Set 10. Auctions and Moral Hazard Suggested Solution: Tibor Heumann

Repeated Downsian Electoral Competition

Limit pricing models and PBE 1

Chapter 2. Equilibrium. 2.1 Complete Information Games

Economics 703: Microeconomics II Modelling Strategic Behavior 1

Virtual Robust Implementation and Strategic Revealed Preference

Game Theory Lecture 10+11: Knowledge

The Condorcet Jur(ies) Theorem

CPS 173 Mechanism design. Vincent Conitzer

Static (or Simultaneous- Move) Games of Complete Information

Refinements - change set of equilibria to find "better" set of equilibria by eliminating some that are less plausible

Microeconomics Qualifying Exam

Transcription:

Economics 301 Strategic Behavior Andy McLennan October 0, 006 Lecture 11 Topics Problem Set 9 Extensive Games of Imperfect Information An Example General Description Strategies and Nash Equilibrium Beliefs and Behavior Strategies Weak Sequential Equilibrium Announcements I will be at a conference next week, hence no office hours. Next Friday there will be a guest lecture by Professor Rohan Pitchford. 1

Problem Set 9 Exercise 87.1 of Osborne Problem Statement: Two firms compete as per Cournot. The inverse demand curve is P(Q) = α Q where Q = q 1 + q is aggregate supply. Firm i s profit is q i P(Q) c i q i where q i is the firm s output and c i is its cost. Firm 1 s cost is c, and firm s cost may be either c L or c H where c H > c L > 0. Firm 1 believes that the probability of c L is θ, where 0 < θ < 1. Firm knows both its own cost and firm 1 s belief. Find the Nash equilibria for those parameter values where all outputs are positive, and compare with the equilibria of the games in which c L is known for sure and c H is known for sure.

Analysis: An equilibrium is a triple (q 1, q L, q H ) where q 1 is firm 1 s output and q L and q H are the outputs of firm when the marginal cost is c L and c H respectively. The profit of firm when its marginal cost is c L is q L (α q 1 q L ) c L q L. Differentiating this with respect to q L, setting the derivative equal to 0, and solving for q L gives q L = α q 1 c L Similarly, the best response to q 1 when the marginal cost is c H is. q H = α q 1 c H. 3

The expected profit of firm 1 is θ(q 1 (α q 1 q L ) cq 1 )+(1 θ)(q 1 (α q 1 q H ) cq 1 ) = q 1 (α q 1 (θq L + (1 θ)q H )) cq 1. Proceeding as above, we find that the best response for firm 1 is q 1 = α (θq L + (1 θ)q H ) c. To solve for the equilibrium we substitute the best response values of q L and q H : q 1 = α (θ α q 1 c L + (1 θ) α q 1 c H ) c. Isolating q 1 gives q 1 = α + (θc L + (1 θ)c H ) c. 3 4

Substituting this expression for q 1 into the formulas for q L and q H gives and q L = α α+(θc L+(1 θ)c H ) c 3 c L = α + c c L 1 (1 θ)(c H c L ) 3 q H = α α+(θc L+(1 θ)c H ) c 3 c H = α + c c H 1 θ(c L c H ). 3 The Nash equilibrium of the game in which firm 1 knows that the cost of firm is c L is obtained by taking q 1 and q L in the case θ = 1; in comparison with this, q 1 is somewhat higher and q L is somewhat lower. The Nash equilibrium of the game in which firm 1 knows that the cost of firm is c H is obtained by taking q 1 and q H in the case θ = 0; in comparison with this, q 1 is somewhat lower and q H is somewhat higher. 5

Exercise 307.1 of Osborne Problem Statement: Two citizens each vote for candidate 1, vote for candidate, or abstain. The candidate receiving the most votes is elected, with ties broken by coin flips. Both citizens prefer that candidate 1 be elected if the state is A and that candidate be elected if the state is B. Citizen 1 knows which state has occurred, and citizen believes that the probability of state A is 0.9. Formulate this as a Bayesian game, find its pure Nash equilibria, show that one involves a weakly dominated strategy, and explain the phrase swing voter s curse. 6

Analysis: To display this situation as a Bayesian game we describe each of the items in Definition 79.1: The set of players is the set of citizens {1, }. The set of states is {A, B}. The set of actions for player i is {v i1, a i, v i } whose elements represent voting for 1, abstaining, and voting for. The signal of citizen 1 is the state. Citizen receives the same signal in both states. Citizen 1 is completely informed, and the belief of citizen is that the probability that the state is A is 0.9. 7

The (common) payoffs in the two states are given in the tables: A v 1 a v 1 v 11 1 1 1 a 1 1 0 1 v 1 0 0 B v 1 a v 1 v 11 0 0 1 a 1 0 1 1 v 1 1 1 8

Equilibria: In the first pure equilibrium: Citizen 1 votes for candidate 1 in state A and candidate in state B. Citizen abstains. In the second pure equilibrium: Citizen votes for candidate 1. Citizen 1 abstains in state A and votes for candidate in state B. Abstaining in state A is weakly dominated by voting for A. The phrase swing voter s curse recalls the phrase winner s curse from auction theory. In both cases there is a situation in which doing what seems likely to be right, based only on one s own information, can lead to a bad outcome. 9

To see that these are the only pure equilibria consider that: For citizen to vote for candidate is strictly dominated by voting for candidate 1. The unique best response for citizen 1 when citizen abstains is given by the first equilibrium. If citizen votes for candidate 1, citizen must vote for candidate in state B, and can either abstain (as in the equilibrium) or vote for candidate 1 in state A. If citizen 1 is voting for candidate 1 in state A and for candidate in state B, then the unique best response for citizen is to abstain. 10

Exercise 310. of Osborne Problem Statement: Two bidders have valuations that are independently and uniformly distributed on [0, 1]. They compete in a second price sealed bid auction with reserve price r, meaning that if the bids are b i and b j with b i > b j, then the object is unsold if b i < r and awarded to i if b i r, with i paying max{r, b j }. Show that bidding one s valuation is weakly dominant, compute the expected revenue for the auctioneer when both agents do so, and find the value of r that maximizes this expected revenue. 11

Analysis: In comparison with bidding v i, a bid b i > v i can only change the outcome if v i < max{r, b j } < b i, in which case the result is that agent i pays more than v i for the object. Bidding b i < v i can change the outcome, in comparison with bidding v i, only if v i > max{r, b j } > b i, in which case the result is that agent i does not win the object, but could have done so profitably. Therefore setting b i = v i is weakly dominant. 1

Expected Revenue: The revenue corresponding to the case min{v 1, v } r is 1 r 1 r min{v 1, v } dv dv 1 = = = 1 r 1 r 1 r ( v1 r v dv + 1 ) v 1 dv dv 1 v 1 1 (v 1 r ) + (1 v 1 )v 1 dv 1 v 1 1 v 1 1 r dv 1 = 1 (1 r ) 1 6 (1 r3 ) 1 r (1 r) = 1 3 r + 3 r3. 13

With probability r both valuations are below r, in which case there is no revenue. With probability r(1 r) we have v 1 > r > v, in which case the revenue is r. Similarly, v > r > v 1 occurs with probability r(1 r), and the revenue in this case is also r. Total expected revenue is r (1 r) + 1 3 r + 3 r3 = 1 3 + r 4 3 r3. Setting the derivative of this equal to zero gives 0 = r 4r = r(1 r). The extreme point is r = 1/, and the second derivative test shows that this is a maximum. Maximized revenue is 1 3 + 1 4 4 3 1 8 = 5 1. This is greater than the expected revenue of 1 3 generated by the second price auction, which corresponds to the case r = 0. 14

Extensive Games with Imperfect Information We begin with Example 315.1, which is simplified version of poker: [ 1 1 Chance High( 1 ] ) Low(1 ) See 1 1 See [ ] 1 1 Raise Raise Pass Meet Pass Meet [ ] [ ] [ ] [ ] 1 1 1 1 First Chance or Nature gives player 1 either a high card or a low card. Player 1 then either sees or raises., If Player 1 raises, player decides whether to pass or meet without knowing whether player 1 s card is high or low. 15

An extensive game (with imperfect information and chance moves) has the following features A set of players. A set of terminal histories. A terminal history cannot be a proper subhistory of another terminal history. A player function assigning either a player or chance to every proper subhistory (including the null history ) of a terminal history. Probabilities of moves chosen by Chance. For each player, a partition (called the information partition) of the set of subhistories assigned to that player. for each player, preferences over lotteries over terminal histories. 16

In Example 315.1: The set of players is {1, }. The set of terminal histories is {HS, HRP, HRM, LS, LRP, LRM} where H = High, L = Low, S = See, R = Raise, P = Pass, and M = Meet. The player function P is given by P( ) = Chance, P(H) = 1, P(L) = 1, P(HR) =, and P(LR) =. The probability that Chance chooses High and the probability that Chance chooses Low are both 1. Player 1 information partition is {{H}, {L}}, and player s information partition is {HR, LR}. The preferences over lotteries are given by taken expected payoffs using the numbers in the payoff vectors attached to terminal histories. 17

This is a very general model, encompassing most of the models of games studied earlier in the course. Any strategic form game can be viewed as an extensive game with imperfect information. ( B S ) B (, 1) (0, 0) S (0, 0) (, 1) B 1 B S B S [ ] [ ] [ ] [ ] 0 0 1 1 0 0 In the extensive form version the players choose actions in some sequence, but when a player chooses she doesn t know what others have already chosen. S 18

Any extensive game of perfect information is an extensive game of the sort discussed here. In an extensive game of perfect information each player s information partition is trivial, consisting entirely of sets with a single element. To represent a Bayesian game as an extensive game of imperfect information we first have Chance choose a state. The players then take turns choosing actions, with their information partitions given by the maps from states to types. In Example 84.1 the set of states is Ω = {α, β, γ}. The probability of α is 9 13, the probability of β is 3 13, and the probability of γ is 1 13. Player 1 gets one signal in states α and β and a different signal in state γ. Player gets one signal in states β and γ and a different signal in state α. 19

Both players choose between left and right: α β ( l r ) ( l r ) L (, ) (0, 0) L (, ) (0, 0) R (3, 0) (1, 1) R (0, 0) (1, 1) γ ( l r ) L (, ) (0, 0). R (0, 0) (1, 1) This is the extensive form version of this game. Chance α( 9 13 ) β( 3 13 ) γ( 1 13 ) 1 1 L R L R L R l r l r l r l r l r l r [ ][ ] [ ][ ] [ ][ ] [ ][ ] [ ][ ] [ ][ ] 0 3 1 0 0 1 0 0 1 0 0 1 0 0 1 0 0 1 0

Strategies and Nash Equilibrium At an information set of a player, the set of actions that can be chosen must be the same at every node in the information set. If I i is an information set for player i, let A(I i ) be the set of actions that can be chosen at I i. A pure strategy for player i is a function that assigns an element of A(I i ) to each information set I i in player i s information partition. A mixed strategy for player i is a probability distribution over player i s pure strategies. A mixed strategy profile α is a Nash equilibrium if, for each player i and every mixed strategy α i for player i, player i s expected payoff at the profile α is at least as large as her expected payoff at the profile (α i, α i ). 1

Example 317.1 In the following game a Challenger can: enter ready for a fight, enter unready, or stay out. R Challenger U Incumbent A F A F Out [ ] [ ] [ ] [ ] 3 1 4 0 3 1 3 The Incumbent [ ] 4 prefers to acquiesce if the entrant is ready, and prefers to fight if the entrant is unready. The Challenger would like to enter (preferably unready) if there is no fight and stay out if there would be a fight.

It is usually easier to find the Nash equilibria if we pass to the strategic form: A F R (3, 3) (1, 1) U (4, 3) (0, ). Out (, 4) (, 4) There is a pure equilibrium in which the Challenger plays U and the entrant acquiesces. There also are Nash equilibria in which the Challenger always stays out and the Incumbent fights with probability 1/ or more. These equilibria are implausible. After either R or U the Incumbent does better to acquiesce. The Challenger knows this and should confidently expect acquiescence. 3

The problem is that Nash equilibrium, applied to an extensive game of perfect information, does not require rationality in every contingency. For extensive games of perfect information the concept of subgame perfect equilibrium handled this problem. If there was only one way to enter, that would work here as well: Challenger A R [ ] [ ] 3 1 3 1 Incumbent F Out [ ] 4 Incumbent A Challenger U Out F [ ] [ ] 4 0 3 [ ] 4 4

We would like to extend the ideas underlying subgame perfect equilibrium to this example, saying that for any belief that the Incumbent might hold once the information set has occurred, her expected payoff is greater if she acquiesces. To develop a general concept that allows us to express these ideas we need to: Introduce beliefs. A belief at an information set is a probability distribution over the partial histories in the information set. Introduce a different notion of strategy that allows us to compute expected payoffs everywhere in the tree. 5

Behavior Strategies A behavior strategy β i for a player i is an assignment of a probability β i (a) to each action that is available at some information set where i chooses. Formally, we can think of β i as a function β i : A(I i ) [0, 1], where the union is over all information sets where i chooses, such that β i (a) = 1 a A(I i ) for each such information set I i. When you play a game you can either: Formulate a plan for the entire game. A mixed strategy is a probability distribution over such plans. Choose actions one at a time, as described by a behavior strategy. 6

A behavior strategy profile is an n-tuple β = (β 1,..., β n ) specifying a behavior strategy for each agent. Given a behavior strategy profile β, one can compute an agent s expected payoff after any partial history. R Challenger U Incumbent A F A F [ ] 3 [ ] 1 [ ] 4 3 1 3 F [ 0 Out Ch ger B ] [ ] 1 4 [ ] 4 Suppose β 1 (R) = 1/3, β 1 (U) = 1/, β 1 (Out) = 1/6, β 1 (F) = 1/4, β 1 (B) = 3/4, β (A) = /5, and β (F) = 3/5. The Incumbent s expected payoff after U is 5 3 + 3 5 (1 4 + 3 4 4) = 33 10. 7

Systems of Beliefs A system of beliefs µ i for a player i is an assignment of a probability µ i (h) to each history in each information set where i chooses. Formally, we can think of µ i as a function µ i : I i [0, 1], where the union is over all information sets where i chooses, such that h I i µ i (h) = 1 for each such information set I i. A belief system is an n-tuple µ = (µ 1,..., µ n ) specifying a system of beliefs for each agent. An assessment is a pair (β, µ) in which β is a behavior strategy profile and µ is a belief system. 8

Given an assessment (β, µ), one can compute an agent s expected payoff at any of her information sets. R Challenger U Incumbent A F A F [ ] 3 [ ] 1 [ ] 4 3 1 3 Let β be as before. F [ 0 Out Ch ger B ] [ ] 1 4 [ ] 4 The Incumbent s expected payoff after U is 33 10. The Incumbent s expected payoff after R is 5 3 + 3 5 1 = 9 5. The Challenger belief is that µ (R) = 3/7 and µ (U) = 4/7. The Challenger s expected payoff at her information set is 3 7 9 5 + 4 7 33 10 = 186 70. 9

Weak Sequential Equilibrium An assessment (β, µ) is sequentially rational if, for each agent i, each information set I i, and each alternative behavior strategy β i for i, the expected payoff at I i under (β, µ) is at least as large as the expected payoff at I i under ((β i, β i), µ). An assessment (β, µ) is weakly consistent if, for each agent i, each information set I i for i, and each partial history h I i, if the probability Pr(I i according to β) = h I i Pr(h according to β) of reaching I i is positive, then player i s belief at I i must assign probability to h. Pr(h according to β) Pr(I i according to β) (β, µ) is a weak sequential equilibrium if it is sequentially rational and consistent. 30

Consistency A behavior strategy profile is totally mixed if every action at every information set is assigned positive probability. If (β, µ) is weakly consistent and β is totally mixed, then every partial history occurs with positive probability, so for each agent i, each information set I i at which i chooses, and each h I i, agent i s belief at I i has to assign probability to h. Pr(h according to β) Pr(I i according to β) That is, if β is totally mixed, there is a unique belief system µ such that (β, µ) is weakly consistent. 31

An assessment (β, µ) is consistent if there is are sequences {β r } r=1 and {µ r } r=1 such that: (a) for each r, (i) β r is totally mixed, and (ii) (β r, µ r ) is weakly consistent, (b) β r β, and (c) µ r µ. A sequential equilibrium is an assessment (β, µ) that is sequentially rational and consistent. The additional strength of consistency, over and above weak consistency, has many natural and desirable consequences. One of these is explored in the coming week s problem set. In fact the consequences of consistency are not fully or satisfactorily understood. Since it involves limits, it is difficult to work with computationally. 3

Solving for Weak Sequential Equilibrium When there is not too much simultaneity the general idea of backwards induction can give us the weak sequential equilibria. R Challenger U Incumbent A F A F [ ] 3 [ ] 1 [ ] 4 3 1 3 F [ 0 Out Ch ger B ] [ ] 1 4 [ ] 4 Let (β, µ) be a weak sequential equilibrium. Sequential rationality implies that β 1 (F) = 1 and β 1 (B) = 0. For any belief that the Incumbent might hold, sequential rationality implies that β (A) = 1 and β (F) = 0. 33

In view of this, sequential rationality implies that β 1 (U) = 1, so that β 1 (R) = 0 and β 1 (Out) = 0. Now weak consistency implies that µ (R) = 0 and µ (U) = 1. The problem of computing the weak sequential equilibria is, in full generality, very complex. (Computing the sequential equilibria is even harder.) Both for the sorts of problems that are considered in this course, and in actual computational practice, it is a process of eliminating possibilities until every case has been fully resolved. 34