Games with Perfect Information

Similar documents
4: Dynamic games. Concordia February 6, 2017

Extensive games (with perfect information)

Extensive Form Games I

6.207/14.15: Networks Lecture 11: Introduction to Game Theory 3

Microeconomics. 2. Game Theory

6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2

Negotiation: Strategic Approach

6.254 : Game Theory with Engineering Applications Lecture 13: Extensive Form Games

Solving Extensive Form Games

Industrial Organization Lecture 3: Game Theory

Backwards Induction. Extensive-Form Representation. Backwards Induction (cont ) The player 2 s optimization problem in the second stage

Basic Game Theory. Kate Larson. January 7, University of Waterloo. Kate Larson. What is Game Theory? Normal Form Games. Computing Equilibria

Game Theory. Wolfgang Frimmel. Perfect Bayesian Equilibrium

3.3.3 Illustration: Infinitely repeated Cournot duopoly.

Basics of Game Theory

Introduction to Game Theory

Bargaining, Contracts, and Theories of the Firm. Dr. Margaret Meyer Nuffield College

September 5 Exercises: Nash equilibrium and dominant strategy equilibrium: p. 44: 1, 3, 4

EconS Advanced Microeconomics II Handout on Subgame Perfect Equilibrium (SPNE)

Static (or Simultaneous- Move) Games of Complete Information

Game Theory and Social Psychology

Dynamic Games, II: Bargaining

Lecture 7. Simple Dynamic Games

Economics 209A Theory and Application of Non-Cooperative Games (Fall 2013) Extensive games with perfect information OR6and7,FT3,4and11

Game Theory. School on Systems and Control, IIT Kanpur. Ankur A. Kulkarni

Game Theory. Kuhn s Theorem. Bernhard Nebel, Robert Mattmüller, Stefan Wölfl, Christian Becker-Asano

MS&E 246: Lecture 4 Mixed strategies. Ramesh Johari January 18, 2007

Advanced Microeconomics

Extensive Form Games with Perfect Information

Economics 703 Advanced Microeconomics. Professor Peter Cramton Fall 2017

UC Berkeley Haas School of Business Game Theory (EMBA 296 & EWMBA 211) Summer 2016

Game Theory. Professor Peter Cramton Economics 300

Evolutionary Dynamics and Extensive Form Games by Ross Cressman. Reviewed by William H. Sandholm *

Economics 3012 Strategic Behavior Andy McLennan October 20, 2006

1 Extensive Form Games

Dynamic stochastic game and macroeconomic equilibrium

Game Theory and its Applications to Networks - Part I: Strict Competition

Political Economy of Institutions and Development: Problem Set 1. Due Date: Thursday, February 23, in class.

SF2972 Game Theory Exam with Solutions March 15, 2013

Extensive Form Games with Perfect Information

Microeconomics for Business Practice Session 3 - Solutions

Equilibrium Refinements

Refinements - change set of equilibria to find "better" set of equilibria by eliminating some that are less plausible

Long-Run versus Short-Run Player

Economics 201A Economic Theory (Fall 2009) Extensive Games with Perfect and Imperfect Information

MS&E 246: Lecture 12 Static games of incomplete information. Ramesh Johari

Notes on Coursera s Game Theory

EconS Sequential Competition

: Cryptography and Game Theory Ran Canetti and Alon Rosen. Lecture 8

Game Theory. Bargaining Theory. ordi Massó. International Doctorate in Economic Analysis (IDEA) Universitat Autònoma de Barcelona (UAB)

Game theory lecture 4. September 24, 2012

Game Theory, Evolutionary Dynamics, and Multi-Agent Learning. Prof. Nicola Gatti

1 The General Definition

Quantum Games. Quantum Strategies in Classical Games. Presented by Yaniv Carmeli

Economics 201B Economic Theory (Spring 2017) Bargaining. Topics: the axiomatic approach (OR 15) and the strategic approach (OR 7).

Open Sequential Equilibria of Multi-Stage Games with Infinite Sets of Types and Actions

Game Theory and Algorithms Lecture 2: Nash Equilibria and Examples

A New and Robust Subgame Perfect Equilibrium in a model of Triadic Power Relations *

Mixed Nash Equilibria

GS/ECON 5010 Answers to Assignment 3 W2005

ON FORWARD INDUCTION

Majority bargaining for resource division

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2016

Confronting Theory with Experimental Data and vice versa. European University Institute. May-Jun Lectures 7-8: Equilibrium

Dynamic Games and Bargaining. Johan Stennek

Cournot Competition Under Asymmetric Information

Economics 209B Behavioral / Experimental Game Theory (Spring 2008) Lecture 3: Equilibrium refinements and selection

Game Theory with Information: Introducing the Witsenhausen Intrinsic Model

SF2972 Game Theory Written Exam with Solutions June 10, 2011

REPEATED GAMES. Jörgen Weibull. April 13, 2010

BELIEFS & EVOLUTIONARY GAME THEORY

Equilibrium Computation

Unique Nash Implementation for a Class of Bargaining Solutions

ECO421: Reputation. Marcin P ski. March 29, 2018

Consistent Beliefs in Extensive Form Games

EC3224 Autumn Lecture #04 Mixed-Strategy Equilibrium

Implementation of the Ordinal Shapley Value for a three-agent economy 1

MAS and Games. Department of Informatics Clausthal University of Technology SS 18

Belief-based Learning

Area I: Contract Theory Question (Econ 206)

Iterated Strict Dominance in Pure Strategies

Computing Equilibria of Repeated And Dynamic Games

Computing Minmax; Dominance

Bounded Rationality, Strategy Simplification, and Equilibrium

Oligopoly. Oligopoly. Xiang Sun. Wuhan University. March 23 April 6, /149

Synthesis weakness of standard approach. Rational Synthesis

First Prev Next Last Go Back Full Screen Close Quit. Game Theory. Giorgio Fagiolo

A Dynamic Level-k Model in Games

Computing Minmax; Dominance

Advanced Microeconomics

Introduction to Game Theory

A strategic implementation of the Talmud rule based on concede-and-divide algorithm

CMSC 474, Introduction to Game Theory Maxmin and Minmax Strategies

EVOLUTIONARY STABILITY FOR TWO-STAGE HAWK-DOVE GAMES

Classic Oligopoly Models: Bertrand and Cournot

EconS Advanced Microeconomics II Handout on Repeated Games

Sequential Equilibria of Multi-Stage Games with Infinite Sets of Types and Actions

Patience and Ultimatum in Bargaining

Game Theory: introduction and applications to computer networks

arxiv: v2 [cs.gt] 17 Jul 2015

Transcription:

Games with Perfect Information Yiling Chen September 7, 2011 Non-Cooperative Game Theory What is it? Mathematical study of interactions between rational and self-interested agents. Non-Cooperative Focus on agents who make their own individual decisions without coalition

Perfect Information ll players know the game structure. Each player, when making any decision, is perfectly informed of all the events that have previously occurred. Definition of Normal-Form Game finite n-person game, G =< N,, u > N = {1, 2,..., n} is the set of players. = { 1, 2,..., n } is a set of available actions. a =(a 1, a 2,..., a n ) is an action profile (or a pure strategy profile). u = {u 1, u 2,...u n } is a set of utility functions for n agents. strategy is a complete contingent plan that defines the action an agent will take in all states of the world. Pure strategies (as opposed to mixed strategies) are the same as actions of agents. Players move simultaneously. Matrix representation for 2-person games

Example: attle of the Sexes husband and wife want to go to movies. They can select between Devils wear Parada and Iron Man. They prefer to go to the same movie, but while the wife prefers Devils wear Parada the husband prefers Iron Man. They need to make the decision independently. Husband DWP IM DWP 10, 5 0, 0 Wife IM 0, 0 5, 10 Example: attle of the Sexes husband and wife want to go to movies. They can select between Devils wear Parada and Iron Man. They prefer to go to the same movie, but while the wife prefers Devils wear Parada the husband prefers Iron Man. They need to make the decision independently. Husband DWP IM DWP 10, 5 0, 0 Wife IM 0, 0 5, 10

est-response Correspondences The best-response correspondence R i (a i ) i are the set of strategies that maximizes agent i s utility given other agents strategy a i. Compute every agent s best-response correspondences. The fixed points of the best-response correspondences, i.e. a R(a ), are the NEs of the game. Example: Continuous Strategy Space Cournot Competition Two suppliers producing a homogeneous good need to choose their production quantity, q i and q 2. The demand that they are facing is p(q) =1000 Q, where Q = q 1 + q 2. Unit cost c > 0. Utility: u 1 (q 1, q 2 )=q 1 [p(q 1 + q 2 ) c] est-response of supplier 1 is R 1 (q 2 )=argmax q1 u 1 (q 1, q 2 )

Example: Matching Pennies Each of the two players has a penny. They independently choose to display either heads or tails. If the two pennies are the same, player 1 takes both pennies. If they are different, player 2 takes both pennies. Heads Tails Heads 1, -1-1, 1 Tails -1, 1 1, -1 Mixed Strategies mixed strategy of agent i, σ i ( i ), defines a probability, σ i (a i )foreachpurestrategya i i. gent i s expected utility of following strategy σ i is u i (σ) = a i P σ (a)u i (a) The support of σ i is the set of pure strategies {a i : σ(a i ) > 0}. Pure strategies are special cases of mixed strategies.

Mixed Strategy Nash Equilibrium Mixed-strategy prole σ = {σ 1, σ 2,..., σ n} is a Nash equilibrium in a game if, for all i, u i (σ i, σ i) u i (σ i, σ i), σ i ( i ). Theorem (Nash 1951): Every game with a finite number of players and action profiles has at least one Nash equilibrium. ll pure strategies in the support of agent i at a mixed strategy Nash Equilibrium have the same expected utility. Finding Mixed Strategy Nash Equilibrium Heads Tails Find the fixed point of the est-response correspondences. Heads 1, -1-1, 1 Tails -1, 1 1, -1

Fixed Point Player 1 r Heads r (p) p (r) Tails Tails Heads Player 2 p Extensive-Form Game with Perfect Information

Example: The Sharing Game and try to split two indivisible and identical gifts. First, suggests a split: which can be keeps both, they each keep one, and keeps both. Then, chooses whether to ccept or Reject the split. If accepts the split, they each get what the split specifies. If rejects, they each get nothing. 2-0 1-1 0-2 R R R (2,0) (0,0) (1,1) (0,0) (0,2) (0,0) Loosely Speaking... Extensive Form detailed description of the sequential structure of the decision problems encountered by the players in a game. Often represented as a game tree Perfect Information ll players know the game structure (including the payoff functions at every outcome). Each player, when making any decision, is perfectly informed of all the events that have previously occurred.

Def. of Perfect-Information Extensive-Form Games perfect-information extensive-form game, G =(N, H, P, u) N = {1, 2,...,n} is the set of players. N={, } 2-0 1-1 0-2 R R R (2,0) (0,0) (1,1) (0,0) (0,2) (0,0) Def. of Perfect-Information Extensive-Form Games perfect-information extensive-form game, G =(N, H, P, u) H is a set of sequences (finite or infinite) Φ H h =(a k ) k=1,..., H is a history If (a k ) k=1,..., H and L <, then (a k ) k=1,...,l H (a k ) k=1 H if (ak ) k=1,...,l H for all positive L Z is the set of terminal histories. H = {Φ, 2 0, 1 2, (2 0, ), (2 0, R), (1 1, ), (1 1, R), (0 2, ), (0 2, R)} 2-0 1-1 0-2 R R R (2,0) (0,0) (1,1) (0,0) (0,2) (0,0)

Def. of Perfect-Information Extensive-Form Games perfect-information extensive-form game, G =(N, H, P, u) P is the player function, P : H\Z N. P(Φ)= P(2 0)= P(1 1) = P(0 2) = 2-0 1-1 0-2 R R R (2,0) (0,0) (1,1) (0,0) (0,2) (0,0) Def. of Perfect-Information Extensive-Form Games perfect-information extensive-form game, G =(N, H, P, u) u = {u1, u 2,...u n } is a set of utility functions, u i : Z R. u 1 ((2 0, )) = 2 u 2 ((2 0, )) = 0... 2-0 1-1 0-2 R R R (2,0) (0,0) (1,1) (0,0) (0,2) (0,0)

Pure Strategies in Perfect-Information Extensive-Form Games pure strategy of player i N in an extensive-form game with perfect information, G =(N, H, P, u), is a function that assigns an action in (h) toeachnon-terminal history h H\Z for which P(h) =i. (h) ={a :(h, a) H} pure strategy is a contingent plan that specifies the action for player i at every decision node of i. Pure Strategies for the Sharing Game 2-0 1-1 0-2 R R R (2,0) (0,0) (1,1) (0,0) (0,2) (0,0) S = {S 1, S 2 } E.g. s 1 =(2 0 if h = Φ), s 2 =( if h =2 0; R if h = 1-1; R if h =0 2).

Pure Strategies S = {S 1, S 2 } E.g. s 1 =( if h = Φ; if h = F ) s 2 =(C if h = ; F if h = ) Normal-Form Representation perfect-information extensive-form game normal-form game 2-0 1-1 0-2 R R R (2,0) (0,0) (1,1) (0,0) (0,2) (0,0) (,,) (,,R) (,R,) (,R,R) (R,,) (R,,R) (R,R,) (R,R,R) 2-0 1-1 0-2 2,0 2,0 2,0 2,0 0,0 0,0 0,0 0,0 1,1 1,1 0,0 0,0 1,1 1,1 0,0 0,0 0,2 0,0 0,2 0,0 0,2 0,0 0,2 0,0 normal-form game perfect-information extensive-form game

Normal-Form Representation: Example 2 perfect-information extensive-form game normal-form game (C, E) (C, F) (D, E) (D, F) (, ) (, ) (, ) (, ) 2,10 normal-form game perfect-information extensive-form game 1, 10 Pure Strategy Nash Equilibrium in Perfect-Information Extensive-Form Games pure strategy profile s is a weak Nash Equilibrium if, for all agents i and for all strategies s i = s i, u i (s i, s i ) u i (s i, s i). (Same as in normal-form games) (C, E) (C, F) (D, E) (D, F) (, ) (, ) (, ) (, ) 2,10 1, 10

Pure Strategy Nash Equilibrium in Perfect-Information Extensive-Form Games pure strategy profile s is a weak Nash Equilibrium if, for all agents i and for all strategies s i = s i, u i (s i, s i ) u i (s i, s i). (Same as in normal-form games) (C, E) (C, F) (D, E) (D, F) (, ) (, ) (, ) (, ) 2,10 1, 10 Pure Strategy Nash Equilibrium in Perfect-Information Extensive-Form Games pure strategy profile s is a weak Nash Equilibrium if, for all agents i and for all strategies s i = s i, u i (s i, s i ) u i (s i, s i). (Same as in normal-form games) (C, E) (C, F) (D, E) (D, F) (, ) (, ) (, ) (, ) 2,10 1, 10

Pure Strategy Nash Equilibrium in Perfect-Information Extensive-Form Games pure strategy profile s is a weak Nash Equilibrium if, for all agents i and for all strategies s i = s i, u i (s i, s i ) u i (s i, s i). (Same as in normal-form games) (C, E) (C, F) (D, E) (D, F) (, ) (, ) (, ) (, ) 2,10 1, 10 Pure Strategy Nash Equilibrium in Perfect-Information Extensive-Form Games pure strategy profile s is a weak Nash Equilibrium if, for all agents i and for all strategies s i = s i, u i (s i, s i ) u i (s i, s i). (Same as in normal-form games) (C, E) (C, F) (D, E) (D, F) (, ) (, ) (, ) (, ) 2,10 1, 10

Nash Equilibrium and Non-Credible Threat Nash Equilibrium is not a very satisfactory solution concept for perfect-information extensive-form games. (C, E) (C, F) (D, E) (D, F) (, ) (, ) (, ) (, ) 2,10 1, 10 Let s play a game now You are part of a fictitious pride of lions that has a strict hierarchy. You are ordered from largest (left) to smallest (right) ll lions prefer to become larger and can do so by eating the next largest lion if that lion is asleep Lions sleep after eating One day a deer passes by the pride. The largest lion would like to eat it, but is afraid of being eaten by the second largest lion once he falls asleep. Should the largest lion eat the deer?

Subgame Perfect Equilibrium Sequential Rationality: player s equilibrium strategy should specify optimal actions at every point in the game tree. Subgame Perfect Equilibrium (SPE) of a perfect-information extensive-form game G is a strategy profile s such that for any subgame G of G, the restriction of s to G is a NE. Every SPE is a NE, but not vice versa. Thm: Every finite extensive-form game with perfect information has a subgame perfect equilibrium. Finite: The set of sequences H is finite. Find SPE: ackward Induction

Find SPE: ackward Induction (2,10) Find SPE: ackward Induction (2,10) (2,10)

Find SPE: ackward Induction (3,8) (2,10) (2,10) Find SPE: ackward Induction (3,8) (3,8) (2,10) (2,10)

Note on Computational Complexity Finding NE for general normal-form games requires time exponential in the size of the normal form. The induced normal form of an extensive-form game is exponentially larger than the original representation. lgorithm of backward induction requires time linear in the size of the extensive-form game. (Depth-first transverse) For zero-sum extensive-form games, we can slightly improve the running time. argaining Game: Split-the-Pie Two players trying to split a desirable pie. The set of all possible agreements X is the set of all divisions of the pie, X = {(x 1, x 2 ):x i 0fori =1, 2andx 1 + x 2 =1}. The first move of the game occurs in period 0, when player 1 makes a proposal x 0 X, then player 2 either accepts or rejects. cceptance ends the game while rejection leads to period 1, in which player 2 makes a proposal x 1 X, which player 1 has to accept or reject. gain, acceptance ends the game; rejection leads to period 2, in which it is once again player 1 s turn to make aproposal.thegamecontinuesinthisfashionsolongas no offer has been accepted. u i (x, t) =δ t x i if proposal x has been accepted in period t, δ (0, 1). u i =0ifnoagreementhasreached.

Split-the-Pie as Perfect-Information Extensive-Form Game 1 2 x 0 2 R (x 0 1, x 0 2 ) 1 x 1 R (δx 1 1, δx 1 2 ) Nash Equilibria of the Split-the-Pie Game The set of NEs is very large. For example, for any x X there is a NE in which the players immediately agree on x. E.g. Player 1 always propose (0.99, 0.01) and only accepts a proposal (0.99, 0.01).

SPE of the Split-the-Pie Game The unique SPE of the game is Player 1 always proposes ( 1 1+δ, δ 1+δ )andaccepts proposals that has x 1 δ 1+δ. Player 2 always proposes ( δ 1+δ, 1 1+δ )andaccepts proposals that has x 2 δ 1+δ.