Walras-Bowley Lecture 2003

Size: px
Start display at page:

Download "Walras-Bowley Lecture 2003"

Transcription

1 Walras-Bowley Lecture 2003 Sergiu Hart This version: September 2004 SERGIU HART c 2004 p. 1

2 ADAPTIVE HEURISTICS A Little Rationality Goes a Long Way Sergiu Hart Center for Rationality, Dept. of Economics, Dept. of Mathematics The Hebrew University of Jerusalem SERGIU HART c 2004 p. 2

3 Most of this talk is based on joint work with Andreu Mas-Colell (Universitat Pompeu Fabra, Barcelona) SERGIU HART c 2004 p. 3

4 Most of this talk is based on joint work with Andreu Mas-Colell (Universitat Pompeu Fabra, Barcelona) All papers and this presentation are available on my home page SERGIU HART c 2004 p. 3

5 Papers SERGIU HART c 2004 p. 4

6 Papers Econometrica (2000) Journal of Economic Theory (2001) Economic Essays (2001) Games and Economic Behavior (2003) American Economic Review (2003) SERGIU HART c 2004 p. 4

7 PART I Introduction: Dynamics SERGIU HART c 2004 p. 5

8 Dynamics SERGIU HART c 2004 p. 6

9 Dynamics Learning SERGIU HART c 2004 p. 6

10 Dynamics Learning START: prior beliefs STEP: observe update (Bayes) optimize (best-reply) REPEAT SERGIU HART c 2004 p. 6

11 Dynamics Evolution SERGIU HART c 2004 p. 7

12 Dynamics Evolution populations each individual fixed action ( gene ) frequencies of each action in the population mixed strategy SERGIU HART c 2004 p. 7

13 Dynamics Evolution populations each individual fixed action ( gene ) frequencies of each action in the population mixed strategy Change: Selection higher payoff higher frequency Mutation random and relatively rare SERGIU HART c 2004 p. 7

14 Dynamics Adaptive Heuristics SERGIU HART c 2004 p. 8

15 Dynamics Adaptive Heuristics rules of thumb myopic simple stimulus response, reinforcement behavioral, experiments non-bayesian SERGIU HART c 2004 p. 8

16 Dynamics Adaptive Heuristics rules of thumb myopic simple stimulus response, reinforcement behavioral, experiments non-bayesian Example: Fictitious Play SERGIU HART c 2004 p. 8

17 Dynamics Adaptive Heuristics rules of thumb myopic simple stimulus response, reinforcement behavioral, experiments non-bayesian Example: Fictitious Play (Play optimally against the empirical distribution of past play of the other player) SERGIU HART c 2004 p. 8

18 ?. jevolution. jheuristics. Learning? SERGIU HART c 2004 p. 9

19 Rationality. jevolution. jheuristics. Learning Rationality SERGIU HART c 2004 p. 9

20 Rationality. jevolution. jheuristics. Learning Rationality most of this talk SERGIU HART c 2004 p. 9

21 Can simple adaptive heuristics lead to sophisticated rational behavior? SERGIU HART c 2004 p. 10

22 Game N-person game in strategic (normal) form Players i = 1, 2,..., N SERGIU HART c 2004 p. 11

23 Game N-person game in strategic (normal) form Players i = 1, 2,..., N For each player i: Actions s i in S i SERGIU HART c 2004 p. 11

24 Game N-person game in strategic (normal) form Players i = 1, 2,..., N For each player i: Actions s i in S i For each player i: Payoffs (utilities) u i (s) u i (s 1, s 2,..., s N ) SERGIU HART c 2004 p. 11

25 Dynamics Dynamics Time t = 1, 2,... SERGIU HART c 2004 p. 12

26 Dynamics Dynamics Time t = 1, 2,... At time t each player i chooses an action s i t in S i SERGIU HART c 2004 p. 12

27 PART II Regret Matching SERGIU HART c 2004 p. 13

28 Advertisement DON T YOU FEEL A PANG OF REGRET? 47.15% YIELD 47.15% January May % 4.43% XYZ Nasdaq Dow Jones Don t wait! Ask your broker today Haaretz June 3, 2003 SERGIU HART c 2004 p. 14

29 Regret Matching REGRET MATCHING = Switch next period to a different action with a probability that is proportional to the regret for that action SERGIU HART c 2004 p. 15

30 Regret Matching REGRET MATCHING = Switch next period to a different action with a probability that is proportional to the regret for that action REGRET = increase in payoff had such a change always been made in the past SERGIU HART c 2004 p. 15

31 Regret U = average payoff up to now SERGIU HART c 2004 p. 16

32 Regret U = average payoff up to now V (k) = average payoff if action k had been played instead of the current action j every time in the past that j was played SERGIU HART c 2004 p. 16

33 Regret U = average payoff up to now V (k) = average payoff if action k had been played instead of the current action j every time in the past that j was played R(k) = [V (k) U] + = regret for action k SERGIU HART c 2004 p. 16

34 Regret U = average payoff up to now V (k) = average payoff if action k had been played instead of the current action j every time in the past that j was played R(k) = [V (k) U] + = regret for action k R(k) R i T (j k) = [ 1 T ) t T : s (u ] it = j i (k, s i t ) u i (s t ) + SERGIU HART c 2004 p. 16

35 Regret U = average payoff up to now V (k) = average payoff if action k had been played instead of the current action j every time in the past that j was played R(k) = [V (k) U] + = regret for action k R(k) R i T (j k) = [ 1 T ) t T : s (u ] it = j i (k, s i t ) u i (s t ) + SERGIU HART c 2004 p. 16

36 Regret U = average payoff up to now V (k) = average payoff if action k had been played instead of the current action j every time in the past that j was played R(k) = [V (k) U] + = regret for action k R(k) R i T (j k) = [ 1 T ) t T : s (u ] it = j i (k, s i t ) u i (s t ) + SERGIU HART c 2004 p. 16

37 Regret U = average payoff up to now V (k) = average payoff if action k had been played instead of the current action j every time in the past that j was played R(k) = [V (k) U] + = regret for action k R(k) R i T (j k) = [ 1 T ) t T : s (u ] it = j i (k, s i t ) u i (s t ) + SERGIU HART c 2004 p. 16

38 Regret U = average payoff up to now V (k) = average payoff if action k had been played instead of the current action j every time in the past that j was played R(k) = [V (k) U] + = regret for action k R(k) R i T (j k) = [ 1 T ) t T : s (u ] it = j i (k, s i t ) u i (s t ) + SERGIU HART c 2004 p. 16

39 Regret Matching Next period play: Switch to action k with a probability that is proportional to the regret R(k) (for k j) SERGIU HART c 2004 p. 17

40 Regret Matching Next period play: Switch to action k with a probability that is proportional to the regret R(k) (for k j) Play the same action j of last period with the remaining probability SERGIU HART c 2004 p. 17

41 Regret Matching Next period play: Switch to action k with a probability that is proportional to the regret R(k) (for k j) Play the same action j of last period with the remaining probability σ(k) σ i T+1 (k) = cr(k), for k j σ(j) σ i T+1 (j) = 1 k j cr(k) SERGIU HART c 2004 p. 17

42 Regret Matching Next period play: Switch to action k with a probability that is proportional to the regret R(k) (for k j) Play the same action j of last period with the remaining probability σ(k) σ i T+1 (k) = cr(k), for k j σ(j) σ i T+1 (j) = 1 k j cr(k) c = a fixed positive constant (so that the probability of not switching is > 0) SERGIU HART c 2004 p. 17

43 Regret Matching Theorem Theorem If all players play Regret Matching then the joint distribution of play converges to the set of CORRELATED EQUILIBRIA of the game SERGIU HART c 2004 p. 18

44 Joint Distribution of Play Joint distribution of play z T = The relative frequencies that the N-tuples of actions have been played up to time T SERGIU HART c 2004 p. 19

45 Joint Distribution of Play Joint distribution of play z T = The relative frequencies that the N-tuples of actions have been played up to time T SERGIU HART c 2004 p. 19

46 Joint Distribution of Play Joint distribution of play z T = The relative frequencies that the N-tuples of actions have been played up to time T T = 1 SERGIU HART c 2004 p. 19

47 Joint Distribution of Play Joint distribution of play z T = The relative frequencies that the N-tuples of actions have been played up to time T T = z 1 SERGIU HART c 2004 p. 19

48 Joint Distribution of Play Joint distribution of play z T = The relative frequencies that the N-tuples of actions have been played up to time T T = /2 1/2 0 0 z 2 SERGIU HART c 2004 p. 19

49 Joint Distribution of Play Joint distribution of play z T = The relative frequencies that the N-tuples of actions have been played up to time T T = /3 1/3 0 0 z 3 SERGIU HART c 2004 p. 19

50 Joint Distribution of Play Joint distribution of play z T = The relative frequencies that the N-tuples of actions have been played up to time T T = 10 3/10 0 2/10 1/10 3/10 1/10 z 10 SERGIU HART c 2004 p. 19

51 Joint Distribution of Play Note 1: The fact that the players randomize independently at each period does not imply that the joint distribution is independent! T = 10 3/10 0 2/10 1/10 3/10 1/10 z 10 SERGIU HART c 2004 p. 19

52 Joint Distribution of Play Note 1: The fact that the players randomize independently at each period does not imply that the joint distribution is independent! Note 2: Players observe the joint distribution (the history of play) SERGIU HART c 2004 p. 19

53 Joint Distribution of Play Note 1: The fact that the players randomize independently at each period does not imply that the joint distribution is independent! Note 2: Players observe the joint distribution (the history of play) Note 3: Players react to the joint distribution (patterns, coincidences, communication, signals,...) SERGIU HART c 2004 p. 19

54 Correlated Equilibrium A Correlated Equilibrium is a Nash equilibrium when the players receive payoff-irrelevant signals before playing the game (Aumann 1974) SERGIU HART c 2004 p. 20

55 Correlated Equilibrium A Correlated Equilibrium is a Nash equilibrium when the players receive payoff-irrelevant signals before playing the game (Aumann 1974) Examples: SERGIU HART c 2004 p. 20

56 Correlated Equilibrium A Correlated Equilibrium is a Nash equilibrium when the players receive payoff-irrelevant signals before playing the game (Aumann 1974) Examples: Independent signals SERGIU HART c 2004 p. 20

57 Correlated Equilibrium A Correlated Equilibrium is a Nash equilibrium when the players receive payoff-irrelevant signals before playing the game (Aumann 1974) Examples: Independent signals Nash equilibrium SERGIU HART c 2004 p. 20

58 Correlated Equilibrium A Correlated Equilibrium is a Nash equilibrium when the players receive payoff-irrelevant signals before playing the game (Aumann 1974) Examples: Independent signals Nash equilibrium Public signals ( sunspots ) SERGIU HART c 2004 p. 20

59 Correlated Equilibrium A Correlated Equilibrium is a Nash equilibrium when the players receive payoff-irrelevant signals before playing the game (Aumann 1974) Examples: Independent signals Nash equilibrium Public signals ( sunspots ) convex combinations of Nash equilibria SERGIU HART c 2004 p. 20

60 Correlated Equilibrium A Correlated Equilibrium is a Nash equilibrium when the players receive payoff-irrelevant signals before playing the game (Aumann 1974) Examples: Independent signals Nash equilibrium Public signals ( sunspots ) convex combinations of Nash equilibria Butterflies play the Chicken Game ( Speckled Wood Pararge aegeria) SERGIU HART c 2004 p. 20

61 Correlated Equilibria "Chicken" game LEAVE STAY LEAVE 5, 5 3, 6 STAY 6, 3 0, 0 SERGIU HART c 2004 p. 21

62 Correlated Equilibria "Chicken" game LEAVE STAY LEAVE 5, 5 3, 6 STAY 6, 3 0, 0 a Nash equilibrium SERGIU HART c 2004 p. 21

63 Correlated Equilibria "Chicken" game LEAVE STAY LEAVE 5, 5 3, 6 STAY 6, 3 0, 0 another Nash equilibriumi SERGIU HART c 2004 p. 21

64 Correlated Equilibria "Chicken" game LEAVE STAY LEAVE 5, 5 3, 6 STAY 6, 3 0, 0 L 0 1/2 1/2 0 a (publicly) correlated equilibrium SERGIU HART c 2004 p. 21

65 Correlated Equilibria "Chicken" game LEAVE STAY LEAVE 5, 5 3, 6 STAY 6, 3 0, 0 L S L 1/3 1/3 S 1/3 0 another correlated equilibrium after signal L play LEAVE after signal S play STAY SERGIU HART c 2004 p. 21

66 Correlated Equilibrium A Correlated Equilibrium is a Nash equilibrium when the players receive payoff-irrelevant signals before playing the game (Aumann 1974) Examples: Independent signals Nash equilibrium Public signals ( sunspots ) convex combinations of Nash equilibria Butterflies play the Chicken Game ( Speckled Wood Pararge aegeria) SERGIU HART c 2004 p. 22

67 Correlated Equilibrium A Correlated Equilibrium is a Nash equilibrium when the players receive payoff-irrelevant signals before playing the game (Aumann 1974) Examples: Independent signals Nash equilibrium Public signals ( sunspots ) convex combinations of Nash equilibria Butterflies play the Chicken Game ( Speckled Wood Pararge aegeria) Boston Celtics front line SERGIU HART c 2004 p. 22

68 Correlated Equilibrium Signals (public, correlated) are unavoidable SERGIU HART c 2004 p. 23

69 Correlated Equilibrium Signals (public, correlated) are unavoidable Bayesian Rationality Correlated Equilibrium (Aumann 1987) SERGIU HART c 2004 p. 23

70 Correlated Equilibrium Signals (public, correlated) are unavoidable Bayesian Rationality Correlated Equilibrium (Aumann 1987) A joint distribution z is a correlated equilibrium u(j, s i )z(j, s i ) u(k, s i )z(j, s i ) s i s i for all i N and all j, k S i SERGIU HART c 2004 p. 23

71 Regret Matching Theorem [recall] Theorem If all players play Regret Matching then the joint distribution of play converges to the set of CORRELATED EQUILIBRIA of the game SERGIU HART c 2004 p. 24

72 Regret Matching Theorem CE = set of correlated equilibria z T = joint distribution of play up to time T distance(z T, CE) 0 as T (a.s.) SERGIU HART c 2004 p. 25

73 Regret Matching Theorem CE = set of correlated equilibria z T = joint distribution of play up to time T distance(z T, CE) 0 as T (a.s.) z T is approximately a correlated equilibrium (or z T is a correlated approximate equilibrium) from some time on (for all large enough T ) SERGIU HART c 2004 p. 25

74 Regret Matching Theorem Proof z T is a correlated equilibrium there is no regret: RT i (j k) = 0 for all players and all actions SERGIU HART c 2004 p. 26

75 Regret Matching Theorem Proof z T is a correlated equilibrium there is no regret: RT i (j k) = 0 for all players and all actions Regret Matching all regrets converge to 0 (Proof: Blackwell Approachability for the vector of regrets + approximate eigenvector probabilities by transition probabilities) SERGIU HART c 2004 p. 26

76 Regret Matching Theorem Proof z T is a correlated equilibrium there is no regret: RT i (j k) = 0 for all players and all actions Regret Matching all regrets converge to 0 (Proof: Blackwell Approachability for the vector of regrets + approximate eigenvector probabilities by transition probabilities) Note: z T converges to the set CE, not to a point SERGIU HART c 2004 p. 26

77 Remarks Correlating device: the history of play SERGIU HART c 2004 p. 27

78 Remarks Correlating device: the history of play Other procedures leading to correlated equilibria: Foster Vohra 1997 Calibrated Learning: best-reply to calibrated forecasts Fudenberg Levine 1999 Conditional Smooth Fictitious Play Eigenvector strategy SERGIU HART c 2004 p. 27

79 Remarks Correlating device: the history of play Other procedures leading to correlated equilibria: Foster Vohra 1997 Calibrated Learning: best-reply to calibrated forecasts Fudenberg Levine 1999 Conditional Smooth Fictitious Play Eigenvector strategy Not heuristics! SERGIU HART c 2004 p. 27

80 Behavioral Aspects Behavioral aspects of Regret Matching: SERGIU HART c 2004 p. 28

81 Behavioral Aspects Behavioral aspects of Regret Matching: Commonly used rules of behavior SERGIU HART c 2004 p. 28

82 Behavioral Aspects Behavioral aspects of Regret Matching: Commonly used rules of behavior Never change a winning team SERGIU HART c 2004 p. 28

83 Behavioral Aspects Behavioral aspects of Regret Matching: Commonly used rules of behavior Never change a winning team The higher would have been the payoff from another action the higher the tendency to switch to it SERGIU HART c 2004 p. 28

84 Behavioral Aspects Behavioral aspects of Regret Matching: Commonly used rules of behavior Never change a winning team The higher would have been the payoff from another action the higher the tendency to switch to it Small probability of switching (the status quo bias") SERGIU HART c 2004 p. 28

85 Behavioral Aspects Behavioral aspects of Regret Matching: Commonly used rules of behavior Never change a winning team The higher would have been the payoff from another action the higher the tendency to switch to it Small probability of switching (the status quo bias") Stimulus-response, reinforcement SERGIU HART c 2004 p. 28

86 Behavioral Aspects Behavioral aspects of Regret Matching: Commonly used rules of behavior Never change a winning team The higher would have been the payoff from another action the higher the tendency to switch to it Small probability of switching (the status quo bias") Stimulus-response, reinforcement No beliefs (defined directly on actions) No best-reply (better-reply?) SERGIU HART c 2004 p. 28

87 Behavioral Aspects Similar to models of learning, experimental and behavioral economics: SERGIU HART c 2004 p. 29

88 Behavioral Aspects Similar to models of learning, experimental and behavioral economics: Bush Mosteller 1955 Erev Roth 1995, 1998 Camerer Ho 1997, 1998, SERGIU HART c 2004 p. 29

89 Behavioral Aspects Similar to models of learning, experimental and behavioral economics: Bush Mosteller 1955 Erev Roth 1995, 1998 Camerer Ho 1997, 1998, SERGIU HART c 2004 p. 29

90 Behavioral Aspects Similar to models of learning, experimental and behavioral economics: Bush Mosteller 1955 Erev Roth 1995, 1998 Camerer Ho 1997, 1998, N. Camille et al, The Involvement of the Orbitofrontal Cortex in the Experience of Regret Science May 2004 (304: ) SERGIU HART c 2004 p. 29

91 PART III Generalized Regret Matching SERGIU HART c 2004 p. 30

92 Questions How special is Regret Matching? SERGIU HART c 2004 p. 31

93 Questions How special is Regret Matching? Why does conditional smooth fictitious play work? SERGIU HART c 2004 p. 31

94 Questions How special is Regret Matching? Why does conditional smooth fictitious play work? Any connections? SERGIU HART c 2004 p. 31

95 Generalized Regret Matching Regret Matching = Switching probabilities are proportional to the regrets: σ(k) = cr(k) SERGIU HART c 2004 p. 32

96 Generalized Regret Matching Regret Matching = Switching probabilities are proportional to the regrets: σ(k) = cr(k) Generalized Regret Matching = Switching probabilities are a function of the regrets: σ(k) = f(r(k)) SERGIU HART c 2004 p. 32

97 Generalized Regret Matching Regret Matching = Switching probabilities are proportional to the regrets: σ(k) = cr(k) Generalized Regret Matching = Switching probabilities are a function of the regrets: σ(k) = f(r(k)) f is a sign-preserving function: f(0) = 0, and x > 0 f(x) > 0 f is a Lipschitz continuous function (in fact, much more general: f k,j, potential) SERGIU HART c 2004 p. 32

98 Generalized Regret Matching Theorem If all players play Generalized Regret Matching then the joint distribution of play converges to the set of correlated equilibria of the game SERGIU HART c 2004 p. 33

99 Generalized Regret Matching Theorem If all players play Generalized Regret Matching then the joint distribution of play converges to the set of correlated equilibria of the game Proof: Universal approachability strategies + Amotz Cahn, M.Sc. thesis, 2000 SERGIU HART c 2004 p. 33

100 Special Cases Play probabilities proportional to the m-th power of the regrets (f(x) = cx m, for m 1) SERGIU HART c 2004 p. 34

101 Special Cases Play probabilities proportional to the m-th power of the regrets (f(x) = cx m, for m 1) m = 1: Regret Matching SERGIU HART c 2004 p. 34

102 Special Cases Play probabilities proportional to the m-th power of the regrets (f(x) = cx m, for m 1) m = 1: Regret Matching m = : Positive probability only to actions with maximal regret SERGIU HART c 2004 p. 34

103 Special Cases Play probabilities proportional to the m-th power of the regrets (f(x) = cx m, for m 1) m = 1: Regret Matching m = : Positive probability only to actions with maximal regret Conditional Fictitious Play SERGIU HART c 2004 p. 34

104 Special Cases Play probabilities proportional to the m-th power of the regrets (f(x) = cx m, for m 1) m = 1: Regret Matching m = : Positive probability only to actions with maximal regret Conditional Fictitious Play But: Not continuous SERGIU HART c 2004 p. 34

105 Special Cases Play probabilities proportional to the m-th power of the regrets (f(x) = cx m, for m 1) m = 1: Regret Matching m = : Positive probability only to actions with maximal regret Conditional Fictitious Play But: Not continuous Therefore: Smooth Conditional Fictitious Play SERGIU HART c 2004 p. 34

106 PART IV Unknown Game SERGIU HART c 2004 p. 35

107 Unknown Game The case of the Unknown game : The player knows only Its own set of actions Its own past actions and payoffs SERGIU HART c 2004 p. 36

108 Unknown Game The case of the Unknown game : The player knows only Its own set of actions Its own past actions and payoffs The player does not know the game (other players, actions, payoff functions, history of other players actions and payoffs) SERGIU HART c 2004 p. 36

109 Proxy Regret Unknown game Unknown regret (The player does not know what the payoff would have been if he had played a different action k) SERGIU HART c 2004 p. 37

110 Proxy Regret Unknown game Unknown regret (The player does not know what the payoff would have been if he had played a different action k) Proxy Regret for k: Use the payoffs received when k has been actually played in the past SERGIU HART c 2004 p. 37

111 Proxy Regret Unknown game Unknown regret (The player does not know what the payoff would have been if he had played a different action k) Proxy Regret for k: Use the payoffs received when k has been actually played in the past Theorem. If all players play strategies based on proxy regret, then the joint distribution of play converges to the set of correlated equilibria of the game SERGIU HART c 2004 p. 37

112 PART V Uncoupled Dynamics SERGIU HART c 2004 p. 38

113 Nash Equilibrium Question: Adaptive heuristics Nash equilibria? SERGIU HART c 2004 p. 39

114 Nash Equilibrium Question: Adaptive heuristics Nash equilibria? In SPECIAL classes of games: YES SERGIU HART c 2004 p. 39

115 Nash Equilibrium Question: Adaptive heuristics Nash equilibria? In SPECIAL classes of games: YES Fictitious play, Regret-based,... Two-person zero-sum games Two-person potential games Supermodular games... SERGIU HART c 2004 p. 39

116 Nash Equilibrium Question: Adaptive heuristics Nash equilibria? In SPECIAL classes of games: YES Fictitious play, Regret-based,... Two-person zero-sum games Two-person potential games Supermodular games... In GENERAL games: NO SERGIU HART c 2004 p. 39

117 Nash Equilibrium Question: Adaptive heuristics Nash equilibria? In SPECIAL classes of games: YES Fictitious play, Regret-based,... Two-person zero-sum games Two-person potential games Supermodular games... In GENERAL games: NO WHY? SERGIU HART c 2004 p. 39

118 Uncoupled Dynamics General dynamic for 2-person games: ẋ(t) = F ( x(t), y(t) ; u 1, u 2 ) ẏ(t) = G ( x(t), y(t) ; u 1, u 2 ) x(t) (S 1 ), y(t) (S 2 ) SERGIU HART c 2004 p. 40

119 Uncoupled Dynamics General dynamic for 2-person games: ẋ(t) = F ( x(t), y(t) ; u 1, u 2 ) ẏ(t) = G ( x(t), y(t) ; u 1, u 2 ) Uncoupled dynamic: ẋ(t) = F ( x(t), y(t) ; u 1 ) ẏ(t) = G ( x(t), y(t) ; u 2 ) x(t) (S 1 ), y(t) (S 2 ) SERGIU HART c 2004 p. 40

120 Uncoupled Dynamics Adaptive ( rational ) dynamics (best-reply, better-reply, payoff-improving, monotonic, fictitious play, regret-based, replicator dynamics,...) are uncoupled SERGIU HART c 2004 p. 41

121 Uncoupled Dynamics Adaptive ( rational ) dynamics (best-reply, better-reply, payoff-improving, monotonic, fictitious play, regret-based, replicator dynamics,...) are uncoupled Uncoupledness is a natural informational condition SERGIU HART c 2004 p. 41

122 Nash-Convergent Dynamics Consider a family of games, each having a unique Nash equilibrium (no coordination problems ) SERGIU HART c 2004 p. 42

123 Nash-Convergent Dynamics Consider a family of games, each having a unique Nash equilibrium (no coordination problems ) A dynamic is Nash-convergent if it always converges to the unique Nash equilibrium Regularity conditions: The unique Nash equilibrium is a stable rest-point of the dynamic SERGIU HART c 2004 p. 42

124 Impossibility There exist no uncoupled dynamics which guarantee Nash convergence SERGIU HART c 2004 p. 43

125 Impossibility There exist no uncoupled dynamics which guarantee Nash convergence There are simple families of games whose unique Nash equilibrium is unstable for every uncoupled dynamic SERGIU HART c 2004 p. 43

126 Impossibility Adaptive ( rational ) dynamics (best-reply, better-reply, payoff-improving, monotonic, fictitious play, regret-based, replicator dynamics,...) are uncoupled SERGIU HART c 2004 p. 44

127 Impossibility Adaptive ( rational ) dynamics (best-reply, better-reply, payoff-improving, monotonic, fictitious play, regret-based, replicator dynamics,...) are uncoupled cannot always converge to Nash equilibria SERGIU HART c 2004 p. 44

128 Nash vs Correlated Correlated equilibria Uncoupled dynamics SERGIU HART c 2004 p. 45

129 Nash vs Correlated Correlated equilibria Uncoupled dynamics Nash equilibria Coupled dynamics SERGIU HART c 2004 p. 45

130 Nash vs Correlated Correlated equilibria Uncoupled dynamics Nash equilibria Coupled dynamics Law of Conservation of Coordination There must be coordination either in the equilibrium concept or in the dynamic SERGIU HART c 2004 p. 45

131 Summary SERGIU HART c 2004 p. 46

132 Where Do We Go From Here? SERGIU HART c 2004 p. 47

133 Where Do We Go From Here? Dynamics and equilibria Which equilibria? Which dynamics? SERGIU HART c 2004 p. 47

134 Where Do We Go From Here? Dynamics and equilibria Which equilibria? Which dynamics? Correlated equilibria: theory and practice Coordination Communication Bounded complexity SERGIU HART c 2004 p. 47

135 Where Do We Go From Here? Dynamics and equilibria Which equilibria? Which dynamics? Correlated equilibria: theory and practice Coordination Communication Bounded complexity Experiments, empirics Theory SERGIU HART c 2004 p. 47

136 Where Do We Go From Here? Dynamics and equilibria Which equilibria? Which dynamics? Correlated equilibria: theory and practice Coordination Communication Bounded complexity Experiments, empirics Theory Joint distribution of play (instead of just the marginals) SERGIU HART c 2004 p. 47

137 Summary SERGIU HART c 2004 p. 48

138 Summary Therejis a simple adaptive heuristic always leading to correlated equilibria SERGIU HART c 2004 p. 48

139 Summary Therejis a simple adaptive heuristic always leading to correlated equilibria (Regret Matching) SERGIU HART c 2004 p. 48

140 Summary Therejis a simple adaptive heuristic always leading to correlated equilibria (Regret Matching) There are many adaptive heuristics always leading to correlated equilibria SERGIU HART c 2004 p. 48

141 Summary Therejis a simple adaptive heuristic always leading to correlated equilibria (Regret Matching) There are many adaptive heuristics always leading to correlated equilibria (Generalized Regret Matching) SERGIU HART c 2004 p. 48

142 Summary Therejis a simple adaptive heuristic always leading to correlated equilibria (Regret Matching) There are many adaptive heuristics always leading to correlated equilibria (Generalized Regret Matching) There can be jno adaptive heuristics always leading to Nash equilibria SERGIU HART c 2004 p. 48

143 Summary Therejis a simple adaptive heuristic always leading to correlated equilibria (Regret Matching) There are many adaptive heuristics always leading to correlated equilibria (Generalized Regret Matching) There can be jno adaptive heuristics always leading to Nash equilibria (Uncoupledness) SERGIU HART c 2004 p. 48

144 Summary ADAPTIVE HEURISTICS SERGIU HART c 2004 p. 49

145 Summary BEHAVIORAL ADAPTIVE HEURISTICS SERGIU HART c 2004 p. 49

146 Summary BEHAVIORAL ADAPTIVE HEURISTICS CORRELATED EQUILIBRIA SERGIU HART c 2004 p. 49

147 Summary BEHAVIORAL ADAPTIVE HEURISTICS CORRELATED EQUILIBRIA Regret Matching SERGIU HART c 2004 p. 49

148 Summary BEHAVIORAL ADAPTIVE HEURISTICS CORRELATED EQUILIBRIA Generalized Regret Matching SERGIU HART c 2004 p. 49

149 Summary BEHAVIORAL ADAPTIVE HEURISTICS CORRELATED EQUILIBRIA Uncoupledness SERGIU HART c 2004 p. 49

150 Question Can simple adaptive heuristics lead to sophisticated rational behavior? SERGIU HART c 2004 p. 50

151 Answer Q Can simple adaptive heuristics lead to sophisticated rational behavior? YES! SERGIU HART c 2004 p. 50

152 Answer Q Can simple adaptive heuristics lead to sophisticated rational behavior? YES! in time... SERGIU HART c 2004 p. 50

153 Summary Macro BEHAVIORAL RATIONAL SERGIU HART c 2004 p. 51

154 Summary Macro BEHAVIORAL RATIONAL ADAPTIVE HEURISTICS SERGIU HART c 2004 p. 51

155 Title ADAPTIVE HEURISTICS ( A Little Rationality Goes a Long Way) SERGIU HART c 2004 p. 52

156 Title ADAPTIVE HEURISTICS (A Little Rationality Goes a Long Way) Rationality Takes Time SERGIU HART c 2004 p. 52

157 Title ADAPTIVE HEURISTICS (A Little Rationality Goes a Long Way) Rationality Takes Time Regret?... No Regret! SERGIU HART c 2004 p. 52

158 Title ADAPTIVE HEURISTICS (A Little Rationality Goes a Long Way) Rationality Takes Time Regret?... No Regret! SERGIU HART c 2004 p. 52

Correlated Equilibria: Rationality and Dynamics

Correlated Equilibria: Rationality and Dynamics Correlated Equilibria: Rationality and Dynamics Sergiu Hart June 2010 AUMANN 80 SERGIU HART c 2010 p. 1 CORRELATED EQUILIBRIA: RATIONALITY AND DYNAMICS Sergiu Hart Center for the Study of Rationality Dept

More information

Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics

Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics Sergiu Hart August 2016 Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics Sergiu Hart Center for the Study of Rationality

More information

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission.

Each copy of any part of a JSTOR transmission must contain the same copyright notice that appears on the screen or printed page of such transmission. Adaptive Heuristics Author(s): Sergiu Hart Source: Econometrica, Vol. 73, No. 5 (Sep., 2005), pp. 1401-1430 Published by: The Econometric Society Stable URL: http://www.jstor.org/stable/3598879 Accessed:

More information

Calibration and Nash Equilibrium

Calibration and Nash Equilibrium Calibration and Nash Equilibrium Dean Foster University of Pennsylvania and Sham Kakade TTI September 22, 2005 What is a Nash equilibrium? Cartoon definition of NE: Leroy Lockhorn: I m drinking because

More information

Lecture 14: Approachability and regret minimization Ramesh Johari May 23, 2007

Lecture 14: Approachability and regret minimization Ramesh Johari May 23, 2007 MS&E 336 Lecture 4: Approachability and regret minimization Ramesh Johari May 23, 2007 In this lecture we use Blackwell s approachability theorem to formulate both external and internal regret minimizing

More information

Deterministic Calibration and Nash Equilibrium

Deterministic Calibration and Nash Equilibrium University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 2-2008 Deterministic Calibration and Nash Equilibrium Sham M. Kakade Dean P. Foster University of Pennsylvania Follow

More information

Brown s Original Fictitious Play

Brown s Original Fictitious Play manuscript No. Brown s Original Fictitious Play Ulrich Berger Vienna University of Economics, Department VW5 Augasse 2-6, A-1090 Vienna, Austria e-mail: ulrich.berger@wu-wien.ac.at March 2005 Abstract

More information

Self-stabilizing uncoupled dynamics

Self-stabilizing uncoupled dynamics Self-stabilizing uncoupled dynamics Aaron D. Jaggard 1, Neil Lutz 2, Michael Schapira 3, and Rebecca N. Wright 4 1 U.S. Naval Research Laboratory, Washington, DC 20375, USA. aaron.jaggard@nrl.navy.mil

More information

Irrational behavior in the Brown von Neumann Nash dynamics

Irrational behavior in the Brown von Neumann Nash dynamics Irrational behavior in the Brown von Neumann Nash dynamics Ulrich Berger a and Josef Hofbauer b a Vienna University of Economics and Business Administration, Department VW 5, Augasse 2-6, A-1090 Wien,

More information

Existence of sparsely supported correlated equilibria

Existence of sparsely supported correlated equilibria Existence of sparsely supported correlated equilibria Fabrizio Germano fabrizio.germano@upf.edu Gábor Lugosi lugosi@upf.es Departament d Economia i Empresa Universitat Pompeu Fabra Ramon Trias Fargas 25-27

More information

Existence of sparsely supported correlated equilibria

Existence of sparsely supported correlated equilibria Existence of sparsely supported correlated equilibria Fabrizio Germano fabrizio.germano@upf.edu Gábor Lugosi lugosi@upf.es Departament d Economia i Empresa Universitat Pompeu Fabra Ramon Trias Fargas 25-27

More information

Irrational behavior in the Brown von Neumann Nash dynamics

Irrational behavior in the Brown von Neumann Nash dynamics Games and Economic Behavior 56 (2006) 1 6 www.elsevier.com/locate/geb Irrational behavior in the Brown von Neumann Nash dynamics Ulrich Berger a,, Josef Hofbauer b a Vienna University of Economics and

More information

Game-Theoretic Learning:

Game-Theoretic Learning: Game-Theoretic Learning: Regret Minimization vs. Utility Maximization Amy Greenwald with David Gondek, Amir Jafari, and Casey Marks Brown University University of Pennsylvania November 17, 2004 Background

More information

האוניברסיטה העברית בירושלים

האוניברסיטה העברית בירושלים האוניברסיטה העברית בירושלים THE HEBREW UNIVERSITY OF JERUSALEM TOWARDS A CHARACTERIZATION OF RATIONAL EXPECTATIONS by ITAI ARIELI Discussion Paper # 475 February 2008 מרכז לחקר הרציונליות CENTER FOR THE

More information

6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Potential Games

6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Potential Games 6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Asu Ozdaglar MIT March 2, 2010 1 Introduction Outline Review of Supermodular Games Reading: Fudenberg and Tirole, Section 12.3.

More information

Games A game is a tuple = (I; (S i ;r i ) i2i) where ffl I is a set of players (i 2 I) ffl S i is a set of (pure) strategies (s i 2 S i ) Q ffl r i :

Games A game is a tuple = (I; (S i ;r i ) i2i) where ffl I is a set of players (i 2 I) ffl S i is a set of (pure) strategies (s i 2 S i ) Q ffl r i : On the Connection between No-Regret Learning, Fictitious Play, & Nash Equilibrium Amy Greenwald Brown University Gunes Ercal, David Gondek, Amir Jafari July, Games A game is a tuple = (I; (S i ;r i ) i2i)

More information

1 The General Definition

1 The General Definition MS&E 336 Lecture 1: Dynamic games Ramesh Johari April 4, 2007 1 The General Definition A dynamic game (or extensive game, or game in extensive form) consists of: A set of players N; A set H of sequences

More information

MS&E 246: Lecture 12 Static games of incomplete information. Ramesh Johari

MS&E 246: Lecture 12 Static games of incomplete information. Ramesh Johari MS&E 246: Lecture 12 Static games of incomplete information Ramesh Johari Incomplete information Complete information means the entire structure of the game is common knowledge Incomplete information means

More information

Network Games: Learning and Dynamics

Network Games: Learning and Dynamics Network Games: Learning and Dynamics Asu Ozdaglar Conference on Decision and Control (CDC) December 2008 Department of Electrical Engineering & Computer Science Massachusetts Institute of Technology, USA

More information

EC319 Economic Theory and Its Applications, Part II: Lecture 7

EC319 Economic Theory and Its Applications, Part II: Lecture 7 EC319 Economic Theory and Its Applications, Part II: Lecture 7 Leonardo Felli NAB.2.14 27 February 2014 Signalling Games Consider the following Bayesian game: Set of players: N = {N, S, }, Nature N strategy

More information

1 Equilibrium Comparisons

1 Equilibrium Comparisons CS/SS 241a Assignment 3 Guru: Jason Marden Assigned: 1/31/08 Due: 2/14/08 2:30pm We encourage you to discuss these problems with others, but you need to write up the actual homework alone. At the top of

More information

A General Class of Adaptive Strategies 1

A General Class of Adaptive Strategies 1 Journal of Economic Theory 98, 2654 (2001) doi:10.1006jeth.2000.2746, available online at http:www.idealibrary.com on A General Class of Adaptive Strategies 1 Sergiu Hart 2 Center for Rationality and Interactive

More information

Possibility and Impossibility of Learning with Limited Behavior Rules

Possibility and Impossibility of Learning with Limited Behavior Rules Possibility and Impossibility of Learning with Limited Behavior Rules Takako Fujiwara-Greve Dept. of Economics Keio University, Tokyo, Japan and Norwegian School of Management BI, Sandvika, Norway and

More information

NOTE. A 2 2 Game without the Fictitious Play Property

NOTE. A 2 2 Game without the Fictitious Play Property GAMES AND ECONOMIC BEHAVIOR 14, 144 148 1996 ARTICLE NO. 0045 NOTE A Game without the Fictitious Play Property Dov Monderer and Aner Sela Faculty of Industrial Engineering and Management, The Technion,

More information

Satisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games

Satisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games Satisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games Stéphane Ross and Brahim Chaib-draa Department of Computer Science and Software Engineering Laval University, Québec (Qc),

More information

Reinforcement Learning

Reinforcement Learning 5 / 28 Reinforcement Learning Based on a simple principle: More likely to repeat an action, if it had to a positive outcome. 6 / 28 Reinforcement Learning Idea of reinforcement learning first formulated

More information

EC319 Economic Theory and Its Applications, Part II: Lecture 2

EC319 Economic Theory and Its Applications, Part II: Lecture 2 EC319 Economic Theory and Its Applications, Part II: Lecture 2 Leonardo Felli NAB.2.14 23 January 2014 Static Bayesian Game Consider the following game of incomplete information: Γ = {N, Ω, A i, T i, µ

More information

Distributional stability and equilibrium selection Evolutionary dynamics under payoff heterogeneity (II) Dai Zusai

Distributional stability and equilibrium selection Evolutionary dynamics under payoff heterogeneity (II) Dai Zusai Distributional stability Dai Zusai 1 Distributional stability and equilibrium selection Evolutionary dynamics under payoff heterogeneity (II) Dai Zusai Tohoku University (Economics) August 2017 Outline

More information

Learning Equilibrium as a Generalization of Learning to Optimize

Learning Equilibrium as a Generalization of Learning to Optimize Learning Equilibrium as a Generalization of Learning to Optimize Dov Monderer and Moshe Tennenholtz Faculty of Industrial Engineering and Management Technion Israel Institute of Technology Haifa 32000,

More information

Bayes Correlated Equilibrium and Comparing Information Structures

Bayes Correlated Equilibrium and Comparing Information Structures Bayes Correlated Equilibrium and Comparing Information Structures Dirk Bergemann and Stephen Morris Spring 2013: 521 B Introduction game theoretic predictions are very sensitive to "information structure"

More information

Mechanism Design: Implementation. Game Theory Course: Jackson, Leyton-Brown & Shoham

Mechanism Design: Implementation. Game Theory Course: Jackson, Leyton-Brown & Shoham Game Theory Course: Jackson, Leyton-Brown & Shoham Bayesian Game Setting Extend the social choice setting to a new setting where agents can t be relied upon to disclose their preferences honestly Start

More information

Game Theory and Control

Game Theory and Control Game Theory and Control Lecture 4: Potential games Saverio Bolognani, Ashish Hota, Maryam Kamgarpour Automatic Control Laboratory ETH Zürich 1 / 40 Course Outline 1 Introduction 22.02 Lecture 1: Introduction

More information

BELIEFS & EVOLUTIONARY GAME THEORY

BELIEFS & EVOLUTIONARY GAME THEORY 1 / 32 BELIEFS & EVOLUTIONARY GAME THEORY Heinrich H. Nax hnax@ethz.ch & Bary S. R. Pradelski bpradelski@ethz.ch May 15, 217: Lecture 1 2 / 32 Plan Normal form games Equilibrium invariance Equilibrium

More information

epub WU Institutional Repository

epub WU Institutional Repository epub WU Institutional Repository Ulrich Berger Learning in games with strategic complementarities revisited Article (Submitted) Original Citation: Berger, Ulrich (2008) Learning in games with strategic

More information

6.254 : Game Theory with Engineering Applications Lecture 7: Supermodular Games

6.254 : Game Theory with Engineering Applications Lecture 7: Supermodular Games 6.254 : Game Theory with Engineering Applications Lecture 7: Asu Ozdaglar MIT February 25, 2010 1 Introduction Outline Uniqueness of a Pure Nash Equilibrium for Continuous Games Reading: Rosen J.B., Existence

More information

Models of Strategic Reasoning Lecture 2

Models of Strategic Reasoning Lecture 2 Models of Strategic Reasoning Lecture 2 Eric Pacuit University of Maryland, College Park ai.stanford.edu/~epacuit August 7, 2012 Eric Pacuit: Models of Strategic Reasoning 1/30 Lecture 1: Introduction,

More information

Game Theory. Bargaining Theory. ordi Massó. International Doctorate in Economic Analysis (IDEA) Universitat Autònoma de Barcelona (UAB)

Game Theory. Bargaining Theory. ordi Massó. International Doctorate in Economic Analysis (IDEA) Universitat Autònoma de Barcelona (UAB) Game Theory Bargaining Theory J International Doctorate in Economic Analysis (IDEA) Universitat Autònoma de Barcelona (UAB) (International Game Theory: Doctorate Bargainingin Theory Economic Analysis (IDEA)

More information

The Query Complexity of Correlated Equilibria

The Query Complexity of Correlated Equilibria The Query Complexity of Correlated Equilibria arxiv:1305.4874v2 [cs.gt] 25 Dec 2017 Sergiu Hart Noam Nisan December 27, 2017 Abstract We consider the complexity of finding a correlated equilibrium of an

More information

Belief-based Learning

Belief-based Learning Belief-based Learning Algorithmic Game Theory Marcello Restelli Lecture Outline Introdutcion to multi-agent learning Belief-based learning Cournot adjustment Fictitious play Bayesian learning Equilibrium

More information

Correlated Equilibrium in Games with Incomplete Information

Correlated Equilibrium in Games with Incomplete Information Correlated Equilibrium in Games with Incomplete Information Dirk Bergemann and Stephen Morris Econometric Society Summer Meeting June 2012 Robust Predictions Agenda game theoretic predictions are very

More information

Game Theory: Spring 2017

Game Theory: Spring 2017 Game Theory: Spring 2017 Ulle Endriss Institute for Logic, Language and Computation University of Amsterdam Ulle Endriss 1 Plan for Today So far, our players didn t know the strategies of the others, but

More information

Robust Learning Equilibrium

Robust Learning Equilibrium Robust Learning Equilibrium Itai Ashlagi Dov Monderer Moshe Tennenholtz Faculty of Industrial Engineering and Management Technion Israel Institute of Technology Haifa 32000, Israel Abstract We introduce

More information

CS364A: Algorithmic Game Theory Lecture #13: Potential Games; A Hierarchy of Equilibria

CS364A: Algorithmic Game Theory Lecture #13: Potential Games; A Hierarchy of Equilibria CS364A: Algorithmic Game Theory Lecture #13: Potential Games; A Hierarchy of Equilibria Tim Roughgarden November 4, 2013 Last lecture we proved that every pure Nash equilibrium of an atomic selfish routing

More information

Advanced Machine Learning

Advanced Machine Learning Advanced Machine Learning Learning and Games MEHRYAR MOHRI MOHRI@ COURANT INSTITUTE & GOOGLE RESEARCH. Outline Normal form games Nash equilibrium von Neumann s minimax theorem Correlated equilibrium Internal

More information

Learning correlated equilibria in games with compact sets of strategies

Learning correlated equilibria in games with compact sets of strategies Learning correlated equilibria in games with compact sets of strategies Gilles Stoltz a,, Gábor Lugosi b a cnrs and Département de Mathématiques et Applications, Ecole Normale Supérieure, 45 rue d Ulm,

More information

Economics 703 Advanced Microeconomics. Professor Peter Cramton Fall 2017

Economics 703 Advanced Microeconomics. Professor Peter Cramton Fall 2017 Economics 703 Advanced Microeconomics Professor Peter Cramton Fall 2017 1 Outline Introduction Syllabus Web demonstration Examples 2 About Me: Peter Cramton B.S. Engineering, Cornell University Ph.D. Business

More information

Learning in Mean Field Games

Learning in Mean Field Games Learning in Mean Field Games P. Cardaliaguet Paris-Dauphine Joint work with S. Hadikhanloo (Dauphine) Nonlinear PDEs: Optimal Control, Asymptotic Problems and Mean Field Games Padova, February 25-26, 2016.

More information

MS&E 246: Lecture 4 Mixed strategies. Ramesh Johari January 18, 2007

MS&E 246: Lecture 4 Mixed strategies. Ramesh Johari January 18, 2007 MS&E 246: Lecture 4 Mixed strategies Ramesh Johari January 18, 2007 Outline Mixed strategies Mixed strategy Nash equilibrium Existence of Nash equilibrium Examples Discussion of Nash equilibrium Mixed

More information

: Cryptography and Game Theory Ran Canetti and Alon Rosen. Lecture 8

: Cryptography and Game Theory Ran Canetti and Alon Rosen. Lecture 8 0368.4170: Cryptography and Game Theory Ran Canetti and Alon Rosen Lecture 8 December 9, 2009 Scribe: Naama Ben-Aroya Last Week 2 player zero-sum games (min-max) Mixed NE (existence, complexity) ɛ-ne Correlated

More information

Evolution & Learning in Games

Evolution & Learning in Games 1 / 28 Evolution & Learning in Games Econ 243B Jean-Paul Carvalho Lecture 5. Revision Protocols and Evolutionary Dynamics 2 / 28 Population Games played by Boundedly Rational Agents We have introduced

More information

Consistent Beliefs in Extensive Form Games

Consistent Beliefs in Extensive Form Games Games 2010, 1, 415-421; doi:10.3390/g1040415 OPEN ACCESS games ISSN 2073-4336 www.mdpi.com/journal/games Article Consistent Beliefs in Extensive Form Games Paulo Barelli 1,2 1 Department of Economics,

More information

Global Games I. Mehdi Shadmehr Department of Economics University of Miami. August 25, 2011

Global Games I. Mehdi Shadmehr Department of Economics University of Miami. August 25, 2011 Global Games I Mehdi Shadmehr Department of Economics University of Miami August 25, 2011 Definition What is a global game? 1 A game of incomplete information. 2 There is an unknown and uncertain parameter.

More information

Cooperative bargaining: independence and monotonicity imply disagreement

Cooperative bargaining: independence and monotonicity imply disagreement Cooperative bargaining: independence and monotonicity imply disagreement Shiran Rachmilevitch September 23, 2012 Abstract A unique bargaining solution satisfies restricted monotonicity, independence of

More information

4. Opponent Forecasting in Repeated Games

4. Opponent Forecasting in Repeated Games 4. Opponent Forecasting in Repeated Games Julian and Mohamed / 2 Learning in Games Literature examines limiting behavior of interacting players. One approach is to have players compute forecasts for opponents

More information

Bargaining Efficiency and the Repeated Prisoners Dilemma. Bhaskar Chakravorti* and John Conley**

Bargaining Efficiency and the Repeated Prisoners Dilemma. Bhaskar Chakravorti* and John Conley** Bargaining Efficiency and the Repeated Prisoners Dilemma Bhaskar Chakravorti* and John Conley** Published as: Bhaskar Chakravorti and John P. Conley (2004) Bargaining Efficiency and the repeated Prisoners

More information

Prediction and Playing Games

Prediction and Playing Games Prediction and Playing Games Vineel Pratap vineel@eng.ucsd.edu February 20, 204 Chapter 7 : Prediction, Learning and Games - Cesa Binachi & Lugosi K-Person Normal Form Games Each player k (k =,..., K)

More information

Small Sample of Related Literature

Small Sample of Related Literature UCLA IPAM July 2015 Learning in (infinitely) repeated games with n players. Prediction and stability in one-shot large (many players) games. Prediction and stability in large repeated games (big games).

More information

Area I: Contract Theory Question (Econ 206)

Area I: Contract Theory Question (Econ 206) Theory Field Exam Winter 2011 Instructions You must complete two of the three areas (the areas being (I) contract theory, (II) game theory, and (III) psychology & economics). Be sure to indicate clearly

More information

6.891 Games, Decision, and Computation February 5, Lecture 2

6.891 Games, Decision, and Computation February 5, Lecture 2 6.891 Games, Decision, and Computation February 5, 2015 Lecture 2 Lecturer: Constantinos Daskalakis Scribe: Constantinos Daskalakis We formally define games and the solution concepts overviewed in Lecture

More information

Selecting Efficient Correlated Equilibria Through Distributed Learning. Jason R. Marden

Selecting Efficient Correlated Equilibria Through Distributed Learning. Jason R. Marden 1 Selecting Efficient Correlated Equilibria Through Distributed Learning Jason R. Marden Abstract A learning rule is completely uncoupled if each player s behavior is conditioned only on his own realized

More information

1 Lattices and Tarski s Theorem

1 Lattices and Tarski s Theorem MS&E 336 Lecture 8: Supermodular games Ramesh Johari April 30, 2007 In this lecture, we develop the theory of supermodular games; key references are the papers of Topkis [7], Vives [8], and Milgrom and

More information

Computational Game Theory Spring Semester, 2005/6. Lecturer: Yishay Mansour Scribe: Ilan Cohen, Natan Rubin, Ophir Bleiberg*

Computational Game Theory Spring Semester, 2005/6. Lecturer: Yishay Mansour Scribe: Ilan Cohen, Natan Rubin, Ophir Bleiberg* Computational Game Theory Spring Semester, 2005/6 Lecture 5: 2-Player Zero Sum Games Lecturer: Yishay Mansour Scribe: Ilan Cohen, Natan Rubin, Ophir Bleiberg* 1 5.1 2-Player Zero Sum Games In this lecture

More information

Distributed Learning based on Entropy-Driven Game Dynamics

Distributed Learning based on Entropy-Driven Game Dynamics Distributed Learning based on Entropy-Driven Game Dynamics Bruno Gaujal joint work with Pierre Coucheney and Panayotis Mertikopoulos Inria Aug., 2014 Model Shared resource systems (network, processors)

More information

Recap Social Choice Functions Fun Game Mechanism Design. Mechanism Design. Lecture 13. Mechanism Design Lecture 13, Slide 1

Recap Social Choice Functions Fun Game Mechanism Design. Mechanism Design. Lecture 13. Mechanism Design Lecture 13, Slide 1 Mechanism Design Lecture 13 Mechanism Design Lecture 13, Slide 1 Lecture Overview 1 Recap 2 Social Choice Functions 3 Fun Game 4 Mechanism Design Mechanism Design Lecture 13, Slide 2 Notation N is the

More information

Algorithmic Strategy Complexity

Algorithmic Strategy Complexity Algorithmic Strategy Complexity Abraham Neyman aneyman@math.huji.ac.il Hebrew University of Jerusalem Jerusalem Israel Algorithmic Strategy Complexity, Northwesten 2003 p.1/52 General Introduction The

More information

Near-Potential Games: Geometry and Dynamics

Near-Potential Games: Geometry and Dynamics Near-Potential Games: Geometry and Dynamics Ozan Candogan, Asuman Ozdaglar and Pablo A. Parrilo January 29, 2012 Abstract Potential games are a special class of games for which many adaptive user dynamics

More information

Mathematical Economics - PhD in Economics

Mathematical Economics - PhD in Economics - PhD in Part 1: Supermodularity and complementarity in the one-dimensional and Paulo Brito ISEG - Technical University of Lisbon November 24, 2010 1 2 - Supermodular optimization 3 one-dimensional 4 Supermodular

More information

Game Theory Fall 2003

Game Theory Fall 2003 Game Theory Fall 2003 Problem Set 1 [1] In this problem (see FT Ex. 1.1) you are asked to play with arbitrary 2 2 games just to get used to the idea of equilibrium computation. Specifically, consider the

More information

An Experiment to Evaluate Bayesian Learning of Nash Equilibrium Play

An Experiment to Evaluate Bayesian Learning of Nash Equilibrium Play Games and Economic Behavior 34, 11 33 (2001) doi:10.1006/game.1999.0786, available online at http://www.idealibrary.com on An Experiment to Evaluate Bayesian Learning of Nash Equilibrium Play James C.

More information

The Game of Normal Numbers

The Game of Normal Numbers The Game of Normal Numbers Ehud Lehrer September 4, 2003 Abstract We introduce a two-player game where at each period one player, say, Player 2, chooses a distribution and the other player, Player 1, a

More information

Computational Evolutionary Game Theory and why I m never using PowerPoint for another presentation involving maths ever again

Computational Evolutionary Game Theory and why I m never using PowerPoint for another presentation involving maths ever again Computational Evolutionary Game Theory and why I m never using PowerPoint for another presentation involving maths ever again Enoch Lau 5 September 2007 Outline What is evolutionary game theory? Why evolutionary

More information

Incentive Compatible Regression Learning

Incentive Compatible Regression Learning Incentive Compatible Regression Learning Ofer Dekel 1 Felix Fischer 2 Ariel D. Procaccia 1 1 School of Computer Science and Engineering The Hebrew University of Jerusalem 2 Institut für Informatik Ludwig-Maximilians-Universität

More information

Nonlinear Dynamics between Micromotives and Macrobehavior

Nonlinear Dynamics between Micromotives and Macrobehavior Nonlinear Dynamics between Micromotives and Macrobehavior Saori Iwanaga & kira Namatame Dept. of Computer Science, National Defense cademy, Yokosuka, 239-8686, JPN, E-mail: {g38042, nama}@nda.ac.jp Tel:

More information

LONG PERSUASION GAMES

LONG PERSUASION GAMES LONG PERSUASION GAMES FRANÇOISE FORGES FRÉDÉRIC KOESSLER CESIFO WORKING PAPER NO. 1669 CATEGORY 9: INDUSTRIAL ORGANISATION FEBRUARY 2006 An electronic version of the paper may be downloaded from the SSRN

More information

Achieving Pareto Optimality Through Distributed Learning

Achieving Pareto Optimality Through Distributed Learning 1 Achieving Pareto Optimality Through Distributed Learning Jason R. Marden, H. Peyton Young, and Lucy Y. Pao Abstract We propose a simple payoff-based learning rule that is completely decentralized, and

More information

Minimax regret and strategic uncertainty

Minimax regret and strategic uncertainty Minimax regret and strategic uncertainty Ludovic Renou University of Leicester Karl H. Schlag Universitat Pompeu Fabra March 31, 2008 Abstract This paper introduces a new solution concept, a minimax regret

More information

Convergence and No-Regret in Multiagent Learning

Convergence and No-Regret in Multiagent Learning Convergence and No-Regret in Multiagent Learning Michael Bowling Department of Computing Science University of Alberta Edmonton, Alberta Canada T6G 2E8 bowling@cs.ualberta.ca Abstract Learning in a multiagent

More information

Coevolutionary Modeling in Networks 1/39

Coevolutionary Modeling in Networks 1/39 Coevolutionary Modeling in Networks Jeff S. Shamma joint work with Ibrahim Al-Shyoukh & Georgios Chasparis & IMA Workshop on Analysis and Control of Network Dynamics October 19 23, 2015 Jeff S. Shamma

More information

Sequential Equilibria of Multi-Stage Games with Infinite Sets of Types and Actions

Sequential Equilibria of Multi-Stage Games with Infinite Sets of Types and Actions Sequential Equilibria of Multi-Stage Games with Infinite Sets of Types and Actions by Roger B. Myerson and Philip J. Reny* Draft notes October 2011 http://home.uchicago.edu/~preny/papers/bigseqm.pdf Abstract:

More information

Stackelberg-solvable games and pre-play communication

Stackelberg-solvable games and pre-play communication Stackelberg-solvable games and pre-play communication Claude d Aspremont SMASH, Facultés Universitaires Saint-Louis and CORE, Université catholique de Louvain, Louvain-la-Neuve, Belgium Louis-André Gérard-Varet

More information

First Prev Next Last Go Back Full Screen Close Quit. Game Theory. Giorgio Fagiolo

First Prev Next Last Go Back Full Screen Close Quit. Game Theory. Giorgio Fagiolo Game Theory Giorgio Fagiolo giorgio.fagiolo@univr.it https://mail.sssup.it/ fagiolo/welcome.html Academic Year 2005-2006 University of Verona Summary 1. Why Game Theory? 2. Cooperative vs. Noncooperative

More information

Math 152: Applicable Mathematics and Computing

Math 152: Applicable Mathematics and Computing Math 152: Applicable Mathematics and Computing May 26, 2017 May 26, 2017 1 / 17 Announcements Homework 6 was posted on Wednesday, due next Wednesday. Last homework is Homework 7, posted next week (due

More information

Correlated Equilibria of Classical Strategic Games with Quantum Signals

Correlated Equilibria of Classical Strategic Games with Quantum Signals Correlated Equilibria of Classical Strategic Games with Quantum Signals Pierfrancesco La Mura Leipzig Graduate School of Management plamura@hhl.de comments welcome September 4, 2003 Abstract Correlated

More information

Title: The Castle on the Hill. Author: David K. Levine. Department of Economics UCLA. Los Angeles, CA phone/fax

Title: The Castle on the Hill. Author: David K. Levine. Department of Economics UCLA. Los Angeles, CA phone/fax Title: The Castle on the Hill Author: David K. Levine Department of Economics UCLA Los Angeles, CA 90095 phone/fax 310-825-3810 email dlevine@ucla.edu Proposed Running Head: Castle on the Hill Forthcoming:

More information

Game Theory. Professor Peter Cramton Economics 300

Game Theory. Professor Peter Cramton Economics 300 Game Theory Professor Peter Cramton Economics 300 Definition Game theory is the study of mathematical models of conflict and cooperation between intelligent and rational decision makers. Rational: each

More information

A (Brief) Introduction to Game Theory

A (Brief) Introduction to Game Theory A (Brief) Introduction to Game Theory Johanne Cohen PRiSM/CNRS, Versailles, France. Goal Goal is a Nash equilibrium. Today The game of Chicken Definitions Nash Equilibrium Rock-paper-scissors Game Mixed

More information

On the Stability of the Cournot Equilibrium: An Evolutionary Approach

On the Stability of the Cournot Equilibrium: An Evolutionary Approach On the Stability of the Cournot Equilibrium: An Evolutionary Approach Cars H. Hommes Marius I. Ochea Jan Tuinstra 15th December 2011 Abstract We construct an evolutionary version of Theocharis (1960 s

More information

General-sum games. I.e., pretend that the opponent is only trying to hurt you. If Column was trying to hurt Row, Column would play Left, so

General-sum games. I.e., pretend that the opponent is only trying to hurt you. If Column was trying to hurt Row, Column would play Left, so General-sum games You could still play a minimax strategy in general- sum games I.e., pretend that the opponent is only trying to hurt you But this is not rational: 0, 0 3, 1 1, 0 2, 1 If Column was trying

More information

Evolutionary Dynamics and Extensive Form Games by Ross Cressman. Reviewed by William H. Sandholm *

Evolutionary Dynamics and Extensive Form Games by Ross Cressman. Reviewed by William H. Sandholm * Evolutionary Dynamics and Extensive Form Games by Ross Cressman Reviewed by William H. Sandholm * Noncooperative game theory is one of a handful of fundamental frameworks used for economic modeling. It

More information

Hybrid Machine Learning Algorithms

Hybrid Machine Learning Algorithms Hybrid Machine Learning Algorithms Umar Syed Princeton University Includes joint work with: Rob Schapire (Princeton) Nina Mishra, Alex Slivkins (Microsoft) Common Approaches to Machine Learning!! Supervised

More information

Tijmen Daniëls Universiteit van Amsterdam. Abstract

Tijmen Daniëls Universiteit van Amsterdam. Abstract Pure strategy dominance with quasiconcave utility functions Tijmen Daniëls Universiteit van Amsterdam Abstract By a result of Pearce (1984), in a finite strategic form game, the set of a player's serially

More information

Secure Protocols or How Communication Generates Correlation

Secure Protocols or How Communication Generates Correlation journal of economic theory 83, 6989 (1998) article no. ET982444 Secure Protocols or How Communication Generates Correlation Olivier Gossner* CORE, Voie du Roman Pays 34, B-1348 Louvain-la-Neuve, Belgium

More information

Theory Field Examination Game Theory (209A) Jan Question 1 (duopoly games with imperfect information)

Theory Field Examination Game Theory (209A) Jan Question 1 (duopoly games with imperfect information) Theory Field Examination Game Theory (209A) Jan 200 Good luck!!! Question (duopoly games with imperfect information) Consider a duopoly game in which the inverse demand function is linear where it is positive

More information

Algorithmic Game Theory and Applications. Lecture 4: 2-player zero-sum games, and the Minimax Theorem

Algorithmic Game Theory and Applications. Lecture 4: 2-player zero-sum games, and the Minimax Theorem Algorithmic Game Theory and Applications Lecture 4: 2-player zero-sum games, and the Minimax Theorem Kousha Etessami 2-person zero-sum games A finite 2-person zero-sum (2p-zs) strategic game Γ, is a strategic

More information

Better-Reply Dynamics with Bounded Recall

Better-Reply Dynamics with Bounded Recall ` Kyiv School of Economics & Kyiv Economics Institute DISCUSSION PAPER SERIES Better-Reply Dynamics with Bounded Recall Andriy Zapechelnyuk (Kyiv School of Economics and Kyiv Economics Institute) First

More information

Uncoupled automata and pure Nash equilibria

Uncoupled automata and pure Nash equilibria Int J Gae Theory (200) 39:483 502 DOI 0.007/s0082-00-0227-9 ORIGINAL PAPER Uncoupled autoata and pure Nash equilibria Yakov Babichenko Accepted: 2 February 200 / Published online: 20 March 200 Springer-Verlag

More information

Game Theory. Greg Plaxton Theory in Programming Practice, Spring 2004 Department of Computer Science University of Texas at Austin

Game Theory. Greg Plaxton Theory in Programming Practice, Spring 2004 Department of Computer Science University of Texas at Austin Game Theory Greg Plaxton Theory in Programming Practice, Spring 2004 Department of Computer Science University of Texas at Austin Bimatrix Games We are given two real m n matrices A = (a ij ), B = (b ij

More information

Quantum Games Have No News for Economists 1

Quantum Games Have No News for Economists 1 Quantum Games Have No News for Economists 1 By David K. Levine 2 This version: September 1, 2005 First version: December 3, 2005 Quantum computing offers the possibility of massively parallel computing

More information

Informed Principal in Private-Value Environments

Informed Principal in Private-Value Environments Informed Principal in Private-Value Environments Tymofiy Mylovanov Thomas Tröger University of Bonn June 21, 2008 1/28 Motivation 2/28 Motivation In most applications of mechanism design, the proposer

More information

Learning to play partially-specified equilibrium

Learning to play partially-specified equilibrium Learning to play partially-specified equilibrium Ehud Lehrer and Eilon Solan August 29, 2007 August 29, 2007 First draft: June 2007 Abstract: In a partially-specified correlated equilibrium (PSCE ) the

More information