Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics

Size: px
Start display at page:

Download "Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics"

Transcription

1 Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics Sergiu Hart August 2016

2 Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics Sergiu Hart Center for the Study of Rationality Dept of Mathematics Dept of Economics The Hebrew University of Jerusalem

3 Joint work with Dean P. Foster University of Pennsylvania & Amazon Research

4 Papers

5 Papers Dean P. Foster and Sergiu Hart Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics 2012 revised:

6 Papers Dean P. Foster and Sergiu Hart Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics 2012 revised: Dean P. Foster and Sergiu Hart An Integral Approach to Calibration 2016 (in preparation)

7 Calibration

8 Calibration Forecaster says: "The chance of rain tomorrow is p"

9 Calibration Forecaster says: "The chance of rain tomorrow is p" Forecaster is CALIBRATED if for every p: the proportion of rainy days among those days when the forecast was p equals p (or is close to p in the long run)

10 Calibration Forecaster says: "The chance of rain tomorrow is p" Forecaster is CALIBRATED if for every p: the proportion of rainy days among those days when the forecast was p equals p (or is close to p in the long run) Dawid 1982

11 Calibration

12 Calibration CALIBRATION can be guaranteed (no matter what the weather is)

13 Calibration CALIBRATION can be guaranteed (no matter what the weather is) Foster and Vohra 1994 [publ 1998]

14 Calibration CALIBRATION can be guaranteed (no matter what the weather is) Forecaster uses mixed forecasting (e.g.: with probability 1/2, forecast = 25% with probability 1/2, forecast = 60%) Foster and Vohra 1994 [publ 1998]

15 Calibration CALIBRATION can be guaranteed (no matter what the weather is) Forecaster uses mixed forecasting (e.g.: with probability 1/2, forecast = 25% with probability 1/2, forecast = 60%) Foster and Vohra 1994 [publ 1998] Hart 1995: proof using Minimax Theorem

16 The MINIMAX Theorem

17 The MINIMAX Theorem THEOREM (von Neumann 1928) IF X R n, Y R m are compact convex sets, and f : X Y R is a continuous function that is convex-concave, i.e., f(, y) : X R is convex for fixed y, and f(x, ) : Y R is concave for fixed x, min max x X y Y THEN f(x, y) = max min y Y x X f(x, y).

18 The MINIMAX Theorem

19 The MINIMAX Theorem THEOREM (von Neumann 1928)

20 The MINIMAX Theorem THEOREM (von Neumann 1928) IF For every strategy of the opponent I have a strategy such that my payoff is at least v

21 The MINIMAX Theorem THEOREM (von Neumann 1928) IF For every strategy of the opponent I have a strategy such that my payoff is at least v THEN I have a strategy that guarantees that my payoff is at least v (for every strategy of the opponent)

22 The MINIMAX Theorem THEOREM (von Neumann 1928) IF For every strategy of the opponent I have a strategy such that my payoff is at least v THEN I have a strategy that guarantees that my payoff is at least v (for every strategy of the opponent) finite game;

23 The MINIMAX Theorem THEOREM (von Neumann 1928) IF For every strategy of the opponent I have a strategy such that my payoff is at least v THEN I have a strategy that guarantees that my payoff is at least v (for every strategy of the opponent) finite game; probabilistic (mixed) strategies

24 Calibration Proof: Minimax

25 Calibration Proof: Minimax FINITE δ-grid, FINITE HORIZON FINITE 2-person 0-sum game

26 Calibration Proof: Minimax FINITE δ-grid, FINITE HORIZON FINITE 2-person 0-sum game IF the strategy of the rainmaker IS KNOWN THEN the forecaster can get δ-calibrated forecasts

27 Calibration Proof: Minimax FINITE δ-grid, FINITE HORIZON FINITE 2-person 0-sum game IF the strategy of the rainmaker IS KNOWN THEN the forecaster can get δ-calibrated forecasts MINIMAX THEOREM the forecaster can GUARANTEE δ-calibrated forecasts (without knowing the rainmaker s strategy)

28 Calibration Proof: Minimax FINITE δ-grid, FINITE HORIZON FINITE 2-person 0-sum game IF the strategy of the rainmaker IS KNOWN THEN the forecaster can get δ-calibrated forecasts MINIMAX THEOREM the forecaster can GUARANTEE δ-calibrated forecasts (without knowing the rainmaker s strategy) Hart 1995

29 Calibration

30 Calibration CALIBRATION can be guaranteed (no matter what the weather is) Forecaster uses mixed forecasting (e.g.: with probability 1/2, forecast = 25% with probability 1/2, forecast = 60%) Foster and Vohra 1994 [publ 1998] Hart 1995: proof using Minimax Theorem

31 Calibration CALIBRATION can be guaranteed (no matter what the weather is) Forecaster uses mixed forecasting (e.g.: with probability 1/2, forecast = 25% with probability 1/2, forecast = 60%) Foster and Vohra 1994 [publ 1998] Hart 1995: proof using Minimax Theorem Hart and Mas-Colell 1996 [publ 2000]: proof using Blackwell s Approachability

32 Calibration CALIBRATION can be guaranteed (no matter what the weather is) Forecaster uses mixed forecasting (e.g.: with probability 1/2, forecast = 25% with probability 1/2, forecast = 60%) Foster and Vohra 1994 [publ 1998] Hart 1995: proof using Minimax Theorem Hart and Mas-Colell 1996 [publ 2000]: proof using Blackwell s Approachability Foster 1999: a simple way to calibrate

33 Calibration CALIBRATION can be guaranteed (no matter what the weather is) Forecaster uses mixed forecasting (e.g.: with probability 1/2, forecast = 25% with probability 1/2, forecast = 60%) BACK-casting (not fore-casting!) ("Politicians Lemma") Foster and Vohra 1994 [publ 1998] Hart 1995: proof using Minimax Theorem Hart and Mas-Colell 1996 [publ 2000]: proof using Blackwell s Approachability Foster 1999: a simple way to calibrate

34 No Calibration

35 No Calibration CALIBRATION cannot be guaranteed when:

36 No Calibration CALIBRATION cannot be guaranteed when: Forecast is known before the rain/no-rain decision is made ("LEAKY FORECASTS")

37 No Calibration CALIBRATION cannot be guaranteed when: Forecast is known before the rain/no-rain decision is made ("LEAKY FORECASTS") Forecaster uses a deterministic forecasting procedure

38 No Calibration CALIBRATION cannot be guaranteed when: Forecast is known before the rain/no-rain decision is made ("LEAKY FORECASTS") Forecaster uses a deterministic forecasting procedure Oakes 1985

39 Smooth Calibration

40 Smooth Calibration SMOOTH CALIBRATION: combine together the days when the forecast was close to p

41 Smooth Calibration SMOOTH CALIBRATION: combine together the days when the forecast was close to p (smooth out the calibration score)

42 Smooth Calibration SMOOTH CALIBRATION: combine together the days when the forecast was close to p (smooth out the calibration score) Main Result: There exists a deterministic procedure that is SMOOTHLY CALIBRATED.

43 Smooth Calibration SMOOTH CALIBRATION: combine together the days when the forecast was close to p (smooth out the calibration score) Main Result: There exists a deterministic procedure that is SMOOTHLY CALIBRATED. Deterministic result holds also when the forecasts are leaked

44 Calibration

45 Calibration Set of ACTIONS: A R m (finite set) Set of FORECASTS: C = (A) Example: A = {0, 1}, C = [0, 1]

46 Calibration Set of ACTIONS: A R m (finite set) Set of FORECASTS: C = (A) Example: A = {0, 1}, C = [0, 1] CALIBRATION SCORE at time T for a sequence (a t, c t ) t=1,2,... in A C:

47 Calibration Set of ACTIONS: A R m (finite set) Set of FORECASTS: C = (A) Example: A = {0, 1}, C = [0, 1] CALIBRATION SCORE at time T for a sequence (a t, c t ) t=1,2,... in A C: K T = 1 T ā t c t T where ā t = t=1 T s=1 1 c s =c t a s T s=1 1 c s =c t

48 Smooth Calibration

49 Smooth Calibration A "smoothing function" is a Lipschitz function Λ : C C [0, 1] with Λ(c, c) = 1 for every c.

50 Smooth Calibration A "smoothing function" is a Lipschitz function Λ : C C [0, 1] with Λ(c, c) = 1 for every c. Λ(x, c) = "weight" of x relative to c

51 Smooth Calibration A "smoothing function" is a Lipschitz function Λ : C C [0, 1] with Λ(c, c) = 1 for every c. Λ(x, c) = "weight" of x relative to c Example: Λ(x, c) = [δ x c ] + /δ

52 Indicator Function

53 Indicator Function 1 x=c 1 c x

54 Indicator and Λ Functions Λ(x, c) 1 x=c 1 c δ c c + δ x

55 Smooth Calibration A "smoothing function" is a Lipschitz function Λ : C C [0, 1] with Λ(c, c) = 1 for every c. Λ(x, c) = "weight" of x relative to c

56 Smooth Calibration A "smoothing function" is a Lipschitz function Λ : C C [0, 1] with Λ(c, c) = 1 for every c. Λ(x, c) = "weight" of x relative to c Λ-CALIBRATION SCORE at time T :

57 Smooth Calibration A "smoothing function" is a Lipschitz function Λ : C C [0, 1] with Λ(c, c) = 1 for every c. Λ(x, c) = "weight" of x relative to c Λ-CALIBRATION SCORE at time T : K Λ T = 1 T T ā Λ t cλ t t=1

58 Smooth Calibration A "smoothing function" is a Lipschitz function Λ : C C [0, 1] with Λ(c, c) = 1 for every c. Λ(x, c) = "weight" of x relative to c Λ-CALIBRATION SCORE at time T : K Λ T = 1 T T t=1 ā Λ t cλ t ā Λ t = T s=1 Λ(c s, c t ) a s T s=1 Λ(c s, c t ), c Λ t = T s=1 Λ(c s, c t ) c s T s=1 Λ(c s, c t )

59 The Calibration Game

60 The Calibration Game In each period t = 1, 2,... :

61 The Calibration Game In each period t = 1, 2,... : Player C ("forecaster") chooses c t C Player A ("action") chooses a t A

62 The Calibration Game In each period t = 1, 2,... : Player C ("forecaster") chooses c t C Player A ("action") chooses a t A a t and c t chosen simultaneously: REGULAR setup

63 The Calibration Game In each period t = 1, 2,... : Player C ("forecaster") chooses c t C Player A ("action") chooses a t A a t and c t chosen simultaneously: REGULAR setup a t chosen after c t is disclosed: LEAKY setup

64 The Calibration Game In each period t = 1, 2,... : Player C ("forecaster") chooses c t C Player A ("action") chooses a t A a t and c t chosen simultaneously: REGULAR setup a t chosen after c t is disclosed: LEAKY setup Full monitoring, perfect recall

65 Smooth Calibration

66 Smooth Calibration A strategy of Player C is

67 Smooth Calibration A strategy of Player C is (ε, L)-SMOOTHLY CALIBRATED

68 Smooth Calibration A strategy of Player C is (ε, L)-SMOOTHLY CALIBRATED if there is T 0 such that K Λ T ε holds for: every T T 0, every strategy of Player A, and every smoothing function Λ with Lipschitz constant L

69 Smooth Calibration: Result

70 Smooth Calibration: Result For every ε > 0 and L < there exists a procedure that is (ε, L)-SMOOTHLY CALIBRATED.

71 Smooth Calibration: Result For every ε > 0 and L < there exists a procedure that is (ε, L)-SMOOTHLY CALIBRATED. Moreover: it is deterministic,

72 Smooth Calibration: Result For every ε > 0 and L < there exists a procedure that is (ε, L)-SMOOTHLY CALIBRATED. Moreover: it is deterministic, it has finite recall (= finite window, stationary),

73 Smooth Calibration: Result For every ε > 0 and L < there exists a procedure that is (ε, L)-SMOOTHLY CALIBRATED. Moreover: it is deterministic, it has finite recall (= finite window, stationary), it uses a fixed finite grid, and

74 Smooth Calibration: Result For every ε > 0 and L < there exists a procedure that is (ε, L)-SMOOTHLY CALIBRATED. Moreover: it is deterministic, it has finite recall (= finite window, stationary), it uses a fixed finite grid, and forecasts may be leaked

75 Smooth Calibration: Implications

76 Smooth Calibration: Implications For forecasting:

77 Smooth Calibration: Implications For forecasting: nothing good... (easier to pass the test)

78 Smooth Calibration: Implications For forecasting: nothing good... (easier to pass the test) For game dynamics:

79 Smooth Calibration: Implications For forecasting: nothing good... (easier to pass the test) For game dynamics: Nash dynamics

80 Calibrated Learning

81 Calibrated Learning CALIBRATED LEARNING:

82 Calibrated Learning CALIBRATED LEARNING: Foster and Vohra 1997

83 Calibrated Learning CALIBRATED LEARNING: every player uses a calibrated forecast on the play of the other players Foster and Vohra 1997

84 Calibrated Learning CALIBRATED LEARNING: every player uses a calibrated forecast on the play of the other players every player best replies to his forecast Foster and Vohra 1997

85 Calibrated Learning CALIBRATED LEARNING: every player uses a calibrated forecast on the play of the other players every player best replies to his forecast time average of play (= empirical distribution of play) is an approximate CORRELATED EQUILIBRIUM Foster and Vohra 1997

86 Smooth Calibrated Learning

87 Smooth Calibrated Learning SMOOTH CALIBRATED LEARNING:

88 Smooth Calibrated Learning SMOOTH CALIBRATED LEARNING: (F) A smoothly calibrated deterministic procedure, which gives in each period t a "forecasted" joint distribution of play c t in (A) (Π i N A i )

89 Smooth Calibrated Learning SMOOTH CALIBRATED LEARNING: (F) A smoothly calibrated deterministic procedure, which gives in each period t a "forecasted" joint distribution of play c t in (A) (Π i N A i ) (P) A Lipschitz approximate best-reply mapping g i : (A) (A i ) for each player i

90 Smooth Calibrated Learning SMOOTH CALIBRATED LEARNING: (F) A smoothly calibrated deterministic procedure, which gives in each period t a "forecasted" joint distribution of play c t in (A) (Π i N A i ) (P) A Lipschitz approximate best-reply mapping g i : (A) (A i ) for each player i In each period t, each player i: 1. runs the procedure (F) to get c t 2. plays g i (c t ) given by (P)

91 Smooth Calibrated Learning

92 Smooth Calibrated Learning SMOOTH CALIBRATED LEARNING (with appropriate parameters):

93 Smooth Calibrated Learning SMOOTH CALIBRATED LEARNING (with appropriate parameters): is a stochastic uncoupled dynamic

94 Smooth Calibrated Learning SMOOTH CALIBRATED LEARNING (with appropriate parameters): is a stochastic uncoupled dynamic has finite MEMORY and is stationary

95 Smooth Calibrated Learning SMOOTH CALIBRATED LEARNING (with appropriate parameters): is a stochastic uncoupled dynamic has finite MEMORY and is stationary Nash ε-equilibria are played at least 1 ε of the time in the long run (a.s.)

96 Smooth Calibrated Learning: Proof

97 Smooth Calibrated Learning: Proof play t = g(c t )

98 Smooth Calibrated Learning: Proof smooth calibration play t = g(c t ) c t

99 Smooth Calibrated Learning: Proof smooth calibration play t = g(c t ) c t use: g is Lipschitz

100 Smooth Calibrated Learning: Proof smooth calibration play t = g(c t ) c t use: g is Lipschitz g approximate best reply play t is an approximate Nash equilibrium

101 Smooth Calibrated Learning: Proof smooth calibration play t = g(c t ) c t use: g is Lipschitz g approximate best reply play t is an approximate Nash equilibrium g(play t ) = g(g(c t )) g(c t ) = play t

102 Why Smooth?

103 Why Smooth? SMOOTH CALIBRATION

104 Why Smooth? SMOOTH CALIBRATION deterministic

105 Why Smooth? SMOOTH CALIBRATION deterministic same forecast for all players

106 Why Smooth? SMOOTH CALIBRATION deterministic same forecast for all players leaky

107 Why Smooth? SMOOTH CALIBRATION deterministic same forecast for all players leaky actions depend on forecast

108 Why Smooth? SMOOTH CALIBRATION deterministic same forecast for all players leaky actions depend on forecast calibrated

109 Why Smooth? SMOOTH CALIBRATION deterministic same forecast for all players leaky actions depend on forecast calibrated forecast equals actions

110 Why Smooth? SMOOTH CALIBRATION deterministic same forecast for all players leaky actions depend on forecast calibrated forecast equals actions FIXED POINT

111 Why Smooth? SMOOTH CALIBRATION deterministic same forecast for all players leaky actions depend on forecast calibrated forecast equals actions FIXED POINT SMOOTH BEST REPLY

112 Why Smooth? SMOOTH CALIBRATION deterministic same forecast for all players leaky actions depend on forecast calibrated forecast equals actions FIXED POINT SMOOTH BEST REPLY fixed point = NASH EQUILIBRIUM

113 Dynamics and Equilibrium

114 Dynamics and Equilibrium "LAW OF CONSERVATION OF COORDINATION":

115 Dynamics and Equilibrium "LAW OF CONSERVATION OF COORDINATION": There must be some COORDINATION

116 Dynamics and Equilibrium "LAW OF CONSERVATION OF COORDINATION": There must be some COORDINATION either in the EQUILIBRIUM notion,

117 Dynamics and Equilibrium "LAW OF CONSERVATION OF COORDINATION": There must be some COORDINATION either in the EQUILIBRIUM notion, or in the DYNAMIC

118 Dynamics and Equilibrium "LAW OF CONSERVATION OF COORDINATION": There must be some COORDINATION either in the EQUILIBRIUM notion, (CORRELATED EQUILIBRIUM) or in the DYNAMIC

119 Dynamics and Equilibrium "LAW OF CONSERVATION OF COORDINATION": There must be some COORDINATION either in the EQUILIBRIUM notion, (CORRELATED EQUILIBRIUM) or in the DYNAMIC (NASH EQUILIBRIUM)

120 Dynamics and Equilibrium "LAW OF CONSERVATION OF COORDINATION": There must be some COORDINATION either in the EQUILIBRIUM notion, (CORRELATED EQUILIBRIUM) or in the DYNAMIC (NASH EQUILIBRIUM) (Hart and Mas-Colell 2003)

121 Integral Approach to Calibration

122 Integral Approach to Calibration INTEGRAL CALIBRATION score:

123 Integral Approach to Calibration INTEGRAL CALIBRATION score: G Λ t (z) = 1 t t Λ(c s, z)(a s c s ) s=1

124 Integral Approach to Calibration INTEGRAL CALIBRATION score: G Λ t (z) = 1 t ( G Λ t 2 = t s=1 C Λ(c s, z)(a s c s ) G Λ t (z) 2 dζ(z) ) 1/2

125 Integral Approach to Calibration INTEGRAL CALIBRATION score: G Λ t (z) = 1 t ( G Λ t 2 = t s=1 C Λ(c s, z)(a s c s ) G Λ t (z) 2 dζ(z) ) 1/2 INTEGRAL CALIBRATION: Guarantee that G Λ t 2 ε for all t large enough, uniformly

126 Integral Calibration

127 Integral Calibration Choose the forecast c t such that Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A

128 Integral Calibration Choose the forecast c t such that Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such c t is guaranteed by a Fixed Point Theorem

129 Integral Calibration Choose the forecast c t such that Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such c t is guaranteed by a Fixed Point Theorem Deterministic Integral Calibration

130 Integral Calibration Choose the forecast c t such that Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such c t is guaranteed by a Fixed Point Theorem Deterministic Integral Calibration Deterministic Smooth Calibration

131 Integral Calibration Choose the forecast c t such that Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such c t is guaranteed by a Fixed Point Theorem Deterministic Integral Calibration Deterministic Smooth Calibration Deterministic Weak Calibration

132 Integral Calibration Choose the forecast c t such that Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such c t is guaranteed by a Fixed Point Theorem Deterministic Integral Calibration Deterministic Smooth Calibration Deterministic Weak Calibration Almost Deterministic Calibration

133 Deterministic Nonlinear Calibration Choose the forecast c t such that Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such c t is guaranteed by a Fixed Point Theorem Deterministic Integral Calibration Deterministic Smooth Calibration Deterministic Weak Calibration Almost Deterministic Calibration

134 Integral Calibration

135 Integral Calibration Choose the distribution of the forecast c t s.t. [ ] E Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A

136 Integral Calibration Choose the distribution of the forecast c t s.t. [ ] E Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such distribution is guaranteed by a Separation / Minimax Theorem

137 Integral Calibration Choose the distribution of the forecast c t s.t. [ ] E Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such distribution is guaranteed by a Separation / Minimax Theorem Probabilistic Calibration

138 Stochastic Linear Calibration Choose the distribution of the forecast c t s.t. [ ] E Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such distribution is guaranteed by a Separation / Minimax Theorem Probabilistic Calibration

139 Integral Approach to Calibration

140 Integral Approach to Calibration fixed point deterministic calibration

141 Integral Approach to Calibration fixed point deterministic calibration separation / minimax stochastic calibration

142 Previous Work

143 Previous Work Calibration and Correlated Equilibria:

144 Previous Work Calibration and Correlated Equilibria: Foster and Vohra 1998 Foster and Vohra 1997 Hart and Mas-Colell 2000,...

145 Previous Work Calibration and Correlated Equilibria: Foster and Vohra 1998 Foster and Vohra 1997 Hart and Mas-Colell 2000,... Weak Calibration (deterministic):

146 Previous Work Calibration and Correlated Equilibria: Foster and Vohra 1998 Foster and Vohra 1997 Hart and Mas-Colell 2000,... Weak Calibration (deterministic): Kakade and Foster 2004 / 2008 Foster and Kakade 2006

147 Previous Work

148 Previous Work Nash Dynamics:

149 Previous Work Nash Dynamics: Foster and Young 2003 Kakade and Foster 2004 / 2008 Foster and Young 2006 Hart and Mas-Colell 2006 Germano and Lugosi 2007 Young 2009 Babichenko 2012

150 Previous Work

151 Previous Work Online Regression Problem:

152 Previous Work Online Regression Problem: Foster 1991 J. Foster 1999 Vovk 2001 Azoury and Warmuth 2001 Cesa-Bianchi and Lugosi 2006

153 Successful Economic Forecasting

154 Successful Economic Forecasting correctly forecasting

155 Successful Economic Forecasting... correctly forecasting 8 of the last 5 recessions!

Calibration and Nash Equilibrium

Calibration and Nash Equilibrium Calibration and Nash Equilibrium Dean Foster University of Pennsylvania and Sham Kakade TTI September 22, 2005 What is a Nash equilibrium? Cartoon definition of NE: Leroy Lockhorn: I m drinking because

More information

Correlated Equilibria: Rationality and Dynamics

Correlated Equilibria: Rationality and Dynamics Correlated Equilibria: Rationality and Dynamics Sergiu Hart June 2010 AUMANN 80 SERGIU HART c 2010 p. 1 CORRELATED EQUILIBRIA: RATIONALITY AND DYNAMICS Sergiu Hart Center for the Study of Rationality Dept

More information

Walras-Bowley Lecture 2003

Walras-Bowley Lecture 2003 Walras-Bowley Lecture 2003 Sergiu Hart This version: September 2004 SERGIU HART c 2004 p. 1 ADAPTIVE HEURISTICS A Little Rationality Goes a Long Way Sergiu Hart Center for Rationality, Dept. of Economics,

More information

The No-Regret Framework for Online Learning

The No-Regret Framework for Online Learning The No-Regret Framework for Online Learning A Tutorial Introduction Nahum Shimkin Technion Israel Institute of Technology Haifa, Israel Stochastic Processes in Engineering IIT Mumbai, March 2013 N. Shimkin,

More information

Deterministic Calibration and Nash Equilibrium

Deterministic Calibration and Nash Equilibrium University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 2-2008 Deterministic Calibration and Nash Equilibrium Sham M. Kakade Dean P. Foster University of Pennsylvania Follow

More information

Prediction and Playing Games

Prediction and Playing Games Prediction and Playing Games Vineel Pratap vineel@eng.ucsd.edu February 20, 204 Chapter 7 : Prediction, Learning and Games - Cesa Binachi & Lugosi K-Person Normal Form Games Each player k (k =,..., K)

More information

Lecture 14: Approachability and regret minimization Ramesh Johari May 23, 2007

Lecture 14: Approachability and regret minimization Ramesh Johari May 23, 2007 MS&E 336 Lecture 4: Approachability and regret minimization Ramesh Johari May 23, 2007 In this lecture we use Blackwell s approachability theorem to formulate both external and internal regret minimizing

More information

Advanced Machine Learning

Advanced Machine Learning Advanced Machine Learning Learning and Games MEHRYAR MOHRI MOHRI@ COURANT INSTITUTE & GOOGLE RESEARCH. Outline Normal form games Nash equilibrium von Neumann s minimax theorem Correlated equilibrium Internal

More information

Learning correlated equilibria in games with compact sets of strategies

Learning correlated equilibria in games with compact sets of strategies Learning correlated equilibria in games with compact sets of strategies Gilles Stoltz a,, Gábor Lugosi b a cnrs and Département de Mathématiques et Applications, Ecole Normale Supérieure, 45 rue d Ulm,

More information

Self-stabilizing uncoupled dynamics

Self-stabilizing uncoupled dynamics Self-stabilizing uncoupled dynamics Aaron D. Jaggard 1, Neil Lutz 2, Michael Schapira 3, and Rebecca N. Wright 4 1 U.S. Naval Research Laboratory, Washington, DC 20375, USA. aaron.jaggard@nrl.navy.mil

More information

Game-Theoretic Learning:

Game-Theoretic Learning: Game-Theoretic Learning: Regret Minimization vs. Utility Maximization Amy Greenwald with David Gondek, Amir Jafari, and Casey Marks Brown University University of Pennsylvania November 17, 2004 Background

More information

arxiv: v1 [cs.lg] 8 Nov 2010

arxiv: v1 [cs.lg] 8 Nov 2010 Blackwell Approachability and Low-Regret Learning are Equivalent arxiv:0.936v [cs.lg] 8 Nov 200 Jacob Abernethy Computer Science Division University of California, Berkeley jake@cs.berkeley.edu Peter L.

More information

LEARNING IN CONCAVE GAMES

LEARNING IN CONCAVE GAMES LEARNING IN CONCAVE GAMES P. Mertikopoulos French National Center for Scientific Research (CNRS) Laboratoire d Informatique de Grenoble GSBE ETBC seminar Maastricht, October 22, 2015 Motivation and Preliminaries

More information

Blackwell s Approachability Theorem: A Generalization in a Special Case. Amy Greenwald, Amir Jafari and Casey Marks

Blackwell s Approachability Theorem: A Generalization in a Special Case. Amy Greenwald, Amir Jafari and Casey Marks Blackwell s Approachability Theorem: A Generalization in a Special Case Amy Greenwald, Amir Jafari and Casey Marks Department of Computer Science Brown University Providence, Rhode Island 02912 CS-06-01

More information

Game Theory. Greg Plaxton Theory in Programming Practice, Spring 2004 Department of Computer Science University of Texas at Austin

Game Theory. Greg Plaxton Theory in Programming Practice, Spring 2004 Department of Computer Science University of Texas at Austin Game Theory Greg Plaxton Theory in Programming Practice, Spring 2004 Department of Computer Science University of Texas at Austin Bimatrix Games We are given two real m n matrices A = (a ij ), B = (b ij

More information

Chapter 9. Mixed Extensions. 9.1 Mixed strategies

Chapter 9. Mixed Extensions. 9.1 Mixed strategies Chapter 9 Mixed Extensions We now study a special case of infinite strategic games that are obtained in a canonic way from the finite games, by allowing mixed strategies. Below [0, 1] stands for the real

More information

Existence of sparsely supported correlated equilibria

Existence of sparsely supported correlated equilibria Existence of sparsely supported correlated equilibria Fabrizio Germano fabrizio.germano@upf.edu Gábor Lugosi lugosi@upf.es Departament d Economia i Empresa Universitat Pompeu Fabra Ramon Trias Fargas 25-27

More information

The Query Complexity of Correlated Equilibria

The Query Complexity of Correlated Equilibria The Query Complexity of Correlated Equilibria arxiv:1305.4874v2 [cs.gt] 25 Dec 2017 Sergiu Hart Noam Nisan December 27, 2017 Abstract We consider the complexity of finding a correlated equilibrium of an

More information

Evolution & Learning in Games

Evolution & Learning in Games 1 / 28 Evolution & Learning in Games Econ 243B Jean-Paul Carvalho Lecture 5. Revision Protocols and Evolutionary Dynamics 2 / 28 Population Games played by Boundedly Rational Agents We have introduced

More information

A General Class of Adaptive Strategies 1

A General Class of Adaptive Strategies 1 Journal of Economic Theory 98, 2654 (2001) doi:10.1006jeth.2000.2746, available online at http:www.idealibrary.com on A General Class of Adaptive Strategies 1 Sergiu Hart 2 Center for Rationality and Interactive

More information

Better-Reply Dynamics with Bounded Recall

Better-Reply Dynamics with Bounded Recall ` Kyiv School of Economics & Kyiv Economics Institute DISCUSSION PAPER SERIES Better-Reply Dynamics with Bounded Recall Andriy Zapechelnyuk (Kyiv School of Economics and Kyiv Economics Institute) First

More information

Existence of sparsely supported correlated equilibria

Existence of sparsely supported correlated equilibria Existence of sparsely supported correlated equilibria Fabrizio Germano fabrizio.germano@upf.edu Gábor Lugosi lugosi@upf.es Departament d Economia i Empresa Universitat Pompeu Fabra Ramon Trias Fargas 25-27

More information

From Bandits to Experts: A Tale of Domination and Independence

From Bandits to Experts: A Tale of Domination and Independence From Bandits to Experts: A Tale of Domination and Independence Nicolò Cesa-Bianchi Università degli Studi di Milano N. Cesa-Bianchi (UNIMI) Domination and Independence 1 / 1 From Bandits to Experts: A

More information

Learning to play partially-specified equilibrium

Learning to play partially-specified equilibrium Learning to play partially-specified equilibrium Ehud Lehrer and Eilon Solan August 29, 2007 August 29, 2007 First draft: June 2007 Abstract: In a partially-specified correlated equilibrium (PSCE ) the

More information

The Online Approach to Machine Learning

The Online Approach to Machine Learning The Online Approach to Machine Learning Nicolò Cesa-Bianchi Università degli Studi di Milano N. Cesa-Bianchi (UNIMI) Online Approach to ML 1 / 53 Summary 1 My beautiful regret 2 A supposedly fun game I

More information

6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2

6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2 6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2 Daron Acemoglu and Asu Ozdaglar MIT October 14, 2009 1 Introduction Outline Mixed Strategies Existence of Mixed Strategy Nash Equilibrium

More information

Games A game is a tuple = (I; (S i ;r i ) i2i) where ffl I is a set of players (i 2 I) ffl S i is a set of (pure) strategies (s i 2 S i ) Q ffl r i :

Games A game is a tuple = (I; (S i ;r i ) i2i) where ffl I is a set of players (i 2 I) ffl S i is a set of (pure) strategies (s i 2 S i ) Q ffl r i : On the Connection between No-Regret Learning, Fictitious Play, & Nash Equilibrium Amy Greenwald Brown University Gunes Ercal, David Gondek, Amir Jafari July, Games A game is a tuple = (I; (S i ;r i ) i2i)

More information

Tijmen Daniëls Universiteit van Amsterdam. Abstract

Tijmen Daniëls Universiteit van Amsterdam. Abstract Pure strategy dominance with quasiconcave utility functions Tijmen Daniëls Universiteit van Amsterdam Abstract By a result of Pearce (1984), in a finite strategic form game, the set of a player's serially

More information

Algorithmic Game Theory and Applications. Lecture 4: 2-player zero-sum games, and the Minimax Theorem

Algorithmic Game Theory and Applications. Lecture 4: 2-player zero-sum games, and the Minimax Theorem Algorithmic Game Theory and Applications Lecture 4: 2-player zero-sum games, and the Minimax Theorem Kousha Etessami 2-person zero-sum games A finite 2-person zero-sum (2p-zs) strategic game Γ, is a strategic

More information

The Complexity of Forecast Testing

The Complexity of Forecast Testing The Complexity of Forecast Testing LANCE FORTNOW and RAKESH V. VOHRA Northwestern University Consider a weather forecaster predicting the probability of rain for the next day. We consider tests that given

More information

Asymptotic calibration

Asymptotic calibration Biometrika (1998), 85, 2, pp. 379-390 Printed in Great Britain Asymptotic calibration BY DEAN P. FOSTER Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, Pennsylvania

More information

Online Learning for Time Series Prediction

Online Learning for Time Series Prediction Online Learning for Time Series Prediction Joint work with Vitaly Kuznetsov (Google Research) MEHRYAR MOHRI MOHRI@ COURANT INSTITUTE & GOOGLE RESEARCH. Motivation Time series prediction: stock values.

More information

Game Theory and its Applications to Networks - Part I: Strict Competition

Game Theory and its Applications to Networks - Part I: Strict Competition Game Theory and its Applications to Networks - Part I: Strict Competition Corinne Touati Master ENS Lyon, Fall 200 What is Game Theory and what is it for? Definition (Roger Myerson, Game Theory, Analysis

More information

4. Opponent Forecasting in Repeated Games

4. Opponent Forecasting in Repeated Games 4. Opponent Forecasting in Repeated Games Julian and Mohamed / 2 Learning in Games Literature examines limiting behavior of interacting players. One approach is to have players compute forecasts for opponents

More information

Bandit View on Continuous Stochastic Optimization

Bandit View on Continuous Stochastic Optimization Bandit View on Continuous Stochastic Optimization Sébastien Bubeck 1 joint work with Rémi Munos 1 & Gilles Stoltz 2 & Csaba Szepesvari 3 1 INRIA Lille, SequeL team 2 CNRS/ENS/HEC 3 University of Alberta

More information

The Game of Normal Numbers

The Game of Normal Numbers The Game of Normal Numbers Ehud Lehrer September 4, 2003 Abstract We introduce a two-player game where at each period one player, say, Player 2, chooses a distribution and the other player, Player 1, a

More information

Theory and Applications of A Repeated Game Playing Algorithm. Rob Schapire Princeton University [currently visiting Yahoo!

Theory and Applications of A Repeated Game Playing Algorithm. Rob Schapire Princeton University [currently visiting Yahoo! Theory and Applications of A Repeated Game Playing Algorithm Rob Schapire Princeton University [currently visiting Yahoo! Research] Learning Is (Often) Just a Game some learning problems: learn from training

More information

Computational Game Theory Spring Semester, 2005/6. Lecturer: Yishay Mansour Scribe: Ilan Cohen, Natan Rubin, Ophir Bleiberg*

Computational Game Theory Spring Semester, 2005/6. Lecturer: Yishay Mansour Scribe: Ilan Cohen, Natan Rubin, Ophir Bleiberg* Computational Game Theory Spring Semester, 2005/6 Lecture 5: 2-Player Zero Sum Games Lecturer: Yishay Mansour Scribe: Ilan Cohen, Natan Rubin, Ophir Bleiberg* 1 5.1 2-Player Zero Sum Games In this lecture

More information

Learning, Games, and Networks

Learning, Games, and Networks Learning, Games, and Networks Abhishek Sinha Laboratory for Information and Decision Systems MIT ML Talk Series @CNRG December 12, 2016 1 / 44 Outline 1 Prediction With Experts Advice 2 Application to

More information

Near-Potential Games: Geometry and Dynamics

Near-Potential Games: Geometry and Dynamics Near-Potential Games: Geometry and Dynamics Ozan Candogan, Asuman Ozdaglar and Pablo A. Parrilo September 6, 2011 Abstract Potential games are a special class of games for which many adaptive user dynamics

More information

Unified Convergence Proofs of Continuous-time Fictitious Play

Unified Convergence Proofs of Continuous-time Fictitious Play Unified Convergence Proofs of Continuous-time Fictitious Play Jeff S. Shamma and Gurdal Arslan Department of Mechanical and Aerospace Engineering University of California Los Angeles {shamma,garslan}@seas.ucla.edu

More information

A second look at minsup points and the existence of pure Nash equilibria. Adib Bagh Departments of Economics and Mathematics University of Kentucky.

A second look at minsup points and the existence of pure Nash equilibria. Adib Bagh Departments of Economics and Mathematics University of Kentucky. A second look at minsup points and the existence of pure Nash equilibria Adib Bagh Departments of Economics and Mathematics University of Kentucky. Abstract. We prove the existence of pure strategy Nash

More information

Part IB Optimisation

Part IB Optimisation Part IB Optimisation Theorems Based on lectures by F. A. Fischer Notes taken by Dexter Chua Easter 2015 These notes are not endorsed by the lecturers, and I have modified them (often significantly) after

More information

1 Equilibrium Comparisons

1 Equilibrium Comparisons CS/SS 241a Assignment 3 Guru: Jason Marden Assigned: 1/31/08 Due: 2/14/08 2:30pm We encourage you to discuss these problems with others, but you need to write up the actual homework alone. At the top of

More information

Selecting Efficient Correlated Equilibria Through Distributed Learning. Jason R. Marden

Selecting Efficient Correlated Equilibria Through Distributed Learning. Jason R. Marden 1 Selecting Efficient Correlated Equilibria Through Distributed Learning Jason R. Marden Abstract A learning rule is completely uncoupled if each player s behavior is conditioned only on his own realized

More information

Regret Testing: A Simple Payo -Based Procedure for Learning Nash Equilibrium 1

Regret Testing: A Simple Payo -Based Procedure for Learning Nash Equilibrium 1 1 Regret Testing: A Simple Payo -Based Procedure for Learning Nash Equilibrium 1 DEAN P. FOSTER Department of Statistics, Wharton School, University of Pennsylvania H. PEYTON YOUNG Department of Economics,

More information

Worst-Case Bounds for Gaussian Process Models

Worst-Case Bounds for Gaussian Process Models Worst-Case Bounds for Gaussian Process Models Sham M. Kakade University of Pennsylvania Matthias W. Seeger UC Berkeley Abstract Dean P. Foster University of Pennsylvania We present a competitive analysis

More information

Notes for Week 3: Learning in non-zero sum games

Notes for Week 3: Learning in non-zero sum games CS 683 Learning, Games, and Electronic Markets Spring 2007 Notes for Week 3: Learning in non-zero sum games Instructor: Robert Kleinberg February 05 09, 2007 Notes by: Yogi Sharma 1 Announcements The material

More information

Optimization, Learning, and Games with Predictable Sequences

Optimization, Learning, and Games with Predictable Sequences Optimization, Learning, and Games with Predictable Sequences Alexander Rakhlin University of Pennsylvania Karthik Sridharan University of Pennsylvania Abstract We provide several applications of Optimistic

More information

Long-Run versus Short-Run Player

Long-Run versus Short-Run Player Repeated Games 1 Long-Run versus Short-Run Player a fixed simultaneous move stage game Player 1 is long-run with discount factor δ actions a A a finite set 1 1 1 1 2 utility u ( a, a ) Player 2 is short-run

More information

6.891 Games, Decision, and Computation February 5, Lecture 2

6.891 Games, Decision, and Computation February 5, Lecture 2 6.891 Games, Decision, and Computation February 5, 2015 Lecture 2 Lecturer: Constantinos Daskalakis Scribe: Constantinos Daskalakis We formally define games and the solution concepts overviewed in Lecture

More information

Algorithms, Games, and Networks January 17, Lecture 2

Algorithms, Games, and Networks January 17, Lecture 2 Algorithms, Games, and Networks January 17, 2013 Lecturer: Avrim Blum Lecture 2 Scribe: Aleksandr Kazachkov 1 Readings for today s lecture Today s topic is online learning, regret minimization, and minimax

More information

The Value of Information in Zero-Sum Games

The Value of Information in Zero-Sum Games The Value of Information in Zero-Sum Games Olivier Gossner THEMA, Université Paris Nanterre and CORE, Université Catholique de Louvain joint work with Jean-François Mertens July 2001 Abstract: It is not

More information

Time Series Prediction & Online Learning

Time Series Prediction & Online Learning Time Series Prediction & Online Learning Joint work with Vitaly Kuznetsov (Google Research) MEHRYAR MOHRI MOHRI@ COURANT INSTITUTE & GOOGLE RESEARCH. Motivation Time series prediction: stock values. earthquakes.

More information

Agnostic Online learnability

Agnostic Online learnability Technical Report TTIC-TR-2008-2 October 2008 Agnostic Online learnability Shai Shalev-Shwartz Toyota Technological Institute Chicago shai@tti-c.org ABSTRACT We study a fundamental question. What classes

More information

Mathematical Economics - PhD in Economics

Mathematical Economics - PhD in Economics - PhD in Part 1: Supermodularity and complementarity in the one-dimensional and Paulo Brito ISEG - Technical University of Lisbon November 24, 2010 1 2 - Supermodular optimization 3 one-dimensional 4 Supermodular

More information

MS&E 246: Lecture 12 Static games of incomplete information. Ramesh Johari

MS&E 246: Lecture 12 Static games of incomplete information. Ramesh Johari MS&E 246: Lecture 12 Static games of incomplete information Ramesh Johari Incomplete information Complete information means the entire structure of the game is common knowledge Incomplete information means

More information

6.207/14.15: Networks Lecture 11: Introduction to Game Theory 3

6.207/14.15: Networks Lecture 11: Introduction to Game Theory 3 6.207/14.15: Networks Lecture 11: Introduction to Game Theory 3 Daron Acemoglu and Asu Ozdaglar MIT October 19, 2009 1 Introduction Outline Existence of Nash Equilibrium in Infinite Games Extensive Form

More information

Near-Potential Games: Geometry and Dynamics

Near-Potential Games: Geometry and Dynamics Near-Potential Games: Geometry and Dynamics Ozan Candogan, Asuman Ozdaglar and Pablo A. Parrilo January 29, 2012 Abstract Potential games are a special class of games for which many adaptive user dynamics

More information

An Online Convex Optimization Approach to Blackwell s Approachability

An Online Convex Optimization Approach to Blackwell s Approachability Journal of Machine Learning Research 17 (2016) 1-23 Submitted 7/15; Revised 6/16; Published 8/16 An Online Convex Optimization Approach to Blackwell s Approachability Nahum Shimkin Faculty of Electrical

More information

The Distribution of Optimal Strategies in Symmetric Zero-sum Games

The Distribution of Optimal Strategies in Symmetric Zero-sum Games The Distribution of Optimal Strategies in Symmetric Zero-sum Games Florian Brandl Technische Universität München brandlfl@in.tum.de Given a skew-symmetric matrix, the corresponding two-player symmetric

More information

Game Theory: Lecture 2

Game Theory: Lecture 2 Game Theory: Lecture 2 Tai-Wei Hu June 29, 2011 Outline Two-person zero-sum games normal-form games Minimax theorem Simplex method 1 2-person 0-sum games 1.1 2-Person Normal Form Games A 2-person normal

More information

What kind of bonus point system makes the rugby teams more offensive?

What kind of bonus point system makes the rugby teams more offensive? What kind of bonus point system makes the rugby teams more offensive? XIV Congreso Dr. Antonio Monteiro - 2017 Motivation Rugby Union is a game in constant evolution Experimental law variations every year

More information

Achieving Pareto Optimality Through Distributed Learning

Achieving Pareto Optimality Through Distributed Learning 1 Achieving Pareto Optimality Through Distributed Learning Jason R. Marden, H. Peyton Young, and Lucy Y. Pao Abstract We propose a simple payoff-based learning rule that is completely decentralized, and

More information

Multiagent Value Iteration in Markov Games

Multiagent Value Iteration in Markov Games Multiagent Value Iteration in Markov Games Amy Greenwald Brown University with Michael Littman and Martin Zinkevich Stony Brook Game Theory Festival July 21, 2005 Agenda Theorem Value iteration converges

More information

Achieving Pareto Optimality Through Distributed Learning

Achieving Pareto Optimality Through Distributed Learning 1 Achieving Pareto Optimality Through Distributed Learning Jason R. Marden, H. Peyton Young, and Lucy Y. Pao Abstract We propose a simple payoff-based learning rule that is completely decentralized, and

More information

CS364A: Algorithmic Game Theory Lecture #13: Potential Games; A Hierarchy of Equilibria

CS364A: Algorithmic Game Theory Lecture #13: Potential Games; A Hierarchy of Equilibria CS364A: Algorithmic Game Theory Lecture #13: Potential Games; A Hierarchy of Equilibria Tim Roughgarden November 4, 2013 Last lecture we proved that every pure Nash equilibrium of an atomic selfish routing

More information

Survival of Dominated Strategies under Evolutionary Dynamics. Josef Hofbauer University of Vienna. William H. Sandholm University of Wisconsin

Survival of Dominated Strategies under Evolutionary Dynamics. Josef Hofbauer University of Vienna. William H. Sandholm University of Wisconsin Survival of Dominated Strategies under Evolutionary Dynamics Josef Hofbauer University of Vienna William H. Sandholm University of Wisconsin a hypnodisk 1 Survival of Dominated Strategies under Evolutionary

More information

Lecture 1. Stochastic Optimization: Introduction. January 8, 2018

Lecture 1. Stochastic Optimization: Introduction. January 8, 2018 Lecture 1 Stochastic Optimization: Introduction January 8, 2018 Optimization Concerned with mininmization/maximization of mathematical functions Often subject to constraints Euler (1707-1783): Nothing

More information

Extensive Form Games I

Extensive Form Games I Extensive Form Games I Definition of Extensive Form Game a finite game tree X with nodes x X nodes are partially ordered and have a single root (minimal element) terminal nodes are z Z (maximal elements)

More information

Online Learning Class 12, 20 March 2006 Andrea Caponnetto, Sanmay Das

Online Learning Class 12, 20 March 2006 Andrea Caponnetto, Sanmay Das Online Learning 9.520 Class 12, 20 March 2006 Andrea Caponnetto, Sanmay Das About this class Goal To introduce the general setting of online learning. To describe an online version of the RLS algorithm

More information

1. Introduction. 2. A Simple Model

1. Introduction. 2. A Simple Model . Introduction In the last years, evolutionary-game theory has provided robust solutions to the problem of selection among multiple Nash equilibria in symmetric coordination games (Samuelson, 997). More

More information

arxiv: v2 [cs.gt] 23 Jan 2019

arxiv: v2 [cs.gt] 23 Jan 2019 Analysis of Hannan Consistent Selection for Monte Carlo Tree Search in Simultaneous Move Games Vojtěch Kovařík, Viliam Lisý vojta.kovarik@gmail.com, viliam.lisy@agents.fel.cvut.cz arxiv:1804.09045v2 [cs.gt]

More information

Mean-field equilibrium: An approximation approach for large dynamic games

Mean-field equilibrium: An approximation approach for large dynamic games Mean-field equilibrium: An approximation approach for large dynamic games Ramesh Johari Stanford University Sachin Adlakha, Gabriel Y. Weintraub, Andrea Goldsmith Single agent dynamic control Two agents:

More information

Satisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games

Satisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games Satisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games Stéphane Ross and Brahim Chaib-draa Department of Computer Science and Software Engineering Laval University, Québec (Qc),

More information

Economics 209B Behavioral / Experimental Game Theory (Spring 2008) Lecture 3: Equilibrium refinements and selection

Economics 209B Behavioral / Experimental Game Theory (Spring 2008) Lecture 3: Equilibrium refinements and selection Economics 209B Behavioral / Experimental Game Theory (Spring 2008) Lecture 3: Equilibrium refinements and selection Theory cannot provide clear guesses about with equilibrium will occur in games with multiple

More information

On Equilibria of Distributed Message-Passing Games

On Equilibria of Distributed Message-Passing Games On Equilibria of Distributed Message-Passing Games Concetta Pilotto and K. Mani Chandy California Institute of Technology, Computer Science Department 1200 E. California Blvd. MC 256-80 Pasadena, US {pilotto,mani}@cs.caltech.edu

More information

CS 573: Algorithmic Game Theory Lecture date: January 23rd, 2008

CS 573: Algorithmic Game Theory Lecture date: January 23rd, 2008 CS 573: Algorithmic Game Theory Lecture date: January 23rd, 2008 Instructor: Chandra Chekuri Scribe: Bolin Ding Contents 1 2-Player Zero-Sum Game 1 1.1 Min-Max Theorem for 2-Player Zero-Sum Game....................

More information

האוניברסיטה העברית בירושלים

האוניברסיטה העברית בירושלים האוניברסיטה העברית בירושלים THE HEBREW UNIVERSITY OF JERUSALEM TOWARDS A CHARACTERIZATION OF RATIONAL EXPECTATIONS by ITAI ARIELI Discussion Paper # 475 February 2008 מרכז לחקר הרציונליות CENTER FOR THE

More information

Defensive Forecasting: 4. Good probabilities mean less than we thought.

Defensive Forecasting: 4. Good probabilities mean less than we thought. Defensive Forecasting: Good probabilities mean less than we thought! Foundations of Probability Seminar Departments of Statistics and Philosophy November 2, 206 Glenn Shafer, Rutgers University 0. Game

More information

Internal Regret in On-line Portfolio Selection

Internal Regret in On-line Portfolio Selection Internal Regret in On-line Portfolio Selection Gilles Stoltz (gilles.stoltz@ens.fr) Département de Mathématiques et Applications, Ecole Normale Supérieure, 75005 Paris, France Gábor Lugosi (lugosi@upf.es)

More information

Online Learning and Online Convex Optimization

Online Learning and Online Convex Optimization Online Learning and Online Convex Optimization Nicolò Cesa-Bianchi Università degli Studi di Milano N. Cesa-Bianchi (UNIMI) Online Learning 1 / 49 Summary 1 My beautiful regret 2 A supposedly fun game

More information

Bargaining Efficiency and the Repeated Prisoners Dilemma. Bhaskar Chakravorti* and John Conley**

Bargaining Efficiency and the Repeated Prisoners Dilemma. Bhaskar Chakravorti* and John Conley** Bargaining Efficiency and the Repeated Prisoners Dilemma Bhaskar Chakravorti* and John Conley** Published as: Bhaskar Chakravorti and John P. Conley (2004) Bargaining Efficiency and the repeated Prisoners

More information

Patience and Ultimatum in Bargaining

Patience and Ultimatum in Bargaining Patience and Ultimatum in Bargaining Björn Segendorff Department of Economics Stockholm School of Economics PO Box 6501 SE-113 83STOCKHOLM SWEDEN SSE/EFI Working Paper Series in Economics and Finance No

More information

Brown s Original Fictitious Play

Brown s Original Fictitious Play manuscript No. Brown s Original Fictitious Play Ulrich Berger Vienna University of Economics, Department VW5 Augasse 2-6, A-1090 Vienna, Austria e-mail: ulrich.berger@wu-wien.ac.at March 2005 Abstract

More information

Population Games and Evolutionary Dynamics

Population Games and Evolutionary Dynamics Population Games and Evolutionary Dynamics (MIT Press, 200x; draft posted on my website) 1. Population games 2. Revision protocols and evolutionary dynamics 3. Potential games and their applications 4.

More information

Game Theory: introduction and applications to computer networks

Game Theory: introduction and applications to computer networks Game Theory: introduction and applications to computer networks Introduction Giovanni Neglia INRIA EPI Maestro 27 January 2014 Part of the slides are based on a previous course with D. Figueiredo (UFRJ)

More information

EC319 Economic Theory and Its Applications, Part II: Lecture 2

EC319 Economic Theory and Its Applications, Part II: Lecture 2 EC319 Economic Theory and Its Applications, Part II: Lecture 2 Leonardo Felli NAB.2.14 23 January 2014 Static Bayesian Game Consider the following game of incomplete information: Γ = {N, Ω, A i, T i, µ

More information

On the Impossibility of Predicting the Behavior of Rational Agents. Dean P. Foster* and H. Peyton Young ** February, 1999 This version: October, 2000

On the Impossibility of Predicting the Behavior of Rational Agents. Dean P. Foster* and H. Peyton Young ** February, 1999 This version: October, 2000 On the Impossibility of Predicting the Behavior of Rational Agents Dean P. Foster* and H. Peyton Young ** February, 1999 This version: October, 2000 We exhibit a class of games that almost surely cannot

More information

Lecture December 2009 Fall 2009 Scribe: R. Ring In this lecture we will talk about

Lecture December 2009 Fall 2009 Scribe: R. Ring In this lecture we will talk about 0368.4170: Cryptography and Game Theory Ran Canetti and Alon Rosen Lecture 7 02 December 2009 Fall 2009 Scribe: R. Ring In this lecture we will talk about Two-Player zero-sum games (min-max theorem) Mixed

More information

Mixed Nash Equilibria

Mixed Nash Equilibria lgorithmic Game Theory, Summer 2017 Mixed Nash Equilibria Lecture 2 (5 pages) Instructor: Thomas Kesselheim In this lecture, we introduce the general framework of games. Congestion games, as introduced

More information

Principal-Agent Games - Equilibria under Asymmetric Information -

Principal-Agent Games - Equilibria under Asymmetric Information - Principal-Agent Games - Equilibria under Asymmetric Information - Ulrich Horst 1 Humboldt-Universität zu Berlin Department of Mathematics and School of Business and Economics Work in progress - Comments

More information

Algorithmic Strategy Complexity

Algorithmic Strategy Complexity Algorithmic Strategy Complexity Abraham Neyman aneyman@math.huji.ac.il Hebrew University of Jerusalem Jerusalem Israel Algorithmic Strategy Complexity, Northwesten 2003 p.1/52 General Introduction The

More information

Regret in the On-Line Decision Problem

Regret in the On-Line Decision Problem Games and Economic Behavior 29, 7 35 (1999) Article ID game.1999.0740, available online at http://www.idealibrary.com on Regret in the On-Line Decision Problem Dean P. Foster Department of Statistics,

More information

Stable Games and their Dynamics

Stable Games and their Dynamics Stable Games and their Dynamics Josef Hofbauer and William H. Sandholm January 10, 2009 Abstract We study a class of population games called stable games. These games are characterized by self-defeating

More information

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2016

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2016 Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2016 The time limit for this exam is four hours. The exam has four sections. Each section includes two questions.

More information

Improving Convergence Rates in Multiagent Learning Through Experts and Adaptive Consultation

Improving Convergence Rates in Multiagent Learning Through Experts and Adaptive Consultation Improving Convergence Rates in Multiagent Learning Through Experts and Adaptive Consultation Greg Hines Cheriton School of Computer Science University of Waterloo Waterloo, Ontario, N2L 3G1 Kate Larson

More information

Convergence and No-Regret in Multiagent Learning

Convergence and No-Regret in Multiagent Learning Convergence and No-Regret in Multiagent Learning Michael Bowling Department of Computing Science University of Alberta Edmonton, Alberta Canada T6G 2E8 bowling@cs.ualberta.ca Abstract Learning in a multiagent

More information

A Note on the Existence of Ratifiable Acts

A Note on the Existence of Ratifiable Acts A Note on the Existence of Ratifiable Acts Joseph Y. Halpern Cornell University Computer Science Department Ithaca, NY 14853 halpern@cs.cornell.edu http://www.cs.cornell.edu/home/halpern August 15, 2018

More information

Games and Their Equilibria

Games and Their Equilibria Chapter 1 Games and Their Equilibria The central notion of game theory that captures many aspects of strategic decision making is that of a strategic game Definition 11 (Strategic Game) An n-player strategic

More information