Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics
|
|
- Drusilla Warren
- 5 years ago
- Views:
Transcription
1 Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics Sergiu Hart August 2016
2 Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics Sergiu Hart Center for the Study of Rationality Dept of Mathematics Dept of Economics The Hebrew University of Jerusalem
3 Joint work with Dean P. Foster University of Pennsylvania & Amazon Research
4 Papers
5 Papers Dean P. Foster and Sergiu Hart Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics 2012 revised:
6 Papers Dean P. Foster and Sergiu Hart Smooth Calibration, Leaky Forecasts, Finite Recall, and Nash Dynamics 2012 revised: Dean P. Foster and Sergiu Hart An Integral Approach to Calibration 2016 (in preparation)
7 Calibration
8 Calibration Forecaster says: "The chance of rain tomorrow is p"
9 Calibration Forecaster says: "The chance of rain tomorrow is p" Forecaster is CALIBRATED if for every p: the proportion of rainy days among those days when the forecast was p equals p (or is close to p in the long run)
10 Calibration Forecaster says: "The chance of rain tomorrow is p" Forecaster is CALIBRATED if for every p: the proportion of rainy days among those days when the forecast was p equals p (or is close to p in the long run) Dawid 1982
11 Calibration
12 Calibration CALIBRATION can be guaranteed (no matter what the weather is)
13 Calibration CALIBRATION can be guaranteed (no matter what the weather is) Foster and Vohra 1994 [publ 1998]
14 Calibration CALIBRATION can be guaranteed (no matter what the weather is) Forecaster uses mixed forecasting (e.g.: with probability 1/2, forecast = 25% with probability 1/2, forecast = 60%) Foster and Vohra 1994 [publ 1998]
15 Calibration CALIBRATION can be guaranteed (no matter what the weather is) Forecaster uses mixed forecasting (e.g.: with probability 1/2, forecast = 25% with probability 1/2, forecast = 60%) Foster and Vohra 1994 [publ 1998] Hart 1995: proof using Minimax Theorem
16 The MINIMAX Theorem
17 The MINIMAX Theorem THEOREM (von Neumann 1928) IF X R n, Y R m are compact convex sets, and f : X Y R is a continuous function that is convex-concave, i.e., f(, y) : X R is convex for fixed y, and f(x, ) : Y R is concave for fixed x, min max x X y Y THEN f(x, y) = max min y Y x X f(x, y).
18 The MINIMAX Theorem
19 The MINIMAX Theorem THEOREM (von Neumann 1928)
20 The MINIMAX Theorem THEOREM (von Neumann 1928) IF For every strategy of the opponent I have a strategy such that my payoff is at least v
21 The MINIMAX Theorem THEOREM (von Neumann 1928) IF For every strategy of the opponent I have a strategy such that my payoff is at least v THEN I have a strategy that guarantees that my payoff is at least v (for every strategy of the opponent)
22 The MINIMAX Theorem THEOREM (von Neumann 1928) IF For every strategy of the opponent I have a strategy such that my payoff is at least v THEN I have a strategy that guarantees that my payoff is at least v (for every strategy of the opponent) finite game;
23 The MINIMAX Theorem THEOREM (von Neumann 1928) IF For every strategy of the opponent I have a strategy such that my payoff is at least v THEN I have a strategy that guarantees that my payoff is at least v (for every strategy of the opponent) finite game; probabilistic (mixed) strategies
24 Calibration Proof: Minimax
25 Calibration Proof: Minimax FINITE δ-grid, FINITE HORIZON FINITE 2-person 0-sum game
26 Calibration Proof: Minimax FINITE δ-grid, FINITE HORIZON FINITE 2-person 0-sum game IF the strategy of the rainmaker IS KNOWN THEN the forecaster can get δ-calibrated forecasts
27 Calibration Proof: Minimax FINITE δ-grid, FINITE HORIZON FINITE 2-person 0-sum game IF the strategy of the rainmaker IS KNOWN THEN the forecaster can get δ-calibrated forecasts MINIMAX THEOREM the forecaster can GUARANTEE δ-calibrated forecasts (without knowing the rainmaker s strategy)
28 Calibration Proof: Minimax FINITE δ-grid, FINITE HORIZON FINITE 2-person 0-sum game IF the strategy of the rainmaker IS KNOWN THEN the forecaster can get δ-calibrated forecasts MINIMAX THEOREM the forecaster can GUARANTEE δ-calibrated forecasts (without knowing the rainmaker s strategy) Hart 1995
29 Calibration
30 Calibration CALIBRATION can be guaranteed (no matter what the weather is) Forecaster uses mixed forecasting (e.g.: with probability 1/2, forecast = 25% with probability 1/2, forecast = 60%) Foster and Vohra 1994 [publ 1998] Hart 1995: proof using Minimax Theorem
31 Calibration CALIBRATION can be guaranteed (no matter what the weather is) Forecaster uses mixed forecasting (e.g.: with probability 1/2, forecast = 25% with probability 1/2, forecast = 60%) Foster and Vohra 1994 [publ 1998] Hart 1995: proof using Minimax Theorem Hart and Mas-Colell 1996 [publ 2000]: proof using Blackwell s Approachability
32 Calibration CALIBRATION can be guaranteed (no matter what the weather is) Forecaster uses mixed forecasting (e.g.: with probability 1/2, forecast = 25% with probability 1/2, forecast = 60%) Foster and Vohra 1994 [publ 1998] Hart 1995: proof using Minimax Theorem Hart and Mas-Colell 1996 [publ 2000]: proof using Blackwell s Approachability Foster 1999: a simple way to calibrate
33 Calibration CALIBRATION can be guaranteed (no matter what the weather is) Forecaster uses mixed forecasting (e.g.: with probability 1/2, forecast = 25% with probability 1/2, forecast = 60%) BACK-casting (not fore-casting!) ("Politicians Lemma") Foster and Vohra 1994 [publ 1998] Hart 1995: proof using Minimax Theorem Hart and Mas-Colell 1996 [publ 2000]: proof using Blackwell s Approachability Foster 1999: a simple way to calibrate
34 No Calibration
35 No Calibration CALIBRATION cannot be guaranteed when:
36 No Calibration CALIBRATION cannot be guaranteed when: Forecast is known before the rain/no-rain decision is made ("LEAKY FORECASTS")
37 No Calibration CALIBRATION cannot be guaranteed when: Forecast is known before the rain/no-rain decision is made ("LEAKY FORECASTS") Forecaster uses a deterministic forecasting procedure
38 No Calibration CALIBRATION cannot be guaranteed when: Forecast is known before the rain/no-rain decision is made ("LEAKY FORECASTS") Forecaster uses a deterministic forecasting procedure Oakes 1985
39 Smooth Calibration
40 Smooth Calibration SMOOTH CALIBRATION: combine together the days when the forecast was close to p
41 Smooth Calibration SMOOTH CALIBRATION: combine together the days when the forecast was close to p (smooth out the calibration score)
42 Smooth Calibration SMOOTH CALIBRATION: combine together the days when the forecast was close to p (smooth out the calibration score) Main Result: There exists a deterministic procedure that is SMOOTHLY CALIBRATED.
43 Smooth Calibration SMOOTH CALIBRATION: combine together the days when the forecast was close to p (smooth out the calibration score) Main Result: There exists a deterministic procedure that is SMOOTHLY CALIBRATED. Deterministic result holds also when the forecasts are leaked
44 Calibration
45 Calibration Set of ACTIONS: A R m (finite set) Set of FORECASTS: C = (A) Example: A = {0, 1}, C = [0, 1]
46 Calibration Set of ACTIONS: A R m (finite set) Set of FORECASTS: C = (A) Example: A = {0, 1}, C = [0, 1] CALIBRATION SCORE at time T for a sequence (a t, c t ) t=1,2,... in A C:
47 Calibration Set of ACTIONS: A R m (finite set) Set of FORECASTS: C = (A) Example: A = {0, 1}, C = [0, 1] CALIBRATION SCORE at time T for a sequence (a t, c t ) t=1,2,... in A C: K T = 1 T ā t c t T where ā t = t=1 T s=1 1 c s =c t a s T s=1 1 c s =c t
48 Smooth Calibration
49 Smooth Calibration A "smoothing function" is a Lipschitz function Λ : C C [0, 1] with Λ(c, c) = 1 for every c.
50 Smooth Calibration A "smoothing function" is a Lipschitz function Λ : C C [0, 1] with Λ(c, c) = 1 for every c. Λ(x, c) = "weight" of x relative to c
51 Smooth Calibration A "smoothing function" is a Lipschitz function Λ : C C [0, 1] with Λ(c, c) = 1 for every c. Λ(x, c) = "weight" of x relative to c Example: Λ(x, c) = [δ x c ] + /δ
52 Indicator Function
53 Indicator Function 1 x=c 1 c x
54 Indicator and Λ Functions Λ(x, c) 1 x=c 1 c δ c c + δ x
55 Smooth Calibration A "smoothing function" is a Lipschitz function Λ : C C [0, 1] with Λ(c, c) = 1 for every c. Λ(x, c) = "weight" of x relative to c
56 Smooth Calibration A "smoothing function" is a Lipschitz function Λ : C C [0, 1] with Λ(c, c) = 1 for every c. Λ(x, c) = "weight" of x relative to c Λ-CALIBRATION SCORE at time T :
57 Smooth Calibration A "smoothing function" is a Lipschitz function Λ : C C [0, 1] with Λ(c, c) = 1 for every c. Λ(x, c) = "weight" of x relative to c Λ-CALIBRATION SCORE at time T : K Λ T = 1 T T ā Λ t cλ t t=1
58 Smooth Calibration A "smoothing function" is a Lipschitz function Λ : C C [0, 1] with Λ(c, c) = 1 for every c. Λ(x, c) = "weight" of x relative to c Λ-CALIBRATION SCORE at time T : K Λ T = 1 T T t=1 ā Λ t cλ t ā Λ t = T s=1 Λ(c s, c t ) a s T s=1 Λ(c s, c t ), c Λ t = T s=1 Λ(c s, c t ) c s T s=1 Λ(c s, c t )
59 The Calibration Game
60 The Calibration Game In each period t = 1, 2,... :
61 The Calibration Game In each period t = 1, 2,... : Player C ("forecaster") chooses c t C Player A ("action") chooses a t A
62 The Calibration Game In each period t = 1, 2,... : Player C ("forecaster") chooses c t C Player A ("action") chooses a t A a t and c t chosen simultaneously: REGULAR setup
63 The Calibration Game In each period t = 1, 2,... : Player C ("forecaster") chooses c t C Player A ("action") chooses a t A a t and c t chosen simultaneously: REGULAR setup a t chosen after c t is disclosed: LEAKY setup
64 The Calibration Game In each period t = 1, 2,... : Player C ("forecaster") chooses c t C Player A ("action") chooses a t A a t and c t chosen simultaneously: REGULAR setup a t chosen after c t is disclosed: LEAKY setup Full monitoring, perfect recall
65 Smooth Calibration
66 Smooth Calibration A strategy of Player C is
67 Smooth Calibration A strategy of Player C is (ε, L)-SMOOTHLY CALIBRATED
68 Smooth Calibration A strategy of Player C is (ε, L)-SMOOTHLY CALIBRATED if there is T 0 such that K Λ T ε holds for: every T T 0, every strategy of Player A, and every smoothing function Λ with Lipschitz constant L
69 Smooth Calibration: Result
70 Smooth Calibration: Result For every ε > 0 and L < there exists a procedure that is (ε, L)-SMOOTHLY CALIBRATED.
71 Smooth Calibration: Result For every ε > 0 and L < there exists a procedure that is (ε, L)-SMOOTHLY CALIBRATED. Moreover: it is deterministic,
72 Smooth Calibration: Result For every ε > 0 and L < there exists a procedure that is (ε, L)-SMOOTHLY CALIBRATED. Moreover: it is deterministic, it has finite recall (= finite window, stationary),
73 Smooth Calibration: Result For every ε > 0 and L < there exists a procedure that is (ε, L)-SMOOTHLY CALIBRATED. Moreover: it is deterministic, it has finite recall (= finite window, stationary), it uses a fixed finite grid, and
74 Smooth Calibration: Result For every ε > 0 and L < there exists a procedure that is (ε, L)-SMOOTHLY CALIBRATED. Moreover: it is deterministic, it has finite recall (= finite window, stationary), it uses a fixed finite grid, and forecasts may be leaked
75 Smooth Calibration: Implications
76 Smooth Calibration: Implications For forecasting:
77 Smooth Calibration: Implications For forecasting: nothing good... (easier to pass the test)
78 Smooth Calibration: Implications For forecasting: nothing good... (easier to pass the test) For game dynamics:
79 Smooth Calibration: Implications For forecasting: nothing good... (easier to pass the test) For game dynamics: Nash dynamics
80 Calibrated Learning
81 Calibrated Learning CALIBRATED LEARNING:
82 Calibrated Learning CALIBRATED LEARNING: Foster and Vohra 1997
83 Calibrated Learning CALIBRATED LEARNING: every player uses a calibrated forecast on the play of the other players Foster and Vohra 1997
84 Calibrated Learning CALIBRATED LEARNING: every player uses a calibrated forecast on the play of the other players every player best replies to his forecast Foster and Vohra 1997
85 Calibrated Learning CALIBRATED LEARNING: every player uses a calibrated forecast on the play of the other players every player best replies to his forecast time average of play (= empirical distribution of play) is an approximate CORRELATED EQUILIBRIUM Foster and Vohra 1997
86 Smooth Calibrated Learning
87 Smooth Calibrated Learning SMOOTH CALIBRATED LEARNING:
88 Smooth Calibrated Learning SMOOTH CALIBRATED LEARNING: (F) A smoothly calibrated deterministic procedure, which gives in each period t a "forecasted" joint distribution of play c t in (A) (Π i N A i )
89 Smooth Calibrated Learning SMOOTH CALIBRATED LEARNING: (F) A smoothly calibrated deterministic procedure, which gives in each period t a "forecasted" joint distribution of play c t in (A) (Π i N A i ) (P) A Lipschitz approximate best-reply mapping g i : (A) (A i ) for each player i
90 Smooth Calibrated Learning SMOOTH CALIBRATED LEARNING: (F) A smoothly calibrated deterministic procedure, which gives in each period t a "forecasted" joint distribution of play c t in (A) (Π i N A i ) (P) A Lipschitz approximate best-reply mapping g i : (A) (A i ) for each player i In each period t, each player i: 1. runs the procedure (F) to get c t 2. plays g i (c t ) given by (P)
91 Smooth Calibrated Learning
92 Smooth Calibrated Learning SMOOTH CALIBRATED LEARNING (with appropriate parameters):
93 Smooth Calibrated Learning SMOOTH CALIBRATED LEARNING (with appropriate parameters): is a stochastic uncoupled dynamic
94 Smooth Calibrated Learning SMOOTH CALIBRATED LEARNING (with appropriate parameters): is a stochastic uncoupled dynamic has finite MEMORY and is stationary
95 Smooth Calibrated Learning SMOOTH CALIBRATED LEARNING (with appropriate parameters): is a stochastic uncoupled dynamic has finite MEMORY and is stationary Nash ε-equilibria are played at least 1 ε of the time in the long run (a.s.)
96 Smooth Calibrated Learning: Proof
97 Smooth Calibrated Learning: Proof play t = g(c t )
98 Smooth Calibrated Learning: Proof smooth calibration play t = g(c t ) c t
99 Smooth Calibrated Learning: Proof smooth calibration play t = g(c t ) c t use: g is Lipschitz
100 Smooth Calibrated Learning: Proof smooth calibration play t = g(c t ) c t use: g is Lipschitz g approximate best reply play t is an approximate Nash equilibrium
101 Smooth Calibrated Learning: Proof smooth calibration play t = g(c t ) c t use: g is Lipschitz g approximate best reply play t is an approximate Nash equilibrium g(play t ) = g(g(c t )) g(c t ) = play t
102 Why Smooth?
103 Why Smooth? SMOOTH CALIBRATION
104 Why Smooth? SMOOTH CALIBRATION deterministic
105 Why Smooth? SMOOTH CALIBRATION deterministic same forecast for all players
106 Why Smooth? SMOOTH CALIBRATION deterministic same forecast for all players leaky
107 Why Smooth? SMOOTH CALIBRATION deterministic same forecast for all players leaky actions depend on forecast
108 Why Smooth? SMOOTH CALIBRATION deterministic same forecast for all players leaky actions depend on forecast calibrated
109 Why Smooth? SMOOTH CALIBRATION deterministic same forecast for all players leaky actions depend on forecast calibrated forecast equals actions
110 Why Smooth? SMOOTH CALIBRATION deterministic same forecast for all players leaky actions depend on forecast calibrated forecast equals actions FIXED POINT
111 Why Smooth? SMOOTH CALIBRATION deterministic same forecast for all players leaky actions depend on forecast calibrated forecast equals actions FIXED POINT SMOOTH BEST REPLY
112 Why Smooth? SMOOTH CALIBRATION deterministic same forecast for all players leaky actions depend on forecast calibrated forecast equals actions FIXED POINT SMOOTH BEST REPLY fixed point = NASH EQUILIBRIUM
113 Dynamics and Equilibrium
114 Dynamics and Equilibrium "LAW OF CONSERVATION OF COORDINATION":
115 Dynamics and Equilibrium "LAW OF CONSERVATION OF COORDINATION": There must be some COORDINATION
116 Dynamics and Equilibrium "LAW OF CONSERVATION OF COORDINATION": There must be some COORDINATION either in the EQUILIBRIUM notion,
117 Dynamics and Equilibrium "LAW OF CONSERVATION OF COORDINATION": There must be some COORDINATION either in the EQUILIBRIUM notion, or in the DYNAMIC
118 Dynamics and Equilibrium "LAW OF CONSERVATION OF COORDINATION": There must be some COORDINATION either in the EQUILIBRIUM notion, (CORRELATED EQUILIBRIUM) or in the DYNAMIC
119 Dynamics and Equilibrium "LAW OF CONSERVATION OF COORDINATION": There must be some COORDINATION either in the EQUILIBRIUM notion, (CORRELATED EQUILIBRIUM) or in the DYNAMIC (NASH EQUILIBRIUM)
120 Dynamics and Equilibrium "LAW OF CONSERVATION OF COORDINATION": There must be some COORDINATION either in the EQUILIBRIUM notion, (CORRELATED EQUILIBRIUM) or in the DYNAMIC (NASH EQUILIBRIUM) (Hart and Mas-Colell 2003)
121 Integral Approach to Calibration
122 Integral Approach to Calibration INTEGRAL CALIBRATION score:
123 Integral Approach to Calibration INTEGRAL CALIBRATION score: G Λ t (z) = 1 t t Λ(c s, z)(a s c s ) s=1
124 Integral Approach to Calibration INTEGRAL CALIBRATION score: G Λ t (z) = 1 t ( G Λ t 2 = t s=1 C Λ(c s, z)(a s c s ) G Λ t (z) 2 dζ(z) ) 1/2
125 Integral Approach to Calibration INTEGRAL CALIBRATION score: G Λ t (z) = 1 t ( G Λ t 2 = t s=1 C Λ(c s, z)(a s c s ) G Λ t (z) 2 dζ(z) ) 1/2 INTEGRAL CALIBRATION: Guarantee that G Λ t 2 ε for all t large enough, uniformly
126 Integral Calibration
127 Integral Calibration Choose the forecast c t such that Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A
128 Integral Calibration Choose the forecast c t such that Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such c t is guaranteed by a Fixed Point Theorem
129 Integral Calibration Choose the forecast c t such that Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such c t is guaranteed by a Fixed Point Theorem Deterministic Integral Calibration
130 Integral Calibration Choose the forecast c t such that Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such c t is guaranteed by a Fixed Point Theorem Deterministic Integral Calibration Deterministic Smooth Calibration
131 Integral Calibration Choose the forecast c t such that Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such c t is guaranteed by a Fixed Point Theorem Deterministic Integral Calibration Deterministic Smooth Calibration Deterministic Weak Calibration
132 Integral Calibration Choose the forecast c t such that Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such c t is guaranteed by a Fixed Point Theorem Deterministic Integral Calibration Deterministic Smooth Calibration Deterministic Weak Calibration Almost Deterministic Calibration
133 Deterministic Nonlinear Calibration Choose the forecast c t such that Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such c t is guaranteed by a Fixed Point Theorem Deterministic Integral Calibration Deterministic Smooth Calibration Deterministic Weak Calibration Almost Deterministic Calibration
134 Integral Calibration
135 Integral Calibration Choose the distribution of the forecast c t s.t. [ ] E Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A
136 Integral Calibration Choose the distribution of the forecast c t s.t. [ ] E Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such distribution is guaranteed by a Separation / Minimax Theorem
137 Integral Calibration Choose the distribution of the forecast c t s.t. [ ] E Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such distribution is guaranteed by a Separation / Minimax Theorem Probabilistic Calibration
138 Stochastic Linear Calibration Choose the distribution of the forecast c t s.t. [ ] E Λ(c t, z) G Λ t 1 (z) dζ(z) (a c t) 0 for every a A Existence of such distribution is guaranteed by a Separation / Minimax Theorem Probabilistic Calibration
139 Integral Approach to Calibration
140 Integral Approach to Calibration fixed point deterministic calibration
141 Integral Approach to Calibration fixed point deterministic calibration separation / minimax stochastic calibration
142 Previous Work
143 Previous Work Calibration and Correlated Equilibria:
144 Previous Work Calibration and Correlated Equilibria: Foster and Vohra 1998 Foster and Vohra 1997 Hart and Mas-Colell 2000,...
145 Previous Work Calibration and Correlated Equilibria: Foster and Vohra 1998 Foster and Vohra 1997 Hart and Mas-Colell 2000,... Weak Calibration (deterministic):
146 Previous Work Calibration and Correlated Equilibria: Foster and Vohra 1998 Foster and Vohra 1997 Hart and Mas-Colell 2000,... Weak Calibration (deterministic): Kakade and Foster 2004 / 2008 Foster and Kakade 2006
147 Previous Work
148 Previous Work Nash Dynamics:
149 Previous Work Nash Dynamics: Foster and Young 2003 Kakade and Foster 2004 / 2008 Foster and Young 2006 Hart and Mas-Colell 2006 Germano and Lugosi 2007 Young 2009 Babichenko 2012
150 Previous Work
151 Previous Work Online Regression Problem:
152 Previous Work Online Regression Problem: Foster 1991 J. Foster 1999 Vovk 2001 Azoury and Warmuth 2001 Cesa-Bianchi and Lugosi 2006
153 Successful Economic Forecasting
154 Successful Economic Forecasting correctly forecasting
155 Successful Economic Forecasting... correctly forecasting 8 of the last 5 recessions!
Calibration and Nash Equilibrium
Calibration and Nash Equilibrium Dean Foster University of Pennsylvania and Sham Kakade TTI September 22, 2005 What is a Nash equilibrium? Cartoon definition of NE: Leroy Lockhorn: I m drinking because
More informationCorrelated Equilibria: Rationality and Dynamics
Correlated Equilibria: Rationality and Dynamics Sergiu Hart June 2010 AUMANN 80 SERGIU HART c 2010 p. 1 CORRELATED EQUILIBRIA: RATIONALITY AND DYNAMICS Sergiu Hart Center for the Study of Rationality Dept
More informationWalras-Bowley Lecture 2003
Walras-Bowley Lecture 2003 Sergiu Hart This version: September 2004 SERGIU HART c 2004 p. 1 ADAPTIVE HEURISTICS A Little Rationality Goes a Long Way Sergiu Hart Center for Rationality, Dept. of Economics,
More informationThe No-Regret Framework for Online Learning
The No-Regret Framework for Online Learning A Tutorial Introduction Nahum Shimkin Technion Israel Institute of Technology Haifa, Israel Stochastic Processes in Engineering IIT Mumbai, March 2013 N. Shimkin,
More informationDeterministic Calibration and Nash Equilibrium
University of Pennsylvania ScholarlyCommons Statistics Papers Wharton Faculty Research 2-2008 Deterministic Calibration and Nash Equilibrium Sham M. Kakade Dean P. Foster University of Pennsylvania Follow
More informationPrediction and Playing Games
Prediction and Playing Games Vineel Pratap vineel@eng.ucsd.edu February 20, 204 Chapter 7 : Prediction, Learning and Games - Cesa Binachi & Lugosi K-Person Normal Form Games Each player k (k =,..., K)
More informationLecture 14: Approachability and regret minimization Ramesh Johari May 23, 2007
MS&E 336 Lecture 4: Approachability and regret minimization Ramesh Johari May 23, 2007 In this lecture we use Blackwell s approachability theorem to formulate both external and internal regret minimizing
More informationAdvanced Machine Learning
Advanced Machine Learning Learning and Games MEHRYAR MOHRI MOHRI@ COURANT INSTITUTE & GOOGLE RESEARCH. Outline Normal form games Nash equilibrium von Neumann s minimax theorem Correlated equilibrium Internal
More informationLearning correlated equilibria in games with compact sets of strategies
Learning correlated equilibria in games with compact sets of strategies Gilles Stoltz a,, Gábor Lugosi b a cnrs and Département de Mathématiques et Applications, Ecole Normale Supérieure, 45 rue d Ulm,
More informationSelf-stabilizing uncoupled dynamics
Self-stabilizing uncoupled dynamics Aaron D. Jaggard 1, Neil Lutz 2, Michael Schapira 3, and Rebecca N. Wright 4 1 U.S. Naval Research Laboratory, Washington, DC 20375, USA. aaron.jaggard@nrl.navy.mil
More informationGame-Theoretic Learning:
Game-Theoretic Learning: Regret Minimization vs. Utility Maximization Amy Greenwald with David Gondek, Amir Jafari, and Casey Marks Brown University University of Pennsylvania November 17, 2004 Background
More informationarxiv: v1 [cs.lg] 8 Nov 2010
Blackwell Approachability and Low-Regret Learning are Equivalent arxiv:0.936v [cs.lg] 8 Nov 200 Jacob Abernethy Computer Science Division University of California, Berkeley jake@cs.berkeley.edu Peter L.
More informationLEARNING IN CONCAVE GAMES
LEARNING IN CONCAVE GAMES P. Mertikopoulos French National Center for Scientific Research (CNRS) Laboratoire d Informatique de Grenoble GSBE ETBC seminar Maastricht, October 22, 2015 Motivation and Preliminaries
More informationBlackwell s Approachability Theorem: A Generalization in a Special Case. Amy Greenwald, Amir Jafari and Casey Marks
Blackwell s Approachability Theorem: A Generalization in a Special Case Amy Greenwald, Amir Jafari and Casey Marks Department of Computer Science Brown University Providence, Rhode Island 02912 CS-06-01
More informationGame Theory. Greg Plaxton Theory in Programming Practice, Spring 2004 Department of Computer Science University of Texas at Austin
Game Theory Greg Plaxton Theory in Programming Practice, Spring 2004 Department of Computer Science University of Texas at Austin Bimatrix Games We are given two real m n matrices A = (a ij ), B = (b ij
More informationChapter 9. Mixed Extensions. 9.1 Mixed strategies
Chapter 9 Mixed Extensions We now study a special case of infinite strategic games that are obtained in a canonic way from the finite games, by allowing mixed strategies. Below [0, 1] stands for the real
More informationExistence of sparsely supported correlated equilibria
Existence of sparsely supported correlated equilibria Fabrizio Germano fabrizio.germano@upf.edu Gábor Lugosi lugosi@upf.es Departament d Economia i Empresa Universitat Pompeu Fabra Ramon Trias Fargas 25-27
More informationThe Query Complexity of Correlated Equilibria
The Query Complexity of Correlated Equilibria arxiv:1305.4874v2 [cs.gt] 25 Dec 2017 Sergiu Hart Noam Nisan December 27, 2017 Abstract We consider the complexity of finding a correlated equilibrium of an
More informationEvolution & Learning in Games
1 / 28 Evolution & Learning in Games Econ 243B Jean-Paul Carvalho Lecture 5. Revision Protocols and Evolutionary Dynamics 2 / 28 Population Games played by Boundedly Rational Agents We have introduced
More informationA General Class of Adaptive Strategies 1
Journal of Economic Theory 98, 2654 (2001) doi:10.1006jeth.2000.2746, available online at http:www.idealibrary.com on A General Class of Adaptive Strategies 1 Sergiu Hart 2 Center for Rationality and Interactive
More informationBetter-Reply Dynamics with Bounded Recall
` Kyiv School of Economics & Kyiv Economics Institute DISCUSSION PAPER SERIES Better-Reply Dynamics with Bounded Recall Andriy Zapechelnyuk (Kyiv School of Economics and Kyiv Economics Institute) First
More informationExistence of sparsely supported correlated equilibria
Existence of sparsely supported correlated equilibria Fabrizio Germano fabrizio.germano@upf.edu Gábor Lugosi lugosi@upf.es Departament d Economia i Empresa Universitat Pompeu Fabra Ramon Trias Fargas 25-27
More informationFrom Bandits to Experts: A Tale of Domination and Independence
From Bandits to Experts: A Tale of Domination and Independence Nicolò Cesa-Bianchi Università degli Studi di Milano N. Cesa-Bianchi (UNIMI) Domination and Independence 1 / 1 From Bandits to Experts: A
More informationLearning to play partially-specified equilibrium
Learning to play partially-specified equilibrium Ehud Lehrer and Eilon Solan August 29, 2007 August 29, 2007 First draft: June 2007 Abstract: In a partially-specified correlated equilibrium (PSCE ) the
More informationThe Online Approach to Machine Learning
The Online Approach to Machine Learning Nicolò Cesa-Bianchi Università degli Studi di Milano N. Cesa-Bianchi (UNIMI) Online Approach to ML 1 / 53 Summary 1 My beautiful regret 2 A supposedly fun game I
More information6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2
6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2 Daron Acemoglu and Asu Ozdaglar MIT October 14, 2009 1 Introduction Outline Mixed Strategies Existence of Mixed Strategy Nash Equilibrium
More informationGames A game is a tuple = (I; (S i ;r i ) i2i) where ffl I is a set of players (i 2 I) ffl S i is a set of (pure) strategies (s i 2 S i ) Q ffl r i :
On the Connection between No-Regret Learning, Fictitious Play, & Nash Equilibrium Amy Greenwald Brown University Gunes Ercal, David Gondek, Amir Jafari July, Games A game is a tuple = (I; (S i ;r i ) i2i)
More informationTijmen Daniëls Universiteit van Amsterdam. Abstract
Pure strategy dominance with quasiconcave utility functions Tijmen Daniëls Universiteit van Amsterdam Abstract By a result of Pearce (1984), in a finite strategic form game, the set of a player's serially
More informationAlgorithmic Game Theory and Applications. Lecture 4: 2-player zero-sum games, and the Minimax Theorem
Algorithmic Game Theory and Applications Lecture 4: 2-player zero-sum games, and the Minimax Theorem Kousha Etessami 2-person zero-sum games A finite 2-person zero-sum (2p-zs) strategic game Γ, is a strategic
More informationThe Complexity of Forecast Testing
The Complexity of Forecast Testing LANCE FORTNOW and RAKESH V. VOHRA Northwestern University Consider a weather forecaster predicting the probability of rain for the next day. We consider tests that given
More informationAsymptotic calibration
Biometrika (1998), 85, 2, pp. 379-390 Printed in Great Britain Asymptotic calibration BY DEAN P. FOSTER Department of Statistics, The Wharton School, University of Pennsylvania, Philadelphia, Pennsylvania
More informationOnline Learning for Time Series Prediction
Online Learning for Time Series Prediction Joint work with Vitaly Kuznetsov (Google Research) MEHRYAR MOHRI MOHRI@ COURANT INSTITUTE & GOOGLE RESEARCH. Motivation Time series prediction: stock values.
More informationGame Theory and its Applications to Networks - Part I: Strict Competition
Game Theory and its Applications to Networks - Part I: Strict Competition Corinne Touati Master ENS Lyon, Fall 200 What is Game Theory and what is it for? Definition (Roger Myerson, Game Theory, Analysis
More information4. Opponent Forecasting in Repeated Games
4. Opponent Forecasting in Repeated Games Julian and Mohamed / 2 Learning in Games Literature examines limiting behavior of interacting players. One approach is to have players compute forecasts for opponents
More informationBandit View on Continuous Stochastic Optimization
Bandit View on Continuous Stochastic Optimization Sébastien Bubeck 1 joint work with Rémi Munos 1 & Gilles Stoltz 2 & Csaba Szepesvari 3 1 INRIA Lille, SequeL team 2 CNRS/ENS/HEC 3 University of Alberta
More informationThe Game of Normal Numbers
The Game of Normal Numbers Ehud Lehrer September 4, 2003 Abstract We introduce a two-player game where at each period one player, say, Player 2, chooses a distribution and the other player, Player 1, a
More informationTheory and Applications of A Repeated Game Playing Algorithm. Rob Schapire Princeton University [currently visiting Yahoo!
Theory and Applications of A Repeated Game Playing Algorithm Rob Schapire Princeton University [currently visiting Yahoo! Research] Learning Is (Often) Just a Game some learning problems: learn from training
More informationComputational Game Theory Spring Semester, 2005/6. Lecturer: Yishay Mansour Scribe: Ilan Cohen, Natan Rubin, Ophir Bleiberg*
Computational Game Theory Spring Semester, 2005/6 Lecture 5: 2-Player Zero Sum Games Lecturer: Yishay Mansour Scribe: Ilan Cohen, Natan Rubin, Ophir Bleiberg* 1 5.1 2-Player Zero Sum Games In this lecture
More informationLearning, Games, and Networks
Learning, Games, and Networks Abhishek Sinha Laboratory for Information and Decision Systems MIT ML Talk Series @CNRG December 12, 2016 1 / 44 Outline 1 Prediction With Experts Advice 2 Application to
More informationNear-Potential Games: Geometry and Dynamics
Near-Potential Games: Geometry and Dynamics Ozan Candogan, Asuman Ozdaglar and Pablo A. Parrilo September 6, 2011 Abstract Potential games are a special class of games for which many adaptive user dynamics
More informationUnified Convergence Proofs of Continuous-time Fictitious Play
Unified Convergence Proofs of Continuous-time Fictitious Play Jeff S. Shamma and Gurdal Arslan Department of Mechanical and Aerospace Engineering University of California Los Angeles {shamma,garslan}@seas.ucla.edu
More informationA second look at minsup points and the existence of pure Nash equilibria. Adib Bagh Departments of Economics and Mathematics University of Kentucky.
A second look at minsup points and the existence of pure Nash equilibria Adib Bagh Departments of Economics and Mathematics University of Kentucky. Abstract. We prove the existence of pure strategy Nash
More informationPart IB Optimisation
Part IB Optimisation Theorems Based on lectures by F. A. Fischer Notes taken by Dexter Chua Easter 2015 These notes are not endorsed by the lecturers, and I have modified them (often significantly) after
More information1 Equilibrium Comparisons
CS/SS 241a Assignment 3 Guru: Jason Marden Assigned: 1/31/08 Due: 2/14/08 2:30pm We encourage you to discuss these problems with others, but you need to write up the actual homework alone. At the top of
More informationSelecting Efficient Correlated Equilibria Through Distributed Learning. Jason R. Marden
1 Selecting Efficient Correlated Equilibria Through Distributed Learning Jason R. Marden Abstract A learning rule is completely uncoupled if each player s behavior is conditioned only on his own realized
More informationRegret Testing: A Simple Payo -Based Procedure for Learning Nash Equilibrium 1
1 Regret Testing: A Simple Payo -Based Procedure for Learning Nash Equilibrium 1 DEAN P. FOSTER Department of Statistics, Wharton School, University of Pennsylvania H. PEYTON YOUNG Department of Economics,
More informationWorst-Case Bounds for Gaussian Process Models
Worst-Case Bounds for Gaussian Process Models Sham M. Kakade University of Pennsylvania Matthias W. Seeger UC Berkeley Abstract Dean P. Foster University of Pennsylvania We present a competitive analysis
More informationNotes for Week 3: Learning in non-zero sum games
CS 683 Learning, Games, and Electronic Markets Spring 2007 Notes for Week 3: Learning in non-zero sum games Instructor: Robert Kleinberg February 05 09, 2007 Notes by: Yogi Sharma 1 Announcements The material
More informationOptimization, Learning, and Games with Predictable Sequences
Optimization, Learning, and Games with Predictable Sequences Alexander Rakhlin University of Pennsylvania Karthik Sridharan University of Pennsylvania Abstract We provide several applications of Optimistic
More informationLong-Run versus Short-Run Player
Repeated Games 1 Long-Run versus Short-Run Player a fixed simultaneous move stage game Player 1 is long-run with discount factor δ actions a A a finite set 1 1 1 1 2 utility u ( a, a ) Player 2 is short-run
More information6.891 Games, Decision, and Computation February 5, Lecture 2
6.891 Games, Decision, and Computation February 5, 2015 Lecture 2 Lecturer: Constantinos Daskalakis Scribe: Constantinos Daskalakis We formally define games and the solution concepts overviewed in Lecture
More informationAlgorithms, Games, and Networks January 17, Lecture 2
Algorithms, Games, and Networks January 17, 2013 Lecturer: Avrim Blum Lecture 2 Scribe: Aleksandr Kazachkov 1 Readings for today s lecture Today s topic is online learning, regret minimization, and minimax
More informationThe Value of Information in Zero-Sum Games
The Value of Information in Zero-Sum Games Olivier Gossner THEMA, Université Paris Nanterre and CORE, Université Catholique de Louvain joint work with Jean-François Mertens July 2001 Abstract: It is not
More informationTime Series Prediction & Online Learning
Time Series Prediction & Online Learning Joint work with Vitaly Kuznetsov (Google Research) MEHRYAR MOHRI MOHRI@ COURANT INSTITUTE & GOOGLE RESEARCH. Motivation Time series prediction: stock values. earthquakes.
More informationAgnostic Online learnability
Technical Report TTIC-TR-2008-2 October 2008 Agnostic Online learnability Shai Shalev-Shwartz Toyota Technological Institute Chicago shai@tti-c.org ABSTRACT We study a fundamental question. What classes
More informationMathematical Economics - PhD in Economics
- PhD in Part 1: Supermodularity and complementarity in the one-dimensional and Paulo Brito ISEG - Technical University of Lisbon November 24, 2010 1 2 - Supermodular optimization 3 one-dimensional 4 Supermodular
More informationMS&E 246: Lecture 12 Static games of incomplete information. Ramesh Johari
MS&E 246: Lecture 12 Static games of incomplete information Ramesh Johari Incomplete information Complete information means the entire structure of the game is common knowledge Incomplete information means
More information6.207/14.15: Networks Lecture 11: Introduction to Game Theory 3
6.207/14.15: Networks Lecture 11: Introduction to Game Theory 3 Daron Acemoglu and Asu Ozdaglar MIT October 19, 2009 1 Introduction Outline Existence of Nash Equilibrium in Infinite Games Extensive Form
More informationNear-Potential Games: Geometry and Dynamics
Near-Potential Games: Geometry and Dynamics Ozan Candogan, Asuman Ozdaglar and Pablo A. Parrilo January 29, 2012 Abstract Potential games are a special class of games for which many adaptive user dynamics
More informationAn Online Convex Optimization Approach to Blackwell s Approachability
Journal of Machine Learning Research 17 (2016) 1-23 Submitted 7/15; Revised 6/16; Published 8/16 An Online Convex Optimization Approach to Blackwell s Approachability Nahum Shimkin Faculty of Electrical
More informationThe Distribution of Optimal Strategies in Symmetric Zero-sum Games
The Distribution of Optimal Strategies in Symmetric Zero-sum Games Florian Brandl Technische Universität München brandlfl@in.tum.de Given a skew-symmetric matrix, the corresponding two-player symmetric
More informationGame Theory: Lecture 2
Game Theory: Lecture 2 Tai-Wei Hu June 29, 2011 Outline Two-person zero-sum games normal-form games Minimax theorem Simplex method 1 2-person 0-sum games 1.1 2-Person Normal Form Games A 2-person normal
More informationWhat kind of bonus point system makes the rugby teams more offensive?
What kind of bonus point system makes the rugby teams more offensive? XIV Congreso Dr. Antonio Monteiro - 2017 Motivation Rugby Union is a game in constant evolution Experimental law variations every year
More informationAchieving Pareto Optimality Through Distributed Learning
1 Achieving Pareto Optimality Through Distributed Learning Jason R. Marden, H. Peyton Young, and Lucy Y. Pao Abstract We propose a simple payoff-based learning rule that is completely decentralized, and
More informationMultiagent Value Iteration in Markov Games
Multiagent Value Iteration in Markov Games Amy Greenwald Brown University with Michael Littman and Martin Zinkevich Stony Brook Game Theory Festival July 21, 2005 Agenda Theorem Value iteration converges
More informationAchieving Pareto Optimality Through Distributed Learning
1 Achieving Pareto Optimality Through Distributed Learning Jason R. Marden, H. Peyton Young, and Lucy Y. Pao Abstract We propose a simple payoff-based learning rule that is completely decentralized, and
More informationCS364A: Algorithmic Game Theory Lecture #13: Potential Games; A Hierarchy of Equilibria
CS364A: Algorithmic Game Theory Lecture #13: Potential Games; A Hierarchy of Equilibria Tim Roughgarden November 4, 2013 Last lecture we proved that every pure Nash equilibrium of an atomic selfish routing
More informationSurvival of Dominated Strategies under Evolutionary Dynamics. Josef Hofbauer University of Vienna. William H. Sandholm University of Wisconsin
Survival of Dominated Strategies under Evolutionary Dynamics Josef Hofbauer University of Vienna William H. Sandholm University of Wisconsin a hypnodisk 1 Survival of Dominated Strategies under Evolutionary
More informationLecture 1. Stochastic Optimization: Introduction. January 8, 2018
Lecture 1 Stochastic Optimization: Introduction January 8, 2018 Optimization Concerned with mininmization/maximization of mathematical functions Often subject to constraints Euler (1707-1783): Nothing
More informationExtensive Form Games I
Extensive Form Games I Definition of Extensive Form Game a finite game tree X with nodes x X nodes are partially ordered and have a single root (minimal element) terminal nodes are z Z (maximal elements)
More informationOnline Learning Class 12, 20 March 2006 Andrea Caponnetto, Sanmay Das
Online Learning 9.520 Class 12, 20 March 2006 Andrea Caponnetto, Sanmay Das About this class Goal To introduce the general setting of online learning. To describe an online version of the RLS algorithm
More information1. Introduction. 2. A Simple Model
. Introduction In the last years, evolutionary-game theory has provided robust solutions to the problem of selection among multiple Nash equilibria in symmetric coordination games (Samuelson, 997). More
More informationarxiv: v2 [cs.gt] 23 Jan 2019
Analysis of Hannan Consistent Selection for Monte Carlo Tree Search in Simultaneous Move Games Vojtěch Kovařík, Viliam Lisý vojta.kovarik@gmail.com, viliam.lisy@agents.fel.cvut.cz arxiv:1804.09045v2 [cs.gt]
More informationMean-field equilibrium: An approximation approach for large dynamic games
Mean-field equilibrium: An approximation approach for large dynamic games Ramesh Johari Stanford University Sachin Adlakha, Gabriel Y. Weintraub, Andrea Goldsmith Single agent dynamic control Two agents:
More informationSatisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games
Satisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games Stéphane Ross and Brahim Chaib-draa Department of Computer Science and Software Engineering Laval University, Québec (Qc),
More informationEconomics 209B Behavioral / Experimental Game Theory (Spring 2008) Lecture 3: Equilibrium refinements and selection
Economics 209B Behavioral / Experimental Game Theory (Spring 2008) Lecture 3: Equilibrium refinements and selection Theory cannot provide clear guesses about with equilibrium will occur in games with multiple
More informationOn Equilibria of Distributed Message-Passing Games
On Equilibria of Distributed Message-Passing Games Concetta Pilotto and K. Mani Chandy California Institute of Technology, Computer Science Department 1200 E. California Blvd. MC 256-80 Pasadena, US {pilotto,mani}@cs.caltech.edu
More informationCS 573: Algorithmic Game Theory Lecture date: January 23rd, 2008
CS 573: Algorithmic Game Theory Lecture date: January 23rd, 2008 Instructor: Chandra Chekuri Scribe: Bolin Ding Contents 1 2-Player Zero-Sum Game 1 1.1 Min-Max Theorem for 2-Player Zero-Sum Game....................
More informationהאוניברסיטה העברית בירושלים
האוניברסיטה העברית בירושלים THE HEBREW UNIVERSITY OF JERUSALEM TOWARDS A CHARACTERIZATION OF RATIONAL EXPECTATIONS by ITAI ARIELI Discussion Paper # 475 February 2008 מרכז לחקר הרציונליות CENTER FOR THE
More informationDefensive Forecasting: 4. Good probabilities mean less than we thought.
Defensive Forecasting: Good probabilities mean less than we thought! Foundations of Probability Seminar Departments of Statistics and Philosophy November 2, 206 Glenn Shafer, Rutgers University 0. Game
More informationInternal Regret in On-line Portfolio Selection
Internal Regret in On-line Portfolio Selection Gilles Stoltz (gilles.stoltz@ens.fr) Département de Mathématiques et Applications, Ecole Normale Supérieure, 75005 Paris, France Gábor Lugosi (lugosi@upf.es)
More informationOnline Learning and Online Convex Optimization
Online Learning and Online Convex Optimization Nicolò Cesa-Bianchi Università degli Studi di Milano N. Cesa-Bianchi (UNIMI) Online Learning 1 / 49 Summary 1 My beautiful regret 2 A supposedly fun game
More informationBargaining Efficiency and the Repeated Prisoners Dilemma. Bhaskar Chakravorti* and John Conley**
Bargaining Efficiency and the Repeated Prisoners Dilemma Bhaskar Chakravorti* and John Conley** Published as: Bhaskar Chakravorti and John P. Conley (2004) Bargaining Efficiency and the repeated Prisoners
More informationPatience and Ultimatum in Bargaining
Patience and Ultimatum in Bargaining Björn Segendorff Department of Economics Stockholm School of Economics PO Box 6501 SE-113 83STOCKHOLM SWEDEN SSE/EFI Working Paper Series in Economics and Finance No
More informationBrown s Original Fictitious Play
manuscript No. Brown s Original Fictitious Play Ulrich Berger Vienna University of Economics, Department VW5 Augasse 2-6, A-1090 Vienna, Austria e-mail: ulrich.berger@wu-wien.ac.at March 2005 Abstract
More informationPopulation Games and Evolutionary Dynamics
Population Games and Evolutionary Dynamics (MIT Press, 200x; draft posted on my website) 1. Population games 2. Revision protocols and evolutionary dynamics 3. Potential games and their applications 4.
More informationGame Theory: introduction and applications to computer networks
Game Theory: introduction and applications to computer networks Introduction Giovanni Neglia INRIA EPI Maestro 27 January 2014 Part of the slides are based on a previous course with D. Figueiredo (UFRJ)
More informationEC319 Economic Theory and Its Applications, Part II: Lecture 2
EC319 Economic Theory and Its Applications, Part II: Lecture 2 Leonardo Felli NAB.2.14 23 January 2014 Static Bayesian Game Consider the following game of incomplete information: Γ = {N, Ω, A i, T i, µ
More informationOn the Impossibility of Predicting the Behavior of Rational Agents. Dean P. Foster* and H. Peyton Young ** February, 1999 This version: October, 2000
On the Impossibility of Predicting the Behavior of Rational Agents Dean P. Foster* and H. Peyton Young ** February, 1999 This version: October, 2000 We exhibit a class of games that almost surely cannot
More informationLecture December 2009 Fall 2009 Scribe: R. Ring In this lecture we will talk about
0368.4170: Cryptography and Game Theory Ran Canetti and Alon Rosen Lecture 7 02 December 2009 Fall 2009 Scribe: R. Ring In this lecture we will talk about Two-Player zero-sum games (min-max theorem) Mixed
More informationMixed Nash Equilibria
lgorithmic Game Theory, Summer 2017 Mixed Nash Equilibria Lecture 2 (5 pages) Instructor: Thomas Kesselheim In this lecture, we introduce the general framework of games. Congestion games, as introduced
More informationPrincipal-Agent Games - Equilibria under Asymmetric Information -
Principal-Agent Games - Equilibria under Asymmetric Information - Ulrich Horst 1 Humboldt-Universität zu Berlin Department of Mathematics and School of Business and Economics Work in progress - Comments
More informationAlgorithmic Strategy Complexity
Algorithmic Strategy Complexity Abraham Neyman aneyman@math.huji.ac.il Hebrew University of Jerusalem Jerusalem Israel Algorithmic Strategy Complexity, Northwesten 2003 p.1/52 General Introduction The
More informationRegret in the On-Line Decision Problem
Games and Economic Behavior 29, 7 35 (1999) Article ID game.1999.0740, available online at http://www.idealibrary.com on Regret in the On-Line Decision Problem Dean P. Foster Department of Statistics,
More informationStable Games and their Dynamics
Stable Games and their Dynamics Josef Hofbauer and William H. Sandholm January 10, 2009 Abstract We study a class of population games called stable games. These games are characterized by self-defeating
More informationPh.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2016
Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2016 The time limit for this exam is four hours. The exam has four sections. Each section includes two questions.
More informationImproving Convergence Rates in Multiagent Learning Through Experts and Adaptive Consultation
Improving Convergence Rates in Multiagent Learning Through Experts and Adaptive Consultation Greg Hines Cheriton School of Computer Science University of Waterloo Waterloo, Ontario, N2L 3G1 Kate Larson
More informationConvergence and No-Regret in Multiagent Learning
Convergence and No-Regret in Multiagent Learning Michael Bowling Department of Computing Science University of Alberta Edmonton, Alberta Canada T6G 2E8 bowling@cs.ualberta.ca Abstract Learning in a multiagent
More informationA Note on the Existence of Ratifiable Acts
A Note on the Existence of Ratifiable Acts Joseph Y. Halpern Cornell University Computer Science Department Ithaca, NY 14853 halpern@cs.cornell.edu http://www.cs.cornell.edu/home/halpern August 15, 2018
More informationGames and Their Equilibria
Chapter 1 Games and Their Equilibria The central notion of game theory that captures many aspects of strategic decision making is that of a strategic game Definition 11 (Strategic Game) An n-player strategic
More information