KINETIC MODELS FOR DIFFERENTIAL GAMES
|
|
- Robyn Fisher
- 5 years ago
- Views:
Transcription
1 KINETIC MODELS FOR DIFFERENTIAL GAMES D. Brinkman, C. Ringhofer math.la.asu.edu/ chris Work supported by NSF KI-NET Partially based on prev. work with P. Degond (Imperial College) M. Herty (RWTH Aachen) and J.G. Liu (Duke U.)
2 Classical kinetic models: An ensemble of mechanical particles is driven by a potential energy and undergoes binary interactions between agents or with a background. Drift: d dt ξ n = x V(ξ n, ξ), ξ ξ t x V(ξ n, ξ), ξ = (ξ 1,.., ξ N ) Random Diffusion: ξ n q n ( ξ), dp[q n ( ξ) = r] = P n (r, ξ) dr Assumptions: 1. (IID) Many particles. All particles are the same. 2. Correlations between particles are approximated. 3. The ensemble tries to minimize a common energy functional V cooperatively.
3 Kinetic density: Let f (x, t) dx = dp[ξ n (t) = x] t f (x, t) x (f x V f ) = Q[f ] x: state of the agent. V f : mean field potential for a multi agent system. Q[f ] interaction; Boltzmann : Q integral operator: Q[f ](x) = P f (x, x )ω f (x ) dx ω f (x)f (x) Einstein - Brownian motion: Q[f ] = x (D f x f ). System driven to a global minimum.
4 SOCIAL, ECONOMIC OR BIOLOGICAL APPLICATIONS: Non - cooperative agents. Each agent has an individual cost functional (energy), which it tries to optimize, given the action of the other agents. This leads to a Pareto optimum and a Nash equilibrium instead of a global optimum.
5 THE NASH EQUILIBRUM NE for pure strategies: Take two players: A and B with strategies (states) x and y and costs C A and C B. Optimal response: x opt = φ A (y) = min z {C A (z, y)}, y opt = φ B (x) = min z {C B (x, z)} Nash equilibrium: Neither player can do better, given the other s state fixed point problem φ A (φ B (x)) = x φ B (φ A (y)) = y 2. NE for mixed strategies: Two ensembles of players (or two players picking strategies from distributions f A and f B ). f A opt = φ A [f B ] = inf g {E g,f B[C A ]}, f B opt = φ B [f A ] = inf g {E f A,g[C B ]} Again: A fixed point problem (for functionals of distributions) φ A [f B [f A ]] = f A φ B [f A [f B ]] = f B
6 x opt = φ A (y) = min z {C A (z, y)}, y opt = φ B (x) = min z {C B (x, z)} φ A (φ B (x)) = x φ B (φ A (y)) = y This is not as optimal as minimizing C A + C B in cooperation (Prisoner s dilemma). General framework of Non-Cooperative, Non-Atomic, Anonymous games with a Continuum of Players (NCNAACP). References: Aumann, Mas Colell, Schmeidler, Shapiro, Shapley,
7 ISSUES: The Nash equilibrium is an equilibrium concept and gives no indication about the dynamics of the system. It assumes global knowledge of each player. Goal: Derive kinetic theories from agent (particle) based models for the dynamics of an actual game based on cost functionals and optimal responses.
8 OUTLINE Dynamics. General framework for mixed strategies and ensembles of players. Predict the future. (Optimal control Mean field games.) Learn from the past. (Decisions based on past experience Behavioral game theory.) Survival of the best strategies (Darwinism) Evolutionary game theory. 2. A simple example (Dynamics of mixed strategies for a RPS game.) 3. Application: Designing insurance policies. Game between an insurer and an ensemble of clients. Numerics. Kinetic theory. 4. Conclusions and outlook.
9 MODELING DYNAMICS I (Estimating the future) Optimal control approach: Swarms of intelligent individuals. Each agent observes the current state of the others and optimizes its cost, based on this observation. (1a.) Estimate over a finite time horizon. (Mean field games.) Given a strategy trajectory y(t) of B, A optimizes the cost of the trajectory t+τ t C A (x(s), y(s)) ds over a finite horizon τ. Job: Find x(t) = x opt (t) such that t+τ t the constraint d dty = V(x, y). C(x(s), y(t)) ds min under Using adjoint calculus, this yields a two point BVP in time, namely a forward problem for y(t) (to satisfy the constraint), coupled to a backward in time problem for a Lagrange multiplier λ(t). Hamilton - Jacobi - Bellmann equtns. Mean-field games (Lasry,Lions)
10 (1b.) Intelligent Swarms. A optimizes the current cost C A (x(t), y(t)) and moves incrementally towards the Pareto optimum. Yields a local in time control and an equation of the type t f A (x, t) = ω(φ A [f B ] f A ). (1c.) Instantaneous control. A proceeds as in (1a) and (1b), but uses only a Taylor series expansion of t+τ t C A (x(s), y(t)) ds for τ 0 and a Taylor series solution of d dty = V(x, y). local in time control. (Degond, Herty, Liu,CR).
11 MODELING DYNAMICS II (Learning) 16 Behavioral game theory; Machine learning General setup: Two ensembles A and B, of distributed players. A uses a (distributed) strategy x for each game. B uses a strategy y. A records outcomes via an observable a, trying to learn the behavior of B. In the same way, B uses an observable b, trying to learn about A.
12 A MULTI AGENT MODEL 2 ensembles A and B with N and M players, each with a strategy x (for A) and y (for B), and an observable a (for A) and b (for B). Step 1: A picks the optimal strategy based on the experience a. x(t + t) = ξ(a(t)), C A (ξ(a), a) = min x C A (x, a) Step 2: They actually play. A picks a random member of B, giving costs C A (x, y) and C B (y, x) for A and B. Step 3: A and B update their observables a and b, based on the outcome. Symmetric picture for B, y, b. a(t + t) = a(t) +???
13 OBSERVABLE UPDATES (a and b) 19 Discount past experiences over time. Behavioral game theory learning and forgetting (Camerer et al) 1) Rolling averages: finite memory 2) Time weighted averages: Discount past experience with a time dependent weight. a(t) = t 0 w(t, τ)c A (x(τ), y(τ)) dτ, t 0 w(t, τ) dτ = 1 t Three types of weights usable in a kinetic approch: Exponential: w(t, τ) e γ(τ t) Geometric: w(t, τ) ( τ t )γ Constant: w(t, τ) 1
14 THE IID MODEL Independent identically distributed agents in ensembles A and B with N >> 1 and M >> 1 members. F(x 1,..x N, a 1,..a N, y 1,..y M, b 1,.., b M, t) = N n=1 f A (x n, a n ) M m=1 f B (y m, b m ) Yields a system of two coupled kinetic equations for f A (x, a, t) and f B (y, b, t). t f A (x, a) = Q A strat[f A ] + Q A obs[f A, f B ] Q A strat[f A ] = ω[δ(x φ A (a)) f A (x, a) dx f A ] +λ Q A obs[f A, f B ] = a (w(t, t)af A ) δ(a C A (x, y )f A (x, a )f B (y, b ) dy a b λf A (x, a)
15 GENERAL KINETIC MODEL FOR TWO PLAYER ENSEMBLES 22 t f A (x, a) = Q A strat[f A ] + Q A obs [f A, f B ], t f B (y, b) = Q B strat[f B ] + Q B obs [f B, f A ] Q A,B strat are simple linear relaxation operators, giving the strategies, based on the observables. Q A,B obs are nonlinear operators, coupling the observables based on the actual outcome of the game. Three time scales: ω: frequency of the games. λ: frequency of observable updates (learning). w(t, t): discounting past experiences (forgetting).
16 DYNAMIC CONVERGENCE TOWARDS A NASH EQUILIBRIUM 24 Theorem Assumption: If learning is successful, i.e. the estimated cost converges against the real cost, and lim t lim t f A (x, a) dx = f B (y, b, t) dy = C A (x, a) dx, C B (y, b) dy then the system converges against the Nash equilibrium for mixed strategies, when averaged over time. 1 lim t t 1 lim t t t 0 t 0 dτ dτ da f A (x, a, τ) = f A Nash(x) db f B (y, b, τ) = f B Nash(y)
17 A SIMPLE EXAMPLE 26 (Rock - Paper - Scissors) Cost to A: A B R P S C A (x, y) = R P S Remark 1: x, y discrete {R = 1, P = 2, S = 3} +λ t f A (x, a) = Q A strat[f A ] + Q A obs[f A, f B ] Q A strat[f A ] = ω[δ(x φ A (a)) f A (x, a) dx f A ] Q A obs[f A, f B ] = a (w(t, t)af A ) C A (x, y )f A (x, a )f B (y, b ) dx a y b λc A (x, y)f A (x, a)
18 Remark 2: RPS has no Nash equilibrium for pure strategies and a trivial NE for mixed strategies P[R, R, S] = ( 1 3, 1 3, 1 3 ). Remark 3: For successful learning we have to try (randomly) new strategies. Example: B plays R all the time and A tries to learn about B. Need to add Q A try: t f A (x, a) = Q A strat[f A ] + Q A try[f A ] + Q A obs [f A, f B ] with Relaxation: Q A try[f A ](x, a) = α[p(x) f A (x, a) dx f A (x, a)] or Brownian Motion: Q A try[f A ](x, a) = α x f A
19 EXAMPLE RPS 1 B plays RPS with f(r)=0.5,f(p)=0.25,f(s)=0.25. A learns. Strategies (left panel) and estimated profits (right panel)for A (R=blue, P=green, S=red) strategies for A strategy profits for A # of games # of games,
20 EXAMPLE RPS 2 A and B both learn convergence to the Nash equilibrium. strategies and profits for A. Strategies (left panel) and estimated profits (right panel)for A (R=blue, P=green, S=red) strategies for A strategy profits for A # of games # of games,
21 A and B both learn convergence to the Nash equilibrium. strategies and profits for B. Strategies (left panel) and estimated profits (right panel)for A (R=blue, P=green, S=red) strategies for B strategy profits for B # of games # of games,
22 Corollary: Steady state given by modified NE 1 lim t t 1 lim t t t dτ 0 t 0 dτ da f A (x, a, τ) = f A Nash(x)+p(x) db f B (y, b, τ) = f B Nash(y)+p(y)
23 OPTIMAL DESIGN OF INSURANCE POLICIES Motivation: 1995: a major US university went from a uniform health insurance plan for all employees to a staggered system with 3 plans with different rates and benefits. Outcome: All plans, except for the cheapest, died out very quickly.
24 OPTIMAL DESIGN OF INSURANCE POLICIES 32 A: ensemble of clients with N >> 1 members. B: one insurer M = 1, offers K insurance plans with annual rates r k, k = 1 : K and cutoffs y k, k = 1 : K. There is a zero benefit plan k = 0 with r 0 = 0 and y 0 = (no insurance). Strategies: Client strategy (A, discrete): x = k {0,.., K}, the plan it chooses (including k = 0 no insurance). Insurer strategy (B): Annual rates r 1,.., r K ; cutoff values y 1,.., y K for accepting a member of A with risk a into plan k = 1 : K. Observable (a, risk, average claims): Poisson process Modification: Claims, independent of x and y!!) A and B share the same observable a.
25 COSTS AND OPTIMAL RESPONSE Optimal response for a client in A: The cost to a member of A, trying to choose plan k = 0 : K is ( ) C A r(k) for k = 1 : K and a < y(k) (eligible) = a for k = 0 (no insurance) If choosing a plan k = 1 : K, the optimal choice is the cheapest for which A is eligible: ν(y, a) = min{k : k = 1 : K, a k < y k } A compares this to the expected out of pocket cost a: ν(y, a) if r ν(y,a) < a (choosing a plan) k opt (y, r, a) = 0 if a < r ν(y,a) (opting out) 0 if a > y K (not eligible) k opt (y, r, a) = H(a r ν(y,a) )H(y K a)ν(y, a)
26 COSTS AND OPTIMAL RESPONSE Insurer B: Insurer cost: Given an ensemble A with distributed risk a choosing plan k C B = K yk k=1 0 (αa r k)f A (k, a) da α < 1: discount factor; y k : risk cutoff in plan k. Optimal response of insurer B : (Has to include a rate strategy!) y opt k = r k af α, r k = β A (k,a) da f A (k,a) da, β > 1 β > 1: greed factor, guarantees that C B 0. C B = α K yk k=1 0 (a βy k )f A (k, a) da 0
27 NUMERICAL RESULTS FOR A SIMPLE MULTI - AGENT MODEL insured (A) 1 insurer (B),. K = 3 insurance plans (+ plan zero = no insurance). 2 risk groups Results extremely sensitive on the greed factor β > 1 which guarantees a profit for the insurer (as long as the plan is not empty).
28 Greed factor β 5 3 : desired profit = 2 3 of the expected risk. Outcome: All but the cheapest plan and plan zero die out.
29 Greed factor β = 4 3 : desired profit = 1 3 of the expected risk. Outcome: All three plans survive.
30 KINETIC INSURANCE MODEL 43 Simplification: The observable claims a of the client ensemble A are governed by a Poisson process, independent of the strategies. This allows for a closure in terms of ρ A k (t) = f A (k, a, t) da and u A (a, t) = k f A (k, a, t) t u A (a, t) = a (w(t, t)au A ) + λa(a) λu A (a, t) t fk A (a, t) = ω[ φ A (k, a, f B )u A (a) da fk A ], t f B k (y, t) = Q B strat[f B, u A, ρ A k ]
31 COMPUTATIONAL COMPLEXITY K + 1 transport equations for the strategies of A. K transport equations for the strategies of B. 1 transport equation for the risk. Three time scales: ω: game frequency, λ: updates of the risk, (claim frequency). w(t, t): discount factor (forgetting about old claims)
32 CONCLUSIONS Content: 1. General kinetic framework for two player non - cooperative games with mixed strategies. 2. Include dynamics via a behavioral game theory approach. 3. Leads to a system of transport equations for the probability densities of strategies and observables involving multiple time scales. 4. Applications to the design and evolution of insurance policies. Outlook: Further analysis of the kinetic model. Time scale separation, moment closures, Chapman - Enskog type expansions. Further investigation in rate strategies.
33 REFERENCES Available at: math.la.asu.edu/ chris Mean Field Games, Jean-Michel LASRY, Pierre-Louis LIONS, Japan. J. Math 2, (2007) DOI: /s Evolution of the distribution of wealth in a nonconservative economic environment driven by local Nash equilibria Pierre Degond, Jian-Guo Liu and Christian Ringhofer Proceedings of the Royal Society A (to be published 2015) Evolution of the Distribution of Wealth in an Economic Environment Driven by Local Nash Equilibria Pierre Degond, Jian-Guo Liu, Christian Ringhofer, Journal of Statistical Physics, published online (2013) Large-Scale Dynamics of Mean-Field Games Driven by Local Nash Equilibria Pierre Degond, Jian-Guo Liu, Christian Ringhofer, Journal of Nonlinear Science, published online (2013)
On the interplay between kinetic theory and game theory
1 On the interplay between kinetic theory and game theory Pierre Degond Department of mathematics, Imperial College London pdegond@imperial.ac.uk http://sites.google.com/site/degond/ Joint work with J.
More informationGame-Theoretic Learning:
Game-Theoretic Learning: Regret Minimization vs. Utility Maximization Amy Greenwald with David Gondek, Amir Jafari, and Casey Marks Brown University University of Pennsylvania November 17, 2004 Background
More informationDifferent types of phase transitions for a simple model of alignment of oriented particles
Different types of phase transitions for a simple model of alignment of oriented particles Amic Frouvelle Université Paris Dauphine Joint work with Jian-Guo Liu (Duke University, USA) and Pierre Degond
More informationHJB equations. Seminar in Stochastic Modelling in Economics and Finance January 10, 2011
Department of Probability and Mathematical Statistics Faculty of Mathematics and Physics, Charles University in Prague petrasek@karlin.mff.cuni.cz Seminar in Stochastic Modelling in Economics and Finance
More informationON MEAN FIELD GAMES. Pierre-Louis LIONS. Collège de France, Paris. (joint project with Jean-Michel LASRY)
Collège de France, Paris (joint project with Jean-Michel LASRY) I INTRODUCTION II A REALLY SIMPLE EXAMPLE III GENERAL STRUCTURE IV TWO PARTICULAR CASES V OVERVIEW AND PERSPECTIVES I. INTRODUCTION New class
More informationOnline Learning Class 12, 20 March 2006 Andrea Caponnetto, Sanmay Das
Online Learning 9.520 Class 12, 20 March 2006 Andrea Caponnetto, Sanmay Das About this class Goal To introduce the general setting of online learning. To describe an online version of the RLS algorithm
More informationMean-Field optimization problems and non-anticipative optimal transport. Beatrice Acciaio
Mean-Field optimization problems and non-anticipative optimal transport Beatrice Acciaio London School of Economics based on ongoing projects with J. Backhoff, R. Carmona and P. Wang Robust Methods in
More informationKnowing What Others Know: Coordination Motives in Information Acquisition
Knowing What Others Know: Coordination Motives in Information Acquisition Christian Hellwig and Laura Veldkamp UCLA and NYU Stern May 2006 1 Hellwig and Veldkamp Two types of information acquisition Passive
More informationSUMMER SCHOOL - KINETIC EQUATIONS LECTURE IV: HOMOGENIZATION OF KINETIC FLOWS IN NETWORKS. Shanghai, 2011.
SUMMER SCHOOL - KINETIC EQUATIONS LECTURE IV: HOMOGENIZATION OF KINETIC FLOWS IN NETWORKS. Shanghai, 2011. C. Ringhofer ringhofer@asu.edu, math.la.asu.edu/ chris OVERVIEW Quantum and classical description
More informationGame Theory, Evolutionary Dynamics, and Multi-Agent Learning. Prof. Nicola Gatti
Game Theory, Evolutionary Dynamics, and Multi-Agent Learning Prof. Nicola Gatti (nicola.gatti@polimi.it) Game theory Game theory: basics Normal form Players Actions Outcomes Utilities Strategies Solutions
More informationA (Brief) Introduction to Game Theory
A (Brief) Introduction to Game Theory Johanne Cohen PRiSM/CNRS, Versailles, France. Goal Goal is a Nash equilibrium. Today The game of Chicken Definitions Nash Equilibrium Rock-paper-scissors Game Mixed
More informationMulti-dimensional Stochastic Singular Control Via Dynkin Game and Dirichlet Form
Multi-dimensional Stochastic Singular Control Via Dynkin Game and Dirichlet Form Yipeng Yang * Under the supervision of Dr. Michael Taksar Department of Mathematics University of Missouri-Columbia Oct
More informationA model of alignment interaction for oriented particles with phase transition
A model of alignment interaction for oriented particles with phase transition Amic Frouvelle ACMAC Joint work with Jian-Guo Liu (Duke University, USA) and Pierre Degond (Institut de Mathématiques de Toulouse,
More informationSUMMER SCHOOL - KINETIC EQUATIONS LECTURE II: CONFINED CLASSICAL TRANSPORT Shanghai, 2011.
SUMMER SCHOOL - KINETIC EQUATIONS LECTURE II: CONFINED CLASSICAL TRANSPORT Shanghai, 2011. C. Ringhofer ringhofer@asu.edu, math.la.asu.edu/ chris OVERVIEW Quantum and classical description of many particle
More informationStochastic Volatility and Correction to the Heat Equation
Stochastic Volatility and Correction to the Heat Equation Jean-Pierre Fouque, George Papanicolaou and Ronnie Sircar Abstract. From a probabilist s point of view the Twentieth Century has been a century
More informationHyperbolic Models for Large Supply Chains. Christian Ringhofer (Arizona State University) Hyperbolic Models for Large Supply Chains p.
Hyperbolic Models for Large Supply Chains Christian Ringhofer (Arizona State University) Hyperbolic Models for Large Supply Chains p. /4 Introduction Topic: Overview of conservation law (traffic - like)
More informationIntroduction to multiscale modeling and simulation. Explicit methods for ODEs : forward Euler. y n+1 = y n + tf(y n ) dy dt = f(y), y(0) = y 0
Introduction to multiscale modeling and simulation Lecture 5 Numerical methods for ODEs, SDEs and PDEs The need for multiscale methods Two generic frameworks for multiscale computation Explicit methods
More informationBelief-based Learning
Belief-based Learning Algorithmic Game Theory Marcello Restelli Lecture Outline Introdutcion to multi-agent learning Belief-based learning Cournot adjustment Fictitious play Bayesian learning Equilibrium
More informationEvolution & Learning in Games
1 / 28 Evolution & Learning in Games Econ 243B Jean-Paul Carvalho Lecture 5. Revision Protocols and Evolutionary Dynamics 2 / 28 Population Games played by Boundedly Rational Agents We have introduced
More informationSome Aspects of First Principle Models for Production Systems and Supply Chains. Christian Ringhofer (Arizona State University)
Some Aspects of First Principle Models for Production Systems and Supply Chains Christian Ringhofer (Arizona State University) Some Aspects of First Principle Models for Production Systems and Supply Chains
More informationA model of alignment interaction for oriented particles with phase transition
A model of alignment interaction for oriented particles with phase transition Amic Frouvelle Archimedes Center for Modeling, Analysis & Computation (ACMAC) University of Crete, Heraklion, Crete, Greece
More informationBounded Rationality, Strategy Simplification, and Equilibrium
Bounded Rationality, Strategy Simplification, and Equilibrium UPV/EHU & Ikerbasque Donostia, Spain BCAM Workshop on Interactions, September 2014 Bounded Rationality Frequently raised criticism of game
More informationA model of alignment interaction for oriented particles with phase transition
A model of alignment interaction for oriented particles with phase transition Amic Frouvelle Institut de Mathématiques de Toulouse Joint work with Jian-Guo Liu (Duke Univ.) and Pierre Degond (IMT) Amic
More informationSurvival of Dominated Strategies under Evolutionary Dynamics. Josef Hofbauer University of Vienna. William H. Sandholm University of Wisconsin
Survival of Dominated Strategies under Evolutionary Dynamics Josef Hofbauer University of Vienna William H. Sandholm University of Wisconsin a hypnodisk 1 Survival of Dominated Strategies under Evolutionary
More informationEvolution & Learning in Games
1 / 27 Evolution & Learning in Games Econ 243B Jean-Paul Carvalho Lecture 2. Foundations of Evolution & Learning in Games II 2 / 27 Outline In this lecture, we shall: Take a first look at local stability.
More informationAn Introduction to Noncooperative Games
An Introduction to Noncooperative Games Alberto Bressan Department of Mathematics, Penn State University Alberto Bressan (Penn State) Noncooperative Games 1 / 29 introduction to non-cooperative games:
More informationDynamic Games with Asymmetric Information: Common Information Based Perfect Bayesian Equilibria and Sequential Decomposition
Dynamic Games with Asymmetric Information: Common Information Based Perfect Bayesian Equilibria and Sequential Decomposition 1 arxiv:1510.07001v1 [cs.gt] 23 Oct 2015 Yi Ouyang, Hamidreza Tavafoghi and
More informationIntroduction to Continuous-Time Dynamic Optimization: Optimal Control Theory
Econ 85/Chatterjee Introduction to Continuous-ime Dynamic Optimization: Optimal Control heory 1 States and Controls he concept of a state in mathematical modeling typically refers to a specification of
More informationCHAPTER 2 THE MAXIMUM PRINCIPLE: CONTINUOUS TIME. Chapter2 p. 1/67
CHAPTER 2 THE MAXIMUM PRINCIPLE: CONTINUOUS TIME Chapter2 p. 1/67 THE MAXIMUM PRINCIPLE: CONTINUOUS TIME Main Purpose: Introduce the maximum principle as a necessary condition to be satisfied by any optimal
More informationSatisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games
Satisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games Stéphane Ross and Brahim Chaib-draa Department of Computer Science and Software Engineering Laval University, Québec (Qc),
More informationErgodicity and Non-Ergodicity in Economics
Abstract An stochastic system is called ergodic if it tends in probability to a limiting form that is independent of the initial conditions. Breakdown of ergodicity gives rise to path dependence. We illustrate
More informationUniversity Of Calgary Department of Mathematics and Statistics
University Of Calgary Department of Mathematics and Statistics Hawkes Seminar May 23, 2018 1 / 46 Some Problems in Insurance and Reinsurance Mohamed Badaoui Department of Electrical Engineering National
More informationPROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS
PROBABILITY: LIMIT THEOREMS II, SPRING 15. HOMEWORK PROBLEMS PROF. YURI BAKHTIN Instructions. You are allowed to work on solutions in groups, but you are required to write up solutions on your own. Please
More informationSUMMER SCHOOL - KINETIC EQUATIONS LECTURE I: TRANSPORT PICTURES AND DISSIPATION Shanghai, 2011.
SUMMER SCHOOL - KINETIC EQUATIONS LECTURE I: TRANSPORT PICTURES AND DISSIPATION Shanghai, 2011. C. Ringhofer ringhofer@asu.edu, math.la.asu.edu/ chris OVERVIEW Quantum and classical description of many
More informationProductivity Losses from Financial Frictions: Can Self-financing Undo Capital Misallocation?
Productivity Losses from Financial Frictions: Can Self-financing Undo Capital Misallocation? Benjamin Moll G Online Appendix: The Model in Discrete Time and with iid Shocks This Appendix presents a version
More informationIntroduction to Reachability Somil Bansal Hybrid Systems Lab, UC Berkeley
Introduction to Reachability Somil Bansal Hybrid Systems Lab, UC Berkeley Outline Introduction to optimal control Reachability as an optimal control problem Various shades of reachability Goal of This
More informationWeak solutions of mean-field stochastic differential equations
Weak solutions of mean-field stochastic differential equations Juan Li School of Mathematics and Statistics, Shandong University (Weihai), Weihai 26429, China. Email: juanli@sdu.edu.cn Based on joint works
More informationA scalar conservation law with discontinuous flux for supply chains with finite buffers.
A scalar conservation law with discontinuous flux for supply chains with finite buffers. Dieter Armbruster School of Mathematical and Statistical Sciences, Arizona State University & Department of Mechanical
More informationGains in evolutionary dynamics. Dai ZUSAI
Gains in evolutionary dynamics unifying rational framework for dynamic stability Dai ZUSAI Hitotsubashi University (Econ Dept.) Economic Theory Workshop October 27, 2016 1 Introduction Motivation and literature
More informationIntroduction to game theory LECTURE 1
Introduction to game theory LECTURE 1 Jörgen Weibull January 27, 2010 1 What is game theory? A mathematically formalized theory of strategic interaction between countries at war and peace, in federations
More informationELEMENTS OF PROBABILITY THEORY
ELEMENTS OF PROBABILITY THEORY Elements of Probability Theory A collection of subsets of a set Ω is called a σ algebra if it contains Ω and is closed under the operations of taking complements and countable
More informationDifferent types of phase transitions for a simple model of alignment of oriented particles
Different types of phase transitions for a simple model of alignment of oriented particles Amic Frouvelle CEREMADE Université Paris Dauphine Joint work with Jian-Guo Liu (Duke University, USA) and Pierre
More informationReflected Brownian Motion
Chapter 6 Reflected Brownian Motion Often we encounter Diffusions in regions with boundary. If the process can reach the boundary from the interior in finite time with positive probability we need to decide
More informationThe Mathematics of Continuous Time Contract Theory
The Mathematics of Continuous Time Contract Theory Ecole Polytechnique, France University of Michigan, April 3, 2018 Outline Introduction to moral hazard 1 Introduction to moral hazard 2 3 General formulation
More informationPopulation Games and Evolutionary Dynamics
Population Games and Evolutionary Dynamics (MIT Press, 200x; draft posted on my website) 1. Population games 2. Revision protocols and evolutionary dynamics 3. Potential games and their applications 4.
More informationLecture Notes 6: Dynamic Equations Part A: First-Order Difference Equations in One Variable
University of Warwick, EC9A0 Maths for Economists Peter J. Hammond 1 of 54 Lecture Notes 6: Dynamic Equations Part A: First-Order Difference Equations in One Variable Peter J. Hammond latest revision 2017
More informationEVOLUTIONARY GAMES WITH GROUP SELECTION
EVOLUTIONARY GAMES WITH GROUP SELECTION Martin Kaae Jensen Alexandros Rigos Department of Economics University of Leicester Controversies in Game Theory: Homo Oeconomicus vs. Homo Socialis ETH Zurich 12/09/2014
More informationPROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS
PROBABILITY: LIMIT THEOREMS II, SPRING 218. HOMEWORK PROBLEMS PROF. YURI BAKHTIN Instructions. You are allowed to work on solutions in groups, but you are required to write up solutions on your own. Please
More informationLecture 3: Hamilton-Jacobi-Bellman Equations. Distributional Macroeconomics. Benjamin Moll. Part II of ECON Harvard University, Spring
Lecture 3: Hamilton-Jacobi-Bellman Equations Distributional Macroeconomics Part II of ECON 2149 Benjamin Moll Harvard University, Spring 2018 1 Outline 1. Hamilton-Jacobi-Bellman equations in deterministic
More information1 Introduction. 2 Diffusion equation and central limit theorem. The content of these notes is also covered by chapter 3 section B of [1].
1 Introduction The content of these notes is also covered by chapter 3 section B of [1]. Diffusion equation and central limit theorem Consider a sequence {ξ i } i=1 i.i.d. ξ i = d ξ with ξ : Ω { Dx, 0,
More informationOther-Regarding Preferences: Theory and Evidence
Other-Regarding Preferences: Theory and Evidence June 9, 2009 GENERAL OUTLINE Economic Rationality is Individual Optimization and Group Equilibrium Narrow version: Restrictive Assumptions about Objective
More informationDistributed Learning based on Entropy-Driven Game Dynamics
Distributed Learning based on Entropy-Driven Game Dynamics Bruno Gaujal joint work with Pierre Coucheney and Panayotis Mertikopoulos Inria Aug., 2014 Model Shared resource systems (network, processors)
More informationLinearization of Differential Equation Models
Linearization of Differential Equation Models 1 Motivation We cannot solve most nonlinear models, so we often instead try to get an overall feel for the way the model behaves: we sometimes talk about looking
More informationFunctional Limit theorems for the quadratic variation of a continuous time random walk and for certain stochastic integrals
Functional Limit theorems for the quadratic variation of a continuous time random walk and for certain stochastic integrals Noèlia Viles Cuadros BCAM- Basque Center of Applied Mathematics with Prof. Enrico
More informationAffine Processes. Econometric specifications. Eduardo Rossi. University of Pavia. March 17, 2009
Affine Processes Econometric specifications Eduardo Rossi University of Pavia March 17, 2009 Eduardo Rossi (University of Pavia) Affine Processes March 17, 2009 1 / 40 Outline 1 Affine Processes 2 Affine
More informationLecture 4: Introduction to stochastic processes and stochastic calculus
Lecture 4: Introduction to stochastic processes and stochastic calculus Cédric Archambeau Centre for Computational Statistics and Machine Learning Department of Computer Science University College London
More informationInformation Choice in Macroeconomics and Finance.
Information Choice in Macroeconomics and Finance. Laura Veldkamp New York University, Stern School of Business, CEPR and NBER Spring 2009 1 Veldkamp What information consumes is rather obvious: It consumes
More informationXt i Xs i N(0, σ 2 (t s)) and they are independent. This implies that the density function of X t X s is a product of normal density functions:
174 BROWNIAN MOTION 8.4. Brownian motion in R d and the heat equation. The heat equation is a partial differential equation. We are going to convert it into a probabilistic equation by reversing time.
More informationProving the Regularity of the Minimal Probability of Ruin via a Game of Stopping and Control
Proving the Regularity of the Minimal Probability of Ruin via a Game of Stopping and Control Erhan Bayraktar University of Michigan joint work with Virginia R. Young, University of Michigan K αρλoβασi,
More informationLecture 4. Chapter 4: Lyapunov Stability. Eugenio Schuster. Mechanical Engineering and Mechanics Lehigh University.
Lecture 4 Chapter 4: Lyapunov Stability Eugenio Schuster schuster@lehigh.edu Mechanical Engineering and Mechanics Lehigh University Lecture 4 p. 1/86 Autonomous Systems Consider the autonomous system ẋ
More informationProf. Erhan Bayraktar (University of Michigan)
September 17, 2012 KAP 414 2:15 PM- 3:15 PM Prof. (University of Michigan) Abstract: We consider a zero-sum stochastic differential controller-and-stopper game in which the state process is a controlled
More informationDeterministic Dynamic Programming
Deterministic Dynamic Programming 1 Value Function Consider the following optimal control problem in Mayer s form: V (t 0, x 0 ) = inf u U J(t 1, x(t 1 )) (1) subject to ẋ(t) = f(t, x(t), u(t)), x(t 0
More informationAn Analytic Method for Solving Uncertain Differential Equations
Journal of Uncertain Systems Vol.6, No.4, pp.244-249, 212 Online at: www.jus.org.uk An Analytic Method for Solving Uncertain Differential Equations Yuhan Liu Department of Industrial Engineering, Tsinghua
More informationStability of Stochastic Differential Equations
Lyapunov stability theory for ODEs s Stability of Stochastic Differential Equations Part 1: Introduction Department of Mathematics and Statistics University of Strathclyde Glasgow, G1 1XH December 2010
More informationUNCERTAINTY FUNCTIONAL DIFFERENTIAL EQUATIONS FOR FINANCE
Surveys in Mathematics and its Applications ISSN 1842-6298 (electronic), 1843-7265 (print) Volume 5 (2010), 275 284 UNCERTAINTY FUNCTIONAL DIFFERENTIAL EQUATIONS FOR FINANCE Iuliana Carmen Bărbăcioru Abstract.
More informationSpatial Economics and Potential Games
Outline Spatial Economics and Potential Games Daisuke Oyama Graduate School of Economics, Hitotsubashi University Hitotsubashi Game Theory Workshop 2007 Session Potential Games March 4, 2007 Potential
More informationSystemic Risk and Stochastic Games with Delay
Systemic Risk and Stochastic Games with Delay Jean-Pierre Fouque (with René Carmona, Mostafa Mousavi and Li-Hsien Sun) PDE and Probability Methods for Interactions Sophia Antipolis (France) - March 30-31,
More informationVariational approach to mean field games with density constraints
1 / 18 Variational approach to mean field games with density constraints Alpár Richárd Mészáros LMO, Université Paris-Sud (based on ongoing joint works with F. Santambrogio, P. Cardaliaguet and F. J. Silva)
More informationRobotics. Control Theory. Marc Toussaint U Stuttgart
Robotics Control Theory Topics in control theory, optimal control, HJB equation, infinite horizon case, Linear-Quadratic optimal control, Riccati equations (differential, algebraic, discrete-time), controllability,
More informationControl Theory in Physics and other Fields of Science
Michael Schulz Control Theory in Physics and other Fields of Science Concepts, Tools, and Applications With 46 Figures Sprin ger 1 Introduction 1 1.1 The Aim of Control Theory 1 1.2 Dynamic State of Classical
More informationEvolutionary Game Theory: Overview and Recent Results
Overviews: Evolutionary Game Theory: Overview and Recent Results William H. Sandholm University of Wisconsin nontechnical survey: Evolutionary Game Theory (in Encyclopedia of Complexity and System Science,
More informationNumerical Optimization Algorithms
Numerical Optimization Algorithms 1. Overview. Calculus of Variations 3. Linearized Supersonic Flow 4. Steepest Descent 5. Smoothed Steepest Descent Overview 1 Two Main Categories of Optimization Algorithms
More informationA HYPERBOLIC RELAXATION MODEL FOR PRODUCT FLOW IN COMPLEX PRODUCTION NETWORKS. Ali Unver, Christian Ringhofer and Dieter Armbruster
DISCRETE AND CONTINUOUS Website: www.aimsciences.org DYNAMICAL SYSTEMS Supplement 2009 pp. 790 799 A HYPERBOLIC RELAXATION MODEL FOR PRODUCT FLOW IN COMPLEX PRODUCTION NETWORKS Ali Unver, Christian Ringhofer
More informationOptimal Control. McGill COMP 765 Oct 3 rd, 2017
Optimal Control McGill COMP 765 Oct 3 rd, 2017 Classical Control Quiz Question 1: Can a PID controller be used to balance an inverted pendulum: A) That starts upright? B) That must be swung-up (perhaps
More informationQuantum Games. Quantum Strategies in Classical Games. Presented by Yaniv Carmeli
Quantum Games Quantum Strategies in Classical Games Presented by Yaniv Carmeli 1 Talk Outline Introduction Game Theory Why quantum games? PQ Games PQ penny flip 2x2 Games Quantum strategies 2 Game Theory
More informationCONTINUUM MODELS FOR FLOWS ON COMPLEX NETWORKS
CONTINUUM MODELS FOR FLOWS ON COMPLEX NETWORKS C. Ringhofer ringhofer@asu.edu, math.la.asu.edu/ chris OUTLINE 1 Introduction: Network Examples: Some examples of real world networks + theoretical structure.
More informationA Model of Evolutionary Dynamics with Quasiperiodic Forcing
paper presented at Society for Experimental Mechanics (SEM) IMAC XXXIII Conference on Structural Dynamics February 2-5, 205, Orlando FL A Model of Evolutionary Dynamics with Quasiperiodic Forcing Elizabeth
More informationNon-Zero-Sum Stochastic Differential Games of Controls and St
Non-Zero-Sum Stochastic Differential Games of Controls and Stoppings October 1, 2009 Based on two preprints: to a Non-Zero-Sum Stochastic Differential Game of Controls and Stoppings I. Karatzas, Q. Li,
More information6.891 Games, Decision, and Computation February 5, Lecture 2
6.891 Games, Decision, and Computation February 5, 2015 Lecture 2 Lecturer: Constantinos Daskalakis Scribe: Constantinos Daskalakis We formally define games and the solution concepts overviewed in Lecture
More informationPoisson Jumps in Credit Risk Modeling: a Partial Integro-differential Equation Formulation
Poisson Jumps in Credit Risk Modeling: a Partial Integro-differential Equation Formulation Jingyi Zhu Department of Mathematics University of Utah zhu@math.utah.edu Collaborator: Marco Avellaneda (Courant
More informationCourse 1 Solutions November 2001 Exams
Course Solutions November Exams . A For i =,, let R = event that a red ball is drawn form urn i i B = event that a blue ball is drawn from urn i. i Then if x is the number of blue balls in urn, ( R R)
More informationA quantum heat equation 5th Spring School on Evolution Equations, TU Berlin
A quantum heat equation 5th Spring School on Evolution Equations, TU Berlin Mario Bukal A. Jüngel and D. Matthes ACROSS - Centre for Advanced Cooperative Systems Faculty of Electrical Engineering and Computing
More informationStability of Feedback Solutions for Infinite Horizon Noncooperative Differential Games
Stability of Feedback Solutions for Infinite Horizon Noncooperative Differential Games Alberto Bressan ) and Khai T. Nguyen ) *) Department of Mathematics, Penn State University **) Department of Mathematics,
More informationSkorokhod Embeddings In Bounded Time
Skorokhod Embeddings In Bounded Time Stefan Ankirchner Institute for Applied Mathematics University of Bonn Endenicher Allee 6 D-535 Bonn ankirchner@hcm.uni-bonn.de Philipp Strack Bonn Graduate School
More informationComputation of Efficient Nash Equilibria for experimental economic games
International Journal of Mathematics and Soft Computing Vol.5, No.2 (2015), 197-212. ISSN Print : 2249-3328 ISSN Online: 2319-5215 Computation of Efficient Nash Equilibria for experimental economic games
More informationMean Field Games on networks
Mean Field Games on networks Claudio Marchi Università di Padova joint works with: S. Cacace (Rome) and F. Camilli (Rome) C. Marchi (Univ. of Padova) Mean Field Games on networks Roma, June 14 th, 2017
More informationQuantum Hydrodynamics models derived from the entropy principle
1 Quantum Hydrodynamics models derived from the entropy principle P. Degond (1), Ch. Ringhofer (2) (1) MIP, CNRS and Université Paul Sabatier, 118 route de Narbonne, 31062 Toulouse cedex, France degond@mip.ups-tlse.fr
More informationThe discrete-time second-best day-to-day dynamic pricing scheme
The discrete-time second-best day-to-day dynamic pricing scheme Linghui Han, David Z.W. Wang & Chengjuan Zhu 25-07-2017 School of Civil & Environmental Engineering Nanyang Technological University, Singapore
More information2.3 Calculus of variations
2.3 Calculus of variations 2.3.1 Euler-Lagrange equation The action functional S[x(t)] = L(x, ẋ, t) dt (2.3.1) which maps a curve x(t) to a number, can be expanded in a Taylor series { S[x(t) + δx(t)]
More informationA Primal-Dual Algorithm for Computing a Cost Allocation in the. Core of Economic Lot-Sizing Games
1 2 A Primal-Dual Algorithm for Computing a Cost Allocation in the Core of Economic Lot-Sizing Games 3 Mohan Gopaladesikan Nelson A. Uhan Jikai Zou 4 October 2011 5 6 7 8 9 10 11 12 Abstract We consider
More informationLecture 5: Single Step Methods
Lecture 5: Single Step Methods J.K. Ryan@tudelft.nl WI3097TU Delft Institute of Applied Mathematics Delft University of Technology 1 October 2012 () Single Step Methods 1 October 2012 1 / 44 Outline 1
More informationAlgorithmic Strategy Complexity
Algorithmic Strategy Complexity Abraham Neyman aneyman@math.huji.ac.il Hebrew University of Jerusalem Jerusalem Israel Algorithmic Strategy Complexity, Northwesten 2003 p.1/52 General Introduction The
More informationOn Stopping Times and Impulse Control with Constraint
On Stopping Times and Impulse Control with Constraint Jose Luis Menaldi Based on joint papers with M. Robin (216, 217) Department of Mathematics Wayne State University Detroit, Michigan 4822, USA (e-mail:
More informationEvolutionary Bargaining Strategies
Evolutionary Bargaining Strategies Nanlin Jin http://cswww.essex.ac.uk/csp/bargain Evolutionary Bargaining Two players alternative offering game x A =?? Player A Rubinstein 1982, 1985: Subgame perfect
More information6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Potential Games
6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Asu Ozdaglar MIT March 2, 2010 1 Introduction Outline Review of Supermodular Games Reading: Fudenberg and Tirole, Section 12.3.
More informationHyperbolicity of Systems Describing Value Functions in Differential Games which Model Duopoly Problems. Joanna Zwierzchowska
Decision Making in Manufacturing and Services Vol. 9 05 No. pp. 89 00 Hyperbolicity of Systems Describing Value Functions in Differential Games which Model Duopoly Problems Joanna Zwierzchowska Abstract.
More informationMean Field Competitive Binary MDPs and Structured Solutions
Mean Field Competitive Binary MDPs and Structured Solutions Minyi Huang School of Mathematics and Statistics Carleton University Ottawa, Canada MFG217, UCLA, Aug 28 Sept 1, 217 1 / 32 Outline of talk The
More informationStochastic Calculus. Kevin Sinclair. August 2, 2016
Stochastic Calculus Kevin Sinclair August, 16 1 Background Suppose we have a Brownian motion W. This is a process, and the value of W at a particular time T (which we write W T ) is a normally distributed
More informationEN Applied Optimal Control Lecture 8: Dynamic Programming October 10, 2018
EN530.603 Applied Optimal Control Lecture 8: Dynamic Programming October 0, 08 Lecturer: Marin Kobilarov Dynamic Programming (DP) is conerned with the computation of an optimal policy, i.e. an optimal
More informationLecture 1: March 7, 2018
Reinforcement Learning Spring Semester, 2017/8 Lecture 1: March 7, 2018 Lecturer: Yishay Mansour Scribe: ym DISCLAIMER: Based on Learning and Planning in Dynamical Systems by Shie Mannor c, all rights
More information