KINETIC MODELS FOR DIFFERENTIAL GAMES

Size: px
Start display at page:

Download "KINETIC MODELS FOR DIFFERENTIAL GAMES"

Transcription

1 KINETIC MODELS FOR DIFFERENTIAL GAMES D. Brinkman, C. Ringhofer math.la.asu.edu/ chris Work supported by NSF KI-NET Partially based on prev. work with P. Degond (Imperial College) M. Herty (RWTH Aachen) and J.G. Liu (Duke U.)

2 Classical kinetic models: An ensemble of mechanical particles is driven by a potential energy and undergoes binary interactions between agents or with a background. Drift: d dt ξ n = x V(ξ n, ξ), ξ ξ t x V(ξ n, ξ), ξ = (ξ 1,.., ξ N ) Random Diffusion: ξ n q n ( ξ), dp[q n ( ξ) = r] = P n (r, ξ) dr Assumptions: 1. (IID) Many particles. All particles are the same. 2. Correlations between particles are approximated. 3. The ensemble tries to minimize a common energy functional V cooperatively.

3 Kinetic density: Let f (x, t) dx = dp[ξ n (t) = x] t f (x, t) x (f x V f ) = Q[f ] x: state of the agent. V f : mean field potential for a multi agent system. Q[f ] interaction; Boltzmann : Q integral operator: Q[f ](x) = P f (x, x )ω f (x ) dx ω f (x)f (x) Einstein - Brownian motion: Q[f ] = x (D f x f ). System driven to a global minimum.

4 SOCIAL, ECONOMIC OR BIOLOGICAL APPLICATIONS: Non - cooperative agents. Each agent has an individual cost functional (energy), which it tries to optimize, given the action of the other agents. This leads to a Pareto optimum and a Nash equilibrium instead of a global optimum.

5 THE NASH EQUILIBRUM NE for pure strategies: Take two players: A and B with strategies (states) x and y and costs C A and C B. Optimal response: x opt = φ A (y) = min z {C A (z, y)}, y opt = φ B (x) = min z {C B (x, z)} Nash equilibrium: Neither player can do better, given the other s state fixed point problem φ A (φ B (x)) = x φ B (φ A (y)) = y 2. NE for mixed strategies: Two ensembles of players (or two players picking strategies from distributions f A and f B ). f A opt = φ A [f B ] = inf g {E g,f B[C A ]}, f B opt = φ B [f A ] = inf g {E f A,g[C B ]} Again: A fixed point problem (for functionals of distributions) φ A [f B [f A ]] = f A φ B [f A [f B ]] = f B

6 x opt = φ A (y) = min z {C A (z, y)}, y opt = φ B (x) = min z {C B (x, z)} φ A (φ B (x)) = x φ B (φ A (y)) = y This is not as optimal as minimizing C A + C B in cooperation (Prisoner s dilemma). General framework of Non-Cooperative, Non-Atomic, Anonymous games with a Continuum of Players (NCNAACP). References: Aumann, Mas Colell, Schmeidler, Shapiro, Shapley,

7 ISSUES: The Nash equilibrium is an equilibrium concept and gives no indication about the dynamics of the system. It assumes global knowledge of each player. Goal: Derive kinetic theories from agent (particle) based models for the dynamics of an actual game based on cost functionals and optimal responses.

8 OUTLINE Dynamics. General framework for mixed strategies and ensembles of players. Predict the future. (Optimal control Mean field games.) Learn from the past. (Decisions based on past experience Behavioral game theory.) Survival of the best strategies (Darwinism) Evolutionary game theory. 2. A simple example (Dynamics of mixed strategies for a RPS game.) 3. Application: Designing insurance policies. Game between an insurer and an ensemble of clients. Numerics. Kinetic theory. 4. Conclusions and outlook.

9 MODELING DYNAMICS I (Estimating the future) Optimal control approach: Swarms of intelligent individuals. Each agent observes the current state of the others and optimizes its cost, based on this observation. (1a.) Estimate over a finite time horizon. (Mean field games.) Given a strategy trajectory y(t) of B, A optimizes the cost of the trajectory t+τ t C A (x(s), y(s)) ds over a finite horizon τ. Job: Find x(t) = x opt (t) such that t+τ t the constraint d dty = V(x, y). C(x(s), y(t)) ds min under Using adjoint calculus, this yields a two point BVP in time, namely a forward problem for y(t) (to satisfy the constraint), coupled to a backward in time problem for a Lagrange multiplier λ(t). Hamilton - Jacobi - Bellmann equtns. Mean-field games (Lasry,Lions)

10 (1b.) Intelligent Swarms. A optimizes the current cost C A (x(t), y(t)) and moves incrementally towards the Pareto optimum. Yields a local in time control and an equation of the type t f A (x, t) = ω(φ A [f B ] f A ). (1c.) Instantaneous control. A proceeds as in (1a) and (1b), but uses only a Taylor series expansion of t+τ t C A (x(s), y(t)) ds for τ 0 and a Taylor series solution of d dty = V(x, y). local in time control. (Degond, Herty, Liu,CR).

11 MODELING DYNAMICS II (Learning) 16 Behavioral game theory; Machine learning General setup: Two ensembles A and B, of distributed players. A uses a (distributed) strategy x for each game. B uses a strategy y. A records outcomes via an observable a, trying to learn the behavior of B. In the same way, B uses an observable b, trying to learn about A.

12 A MULTI AGENT MODEL 2 ensembles A and B with N and M players, each with a strategy x (for A) and y (for B), and an observable a (for A) and b (for B). Step 1: A picks the optimal strategy based on the experience a. x(t + t) = ξ(a(t)), C A (ξ(a), a) = min x C A (x, a) Step 2: They actually play. A picks a random member of B, giving costs C A (x, y) and C B (y, x) for A and B. Step 3: A and B update their observables a and b, based on the outcome. Symmetric picture for B, y, b. a(t + t) = a(t) +???

13 OBSERVABLE UPDATES (a and b) 19 Discount past experiences over time. Behavioral game theory learning and forgetting (Camerer et al) 1) Rolling averages: finite memory 2) Time weighted averages: Discount past experience with a time dependent weight. a(t) = t 0 w(t, τ)c A (x(τ), y(τ)) dτ, t 0 w(t, τ) dτ = 1 t Three types of weights usable in a kinetic approch: Exponential: w(t, τ) e γ(τ t) Geometric: w(t, τ) ( τ t )γ Constant: w(t, τ) 1

14 THE IID MODEL Independent identically distributed agents in ensembles A and B with N >> 1 and M >> 1 members. F(x 1,..x N, a 1,..a N, y 1,..y M, b 1,.., b M, t) = N n=1 f A (x n, a n ) M m=1 f B (y m, b m ) Yields a system of two coupled kinetic equations for f A (x, a, t) and f B (y, b, t). t f A (x, a) = Q A strat[f A ] + Q A obs[f A, f B ] Q A strat[f A ] = ω[δ(x φ A (a)) f A (x, a) dx f A ] +λ Q A obs[f A, f B ] = a (w(t, t)af A ) δ(a C A (x, y )f A (x, a )f B (y, b ) dy a b λf A (x, a)

15 GENERAL KINETIC MODEL FOR TWO PLAYER ENSEMBLES 22 t f A (x, a) = Q A strat[f A ] + Q A obs [f A, f B ], t f B (y, b) = Q B strat[f B ] + Q B obs [f B, f A ] Q A,B strat are simple linear relaxation operators, giving the strategies, based on the observables. Q A,B obs are nonlinear operators, coupling the observables based on the actual outcome of the game. Three time scales: ω: frequency of the games. λ: frequency of observable updates (learning). w(t, t): discounting past experiences (forgetting).

16 DYNAMIC CONVERGENCE TOWARDS A NASH EQUILIBRIUM 24 Theorem Assumption: If learning is successful, i.e. the estimated cost converges against the real cost, and lim t lim t f A (x, a) dx = f B (y, b, t) dy = C A (x, a) dx, C B (y, b) dy then the system converges against the Nash equilibrium for mixed strategies, when averaged over time. 1 lim t t 1 lim t t t 0 t 0 dτ dτ da f A (x, a, τ) = f A Nash(x) db f B (y, b, τ) = f B Nash(y)

17 A SIMPLE EXAMPLE 26 (Rock - Paper - Scissors) Cost to A: A B R P S C A (x, y) = R P S Remark 1: x, y discrete {R = 1, P = 2, S = 3} +λ t f A (x, a) = Q A strat[f A ] + Q A obs[f A, f B ] Q A strat[f A ] = ω[δ(x φ A (a)) f A (x, a) dx f A ] Q A obs[f A, f B ] = a (w(t, t)af A ) C A (x, y )f A (x, a )f B (y, b ) dx a y b λc A (x, y)f A (x, a)

18 Remark 2: RPS has no Nash equilibrium for pure strategies and a trivial NE for mixed strategies P[R, R, S] = ( 1 3, 1 3, 1 3 ). Remark 3: For successful learning we have to try (randomly) new strategies. Example: B plays R all the time and A tries to learn about B. Need to add Q A try: t f A (x, a) = Q A strat[f A ] + Q A try[f A ] + Q A obs [f A, f B ] with Relaxation: Q A try[f A ](x, a) = α[p(x) f A (x, a) dx f A (x, a)] or Brownian Motion: Q A try[f A ](x, a) = α x f A

19 EXAMPLE RPS 1 B plays RPS with f(r)=0.5,f(p)=0.25,f(s)=0.25. A learns. Strategies (left panel) and estimated profits (right panel)for A (R=blue, P=green, S=red) strategies for A strategy profits for A # of games # of games,

20 EXAMPLE RPS 2 A and B both learn convergence to the Nash equilibrium. strategies and profits for A. Strategies (left panel) and estimated profits (right panel)for A (R=blue, P=green, S=red) strategies for A strategy profits for A # of games # of games,

21 A and B both learn convergence to the Nash equilibrium. strategies and profits for B. Strategies (left panel) and estimated profits (right panel)for A (R=blue, P=green, S=red) strategies for B strategy profits for B # of games # of games,

22 Corollary: Steady state given by modified NE 1 lim t t 1 lim t t t dτ 0 t 0 dτ da f A (x, a, τ) = f A Nash(x)+p(x) db f B (y, b, τ) = f B Nash(y)+p(y)

23 OPTIMAL DESIGN OF INSURANCE POLICIES Motivation: 1995: a major US university went from a uniform health insurance plan for all employees to a staggered system with 3 plans with different rates and benefits. Outcome: All plans, except for the cheapest, died out very quickly.

24 OPTIMAL DESIGN OF INSURANCE POLICIES 32 A: ensemble of clients with N >> 1 members. B: one insurer M = 1, offers K insurance plans with annual rates r k, k = 1 : K and cutoffs y k, k = 1 : K. There is a zero benefit plan k = 0 with r 0 = 0 and y 0 = (no insurance). Strategies: Client strategy (A, discrete): x = k {0,.., K}, the plan it chooses (including k = 0 no insurance). Insurer strategy (B): Annual rates r 1,.., r K ; cutoff values y 1,.., y K for accepting a member of A with risk a into plan k = 1 : K. Observable (a, risk, average claims): Poisson process Modification: Claims, independent of x and y!!) A and B share the same observable a.

25 COSTS AND OPTIMAL RESPONSE Optimal response for a client in A: The cost to a member of A, trying to choose plan k = 0 : K is ( ) C A r(k) for k = 1 : K and a < y(k) (eligible) = a for k = 0 (no insurance) If choosing a plan k = 1 : K, the optimal choice is the cheapest for which A is eligible: ν(y, a) = min{k : k = 1 : K, a k < y k } A compares this to the expected out of pocket cost a: ν(y, a) if r ν(y,a) < a (choosing a plan) k opt (y, r, a) = 0 if a < r ν(y,a) (opting out) 0 if a > y K (not eligible) k opt (y, r, a) = H(a r ν(y,a) )H(y K a)ν(y, a)

26 COSTS AND OPTIMAL RESPONSE Insurer B: Insurer cost: Given an ensemble A with distributed risk a choosing plan k C B = K yk k=1 0 (αa r k)f A (k, a) da α < 1: discount factor; y k : risk cutoff in plan k. Optimal response of insurer B : (Has to include a rate strategy!) y opt k = r k af α, r k = β A (k,a) da f A (k,a) da, β > 1 β > 1: greed factor, guarantees that C B 0. C B = α K yk k=1 0 (a βy k )f A (k, a) da 0

27 NUMERICAL RESULTS FOR A SIMPLE MULTI - AGENT MODEL insured (A) 1 insurer (B),. K = 3 insurance plans (+ plan zero = no insurance). 2 risk groups Results extremely sensitive on the greed factor β > 1 which guarantees a profit for the insurer (as long as the plan is not empty).

28 Greed factor β 5 3 : desired profit = 2 3 of the expected risk. Outcome: All but the cheapest plan and plan zero die out.

29 Greed factor β = 4 3 : desired profit = 1 3 of the expected risk. Outcome: All three plans survive.

30 KINETIC INSURANCE MODEL 43 Simplification: The observable claims a of the client ensemble A are governed by a Poisson process, independent of the strategies. This allows for a closure in terms of ρ A k (t) = f A (k, a, t) da and u A (a, t) = k f A (k, a, t) t u A (a, t) = a (w(t, t)au A ) + λa(a) λu A (a, t) t fk A (a, t) = ω[ φ A (k, a, f B )u A (a) da fk A ], t f B k (y, t) = Q B strat[f B, u A, ρ A k ]

31 COMPUTATIONAL COMPLEXITY K + 1 transport equations for the strategies of A. K transport equations for the strategies of B. 1 transport equation for the risk. Three time scales: ω: game frequency, λ: updates of the risk, (claim frequency). w(t, t): discount factor (forgetting about old claims)

32 CONCLUSIONS Content: 1. General kinetic framework for two player non - cooperative games with mixed strategies. 2. Include dynamics via a behavioral game theory approach. 3. Leads to a system of transport equations for the probability densities of strategies and observables involving multiple time scales. 4. Applications to the design and evolution of insurance policies. Outlook: Further analysis of the kinetic model. Time scale separation, moment closures, Chapman - Enskog type expansions. Further investigation in rate strategies.

33 REFERENCES Available at: math.la.asu.edu/ chris Mean Field Games, Jean-Michel LASRY, Pierre-Louis LIONS, Japan. J. Math 2, (2007) DOI: /s Evolution of the distribution of wealth in a nonconservative economic environment driven by local Nash equilibria Pierre Degond, Jian-Guo Liu and Christian Ringhofer Proceedings of the Royal Society A (to be published 2015) Evolution of the Distribution of Wealth in an Economic Environment Driven by Local Nash Equilibria Pierre Degond, Jian-Guo Liu, Christian Ringhofer, Journal of Statistical Physics, published online (2013) Large-Scale Dynamics of Mean-Field Games Driven by Local Nash Equilibria Pierre Degond, Jian-Guo Liu, Christian Ringhofer, Journal of Nonlinear Science, published online (2013)

On the interplay between kinetic theory and game theory

On the interplay between kinetic theory and game theory 1 On the interplay between kinetic theory and game theory Pierre Degond Department of mathematics, Imperial College London pdegond@imperial.ac.uk http://sites.google.com/site/degond/ Joint work with J.

More information

Game-Theoretic Learning:

Game-Theoretic Learning: Game-Theoretic Learning: Regret Minimization vs. Utility Maximization Amy Greenwald with David Gondek, Amir Jafari, and Casey Marks Brown University University of Pennsylvania November 17, 2004 Background

More information

Different types of phase transitions for a simple model of alignment of oriented particles

Different types of phase transitions for a simple model of alignment of oriented particles Different types of phase transitions for a simple model of alignment of oriented particles Amic Frouvelle Université Paris Dauphine Joint work with Jian-Guo Liu (Duke University, USA) and Pierre Degond

More information

HJB equations. Seminar in Stochastic Modelling in Economics and Finance January 10, 2011

HJB equations. Seminar in Stochastic Modelling in Economics and Finance January 10, 2011 Department of Probability and Mathematical Statistics Faculty of Mathematics and Physics, Charles University in Prague petrasek@karlin.mff.cuni.cz Seminar in Stochastic Modelling in Economics and Finance

More information

ON MEAN FIELD GAMES. Pierre-Louis LIONS. Collège de France, Paris. (joint project with Jean-Michel LASRY)

ON MEAN FIELD GAMES. Pierre-Louis LIONS. Collège de France, Paris. (joint project with Jean-Michel LASRY) Collège de France, Paris (joint project with Jean-Michel LASRY) I INTRODUCTION II A REALLY SIMPLE EXAMPLE III GENERAL STRUCTURE IV TWO PARTICULAR CASES V OVERVIEW AND PERSPECTIVES I. INTRODUCTION New class

More information

Online Learning Class 12, 20 March 2006 Andrea Caponnetto, Sanmay Das

Online Learning Class 12, 20 March 2006 Andrea Caponnetto, Sanmay Das Online Learning 9.520 Class 12, 20 March 2006 Andrea Caponnetto, Sanmay Das About this class Goal To introduce the general setting of online learning. To describe an online version of the RLS algorithm

More information

Mean-Field optimization problems and non-anticipative optimal transport. Beatrice Acciaio

Mean-Field optimization problems and non-anticipative optimal transport. Beatrice Acciaio Mean-Field optimization problems and non-anticipative optimal transport Beatrice Acciaio London School of Economics based on ongoing projects with J. Backhoff, R. Carmona and P. Wang Robust Methods in

More information

Knowing What Others Know: Coordination Motives in Information Acquisition

Knowing What Others Know: Coordination Motives in Information Acquisition Knowing What Others Know: Coordination Motives in Information Acquisition Christian Hellwig and Laura Veldkamp UCLA and NYU Stern May 2006 1 Hellwig and Veldkamp Two types of information acquisition Passive

More information

SUMMER SCHOOL - KINETIC EQUATIONS LECTURE IV: HOMOGENIZATION OF KINETIC FLOWS IN NETWORKS. Shanghai, 2011.

SUMMER SCHOOL - KINETIC EQUATIONS LECTURE IV: HOMOGENIZATION OF KINETIC FLOWS IN NETWORKS. Shanghai, 2011. SUMMER SCHOOL - KINETIC EQUATIONS LECTURE IV: HOMOGENIZATION OF KINETIC FLOWS IN NETWORKS. Shanghai, 2011. C. Ringhofer ringhofer@asu.edu, math.la.asu.edu/ chris OVERVIEW Quantum and classical description

More information

Game Theory, Evolutionary Dynamics, and Multi-Agent Learning. Prof. Nicola Gatti

Game Theory, Evolutionary Dynamics, and Multi-Agent Learning. Prof. Nicola Gatti Game Theory, Evolutionary Dynamics, and Multi-Agent Learning Prof. Nicola Gatti (nicola.gatti@polimi.it) Game theory Game theory: basics Normal form Players Actions Outcomes Utilities Strategies Solutions

More information

A (Brief) Introduction to Game Theory

A (Brief) Introduction to Game Theory A (Brief) Introduction to Game Theory Johanne Cohen PRiSM/CNRS, Versailles, France. Goal Goal is a Nash equilibrium. Today The game of Chicken Definitions Nash Equilibrium Rock-paper-scissors Game Mixed

More information

Multi-dimensional Stochastic Singular Control Via Dynkin Game and Dirichlet Form

Multi-dimensional Stochastic Singular Control Via Dynkin Game and Dirichlet Form Multi-dimensional Stochastic Singular Control Via Dynkin Game and Dirichlet Form Yipeng Yang * Under the supervision of Dr. Michael Taksar Department of Mathematics University of Missouri-Columbia Oct

More information

A model of alignment interaction for oriented particles with phase transition

A model of alignment interaction for oriented particles with phase transition A model of alignment interaction for oriented particles with phase transition Amic Frouvelle ACMAC Joint work with Jian-Guo Liu (Duke University, USA) and Pierre Degond (Institut de Mathématiques de Toulouse,

More information

SUMMER SCHOOL - KINETIC EQUATIONS LECTURE II: CONFINED CLASSICAL TRANSPORT Shanghai, 2011.

SUMMER SCHOOL - KINETIC EQUATIONS LECTURE II: CONFINED CLASSICAL TRANSPORT Shanghai, 2011. SUMMER SCHOOL - KINETIC EQUATIONS LECTURE II: CONFINED CLASSICAL TRANSPORT Shanghai, 2011. C. Ringhofer ringhofer@asu.edu, math.la.asu.edu/ chris OVERVIEW Quantum and classical description of many particle

More information

Stochastic Volatility and Correction to the Heat Equation

Stochastic Volatility and Correction to the Heat Equation Stochastic Volatility and Correction to the Heat Equation Jean-Pierre Fouque, George Papanicolaou and Ronnie Sircar Abstract. From a probabilist s point of view the Twentieth Century has been a century

More information

Hyperbolic Models for Large Supply Chains. Christian Ringhofer (Arizona State University) Hyperbolic Models for Large Supply Chains p.

Hyperbolic Models for Large Supply Chains. Christian Ringhofer (Arizona State University) Hyperbolic Models for Large Supply Chains p. Hyperbolic Models for Large Supply Chains Christian Ringhofer (Arizona State University) Hyperbolic Models for Large Supply Chains p. /4 Introduction Topic: Overview of conservation law (traffic - like)

More information

Introduction to multiscale modeling and simulation. Explicit methods for ODEs : forward Euler. y n+1 = y n + tf(y n ) dy dt = f(y), y(0) = y 0

Introduction to multiscale modeling and simulation. Explicit methods for ODEs : forward Euler. y n+1 = y n + tf(y n ) dy dt = f(y), y(0) = y 0 Introduction to multiscale modeling and simulation Lecture 5 Numerical methods for ODEs, SDEs and PDEs The need for multiscale methods Two generic frameworks for multiscale computation Explicit methods

More information

Belief-based Learning

Belief-based Learning Belief-based Learning Algorithmic Game Theory Marcello Restelli Lecture Outline Introdutcion to multi-agent learning Belief-based learning Cournot adjustment Fictitious play Bayesian learning Equilibrium

More information

Evolution & Learning in Games

Evolution & Learning in Games 1 / 28 Evolution & Learning in Games Econ 243B Jean-Paul Carvalho Lecture 5. Revision Protocols and Evolutionary Dynamics 2 / 28 Population Games played by Boundedly Rational Agents We have introduced

More information

Some Aspects of First Principle Models for Production Systems and Supply Chains. Christian Ringhofer (Arizona State University)

Some Aspects of First Principle Models for Production Systems and Supply Chains. Christian Ringhofer (Arizona State University) Some Aspects of First Principle Models for Production Systems and Supply Chains Christian Ringhofer (Arizona State University) Some Aspects of First Principle Models for Production Systems and Supply Chains

More information

A model of alignment interaction for oriented particles with phase transition

A model of alignment interaction for oriented particles with phase transition A model of alignment interaction for oriented particles with phase transition Amic Frouvelle Archimedes Center for Modeling, Analysis & Computation (ACMAC) University of Crete, Heraklion, Crete, Greece

More information

Bounded Rationality, Strategy Simplification, and Equilibrium

Bounded Rationality, Strategy Simplification, and Equilibrium Bounded Rationality, Strategy Simplification, and Equilibrium UPV/EHU & Ikerbasque Donostia, Spain BCAM Workshop on Interactions, September 2014 Bounded Rationality Frequently raised criticism of game

More information

A model of alignment interaction for oriented particles with phase transition

A model of alignment interaction for oriented particles with phase transition A model of alignment interaction for oriented particles with phase transition Amic Frouvelle Institut de Mathématiques de Toulouse Joint work with Jian-Guo Liu (Duke Univ.) and Pierre Degond (IMT) Amic

More information

Survival of Dominated Strategies under Evolutionary Dynamics. Josef Hofbauer University of Vienna. William H. Sandholm University of Wisconsin

Survival of Dominated Strategies under Evolutionary Dynamics. Josef Hofbauer University of Vienna. William H. Sandholm University of Wisconsin Survival of Dominated Strategies under Evolutionary Dynamics Josef Hofbauer University of Vienna William H. Sandholm University of Wisconsin a hypnodisk 1 Survival of Dominated Strategies under Evolutionary

More information

Evolution & Learning in Games

Evolution & Learning in Games 1 / 27 Evolution & Learning in Games Econ 243B Jean-Paul Carvalho Lecture 2. Foundations of Evolution & Learning in Games II 2 / 27 Outline In this lecture, we shall: Take a first look at local stability.

More information

An Introduction to Noncooperative Games

An Introduction to Noncooperative Games An Introduction to Noncooperative Games Alberto Bressan Department of Mathematics, Penn State University Alberto Bressan (Penn State) Noncooperative Games 1 / 29 introduction to non-cooperative games:

More information

Dynamic Games with Asymmetric Information: Common Information Based Perfect Bayesian Equilibria and Sequential Decomposition

Dynamic Games with Asymmetric Information: Common Information Based Perfect Bayesian Equilibria and Sequential Decomposition Dynamic Games with Asymmetric Information: Common Information Based Perfect Bayesian Equilibria and Sequential Decomposition 1 arxiv:1510.07001v1 [cs.gt] 23 Oct 2015 Yi Ouyang, Hamidreza Tavafoghi and

More information

Introduction to Continuous-Time Dynamic Optimization: Optimal Control Theory

Introduction to Continuous-Time Dynamic Optimization: Optimal Control Theory Econ 85/Chatterjee Introduction to Continuous-ime Dynamic Optimization: Optimal Control heory 1 States and Controls he concept of a state in mathematical modeling typically refers to a specification of

More information

CHAPTER 2 THE MAXIMUM PRINCIPLE: CONTINUOUS TIME. Chapter2 p. 1/67

CHAPTER 2 THE MAXIMUM PRINCIPLE: CONTINUOUS TIME. Chapter2 p. 1/67 CHAPTER 2 THE MAXIMUM PRINCIPLE: CONTINUOUS TIME Chapter2 p. 1/67 THE MAXIMUM PRINCIPLE: CONTINUOUS TIME Main Purpose: Introduce the maximum principle as a necessary condition to be satisfied by any optimal

More information

Satisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games

Satisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games Satisfaction Equilibrium: Achieving Cooperation in Incomplete Information Games Stéphane Ross and Brahim Chaib-draa Department of Computer Science and Software Engineering Laval University, Québec (Qc),

More information

Ergodicity and Non-Ergodicity in Economics

Ergodicity and Non-Ergodicity in Economics Abstract An stochastic system is called ergodic if it tends in probability to a limiting form that is independent of the initial conditions. Breakdown of ergodicity gives rise to path dependence. We illustrate

More information

University Of Calgary Department of Mathematics and Statistics

University Of Calgary Department of Mathematics and Statistics University Of Calgary Department of Mathematics and Statistics Hawkes Seminar May 23, 2018 1 / 46 Some Problems in Insurance and Reinsurance Mohamed Badaoui Department of Electrical Engineering National

More information

PROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS

PROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS PROBABILITY: LIMIT THEOREMS II, SPRING 15. HOMEWORK PROBLEMS PROF. YURI BAKHTIN Instructions. You are allowed to work on solutions in groups, but you are required to write up solutions on your own. Please

More information

SUMMER SCHOOL - KINETIC EQUATIONS LECTURE I: TRANSPORT PICTURES AND DISSIPATION Shanghai, 2011.

SUMMER SCHOOL - KINETIC EQUATIONS LECTURE I: TRANSPORT PICTURES AND DISSIPATION Shanghai, 2011. SUMMER SCHOOL - KINETIC EQUATIONS LECTURE I: TRANSPORT PICTURES AND DISSIPATION Shanghai, 2011. C. Ringhofer ringhofer@asu.edu, math.la.asu.edu/ chris OVERVIEW Quantum and classical description of many

More information

Productivity Losses from Financial Frictions: Can Self-financing Undo Capital Misallocation?

Productivity Losses from Financial Frictions: Can Self-financing Undo Capital Misallocation? Productivity Losses from Financial Frictions: Can Self-financing Undo Capital Misallocation? Benjamin Moll G Online Appendix: The Model in Discrete Time and with iid Shocks This Appendix presents a version

More information

Introduction to Reachability Somil Bansal Hybrid Systems Lab, UC Berkeley

Introduction to Reachability Somil Bansal Hybrid Systems Lab, UC Berkeley Introduction to Reachability Somil Bansal Hybrid Systems Lab, UC Berkeley Outline Introduction to optimal control Reachability as an optimal control problem Various shades of reachability Goal of This

More information

Weak solutions of mean-field stochastic differential equations

Weak solutions of mean-field stochastic differential equations Weak solutions of mean-field stochastic differential equations Juan Li School of Mathematics and Statistics, Shandong University (Weihai), Weihai 26429, China. Email: juanli@sdu.edu.cn Based on joint works

More information

A scalar conservation law with discontinuous flux for supply chains with finite buffers.

A scalar conservation law with discontinuous flux for supply chains with finite buffers. A scalar conservation law with discontinuous flux for supply chains with finite buffers. Dieter Armbruster School of Mathematical and Statistical Sciences, Arizona State University & Department of Mechanical

More information

Gains in evolutionary dynamics. Dai ZUSAI

Gains in evolutionary dynamics. Dai ZUSAI Gains in evolutionary dynamics unifying rational framework for dynamic stability Dai ZUSAI Hitotsubashi University (Econ Dept.) Economic Theory Workshop October 27, 2016 1 Introduction Motivation and literature

More information

Introduction to game theory LECTURE 1

Introduction to game theory LECTURE 1 Introduction to game theory LECTURE 1 Jörgen Weibull January 27, 2010 1 What is game theory? A mathematically formalized theory of strategic interaction between countries at war and peace, in federations

More information

ELEMENTS OF PROBABILITY THEORY

ELEMENTS OF PROBABILITY THEORY ELEMENTS OF PROBABILITY THEORY Elements of Probability Theory A collection of subsets of a set Ω is called a σ algebra if it contains Ω and is closed under the operations of taking complements and countable

More information

Different types of phase transitions for a simple model of alignment of oriented particles

Different types of phase transitions for a simple model of alignment of oriented particles Different types of phase transitions for a simple model of alignment of oriented particles Amic Frouvelle CEREMADE Université Paris Dauphine Joint work with Jian-Guo Liu (Duke University, USA) and Pierre

More information

Reflected Brownian Motion

Reflected Brownian Motion Chapter 6 Reflected Brownian Motion Often we encounter Diffusions in regions with boundary. If the process can reach the boundary from the interior in finite time with positive probability we need to decide

More information

The Mathematics of Continuous Time Contract Theory

The Mathematics of Continuous Time Contract Theory The Mathematics of Continuous Time Contract Theory Ecole Polytechnique, France University of Michigan, April 3, 2018 Outline Introduction to moral hazard 1 Introduction to moral hazard 2 3 General formulation

More information

Population Games and Evolutionary Dynamics

Population Games and Evolutionary Dynamics Population Games and Evolutionary Dynamics (MIT Press, 200x; draft posted on my website) 1. Population games 2. Revision protocols and evolutionary dynamics 3. Potential games and their applications 4.

More information

Lecture Notes 6: Dynamic Equations Part A: First-Order Difference Equations in One Variable

Lecture Notes 6: Dynamic Equations Part A: First-Order Difference Equations in One Variable University of Warwick, EC9A0 Maths for Economists Peter J. Hammond 1 of 54 Lecture Notes 6: Dynamic Equations Part A: First-Order Difference Equations in One Variable Peter J. Hammond latest revision 2017

More information

EVOLUTIONARY GAMES WITH GROUP SELECTION

EVOLUTIONARY GAMES WITH GROUP SELECTION EVOLUTIONARY GAMES WITH GROUP SELECTION Martin Kaae Jensen Alexandros Rigos Department of Economics University of Leicester Controversies in Game Theory: Homo Oeconomicus vs. Homo Socialis ETH Zurich 12/09/2014

More information

PROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS

PROBABILITY: LIMIT THEOREMS II, SPRING HOMEWORK PROBLEMS PROBABILITY: LIMIT THEOREMS II, SPRING 218. HOMEWORK PROBLEMS PROF. YURI BAKHTIN Instructions. You are allowed to work on solutions in groups, but you are required to write up solutions on your own. Please

More information

Lecture 3: Hamilton-Jacobi-Bellman Equations. Distributional Macroeconomics. Benjamin Moll. Part II of ECON Harvard University, Spring

Lecture 3: Hamilton-Jacobi-Bellman Equations. Distributional Macroeconomics. Benjamin Moll. Part II of ECON Harvard University, Spring Lecture 3: Hamilton-Jacobi-Bellman Equations Distributional Macroeconomics Part II of ECON 2149 Benjamin Moll Harvard University, Spring 2018 1 Outline 1. Hamilton-Jacobi-Bellman equations in deterministic

More information

1 Introduction. 2 Diffusion equation and central limit theorem. The content of these notes is also covered by chapter 3 section B of [1].

1 Introduction. 2 Diffusion equation and central limit theorem. The content of these notes is also covered by chapter 3 section B of [1]. 1 Introduction The content of these notes is also covered by chapter 3 section B of [1]. Diffusion equation and central limit theorem Consider a sequence {ξ i } i=1 i.i.d. ξ i = d ξ with ξ : Ω { Dx, 0,

More information

Other-Regarding Preferences: Theory and Evidence

Other-Regarding Preferences: Theory and Evidence Other-Regarding Preferences: Theory and Evidence June 9, 2009 GENERAL OUTLINE Economic Rationality is Individual Optimization and Group Equilibrium Narrow version: Restrictive Assumptions about Objective

More information

Distributed Learning based on Entropy-Driven Game Dynamics

Distributed Learning based on Entropy-Driven Game Dynamics Distributed Learning based on Entropy-Driven Game Dynamics Bruno Gaujal joint work with Pierre Coucheney and Panayotis Mertikopoulos Inria Aug., 2014 Model Shared resource systems (network, processors)

More information

Linearization of Differential Equation Models

Linearization of Differential Equation Models Linearization of Differential Equation Models 1 Motivation We cannot solve most nonlinear models, so we often instead try to get an overall feel for the way the model behaves: we sometimes talk about looking

More information

Functional Limit theorems for the quadratic variation of a continuous time random walk and for certain stochastic integrals

Functional Limit theorems for the quadratic variation of a continuous time random walk and for certain stochastic integrals Functional Limit theorems for the quadratic variation of a continuous time random walk and for certain stochastic integrals Noèlia Viles Cuadros BCAM- Basque Center of Applied Mathematics with Prof. Enrico

More information

Affine Processes. Econometric specifications. Eduardo Rossi. University of Pavia. March 17, 2009

Affine Processes. Econometric specifications. Eduardo Rossi. University of Pavia. March 17, 2009 Affine Processes Econometric specifications Eduardo Rossi University of Pavia March 17, 2009 Eduardo Rossi (University of Pavia) Affine Processes March 17, 2009 1 / 40 Outline 1 Affine Processes 2 Affine

More information

Lecture 4: Introduction to stochastic processes and stochastic calculus

Lecture 4: Introduction to stochastic processes and stochastic calculus Lecture 4: Introduction to stochastic processes and stochastic calculus Cédric Archambeau Centre for Computational Statistics and Machine Learning Department of Computer Science University College London

More information

Information Choice in Macroeconomics and Finance.

Information Choice in Macroeconomics and Finance. Information Choice in Macroeconomics and Finance. Laura Veldkamp New York University, Stern School of Business, CEPR and NBER Spring 2009 1 Veldkamp What information consumes is rather obvious: It consumes

More information

Xt i Xs i N(0, σ 2 (t s)) and they are independent. This implies that the density function of X t X s is a product of normal density functions:

Xt i Xs i N(0, σ 2 (t s)) and they are independent. This implies that the density function of X t X s is a product of normal density functions: 174 BROWNIAN MOTION 8.4. Brownian motion in R d and the heat equation. The heat equation is a partial differential equation. We are going to convert it into a probabilistic equation by reversing time.

More information

Proving the Regularity of the Minimal Probability of Ruin via a Game of Stopping and Control

Proving the Regularity of the Minimal Probability of Ruin via a Game of Stopping and Control Proving the Regularity of the Minimal Probability of Ruin via a Game of Stopping and Control Erhan Bayraktar University of Michigan joint work with Virginia R. Young, University of Michigan K αρλoβασi,

More information

Lecture 4. Chapter 4: Lyapunov Stability. Eugenio Schuster. Mechanical Engineering and Mechanics Lehigh University.

Lecture 4. Chapter 4: Lyapunov Stability. Eugenio Schuster. Mechanical Engineering and Mechanics Lehigh University. Lecture 4 Chapter 4: Lyapunov Stability Eugenio Schuster schuster@lehigh.edu Mechanical Engineering and Mechanics Lehigh University Lecture 4 p. 1/86 Autonomous Systems Consider the autonomous system ẋ

More information

Prof. Erhan Bayraktar (University of Michigan)

Prof. Erhan Bayraktar (University of Michigan) September 17, 2012 KAP 414 2:15 PM- 3:15 PM Prof. (University of Michigan) Abstract: We consider a zero-sum stochastic differential controller-and-stopper game in which the state process is a controlled

More information

Deterministic Dynamic Programming

Deterministic Dynamic Programming Deterministic Dynamic Programming 1 Value Function Consider the following optimal control problem in Mayer s form: V (t 0, x 0 ) = inf u U J(t 1, x(t 1 )) (1) subject to ẋ(t) = f(t, x(t), u(t)), x(t 0

More information

An Analytic Method for Solving Uncertain Differential Equations

An Analytic Method for Solving Uncertain Differential Equations Journal of Uncertain Systems Vol.6, No.4, pp.244-249, 212 Online at: www.jus.org.uk An Analytic Method for Solving Uncertain Differential Equations Yuhan Liu Department of Industrial Engineering, Tsinghua

More information

Stability of Stochastic Differential Equations

Stability of Stochastic Differential Equations Lyapunov stability theory for ODEs s Stability of Stochastic Differential Equations Part 1: Introduction Department of Mathematics and Statistics University of Strathclyde Glasgow, G1 1XH December 2010

More information

UNCERTAINTY FUNCTIONAL DIFFERENTIAL EQUATIONS FOR FINANCE

UNCERTAINTY FUNCTIONAL DIFFERENTIAL EQUATIONS FOR FINANCE Surveys in Mathematics and its Applications ISSN 1842-6298 (electronic), 1843-7265 (print) Volume 5 (2010), 275 284 UNCERTAINTY FUNCTIONAL DIFFERENTIAL EQUATIONS FOR FINANCE Iuliana Carmen Bărbăcioru Abstract.

More information

Spatial Economics and Potential Games

Spatial Economics and Potential Games Outline Spatial Economics and Potential Games Daisuke Oyama Graduate School of Economics, Hitotsubashi University Hitotsubashi Game Theory Workshop 2007 Session Potential Games March 4, 2007 Potential

More information

Systemic Risk and Stochastic Games with Delay

Systemic Risk and Stochastic Games with Delay Systemic Risk and Stochastic Games with Delay Jean-Pierre Fouque (with René Carmona, Mostafa Mousavi and Li-Hsien Sun) PDE and Probability Methods for Interactions Sophia Antipolis (France) - March 30-31,

More information

Variational approach to mean field games with density constraints

Variational approach to mean field games with density constraints 1 / 18 Variational approach to mean field games with density constraints Alpár Richárd Mészáros LMO, Université Paris-Sud (based on ongoing joint works with F. Santambrogio, P. Cardaliaguet and F. J. Silva)

More information

Robotics. Control Theory. Marc Toussaint U Stuttgart

Robotics. Control Theory. Marc Toussaint U Stuttgart Robotics Control Theory Topics in control theory, optimal control, HJB equation, infinite horizon case, Linear-Quadratic optimal control, Riccati equations (differential, algebraic, discrete-time), controllability,

More information

Control Theory in Physics and other Fields of Science

Control Theory in Physics and other Fields of Science Michael Schulz Control Theory in Physics and other Fields of Science Concepts, Tools, and Applications With 46 Figures Sprin ger 1 Introduction 1 1.1 The Aim of Control Theory 1 1.2 Dynamic State of Classical

More information

Evolutionary Game Theory: Overview and Recent Results

Evolutionary Game Theory: Overview and Recent Results Overviews: Evolutionary Game Theory: Overview and Recent Results William H. Sandholm University of Wisconsin nontechnical survey: Evolutionary Game Theory (in Encyclopedia of Complexity and System Science,

More information

Numerical Optimization Algorithms

Numerical Optimization Algorithms Numerical Optimization Algorithms 1. Overview. Calculus of Variations 3. Linearized Supersonic Flow 4. Steepest Descent 5. Smoothed Steepest Descent Overview 1 Two Main Categories of Optimization Algorithms

More information

A HYPERBOLIC RELAXATION MODEL FOR PRODUCT FLOW IN COMPLEX PRODUCTION NETWORKS. Ali Unver, Christian Ringhofer and Dieter Armbruster

A HYPERBOLIC RELAXATION MODEL FOR PRODUCT FLOW IN COMPLEX PRODUCTION NETWORKS. Ali Unver, Christian Ringhofer and Dieter Armbruster DISCRETE AND CONTINUOUS Website: www.aimsciences.org DYNAMICAL SYSTEMS Supplement 2009 pp. 790 799 A HYPERBOLIC RELAXATION MODEL FOR PRODUCT FLOW IN COMPLEX PRODUCTION NETWORKS Ali Unver, Christian Ringhofer

More information

Optimal Control. McGill COMP 765 Oct 3 rd, 2017

Optimal Control. McGill COMP 765 Oct 3 rd, 2017 Optimal Control McGill COMP 765 Oct 3 rd, 2017 Classical Control Quiz Question 1: Can a PID controller be used to balance an inverted pendulum: A) That starts upright? B) That must be swung-up (perhaps

More information

Quantum Games. Quantum Strategies in Classical Games. Presented by Yaniv Carmeli

Quantum Games. Quantum Strategies in Classical Games. Presented by Yaniv Carmeli Quantum Games Quantum Strategies in Classical Games Presented by Yaniv Carmeli 1 Talk Outline Introduction Game Theory Why quantum games? PQ Games PQ penny flip 2x2 Games Quantum strategies 2 Game Theory

More information

CONTINUUM MODELS FOR FLOWS ON COMPLEX NETWORKS

CONTINUUM MODELS FOR FLOWS ON COMPLEX NETWORKS CONTINUUM MODELS FOR FLOWS ON COMPLEX NETWORKS C. Ringhofer ringhofer@asu.edu, math.la.asu.edu/ chris OUTLINE 1 Introduction: Network Examples: Some examples of real world networks + theoretical structure.

More information

A Model of Evolutionary Dynamics with Quasiperiodic Forcing

A Model of Evolutionary Dynamics with Quasiperiodic Forcing paper presented at Society for Experimental Mechanics (SEM) IMAC XXXIII Conference on Structural Dynamics February 2-5, 205, Orlando FL A Model of Evolutionary Dynamics with Quasiperiodic Forcing Elizabeth

More information

Non-Zero-Sum Stochastic Differential Games of Controls and St

Non-Zero-Sum Stochastic Differential Games of Controls and St Non-Zero-Sum Stochastic Differential Games of Controls and Stoppings October 1, 2009 Based on two preprints: to a Non-Zero-Sum Stochastic Differential Game of Controls and Stoppings I. Karatzas, Q. Li,

More information

6.891 Games, Decision, and Computation February 5, Lecture 2

6.891 Games, Decision, and Computation February 5, Lecture 2 6.891 Games, Decision, and Computation February 5, 2015 Lecture 2 Lecturer: Constantinos Daskalakis Scribe: Constantinos Daskalakis We formally define games and the solution concepts overviewed in Lecture

More information

Poisson Jumps in Credit Risk Modeling: a Partial Integro-differential Equation Formulation

Poisson Jumps in Credit Risk Modeling: a Partial Integro-differential Equation Formulation Poisson Jumps in Credit Risk Modeling: a Partial Integro-differential Equation Formulation Jingyi Zhu Department of Mathematics University of Utah zhu@math.utah.edu Collaborator: Marco Avellaneda (Courant

More information

Course 1 Solutions November 2001 Exams

Course 1 Solutions November 2001 Exams Course Solutions November Exams . A For i =,, let R = event that a red ball is drawn form urn i i B = event that a blue ball is drawn from urn i. i Then if x is the number of blue balls in urn, ( R R)

More information

A quantum heat equation 5th Spring School on Evolution Equations, TU Berlin

A quantum heat equation 5th Spring School on Evolution Equations, TU Berlin A quantum heat equation 5th Spring School on Evolution Equations, TU Berlin Mario Bukal A. Jüngel and D. Matthes ACROSS - Centre for Advanced Cooperative Systems Faculty of Electrical Engineering and Computing

More information

Stability of Feedback Solutions for Infinite Horizon Noncooperative Differential Games

Stability of Feedback Solutions for Infinite Horizon Noncooperative Differential Games Stability of Feedback Solutions for Infinite Horizon Noncooperative Differential Games Alberto Bressan ) and Khai T. Nguyen ) *) Department of Mathematics, Penn State University **) Department of Mathematics,

More information

Skorokhod Embeddings In Bounded Time

Skorokhod Embeddings In Bounded Time Skorokhod Embeddings In Bounded Time Stefan Ankirchner Institute for Applied Mathematics University of Bonn Endenicher Allee 6 D-535 Bonn ankirchner@hcm.uni-bonn.de Philipp Strack Bonn Graduate School

More information

Computation of Efficient Nash Equilibria for experimental economic games

Computation of Efficient Nash Equilibria for experimental economic games International Journal of Mathematics and Soft Computing Vol.5, No.2 (2015), 197-212. ISSN Print : 2249-3328 ISSN Online: 2319-5215 Computation of Efficient Nash Equilibria for experimental economic games

More information

Mean Field Games on networks

Mean Field Games on networks Mean Field Games on networks Claudio Marchi Università di Padova joint works with: S. Cacace (Rome) and F. Camilli (Rome) C. Marchi (Univ. of Padova) Mean Field Games on networks Roma, June 14 th, 2017

More information

Quantum Hydrodynamics models derived from the entropy principle

Quantum Hydrodynamics models derived from the entropy principle 1 Quantum Hydrodynamics models derived from the entropy principle P. Degond (1), Ch. Ringhofer (2) (1) MIP, CNRS and Université Paul Sabatier, 118 route de Narbonne, 31062 Toulouse cedex, France degond@mip.ups-tlse.fr

More information

The discrete-time second-best day-to-day dynamic pricing scheme

The discrete-time second-best day-to-day dynamic pricing scheme The discrete-time second-best day-to-day dynamic pricing scheme Linghui Han, David Z.W. Wang & Chengjuan Zhu 25-07-2017 School of Civil & Environmental Engineering Nanyang Technological University, Singapore

More information

2.3 Calculus of variations

2.3 Calculus of variations 2.3 Calculus of variations 2.3.1 Euler-Lagrange equation The action functional S[x(t)] = L(x, ẋ, t) dt (2.3.1) which maps a curve x(t) to a number, can be expanded in a Taylor series { S[x(t) + δx(t)]

More information

A Primal-Dual Algorithm for Computing a Cost Allocation in the. Core of Economic Lot-Sizing Games

A Primal-Dual Algorithm for Computing a Cost Allocation in the. Core of Economic Lot-Sizing Games 1 2 A Primal-Dual Algorithm for Computing a Cost Allocation in the Core of Economic Lot-Sizing Games 3 Mohan Gopaladesikan Nelson A. Uhan Jikai Zou 4 October 2011 5 6 7 8 9 10 11 12 Abstract We consider

More information

Lecture 5: Single Step Methods

Lecture 5: Single Step Methods Lecture 5: Single Step Methods J.K. Ryan@tudelft.nl WI3097TU Delft Institute of Applied Mathematics Delft University of Technology 1 October 2012 () Single Step Methods 1 October 2012 1 / 44 Outline 1

More information

Algorithmic Strategy Complexity

Algorithmic Strategy Complexity Algorithmic Strategy Complexity Abraham Neyman aneyman@math.huji.ac.il Hebrew University of Jerusalem Jerusalem Israel Algorithmic Strategy Complexity, Northwesten 2003 p.1/52 General Introduction The

More information

On Stopping Times and Impulse Control with Constraint

On Stopping Times and Impulse Control with Constraint On Stopping Times and Impulse Control with Constraint Jose Luis Menaldi Based on joint papers with M. Robin (216, 217) Department of Mathematics Wayne State University Detroit, Michigan 4822, USA (e-mail:

More information

Evolutionary Bargaining Strategies

Evolutionary Bargaining Strategies Evolutionary Bargaining Strategies Nanlin Jin http://cswww.essex.ac.uk/csp/bargain Evolutionary Bargaining Two players alternative offering game x A =?? Player A Rubinstein 1982, 1985: Subgame perfect

More information

6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Potential Games

6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Potential Games 6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Asu Ozdaglar MIT March 2, 2010 1 Introduction Outline Review of Supermodular Games Reading: Fudenberg and Tirole, Section 12.3.

More information

Hyperbolicity of Systems Describing Value Functions in Differential Games which Model Duopoly Problems. Joanna Zwierzchowska

Hyperbolicity of Systems Describing Value Functions in Differential Games which Model Duopoly Problems. Joanna Zwierzchowska Decision Making in Manufacturing and Services Vol. 9 05 No. pp. 89 00 Hyperbolicity of Systems Describing Value Functions in Differential Games which Model Duopoly Problems Joanna Zwierzchowska Abstract.

More information

Mean Field Competitive Binary MDPs and Structured Solutions

Mean Field Competitive Binary MDPs and Structured Solutions Mean Field Competitive Binary MDPs and Structured Solutions Minyi Huang School of Mathematics and Statistics Carleton University Ottawa, Canada MFG217, UCLA, Aug 28 Sept 1, 217 1 / 32 Outline of talk The

More information

Stochastic Calculus. Kevin Sinclair. August 2, 2016

Stochastic Calculus. Kevin Sinclair. August 2, 2016 Stochastic Calculus Kevin Sinclair August, 16 1 Background Suppose we have a Brownian motion W. This is a process, and the value of W at a particular time T (which we write W T ) is a normally distributed

More information

EN Applied Optimal Control Lecture 8: Dynamic Programming October 10, 2018

EN Applied Optimal Control Lecture 8: Dynamic Programming October 10, 2018 EN530.603 Applied Optimal Control Lecture 8: Dynamic Programming October 0, 08 Lecturer: Marin Kobilarov Dynamic Programming (DP) is conerned with the computation of an optimal policy, i.e. an optimal

More information

Lecture 1: March 7, 2018

Lecture 1: March 7, 2018 Reinforcement Learning Spring Semester, 2017/8 Lecture 1: March 7, 2018 Lecturer: Yishay Mansour Scribe: ym DISCLAIMER: Based on Learning and Planning in Dynamical Systems by Shie Mannor c, all rights

More information