Inertial Game Dynamics

Similar documents
LEARNING IN CONCAVE GAMES

Distributed Learning based on Entropy-Driven Game Dynamics

From holonomy reductions of Cartan geometries to geometric compactifications

ECOLE POLYTECHNIQUE HIGHER ORDER LEARNING AND EVOLUTION IN GAMES. Cahier n Rida LARAKI Panayotis MERTIKOPOULOS

4.7 The Levi-Civita connection and parallel transport

Chapter 3. Riemannian Manifolds - I. The subject of this thesis is to extend the combinatorial curve reconstruction approach to curves

A brief introduction to Semi-Riemannian geometry and general relativity. Hans Ringström

Hessian Riemannian Gradient Flows in Convex Programming

Stability of Feedback Solutions for Infinite Horizon Noncooperative Differential Games

Differential Geometry II Lecture 1: Introduction and Motivation

CALCULUS ON MANIFOLDS. 1. Riemannian manifolds Recall that for any smooth manifold M, dim M = n, the union T M =

Geometric inequalities for black holes

Structural and Multidisciplinary Optimization. P. Duysinx and P. Tossings

DIFFERENTIAL GEOMETRY, LECTURE 16-17, JULY 14-17

Near-Potential Games: Geometry and Dynamics

DIFFERENTIAL GEOMETRY. LECTURE 12-13,

LECTURE 10: THE ATIYAH-GUILLEMIN-STERNBERG CONVEXITY THEOREM

Rigidity and Non-rigidity Results on the Sphere

Manifold Optimization Over the Set of Doubly Stochastic Matrices: A Second-Order Geometry

Steepest descent method on a Riemannian manifold: the convex case

Convergence Rate of Nonlinear Switched Systems

DETERMINISTIC AND STOCHASTIC SELECTION DYNAMICS

Near-Potential Games: Geometry and Dynamics

minimize x subject to (x 2)(x 4) u,

Choice of Riemannian Metrics for Rigid Body Kinematics

Pairwise Comparison Dynamics for Games with Continuous Strategy Space

Recovery of anisotropic metrics from travel times

Integration of non linear conservation laws?

ON THE FOLIATION OF SPACE-TIME BY CONSTANT MEAN CURVATURE HYPERSURFACES

Hessian Riemannian gradient flows in convex programming

On the mass of asymptotically hyperbolic manifolds

DIFFERENTIAL GEOMETRY HW 12

Alberto Bressan. Department of Mathematics, Penn State University

Non-existence of time-periodic dynamics in general relativity

LECTURE 15: COMPLETENESS AND CONVEXITY

RAJASTHAN PUBLIC SERVICE COMMISSION, AJMER

Algorithms for constrained local optimization

Spatial Economics and Potential Games

EUCLIDEAN CONFORMALLY FLAT SUBMANIFOLDS IN CODIMENSION TWO OBTAINED AS INTERSECTIONS

Stabilization of Relative Equilibria for Underactuated Systems on Riemannian Manifolds

PICARD S THEOREM STEFAN FRIEDL

1. Geometry of the unit tangent bundle

Existence of 1-harmonic map flow

Math 6455 Nov 1, Differential Geometry I Fall 2006, Georgia Tech

THEODORE VORONOV DIFFERENTIAL GEOMETRY. Spring 2009

Calderón-Zygmund inequality on noncompact Riem. manifolds

Matrix Manifolds: Second-Order Geometry

Key words. saddle-point dynamics, asymptotic convergence, convex-concave functions, proximal calculus, center manifold theory, nonsmooth dynamics

Neighbourhoods of Randomness and Independence

An introduction to Mathematical Theory of Control

II. DIFFERENTIABLE MANIFOLDS. Washington Mio CENTER FOR APPLIED VISION AND IMAGING SCIENCES

Course Summary Math 211

Loos Symmetric Cones. Jimmie Lawson Louisiana State University. July, 2018

BERGMAN KERNEL ON COMPACT KÄHLER MANIFOLDS

Deforming conformal metrics with negative Bakry-Émery Ricci Tensor on manifolds with boundary

arxiv: v6 [cs.sy] 5 Jul 2017

Stabilization and Passivity-Based Control

Game theory Lecture 19. Dynamic games. Game theory

Rigidity of Black Holes

On the Stability of the Best Reply Map for Noncooperative Differential Games

GEOMETRY OF GEODESIC SPHERES IN A COMPLEX PROJECTIVE SPACE IN TERMS OF THEIR GEODESICS

Exercises in Geometry II University of Bonn, Summer semester 2015 Professor: Prof. Christian Blohmann Assistant: Saskia Voss Sheet 1

CS 468, Lecture 11: Covariant Differentiation

4 Riemannian geometry

Stability theory is a fundamental topic in mathematics and engineering, that include every

Stability of Nonlinear Systems An Introduction

Nonlinear Systems and Control Lecture # 12 Converse Lyapunov Functions & Time Varying Systems. p. 1/1

Evolution & Learning in Games

Modeling and Analysis of Dynamic Systems

LECTURE 10: THE PARALLEL TRANSPORT

Closed characteristics on non-compact mechanical contact manifolds 1

INVERSE CASCADE on HYPERBOLIC PLANE

(x k ) sequence in F, lim x k = x x F. If F : R n R is a function, level sets and sublevel sets of F are any sets of the form (respectively);

Poisson Equation on Closed Manifolds

A CONSTRUCTION OF TRANSVERSE SUBMANIFOLDS

Constrained Optimization and Lagrangian Duality

+ dxk. dt 2. dt Γi km dxm. . Its equations of motion are second order differential equations. with intitial conditions

HOMEWORK 2 - RIEMANNIAN GEOMETRY. 1. Problems In what follows (M, g) will always denote a Riemannian manifold with a Levi-Civita connection.

MATH DIFFERENTIAL GEOMETRY. Contents

An introduction to General Relativity and the positive mass theorem

Population Games and Evolutionary Dynamics

The harmonic map flow

Lyapunov stability ORDINARY DIFFERENTIAL EQUATIONS

Theorem 2. Let n 0 3 be a given integer. is rigid in the sense of Guillemin, so are all the spaces ḠR n,n, with n n 0.

The existence of light-like homogeneous geodesics in homogeneous Lorentzian manifolds. Sao Paulo, 2013

Special and General Relativity (PHZ 4601/5606) Fall 2018 Classwork and Homework. Every exercise counts 10 points unless stated differently.

Reduction of Homogeneous Riemannian structures

The Steepest Descent Algorithm for Unconstrained Optimization

UNIVERSITY OF DUBLIN

3 Stability and Lyapunov Functions

Online Convex Optimization

Mostow Rigidity. W. Dison June 17, (a) semi-simple Lie groups with trivial centre and no compact factors and

Evolution & Learning in Games

arxiv: v2 [math.dg] 30 Nov 2016

Written Examination

Localizing solutions of the Einstein equations

Introduction to Information Geometry

State observers for invariant dynamics on a Lie group

1 Directional Derivatives and Differentiability

IE 5531: Engineering Optimization I

Transcription:

... Inertial Game Dynamics R. Laraki P. Mertikopoulos CNRS LAMSADE laboratory CNRS LIG laboratory ADGO'13 Playa Blanca, October 15, 2013

... Motivation Main Idea: use second order tools to derive efficient learning algorithms in games. The second order exponential learning dynamics (Rida's talk) have many pleasant properties, but also various limitations: Cannot converge to interior equilibria (not a problem in many applications, desirable in others). Convex programming properties not clear no damping mechanism. Lack of a bona fide "heavy ball with friction" interpretation. In this talk: use geometric ideas to derive a class of inertial (= admitting an energy function), second order dynamics for learning in games.

... Approach Breakdown The main steps of our approach will be as follows: 1. Endow the simplex with a Hessian Riemannian geometric structure. 2. Derive the equations of motion for a learner under the forcing of his unilateral gradient (taken w.r.t. the HR geometry on the simplex). 3. Derive an isometric embedding of the problem into an ambient Euclidean space. 4. Establish the well-posedness of the dynamics. 5. Use the system's energy function to derive the dynamics' asymptotic properties.

... Notation We will work with finite games G G(N, A, u) consisting of: A finite set of players: N = {1,, N}. The players' action sets A k = {α k,0, α k,1, }, k N. The players' payoff functions u k A k A k R, extended multilinearly to X k (A k ) if players use mixed strategies x k X k (A k ). Note: indices will be suppressed when possible. Special case: if u kα (x) u kβ (x) = [V(α; x k ) V(β; x k )] for some V X R, the game is called a potential game. Equilibrium: we will say that q X is a Nash equilibrium of G if u kα (q) u kβ (q) for all α supp(q k ), β A k, k N.

... Riemannian Metrics A Riemannian metric on an open set U R m is a smoothly varying scalar product on U д(x, Y) X, Y д = j,k X j д jk Y k, X, Y R m, where д д(x) is a smooth field of positive-definite matrices on U.

... Riemannian Metrics A Riemannian metric on an open set U R m is a smoothly varying scalar product on U д(x, Y) X, Y д = j,k X j д jk Y k, X, Y R m, where д д(x) is a smooth field of positive-definite matrices on U. The gradient of a scalar function V U R with respect to д is defined as: grad д V = д 1 ( V) or, in components, ( grad д V) j = k д 1 jk k V, where V = ( j V) n j=1 is the array of partial derivatives of V.

... Riemannian Metrics A Riemannian metric on an open set U R m is a smoothly varying scalar product on U д(x, Y) X, Y д = j,k X j д jk Y k, X, Y R m, where д д(x) is a smooth field of positive-definite matrices on U. The gradient of a scalar function V U R with respect to д is defined as: grad д V = д 1 ( V) or, in components, ( grad д V) j = k д 1 jk k V, where V = ( j V) n j=1 is the array of partial derivatives of V. Fundamental property of the gradient: d dt V(x(t)) = grad д V, ẋ д. More generally, the derivative of V along a vector field X on U will be: X f d f X = grad f, X.

... Parallel Transport How can we differentiate a vector field along another in a Riemannian setting? Definition Let X, Y be vector fields on U. A connection on U will be a map (X, Y) X Y s.t.: 1. f1 X 1 + f 2 X 2 Y = f 1 X1 Y + f 2 X2 Y f 1, f 2 C (U). 2. X (ay 1 + by 2) = a X Y 1 + b X Y 2 for all a, b R. 3. X ( f Y) = f X Y + X f Y for all f C (U). In components: ( X Y) k = i X i i Y k + i, j Γ k i jx i Y j, where Γ k i j are the connection's Christoffel symbols.

... Covariant Differentiation A Riemannian metric generates the so-called Levi-Civita connection with symbols Γ k i j = 1 2 l д 1 kl ( i д l j + j д li l д i j ) This leads to the notion of covariant differentiation along a curve x(t) of U: ( ẋ X) k Ẋ k + i, j Γ k i jx iẋ j If the field being differentiated is the velocity of x(t), we obtain the acceleration of x(t) D 2 x k Dt 2 = ẍ k + i, j Γ k i jẋ iẋ j. Definition A geodesic on U is a curve x(t) with zero acceleration: D 2 x Dt 2 = 0.

... Hessian Riemannian Metrics We will be interested in a specific class of Riemannian metrics on the positive orthant R m >0 of R m generated by a family of barrier functions. Definition Let θ [0, + ) R {+ } be a C function such that 1. θ(x) < for all x > 0. 2. lim x 0 + θ (x) =. 3. θ (x) > 0 and θ (x) < 0 for all x > 0. The Hessian Riemannian metric generated by θ on R n+1 >0 will be д(x) = Hess ( k θ(x k )) or, in components, д i j (x) = θ (x i)δ i j. The function θ will be called the kernel of д.

... Hessian Riemannian Metrics We will be interested in a specific class of Riemannian metrics on the positive orthant R m >0 of R m generated by a family of barrier functions. Definition Let θ [0, + ) R {+ } be a C function such that 1. θ(x) < for all x > 0. 2. lim x 0 + θ (x) =. 3. θ (x) > 0 and θ (x) < 0 for all x > 0. The Hessian Riemannian metric generated by θ on R n+1 >0 will be д(x) = Hess ( k θ(x k )) or, in components, д i j (x) = θ (x i)δ i j. The function θ will be called the kernel of д. Examples The Shahshahani metric: θ(x) = x log x д i j (x) = δ i j /x j. The log-barrier metric: θ(x) = log x д i j (x) = δ i j /x 2 j. The Euclidean metric (non-example): θ(x) = 1 2 x2 д i j (x) = δ i j.

... The Heavy Ball with Friction The heavy ball with friction dynamics (Attouch et al.) on R m are: ẍ = grad V ηẋ, (HBF) where V R m R is a smooth potential function and η > 0 is the friction coefficient which dissipates energy. Theorem (Alvarez 2000) If V is convex and arg min V, (HBF) converges to a minimizer of V.

... The Heavy Ball with Friction The heavy ball with friction dynamics (Attouch et al.) on R m are: ẍ = grad V ηẋ, (HBF) where V R m R is a smooth potential function and η > 0 is the friction coefficient which dissipates energy. Theorem (Alvarez 2000) If V is convex and arg min V, (HBF) converges to a minimizer of V. We wish to apply the above method to the unit simplex ; in the presence of inequality constraints however, (HBF) is no longer well-posed: it exits in finite time. We will take a two-step approach: 1. Endow with a Hessian Riemannian structure. 2. Derive the Riemannian analogue of (HBF).

... The Heavy Ball with Friction on the Simplex Let д be a Hessian Riemannian metric on R n+1 >0 with kernel θ. Then (HBF) becomes: or, in components: ẍ k = with u k = k V and Γ k i j = 1 2 D 2 x Dt 2 = grad д V ηẋ, 1 θ (x k ) u k n i, j=1 Γk i jẋ iẋ j ηẋ k, θ (x k ) θ (x k ) δ i jk.

... The Heavy Ball with Friction on the Simplex Let д be a Hessian Riemannian metric on R n+1 >0 with kernel θ. Then (HBF) becomes: or, in components: ẍ k = with u k = k V and Γ k i j = 1 2 D 2 x Dt 2 = grad д V ηẋ, 1 θ (x k ) u k n i, j=1 Γk i jẋ iẋ j ηẋ k, θ (x k ) θ (x k ) δ i jk. Using d'alembert's principle to project on the simplex, we obtain the inertial dynamics: ẍ k = 1 θ k [u k l (Θ h /θ l ) u l ] 1 1 2 Driving force θ k [θ k ẋ 2 k l (Θ h /θ l ) θ Constraint force l ẋ 2 l ] ηẋ k Friction (ID) where θ k = θ (x k ) and Θ h is the harmonic mean Θ h = ( l 1/θ l ) 1.

... Inertial Game Dynamics Tensoring over players, we obtain the inertial game dynamics: ẍ kα = 1 [u θ kα β (Θ k,h/θ kβ) u kβ ] kα 1 1 [θ kαẋ 2 2 θ kα l (Θ k,h/θ kβ) θ kβẋkβ] 2 ηẋ kα, kα where the players' payoffs u kα = u k are viewed as unilateral gradients. x kα (IGD)

... Inertial Game Dynamics Tensoring over players, we obtain the inertial game dynamics: ẍ kα = 1 [u θ kα β (Θ k,h/θ kβ) u kβ ] kα 1 1 [θ kαẋ 2 2 θ kα l (Θ k,h/θ kβ) θ kβẋkβ] 2 ηẋ kα, kα where the players' payoffs u kα = u k are viewed as unilateral gradients. x kα Examples (IGD) 1. The Gibbs kernel θ(x) = x log x generates the inertial replicator dynamics: ẍ kα = x kα (u kα β x kβ u kβ ) + 1 2 x kα (ẋ 2 kα/x 2 kα β ẋ 2 kβ/x kβ ) ηẋ kα. (I-RD) 2. The Burg kernel θ(x) = log x generates the inertial log-barrier dynamics: ẍ kα = x 2 kα (u kα r 2 k β xkβu 2 kβ ) + x 2 kα (ẋkα/x 2 3 kα r 2 k β ẋkβ/x 2 kβ ) ηẋ kα, (I-LD) where r 2 k = β xkβ. 2

... Energy, Damping and Convergence For a single player, the Riemannian structure on gives rise to the energy functional: E(x, v) = 1 v, v + V(x) 2 Under the inertial dynamics, energy is dissipated: Ė = D2 x Dt 2, ẋ + grad V, ẋ = grad V ηẋ, ẋ + grad V, ẋ = η ẋ 2 0

... Energy, Damping and Convergence For a single player, the Riemannian structure on gives rise to the energy functional: E(x, v) = 1 v, v + V(x) 2 Under the inertial dynamics, energy is dissipated: Ė = D2 x Dt 2, ẋ + grad V, ẋ = grad V ηẋ, ẋ + grad V, ẋ = η ẋ 2 0 As a result, inertial trajectories that exist for all time eventually slow down: Proposition If x(t) exists for all t 0, then lim t ẋ(t) = 0.

... Energy, Damping and Convergence For a single player, the Riemannian structure on gives rise to the energy functional: E(x, v) = 1 v, v + V(x) 2 Under the inertial dynamics, energy is dissipated: Ė = D2 x Dt 2, ẋ + grad V, ẋ = grad V ηẋ, ẋ + grad V, ẋ = η ẋ 2 0 As a result, inertial trajectories that exist for all time eventually slow down: Proposition If x(t) exists for all t 0, then lim t ẋ(t) = 0. Theorem Assume that the dynamics (ID) are well-posed, and let q be a local minimizer of V with Hess(V) 0 at q. If x(0) is sufficiently close to q and the system's initial kinetic energy K(0) = 1 2 ẋ(0) 2 is low enough, then lim t x(t) = q.

... The Folk Theorem of Evolutionary Game Theory First order gradient descent w.r.t. the Shahshahani metric д i j (x) = δ i j /x j leads to the replicator equation: ẋ kα = x kα [u kα β x kβ u kβ ] (RD 1 ) The most well known stability and convergence result is the folk theorem of evolutionary game theory which states that (RD 1) has the following properties: I. A state is stationary iff it is a restricted equilibrium i.e.u kα (q) = u kβ (q) if α, β supp(q k ). II. If an interior solution orbit converges, its limit is Nash. III. If a point is Lyapunov stable, then it is also Nash. IV. A point is asymptotically stable if and only if it is a strict equilibrium.

... An Inertial Folk Theorem In our inertial setting, we have the following folk-type theorem: Theorem Assume that the inertial dynamics (IGD) are well-posed, and let x(t) be a solution orbit of (IGD) for η k 0. Then: I. x(t) = q for all t 0 if and only if q is a restricted equilibrium. II. If x(t) is interior and lim t x(t) = q, then q is a restricted equilibrium of G. III. If every neighborhood U of q in X admits an interior orbit x U (t) such that x U (t) U for all t 0, then q is a restricted equilibrium of G. IV. If q is a strict equilibrium of G and x(t) starts close enough to q with sufficiently low speed ẋ(0), then x(t) remains close to q for all t 0 and lim t x(t) = q.

... An Isometric Embedding into Euclidean Space The above results all rely on the inertial dynamics being well-posed not obvious! We will study this by embedding the problem isometrically in an ambient Euclidean space. Proposition (Nash embedding) Let ξ α = ϕ(x α ) with ϕ (x) = θ (x), and set S = {(ϕ(x 0 ),, ϕ(x n )) x rel int( )}. Then S with the ambient metric of R n is isomorphic to rel int( ) with the Hessian Riemannian metric generated by θ. Examples 1. The open unit simplex R n+1 with the Shahshahani metric д i j = δ i j /x j is isometric to an open orthant of the radius-2 sphere in R n+1 (Akin, 1979). 2. The open unit simplex R n+1 with the log-barrier metric д i j = δ i j /x 2 j is isometric to the closed hypersurface S = {ξ R n+1 ξ α < 0 and β e ξ β = 1}.

... of the Inertial Dynamics In the Euclidean variables ξ = ϕ(x), the inertial dynamics become: 1 ξ α = (u α θ β (Θ h /θ β ) u β ) + 1 1 α 2 θ β Θ α h θ β /(θ β ) 2 ξ2 β η ξ α. By the Euclidean isometry property, this is just Newton's ordinary second law of motion for particles constrained to move on the hypersurface S = {ξ R n+1 β ϕ 1 (ξ β ) = 1}. Theorem The dynamics (ID) are well-posed if and only if S is a closed hypersurface of R n+1. Proof technical and hard, but intuition straightforward: if S is bounded in some direction, then orbits can escape from that part of S in finite time.

... Examples Nash embedding for the Shahshahani simplex: θ(x) = x log x, ξ = 2 x. The dynamics escape in finite time.

Motivation... Examples Nash embedding for the Burg simplex: θ(x) = log x, ξ = log x. The dynamics are well-posed. P. Mertikopoulos CNRS Laboratoire d'informatique de Grenoble

... Future Directions Some open problems for the coffee break: What do the dynamics look like for more general domains? When are they well posed? Conjecture: if the interior of the feasible set can be mapped isometrically to a closed submanifold of some ambient real space. What are the dynamics' global convergence properties for special classes of functions (convex, analytic, etc.)?