COMPUTATION OF GENERALIZED NASH EQUILIBRIA WITH

Similar documents
Solving a Class of Generalized Nash Equilibrium Problems

GENERALIZED NASH EQUILIBRIUM PROBLEMS. Preprint 279 August August 1, 2007

6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Potential Games

Basics of Game Theory

Static (or Simultaneous- Move) Games of Complete Information

Solving Quadratic Multi-Leader-Follower Games by Smoothing the Follower s Best Response

An Introduction to Noncooperative Games

Lecture 8 Plus properties, merit functions and gap functions. September 28, 2008

A Distributed Regularized Jacobi-type ADMM-Method for Generalized Nash Equilibrium Problems in Hilbert Spaces

Part IB Optimisation

Downloaded 01/28/18 to Redistribution subject to SIAM license or copyright; see

A Stochastic Multiple Leader Stackelberg Model: Analysis, Computation, and Application

Lecture 1. Evolution of Market Concentration

Existence theorems for generalized Nash equilibrium problems: an analysis of assumptions

6.207/14.15: Networks Lecture 10: Introduction to Game Theory 2

arxiv: v2 [cs.gt] 1 Dec 2018

6.254 : Game Theory with Engineering Applications Lecture 7: Supermodular Games

Game Theory and Algorithms Lecture 2: Nash Equilibria and Examples

Hotelling games on networks

Game Theory. Professor Peter Cramton Economics 300

Bounded Rationality Lecture 2. Full (Substantive, Economic) Rationality

FPTAS for Computing a Symmetric Leontief Competitive Economy Equilibrium

EE291E/ME 290Q Lecture Notes 8. Optimal Control and Dynamic Games

Near-Potential Games: Geometry and Dynamics

Near-Potential Games: Geometry and Dynamics

Chapter 9. Mixed Extensions. 9.1 Mixed strategies

Spring 2016 Network Science. Solution of Quiz I

Mathematical Economics - PhD in Economics

Static Models of Oligopoly

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2016

Oligopoly. Molly W. Dahl Georgetown University Econ 101 Spring 2009

Advanced Microeconomics

Oligopoly Theory 2 Bertrand Market Games

Stochastic Approximation Approaches to the Stochastic Variational Inequality Problem

Supermodular Games. Ichiro Obara. February 6, 2012 UCLA. Obara (UCLA) Supermodular Games February 6, / 21

Game Theoretic Scenarios of Cybersecurity. Investment Models in a Sellers-Buyers Market. Spencer R. R. Kirbyson. A Thesis.

Relaxation algorithms to find Nash equilibria with economic applications

Introduction to game theory LECTURE 1

UC Berkeley Haas School of Business Game Theory (EMBA 296 & EWMBA 211) Summer 2016

Computing Equilibria of Repeated And Dynamic Games

Martingale optimal transport with Monge s cost function

12. Interior-point methods

Constrained Optimization and Lagrangian Duality

MS&E 246: Lecture 12 Static games of incomplete information. Ramesh Johari

Nash Equilibria in Normal Games via Optimization Methods

Computing risk averse equilibrium in incomplete market. Henri Gerard Andy Philpott, Vincent Leclère

Collaborative Network Formation in Spatial Oligopolies

Scientific Computing: An Introductory Survey

Scientific Computing: An Introductory Survey

An inexact subgradient algorithm for Equilibrium Problems

Game theory and market power

Conjectural Variations in Aggregative Games: An Evolutionary Perspective

1 PROBLEM DEFINITION. i=1 z i = 1 }.

LEARNING IN CONCAVE GAMES

Supplemental Document to the Paper A Game-Theoretic Approach for Regulating Hazmat Transportation

Technische Universität Dresden Herausgeber: Der Rektor

6.891 Games, Decision, and Computation February 5, Lecture 2

Justification of Constrained Game Equilibrium Models 1

Ergodicity and Non-Ergodicity in Economics

Network Games: Learning and Dynamics

Convex Optimization. Newton s method. ENSAE: Optimisation 1/44

4: Dynamic games. Concordia February 6, 2017

Multimarket Oligopolies with Restricted Market Access

Formulating an n-person noncooperative game as a tensor complementarity problem

Sequential Quadratic Programming Method for Nonlinear Second-Order Cone Programming Problems. Hirokazu KATO

Interior Point Algorithms for Constrained Convex Optimization

Variational inequality formulation of chance-constrained games

On the Existence of Strong Nash Equilibria

Game Theory Correlated equilibrium 1

Lecture 3. Optimization Problems and Iterative Algorithms

Convex Optimization & Lagrange Duality

Algorithms for Constrained Optimization

REPEATED GAMES. Jörgen Weibull. April 13, 2010

Modeling, equilibria, power and risk

Constructive Proof of the Fan-Glicksberg Fixed Point Theorem for Sequentially Locally Non-constant Multi-functions in a Locally Convex Space

Computing Normalized Equilibria in Convex-Concave Games S. D. Flåm and A. Ruszczyński. April 26, 2006

Solving Dual Problems

Optimality conditions and complementarity, Nash equilibria and games, engineering and economic application

SEQUENTIAL ESTIMATION OF DYNAMIC DISCRETE GAMES. Victor Aguirregabiria (Boston University) and. Pedro Mira (CEMFI) Applied Micro Workshop at Minnesota

12. Interior-point methods

Dynamic Bertrand and Cournot Competition

A CHARACTERIZATION OF STRICT LOCAL MINIMIZERS OF ORDER ONE FOR STATIC MINMAX PROBLEMS IN THE PARAMETRIC CONSTRAINT CASE

Algorithms for constrained local optimization

A DC (DIFFERENCE OF CONVEX FUNCTIONS) APPROACH OF THE MPECS. Matthieu Marechal. Rafael Correa. (Communicated by the associate editor name)

EC3224 Autumn Lecture #04 Mixed-Strategy Equilibrium

Game theory Lecture 19. Dynamic games. Game theory

Robust Comparative Statics in Large Static Games

Outline. Scientific Computing: An Introductory Survey. Optimization. Optimization Problems. Examples: Optimization Problems

An (exhaustive?) overview of optimization methods

6.207/14.15: Networks Lecture 11: Introduction to Game Theory 3

Stackelberg oligopoly TU-games: characterization of the core and 1-concavity of the dual game

Extreme Abridgment of Boyd and Vandenberghe s Convex Optimization

Flow Control, Routing, and Performance with a For-profit Service Provider

Solving Extensive Form Games

Solving Dynamic Games with Newton s Method

Infeasibility Detection and an Inexact Active-Set Method for Large-Scale Nonlinear Optimization

3.3.3 Illustration: Infinitely repeated Cournot duopoly.

ECE Optimization for wireless networks Final. minimize f o (x) s.t. Ax = b,

Affine scaling interior Levenberg-Marquardt method for KKT systems. C S:Levenberg-Marquardt{)KKTXÚ

Motivation. Lecture 2 Topics from Optimization and Duality. network utility maximization (NUM) problem:

Transcription:

COMPUTATION OF GENERALIZED NASH EQUILIBRIA WITH THE GNE PACKAGE C. Dutang 1,2 1 ISFA - Lyon, 2 AXA GRM - Paris, 1/19 12/08/2011 user! 2011

OUTLINE 1 GAP FUNCTION MINIZATION APPROACH 2 FIXED-POINT APPROACH 3 KKT REFORMULATION 2/19 12/08/2011 user! 2011

WHAT IS A NASH EQUILIBRIUM (NE)? Some history, An equilibrium concept was first introduced by Cournot in 1838. But mathematical concepts had widely spread with The theory of games and economic behavior of von Neumann & Morgenstern book in 1947. John F. Nash introduces the NE equilibrium concept in a 1949 seminar. 3/19 12/08/2011 user! 2011

WHAT IS A NASH EQUILIBRIUM (NE)? Some history, An equilibrium concept was first introduced by Cournot in 1838. But mathematical concepts had widely spread with The theory of games and economic behavior of von Neumann & Morgenstern book in 1947. John F. Nash introduces the NE equilibrium concept in a 1949 seminar. Various names for various games Which type of players? Cooperative vs. non-cooperative Which type of for the strategy set? Finite vs. infinite games e.g. {squeal, silent} for Prisoner s dilemna vs. [0, 1] cake cutting game Is there any time involved? Static vs. dynamic games e.g. battle of the sexes vs. pursuit-evasion game Is it stochastic? Deterministic payoff vs. random payoff 3/19 12/08/2011 user! 2011

NON-COOPERATIVE GAMES Idea : to model competition among players Characterization by a pair of a set of stategies S for the n players a payoff function f such that f i (s) is the payoff for player i given the overall strategy s 4/19 12/08/2011 user! 2011

NON-COOPERATIVE GAMES Idea : to model competition among players Characterization by a pair of a set of stategies S for the n players a payoff function f such that f i (s) is the payoff for player i given the overall strategy s Nash equilibrium is defined as a strategy in which no player can improve unilateraly his payoff. 4/19 12/08/2011 user! 2011

NON-COOPERATIVE GAMES Idea : to model competition among players Characterization by a pair of a set of stategies S for the n players a payoff function f such that f i (s) is the payoff for player i given the overall strategy s Nash equilibrium is defined as a strategy in which no player can improve unilateraly his payoff. Duopoly example: Consider two profit-seeking firms sell an identical product on the same market. Each firm will choose its production rates. Let x i R + be the production of firm i and d, λ and ρ be constants. The market price is p(x) = d ρ(x 1 + x 2), from which we deduce the ith firm profit p(x)x i λx i. For d = 20, λ = 4, ρ = 1, we get x = (16/3, 16/3). 4/19 12/08/2011 user! 2011

EXTENSION TO GENERALIZED NE (GNE) DEFINITION (NEP) The Nash equilibrium problem NEP(N, θ ν, X) for N players consists in finding x solving N sub problems x ν solves min y ν X ν θ ν(y ν, x ν), ν = 1,..., N, where X ν is the action space of player ν. 5/19 12/08/2011 user! 2011

EXTENSION TO GENERALIZED NE (GNE) DEFINITION (NEP) The Nash equilibrium problem NEP(N, θ ν, X) for N players consists in finding x solving N sub problems x ν solves min y ν X ν θ ν(y ν, x ν), ν = 1,..., N, where X ν is the action space of player ν. GNE was first introduced by Debreu, [Deb52]. DEFINITION (GNEP) The generalized NE problem GNEP(N, θ ν, X ν) consists in finding x solving N sub problems x ν solves min y ν θ ν(y ν, x ν) such that x ν X ν(x ν), ν = 1,..., N, where X ν(x ν) is the action space of player ν given others player actions x ν. Remark: we study a jointly convex case X = {x, g(x) 0, ν, h ν(x ν) 0}. 5/19 12/08/2011 user! 2011

OUTLINE 1 GAP FUNCTION MINIZATION APPROACH 2 FIXED-POINT APPROACH 3 KKT REFORMULATION 6/19 12/08/2011 user! 2011

REFORMULATION AS A GAP DEFINITION (GP) Let ψ be a gap function. We define a merit function as ˆV (x) = sup ψ(x, y). y X(x) From [vhk09], the gap problem GP(N, X, ψ) is x X(x ) and ˆV (x ) = 0. 7/19 12/08/2011 user! 2011

REFORMULATION AS A GAP DEFINITION (GP) Let ψ be a gap function. We define a merit function as ˆV (x) = sup ψ(x, y). y X(x) From [vhk09], the gap problem GP(N, X, ψ) is x X(x ) and ˆV (x ) = 0. PROPOSITION Assuming continuity, ˆV (x) 0 and so x solves the GP(N, X, ψ) is equivalent to min ˆV (x) and ˆV (x ) = 0. x X(x) 7/19 12/08/2011 user! 2011

REFORMULATION AS A GAP DEFINITION (GP) Let ψ be a gap function. We define a merit function as ˆV (x) = sup ψ(x, y). y X(x) From [vhk09], the gap problem GP(N, X, ψ) is x X(x ) and ˆV (x ) = 0. PROPOSITION Assuming continuity, ˆV (x) 0 and so x solves the GP(N, X, ψ) is equivalent to min ˆV (x) and ˆV (x ) = 0. x X(x) Example: the Nikaido-Isoda function NX ψ(x, y) = [θ(x ν, x ν) θ(y ν, x ν)] α 2 x y 2. ν=1 7/19 12/08/2011 user! 2011

THE M I NGA P FUNCTION The gap function minimization consists in minimizing a gap function min V (x). The function mingap provides two optimization methods. Barzilai-Borwein method x n+1 = x n + t nd n, where direction is d n = V (x n) and stepsize BFGS method t n = d T n 1 d T n 1d n 1 ( V (xn) V (xn 1)). x n+1 = x n + t nh 1 n d n, where H n approximates the Hessian by a symmetric rank two update of the type H n+1 = H n + auu T + bvv T and t n is line searched. 8/19 12/08/2011 user! 2011

DUOPOLY EXAMPLE Let x i R + be the production of firm i and d = 20, λ = 4 and ρ = 1 be constants. The ith firm profit θ i(x) = (d ρ(x 1 + x 2))x i λx i. > mingap(c(10,20), V, GV, method="bb",...) **** k 0 x_k 10 20 **** k 1 x_k 9.947252 19.50739 **** k 2 x_k 8.42043 7.216102 **** k 3 x_k 6.53553 6.066531 **** k 4 x_k 5.333327 5.333322 $par [1] 5.333327 5.333322 $outer.counts Vhat gradvhat 5 9 $outer.iter [1] 3 $code [1] 0 > 9/19 12/08/2011 user! 2011

DUOPOLY EXAMPLE Let x i R + be the production of firm i and d = 20, λ = 4 and ρ = 1 be constants. The ith firm profit θ i(x) = (d ρ(x 1 + x 2))x i λx i. > mingap(c(10,20), V, GV, method="bb",...) **** k 0 x_k 10 20 **** k 1 x_k 9.947252 19.50739 **** k 2 x_k 8.42043 7.216102 **** k 3 x_k 6.53553 6.066531 **** k 4 x_k 5.333327 5.333322 $par [1] 5.333327 5.333322 Outer iterations 3 Inner iterations - V 141 Inner iterations - V 269 $outer.counts Vhat gradvhat 5 9 $outer.iter [1] 3 $code [1] 0 > 9/19 12/08/2011 user! 2011

DUOPOLY EXAMPLE Let x i R + be the production of firm i and d = 20, λ = 4 and ρ = 1 be constants. The ith firm profit θ i(x) = (d ρ(x 1 + x 2))x i λx i. > mingap(c(10,20), V, GV, method="bb",...) **** k 0 x_k 10 20 **** k 1 x_k 9.947252 19.50739 **** k 2 x_k 8.42043 7.216102 **** k 3 x_k 6.53553 6.066531 **** k 4 x_k 5.333327 5.333322 $par [1] 5.333327 5.333322 $outer.counts Vhat gradvhat 5 9 $outer.iter [1] 3 Outer iterations 3 Inner iterations - V 141 Inner iterations - V 269 Function calls - V 5 leading to calls - ψ 978 leading to calls - ψ 276 Function calls - V 9 leading to calls - ψ 2760 leading to calls - ψ 652 $code [1] 0 > 9/19 12/08/2011 user! 2011

OUTLINE 1 GAP FUNCTION MINIZATION APPROACH 2 FIXED-POINT APPROACH 3 KKT REFORMULATION 10/19 12/08/2011 user! 2011

REFORMULATION OF THE GNE PROBLEM AS A FIXED-POINT DEFINITION (REGULARIZED NIF) The regularized Nikaido-Isoda function is NX NIF α(x, y) = [θ(x ν, x ν) θ(y ν, x ν)] α 2 x y 2, ν=1 with a regularization parameter α > 0. Let y α : R n R n be the equilibrium response defined as y α(x) = (yα(x), 1..., yα N (x)), where yα(x) i = arg max NIF α(x, y). y i X i (x i ) 11/19 12/08/2011 user! 2011

THE F I X E D P O I N T FUNCTION Fixed-point methods for f(x) = 0 pure fixed point method Polynomial methods Relaxation algorithm x n+1 = f(x n). x n+1 = λ nf(x n) + (1 λ n)x n, where (λ n) n is either constant, decreasing or line-searched. RRE and MPE method x n+1 = x n + t n(f(x n) x n) where r n = f(x n) x n and v n = f(f(x n)) 2f(x n) + x n, with t n equals to < v n, r n > / < v n, v n > for RRE1, < r n, r n > / < v n, r n > for MPE1. Squaring method It consists in applying twice a cycle step to get the next iterate given t n such as RRE and MPE. Epsilon algorithms: not implemented x n+1 = x n 2t nr n + t 2 n vn, 12/19 12/08/2011 user! 2011

DUOPOLY EXAMPLE Let x i R + be the production of firm i and d = 20, λ = 4 and ρ = 1 be constants. The ith firm profit θ i(x) = (d ρ(x 1 + x 2))x i λx i. > fixedpoint(c(0,0), method="pure",...) $par [1] 5.333334 5.333333 $outer.counts yfunc Vfunc 24 24 $outer.iter [1] 24 $code [1] 0 > 13/19 12/08/2011 user! 2011

DUOPOLY EXAMPLE Let x i R + be the production of firm i and d = 20, λ = 4 and ρ = 1 be constants. The ith firm profit θ i(x) = (d ρ(x 1 + x 2))x i λx i. > fixedpoint(c(0,0), method="pure",...) $par [1] 5.333334 5.333333 Total calls ψ ψ Outer iter Gap - BB 317 90 2 $outer.counts yfunc Vfunc 24 24 $outer.iter [1] 24 $code [1] 0 > 13/19 12/08/2011 user! 2011

DUOPOLY EXAMPLE Let x i R + be the production of firm i and d = 20, λ = 4 and ρ = 1 be constants. The ith firm profit θ i(x) = (d ρ(x 1 + x 2))x i λx i. > fixedpoint(c(0,0), method="pure",...) $par [1] 5.333334 5.333333 $outer.counts yfunc Vfunc 24 24 $outer.iter [1] 24 Total calls ψ ψ Outer iter Gap - BB 317 90 2 Pure FP 2276 724 25 Decr. FP 1207 399 13 Line search FP 2492 802 14 $code [1] 0 > 13/19 12/08/2011 user! 2011

DUOPOLY EXAMPLE Let x i R + be the production of firm i and d = 20, λ = 4 and ρ = 1 be constants. The ith firm profit θ i(x) = (d ρ(x 1 + x 2))x i λx i. > fixedpoint(c(0,0), method="pure",...) $par [1] 5.333334 5.333333 $outer.counts yfunc Vfunc 24 24 $outer.iter [1] 24 $code [1] 0 > Total calls ψ ψ Outer iter Gap - BB 317 90 2 Pure FP 2276 724 25 Decr. FP 1207 399 13 Line search FP 2492 802 14 RRE FP 536 134 4 MPE FP 390 120 3 13/19 12/08/2011 user! 2011

DUOPOLY EXAMPLE Let x i R + be the production of firm i and d = 20, λ = 4 and ρ = 1 be constants. The ith firm profit θ i(x) = (d ρ(x 1 + x 2))x i λx i. > fixedpoint(c(0,0), method="pure",...) $par [1] 5.333334 5.333333 $outer.counts yfunc Vfunc 24 24 $outer.iter [1] 24 $code [1] 0 > Total calls ψ ψ Outer iter Gap - BB 317 90 2 Pure FP 2276 724 25 Decr. FP 1207 399 13 Line search FP 2492 802 14 RRE FP 536 134 4 MPE FP 390 120 3 SqRRE FP 491 197 4 SqMPE FP 530 166 3 13/19 12/08/2011 user! 2011

OUTLINE 1 GAP FUNCTION MINIZATION APPROACH 2 FIXED-POINT APPROACH 3 KKT REFORMULATION 14/19 12/08/2011 user! 2011

KKT SYSTEM REFORMULATION DEFINITION (KKT) From [FFP09], the first-order necessary conditions for ν subproblem states, there exists a Lagrangian multiplier λ ν R m such that L(x, λ) = xν θ xν (x) + X 1 j m λ νj xν g j(x) = 0 ( R nν ). 0 λ ν g(x) 0 ( R m ) 15/19 12/08/2011 user! 2011

KKT SYSTEM REFORMULATION DEFINITION (KKT) From [FFP09], the first-order necessary conditions for ν subproblem states, there exists a Lagrangian multiplier λ ν R m such that L(x, λ) = xν θ xν (x) + X 1 j m λ νj xν g j(x) = 0 ( R nν ). 0 λ ν g(x) 0 ( R m ) PROPOSITION The extended KKT system can be reformulated as a system of equations using complementarity functions. Let φ be a complementarity function. «L(x, λ) F φ (x, λ) = 0 where F φ (x, λ) =, φ.( g(x), λ) where φ. is the component-wise version of φ. Example: φ(a, b) = min(a, b). 15/19 12/08/2011 user! 2011

THE NE W T O NKKT FUNCTION The problem is Φ(z) = 0, with z = (x T λ T ) T. It is solved by an iterative scheme z n+1 = z n + d n, where the direction d n is computed in two different ways: Newton method: The direction solves the system with V n Φ(x n). V nd = Φ(x n), The Levenberg-Marquardt method: The direction solves the system (V T n V n + λ k I)d = V T n Φ(x n), where I denotes the identity matrix, λ k is the LM parameter, e.g. λ n = Φ(z n) 2. 16/19 12/08/2011 user! 2011

DUOPOLY EXAMPLE Let x i R + be the production of firm i and d = 20, λ = 4 and ρ = 1 be constants. The ith firm profit θ i(x) = (d ρ(x 1 + x 2))x i λx i. > NewtonKKT(rep(0, 4), "Leven",...) $par [1] 5.333 5.333-3e-08-3e-08 $value [1] 4.502713e-08 $counts phi jacphi 12 12 $iter [1] 11 > 17/19 12/08/2011 user! 2011

DUOPOLY EXAMPLE Let x i R + be the production of firm i and d = 20, λ = 4 and ρ = 1 be constants. The ith firm profit θ i(x) = (d ρ(x 1 + x 2))x i λx i. > NewtonKKT(rep(0, 4), "Leven",...) $par [1] 5.333 5.333-3e-08-3e-08 $value [1] 4.502713e-08 $counts phi jacphi 12 12 Total calls ψ ψ Outer iter Gap - BB 317 90 2 Decr. FP 1207 399 13 MPE FP 390 120 3 $iter [1] 11 > 17/19 12/08/2011 user! 2011

DUOPOLY EXAMPLE Let x i R + be the production of firm i and d = 20, λ = 4 and ρ = 1 be constants. The ith firm profit θ i(x) = (d ρ(x 1 + x 2))x i λx i. > NewtonKKT(rep(0, 4), "Leven",...) $par [1] 5.333 5.333-3e-08-3e-08 $value [1] 4.502713e-08 $counts phi jacphi 12 12 $iter [1] 11 > Total calls ψ ψ Outer iter Gap - BB 317 90 2 Decr. FP 1207 399 13 MPE FP 390 120 3 Newton 3 3 2 LM 12 12 11 17/19 12/08/2011 user! 2011

CONCLUSION The GNE package 1 provides base tools to compute generalized Nash equilibria for static infinite noncooperative games. Three methods : Gap function minimization: mingap, Fixed-point methods: fixedpoint, KKT reformulation: NewtonKKT. 1 hosted on R-forge. 18/19 12/08/2011 user! 2011

CONCLUSION The GNE package 1 provides base tools to compute generalized Nash equilibria for static infinite noncooperative games. Three methods : Gap function minimization: mingap, Fixed-point methods: fixedpoint, KKT reformulation: NewtonKKT. Future developments: extends to non jointly convex case for FP and GP methods, develop further the KKT reformulation, later, extends to finite noncooperative games, e.g. Lemke-Howson algorithm, far later, extends to cooperative games! 1 hosted on R-forge. 18/19 12/08/2011 user! 2011

References REFERENCES Gerard Debreu, A social equilibrium existence theorem, Proc. Nat. Acad. Sci. U.S.A. (1952). Francisco Facchinei, Andreas Fischer, and Veronica Piccialli, Generalized Nash equilibrium problems and Newton methods, Math. Program., Ser. B 117 (2009), 163 194. Anna von Heusinger and Christian Kanzow, Optimization reformulations of the generalized Nash equilibrium problem using the Nikaido-Isoda type functions, Computational Optimization and Applications 43 (2009), no. 3. Thank you for your attention! 19/19 12/08/2011 user! 2011