Leader-Follower strategies for a Multi-Plant differential game

Size: px
Start display at page:

Download "Leader-Follower strategies for a Multi-Plant differential game"

Transcription

1 008 American Control Conference Westin Seattle Hotel, Seattle, Washington, USA June -3, 008 FrA066 Leader-Follower strategies for a Multi-Plant differential game Manuel Jiménez-Lizárraga, Alex Poznyak and MA Alcorta Abstract In this paper the formulation of a concept for a type of robust Leader-Follower equilibrium for a Multi- Plant or multiple scenarios differential game is developed The game dynamic is given by a family of N different possible differential equations Multi-Model representation with no information about the trajectory which is realized The robust Leader-Follower strategy for each player must confront with all possible scenarios simultaneously The problem of each player is the designing of min-max strategies for each player which guarantee an equilibrium for the worst case scenario Based on the Robust Maximum Principle, the conditions for a game to be in Robust Leader-Follower Equilibrium are presented As in the Nash equilibrium case the initial min-max differential game may be converted into a standard static game given in a multidimensional simplex A numerical procedure for resolving the case of linear quadratic differential game is presented I INTRODUCTION Beginning from the seminal works of [], [], [3] the leader-follower or Stackelberg solution for an open loop information structure in a two person differential games has been well established see also [4], [5] This concept of a solution for a game, introduced in 934 by the economist H von Stackelberg [6], is suitable when one of the players is forced to subordinate to an authority that must announces his decision first, before play his own strategy, or when one of the players chooses to play the game passively, that is, composes just his optimal reaction solve an oneplayer optimization problem given that the other player has announced his strategy As it is shown in the above publications this kind of problems are still tractable using results of optimal control theory All of them tackle this problem when the model of the considered dynamics is exactly known In the case of one player optimization problems the robust version of the traditional Maximum Principle referred to as Robust Maximum Principle RMP see [7], [8] and [9] allows to design an "optimal policy" of a min-max type for a multi-plant or multi-scenario problem, where each possible scenario is seen as possible parametric realization of the dynamic equation So, it appears that we have N possible linear state dynamic equations, each of them describing a model and there is no a priory information which will be the active one The RMP is based on the concept of min-max control problem where the operation of the maximization is taken over a set of uncertainty a parameter from a finite M Jiménez-Lizárraga and MA Alcorta are with the Faculty of Physical and Mathematical Sciences Autonomous University of Nuevo León, San Nicolas de los Garza N L, México majimenez,aalcorta@fcfmuanlmx A Poznyak is with the Automatic Control Department CINVESTAV-IPN México DF AP apoznyak@ctrlcinvestavmx set and the operation of the minimization is taken over set of admissible control strategies For a game with multiparticipants, choosing Nash strategies, some similar concept of Multi-scenarios has been used to exemplify the discrepancies of the players in information sets, models, cost functions or even different amount of information that the players could hold of a large scale system, [0], [], [], and recently, applying the Robust Maximum Principle there was derived a type of Robust Nash equilibrium [3] for a multi-scenario game parametrized by a parameter belonging to a given finite parametric set and the problem was formulated as a min-max problem of the game The purpose of this paper is to develop the RMP for a family of two person Multi-Plant game when one player is considered as the leader and the second one as the follower We formulate the problem for both leader and follower as min-max problem, that to the best of our knowledge has never been considered before, but presents a great interest because of its high spread applications The focus is, as in [3], the designing of strategies that should provide a "Robust leader-follower equilibrium" being applied to all scenarios or models of the game simultaneously It is shown that the resulting robust strategies for each player again appears as a mixture with the weights fulfilling the leader-follower condition of the controls which are the leader-follower for each fixed parameter value This technique permits to transform the formulation of the game with a leader from a Banach space where it was initially formulated into a finite dimensional space where the strategies to be found are the preference weights of each scenario in the weighted sum of the individual optimal controls The problem of finding such equilibrium weights is solve by the implementation of a special numerical procedure which extends the method given in [3] A numerical example illustrate the effectiveness of this approach II STANDARD LEADER-FOLLOWER EQUILIBRIUM We begin with a brief ly review of the concept for leaderfollower equilibrium The basic idea of a leader-follower strategy for a static two-person game seems to be very simple Consider two players Player and Player The cost function associated with the players are J u,u for Player J u,u for Player Both players want to minimize their criteria that naturally provokes a conf lict situation Defining the setsu andu for the admissible strategies for the player and player /08/$ AACC 3839

2 The resolution of this problem is given by the following equilibrium concept Choosing the player as the leader and player as the follower The set of strategies are said to be in a Leader-Follower equilibria with Player as the leader and Player as the follower if: and J u,u u J u,u o u J u,u o u = min u U J u,u andu o =u o u is the optimal policy of the player for a given strategy of the leader, and as is usual u u = u o u This means that being player as the leader, he must advance his strategy to play first and because the player want to minimize his functionalj thenu o is the "optimal reaction" OR of player for the minimization of J givenu If Player chooses any other strategyu, then Player will choose a strategy u o that minimizesj, but the resulting cost for Player may be greater than or equal to that when the Leader-Follower strategy with Player as the leader is used III MULTI-MODEL TWO PLAYERS GAME Consider the following two players multi-model differential game: ẋ =f x,u,u,t 3 wherex R n is the state vector of the game at timet [t 0,T], u j R m j j =, are the control strategies of each player at time t and is the entire index from a finite set A :={,,,M} describing each possible -model of the dynamics game 3,M is the number of possible model Let the individual aim performance h i, of each player i=, for each-model scenario be given by h i, :=h i 0x T+ T g i x,u,u,t dt i=, 4 The worst-case with respect to a possible scenario cost functionalf i for each player under fixed admissible strategiesu U andu U is defined by F i u,u :=max hi, u,u A Robust leader-follower Equilibrium The set of strategies are said to be in a Robust leaderfollower equilibria with Player as the leader and Player as the follower if: for any admissible strategyu,u U U the next inequalities hold F u,u F u,u o u, u :=u o u where: 5 F u,u o u = min u U F u,u 6 Note here that the main difference with the Standard leader-follower equilibrium is the max operation taken over all possible scenarios B Open-Loop Robust leader-follower Strategies for Multi- Model Differential Games Proceeding first outlining the robust optimal reaction of the follower, let us represent the robust optimal control problem for the follower following the standard procedure in Optimal Control Theory For each possible scenario introduce the extended variables for the follower x,f = x,,x n,x,f n+ defined inr n+ and the last component given by x,f n+ T x,f n+ = or, in differential form, ẋ,f n+ =g g x,u,u,τ dτ x,u,u,t, x,f n+ t 0=0 7 Now the initial individual aim performance for the follower 4 can be represented in the Mayer form without an integral term: h, =h, 0 xt+x,f n+ T 8 Notice thath, 0 x does not depend on the last coordenate x,f n+, that is, x,f n+ extended conjugate vector-variable by satisfying ψ j =- n k= h, 0 x = 0 Define also the new ψ :=ψ,,ψ n R n ψ := ψ,,ψ n,ψ n+ R n+ f k x,u,u,t ψ k - g x,u,u,t x j with some terminal condition j=,,n+ x n+ ψ n+ ψ j T=b j 0 t T, j=,,n+ 0 where the vectorb will be defined below For the "superextended" vectors for the follower defined by x,f := x,,x n+;;x M,,x M n+ ψ := ψ,,ψ n+;;ψ M,,ψ M n+ f,f := f,,f n,g ;;f M,,fn M,g M f,f =f,,f n,g Rn+ these vectors represent the complete family of trajectories including then+ state variables for the follower Using the previous vectors, following [8], define the next "generalized" Hamiltonian function for the follower: H ψ, x,f,u,u,t x := ψ, f,f,f,u,u,t = ψ, x,f,u,u,t = [ H ψ j f j x,u,u,t +ψ n+g x,f,u,u,t ]

3 the direct 3, 7 and conjugate 9 ODE equations may be represented shortly in the standard Hamiltonian form as d H ψ, x,f,u,u,t dt x,f = ψ ψ, x,f,u,u,t H d dt ψ = x,f Again, as it follows from the definition 6, and as it is shown in [], [], the optimal reaction u o of the follower to and fixed leader strategyu of the leader, is solved as an standard optimal control problem, for the presented Multi- Model problem the solution is given as in [9], the follower should solve the following robust optimal control problem: max h, u,u u min u Uadm Uadm i :={ u i u,u } 3 o u that provides the robust Stackelberg strategy for the follower Following the result of [9] the necessary condition for a robust optimality of a strategy for the follower must fulfill the next condition: the maximality condition The control strategies for the followeru o t Uadm t [0,T] satisfies ψ, x,f,u,u,t ψ o, x,f,u,u,t H or, equivalently, u o Arg max H u Uadm H ψ, x,f,u,u o,t 4 the complementary slackness condition For every next conditions hold µ h, F =0 5 the transversality condition Follower For every and everyi=,,n b +µ gradh, x T=0 ψ T=b, ψ n+t+µ =0 6 To derive the robust optimal condition for the leader, using the same Mayer representation as before, the functional for the leader is represented as: where h, =h, 0 xt+x,l n+ T T x,l n+ = g x,u,u,τ dτ Now consider the next hamiltonian representation for the leader: H ψ, λ, λ,λ 3, x,l,u,u,t := λ, f,l x,l,u,u,t - λ H x +λ H,f 3 u = H ψ, λ, λ,λ 3, x,l,u,u,t = [ λ,j fj x,u,u,t + λ,n+g x,f,u,u,t - λ H +λ x,f 3 H u where the extended vector are defined as: x,l := x,,x,l n+ ;;xm,,x M,l n+ λ := λ,,,λ,n+ ;;λm,,,λm,n+ λ := λ,,,λ,n+ ;;λm,,,λm,n+ f,l := f,,f n,g ;;f M,,fn M,g M f,l =f,,f n,g R n+ with the adjoint variables satisfying conditions that must be satisfy by the leader d H dt λ ψ, λ, λ,λ 3, x,l,u,u,t = x,l H ψ, λ, λ,λ 3, x,l,u,u,t d dt λ = ψ λ t 0=0 The application of the result in [9], yields to the next conditions that must be satisfy by the leader the maximality condition The control strategies for the leaderu t Uadm t [t 0,T], satisfies: H ψ, λ, λ,λ 3, x,l,u,u,t H ψ, λ, λ,λ 3, x,l,u,u,t equivalently, u Arg max H u Uadm ψ, λ, λ,λ 3, x,l,u,u,t 7 the complementary slackness condition For every next conditions hold µ h, F =0 8 the transversality condition Follower For every and everyi=,,n c +µ [ gradh, x] T x h, x Tλ T =0; λ T =c λ n+t+µ =0 9 additionally the leader must satisfy the next relation: H ψ, λ, λ,λ 3, x,l,u,u,t u =0 0 C Open-Loop Robust Leader-Follower Strategies in LQ Differential Games Consider the next two-player linear quadratic multimodel game: ẋ t=a tx t+ B,j tu j t j= x t R n, x t 0 =x 0, u j t R mj, j=,; t [t 0,T], T < ; ={,,,M} 384

4 Suppose that the individual aim performance h i, of each i-player i=, for each-model scenario is given by h i, u [,u = x TQ i f x T+ T x Q i x + u j R ij u ]dt j j= As we mentioned before the robust optimal control problem for each player is formulated as 3 In what follows we illustrate how the construction given above is applied for the case of two players leader follower LQ multi-model differential games The Hamiltonians for the leader and follower LQ are: [ x Q x + H = ψ n+ j= u j R j u j + ψ A x +B, u +B, u H = [ λ,n+ x Q x + u j R j u j + j= λ A x +B, u +B, u -λ Q x +A, ψ + R u +ψ B, λ 3 3 The conditions 4-0 for the LQ multimodel plant are as follows: ψ =- x H =-A ψ -ψ n+ Q x ; ψ n+ t =0 [ ψ T =-µ grad x TQ f x T +x,f n+ ]= T µ Q x T; ψ n+ T =-µ λ = - x H = -A λ λ,n+q x +Q λ λ,n+t=0 [ ] λ T=µ Q f x T Q f λ T λ,n+t=-µ λ = - ψ H = -A λ +B, λ 3; λ t 0 =0 4 the robust Stackelberg strategies for the leader satisfies: R u t= µ B, λ 5 and for the follower R u t= µ B, ψ 6 and H u = [ - µ R u t+λ B, +R λ ] 3 = [ -µ µ R R B, ψ +λ B, +R λ ] 3 =0 7 finally λ 3 = R [ µ µ R R ] B, ψ B, λ 8 Since at least one index A is active we have: µ i >0Introducing the normalized adjoint variables with as { ψ ψ n t µ nt= if µ >0 0 if µ =0 { λ k,n t= λ n t µ if µ >0 0 if µ =0 k=,,3 we get ψ t =-A ψ - ψ ψ n+t=0; n+ Q x with the corresponding transversality conditions given by ψ T= Q x T, ψ n+t =- λ t=-a λ λ,n+q x +Q λ λ T=Q f x T Q f λ T; λ,n+t=- λ = -A λ +B, λ 3 ; λ t 0=0 the robust Stackelberg strategies becomes: u t= µ R µ B, λ = R ν, B, λ and for the follower 9 u t= µ R µ B, ψ = R ν, B, ψ 30 where the vectors ν i := ν,,,ν,m i=, belongs to the simplex: { S i,m := ν i R M= A :ν,i =µ i µ i 0, ν,i = Finally 3 λ 3 = R [ R R ] B, ψ B, λ 3 As one can see from 9 and 30, the Robust Optimal Control for both leader an follower is a mixture of the control actions optimal for each independent index 384

5 D Extended form for the game Now consider the next representation for the game A 0 Q i 0 A:= ;Q i := 0 A M 0 Q i Q i f 0 ν,i I 0 Q i f :=,Γ i := 0 Q i f 0 ν M,i I B i := [ B i, Bi,M ], I R n n i=, 33 In the extended form we obtain the general dynamics given by bold stand for extended vectors and matrices: ẋ=ax+b u +B u, x t 0 = x 0,,x M 0 ψ=-a ψ+q x; ψt =-Q f xt; λ =-A λ Q x+q λ ; λ T =Q f xt -Q f λ T λ = -Aλ +B λ 3 ; λ t 0 =0 λ 3 = R [ R R ] B ψ B λ u = R B Γ λ u = R B Γ ψ where x := x,,,x, n ;;x M,,,xn M, ψ ψ, :=,, ψ, n ;; ψ M,,, ψ M, n λ, M, M, k := λ, k,,, λ k,n ;; λ k,,, λ,n k=,,3 R nm R nm R nm Theorem: If for the two person linear quadratic differential game 34 with the following restrictions to the matrices: R ii > 0, R 0, Q i 0 and Q i f 0 i =, there exists a solution set of the following parametrized coupled differential equation [4]: Ṗ ν Λ,ν A B R B Λ,ν B R B P ν + P ν +A P ν +Γ Q =0; P ν T=Γ Q f ; P ν =P ν R nm nm A B R B Λ,ν B R B P ν + Λ,ν +A Λ,ν +Γ Q Γ Q Λ,ν =0; Λ,ν T=Γ Q f Q f Λ,ν T A B R B Λ,ν B R B P ν Λ,ν +Λ,ν +AΛ,ν +Γ B R R R B P ν Γ B R B Λ,ν =0; Γ Λ,ν t 0 =0 36 then the open-loop robust leader-follower equilibrium strategies with Player acting as the leader are: u = R B Λ,ν x u = R B P ν x 37 define the open-loop robust leader-follower equilibrium solution if the matrixγ i i=, in 33 contains the vectors ν i which satisfy the leader-follower equilibrium condition: J ν,ν ν J ν,ν o ν ν o ν :=arg min ν S N J ν,ν for anyν i S i,m 38 where J i ν,ν :=max hi, u,u i=, 39 with given by 37 parametrized byν andν ν i S i,m through 36 Proof: Let us try to represent Γ ψ as Γ ψ= P ν x, Γ λ = Λ,ν x, and Γ λ = Λ,ν x By 36 and by the commutation of the operators Γ i A =A Γ i andγ i Q i =Q i Γ i the result follows IV NUMERICAL PROCEDURE For the problem of finding the leader-follower equilibrium weights ν,ν ν we propose the use of the next minimizing numerical procedure To make this analytically is not a simple task Assuming that J i ν,ν ν >0for all ν i S i,m i =, if not, one can add to the cost functions any necessary positive constant that, evidently does not change anything, define the series of the vectors iterations { ν i,k} for any fixedn as { ν i,k+ =π S i,m ν i,k γ i,k + F J i ν,k,ν,k} i ν,k,ν,k ν i,0 S i,m, k=,, F i ν,k,ν,k [ h,i = ν,k,ν,k, h M,i ν,k,ν,k] J i ν,k,ν,k = max h,i ν,k,ν,k,n 40 where π S i,m{ } is the projector of an argument to the simplexs i,m and the new functional h,i is defined as: h,i ν,k,ν,k := δ ν i +h,i ν,k,ν,k 4 whether this algorithm converges to a unique point or not we let this discussion for a large version of this work Algorithm The algorithm for finding a solution for the robust leader-follower strategies is summarized as follows: Step Select an initial condition for the leader s weights Γ Step Applying the control action equal to the weighted combination of standard stackelberg strategies, calculate all possible for each scenario dynamics and the corresponding cost functionals Step 3 Using the corresponding cost functionals perform iteratively the minimizing procedure 40 to find the optimal response of the follower Γ for a given fixed strategy of the leader Step 4 Given the optimal response of the follower Γ keep fixed this 3843

6 values and use again 40 to minimize the leader s weights Γ Step 5 Continue form the Step making the initial conditions with values found in 3 until convergence A Solving Coupled equations For the solution of the set of coupled Riccati equations 36, we follow the work of [], which is based on the solution of an auxiliary system For the lack of space we omit the details of this method V NUMERICAL EXAMPLE Consider the next two-scenarios two-players LQ differential game given by ẋ=ax+b [ ] u +B u, A 0 3 A= 0 A ;x 0 = A = ;A = ; B, =B, = ;B 0, =B, = Q, =Q, = I ;Q, =Q, =5 I Q, f =3I ;Q, f =I ;Q, f =3I ;Q, f =I R ii =R ij =I i,j=, The tables below show the convergence to dependence of the cost functionals on the weights ν = ν, ν, ν = ν, ν, ν 0,ν 0 k ν ν h, h, Table Cost Function Leader k ν ν h, h, Table Cost Function Follower As one can see in the tables and the numerical procedure works efficiently finding the robust strategies, in this case the algorithm practically finish after one cycle of performing 5 iterations for the follower and 5 iterations for the leader At the end of the process the cost functionals arrives to the practically same values as we expected from the complementary slackness condition 5 and 8 VI CONCLUSIONS In this paper the formulation of a concept for a type of robust Leader-Follower equilibrium for a Multi-Plant differential game was presented The dynamic of the game for this kind of problems is given by a set of N different possible differential equations Multi-Plant problem with no information about the trajectory which is realized The problem is solved designing of min-max strategies for each player which guarantee an equilibrium for the worst case scenario The suggested approach was based on the Robust Maximum Principle, the conditions for a game to be in Robust Leader- Follower Equilibrium was given As in the Nash equilibrium case the initial min-max differential game is converted into a standard static game given in a multidimensional simplex The realization of the numerical procedure confirms the effectiveness of the suggested approach REFERENCES [] C Chen and J Cruz, Stackelberg solution for a two person games with biased information patterns, IEEE Transactions of Automatic Control, vol 7, no 6, pp , 97 [] M Simman and J Cruz, On the stackerberg strategy in nonzero games, Journal of Optimization Theory and Applications, vol, no 5, 973 [3], Additional aspects of the stackelberg strategy in nonzero-sum games, Journal of Optimization Theory and Applications, vol, no 6, 973 [4] T Basar and G Olsder, Dynamic Noncooperative Game Theory Academic Press, New York, 98 [5] B Tolwinski, A stackelberg solution of dynamic games, IEEE Transactions of Automatic Control, vol 8, pp 85 93, 983 [6] H Von Stackelberg, The theory of the market economy Oxford University Press, 95 [7] V Boltyanski and A Poznyak, Robust maximum principle in minimax control, International Journal of Control, vol 7, no 4, pp , 999 [8], Linear multi-model time optimization Optimal Control Applications and Methods, vol 3, pp 4 6, 00 [9] A Poznyak, T Duncan, B Pasik-Duncan, and V Boltyanski, Robust maximum principle for multi-model LQ-problem, International Journal of Control, vol 75, no 5, pp , 00 [0] H Khalil, Multimodel design of a nash strategy, Journal of Optimization Theory and Applications, vol 3, no 4, pp , 980 [] H Khalil and P Kokotovic, Control strategies for decision makers using different models of the same system, IEEE Transactions on Automatic Control, vol AC-3, no, pp 89 98, 978 [] V Saksena, J J Cruz, W R Perkins, and T Basar, Information induced multimodel solutions in multiple decisionmaker problems, IEEE Transactions on Automatic Control, vol AC-8, no 6, pp 76 78, 983 [3] M Jimenez-Lizarraga and A Poznyak, Robust nash equilibrium in multi-model lq differential games: Analysis and extraproximal numerical procedure, Optimal Control Applications and Methods, vol 8, no, pp 7 4, 007 [4] H Abou-Kandil, G Freiling, V Ionescu, and G Jank, Matrix Riccati Equations in Control and System Theory Birkhauser,

Min-Max Output Integral Sliding Mode Control for Multiplant Linear Uncertain Systems

Min-Max Output Integral Sliding Mode Control for Multiplant Linear Uncertain Systems Proceedings of the 27 American Control Conference Marriott Marquis Hotel at Times Square New York City, USA, July -3, 27 FrC.4 Min-Max Output Integral Sliding Mode Control for Multiplant Linear Uncertain

More information

An Application of Pontryagin s Maximum Principle in a Linear Quadratic Differential Game

An Application of Pontryagin s Maximum Principle in a Linear Quadratic Differential Game An Application of Pontryagin s Maximum Principle in a Linear Quadratic Differential Game Marzieh Khakestari (Corresponding author) Institute For Mathematical Research, Universiti Putra Malaysia, 43400

More information

Linear Quadratic Zero-Sum Two-Person Differential Games Pierre Bernhard June 15, 2013

Linear Quadratic Zero-Sum Two-Person Differential Games Pierre Bernhard June 15, 2013 Linear Quadratic Zero-Sum Two-Person Differential Games Pierre Bernhard June 15, 2013 Abstract As in optimal control theory, linear quadratic (LQ) differential games (DG) can be solved, even in high dimension,

More information

Generalized Riccati Equations Arising in Stochastic Games

Generalized Riccati Equations Arising in Stochastic Games Generalized Riccati Equations Arising in Stochastic Games Michael McAsey a a Department of Mathematics Bradley University Peoria IL 61625 USA mcasey@bradley.edu Libin Mou b b Department of Mathematics

More information

Floor Control (kn) Time (sec) Floor 5. Displacement (mm) Time (sec) Floor 5.

Floor Control (kn) Time (sec) Floor 5. Displacement (mm) Time (sec) Floor 5. DECENTRALIZED ROBUST H CONTROL OF MECHANICAL STRUCTURES. Introduction L. Bakule and J. Böhm Institute of Information Theory and Automation Academy of Sciences of the Czech Republic The results contributed

More information

Multi-Pursuer Single-Evader Differential Games with Limited Observations*

Multi-Pursuer Single-Evader Differential Games with Limited Observations* American Control Conference (ACC) Washington DC USA June 7-9 Multi-Pursuer Single- Differential Games with Limited Observations* Wei Lin Student Member IEEE Zhihua Qu Fellow IEEE and Marwan A. Simaan Life

More information

Noncooperative continuous-time Markov games

Noncooperative continuous-time Markov games Morfismos, Vol. 9, No. 1, 2005, pp. 39 54 Noncooperative continuous-time Markov games Héctor Jasso-Fuentes Abstract This work concerns noncooperative continuous-time Markov games with Polish state and

More information

arxiv: v1 [math.oc] 11 Dec 2017

arxiv: v1 [math.oc] 11 Dec 2017 A Non-Cooperative Game Approach to Autonomous Racing Alexander Liniger and John Lygeros Abstract arxiv:1712.03913v1 [math.oc] 11 Dec 2017 We consider autonomous racing of two cars and present an approach

More information

A Note on Open Loop Nash Equilibrium in Linear-State Differential Games

A Note on Open Loop Nash Equilibrium in Linear-State Differential Games Applied Mathematical Sciences, vol. 8, 2014, no. 145, 7239-7248 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2014.49746 A Note on Open Loop Nash Equilibrium in Linear-State Differential

More information

Deterministic Dynamic Programming

Deterministic Dynamic Programming Deterministic Dynamic Programming 1 Value Function Consider the following optimal control problem in Mayer s form: V (t 0, x 0 ) = inf u U J(t 1, x(t 1 )) (1) subject to ẋ(t) = f(t, x(t), u(t)), x(t 0

More information

Algorithmic Game Theory and Applications. Lecture 4: 2-player zero-sum games, and the Minimax Theorem

Algorithmic Game Theory and Applications. Lecture 4: 2-player zero-sum games, and the Minimax Theorem Algorithmic Game Theory and Applications Lecture 4: 2-player zero-sum games, and the Minimax Theorem Kousha Etessami 2-person zero-sum games A finite 2-person zero-sum (2p-zs) strategic game Γ, is a strategic

More information

A New Algorithm for Solving Cross Coupled Algebraic Riccati Equations of Singularly Perturbed Nash Games

A New Algorithm for Solving Cross Coupled Algebraic Riccati Equations of Singularly Perturbed Nash Games A New Algorithm for Solving Cross Coupled Algebraic Riccati Equations of Singularly Perturbed Nash Games Hiroaki Mukaidani Hua Xu and Koichi Mizukami Faculty of Information Sciences Hiroshima City University

More information

Risk-Sensitive and Robust Mean Field Games

Risk-Sensitive and Robust Mean Field Games Risk-Sensitive and Robust Mean Field Games Tamer Başar Coordinated Science Laboratory Department of Electrical and Computer Engineering University of Illinois at Urbana-Champaign Urbana, IL - 6181 IPAM

More information

Linear Quadratic Zero-Sum Two-Person Differential Games

Linear Quadratic Zero-Sum Two-Person Differential Games Linear Quadratic Zero-Sum Two-Person Differential Games Pierre Bernhard To cite this version: Pierre Bernhard. Linear Quadratic Zero-Sum Two-Person Differential Games. Encyclopaedia of Systems and Control,

More information

EE291E/ME 290Q Lecture Notes 8. Optimal Control and Dynamic Games

EE291E/ME 290Q Lecture Notes 8. Optimal Control and Dynamic Games EE291E/ME 290Q Lecture Notes 8. Optimal Control and Dynamic Games S. S. Sastry REVISED March 29th There exist two main approaches to optimal control and dynamic games: 1. via the Calculus of Variations

More information

Feedback strategies for discrete-time linear-quadratic two-player descriptor games

Feedback strategies for discrete-time linear-quadratic two-player descriptor games Feedback strategies for discrete-time linear-quadratic two-player descriptor games Marc Jungers To cite this version: Marc Jungers. Feedback strategies for discrete-time linear-quadratic two-player descriptor

More information

Solving a Class of Generalized Nash Equilibrium Problems

Solving a Class of Generalized Nash Equilibrium Problems Journal of Mathematical Research with Applications May, 2013, Vol. 33, No. 3, pp. 372 378 DOI:10.3770/j.issn:2095-2651.2013.03.013 Http://jmre.dlut.edu.cn Solving a Class of Generalized Nash Equilibrium

More information

Chapter Introduction. A. Bensoussan

Chapter Introduction. A. Bensoussan Chapter 2 EXPLICIT SOLUTIONS OFLINEAR QUADRATIC DIFFERENTIAL GAMES A. Bensoussan International Center for Decision and Risk Analysis University of Texas at Dallas School of Management, P.O. Box 830688

More information

Game Theory Extra Lecture 1 (BoB)

Game Theory Extra Lecture 1 (BoB) Game Theory 2014 Extra Lecture 1 (BoB) Differential games Tools from optimal control Dynamic programming Hamilton-Jacobi-Bellman-Isaacs equation Zerosum linear quadratic games and H control Baser/Olsder,

More information

Static and Dynamic Optimization (42111)

Static and Dynamic Optimization (42111) Static and Dynamic Optimization (421) Niels Kjølstad Poulsen Build. 0b, room 01 Section for Dynamical Systems Dept. of Applied Mathematics and Computer Science The Technical University of Denmark Email:

More information

Optimization 4. GAME THEORY

Optimization 4. GAME THEORY Optimization GAME THEORY DPK Easter Term Saddle points of two-person zero-sum games We consider a game with two players Player I can choose one of m strategies, indexed by i =,, m and Player II can choose

More information

A DIFFERENTIAL GAMES APPROACH FOR ANALYSIS OF SPACECRAFT POST-DOCKING OPERATIONS

A DIFFERENTIAL GAMES APPROACH FOR ANALYSIS OF SPACECRAFT POST-DOCKING OPERATIONS A DIFFERENTIAL GAMES APPROACH FOR ANALYSIS OF SPACECRAFT POST-DOCKING OPERATIONS By TAKASHI HIRAMATSU A DISSERTATION PRESENTED TO THE GRADUATE SCHOOL OF THE UNIVERSITY OF FLORIDA IN PARTIAL FULFILLMENT

More information

An Introduction to Noncooperative Games

An Introduction to Noncooperative Games An Introduction to Noncooperative Games Alberto Bressan Department of Mathematics, Penn State University Alberto Bressan (Penn State) Noncooperative Games 1 / 29 introduction to non-cooperative games:

More information

Math-Net.Ru All Russian mathematical portal

Math-Net.Ru All Russian mathematical portal Math-Net.Ru All Russian mathematical portal Anna V. Tur, A time-consistent solution formula for linear-quadratic discrete-time dynamic games with nontransferable payoffs, Contributions to Game Theory and

More information

Formula Sheet for Optimal Control

Formula Sheet for Optimal Control Formula Sheet for Optimal Control Division of Optimization and Systems Theory Royal Institute of Technology 144 Stockholm, Sweden 23 December 1, 29 1 Dynamic Programming 11 Discrete Dynamic Programming

More information

Pontryagin s maximum principle

Pontryagin s maximum principle Pontryagin s maximum principle Emo Todorov Applied Mathematics and Computer Science & Engineering University of Washington Winter 2012 Emo Todorov (UW) AMATH/CSE 579, Winter 2012 Lecture 5 1 / 9 Pontryagin

More information

Sliding Mode Regulator as Solution to Optimal Control Problem for Nonlinear Polynomial Systems

Sliding Mode Regulator as Solution to Optimal Control Problem for Nonlinear Polynomial Systems 29 American Control Conference Hyatt Regency Riverfront, St. Louis, MO, USA June -2, 29 WeA3.5 Sliding Mode Regulator as Solution to Optimal Control Problem for Nonlinear Polynomial Systems Michael Basin

More information

Second Order Sufficient Conditions for Optimal Control Problems with Non-unique Minimizers

Second Order Sufficient Conditions for Optimal Control Problems with Non-unique Minimizers 2 American Control Conference Marriott Waterfront, Baltimore, MD, USA June 3-July 2, 2 WeA22. Second Order Sufficient Conditions for Optimal Control Problems with Non-unique Minimizers Christos Gavriel

More information

Genetic algorithm for closed-loop equilibrium of high-order linear quadratic dynamic games

Genetic algorithm for closed-loop equilibrium of high-order linear quadratic dynamic games Mathematics and Computers in Simulation 53 (2000) 39 47 Genetic algorithm for closed-loop equilibrium of high-order linear quadratic dynamic games Süheyla Özyıldırım Faculty of Business Administration,

More information

Adaptive State Feedback Nash Strategies for Linear Quadratic Discrete-Time Games

Adaptive State Feedback Nash Strategies for Linear Quadratic Discrete-Time Games Adaptive State Feedbac Nash Strategies for Linear Quadratic Discrete-Time Games Dan Shen and Jose B. Cruz, Jr. Intelligent Automation Inc., Rocville, MD 2858 USA (email: dshen@i-a-i.com). The Ohio State

More information

For general queries, contact

For general queries, contact PART I INTRODUCTION LECTURE Noncooperative Games This lecture uses several examples to introduce the key principles of noncooperative game theory Elements of a Game Cooperative vs Noncooperative Games:

More information

Discrete-Time H Gaussian Filter

Discrete-Time H Gaussian Filter Proceedings of the 17th World Congress The International Federation of Automatic Control Discrete-Time H Gaussian Filter Ali Tahmasebi and Xiang Chen Department of Electrical and Computer Engineering,

More information

4: Dynamic games. Concordia February 6, 2017

4: Dynamic games. Concordia February 6, 2017 INSE6441 Jia Yuan Yu 4: Dynamic games Concordia February 6, 2017 We introduce dynamic game with non-simultaneous moves. Example 0.1 (Ultimatum game). Divide class into two groups at random: Proposers,

More information

Hyperbolicity of Systems Describing Value Functions in Differential Games which Model Duopoly Problems. Joanna Zwierzchowska

Hyperbolicity of Systems Describing Value Functions in Differential Games which Model Duopoly Problems. Joanna Zwierzchowska Decision Making in Manufacturing and Services Vol. 9 05 No. pp. 89 00 Hyperbolicity of Systems Describing Value Functions in Differential Games which Model Duopoly Problems Joanna Zwierzchowska Abstract.

More information

Zero-Sum Games Public Strategies Minimax Theorem and Nash Equilibria Appendix. Zero-Sum Games. Algorithmic Game Theory.

Zero-Sum Games Public Strategies Minimax Theorem and Nash Equilibria Appendix. Zero-Sum Games. Algorithmic Game Theory. Public Strategies Minimax Theorem and Nash Equilibria Appendix 2013 Public Strategies Minimax Theorem and Nash Equilibria Appendix Definition Definition A zero-sum game is a strategic game, in which for

More information

Basics of Game Theory

Basics of Game Theory Basics of Game Theory Giacomo Bacci and Luca Sanguinetti Department of Information Engineering University of Pisa, Pisa, Italy {giacomo.bacci,luca.sanguinetti}@iet.unipi.it April - May, 2010 G. Bacci and

More information

Chapter 2 Optimal Control Problem

Chapter 2 Optimal Control Problem Chapter 2 Optimal Control Problem Optimal control of any process can be achieved either in open or closed loop. In the following two chapters we concentrate mainly on the first class. The first chapter

More information

Game of Defending a Target: A Linear Quadratic Differential Game Approach

Game of Defending a Target: A Linear Quadratic Differential Game Approach Game of Defending a Target: A Linear Quadratic Differential Game Approach Dongxu Li Jose B. Cruz, Jr. Department of Electrical and Computer Engineering, The Ohio State University, 5 Dreese Lab, 5 Neil

More information

Tilburg University. Uniqueness conditions for the affine open-loop linear quadratic differential games Engwerda, Jacob. Published in: Automatica

Tilburg University. Uniqueness conditions for the affine open-loop linear quadratic differential games Engwerda, Jacob. Published in: Automatica Tilburg University Uniqueness conditions for the affine open-loop linear quadratic differential games Engwerda, Jacob Published in: Automatica Publication date: 2008 Link to publication Citation for published

More information

Adaptive Nonlinear Model Predictive Control with Suboptimality and Stability Guarantees

Adaptive Nonlinear Model Predictive Control with Suboptimality and Stability Guarantees Adaptive Nonlinear Model Predictive Control with Suboptimality and Stability Guarantees Pontus Giselsson Department of Automatic Control LTH Lund University Box 118, SE-221 00 Lund, Sweden pontusg@control.lth.se

More information

Incentive-Based Pricing for Network Games with Complete and Incomplete Information

Incentive-Based Pricing for Network Games with Complete and Incomplete Information Incentive-Based Pricing for Network Games with Complete and Incomplete Information Hongxia Shen and Tamer Başar Coordinated Science Laboratory University of Illinois at Urbana-Champaign hshen1, tbasar@control.csl.uiuc.edu

More information

OPTIMAL CONTROL SYSTEMS

OPTIMAL CONTROL SYSTEMS SYSTEMS MIN-MAX Harry G. Kwatny Department of Mechanical Engineering & Mechanics Drexel University OUTLINE MIN-MAX CONTROL Problem Definition HJB Equation Example GAME THEORY Differential Games Isaacs

More information

First steps towards finding a solution of a dynamic investor-bank game

First steps towards finding a solution of a dynamic investor-bank game Delft University of Technology Delft Center for Systems and Control Technical report -36 First steps towards finding a solution of a dynamic investor-bank game K Staňková and B De Schutter If you want

More information

Min-Max Certainty Equivalence Principle and Differential Games

Min-Max Certainty Equivalence Principle and Differential Games Min-Max Certainty Equivalence Principle and Differential Games Pierre Bernhard and Alain Rapaport INRIA Sophia-Antipolis August 1994 Abstract This paper presents a version of the Certainty Equivalence

More information

H 1 optimisation. Is hoped that the practical advantages of receding horizon control might be combined with the robustness advantages of H 1 control.

H 1 optimisation. Is hoped that the practical advantages of receding horizon control might be combined with the robustness advantages of H 1 control. A game theoretic approach to moving horizon control Sanjay Lall and Keith Glover Abstract A control law is constructed for a linear time varying system by solving a two player zero sum dierential game

More information

MIN-MAX AND MIN-MIN STACKELBERG STRATEGIES WITH CLOSED-LOOP INFORMATION STRUCTURE

MIN-MAX AND MIN-MIN STACKELBERG STRATEGIES WITH CLOSED-LOOP INFORMATION STRUCTURE Journal of Dynamical and Control Systems, Vol. 17, No. 3, July 11, 387 45 c 11 MIN-MAX AND MIN-MIN STACKELBERG STRATEGIES WITH CLOSED-LOOP INFORMATION STRUCTURE M. JUNGERS, E. TRELAT, and H. ABOU-KANDIL

More information

Computing Minmax; Dominance

Computing Minmax; Dominance Computing Minmax; Dominance CPSC 532A Lecture 5 Computing Minmax; Dominance CPSC 532A Lecture 5, Slide 1 Lecture Overview 1 Recap 2 Linear Programming 3 Computational Problems Involving Maxmin 4 Domination

More information

Credibilistic Bi-Matrix Game

Credibilistic Bi-Matrix Game Journal of Uncertain Systems Vol.6, No.1, pp.71-80, 2012 Online at: www.jus.org.uk Credibilistic Bi-Matrix Game Prasanta Mula 1, Sankar Kumar Roy 2, 1 ISRO Satellite Centre, Old Airport Road, Vimanapura

More information

Computing Minmax; Dominance

Computing Minmax; Dominance Computing Minmax; Dominance CPSC 532A Lecture 5 Computing Minmax; Dominance CPSC 532A Lecture 5, Slide 1 Lecture Overview 1 Recap 2 Linear Programming 3 Computational Problems Involving Maxmin 4 Domination

More information

Game Theory. Professor Peter Cramton Economics 300

Game Theory. Professor Peter Cramton Economics 300 Game Theory Professor Peter Cramton Economics 300 Definition Game theory is the study of mathematical models of conflict and cooperation between intelligent and rational decision makers. Rational: each

More information

Dynamic Games with Asymmetric Information: Common Information Based Perfect Bayesian Equilibria and Sequential Decomposition

Dynamic Games with Asymmetric Information: Common Information Based Perfect Bayesian Equilibria and Sequential Decomposition Dynamic Games with Asymmetric Information: Common Information Based Perfect Bayesian Equilibria and Sequential Decomposition 1 arxiv:1510.07001v1 [cs.gt] 23 Oct 2015 Yi Ouyang, Hamidreza Tavafoghi and

More information

Feedback stabilisation with positive control of dissipative compartmental systems

Feedback stabilisation with positive control of dissipative compartmental systems Feedback stabilisation with positive control of dissipative compartmental systems G. Bastin and A. Provost Centre for Systems Engineering and Applied Mechanics (CESAME Université Catholique de Louvain

More information

Stackelberg equilibria for discrete-time dynamic games Part II: Stochastic games with deterministic information structure

Stackelberg equilibria for discrete-time dynamic games Part II: Stochastic games with deterministic information structure Delft University of Technology Delft Center for Systems and Control Technical report 0-06 Stacelberg equilibria for discrete-time dynamic games Part II: Stochastic games with deterministic information

More information

Noncausal Optimal Tracking of Linear Switched Systems

Noncausal Optimal Tracking of Linear Switched Systems Noncausal Optimal Tracking of Linear Switched Systems Gou Nakura Osaka University, Department of Engineering 2-1, Yamadaoka, Suita, Osaka, 565-0871, Japan nakura@watt.mech.eng.osaka-u.ac.jp Abstract. In

More information

Game Theory: Lecture 2

Game Theory: Lecture 2 Game Theory: Lecture 2 Tai-Wei Hu June 29, 2011 Outline Two-person zero-sum games normal-form games Minimax theorem Simplex method 1 2-person 0-sum games 1.1 2-Person Normal Form Games A 2-person normal

More information

Tilburg University. An equivalence result in linear-quadratic theory van den Broek, W.A.; Engwerda, Jacob; Schumacher, Hans. Published in: Automatica

Tilburg University. An equivalence result in linear-quadratic theory van den Broek, W.A.; Engwerda, Jacob; Schumacher, Hans. Published in: Automatica Tilburg University An equivalence result in linear-quadratic theory van den Broek, W.A.; Engwerda, Jacob; Schumacher, Hans Published in: Automatica Publication date: 2003 Link to publication Citation for

More information

General-sum games. I.e., pretend that the opponent is only trying to hurt you. If Column was trying to hurt Row, Column would play Left, so

General-sum games. I.e., pretend that the opponent is only trying to hurt you. If Column was trying to hurt Row, Column would play Left, so General-sum games You could still play a minimax strategy in general- sum games I.e., pretend that the opponent is only trying to hurt you But this is not rational: 0, 0 3, 1 1, 0 2, 1 If Column was trying

More information

Control Synthesis of Discrete Manufacturing Systems using Timed Finite Automata

Control Synthesis of Discrete Manufacturing Systems using Timed Finite Automata Control Synthesis of Discrete Manufacturing Systems using Timed Finite utomata JROSLV FOGEL Institute of Informatics Slovak cademy of Sciences ratislav Dúbravská 9, SLOVK REPULIC bstract: - n application

More information

Stable Matching Existence, Computation, Convergence Correlated Preferences. Stable Matching. Algorithmic Game Theory.

Stable Matching Existence, Computation, Convergence Correlated Preferences. Stable Matching. Algorithmic Game Theory. Existence, Computation, Convergence Correlated Preferences Existence, Computation, Convergence Correlated Preferences Stable Marriage Set of Women Y Set of Men X Existence, Computation, Convergence Correlated

More information

A sub-optimal second order sliding mode controller for systems with saturating actuators

A sub-optimal second order sliding mode controller for systems with saturating actuators 28 American Control Conference Westin Seattle Hotel, Seattle, Washington, USA June -3, 28 FrB2.5 A sub-optimal second order sliding mode for systems with saturating actuators Antonella Ferrara and Matteo

More information

Algorithmic Game Theory

Algorithmic Game Theory Nash Equilibria in Zero-Sum Games Algorithmic Game Theory Algorithmic game theory is not satisfied with merely an existence result for an important solution concept such as Nash equilibrium in mixed strategies.

More information

L 2 -induced Gains of Switched Systems and Classes of Switching Signals

L 2 -induced Gains of Switched Systems and Classes of Switching Signals L 2 -induced Gains of Switched Systems and Classes of Switching Signals Kenji Hirata and João P. Hespanha Abstract This paper addresses the L 2-induced gain analysis for switched linear systems. We exploit

More information

A Tropical Extremal Problem with Nonlinear Objective Function and Linear Inequality Constraints

A Tropical Extremal Problem with Nonlinear Objective Function and Linear Inequality Constraints A Tropical Extremal Problem with Nonlinear Objective Function and Linear Inequality Constraints NIKOLAI KRIVULIN Faculty of Mathematics and Mechanics St. Petersburg State University 28 Universitetsky Ave.,

More information

An introduction to Birkhoff normal form

An introduction to Birkhoff normal form An introduction to Birkhoff normal form Dario Bambusi Dipartimento di Matematica, Universitá di Milano via Saldini 50, 0133 Milano (Italy) 19.11.14 1 Introduction The aim of this note is to present an

More information

Scientiae Mathematicae Japonicae Online, Vol. 5, (2001), POSSIBILITY THAT THE CUSTOMERS GIVE UP PURCHASING THE MERCHANDISE

Scientiae Mathematicae Japonicae Online, Vol. 5, (2001), POSSIBILITY THAT THE CUSTOMERS GIVE UP PURCHASING THE MERCHANDISE Scientiae Mathematicae Japonicae Online, Vol. 5, (00), 7 79 7 A DUOPOLISTIC INVENTORY PROBLEM INCLUDING THE POSSIBILITY THAT THE CUSTOMERS GIVE UP PURCHASING THE MERCHANDISE HITOSHI HOHJO and YOSHINOBU

More information

OSNR Optimization in Optical Networks: Extension for Capacity Constraints

OSNR Optimization in Optical Networks: Extension for Capacity Constraints 5 American Control Conference June 8-5. Portland OR USA ThB3.6 OSNR Optimization in Optical Networks: Extension for Capacity Constraints Yan Pan and Lacra Pavel Abstract This paper builds on the OSNR model

More information

A Stackelberg Game Model of Dynamic Duopolistic Competition with Sticky Prices. Abstract

A Stackelberg Game Model of Dynamic Duopolistic Competition with Sticky Prices. Abstract A Stackelberg Game Model of Dynamic Duopolistic Competition with Sticky Prices Kenji Fujiwara School of Economics, Kwansei Gakuin University Abstract We develop the following Stackelberg game model of

More information

Electricity Market Modelling Using Dynamic LQ Games

Electricity Market Modelling Using Dynamic LQ Games XII International PhD Workshop OWD 1, 3-6 October 1 Electricity Market Modelling Using Dynamic LQ Games Michał Ganobis, Institute of Automatics, AGH ( 6.11.9, prof. Wojciech Mitkowski, University of Science

More information

Robust Stability. Robust stability against time-invariant and time-varying uncertainties. Parameter dependent Lyapunov functions

Robust Stability. Robust stability against time-invariant and time-varying uncertainties. Parameter dependent Lyapunov functions Robust Stability Robust stability against time-invariant and time-varying uncertainties Parameter dependent Lyapunov functions Semi-infinite LMI problems From nominal to robust performance 1/24 Time-Invariant

More information

Nash Bargaining Equilibria for Controllable Markov Chains Games

Nash Bargaining Equilibria for Controllable Markov Chains Games Preprints of the 0th World Congress The International Federation of Automatic Control Toulouse, France, July 9-4, 07 Nash Bargaining Equilibria for Controllable Markov Chains Games Kristal K. Trejo Julio

More information

arzelier

arzelier COURSE ON LMI OPTIMIZATION WITH APPLICATIONS IN CONTROL PART II.1 LMIs IN SYSTEMS CONTROL STATE-SPACE METHODS STABILITY ANALYSIS Didier HENRION www.laas.fr/ henrion henrion@laas.fr Denis ARZELIER www.laas.fr/

More information

OPERATIONS RESEARCH. Linear Programming Problem

OPERATIONS RESEARCH. Linear Programming Problem OPERATIONS RESEARCH Chapter 1 Linear Programming Problem Prof. Bibhas C. Giri Department of Mathematics Jadavpur University Kolkata, India Email: bcgiri.jumath@gmail.com MODULE - 2: Simplex Method for

More information

Stabilization of an Uncertain Simple Fishery Management Game Engwerda, Jacob

Stabilization of an Uncertain Simple Fishery Management Game Engwerda, Jacob Tilburg University Stabilization of an Uncertain Simple Fishery Management Game Engwerda, Jacob Document version: Early version, also known as pre-print Publication date: 27 Link to publication Citation

More information

L -Bounded Robust Control of Nonlinear Cascade Systems

L -Bounded Robust Control of Nonlinear Cascade Systems L -Bounded Robust Control of Nonlinear Cascade Systems Shoudong Huang M.R. James Z.P. Jiang August 19, 2004 Accepted by Systems & Control Letters Abstract In this paper, we consider the L -bounded robust

More information

DOMAIN OF ATTRACTION: ESTIMATES FOR NON-POLYNOMIAL SYSTEMS VIA LMIS. Graziano Chesi

DOMAIN OF ATTRACTION: ESTIMATES FOR NON-POLYNOMIAL SYSTEMS VIA LMIS. Graziano Chesi DOMAIN OF ATTRACTION: ESTIMATES FOR NON-POLYNOMIAL SYSTEMS VIA LMIS Graziano Chesi Dipartimento di Ingegneria dell Informazione Università di Siena Email: chesi@dii.unisi.it Abstract: Estimating the Domain

More information

Sufficient Statistics in Decentralized Decision-Making Problems

Sufficient Statistics in Decentralized Decision-Making Problems Sufficient Statistics in Decentralized Decision-Making Problems Ashutosh Nayyar University of Southern California Feb 8, 05 / 46 Decentralized Systems Transportation Networks Communication Networks Networked

More information

A Stable Block Model Predictive Control with Variable Implementation Horizon

A Stable Block Model Predictive Control with Variable Implementation Horizon American Control Conference June 8-,. Portland, OR, USA WeB9. A Stable Block Model Predictive Control with Variable Implementation Horizon Jing Sun, Shuhao Chen, Ilya Kolmanovsky Abstract In this paper,

More information

H -Optimal Control and Related Minimax Design Problems

H -Optimal Control and Related Minimax Design Problems Tamer Başar Pierre Bernhard H -Optimal Control and Related Minimax Design Problems A Dynamic Game Approach Second Edition 1995 Birkhäuser Boston Basel Berlin Contents Preface v 1 A General Introduction

More information

and the nite horizon cost index with the nite terminal weighting matrix F > : N?1 X J(z r ; u; w) = [z(n)? z r (N)] T F [z(n)? z r (N)] + t= [kz? z r

and the nite horizon cost index with the nite terminal weighting matrix F > : N?1 X J(z r ; u; w) = [z(n)? z r (N)] T F [z(n)? z r (N)] + t= [kz? z r Intervalwise Receding Horizon H 1 -Tracking Control for Discrete Linear Periodic Systems Ki Baek Kim, Jae-Won Lee, Young Il. Lee, and Wook Hyun Kwon School of Electrical Engineering Seoul National University,

More information

Closed-Loop Impulse Control of Oscillating Systems

Closed-Loop Impulse Control of Oscillating Systems Closed-Loop Impulse Control of Oscillating Systems A. N. Daryin and A. B. Kurzhanski Moscow State (Lomonosov) University Faculty of Computational Mathematics and Cybernetics Periodic Control Systems, 2007

More information

Nonlinear Control Design for Linear Differential Inclusions via Convex Hull Quadratic Lyapunov Functions

Nonlinear Control Design for Linear Differential Inclusions via Convex Hull Quadratic Lyapunov Functions Nonlinear Control Design for Linear Differential Inclusions via Convex Hull Quadratic Lyapunov Functions Tingshu Hu Abstract This paper presents a nonlinear control design method for robust stabilization

More information

Dual Interpretations and Duality Applications (continued)

Dual Interpretations and Duality Applications (continued) Dual Interpretations and Duality Applications (continued) Yinyu Ye Department of Management Science and Engineering Stanford University Stanford, CA 94305, U.S.A. http://www.stanford.edu/ yyye (LY, Chapters

More information

Game theory Lecture 19. Dynamic games. Game theory

Game theory Lecture 19. Dynamic games. Game theory Lecture 9. Dynamic games . Introduction Definition. A dynamic game is a game Γ =< N, x, {U i } n i=, {H i } n i= >, where N = {, 2,..., n} denotes the set of players, x (t) = f (x, u,..., u n, t), x(0)

More information

Stability Analysis for Linear Systems under State Constraints

Stability Analysis for Linear Systems under State Constraints Stabilit Analsis for Linear Sstems under State Constraints Haijun Fang Abstract This paper revisits the problem of stabilit analsis for linear sstems under state constraints New and less conservative sufficient

More information

Robust control and applications in economic theory

Robust control and applications in economic theory Robust control and applications in economic theory In honour of Professor Emeritus Grigoris Kalogeropoulos on the occasion of his retirement A. N. Yannacopoulos Department of Statistics AUEB 24 May 2013

More information

Game Theory and its Applications to Networks - Part I: Strict Competition

Game Theory and its Applications to Networks - Part I: Strict Competition Game Theory and its Applications to Networks - Part I: Strict Competition Corinne Touati Master ENS Lyon, Fall 200 What is Game Theory and what is it for? Definition (Roger Myerson, Game Theory, Analysis

More information

A Novel Integral-Based Event Triggering Control for Linear Time-Invariant Systems

A Novel Integral-Based Event Triggering Control for Linear Time-Invariant Systems 53rd IEEE Conference on Decision and Control December 15-17, 2014. Los Angeles, California, USA A Novel Integral-Based Event Triggering Control for Linear Time-Invariant Systems Seyed Hossein Mousavi 1,

More information

The servo problem for piecewise linear systems

The servo problem for piecewise linear systems The servo problem for piecewise linear systems Stefan Solyom and Anders Rantzer Department of Automatic Control Lund Institute of Technology Box 8, S-22 Lund Sweden {stefan rantzer}@control.lth.se Abstract

More information

McGill University Department of Electrical and Computer Engineering

McGill University Department of Electrical and Computer Engineering McGill University Department of Electrical and Computer Engineering ECSE 56 - Stochastic Control Project Report Professor Aditya Mahajan Team Decision Theory and Information Structures in Optimal Control

More information

CONVERGENCE PROOF FOR RECURSIVE SOLUTION OF LINEAR-QUADRATIC NASH GAMES FOR QUASI-SINGULARLY PERTURBED SYSTEMS. S. Koskie, D. Skataric and B.

CONVERGENCE PROOF FOR RECURSIVE SOLUTION OF LINEAR-QUADRATIC NASH GAMES FOR QUASI-SINGULARLY PERTURBED SYSTEMS. S. Koskie, D. Skataric and B. To appear in Dynamics of Continuous, Discrete and Impulsive Systems http:monotone.uwaterloo.ca/ journal CONVERGENCE PROOF FOR RECURSIVE SOLUTION OF LINEAR-QUADRATIC NASH GAMES FOR QUASI-SINGULARLY PERTURBED

More information

Contraction Based Adaptive Control of a Class of Nonlinear Systems

Contraction Based Adaptive Control of a Class of Nonlinear Systems 9 American Control Conference Hyatt Regency Riverfront, St. Louis, MO, USA June -, 9 WeB4.5 Contraction Based Adaptive Control of a Class of Nonlinear Systems B. B. Sharma and I. N. Kar, Member IEEE Abstract

More information

ECE Optimization for wireless networks Final. minimize f o (x) s.t. Ax = b,

ECE Optimization for wireless networks Final. minimize f o (x) s.t. Ax = b, ECE 788 - Optimization for wireless networks Final Please provide clear and complete answers. PART I: Questions - Q.. Discuss an iterative algorithm that converges to the solution of the problem minimize

More information

Near-Potential Games: Geometry and Dynamics

Near-Potential Games: Geometry and Dynamics Near-Potential Games: Geometry and Dynamics Ozan Candogan, Asuman Ozdaglar and Pablo A. Parrilo January 29, 2012 Abstract Potential games are a special class of games for which many adaptive user dynamics

More information

Optimal Control Feedback Nash in The Scalar Infinite Non-cooperative Dynamic Game with Discount Factor

Optimal Control Feedback Nash in The Scalar Infinite Non-cooperative Dynamic Game with Discount Factor Global Journal of Pure and Applied Mathematics. ISSN 0973-1768 Volume 12, Number 4 (2016), pp. 3463-3468 Research India Publications http://www.ripublication.com/gjpam.htm Optimal Control Feedback Nash

More information

arxiv: v1 [math.pr] 18 Oct 2016

arxiv: v1 [math.pr] 18 Oct 2016 An Alternative Approach to Mean Field Game with Major and Minor Players, and Applications to Herders Impacts arxiv:161.544v1 [math.pr 18 Oct 216 Rene Carmona August 7, 218 Abstract Peiqi Wang The goal

More information

A Stackelberg Network Game with a Large Number of Followers 1,2,3

A Stackelberg Network Game with a Large Number of Followers 1,2,3 A Stackelberg Network Game with a Large Number of Followers,,3 T. BAŞAR 4 and R. SRIKANT 5 Communicated by M. Simaan Abstract. We consider a hierarchical network game with multiple links, a single service

More information

Model reduction for linear systems by balancing

Model reduction for linear systems by balancing Model reduction for linear systems by balancing Bart Besselink Jan C. Willems Center for Systems and Control Johann Bernoulli Institute for Mathematics and Computer Science University of Groningen, Groningen,

More information

Convex Sets Strict Separation. in the Minimax Theorem

Convex Sets Strict Separation. in the Minimax Theorem Applied Mathematical Sciences, Vol. 8, 2014, no. 36, 1781-1787 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2014.4271 Convex Sets Strict Separation in the Minimax Theorem M. A. M. Ferreira

More information

Riccati difference equations to non linear extended Kalman filter constraints

Riccati difference equations to non linear extended Kalman filter constraints International Journal of Scientific & Engineering Research Volume 3, Issue 12, December-2012 1 Riccati difference equations to non linear extended Kalman filter constraints Abstract Elizabeth.S 1 & Jothilakshmi.R

More information

Two hours UNIVERSITY OF MANCHESTER SCHOOL OF COMPUTER SCIENCE. Date: Thursday 17th May 2018 Time: 09:45-11:45. Please answer all Questions.

Two hours UNIVERSITY OF MANCHESTER SCHOOL OF COMPUTER SCIENCE. Date: Thursday 17th May 2018 Time: 09:45-11:45. Please answer all Questions. COMP 34120 Two hours UNIVERSITY OF MANCHESTER SCHOOL OF COMPUTER SCIENCE AI and Games Date: Thursday 17th May 2018 Time: 09:45-11:45 Please answer all Questions. Use a SEPARATE answerbook for each SECTION

More information