Stackelberg equilibria for discrete-time dynamic games Part II: Stochastic games with deterministic information structure

Size: px
Start display at page:

Download "Stackelberg equilibria for discrete-time dynamic games Part II: Stochastic games with deterministic information structure"

Transcription

1 Delft University of Technology Delft Center for Systems and Control Technical report 0-06 Stacelberg equilibria for discrete-time dynamic games Part II: Stochastic games with deterministic information structure K. Staňová and B. De Schutter If you want to cite this report, please use the following reference instead: K. Staňová and B. De Schutter, Stacelberg equilibria for discrete-time dynamic games Part II: Stochastic games with deterministic information structure, Proceedings of the 0 IEEE International Conference on Networing, Sensing and Control, Delft, The Netherlands, pp , Apr. 0. Delft Center for Systems and Control Delft University of Technology Meelweg, 68 CD Delft The Netherlands phone: secretary fax: URL: This report can also be downloaded viahttp://pub.deschutter.info/abs/0_06.html

2 Stacelberg Equilibria for Discrete-Time Dynamic Games Part II: Stochastic Games with Deterministic Information Structure Kateřina Staňová, Bart De Schutter Abstract We consider a two-person discrete-time dynamic game with a prespecified fixed duration. Each player maximizes her profit over the game horizon, taing decisions of the other player into account. Our goal is to find the Stacelberg equilibria for such a game. After having discussed deterministic Stacelberg games in the companion paper Stacelberg Equilibria for Discrete-Time Dynamic Games Part I: Deterministic Games, in this paper we focus on stochastic games with a deterministic information structure. While for the stochastic game with open-loop structure the solution procedure is straightforward and already reported in the literature, the problem with the closed-loop problem information structure for stochastic games remains a challenge. After discussing a rather standard approach to solve the open-loop stochastic game, we propose an approach to find suboptimal solutions of the closed-loop game. Moreover, we discuss solution approach for generalized games in which the leader has access to the follower s past actions, the so-called inverse Stacelberg games. Keywords: stochastic games, discrete-time infinite dynamic games, Stacelberg games, information structure, team problems I. INTRODUCTION & LITERATURE OVERVIEW This paper extends results of the companion paper Stacelberg Equilibria for Discrete-Time Dynamic Games Part I: Deterministic Games into the realm of stochastic games with a deterministic information structure. In such a game there is a noise in the state equation, but the players do not have a biased perception of the states of the game. The game is referred to as infinite, because the decision spaces of the players comprise an infinite number of alternatives. We focus on the noncooperative variant of this game [], [], in which the goals of individual players might be conflicting. More specifically, we deal with Stacelberg problems [] [6] as opposed to Nash problems [7], [8]. The open-loop Stacelberg solution concept in the infinite discrete-time dynamic games was first treated in [9]. Some other references that discuss the open-loop and the feedbac Stacelberg solutions in discrete-time infinite dynamic games are [3], [0], []. Applications of this concept in microeconomics can be found in []. In this paper we show a standard method how to solve open-loop stochastic games. This paper extends the theory of Stacelberg equilibria for stochastic games presented in [], [] under a closedloop information pattern. Not much attention has been given to such problems. The difficulties encountered when dealing with the closed-loop stochastic variant of the game were first K. Staňová and B. De Schutter are with the Delft Center for Systems & Control, Delft University of Technology, The Netherlands..stanova@tudelft.nl, b.deschutter@tudelft.nl pointed out in [3] and are stated in [], without proposing a solution strategy. In [4] the feedbac strategy solutions in linear-quadratic stochastic dynamic games with noisy observations are discussed. However, stochastic games with a closed-loop information structure remain a challenge. In this paper we suggest a methodology to find a suboptimal solution of such games. Moreover, we propose a solution procedure for a generalized variant of the game in which the leader has access to the follower s past actions. This paper is composed as follows. In Section II basic notions used in this paper are introduced. In Section III the open-loop variant of the stochastic game is dealt with. In Section IV we study the closed-loop variant of the game. In Section V the generalized variant of the stochastic game, also referred to as the inverse or reverse Stacelberg game, is considered. In Section VI the conclusions and possibilities for future research are discussed. II. PRELIMINARIES Let us consider a two-player Stacelberg game, in which P is the leader and P is the follower. Let K ={,...,K} denote the stages of the game, where K is the maximal number of moves the player P i i {,} is allowed to mae. Let X R and X R be nonempty closed intervals and let X def = X X be the state space of the game. The state of the game for the -th stage is then referred to as x = x,x T, where x is P s state and x is P s state. Let U R for each K be a closed interval called P s decision space. Its elements are P s permissible decisions at stage, announced by P at the beginning of the stage. Let U R for each K be P s decision space. Its elements are P s permissible decisions u at stage. In this paper we will extend the game to the stochastic infinite discrete-time dynamic game by adding an additional player, the so-called nature, into the set of players. The nature acts randomly, but according to an a priori nown probability law. The state of the game evolves according to the difference equation u x + F x,u,u,θ, K, with F : X U U Θ X. Here θ Θ is the decision variable of the nature at stage. The initial state x is then also a random variable and the joint probability distribution function of x,θ,...,θ K is assumed to be nown. For the sae of simplicity, in this paper we will consider a situation with F x,u,u def = f x,u,u + θ.

3 Then becomes x + f x,u,u + θ, K, with state function f : X U U X, where θ,...,θ K is a sequence of statistically i.i.d. Gaussian vectors with values in R and with covθ >0, K. Let the information gained and recalled by P i i {,} at stage of the game be determined by η i, an a priori nown selection from x,...,x ; x,..., x. Specifications of η i for all K characterize the information structure of the game for P i. Let N i def = { η i T } i. Let Γ be a prespecified class of measurable mappings γ i : N i U i, called P i s permissible strategies at stage. The aggregate mapping γ i = γ i,γi,...,γi K is Pi s strategy, and the class Γ i of all such mappings γ i so that γ i Γ i, K, is P i s strategy set. We will refer to a P s strategy based on the P s strategy as the P s response to the P s strategy. Definition.: Information structure In a -person discrete time dynamic game, we say that P i s i {,} information has for all K a an open-loop structure if η i = x, b a closed-loop structure if η i = x,...,x, c a feedbac structure if η i = x. In this paper we will consider cases a and b, case c is a subject of the future research. Let L i : X U U X U U... X U U R be called Pi s profit function. Each player maximizes L i, taing into account possible actions of the other player. A. Assumptions on state and profit functions In order to simplify the analysis, we will mostly assume unless stated differently, the following: The functions f,u,u, f x,,u, and f,u, are continuously differentiable on R +, U, and U, respectively, The profit functions are assumed to be stage-additive, i.e., there exists g i : X X U U R, for all K, so that L i u,u = K = g i x,u,u,x +, 3 i {,}, and continuously differentiable on U U. Furthermore, g i,u,u, and g i x,u,,x + are continuously differentiable on R and U, respectively. B. Stochastic game formulation An extensive form description of the game contains the set of players, the index set defining the stages of the game, the state space and the decision spaces, the state equation, the observation sets, the state-observation equation, For the sae of notation convenience we refer to the profit functions as L i u,u, with u i def = u i T.,...,ui K the information structure of the game, the information spaces, the strategy sets, and the profit functionals. Similarly as it is done in [] for a general N-person discrete-time game, we can transform the game into a normal-form game. For each fixed initial state x and for each pair γ,γ, where γ i Γ i, i {,}, the extensive form description leads to a unique vector u i def = γ i i η, i {,}, K, 4 because of the causal nature of the information structure and because the state evolves according to a stochastic difference equation. Then, substitution of 4 into L and L leads to a pair of functions reflecting the corresponding profits of the players. This further implies existence of a composite mapping J i : Γ Γ R for each i {,}, which is the strategy-dependent profit function. Then the permissible strategy spaces Γ,Γ together with J γ,γ,j γ,γ constitute the normal form description of the game for each fixed initial state x. If the game can be expressed in the normal form, techniques used for finding solutions of finite games can be adopted []. III. OPEN-LOOP GAME This section summarizes results presented in [], [4]. Consider that the system evolves according to x + f x,u,u + θ, K, 5 where θ,...,θ K is a sequence of statistically i.i.d. Gaussian vectors with values in R and with covθ >0, K. The profit functionals for P and P are given by 3. A. Open-loop information pattern for both players If the information has an open-loop structure for both players and is nown a priori, the solution can be found by converting the game into equivalent static normal form and by consequent utilizing methods used for finding the deterministic open-loop Stacelberg solution. Indeed, we can recursively substitute 5 into 3 and tae expected values of L i over the random variables θ,...,θ K to obtain functionals L i depending only on the players decisions u and u and on x. The game can be then treated as a static game. The solution depends on the statistical moments of the random disturbances in the state equation unless the system equation is linear and the profit functions are quadratic []. In such a case the solution may be independent of the disturbances.

4 B. Open-loop information pattern for the leader and closedloop information pattern for the follower For the deterministic Stacelberg games dealt with in the companion paper the optimal open-loop Stacelberg equilibrium strategy of P does not change if P has an additional state information. The optimal strategy of P then becomes any closed-loop representation of the open-loop response on the equilibrium trajectory associated with the open-loop Stacelberg solution; hence, the optimal strategy of P is nonunique. However, for the stochastic Stacelberg game with an open-loop information structure for both P and P the solution does not coincide with the solution of the game in which P has an open-loop information structure and P has a closed-loop information structure and the latter has to be obtained independently of the former. The steps to solve the game with the open-loop information structure for P and closed-loop information structure for P are see [] for its derivation: For any u U K maximize E K g x,u,u,x + u = γ x,...,x = subject to and over Γ. We will denote the maximizing strategy of P by γ o. Following [5], [6], we can see that with η def = x l,l any γ o satisfies the dynamic programming principle V,x= max E [ g x,u,u, f x,u,u with u U + θ +V +, f x,u =E [ g x,u,γ o η, f x,u η + θ,γ o +V +, f x,u V,x def = max Maximize u U E [ K i=,γ o,u + θ ] ] η + θ, 6 g i xi,u,u ],x i+. J γ,γ o =E [ L u,u u = γ x, u = γ o ] η, K over Γ, subject to 6 and the state equation with u replaced by γ -dependent γ o η. The solution of this optimization problem constitutes the Stacelberg strategy of the leader in the stochastic dynamic game under consideration, provided that the maximization of V provides a unique solution o γ,...,γ o K for each u,...,u K. Note that γ o is dependent on γ, but this dependence cannot be in general written in the closed-form. Steps and provide a straightforward procedure for finding Stacelberg equilibrium of the game defined by, 3. IV. CLOSED-LOOP STOCHASTIC GAME If P has access to dynamic information, but does not have direct access to decisions of P, a direct approach to find the solution cannot be applied, as any optimal response of P to a strategy announced by P cannot be expressed analytically in terms of strategy of P, even in the case of linear-quadratic games. Moreover, the indirect method proposed in the companion paper solving the deterministic game cannot be used here, since for stochastic game each strategy has a unique representation as opposed to the infinitely many closed-loop representations of a given strategy in a deterministic system. Consequently, derivation of closedloop solutions of stochastic dynamic games remains a challenge. If, however, we mae some structural assumptions on possible strategies of P - which means seeing suboptimal solutions instead of optimal solutions, then the problem may become tractable. In particular, if, under the structural assumptions on Γ, restricting Γ to a certain function set, the class of the permissible strategies of P can be described by a finite number of parameters, and if P s optimal response can be determined as a function of these parameters, then the original game may transform into the static one in which P selects her strategy from an Euclidian space of the corresponding dimension; such a static game is, in general, solvable [7], although more liely numerically than analytically. The following example illustrates such an approach and shows that even with very simple stochastic dynamic games and very restrictive assumptions on Γ, the corresponding sub-optimal solution methodology is not trivial. Example 4. Two-stage stochastic game: Let L = x 3 u, u L = x 3 u, x = x u + θ, x 3 = x u + θ, 7 where θ and θ are i.i.d. random variables with zero mean and variances σ, σ and x is nown a priori, and assume that the strategies of P are linear in x, i.e., we restrict ourselves to the strategies Γ ={γ γ x,x = ρ x + ρ x }. 8 Player P acts at the stage and has access to x and x, player P acts at stage and has only access to x. Player P maximizes J =E { x γ x,x γ x }, over γ Γ in order to determine her optimal response to any γ Γ chosen by P. With Γ defined by 8, the problem is equivalent to the problem of finding optimal ρ and ρ, which are x -

5 dependent. Under such restriction J has a unique maximum, leading to the optimal strategy for P γ,o = ρ ρ ρ x ρ ρ. + By substituting γ and γ,o into J γ,γ, together with the corresponding values of x 3 and x, we obtain x ρ ρ Fρ,ρ = + + ρ ρ σ ρ ρ + ρ ρ ρ x + ρ σ, which has to be maximized over ρ and ρ for fixed x. Note that: F is continuous in both ρ and ρ, Fρ,ρ 0 for all ρ,ρ R, and Fρ,ρ if ρ, ρ. Consequently by the Weierstrass theorem [8], there exists at least one pair ρ,ρ maximizing F for any given x,σ. The optimal ρ,ρ depends on x,σ, but not on σ. Hence, for each fixed x,σ the function Fρ,ρ can be maximized using classical optimization methods [9]. γ x,x = ρ x + ρ x is a suboptimal strategy, i.e., it does not lead to the team maximum of the game for P. V. GENERALIZED STOCHASTIC STACKELBERG GAME Let us now assume that P has information about the past states of the game and the past decisions of P. With such extended information structure a given strategy of P will have multiple representations, thus it might be possible to enforce her team maximum J γ,γ = max γ Γ max γ Γ J γ,γ. The goal of P is to find an optimal representation of the team-optimal strategy, i.e., the strategy which enforces the team maximum. Such a strategy may be dependent on decisions made by P. Example 5.: Let L = x 3 u, u L = x 3 u, x = x u + θ, x 3 = x u + θ. 9 Let P has access to u, in addition to x and x. By this way, the game is generalized into a so-called inverse Stacelberg game [0] []. The best outcome that P can achieve is the team maximum J γ,γ = max γ Γ max γ Γ J γ,γ, with J γ,γ =E { x γ x,x θ + γ x,x γ x }. Note that the profit shows dependence on u = γ x not only directly, but also through x. This team problem is in fact a linear-quadratic stochastic control problem [4] and the team-optimum strategies are γ = x 3, γ = 5x 4, 0 which is the unique maximizing pair on Γ Γ. It is not, however, unique in the extended strategy space of P, as for example the following parametrized strategy also constitutes an optimal solution, with κ R: γ κ x,x,u x = 3 + κ u 5 4 x, γ κ = 5x 4, which characterizes the class of optimal strategies linear in x, x, and u, all leading to the same maximum expected value for P. We will refer to these strategies as representations of γ under the team-optimum solution γ,γ. Among this family of strategy pairs we are looing for the one with the additional property: If P maximizes her expected profit function, then the strategy in Γ that leads to this maximum is still γ, resp. γ κ. The corresponding strategy for P would then correspond to the global inverse Stacelberg solution, yielding the team maximum profit for P. Let us now focus on linear representations, which leads to the quadratic maximization problem: E x γ α x,x,u θ + u, with x = x u + θ. Since x is independent of θ and θ, which have both zero mean, this problem is equivalent to the following deterministic optimization problem: max u u R x α u x u The solution to this problem is the pair γ x,x,u = x u x, γ 5 x = 4 x, which constitutes a global inverse Stacelberg solution. This is, in fact, the unique solution within the class of linear strategies. Remar 5.: In Example 5. P has access to all information that P has access to. However, in stochastic inverse Stacelberg problems the information may not always be nested for the leader. If in this example P had access to x and u, then the problem would be of a nonnested information structure. To such problems the methodology

6 shown in the example does not apply, since the dynamic information for P no longer exhibits redundancy. While such problems have not been so far studied in detail, they were discussed in [3], [4] and are also a subject of our future research. For stochastic inverse Stacelberg problems with nested information the approach used in Example 5. can be generalized as follows. Consider a two-person stochastic inverse Stacelberg problem with the profit functions L u,u,θ and L u,u,θ for P and P, respectively, where θ is some random vector with a nown probability distribution function. Let y = h θ be the estimate of P on Θ and y = h θ be the estimate of P on Θ, with the property, that what P nows is also nown by P, i.e., the sigma-field generated by y includes the sigma-field generated by y. Let Γ i be the set of all measurable strategies of the form u i = γ i y i, i=,, and ˆΓ be the set of all measurable policies of the form u = γ y,u. Let us introduce the pair of policies γ,,γ, def = arg max γ Γ,γ Γ E θ { L γ h θ, γ h θ,θ }, assuming that the underlying team problem admits a maximizing solution. Then an optimal representation of P s strategy γ, is ˆγ ˆΓ satisfying ˆγ h θ,γ, h θ = γ, h θ a.s. The following result now follows. Proposition 5.: For the stochastic inverse Stacelberg problem with nested information as formulated above, the pair ˆγ,γ, constitutes the team maximum outcome for P. Equivalently, if a strategy ˆγ ˆΓ satisfies, the optimum of the underlying stochastic decision problem for the leader exists. VI. CONCLUSIONS & FUTURE RESEARCH In this paper we have introduced a stochastic variant of discrete-time infinite dynamic games and have dealt with finding their Stacelberg equilibrium solutions. Such solutions depend on the information patterns of the games and vary with the characteristics of the individual problems. We have reviewed classical approaches used to solve the games with open-loop information structure and proposed an approach to find a suboptimum of the games with closedloop information structure. Moreover, we have shown how to find a solution to a generalized variant of the game with the closed-loop information structure, in which the leader has access to the follower s past actions, the so-called inverse Stacelberg game. The main subjects of future research are finding solutions of the generalized stochastic game with a nonnested information structure and study of the stochastic games with feedbac information structure. ACKNOWLEDGMENTS This research has been supported by the European 7th framewor STREP project Hierarchical and distributed model predictive control of large-scale systems HD-MPC, contract number INFSO-ICT-3854, and by the European 7th Framewor Networ of Excellence Highly-complex and networed control systems HYCON. The authors would lie to than dr. Ján Buša from the Department of Mathematics, Faculty of Electrical Engineering & Computer Science, The Technical University of Košice, for his helpful insights. REFERENCES [] A. Perea y Monsuwé, Information Structures in Non-Cooperative Games. Maastricht, The Netherlands: Unigraphic, 997. [] T. Başar and G. J. Olsder, Dynamic Noncooperative Game Theory. Philadelphia, Pennsylvania: SIAM, 999. [3] F. Kydland, Noncooperative and dominant player solutions in discrete dynamic games, International Journal of Economic Review, vol. 6, pp , 975. [4] A. Bagchi, Stacelberg Differential Games in Economical Models. Berlin, Germany: Springer-Verlag, 984. [5] H. Peters, Game Theory: A Multi-Leveled Approach. Dordrecht, The Netherlands: Springer, 008. [6] K. Staňová, On Stacelberg and Inverse Stacelberg Games & Their Applications in the Optimal Toll Design Problem, the Energy Maret Liberalization Problem, and in the Theory of Incentives, Ph.D. dissertation, Delft University of Technology, Delft, The Netherlands, 009. [7] J. Nash, Noncooperative games, Annals of Mathematics, vol. 54, pp , 95. [8] G. Olsder, Adaptive Nash strategies for repeated games resulting in Pareto solutions, Delft University of Technology, Department of Mathematics and Informatics, Reports of the Department of Mathematics and Informatics 86-09, 986. [9] M. Simaan and J. B. Cruz, Additional aspects of the Stacelberg strategy in the nonzero sum games, Journal of Optimization Theory and Applications, vol., pp , 973. [0] J. B. Cruz, Leader-follower strategies for multilevel systems, IEEE Transactions on Automatic Control, vol. AC-3, pp , 978. [] T. Başar, Information structures and equilibria in dynamic games, in New Trends in Dynamic System Theory and Economics, M. Aoi and A. Marzollo, Eds. New Yor and London: Academic Press, 979, pp [] K. Ouguchi, Expactations and Stability in Oligopoly Models, ser. Lecture Notes in Economics and Mathematical Systems. Berlin: Springer-Verlag, 976. [3] D. Castanon, Equilibria in stochastic dynamic games of stacelberg type, Ph.D. dissertation, M.I.T. Electronics Systems Laboratory, Cambridge, MA, 976. [4] T. Başar, Stochastic stagewise stacelberg strategies for linearquadratic systems, in Stochastic Control Theory and Stochastic Differential Systems, ser. Lecture Notes in Control and Information Sciences, M. Kohlmann and W. Vogel, Eds. New Yor: Springer-Verlag, 979, ch. 6, pp [5] R. Bellman, Dynamic Programming. Princeton, New Jersey: Princeton University Press, 957. [6] M. L. Puterman, Marov Decision Processes: Discrete Stochastic Dynamic Programming. New Yor: John Wiley & Sons, Inc., 994. [7] M. Bardi, T. E. S. Raghavan, and T. Parthasarathy, Eds., Stochastic and Differential Games: Theory and Numerical Methods, ser. Annals of International Society of Dynamic Games. Boston: Birhäuser, 999, vol. 4. [8] W. Rudin, Functional analysis. New Yor: McGraw-Hill, 973. [9] P. E. Gill, W. Murray, and M. H. Wright, Practical optimization. London, UK: Academic Press, 004. [0] G. J. Olsder, Phenomena in inverse Stacelberg problems, Mathematisches Forschungsinstitut Oberwolfach, Oberwolfach, Germany, Tech. Rep. /005, 005.

7 [] K. Staňová and G. J. Olsder, Inverse Stacelberg games versus adverse-selection principal-agent model theory, in Proceedings of the th International Symposium on Dynamic Games and Applications, Sophia-Antipolis, France, 006. [] K. Staňová, M. C. J. Bliemer, and G. J. Olsder, Inverse Stacelberg games and their application to dynamic bilevel optimal toll design problem, in Proc. th International Symposium on Dynamic Games and Applications, 006. [3] T. Başar, Affine incentive schemes for stochastic systems with dynamic information, SIAM Journal on Scientific and Statistical Computing, vol., pp. 99 0, 984. [4], Stochastic incentive problems with partial dynamic information and multiple levels of hierarchy, European Journal of Political Economy, pp. 03 7, 989.

First steps towards finding a solution of a dynamic investor-bank game

First steps towards finding a solution of a dynamic investor-bank game Delft University of Technology Delft Center for Systems and Control Technical report -36 First steps towards finding a solution of a dynamic investor-bank game K Staňková and B De Schutter If you want

More information

Reverse Stackelberg games, Part I: Basic framework

Reverse Stackelberg games, Part I: Basic framework Delft University of Technology Delft Center for Systems and Control Technical report 12-021 Reverse Stackelberg games, Part I: Basic framework N. Groot, B. De Schutter, and H. Hellendoorn If you want to

More information

Adaptive State Feedback Nash Strategies for Linear Quadratic Discrete-Time Games

Adaptive State Feedback Nash Strategies for Linear Quadratic Discrete-Time Games Adaptive State Feedbac Nash Strategies for Linear Quadratic Discrete-Time Games Dan Shen and Jose B. Cruz, Jr. Intelligent Automation Inc., Rocville, MD 2858 USA (email: dshen@i-a-i.com). The Ohio State

More information

An Introduction to Noncooperative Games

An Introduction to Noncooperative Games An Introduction to Noncooperative Games Alberto Bressan Department of Mathematics, Penn State University Alberto Bressan (Penn State) Noncooperative Games 1 / 29 introduction to non-cooperative games:

More information

Dynamic stochastic game and macroeconomic equilibrium

Dynamic stochastic game and macroeconomic equilibrium Dynamic stochastic game and macroeconomic equilibrium Tianxiao Zheng SAIF 1. Introduction We have studied single agent problems. However, macro-economy consists of a large number of agents including individuals/households,

More information

On max-algebraic models for transportation networks

On max-algebraic models for transportation networks K.U.Leuven Department of Electrical Engineering (ESAT) SISTA Technical report 98-00 On max-algebraic models for transportation networks R. de Vries, B. De Schutter, and B. De Moor If you want to cite this

More information

Strategic Interactions between Fiscal and Monetary Policies in a Monetary Union

Strategic Interactions between Fiscal and Monetary Policies in a Monetary Union Strategic Interactions between Fiscal and Monetary Policies in a Monetary Union Reinhard Neck Doris A. Behrens Preliminary Draft, not to be quoted January 2003 Corresponding Author s Address: Reinhard

More information

Distributed Game Strategy Design with Application to Multi-Agent Formation Control

Distributed Game Strategy Design with Application to Multi-Agent Formation Control 5rd IEEE Conference on Decision and Control December 5-7, 4. Los Angeles, California, USA Distributed Game Strategy Design with Application to Multi-Agent Formation Control Wei Lin, Member, IEEE, Zhihua

More information

Prediction-based adaptive control of a class of discrete-time nonlinear systems with nonlinear growth rate

Prediction-based adaptive control of a class of discrete-time nonlinear systems with nonlinear growth rate www.scichina.com info.scichina.com www.springerlin.com Prediction-based adaptive control of a class of discrete-time nonlinear systems with nonlinear growth rate WEI Chen & CHEN ZongJi School of Automation

More information

Information, Utility & Bounded Rationality

Information, Utility & Bounded Rationality Information, Utility & Bounded Rationality Pedro A. Ortega and Daniel A. Braun Department of Engineering, University of Cambridge Trumpington Street, Cambridge, CB2 PZ, UK {dab54,pao32}@cam.ac.uk Abstract.

More information

Optimal Convergence in Multi-Agent MDPs

Optimal Convergence in Multi-Agent MDPs Optimal Convergence in Multi-Agent MDPs Peter Vrancx 1, Katja Verbeeck 2, and Ann Nowé 1 1 {pvrancx, ann.nowe}@vub.ac.be, Computational Modeling Lab, Vrije Universiteit Brussel 2 k.verbeeck@micc.unimaas.nl,

More information

Noncooperative continuous-time Markov games

Noncooperative continuous-time Markov games Morfismos, Vol. 9, No. 1, 2005, pp. 39 54 Noncooperative continuous-time Markov games Héctor Jasso-Fuentes Abstract This work concerns noncooperative continuous-time Markov games with Polish state and

More information

Incentive-Based Pricing for Network Games with Complete and Incomplete Information

Incentive-Based Pricing for Network Games with Complete and Incomplete Information Incentive-Based Pricing for Network Games with Complete and Incomplete Information Hongxia Shen and Tamer Başar Coordinated Science Laboratory University of Illinois at Urbana-Champaign hshen1, tbasar@control.csl.uiuc.edu

More information

Robust Predictions in Games with Incomplete Information

Robust Predictions in Games with Incomplete Information Robust Predictions in Games with Incomplete Information joint with Stephen Morris (Princeton University) November 2010 Payoff Environment in games with incomplete information, the agents are uncertain

More information

Large-population, dynamical, multi-agent, competitive, and cooperative phenomena occur in a wide range of designed

Large-population, dynamical, multi-agent, competitive, and cooperative phenomena occur in a wide range of designed Definition Mean Field Game (MFG) theory studies the existence of Nash equilibria, together with the individual strategies which generate them, in games involving a large number of agents modeled by controlled

More information

An Adaptive Clustering Method for Model-free Reinforcement Learning

An Adaptive Clustering Method for Model-free Reinforcement Learning An Adaptive Clustering Method for Model-free Reinforcement Learning Andreas Matt and Georg Regensburger Institute of Mathematics University of Innsbruck, Austria {andreas.matt, georg.regensburger}@uibk.ac.at

More information

Equivalent Bilevel Programming Form for the Generalized Nash Equilibrium Problem

Equivalent Bilevel Programming Form for the Generalized Nash Equilibrium Problem Vol. 2, No. 1 ISSN: 1916-9795 Equivalent Bilevel Programming Form for the Generalized Nash Equilibrium Problem Lianju Sun College of Operations Research and Management Science, Qufu Normal University Tel:

More information

Preliminary Results on Social Learning with Partial Observations

Preliminary Results on Social Learning with Partial Observations Preliminary Results on Social Learning with Partial Observations Ilan Lobel, Daron Acemoglu, Munther Dahleh and Asuman Ozdaglar ABSTRACT We study a model of social learning with partial observations from

More information

The characteristic equation and minimal state space realization of SISO systems in the max algebra

The characteristic equation and minimal state space realization of SISO systems in the max algebra KULeuven Department of Electrical Engineering (ESAT) SISTA Technical report 93-57a The characteristic equation and minimal state space realization of SISO systems in the max algebra B De Schutter and B

More information

On Equilibria of Distributed Message-Passing Games

On Equilibria of Distributed Message-Passing Games On Equilibria of Distributed Message-Passing Games Concetta Pilotto and K. Mani Chandy California Institute of Technology, Computer Science Department 1200 E. California Blvd. MC 256-80 Pasadena, US {pilotto,mani}@cs.caltech.edu

More information

Tilburg University. An equivalence result in linear-quadratic theory van den Broek, W.A.; Engwerda, Jacob; Schumacher, Hans. Published in: Automatica

Tilburg University. An equivalence result in linear-quadratic theory van den Broek, W.A.; Engwerda, Jacob; Schumacher, Hans. Published in: Automatica Tilburg University An equivalence result in linear-quadratic theory van den Broek, W.A.; Engwerda, Jacob; Schumacher, Hans Published in: Automatica Publication date: 2003 Link to publication Citation for

More information

Math-Net.Ru All Russian mathematical portal

Math-Net.Ru All Russian mathematical portal Math-Net.Ru All Russian mathematical portal Anna V. Tur, A time-consistent solution formula for linear-quadratic discrete-time dynamic games with nontransferable payoffs, Contributions to Game Theory and

More information

Introduction to Game Theory

Introduction to Game Theory COMP323 Introduction to Computational Game Theory Introduction to Game Theory Paul G. Spirakis Department of Computer Science University of Liverpool Paul G. Spirakis (U. Liverpool) Introduction to Game

More information

Near-Potential Games: Geometry and Dynamics

Near-Potential Games: Geometry and Dynamics Near-Potential Games: Geometry and Dynamics Ozan Candogan, Asuman Ozdaglar and Pablo A. Parrilo January 29, 2012 Abstract Potential games are a special class of games for which many adaptive user dynamics

More information

6.254 : Game Theory with Engineering Applications Lecture 7: Supermodular Games

6.254 : Game Theory with Engineering Applications Lecture 7: Supermodular Games 6.254 : Game Theory with Engineering Applications Lecture 7: Asu Ozdaglar MIT February 25, 2010 1 Introduction Outline Uniqueness of a Pure Nash Equilibrium for Continuous Games Reading: Rosen J.B., Existence

More information

Feedback strategies for discrete-time linear-quadratic two-player descriptor games

Feedback strategies for discrete-time linear-quadratic two-player descriptor games Feedback strategies for discrete-time linear-quadratic two-player descriptor games Marc Jungers To cite this version: Marc Jungers. Feedback strategies for discrete-time linear-quadratic two-player descriptor

More information

Genetic algorithm for closed-loop equilibrium of high-order linear quadratic dynamic games

Genetic algorithm for closed-loop equilibrium of high-order linear quadratic dynamic games Mathematics and Computers in Simulation 53 (2000) 39 47 Genetic algorithm for closed-loop equilibrium of high-order linear quadratic dynamic games Süheyla Özyıldırım Faculty of Business Administration,

More information

A Nash Equilibrium Analysis for Interference Coupled Wireless Systems

A Nash Equilibrium Analysis for Interference Coupled Wireless Systems A Nash Equilibrium Analysis for Interference Coupled Wireless Systems Siddharth Nai Technical University of Berlin Heinrich Hertz Chair for Mobile Communications Einsteinufer 25, 10587 Berlin, Germany

More information

UC Berkeley Haas School of Business Game Theory (EMBA 296 & EWMBA 211) Summer 2016

UC Berkeley Haas School of Business Game Theory (EMBA 296 & EWMBA 211) Summer 2016 UC Berkeley Haas School of Business Game Theory (EMBA 296 & EWMBA 211) Summer 2016 More on strategic games and extensive games with perfect information Block 2 Jun 12, 2016 Food for thought LUPI Many players

More information

Advertising and Promotion in a Marketing Channel

Advertising and Promotion in a Marketing Channel Applied Mathematical Sciences, Vol. 13, 2019, no. 9, 405-413 HIKARI Ltd, www.m-hikari.com https://doi.org/10.12988/ams.2019.9240 Advertising and Promotion in a Marketing Channel Alessandra Buratto Dipartimento

More information

Optimal Decentralized Control of Coupled Subsystems With Control Sharing

Optimal Decentralized Control of Coupled Subsystems With Control Sharing IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 58, NO. 9, SEPTEMBER 2013 2377 Optimal Decentralized Control of Coupled Subsystems With Control Sharing Aditya Mahajan, Member, IEEE Abstract Subsystems that

More information

A New and Robust Subgame Perfect Equilibrium in a model of Triadic Power Relations *

A New and Robust Subgame Perfect Equilibrium in a model of Triadic Power Relations * A New and Robust Subgame Perfect Equilibrium in a model of Triadic Power Relations * by Magnus Hatlebakk ** Department of Economics, University of Bergen Abstract: We present a new subgame perfect equilibrium

More information

Channel Probing in Communication Systems: Myopic Policies Are Not Always Optimal

Channel Probing in Communication Systems: Myopic Policies Are Not Always Optimal Channel Probing in Communication Systems: Myopic Policies Are Not Always Optimal Matthew Johnston, Eytan Modiano Laboratory for Information and Decision Systems Massachusetts Institute of Technology Cambridge,

More information

Computation of Efficient Nash Equilibria for experimental economic games

Computation of Efficient Nash Equilibria for experimental economic games International Journal of Mathematics and Soft Computing Vol.5, No.2 (2015), 197-212. ISSN Print : 2249-3328 ISSN Online: 2319-5215 Computation of Efficient Nash Equilibria for experimental economic games

More information

1. The General Linear-Quadratic Framework

1. The General Linear-Quadratic Framework ECO 37 Economics of Uncertainty Fall Term 009 Notes for lectures Incentives for Effort - Multi-Dimensional Cases Here we consider moral hazard problems in the principal-agent framewor, restricting the

More information

CS364A: Algorithmic Game Theory Lecture #16: Best-Response Dynamics

CS364A: Algorithmic Game Theory Lecture #16: Best-Response Dynamics CS364A: Algorithmic Game Theory Lecture #16: Best-Response Dynamics Tim Roughgarden November 13, 2013 1 Do Players Learn Equilibria? In this lecture we segue into the third part of the course, which studies

More information

The Value of Information in Zero-Sum Games

The Value of Information in Zero-Sum Games The Value of Information in Zero-Sum Games Olivier Gossner THEMA, Université Paris Nanterre and CORE, Université Catholique de Louvain joint work with Jean-François Mertens July 2001 Abstract: It is not

More information

Game Theory with Information: Introducing the Witsenhausen Intrinsic Model

Game Theory with Information: Introducing the Witsenhausen Intrinsic Model Game Theory with Information: Introducing the Witsenhausen Intrinsic Model Michel De Lara and Benjamin Heymann Cermics, École des Ponts ParisTech France École des Ponts ParisTech March 15, 2017 Information

More information

Near-Potential Games: Geometry and Dynamics

Near-Potential Games: Geometry and Dynamics Near-Potential Games: Geometry and Dynamics Ozan Candogan, Asuman Ozdaglar and Pablo A. Parrilo September 6, 2011 Abstract Potential games are a special class of games for which many adaptive user dynamics

More information

Equilibria for games with asymmetric information: from guesswork to systematic evaluation

Equilibria for games with asymmetric information: from guesswork to systematic evaluation Equilibria for games with asymmetric information: from guesswork to systematic evaluation Achilleas Anastasopoulos anastas@umich.edu EECS Department University of Michigan February 11, 2016 Joint work

More information

Basics of Game Theory

Basics of Game Theory Basics of Game Theory Giacomo Bacci and Luca Sanguinetti Department of Information Engineering University of Pisa, Pisa, Italy {giacomo.bacci,luca.sanguinetti}@iet.unipi.it April - May, 2010 G. Bacci and

More information

Combinatorial Agency of Threshold Functions

Combinatorial Agency of Threshold Functions Combinatorial Agency of Threshold Functions Shaili Jain 1 and David C. Parkes 2 1 Yale University, New Haven, CT shaili.jain@yale.edu 2 Harvard University, Cambridge, MA parkes@eecs.harvard.edu Abstract.

More information

Best Guaranteed Result Principle and Decision Making in Operations with Stochastic Factors and Uncertainty

Best Guaranteed Result Principle and Decision Making in Operations with Stochastic Factors and Uncertainty Stochastics and uncertainty underlie all the processes of the Universe. N.N.Moiseev Best Guaranteed Result Principle and Decision Making in Operations with Stochastic Factors and Uncertainty by Iouldouz

More information

UTILIZING PRIOR KNOWLEDGE IN ROBUST OPTIMAL EXPERIMENT DESIGN. EE & CS, The University of Newcastle, Australia EE, Technion, Israel.

UTILIZING PRIOR KNOWLEDGE IN ROBUST OPTIMAL EXPERIMENT DESIGN. EE & CS, The University of Newcastle, Australia EE, Technion, Israel. UTILIZING PRIOR KNOWLEDGE IN ROBUST OPTIMAL EXPERIMENT DESIGN Graham C. Goodwin James S. Welsh Arie Feuer Milan Depich EE & CS, The University of Newcastle, Australia 38. EE, Technion, Israel. Abstract:

More information

Applications of Dynamic Pricing in Day-to-Day Equilibrium Models

Applications of Dynamic Pricing in Day-to-Day Equilibrium Models Applications of Dynamic Pricing in Day-to-Day Equilibrium Models Tarun Rambha Graduate Research Assistant The University of Texas at Austin Department of Civil, Architectural, and Environmental Engineering

More information

Finite-Horizon Optimal State-Feedback Control of Nonlinear Stochastic Systems Based on a Minimum Principle

Finite-Horizon Optimal State-Feedback Control of Nonlinear Stochastic Systems Based on a Minimum Principle Finite-Horizon Optimal State-Feedbac Control of Nonlinear Stochastic Systems Based on a Minimum Principle Marc P Deisenroth, Toshiyui Ohtsua, Florian Weissel, Dietrich Brunn, and Uwe D Hanebec Abstract

More information

Credibilistic Bi-Matrix Game

Credibilistic Bi-Matrix Game Journal of Uncertain Systems Vol.6, No.1, pp.71-80, 2012 Online at: www.jus.org.uk Credibilistic Bi-Matrix Game Prasanta Mula 1, Sankar Kumar Roy 2, 1 ISRO Satellite Centre, Old Airport Road, Vimanapura

More information

6.207/14.15: Networks Lecture 11: Introduction to Game Theory 3

6.207/14.15: Networks Lecture 11: Introduction to Game Theory 3 6.207/14.15: Networks Lecture 11: Introduction to Game Theory 3 Daron Acemoglu and Asu Ozdaglar MIT October 19, 2009 1 Introduction Outline Existence of Nash Equilibrium in Infinite Games Extensive Form

More information

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 53, NO. 7, AUGUST /$ IEEE

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 53, NO. 7, AUGUST /$ IEEE IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL 53, NO 7, AUGUST 2008 1643 Asymptotically Optimal Decentralized Control for Large Population Stochastic Multiagent Systems Tao Li, Student Member, IEEE, and

More information

1 PROBLEM DEFINITION. i=1 z i = 1 }.

1 PROBLEM DEFINITION. i=1 z i = 1 }. Algorithms for Approximations of Nash Equilibrium (003; Lipton, Markakis, Mehta, 006; Kontogiannis, Panagopoulou, Spirakis, and 006; Daskalakis, Mehta, Papadimitriou) Spyros C. Kontogiannis University

More information

Mathematical Economics - PhD in Economics

Mathematical Economics - PhD in Economics - PhD in Part 1: Supermodularity and complementarity in the one-dimensional and Paulo Brito ISEG - Technical University of Lisbon November 24, 2010 1 2 - Supermodular optimization 3 one-dimensional 4 Supermodular

More information

A Stackelberg Network Game with a Large Number of Followers 1,2,3

A Stackelberg Network Game with a Large Number of Followers 1,2,3 A Stackelberg Network Game with a Large Number of Followers,,3 T. BAŞAR 4 and R. SRIKANT 5 Communicated by M. Simaan Abstract. We consider a hierarchical network game with multiple links, a single service

More information

Cheap talk theoretic tools for distributed sensing in the presence of strategic sensors

Cheap talk theoretic tools for distributed sensing in the presence of strategic sensors 1 / 37 Cheap talk theoretic tools for distributed sensing in the presence of strategic sensors Cédric Langbort Department of Aerospace Engineering & Coordinated Science Laboratory UNIVERSITY OF ILLINOIS

More information

On Robust Arm-Acquiring Bandit Problems

On Robust Arm-Acquiring Bandit Problems On Robust Arm-Acquiring Bandit Problems Shiqing Yu Faculty Mentor: Xiang Yu July 20, 2014 Abstract In the classical multi-armed bandit problem, at each stage, the player has to choose one from N given

More information

CONTROL SYSTEMS, ROBOTICS AND AUTOMATION Vol. XI Stochastic Stability - H.J. Kushner

CONTROL SYSTEMS, ROBOTICS AND AUTOMATION Vol. XI Stochastic Stability - H.J. Kushner STOCHASTIC STABILITY H.J. Kushner Applied Mathematics, Brown University, Providence, RI, USA. Keywords: stability, stochastic stability, random perturbations, Markov systems, robustness, perturbed systems,

More information

A Note on Open Loop Nash Equilibrium in Linear-State Differential Games

A Note on Open Loop Nash Equilibrium in Linear-State Differential Games Applied Mathematical Sciences, vol. 8, 2014, no. 145, 7239-7248 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2014.49746 A Note on Open Loop Nash Equilibrium in Linear-State Differential

More information

Single parameter FPT-algorithms for non-trivial games

Single parameter FPT-algorithms for non-trivial games Single parameter FPT-algorithms for non-trivial games Author Estivill-Castro, Vladimir, Parsa, Mahdi Published 2011 Journal Title Lecture Notes in Computer science DOI https://doi.org/10.1007/978-3-642-19222-7_13

More information

Mechanism Design: Basic Concepts

Mechanism Design: Basic Concepts Advanced Microeconomic Theory: Economics 521b Spring 2011 Juuso Välimäki Mechanism Design: Basic Concepts The setup is similar to that of a Bayesian game. The ingredients are: 1. Set of players, i {1,

More information

Total Expected Discounted Reward MDPs: Existence of Optimal Policies

Total Expected Discounted Reward MDPs: Existence of Optimal Policies Total Expected Discounted Reward MDPs: Existence of Optimal Policies Eugene A. Feinberg Department of Applied Mathematics and Statistics State University of New York at Stony Brook Stony Brook, NY 11794-3600

More information

Equilibria in Games with Weak Payoff Externalities

Equilibria in Games with Weak Payoff Externalities NUPRI Working Paper 2016-03 Equilibria in Games with Weak Payoff Externalities Takuya Iimura, Toshimasa Maruta, and Takahiro Watanabe October, 2016 Nihon University Population Research Institute http://www.nihon-u.ac.jp/research/institute/population/nupri/en/publications.html

More information

EQUIVALENCE OF TOPOLOGIES AND BOREL FIELDS FOR COUNTABLY-HILBERT SPACES

EQUIVALENCE OF TOPOLOGIES AND BOREL FIELDS FOR COUNTABLY-HILBERT SPACES EQUIVALENCE OF TOPOLOGIES AND BOREL FIELDS FOR COUNTABLY-HILBERT SPACES JEREMY J. BECNEL Abstract. We examine the main topologies wea, strong, and inductive placed on the dual of a countably-normed space

More information

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2016

Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2016 Ph.D. Preliminary Examination MICROECONOMIC THEORY Applied Economics Graduate Program June 2016 The time limit for this exam is four hours. The exam has four sections. Each section includes two questions.

More information

MDP Preliminaries. Nan Jiang. February 10, 2019

MDP Preliminaries. Nan Jiang. February 10, 2019 MDP Preliminaries Nan Jiang February 10, 2019 1 Markov Decision Processes In reinforcement learning, the interactions between the agent and the environment are often described by a Markov Decision Process

More information

Brown s Original Fictitious Play

Brown s Original Fictitious Play manuscript No. Brown s Original Fictitious Play Ulrich Berger Vienna University of Economics, Department VW5 Augasse 2-6, A-1090 Vienna, Austria e-mail: ulrich.berger@wu-wien.ac.at March 2005 Abstract

More information

EE291E/ME 290Q Lecture Notes 8. Optimal Control and Dynamic Games

EE291E/ME 290Q Lecture Notes 8. Optimal Control and Dynamic Games EE291E/ME 290Q Lecture Notes 8. Optimal Control and Dynamic Games S. S. Sastry REVISED March 29th There exist two main approaches to optimal control and dynamic games: 1. via the Calculus of Variations

More information

On the Inherent Robustness of Suboptimal Model Predictive Control

On the Inherent Robustness of Suboptimal Model Predictive Control On the Inherent Robustness of Suboptimal Model Predictive Control James B. Rawlings, Gabriele Pannocchia, Stephen J. Wright, and Cuyler N. Bates Department of Chemical & Biological Engineering Computer

More information

Cooperation Stimulation in Cooperative Communications: An Indirect Reciprocity Game

Cooperation Stimulation in Cooperative Communications: An Indirect Reciprocity Game IEEE ICC 202 - Wireless Networks Symposium Cooperation Stimulation in Cooperative Communications: An Indirect Reciprocity Game Yang Gao, Yan Chen and K. J. Ray Liu Department of Electrical and Computer

More information

Information Structures, the Witsenhausen Counterexample, and Communicating Using Actions

Information Structures, the Witsenhausen Counterexample, and Communicating Using Actions Information Structures, the Witsenhausen Counterexample, and Communicating Using Actions Pulkit Grover, Carnegie Mellon University Abstract The concept of information-structures in decentralized control

More information

Lecture 6 Random walks - advanced methods

Lecture 6 Random walks - advanced methods Lecture 6: Random wals - advanced methods 1 of 11 Course: M362K Intro to Stochastic Processes Term: Fall 2014 Instructor: Gordan Zitovic Lecture 6 Random wals - advanced methods STOPPING TIMES Our last

More information

A Few Games and Geometric Insights

A Few Games and Geometric Insights A Few Games and Geometric Insights Brian Powers Arizona State University brpowers@asu.edu January 20, 2017 1 / 56 Outline 1 Nash/Correlated Equilibria A Motivating Example The Nash Equilibrium Correlated

More information

MS&E 246: Lecture 18 Network routing. Ramesh Johari

MS&E 246: Lecture 18 Network routing. Ramesh Johari MS&E 246: Lecture 18 Network routing Ramesh Johari Network routing Last lecture: a model where N is finite Now: assume N is very large Formally: Represent the set of users as a continuous interval, [0,

More information

NASH STRATEGIES FOR DYNAMIC NONCOOPERATIVE LINEAR QUADRATIC SEQUENTIAL GAMES

NASH STRATEGIES FOR DYNAMIC NONCOOPERATIVE LINEAR QUADRATIC SEQUENTIAL GAMES NASH STRATEGIES FOR DYNAMIC NONCOOPERATIVE LINEAR QUADRATIC SEQUENTIAL GAMES DISSERTATION Presented in Partial Fulfillment of the Requirements for the Degree Doctor of Philosophy in the Graduate School

More information

Volume 30, Issue 3. A note on Kalman filter approach to solution of rational expectations models

Volume 30, Issue 3. A note on Kalman filter approach to solution of rational expectations models Volume 30, Issue 3 A note on Kalman filter approach to solution of rational expectations models Marco Maria Sorge BGSE, University of Bonn Abstract In this note, a class of nonlinear dynamic models under

More information

NON-LINEAR CONTROL OF OUTPUT PROBABILITY DENSITY FUNCTION FOR LINEAR ARMAX SYSTEMS

NON-LINEAR CONTROL OF OUTPUT PROBABILITY DENSITY FUNCTION FOR LINEAR ARMAX SYSTEMS Control 4, University of Bath, UK, September 4 ID-83 NON-LINEAR CONTROL OF OUTPUT PROBABILITY DENSITY FUNCTION FOR LINEAR ARMAX SYSTEMS H. Yue, H. Wang Control Systems Centre, University of Manchester

More information

Stochastic Realization of Binary Exchangeable Processes

Stochastic Realization of Binary Exchangeable Processes Stochastic Realization of Binary Exchangeable Processes Lorenzo Finesso and Cecilia Prosdocimi Abstract A discrete time stochastic process is called exchangeable if its n-dimensional distributions are,

More information

Optimal Tariffs on Exhaustible Resources: A Leader-Follower Model

Optimal Tariffs on Exhaustible Resources: A Leader-Follower Model Optimal Tariffs on Exhaustible Resources: A Leader-Follower Model Kenji Fujiwara Department of Economics, McGill University & School of Economics, Kwansei Gakuin University Ngo Van Long Department of Economics,

More information

Topic 6: Projected Dynamical Systems

Topic 6: Projected Dynamical Systems Topic 6: Projected Dynamical Systems John F. Smith Memorial Professor and Director Virtual Center for Supernetworks Isenberg School of Management University of Massachusetts Amherst, Massachusetts 01003

More information

Political Economy of Institutions and Development: Problem Set 1. Due Date: Thursday, February 23, in class.

Political Economy of Institutions and Development: Problem Set 1. Due Date: Thursday, February 23, in class. Political Economy of Institutions and Development: 14.773 Problem Set 1 Due Date: Thursday, February 23, in class. Answer Questions 1-3. handed in. The other two questions are for practice and are not

More information

Stationary Probabilities of Markov Chains with Upper Hessenberg Transition Matrices

Stationary Probabilities of Markov Chains with Upper Hessenberg Transition Matrices Stationary Probabilities of Marov Chains with Upper Hessenberg Transition Matrices Y. Quennel ZHAO Department of Mathematics and Statistics University of Winnipeg Winnipeg, Manitoba CANADA R3B 2E9 Susan

More information

A PREDICTION-BASED BEHAVIORAL MODEL FOR FINANCIAL MARKETS

A PREDICTION-BASED BEHAVIORAL MODEL FOR FINANCIAL MARKETS A PREDICTION-BASED BEHAVIORAL MODEL FOR FINANCIAL MARKETS László Gerencsér, Zalán Mátyás Computer and Automation Institute of the Hungarian Academy of Sciences, 3-7 Kende u., Budapest, Hungary gerencser@sztai.hu,

More information

Computing Equilibria of Repeated And Dynamic Games

Computing Equilibria of Repeated And Dynamic Games Computing Equilibria of Repeated And Dynamic Games Şevin Yeltekin Carnegie Mellon University ICE 2012 July 2012 1 / 44 Introduction Repeated and dynamic games have been used to model dynamic interactions

More information

Contracts under Asymmetric Information

Contracts under Asymmetric Information Contracts under Asymmetric Information 1 I Aristotle, economy (oiko and nemo) and the idea of exchange values, subsequently adapted by Ricardo and Marx. Classical economists. An economy consists of a set

More information

State Estimation and Prediction in a Class of Stochastic Hybrid Systems

State Estimation and Prediction in a Class of Stochastic Hybrid Systems State Estimation and Prediction in a Class of Stochastic Hybrid Systems Eugenio Cinquemani Mario Micheli Giorgio Picci Dipartimento di Ingegneria dell Informazione, Università di Padova, Padova, Italy

More information

PARAMETER ESTIMATION AND ORDER SELECTION FOR LINEAR REGRESSION PROBLEMS. Yngve Selén and Erik G. Larsson

PARAMETER ESTIMATION AND ORDER SELECTION FOR LINEAR REGRESSION PROBLEMS. Yngve Selén and Erik G. Larsson PARAMETER ESTIMATION AND ORDER SELECTION FOR LINEAR REGRESSION PROBLEMS Yngve Selén and Eri G Larsson Dept of Information Technology Uppsala University, PO Box 337 SE-71 Uppsala, Sweden email: yngveselen@ituuse

More information

Introduction. 1 University of Pennsylvania, Wharton Finance Department, Steinberg Hall-Dietrich Hall, 3620

Introduction. 1 University of Pennsylvania, Wharton Finance Department, Steinberg Hall-Dietrich Hall, 3620 May 16, 2006 Philip Bond 1 Are cheap talk and hard evidence both needed in the courtroom? Abstract: In a recent paper, Bull and Watson (2004) present a formal model of verifiability in which cheap messages

More information

State Estimation of Linear and Nonlinear Dynamic Systems

State Estimation of Linear and Nonlinear Dynamic Systems State Estimation of Linear and Nonlinear Dynamic Systems Part I: Linear Systems with Gaussian Noise James B. Rawlings and Fernando V. Lima Department of Chemical and Biological Engineering University of

More information

Asymptotics of Efficiency Loss in Competitive Market Mechanisms

Asymptotics of Efficiency Loss in Competitive Market Mechanisms Page 1 of 40 Asymptotics of Efficiency Loss in Competitive Market Mechanisms Jia Yuan Yu and Shie Mannor Department of Electrical and Computer Engineering McGill University Montréal, Québec, Canada jia.yu@mail.mcgill.ca

More information

Graduate Microeconomics II Lecture 5: Cheap Talk. Patrick Legros

Graduate Microeconomics II Lecture 5: Cheap Talk. Patrick Legros Graduate Microeconomics II Lecture 5: Cheap Talk Patrick Legros 1 / 35 Outline Cheap talk 2 / 35 Outline Cheap talk Crawford-Sobel Welfare 3 / 35 Outline Cheap talk Crawford-Sobel Welfare Partially Verifiable

More information

A Stackelberg Game Model of Dynamic Duopolistic Competition with Sticky Prices. Abstract

A Stackelberg Game Model of Dynamic Duopolistic Competition with Sticky Prices. Abstract A Stackelberg Game Model of Dynamic Duopolistic Competition with Sticky Prices Kenji Fujiwara School of Economics, Kwansei Gakuin University Abstract We develop the following Stackelberg game model of

More information

A note on the characteristic equation in the max-plus algebra

A note on the characteristic equation in the max-plus algebra K.U.Leuven Department of Electrical Engineering (ESAT) SISTA Technical report 96-30 A note on the characteristic equation in the max-plus algebra B. De Schutter and B. De Moor If you want to cite this

More information

ON THE PARAMETRIC UNCERTAINTY OF WEAKLY REVERSIBLE REALIZATIONS OF KINETIC SYSTEMS

ON THE PARAMETRIC UNCERTAINTY OF WEAKLY REVERSIBLE REALIZATIONS OF KINETIC SYSTEMS HUNGARIAN JOURNAL OF INDUSTRY AND CHEMISTRY VESZPRÉM Vol. 42(2) pp. 3 7 (24) ON THE PARAMETRIC UNCERTAINTY OF WEAKLY REVERSIBLE REALIZATIONS OF KINETIC SYSTEMS GYÖRGY LIPTÁK, GÁBOR SZEDERKÉNYI,2 AND KATALIN

More information

LEARNING IN CONCAVE GAMES

LEARNING IN CONCAVE GAMES LEARNING IN CONCAVE GAMES P. Mertikopoulos French National Center for Scientific Research (CNRS) Laboratoire d Informatique de Grenoble GSBE ETBC seminar Maastricht, October 22, 2015 Motivation and Preliminaries

More information

Hyperbolicity of Systems Describing Value Functions in Differential Games which Model Duopoly Problems. Joanna Zwierzchowska

Hyperbolicity of Systems Describing Value Functions in Differential Games which Model Duopoly Problems. Joanna Zwierzchowska Decision Making in Manufacturing and Services Vol. 9 05 No. pp. 89 00 Hyperbolicity of Systems Describing Value Functions in Differential Games which Model Duopoly Problems Joanna Zwierzchowska Abstract.

More information

Game Theory Extra Lecture 1 (BoB)

Game Theory Extra Lecture 1 (BoB) Game Theory 2014 Extra Lecture 1 (BoB) Differential games Tools from optimal control Dynamic programming Hamilton-Jacobi-Bellman-Isaacs equation Zerosum linear quadratic games and H control Baser/Olsder,

More information

Learning Approaches to the Witsenhausen Counterexample From a View of Potential Games

Learning Approaches to the Witsenhausen Counterexample From a View of Potential Games Learning Approaches to the Witsenhausen Counterexample From a View of Potential Games Na Li, Jason R. Marden and Jeff S. Shamma Abstract Since Witsenhausen put forward his remarkable counterexample in

More information

On the Pareto Efficiency of a Socially Optimal Mechanism for Monopoly Regulation

On the Pareto Efficiency of a Socially Optimal Mechanism for Monopoly Regulation MPRA Munich Personal RePEc Archive On the Pareto Efficiency of a Socially Optimal Mechanism for Monopoly Regulation Ismail Saglam Ipek University 4 May 2016 Online at https://mpra.ub.uni-muenchen.de/71090/

More information

Price and Capacity Competition

Price and Capacity Competition Price and Capacity Competition Daron Acemoglu, Kostas Bimpikis, and Asuman Ozdaglar October 9, 2007 Abstract We study the efficiency of oligopoly equilibria in a model where firms compete over capacities

More information

Maximization of Submodular Set Functions

Maximization of Submodular Set Functions Northeastern University Department of Electrical and Computer Engineering Maximization of Submodular Set Functions Biomedical Signal Processing, Imaging, Reasoning, and Learning BSPIRAL) Group Author:

More information

Blackwell s Informativeness Theorem Using Category Theory

Blackwell s Informativeness Theorem Using Category Theory Blackwell s Informativeness Theorem Using Category Theory Henrique de Oliveira March 29, 2016 1 Introduction An agent faces a decision problem under uncertainty, whose payoff u (a, ω) depends on her action

More information

EUSIPCO

EUSIPCO EUSIPCO 3 569736677 FULLY ISTRIBUTE SIGNAL ETECTION: APPLICATION TO COGNITIVE RAIO Franc Iutzeler Philippe Ciblat Telecom ParisTech, 46 rue Barrault 753 Paris, France email: firstnamelastname@telecom-paristechfr

More information