Research Article A Necessary Condition for Nash Equilibrium in Two-Person Zero-Sum Constrained Stochastic Games
|
|
- Sharyl Craig
- 5 years ago
- Views:
Transcription
1 Game Theory Volume 2013, Article ID , 5 pages Research Article A Necessary Condition for Nash Equilibrium in Two-Person Zero-Sum Constrained Stochastic Games Hyeong Soo Chang Department of Computer Science and Engineering, Sogang University, Seoul , Republic of Korea Correspondence should be addressed to Hyeong Soo Chang; hschang@sogang.ac.kr Received 15 September 2013; Accepted 19 November 2013 Academic Editor: Peijun Guo Copyright 2013 Hyeong Soo Chang. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. We provide a necessary condition that a constrained Nash-equilibrium (CNE) policy pair satisfies in two-person zero-sum constrained stochastic discounted-payoff games and discuss a general method of approximating CNE based on the condition. 1. Introduction Altman and Shwartz [1] established a sufficient condition for the existence of a stationary Markovian constrained Nash equilibrium (CNE) policy pair in a general model of finite two-person zero-sum constrained stochastic games and Alvarez-Mena and Hernández-Lerma [2] extended the result for infinite state and action spaces. Even though a few computational studies exist for average-payoff models with additional simplifying assumptions (see, e.g., [3 5]), there seems to be no work providing a meaningful necessary condition for CNE or any general approximation scheme for CNE within the general discounting-cost model. This brief paper establishes a necessary condition that a CNE policy pair satisfies by a novel characterization of the set of all feasible policies of one player when the other player s policy is fixed. This is done by identifying feasible mixed actions of one player at a current state when the expected total discounted constraint cost from each reachable next stateisgivenbyavaluefunctiondefinedoverthestatespace. The necessary condition provides a general method of testing whetheragivenpolicypairisacnepolicypairandcan induce a general approximation scheme for CNE. 2. Preliminaries Consider a two-person zero-sum Markov game (MG) [6] M = (X,A,,P,C),whereX is a finite state set, and A and are nonempty finite pure-action sets for the minimizer and the maximizer, respectively, at x in X with A = A and =. We denote the mixed action sets at x in X over A and to be D[A] and D[], respectively,withd[a] = D[A] and D[] = D[]. Once f in D[A] and g in D[] are simultaneously taken at x in X by the minimizer and the maximizer, respectively (with the complete knowledge of the state but without knowing each other s current action being taken), x makes a transition to a next state y by the probability P fg given as P fg = a A b f (a) g (b) p (y x, a, b). Here f(a) denotes the probability of selecting a, similarto g(b), and p(y x, a, b) denotes the probability of moving from x to y by a and b. Then the minimizer obtains an expected cost of C(x, f, g) given by C(x,f,g) = a A b c (x, y, a, b) p (y x, a, b) f (a) g (b), where c(x, y, a, b) in R is a payoff to the minimizer (the negative of this will be incurred to the maximizer). We define a stationary Markovian policy π of the minimizer as a function π : X D[A] with π D[A] (1) (2)
2 2 Game Theory for all anddenote Π to be the set of all possible such policies. A policy φ is similarly defined for the maximizer with D[], and we denote Φ to be the set of all possible such policies. Define the objective value of π in Π and φ in Φ with an initial state x in X as V (π, φ) =E[ γ t C (X t,π(x t ),φ(x t )) X 0 =x], t=0 where X t is a random variable denoting the state at time t by following π and φ,andγ (0, 1) is a fixed discounting factor. We let V δ (π, φ) = δv(π, φ) for a given initial state distribution δ over X. The MG M is associated with constraint functions κ i, i= 1, 2, defined over X and constraint-cost functions d i, i=1,2, where d i (x,y,a,b) R, x, y X, a A,andb, is a constraint-cost paid by the minimizer if i=1and by the maximizer if i=2. (For simplicity, we consider the model in [1, 2] with only one side constraint for each player.) Apolicyπ in Π is called δ-feasible with respect to φ in Φ if the pair of π and φ satisfies the constraint inequality of δj 1 (π, φ) δκ 1, wherej 1 is defined with β 1 (0, 1) such that J 1 (π, φ) =E[ β t 1 D 1 (X t,π(x t ),φ(x t )) X 0 =x], t=0 The expected constraint cost D 1 (x, f, g) is given by D 1 (x, f, g) = a A b (3). (4) d 1 (x,y,a,b)p(y x,a,b)f(a) g (b). Similarly, φ in Φ is δ-feasible with respect to π in Π if δj 2 (π, φ) δκ 2, wherej 2 is defined with D 2 and β 2 (0, 1).Wesaythatπ is feasible with respect to φ if, for all x in X, J 1 (π, φ) κ 1,andφ is feasible with respect to π if, for all x in X, J 2 (π, φ) κ 2.Notethatifπ is feasible with respect to φ, thenπ is δ-feasible with respect to φ for any δ. Let Π φ =π Π: δ J 1 (π, φ) δ κ 1 }, φ Φ, Φ π = φ Φ : δ J 2 (π, φ) δ κ 2 }, π Π. (6) That is, Π φ is the set of all δ-feasible policies in Π with respect to φ (when the maximizer s policy is fixed by φ); similarly Φ π (5) is obtained when the minimizer s policy is fixed by π. Then M has a constrained Nash equilibrium (CNE) if there exists a pair of π in Π and φ in Φ such that π is δ-feasible with respect to φ and φ is δ-feasible with respect to π,and V δ (π,φ) V δ (π,φ ) V δ (π, φ ) π Π φ, φ Φ π. 3. A Necessary Condition for CNE Let (X) be the set of all real-valued functions on X.Givenφ in Φ and V in (X), define φ-constrained feasible mixed action set with V for the minimizer: for all x in X, D[A ] φ,v = f D[A ] :D 1 (x, f, φ )+β 1 P fφ V (y) (7) V +(1 β 1 ) min (κ 1 (x ) V(x )) } }. } (8) We further define π-constrained feasible mixed action set with V forthemaximizerforπ in Π and V in (X):forallx in X, D[ ] π,v = g D[ ] :D 2 (x, π,g)+β 2 P πg V (y) V +(1 β 2 ) min (κ 2 (x ) V(x )) } }. } (9) Lemma 1. Given φ in Φ and V in (X), suppose that D[A] φ,v =0for all x in X.Thenanyπ in Π such that π D[A] φ,v for all x in X is feasible with respect to φ.similarly, given π in Π and V in (X), ifd[] π,v =0 for all x in X, then any φ in Π such that φ D[] π,v for all x in X is feasible with respect to π. Proof. Consider an operator T π,φ :(X) (X)given as T π,φ (V) =D 1 (x, π,φ)+β 1 P πφ V (y), x X, V (X). (10) Thenwehavethatforτ in R and u in (X) if T π,φ (u) u + τ for all x in X, then,forallx X, J 1 (π, φ) u+τ(1 β 1 ) 1 and lim n (T π,φ (u)) n = J 1 (π, φ) [6].
3 Game Theory 3 ecause π is in D[A] φ,v for all x in X, T π,φ (V) V + (1 β 1 )min (κ 1 (x ) V(x )) for all x in X. With τ=(1 β 1 )min (κ 1 (x ) V(x )),wehavethat,forallx in X, J 1 (π, φ) V + min (κ 1 (x ) V(x )) κ 1. (11) The second statement can be proven with the similar symmetrical reasoning. Some notable examples of choosing V in (X) for D[A] φ,v and D[] π,v are κ 1, κ 2, J 1 (π, φ), andj 2 (π, φ), forsomepairofπ and φ, and the zero function such that V = 0 for all x in X. Inparticular,ifV = 0 for all,thend[a] φ,v = f D[A] : D 1 (x, f, φ) (1 β 1 )min κ 1 (x )}. We now let Π φ,v = π Π : π D[A ] φ,v, }, φ Φ, V (X), Φ π,v = φ Φ : φ D[ ] π,v, }, π Π,V (X). (12) If D[A] φ,v = 0 for some x in X, thenπ φ,v = 0,and similarly Φ π,v =0if D[] π,v =0for some x in X. The following theorem characterizes the set of one player s all feasible policies with respect to a fixed policy of the other player. Theorem 2. Given φ in Φ, π in Π is feasible with respect to φ if and only if π is in V (X) Π φ,v.furthermore,givenπ Π, φ in Φ is feasible with respect to π if and only if φ is in V (X) Φ π,v. Proof. We prove only the first part of the statement for the minimizer case. Suppose that π is feasible with respect to φ. ThenbysettingV = J 1 (π, φ), (1 β 1 )min (κ 1 J 1 (π, φ)) 0.Itfollowsthat D 1 (x, π,φ)+β 1 =J 1 (π, φ) P πφ J 1 (π, φ) (y) J 1 (π, φ) +(1 β 1 ) min (κ 1 (x ) J 1 (π, φ) (x )) (13) and this implies that there exists V in (X) such that π is in D[A] φ,v for all x in X. For the other direction, if π is in V (X) Π φ,v,then,for some V (X), π is in D[A] φ,v for all x in X. y Lemma 1, π is feasible with respect to φ. ecause a policy of one player which is feasible with respect to a policy of the other player is δ-feasible with respect to the policy of the other player, the following necessary condition satisfied by a CNE policy pair is immediate. Corollary 3. If a pair of π in Π and φ in Φ is a CNE pair for a given δ, then the pair satisfies the following saddle-point inequality: V δ (π,φ) V δ (π,φ ) V δ (π, φ ), π Π φ,v, φ Φ π,v. V (X) V (X) (14) Given a pair of π in Π and φ in Φ, if we find some V, u in (X) such that the pair does not satisfy the saddle-point inequality over nonempty Π φ,v and Φ π,u, then the pair is not acnepair. 4. An Example of Approximation Scheme for CNE We now provide an example of a general approximation scheme for CNE based on the necessary condition. asically, we fix some u, V in (X) andtrytofindanequilibriumpolicy pair that satisfies the saddle-point inequalities over subsets of the feasible policy spaces induced with the selected u, V and the pair, which is an approximate CNE pair. We start with selecting arbitrary V 1, V 2 in (X) and define feasible mixed joint-action sets induced with V 1, V 2 :forallx X, D V 1,V 2 = (f, g) D [A ] D[ ] i=1,2 Let :D i (x, f, g) + β i P fg V i (y) V i +(1 β i ) min (κ i (x ) V i (x )) } }. } =(a, b) A :(f a,g b ) D V 1,V 2 }, (15) (16) where f a denotes the mixed action with f a (a) = 1 and g b with g b (b) = 1. Assume that =0for all x in X.Wethenobtaina pair of nonempty A A and ΔV 1,V 2 such that A ΔV 1,V 2. Let and similarly A φ,v =a A :f a D[A ] φ,v },, V (X), φ Φ π,v =b :g b D[ ] π,v },, V (X), π Π. (17) (18)
4 4 Game Theory That is, A φ,v is a φ-constrained feasible pure-action set with V at x for the minimizer and similarly π,v is for the maximizer. For any subset A A, x X with A = A, we denote D[A ] to be the set of all possible probability distributions over A that have zero probabilities for the actions in A\A.IfA = 0,then D[A ] = 0. ThenotationofD[ ] for is similarly denoted. (Note that D[A φ,v ] D[A] φ,v D[A] for all x in X in general.) We further let Π[A ]=π Π:π D[A ], }, Φ[ ]=φ Φ:φ D[ ], }. (19) If D[A ] = 0 for some x in X, Π[A ]=0,andsimilarly Φ[ ]=0if D[ ] = 0 for some x in X. y construction, the following result is immediate. For all, A φ Φ[Δ V 1,V 2 ] π Π[Δ V 1,V 2 A ] A φ,v1, π,v2, (20) which further implies that, for any ( π, φ) such that π D[ A ] for all x in X and φ D[ ] for all x in X, π is feasible with respect to φ and φ is feasible with respect to π. Consider now the unconstrained game M V 1,V 2 = (X, A,ΔV 1,V 2,P,C), where P and C are evaluated only at f in D[ A ] and g in D[ΔV 1,V 2 ] for all x in X, and denote the set of all NE policy pairs of M V 1,V 2 to be NE(M V 1,V 2 ). The above two results finally imply then that any ( π, φ) NE(M V 1,V 2 ) is a local CNE for M.Inotherwords, for any δ, π is δ-feasible with respect to φ and φ is δ-feasible with respect to π, andπ[ A ] Π φ and Φ[ΔV 1,V 2 ] Φ π, and V δ ( π, φ) V δ ( π, φ) V δ (π, φ), π Π [ A ], φ Φ[ΔV 1,V 2 ]. (21) That is, the local CNE pair ( π, φ) satisfies all of the conditions of CNE except that the saddle-point inequality is satisfied locally for the subsets of Π φ and Φ π. In fact, related solution concepts for games that are resistant to local deviations, called local NE, have been already established in economics (see, e.g., [7]). Projecting into the two sets of A and turns out to be equivalent to obtaining a complete bipartite subgraph or biclique subgraph from a (bipartite) graph. The problem of finding a biclique in a given bipartite graph is well studied in the graph theory literature (see, e.g., [8]). Another issue is how we set V 1 and V 2 such that is nonempty for all x in X. If there exists a pure-policy pair of π in Π and φ in Φ such that, for all x in X, π(a) = 1 for some a Aand φ(b) = 1 for some b and onepolicyisfeasiblewithrespecttotheotherpolicy,thenby setting V i =J i (π, φ), i=1,2,wehave(π, φ) D V 1,V 2 for all x in X, making =0 for all x in X. Wecan put the following feasibility assumption on M to assure the existence of such a pure-policy pair: for all x in X, min κ 1,κ 2 } max inf J 1 (π, φ), inf J (22) 2 (π, φ) }. π Π,φ Φ π Π,φ Φ In other words, by this assumption there exists at least one pure-policy pair such that one policy is feasible with respect to the other policy. The existence comes from the fact that there exists a pure-policy pair (π, φ) that achieves inf π,φ J 1(π,φ ) for all x in X in solving this Markov decision process problem [9] and also for the case of inf π,φ J 2(π,φ ) for all x in X. To illustrate how the above method works, we consider the following simple MG M = (X,A,,P,C),whereX = 1, 2}, A = = a, b}, x X,andp, c, d 1,andd 2 are given in terms of a matrix form, respectively: p aa =[ ], p ab = [ ], pba = [ ],andpbb = [ ]; caa = c ab = [ ], cba = c bb = [ ];anddaa 1 = d ab 1 = [ ], d ba 1 =d bb 1 =[ ], daa 2 =d ab 2 =[ ],anddba 2 =d bb 2 =[ ]. In this matrix form, for example, the (i, j)th entries of p ab and c ab refer to p(j i, a, b) and c(i, j, a, b), respectively.theother parameters are given such that γ = β 1 = 0.9, β 2 = 0.95, κ 1 (1) = κ 2 (1) = 40,andκ 1 (2) = κ 2 (2) = 50. We first observe that, for the pure-policy pair (π, φ) = ((a, a), (a, a)), one policy is feasible with respect to the other policy. The notation of π = (π(1), π(2)) here refers to π(1) D[A(1)] and π(2) D[A(2)]. Forsimplicity, we writeπ(1) as a for the distribution concentrated on the action a.(notethat we can obtain another such pure-policy pair ((a, a), (b, b)) which achieves inf π Π,φ Φ J 1 (π, φ) = 2, x X.) Now, by setting V i = J i (π, φ), i = 1,2,wefirstobtain = (a, a), (a, b)}, forall,therebyhavingthe sets of A = a} and ΔV 1,V 2 = a, b}, forall. Wesolvetheunconstrainedtwo-personzero-sumMGM V 1,V 2, obtaining a pure NE policy pair of ( π, φ) = ((a, a), (a, a)) for M V 1,V 2, which is then a local CNE for M. 5. Concluding Remark For simplicity, the model of the present note deals with only one side constraint for each player. Generalization of this into multiple side constraints per player would make the definitions complex, but the ideas must be the same to the one side-constraint case. Acknowledgment The author is with the Department of Computer Science and Engineering at Sogang University, Seoul, Republic of Korea,
5 Game Theory 5 References [1] E. Altman and A. Shwartz, Constrained Markov games: Nash equilibria, in Annals of the International Society of Dynamic Games,V.Gaitsgory,J.Filar,andK.Mizukami,Eds.,vol.5,pp , irkhäauser, oston, Mass, USA, [2] J. Alvarez-Mena and O. Hernández-Lerma, Existence of nash equilibria for constrained stochastic games, Mathematical Methods of Operations Research, vol. 63, no. 2, pp , [3] E. Altman, K. Avrachenkov, R. Marquez, and G. Miller, Zerosum constrained stochastic games with independent state processes, Mathematical Methods of Operations Research, vol. 62, no. 3, pp , [4] E. Altman, K. Avrachenkov, N. onneau, M. Debbah, R. El-Azouzi, and D. S. Menasche, Constrained cost-coupled stochastic games with independent state processes, Operations Research Letters,vol.36,no.2,pp ,2008. [5] N. Shimkin, Stochastic games with average cost constraints, in Annals of the International Society of Dynamic Games, T.asar anda.haurie,eds.,vol.1ofadvances in Dynamic Games and Applications, irkhäauser, oston, Mass, USA, [6] J. Filar and K. Vrieze, Competitive Markov Decision Processes, Springer,NewYork,NY,USA,1996. [7] C. Alós-Ferrer and A.. Ania, Local equilibria in economic games, Economics Letters, vol. 70, no. 2, pp , [8] D. S. Hochbaum, Approximating clique and biclique problems, Algorithms, vol. 29, no. 1, pp , [9] M. L. Puterman, Markov Decision Processes: Discrete Stochastic Dynamic Programming, John Wiley & Sons, New York, NY, USA, 1994.
6 Advances in Operations Research Advances in Decision Sciences Applied Mathematics Algebra Probability and Statistics The Scientific World Journal International Differential Equations Submit your manuscripts at International Advances in Combinatorics Mathematical Physics Complex Analysis International Mathematics and Mathematical Sciences Mathematical Problems in Engineering Mathematics Discrete Mathematics Discrete Dynamics in Nature and Society Function Spaces Abstract and Applied Analysis International Stochastic Analysis Optimization
Research Article Existence of Periodic Positive Solutions for Abstract Difference Equations
Discrete Dynamics in Nature and Society Volume 2011, Article ID 870164, 7 pages doi:10.1155/2011/870164 Research Article Existence of Periodic Positive Solutions for Abstract Difference Equations Shugui
More informationResearch Article A Note on the Central Limit Theorems for Dependent Random Variables
International Scholarly Research Network ISRN Probability and Statistics Volume 22, Article ID 92427, pages doi:.542/22/92427 Research Article A Note on the Central Limit Theorems for Dependent Random
More informationResearch Article The Solution Set Characterization and Error Bound for the Extended Mixed Linear Complementarity Problem
Journal of Applied Mathematics Volume 2012, Article ID 219478, 15 pages doi:10.1155/2012/219478 Research Article The Solution Set Characterization and Error Bound for the Extended Mixed Linear Complementarity
More informationResearch Article Identifying a Global Optimizer with Filled Function for Nonlinear Integer Programming
Discrete Dynamics in Nature and Society Volume 20, Article ID 7697, pages doi:0.55/20/7697 Research Article Identifying a Global Optimizer with Filled Function for Nonlinear Integer Programming Wei-Xiang
More informationResearch Article Existence and Duality of Generalized ε-vector Equilibrium Problems
Applied Mathematics Volume 2012, Article ID 674512, 13 pages doi:10.1155/2012/674512 Research Article Existence and Duality of Generalized ε-vector Equilibrium Problems Hong-Yong Fu, Bin Dan, and Xiang-Yu
More informationZero-Sum Average Semi-Markov Games: Fixed Point Solutions of the Shapley Equation
Zero-Sum Average Semi-Markov Games: Fixed Point Solutions of the Shapley Equation Oscar Vega-Amaya Departamento de Matemáticas Universidad de Sonora May 2002 Abstract This paper deals with zero-sum average
More informationResearch Article Strong Convergence of Parallel Iterative Algorithm with Mean Errors for Two Finite Families of Ćirić Quasi-Contractive Operators
Abstract and Applied Analysis Volume 01, Article ID 66547, 10 pages doi:10.1155/01/66547 Research Article Strong Convergence of Parallel Iterative Algorithm with Mean Errors for Two Finite Families of
More informationResearch Article The Entanglement of Independent Quantum Systems
Advances in Mathematical Physics Volume 2012, Article ID 104856, 6 pages doi:10.1155/2012/104856 Research Article The Entanglement of Independent Quantum Systems Shuilin Jin 1 and Li Xu 2 1 Department
More informationResearch Article Remarks on Asymptotic Centers and Fixed Points
Abstract and Applied Analysis Volume 2010, Article ID 247402, 5 pages doi:10.1155/2010/247402 Research Article Remarks on Asymptotic Centers and Fixed Points A. Kaewkhao 1 and K. Sokhuma 2 1 Department
More informationResearch Article A Nice Separation of Some Seiffert-Type Means by Power Means
International Mathematics and Mathematical Sciences Volume 2012, Article ID 40692, 6 pages doi:10.1155/2012/40692 Research Article A Nice Separation of Some Seiffert-Type Means by Power Means Iulia Costin
More informationNoncooperative continuous-time Markov games
Morfismos, Vol. 9, No. 1, 2005, pp. 39 54 Noncooperative continuous-time Markov games Héctor Jasso-Fuentes Abstract This work concerns noncooperative continuous-time Markov games with Polish state and
More informationTotal Expected Discounted Reward MDPs: Existence of Optimal Policies
Total Expected Discounted Reward MDPs: Existence of Optimal Policies Eugene A. Feinberg Department of Applied Mathematics and Statistics State University of New York at Stony Brook Stony Brook, NY 11794-3600
More informationResearch Article Global Attractivity of a Higher-Order Difference Equation
Discrete Dynamics in Nature and Society Volume 2012, Article ID 930410, 11 pages doi:10.1155/2012/930410 Research Article Global Attractivity of a Higher-Order Difference Equation R. Abo-Zeid Department
More informationResearch Article A Fictitious Play Algorithm for Matrix Games with Fuzzy Payoffs
Abstract and Applied Analysis Volume 2012, Article ID 950482, 12 pages doi:101155/2012/950482 Research Article A Fictitious Play Algorithm for Matrix Games with Fuzzy Payoffs Emrah Akyar Department of
More informationConvex Sets Strict Separation. in the Minimax Theorem
Applied Mathematical Sciences, Vol. 8, 2014, no. 36, 1781-1787 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2014.4271 Convex Sets Strict Separation in the Minimax Theorem M. A. M. Ferreira
More informationContinuous-Time Markov Decision Processes. Discounted and Average Optimality Conditions. Xianping Guo Zhongshan University.
Continuous-Time Markov Decision Processes Discounted and Average Optimality Conditions Xianping Guo Zhongshan University. Email: mcsgxp@zsu.edu.cn Outline The control model The existing works Our conditions
More informationResearch Article Convergence Theorems for Infinite Family of Multivalued Quasi-Nonexpansive Mappings in Uniformly Convex Banach Spaces
Abstract and Applied Analysis Volume 2012, Article ID 435790, 6 pages doi:10.1155/2012/435790 Research Article Convergence Theorems for Infinite Family of Multivalued Quasi-Nonexpansive Mappings in Uniformly
More informationResearch Article On Decomposable Measures Induced by Metrics
Applied Mathematics Volume 2012, Article ID 701206, 8 pages doi:10.1155/2012/701206 Research Article On Decomposable Measures Induced by Metrics Dong Qiu 1 and Weiquan Zhang 2 1 College of Mathematics
More informationResearch Article New Oscillation Criteria for Second-Order Neutral Delay Differential Equations with Positive and Negative Coefficients
Abstract and Applied Analysis Volume 2010, Article ID 564068, 11 pages doi:10.1155/2010/564068 Research Article New Oscillation Criteria for Second-Order Neutral Delay Differential Equations with Positive
More informationOPTIMALITY OF RANDOMIZED TRUNK RESERVATION FOR A PROBLEM WITH MULTIPLE CONSTRAINTS
OPTIMALITY OF RANDOMIZED TRUNK RESERVATION FOR A PROBLEM WITH MULTIPLE CONSTRAINTS Xiaofei Fan-Orzechowski Department of Applied Mathematics and Statistics State University of New York at Stony Brook Stony
More informationResearch Article Strong Convergence of a Projected Gradient Method
Applied Mathematics Volume 2012, Article ID 410137, 10 pages doi:10.1155/2012/410137 Research Article Strong Convergence of a Projected Gradient Method Shunhou Fan and Yonghong Yao Department of Mathematics,
More informationResearch Article Almost Sure Central Limit Theorem of Sample Quantiles
Advances in Decision Sciences Volume 202, Article ID 67942, 7 pages doi:0.55/202/67942 Research Article Almost Sure Central Limit Theorem of Sample Quantiles Yu Miao, Shoufang Xu, 2 and Ang Peng 3 College
More informationResearch Article Convex Polyhedron Method to Stability of Continuous Systems with Two Additive Time-Varying Delay Components
Applied Mathematics Volume 202, Article ID 689820, 3 pages doi:0.55/202/689820 Research Article Convex Polyhedron Method to Stability of Continuous Systems with Two Additive Time-Varying Delay Components
More informationResearch Article Multiple-Decision Procedures for Testing the Homogeneity of Mean for k Exponential Distributions
Discrete Dynamics in Nature and Society, Article ID 70074, 5 pages http://dx.doi.org/0.55/204/70074 Research Article Multiple-Decision Procedures for Testing the Homogeneity of Mean for Exponential Distributions
More informationResearch Article Normal and Osculating Planes of Δ-Regular Curves
Abstract and Applied Analysis Volume 2010, Article ID 923916, 8 pages doi:10.1155/2010/923916 Research Article Normal and Osculating Planes of Δ-Regular Curves Sibel Paşalı Atmaca Matematik Bölümü, Fen-Edebiyat
More informationResearch Article Ulam-Hyers-Rassias Stability of a Hyperbolic Partial Differential Equation
International Scholarly Research Network ISRN Mathematical Analysis Volume 212, Article ID 69754, 1 pages doi:1.542/212/69754 Research Article Ulam-Hyers-Rassias Stability of a Hyperbolic Partial Differential
More informationResearch Article Attracting Periodic Cycles for an Optimal Fourth-Order Nonlinear Solver
Abstract and Applied Analysis Volume 01, Article ID 63893, 8 pages doi:10.1155/01/63893 Research Article Attracting Periodic Cycles for an Optimal Fourth-Order Nonlinear Solver Mi Young Lee and Changbum
More informationValue Iteration and Action ɛ-approximation of Optimal Policies in Discounted Markov Decision Processes
Value Iteration and Action ɛ-approximation of Optimal Policies in Discounted Markov Decision Processes RAÚL MONTES-DE-OCA Departamento de Matemáticas Universidad Autónoma Metropolitana-Iztapalapa San Rafael
More informationChapter 9. Mixed Extensions. 9.1 Mixed strategies
Chapter 9 Mixed Extensions We now study a special case of infinite strategic games that are obtained in a canonic way from the finite games, by allowing mixed strategies. Below [0, 1] stands for the real
More informationResearch Article Translative Packing of Unit Squares into Squares
International Mathematics and Mathematical Sciences Volume 01, Article ID 61301, 7 pages doi:10.1155/01/61301 Research Article Translative Packing of Unit Squares into Squares Janusz Januszewski Institute
More informationStatic Model of Decision-making over the Set of Coalitional Partitions
Applied Mathematical ciences, Vol. 8, 2014, no. 170, 8451-8457 HIKARI Ltd, www.m-hiari.com http://dx.doi.org/10.12988/ams.2014.410889 tatic Model of Decision-maing over the et of Coalitional Partitions
More informationResearch Article The Dirichlet Problem on the Upper Half-Space
Abstract and Applied Analysis Volume 2012, Article ID 203096, 5 pages doi:10.1155/2012/203096 Research Article The Dirichlet Problem on the Upper Half-Space Jinjin Huang 1 and Lei Qiao 2 1 Department of
More information6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Potential Games
6.254 : Game Theory with Engineering Applications Lecture 8: Supermodular and Asu Ozdaglar MIT March 2, 2010 1 Introduction Outline Review of Supermodular Games Reading: Fudenberg and Tirole, Section 12.3.
More informationA Note on Open Loop Nash Equilibrium in Linear-State Differential Games
Applied Mathematical Sciences, vol. 8, 2014, no. 145, 7239-7248 HIKARI Ltd, www.m-hikari.com http://dx.doi.org/10.12988/ams.2014.49746 A Note on Open Loop Nash Equilibrium in Linear-State Differential
More informationResearch Article Asymptotic Behavior of the Solutions of System of Difference Equations of Exponential Form
Difference Equations Article ID 936302 6 pages http://dx.doi.org/10.1155/2014/936302 Research Article Asymptotic Behavior of the Solutions of System of Difference Equations of Exponential Form Vu Van Khuong
More informationRobustness of policies in Constrained Markov Decision Processes
1 Robustness of policies in Constrained Markov Decision Processes Alexander Zadorojniy and Adam Shwartz, Senior Member, IEEE Abstract We consider the optimization of finite-state, finite-action Markov
More informationComputational Game Theory Spring Semester, 2005/6. Lecturer: Yishay Mansour Scribe: Ilan Cohen, Natan Rubin, Ophir Bleiberg*
Computational Game Theory Spring Semester, 2005/6 Lecture 5: 2-Player Zero Sum Games Lecturer: Yishay Mansour Scribe: Ilan Cohen, Natan Rubin, Ophir Bleiberg* 1 5.1 2-Player Zero Sum Games In this lecture
More informationResearch Article Constrained Solutions of a System of Matrix Equations
Journal of Applied Mathematics Volume 2012, Article ID 471573, 19 pages doi:10.1155/2012/471573 Research Article Constrained Solutions of a System of Matrix Equations Qing-Wen Wang 1 and Juan Yu 1, 2 1
More informationResearch Article On Prime Near-Rings with Generalized Derivation
International Mathematics and Mathematical Sciences Volume 2008, Article ID 490316, 5 pages doi:10.1155/2008/490316 Research Article On Prime Near-Rings with Generalized Derivation Howard E. Bell Department
More informationResearch Article Taylor s Expansion Revisited: A General Formula for the Remainder
International Mathematics and Mathematical Sciences Volume 2012, Article ID 645736, 5 pages doi:10.1155/2012/645736 Research Article Taylor s Expansion Revisited: A General Formula for the Remainder José
More informationResearch Article Parametric Evaluations of the Rogers-Ramanujan Continued Fraction
International Mathematics and Mathematical Sciences Volume 011, Article ID 940839, 11 pages doi:10.1155/011/940839 Research Article Parametric Evaluations of the Rogers-Ramanujan Continued Fraction Nikos
More informationZero-sum constrained stochastic games with independent state processes
Math. Meth. Oper. Res. (2005) 62: 375 386 DOI 10.1007/s00186-005-0034-4 ORIGINAL ARTICLE Eitan Altman Konstantin Avrachenkov Richard Marquez Gregory Miller Zero-sum constrained stochastic games with independent
More informationReductions Of Undiscounted Markov Decision Processes and Stochastic Games To Discounted Ones. Jefferson Huang
Reductions Of Undiscounted Markov Decision Processes and Stochastic Games To Discounted Ones Jefferson Huang School of Operations Research and Information Engineering Cornell University November 16, 2016
More informationResearch Article Implicative Ideals of BCK-Algebras Based on the Fuzzy Sets and the Theory of Falling Shadows
International Mathematics and Mathematical Sciences Volume 2010, Article ID 819463, 11 pages doi:10.1155/2010/819463 Research Article Implicative Ideals of BCK-Algebras Based on the Fuzzy Sets and the
More informationResearch Article Special Approach to Near Set Theory
Mathematical Problems in Engineering Volume 2011, Article ID 168501, 10 pages doi:10.1155/2011/168501 Research Article Special Approach to Near Set Theory M. E. Abd El-Monsef, 1 H. M. Abu-Donia, 2 and
More informationResearch Article Wave Scattering in Inhomogeneous Strings
International Scholarly Research Network ISRN Mathematical Analysis Volume 011, Article ID 64049, 14 pages doi:10.540/011/64049 Research Article Wave Scattering in Inhomogeneous Strings Nezam Iraniparast
More informationNear-Potential Games: Geometry and Dynamics
Near-Potential Games: Geometry and Dynamics Ozan Candogan, Asuman Ozdaglar and Pablo A. Parrilo January 29, 2012 Abstract Potential games are a special class of games for which many adaptive user dynamics
More informationLecture notes for Analysis of Algorithms : Markov decision processes
Lecture notes for Analysis of Algorithms : Markov decision processes Lecturer: Thomas Dueholm Hansen June 6, 013 Abstract We give an introduction to infinite-horizon Markov decision processes (MDPs) with
More informationSelecting Efficient Correlated Equilibria Through Distributed Learning. Jason R. Marden
1 Selecting Efficient Correlated Equilibria Through Distributed Learning Jason R. Marden Abstract A learning rule is completely uncoupled if each player s behavior is conditioned only on his own realized
More informationResearch Article Some Generalizations of Fixed Point Results for Multivalued Contraction Mappings
International Scholarly Research Network ISRN Mathematical Analysis Volume 2011, Article ID 924396, 13 pages doi:10.5402/2011/924396 Research Article Some Generalizations of Fixed Point Results for Multivalued
More informationResearch Article Quasilinearization Technique for Φ-Laplacian Type Equations
International Mathematics and Mathematical Sciences Volume 0, Article ID 975760, pages doi:0.55/0/975760 Research Article Quasilinearization Technique for Φ-Laplacian Type Equations Inara Yermachenko and
More informationZero-sum semi-markov games in Borel spaces: discounted and average payoff
Zero-sum semi-markov games in Borel spaces: discounted and average payoff Fernando Luque-Vásquez Departamento de Matemáticas Universidad de Sonora México May 2002 Abstract We study two-person zero-sum
More informationGame Theory. Professor Peter Cramton Economics 300
Game Theory Professor Peter Cramton Economics 300 Definition Game theory is the study of mathematical models of conflict and cooperation between intelligent and rational decision makers. Rational: each
More informationResearch Article k-tuple Total Domination in Complementary Prisms
International Scholarly Research Network ISRN Discrete Mathematics Volume 2011, Article ID 68127, 13 pages doi:10.502/2011/68127 Research Article k-tuple Total Domination in Complementary Prisms Adel P.
More informationResearch Article Solution of Fuzzy Matrix Equation System
International Mathematics and Mathematical Sciences Volume 2012 Article ID 713617 8 pages doi:101155/2012/713617 Research Article Solution of Fuzzy Matrix Equation System Mahmood Otadi and Maryam Mosleh
More informationResearch Article A Data Envelopment Analysis Approach to Supply Chain Efficiency
Advances in Decision Sciences Volume 2011, Article ID 608324, 8 pages doi:10.1155/2011/608324 Research Article A Data Envelopment Analysis Approach to Supply Chain Efficiency Alireza Amirteimoori and Leila
More informationResearch Article Some New Fixed-Point Theorems for a (ψ, φ)-pair Meir-Keeler-Type Set-Valued Contraction Map in Complete Metric Spaces
Applied Mathematics Volume 2011, Article ID 327941, 11 pages doi:10.1155/2011/327941 Research Article Some New Fixed-Point Theorems for a (ψ, φ)-pair Meir-Keeler-Type Set-Valued Contraction Map in Complete
More informationEquilibria in Games with Weak Payoff Externalities
NUPRI Working Paper 2016-03 Equilibria in Games with Weak Payoff Externalities Takuya Iimura, Toshimasa Maruta, and Takahiro Watanabe October, 2016 Nihon University Population Research Institute http://www.nihon-u.ac.jp/research/institute/population/nupri/en/publications.html
More informationResearch Article r-costar Pair of Contravariant Functors
International Mathematics and Mathematical Sciences Volume 2012, Article ID 481909, 8 pages doi:10.1155/2012/481909 Research Article r-costar Pair of Contravariant Functors S. Al-Nofayee Department of
More informationResearch Article Semi-Online Scheduling on Two Machines with GoS Levels and Partial Information of Processing Time
e Scientific World Journal Volume 04, Article ID 57634, 6 pages http://dx.doi.org/0.55/04/57634 Research Article Semi-Online Scheduling on Two Machines with GoS Levels and Partial Information of Processing
More informationPayoff Continuity in Incomplete Information Games
journal of economic theory 82, 267276 (1998) article no. ET982418 Payoff Continuity in Incomplete Information Games Atsushi Kajii* Institute of Policy and Planning Sciences, University of Tsukuba, 1-1-1
More informationResearch Article Approximation of Analytic Functions by Bessel s Functions of Fractional Order
Abstract and Applied Analysis Volume 20, Article ID 923269, 3 pages doi:0.55/20/923269 Research Article Approximation of Analytic Functions by Bessel s Functions of Fractional Order Soon-Mo Jung Mathematics
More informationLong-Run versus Short-Run Player
Repeated Games 1 Long-Run versus Short-Run Player a fixed simultaneous move stage game Player 1 is long-run with discount factor δ actions a A a finite set 1 1 1 1 2 utility u ( a, a ) Player 2 is short-run
More informationResearch Article On the Difference Equation x n 1 x n x n k / x n k 1 a bx n x n k
Abstract and Applied Analysis Volume 2012, Article ID 108047, 9 pages doi:10.1155/2012/108047 Research Article On the Difference Equation x n 1 x n x n k / x n k 1 a bx n x n k Stevo Stević, 1 Josef Diblík,
More informationResearch Article Hybrid Algorithm of Fixed Point for Weak Relatively Nonexpansive Multivalued Mappings and Applications
Abstract and Applied Analysis Volume 2012, Article ID 479438, 13 pages doi:10.1155/2012/479438 Research Article Hybrid Algorithm of Fixed Point for Weak Relatively Nonexpansive Multivalued Mappings and
More informationResearch Article An Iterative Algorithm for the Split Equality and Multiple-Sets Split Equality Problem
Abstract and Applied Analysis, Article ID 60813, 5 pages http://dx.doi.org/10.1155/014/60813 Research Article An Iterative Algorithm for the Split Equality and Multiple-Sets Split Equality Problem Luoyi
More informationResearch Article Uniqueness Theorems on Difference Monomials of Entire Functions
Abstract and Applied Analysis Volume 202, Article ID 40735, 8 pages doi:0.55/202/40735 Research Article Uniqueness Theorems on Difference Monomials of Entire Functions Gang Wang, Deng-li Han, 2 and Zhi-Tao
More informationResearch Article On Maximal and Minimal Fuzzy Sets in I-Topological Spaces
International Mathematics and Mathematical Sciences Volume 2010, Article ID 180196, 11 pages doi:10.1155/2010/180196 Research Article On Maximal and Minimal Fuzzy Sets in I-Topological Spaces Samer Al
More informationResearch Article Fast Constructions of Quantum Codes Based on Residues Pauli Block Matrices
Advances in Mathematical Physics Volume 2010, Article ID 469124, 12 pages doi:10.1155/2010/469124 Research Article Fast Constructions of Quantum Codes Based on Residues Pauli Block Matrices Ying Guo, Guihu
More informationResearch Article Sufficient Optimality and Sensitivity Analysis of a Parameterized Min-Max Programming
Applied Mathematics Volume 2012, Article ID 692325, 9 pages doi:10.1155/2012/692325 Research Article Sufficient Optimality and Sensitivity Analysis of a Parameterized Min-Max Programming Huijuan Xiong,
More information6.254 : Game Theory with Engineering Applications Lecture 7: Supermodular Games
6.254 : Game Theory with Engineering Applications Lecture 7: Asu Ozdaglar MIT February 25, 2010 1 Introduction Outline Uniqueness of a Pure Nash Equilibrium for Continuous Games Reading: Rosen J.B., Existence
More informationResearch Article Merrifield-Simmons Index in Random Phenylene Chains and Random Hexagon Chains
Discrete Dynamics in Nature and Society Volume 015, Article ID 56896, 7 pages http://dx.doi.org/10.1155/015/56896 Research Article Merrifield-Simmons Index in Random Phenylene Chains and Random Hexagon
More informationResearch Article Fuzzy Soft Multiset Theory
Abstract and Applied Analysis Volume 22 Article ID 6 2 pages doi:/22/6 Research Article Fuzzy Soft Multiset Theory Shawkat Alkhazaleh and Abdul Razak Salleh School of Mathematical Sciences Faculty of Science
More informationContinuity of equilibria for two-person zero-sum games with noncompact action sets and unbounded payoffs
DOI 10.1007/s10479-017-2677-y FEINBERG: PROBABILITY Continuity of equilibria for two-person zero-sum games with noncompact action sets and unbounded payoffs Eugene A. Feinberg 1 Pavlo O. Kasyanov 2 Michael
More informationCorrespondence should be addressed to Abasalt Bodaghi;
Function Spaces, Article ID 532463, 5 pages http://dx.doi.org/0.55/204/532463 Research Article Approximation on the Quadratic Reciprocal Functional Equation Abasalt Bodaghi and Sang Og Kim 2 Department
More informationResearch Article A Partial Backlogging Inventory Model for Deteriorating Items with Fluctuating Selling Price and Purchasing Cost
Advances in Operations Research Volume 2012, Article ID 385371, 15 pages doi:10.1155/2012/385371 Research Article A Partial Backlogging Inventory Model for Deteriorating Items with Fluctuating Selling
More informationResearch Article Solvability of a Class of Integral Inclusions
Abstract and Applied Analysis Volume 212, Article ID 21327, 12 pages doi:1.1155/212/21327 Research Article Solvability of a Class of Integral Inclusions Ying Chen and Shihuang Hong Institute of Applied
More informationPure Strategy Equilibria in Symmetric Two-Player Zero-Sum Games
Pure Strategy Equilibria in Symmetric Two-Player Zero-Sum Games Peter Duersch Jörg Oechssler Burkhard C. Schipper November 22, 2010 Abstract We show that a symmetric two-player zero-sum game has a pure
More informationResearch Article On New Wilker-Type Inequalities
International Scholarly Research Network ISRN Mathematical Analysis Volume 2011, Article ID 681702, 7 pages doi:10.5402/2011/681702 Research Article On New Wilker-Type Inequalities Zhengjie Sun and Ling
More informationResearch Article Some New Explicit Values of Quotients of Ramanujan s Theta Functions and Continued Fractions
International Mathematics and Mathematical Sciences Article ID 534376 7 pages http://dx.doi.org/0.55/04/534376 Research Article Some New Explicit Values of Quotients of Ramanujan s Theta Functions and
More informationCorrespondence should be addressed to Serap Bulut;
The Scientific World Journal Volume 201, Article ID 17109, 6 pages http://dx.doi.org/10.1155/201/17109 Research Article Coefficient Estimates for Initial Taylor-Maclaurin Coefficients for a Subclass of
More informationResearch Article A Characterization of E-Benson Proper Efficiency via Nonlinear Scalarization in Vector Optimization
Applied Mathematics, Article ID 649756, 5 pages http://dx.doi.org/10.1155/2014/649756 Research Article A Characterization of E-Benson Proper Efficiency via Nonlinear Scalarization in Vector Optimization
More informationResearch Article A Constructive Analysis of Convex-Valued Demand Correspondence for Weakly Uniformly Rotund and Monotonic Preference
Advances in Decision Sciences Volume 2011, Article ID 960819, 7 pages doi:10.1155/2011/960819 Research Article A Constructive Analysis of Convex-Valued Demand Correspondence for Weakly Uniformly Rotund
More informationResearch Article A New Class of Meromorphically Analytic Functions with Applications to the Generalized Hypergeometric Functions
Abstract and Applied Analysis Volume 20, Article ID 59405, 0 pages doi:0.55/20/59405 Research Article A New Class of Meromorphically Analytic Functions with Applications to the Generalized Hypergeometric
More informationResearch Article On Factorizations of Upper Triangular Nonnegative Matrices of Order Three
Discrete Dynamics in Nature and Society Volume 015, Article ID 96018, 6 pages http://dx.doi.org/10.1155/015/96018 Research Article On Factorizations of Upper Triangular Nonnegative Matrices of Order Three
More informationResearch Article λ, μ -Fuzzy Version of Ideals, Interior Ideals, Quasi-Ideals, and Bi-Ideals
Applied Mathematics Volume 2012, Article ID 425890, 7 pages doi:10.1155/2012/425890 Research Article λ, μ -Fuzzy Version of Ideals, Interior Ideals, Quasi-Ideals, and Bi-Ideals Yuming Feng 1, 2 and P.
More informationAction-Independent Subjective Expected Utility without States of the World
heoretical Economics Letters, 013, 3, 17-1 http://dxdoiorg/10436/tel01335a004 Published Online September 013 (http://wwwscirporg/journal/tel) Action-Independent Subjective Expected Utility without States
More informationResearch Article Applications of Differential Subordination for Argument Estimates of Multivalent Analytic Functions
Abstract and Applied Analysis Volume 04, Article ID 63470, 4 pages http://dx.doi.org/0.55/04/63470 search Article Applications of Differential Subordination for Argument Estimates of Multivalent Analytic
More informationStochastic Dynamic Programming. Jesus Fernandez-Villaverde University of Pennsylvania
Stochastic Dynamic Programming Jesus Fernande-Villaverde University of Pennsylvania 1 Introducing Uncertainty in Dynamic Programming Stochastic dynamic programming presents a very exible framework to handle
More informationResearch Article A New Fractional Integral Inequality with Singularity and Its Application
Abstract and Applied Analysis Volume 212, Article ID 93798, 12 pages doi:1.1155/212/93798 Research Article A New Fractional Integral Inequality with Singularity and Its Application Qiong-Xiang Kong 1 and
More information1 Stochastic Dynamic Programming
1 Stochastic Dynamic Programming Formally, a stochastic dynamic program has the same components as a deterministic one; the only modification is to the state transition equation. When events in the future
More informationResearch Article Generalized α-ψ Contractive Type Mappings and Related Fixed Point Theorems with Applications
Abstract and Applied Analysis Volume 01, Article ID 793486, 17 pages doi:10.1155/01/793486 Research Article Generalized α-ψ Contractive Type Mappings and Related Fixed Point Theorems with Applications
More informationResearch Article Improved Estimators of the Mean of a Normal Distribution with a Known Coefficient of Variation
Probability and Statistics Volume 2012, Article ID 807045, 5 pages doi:10.1155/2012/807045 Research Article Improved Estimators of the Mean of a Normal Distribution with a Known Coefficient of Variation
More informationResearch Article On the Modified q-bernoulli Numbers of Higher Order with Weight
Abstract and Applied Analysis Volume 202, Article ID 948050, 6 pages doi:0.55/202/948050 Research Article On the Modified -Bernoulli Numbers of Higher Order with Weight T. Kim, J. Choi, 2 Y.-H. Kim, 2
More informationResearch Article Sunlet Decomposition of Certain Equipartite Graphs
International Combinatorics Volume 2013, Article ID 907249, 4 pages http://dx.doi.org/10.1155/2013/907249 Research Article Sunlet Decomposition of Certain Equipartite Graphs Abolape D. Akwu 1 and Deborah
More informationAlgorithmic Game Theory and Applications. Lecture 15: a brief taster of Markov Decision Processes and Stochastic Games
Algorithmic Game Theory and Applications Lecture 15: a brief taster of Markov Decision Processes and Stochastic Games Kousha Etessami warning 1 The subjects we will touch on today are so interesting and
More informationA Reinforcement Learning (Nash-R) Algorithm for Average Reward Irreducible Stochastic Games
Learning in Average Reward Stochastic Games A Reinforcement Learning (Nash-R) Algorithm for Average Reward Irreducible Stochastic Games Jun Li Jun.Li@warnerbros.com Kandethody Ramachandran ram@cas.usf.edu
More informationResearch Article Modified T-F Function Method for Finding Global Minimizer on Unconstrained Optimization
Mathematical Problems in Engineering Volume 2010, Article ID 602831, 11 pages doi:10.1155/2010/602831 Research Article Modified T-F Function Method for Finding Global Minimizer on Unconstrained Optimization
More informationResearch Article A New Class of Meromorphic Functions Associated with Spirallike Functions
Applied Mathematics Volume 202, Article ID 49497, 2 pages doi:0.55/202/49497 Research Article A New Class of Meromorphic Functions Associated with Spirallike Functions Lei Shi, Zhi-Gang Wang, and Jing-Ping
More informationGame theory Lecture 19. Dynamic games. Game theory
Lecture 9. Dynamic games . Introduction Definition. A dynamic game is a game Γ =< N, x, {U i } n i=, {H i } n i= >, where N = {, 2,..., n} denotes the set of players, x (t) = f (x, u,..., u n, t), x(0)
More information