Generalized Riccati Equations Arising in Stochastic Games

Similar documents
Tilburg University. An equivalence result in linear-quadratic theory van den Broek, W.A.; Engwerda, Jacob; Schumacher, Hans. Published in: Automatica

Research Article Indefinite LQ Control for Discrete-Time Stochastic Systems via Semidefinite Programming

On the linear quadratic optimization problems and associated Riccati equations for systems modeled by Ito linear differential equations

Observer design for a general class of triangular systems

Yuefen Chen & Yuanguo Zhu

ESTIMATES ON THE NUMBER OF LIMIT CYCLES OF A GENERALIZED ABEL EQUATION

ON ADAPTIVE CONTROL FOR THE CONTINUOUS TIME-VARYING JLQG PROBLEM

On Irregular Linear Quadratic Control: Deterministic Case

The Solvability Conditions for the Inverse Eigenvalue Problem of Hermitian and Generalized Skew-Hamiltonian Matrices and Its Approximation

On the decay of elements of inverse triangular Toeplitz matrix

Iterative Solution of a Matrix Riccati Equation Arising in Stochastic Control

On Solvability of an Indefinite Riccati Equation

Noncooperative continuous-time Markov games

Consensus Control of Multi-agent Systems with Optimal Performance

SOME NEW EXAMPLES OF INFINITE IMAGE PARTITION REGULAR MATRICES

Lecture notes: Applied linear algebra Part 1. Version 2

CONTINUITY OF SOLUTIONS OF RICCATI EQUATIONS FOR THE DISCRETE TIME JLQP

Chapter III. Stability of Linear Systems

Spectral Properties of Matrix Polynomials in the Max Algebra

H 1 optimisation. Is hoped that the practical advantages of receding horizon control might be combined with the robustness advantages of H 1 control.

AN ADDENDUM TO THE PROBLEM OF STOCHASTIC OBSERVABILITY

adopt the usual symbols R and C in order to denote the set of real and complex numbers, respectively. The open and closed right half-planes of C are d

w T 1 w T 2. w T n 0 if i j 1 if i = j

A New Algorithm for Solving Cross Coupled Algebraic Riccati Equations of Singularly Perturbed Nash Games

QUALITATIVE CONTROLLABILITY AND UNCONTROLLABILITY BY A SINGLE ENTRY

Alternative Characterization of Ergodicity for Doubly Stochastic Chains

On Euclidean distance matrices

Operators with Compatible Ranges

LOCAL TIMES OF RANKED CONTINUOUS SEMIMARTINGALES

Elementary linear algebra

LINEAR-CONVEX CONTROL AND DUALITY

Homogeneous Linear Systems and Their General Solutions

On Backward Product of Stochastic Matrices

c 1998 Society for Industrial and Applied Mathematics

On the adjacency matrix of a block graph

Z i Q ij Z j. J(x, φ; U) = X T φ(t ) 2 h + where h k k, H(t) k k and R(t) r r are nonnegative definite matrices (R(t) is uniformly in t nonsingular).

Scattered Data Interpolation with Polynomial Precision and Conditionally Positive Definite Functions

1 Lyapunov theory of stability

A note on linear differential equations with periodic coefficients.

Linear Quadratic Zero-Sum Two-Person Differential Games Pierre Bernhard June 15, 2013

Linear ODEs. Existence of solutions to linear IVPs. Resolvent matrix. Autonomous linear systems

A matrix over a field F is a rectangular array of elements from F. The symbol

Robotics. Control Theory. Marc Toussaint U Stuttgart

Deterministic Dynamic Programming

Linear-Quadratic Optimal Control: Full-State Feedback

Singular Value Inequalities for Real and Imaginary Parts of Matrices

Periodic solutions of weakly coupled superlinear systems

Analytic Number Theory Solutions

arxiv:math/ v1 [math.fa] 26 Oct 1993

6.241 Dynamic Systems and Control

Newtonian Mechanics. Chapter Classical space-time

and the nite horizon cost index with the nite terminal weighting matrix F > : N?1 X J(z r ; u; w) = [z(n)? z r (N)] T F [z(n)? z r (N)] + t= [kz? z r

Linear-Quadratic Stochastic Differential Games with General Noise Processes

1. Find the solution of the following uncontrolled linear system. 2 α 1 1

ON SUM OF SQUARES DECOMPOSITION FOR A BIQUADRATIC MATRIX FUNCTION

A Concise Course on Stochastic Partial Differential Equations

Weak and strong convergence theorems of modified SP-iterations for generalized asymptotically quasi-nonexpansive mappings

USE OF SEMIDEFINITE PROGRAMMING FOR SOLVING THE LQR PROBLEM SUBJECT TO RECTANGULAR DESCRIPTOR SYSTEMS

Banach Journal of Mathematical Analysis ISSN: (electronic)

Review of Basic Concepts in Linear Algebra

Functions of Several Variables

Hegselmann-Krause Dynamics: An Upper Bound on Termination Time

EE363 homework 2 solutions

ELEMENTARY LINEAR ALGEBRA

Convergence to Common Fixed Point for Two Asymptotically Quasi-nonexpansive Mappings in the Intermediate Sense in Banach Spaces

Dyadic diaphony of digital sequences

Time-Invariant Linear Quadratic Regulators!

Applied Mathematics Letters. Comparison theorems for a subclass of proper splittings of matrices

Strong Convergence Theorem by a Hybrid Extragradient-like Approximation Method for Variational Inequalities and Fixed Point Problems

ON COMPLEMENTED SUBSPACES OF SUMS AND PRODUCTS OF BANACH SPACES

Lecture Notes of EE 714

Stability of Feedback Solutions for Infinite Horizon Noncooperative Differential Games

Strictly positive definite functions on a real inner product space

NORMS ON SPACE OF MATRICES

7 Planar systems of linear ODE

Linear Algebra Solutions 1

Recursive Solutions of the Matrix Equations X + A T X 1 A = Q and X A T X 1 A = Q

Abstract. In this article, several matrix norm inequalities are proved by making use of the Hiroshima 2003 result on majorization relations.

ELA THE MINIMUM-NORM LEAST-SQUARES SOLUTION OF A LINEAR SYSTEM AND SYMMETRIC RANK-ONE UPDATES

Exercise Solutions to Functional Analysis

L 2 -induced Gains of Switched Systems and Classes of Switching Signals

Strictly Positive Definite Functions on a Real Inner Product Space

Prediction-based adaptive control of a class of discrete-time nonlinear systems with nonlinear growth rate

On the Skeel condition number, growth factor and pivoting strategies for Gaussian elimination

Properties of Matrices and Operations on Matrices

Switching, sparse and averaged control

Observability and forward-backward observability of discrete-time nonlinear systems

Math 350 Fall 2011 Notes about inner product spaces. In this notes we state and prove some important properties of inner product spaces.

Computations of Critical Groups at a Degenerate Critical Point for Strongly Indefinite Functionals

ISOLATED SEMIDEFINITE SOLUTIONS OF THE CONTINUOUS-TIME ALGEBRAIC RICCATI EQUATION

Discrete Population Models with Asymptotically Constant or Periodic Solutions

MATH 326: RINGS AND MODULES STEFAN GILLE

Application of Measure of Noncompactness for the System of Functional Integral Equations

On the Hermitian solutions of the

BESSEL MATRIX DIFFERENTIAL EQUATIONS: EXPLICIT SOLUTIONS OF INITIAL AND TWO-POINT BOUNDARY VALUE PROBLEMS

On the Simulatability Condition in Key Generation Over a Non-authenticated Public Channel

Hamilton Cycles in Digraphs of Unitary Matrices

Basic Concepts in Linear Algebra

Research Article Stabilization Analysis and Synthesis of Discrete-Time Descriptor Markov Jump Systems with Partially Unknown Transition Probabilities

Linear Quadratic Zero-Sum Two-Person Differential Games

Transcription:

Generalized Riccati Equations Arising in Stochastic Games Michael McAsey a a Department of Mathematics Bradley University Peoria IL 61625 USA mcasey@bradley.edu Libin Mou b b Department of Mathematics Bradley University Peoria IL 61625 USA mou@bradley.edu Corresponding author: Michael McAsey, Department of Mathematics, Bradley University, Peoria IL 61625, USA, email: mcasey@bradley.edu, phone: 309-677- 2491, FAX: 309-677-3999 Abstract. We study a class of rational matrix differential equations that generalize the Riccati differential equations. The generalization involves replacing positive definite weighting matrices in the usual Riccati equations with either semidefinite or indefinite matrices that arise in linear quadratic control problems and differential games both stochastic and deterministic. The purpose of this paper is to prove some fundamental properties such as comparison, monotonicity and existence theorems. These properties are well-known for classical Riccati differential equations when certain matrices are assumed definite. As applications, we obtain conditions for the existence of solutions to the algebraic Riccati equation and to equations with periodic coefficients. 1 Introduction We first state the problem to be considered and then discuss its motivation. Let A, B i, N, R ij and S i (i = 1, 2) be bounded measurable matrix functions on [0, T ] with appropriate dimensions as described below in (6). Let S n be the set of all symmetric n n matrices. Let Π = (π ij ) 0 i,j 2 be an operator on [0, T ] S n (see (7)). We consider the following rational matrix equation (or generalized Riccati equation) for a differentiable symmetric matrix function P (t), 0 t T : P (t) + N (t, P (t)) S (t, P (t))r(t, P (t)) S(t, P (t)) = 0; (1.1) P (T ) = G S n ; Range[S(t, P (t))] Range[R(t, P (t))]; (1.2) R 11 (t) + π 11 (t, P (t)) 0; R 22 (t) + π 22 (t, P (t)) 0, (1.3) (1) 1

where P (t) = d dt P (t), ( ) represents matrix pseudoinverse, ( ) represents matrix transpose, and N (t, P ) = P A (t) + A(t)P + π 00 (t, P ) + N(t); ( ) B S(t, P ) = 1 (t)p + π 10 (t, P ) + S 1 (t) B 2(t)P ; + π 20 (t, P ) + S 2 (t) ( ) R11 (t) + π R(t, P ) = 11 (t, P ) R 12 (t) + π 12 (t, P ). R 21 (t) + π 21 (t, P ) R 22 (t) + π 22 (t, P ) A typical problem where equation (1) arises is a linear quadratic (LQ) stochastic zero-sum game. Specifically, let π ij (t, P ) = D i (t)p D j(t) for i = 0, 1, 2, where D 0, D 1 and D 2 are bounded measurable matrix functions on [0, T ] with appropriate dimensions. Then equation (1) is the Riccati equation induced from a stochastic zero-sum game with the following linear state equation and quadratic index. dx = (Ax + B 1 u 1 + B 2 u 2 )dt + (D 0 x + D 1 u 1 + D 2 u 2 )dw (t), for 0 t T ; x(0) = x 0 J(u 1, u 2 ) = E[x (T )Gx(T )] + E T 0 [x Nx +2x (S 1u 1 + S 2u 2 ) + u 1R 11 u 1 + 2u 1R 12 u 2 + u 2R 22 u 2 ]dt where W is a standard Brownian motion over a probability space, x R n is the state variable, u 1 R k1 and u 2 R k2 are control variables taken by two players, and E[ ] is the expectation; see [1] (for LQ deterministic games) or [2]. In fact, it has recently been proved by the second author and J.M. Yong that equation (1) with π ij (t, P ) = D i (t)p D j(t) is equivalent to LQ game problems under appropriate conditions. This shows that equation (1) plays a central role in LQ games. It is worth pointing out that while the LQ game is stochastic (with deterministic coefficients), equation (1) is completely deterministic. Equation (1) is quite general. First, the weighting matrix R(t, P ) is typically indefinite, due to condition (1.3). This is in contrast to Riccati differential equations arising from LQ control problems, where the weighting matrices are necessarily semidefinite. For example, a minimization LQ stochastic control problem can be considered as a special game (3) with u 2 restricted to be 0 (i.e. k 2 = 0). In this case, R(t, P (t)) = R 11 (t) + D 1P D 1 and a necessary condition for the existence of an optimal control is R(t, P (t)) 0. Note that when D 1 0 (stochastic LQ controls), this condition may be satisfied even if R 11 (t) (the weight matrix of the control u 1 in the index J) is indefinite; see [3], [4], [5], [6], [7], and [8]. However, when D 1 = 0 (deterministic LQ controls), it is necessary that R(t, P (t)) = R 11 (t) 0. In addition, R(t, P (t)) is allowed to be singular here, while most of existing literature assumes that R(t, P (t)) is nonsingular. Finally, with the operator Π(, ), equation (1) contains Riccati equations arising from LQ controls and games with stochastic parameters having jumps; see [9], [10], [11] for such examples from LQ controls. (2) (3) 2

The purpose of this paper is to establish the fundamental properties for equation (1), including comparison and monotonicity theorems and conditions for existence of solutions. These properties form the basis for solving the equation and the problems where the equation arises. Specifically, in Section 3, we will study the structure of (1) and obtain a useful representation and interpretations for (1). In Section 4, we prove comparison theorems for solutions of (1), generalizing the related results in [10], [11], [12], [13], [14], [15], [16], [17] for special cases of (1). As an application of the comparison theorems, in Section 5 we give a necessary and sufficient condition for the existence of solutions to (1) satisfying a strengthened version of (1.3). As another application of the comparison theorems, we study two interesting special cases in Section 6, where we prove the monotonicity of solutions to (1) in case the coefficients are either constant (time-invariant) or periodic. Consequently, we obtain some necessary and sufficient conditions for the existence of a solution to the algebraic Riccati equation; i.e. a constant solution to (1.1). Finally, we give conditions guaranteeing periodic solutions to (1). 2 Assumptions In this section we state the assumptions for equation (1). Note that the block matrices in (2) are linear in P (t) and we have ( ) N (t, P ) S (t, P ) (4) S(t, P ) R(t, P ) A (t)p + P A(t) P B 1 (t) P B 2 (t) = T (t) + Π(t, P ) + B 1(t)P 0 0 B 2(t)P 0 0 where Π(, ) = (π ij (, )) 0 i,j 2, and T (t) = N(t) S 1(t) S 2(t) S 1 (t) R 11 (t) R 12 (t). (5) S 2 (t) R 21 (t) R 22 (t) We will state the assumptions for equation (1) in terms of the quadruple (A, B, T, Π) that determines (1). Before we continue, let us make some comments on notations. In the rest of paper, we will often suppress the variable t. To indicate that a definition or relationship holds for all t [0, T ], we will sometimes replace (t) by ( ) and drop the phrase for for all t [0, T ]. For example, we will write P ( ) 0 for P (t) 0 for all t [0, T ]. For a Hilbert space H, we denote by L (0, T ; H) the space of bounded measurable functions with values in H, and by L 1, (0, T ; H) the space of functions P L (0, T ; H) with dp dt L (0, T ; H). The notation R k n will denote the k n matrices over the reals. We make two standing assumptions on the quadruple (A, B, T, Π). Let n, k 1 and k 2 be nonnegative integers. 3

(A1). The matrix functions A, B i, T satisfy A L (0, T ; R n n ); B i L (0, T ; R n ki ); S i L (0, T ; R ki n ); R ij = R ji L (0, T ; R ki kj ); N L (0, T ; S n ); G S n. (6) (A2). The operator Π : [0, T ] S n S n+k1+k2 is Lipschitz and monotonically increasing. In other words, there exists a constant L such that for all t [0, T ] and P, Q S n, Π(t, P ) Π(t, Q) L P Q, if P Q, then Π(t, P ) Π(t, Q), (7) where is a norm on S n or S n+k1+k2. The range hypothesis (1.2) will be used in two slightly different ways. First, S(t, P (t)) and R(t, P (t)) are (k 1 + k 2 ) n and (k 1 + k 2 ) (k 1 + k 2 ) matrices, respectively. As such they act on vectors in R n and R k1+k2 both resulting in vectors in R k1+k2. (This viewpoint will be used in the proof of Proposition 1.) Second, S(t, P (t)) and R(t, P (t)) can be thought of as acting by multiplication on n (k 1 + k 2 ) and (k 1 + k 2 ) (k 1 + k 2 ) matrices, respectively, thus resulting in (k 1 + k 2 ) (k 1 + k 2 ) matrices. (This will be used in (16).) So hypothesis (1.2) makes sense in either interpretation. 3 Algebraic Descriptions of Riccati Equation (1) In this section we give a representation and some interpretations for equation (1). The representation is important to the proofs of our main results. We will assume that (A, B, T ) satisfies (A1) and Π satisfies (A2), and P L 1, (0, T ; S n ). We first recall some properties of the matrix pseudoinverse [18]. For M R k n, there exists a unique generalized inverse matrix M R n k that has the following properties: MM M = M, M MM = M, (MM ) = MM, (M M) = M M. (8) In addition, if M = M, then M = (M ), M 0 M 0, M M = MM. Conditions (1.1)-(1.3) in equation (1) have some algebraic interpretations. Consider the following quadratic form q of (x, u 1, u 2 ) R n R k1 R k2. ( ) ( ( ) x N S x q(x, u 1, u 2 ) = (9) u S R) u = x N x + 2x Su + u Ru, 4

where ( ) N S n, R S k1+k2 and S R (k1+k2) n are constant matrices and u = u1. In the following, we will sometimes use the row notation u = (u u 1, u 2 ), 2 for an element of R k1 R k2, and write q(x, u 1, u 2 ) = q(x, u). The notation is chosen here to mimic that in equation (1). Proposition 1 The following two statements are equivalent. (1). N, R and S satisfy the conditions N S R S = 0, (10.1) Range[S] Range[R], (10.2) R 11 0, R 22 0. (10.3) (10) (2) For every x R n, the quadratic function q(x, u 1, u 2 ) has a saddle point (û 1, û 2 ) R k1 R k2 with value 0, that is, for all (u 1, u 2 ) R k1 R k2, we have q(x, û 1, u 2 ) 0 q(x, u 1, û 2 ). (11) ) (û1 Proof. We first prove (2) (1). Suppose (û 1, û 2 ) satisfies (11), then û = is a critical point (depending on x) of q(x, u 1, u 2 ) in (9). So differentiating (9) with respect to u, we have Rû + Sx = 0. (12) Since x is arbitrary, (10.2) holds. Condition (10.3) follows from that (11) holds for all (u 1, u 2 ) R k1 R k2. We can rewrite quadratic q(x, u 1, u 2 ) as a sum of squares: q(x, u 1, u 2 ) = x (N S R S)x + (u û) R(u û). (13) Indeed, by using (12): Rû = Sx and (8): R = RR R, we have (Rû) = x S, and so (u û) R(u û) =u Ru 2u Rû + û Rû =u Ru + 2u Sx + (Rû) R Rû =u Ru + 2u Sx + x S R Sx. So (13) follows by recalling the definition of q(x, u 1, u 2 ). By (11) and (13), 0 = q(x, û 1, û 2 ) = x (N S R S)x for all x. So (10.1) holds. This finishes the proof of sufficiency. Now we show that (1) (2). Condition (10.2) implies that (12) has a solution û R (k1+k2) for every x R n. By (13) and (10.1) we have Then condition (10.3) implies (11). q(x, u 1, u 2 ) = (u û) R(u û). û 2 5

Algebraic Interpretations of Riccati Equations. Now we apply Proposition 1 to ( ) ( ) N S P (t) + N (t, P (t)) S = (t, P (t)), (14) S R S(t, P (t)) R(t, P (t)) ( ) u1 which defines the following quadratic of u = R u k1+k2 for each (t, x) 2 [0, T ] R n : x [ P (t) + N (t, P (t))]x + 2x S(t, P (t))u + u R(t, P (t))u. (15) We obtain the following interpretations of equation (1) for each (t, x) [0, T ] R n : (1) Condition (1.2) is equivalent to: quadratic (15) has a critical point û. (2) Condition (1.3) is equivalent to: a critical point of (15) is a saddle point. (3) Condition (1.1) says that the critical value is 0. Condition (10.2) is also equivalent to the statement that the matrix equation RK + S = 0 (16) has a solution K R (k1+k2) n. In this case, û = Kx satisfies (12) for every x R n. In fact, we can show that R S is such a solution. Indeed, since S = R K and (8): RR R = R, we have R { R S } = RR R K = R K = S. Let K R (k1+k2) n be arbitrary and u = Kx. Then q(x, u) = q(x, Kx) = x [N + 2K S + K RK]x. On the other hand, q(x, Kx) is represented by (13) with û = Kx. Since x R n is arbitrary, we obtain the following identity for K R (k1+k2) n, which may be verified directly. N (P ) S (P )R(P ) S(P ) + (K K) R(P )(K K) ( ) ( ( ) = N + 2K S + K I N S I RK =. K S R) K (17) Now apply (17) to the matrix in (14) for every (t, x) [0, T ] R n. The decomposition (4) implies that ( ) ( ) ( ) ( ) I N S ( ) ( ) I I P (t) + N (t, P (t)) S = (t, P (t)) I K S R K K S(t, P (t)) R(t, P (t)) K (18) = P (t) + Q 0 (t, K) + L(t, K; P (t)) where Q 0 (t, K) is a quadratic in K and L(t, K; P ) is linear in P, defined as follows. ( ) ( ) I I Q 0 (t, K) = T (t), K K ( ) ( ) I I L(t, K; P ) = Π(t, P ) + [A(t) + B(t)K] P + P [A(t) + B(t)K], K K (19) 6

where B = (B 1, B 2 ) and T is defined in (5). From (17) and (18), we obtain the following representations of (1). Proposition 2 Suppose for each t [0, T ], P (t) L 1, (0, T ; S n ) satisfies (1.2). Then for each K R (k1+k2) n, letting K(t, P ) = R (t, P )S(t, P ), we have P (t) + N (t, P (t)) S (t, P (t))r(t, P (t)) S(t, P (t)) = P (t) + Q 0 (t, K) + L(t, K; P ) (K K(t, P (t))) R(t, P (t))(k K(t, P (t))). 4 Comparison Theorems Comparison theorems are a fundamental part of the theory of Riccati equations. Such theorems are a basis for proving the existence and other properties of Riccati equations. For special cases of equation (1) (as mentioned in 1), comparison theorems have been proved in many papers. See [10], [11], [13], [14], [15], [17], [19], [20], [21] and references therein for more exposition on comparison theorems and some of their applications. We prove two comparison theorems (essentially equivalent) for equation (1). First consider an equation of the form (20) P (t) + A (t)p (t) + P (t)a(t) + π(t, P (t)) + N(t) = 0, P (T ) = G, (21) where the operator π(, ) : [0, T ] S n S n. We will write π(, ) 0 to mean that π(t, P ) 0 for all (t, P ) [0, T ] S n with P 0. Proposition 3 Let A, G, N be as in (6) and π(, ) be Lipschitz. Then equation (21) has a unique solution P L 1, (0, T ; S n ). In addition, if π(, ) 0, G 0 and N( ) 0, then P ( ) 0. Furthermore, if π(, ) 0, and either G > 0 or N( ) > 0, then P ( ) > 0. Proof. Following the idea in [17] we let Φ(t, s) be the fundamental matrix of A, that is, Φ(t, s) 1 = Φ(s, t) and tφ(s, t) = A(t)Φ(s, t). It follows that (21) is equivalent to T P (t) = Φ(T, t) GΦ(T, t) + Φ(s, t) [π(s, P (s)) + N(s)]Φ(s, t)ds. (22) t The Volterra equation (22) has a unique solution P, which can be found by successive approximations; say, {P m : m = 0, 1, 2,...} starting with P 0 = 0. If π(, ) 0, G 0 and N( ) 0, then P m ( ) 0 for all m 0, which implies that P ( ) = lim m P m ( ) 0. If either G > 0 or N( ) > 0, then for all t [0, T ], T P (t) Φ(T, t) NΦ(T, t) + Φ(s, t) G(s)Φ(s, t)ds > 0. t 7

Theorem 1 (Comparison) For i = 1, 2, let P i L 1, (0, T ; S n ) be solutions to equation (1) associated (A, B, T i, Π i ), which satisfy (A1) and (A2). Suppose Π 1 (, ) Π 2 (, ), T 1 ( ) T 2 ( ) and P 1 (T ) P 2 (T ). Then P 1 ( ) P 2 ( ). In addition, P 1 ( ) < ( P 2 ( ) if one of) the following conditions holds. N2 N (1) T 2 T 1 = 1 0 (i.e., T 0 0 2 and T 1 are different at only the (1, 1) entry) and N 2 ( ) N 1 ( ) > 0. (2) P 1 (T ) < P 2 (T ). Proof. For i = 1, 2, let S i (P i ), R i (P i ), K i (P i ), Q i0 (K), L i (K; P i ) be the notations defined in (2), Proposition 2 and (19) with (A, B, T i, Π i ) (with t suppressed). Using representation (20) for the equations of P 1 and P 2, we obtain for arbitrary K R (k1+k2) n, 0 = Q 20 (K) Q 10 (K) + P 2 P 1 + L 2 (K; P 2 ) L 1 (K; P 1 ) + M(K). (23) where M(K) is the difference of square terms in (20), that is M(K) =(K K 1 (P 1 )) R 1 (P 1 )(K K 1 (P 1 )) (K K 2 (P 2 )) R 2 (P 2 )(K K 2 (P 2 )). ( A key ) step of the proof is ( to choose ) K such that M(K) ( 0. Write ) K 1 (P 1 ) = K11 (P 1 ) K21 (P and K K 12 (P 1 ) 2 (P 2 ) = 2 ) K21 (P and choose K = 2 ). It follows K 22 (P 2 ) K 12 (P 1 ) that ( ) ( ) ( ) K21 (P K K 1 (P 1 ) = 2 ) K11 (P 1 ) K21 (P = 2 ) K 11 (P 1 ). K 12 (P 1 ) K 12 (P 1 ) 0 ( ) 0 Similarly K K 2 (P 2 ) =. Using the notation [ ] K 12 (P 1 ) K 21 (P 2 ) i,j to denote the (i, j) entry of a matrix, we know from condition (1.3) that [R 1 (P 1 )] 1,1 0 and [R 2 (P 2 )] 2,2 0. It follows that M(K) =(K 21 (P 2 ) K 11 (P 1 )) [R 1 (P 1 )] 1,1 (K 21 (P 2 ) K 11 (P 1 )) (K 12 (P 1 ) K 22 (P 2 )) [R 2 (P 2 )] 2,2 (K 12 (P 1 ) K 22 (P 2 )) 0. Denote P ( ) = P 2 ( ) P 1 ( ). Then P (T ) 0 and (23) implies that P satisfies ( ) ( I I 0 = [T K 2 T 1 + Π 2 (P + P 1 ) Π 1 (P 1 )] K) + P + (A + BK) P + P (A + BK) + M(K). (24) ( ) ( ) I I Now we apply Proposition 3 to (24) with N = M(K)+ [T K 2 T 1 ] K and Π( ) = Π 2 ( + P 1 ) Π 1 (P 1 ). Since M(K) 0 and T 2 ( ) T 1 ( ), we 8

have N( ) 0. In addition, since Π 2 (, ) Π 1 (, ) and Π 1 is monotonically increasing, we have, for every Q 0, Π(t, Q) = Π 2 (t, Q + P 1 ) Π 1 (t, P 1 ) Π 1 (t, Q + P 1 ) Π 1 (t, P 1 ) 0. So Π(, ) 0. By Proposition 3, P ( ) 0. Under condition (1), we have for t [0, T ], N = N 2 N 1 + M(K) N 2 N 1 > 0. So under either condition (1) or (2), P ( ) = P 2 ( ) P 1 ( ) > 0 by Proposition 3. The comparison theorem is often stated in the terms of upper and lower solutions. We say that P L 1, (0, T ; S n ) is a lower [upper, respectively] solution of (1) if it satisfies conditions (1.2) and (1.3) and the following inequalities for all t [0, T ]: P (t) + N (t, P (t)) S (t, P (t))r(t, P (t)) S(t, P (t)) 0; P (T ) G. [and, respectively, P (t) + N (t, P (t)) S (t, P (t))r(t, P (t)) S(t, P (t)) 0; P (T ) G] (25) The terminology may seem improperly named, but it is simply a reflection of the fact that the problems being solved are final value rather than initial value problems. Finally, if one of the inequalities in (25) is strict, then we say P is a strict lower (or upper) solution. For example, it is easy to see that P ( ) = 0 is a lower [an upper] solution of (1) if and only if for all t [0, T ], R 11 (t) 0, R 22 (t) 0, Range[S(t)] Range[R(t)]; N(t) S (t)r (t)s(t) 0, G 0. [N(t) S (t)r (t)s(t) 0, G 0, respectively.] Using Theorem 1, we now prove the following comparison result. (26) Theorem 2 (Comparison) Suppose (P 1, P 2 ) is a pair of lower-upper solutions of (1). Then P 1 ( ) P 2 ( ). If either P 1 is a strict upper solution or P 2 is a strict lower solution, then P 1 ( ) < P 2 ( ). Proof. The assumption implies that P i (t) + N (t, P i (t)) S (t, P i (t))r(t, P i (t)) S(t, P i (t)) + H i (t) = 0. for some matrices ( H 1 ( ) 0, ) H 2 ( ) 0 and ( for all t )[0, T ]. Apply Theorem N + H2 S 1 with T 2 = N + H1 S and T S R 1 =. Since T S R 2 T 1 and P 1 (T ) G P 2 (T ), P 1 ( ) P 2 ( ) by Theorem 1. The second conclusion follows from the second conclusion of Theorem 1. As a corollary, we obtain the following result. Corollary 1 Suppose condition (26) holds. Then the solution P of (1), if it exists, is positive [negative, respectively] semidefinite. 9

5 Existence Results Upper and lower solutions can usually be found more easily than solutions. In this section we give a condition for the existence of a solution to equation (1) in terms of upper and lower solutions. We consider solutions P L 1, (0, T ; S n ) that satisfy the strict inequalities in (1.3), that is, for t [0, T ], P (t) + N (t, P (t)) S (t, P (t))r(t, P (t)) S(t, P (t)) = 0, (27.1) P (T ) = G S n, (27) R 11 (t) + π 11 (t, P (t)) > 0; R 22 (t) + π 22 (t, P (t)) < 0. (27.2) We assert first that if P satisfies (27.2), then the weighting matrix R(P (t)) is invertible. The invertibility of R(t, P (t)) can ( be seen ) by writing its block A B matrix form, for convenience, as R(P (t)) = B, with A and C being C ( x square matrices, A > 0, and C < 0. If is in the kernel of R(P (t)), y) then Ax + By = 0 so that x = A 1 By. Then B x + Cy = 0 can be rewritten as ((C B A 1 B)y = 0. But C B A 1 B < 0, so y = 0 and it x follows that = 0. Thus R(t, P (t)) is invertible which then, of course, y) implies R(t, P (t)) 1 = R(t, P (t)). In this case, conditions (1.2) and (1.3) are automatically satisfied. The notion of lower and upper solutions for (27) is defined as in (25). We have Theorem 3 There exists a solution P L 1, (0, T ; S n ) to (27) if and only if (27) has a pair of lower-upper solutions (P 1, P 2 ). Proof. The necessity is trivial by taking P = P 1 = P 2. We prove the sufficiency. Since P 1 (T ) G P 2 (T ) and P 1 and P 2 satisfy (27.2), it follows that R 11 (T ) + π 11 (G) R 11 (T ) + π 11 (P 1 (T )) > 0, R 22 (T ) + π 22 (G) R 22 (T ) + π 22 (P 2 (T )) < 0. So (27.2) holds at t = T with P (T ) = G. The local existence theory of ODE implies that equation (27.1) has a solution P that exists in a maximal interval (τ, T ]. Theorem 2 implies that P 1 (t) P (t) P 2 (t) in (τ, T ]. This implies that P satisfies (27.2). By using the inequalities in (27.2) and equation (27.1), we see that P τ lim t τ+ P (t) exists and P 1 (τ) P τ P 2 (τ). It turns out that (27.2) holds for P (t) with t [τ, T ]. So P can be extended to a solution of (27.1) that satisfies (27.2) on [τ, T ]. We claim that τ = 0. For if τ > 0, then P could be extended further left beyond τ by the local existence theory of ODE. This would contradict the maximality of (τ, T ]. Therefore equation (27.1) has a solution on [0, T ] satisfying (27.2). 10

If the lower solution P 1 and upper solution P 2 satisfy (1.2) and (1.3), instead of (27.2), then it is an open question whether equation (1) has a solution. In the proof of Theorem 3 we used the following three properties of solutions to (1) that satisfy (27.2). (a) If P 1 and P 2 satisfy (27.2), then every function between P 1 and P 2 also satisfies (27.2). (b) The local existence of a solution of (27). (c) Every solution of (27) on an interval (τ, τ] [0, T ] can be extended to [τ, τ]. For equation (1.1) with (1.2) and (1.3), (a) can be verified under appropriate kernel conditions, as done in [15] for rational differential equations arising from stochastic controls. Properties (b) and (c) are not known due to the fact that the generalized inverse R(t, P (t)) is not continuous in its argument P (t). For this reason, the existence result Corollary 4.6 in [15] may need additional assumptions. We hope to address this question in a future study. We end this section with the following property of solutions to (27). Proposition 4 If (P 1, P 2 ) is a pair of lower and upper solutions of (27), then the set of solutions, X, of (27) satisfying P 1 X P 2, contains a minimal solution Z and a maximal solution Y. That is, for all such solutions X. Z( ) X( ) Y ( ) Proof. Let Y and Z be the solutions with Y (T ) = P 2 (T ) and Z(T ) = P 1 (T ). Now if X is any solution of (27) satisfying P 1 (T ) X(T ) P 2 (T ), then we have Z( ) X( ) Y ( ) by Theorem 2. 6 Monotonicity and existence of constant solutions In this section we study the monotonicity of a solution of (1) when all coefficient matrices are periodic or constant in t. As applications, we give conditions for the existence of periodic or constant solutions to (1). When all coefficient matrices are constant, the algebraic Riccati equation for P S n associated with (1) is E(P ) N (P ) S (P )R(P ) S(P ) = 0; (28.1) Range [S(P )] Range[R(P )]; (28.2) (28) R 11 + π 11 (P ) 0; R 22 + π 22 (P ) 0, (28.3) 11

where in analogy with (2) N (P ) = P A + AP + π 00 (P ) + N; ( ) B S(P ) = 1 P + π 10 (P ) + S 1 B 2P ; + π 20 (P ) + S 2 ( ) R11 + π R(P ) = 11 (P ) R 12 + π 12 (P ). R 21 + π 21 (P ) R 22 + π 22 (P ) Theorem 4 (Monotonicity) Suppose all of the coefficients in (1) are constant and P (t) is the solution of (1) in (s, T ] with G = P (T ). Suppose that Range[S(P )] Range[R(P )], R 11 + π 11 (G) 0; R 22 + π 22 (G) 0. Then we have (i) E(G) 0 if and only if P (t) is decreasing in (s, T ]. (ii) E(G) 0 if and only if P (t) is increasing in (s, T ]. Proof. (i) Since E(G) 0, G is a constant lower solution to (1) on (, T ]. By Theorem 2, P (t) G, for all t (s, T ]. For any number τ (0, T s), define P : (s + τ, T + τ] S n by P (t) = P (t τ). Since (1) is autonomous, P (t) is also a solution to (1) with P (T ) = P (T τ) G = P (T ). By Theorem 2 again, P (t) P (t) for t (s + τ, T ], or equivalently, P (t τ) P (t) for every τ (0, T s). In other words, P (t) is decreasing in (s, T ] as a function of t. Part (ii) is proved similarly. Extending terminology to the algebraic case, we will call P S n a lower solution to the algebraic Riccati equation (28) in case E(P ) 0 and an upper solution in case E(P ) 0. Using this terminology, Theorem 4 can be rephrased by saying P ( ) is decreasing if and only if the terminal value G is a lower solution. And similarly P ( ) is increasing if and only if the terminal value G is an upper solution. Theorem 4 implies that if P is a bounded solution to (1) on (, T ] with G = P (T ) being an upper or lower solution to (1), then P lim t P (t) exists. By combining Theorem 3 and Theorem 4, we obtain a necessary and sufficient condition for existence of solutions to the following equation: { E(P ) N (P ) S (P )R(P ) S(P ) = 0; (29.1) (29) R 11 + π 11 (P ) > 0; R 22 + π 22 (P ) < 0, (29.2) Note again that (29.2) is a stronger assumption than (28.2) and (28.3). Theorem 5 Equation (29) has a solution P S n if and only if it has an upper solution Y and a lower solution Z such that Y Z. The set [Z, Y ] = {P S n Z P Y } contains a minimal solution and maximal solution. 12

Proof. The necessity is obvious by choosing Y = Z = P. For the sufficiency, consider equation (27) with terminal values P (T ) = Y and Z, respectively. Since Y is an upper solution and Z is a lower solution to (27) in (, T ], by Theorem 3, there exist solutions P Y ( ) and P Z ( ) on (, T ] to (27) such that P Y (T ) = Y, P Z (T ) = Z and Y P Y ( ) P Z ( ) Z. By Theorem 4, both P Y ( ) and P Z ( ) are monotone. So both Y = lim t P Y (t) and Z = lim t P Z (t) exist. Clearly Y and Z are constant solutions to (27) satisfying Y Y Z Z. It follows that R 11 + π 11 (Y ) R 11 + π 11 (Z ) R 11 + π 11 (Z) > 0, R 22 + π 22 (Z ) R 11 + π 11 (Y ) R 11 + π 11 (Y ) < 0. Lastly consider the case in which all the coefficients of (1.1) are periodic with period θ. Theorem 6 Suppose the coefficients of (1.1) are continuous and θ-periodic. Equation (27) has a θ-periodic solution P if and only if (27) has an upper solution Y and a lower solution Z on [T θ, T ] such that Z(T ) Z(T θ) Y (T θ) Y (T ). Proof. The necessity is obvious by letting Y = Z = P. To show the sufficiency, we first show that for k = 0, 1, 2,..., (27) has a solution P i on [T θ, T ] with the following properties, for i = 1, 2,..., P 0 (T ) = Z(T ), P i+1 (T ) = P i (T θ), Z(t) P i (t) P i+1 (t) Y (t) for t [T θ, T ]. The existence of a solution P 0 (t) with Z(t) P 0 (t) Y (t) for t [T θ, T ] is guaranteed by Theorem 3. Since Z(T ) Z(T θ) P 0 (T θ) Y (T θ) Y (T ), the existence of P 1 (t) with Z(t) P 1 (t) Y (t) for t [T θ, T ] is guaranteed, again by Theorem 3. Since both P 1 (t) and P 0 (t) are solutions to (27) with P 1 (T ) = P 0 (T θ) P 0 (T ), the comparison theorem implies that P 0 (t) P 1 (t). The cases for i 1 are proved by induction. So {P i ( )} is a monotonically increasing bounded sequence. Therefore, P (t) = lim i P i (t) exists for each t [T θ, T ]. The property P i+1 (T ) = P i (T θ) implies that P (T ) = P (T θ), that is, P (t) is θ-periodic. Since each P i (t) satisfies (27) and Z(t) P i (t) Y (t) for t [T θ, T ], the limit P (t) satisfies (27) and Z(t) P (t) Y (t) for t [T θ, T ]. Extend P ( ) to (, T ] periodically, we get a periodic solution of (27). 13

It is a typical argument to show that P ( ) satisfies equation (27). Equation (27) for P i ( ) can be written as d dt P i(t) = E(t, P i (t)), where E(t, P ) = N (t, P ) S (t, P )R(t, P ) 1 S(t, P ). We show that E(t, P i (t)) is a bounded function uniformly for i 0. Write, ( ) A B R(t, P i (t)) = (R jk (t) + π jk (P i (t))) 1 j,k 2 = B. C where A = R 11 (t) + π 11 (t, P i (t)) R 11 (t) + π 11 (t, Z(t)) C = R 22 (t) + π 22 (t, P i (t)) R 22 (t) + π 22 (t, Y (t)) Since Y and Z satisfy (27.2) and the coefficients are continuous, we see that A 1 and C 1 (and hence (C B A 1 B) 1 ) are bounded functions uniformly for i 0. By the inverse formula ( ) 1 ( ) ( ) ( ) A B I 0 A 1 0 I A = 1 B C B A 1 I 0 (C B A 1 B) 1 0 I B we see that R(t, P i (t)) 1 is bounded uniformly for i 0. Consequently we know that E(t, P i (t)) is a bounded function uniformly for i 0. Therefore for all r and r + r [T θ, T ], we have Let i, we have that P i (t) r+ r r P (t) r+ r r = = r+ r r r+ r r E(t, P i (t))dt. E(t, P (t))dt Dividing both sides by r and taking limit as r 0, we see that P (t) satisfies (27) on [T θ, T ]. Note that the solution P (t) obtained in the proof of Theorem 6 is the minimal θ-periodic solution between Z and Y on [T θ, T ]. If we start with P 0 (T ) = Y (T ), then the solution P ( ) will be maximal θ-periodic solution between Z and Y on [T θ, T ]. In [14, Theorems 3.9 & 3.11], results similar to Theorem 6 are proved for equation (1) among periodic solutions P (t) such that R(t, P (t)) > 0. References [1] T. Basar, G.J. Olsder, Dynamic noncooperative game theory, second ed., Academic Press, London/New York, 1995. 14

[2] L. Mou, J.M. Yong, Two-person zero-sum linear quadratic stochastic differential games by a Hilbert space method, Nonlinear Dynamics and Systems Theory, in press. [3] M. Ait Rami, J.B. Moore, X.Y. Zhou, Indefinite stochastic linear quadratic control and generalized differential Riccati equation. SIAM J. Control Optim. 40 (2001/02) 1296-1311. [4] M. Ait Rami, X. Chen, J. B. Moore, X.Y. Zhou, Solvability and asymptotic behavior of generalized Riccati equations arising in indefinite stochastic LQ controls. IEEE Trans. Automat. Control 46 (2001) 428-440. [5] S. Chen, X. Li, X.Y. Zhou, Stochastic linear quadratic regulators with indefinite control weight costs, SIAM J. Control and Optim. 36 (1998) 1685-1702. [6] S. Chen, X.Y. Zhou, Stochastic linear quadratic regulators with indefinite control weight costs II, SIAM J. Control Optim., 39 (2000) 1065-1081. [7] G. Tessitore, Some remarks on the Riccati equation arising in an optimal control problem with state- and control- dependent noise, SIAM J. Control Optim. 30 (1992) 717-744. [8] J. Yong, X.Y. Zhou, Stochastic Controls, Hamiltonian Systems and HJB Equations, Springer, New York, 1999. [9] M. D. Fragoso, O.L.V. Costa, C. E. de Souza, A new approach to linearly perturbed Riccati equations arising in stochastic control, Appl. Math. Optim. 37 (1998) 99-126. [10] G. Freiling, G. Jank, Existence and comparison theorems for algebraic Riccati equations and Riccati differential and difference equations, J. Dynamic and Control Systems, 2 (1996) 529-547. [11] G. Freiling, G. Jank, H. Abou-Kandil, Generalized Riccati difference and differential equations, Linear Algebra and Its Applications, 241/243 (1996) 291-303. [12] S. Bittanti, A.J. Laub, J.C. Willems, The Riccati equations, Springer, New York, 1991. [13] R. F. Curtain, L. Rodman, Comparison theorems for infinite dimensional Riccati equations, System Control Letters 15 (1990) 153-159. [14] V. Dragan. G. Freiling, A. Hochhaus, T. Morozan, A class of nonlinear differential equations on the space of symmetric matrices. Electronic Journal of Differential Equations 2004 (2004) no. 96, 1-48. [15] G. Freiling, A. Hochhaus, On a class of rational matrix differential equations arising in stochastic control, Linear Algebra and Its Applications 379 (2004) 43-68. 15

[16] W.T. Reid, Riccati differential equations. Academic Press, New York and London, 1972. [17] W.M. Wonham, On a matrix Riccati equation of stochastic control. SIAM J. Control 6 (1968), 681-697. [18] R. Penrose, A generalized inverse of matrices, Proc. Cambridge Philos. Soc. 52 (1955) 17-19. [19] W.A. Coppel, Disconjugacy, Lecture Notes in Mathematics, vol. 220, Springer, Berlin, 1971. [20] A.C.M. Ran and R. Vreugdenhil, Existence and comparison theorems for algebraic Riccati equations for continuous- and discrete-time systems, Linear Algebra and Its Applications 99 (1988) 63-83. [21] C. W. Wimmer, Monotonicity of maximal solutions of algebraic Riccati equations, Syst. Contr. Lett. 5 (1985) 317-319. 16