A SURVEY OF PROPERTIES OF FINITE HORIZON DIFFERENTIAL GAMES UNDER ISAACS CONDITION. Contents
|
|
- Ruby Whitehead
- 5 years ago
- Views:
Transcription
1 A SURVEY OF PROPERTIES OF FINITE HORIZON DIFFERENTIAL GAMES UNDER ISAACS CONDITION BOTAO WU Abstract. In ths paper, we attempt to answer the followng questons about dfferental games: 1) when does a two-player, zero-sum dfferental game have a value and s t unque? 2) How can we characterze the value of the game and relate ths value to some Hamlton-Jacob equaton? 3) When n general does the Nash Equlbrum of the dfferental game exst? To answer these questons, we use control theory and vscosty soluton approaches, appealng to the exstence of the Isaacs condtons: the relaton between the upper and lower Hamltonans. Contents 1. Introducton to dfferental game and optmal control 1 2. Strateges 4 3. Propertes of the value functon 5 4. Vscosty solutons and another verfcaton theorem 9 5. characterzaton and exstence of Nash equlbrum 14 Acknowledgments 17 References Introducton to dfferental game and optmal control We consder the followng ODE: For t < T, and x R m, ẋ(t) = f(t, x(t), u 1 (t), u 2 (t),..., u N (t)) (1.1) x( ) = x 0 We can nterpret the dynamcs of ths equaton n a game theory perspectve. Each player 1, 2,..., N chooses a control u (t) : [, T ] U R m. We denote by Φ ( ) the set of all such controls u that are feasble for player at tme. The state varable x(t) s a soluton to the above system and a response to the controls u 1, u 2,..., u N. In order for (1.1) to have a unque soluton, we assume that f : [, T ] R m U 1... U N R m s bounded and Lpschtz contnuous n x: (1.2) f(t, x, u 1, u 2,..., u N ) f(t, y, u 1, u 2,..., u N ) C 1 x y, for some C 1 R and all t [, T ]. Date: AUGUST 30,
2 2 BOTAO WU We defne: P (, x 0, u 1, u 2,..., u N ) = h (t, x(t), u 1 (t), u 2 (t),..., u N (t))dt + g (x(t )) to be the pay-off functonal of player. We can nterpret g : R m R as the termnal pay-off that s bounded and Lpschtz contnuous n x: (1.3) g(x) g(y) C 2 x y Smlarly, h : [, T ] R m U 1,..., U N R can be understood to be the runnng cost. We also assume h s bounded and Lpschtz contnuous n x: (1.4) h(t, x, u 1, u 2,..., u N ) h(t, y, u 1, u 2,..., u N ) C 3 x y for some C 2, C 3 R, and for all t [, T ]. We consder two types of controls. One type s called the open-loop control. By usng ths type of control, a player cannot make any observaton of the current state of the system. The other type of control s called the feedback control, whch enable the player to observe the current state of the system. Formally, Defnton 1.5. u (t) s a open loop control f u (t) = φ(t), where φ : [, T ] U. and u (t) s a feedback control f u (t) = φ(t, x(t)), where φ : [.T ] R m U. Let us now consder a two player, zero-sum game. Ths zero-sum game s determned by one sngle payoff: P = P 1 = P 2, snce, by defnton, one player s gan of the pay-off s exactly balanced by the loss of the pay-off of the other player n the zero-sum game. Here we assume that the frst player tres to maxmze the pay-off by choosng a feedback control φ 1 Φ 1 whle the second player s mnmzng the pay-off by choosng a feedback control φ 2 Φ 2, where Φ 1 and Φ 2 are player 1 s and 2 s feasble set of controls respectvely We wrte the pay-off as: P (, x 0, φ 1, φ 2 ) = the soluton to followng system: (1.6) h(t, x(t), φ 1 (t, x(t)), φ 2 (t, x(t)))dt + g(x(t )), where x s ẋ(t) = f(x(t), φ 1 (t, x(t)), φ 2 (t, x(t))) x( ) = x 0 We assume that functons g and h satsfy assumptons (1.3) and (1.4). φ 1 Φ 1 Defnton 1.7. The upper value functon of a two-player, zero-sum dfferental game s gven by V + (, x 0 ) = P (, x 0, φ 1, φ 2 ). The lower value func- φ 2 Φ 2 ton s gven by V (, x 0 ) = P (, x 0, φ 1, φ 2 ). One can show that φ 2 Φ 2 φ 1 Φ 1 V V +. We say that the game has a value f V + = V, n whch case, the value s V = V + = V. Defnton 1.8. We defne the upper and lower Hamltonan functons:
3 A SURVEY OF PROPERTIES OF FINITE HORIZON DIFFERENTIAL GAMES UNDER ISAACS CONDITION3 H + (t, x, p) = h(t, x, φ 1, φ 2 ) + p f(x, φ 1, φ 2 ) } φ 2 Φ 2 φ 1 Φ 1 H (t, x, p) = h(t, x, φ 1, φ 2 ) + p f(x, φ 1, φ 2 ) } φ 2 Φ 2 φ 1 Φ 1 We assume that there exst φ 1 (t, x, p) arg φ 1 Φ 1 φ 2 Φ 2h(t, x, φ1, φ 2 )+p f(x, φ 1, φ 2 )}, and φ 2 (t, x, p) arg h(t, x, φ 1, φ 2 )+p f(x, φ 1, φ 2 )} such that the Isaacs φ 2 Φ 2 φ 1 Φ 1 condton, the equalty of the two Hamltonan functons, holds: (1.9) H(t, x, p) := H + (t, x, p) = H (t, x, p) The above defntons allow us to prove the Verfcaton Theorem. The theorem states that n order to check that a gven functon s ndeed the value functon of the game, t suffces to check that ths functon s a soluton to a partal dfferental equaton (PDE), called a Hamlton-Jacob equaton. Theorem Let V (t, x) be a C 1 functon on [, T ] R m satsfyng: V t (t, x) + H(t, x, D x V (x)) = 0 (1.11) V (T, x) = g(x) If φ 1 Φ 1 and φ 2 Φ 2 where Φ 1 and Φ 2 are sets of feasble controls of both players, then V s the value of the game. Moreover, φ 1 (t, x, D x V (t, x)) and φ 2 (t, x, D x V (t, x)) actually are the optmal controls of player I and player II,.e., V (, x 0 ) = P (, x 0, φ 1, φ 2 ) = P (, x 0, φ 1, φ 2 ) = P (, x 0, φ 1, φ 2 ). φ 1 Φ 1 φ 2 Φ 2 Proof. For any (, x 0 ) [, T ] R m, let φ 1 Φ 1, and let x( ) solve the (1.6) wth controls φ 1 ( ) and φ 2 ( ). We compute: t d dt [V (t, x(t)) + h(s, x(s), φ 1 (s, x(s)), φ 2 (s, x(s)))ds = V t (t, x(t))+d x V (t, x(t)) f(x(t), φ 1 (t, x(t)), φ 2 (t, x(t)))+h(t, x(t), φ(t, x(t)), φ 2 (t, x(t)) V t (t, x(t))+ φ 1 Φ 1 D x V (t, x(t)) f(x(t), φ 1, φ 2 (t, x(t)))+h(t, x(t), φ 1, φ 2 (t, x(t))} = V t (t, x(t)) + H(t, x(t), D x V (t, x(t))) = 0 by (1.9). Integratng over [, T ], we get V (, x 0 ) 0 V (T, x(t )) + h(t, x(t), φ(t, x(t)), φ 2 (t, x(t)))dt Snce V (T, x(t )) = g(x(t )) by assumpton, we get: V (, x 0 ) h(, x 0, φ 1, φ 2 )dt + g(x(t )) φ 1 Φ 1 φ 1 Φ 1 P (, x 0, φ 1, φ 2 ) = V + (, x 0 )
4 4 BOTAO WU Smlarly, we can show that V (, x 0 ) P (, x 0, φ 1, φ 2 ) = V (, x 0 ). φ 2 Φ 2 Snce V V +, we have V (, x 0 ) = P (, x 0, φ 1, φ 2 ) s the value of the game. Remark: In the nte horzon problem whch has a pay-off functon of the form: P (x 0, φ 1, φ 2 ) = 0 e λt h(x 0, φ 1, φ 2 )dt for some λ > 0, one can show n a smlar way that the assocated Hamlton-Jacob equaton s: λv (x) + H(x, DV (x)) = 0. It should be notced that nether the open-loop controls nor the feedback controls gve addtonal ormaton about the decsons made by the other player. Ths s not consstent wth the tradtonal game theory lterature where a player s acton s usually dependent upon others actons and s often nterpreted as the best response to the other player s actons. Therefore, we ntroduce the concept of strategy n dfferental games. 2. Strateges For convenence, we wrte Φ ( ) = Φ 1 ( )... Φ 1 ( ) Φ +1 ( )... Φ N ( ). We also wrte an N tuple of control (u ) = (u 1,..., u N ) and an N 1 tuple of control (u ) = (u 1,...u 1, u +1,..., u N ), where u Φ. Defnton 2.1. A map α : Φ ( ) Φ ( ) s a non-antcpated strategy f, for all t > and (u ), (v ) Φ ( ), the condton (u )( ) = (v )( ) on [, t] mples that α [(u )]( ) = α [(v )]( ) on [, t]. A map α : Φ ( ) Φ ( ) s a non-antcpated strategy wth delay τ f, for all t > and (u ), (v ) Φ ( ), the condton (u )( ) = (v )( ) on [, t] mples that α [(u )]( ) = α [(v )]( ) on [, t + τ]. We denote the space A d () to be the set of delay strateges of player. Intutvely, by usng a strategy α A d (), a player should be able to answer to each control from other players (u (t)) Φ ( ) at tme by a control from hs own feasble set u Φ ( ) such that u = α[(u )]. Formally: Theorem 2.2. If (α 1,..., α N ) A 1 d ()... A N d (), then there exst unque N-tuple controls (u ) Φ 1 ( )... Φ N ( ) such that α ((u )) = u on [, ]. Proof. It suffces to consder a two-player game snce each player could thnk of all the others as one sngle player wth strateges from Φ 1 ( ). Let α A d ( ) and β B d ( ) be player 1 s and 2 s strateges. We clam that for any k 1, there exst unque (u k, v k ) : [, + kτ] Φ 1 Φ 2 such that α(v k ) = u k and β(u k ) = v k on [, + kτ]. We prove by nducton. For k = 1. Pck any v Φ 2 ( ), and set u 1 = α[v] and v 1 = β[u 1 ]. Snce α s a delay strategy and v 1,v Φ 2 ( ) agrees on the measure zero set [, ], we have α[v 1 ] = α[v] = u 1 on [, + τ]. Suppose for any k 1, there exst unque (u k, v k ) Φ 1 ( ) Φ 2 ( ) such that α[v k ] = u k and β[u k ] = v k on [, + τ]. We set u k+1 = α[v k ], and v k+1 = β[u k+1 ]
5 A SURVEY OF PROPERTIES OF FINITE HORIZON DIFFERENTIAL GAMES UNDER ISAACS CONDITION5 on [, ]. From our constructon, u k+1 = α[v k ] = u k on [, + kτ]. Snce β s a non-antcpated strategy, we have v k = β[u k ] = β[u k+1 ] = v k+1 on [, + kτ]. Snce α s a non-antcpated strategy wth delay, we have α[v k ] = α[v k+1 ] = u k+1 on [, + kτ + τ]. By our constructon, the par (u k, v k ) s unque. So f we set (u, v)=(u k, v k ) on [ + kτ, + (k + 1)τ]. the par (u, v) satsfy the desred property. We now ntroduce non-antcpated-strategy to our two-player, zero-sum dfferental game from secton 1. Consder the dynamcal system: ẋ(t) = f(t, x(t), u(t), v(t)) (2.3) x( ) = x 0 Let α A d ( ), β B d ( ) be player 1 s and 2 s strateges respectvely. We modfy the upper and lower value functons: V + (, x 0 ) = α A d () where the pay-off s: P (, x 0, u, v) = P (, x 0, α[v], v), and V (, x 0 ) = P (, x 0, u, β[u]), v Φ 2 β B d () u Φ 1 h(t, x(t), u(t), v(t))dt + g(x(t )) 3. Propertes of the value functon We now turn to the basc propertes of the value functon. dynamc programmng property. The frst s the Theorem 3.1 (Dynamc Programmng Property). For t + σ T, x R m, V (, x 0 ) = V + (, x 0 ) = β B d () u Φ 1 () α A d () v Φ 2 () h(t, x(t), u(t), β[u](t))dt+v ( +σ, x( +σ)) h(t, x(t), u(t), β[u](t))dt + V + ( + σ, x( + σ)) where x( ) solves the ODE system (2.3) wth u( ) = α[v]( ), and v( ) = β[u]( ). In other words, we can splt the soluton to the value functons on the nterval [, T ] nto two separate problems. 1). As a frst step, we solve the problem on the subnterval [ + σ, T ] wth runnng cost h and termnal pay-off g. In ths way, we determne the value functons V + ( + σ, ) and V ( + σ, ); 2). As a second step, we solve the problem on the subnterval [, + σ] wth runnng cost h and termnal pay-offs V + ( + σ, ) and V ( + σ, ), determned by the frst step. We now present the proof of the dynamc programmng property. Proof. We prove the case for the lower value functon. The proof follows deas from [1]. Set W (, x 0 ) = β B d () u Φ 1 () h(t, x(t), u(t), β[u](t))dt + V ( + σ, x( + σ)).
6 6 BOTAO WU Then for any ɛ > 0, there exsts δ B d ( ) such that (3.2) W (, x 0 )+ɛ u Φ 1 () By defnton, V ( +σ, x( +σ)) = β B d (+σ) u Φ 1 (+σ) h(t, x(t), u(t), δ[u](t))dt+v ( +σ, x( +σ)). where x( ) solves the ODE (2.3) on [ + σ, T ]. +σ h(t, x(t), u(t), β[u](t))dt+g(x(t )) Thus, there exst a strategy δ x(to+σ) B d ( + σ) such that (3.3) V ( +σ, x( +σ))+ɛ h(t, x(t), u(t), δ x(t0+σ)[u](t))dt+g(x(t )). u Φ 1 (+σ) +σ So we defne the strategy β B d ( ): for any u Φ( ), set δ[u](t) t + σ β[u](t) = δ x(t0+σ)[u](t) + σ t T So for any u Φ( ), by (3.2), and (3.3), we have: W (, x 0 ) u Φ 1 () + u Φ 1 (+σ) +σ u Φ() h(t, x(t), u(t), δ[u](t))dt h(t, x(t), u(t), δ x(t0+σ)[u](t))dt + g(x(t )) 2ɛ h(t, x(t), u(t), β[u](t))dt + g(x(t )) 2ɛ Therefore, we have V (, x 0 ) W (, x 0 ) + 2ɛ. On the other hand, by the defnton of V (, x 0 ), for any ɛ > 0, there exsts a strategy β B d ( ), such that (3.4) V (, x 0 ) + ɛ h(t, x(t), u(t), β[u](t))dt + g(x(t )) u Φ 1 () Also, we have W (, x 0 ) u Φ 1 () h(t, x(t), u(t), β[u](t))dt + V ( + σ, x( + σ))
7 A SURVEY OF PROPERTIES OF FINITE HORIZON DIFFERENTIAL GAMES UNDER ISAACS CONDITION7 So there exsts u 1 Φ 1 ( ) such that (3.5) W (, x 0 ) Therefore, for any u Φ 1 ( + σ), we defne ũ(t) = h(t, x(t), u 1 (t), β[u 1 ](t))dt + V ( + σ, x( + σ)) + ɛ u 1 (t) u(t) and a strategy β B d ( + σ) by β[u](t) = β[ũ](t), for + σ t T. By defnton, V ( + σ, x( + σ)) So there exsts u 2 Φ 1 ( + σ) such that (3.6) V ( + σ, x( + σ)) t + σ + σ t T h(t, x(t), u(t), β[u](t))dt + g(x(t )) u Φ 1 (+σ) +σ +σ We now fnally defne u Φ 1 ( ) by u(t) = By (3.4), (3.5) and (3.6) above, we can wrte W (, x 0 ) h(t, x(t), u(t), β[u](t))dt + g(x(t )) + ɛ u 1 (t) u 2 (t) t + σ + σ t T h(t, x(t), u(t), β[u](t))dt + g(x(t )) + 2ɛ V (, x 0 ) + 3ɛ Takng ɛ arbtrarly small, we obtan W (, x 0 ) = V (, x 0 ). The proof for the upper value functon s dentcal. From the dynamc programmng property, we can deduce that the value functon s Lpschtz contnuous. Lemma 3.7. There exsts a constant C R such that for any [0, T ], and for any x 0,y 0 R m, we have: V (, x 0 ) V (, y 0 ) C x 0 y 0, and V + (, x 0 ) V + (, y 0 ) C x 0 y 0 Proof. We wll prove the lemma for the lower value functon. For any arbtrary controls u and v, let x(t) and y(t) be solutons of the ODE (2.3), wth the assocated ntal condtons x( ) = x 0 and y( ) = y 0 Snce the functon f s Lpschtz contnuous n x from (1.2), there exst C 1 R such that: ẋ(t) ẏ(t) = f(t, x(t), u(t), v(t)) f(t, y(t), u(t), v(t)) C 1 x(t) y(t). Thus, by Gronwall s nequalty, x(t) y(t) x 0 y 0 e C1(t t0). Usng the Lpschtz contnuty of functons g and h from (1.3) and (1.4), we wrte:
8 8 BOTAO WU P (, x 0, u, v) P (, y 0, u, v) h(t, x(t), u(t), v(t)) h(t, y(t), u(t), v(t)) dt + g(x(t )) g(y(t )) C 3 x(t) y(t) dt + C 2 x(t ) y(t ) C 3 e C1(T t0) (T ) x 0 y 0 + C 2 e C1(T t0) x 0 y 0 C x 0 y 0 By Theorem 2.2, there exsts unque strategy β B d such that β[u] = v. Hence, P (, x 0, u, β[u]) P (, y 0, u, β[u]) C x 0 y 0. Takng the remum over u and mum over β, we obtan V (, x 0 ) V (, y 0 ) C x 0 y 0 Theorem 3.8. V +,and V are bounded and Lpschtz contnuous n all varables. Proof. We wll prove the theorem for the lower value functon V. Snce V (, x 0 ) = h(t, x(t), u(t, x(t)), v(t, x(t)))dt + g(x(t )), and β B d () u Φ 1 both functons h and g are bounded, we must have V s bounded as well. We have shown n Lemma 3.7 that V s Lpschtz contnuous n the state varable. We now try to prove that V s also Lpschtz n the tme varable. Let x 0 R m, and le t 1 T. By the dynamc programmng property, we can wrte V (, x 0 ) = β B d () u Φ 1 () t1 h(t, x(t), u(t), β[u](t))dt + V (t 1, x(t 1 )) where x( ) solves the ODE (2.3) on [, t 1 ] wth controls u( ) and β[u]( ). Snce h s bounded by some constant C 3 R as we have assumed n (1.4), then for any u Φ( ), and β B d ( ), we have t1 h(t, x(t), u, β[u])dt C 3 t 1. Because ẋ(t) = f(t, x(t), u, β[u]), and f s bounded by some constant C 1 R, we can wrte: x(t 1 ) x 0 Lemma 3.7 mples that: t1 f(t, x(t), u, β[u]) dt C 1 t 1. V (t 1, x(t 1 )) V (t 1, x 0 ) C x(t 1 ) x 0 CC 1 t 1 = C 4 t 1. Thus, we have: V (t 1, x 0 ) (C 3 + C 4 )(t 1 ) V (t 1, x(t 1 )) + t1 h(t, x(t), u(t), β[u](t))dt V (t 1, x 0 ) + (C 3 + C 4 )(t 1 )
9 A SURVEY OF PROPERTIES OF FINITE HORIZON DIFFERENTIAL GAMES UNDER ISAACS CONDITION9 Denote M := C 3 + C 4 and take the remum over u and the mum over β[u]. By the dynamc programmng property, we have: V (t 1, x 0 ) M(t 1 ) V (, x 0 ) V (t 1, x 0 ) and thus, V (t 1, x 0 ) V (, x 0 ) M(t 1 ). 4. Vscosty solutons and another verfcaton theorem We showed n Secton 1 that f the value functon s C 1, then the value functon satsfes the assocated Hamlton-Jacob equaton when the Isaacs Condton holds. However, the value functon may not possess such regularty and t s possble that the value functon has ponts that are non-dfferentable. In ths secton, we shall weaken our assumpton about our value functons; we shall only assume t to be bounded and Lpschtz contnuous. To accomodate ths weakened regularty assumpton, we ntroduce the dea of vscosty solutons. Consder the followng Hamlton-Jacob equaton: u t + H(t, x, D x u) = 0 (t, x) [, T ] R m (4.1) u(t, x) = g(x) x R m Defnton 4.2. A bounded, and Lpschtz contnuous functon u : [, T ] R m R m s called a vscosty subsoluton to the Hamlton-Jacob equaton (4.1) provded that for any test functon φ C 1, f u φ assumes a local maxmum at (, x 0 ) [, T ] R m, then φ t (, x 0 ) + H(, x 0, Dφ(, x 0 )) 0. Smlarly, a bounded, and Lpschtz contnuous functon u : [, T ] R m R m s called a vscosty ersoluton to the Hamlton-Jacob equaton (4.1) provded that for any test functon φ C 1, f u φ assumes a local mnmum at (, x 0 ) [, T ] R m, then φ t (, x 0 ) + H(, x 0, Dφ(, x 0 )) 0. We call u a vscosty soluton f t s both a vscosty subsoluton and a vscosty ersoluton. In secton 1, we saw that Issacs condton ensures that the value functon satsfes the Hamlton-Jacob equaton f the value functon s C 1. We shall now show that the upper and lower functons V + and V, as defned n the prevous secton, whch we showed to be Lpschtz contnuous, satsfy the above Hamlton-Jacob equaton n the vscosty sense. Theorem 4.3. a). V + s a vscosty soluton of the Hamlton-Jacob equaton V t + + H + (t, x, D x V + ) = 0 (t, x) [, T ] R m (4.4) V + (T, x) = g(x) x R m where the Hamltonan satsfes H + (t, x, p) = v b). V s a vscosty soluton of the Hamlton-Jacob equaton (4.5) f(t, x, u, v) p + h(t, x, u, v)}. u Vt + H (t, x, D x V ) = 0 (t, x) [, T ] R m V (T, x) = g(x) x R m
10 10 BOTAO WU where the Hamltonan satsfes H (t, x, p) = u The proof depends on the followng estmates: Lemma 4.6. Assume that φ C 1. f(t, x, u, v) p + h(t, x, u, v)} v a).if φ satsfes that φ t (, x 0 ) + H + (, x 0, D x φ(, x 0 )) θ < 0, then for each suffcently small σ > 0, there exsts some v Φ 2 ( ) such that (4.7) φ t (t, x(t))+f(t, x(t), α[v](t), v(t)) D x φ(t, x(t))+h(t, x(t), α[v](t), v(t))dt θσ 2 for all α A d ( ) b) f φ satsfes that φ t (, x 0 ) + H (, x 0, D x φ(, x 0 )) θ > 0, then for each suffcently small σ > 0, there exsts some v Φ 2 ( ) such that (4.8) φ t (t, x(t))+f(t, x(t), α[v](t), v(t)) D x φ(t, x(t))+h(t, x(t), α[v](t), v(t))dt θσ 2 for all α A d ( ) Proof. Let J(t, x, u, v) = h(t, x, u, v) + f(t, x, u, v) D x φ(t, x) + φ t (t, x). By our assumpton, v Φ 2 () u Φ 1 () Then, there exsts v Φ 2 ( ) such that J(, x 0, u, v) θ < 0. J(, x 0, u, v ) 3θ u Φ 1 () 4 < 0. By contnuty, we have that J(t, x(t), u, v ) θ u Φ 1 () 2 < 0, for t + σ, where σ s suffcently small and x( ) solves the ODE (2.6)for any u( ) and v( ) wth x( ) = x 0. In partcular, for v( ) = v, and α A d ( ) wth α[v ] = u, we have, on t + σ: φ t (t, x(t)) + f(t, x(t), α[v](t), v(t)) D x φ(t, x(t)) + h(t, x(t), α[v](t), v(t)) θ 2. Integratng over [, + σ], we obtan (4.7) The proof of b) s smlar. We now prove Theorem 4.2. The proof follows from [1]. Proof. Suppose that V + φ attans a local maxmum at (, x 0 ) [, T ] R m. We want to show that φ t (, x 0 ) + H + (, x 0, D x φ(, x 0 )) 0. Suppose not, there exsts θ > 0 such that φ t (, x 0 ) + H + (, x 0, D x φ(, x 0 )) θ < 0. Then by Lemma 4.3, we have: φ t (t, x(t)) + f(t, x(t), α[v](t), v(t)) D x φ(t, x(t)) + h(t, x(t), α[v](t), v(t))dt
11 A SURVEY OF PROPERTIES OF FINITE HORIZON DIFFERENTIAL GAMES UNDER ISAACS CONDITION 11 θσ 2. Thus, we have: α A d () v Φ 2 () θσ 2 < 0. φ t (t, x(t))+f(t, x(t), α[v](t), v(t)) D x φ(t, x(t))+h(t, x(t), α[v](t), v(t))dt On the other hand, by the dynamc programmng prncple, we have: V + (, x 0 ) = α A d () v Φ 2 () h(t, x(t), u(t), β[u](t))dt+v + ( +σ, x( +σ)). Snce V + φ has a local maxmum at (, x 0 ), for σ small we have: V + (, x 0 ) φ(, x 0 ) V + ( + σ, x( + σ)) φ( + σ, x( + σ)), where x( ) solves the ODE on [, + σ] for any u( ) and v( ) wth x( ) = x 0. Thus, α A d () v Φ 2 () Notce that: φ( +σ, x( +σ)) φ(, x 0 ) = Therefore, we fnally get: α A d () v Φ 2 () 0, whch s a contradcton to (4.7). h(t, x(t), α[v](t), v(t))+φ( +σ, x( +σ)) φ(, x 0 ) 0. φ t (t, x(t))+f(t, x(t), α[v](t), v(t)) D x φ(t, x(t))dt. φ t (t, x(t))+f(t, x(t), α[v](t), v(t)) D x φ(t, x(t))+h(t, x(t), α[v](t), v(t))dt Therefore, we must have φ t (, x 0 ) + H + (, x 0, D x φ(, x 0 )) 0. We can prove smlarly that f V + φ acheves a local mnmum at (, x 0 ), then φ t (, x 0 ) + H + (, x 0, D x φ(, x 0 )) 0. Therefore, V + s a vscosty soluton of + H + (t, x, D x V + ) = 0 wth V + (T, x) = g(x). V + t Smlarly, we can prove that V s a vscosty soluton of the Hamlton-Jacob equaton for the lower Hamltonan. We now attempt to show that the game has a value functon under Isaacs condton n the vscosty sense, and that ths value s actually unque. To begn, we shall study comparson prncple for the solutons of Hamlton-Jacob equatons.
12 12 BOTAO WU We assume the Hamltonan functon H : [, T ] R m R m satsfes the Lpschtz condton: (4.9) H(t, x, p) H(s, y, p) C( t s + x y )(1 + p ) and (4.10) H(t, x, p) H(t, x, q) C p q for some constant C R Theorem (Comparson Prncple) Let u : [, T ] R m R be a vscosty subsoluton of the Hamtlton-Jacob equaton: (4.12) u t (t, x) + H(t, x, D x u(t, x)) = 0. Let v : [, T ] R m R be a vscosty ersoluton of the Hamlton-Jacob equaton: (4.13) v t (t, x) + H(t, x, D x v(t, x)) = 0. If u(t, x) v(t, x) for any x R m, then u(t, x) v(t, x) for any (t, x) [, T ] R m Proof. We sketch the proof here snce the proof s long and should be standard n any reference of vscosty solutons of Hamlton-Jacob equatons. It s enough to show that for any α > 0, we have u(t, x) v(t, x) α(t t) 0 for any (t, x) [, T ] R m. If not, then there exsts α > 0 such that: M := u(t, x) v(t, x) α(t t)} > 0. (t,x) [,T ] R m Consder the functon Φ ɛ (t, s, x, y) : [, T ] [, t] R m R m R such that: Φ ɛ (t, s, x, y) = u(x, t) v(y, s) α(t t) ɛ 2 ( x 2 + y 2 ) 1 2ɛ 2 ( t s 2 + x y 2 ). Thanks to the penalzaton terms, we can show that the above functon assumes a mamum value at some (t ɛ, s ɛ, x ɛ, y ɛ ) ([, T ] R m ) 2. We can show that t ɛ s ɛ 2 0 and x ɛ y ɛ 2 0 as ɛ 0, and therefore that lm 2ɛ 2 t ɛ s ɛ 2 + x ɛ y ɛ 2 = 0. We denote by (t, x ) the lmt pont. If t = T, then 0 < M u(t, x ) v(t, x ) 0, whch s a contradcton. Thus, t (, T ). ɛ 0 1 Now consder the functon: (t, x) u(x, t) v(y ɛ, s ɛ ) + α(t t) + ɛ 2 ( x 2 + y ɛ 2 ) + 1 2ɛ 2 ( t s ɛ 2 + x y ɛ 2 )} The functon assumes a local maxmum at (t ɛ, x ɛ ). Snce u s a vscosty subsoluton of (4.12), we have: α + 1 ɛ 2 (t ɛ s ɛ ) + H(t ɛ, x ɛ, 1 ɛ 2 (x ɛ y ɛ ) + ɛx ɛ ) 0. Smlarly, consder the functon:
13 A SURVEY OF PROPERTIES OF FINITE HORIZON DIFFERENTIAL GAMES UNDER ISAACS CONDITION 13 (s, y) v(s, y) u(x ɛ, t ɛ ) α(t t ɛ ) ɛ 2 ( x ɛ 2 + y 2 ) 1 2ɛ 2 ( t ɛ s 2 + x ɛ y 2 )} Snce v s a vscosty er- The functon assumes a local mnmum at (t ɛ, x ɛ ). soluton of (4.13), we have: 1 ɛ 2 (t ɛ s ɛ ) + H(s ɛ, y ɛ, 1 ɛ 2 (x ɛ y ɛ ) ɛy ɛ ) 0. Therefore, we can wrte: 0 < α H(t ɛ, x ɛ, 1 ɛ 2 (x ɛ y ɛ ) + ɛx ɛ ) H(s ɛ, y ɛ, 1 ɛ 2 (x ɛ y ɛ ) ɛy ɛ ) = H(t ɛ, x ɛ, 1 ɛ 2 (x ɛ y ɛ ) + ɛx ɛ ) H(s ɛ, y ɛ, 1 ɛ 2 (x ɛ y ɛ ) + ɛx ɛ ) +H(s ɛ, y ɛ, 1 ɛ 2 (x ɛ y ɛ ) + ɛx ɛ ) H(s ɛ, y ɛ, 1 ɛ 2 (x ɛ y ɛ ) ɛy ɛ ) C( t ɛ s ɛ + x ɛ y ɛ )(1 + 1 ɛ 2 (x ɛ y ɛ ) + ɛx ɛ ) + Cɛ( x ɛ + y ɛ ) 0 as ɛ 0 by (4.9) and (4.10), whch s a contradcton. Therefore, we must have that u(x, t) v(x, t) for any (t, x) [, T ] R m. Wth the comparson prncple, we are able to show that the game has a value satsfyng the Hamlton-Jacob equaton (4.1) f Isaacs condton holds. Ths s an analogous result to Theorem 1.10, where we assume that our value functon s smooth. Theorem Assume that Isaacs condton holds: H + (t, x, p) = H (t, x, p) for any (t, x, p) [, T ] R m R m. Then the game has a value V (t, x) = V + (t, x) = V (t, x) for any (t, x) [, T ] R m. Proof. Snce H + (t, x, p) = H (t, x, p) := H(t, x, p), then by Theorem 4.2, V + s a vscosty soluton of V t + (t, x) + H(t, x, D x V + (t, x)) = 0. In partcular, t s a vscosty subsoluton. Smlarly, V s a vscosty soluton of Vt (t, x) + H(t, x, D x V (t, x)) = 0. In partcular, t s a vscosty ersoluton. Moreover, g(x(t )) = V + (T, x) V (T, x) = g(x(t )). Therefore, V + (t, x) V (t, x) for any (t, x) [, T ] R m. Reversng the roles of V + and V, we can also get V + (t, x) V (t, x) for any (t, x) [, T ] R m. Therefore, the game has a value: V (t, x) := V + (t, x) = V (t, x), whch satsfes V t (t, x) + H(t, x, D x V (t, x)) = 0. We brefly menton here the nte horzon problem snce t shares smlar propertes wth the fnte tme horzon problems. Consder the followng ODE: For 0 t <, and x R m, (4.15) ẋ(t) = f(x(t), u(t), v(t)) x(0) = x 0
14 14 BOTAO WU The pay-off functonal s: P (x 0, u, v) = 0 e λt h(x(t), u(t), v(t))dt, where x( ) solves the ODE (4.13) for any u( ) Φ 1, and v( ) Φ 2. The upper and lower value functons are defned accordngly as before. We have the followng propertes of the value functons: Theorem The value functons V + and V are bounded and Lpschtz contnuous. Theorem The value functons satsfy the dynamc programmng property. For the upper value functon V +, the dynamc programmng property s: V + (x 0 ) = β B d u U 0 e λs h(x(s), u(s), β[u](s))ds + e λt V + (x(t)) As Secton 1 ponts out, the assocated Hamlton-Jacob equaton s λv (x) + H(x, DV (x)) = 0. The proofs of the above theorems are smlar to the proofs of the fnte tme problem. 5. characterzaton and exstence of Nash equlbrum In secton 1, we showed n the verfcaton theorem that V (, x 0 ) = P (, x 0, u, ṽ) = P (, x 0, ũ, v) = P (, x 0, ũ, ṽ). u Φ 1 v Φ 2 Notce that ũ, and ṽ closely resemble the Nash equlbrum strateges n the game theory lterature, where each control s the best response map to the control of the other player. In ths secton, we attempt to show rgorously the exstence of Nash equlbrum pay-offs from Isaacs condton n a dfferental game. We consder a N-player dfferental game. Defnton 5.1. Let (, x 0 ) [0, T ] R N. We say that (e 1, e 2,..., e n ) s a Nash equlbrum pay-off at (, x 0 ) f for any ɛ > 0, there exst (ᾱ ) A d... AN d of delay strateges such that for any 1, 2,..., N, we have: e P (, x 0, (ᾱ )) ɛ and P (, x 0, ᾱ, ᾱ ) P (, x 0, ᾱ, u ) ɛ, for any u Φ ( ). [3] suggests that t s suffcent to maxmze just the termnal pay-off as opposed to maxmzng both the ntegral and the termnal pay-offs. We shall, therefore, restrct: P (, x 0, (α )) = g (x(t )), where x( ) solves the ODE wth x( ) = x 0 and some u ( ) = α [(u )]( ). We see that Defnton 5.1 s the tradtonal Nash Equlbrum defnton n game theory lterature where t s emphaszed that no unlateral devaton n strategy by any sngle player s proftable. For convenence, we wll be prmarly workng on an equvalent defnton that [2] gves. Defnton 5.2. (Equvalent defnton of Nash equlbrum). (e ) s a Nash equlbrum pay-off f and only f t s
15 A SURVEY OF PROPERTIES OF FINITE HORIZON DIFFERENTIAL GAMES UNDER ISAACS CONDITION 15 (Reachable) For any 1, 2,..., N, e g (x(t )) ɛ. (Consstent) For any t [, T ], e V (t, x(t)) ɛ, where x( ) solves the ODE wth x( ) = x 0 and a N-tuple of controls (u ), and V s the lower value of the game wth (5.3) V (, x 0 ) := α A d () (u ) Φ () Isaacs condton n ths case states that: (u ) Φ u Φ f(x, u, (u )) p} = u Φ P (, x 0, α [(u )], u ) f(x, u, (u )) p} (u ) Φ for any (x, p) R N R N. We have shown that under ths condton, V the value of the game and satsfes the followng equalty: V (, x 0 ) = α A d () (u ) Φ () P (, x 0, α [(u )], u ). s ndeed Our goal s to prove the followng theorem: Theorem 5.4. If the functons f and g are bounded and Lpschtz contnuous, and Isaacs condton holds, then for any (, x 0 ) [, T ] R N, there exsts at least one Nash Equlbum at (, x 0 ). The proof of ths theorem follows deas from [2] and depends on the followng two lemmas. Lemma 5.5. For any (, x 0 ) [, T ] R N and for any ɛ > 0, there exst (ū ) Φ 1 ( )... Φ N ( ) such that for all 1, 2,..., N, and for all t [, T ], we have V (t, x(t)) V (, x 0 ) + ɛ, where x(t) satsfes the ODE (1.1). Proof. For any ɛ > 0, by defnton of the lower value functons, there exsts a delay strategy ᾱ A d () such that V (, x 0 ) ɛ P (, x 0, ᾱ, (u )) (u ) Φ () By Theorem 2.2, there exsts a unque N-tuple of controls (ū ) such that ᾱ [(ū )] = ū for any 1, 2,..., N. Let x(t) be the trajectory of the ODE wth x( ) = x 0 and controls (ū ). Then for any t 1 [, T ], we defne a new strategy α A d (t 1) by α [(u 1 )] = ᾱ[(ũ )] [t1,t ] where (ũ (ū )(t) t t 1 )(t) = (u for any (u ) U (t 1 ). Thus: )(t) t 1 t T V (t 1, x(t 1 )) = α A d (t 1) (u ) Φ (t 1) P (t 1, x(t 1 ), α, (u )) P (t 1, x(t 1 ), α, (u )) (u ) Φ (t 1) P (, x( ), ᾱ, (v )) (u ) Φ ()
16 16 BOTAO WU because f (v ) = (u ) on [, t 1 ], then we have the trajectores: x(t ; x( ) = x 0, ᾱ, (v )) = x(t ; x(t 1 ), α, (v ) [t1,t ]). Thus, we have: V (t 1, x(t 1 )) P (, x 0, ᾱ, (v )) V (v ) Φ (.x 0 ) ɛ () Lemma 5.6. For any (, x 0 ) [, T ] R N, and for any ɛ > 0, there exsts a N-tuple of control (u ) Φ 1 ( )... Φ N ( ), such that: V (s, x(s)) V (t, x(t)) + ɛ for any s t T where x(t) solves the ODE (1.1) wth x( ) = x 0 and controls (u ). (T t0)k Proof. For n > 1, let t k = + n. By Lemma 5.5, we can construct a N-tuple controls (u n) : [t k, t k+1 ) Φ 1... Φ N such that: V (t, x n (t)) V (t k, x n (t k )) 1 n for any t [t 2 k, t k+1 ), where x n ( ) s the ODE (1.1) trajectory wth x( ) = x 0 and controls (u n). Now for any s t T, we can fnd t k1 and t k2 such that t k1 s t k1+1 and t k2 t t k2+1. Snce we proved that the value functon V s Lpschtz contnuous n all varables and functon f s bounded, we can wrte: V (t, x n (t)) V (t k2, x n (t k2 )) V (t, x n (t)) V (t k2, x n (t)) + V (t k2, x n (t)) V (t k2, x n (t k2 )) M t k2+1 t k2 + C x n (t) x n (t k2 ) where M andc are the Lpschtz constants t M t k2+1 t k2 + f(s, x N (s), (u n) ds t k2 M t k2+1 t k2 + C 1 t k2+1 t k2 where C 1 s the bound of functon f. 1 n (M + C 1) := K n from our constructons of t k 2. We can prove smlarly that V V k 2 1 (s, x n (s)) V (t k1, x n (t k1 )) K n. Therefore, (t, x n (t)) V (s, x n (s)) V (t k2, x n (t k2 )) V (t k1+1, x n (t k1+1)) 2K n (V (t j+1, x n (t +1 )) V (t j, x n (t j ))) 2K n 1 n 2K ɛ for some n j=k 1+1 sutable choce of ɛ. We now prove Theorem 5.3 usng the Defnton 5.2, the equvalent defnton of Nash Equlbrum. Proof. Let the N-tuple controls (u n) be constructed as n Lemma 5.5 wth ɛ = 1/n. Let x n (t) be the ODE (1.1) trajectory wth x n ( ) = x 0 and controls (u n). The sequence x n s unformly bounded and Lpschtz contnuous on [, T ], and thus
17 A SURVEY OF PROPERTIES OF FINITE HORIZON DIFFERENTIAL GAMES UNDER ISAACS CONDITION 17 we can fnd a subsequence x nj unformly on [, T ]. From the contnuty of the value functon V whch converges to some contnuous trajectory x and the con- s non-decreasng: structon of the controls (u n), by Lemma 5.5 we have that V V (s, x(s)) V (t, x(t)) for any s t T and for any 1, 2,...N. Set (e ) = (g (x(t ))). Snce V V s non-decreasng, we have: (t, x(t)) V (T, x(t )) = e for any s t T and for any 1, 2,...N Agan, snce x n unformly converges to x and V s contnuous, for any ɛ > 0, we can fnd some n such that: e g (x n (T )) ɛ and V (t, x n (t)) e + ɛ for any t [, T ]. By Defnton 5.2 ths means that (e ) s Nash equlbrum pay-off. Acknowledgments. It s a pleasure to thank my mentor, Yan Zhang, for hs help and advce for ths paper. I would also lke to thank Prof. Peter May for organzng the REU program each year. References [1] L.C.Evans and P.E.Sougands. Dfferental Games and Representaton Formulas for Solutons of Hamlton-Jacob-Isaacs Equatons. [2] Cardalaguet Perre. Introducton to Dfferental Games. [3] Rufus Isaacs. Dfferental Games.
The Second Anti-Mathima on Game Theory
The Second Ant-Mathma on Game Theory Ath. Kehagas December 1 2006 1 Introducton In ths note we wll examne the noton of game equlbrum for three types of games 1. 2-player 2-acton zero-sum games 2. 2-player
More informationprinceton univ. F 17 cos 521: Advanced Algorithm Design Lecture 7: LP Duality Lecturer: Matt Weinberg
prnceton unv. F 17 cos 521: Advanced Algorthm Desgn Lecture 7: LP Dualty Lecturer: Matt Wenberg Scrbe: LP Dualty s an extremely useful tool for analyzng structural propertes of lnear programs. Whle there
More informationAPPENDIX A Some Linear Algebra
APPENDIX A Some Lnear Algebra The collecton of m, n matrces A.1 Matrces a 1,1,..., a 1,n A = a m,1,..., a m,n wth real elements a,j s denoted by R m,n. If n = 1 then A s called a column vector. Smlarly,
More informationFirst day August 1, Problems and Solutions
FOURTH INTERNATIONAL COMPETITION FOR UNIVERSITY STUDENTS IN MATHEMATICS July 30 August 4, 997, Plovdv, BULGARIA Frst day August, 997 Problems and Solutons Problem. Let {ε n } n= be a sequence of postve
More informationk t+1 + c t A t k t, t=0
Macro II (UC3M, MA/PhD Econ) Professor: Matthas Kredler Fnal Exam 6 May 208 You have 50 mnutes to complete the exam There are 80 ponts n total The exam has 4 pages If somethng n the queston s unclear,
More informationAppendix B. Criterion of Riemann-Stieltjes Integrability
Appendx B. Crteron of Remann-Steltes Integrablty Ths note s complementary to [R, Ch. 6] and [T, Sec. 3.5]. The man result of ths note s Theorem B.3, whch provdes the necessary and suffcent condtons for
More informationCOS 521: Advanced Algorithms Game Theory and Linear Programming
COS 521: Advanced Algorthms Game Theory and Lnear Programmng Moses Charkar February 27, 2013 In these notes, we ntroduce some basc concepts n game theory and lnear programmng (LP). We show a connecton
More informationComplete subgraphs in multipartite graphs
Complete subgraphs n multpartte graphs FLORIAN PFENDER Unverstät Rostock, Insttut für Mathematk D-18057 Rostock, Germany Floran.Pfender@un-rostock.de Abstract Turán s Theorem states that every graph G
More informationModelli Clamfim Equazioni differenziali 7 ottobre 2013
CLAMFIM Bologna Modell 1 @ Clamfm Equazon dfferenzal 7 ottobre 2013 professor Danele Rtell danele.rtell@unbo.t 1/18? Ordnary Dfferental Equatons A dfferental equaton s an equaton that defnes a relatonshp
More informationMore metrics on cartesian products
More metrcs on cartesan products If (X, d ) are metrc spaces for 1 n, then n Secton II4 of the lecture notes we defned three metrcs on X whose underlyng topologes are the product topology The purpose of
More informationAssortment Optimization under MNL
Assortment Optmzaton under MNL Haotan Song Aprl 30, 2017 1 Introducton The assortment optmzaton problem ams to fnd the revenue-maxmzng assortment of products to offer when the prces of products are fxed.
More informationProblem Set 9 Solutions
Desgn and Analyss of Algorthms May 4, 2015 Massachusetts Insttute of Technology 6.046J/18.410J Profs. Erk Demane, Srn Devadas, and Nancy Lynch Problem Set 9 Solutons Problem Set 9 Solutons Ths problem
More informationREAL ANALYSIS I HOMEWORK 1
REAL ANALYSIS I HOMEWORK CİHAN BAHRAN The questons are from Tao s text. Exercse 0.0.. If (x α ) α A s a collecton of numbers x α [0, + ] such that x α
More informationMath 217 Fall 2013 Homework 2 Solutions
Math 17 Fall 013 Homework Solutons Due Thursday Sept. 6, 013 5pm Ths homework conssts of 6 problems of 5 ponts each. The total s 30. You need to fully justfy your answer prove that your functon ndeed has
More informationTHE CHINESE REMAINDER THEOREM. We should thank the Chinese for their wonderful remainder theorem. Glenn Stevens
THE CHINESE REMAINDER THEOREM KEITH CONRAD We should thank the Chnese for ther wonderful remander theorem. Glenn Stevens 1. Introducton The Chnese remander theorem says we can unquely solve any par of
More information3 Basic boundary value problems for analytic function in the upper half plane
3 Basc boundary value problems for analytc functon n the upper half plane 3. Posson representaton formulas for the half plane Let f be an analytc functon of z throughout the half plane Imz > 0, contnuous
More informationAffine and Riemannian Connections
Affne and Remannan Connectons Semnar Remannan Geometry Summer Term 2015 Prof Dr Anna Wenhard and Dr Gye-Seon Lee Jakob Ullmann Notaton: X(M) space of smooth vector felds on M D(M) space of smooth functons
More informationThe Order Relation and Trace Inequalities for. Hermitian Operators
Internatonal Mathematcal Forum, Vol 3, 08, no, 507-57 HIKARI Ltd, wwwm-hkarcom https://doorg/0988/mf088055 The Order Relaton and Trace Inequaltes for Hermtan Operators Y Huang School of Informaton Scence
More informationCanonical transformations
Canoncal transformatons November 23, 2014 Recall that we have defned a symplectc transformaton to be any lnear transformaton M A B leavng the symplectc form nvarant, Ω AB M A CM B DΩ CD Coordnate transformatons,
More informationPerfect Competition and the Nash Bargaining Solution
Perfect Competton and the Nash Barganng Soluton Renhard John Department of Economcs Unversty of Bonn Adenauerallee 24-42 53113 Bonn, Germany emal: rohn@un-bonn.de May 2005 Abstract For a lnear exchange
More informationLecture 10 Support Vector Machines II
Lecture 10 Support Vector Machnes II 22 February 2016 Taylor B. Arnold Yale Statstcs STAT 365/665 1/28 Notes: Problem 3 s posted and due ths upcomng Frday There was an early bug n the fake-test data; fxed
More informationn α j x j = 0 j=1 has a nontrivial solution. Here A is the n k matrix whose jth column is the vector for all t j=0
MODULE 2 Topcs: Lnear ndependence, bass and dmenson We have seen that f n a set of vectors one vector s a lnear combnaton of the remanng vectors n the set then the span of the set s unchanged f that vector
More informationLectures - Week 4 Matrix norms, Conditioning, Vector Spaces, Linear Independence, Spanning sets and Basis, Null space and Range of a Matrix
Lectures - Week 4 Matrx norms, Condtonng, Vector Spaces, Lnear Independence, Spannng sets and Bass, Null space and Range of a Matrx Matrx Norms Now we turn to assocatng a number to each matrx. We could
More informationLecture 12: Discrete Laplacian
Lecture 12: Dscrete Laplacan Scrbe: Tanye Lu Our goal s to come up wth a dscrete verson of Laplacan operator for trangulated surfaces, so that we can use t n practce to solve related problems We are mostly
More informationEdge Isoperimetric Inequalities
November 7, 2005 Ross M. Rchardson Edge Isopermetrc Inequaltes 1 Four Questons Recall that n the last lecture we looked at the problem of sopermetrc nequaltes n the hypercube, Q n. Our noton of boundary
More informationPhysics 5153 Classical Mechanics. D Alembert s Principle and The Lagrangian-1
P. Guterrez Physcs 5153 Classcal Mechancs D Alembert s Prncple and The Lagrangan 1 Introducton The prncple of vrtual work provdes a method of solvng problems of statc equlbrum wthout havng to consder the
More informationMaximizing the number of nonnegative subsets
Maxmzng the number of nonnegatve subsets Noga Alon Hao Huang December 1, 213 Abstract Gven a set of n real numbers, f the sum of elements of every subset of sze larger than k s negatve, what s the maxmum
More informationThe Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction
ECONOMICS 5* -- NOTE (Summary) ECON 5* -- NOTE The Multple Classcal Lnear Regresson Model (CLRM): Specfcaton and Assumptons. Introducton CLRM stands for the Classcal Lnear Regresson Model. The CLRM s also
More informationSolutions to exam in SF1811 Optimization, Jan 14, 2015
Solutons to exam n SF8 Optmzaton, Jan 4, 25 3 3 O------O -4 \ / \ / The network: \/ where all lnks go from left to rght. /\ / \ / \ 6 O------O -5 2 4.(a) Let x = ( x 3, x 4, x 23, x 24 ) T, where the varable
More informationON A DETERMINATION OF THE INITIAL FUNCTIONS FROM THE OBSERVED VALUES OF THE BOUNDARY FUNCTIONS FOR THE SECOND-ORDER HYPERBOLIC EQUATION
Advanced Mathematcal Models & Applcatons Vol.3, No.3, 2018, pp.215-222 ON A DETERMINATION OF THE INITIAL FUNCTIONS FROM THE OBSERVED VALUES OF THE BOUNDARY FUNCTIONS FOR THE SECOND-ORDER HYPERBOLIC EUATION
More informationAppendix for Causal Interaction in Factorial Experiments: Application to Conjoint Analysis
A Appendx for Causal Interacton n Factoral Experments: Applcaton to Conjont Analyss Mathematcal Appendx: Proofs of Theorems A. Lemmas Below, we descrbe all the lemmas, whch are used to prove the man theorems
More informationMASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 12 10/21/2013. Martingale Concentration Inequalities and Applications
MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.65/15.070J Fall 013 Lecture 1 10/1/013 Martngale Concentraton Inequaltes and Applcatons Content. 1. Exponental concentraton for martngales wth bounded ncrements.
More informationEstimation: Part 2. Chapter GREG estimation
Chapter 9 Estmaton: Part 2 9. GREG estmaton In Chapter 8, we have seen that the regresson estmator s an effcent estmator when there s a lnear relatonshp between y and x. In ths chapter, we generalzed the
More informationSimultaneous Optimization of Berth Allocation, Quay Crane Assignment and Quay Crane Scheduling Problems in Container Terminals
Smultaneous Optmzaton of Berth Allocaton, Quay Crane Assgnment and Quay Crane Schedulng Problems n Contaner Termnals Necat Aras, Yavuz Türkoğulları, Z. Caner Taşkın, Kuban Altınel Abstract In ths work,
More informationOnline Appendix. t=1 (p t w)q t. Then the first order condition shows that
Artcle forthcomng to ; manuscrpt no (Please, provde the manuscrpt number!) 1 Onlne Appendx Appendx E: Proofs Proof of Proposton 1 Frst we derve the equlbrum when the manufacturer does not vertcally ntegrate
More informationPHYS 705: Classical Mechanics. Calculus of Variations II
1 PHYS 705: Classcal Mechancs Calculus of Varatons II 2 Calculus of Varatons: Generalzaton (no constrant yet) Suppose now that F depends on several dependent varables : We need to fnd such that has a statonary
More informationAppendix B. The Finite Difference Scheme
140 APPENDIXES Appendx B. The Fnte Dfference Scheme In ths appendx we present numercal technques whch are used to approxmate solutons of system 3.1 3.3. A comprehensve treatment of theoretcal and mplementaton
More informationCollege of Computer & Information Science Fall 2009 Northeastern University 20 October 2009
College of Computer & Informaton Scence Fall 2009 Northeastern Unversty 20 October 2009 CS7880: Algorthmc Power Tools Scrbe: Jan Wen and Laura Poplawsk Lecture Outlne: Prmal-dual schema Network Desgn:
More informationMATH 241B FUNCTIONAL ANALYSIS - NOTES EXAMPLES OF C ALGEBRAS
MATH 241B FUNCTIONAL ANALYSIS - NOTES EXAMPLES OF C ALGEBRAS These are nformal notes whch cover some of the materal whch s not n the course book. The man purpose s to gve a number of nontrval examples
More informationExercise Solutions to Real Analysis
xercse Solutons to Real Analyss Note: References refer to H. L. Royden, Real Analyss xersze 1. Gven any set A any ɛ > 0, there s an open set O such that A O m O m A + ɛ. Soluton 1. If m A =, then there
More informationAxiomatizations of Pareto Equilibria in Multicriteria Games
ames and Economc Behavor 28, 146154 1999. Artcle ID game.1998.0680, avalable onlne at http:www.dealbrary.com on Axomatzatons of Pareto Equlbra n Multcrtera ames Mark Voorneveld,* Dres Vermeulen, and Peter
More informationModule 3 LOSSY IMAGE COMPRESSION SYSTEMS. Version 2 ECE IIT, Kharagpur
Module 3 LOSSY IMAGE COMPRESSION SYSTEMS Verson ECE IIT, Kharagpur Lesson 6 Theory of Quantzaton Verson ECE IIT, Kharagpur Instructonal Objectves At the end of ths lesson, the students should be able to:
More informationGeneralized Linear Methods
Generalzed Lnear Methods 1 Introducton In the Ensemble Methods the general dea s that usng a combnaton of several weak learner one could make a better learner. More formally, assume that we have a set
More informationUniqueness of Weak Solutions to the 3D Ginzburg- Landau Model for Superconductivity
Int. Journal of Math. Analyss, Vol. 6, 212, no. 22, 195-114 Unqueness of Weak Solutons to the 3D Gnzburg- Landau Model for Superconductvty Jshan Fan Department of Appled Mathematcs Nanjng Forestry Unversty
More information10. Canonical Transformations Michael Fowler
10. Canoncal Transformatons Mchael Fowler Pont Transformatons It s clear that Lagrange s equatons are correct for any reasonable choce of parameters labelng the system confguraton. Let s call our frst
More informationModelli Clamfim Equazioni differenziali 22 settembre 2016
CLAMFIM Bologna Modell 1 @ Clamfm Equazon dfferenzal 22 settembre 2016 professor Danele Rtell danele.rtell@unbo.t 1/22? Ordnary Dfferental Equatons A dfferental equaton s an equaton that defnes a relatonshp
More informationTransfer Functions. Convenient representation of a linear, dynamic model. A transfer function (TF) relates one input and one output: ( ) system
Transfer Functons Convenent representaton of a lnear, dynamc model. A transfer functon (TF) relates one nput and one output: x t X s y t system Y s The followng termnology s used: x y nput output forcng
More informationOn the set of natural numbers
On the set of natural numbers by Jalton C. Ferrera Copyrght 2001 Jalton da Costa Ferrera Introducton The natural numbers have been understood as fnte numbers, ths wor tres to show that the natural numbers
More informationRepresentation theory and quantum mechanics tutorial Representation theory and quantum conservation laws
Representaton theory and quantum mechancs tutoral Representaton theory and quantum conservaton laws Justn Campbell August 1, 2017 1 Generaltes on representaton theory 1.1 Let G GL m (R) be a real algebrac
More information12. The Hamilton-Jacobi Equation Michael Fowler
1. The Hamlton-Jacob Equaton Mchael Fowler Back to Confguraton Space We ve establshed that the acton, regarded as a functon of ts coordnate endponts and tme, satsfes ( ) ( ) S q, t / t+ H qpt,, = 0, and
More informationECE559VV Project Report
ECE559VV Project Report (Supplementary Notes Loc Xuan Bu I. MAX SUM-RATE SCHEDULING: THE UPLINK CASE We have seen (n the presentaton that, for downlnk (broadcast channels, the strategy maxmzng the sum-rate
More informationMAT 578 Functional Analysis
MAT 578 Functonal Analyss John Qugg Fall 2008 Locally convex spaces revsed September 6, 2008 Ths secton establshes the fundamental propertes of locally convex spaces. Acknowledgment: although I wrote these
More informationAnti-van der Waerden numbers of 3-term arithmetic progressions.
Ant-van der Waerden numbers of 3-term arthmetc progressons. Zhanar Berkkyzy, Alex Schulte, and Mchael Young Aprl 24, 2016 Abstract The ant-van der Waerden number, denoted by aw([n], k), s the smallest
More informationInfinitely Split Nash Equilibrium Problems in Repeated Games
Infntely Splt ash Equlbrum Problems n Repeated Games Jnlu L Department of Mathematcs Shawnee State Unversty Portsmouth, Oho 4566 USA Abstract In ths paper, we ntroduce the concept of nfntely splt ash equlbrum
More informationGeneral viscosity iterative method for a sequence of quasi-nonexpansive mappings
Avalable onlne at www.tjnsa.com J. Nonlnear Sc. Appl. 9 (2016), 5672 5682 Research Artcle General vscosty teratve method for a sequence of quas-nonexpansve mappngs Cuje Zhang, Ynan Wang College of Scence,
More informationPoisson brackets and canonical transformations
rof O B Wrght Mechancs Notes osson brackets and canoncal transformatons osson Brackets Consder an arbtrary functon f f ( qp t) df f f f q p q p t But q p p where ( qp ) pq q df f f f p q q p t In order
More informationErrors for Linear Systems
Errors for Lnear Systems When we solve a lnear system Ax b we often do not know A and b exactly, but have only approxmatons  and ˆb avalable. Then the best thng we can do s to solve ˆx ˆb exactly whch
More informationErratum: A Generalized Path Integral Control Approach to Reinforcement Learning
Journal of Machne Learnng Research 00-9 Submtted /0; Publshed 7/ Erratum: A Generalzed Path Integral Control Approach to Renforcement Learnng Evangelos ATheodorou Jonas Buchl Stefan Schaal Department of
More informationNotes on Frequency Estimation in Data Streams
Notes on Frequency Estmaton n Data Streams In (one of) the data streamng model(s), the data s a sequence of arrvals a 1, a 2,..., a m of the form a j = (, v) where s the dentty of the tem and belongs to
More informationGames of Threats. Elon Kohlberg Abraham Neyman. Working Paper
Games of Threats Elon Kohlberg Abraham Neyman Workng Paper 18-023 Games of Threats Elon Kohlberg Harvard Busness School Abraham Neyman The Hebrew Unversty of Jerusalem Workng Paper 18-023 Copyrght 2017
More informationLecture 21: Numerical methods for pricing American type derivatives
Lecture 21: Numercal methods for prcng Amercan type dervatves Xaoguang Wang STAT 598W Aprl 10th, 2014 (STAT 598W) Lecture 21 1 / 26 Outlne 1 Fnte Dfference Method Explct Method Penalty Method (STAT 598W)
More informationSolutions to the 71st William Lowell Putnam Mathematical Competition Saturday, December 4, 2010
Solutons to the 7st Wllam Lowell Putnam Mathematcal Competton Saturday, December 4, 2 Kran Kedlaya and Lenny Ng A The largest such k s n+ 2 n 2. For n even, ths value s acheved by the partton {,n},{2,n
More information5 The Rational Canonical Form
5 The Ratonal Canoncal Form Here p s a monc rreducble factor of the mnmum polynomal m T and s not necessarly of degree one Let F p denote the feld constructed earler n the course, consstng of all matrces
More informationSTEINHAUS PROPERTY IN BANACH LATTICES
DEPARTMENT OF MATHEMATICS TECHNICAL REPORT STEINHAUS PROPERTY IN BANACH LATTICES DAMIAN KUBIAK AND DAVID TIDWELL SPRING 2015 No. 2015-1 TENNESSEE TECHNOLOGICAL UNIVERSITY Cookevlle, TN 38505 STEINHAUS
More informationModule 9. Lecture 6. Duality in Assignment Problems
Module 9 1 Lecture 6 Dualty n Assgnment Problems In ths lecture we attempt to answer few other mportant questons posed n earler lecture for (AP) and see how some of them can be explaned through the concept
More information3.1 Expectation of Functions of Several Random Variables. )' be a k-dimensional discrete or continuous random vector, with joint PMF p (, E X E X1 E X
Statstcs 1: Probablty Theory II 37 3 EPECTATION OF SEVERAL RANDOM VARIABLES As n Probablty Theory I, the nterest n most stuatons les not on the actual dstrbuton of a random vector, but rather on a number
More information20. Mon, Oct. 13 What we have done so far corresponds roughly to Chapters 2 & 3 of Lee. Now we turn to Chapter 4. The first idea is connectedness.
20. Mon, Oct. 13 What we have done so far corresponds roughly to Chapters 2 & 3 of Lee. Now we turn to Chapter 4. The frst dea s connectedness. Essentally, we want to say that a space cannot be decomposed
More informationFINITELY-GENERATED MODULES OVER A PRINCIPAL IDEAL DOMAIN
FINITELY-GENERTED MODULES OVER PRINCIPL IDEL DOMIN EMMNUEL KOWLSKI Throughout ths note, s a prncpal deal doman. We recall the classfcaton theorem: Theorem 1. Let M be a fntely-generated -module. (1) There
More informationGraph Reconstruction by Permutations
Graph Reconstructon by Permutatons Perre Ille and Wllam Kocay* Insttut de Mathémathques de Lumny CNRS UMR 6206 163 avenue de Lumny, Case 907 13288 Marselle Cedex 9, France e-mal: lle@ml.unv-mrs.fr Computer
More informationNP-Completeness : Proofs
NP-Completeness : Proofs Proof Methods A method to show a decson problem Π NP-complete s as follows. (1) Show Π NP. (2) Choose an NP-complete problem Π. (3) Show Π Π. A method to show an optmzaton problem
More informationStrong Markov property: Same assertion holds for stopping times τ.
Brownan moton Let X ={X t : t R + } be a real-valued stochastc process: a famlty of real random varables all defned on the same probablty space. Defne F t = nformaton avalable by observng the process up
More informationSupplement: Proofs and Technical Details for The Solution Path of the Generalized Lasso
Supplement: Proofs and Techncal Detals for The Soluton Path of the Generalzed Lasso Ryan J. Tbshran Jonathan Taylor In ths document we gve supplementary detals to the paper The Soluton Path of the Generalzed
More informationPHYS 705: Classical Mechanics. Canonical Transformation II
1 PHYS 705: Classcal Mechancs Canoncal Transformaton II Example: Harmonc Oscllator f ( x) x m 0 x U( x) x mx x LT U m Defne or L p p mx x x m mx x H px L px p m p x m m H p 1 x m p m 1 m H x p m x m m
More information763622S ADVANCED QUANTUM MECHANICS Solution Set 1 Spring c n a n. c n 2 = 1.
7636S ADVANCED QUANTUM MECHANICS Soluton Set 1 Sprng 013 1 Warm-up Show that the egenvalues of a Hermtan operator  are real and that the egenkets correspondng to dfferent egenvalues are orthogonal (b)
More informationCOMPLEX NUMBERS AND QUADRATIC EQUATIONS
COMPLEX NUMBERS AND QUADRATIC EQUATIONS INTRODUCTION We know that x 0 for all x R e the square of a real number (whether postve, negatve or ero) s non-negatve Hence the equatons x, x, x + 7 0 etc are not
More information2.3 Nilpotent endomorphisms
s a block dagonal matrx, wth A Mat dm U (C) In fact, we can assume that B = B 1 B k, wth B an ordered bass of U, and that A = [f U ] B, where f U : U U s the restrcton of f to U 40 23 Nlpotent endomorphsms
More informationBOUNDEDNESS OF THE RIESZ TRANSFORM WITH MATRIX A 2 WEIGHTS
BOUNDEDNESS OF THE IESZ TANSFOM WITH MATIX A WEIGHTS Introducton Let L = L ( n, be the functon space wth norm (ˆ f L = f(x C dx d < For a d d matrx valued functon W : wth W (x postve sem-defnte for all
More informationAffine transformations and convexity
Affne transformatons and convexty The purpose of ths document s to prove some basc propertes of affne transformatons nvolvng convex sets. Here are a few onlne references for background nformaton: http://math.ucr.edu/
More information6.854J / J Advanced Algorithms Fall 2008
MIT OpenCourseWare http://ocw.mt.edu 6.854J / 18.415J Advanced Algorthms Fall 2008 For nformaton about ctng these materals or our Terms of Use, vst: http://ocw.mt.edu/terms. 18.415/6.854 Advanced Algorthms
More informationSome modelling aspects for the Matlab implementation of MMA
Some modellng aspects for the Matlab mplementaton of MMA Krster Svanberg krlle@math.kth.se Optmzaton and Systems Theory Department of Mathematcs KTH, SE 10044 Stockholm September 2004 1. Consdered optmzaton
More informationModelli Clamfim Equazione del Calore Lezione ottobre 2014
CLAMFIM Bologna Modell 1 @ Clamfm Equazone del Calore Lezone 17 15 ottobre 2014 professor Danele Rtell danele.rtell@unbo.t 1/24? Convoluton The convoluton of two functons g(t) and f(t) s the functon (g
More informationON THE BURGERS EQUATION WITH A STOCHASTIC STEPPING STONE NOISY TERM
O THE BURGERS EQUATIO WITH A STOCHASTIC STEPPIG STOE OISY TERM Eaterna T. Kolovsa Comuncacón Técnca o I-2-14/11-7-22 PE/CIMAT On the Burgers Equaton wth a stochastc steppng-stone nosy term Eaterna T. Kolovsa
More informationLecture Notes on Linear Regression
Lecture Notes on Lnear Regresson Feng L fl@sdueducn Shandong Unversty, Chna Lnear Regresson Problem In regresson problem, we am at predct a contnuous target value gven an nput feature vector We assume
More informationMATH 5707 HOMEWORK 4 SOLUTIONS 2. 2 i 2p i E(X i ) + E(Xi 2 ) ä i=1. i=1
MATH 5707 HOMEWORK 4 SOLUTIONS CİHAN BAHRAN 1. Let v 1,..., v n R m, all lengths v are not larger than 1. Let p 1,..., p n [0, 1] be arbtrary and set w = p 1 v 1 + + p n v n. Then there exst ε 1,..., ε
More informationNumerical Heat and Mass Transfer
Master degree n Mechancal Engneerng Numercal Heat and Mass Transfer 06-Fnte-Dfference Method (One-dmensonal, steady state heat conducton) Fausto Arpno f.arpno@uncas.t Introducton Why we use models and
More informationTHERE ARE INFINITELY MANY FIBONACCI COMPOSITES WITH PRIME SUBSCRIPTS
Research and Communcatons n Mathematcs and Mathematcal Scences Vol 10, Issue 2, 2018, Pages 123-140 ISSN 2319-6939 Publshed Onlne on November 19, 2018 2018 Jyot Academc Press http://jyotacademcpressorg
More informationa b a In case b 0, a being divisible by b is the same as to say that
Secton 6.2 Dvsblty among the ntegers An nteger a ε s dvsble by b ε f there s an nteger c ε such that a = bc. Note that s dvsble by any nteger b, snce = b. On the other hand, a s dvsble by only f a = :
More informationEconomics 101. Lecture 4 - Equilibrium and Efficiency
Economcs 0 Lecture 4 - Equlbrum and Effcency Intro As dscussed n the prevous lecture, we wll now move from an envronment where we looed at consumers mang decsons n solaton to analyzng economes full of
More informationON MECHANICS WITH VARIABLE NONCOMMUTATIVITY
ON MECHANICS WITH VARIABLE NONCOMMUTATIVITY CIPRIAN ACATRINEI Natonal Insttute of Nuclear Physcs and Engneerng P.O. Box MG-6, 07725-Bucharest, Romana E-mal: acatrne@theory.npne.ro. Receved March 6, 2008
More informationApplied Stochastic Processes
STAT455/855 Fall 23 Appled Stochastc Processes Fnal Exam, Bref Solutons 1. (15 marks) (a) (7 marks) The dstrbuton of Y s gven by ( ) ( ) y 2 1 5 P (Y y) for y 2, 3,... The above follows because each of
More informationk(k 1)(k 2)(p 2) 6(p d.
BLOCK-TRANSITIVE 3-DESIGNS WITH AFFINE AUTOMORPHISM GROUP Greg Gamble Let X = (Z p d where p s an odd prme and d N, and let B X, B = k. Then t was shown by Praeger that the set B = {B g g AGL d (p} s the
More informationSELECTED SOLUTIONS, SECTION (Weak duality) Prove that the primal and dual values p and d defined by equations (4.3.2) and (4.3.3) satisfy p d.
SELECTED SOLUTIONS, SECTION 4.3 1. Weak dualty Prove that the prmal and dual values p and d defned by equatons 4.3. and 4.3.3 satsfy p d. We consder an optmzaton problem of the form The Lagrangan for ths
More informationDifferential Games II. Marc Quincampoix Université de Bretagne Occidentale ( Brest-France) SADCO, London, September 2011
Differential Games II Marc Quincampoix Université de Bretagne Occidentale ( Brest-France) SADCO, London, September 2011 Contents 1. I Introduction: A Pursuit Game and Isaacs Theory 2. II Strategies 3.
More informationStanford University CS254: Computational Complexity Notes 7 Luca Trevisan January 29, Notes for Lecture 7
Stanford Unversty CS54: Computatonal Complexty Notes 7 Luca Trevsan January 9, 014 Notes for Lecture 7 1 Approxmate Countng wt an N oracle We complete te proof of te followng result: Teorem 1 For every
More informationMath 702 Midterm Exam Solutions
Math 702 Mdterm xam Solutons The terms measurable, measure, ntegrable, and almost everywhere (a.e.) n a ucldean space always refer to Lebesgue measure m. Problem. [6 pts] In each case, prove the statement
More informationGame Theory. Lecture Notes By Y. Narahari. Department of Computer Science and Automation Indian Institute of Science Bangalore, India February 2008
Game Theory Lecture Notes By Y. Narahar Department of Computer Scence and Automaton Indan Insttute of Scence Bangalore, Inda February 2008 Chapter 10: Two Person Zero Sum Games Note: Ths s a only a draft
More informationSalmon: Lectures on partial differential equations. Consider the general linear, second-order PDE in the form. ,x 2
Salmon: Lectures on partal dfferental equatons 5. Classfcaton of second-order equatons There are general methods for classfyng hgher-order partal dfferental equatons. One s very general (applyng even to
More informationLecture 14: Bandits with Budget Constraints
IEOR 8100-001: Learnng and Optmzaton for Sequental Decson Makng 03/07/16 Lecture 14: andts wth udget Constrants Instructor: Shpra Agrawal Scrbed by: Zhpeng Lu 1 Problem defnton In the regular Mult-armed
More informationA combinatorial problem associated with nonograms
A combnatoral problem assocated wth nonograms Jessca Benton Ron Snow Nolan Wallach March 21, 2005 1 Introducton. Ths work was motvated by a queston posed by the second named author to the frst named author
More informationFoundations of Arithmetic
Foundatons of Arthmetc Notaton We shall denote the sum and product of numbers n the usual notaton as a 2 + a 2 + a 3 + + a = a, a 1 a 2 a 3 a = a The notaton a b means a dvdes b,.e. ac = b where c s an
More information