Distributionally Robust Stochastic Control with Conic Confidence Sets
|
|
- Christiana Kelley
- 6 years ago
- Views:
Transcription
1 Disribuionally Robus Sochasic Conrol wih Conic Confidence Ses Insoon Yang Absrac The heory of (sandard) sochasic opimal conrol is based on he assumpion ha he probabiliy disribuion of uncerain variables is fully known. In pracice, however, obaining an accurae disribuion is ofen challenging. To resolve his issue, we sudy a disribuionally robus sochasic conrol problem ha minimizes a cos funcion of ineres given ha he disribuion of uncerain variables is no known bu lies in a so-called ambiguiy se. We firs invesigae a dynamic programming approach and idenify condiions for he exisence and opimaliy of non-randomized Markov policies. We hen propose a dualiy-based reformulaion mehod for an associaed Bellman equaion in cases wih conic confidence ses. This reformulaion alleviaes he compuaional issues inheren in he infinie-dimensional minimax opimizaion problem in he Bellman equaion wihou sacrificing opimaliy. The effeciveness of he proposed mehod is demonsraed hrough an applicaion o a sochasic invenory conrol problem. I. INTRODUCTION Sandard sochasic conrol mehods assume ha he probabiliy disribuion of uncerain variables (e.g., disurbances) is available. However, his assumpion is ofen resricive in pracice because obaining an accurae disribuion requires large-scale, high-resoluion sensor measuremens over a long raining period or muliple periods. Siuaions in which uncerain variables are no direcly observed could be much more challenging; compuaional mehods, such as filering or saisical learning echniques, are ofen used o obain he (poserior) disribuion of he uncerain variables given limied observaions. The accuracy of he obained disribuion is ofen poor, as i is subjec o he qualiy of he observaions, compuaional mehods, and prior knowledge abou he variables. If poor disribuional informaion is employed in consrucing a sochasic opimal conroller, i does no guaranee opimaliy and can even cause caasrophic sysem behaviors [1], [2]. To overcome his issue of limied disribuion informaion in sochasic conrol, we invesigae a disribuionally robus conrol approach. This emerging sochasic conrol mehod minimizes a cos funcion of ineres, assuming ha he disribuion of uncerain variables is no compleely known bu conained in a pre-specified ambiguiy se of probabiliy disribuions. In finie-sae Markov decision process seings, several ypes of ambiguiy ses have been considered: [3], [4], [5] employ ambiguiy ses wih momen consrains, confidence inervals and Wassersein disance, respecively. However, for coninuous-sae conrol problems, only a few This work is pored in par by NSF under ECCS and CNS I. Yang is wih he Elecrical Engineering Deparmen, Universiy of Souhern California, Los Angeles, CA 90089, USA insoonya@usc.edu cases wih momen consrain-based and oal variaion disance ambiguiy are well sudied [6], [7], [8]. In his paper, we consider an imporan class of ambiguiy ses ha are characerized wih confidence ses and probabiliy inervals in a coninuous-sae sochasic conrol seing. The conribuions of his work are wofold. Firs, we invesigae a dynamic programming soluion o discree-ime disribuionally robus conrol problems in general finie-horizon cases and provide condiions for he exisence and opimaliy of non-randomized Markov policies. This exisence resul is based on he lower semi-coninuiy of an associaed dynamic programming operaor. Our dynamic programming approach idenifies an imporan srucural propery: he sysem sae is a sufficien saisic. Second, we propose a dual formulaion of he Bellman equaion in cases wih conic confidence ses. This approach convers he compuaionally challenging infinie-dimensional minimax problem ino a semi-infinie program, which can be solved by exising convergen algorihms. We also show ha he proposed reformulaion is exac using srong dualiy and he nesing condiion for confidence ses proposed by Wiesemann e al. [9]. The uiliy of our approach is demonsraed hrough an applicaion o a sochasic invenory conrol problem. The remainder of his paper is organized as follows. In Secion II, we inroduce he problem seup, including a dynamic game formulaion of disribuionally robus conrol problems. In Secion III, we provide condiions for he exisence and opimaliy of non-randomized Markov policies and anoher se of condiions for he convexiy of he value funcion. Based on hese analyical resuls, we develop a dualiy-based reformulaion mehod for he Bellman equaion in Secion IV. A sochasic invenory conrol problem is considered in Secion V as an applicaion of he proposed mehod. We use he following noaion hroughou he paper. Given a Borel space X, P(X) denoes he se of Borel probabiliy measures on X. Given a cone K, K represens is dual cone. We also le T := {0, 1,, T 1} and T := {0, 1,, T }. II. PROBLEM SETUP A. Ambiguiy in he Disribuion of Disurbances Consider he following sochasic sysem subjec o he disurbance {w } T =0 1, w, defined on a sandard probabiliy space (Ω, F, P): x +1 = f(x, u, w ), (1) where x R n is he sae, u R m is he conrol inpu a sage and f : R n R m R n is a measurable funcion. We assume ha w s and w are independen for s. In
2 many pracical siuaions, he full disurbance disribuion may no be available. To overcome his challenge, we will invesigae a disribuionally robus conrol approach ha minimizes he wors-case cos associaed wih he sysem evoluion under informaion consrains characerized as a se of probabiliy disribuions. We assume ha he rue probabiliy disribuion µ of w is no compleely known bu conained in a so-called ambiguiy se, D. Example 1 (Ambiguiy wih Confidence Ses). Consider he following se of probabiliy measures: D := {µ P( ) µ(c i ) [p i, pi ], i I }, (2) where p i, pi [0, 1] are model parameers, C i s are confidence ses, and I := {1,, N } is a given index se. Wih any measure in his se, he probabiliy ha w is conained in he se C i is beween p i and pi. Noe ha he user can specify he disribuional informaion, such as C i and (p i, pi ), ha he conroller russ given daa and saisical informaion abou he disurbance. Anoher popular ype of ambiguiy ses in single-sage opimizaion problems is based on momen consrains and confidence ses [10], [11], [12], [13], [14], [9]. Recenly, saisical disance-based ambiguiy ses also received a grea deal of aenion since hey can be easily designed from an empirical disribuion wihou requiring a large enough daa samples o esimae momens [15], [16], [17], [18], [19], [20]. In Secion IV, we consider ambiguiy ses of he form (2) ha are characerized wih conic confidence ses. This class of ambiguiy ses can be inuiively consruced from empirical disribuions, as will be illusraed in Secion V. B. Disribuionally Robus Conrol as a Dynamic Game Le H be he se of hisories up o sage, whose elemen is of he form 1 h = (x 0, u 0, w 0,, x 1, u 1, w 1, x ). The se of admissible conrol sraegies is chosen as Π := {π := (π 0,, π T 1 ) π (U(x ) h ) = 1 h H }, where U(x ) is he se of admissible acions given sae x, and π is a sochasic kernel from H o R m. Noe ha he sraegy space is broad enough o conain randomized non- Markov policies. 2 Similarly, we le H e be he se of exended hisories up o sage, whose elemen akes he form h e := (x 0, u 0, w 0, µ 0,, x 1, u 1, w 1, µ 1, x, u ). By viewing he disurbance as an adversarial player who chooses he disurbance disribuion given available informaion h e, we define he se of admissible disribuion sraegies as Γ := {γ = (γ 0,, γ T 1 ) γ (D h e ) = 1 h e H e }. 1 All he resuls in his paper are valid wih hisories of he form h := (x 0, u 0, w 0, µ 0,, x 1, u 1, w 1, µ 1, x ) ha also conains Player II s acions (µ 0,, µ 1 ). However, we use he reduced version of hisories because he realized disribuions may no be observable in pracice. 2 Suppose ha for each, π ( h ) is concenraed a a measurable funcion φ : R n R m such ha φ (x) U(x) for all x R n and for all h H. Then, π is a (non-randomized) Markov policy and by a sligh abuse of noaion π is considered o be idenical wih φ. Consider he following cos funcion associaed wih he sysem evoluion saring from x 0 = x: [ T 1 ] J x [π, γ] := E π,γ r(x, u ) + q(x T ), =0 where r : R n R m R and q : R n R are sagewise and erminal cos funcions of ineres. Here, E π,γ is he expecaion aken wih respec o he probabiliy measure P π,γ induced by he sraegy pair (π, γ). Our goal is o choose a conrol policy ha minimizes he wors-case cos given ambiguous informaion abou he probabiliy disribuion {µ } T =0 1 of he disurbance {w } T =0 1. To be more precise, we define he opimal disribuionally robus conrol policies as follows: Definiion 1. A conrol sraegy π Π is said o be an opimal disribuionally robus policy if i saisfies J x [π, γ] J x [π, γ ] π Π. γ Γ γ Γ A desired conrol sraegy can be obained by solving he following minimax conrol problem: inf J x [π, γ]. (3) π Π γ Γ The mos imporan par of his problem formulaion is he inner maximizaion of he cos funcion over all probabiliy disribuion policies in he sraegy space Γ, which encodes disribuional ambiguiy hrough D. One can view his problem as a wo-player zero-sum dynamic game, in which Player I chooses a conrol policy o minimize he cos and Player II selecs he disurbance s disribuion sraegy ha maximizes he cos. The proposed minimax formulaion implies he following propery: Proposiion 1. Suppose ha an opimal soluion o he disribuionally robus conrol problem (3) exiss and is denoed by (π, γ ). Then, he following inequaliies hold: J x [π, γ] J x [π, γ ] J x [π, γ ] (π, γ) Π Γ. γ Γ The firs and second equaliies hold wih γ = γ and π = π, respecively. In addiion, π is an opimal disribuionally robus policy. The firs inequaliy implies ha when he opimal policy π is employed, he wors-case cos value is equal o J x [π, γ ] for any disribuional error consisen wih he consrains in he ambiguiy se D for each. Thus, his approach provides a performance guaranee in he form of an upper-bound, J x [π, γ ], of he cos value, which is igh. Noe ha his performance guaranee may no be valid when a differen conrol policy is employed as shown in he second inequaliy. Furhermore, he second inequaliy confirms ha an opimal soluion o he dynamic game problem (3) provides an opimal disribuionally robus policy.
3 III. DYNAMIC PROGRAMMING SOLUTION A. Exisence of Opimal Disribuionally Robus Policies We begin by inroducing he following dynamic programming operaor T, T : [ ] T v(x) := inf r(x, u)+ v(f(x, u, w))dµ(w), u U(x) µ D where v is a measurable funcion on R n. We hen define he value funcion of he disribuionally robus conrol problem (3) as v (x) := T T +1 T T 1 q(x) for each T and v T (x) := q(x). By definiion, v (x) represens he minimal wors-case expeced cos value from sage o T given x = x. Under he following assumpion for he measurable selecion condiion, he value funcion is lower semi-coninuous and hus he disribuionally robus conrol problem (3) admis a non-randomized Markov policy, which is opimal. Assumpion 1. The following properies hold: (i) r(x, u) and q(x) are lower semi-coninuous and bounded below for all (x, u) R n R m such ha u U(x); (ii) For each bounded coninuous funcion g : R n R, he funcion ĝ (x, u, µ) := g(f(x, u, w))dµ(w) is coninuous for all (x, u, µ) R n R m D such ha u U(x); (iii) The se U(x) is compac for each x R n. In addiion, he se-valued mapping x U(x) is upper semiconinuous. Theorem 1. Suppose ha Assumpion 1 holds. Then, he value funcion v is lower semi-coninuous for each T. Furhermore, here exiss a measurable funcion φ : R n R m for each T such ha φ (x) U(x) and v (x) = µ D [ r(x, φ (x)) + ] v +1 (f(x, φ (x), w))dµ(w) for all x R n. The non-randomized Markov policy π := (φ 0,, φ T 1 ) Π is an opimal soluion o he disribuionally robus conrol problem (3), i.e., v 0 (x) = J x [π, γ]. γ Γ This heorem can be shown by exending he proof of Theorem 1 in [7] and Theorem 3.1 in [21]. The key idea is o show ha he lower semi-coninuiy of v is preserved hrough he dynamic programming operaor. We can hen use mahemaical inducion o show ha v is lower semiconinuous and hus, he ouer minimizaion problem in he definiion of value funcions admis an opimal soluion. Noe ha Theorem 1 allows us o idenify an imporan srucural propery of he disribuionally robus conrol problem: he sysem sae is a sufficien saisic for Player I (conroller) under Assumpion 1. This observaion yields he pracical advanage ha i suffices o focus on non-randomized Markov polices when designing an opimal disribuionally robus conroller. B. Bellman Equaion Applying he dynamic programming principle [22], [23], we can evaluae he value funcion backward in ime as follows: Proposiion 2. Suppose ha Assumpion 1 holds. Then, he value funcion v saisfies he following Bellman equaion: v (x) = min u U(x) [ r(x, u) + µ D wih v T (x) = q(x). ] (4) v +1 (f(x, u, w))dµ(w) Noe ha due o Theorem 1 he ouer minimizaion problem in he Bellman equaion admis an opimal soluion. From he numerical perspecive, however, solving he Bellman equaion is challenging. In addiion o he scalabiliy issue inheren in dynamic programming, his Bellman equaion involves an infinie-dimensional minimax opimizaion problem because he disurbance may have a coninuous densiy. In he nex secion, we will resolve his compuaional issue for an imporan class of disribuionally robus sochasic conrol problems in which confidence ses are specified. Wih such an ambiguiy se, we will show ha an opimal disribuionally robus policy can be obained by solving compuaionally racable semi-infinie programs if he value funcion is convex. C. Convexiy of he Value Funcion We now show ha he value funcion is convex under he following condiions: Assumpion 2. The disribuionally robus conrol problem (3) saisfies he followings: (i) r : R n R m R and q : R n R are convex funcions; (ii) f : R n R m R n is an affine funcion; (iii) For all λ (0, 1) and for all x 1, x 2 R n, if u i U(x i ), i = 1, 2, hen λu 1 + (1 λ)u 2 U(λx 1 + (1 λ)x 2 ). Proposiion 3. Suppose ha Assumpion 2 holds. Then, he value funcion v : R n R is convex for each T. Proof. We use mahemaical inducion. For = T, v T : R n R is convex since v T = q. Suppose ha v τ : R n R is convex for τ = T 1,, + 1. We now he consider he value funcion a sage. Fix λ (0, 1) and x 1, x 2 R n. For any ɛ > 0, here exiss an ɛ-opimal soluion u i U(x i ) o he ouer minimizaion problem in he Bellman equaion (4) for (, x i ), i.e., v (x i )+ɛ > r(x i, u i )+ v +1 (f(x i, u i, w))dµ(w). µ D
4 Le x λ := λx 1 + (1 λ)x 2 and u λ := λu 1 + (1 λ)u 2. Due o Assumpion 2 (iii), u λ U(x λ ). Thus, v (x λ ) r(x λ, u λ ) + v +1 (f(x λ, u λ, w))dµ(w). µ D We now noice ha f(x λ, u λ, w) = λf(x 1, u 1, w) + (1 λ)f(x 2, u 2, w) due o Assumpion 2 (ii). Since v +1 and r are convex, we obain ha v (x λ ) λr(x 1, u 1 ) + (1 λ)r(x 2, u 2 ) + λv +1 (f(x 1, u 1, w))dµ(w) µ D R l + (1 λ)v +1 (f(x 2, u 2, w))dµ(w) µ D < λv (x 1 ) + (1 λ)v (x 2 ) + ɛ. Leing ɛ 0, we conclude ha v is convex. Noe ha his proposiion does no require he exisence of opimal disribuionally robus policies. Furhermore, we do no impose any specific srucure on he ambiguiy se D for he convexiy of he value funcion. IV. STRONG DUALITY-BASED REFORMULATION A. Disribuional Ambiguiy wih Conic Confidence Ses We now focus on he disribuionally robus conrol problem wih a paricular ambiguiy se of he form (2) in Example 1: D := {µ P( ) µ(c i ) [p i, pi ], i I }, where I := {1,, N } and he confidence se C i has he following conic represenaion: C i = {w C i w K i d i }, where C i R Li l and d i R Li are model parameers, and K i is a proper cone. We impose he following assumpion: Assumpion 3. The ambiguiy se saisfies he following condiions: (i) The confidence se C N is compac and p N = p N = 1. (ii) There exiss a disribuion measure µ D such ha µ(c) i (p i, pi ) whenever p i < pi, i I. (iii) (Nesing Condiion) For each T and all i, i I such ha i i, we have eiher C i s C i, C i s C i or C i C i =, where X s Y represens ha X is a sric subse of Y. The firs condiion represens ha C N is he por of µ. The second condiion ensures ha here exiss a probabiliy disribuion ha saisfies he probabilisic consrains in D as sric inequaliies whenever p i < pi. These wo regulariy condiions will guaranee ha srong dualiy holds based on he generalized Slaer-ype resuls from Shapiro [24] when he Bellman equaion is reformulaed in he nex subsecion. The hird condiion is called he nesing condiion [9], which implies ha here exiss a sric parial order on he confidence ses regarding he se inclusion and ha any incomparable ses are disjoin. This nesing condiion, ogeher wih he wo regulariy condiions, provides a racable dual formulaion of he Bellman equaion wihou loss of opimaliy. B. Dual Bellman Equaion We now reformulae he infinie dimensional minimax opimizaion problem in he Bellman equaion (4) as a semiinfinie program, which can be numerically solved by exising convergen algorihms. Furhermore, his reformulaion based on srong dualiy is exac as shown in he following heorem. Theorem 2. Suppose ha Assumpions 1, 2 and 3 hold. Then, he following equaliy holds for all (, x) T R n : 3 v (x) = inf r(x, u) + (p i κ i p i u,κ,λ,ν λi ) i I s.. (C i w d i ) ν i + i A (i) (κ i λ i ) v +1 (f(x, u, w)) w C N i I u U(x), λ, κ R N +, ν i K i, where A (i) := {i} {i I C i s C i }, wih he erminal condiion v T (x) = q(x). Proof. By inroducing a slack variable z R, we can rewrie he Bellman equaion (4) in he following equivalen form: v (x) = inf z u U(x),z R { s.. r(x, u) µ D } + v +1 (f(x, u, w))dµ(w) z for each (, x) T R n. We firs focus on he maximizaion problem in he inequaliy consrain. I can be rewrien as he following infinie-dimensional linear program: r(x, u) + v +1 (f(x, u, w))dµ(w) µ P( ) R l s.. 1 {w C i C N }dµ(w) p i i I 1 {w C i }dµ(w) p i i I. C N Under Assumpion 3, he generalized Slaer condiion holds [24]. Thus, here is no dualiy gap and we have he following dual formulaion of he problem above wihou loss of opimaliy: inf κ,λ R N + r(x, u) + i I (p i κ i p i λi ) s.. v +1 (f(x, u, w)) + i I 1 {w C i }(λ i κ i ) 0 w C N. 3 In he reformulaed Bellman equaion, min u is merged wih inf κ,λ,ν for a compac represenaion. The minimizaion problem admis an opimal soluion u.
5 Le B (i) be he index ses of all he sric subses of C, i i.e., B (i) := {i I C i s C}. i We also le C i := C i \ C i. i B (i) Due o he nesing condiion, { C 1 N,, C } is a disjoin pariion of he por C N. Therefore, he inequaliy consrain of he dual problem can be rewrien as v +1 (f(x, u, w)) + (λ i κ i ) 0 w C i i I. i A (i) The inequaliy consrain associaed wih he index i I is equivalen o w C i v +1 (f(x, u, w)) + (λ i κ i ) 0. i A (i) Since w v +1 (f(x, u, w)) is convex for each (x, u), he objecive funcion is convex wih respec o w. Therefore, he maximum is aained a he ouer boundary of C. i We now observe ha he ouer boundary of C i corresponds o he ouer boundary of C i due o he nesing condiion [9]. Thus, we can rewrie he ih consrain as w C N v +1 (f(x, u, w)) + s.. C i w K i d i. i A (i) (λ i κ i ) Is dual is given by he following semi-infinie program: inf d i ν i + θ i + (λ i κ i ) ν i K i,θi R s.. w C N i A (i) v +1 (f(x, u, w)) (C i w) ν i θ i. Puing he reformulaion resuls all ogeher, we have ha v (x) = inf z s.. r(x, u) + i I (p i κ i p i λi ) z d i ν i + θ i + i A (i) (λ i κ i ) 0 i I v +1 (f(x, u, w)) (C i w) ν i θ i w C N i I u U(x), κ, λ R N +, ν i K i, z R, θ R N. Viewing z and θ as slack variables, we conclude ha he saemen in he heorem holds. Noe ha he convexiy of w v +1 (f(x, u, w)) and he nesing condiion play a criical role in preserving opimaliy in he proposed reformulaion as originally observed by Wiesemann e al. in he conex of single-sage opimizaion [9]. When (u, w) v +1 (f(x, u, w)) is also piecewise affine, our resul is consisen wih Theorem 1 in [9]. Theorem 2 allows us o avoid solving he compuaionally challenging infinie-dimensional minimax opimizaion problems in he original Bellman equaion. Insead, we can evaluae he value funcion backward in ime by solving a C 2 =[3.5, 8.5] C 1 =[5.5, 6.5] w C 3 =[0, 12] Fig. 1: The empirical disribuion µ of w and he confidence ses C i, i = 1, 2, 3. semi-infinie program a each (discreized) sae. This semiinfinie program can be solved by several convergen mehods such as primal-dual mehods, discreizaion mehods, homoopy mehods, exchange mehods, and consrain sampling mehods (see [25], [26], [27] and he references herein). Among hem, we will use he convergen discreizaion mehod developed by Reemsen [28] in he nex secion. V. APPLICATION TO INVENTORY CONTROL We consider a sochasic invenory conrol problem o demonsrae he performance of our disribuionally robus conrol mehod. We use he sandard seing of sochasic Newsvendor problems (e.g., [29]). Le x R be an invenory level of ineres a sage. Given he quaniy u U := [0, 10] ordered and he sochasic demand w a sage, he invenory level evolves as x +1 = x + u w, for T := {0, 1,, 6}. We assume ha any unsaisfied demand is backlogged for he nex sage and hus allow negaive sae values. The sage-wise cos funcion is given by r(x, u, w ) = c o (x + u w ) + + c u (w x u ) +, where c o = 1 is he overage (or sorage) cos and c u = 1 is he underage cos (or he cos of los sales). Fig. 1 shows he empirical disribuion µ of w for all T, and he confidence ses used in our simulaions. We choose p i and p i as 90% and 110% of µ (C). i Thus, µ is conained in he consruced ambiguiy se D. We compare our disribuionally robus conroller designed using D and he sandard sochasic opimal conroller consruced wih he empirical disribuion µ when x 0 = 10. Suppose ha he acual disribuion µ rue of w is uniform in each confidence se and µ rue (C) i = p i. Then, µrue D bu µ rue is differen from he empirical disribuion µ. In our simulaion wih 10 5 rajecories of {w } sampled from {µ rue }, he disribuionally robus conrol mehod reduces he oal expeced cos incurred by he sandard conroller by 29.6%. This resul confirms ha our conroller is robus agains errors in disurbance disribuions while he sandard conroller is no. To invesigae why he proposed conroller performs beer han he sandard conroller under disribuional ambiguiy,
6 invenory level ime (day) (a) ime (day) (b) Fig. 2: Tukey box plos of sae rajecories conrolled by (a) he sandard sochasic opimal conroller and (b) he disribuionally robus conroller. we now compare heir conrolled sae rajecories. As shown in Fig. 2, he disribuionally robus conroller drives he median (and he firs and hird quaniles) of sae rajecories closer o he origin han he sandard conroller. Since our conroller considers he wors-case disribuion in he ambiguiy se, i can conrol he sysem in a desirable manner (mainaining he invenory level close o zero) even when deviaes from he empirical disribuion µ. On he oher hand, he sandard conroller opimizes he sysem performance only when µ rue = µ ; oherwise, here is no performance guaranee. In paricular, he sandard conroller iniially increases he invenory level by using approximaely 97% of he maximum allowable conrol value. This aggressive conrol acion is inended o saisfy demand a laer sages wih he limied conrol range U := [0, 10]. However, µ rue as µ rue deviaes from µ, his sandard conrol sraegy generaes higher overage coss han expeced. On he oher hand, he disribuionally robus conroller is designed o ake ino accoun such possibiliies and is capable of balancing he overage and underage coss when µ rue µ. VI. CONCLUSION AND FUTURE WORK We have proposed a dualiy-based dynamic programming approach o disribuionally robus conrol problems wih conic confidence ses. The srucural propery we idenified allows us o focus on non-randomized Markov policies wih sae feedback. Our exac dualiy-based reformulaion mehod also alleviaes he compuaional issues in he original Bellman equaion ha involves infinie-dimensional minimax opimizaion problems. As a fuure research, i is of grea ineres o develop a scalable numerical mehod for he reformulaed Bellman equaion. Furhermore, adding risk consrains may help in sysemaically discouraging undesirable sysem behaviors. REFERENCES [1] A. Nilim and L. El Ghaoui, Robus conrol of Markov decision processes wih uncerain ransiion marices, Operaions Research, vol. 53, no. 5, pp , [2] S. Samuelson and I. Yang, Daa-driven disribuionally robus conrol of energy sorage o manage wind power flucuaions, in Proceedings of he 1s IEEE Conference on Conrol Technology and Applicaions, [3] H. Xu and S. Mannor, Disribuionally robus Markov decision processes, Mahemaics of Operaions Research, vol. 37, no. 2, pp , [4] P. Yu and H. Xu, Disribuionally robus counerpar in Markov decision processes, IEEE Transacions on Auomaic Conrol, vol. 61, no. 9, pp , [5] I. Yang, A convex opimizaion approach o disribuionally robus Markov decision processes wih Wassersein disance, IEEE Conrol Sysems Leers, vol. 1, no. 1, pp , [6] B. P. G. Van Parys, D. Kuhn, P. J. Goular, and M. Morari, Disribuionally robus conrol of consrained sochasic sysems, IEEE Transacions on Auomaic Conrol, vol. 61, no. 2, pp , [7] I. Yang, A dynamic game approach o disribuionally robus safey specificaions for sochasic sysems, arxiv: , [8] I. Tzorzis, C. D. Charalambous, and T. Charalambous, Dynamic programming subjec o oal variaion disance ambiguiy, SIAM Journal on Conrol and Opimizaion, vol. 53, no. 4, pp , [9] W. Wiesemann, D. Kuhn, and M. Sim, Disribuionally robus convex opimizaion, Operaions Research, vol. 62, no. 6, pp , [10] H. Scarf, K. J. Arrow, and S. Karlin, A min-max soluion of an invenory problem, Sudies in he Mahemaical Theory of Invenory and Producion, pp , [11] J. Dupačová, The minimax approach o sochasic programming and an lllusraive applicaion, Sochasics, vol. 20, pp , [12] E. Delage and Y. Ye, Disribuionally robus opimizaion under momen uncerainy wih applicaion o daa-driven problems, Operaions Research, vol. 58, no. 3, pp , [13] I. Popescu, Robus mean-covariance soluions for sochasic opimizaion, Operaions Research, vol. 55, no. 1, pp , [14] S. Zymler, D. Kuhn, and B. Rusem, Disribuionally robus join chance consrains wih second-order momen informaion, Mahemaical Programming, Ser. A, vol. 137, pp , [15] A. Ben-Tal, D. Den Herog, A. De Waegenaere, B. Melenberg, and G. Rennen, Robus soluions of opimizaion problems affeced by uncerain probabiliies, Managemen Science, vol. 59, no. 2, pp , [16] R. Jiang and Y. Guan, Daa-driven chance consrained sochasic program, Mahemaical Programming, Ser. A, vol. 158, pp , [17] H. Sun and H. Xu, Convergence analysis for disribuionally robus opimizaion and equilibrium problems, Mahemaics of Operaions Research, vol. 41, no. 2, pp , [18] E. Erdoğan and G. Iyengar, Ambiguous chance consrained problems and robus opimizaion, Mahemaical Programming, Ser. B, vol. 107, pp , [19] P. Mohajerin Esfahani and D. Kuhn, Daa-driven disribuionally robus opimizaion using he Wassersein meric: Performance guaranees and racable reformulaions, arxiv: , [20] R. Gao and A. J. Kleyweg, Disribuionally robus sochasic opimizaion wih Wassersein disance, arxiv: , [21] J. I. González-Trejo, O. Hernández-Lerma, and L. F. Hoyos-Reyes, Minimax conrol of discree-ime sochasic sysems, SIAM Journal on Conrol and Opimizaion, vol. 41, no. 5, pp , [22] R. Bellman, Dynamic programming and Lagrange mulipliers, Proceedings of he Naional Academy of Sciences, vol. 42, no. 10, pp , [23] O. Hernández-Lerma and J. B. Lasserre, Discree-Time Markov Conrol Processes: Basic Opimaliy Crieria. Springer, [24] A. Shapiro, On dualiy heory of conic linear problems, in Semi- Infinie Programming. Springer, 2001, pp [25] R. Heich and K. O. Koranek, Semi-infinie programming: Theory, mehods, and applicaions, SIAM Review, vol. 35, no. 3, pp , [26] M. López and G. Sill, Semi-infinie programming, European Journal of Operaional Research, vol. 180, pp , [27] G. Calafiore and M. C. Campi, Uncerain convex programs: randomized soluions and confidence levels, Mahemaical Programming, Ser. A, vol. 102, pp , [28] R. Reemsen, Discreizaion mehods for he soluion of semi-infinie programming problems, Journal of Opimizaion Theory and Applicaions, vol. 71, no. 1, pp , [29] R. Levi, R. O. Roundy, and D. B. Shmoys, Provably near-opimal sampling-based policies for sochasic invenory conrol models, Mahemaics of Operaions Research, vol. 32, no. 4, pp , 2007.
An introduction to the theory of SDDP algorithm
An inroducion o he heory of SDDP algorihm V. Leclère (ENPC) Augus 1, 2014 V. Leclère Inroducion o SDDP Augus 1, 2014 1 / 21 Inroducion Large scale sochasic problem are hard o solve. Two ways of aacking
More informationT L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB
Elecronic Companion EC.1. Proofs of Technical Lemmas and Theorems LEMMA 1. Le C(RB) be he oal cos incurred by he RB policy. Then we have, T L E[C(RB)] 3 E[Z RB ]. (EC.1) Proof of Lemma 1. Using he marginal
More informationLecture 20: Riccati Equations and Least Squares Feedback Control
34-5 LINEAR SYSTEMS Lecure : Riccai Equaions and Leas Squares Feedback Conrol 5.6.4 Sae Feedback via Riccai Equaions A recursive approach in generaing he marix-valued funcion W ( ) equaion for i for he
More informationA Primal-Dual Type Algorithm with the O(1/t) Convergence Rate for Large Scale Constrained Convex Programs
PROC. IEEE CONFERENCE ON DECISION AND CONTROL, 06 A Primal-Dual Type Algorihm wih he O(/) Convergence Rae for Large Scale Consrained Convex Programs Hao Yu and Michael J. Neely Absrac This paper considers
More informationApplication of a Stochastic-Fuzzy Approach to Modeling Optimal Discrete Time Dynamical Systems by Using Large Scale Data Processing
Applicaion of a Sochasic-Fuzzy Approach o Modeling Opimal Discree Time Dynamical Sysems by Using Large Scale Daa Processing AA WALASZE-BABISZEWSA Deparmen of Compuer Engineering Opole Universiy of Technology
More informationDecentralized Stochastic Control with Partial History Sharing: A Common Information Approach
1 Decenralized Sochasic Conrol wih Parial Hisory Sharing: A Common Informaion Approach Ashuosh Nayyar, Adiya Mahajan and Demoshenis Tenekezis arxiv:1209.1695v1 [cs.sy] 8 Sep 2012 Absrac A general model
More informationVehicle Arrival Models : Headway
Chaper 12 Vehicle Arrival Models : Headway 12.1 Inroducion Modelling arrival of vehicle a secion of road is an imporan sep in raffic flow modelling. I has imporan applicaion in raffic flow simulaion where
More informationExpert Advice for Amateurs
Exper Advice for Amaeurs Ernes K. Lai Online Appendix - Exisence of Equilibria The analysis in his secion is performed under more general payoff funcions. Wihou aking an explici form, he payoffs of he
More informationThis document was generated at 1:04 PM, 09/10/13 Copyright 2013 Richard T. Woodward. 4. End points and transversality conditions AGEC
his documen was generaed a 1:4 PM, 9/1/13 Copyrigh 213 Richard. Woodward 4. End poins and ransversaliy condiions AGEC 637-213 F z d Recall from Lecure 3 ha a ypical opimal conrol problem is o maimize (,,
More informationOn Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems
MATHEMATICS OF OPERATIONS RESEARCH Vol. 38, No. 2, May 2013, pp. 209 227 ISSN 0364-765X (prin) ISSN 1526-5471 (online) hp://dx.doi.org/10.1287/moor.1120.0562 2013 INFORMS On Boundedness of Q-Learning Ieraes
More informationTechnical Report Doc ID: TR March-2013 (Last revision: 23-February-2016) On formulating quadratic functions in optimization models.
Technical Repor Doc ID: TR--203 06-March-203 (Las revision: 23-Februar-206) On formulaing quadraic funcions in opimizaion models. Auhor: Erling D. Andersen Convex quadraic consrains quie frequenl appear
More informationZürich. ETH Master Course: L Autonomous Mobile Robots Localization II
Roland Siegwar Margaria Chli Paul Furgale Marco Huer Marin Rufli Davide Scaramuzza ETH Maser Course: 151-0854-00L Auonomous Mobile Robos Localizaion II ACT and SEE For all do, (predicion updae / ACT),
More informationSTATE-SPACE MODELLING. A mass balance across the tank gives:
B. Lennox and N.F. Thornhill, 9, Sae Space Modelling, IChemE Process Managemen and Conrol Subjec Group Newsleer STE-SPACE MODELLING Inroducion: Over he pas decade or so here has been an ever increasing
More informationRobust estimation based on the first- and third-moment restrictions of the power transformation model
h Inernaional Congress on Modelling and Simulaion, Adelaide, Ausralia, 6 December 3 www.mssanz.org.au/modsim3 Robus esimaion based on he firs- and hird-momen resricions of he power ransformaion Nawaa,
More informationt dt t SCLP Bellman (1953) CLP (Dantzig, Tyndall, Grinold, Perold, Anstreicher 60's-80's) Anderson (1978) SCLP
Coninuous Linear Programming. Separaed Coninuous Linear Programming Bellman (1953) max c () u() d H () u () + Gsusds (,) () a () u (), < < CLP (Danzig, yndall, Grinold, Perold, Ansreicher 6's-8's) Anderson
More informationMean-square Stability Control for Networked Systems with Stochastic Time Delay
JOURNAL OF SIMULAION VOL. 5 NO. May 7 Mean-square Sabiliy Conrol for Newored Sysems wih Sochasic ime Delay YAO Hejun YUAN Fushun School of Mahemaics and Saisics Anyang Normal Universiy Anyang Henan. 455
More informationNotes on Kalman Filtering
Noes on Kalman Filering Brian Borchers and Rick Aser November 7, Inroducion Daa Assimilaion is he problem of merging model predicions wih acual measuremens of a sysem o produce an opimal esimae of he curren
More informationTwo Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017
Two Popular Bayesian Esimaors: Paricle and Kalman Filers McGill COMP 765 Sep 14 h, 2017 1 1 1, dx x Bel x u x P x z P Recall: Bayes Filers,,,,,,, 1 1 1 1 u z u x P u z u x z P Bayes z = observaion u =
More informationOptimality Conditions for Unconstrained Problems
62 CHAPTER 6 Opimaliy Condiions for Unconsrained Problems 1 Unconsrained Opimizaion 11 Exisence Consider he problem of minimizing he funcion f : R n R where f is coninuous on all of R n : P min f(x) x
More informationOn a Discrete-In-Time Order Level Inventory Model for Items with Random Deterioration
Journal of Agriculure and Life Sciences Vol., No. ; June 4 On a Discree-In-Time Order Level Invenory Model for Iems wih Random Deerioraion Dr Biswaranjan Mandal Associae Professor of Mahemaics Acharya
More informationOnline Appendix to Solution Methods for Models with Rare Disasters
Online Appendix o Soluion Mehods for Models wih Rare Disasers Jesús Fernández-Villaverde and Oren Levinal In his Online Appendix, we presen he Euler condiions of he model, we develop he pricing Calvo block,
More informationCONTROL SYSTEMS, ROBOTICS AND AUTOMATION Vol. XI Control of Stochastic Systems - P.R. Kumar
CONROL OF SOCHASIC SYSEMS P.R. Kumar Deparmen of Elecrical and Compuer Engineering, and Coordinaed Science Laboraory, Universiy of Illinois, Urbana-Champaign, USA. Keywords: Markov chains, ransiion probabiliies,
More informationInformation Relaxations and Duality in Stochastic Dynamic Programs
OPERATIONS RESEARCH Vol. 58, No. 4, Par 1 of 2, July Augus 2010, pp. 785 801 issn 0030-364X eissn 1526-5463 10 5804 0785 informs doi 10.1287/opre.1090.0796 2010 INFORMS Informaion Relaxaions and Dualiy
More informationInventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions
Muli-Period Sochasic Models: Opimali of (s, S) Polic for -Convex Objecive Funcions Consider a seing similar o he N-sage newsvendor problem excep ha now here is a fixed re-ordering cos (> 0) for each (re-)order.
More informationRANDOM LAGRANGE MULTIPLIERS AND TRANSVERSALITY
ECO 504 Spring 2006 Chris Sims RANDOM LAGRANGE MULTIPLIERS AND TRANSVERSALITY 1. INTRODUCTION Lagrange muliplier mehods are sandard fare in elemenary calculus courses, and hey play a cenral role in economic
More informationInventory Control of Perishable Items in a Two-Echelon Supply Chain
Journal of Indusrial Engineering, Universiy of ehran, Special Issue,, PP. 69-77 69 Invenory Conrol of Perishable Iems in a wo-echelon Supply Chain Fariborz Jolai *, Elmira Gheisariha and Farnaz Nojavan
More information1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC
This documen was generaed a :45 PM 8/8/04 Copyrigh 04 Richard T. Woodward. An inroducion o dynamic opimizaion -- Opimal Conrol and Dynamic Programming AGEC 637-04 I. Overview of opimizaion Opimizaion is
More informationStability and Bifurcation in a Neural Network Model with Two Delays
Inernaional Mahemaical Forum, Vol. 6, 11, no. 35, 175-1731 Sabiliy and Bifurcaion in a Neural Nework Model wih Two Delays GuangPing Hu and XiaoLing Li School of Mahemaics and Physics, Nanjing Universiy
More informationModal identification of structures from roving input data by means of maximum likelihood estimation of the state space model
Modal idenificaion of srucures from roving inpu daa by means of maximum likelihood esimaion of he sae space model J. Cara, J. Juan, E. Alarcón Absrac The usual way o perform a forced vibraion es is o fix
More informationPhysics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle
Chaper 2 Newonian Mechanics Single Paricle In his Chaper we will review wha Newon s laws of mechanics ell us abou he moion of a single paricle. Newon s laws are only valid in suiable reference frames,
More informationO Q L N. Discrete-Time Stochastic Dynamic Programming. I. Notation and basic assumptions. ε t : a px1 random vector of disturbances at time t.
Econ. 5b Spring 999 C. Sims Discree-Time Sochasic Dynamic Programming 995, 996 by Chrisopher Sims. This maerial may be freely reproduced for educaional and research purposes, so long as i is no alered,
More information1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC
This documen was generaed a :37 PM, 1/11/018 Copyrigh 018 Richard T. Woodward 1. An inroducion o dynamic opimiaion -- Opimal Conrol and Dynamic Programming AGEC 64-018 I. Overview of opimiaion Opimiaion
More informationInternational Journal of Scientific & Engineering Research, Volume 4, Issue 10, October ISSN
Inernaional Journal of Scienific & Engineering Research, Volume 4, Issue 10, Ocober-2013 900 FUZZY MEAN RESIDUAL LIFE ORDERING OF FUZZY RANDOM VARIABLES J. EARNEST LAZARUS PIRIYAKUMAR 1, A. YAMUNA 2 1.
More informationLogic in computer science
Logic in compuer science Logic plays an imporan role in compuer science Logic is ofen called he calculus of compuer science Logic plays a similar role in compuer science o ha played by calculus in he physical
More information2. Nonlinear Conservation Law Equations
. Nonlinear Conservaion Law Equaions One of he clear lessons learned over recen years in sudying nonlinear parial differenial equaions is ha i is generally no wise o ry o aack a general class of nonlinear
More informationMODULE 3 FUNCTION OF A RANDOM VARIABLE AND ITS DISTRIBUTION LECTURES PROBABILITY DISTRIBUTION OF A FUNCTION OF A RANDOM VARIABLE
Topics MODULE 3 FUNCTION OF A RANDOM VARIABLE AND ITS DISTRIBUTION LECTURES 2-6 3. FUNCTION OF A RANDOM VARIABLE 3.2 PROBABILITY DISTRIBUTION OF A FUNCTION OF A RANDOM VARIABLE 3.3 EXPECTATION AND MOMENTS
More informationThe Asymptotic Behavior of Nonoscillatory Solutions of Some Nonlinear Dynamic Equations on Time Scales
Advances in Dynamical Sysems and Applicaions. ISSN 0973-5321 Volume 1 Number 1 (2006, pp. 103 112 c Research India Publicaions hp://www.ripublicaion.com/adsa.hm The Asympoic Behavior of Nonoscillaory Soluions
More informationarxiv: v1 [math.oc] 27 Jul 2009
PARTICLE METHODS FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS PIERRE CARPENTIER GUY COHEN AND ANES DALLAGI arxiv:0907.4663v1 [mah.oc] 27 Jul 2009 Absrac. When dealing wih numerical soluion of sochasic opimal
More informationINTRODUCTION TO MACHINE LEARNING 3RD EDITION
ETHEM ALPAYDIN The MIT Press, 2014 Lecure Slides for INTRODUCTION TO MACHINE LEARNING 3RD EDITION alpaydin@boun.edu.r hp://www.cmpe.boun.edu.r/~ehem/i2ml3e CHAPTER 2: SUPERVISED LEARNING Learning a Class
More informationTwo New Uncertainty Programming Models of Inventory with Uncertain Costs
Journal of Informaion & Compuaional Science 8: 2 (211) 28 288 Available a hp://www.joics.com Two New Uncerainy Programming Models of Invenory wih Uncerain Coss Lixia Rong Compuer Science and Technology
More information23.5. Half-Range Series. Introduction. Prerequisites. Learning Outcomes
Half-Range Series 2.5 Inroducion In his Secion we address he following problem: Can we find a Fourier series expansion of a funcion defined over a finie inerval? Of course we recognise ha such a funcion
More informationGMM - Generalized Method of Moments
GMM - Generalized Mehod of Momens Conens GMM esimaion, shor inroducion 2 GMM inuiion: Maching momens 2 3 General overview of GMM esimaion. 3 3. Weighing marix...........................................
More informationSliding Mode Extremum Seeking Control for Linear Quadratic Dynamic Game
Sliding Mode Exremum Seeking Conrol for Linear Quadraic Dynamic Game Yaodong Pan and Ümi Özgüner ITS Research Group, AIST Tsukuba Eas Namiki --, Tsukuba-shi,Ibaraki-ken 5-856, Japan e-mail: pan.yaodong@ais.go.jp
More informationNotes for Lecture 17-18
U.C. Berkeley CS278: Compuaional Complexiy Handou N7-8 Professor Luca Trevisan April 3-8, 2008 Noes for Lecure 7-8 In hese wo lecures we prove he firs half of he PCP Theorem, he Amplificaion Lemma, up
More informationA DELAY-DEPENDENT STABILITY CRITERIA FOR T-S FUZZY SYSTEM WITH TIME-DELAYS
A DELAY-DEPENDENT STABILITY CRITERIA FOR T-S FUZZY SYSTEM WITH TIME-DELAYS Xinping Guan ;1 Fenglei Li Cailian Chen Insiue of Elecrical Engineering, Yanshan Universiy, Qinhuangdao, 066004, China. Deparmen
More informationSignaling equilibria for dynamic LQG games with. asymmetric information
Signaling equilibria for dynamic LQG games wih asymmeric informaion Deepanshu Vasal and Achilleas Anasasopoulos Absrac We consider a finie horizon dynamic game wih wo players who observe heir ypes privaely
More informationStable approximations of optimal filters
Sable approximaions of opimal filers Joaquin Miguez Deparmen of Signal Theory & Communicaions, Universidad Carlos III de Madrid. E-mail: joaquin.miguez@uc3m.es Join work wih Dan Crisan (Imperial College
More informationApplying Genetic Algorithms for Inventory Lot-Sizing Problem with Supplier Selection under Storage Capacity Constraints
IJCSI Inernaional Journal of Compuer Science Issues, Vol 9, Issue 1, No 1, January 2012 wwwijcsiorg 18 Applying Geneic Algorihms for Invenory Lo-Sizing Problem wih Supplier Selecion under Sorage Capaciy
More informationACE 562 Fall Lecture 5: The Simple Linear Regression Model: Sampling Properties of the Least Squares Estimators. by Professor Scott H.
ACE 56 Fall 005 Lecure 5: he Simple Linear Regression Model: Sampling Properies of he Leas Squares Esimaors by Professor Sco H. Irwin Required Reading: Griffihs, Hill and Judge. "Inference in he Simple
More information3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon
3..3 INRODUCION O DYNAMIC OPIMIZAION: DISCREE IME PROBLEMS A. he Hamilonian and Firs-Order Condiions in a Finie ime Horizon Define a new funcion, he Hamilonian funcion, H. H he change in he oal value of
More information1 Review of Zero-Sum Games
COS 5: heoreical Machine Learning Lecurer: Rob Schapire Lecure #23 Scribe: Eugene Brevdo April 30, 2008 Review of Zero-Sum Games Las ime we inroduced a mahemaical model for wo player zero-sum games. Any
More information5. Stochastic processes (1)
Lec05.pp S-38.45 - Inroducion o Teleraffic Theory Spring 2005 Conens Basic conceps Poisson process 2 Sochasic processes () Consider some quaniy in a eleraffic (or any) sysem I ypically evolves in ime randomly
More informationChristos Papadimitriou & Luca Trevisan November 22, 2016
U.C. Bereley CS170: Algorihms Handou LN-11-22 Chrisos Papadimiriou & Luca Trevisan November 22, 2016 Sreaming algorihms In his lecure and he nex one we sudy memory-efficien algorihms ha process a sream
More informationOptimal Investment under Dynamic Risk Constraints and Partial Information
Opimal Invesmen under Dynamic Risk Consrains and Parial Informaion Wolfgang Puschögl Johann Radon Insiue for Compuaional and Applied Mahemaics (RICAM) Ausrian Academy of Sciences www.ricam.oeaw.ac.a 2
More informationA New Perturbative Approach in Nonlinear Singularity Analysis
Journal of Mahemaics and Saisics 7 (: 49-54, ISSN 549-644 Science Publicaions A New Perurbaive Approach in Nonlinear Singulariy Analysis Ta-Leung Yee Deparmen of Mahemaics and Informaion Technology, The
More informationLecture Notes 2. The Hilbert Space Approach to Time Series
Time Series Seven N. Durlauf Universiy of Wisconsin. Basic ideas Lecure Noes. The Hilber Space Approach o Time Series The Hilber space framework provides a very powerful language for discussing he relaionship
More informationSUFFICIENT CONDITIONS FOR EXISTENCE SOLUTION OF LINEAR TWO-POINT BOUNDARY PROBLEM IN MINIMIZATION OF QUADRATIC FUNCTIONAL
HE PUBLISHING HOUSE PROCEEDINGS OF HE ROMANIAN ACADEMY, Series A, OF HE ROMANIAN ACADEMY Volume, Number 4/200, pp 287 293 SUFFICIEN CONDIIONS FOR EXISENCE SOLUION OF LINEAR WO-POIN BOUNDARY PROBLEM IN
More information6. Stochastic calculus with jump processes
A) Trading sraegies (1/3) Marke wih d asses S = (S 1,, S d ) A rading sraegy can be modelled wih a vecor φ describing he quaniies invesed in each asse a each insan : φ = (φ 1,, φ d ) The value a of a porfolio
More informationSupplement for Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence
Supplemen for Sochasic Convex Opimizaion: Faser Local Growh Implies Faser Global Convergence Yi Xu Qihang Lin ianbao Yang Proof of heorem heorem Suppose Assumpion holds and F (w) obeys he LGC (6) Given
More informationLecture 33: November 29
36-705: Inermediae Saisics Fall 2017 Lecurer: Siva Balakrishnan Lecure 33: November 29 Today we will coninue discussing he boosrap, and hen ry o undersand why i works in a simple case. In he las lecure
More informationLecture 4 Notes (Little s Theorem)
Lecure 4 Noes (Lile s Theorem) This lecure concerns one of he mos imporan (and simples) heorems in Queuing Theory, Lile s Theorem. More informaion can be found in he course book, Bersekas & Gallagher,
More informationR t. C t P t. + u t. C t = αp t + βr t + v t. + β + w t
Exercise 7 C P = α + β R P + u C = αp + βr + v (a) (b) C R = α P R + β + w (c) Assumpions abou he disurbances u, v, w : Classical assumions on he disurbance of one of he equaions, eg. on (b): E(v v s P,
More informationarxiv: v1 [math.pr] 19 Feb 2011
A NOTE ON FELLER SEMIGROUPS AND RESOLVENTS VADIM KOSTRYKIN, JÜRGEN POTTHOFF, AND ROBERT SCHRADER ABSTRACT. Various equivalen condiions for a semigroup or a resolven generaed by a Markov process o be of
More information1 Subdivide the optimization horizon [t 0,t f ] into n s 1 control stages,
Opimal Conrol Formulaion Opimal Conrol Lecures 19-2: Direc Soluion Mehods Benoî Chachua Deparmen of Chemical Engineering Spring 29 We are concerned wih numerical soluion procedures for
More informationEnergy Storage Benchmark Problems
Energy Sorage Benchmark Problems Daniel F. Salas 1,3, Warren B. Powell 2,3 1 Deparmen of Chemical & Biological Engineering 2 Deparmen of Operaions Research & Financial Engineering 3 Princeon Laboraory
More informationStochastic Perishable Inventory Systems: Dual-Balancing and Look-Ahead Approaches
Sochasic Perishable Invenory Sysems: Dual-Balancing and Look-Ahead Approaches by Yuhe Diao A hesis presened o he Universiy Of Waerloo in fulfilmen of he hesis requiremen for he degree of Maser of Applied
More informationOptimal Control for LQG Systems on Graphs Part I: Structural Results
Opimal Conrol for LQG Sysems on Graphs Par I: Srucural Resuls Ashuosh Nayyar Lauren Lessard Absrac In his wo-par paper, we idenify a broad class of decenralized oupu-feedback LQG sysems for which he opimal
More informationOn Measuring Pro-Poor Growth. 1. On Various Ways of Measuring Pro-Poor Growth: A Short Review of the Literature
On Measuring Pro-Poor Growh 1. On Various Ways of Measuring Pro-Poor Growh: A Shor eview of he Lieraure During he pas en years or so here have been various suggesions concerning he way one should check
More information2.160 System Identification, Estimation, and Learning. Lecture Notes No. 8. March 6, 2006
2.160 Sysem Idenificaion, Esimaion, and Learning Lecure Noes No. 8 March 6, 2006 4.9 Eended Kalman Filer In many pracical problems, he process dynamics are nonlinear. w Process Dynamics v y u Model (Linearized)
More informationAn Introduction to Malliavin calculus and its applications
An Inroducion o Malliavin calculus and is applicaions Lecure 5: Smoohness of he densiy and Hörmander s heorem David Nualar Deparmen of Mahemaics Kansas Universiy Universiy of Wyoming Summer School 214
More informationPlanning in POMDPs. Dominik Schoenberger Abstract
Planning in POMDPs Dominik Schoenberger d.schoenberger@sud.u-darmsad.de Absrac This documen briefly explains wha a Parially Observable Markov Decision Process is. Furhermore i inroduces he differen approaches
More informationOBJECTIVES OF TIME SERIES ANALYSIS
OBJECTIVES OF TIME SERIES ANALYSIS Undersanding he dynamic or imedependen srucure of he observaions of a single series (univariae analysis) Forecasing of fuure observaions Asceraining he leading, lagging
More informationChapter 2. First Order Scalar Equations
Chaper. Firs Order Scalar Equaions We sar our sudy of differenial equaions in he same way he pioneers in his field did. We show paricular echniques o solve paricular ypes of firs order differenial equaions.
More informationUtility maximization in incomplete markets
Uiliy maximizaion in incomplee markes Marcel Ladkau 27.1.29 Conens 1 Inroducion and general seings 2 1.1 Marke model....................................... 2 1.2 Trading sraegy.....................................
More informationClass Meeting # 10: Introduction to the Wave Equation
MATH 8.5 COURSE NOTES - CLASS MEETING # 0 8.5 Inroducion o PDEs, Fall 0 Professor: Jared Speck Class Meeing # 0: Inroducion o he Wave Equaion. Wha is he wave equaion? The sandard wave equaion for a funcion
More informationMoving Horizon Estimation for Staged QP Problems
51s IEEE Conference on Decision and Conrol, Maui, HI, December 212 Moving Horizon Esimaion for Saged QP Problems Eric Chu, Arezou Keshavarz, Dimiry Gorinevsky, and Sephen Boyd Absrac his paper considers
More informationCash Flow Valuation Mode Lin Discrete Time
IOSR Journal of Mahemaics (IOSR-JM) e-issn: 2278-5728,p-ISSN: 2319-765X, 6, Issue 6 (May. - Jun. 2013), PP 35-41 Cash Flow Valuaion Mode Lin Discree Time Olayiwola. M. A. and Oni, N. O. Deparmen of Mahemaics
More information12: AUTOREGRESSIVE AND MOVING AVERAGE PROCESSES IN DISCRETE TIME. Σ j =
1: AUTOREGRESSIVE AND MOVING AVERAGE PROCESSES IN DISCRETE TIME Moving Averages Recall ha a whie noise process is a series { } = having variance σ. The whie noise process has specral densiy f (λ) = of
More informationExponential Weighted Moving Average (EWMA) Chart Under The Assumption of Moderateness And Its 3 Control Limits
DOI: 0.545/mjis.07.5009 Exponenial Weighed Moving Average (EWMA) Char Under The Assumpion of Moderaeness And Is 3 Conrol Limis KALPESH S TAILOR Assisan Professor, Deparmen of Saisics, M. K. Bhavnagar Universiy,
More informationMATHEMATICAL DESCRIPTION OF THEORETICAL METHODS OF RESERVE ECONOMY OF CONSIGNMENT STORES
MAHEMAICAL DESCIPION OF HEOEICAL MEHODS OF ESEVE ECONOMY OF CONSIGNMEN SOES Péer elek, József Cselényi, György Demeer Universiy of Miskolc, Deparmen of Maerials Handling and Logisics Absrac: Opimizaion
More informationGames Against Nature
Advanced Course in Machine Learning Spring 2010 Games Agains Naure Handous are joinly prepared by Shie Mannor and Shai Shalev-Shwarz In he previous lecures we alked abou expers in differen seups and analyzed
More informationChapter 2. Models, Censoring, and Likelihood for Failure-Time Data
Chaper 2 Models, Censoring, and Likelihood for Failure-Time Daa William Q. Meeker and Luis A. Escobar Iowa Sae Universiy and Louisiana Sae Universiy Copyrigh 1998-2008 W. Q. Meeker and L. A. Escobar. Based
More informationAn Introduction to Backward Stochastic Differential Equations (BSDEs) PIMS Summer School 2016 in Mathematical Finance.
1 An Inroducion o Backward Sochasic Differenial Equaions (BSDEs) PIMS Summer School 2016 in Mahemaical Finance June 25, 2016 Chrisoph Frei cfrei@ualbera.ca This inroducion is based on Touzi [14], Bouchard
More informationCENTRALIZED VERSUS DECENTRALIZED PRODUCTION PLANNING IN SUPPLY CHAINS
CENRALIZED VERSUS DECENRALIZED PRODUCION PLANNING IN SUPPLY CHAINS Georges SAHARIDIS* a, Yves DALLERY* a, Fikri KARAESMEN* b * a Ecole Cenrale Paris Deparmen of Indusial Engineering (LGI), +3343388, saharidis,dallery@lgi.ecp.fr
More informationA Specification Test for Linear Dynamic Stochastic General Equilibrium Models
Journal of Saisical and Economeric Mehods, vol.1, no.2, 2012, 65-70 ISSN: 2241-0384 (prin), 2241-0376 (online) Scienpress Ld, 2012 A Specificaion Tes for Linear Dynamic Sochasic General Equilibrium Models
More informationCompetitive and Cooperative Inventory Policies in a Two-Stage Supply-Chain
Compeiive and Cooperaive Invenory Policies in a Two-Sage Supply-Chain (G. P. Cachon and P. H. Zipkin) Presened by Shruivandana Sharma IOE 64, Supply Chain Managemen, Winer 2009 Universiy of Michigan, Ann
More informationEchocardiography Project and Finite Fourier Series
Echocardiography Projec and Finie Fourier Series 1 U M An echocardiagram is a plo of how a porion of he hear moves as he funcion of ime over he one or more hearbea cycles If he hearbea repeas iself every
More informationTHE BELLMAN PRINCIPLE OF OPTIMALITY
THE BELLMAN PRINCIPLE OF OPTIMALITY IOANID ROSU As I undersand, here are wo approaches o dynamic opimizaion: he Ponrjagin Hamilonian) approach, and he Bellman approach. I saw several clear discussions
More informationEXERCISES FOR SECTION 1.5
1.5 Exisence and Uniqueness of Soluions 43 20. 1 v c 21. 1 v c 1 2 4 6 8 10 1 2 2 4 6 8 10 Graph of approximae soluion obained using Euler s mehod wih = 0.1. Graph of approximae soluion obained using Euler
More informationPOSITIVE SOLUTIONS OF NEUTRAL DELAY DIFFERENTIAL EQUATION
Novi Sad J. Mah. Vol. 32, No. 2, 2002, 95-108 95 POSITIVE SOLUTIONS OF NEUTRAL DELAY DIFFERENTIAL EQUATION Hajnalka Péics 1, János Karsai 2 Absrac. We consider he scalar nonauonomous neural delay differenial
More informationEssential Maps and Coincidence Principles for General Classes of Maps
Filoma 31:11 (2017), 3553 3558 hps://doi.org/10.2298/fil1711553o Published by Faculy of Sciences Mahemaics, Universiy of Niš, Serbia Available a: hp://www.pmf.ni.ac.rs/filoma Essenial Maps Coincidence
More informationAn Introduction to Stochastic Programming: The Recourse Problem
An Inroducion o Sochasic Programming: he Recourse Problem George Danzig and Phil Wolfe Ellis Johnson, Roger Wes, Dick Cole, and Me John Birge Where o look in he ex pp. 6-7, Secion.2.: Inroducion o sochasic
More informationDiebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles
Diebold, Chaper 7 Francis X. Diebold, Elemens of Forecasing, 4h Ediion (Mason, Ohio: Cengage Learning, 006). Chaper 7. Characerizing Cycles Afer compleing his reading you should be able o: Define covariance
More informationOptima and Equilibria for Traffic Flow on a Network
Opima and Equilibria for Traffic Flow on a Nework Albero Bressan Deparmen of Mahemaics, Penn Sae Universiy bressan@mah.psu.edu Albero Bressan (Penn Sae) Opima and equilibria for raffic flow 1 / 1 A Traffic
More informationScheduling of Crude Oil Movements at Refinery Front-end
Scheduling of Crude Oil Movemens a Refinery Fron-end Ramkumar Karuppiah and Ignacio Grossmann Carnegie Mellon Universiy ExxonMobil Case Sudy: Dr. Kevin Furman Enerprise-wide Opimizaion Projec March 15,
More informationSubway stations energy and air quality management
Subway saions energy and air qualiy managemen wih sochasic opimizaion Trisan Rigau 1,2,4, Advisors: P. Carpenier 3, J.-Ph. Chancelier 2, M. De Lara 2 EFFICACITY 1 CERMICS, ENPC 2 UMA, ENSTA 3 LISIS, IFSTTAR
More informationStochastic Model for Cancer Cell Growth through Single Forward Mutation
Journal of Modern Applied Saisical Mehods Volume 16 Issue 1 Aricle 31 5-1-2017 Sochasic Model for Cancer Cell Growh hrough Single Forward Muaion Jayabharahiraj Jayabalan Pondicherry Universiy, jayabharahi8@gmail.com
More informationThe Optimal Stopping Time for Selling an Asset When It Is Uncertain Whether the Price Process Is Increasing or Decreasing When the Horizon Is Infinite
American Journal of Operaions Research, 08, 8, 8-9 hp://wwwscirporg/journal/ajor ISSN Online: 60-8849 ISSN Prin: 60-8830 The Opimal Sopping Time for Selling an Asse When I Is Uncerain Wheher he Price Process
More informationSTABILITY OF PEXIDERIZED QUADRATIC FUNCTIONAL EQUATION IN NON-ARCHIMEDEAN FUZZY NORMED SPASES
Novi Sad J. Mah. Vol. 46, No. 1, 2016, 15-25 STABILITY OF PEXIDERIZED QUADRATIC FUNCTIONAL EQUATION IN NON-ARCHIMEDEAN FUZZY NORMED SPASES N. Eghbali 1 Absrac. We deermine some sabiliy resuls concerning
More informationHamilton- J acobi Equation: Weak S olution We continue the study of the Hamilton-Jacobi equation:
M ah 5 7 Fall 9 L ecure O c. 4, 9 ) Hamilon- J acobi Equaion: Weak S oluion We coninue he sudy of he Hamilon-Jacobi equaion: We have shown ha u + H D u) = R n, ) ; u = g R n { = }. ). In general we canno
More information