Distributionally Robust Stochastic Control with Conic Confidence Sets

Size: px
Start display at page:

Download "Distributionally Robust Stochastic Control with Conic Confidence Sets"

Transcription

1 Disribuionally Robus Sochasic Conrol wih Conic Confidence Ses Insoon Yang Absrac The heory of (sandard) sochasic opimal conrol is based on he assumpion ha he probabiliy disribuion of uncerain variables is fully known. In pracice, however, obaining an accurae disribuion is ofen challenging. To resolve his issue, we sudy a disribuionally robus sochasic conrol problem ha minimizes a cos funcion of ineres given ha he disribuion of uncerain variables is no known bu lies in a so-called ambiguiy se. We firs invesigae a dynamic programming approach and idenify condiions for he exisence and opimaliy of non-randomized Markov policies. We hen propose a dualiy-based reformulaion mehod for an associaed Bellman equaion in cases wih conic confidence ses. This reformulaion alleviaes he compuaional issues inheren in he infinie-dimensional minimax opimizaion problem in he Bellman equaion wihou sacrificing opimaliy. The effeciveness of he proposed mehod is demonsraed hrough an applicaion o a sochasic invenory conrol problem. I. INTRODUCTION Sandard sochasic conrol mehods assume ha he probabiliy disribuion of uncerain variables (e.g., disurbances) is available. However, his assumpion is ofen resricive in pracice because obaining an accurae disribuion requires large-scale, high-resoluion sensor measuremens over a long raining period or muliple periods. Siuaions in which uncerain variables are no direcly observed could be much more challenging; compuaional mehods, such as filering or saisical learning echniques, are ofen used o obain he (poserior) disribuion of he uncerain variables given limied observaions. The accuracy of he obained disribuion is ofen poor, as i is subjec o he qualiy of he observaions, compuaional mehods, and prior knowledge abou he variables. If poor disribuional informaion is employed in consrucing a sochasic opimal conroller, i does no guaranee opimaliy and can even cause caasrophic sysem behaviors [1], [2]. To overcome his issue of limied disribuion informaion in sochasic conrol, we invesigae a disribuionally robus conrol approach. This emerging sochasic conrol mehod minimizes a cos funcion of ineres, assuming ha he disribuion of uncerain variables is no compleely known bu conained in a pre-specified ambiguiy se of probabiliy disribuions. In finie-sae Markov decision process seings, several ypes of ambiguiy ses have been considered: [3], [4], [5] employ ambiguiy ses wih momen consrains, confidence inervals and Wassersein disance, respecively. However, for coninuous-sae conrol problems, only a few This work is pored in par by NSF under ECCS and CNS I. Yang is wih he Elecrical Engineering Deparmen, Universiy of Souhern California, Los Angeles, CA 90089, USA insoonya@usc.edu cases wih momen consrain-based and oal variaion disance ambiguiy are well sudied [6], [7], [8]. In his paper, we consider an imporan class of ambiguiy ses ha are characerized wih confidence ses and probabiliy inervals in a coninuous-sae sochasic conrol seing. The conribuions of his work are wofold. Firs, we invesigae a dynamic programming soluion o discree-ime disribuionally robus conrol problems in general finie-horizon cases and provide condiions for he exisence and opimaliy of non-randomized Markov policies. This exisence resul is based on he lower semi-coninuiy of an associaed dynamic programming operaor. Our dynamic programming approach idenifies an imporan srucural propery: he sysem sae is a sufficien saisic. Second, we propose a dual formulaion of he Bellman equaion in cases wih conic confidence ses. This approach convers he compuaionally challenging infinie-dimensional minimax problem ino a semi-infinie program, which can be solved by exising convergen algorihms. We also show ha he proposed reformulaion is exac using srong dualiy and he nesing condiion for confidence ses proposed by Wiesemann e al. [9]. The uiliy of our approach is demonsraed hrough an applicaion o a sochasic invenory conrol problem. The remainder of his paper is organized as follows. In Secion II, we inroduce he problem seup, including a dynamic game formulaion of disribuionally robus conrol problems. In Secion III, we provide condiions for he exisence and opimaliy of non-randomized Markov policies and anoher se of condiions for he convexiy of he value funcion. Based on hese analyical resuls, we develop a dualiy-based reformulaion mehod for he Bellman equaion in Secion IV. A sochasic invenory conrol problem is considered in Secion V as an applicaion of he proposed mehod. We use he following noaion hroughou he paper. Given a Borel space X, P(X) denoes he se of Borel probabiliy measures on X. Given a cone K, K represens is dual cone. We also le T := {0, 1,, T 1} and T := {0, 1,, T }. II. PROBLEM SETUP A. Ambiguiy in he Disribuion of Disurbances Consider he following sochasic sysem subjec o he disurbance {w } T =0 1, w, defined on a sandard probabiliy space (Ω, F, P): x +1 = f(x, u, w ), (1) where x R n is he sae, u R m is he conrol inpu a sage and f : R n R m R n is a measurable funcion. We assume ha w s and w are independen for s. In

2 many pracical siuaions, he full disurbance disribuion may no be available. To overcome his challenge, we will invesigae a disribuionally robus conrol approach ha minimizes he wors-case cos associaed wih he sysem evoluion under informaion consrains characerized as a se of probabiliy disribuions. We assume ha he rue probabiliy disribuion µ of w is no compleely known bu conained in a so-called ambiguiy se, D. Example 1 (Ambiguiy wih Confidence Ses). Consider he following se of probabiliy measures: D := {µ P( ) µ(c i ) [p i, pi ], i I }, (2) where p i, pi [0, 1] are model parameers, C i s are confidence ses, and I := {1,, N } is a given index se. Wih any measure in his se, he probabiliy ha w is conained in he se C i is beween p i and pi. Noe ha he user can specify he disribuional informaion, such as C i and (p i, pi ), ha he conroller russ given daa and saisical informaion abou he disurbance. Anoher popular ype of ambiguiy ses in single-sage opimizaion problems is based on momen consrains and confidence ses [10], [11], [12], [13], [14], [9]. Recenly, saisical disance-based ambiguiy ses also received a grea deal of aenion since hey can be easily designed from an empirical disribuion wihou requiring a large enough daa samples o esimae momens [15], [16], [17], [18], [19], [20]. In Secion IV, we consider ambiguiy ses of he form (2) ha are characerized wih conic confidence ses. This class of ambiguiy ses can be inuiively consruced from empirical disribuions, as will be illusraed in Secion V. B. Disribuionally Robus Conrol as a Dynamic Game Le H be he se of hisories up o sage, whose elemen is of he form 1 h = (x 0, u 0, w 0,, x 1, u 1, w 1, x ). The se of admissible conrol sraegies is chosen as Π := {π := (π 0,, π T 1 ) π (U(x ) h ) = 1 h H }, where U(x ) is he se of admissible acions given sae x, and π is a sochasic kernel from H o R m. Noe ha he sraegy space is broad enough o conain randomized non- Markov policies. 2 Similarly, we le H e be he se of exended hisories up o sage, whose elemen akes he form h e := (x 0, u 0, w 0, µ 0,, x 1, u 1, w 1, µ 1, x, u ). By viewing he disurbance as an adversarial player who chooses he disurbance disribuion given available informaion h e, we define he se of admissible disribuion sraegies as Γ := {γ = (γ 0,, γ T 1 ) γ (D h e ) = 1 h e H e }. 1 All he resuls in his paper are valid wih hisories of he form h := (x 0, u 0, w 0, µ 0,, x 1, u 1, w 1, µ 1, x ) ha also conains Player II s acions (µ 0,, µ 1 ). However, we use he reduced version of hisories because he realized disribuions may no be observable in pracice. 2 Suppose ha for each, π ( h ) is concenraed a a measurable funcion φ : R n R m such ha φ (x) U(x) for all x R n and for all h H. Then, π is a (non-randomized) Markov policy and by a sligh abuse of noaion π is considered o be idenical wih φ. Consider he following cos funcion associaed wih he sysem evoluion saring from x 0 = x: [ T 1 ] J x [π, γ] := E π,γ r(x, u ) + q(x T ), =0 where r : R n R m R and q : R n R are sagewise and erminal cos funcions of ineres. Here, E π,γ is he expecaion aken wih respec o he probabiliy measure P π,γ induced by he sraegy pair (π, γ). Our goal is o choose a conrol policy ha minimizes he wors-case cos given ambiguous informaion abou he probabiliy disribuion {µ } T =0 1 of he disurbance {w } T =0 1. To be more precise, we define he opimal disribuionally robus conrol policies as follows: Definiion 1. A conrol sraegy π Π is said o be an opimal disribuionally robus policy if i saisfies J x [π, γ] J x [π, γ ] π Π. γ Γ γ Γ A desired conrol sraegy can be obained by solving he following minimax conrol problem: inf J x [π, γ]. (3) π Π γ Γ The mos imporan par of his problem formulaion is he inner maximizaion of he cos funcion over all probabiliy disribuion policies in he sraegy space Γ, which encodes disribuional ambiguiy hrough D. One can view his problem as a wo-player zero-sum dynamic game, in which Player I chooses a conrol policy o minimize he cos and Player II selecs he disurbance s disribuion sraegy ha maximizes he cos. The proposed minimax formulaion implies he following propery: Proposiion 1. Suppose ha an opimal soluion o he disribuionally robus conrol problem (3) exiss and is denoed by (π, γ ). Then, he following inequaliies hold: J x [π, γ] J x [π, γ ] J x [π, γ ] (π, γ) Π Γ. γ Γ The firs and second equaliies hold wih γ = γ and π = π, respecively. In addiion, π is an opimal disribuionally robus policy. The firs inequaliy implies ha when he opimal policy π is employed, he wors-case cos value is equal o J x [π, γ ] for any disribuional error consisen wih he consrains in he ambiguiy se D for each. Thus, his approach provides a performance guaranee in he form of an upper-bound, J x [π, γ ], of he cos value, which is igh. Noe ha his performance guaranee may no be valid when a differen conrol policy is employed as shown in he second inequaliy. Furhermore, he second inequaliy confirms ha an opimal soluion o he dynamic game problem (3) provides an opimal disribuionally robus policy.

3 III. DYNAMIC PROGRAMMING SOLUTION A. Exisence of Opimal Disribuionally Robus Policies We begin by inroducing he following dynamic programming operaor T, T : [ ] T v(x) := inf r(x, u)+ v(f(x, u, w))dµ(w), u U(x) µ D where v is a measurable funcion on R n. We hen define he value funcion of he disribuionally robus conrol problem (3) as v (x) := T T +1 T T 1 q(x) for each T and v T (x) := q(x). By definiion, v (x) represens he minimal wors-case expeced cos value from sage o T given x = x. Under he following assumpion for he measurable selecion condiion, he value funcion is lower semi-coninuous and hus he disribuionally robus conrol problem (3) admis a non-randomized Markov policy, which is opimal. Assumpion 1. The following properies hold: (i) r(x, u) and q(x) are lower semi-coninuous and bounded below for all (x, u) R n R m such ha u U(x); (ii) For each bounded coninuous funcion g : R n R, he funcion ĝ (x, u, µ) := g(f(x, u, w))dµ(w) is coninuous for all (x, u, µ) R n R m D such ha u U(x); (iii) The se U(x) is compac for each x R n. In addiion, he se-valued mapping x U(x) is upper semiconinuous. Theorem 1. Suppose ha Assumpion 1 holds. Then, he value funcion v is lower semi-coninuous for each T. Furhermore, here exiss a measurable funcion φ : R n R m for each T such ha φ (x) U(x) and v (x) = µ D [ r(x, φ (x)) + ] v +1 (f(x, φ (x), w))dµ(w) for all x R n. The non-randomized Markov policy π := (φ 0,, φ T 1 ) Π is an opimal soluion o he disribuionally robus conrol problem (3), i.e., v 0 (x) = J x [π, γ]. γ Γ This heorem can be shown by exending he proof of Theorem 1 in [7] and Theorem 3.1 in [21]. The key idea is o show ha he lower semi-coninuiy of v is preserved hrough he dynamic programming operaor. We can hen use mahemaical inducion o show ha v is lower semiconinuous and hus, he ouer minimizaion problem in he definiion of value funcions admis an opimal soluion. Noe ha Theorem 1 allows us o idenify an imporan srucural propery of he disribuionally robus conrol problem: he sysem sae is a sufficien saisic for Player I (conroller) under Assumpion 1. This observaion yields he pracical advanage ha i suffices o focus on non-randomized Markov polices when designing an opimal disribuionally robus conroller. B. Bellman Equaion Applying he dynamic programming principle [22], [23], we can evaluae he value funcion backward in ime as follows: Proposiion 2. Suppose ha Assumpion 1 holds. Then, he value funcion v saisfies he following Bellman equaion: v (x) = min u U(x) [ r(x, u) + µ D wih v T (x) = q(x). ] (4) v +1 (f(x, u, w))dµ(w) Noe ha due o Theorem 1 he ouer minimizaion problem in he Bellman equaion admis an opimal soluion. From he numerical perspecive, however, solving he Bellman equaion is challenging. In addiion o he scalabiliy issue inheren in dynamic programming, his Bellman equaion involves an infinie-dimensional minimax opimizaion problem because he disurbance may have a coninuous densiy. In he nex secion, we will resolve his compuaional issue for an imporan class of disribuionally robus sochasic conrol problems in which confidence ses are specified. Wih such an ambiguiy se, we will show ha an opimal disribuionally robus policy can be obained by solving compuaionally racable semi-infinie programs if he value funcion is convex. C. Convexiy of he Value Funcion We now show ha he value funcion is convex under he following condiions: Assumpion 2. The disribuionally robus conrol problem (3) saisfies he followings: (i) r : R n R m R and q : R n R are convex funcions; (ii) f : R n R m R n is an affine funcion; (iii) For all λ (0, 1) and for all x 1, x 2 R n, if u i U(x i ), i = 1, 2, hen λu 1 + (1 λ)u 2 U(λx 1 + (1 λ)x 2 ). Proposiion 3. Suppose ha Assumpion 2 holds. Then, he value funcion v : R n R is convex for each T. Proof. We use mahemaical inducion. For = T, v T : R n R is convex since v T = q. Suppose ha v τ : R n R is convex for τ = T 1,, + 1. We now he consider he value funcion a sage. Fix λ (0, 1) and x 1, x 2 R n. For any ɛ > 0, here exiss an ɛ-opimal soluion u i U(x i ) o he ouer minimizaion problem in he Bellman equaion (4) for (, x i ), i.e., v (x i )+ɛ > r(x i, u i )+ v +1 (f(x i, u i, w))dµ(w). µ D

4 Le x λ := λx 1 + (1 λ)x 2 and u λ := λu 1 + (1 λ)u 2. Due o Assumpion 2 (iii), u λ U(x λ ). Thus, v (x λ ) r(x λ, u λ ) + v +1 (f(x λ, u λ, w))dµ(w). µ D We now noice ha f(x λ, u λ, w) = λf(x 1, u 1, w) + (1 λ)f(x 2, u 2, w) due o Assumpion 2 (ii). Since v +1 and r are convex, we obain ha v (x λ ) λr(x 1, u 1 ) + (1 λ)r(x 2, u 2 ) + λv +1 (f(x 1, u 1, w))dµ(w) µ D R l + (1 λ)v +1 (f(x 2, u 2, w))dµ(w) µ D < λv (x 1 ) + (1 λ)v (x 2 ) + ɛ. Leing ɛ 0, we conclude ha v is convex. Noe ha his proposiion does no require he exisence of opimal disribuionally robus policies. Furhermore, we do no impose any specific srucure on he ambiguiy se D for he convexiy of he value funcion. IV. STRONG DUALITY-BASED REFORMULATION A. Disribuional Ambiguiy wih Conic Confidence Ses We now focus on he disribuionally robus conrol problem wih a paricular ambiguiy se of he form (2) in Example 1: D := {µ P( ) µ(c i ) [p i, pi ], i I }, where I := {1,, N } and he confidence se C i has he following conic represenaion: C i = {w C i w K i d i }, where C i R Li l and d i R Li are model parameers, and K i is a proper cone. We impose he following assumpion: Assumpion 3. The ambiguiy se saisfies he following condiions: (i) The confidence se C N is compac and p N = p N = 1. (ii) There exiss a disribuion measure µ D such ha µ(c) i (p i, pi ) whenever p i < pi, i I. (iii) (Nesing Condiion) For each T and all i, i I such ha i i, we have eiher C i s C i, C i s C i or C i C i =, where X s Y represens ha X is a sric subse of Y. The firs condiion represens ha C N is he por of µ. The second condiion ensures ha here exiss a probabiliy disribuion ha saisfies he probabilisic consrains in D as sric inequaliies whenever p i < pi. These wo regulariy condiions will guaranee ha srong dualiy holds based on he generalized Slaer-ype resuls from Shapiro [24] when he Bellman equaion is reformulaed in he nex subsecion. The hird condiion is called he nesing condiion [9], which implies ha here exiss a sric parial order on he confidence ses regarding he se inclusion and ha any incomparable ses are disjoin. This nesing condiion, ogeher wih he wo regulariy condiions, provides a racable dual formulaion of he Bellman equaion wihou loss of opimaliy. B. Dual Bellman Equaion We now reformulae he infinie dimensional minimax opimizaion problem in he Bellman equaion (4) as a semiinfinie program, which can be numerically solved by exising convergen algorihms. Furhermore, his reformulaion based on srong dualiy is exac as shown in he following heorem. Theorem 2. Suppose ha Assumpions 1, 2 and 3 hold. Then, he following equaliy holds for all (, x) T R n : 3 v (x) = inf r(x, u) + (p i κ i p i u,κ,λ,ν λi ) i I s.. (C i w d i ) ν i + i A (i) (κ i λ i ) v +1 (f(x, u, w)) w C N i I u U(x), λ, κ R N +, ν i K i, where A (i) := {i} {i I C i s C i }, wih he erminal condiion v T (x) = q(x). Proof. By inroducing a slack variable z R, we can rewrie he Bellman equaion (4) in he following equivalen form: v (x) = inf z u U(x),z R { s.. r(x, u) µ D } + v +1 (f(x, u, w))dµ(w) z for each (, x) T R n. We firs focus on he maximizaion problem in he inequaliy consrain. I can be rewrien as he following infinie-dimensional linear program: r(x, u) + v +1 (f(x, u, w))dµ(w) µ P( ) R l s.. 1 {w C i C N }dµ(w) p i i I 1 {w C i }dµ(w) p i i I. C N Under Assumpion 3, he generalized Slaer condiion holds [24]. Thus, here is no dualiy gap and we have he following dual formulaion of he problem above wihou loss of opimaliy: inf κ,λ R N + r(x, u) + i I (p i κ i p i λi ) s.. v +1 (f(x, u, w)) + i I 1 {w C i }(λ i κ i ) 0 w C N. 3 In he reformulaed Bellman equaion, min u is merged wih inf κ,λ,ν for a compac represenaion. The minimizaion problem admis an opimal soluion u.

5 Le B (i) be he index ses of all he sric subses of C, i i.e., B (i) := {i I C i s C}. i We also le C i := C i \ C i. i B (i) Due o he nesing condiion, { C 1 N,, C } is a disjoin pariion of he por C N. Therefore, he inequaliy consrain of he dual problem can be rewrien as v +1 (f(x, u, w)) + (λ i κ i ) 0 w C i i I. i A (i) The inequaliy consrain associaed wih he index i I is equivalen o w C i v +1 (f(x, u, w)) + (λ i κ i ) 0. i A (i) Since w v +1 (f(x, u, w)) is convex for each (x, u), he objecive funcion is convex wih respec o w. Therefore, he maximum is aained a he ouer boundary of C. i We now observe ha he ouer boundary of C i corresponds o he ouer boundary of C i due o he nesing condiion [9]. Thus, we can rewrie he ih consrain as w C N v +1 (f(x, u, w)) + s.. C i w K i d i. i A (i) (λ i κ i ) Is dual is given by he following semi-infinie program: inf d i ν i + θ i + (λ i κ i ) ν i K i,θi R s.. w C N i A (i) v +1 (f(x, u, w)) (C i w) ν i θ i. Puing he reformulaion resuls all ogeher, we have ha v (x) = inf z s.. r(x, u) + i I (p i κ i p i λi ) z d i ν i + θ i + i A (i) (λ i κ i ) 0 i I v +1 (f(x, u, w)) (C i w) ν i θ i w C N i I u U(x), κ, λ R N +, ν i K i, z R, θ R N. Viewing z and θ as slack variables, we conclude ha he saemen in he heorem holds. Noe ha he convexiy of w v +1 (f(x, u, w)) and he nesing condiion play a criical role in preserving opimaliy in he proposed reformulaion as originally observed by Wiesemann e al. in he conex of single-sage opimizaion [9]. When (u, w) v +1 (f(x, u, w)) is also piecewise affine, our resul is consisen wih Theorem 1 in [9]. Theorem 2 allows us o avoid solving he compuaionally challenging infinie-dimensional minimax opimizaion problems in he original Bellman equaion. Insead, we can evaluae he value funcion backward in ime by solving a C 2 =[3.5, 8.5] C 1 =[5.5, 6.5] w C 3 =[0, 12] Fig. 1: The empirical disribuion µ of w and he confidence ses C i, i = 1, 2, 3. semi-infinie program a each (discreized) sae. This semiinfinie program can be solved by several convergen mehods such as primal-dual mehods, discreizaion mehods, homoopy mehods, exchange mehods, and consrain sampling mehods (see [25], [26], [27] and he references herein). Among hem, we will use he convergen discreizaion mehod developed by Reemsen [28] in he nex secion. V. APPLICATION TO INVENTORY CONTROL We consider a sochasic invenory conrol problem o demonsrae he performance of our disribuionally robus conrol mehod. We use he sandard seing of sochasic Newsvendor problems (e.g., [29]). Le x R be an invenory level of ineres a sage. Given he quaniy u U := [0, 10] ordered and he sochasic demand w a sage, he invenory level evolves as x +1 = x + u w, for T := {0, 1,, 6}. We assume ha any unsaisfied demand is backlogged for he nex sage and hus allow negaive sae values. The sage-wise cos funcion is given by r(x, u, w ) = c o (x + u w ) + + c u (w x u ) +, where c o = 1 is he overage (or sorage) cos and c u = 1 is he underage cos (or he cos of los sales). Fig. 1 shows he empirical disribuion µ of w for all T, and he confidence ses used in our simulaions. We choose p i and p i as 90% and 110% of µ (C). i Thus, µ is conained in he consruced ambiguiy se D. We compare our disribuionally robus conroller designed using D and he sandard sochasic opimal conroller consruced wih he empirical disribuion µ when x 0 = 10. Suppose ha he acual disribuion µ rue of w is uniform in each confidence se and µ rue (C) i = p i. Then, µrue D bu µ rue is differen from he empirical disribuion µ. In our simulaion wih 10 5 rajecories of {w } sampled from {µ rue }, he disribuionally robus conrol mehod reduces he oal expeced cos incurred by he sandard conroller by 29.6%. This resul confirms ha our conroller is robus agains errors in disurbance disribuions while he sandard conroller is no. To invesigae why he proposed conroller performs beer han he sandard conroller under disribuional ambiguiy,

6 invenory level ime (day) (a) ime (day) (b) Fig. 2: Tukey box plos of sae rajecories conrolled by (a) he sandard sochasic opimal conroller and (b) he disribuionally robus conroller. we now compare heir conrolled sae rajecories. As shown in Fig. 2, he disribuionally robus conroller drives he median (and he firs and hird quaniles) of sae rajecories closer o he origin han he sandard conroller. Since our conroller considers he wors-case disribuion in he ambiguiy se, i can conrol he sysem in a desirable manner (mainaining he invenory level close o zero) even when deviaes from he empirical disribuion µ. On he oher hand, he sandard conroller opimizes he sysem performance only when µ rue = µ ; oherwise, here is no performance guaranee. In paricular, he sandard conroller iniially increases he invenory level by using approximaely 97% of he maximum allowable conrol value. This aggressive conrol acion is inended o saisfy demand a laer sages wih he limied conrol range U := [0, 10]. However, µ rue as µ rue deviaes from µ, his sandard conrol sraegy generaes higher overage coss han expeced. On he oher hand, he disribuionally robus conroller is designed o ake ino accoun such possibiliies and is capable of balancing he overage and underage coss when µ rue µ. VI. CONCLUSION AND FUTURE WORK We have proposed a dualiy-based dynamic programming approach o disribuionally robus conrol problems wih conic confidence ses. The srucural propery we idenified allows us o focus on non-randomized Markov policies wih sae feedback. Our exac dualiy-based reformulaion mehod also alleviaes he compuaional issues in he original Bellman equaion ha involves infinie-dimensional minimax opimizaion problems. As a fuure research, i is of grea ineres o develop a scalable numerical mehod for he reformulaed Bellman equaion. Furhermore, adding risk consrains may help in sysemaically discouraging undesirable sysem behaviors. REFERENCES [1] A. Nilim and L. El Ghaoui, Robus conrol of Markov decision processes wih uncerain ransiion marices, Operaions Research, vol. 53, no. 5, pp , [2] S. Samuelson and I. Yang, Daa-driven disribuionally robus conrol of energy sorage o manage wind power flucuaions, in Proceedings of he 1s IEEE Conference on Conrol Technology and Applicaions, [3] H. Xu and S. Mannor, Disribuionally robus Markov decision processes, Mahemaics of Operaions Research, vol. 37, no. 2, pp , [4] P. Yu and H. Xu, Disribuionally robus counerpar in Markov decision processes, IEEE Transacions on Auomaic Conrol, vol. 61, no. 9, pp , [5] I. Yang, A convex opimizaion approach o disribuionally robus Markov decision processes wih Wassersein disance, IEEE Conrol Sysems Leers, vol. 1, no. 1, pp , [6] B. P. G. Van Parys, D. Kuhn, P. J. Goular, and M. Morari, Disribuionally robus conrol of consrained sochasic sysems, IEEE Transacions on Auomaic Conrol, vol. 61, no. 2, pp , [7] I. Yang, A dynamic game approach o disribuionally robus safey specificaions for sochasic sysems, arxiv: , [8] I. Tzorzis, C. D. Charalambous, and T. Charalambous, Dynamic programming subjec o oal variaion disance ambiguiy, SIAM Journal on Conrol and Opimizaion, vol. 53, no. 4, pp , [9] W. Wiesemann, D. Kuhn, and M. Sim, Disribuionally robus convex opimizaion, Operaions Research, vol. 62, no. 6, pp , [10] H. Scarf, K. J. Arrow, and S. Karlin, A min-max soluion of an invenory problem, Sudies in he Mahemaical Theory of Invenory and Producion, pp , [11] J. Dupačová, The minimax approach o sochasic programming and an lllusraive applicaion, Sochasics, vol. 20, pp , [12] E. Delage and Y. Ye, Disribuionally robus opimizaion under momen uncerainy wih applicaion o daa-driven problems, Operaions Research, vol. 58, no. 3, pp , [13] I. Popescu, Robus mean-covariance soluions for sochasic opimizaion, Operaions Research, vol. 55, no. 1, pp , [14] S. Zymler, D. Kuhn, and B. Rusem, Disribuionally robus join chance consrains wih second-order momen informaion, Mahemaical Programming, Ser. A, vol. 137, pp , [15] A. Ben-Tal, D. Den Herog, A. De Waegenaere, B. Melenberg, and G. Rennen, Robus soluions of opimizaion problems affeced by uncerain probabiliies, Managemen Science, vol. 59, no. 2, pp , [16] R. Jiang and Y. Guan, Daa-driven chance consrained sochasic program, Mahemaical Programming, Ser. A, vol. 158, pp , [17] H. Sun and H. Xu, Convergence analysis for disribuionally robus opimizaion and equilibrium problems, Mahemaics of Operaions Research, vol. 41, no. 2, pp , [18] E. Erdoğan and G. Iyengar, Ambiguous chance consrained problems and robus opimizaion, Mahemaical Programming, Ser. B, vol. 107, pp , [19] P. Mohajerin Esfahani and D. Kuhn, Daa-driven disribuionally robus opimizaion using he Wassersein meric: Performance guaranees and racable reformulaions, arxiv: , [20] R. Gao and A. J. Kleyweg, Disribuionally robus sochasic opimizaion wih Wassersein disance, arxiv: , [21] J. I. González-Trejo, O. Hernández-Lerma, and L. F. Hoyos-Reyes, Minimax conrol of discree-ime sochasic sysems, SIAM Journal on Conrol and Opimizaion, vol. 41, no. 5, pp , [22] R. Bellman, Dynamic programming and Lagrange mulipliers, Proceedings of he Naional Academy of Sciences, vol. 42, no. 10, pp , [23] O. Hernández-Lerma and J. B. Lasserre, Discree-Time Markov Conrol Processes: Basic Opimaliy Crieria. Springer, [24] A. Shapiro, On dualiy heory of conic linear problems, in Semi- Infinie Programming. Springer, 2001, pp [25] R. Heich and K. O. Koranek, Semi-infinie programming: Theory, mehods, and applicaions, SIAM Review, vol. 35, no. 3, pp , [26] M. López and G. Sill, Semi-infinie programming, European Journal of Operaional Research, vol. 180, pp , [27] G. Calafiore and M. C. Campi, Uncerain convex programs: randomized soluions and confidence levels, Mahemaical Programming, Ser. A, vol. 102, pp , [28] R. Reemsen, Discreizaion mehods for he soluion of semi-infinie programming problems, Journal of Opimizaion Theory and Applicaions, vol. 71, no. 1, pp , [29] R. Levi, R. O. Roundy, and D. B. Shmoys, Provably near-opimal sampling-based policies for sochasic invenory conrol models, Mahemaics of Operaions Research, vol. 32, no. 4, pp , 2007.

An introduction to the theory of SDDP algorithm

An introduction to the theory of SDDP algorithm An inroducion o he heory of SDDP algorihm V. Leclère (ENPC) Augus 1, 2014 V. Leclère Inroducion o SDDP Augus 1, 2014 1 / 21 Inroducion Large scale sochasic problem are hard o solve. Two ways of aacking

More information

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB Elecronic Companion EC.1. Proofs of Technical Lemmas and Theorems LEMMA 1. Le C(RB) be he oal cos incurred by he RB policy. Then we have, T L E[C(RB)] 3 E[Z RB ]. (EC.1) Proof of Lemma 1. Using he marginal

More information

Lecture 20: Riccati Equations and Least Squares Feedback Control

Lecture 20: Riccati Equations and Least Squares Feedback Control 34-5 LINEAR SYSTEMS Lecure : Riccai Equaions and Leas Squares Feedback Conrol 5.6.4 Sae Feedback via Riccai Equaions A recursive approach in generaing he marix-valued funcion W ( ) equaion for i for he

More information

A Primal-Dual Type Algorithm with the O(1/t) Convergence Rate for Large Scale Constrained Convex Programs

A Primal-Dual Type Algorithm with the O(1/t) Convergence Rate for Large Scale Constrained Convex Programs PROC. IEEE CONFERENCE ON DECISION AND CONTROL, 06 A Primal-Dual Type Algorihm wih he O(/) Convergence Rae for Large Scale Consrained Convex Programs Hao Yu and Michael J. Neely Absrac This paper considers

More information

Application of a Stochastic-Fuzzy Approach to Modeling Optimal Discrete Time Dynamical Systems by Using Large Scale Data Processing

Application of a Stochastic-Fuzzy Approach to Modeling Optimal Discrete Time Dynamical Systems by Using Large Scale Data Processing Applicaion of a Sochasic-Fuzzy Approach o Modeling Opimal Discree Time Dynamical Sysems by Using Large Scale Daa Processing AA WALASZE-BABISZEWSA Deparmen of Compuer Engineering Opole Universiy of Technology

More information

Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach

Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach 1 Decenralized Sochasic Conrol wih Parial Hisory Sharing: A Common Informaion Approach Ashuosh Nayyar, Adiya Mahajan and Demoshenis Tenekezis arxiv:1209.1695v1 [cs.sy] 8 Sep 2012 Absrac A general model

More information

Vehicle Arrival Models : Headway

Vehicle Arrival Models : Headway Chaper 12 Vehicle Arrival Models : Headway 12.1 Inroducion Modelling arrival of vehicle a secion of road is an imporan sep in raffic flow modelling. I has imporan applicaion in raffic flow simulaion where

More information

Expert Advice for Amateurs

Expert Advice for Amateurs Exper Advice for Amaeurs Ernes K. Lai Online Appendix - Exisence of Equilibria The analysis in his secion is performed under more general payoff funcions. Wihou aking an explici form, he payoffs of he

More information

This document was generated at 1:04 PM, 09/10/13 Copyright 2013 Richard T. Woodward. 4. End points and transversality conditions AGEC

This document was generated at 1:04 PM, 09/10/13 Copyright 2013 Richard T. Woodward. 4. End points and transversality conditions AGEC his documen was generaed a 1:4 PM, 9/1/13 Copyrigh 213 Richard. Woodward 4. End poins and ransversaliy condiions AGEC 637-213 F z d Recall from Lecure 3 ha a ypical opimal conrol problem is o maimize (,,

More information

On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems

On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems MATHEMATICS OF OPERATIONS RESEARCH Vol. 38, No. 2, May 2013, pp. 209 227 ISSN 0364-765X (prin) ISSN 1526-5471 (online) hp://dx.doi.org/10.1287/moor.1120.0562 2013 INFORMS On Boundedness of Q-Learning Ieraes

More information

Technical Report Doc ID: TR March-2013 (Last revision: 23-February-2016) On formulating quadratic functions in optimization models.

Technical Report Doc ID: TR March-2013 (Last revision: 23-February-2016) On formulating quadratic functions in optimization models. Technical Repor Doc ID: TR--203 06-March-203 (Las revision: 23-Februar-206) On formulaing quadraic funcions in opimizaion models. Auhor: Erling D. Andersen Convex quadraic consrains quie frequenl appear

More information

Zürich. ETH Master Course: L Autonomous Mobile Robots Localization II

Zürich. ETH Master Course: L Autonomous Mobile Robots Localization II Roland Siegwar Margaria Chli Paul Furgale Marco Huer Marin Rufli Davide Scaramuzza ETH Maser Course: 151-0854-00L Auonomous Mobile Robos Localizaion II ACT and SEE For all do, (predicion updae / ACT),

More information

STATE-SPACE MODELLING. A mass balance across the tank gives:

STATE-SPACE MODELLING. A mass balance across the tank gives: B. Lennox and N.F. Thornhill, 9, Sae Space Modelling, IChemE Process Managemen and Conrol Subjec Group Newsleer STE-SPACE MODELLING Inroducion: Over he pas decade or so here has been an ever increasing

More information

Robust estimation based on the first- and third-moment restrictions of the power transformation model

Robust estimation based on the first- and third-moment restrictions of the power transformation model h Inernaional Congress on Modelling and Simulaion, Adelaide, Ausralia, 6 December 3 www.mssanz.org.au/modsim3 Robus esimaion based on he firs- and hird-momen resricions of he power ransformaion Nawaa,

More information

t dt t SCLP Bellman (1953) CLP (Dantzig, Tyndall, Grinold, Perold, Anstreicher 60's-80's) Anderson (1978) SCLP

t dt t SCLP Bellman (1953) CLP (Dantzig, Tyndall, Grinold, Perold, Anstreicher 60's-80's) Anderson (1978) SCLP Coninuous Linear Programming. Separaed Coninuous Linear Programming Bellman (1953) max c () u() d H () u () + Gsusds (,) () a () u (), < < CLP (Danzig, yndall, Grinold, Perold, Ansreicher 6's-8's) Anderson

More information

Mean-square Stability Control for Networked Systems with Stochastic Time Delay

Mean-square Stability Control for Networked Systems with Stochastic Time Delay JOURNAL OF SIMULAION VOL. 5 NO. May 7 Mean-square Sabiliy Conrol for Newored Sysems wih Sochasic ime Delay YAO Hejun YUAN Fushun School of Mahemaics and Saisics Anyang Normal Universiy Anyang Henan. 455

More information

Notes on Kalman Filtering

Notes on Kalman Filtering Noes on Kalman Filering Brian Borchers and Rick Aser November 7, Inroducion Daa Assimilaion is he problem of merging model predicions wih acual measuremens of a sysem o produce an opimal esimae of he curren

More information

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017 Two Popular Bayesian Esimaors: Paricle and Kalman Filers McGill COMP 765 Sep 14 h, 2017 1 1 1, dx x Bel x u x P x z P Recall: Bayes Filers,,,,,,, 1 1 1 1 u z u x P u z u x z P Bayes z = observaion u =

More information

Optimality Conditions for Unconstrained Problems

Optimality Conditions for Unconstrained Problems 62 CHAPTER 6 Opimaliy Condiions for Unconsrained Problems 1 Unconsrained Opimizaion 11 Exisence Consider he problem of minimizing he funcion f : R n R where f is coninuous on all of R n : P min f(x) x

More information

On a Discrete-In-Time Order Level Inventory Model for Items with Random Deterioration

On a Discrete-In-Time Order Level Inventory Model for Items with Random Deterioration Journal of Agriculure and Life Sciences Vol., No. ; June 4 On a Discree-In-Time Order Level Invenory Model for Iems wih Random Deerioraion Dr Biswaranjan Mandal Associae Professor of Mahemaics Acharya

More information

Online Appendix to Solution Methods for Models with Rare Disasters

Online Appendix to Solution Methods for Models with Rare Disasters Online Appendix o Soluion Mehods for Models wih Rare Disasers Jesús Fernández-Villaverde and Oren Levinal In his Online Appendix, we presen he Euler condiions of he model, we develop he pricing Calvo block,

More information

CONTROL SYSTEMS, ROBOTICS AND AUTOMATION Vol. XI Control of Stochastic Systems - P.R. Kumar

CONTROL SYSTEMS, ROBOTICS AND AUTOMATION Vol. XI Control of Stochastic Systems - P.R. Kumar CONROL OF SOCHASIC SYSEMS P.R. Kumar Deparmen of Elecrical and Compuer Engineering, and Coordinaed Science Laboraory, Universiy of Illinois, Urbana-Champaign, USA. Keywords: Markov chains, ransiion probabiliies,

More information

Information Relaxations and Duality in Stochastic Dynamic Programs

Information Relaxations and Duality in Stochastic Dynamic Programs OPERATIONS RESEARCH Vol. 58, No. 4, Par 1 of 2, July Augus 2010, pp. 785 801 issn 0030-364X eissn 1526-5463 10 5804 0785 informs doi 10.1287/opre.1090.0796 2010 INFORMS Informaion Relaxaions and Dualiy

More information

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions Muli-Period Sochasic Models: Opimali of (s, S) Polic for -Convex Objecive Funcions Consider a seing similar o he N-sage newsvendor problem excep ha now here is a fixed re-ordering cos (> 0) for each (re-)order.

More information

RANDOM LAGRANGE MULTIPLIERS AND TRANSVERSALITY

RANDOM LAGRANGE MULTIPLIERS AND TRANSVERSALITY ECO 504 Spring 2006 Chris Sims RANDOM LAGRANGE MULTIPLIERS AND TRANSVERSALITY 1. INTRODUCTION Lagrange muliplier mehods are sandard fare in elemenary calculus courses, and hey play a cenral role in economic

More information

Inventory Control of Perishable Items in a Two-Echelon Supply Chain

Inventory Control of Perishable Items in a Two-Echelon Supply Chain Journal of Indusrial Engineering, Universiy of ehran, Special Issue,, PP. 69-77 69 Invenory Conrol of Perishable Iems in a wo-echelon Supply Chain Fariborz Jolai *, Elmira Gheisariha and Farnaz Nojavan

More information

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC This documen was generaed a :45 PM 8/8/04 Copyrigh 04 Richard T. Woodward. An inroducion o dynamic opimizaion -- Opimal Conrol and Dynamic Programming AGEC 637-04 I. Overview of opimizaion Opimizaion is

More information

Stability and Bifurcation in a Neural Network Model with Two Delays

Stability and Bifurcation in a Neural Network Model with Two Delays Inernaional Mahemaical Forum, Vol. 6, 11, no. 35, 175-1731 Sabiliy and Bifurcaion in a Neural Nework Model wih Two Delays GuangPing Hu and XiaoLing Li School of Mahemaics and Physics, Nanjing Universiy

More information

Modal identification of structures from roving input data by means of maximum likelihood estimation of the state space model

Modal identification of structures from roving input data by means of maximum likelihood estimation of the state space model Modal idenificaion of srucures from roving inpu daa by means of maximum likelihood esimaion of he sae space model J. Cara, J. Juan, E. Alarcón Absrac The usual way o perform a forced vibraion es is o fix

More information

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle Chaper 2 Newonian Mechanics Single Paricle In his Chaper we will review wha Newon s laws of mechanics ell us abou he moion of a single paricle. Newon s laws are only valid in suiable reference frames,

More information

O Q L N. Discrete-Time Stochastic Dynamic Programming. I. Notation and basic assumptions. ε t : a px1 random vector of disturbances at time t.

O Q L N. Discrete-Time Stochastic Dynamic Programming. I. Notation and basic assumptions. ε t : a px1 random vector of disturbances at time t. Econ. 5b Spring 999 C. Sims Discree-Time Sochasic Dynamic Programming 995, 996 by Chrisopher Sims. This maerial may be freely reproduced for educaional and research purposes, so long as i is no alered,

More information

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC This documen was generaed a :37 PM, 1/11/018 Copyrigh 018 Richard T. Woodward 1. An inroducion o dynamic opimiaion -- Opimal Conrol and Dynamic Programming AGEC 64-018 I. Overview of opimiaion Opimiaion

More information

International Journal of Scientific & Engineering Research, Volume 4, Issue 10, October ISSN

International Journal of Scientific & Engineering Research, Volume 4, Issue 10, October ISSN Inernaional Journal of Scienific & Engineering Research, Volume 4, Issue 10, Ocober-2013 900 FUZZY MEAN RESIDUAL LIFE ORDERING OF FUZZY RANDOM VARIABLES J. EARNEST LAZARUS PIRIYAKUMAR 1, A. YAMUNA 2 1.

More information

Logic in computer science

Logic in computer science Logic in compuer science Logic plays an imporan role in compuer science Logic is ofen called he calculus of compuer science Logic plays a similar role in compuer science o ha played by calculus in he physical

More information

2. Nonlinear Conservation Law Equations

2. Nonlinear Conservation Law Equations . Nonlinear Conservaion Law Equaions One of he clear lessons learned over recen years in sudying nonlinear parial differenial equaions is ha i is generally no wise o ry o aack a general class of nonlinear

More information

MODULE 3 FUNCTION OF A RANDOM VARIABLE AND ITS DISTRIBUTION LECTURES PROBABILITY DISTRIBUTION OF A FUNCTION OF A RANDOM VARIABLE

MODULE 3 FUNCTION OF A RANDOM VARIABLE AND ITS DISTRIBUTION LECTURES PROBABILITY DISTRIBUTION OF A FUNCTION OF A RANDOM VARIABLE Topics MODULE 3 FUNCTION OF A RANDOM VARIABLE AND ITS DISTRIBUTION LECTURES 2-6 3. FUNCTION OF A RANDOM VARIABLE 3.2 PROBABILITY DISTRIBUTION OF A FUNCTION OF A RANDOM VARIABLE 3.3 EXPECTATION AND MOMENTS

More information

The Asymptotic Behavior of Nonoscillatory Solutions of Some Nonlinear Dynamic Equations on Time Scales

The Asymptotic Behavior of Nonoscillatory Solutions of Some Nonlinear Dynamic Equations on Time Scales Advances in Dynamical Sysems and Applicaions. ISSN 0973-5321 Volume 1 Number 1 (2006, pp. 103 112 c Research India Publicaions hp://www.ripublicaion.com/adsa.hm The Asympoic Behavior of Nonoscillaory Soluions

More information

arxiv: v1 [math.oc] 27 Jul 2009

arxiv: v1 [math.oc] 27 Jul 2009 PARTICLE METHODS FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS PIERRE CARPENTIER GUY COHEN AND ANES DALLAGI arxiv:0907.4663v1 [mah.oc] 27 Jul 2009 Absrac. When dealing wih numerical soluion of sochasic opimal

More information

INTRODUCTION TO MACHINE LEARNING 3RD EDITION

INTRODUCTION TO MACHINE LEARNING 3RD EDITION ETHEM ALPAYDIN The MIT Press, 2014 Lecure Slides for INTRODUCTION TO MACHINE LEARNING 3RD EDITION alpaydin@boun.edu.r hp://www.cmpe.boun.edu.r/~ehem/i2ml3e CHAPTER 2: SUPERVISED LEARNING Learning a Class

More information

Two New Uncertainty Programming Models of Inventory with Uncertain Costs

Two New Uncertainty Programming Models of Inventory with Uncertain Costs Journal of Informaion & Compuaional Science 8: 2 (211) 28 288 Available a hp://www.joics.com Two New Uncerainy Programming Models of Invenory wih Uncerain Coss Lixia Rong Compuer Science and Technology

More information

23.5. Half-Range Series. Introduction. Prerequisites. Learning Outcomes

23.5. Half-Range Series. Introduction. Prerequisites. Learning Outcomes Half-Range Series 2.5 Inroducion In his Secion we address he following problem: Can we find a Fourier series expansion of a funcion defined over a finie inerval? Of course we recognise ha such a funcion

More information

GMM - Generalized Method of Moments

GMM - Generalized Method of Moments GMM - Generalized Mehod of Momens Conens GMM esimaion, shor inroducion 2 GMM inuiion: Maching momens 2 3 General overview of GMM esimaion. 3 3. Weighing marix...........................................

More information

Sliding Mode Extremum Seeking Control for Linear Quadratic Dynamic Game

Sliding Mode Extremum Seeking Control for Linear Quadratic Dynamic Game Sliding Mode Exremum Seeking Conrol for Linear Quadraic Dynamic Game Yaodong Pan and Ümi Özgüner ITS Research Group, AIST Tsukuba Eas Namiki --, Tsukuba-shi,Ibaraki-ken 5-856, Japan e-mail: pan.yaodong@ais.go.jp

More information

Notes for Lecture 17-18

Notes for Lecture 17-18 U.C. Berkeley CS278: Compuaional Complexiy Handou N7-8 Professor Luca Trevisan April 3-8, 2008 Noes for Lecure 7-8 In hese wo lecures we prove he firs half of he PCP Theorem, he Amplificaion Lemma, up

More information

A DELAY-DEPENDENT STABILITY CRITERIA FOR T-S FUZZY SYSTEM WITH TIME-DELAYS

A DELAY-DEPENDENT STABILITY CRITERIA FOR T-S FUZZY SYSTEM WITH TIME-DELAYS A DELAY-DEPENDENT STABILITY CRITERIA FOR T-S FUZZY SYSTEM WITH TIME-DELAYS Xinping Guan ;1 Fenglei Li Cailian Chen Insiue of Elecrical Engineering, Yanshan Universiy, Qinhuangdao, 066004, China. Deparmen

More information

Signaling equilibria for dynamic LQG games with. asymmetric information

Signaling equilibria for dynamic LQG games with. asymmetric information Signaling equilibria for dynamic LQG games wih asymmeric informaion Deepanshu Vasal and Achilleas Anasasopoulos Absrac We consider a finie horizon dynamic game wih wo players who observe heir ypes privaely

More information

Stable approximations of optimal filters

Stable approximations of optimal filters Sable approximaions of opimal filers Joaquin Miguez Deparmen of Signal Theory & Communicaions, Universidad Carlos III de Madrid. E-mail: joaquin.miguez@uc3m.es Join work wih Dan Crisan (Imperial College

More information

Applying Genetic Algorithms for Inventory Lot-Sizing Problem with Supplier Selection under Storage Capacity Constraints

Applying Genetic Algorithms for Inventory Lot-Sizing Problem with Supplier Selection under Storage Capacity Constraints IJCSI Inernaional Journal of Compuer Science Issues, Vol 9, Issue 1, No 1, January 2012 wwwijcsiorg 18 Applying Geneic Algorihms for Invenory Lo-Sizing Problem wih Supplier Selecion under Sorage Capaciy

More information

ACE 562 Fall Lecture 5: The Simple Linear Regression Model: Sampling Properties of the Least Squares Estimators. by Professor Scott H.

ACE 562 Fall Lecture 5: The Simple Linear Regression Model: Sampling Properties of the Least Squares Estimators. by Professor Scott H. ACE 56 Fall 005 Lecure 5: he Simple Linear Regression Model: Sampling Properies of he Leas Squares Esimaors by Professor Sco H. Irwin Required Reading: Griffihs, Hill and Judge. "Inference in he Simple

More information

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon 3..3 INRODUCION O DYNAMIC OPIMIZAION: DISCREE IME PROBLEMS A. he Hamilonian and Firs-Order Condiions in a Finie ime Horizon Define a new funcion, he Hamilonian funcion, H. H he change in he oal value of

More information

1 Review of Zero-Sum Games

1 Review of Zero-Sum Games COS 5: heoreical Machine Learning Lecurer: Rob Schapire Lecure #23 Scribe: Eugene Brevdo April 30, 2008 Review of Zero-Sum Games Las ime we inroduced a mahemaical model for wo player zero-sum games. Any

More information

5. Stochastic processes (1)

5. Stochastic processes (1) Lec05.pp S-38.45 - Inroducion o Teleraffic Theory Spring 2005 Conens Basic conceps Poisson process 2 Sochasic processes () Consider some quaniy in a eleraffic (or any) sysem I ypically evolves in ime randomly

More information

Christos Papadimitriou & Luca Trevisan November 22, 2016

Christos Papadimitriou & Luca Trevisan November 22, 2016 U.C. Bereley CS170: Algorihms Handou LN-11-22 Chrisos Papadimiriou & Luca Trevisan November 22, 2016 Sreaming algorihms In his lecure and he nex one we sudy memory-efficien algorihms ha process a sream

More information

Optimal Investment under Dynamic Risk Constraints and Partial Information

Optimal Investment under Dynamic Risk Constraints and Partial Information Opimal Invesmen under Dynamic Risk Consrains and Parial Informaion Wolfgang Puschögl Johann Radon Insiue for Compuaional and Applied Mahemaics (RICAM) Ausrian Academy of Sciences www.ricam.oeaw.ac.a 2

More information

A New Perturbative Approach in Nonlinear Singularity Analysis

A New Perturbative Approach in Nonlinear Singularity Analysis Journal of Mahemaics and Saisics 7 (: 49-54, ISSN 549-644 Science Publicaions A New Perurbaive Approach in Nonlinear Singulariy Analysis Ta-Leung Yee Deparmen of Mahemaics and Informaion Technology, The

More information

Lecture Notes 2. The Hilbert Space Approach to Time Series

Lecture Notes 2. The Hilbert Space Approach to Time Series Time Series Seven N. Durlauf Universiy of Wisconsin. Basic ideas Lecure Noes. The Hilber Space Approach o Time Series The Hilber space framework provides a very powerful language for discussing he relaionship

More information

SUFFICIENT CONDITIONS FOR EXISTENCE SOLUTION OF LINEAR TWO-POINT BOUNDARY PROBLEM IN MINIMIZATION OF QUADRATIC FUNCTIONAL

SUFFICIENT CONDITIONS FOR EXISTENCE SOLUTION OF LINEAR TWO-POINT BOUNDARY PROBLEM IN MINIMIZATION OF QUADRATIC FUNCTIONAL HE PUBLISHING HOUSE PROCEEDINGS OF HE ROMANIAN ACADEMY, Series A, OF HE ROMANIAN ACADEMY Volume, Number 4/200, pp 287 293 SUFFICIEN CONDIIONS FOR EXISENCE SOLUION OF LINEAR WO-POIN BOUNDARY PROBLEM IN

More information

6. Stochastic calculus with jump processes

6. Stochastic calculus with jump processes A) Trading sraegies (1/3) Marke wih d asses S = (S 1,, S d ) A rading sraegy can be modelled wih a vecor φ describing he quaniies invesed in each asse a each insan : φ = (φ 1,, φ d ) The value a of a porfolio

More information

Supplement for Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence

Supplement for Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence Supplemen for Sochasic Convex Opimizaion: Faser Local Growh Implies Faser Global Convergence Yi Xu Qihang Lin ianbao Yang Proof of heorem heorem Suppose Assumpion holds and F (w) obeys he LGC (6) Given

More information

Lecture 33: November 29

Lecture 33: November 29 36-705: Inermediae Saisics Fall 2017 Lecurer: Siva Balakrishnan Lecure 33: November 29 Today we will coninue discussing he boosrap, and hen ry o undersand why i works in a simple case. In he las lecure

More information

Lecture 4 Notes (Little s Theorem)

Lecture 4 Notes (Little s Theorem) Lecure 4 Noes (Lile s Theorem) This lecure concerns one of he mos imporan (and simples) heorems in Queuing Theory, Lile s Theorem. More informaion can be found in he course book, Bersekas & Gallagher,

More information

R t. C t P t. + u t. C t = αp t + βr t + v t. + β + w t

R t. C t P t. + u t. C t = αp t + βr t + v t. + β + w t Exercise 7 C P = α + β R P + u C = αp + βr + v (a) (b) C R = α P R + β + w (c) Assumpions abou he disurbances u, v, w : Classical assumions on he disurbance of one of he equaions, eg. on (b): E(v v s P,

More information

arxiv: v1 [math.pr] 19 Feb 2011

arxiv: v1 [math.pr] 19 Feb 2011 A NOTE ON FELLER SEMIGROUPS AND RESOLVENTS VADIM KOSTRYKIN, JÜRGEN POTTHOFF, AND ROBERT SCHRADER ABSTRACT. Various equivalen condiions for a semigroup or a resolven generaed by a Markov process o be of

More information

1 Subdivide the optimization horizon [t 0,t f ] into n s 1 control stages,

1 Subdivide the optimization horizon [t 0,t f ] into n s 1 control stages, Opimal Conrol Formulaion Opimal Conrol Lecures 19-2: Direc Soluion Mehods Benoî Chachua Deparmen of Chemical Engineering Spring 29 We are concerned wih numerical soluion procedures for

More information

Energy Storage Benchmark Problems

Energy Storage Benchmark Problems Energy Sorage Benchmark Problems Daniel F. Salas 1,3, Warren B. Powell 2,3 1 Deparmen of Chemical & Biological Engineering 2 Deparmen of Operaions Research & Financial Engineering 3 Princeon Laboraory

More information

Stochastic Perishable Inventory Systems: Dual-Balancing and Look-Ahead Approaches

Stochastic Perishable Inventory Systems: Dual-Balancing and Look-Ahead Approaches Sochasic Perishable Invenory Sysems: Dual-Balancing and Look-Ahead Approaches by Yuhe Diao A hesis presened o he Universiy Of Waerloo in fulfilmen of he hesis requiremen for he degree of Maser of Applied

More information

Optimal Control for LQG Systems on Graphs Part I: Structural Results

Optimal Control for LQG Systems on Graphs Part I: Structural Results Opimal Conrol for LQG Sysems on Graphs Par I: Srucural Resuls Ashuosh Nayyar Lauren Lessard Absrac In his wo-par paper, we idenify a broad class of decenralized oupu-feedback LQG sysems for which he opimal

More information

On Measuring Pro-Poor Growth. 1. On Various Ways of Measuring Pro-Poor Growth: A Short Review of the Literature

On Measuring Pro-Poor Growth. 1. On Various Ways of Measuring Pro-Poor Growth: A Short Review of the Literature On Measuring Pro-Poor Growh 1. On Various Ways of Measuring Pro-Poor Growh: A Shor eview of he Lieraure During he pas en years or so here have been various suggesions concerning he way one should check

More information

2.160 System Identification, Estimation, and Learning. Lecture Notes No. 8. March 6, 2006

2.160 System Identification, Estimation, and Learning. Lecture Notes No. 8. March 6, 2006 2.160 Sysem Idenificaion, Esimaion, and Learning Lecure Noes No. 8 March 6, 2006 4.9 Eended Kalman Filer In many pracical problems, he process dynamics are nonlinear. w Process Dynamics v y u Model (Linearized)

More information

An Introduction to Malliavin calculus and its applications

An Introduction to Malliavin calculus and its applications An Inroducion o Malliavin calculus and is applicaions Lecure 5: Smoohness of he densiy and Hörmander s heorem David Nualar Deparmen of Mahemaics Kansas Universiy Universiy of Wyoming Summer School 214

More information

Planning in POMDPs. Dominik Schoenberger Abstract

Planning in POMDPs. Dominik Schoenberger Abstract Planning in POMDPs Dominik Schoenberger d.schoenberger@sud.u-darmsad.de Absrac This documen briefly explains wha a Parially Observable Markov Decision Process is. Furhermore i inroduces he differen approaches

More information

OBJECTIVES OF TIME SERIES ANALYSIS

OBJECTIVES OF TIME SERIES ANALYSIS OBJECTIVES OF TIME SERIES ANALYSIS Undersanding he dynamic or imedependen srucure of he observaions of a single series (univariae analysis) Forecasing of fuure observaions Asceraining he leading, lagging

More information

Chapter 2. First Order Scalar Equations

Chapter 2. First Order Scalar Equations Chaper. Firs Order Scalar Equaions We sar our sudy of differenial equaions in he same way he pioneers in his field did. We show paricular echniques o solve paricular ypes of firs order differenial equaions.

More information

Utility maximization in incomplete markets

Utility maximization in incomplete markets Uiliy maximizaion in incomplee markes Marcel Ladkau 27.1.29 Conens 1 Inroducion and general seings 2 1.1 Marke model....................................... 2 1.2 Trading sraegy.....................................

More information

Class Meeting # 10: Introduction to the Wave Equation

Class Meeting # 10: Introduction to the Wave Equation MATH 8.5 COURSE NOTES - CLASS MEETING # 0 8.5 Inroducion o PDEs, Fall 0 Professor: Jared Speck Class Meeing # 0: Inroducion o he Wave Equaion. Wha is he wave equaion? The sandard wave equaion for a funcion

More information

Moving Horizon Estimation for Staged QP Problems

Moving Horizon Estimation for Staged QP Problems 51s IEEE Conference on Decision and Conrol, Maui, HI, December 212 Moving Horizon Esimaion for Saged QP Problems Eric Chu, Arezou Keshavarz, Dimiry Gorinevsky, and Sephen Boyd Absrac his paper considers

More information

Cash Flow Valuation Mode Lin Discrete Time

Cash Flow Valuation Mode Lin Discrete Time IOSR Journal of Mahemaics (IOSR-JM) e-issn: 2278-5728,p-ISSN: 2319-765X, 6, Issue 6 (May. - Jun. 2013), PP 35-41 Cash Flow Valuaion Mode Lin Discree Time Olayiwola. M. A. and Oni, N. O. Deparmen of Mahemaics

More information

12: AUTOREGRESSIVE AND MOVING AVERAGE PROCESSES IN DISCRETE TIME. Σ j =

12: AUTOREGRESSIVE AND MOVING AVERAGE PROCESSES IN DISCRETE TIME. Σ j = 1: AUTOREGRESSIVE AND MOVING AVERAGE PROCESSES IN DISCRETE TIME Moving Averages Recall ha a whie noise process is a series { } = having variance σ. The whie noise process has specral densiy f (λ) = of

More information

Exponential Weighted Moving Average (EWMA) Chart Under The Assumption of Moderateness And Its 3 Control Limits

Exponential Weighted Moving Average (EWMA) Chart Under The Assumption of Moderateness And Its 3 Control Limits DOI: 0.545/mjis.07.5009 Exponenial Weighed Moving Average (EWMA) Char Under The Assumpion of Moderaeness And Is 3 Conrol Limis KALPESH S TAILOR Assisan Professor, Deparmen of Saisics, M. K. Bhavnagar Universiy,

More information

MATHEMATICAL DESCRIPTION OF THEORETICAL METHODS OF RESERVE ECONOMY OF CONSIGNMENT STORES

MATHEMATICAL DESCRIPTION OF THEORETICAL METHODS OF RESERVE ECONOMY OF CONSIGNMENT STORES MAHEMAICAL DESCIPION OF HEOEICAL MEHODS OF ESEVE ECONOMY OF CONSIGNMEN SOES Péer elek, József Cselényi, György Demeer Universiy of Miskolc, Deparmen of Maerials Handling and Logisics Absrac: Opimizaion

More information

Games Against Nature

Games Against Nature Advanced Course in Machine Learning Spring 2010 Games Agains Naure Handous are joinly prepared by Shie Mannor and Shai Shalev-Shwarz In he previous lecures we alked abou expers in differen seups and analyzed

More information

Chapter 2. Models, Censoring, and Likelihood for Failure-Time Data

Chapter 2. Models, Censoring, and Likelihood for Failure-Time Data Chaper 2 Models, Censoring, and Likelihood for Failure-Time Daa William Q. Meeker and Luis A. Escobar Iowa Sae Universiy and Louisiana Sae Universiy Copyrigh 1998-2008 W. Q. Meeker and L. A. Escobar. Based

More information

An Introduction to Backward Stochastic Differential Equations (BSDEs) PIMS Summer School 2016 in Mathematical Finance.

An Introduction to Backward Stochastic Differential Equations (BSDEs) PIMS Summer School 2016 in Mathematical Finance. 1 An Inroducion o Backward Sochasic Differenial Equaions (BSDEs) PIMS Summer School 2016 in Mahemaical Finance June 25, 2016 Chrisoph Frei cfrei@ualbera.ca This inroducion is based on Touzi [14], Bouchard

More information

CENTRALIZED VERSUS DECENTRALIZED PRODUCTION PLANNING IN SUPPLY CHAINS

CENTRALIZED VERSUS DECENTRALIZED PRODUCTION PLANNING IN SUPPLY CHAINS CENRALIZED VERSUS DECENRALIZED PRODUCION PLANNING IN SUPPLY CHAINS Georges SAHARIDIS* a, Yves DALLERY* a, Fikri KARAESMEN* b * a Ecole Cenrale Paris Deparmen of Indusial Engineering (LGI), +3343388, saharidis,dallery@lgi.ecp.fr

More information

A Specification Test for Linear Dynamic Stochastic General Equilibrium Models

A Specification Test for Linear Dynamic Stochastic General Equilibrium Models Journal of Saisical and Economeric Mehods, vol.1, no.2, 2012, 65-70 ISSN: 2241-0384 (prin), 2241-0376 (online) Scienpress Ld, 2012 A Specificaion Tes for Linear Dynamic Sochasic General Equilibrium Models

More information

Competitive and Cooperative Inventory Policies in a Two-Stage Supply-Chain

Competitive and Cooperative Inventory Policies in a Two-Stage Supply-Chain Compeiive and Cooperaive Invenory Policies in a Two-Sage Supply-Chain (G. P. Cachon and P. H. Zipkin) Presened by Shruivandana Sharma IOE 64, Supply Chain Managemen, Winer 2009 Universiy of Michigan, Ann

More information

Echocardiography Project and Finite Fourier Series

Echocardiography Project and Finite Fourier Series Echocardiography Projec and Finie Fourier Series 1 U M An echocardiagram is a plo of how a porion of he hear moves as he funcion of ime over he one or more hearbea cycles If he hearbea repeas iself every

More information

THE BELLMAN PRINCIPLE OF OPTIMALITY

THE BELLMAN PRINCIPLE OF OPTIMALITY THE BELLMAN PRINCIPLE OF OPTIMALITY IOANID ROSU As I undersand, here are wo approaches o dynamic opimizaion: he Ponrjagin Hamilonian) approach, and he Bellman approach. I saw several clear discussions

More information

EXERCISES FOR SECTION 1.5

EXERCISES FOR SECTION 1.5 1.5 Exisence and Uniqueness of Soluions 43 20. 1 v c 21. 1 v c 1 2 4 6 8 10 1 2 2 4 6 8 10 Graph of approximae soluion obained using Euler s mehod wih = 0.1. Graph of approximae soluion obained using Euler

More information

POSITIVE SOLUTIONS OF NEUTRAL DELAY DIFFERENTIAL EQUATION

POSITIVE SOLUTIONS OF NEUTRAL DELAY DIFFERENTIAL EQUATION Novi Sad J. Mah. Vol. 32, No. 2, 2002, 95-108 95 POSITIVE SOLUTIONS OF NEUTRAL DELAY DIFFERENTIAL EQUATION Hajnalka Péics 1, János Karsai 2 Absrac. We consider he scalar nonauonomous neural delay differenial

More information

Essential Maps and Coincidence Principles for General Classes of Maps

Essential Maps and Coincidence Principles for General Classes of Maps Filoma 31:11 (2017), 3553 3558 hps://doi.org/10.2298/fil1711553o Published by Faculy of Sciences Mahemaics, Universiy of Niš, Serbia Available a: hp://www.pmf.ni.ac.rs/filoma Essenial Maps Coincidence

More information

An Introduction to Stochastic Programming: The Recourse Problem

An Introduction to Stochastic Programming: The Recourse Problem An Inroducion o Sochasic Programming: he Recourse Problem George Danzig and Phil Wolfe Ellis Johnson, Roger Wes, Dick Cole, and Me John Birge Where o look in he ex pp. 6-7, Secion.2.: Inroducion o sochasic

More information

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles Diebold, Chaper 7 Francis X. Diebold, Elemens of Forecasing, 4h Ediion (Mason, Ohio: Cengage Learning, 006). Chaper 7. Characerizing Cycles Afer compleing his reading you should be able o: Define covariance

More information

Optima and Equilibria for Traffic Flow on a Network

Optima and Equilibria for Traffic Flow on a Network Opima and Equilibria for Traffic Flow on a Nework Albero Bressan Deparmen of Mahemaics, Penn Sae Universiy bressan@mah.psu.edu Albero Bressan (Penn Sae) Opima and equilibria for raffic flow 1 / 1 A Traffic

More information

Scheduling of Crude Oil Movements at Refinery Front-end

Scheduling of Crude Oil Movements at Refinery Front-end Scheduling of Crude Oil Movemens a Refinery Fron-end Ramkumar Karuppiah and Ignacio Grossmann Carnegie Mellon Universiy ExxonMobil Case Sudy: Dr. Kevin Furman Enerprise-wide Opimizaion Projec March 15,

More information

Subway stations energy and air quality management

Subway stations energy and air quality management Subway saions energy and air qualiy managemen wih sochasic opimizaion Trisan Rigau 1,2,4, Advisors: P. Carpenier 3, J.-Ph. Chancelier 2, M. De Lara 2 EFFICACITY 1 CERMICS, ENPC 2 UMA, ENSTA 3 LISIS, IFSTTAR

More information

Stochastic Model for Cancer Cell Growth through Single Forward Mutation

Stochastic Model for Cancer Cell Growth through Single Forward Mutation Journal of Modern Applied Saisical Mehods Volume 16 Issue 1 Aricle 31 5-1-2017 Sochasic Model for Cancer Cell Growh hrough Single Forward Muaion Jayabharahiraj Jayabalan Pondicherry Universiy, jayabharahi8@gmail.com

More information

The Optimal Stopping Time for Selling an Asset When It Is Uncertain Whether the Price Process Is Increasing or Decreasing When the Horizon Is Infinite

The Optimal Stopping Time for Selling an Asset When It Is Uncertain Whether the Price Process Is Increasing or Decreasing When the Horizon Is Infinite American Journal of Operaions Research, 08, 8, 8-9 hp://wwwscirporg/journal/ajor ISSN Online: 60-8849 ISSN Prin: 60-8830 The Opimal Sopping Time for Selling an Asse When I Is Uncerain Wheher he Price Process

More information

STABILITY OF PEXIDERIZED QUADRATIC FUNCTIONAL EQUATION IN NON-ARCHIMEDEAN FUZZY NORMED SPASES

STABILITY OF PEXIDERIZED QUADRATIC FUNCTIONAL EQUATION IN NON-ARCHIMEDEAN FUZZY NORMED SPASES Novi Sad J. Mah. Vol. 46, No. 1, 2016, 15-25 STABILITY OF PEXIDERIZED QUADRATIC FUNCTIONAL EQUATION IN NON-ARCHIMEDEAN FUZZY NORMED SPASES N. Eghbali 1 Absrac. We deermine some sabiliy resuls concerning

More information

Hamilton- J acobi Equation: Weak S olution We continue the study of the Hamilton-Jacobi equation:

Hamilton- J acobi Equation: Weak S olution We continue the study of the Hamilton-Jacobi equation: M ah 5 7 Fall 9 L ecure O c. 4, 9 ) Hamilon- J acobi Equaion: Weak S oluion We coninue he sudy of he Hamilon-Jacobi equaion: We have shown ha u + H D u) = R n, ) ; u = g R n { = }. ). In general we canno

More information