arxiv: v1 [math.oc] 27 Jul 2009

Size: px
Start display at page:

Download "arxiv: v1 [math.oc] 27 Jul 2009"

Transcription

1 PARTICLE METHODS FOR STOCHASTIC OPTIMAL CONTROL PROBLEMS PIERRE CARPENTIER GUY COHEN AND ANES DALLAGI arxiv: v1 [mah.oc] 27 Jul 2009 Absrac. When dealing wih numerical soluion of sochasic opimal conrol problems sochasic dynamic programming is he naural framework. In order o ry o overcome he so-called curse of dimensionaliy he sochasic programming school promoed anoher approach based on scenario rees which can be seen as he combinaion of Mone Carlo sampling ideas on he one hand and of a heurisic echnique o handle causaliy or nonanicipaiveness consrains on he oher hand. However if one considers ha he soluion of a sochasic opimal conrol problem is a feedback law which relaes conrol o sae variables he numerical resoluion of he opimizaion problem over a scenario ree should be compleed by a feedback synhesis sage in which a each ime sep of he scenario ree conrol values a nodes are ploed agains corresponding sae values o provide a firs discree shape of his feedback law from which a coninuous funcion can be finally inferred. From his poin of view he scenario ree approach faces an imporan difficuly: a he firs ime sages close o he ree roo here are a few nodes or Mone-Carlo paricles and herefore a relaively scarce amoun of informaion o guess a feedback law bu his informaion is generally of a good qualiy ha is viewed as a se of conrol value esimaes for some paricular sae values i has a small variance because he fuure of hose nodes is rich enough; on he conrary a he final ime sages near he ree leaves he number of nodes increases bu he variance ges large because he fuure of each node ges poor and someimes even deerminisic. Afer his dilemma has been confirmed by numerical experimens we have ried o derive new variaional approaches. Firs of all wo differen formulaions of he essenial consrain of nonanicipaiveness are considered: one is called algebraic and he oher one is called funcional. Nex in boh seings we obain opimaliy condiions for he corresponding opimal conrol problem. For he numerical resoluion of hose opimaliy condiions an adapive mesh discreizaion mehod is used in he sae space in order o provide informaion for feedback synhesis. This mesh is naurally derived from a bunch of sample noise rajecories which need no o be pu ino he form of a ree prior o numerical resoluion. In paricular an imporan consequence of his discrepancy wih he scenario ree approach is ha he same number of nodes or poins are available from he beginning o he end of he ime horizon. And his will be obained wihou sacrifying he qualiy of he resuls ha is he variance of he esimaes. Resuls of experimens wih a hydro-elecric dam producion managemen problem will be presened and will demonsrae he claimed improvemens. Key words. sochasic programming; measurabiliy consrains; discreizaion AMS subjec classificaions. 90C15 49M25 62L20 Inroducion. Taking ino accoun uncerainies in he decision process has become an imporan issue for all indusries. Facing he marke volailiies he weaher whims and he changing policies and regulaory consrains decision makers have o find an opimal way o inroduce heir decision process ino his uncerain framework. One way o ake uncerainies ino accoun is o use he sochasic opimizaion framework. In his approach he decision maker makes his decision by opimizing a mean value wih regard o he muliple possible scenarios weighed wih a probabiliy law. Sochasic opimizaion problems ofen involve informaion consrains: he decision maker makes his decision afer geing observaions abou he possible scenarios. In he lieraure wo differen communiies have deal wih his informaion issue using differen modelling echniques and herefore differen soluion mehods. The École Naionale Supérieure de Techniques Avancées 32 boulevard Vicor Paris Cedex 15 France Pierre.Carpenier@ensa.fr Universié de Paris-Es CERMICS École des Pons Champs sur Marne Marne la Vallée Cedex 2 France guy.cohen@mail.enpc.fr EDF R&D 1 avenue du Général de Gaulle Clamar Cedex France anes.dallagi@edf.fr 1

2 2 A. DALLAGI AND G. COHEN AND P. CARPENTIER quesion is: knowing ha sochasic opimizaion problems are ofen infinie dimensional problems how can we implemen racable soluion mehods which can be implemened using a compuer? To answer his quesion each communiy brough is own answers. The sochasic programming communiy models he informaion srucure by scenarios rees: his involves Mone Carlo sampling plus some manipulaions of he sample rajecories o enforce he ree srucure. Then racable soluion mehods for sochasic opimizaion problems consis in solving deerminisic problems over he decision rees. We refer o [ ] for furher deails on hese mehods. Neverheless sochasic programming faces an imporan difficuly: due o he ree srucure one has a few discreizaion poins nodes of he ree a he early sages which could represen a serious handicap when aemping o synhesizing a feedback law. On he conrary when approaching he las sages one has a large number of discreizaion poins bu wih a fuure which may be almos or compleely deerminisic. The sochasic opimal conrol communiy uses special srucures of he sochasic opimizaion problems ime sequenialiy and sae noions o model he informaion consrains hrough a funcional inerpreaion ha leads o he Dynamic Programming Principle. We refer o [4 5 6] for furher deails on his mehod. However sochasic dynamic programming is also confroned wih a serious obsacle known as he curse of dimensionaliy. In fac his mehod leads one o blindly discreize he whole sae space wihou aking ino accoun a generally nonuniform sae disribuion a he opimal soluion. This paper ries o bridge he gap beween hose wo communiies. We propose a racable soluion mehod for sochasic opimal conrol problems which makes use of he good ideas of boh Mone Carlo sampling and variaional mehods on he one hand funcional handling of he informaion srucure on he oher hand. Oher relaed works [8 20] are sill closer o he sochasic programming poin of view. In our approach he same number of discreizaion sample poins are used from he beginning o he end of he ime horizon and his discreizaion grid is adapive: when ransposed ino he sae space i reflecs he opimal sae disribuion a each ime sage. The proposed mehod is of a variaional naure in ha i is based on gradien calculaions which involve he forward sae variable inegraion and he backward adjoin sae co-sae evaluaion as in he Ponryagin minimum principle. The purpose is o solve he Kuhn-Tucker necessary opimaliy condiions of he considered problem including he informaion consrains. This paper is organized as follows. In 1 we discuss how o model he informaion srucure in sochasic opimizaion problems. Then we derive opimaliy condiions for sochasic opimizaion problems wih informaion consrains. In 2 hese opimaliy condiions are specialized o he siuaion of sochasic opimal conrol problems. A special aenion is paid o he so-called Markovian case in 2.4. The numerical implemenaion of a resoluion mehod based on Mone Carlo and funcional approximaions is presened in 3. Finally in 4 a case sudy namely a hydro-elecric dam managemen problem illusraes he proposed mehod and compares i o he sandard scenario ree approach. 1. Preliminaries. In his secion we presen he main framework of his paper: how o model sochasic opimizaion problems and how o represen he informaion srucure of such problems. This preliminary secion is direcly inspired from he

3 PARTICLE METHODS FOR STOCHASTIC OPTIMIZATION PROBLEMS 3 works of he Sysem and Opimizaion Working Group 1 [2 19 9]. In he sequel of he paper he random variables defined over a probabiliy space Ω A P will be denoed using bold leers e.g. ξ L 2 Ω A P; Ξ whereas heir realizaions will be denoed using normal leers e.g. ξ Ξ Modelling informaion. When unknown facors affec a process ha we ry o conrol we consider a se Ω of possible saes of naure ω among which he rue sae ω 0 is supposed o be. Roughly speaking an informaion srucure is a pariion of Ω ino a collecion of subses G. A poseriori observaions will help us o deermine in which paricular subse of his pariion he rue sae ω 0 lies. Two exreme cases may be menioned. 1. If he pariion is simply he crude pariion { Ω} hen he a poseriori observaions are useless. We will refer o his siuaion as an open-loop informaion srucure. 2. If he pariion is he fines possible one namely all subses G are singleons hen he a poseriori observaions will ell us exacly wha is he rue sae of naure ω 0. This is he siuaion of perfec knowledge prior o making our decisions. In beween afer he a poseriori observaions become available we will remain wih some uncerainy abou he rue ω 0 ha is we will know in which paricular G he rue sae lies bu all ω s in ha G are sill possible. Moreover in dynamic siuaions new observaions may become available a each ime sage and a pariion of Ω corresponding o his informaion srucure mus be considered a each ime sage. We refer o he general case as a closed-loop decision process. In order o have a framework in which various operaions on informaion srucures become possible probabiliy heory inroduces so-called σ-algebras or σ-fields random variables and a pre-order relaion beween random variables. The laer is called measurabiliy. 2 We refer he reader o [17] for furher deails on informaion srucure and probabiliy heory. As far as his paper is concerned we here sae a resul found in [17 Theorem 8 p.108] giving he main properies of he measurabiliy relaion beween random variables. Proposiion 1.1 Measurabiliy relaion. Le Ω A P be a probabiliy space le Y 1 : Ω Y 1 and Y 2 : Ω Y 2 be wo random variables aking heir values in Y 1 and Y 2 respecively. The following saemens are equivalen: 1. Y 1 Y 2 2. σy 1 σy 2 3. φ : Y 2 Y 1 a measurable mapping such ha Y 1 = φ Y 2 4. Y 1 = EY 1 Y 2 P-a.s Modelling a sochasic opimizaion problem. In his secion we consider wo inerpreaions of a sochasic opimizaion problem: an algebraic one in which he informaion consrain is modelled hrough he sandard measurabiliy relaion Saemen 1 of Proposiion 1.1 and a funcional inerpreaion in which we use 1 SOWG École Naionale des Pons e Chaussées: Laeiia Andrieu Kengy Bary Pierre Carpenier Jean-Philippe Chancelier Guy Cohen Anes Dallagi Michel De Lara Pierre Girardeau Babakar Seck Cyrille Srugarek. 2 A random variable Y 1 is measurable wih respec o anoher random variable Y 2 if and only if he σ-field generaed by he firs is included ino he one generaed by he second: σy 1 σy 2. In his paper we will no give furher deails on hese definiions. Insead we refer he reader o [7] for furher deails on he probabiliy and measurabiliy heory.

4 4 A. DALLAGI AND G. COHEN AND P. CARPENTIER he funcional equivalen of measurabiliy relaion Saemen 3 of Proposiion Algebraic inerpreaion. Le Ω A P be a probabiliy space and le ξ : Ω Ξ be a random variable aking values in Ξ := R d ξ noise space. We denoe by U := R du he conrol space and by U he funcional space L 2 Ω A P; U. The cos funcion J : U R is defined as he expecaion of a normal inegrand 3 j : U Ξ R JU := E juξ = j Uω ξω dpω. The feasible se U fe := U as U me accouns for wo differen consrains: almos sure consrains: Ω U as := { U U Uω Γ as ω P-a.s. } 1.1 where Γ as : Ω U is a measurable se-valued mapping see [18 Definiion 14.1] which is convex and closed valued and measurabiliy consrains: U me := { U U U is G measurable } 1.2 where G is a given sub-σ-field of A. From heir respecive definiions i is easy o prove ha U me is a closed subspace of U indeed L 2 Ω G P; U and ha U as is a closed convex subse of U. The opimizaion problem under consideraion is o minimize he cos funcion JU over he feasible subse U fe. The firs model represening he sochasic opimizaion problem is hus min JU s.. U U U Ufe. 1.3 This inerpreaion is called algebraic as i uses an algebraic relaion measurabiliy o define he informaion srucure of Problem Funcional inerpreaion. According o Proposiion 1.1 a funcional model for sochasic opimizaion problems is available in which he opimizaion is achieved wih respec o funcions called feedbacks. Indeed le Y : Ω Y be a random variable called he observaion aking value in Y := R dy Φ be he space L 2 Y B o Y P Y ; U where B o Y is he Borel σ-field of Y and P Y he image of he probabiliy measure P by Y Φ as be he subse of Φ defined by Φ as := { φ Φ φ Y ω Γ as ω P-a.s. }. We define he cos funcion J : Φ R as Jφ := E j φy ξ. We are ineresed in he funcional opimizaion problem: min J φ s.. φ Φ as. 1.4 φ Φ 3 See [18 Definiion 14.27]. We remind ha he normal inegrand assumpion is done o ensure measurabiliy properies [18 Proposiion 14.28].

5 PARTICLE METHODS FOR STOCHASTIC OPTIMIZATION PROBLEMS 5 Proposiion 1.2. If σy = G hen Problem 1.3 is equivalen o Problem 1.4 in he sense ha if U is soluion of 1.3 resp. φ soluion of 1.4 hen here exiss φ soluion of 1.4 resp. U soluion of 1.3 such ha U = φ Y P-a.s.. Proof. This is a sraighforward consequence of Proposiion 1.1 and of he definiion of he feasible ses in boh problems Opimaliy condiions for a sochasic opimizaion problem. Previous works deal wih opimaliy condiions for sochasic opimal conrol problems [15 13]. This paragraph can be viewed as a sligh exension of [15] when considering a sochasic opimizaion problem subjec o almos-sure and measurabiliy consrains. We consider Problem 1.3 presened in 1.2. We recall ha he feasible se U fe is he inersecion of a closed convex subse U as and of a linear subspace U me. In order o apply he resuls given in Appendix A we firs esablish he following lemma. Lemma 1.3. Le U as be he convex se defined by almos sure consrains 1.1 and le U me be he linear subspace defined by measurabiliy consrains 1.2. We assume ha Γ as is a G-mesurable and closed convex valued mapping. Then proj U as U me U me. Proof. We firs prove ha proj U as U ω = projγ as ω Uω P-a.s.. In- deed le FV := 1 2 V U 2 U + χ V. From he definiion of almos sure consrains we have FV = Ω f V ω ω U as dpω wih fv ω := 1 2 v Uω 2 U + χ Γ as ω v. By definiion of he projecion proj U as U is soluion of he opimizaion problem min f V ω ω dpω. V U Ω Using [18 Theorem 14.60] inerchange of minimizaion and inegraion we obain proju as U ω arg min fv ω = { } proj Γ as ω Uω P-a.s. 1.5 v U hence he claimed propery. Le us consider any U U me. We deduce from 1.5 ha proju as 1 U ω = argmin v Uω 2 v Γ as ω 2 U P-a.s.. From [1 Theorem ] measurabiliy of marginal funcions we deduce from he G-measurabiliy of boh U and Γ as ha he arg min funcion proj U as U is also a G-measurable funcion which means ha proj U as U U me. The main resul of his secion is given by he following heorem. Theorem 1.4. Assume ha Γ as is a G-mesurable and closed convex valued mapping ha funcion j is a normal inegrand such ha j ξ is differeniable P-a.s. and ha j u U ξ U for all U U. Le U be a soluion of Problem 1.3. Then E j u U ξ G χu asu. 1.6

6 6 A. DALLAGI AND G. COHEN AND P. CARPENTIER Proof. The differeniabiliy of J : U R is a sraighforward consequence of boh he inegral expression of J and he differeniabiliy assumpion on j. Moreover he expression of he derivaive of J is given by J Uω = j u Uω ξω P-a.s.. Le U be a soluion of 1.3. From Lemma 1.3 and Proposiion A.2 in he Appendix we obain proj U me J U χ U asu his las expression being equivalen o 1.6 hanks o he characerizaion of he condiional expecaion as a projecion and he expression of J. Using he equivalen saemens A.2c he opimaliy condiions given by Theorem 1.4 can be reformulaed as U = proj U as U ǫproj U me JU ǫ > 0. This las formulaion can be used in pracice o solve Problem 1.3 using a projeced gradien algorihm. The projecion over U as involves random variables bu we saw in he proof of Lemma 1.3 ha proj U as can be performed in a poinwise manner ω per ω so ha he implemenaion of he algorihm is effecive. Remark 1. The opimaliy condiions 1.6 which are given in erms of random variables also have a poinwise inerpreaion. As a maer of fac using he equivalen saemens A.2 and he poinwise inerpreaion of proj U as i is sraighforward o prove ha R χ U asu implies Rω χ Γ as ω U ω P-a.s.. 2. Sochasic opimal conrol problems. Sochasic opimal conrol problems res upon he same framework as sochasic opimizaion problems: hey can be modelled as closed-loop sochasic opimizaion problems wih a sequenial ime srucure. In his secion we deal wih he algebraic inerpreaion of sochasic opimal conrol problems in a discree ime framework. We will derive opimaliy condiions following he same principle as in Problem formulaion. We consider a sochasic opimal conrol problem in discree ime T denoing he ime horizon. A each sage = 0... T we denoe by W := R dw he noise space a ime. Le W be a se of random variables defined on Ω A P and aking values in W and le W := W 0 W T. The noise process of he problem is a random vecor W W such ha W = W 0...W T wih W W. A each sage = 0... T 1 we denoe by U := R du he conrol space and by U := L 2 Ω A P; U he space of square inegrable random variables aking values in U. A he decision maker makes a decision a conrol U U. Le U := U 0 U T 1. The conrol process of he problem is a random vecor U U such ha U = U 0... U T 1 wih U U. We also assume ha each conrol variable U is subjec o almos-sure consrains. More precisely for all = 0...T 1 le Γ as : Ω U be a se-valued mapping random se le U as := { U U ω Γ as ω P-a.s.} and le U as := U0 as UT as 1. The almos-sure consrains wries U U as = 0...T

7 PARTICLE METHODS FOR STOCHASTIC OPTIMIZATION PROBLEMS Dynamics. A each sage = 0...T we denoe by X := R dx he sae space a ime. Le X be a se of random variables defined on Ω A P and aking values in X and le X := X 0... X T. The sae process of he problem is a random vecor X X such ha X = X 0...X T wih X X. I arises from he dynamics f : X U W +1 X +1 of he sysem namely X 0 = W 0 P-a.s. X +1 = f X P-a.s. = 0...T a 2.2b Remark 2. The shif on he ime index beween he random variable X and he conrol U on he one hand and he noise W +1 on he oher hand enlighens he fac ha we are in he so-called decision-hazard scheme: a ime he decision maker chooses a conrol variable U before having any informaion abou he noise W +1 which affecs he dynamics of he sysem Cos funcion. We consider a each sage = 0...T 1 an inegral cos funcion L : X U W +1 R. Moreover a he final sage T we consider a final cos funcion K : X T R. Summing up all hese insananeous coss we obain he overall cos funcion of he problem jx u w := T 1 =0 L x u w +1 + Kx T and he decision maker has o choose he conrol variables U in order o minimize he overall cos expecaion T 1 JX U := E L X + KX T. 2.3 = Informaion srucure. Generally speaking he informaion available on he sysem a ime is modelled as a random variable Y : Ω Y where Y := R dy is he observaion space. Le Y be he se of random variables aking heir values in Y. We suppose ha here exiss an observaion mapping h : W 0 W T Y such ha Y = h W 0...W T P-a.s.. For all = 0...T we denoe by G = σy he sub-σ-field of A generaed by he random variable Y : 4 G = σ h W 0... W T. The decision maker knows Y when choosing he appropriae conrol U a ime { so ha he informaion consrain is U Y. Using he noaions U me = U U is G measurable } and U me = U0 me UT me 1 he measurabiliy consrains of he problem wries U U me = 0...T We also inroduce he sequence of σ-fields F associaed wih he noise =0...T process W where F is he σ-field generaed by he noises prior o : F = σ W 0...W = 0...T. This sequence is a filraion as i saisfies he inclusions F 0 F 1... F T A. When G = F we are in he case of complee causal informaion. 4 Noe ha he σ-fields G s are fixed in he sense ha hey do no depend on he conrol variable U.

8 8 A. DALLAGI AND G. COHEN AND P. CARPENTIER Opimizaion problem. According o he noaion given in he previous paragraphs he sochasic opimal conrol problem we consider here is min U X U X T 1 E L X + KX T 2.5a =0 subjec o boh he dynamics consrains 2.2 and he conrol consrains U U as U me = 0...T b We follow here he algebraic inerpreaion of an opimizaion problem given in 1.2 as he informaion srucure is defined using a measurabiliy relaion. In he way we modelled Problem 2.5 we have o minimize he cos funcion JUX wih respec o boh U and X under he dynamics consrains 2.2. Bu he sae variables X are in fac inermediary variables and i is possible o eliminae hem by recursively incorporaing he dynamics equaions 2.2 ino he cos funcion 2.3. The resuling cos funcion J only depends on he conrol variables U. Is expression is JU = E ju W wih ju w = T 1 =0 L u 0...u w 0... w +1 + Ku 0... u T 1 w 0...w T. In his seing Problem 2.5 is equivalen o min U U JU s.. U U as U me. 2.6 The derivaives of he cos funcion J can be obained from he derivaives of J using he well known adjoin sae mehod. This is based on he following resul saed here wihou proof. Proposiion 2.1. Assuming ha funcions f and L are coninuously differeniable wih respec o heir firs wo argumens and ha funcion K is coninuously differeniable he parial derivaives of j wih respec o u are given by j u u w = L u x u w +1 + λ +1f u x u w +1 where he sae vecor x 0...x T saisfies he forward dynamics equaion x 0 = w 0 x +1 = f x u w +1 whereas he adjoin sae or co-sae vecor λ 0...λ T is chosen o saisfy he backward dynamics equaion λ T = K x T λ = L x x u w +1 + f x x u w +1 λ Assumpions. In order o derive opimaliy condiions for Problem 2.5 we make he following assumpions. Assumpion 1 Consrains srucure. Γ as are closed convex se-valued mappings = 0...T 1.

9 PARTICLE METHODS FOR STOCHASTIC OPTIMIZATION PROBLEMS 9 Assumpion 2 Differeniabiliy. 1. Funcions f dynamics and L cos are coninuous differeniable wih respec o heir firs wo argumens sae and conrol = 0...T Funcion K final cos is coninuously differeniable. 3. Funcions L and f are normal inegrands = 0...T The derivaives of f L and K are square inegrable = 0... T 1. Assumpion 3 Nonanicipaiviy and measurabiliy. 1. G F = 0...T. 2. Mappings Γ as are G -measurable = 0...T 1. Assumpion 2 will allow us all inegraion and derivaion operaions needed for obaining opimaliy condiions. The measurabiliy condiion on Γ as in Assumpion 3 expresses ha he almos-sure consrains mus a leas have he same measurabiliy as he decision variables. The firs condiion in Assumpion 3 expresses he causaliy of he problem: he decision maker has no access o informaion in he fuure! Under his assumpion here exiss a measurable mapping h : W 0 W Y such ha he informaion variable Y wries Y = h W 0...W P-a.s.. From Assumpion 1 and using Lemma 1.3 he following propery is readily available. Proposiion 2.2 Consrains srucure. For all = 0...T 1 1. U as is a closed convex subse of U 2. proj U as U me U me Opimaliy condiions in sochasic opimal conrol problems. We presen here necessary opimaliy condiions for he sochasic opimal conrol problem 2.5 which are an exension of he condiions given in Theorem Non-adaped opimaliy condiions. A firs se of opimaliy condiions is given in he nex heorem. Theorem 2.3. Le he wo random processes X =0...T X and U =0...T 1 U be a soluion of Problem 2.5. Suppose ha Assumpion 1 2 and 3 are saisfied. Then here exiss a random process λ =0...T X such ha for all = 0... T 1 X 0 = W 0 X +1 = f X 2.7a 2.7b λ T = K X T λ = L x X + f x X λ c 2.7d E L ux + λ +1 f ux G χu asu. 2.7e Proof. From he equivalence beween Problem 2.5 and Problem 2.6 we obain using Theorem 1.4 ha he soluion U saisfies proj U me J U χ U asu

10 10 A. DALLAGI AND G. COHEN AND P. CARPENTIER and herefore E j u UW G χu asu for all = 0... T 1. The desired resul follows from Proposiion 2.1. The condiions given by Theorem 2.3 are called non-adaped opimaliy condiions because he dual random process λ =0...T is no adaped o he naural filraion F =0...T ha is λ generally depends on he fuure. We will see in he nex secion ha similar opimaliy condiions can be wrien wih help of an adaped dual random process Adaped opimaliy condiions. The following heorem presens opimaliy condiions involving an adaped dual random process. Theorem 2.4. Le he wo random processes X =0...T X and U =0...T 1 U be a soluion of Problem 2.5. Assume ha Assumpion 1 2 and 3 are saisfied. Then here exiss a process Λ =0...T X adaped o he filraion F =0...T such ha for all = 0... T 1 X 0 = W 0 X +1 = f X 2.8a 2.8b Λ T = K X T Λ = E L x X + f x X Λ +1 F 2.8c 2.8d E L ux + Λ +1 f ux G χu asu. 2.8e Proof. All assumpions of Theorem 2.3 are me so ha here exiss a random process λ =0...T saisfying 2.7. Define for all he random variable Λ by Λ := E λ F. By consrucion he process Λ =0...T is adaped o he filraion F =0...T. A sage T we have Λ T = Eλ T F T = E K X T F T = K X T because X T is F T -measurable. For all = T using he law of oal expecaion E F = E E F +1 F and since all variables X and W +1 are F +1 -measurable we deduce from 2.7 ha Λ = E L x X + f x X Eλ +1 F +1 F = E L x X + f x X Λ +1 F hence he adaped backward dynamics equaions given in 2.8. Assumpion 3 implies ha G F F +1. Using E G = E E F +1 G and he measurabiliy properies of X and W +1 he las opimaliy condiion in 2.7 becomes E L ux + Eλ +1 F +1f ux G χu asu hence he las opimaliy condiion given in 2.8. Noe ha in he opimaliy condiions given in Theorem 2.4 a each sage he gradien is projeced over he subspace generaed by he observaion σ-field G whereas he adaped dual random variable is projeced over he subspace generaed by F which corresponds o he naural filraion of Problem 2.5.

11 PARTICLE METHODS FOR STOCHASTIC OPTIMIZATION PROBLEMS Opimaliy condiions in he Markovian case. As far as informaion srucure is concerned he opimaliy condiions 2.7 and 2.8 were obained assuming only nonanicipaiveness see Assumpion 3 and he fac ha he observaion σ- fields G =0...T do no depend on he decisions U =0...T 1. The las assumpion allows us o avoid he so-called dual effec of conrol see [3] for furher deails. Anoher feaure ofen available in pracice for he informaion srucure is he perfec memory propery which inuiively means ha he informaion is no los over ime. The las propery implies ha G =0...T is a filraion namely G G +1 = 0...T 1. We will assume in he sequel ha he perfec memory propery holds and we will moreover assume complee causal noise observaion so ha G = F = 0...T. 5 Then boh opimaliy condiions 2.7 and 2.8 involve condiional expecaions wih respec o a random observaion variable he dimension of which increases wih ime a new noise random variable becomes available a each sage. This leads o a compuaional difficuly of he same naure as he so-called curse of dimensionaliy. To address his difficuly i would be an easier siuaion o have a consan dimension for he observaion space as in he sochasic dynamic programming principle [5 6] when he opimal conrol a only depends on he sae variable a he same ime sage. We hus consider new more resricive assumpions which mach he sochasic opimal conrol framework an lead us o he desired siuaion. Assumpion 4 Markovian case. 1. G = F = 0...T perfec memory and causal noise observaion. 2. The random variables W 0...W T are independen whie noise. 3. The mappings Γ as are consan deerminisic consrains: = 0...T 1 Γ as U Γ as ω = Γas P-a.s.. The sandard formulaion of a sochasic opimal conrol problem in he Markovian case is o assume ha he sae is compleely and perfecly observed. The problem formulaion is accordingly min U X U X T 1 E L X + KX T 2.9a =0 subjec o boh he dynamics consrains 2.2 and he conrol consrains U Γ as P-a.s. and U X = 0...T b We now consider he opimaliy condiions 2.7 and 2.8 and we specialize hem o he Markovian case Markovian case: non-adaped opimaliy condiions. We presen a non-adaped version of he opimaliy condiions of Problem 2.5 wih Markovian assumpions. We begin by presening a resul inspired by he sochasic dynamic programming principle. Theorem 2.5. Suppose ha Assumpions 1 2 and 4 are fulfilled and assume ha here exis wo random processes U =0...T 1 U and X =0...T X soluion 5 Noe ha complee causal noise observaion implies he perfec memory propery as far as F =0...T is a filraion.

12 12 A. DALLAGI AND G. COHEN AND P. CARPENTIER of Problem 2.5. Then here exiss a process λ =0...T 1 X saisfying 2.7 and such ha for all = 0...T 1 a λ +1 X +1 W +2...W T b U X c E L ux + λ +1 f ux F = E L u X + λ +1 f u X X P-a.s.. Proof. Firs Assumpion 3 being implied by Assumpion 4 he exisence of he process λ =0...T X saisfying 2.7 is given by Theorem 2.3. Denoing by H he Hamilonian a ime namely H x u w λ = L x u w+λ f x u w opimaliy condiions 2.7d and 2.7e wrie λ = H x X λ a E H u X λ +1 F χu asu. 2.10b The proof of saemens a and b is obained by inducion. For he sake of simpliciy we firs prove he resul when U as = U so ha 2.10b reduces o he equaliy condiion: E H ux λ +1 F = c A sage T we know from 2.7c ha λ T = µ T 1 X T wih µ T 1 being a measurable funcion 6 and hence λ T X T. 2.11a Then using 2.7b he opimaliy condiion 2.10c akes he form: E H T 1 u XT 1 U T 1 W T µ T 1 f T 1 X T 1 U T 1 W T FT 1 = 0. X T 1 and U T 1 being boh F T 1 -measurable random variables and W T being independen of F T 1 whie noise assumpion we deduce ha he condiional expecaion in he las expression reduces o an expecaion. Le G T 1 denoes he funcion resuling from is inegraion namely G T 1 x u = E H T 1 u x u WT µ T 1 f T 1 x u W T. G T 1 is a mesurable mapping 7 and he opimaliy condiion wries G T 1 X T 1 U T 1 = 0. Using he measurable selecion heorem available for implici measurable funcions [14 Theorem 8] 8 we deduce ha here exiss a measurable mapping γ T 1 : X T 1 U T 1 such ha G T 1 XT 1 γ T 1 X T 1 = 0. As a 6 In fac µ T 1 = K. From Assumpion 2 µ T 1 is a coninuous mapping. 7 in fac a coninuous one from Assumpion 2 8 See for insance [21 Secion 7] for a survey of measurable selecion heorems corresponding o he implici case. Noe ha [14 Theorem 8] needs a paricular assumpion concerning he σ-field equipping X T 1. We assume here ha such assumpion holds.

13 PARTICLE METHODS FOR STOCHASTIC OPTIMIZATION PROBLEMS 13 conclusion he conrol variable U T 1 = γ T 1 X T 1 saisfies he opimaliy condiion 2.10c a = T 1 and is such ha U T 1 X T b A sage assume ha λ +1 X +1 W +2...W T. Then here exiss a measurable funcion µ such ha λ +1 = µ X +1 W W T. 2.12a The opimaliy condiion 2.10c a sage akes he form E H u X µ f X W W T F = 0. Wih he same reasoning as a sage T we deduce ha his condiional expecaion reduces o an expecaion so ha he opimaliy condiion wries G X = 0 G being a measurable funcion given by G x u = E H u x u µ f x u W W T. Using again [14 Theorem 8] we deduce ha here exiss a measurable mapping γ : X U such ha U = γ X saisfies he opimaliy condiion 2.10c a. We have accordingly U X. 2.12b Ulimaely saring from he opimaliy condiion 2.10a namely λ = H x X λ +1 using he inducion assumpion 2.12a ogeher wih 2.12b and 2.7c we obain ha λ = H x X γ X µ f X γ X W W T. We conclude ha λ X... W T so ha he desired resul holds rue. 9 Le s go now o he general case U as U. From A.2 he opimaliy condiion 2.10b is again equivalen o an equaliy condiion: proj U as U ǫe H u X λ +1 F U = 0 and he same argumens as in he previous case remain valid. A las from U X λ +1 X... W T and he whie noise assumpion we deduce ha λ +1 depends on F only hrough X so ha E L ux + λ +1 f ux F = E L u X + λ +1 f u X X. 9 Noe ha we obained as an inermediae resul ha λ +1 `X... W T.

14 14 A. DALLAGI AND G. COHEN AND P. CARPENTIER Saemen c is hus saisfied and he proof is complee. In he Markovian case he opimal soluion of Problem 2.5 measurabiliy wih respec o all pas noise variables saisfies he measurabiliy consrains of Problem 2.9 measurabiliy wih respec o he curren sae variable. The wo problems are equivalen same min and argmin. In fac he feasible se of 2.5 conains he feasible se of 2.9 and Theorem 2.5 shows us ha any opimal soluion of 2.5 is feasible for 2.9 and is herefore also opimal also for 2.9. Ulimaely he opimaliy condiions of Problem 2.9 can be wrien as X 0 = W 0 X +1 = f X 2.13a 2.13b λ T = K X T λ = L x X + f x X λ c 2.13d E L u X +λ +1 f u X X χu asu. 2.13e Remark 3. Le G 1 =0...T 1 and G 2 =0...T 1 be he gradien processes associaed wih 2.7 and 2.13 respecively: G 1 :=E L u X + f u X λ +1 F G 2 :=E L u X + f u X λ +1 X. Unlike Problem 2.5 we are unable o compue he opimaliy condiions 2.13 of Problem 2.9 by differeniaing he Lagrangian funcion because he condiioning erm X depends iself on he conrol variables U. Consequenly G 2 is no claimed o represen he projeced gradien of Problem 2.9. The equaliy beween he gradien G 1 and G2 holds rue only a he opimum Markovian case: adaped opimaliy condiions. We now presen he adaped version of he opimaliy condiions of Problem 2.5 under Markovian assumpions. Theorem 2.6. Suppose ha Assumpions 1 2 and 4 are fulfilled and assume ha here exiss wo random processes U =0...T 1 U and X =0...T 1 X soluion of Problem 2.5. Then here exiss a process Λ =0...T 1 X saisfying 2.8 and such ha for all = 0...T 1 a Λ +1 X +1 b U X c E L u X + Λ +1 f u X F = E L ux + Λ +1 f ux X P-a.s.. Proof. The proof follows he same scheme as he one of Theorem 2.5. We jus poin ou ha λ +1 X +1 W W T and Λ +1 = Eλ +1 F +1 implies ha Λ +1 X +1.

15 PARTICLE METHODS FOR STOCHASTIC OPTIMIZATION PROBLEMS 15 From he equivalence beween Problem 2.5 measurabiliy wih respec o he pas noise and Problem 2.9 measurabiliy wih respec o he sae in he Markovian framework we may consider ha he opimaliy condiions of Problem 2.9 in he adaped version are X 0 = W 0 X +1 = f X 2.14a 2.14b Λ T = K X T Λ = E L x X + f x X Λ +1 X 2.14c 2.14d E L ux +Λ +1 f ux X χu asu. 2.14e The opimaliy condiions 2.13 and 2.14 involve condiional expecaions wih respec o he sae variable he dimension of which is in mos cases fixed ha is i does no depend on he ime sage. In order o solve Problem 2.5 and equivalenly Problem 2.9 we have o discreize hose condiions and in paricular o approximae he condiional expecaions. The lieraure on condiional expecaion approximaion only offers biased esimaors wih an inegraed squared error depending on he dimension of he condiioning erm. On he conrary approximaing an expecaion hrough a Mone-Carlo echnique involves non-biased esimaors he variance of which does no depend on he dimension of he underlining space. In he nex secion we propose a funcional inerpreaion of he sochasic opimal conrol problem in order o ge rid of hose condiional expecaions and deal only wih expecaions Opimaliy condiions from a funcional poin of view. Consider he sochasic opimal conrol Problem 2.5. Under Markovian assumpions we have shown in 2.3 ha i is equivalen o Problem 2.9 and ha 2.14 is a se of necessary opimaliy condiions. Hereafer we ransform he opimaliy condiions 2.14 using Theorem 2.6 and he funcional inerpreaion of he measurabiliy relaion beween random variables see Proposiion 1.1. Theorem 2.7. Suppose ha Assumpions 1 2 and 4 are fulfilled and assume ha here exis wo random processes U =0...T 1 U and X =0...T 1 X soluion of Problem 2.5. Le Λ =0...T 1 X be a random process saisfying he opimaliy condiions 2.8. Then here exiss wo sequences of mappings Λ =0...T and φ =0...T 1 Λ : X X and φ : X U such ha for all = 0...T 1 Λ = Λ X U = φ X 2.15a 2.15b Λ = E E L x ` φ + f x ` φ Λ+1`f φ 2.15c ` φ L u` φ + Λ +1`f φ f u χ Φ asφ 2.15d wih Φ as := { φ L 2 X B o X P X ; U φ x Γ as x X } and PX he probabiliy measure associaed wih X.

16 16 A. DALLAGI AND G. COHEN AND P. CARPENTIER Proof. By Theorem 2.6 he random processes U =0...T 1 X =0...T and Λ =0...T 1 are such ha U X and Λ X. By Proposiion 1.1 here exis measurable mappings φ : X U and Λ : X X such ha U = φ X and Λ = Λ X. From U U = L 2 Ω A P; U we deduce ha φ L 2 X B o X P X ; U : Ω U ω 2 dpω = Since U U as Ω φ X ω 2 dpω = X φ x 2 dp X x < +. φ X ω Γ as P-a.s. we obain ha φ Φ as. The co-sae dynamic equaions in 2.14 rewries Λ X = E L x `X φ X + f x `X φ X Λ+1X +1 which using X +1 = f X φ X becomes Λ X = E L x `X φ X X + f x `X φ X Λ+1`fX φ X X. From he whie noise assumpion he random variable W +1 is independen of X so ha he condiional expecaion urns ou o be jus an expecaion wih respec o he noise random variable. The co-sae dynamics equaion wries accordingly as a funcional equaliy: Λ = E L x φ + f x φ Λ +1`f φ. Using similar argumens we easily obain from he las condiion in 2.14: E L u φ + Λ +1`f φ f u φ χ Φ asφ which complees he proof. Theorem 2.7 provides he new funcional opimaliy condiions 2.15 for Problem 2.5 in he Markovian case. These opimaliy condiions do no involve condiional expecaions bu jus expecaions. Therefore we may hope ha in he approximaion process of hese condiions we will obain non biased esimaes he variance of which will no depend on he dimension of he sae space. 3. Adapive discreizaion echnique. In his secion we develop racable numerical mehods for obaining he soluion of Problem 2.5. We will limi ourselves here o he Markovian framework bu mehods for all cases described in 2 can be found in [9]. We briefly discuss wo classical soluion mehods namely sochasic programming and dynamic programming. Then we presen an adapive mesh algorihm which consiss in discreizing he opimaliy condiions obained a 2.4 and in using hem in a gradien-like algorihm Discree represenaion of a funcion. As far as numerical resoluion is concerned we need o manipulae funcions which are infinie dimensional objecs and which in mos cases do no have a closed-form expression. Thus we mus have a discree represenaion of such an objec. Le φ : X U be a funcion. We suppose ha we have a disposal a fixed or variable grid x in X ha is a collecion of elemens in X: x = x i i=1...n X n.

17 PARTICLE METHODS FOR STOCHASTIC OPTIMIZATION PROBLEMS 17 A poin x i will be called a paricle. In order o obain a discree represenaion of φ we define is race ha is a grid u on U which corresponds o he values of φ on x: u = φx i i=1...n Un. We denoe by T U : U X X n U n he race operaor which wih any funcion φ : X U associaes he couple of grids x u ha is he n poins x i φx i X U. On he oher hand knowing he race of φ we need o compue an approximaion of he value aken by φ a any x X. Le R U : X n U n U X be an inerpolaion-regression operaor which wih any couple of grids x u associaes φ : X U represening he iniial funcion. Such an inerpolaion-regression operaor may be defined in differen ways polynomial inerpolaion kernel approximaion closes neighbor ec Sochasic programming. One way o discreize sochasic opimal conrol problems of ype 2.5 is o model he informaion srucure as a decision ree. 1. Firs simulae a given number N of scenarios W kk=1...n =0...T of he noise process. Then by some ree generaion procedures see e.g. [16] or [12] organize hese scenarios in a scenario ree a any node of any ime sage here is a single pas rajecory bu muliple fuures: see Figure 3.1. =0 =1 =2 =3 =T =0 =1 =2 =3 =T N scenarios Scenarios ree Fig From scenarios o a ree 2. Second wrie he componens of he problem sae dynamics and cos expecaion on he scenario ree noe ha he informaion consrains are buil-in in such a ree srucure. Then he approximaion on he scenario ree of Problem 2.5 is solved using an appropriae deerminisic non-linear programming package. The opimal soluion consiss of sae and conrol paricles a each node of he ree. An inerpolaionregression procedure as suggesed a 3.1 has o be performed a each ime sage in order o synhesize a feedback law. Such a mehodology is relaively easy o implemen and need no in fac any Markovian assumpion. Neverheless i faces a serious drawback: a he firs ime

18 18 A. DALLAGI AND G. COHEN AND P. CARPENTIER sages close o he ree roo only a few paricles or nodes are available and experience shows ha he esimaes of he opimal feedback hey provide has a relaively weak variance because he number of pending scenarios a each such node is sill large enough; on he conrary a he final ime sages near he ree leaves a large number of paricles is available bu wih a huge variance. In boh cases he feedback synhesis he inerpolaion-regression process from he available grid will be inaccurae. Such an observaion will be highlighed laer on he case sudy Sochasic dynamic programming. We consider Problem 2.5 in he Markovian case which is hus equivalen o Problem 2.9. From Theorem 2.5 he opimal conrol process U =0...T 1 can be searched as a collecion of feedback laws φ =0...T 1 depending on he process sae X =0...T ha is U = φ X P-a.s.. According o he Dynamic Programming Principle he resoluion is buil up backward from = T o = 0 by solving he Bellman equaion a each ime sage for all sae values x X. Such a principle leads o he following algorihm [5 6]. Algorihm 1 Sochasic Dynamic Programming. A sage T define he Bellman funcion V T as V T x := Kx x X T. Recursively for = T compue he Bellman funcion V as V x = min E L x u + V +1 f x u x X u Γ as he opimal feedback law φ being obained as φ x = arg min E L x u + V +1 f x u x X. u Γ as This algorihm is only concepual because i operaes on infinie dimensional objecs V and expecaions canno always be evaluaed analyically. We mus indeed manipulae hose objecs as indicaed a 3.1. For every = 0...T le x := x i i=1...n be a fixed grid of n discreizaion poins in he sae space X. We approximae he funcions appearing in Algorihm 1 by heir race over ha grid: v i = V x i i = 1...n = 0...T u i = φ x i i = 1...n = 0...T 1. We also need o approximae he expecaions by he Mone Carlo mehod. Le W kk=1...n =0...T denoe N independen and idenically disribued scenarios of he random noise process. The discreized sochasic dynamic programming algorihm is as follows. Algorihm 2 Discreized Sochasic Dynamic Programming. A sage T compue he race v T of he Bellman funcion V T : v i T = V Tx i T i = 1... n T

19 PARTICLE METHODS FOR STOCHASTIC OPTIMIZATION PROBLEMS 19 Recursively for = T approximae V +1 by inerpolaion-regression: V +1 = R R x +1 v +1 compue he wo grids v and u ha is for each i = 1... n v i 1 = min u Γ as N u i = argmin u Γ as N [ k=1 1 N L x i u W k +1 + V+1 f x i u W k +1 ] N [ L x i u W k +1 + V+1 f x i u W k +1 ] k=1 and obain he feedback law as φ = R U x u. Remark 4. The inerpolaion-regression operaor is in mos cases mandaory in Algorihm 2. As a maer of fac for a given ime sage and a given conrol value u U here usually does no exis any index j such ha x j +1 = f x i u W+1 k which means ha V +1 has o be compued ou of he grid x +1. Noe however ha he Bellman funcion a ime sage T is known analyically so ha inerpolaion is no needed for V T. This mehod faces an imporan difficuly: he curse of dimensionaliy. In fac one generally discreizes each sae coordinae a each ime sage using a scalar grid and a fixed number of poins. Therefore he grid x is obained as he Caresian produc of he scalar grids over all he coordinaes. Thus he number of paricles of ha grid increases exponenially wih he sae space dimension. This is he wellknown drawback of mos mehods derived from Dynamic Programming which do no ake advanage of he repariion of he opimal sae paricles in he sae space in order o concenrae compuaions in significan pars of he sae space The adapive mesh algorihm. Considering he difficulies faced by boh sochasic and dynamic programming mehods we propose an alernaive mehod for solving Problem 2.5 in he Markovian case. The mehod based on opimaliy condiions 2.15 aims a dealing wih he same number of noise paricles from he beginning of he ime horizon o he end: we hus hope ha he generaed feedback law esimaors will have a reduced and fixed variance during all ime sages; aemping o alleviae he curse of dimensionaliy by operaing on an adapive discreizaion grid auomaically generaed from he primary noise discreizaion grid Approximaion. Le us denoe by W kk=1...n =0...T a se of N independen and idenically disribued scenarios obained from he noise random process. Given random conrol grids u := U k k=1...n for = 0...T 1 we can compue he sae random grids x := X k k=1...n by propagaing he sae dynamics equaion: X k 0 = W k 0 k = 1... N 3.1a X k +1 = f X k Uk k k = 1...N = 0...T b

20 20 A. DALLAGI AND G. COHEN AND P. CARPENTIER The feedback a ime sage is obained using an inerpolaion-regression operaor on he grids x u. Noe ha he sae space grids x are no a priori fixed as in Dynamic Programming mehods. They are in fac adaped o he conrol paricules as hey change whenever he decision maker changes is conrol sraegy. Remark 5. If he opimal sae is concenraed in a region of he sae space he opimal feedback will be synhesized only inside ha region. In fac we need no compue i elsewhere because he sae will hardly reach oher regions. In order o obain approximaions Λ of he co-sae funcions Λ inroduced a Theorem 2.7 we compue paricles Λ k by inegraing he co-sae backward dynamic equaion over he sae grids x. We hus obain co-sae grids l = Λ k k=1...n and make use of inerpolaion-regression operaors R X in order o compue he co-sae funcion for values ou of he curren grid. More specifically he process is iniiaed wih Λ T = Λ T = K ; hen for all = T one compues: Λ k = 1 N NX L x X k U k W+1 j + f x X k U k W+1 j Λ b +1 `fx k U k W+1 j j=1 bλ = R X x l. 3.2a 3.2b G k Ulimaely for all k = 1...N and for all = 0...T 1 he gradien paricles are obained as G k = 1 N NX» L u j=1 `Xk U k W+1 j + f u `Xk U k W j bλ f +1 `Xk U k W j. 3.3 As already noiced he direcion associaed wih hese paricles represens he gradien only a he opimum Algorihm. We can now derive a descen-like algorihm o solve Problem 2.5 under Markovian assumpions. A each ieraion sae paricles are propagaed forward wih no ineracion beween paricles hen co-sae paricles are propagaed backward now wih ineracion caused by he regression-inerpolaion operaions. Then gradien paricles are compued using 3.3 and he conrol paricles are updaed using a gradien-like mehod. Ulimaely a funcional represenaion of he feedback laws is obained hanks o a regression-inerpolaion operaor. Algorihm 3. Sep [0]. Le u [0] =0...T 1 = U [0]k Sep [l]. k=1...n be he iniial conrol grids. =0...T 1 1. Compue he sae grids x [l] =0...T by propagaing he dynamics 3.1 wih U = U [l]. 2. Compue boh co-sae grids l [l] =0...T and funcional approximaions Λ [l] =0...T 1 by propagaing he dynamics 3.2 wih U = U [l] and X = X [l]. 3. Compue he gradien paricles G [l]k k=1...n using Equaion 3.3 =0...T 1 wih U = U [l] X = X [l] and Λ = Λ [l].

21 PARTICLE METHODS FOR STOCHASTIC OPTIMIZATION PROBLEMS For all = 0...T 1 and k = 1...N updae he conrol paricles by performing a projeced gradien sep: U [l]k ρ [l] G [l]k. U [l+1]k = proj Γ as 5. Sop if some degree of accuracy is reached. Else se l = l+1 and ierae. Sep [ ]. 1. Se he grids: u [ ] x [ ] =0...T 1 = U [l]k k=1...n =0...T 1 = X [l]k =0...T 1 k=1...n =0...T Obain for all = 0... T 1 he feedback law: [ ] φ = R U x u [ ]. The only a priori discreizaion made during his algorihm is relaive o he noise sampling. Once such a discreizaion has been performed all oher grids used o ulimaely obain he feedback laws are derived by inegraing dynamic equaions. In addiion no condiional expecaion approximaions are involved in he process. We jus approximae expecaions using Mone Carlo echniques and i is well-known ha he variance of such an approximaion does no depend on he dimension of he underlying space. The only space-size dependen operaions are in fac he inerpolaion operaors used o approximae he co-sae mappings during he ieraive process plus he feedback laws once convergence is achieved. Furhermore his adapive discreizaion makes compuaions concenrae in effecive sae space regions unlike dynamic programming which explores he whole sae space. 4. Case sudy. We consider he producion managemen of an hydro-elecric dam. The problem is formulaed as a sochasic opimal conrol and we consider solving i by he hree mehods described in Model. The problem is formulaed in discree ime over 24 hours using a consan ime sep of one hour. The index = 0...T where T = 24 defines he ime discreizaion grid. The waer volume sored in he dam a ime sage = 0...T is a one dimensional random variable X L 2 Ω A P; R corresponding o he sysem sae. This sorage variable has o remain beween given bounds x minimal volume o be kep in he dam and x maximal waer volume he dam can conain so ha for all = 0...T he following almos-sure consrain mus hold: x X x P-a.s The waer inflow ino he dam a sage is denoed by A. I is a one dimensional random variable wih known probabiliy law. We denoe by U L 2 Ω A P; R he one dimensional random variable corresponding o he desired volume of waer o be urbinaed a sage and by E he effecively urbinaed waer volume during he same ime sage. In mos cases E = U. Bu his equaliy is no achievable if he dam goes under is minimal volume. Therefore we have for all = 0...T 1: E = minu X + A +1 x P-a.s.. 4.2

22 22 A. DALLAGI AND G. COHEN AND P. CARPENTIER The shif in he ime indices is due o he fac ha he decision o urbinae is supposed o be made before observing he waer inflow volume enering he dam: we are in a decision-hazard framework. Moreover he conrol variables U are subjec o he following bounds: for all = 0...T 1 u U u P-a.s Taking ino accoun a possible overflow he dam dynamics wrie for = 0...T 1: X +1 = minx E + A +1 x P-a.s Noe ha consrains 4.1 are fully aken ino accoun in he modelling of he dam dynamics. An elecriciy producion P is associaed wih he effecively urbinaed waer volume and i also depends on he waer sorage indeed on he waer level in he dam due o he fall heigh effec: P = gx E. 4.5 Le D =1...T denoes he elecriciy demand which is supposed o be a sochasic process wih known probabiliy law. In our decision-hazard framework producion P has o mee demand D +1 : eiher P D +1 and he producion excess is sold on he elecriciy marke or P D +1 and he gap mus be compensaed for eiher by buying power on he marke or by paying a penaly. The associaed cos is modelled as c D +1 P. 4.6 Ulimaely we suppose ha a penaly funcion K on he final sock X T is given and ha he iniial condiion X 0 is a random variable wih known probabiliy law. Le W =0...T be he noise random process defined as W 0 = X 0 W = A D = 1...T. We assume ha he noises are fully observed in a non-anicipaive way and ha he conrol variables are measurable wih respec o he pas noises. The dam managemen problem is hen he following. min U X T 1 E c D+1 gx E + KX T 4.7a =0 subjec o he consrains and o he measurabiliy consrains U W 0...W = 0...T b I precisely corresponds o he sochasic opimal conrol problem formulaion 2.5 when using he following noaions: w = a d L x u w = c d g x minu x + a x f x u w = min x minu x + a x + a x Γ as = [u u].

An introduction to the theory of SDDP algorithm

An introduction to the theory of SDDP algorithm An inroducion o he heory of SDDP algorihm V. Leclère (ENPC) Augus 1, 2014 V. Leclère Inroducion o SDDP Augus 1, 2014 1 / 21 Inroducion Large scale sochasic problem are hard o solve. Two ways of aacking

More information

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB Elecronic Companion EC.1. Proofs of Technical Lemmas and Theorems LEMMA 1. Le C(RB) be he oal cos incurred by he RB policy. Then we have, T L E[C(RB)] 3 E[Z RB ]. (EC.1) Proof of Lemma 1. Using he marginal

More information

Chapter 2. First Order Scalar Equations

Chapter 2. First Order Scalar Equations Chaper. Firs Order Scalar Equaions We sar our sudy of differenial equaions in he same way he pioneers in his field did. We show paricular echniques o solve paricular ypes of firs order differenial equaions.

More information

Finish reading Chapter 2 of Spivak, rereading earlier sections as necessary. handout and fill in some missing details!

Finish reading Chapter 2 of Spivak, rereading earlier sections as necessary. handout and fill in some missing details! MAT 257, Handou 6: Ocober 7-2, 20. I. Assignmen. Finish reading Chaper 2 of Spiva, rereading earlier secions as necessary. handou and fill in some missing deails! II. Higher derivaives. Also, read his

More information

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle Chaper 2 Newonian Mechanics Single Paricle In his Chaper we will review wha Newon s laws of mechanics ell us abou he moion of a single paricle. Newon s laws are only valid in suiable reference frames,

More information

Lecture 20: Riccati Equations and Least Squares Feedback Control

Lecture 20: Riccati Equations and Least Squares Feedback Control 34-5 LINEAR SYSTEMS Lecure : Riccai Equaions and Leas Squares Feedback Conrol 5.6.4 Sae Feedback via Riccai Equaions A recursive approach in generaing he marix-valued funcion W ( ) equaion for i for he

More information

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon 3..3 INRODUCION O DYNAMIC OPIMIZAION: DISCREE IME PROBLEMS A. he Hamilonian and Firs-Order Condiions in a Finie ime Horizon Define a new funcion, he Hamilonian funcion, H. H he change in he oal value of

More information

10. State Space Methods

10. State Space Methods . Sae Space Mehods. Inroducion Sae space modelling was briefly inroduced in chaper. Here more coverage is provided of sae space mehods before some of heir uses in conrol sysem design are covered in he

More information

Vehicle Arrival Models : Headway

Vehicle Arrival Models : Headway Chaper 12 Vehicle Arrival Models : Headway 12.1 Inroducion Modelling arrival of vehicle a secion of road is an imporan sep in raffic flow modelling. I has imporan applicaion in raffic flow simulaion where

More information

14 Autoregressive Moving Average Models

14 Autoregressive Moving Average Models 14 Auoregressive Moving Average Models In his chaper an imporan parameric family of saionary ime series is inroduced, he family of he auoregressive moving average, or ARMA, processes. For a large class

More information

Application of a Stochastic-Fuzzy Approach to Modeling Optimal Discrete Time Dynamical Systems by Using Large Scale Data Processing

Application of a Stochastic-Fuzzy Approach to Modeling Optimal Discrete Time Dynamical Systems by Using Large Scale Data Processing Applicaion of a Sochasic-Fuzzy Approach o Modeling Opimal Discree Time Dynamical Sysems by Using Large Scale Daa Processing AA WALASZE-BABISZEWSA Deparmen of Compuer Engineering Opole Universiy of Technology

More information

STATE-SPACE MODELLING. A mass balance across the tank gives:

STATE-SPACE MODELLING. A mass balance across the tank gives: B. Lennox and N.F. Thornhill, 9, Sae Space Modelling, IChemE Process Managemen and Conrol Subjec Group Newsleer STE-SPACE MODELLING Inroducion: Over he pas decade or so here has been an ever increasing

More information

Subway stations energy and air quality management

Subway stations energy and air quality management Subway saions energy and air qualiy managemen wih sochasic opimizaion Trisan Rigau 1,2,4, Advisors: P. Carpenier 3, J.-Ph. Chancelier 2, M. De Lara 2 EFFICACITY 1 CERMICS, ENPC 2 UMA, ENSTA 3 LISIS, IFSTTAR

More information

23.2. Representing Periodic Functions by Fourier Series. Introduction. Prerequisites. Learning Outcomes

23.2. Representing Periodic Functions by Fourier Series. Introduction. Prerequisites. Learning Outcomes Represening Periodic Funcions by Fourier Series 3. Inroducion In his Secion we show how a periodic funcion can be expressed as a series of sines and cosines. We begin by obaining some sandard inegrals

More information

Class Meeting # 10: Introduction to the Wave Equation

Class Meeting # 10: Introduction to the Wave Equation MATH 8.5 COURSE NOTES - CLASS MEETING # 0 8.5 Inroducion o PDEs, Fall 0 Professor: Jared Speck Class Meeing # 0: Inroducion o he Wave Equaion. Wha is he wave equaion? The sandard wave equaion for a funcion

More information

Online Appendix to Solution Methods for Models with Rare Disasters

Online Appendix to Solution Methods for Models with Rare Disasters Online Appendix o Soluion Mehods for Models wih Rare Disasers Jesús Fernández-Villaverde and Oren Levinal In his Online Appendix, we presen he Euler condiions of he model, we develop he pricing Calvo block,

More information

Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach

Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach 1 Decenralized Sochasic Conrol wih Parial Hisory Sharing: A Common Informaion Approach Ashuosh Nayyar, Adiya Mahajan and Demoshenis Tenekezis arxiv:1209.1695v1 [cs.sy] 8 Sep 2012 Absrac A general model

More information

RANDOM LAGRANGE MULTIPLIERS AND TRANSVERSALITY

RANDOM LAGRANGE MULTIPLIERS AND TRANSVERSALITY ECO 504 Spring 2006 Chris Sims RANDOM LAGRANGE MULTIPLIERS AND TRANSVERSALITY 1. INTRODUCTION Lagrange muliplier mehods are sandard fare in elemenary calculus courses, and hey play a cenral role in economic

More information

BU Macro BU Macro Fall 2008, Lecture 4

BU Macro BU Macro Fall 2008, Lecture 4 Dynamic Programming BU Macro 2008 Lecure 4 1 Ouline 1. Cerainy opimizaion problem used o illusrae: a. Resricions on exogenous variables b. Value funcion c. Policy funcion d. The Bellman equaion and an

More information

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles Diebold, Chaper 7 Francis X. Diebold, Elemens of Forecasing, 4h Ediion (Mason, Ohio: Cengage Learning, 006). Chaper 7. Characerizing Cycles Afer compleing his reading you should be able o: Define covariance

More information

Technical Report Doc ID: TR March-2013 (Last revision: 23-February-2016) On formulating quadratic functions in optimization models.

Technical Report Doc ID: TR March-2013 (Last revision: 23-February-2016) On formulating quadratic functions in optimization models. Technical Repor Doc ID: TR--203 06-March-203 (Las revision: 23-Februar-206) On formulaing quadraic funcions in opimizaion models. Auhor: Erling D. Andersen Convex quadraic consrains quie frequenl appear

More information

Notes on Kalman Filtering

Notes on Kalman Filtering Noes on Kalman Filering Brian Borchers and Rick Aser November 7, Inroducion Daa Assimilaion is he problem of merging model predicions wih acual measuremens of a sysem o produce an opimal esimae of he curren

More information

On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems

On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems MATHEMATICS OF OPERATIONS RESEARCH Vol. 38, No. 2, May 2013, pp. 209 227 ISSN 0364-765X (prin) ISSN 1526-5471 (online) hp://dx.doi.org/10.1287/moor.1120.0562 2013 INFORMS On Boundedness of Q-Learning Ieraes

More information

SZG Macro 2011 Lecture 3: Dynamic Programming. SZG macro 2011 lecture 3 1

SZG Macro 2011 Lecture 3: Dynamic Programming. SZG macro 2011 lecture 3 1 SZG Macro 2011 Lecure 3: Dynamic Programming SZG macro 2011 lecure 3 1 Background Our previous discussion of opimal consumpion over ime and of opimal capial accumulaion sugges sudying he general decision

More information

23.5. Half-Range Series. Introduction. Prerequisites. Learning Outcomes

23.5. Half-Range Series. Introduction. Prerequisites. Learning Outcomes Half-Range Series 2.5 Inroducion In his Secion we address he following problem: Can we find a Fourier series expansion of a funcion defined over a finie inerval? Of course we recognise ha such a funcion

More information

On Measuring Pro-Poor Growth. 1. On Various Ways of Measuring Pro-Poor Growth: A Short Review of the Literature

On Measuring Pro-Poor Growth. 1. On Various Ways of Measuring Pro-Poor Growth: A Short Review of the Literature On Measuring Pro-Poor Growh 1. On Various Ways of Measuring Pro-Poor Growh: A Shor eview of he Lieraure During he pas en years or so here have been various suggesions concerning he way one should check

More information

t is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t...

t is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t... Mah 228- Fri Mar 24 5.6 Marix exponenials and linear sysems: The analogy beween firs order sysems of linear differenial equaions (Chaper 5) and scalar linear differenial equaions (Chaper ) is much sronger

More information

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017 Two Popular Bayesian Esimaors: Paricle and Kalman Filers McGill COMP 765 Sep 14 h, 2017 1 1 1, dx x Bel x u x P x z P Recall: Bayes Filers,,,,,,, 1 1 1 1 u z u x P u z u x z P Bayes z = observaion u =

More information

Linear Response Theory: The connection between QFT and experiments

Linear Response Theory: The connection between QFT and experiments Phys540.nb 39 3 Linear Response Theory: The connecion beween QFT and experimens 3.1. Basic conceps and ideas Q: How do we measure he conduciviy of a meal? A: we firs inroduce a weak elecric field E, and

More information

Cash Flow Valuation Mode Lin Discrete Time

Cash Flow Valuation Mode Lin Discrete Time IOSR Journal of Mahemaics (IOSR-JM) e-issn: 2278-5728,p-ISSN: 2319-765X, 6, Issue 6 (May. - Jun. 2013), PP 35-41 Cash Flow Valuaion Mode Lin Discree Time Olayiwola. M. A. and Oni, N. O. Deparmen of Mahemaics

More information

This document was generated at 1:04 PM, 09/10/13 Copyright 2013 Richard T. Woodward. 4. End points and transversality conditions AGEC

This document was generated at 1:04 PM, 09/10/13 Copyright 2013 Richard T. Woodward. 4. End points and transversality conditions AGEC his documen was generaed a 1:4 PM, 9/1/13 Copyrigh 213 Richard. Woodward 4. End poins and ransversaliy condiions AGEC 637-213 F z d Recall from Lecure 3 ha a ypical opimal conrol problem is o maimize (,,

More information

Some Basic Information about M-S-D Systems

Some Basic Information about M-S-D Systems Some Basic Informaion abou M-S-D Sysems 1 Inroducion We wan o give some summary of he facs concerning unforced (homogeneous) and forced (non-homogeneous) models for linear oscillaors governed by second-order,

More information

An Introduction to Backward Stochastic Differential Equations (BSDEs) PIMS Summer School 2016 in Mathematical Finance.

An Introduction to Backward Stochastic Differential Equations (BSDEs) PIMS Summer School 2016 in Mathematical Finance. 1 An Inroducion o Backward Sochasic Differenial Equaions (BSDEs) PIMS Summer School 2016 in Mahemaical Finance June 25, 2016 Chrisoph Frei cfrei@ualbera.ca This inroducion is based on Touzi [14], Bouchard

More information

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions Muli-Period Sochasic Models: Opimali of (s, S) Polic for -Convex Objecive Funcions Consider a seing similar o he N-sage newsvendor problem excep ha now here is a fixed re-ordering cos (> 0) for each (re-)order.

More information

Simulation-Solving Dynamic Models ABE 5646 Week 2, Spring 2010

Simulation-Solving Dynamic Models ABE 5646 Week 2, Spring 2010 Simulaion-Solving Dynamic Models ABE 5646 Week 2, Spring 2010 Week Descripion Reading Maerial 2 Compuer Simulaion of Dynamic Models Finie Difference, coninuous saes, discree ime Simple Mehods Euler Trapezoid

More information

Chapter 6. Systems of First Order Linear Differential Equations

Chapter 6. Systems of First Order Linear Differential Equations Chaper 6 Sysems of Firs Order Linear Differenial Equaions We will only discuss firs order sysems However higher order sysems may be made ino firs order sysems by a rick shown below We will have a sligh

More information

PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD

PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD HAN XIAO 1. Penalized Leas Squares Lasso solves he following opimizaion problem, ˆβ lasso = arg max β R p+1 1 N y i β 0 N x ij β j β j (1.1) for some 0.

More information

Notes for Lecture 17-18

Notes for Lecture 17-18 U.C. Berkeley CS278: Compuaional Complexiy Handou N7-8 Professor Luca Trevisan April 3-8, 2008 Noes for Lecure 7-8 In hese wo lecures we prove he firs half of he PCP Theorem, he Amplificaion Lemma, up

More information

MATH 4330/5330, Fourier Analysis Section 6, Proof of Fourier s Theorem for Pointwise Convergence

MATH 4330/5330, Fourier Analysis Section 6, Proof of Fourier s Theorem for Pointwise Convergence MATH 433/533, Fourier Analysis Secion 6, Proof of Fourier s Theorem for Poinwise Convergence Firs, some commens abou inegraing periodic funcions. If g is a periodic funcion, g(x + ) g(x) for all real x,

More information

Section 3.5 Nonhomogeneous Equations; Method of Undetermined Coefficients

Section 3.5 Nonhomogeneous Equations; Method of Undetermined Coefficients Secion 3.5 Nonhomogeneous Equaions; Mehod of Undeermined Coefficiens Key Terms/Ideas: Linear Differenial operaor Nonlinear operaor Second order homogeneous DE Second order nonhomogeneous DE Soluion o homogeneous

More information

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC This documen was generaed a :45 PM 8/8/04 Copyrigh 04 Richard T. Woodward. An inroducion o dynamic opimizaion -- Opimal Conrol and Dynamic Programming AGEC 637-04 I. Overview of opimizaion Opimizaion is

More information

Zürich. ETH Master Course: L Autonomous Mobile Robots Localization II

Zürich. ETH Master Course: L Autonomous Mobile Robots Localization II Roland Siegwar Margaria Chli Paul Furgale Marco Huer Marin Rufli Davide Scaramuzza ETH Maser Course: 151-0854-00L Auonomous Mobile Robos Localizaion II ACT and SEE For all do, (predicion updae / ACT),

More information

A Primal-Dual Type Algorithm with the O(1/t) Convergence Rate for Large Scale Constrained Convex Programs

A Primal-Dual Type Algorithm with the O(1/t) Convergence Rate for Large Scale Constrained Convex Programs PROC. IEEE CONFERENCE ON DECISION AND CONTROL, 06 A Primal-Dual Type Algorihm wih he O(/) Convergence Rae for Large Scale Consrained Convex Programs Hao Yu and Michael J. Neely Absrac This paper considers

More information

Optimality Conditions for Unconstrained Problems

Optimality Conditions for Unconstrained Problems 62 CHAPTER 6 Opimaliy Condiions for Unconsrained Problems 1 Unconsrained Opimizaion 11 Exisence Consider he problem of minimizing he funcion f : R n R where f is coninuous on all of R n : P min f(x) x

More information

Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still.

Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still. Lecure - Kinemaics in One Dimension Displacemen, Velociy and Acceleraion Everyhing in he world is moving. Nohing says sill. Moion occurs a all scales of he universe, saring from he moion of elecrons in

More information

Final Spring 2007

Final Spring 2007 .615 Final Spring 7 Overview The purpose of he final exam is o calculae he MHD β limi in a high-bea oroidal okamak agains he dangerous n = 1 exernal ballooning-kink mode. Effecively, his corresponds o

More information

Solutions to Assignment 1

Solutions to Assignment 1 MA 2326 Differenial Equaions Insrucor: Peronela Radu Friday, February 8, 203 Soluions o Assignmen. Find he general soluions of he following ODEs: (a) 2 x = an x Soluion: I is a separable equaion as we

More information

2. Nonlinear Conservation Law Equations

2. Nonlinear Conservation Law Equations . Nonlinear Conservaion Law Equaions One of he clear lessons learned over recen years in sudying nonlinear parial differenial equaions is ha i is generally no wise o ry o aack a general class of nonlinear

More information

On-line Adaptive Optimal Timing Control of Switched Systems

On-line Adaptive Optimal Timing Control of Switched Systems On-line Adapive Opimal Timing Conrol of Swiched Sysems X.C. Ding, Y. Wardi and M. Egersed Absrac In his paper we consider he problem of opimizing over he swiching imes for a muli-modal dynamic sysem when

More information

Two Coupled Oscillators / Normal Modes

Two Coupled Oscillators / Normal Modes Lecure 3 Phys 3750 Two Coupled Oscillaors / Normal Modes Overview and Moivaion: Today we ake a small, bu significan, sep owards wave moion. We will no ye observe waves, bu his sep is imporan in is own

More information

Hamilton- J acobi Equation: Explicit Formulas In this lecture we try to apply the method of characteristics to the Hamilton-Jacobi equation: u t

Hamilton- J acobi Equation: Explicit Formulas In this lecture we try to apply the method of characteristics to the Hamilton-Jacobi equation: u t M ah 5 2 7 Fall 2 0 0 9 L ecure 1 0 O c. 7, 2 0 0 9 Hamilon- J acobi Equaion: Explici Formulas In his lecure we ry o apply he mehod of characerisics o he Hamilon-Jacobi equaion: u + H D u, x = 0 in R n

More information

15. Vector Valued Functions

15. Vector Valued Functions 1. Vecor Valued Funcions Up o his poin, we have presened vecors wih consan componens, for example, 1, and,,4. However, we can allow he componens of a vecor o be funcions of a common variable. For example,

More information

R t. C t P t. + u t. C t = αp t + βr t + v t. + β + w t

R t. C t P t. + u t. C t = αp t + βr t + v t. + β + w t Exercise 7 C P = α + β R P + u C = αp + βr + v (a) (b) C R = α P R + β + w (c) Assumpions abou he disurbances u, v, w : Classical assumions on he disurbance of one of he equaions, eg. on (b): E(v v s P,

More information

Lecture Notes 2. The Hilbert Space Approach to Time Series

Lecture Notes 2. The Hilbert Space Approach to Time Series Time Series Seven N. Durlauf Universiy of Wisconsin. Basic ideas Lecure Noes. The Hilber Space Approach o Time Series The Hilber space framework provides a very powerful language for discussing he relaionship

More information

An Introduction to Malliavin calculus and its applications

An Introduction to Malliavin calculus and its applications An Inroducion o Malliavin calculus and is applicaions Lecure 5: Smoohness of he densiy and Hörmander s heorem David Nualar Deparmen of Mahemaics Kansas Universiy Universiy of Wyoming Summer School 214

More information

Math 334 Fall 2011 Homework 11 Solutions

Math 334 Fall 2011 Homework 11 Solutions Dec. 2, 2 Mah 334 Fall 2 Homework Soluions Basic Problem. Transform he following iniial value problem ino an iniial value problem for a sysem: u + p()u + q() u g(), u() u, u () v. () Soluion. Le v u. Then

More information

Econ107 Applied Econometrics Topic 7: Multicollinearity (Studenmund, Chapter 8)

Econ107 Applied Econometrics Topic 7: Multicollinearity (Studenmund, Chapter 8) I. Definiions and Problems A. Perfec Mulicollineariy Econ7 Applied Economerics Topic 7: Mulicollineariy (Sudenmund, Chaper 8) Definiion: Perfec mulicollineariy exiss in a following K-variable regression

More information

Robust estimation based on the first- and third-moment restrictions of the power transformation model

Robust estimation based on the first- and third-moment restrictions of the power transformation model h Inernaional Congress on Modelling and Simulaion, Adelaide, Ausralia, 6 December 3 www.mssanz.org.au/modsim3 Robus esimaion based on he firs- and hird-momen resricions of he power ransformaion Nawaa,

More information

1 Review of Zero-Sum Games

1 Review of Zero-Sum Games COS 5: heoreical Machine Learning Lecurer: Rob Schapire Lecure #23 Scribe: Eugene Brevdo April 30, 2008 Review of Zero-Sum Games Las ime we inroduced a mahemaical model for wo player zero-sum games. Any

More information

In this chapter the model of free motion under gravity is extended to objects projected at an angle. When you have completed it, you should

In this chapter the model of free motion under gravity is extended to objects projected at an angle. When you have completed it, you should Cambridge Universiy Press 978--36-60033-7 Cambridge Inernaional AS and A Level Mahemaics: Mechanics Coursebook Excerp More Informaion Chaper The moion of projeciles In his chaper he model of free moion

More information

The expectation value of the field operator.

The expectation value of the field operator. The expecaion value of he field operaor. Dan Solomon Universiy of Illinois Chicago, IL dsolom@uic.edu June, 04 Absrac. Much of he mahemaical developmen of quanum field heory has been in suppor of deermining

More information

ACE 562 Fall Lecture 5: The Simple Linear Regression Model: Sampling Properties of the Least Squares Estimators. by Professor Scott H.

ACE 562 Fall Lecture 5: The Simple Linear Regression Model: Sampling Properties of the Least Squares Estimators. by Professor Scott H. ACE 56 Fall 005 Lecure 5: he Simple Linear Regression Model: Sampling Properies of he Leas Squares Esimaors by Professor Sco H. Irwin Required Reading: Griffihs, Hill and Judge. "Inference in he Simple

More information

Hamilton- J acobi Equation: Weak S olution We continue the study of the Hamilton-Jacobi equation:

Hamilton- J acobi Equation: Weak S olution We continue the study of the Hamilton-Jacobi equation: M ah 5 7 Fall 9 L ecure O c. 4, 9 ) Hamilon- J acobi Equaion: Weak S oluion We coninue he sudy of he Hamilon-Jacobi equaion: We have shown ha u + H D u) = R n, ) ; u = g R n { = }. ). In general we canno

More information

Solutions Problem Set 3 Macro II (14.452)

Solutions Problem Set 3 Macro II (14.452) Soluions Problem Se 3 Macro II (14.452) Francisco A. Gallego 04/27/2005 1 Q heory of invesmen in coninuous ime and no uncerainy Consider he in nie horizon model of a rm facing adjusmen coss o invesmen.

More information

Online Convex Optimization Example And Follow-The-Leader

Online Convex Optimization Example And Follow-The-Leader CSE599s, Spring 2014, Online Learning Lecure 2-04/03/2014 Online Convex Opimizaion Example And Follow-The-Leader Lecurer: Brendan McMahan Scribe: Sephen Joe Jonany 1 Review of Online Convex Opimizaion

More information

Matrix Versions of Some Refinements of the Arithmetic-Geometric Mean Inequality

Matrix Versions of Some Refinements of the Arithmetic-Geometric Mean Inequality Marix Versions of Some Refinemens of he Arihmeic-Geomeric Mean Inequaliy Bao Qi Feng and Andrew Tonge Absrac. We esablish marix versions of refinemens due o Alzer ], Carwrigh and Field 4], and Mercer 5]

More information

Matlab and Python programming: how to get started

Matlab and Python programming: how to get started Malab and Pyhon programming: how o ge sared Equipping readers he skills o wrie programs o explore complex sysems and discover ineresing paerns from big daa is one of he main goals of his book. In his chaper,

More information

Solutions from Chapter 9.1 and 9.2

Solutions from Chapter 9.1 and 9.2 Soluions from Chaper 9 and 92 Secion 9 Problem # This basically boils down o an exercise in he chain rule from calculus We are looking for soluions of he form: u( x) = f( k x c) where k x R 3 and k is

More information

Essential Microeconomics : OPTIMAL CONTROL 1. Consider the following class of optimization problems

Essential Microeconomics : OPTIMAL CONTROL 1. Consider the following class of optimization problems Essenial Microeconomics -- 6.5: OPIMAL CONROL Consider he following class of opimizaion problems Max{ U( k, x) + U+ ( k+ ) k+ k F( k, x)}. { x, k+ } = In he language of conrol heory, he vecor k is he vecor

More information

GMM - Generalized Method of Moments

GMM - Generalized Method of Moments GMM - Generalized Mehod of Momens Conens GMM esimaion, shor inroducion 2 GMM inuiion: Maching momens 2 3 General overview of GMM esimaion. 3 3. Weighing marix...........................................

More information

EXERCISES FOR SECTION 1.5

EXERCISES FOR SECTION 1.5 1.5 Exisence and Uniqueness of Soluions 43 20. 1 v c 21. 1 v c 1 2 4 6 8 10 1 2 2 4 6 8 10 Graph of approximae soluion obained using Euler s mehod wih = 0.1. Graph of approximae soluion obained using Euler

More information

Guest Lectures for Dr. MacFarlane s EE3350 Part Deux

Guest Lectures for Dr. MacFarlane s EE3350 Part Deux Gues Lecures for Dr. MacFarlane s EE3350 Par Deux Michael Plane Mon., 08-30-2010 Wrie name in corner. Poin ou his is a review, so I will go faser. Remind hem o go lisen o online lecure abou geing an A

More information

MATH 5720: Gradient Methods Hung Phan, UMass Lowell October 4, 2018

MATH 5720: Gradient Methods Hung Phan, UMass Lowell October 4, 2018 MATH 5720: Gradien Mehods Hung Phan, UMass Lowell Ocober 4, 208 Descen Direcion Mehods Consider he problem min { f(x) x R n}. The general descen direcions mehod is x k+ = x k + k d k where x k is he curren

More information

5. Stochastic processes (1)

5. Stochastic processes (1) Lec05.pp S-38.45 - Inroducion o Teleraffic Theory Spring 2005 Conens Basic conceps Poisson process 2 Sochasic processes () Consider some quaniy in a eleraffic (or any) sysem I ypically evolves in ime randomly

More information

Multi-scale 2D acoustic full waveform inversion with high frequency impulsive source

Multi-scale 2D acoustic full waveform inversion with high frequency impulsive source Muli-scale D acousic full waveform inversion wih high frequency impulsive source Vladimir N Zubov*, Universiy of Calgary, Calgary AB vzubov@ucalgaryca and Michael P Lamoureux, Universiy of Calgary, Calgary

More information

6. Stochastic calculus with jump processes

6. Stochastic calculus with jump processes A) Trading sraegies (1/3) Marke wih d asses S = (S 1,, S d ) A rading sraegy can be modelled wih a vecor φ describing he quaniies invesed in each asse a each insan : φ = (φ 1,, φ d ) The value a of a porfolio

More information

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation Course Noes for EE7C Spring 018: Convex Opimizaion and Approximaion Insrucor: Moriz Hard Email: hard+ee7c@berkeley.edu Graduae Insrucor: Max Simchowiz Email: msimchow+ee7c@berkeley.edu Ocober 15, 018 3

More information

0.1 MAXIMUM LIKELIHOOD ESTIMATION EXPLAINED

0.1 MAXIMUM LIKELIHOOD ESTIMATION EXPLAINED 0.1 MAXIMUM LIKELIHOOD ESTIMATIO EXPLAIED Maximum likelihood esimaion is a bes-fi saisical mehod for he esimaion of he values of he parameers of a sysem, based on a se of observaions of a random variable

More information

O Q L N. Discrete-Time Stochastic Dynamic Programming. I. Notation and basic assumptions. ε t : a px1 random vector of disturbances at time t.

O Q L N. Discrete-Time Stochastic Dynamic Programming. I. Notation and basic assumptions. ε t : a px1 random vector of disturbances at time t. Econ. 5b Spring 999 C. Sims Discree-Time Sochasic Dynamic Programming 995, 996 by Chrisopher Sims. This maerial may be freely reproduced for educaional and research purposes, so long as i is no alered,

More information

Failure of the work-hamiltonian connection for free energy calculations. Abstract

Failure of the work-hamiltonian connection for free energy calculations. Abstract Failure of he work-hamilonian connecion for free energy calculaions Jose M. G. Vilar 1 and J. Miguel Rubi 1 Compuaional Biology Program, Memorial Sloan-Keering Cancer Cener, 175 York Avenue, New York,

More information

) were both constant and we brought them from under the integral.

) were both constant and we brought them from under the integral. YIELD-PER-RECRUIT (coninued The yield-per-recrui model applies o a cohor, bu we saw in he Age Disribuions lecure ha he properies of a cohor do no apply in general o a collecion of cohors, which is wha

More information

ACE 562 Fall Lecture 4: Simple Linear Regression Model: Specification and Estimation. by Professor Scott H. Irwin

ACE 562 Fall Lecture 4: Simple Linear Regression Model: Specification and Estimation. by Professor Scott H. Irwin ACE 56 Fall 005 Lecure 4: Simple Linear Regression Model: Specificaion and Esimaion by Professor Sco H. Irwin Required Reading: Griffihs, Hill and Judge. "Simple Regression: Economic and Saisical Model

More information

Applying Genetic Algorithms for Inventory Lot-Sizing Problem with Supplier Selection under Storage Capacity Constraints

Applying Genetic Algorithms for Inventory Lot-Sizing Problem with Supplier Selection under Storage Capacity Constraints IJCSI Inernaional Journal of Compuer Science Issues, Vol 9, Issue 1, No 1, January 2012 wwwijcsiorg 18 Applying Geneic Algorihms for Invenory Lo-Sizing Problem wih Supplier Selecion under Sorage Capaciy

More information

SUPPLEMENTARY INFORMATION

SUPPLEMENTARY INFORMATION SUPPLEMENTARY INFORMATION DOI: 0.038/NCLIMATE893 Temporal resoluion and DICE * Supplemenal Informaion Alex L. Maren and Sephen C. Newbold Naional Cener for Environmenal Economics, US Environmenal Proecion

More information

Chapter 3 Boundary Value Problem

Chapter 3 Boundary Value Problem Chaper 3 Boundary Value Problem A boundary value problem (BVP) is a problem, ypically an ODE or a PDE, which has values assigned on he physical boundary of he domain in which he problem is specified. Le

More information

Expert Advice for Amateurs

Expert Advice for Amateurs Exper Advice for Amaeurs Ernes K. Lai Online Appendix - Exisence of Equilibria The analysis in his secion is performed under more general payoff funcions. Wihou aking an explici form, he payoffs of he

More information

Math 333 Problem Set #2 Solution 14 February 2003

Math 333 Problem Set #2 Solution 14 February 2003 Mah 333 Problem Se #2 Soluion 14 February 2003 A1. Solve he iniial value problem dy dx = x2 + e 3x ; 2y 4 y(0) = 1. Soluion: This is separable; we wrie 2y 4 dy = x 2 + e x dx and inegrae o ge The iniial

More information

( ) = b n ( t) n " (2.111) or a system with many states to be considered, solving these equations isn t. = k U I ( t,t 0 )! ( t 0 ) (2.

( ) = b n ( t) n  (2.111) or a system with many states to be considered, solving these equations isn t. = k U I ( t,t 0 )! ( t 0 ) (2. Andrei Tokmakoff, MIT Deparmen of Chemisry, 3/14/007-6.4 PERTURBATION THEORY Given a Hamilonian H = H 0 + V where we know he eigenkes for H 0 : H 0 n = E n n, we can calculae he evoluion of he wavefuncion

More information

Seminar 4: Hotelling 2

Seminar 4: Hotelling 2 Seminar 4: Hoelling 2 November 3, 211 1 Exercise Par 1 Iso-elasic demand A non renewable resource of a known sock S can be exraced a zero cos. Demand for he resource is of he form: D(p ) = p ε ε > A a

More information

t 2 B F x,t n dsdt t u x,t dxdt

t 2 B F x,t n dsdt t u x,t dxdt Evoluion Equaions For 0, fixed, le U U0, where U denoes a bounded open se in R n.suppose ha U is filled wih a maerial in which a conaminan is being ranspored by various means including diffusion and convecion.

More information

Basic Circuit Elements Professor J R Lucas November 2001

Basic Circuit Elements Professor J R Lucas November 2001 Basic Circui Elemens - J ucas An elecrical circui is an inerconnecion of circui elemens. These circui elemens can be caegorised ino wo ypes, namely acive and passive elemens. Some Definiions/explanaions

More information

References are appeared in the last slide. Last update: (1393/08/19)

References are appeared in the last slide. Last update: (1393/08/19) SYSEM IDEIFICAIO Ali Karimpour Associae Professor Ferdowsi Universi of Mashhad References are appeared in he las slide. Las updae: 0..204 393/08/9 Lecure 5 lecure 5 Parameer Esimaion Mehods opics o be

More information

Georey E. Hinton. University oftoronto. Technical Report CRG-TR February 22, Abstract

Georey E. Hinton. University oftoronto.   Technical Report CRG-TR February 22, Abstract Parameer Esimaion for Linear Dynamical Sysems Zoubin Ghahramani Georey E. Hinon Deparmen of Compuer Science Universiy oftorono 6 King's College Road Torono, Canada M5S A4 Email: zoubin@cs.orono.edu Technical

More information

We just finished the Erdős-Stone Theorem, and ex(n, F ) (1 1/(χ(F ) 1)) ( n

We just finished the Erdős-Stone Theorem, and ex(n, F ) (1 1/(χ(F ) 1)) ( n Lecure 3 - Kövari-Sós-Turán Theorem Jacques Versraëe jacques@ucsd.edu We jus finished he Erdős-Sone Theorem, and ex(n, F ) ( /(χ(f ) )) ( n 2). So we have asympoics when χ(f ) 3 bu no when χ(f ) = 2 i.e.

More information

Essential Maps and Coincidence Principles for General Classes of Maps

Essential Maps and Coincidence Principles for General Classes of Maps Filoma 31:11 (2017), 3553 3558 hps://doi.org/10.2298/fil1711553o Published by Faculy of Sciences Mahemaics, Universiy of Niš, Serbia Available a: hp://www.pmf.ni.ac.rs/filoma Essenial Maps Coincidence

More information

Lecture 33: November 29

Lecture 33: November 29 36-705: Inermediae Saisics Fall 2017 Lecurer: Siva Balakrishnan Lecure 33: November 29 Today we will coninue discussing he boosrap, and hen ry o undersand why i works in a simple case. In he las lecure

More information

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC This documen was generaed a :37 PM, 1/11/018 Copyrigh 018 Richard T. Woodward 1. An inroducion o dynamic opimiaion -- Opimal Conrol and Dynamic Programming AGEC 64-018 I. Overview of opimiaion Opimiaion

More information

On a Discrete-In-Time Order Level Inventory Model for Items with Random Deterioration

On a Discrete-In-Time Order Level Inventory Model for Items with Random Deterioration Journal of Agriculure and Life Sciences Vol., No. ; June 4 On a Discree-In-Time Order Level Invenory Model for Iems wih Random Deerioraion Dr Biswaranjan Mandal Associae Professor of Mahemaics Acharya

More information

Macroeconomic Theory Ph.D. Qualifying Examination Fall 2005 ANSWER EACH PART IN A SEPARATE BLUE BOOK. PART ONE: ANSWER IN BOOK 1 WEIGHT 1/3

Macroeconomic Theory Ph.D. Qualifying Examination Fall 2005 ANSWER EACH PART IN A SEPARATE BLUE BOOK. PART ONE: ANSWER IN BOOK 1 WEIGHT 1/3 Macroeconomic Theory Ph.D. Qualifying Examinaion Fall 2005 Comprehensive Examinaion UCLA Dep. of Economics You have 4 hours o complee he exam. There are hree pars o he exam. Answer all pars. Each par has

More information

Random Walk with Anti-Correlated Steps

Random Walk with Anti-Correlated Steps Random Walk wih Ani-Correlaed Seps John Noga Dirk Wagner 2 Absrac We conjecure he expeced value of random walks wih ani-correlaed seps o be exacly. We suppor his conjecure wih 2 plausibiliy argumens and

More information