Robust Shortest Path Planning and Semicontractive Dynamic Programming

Size: px
Start display at page:

Download "Robust Shortest Path Planning and Semicontractive Dynamic Programming"

Transcription

1 ebrary 4 (Revised Agst 4, ne 6 Report LIDS - 95 Robst Shortest Path Planning Semicontractive Dynamic Programg Dimitri P. Bertseas Abstract In this paper we consider shortest path in a directed graph where transitions between nodes sbject to ncertainty. We se a ima ormlation, where objective is to garantee that a destination state is reached with a imm cost path nder worst possible instance o ncertainty. s o this type arise, among ors, in planning prsit-evasion contets, in model predictive control. Or analysis maes se o recently developed ory o abstract semicontractive dynamic programg models. We investigate qestions o eistence niqeness o soltion o optimality eqation, eistence o optimal paths, validity o varios algorithms patterned ater classical s o vale policy iteration, as well as a Dijstra-lie algorithm or with nonnegative arc lengths. Dimitri Bertseas is with Dept. o Electr. Engineering Comp. Science, Laboratory or Inormation Decision Systems, M.I.T., Cambridge, Mass., 9.

2 . INTRODUCTION In this paper, we discss shortest path that embody a worst-case view o ncertainty. These relate to several or types o arising in stochastic ima control, model predictive control, Marovian decision processes, planning, seqential games, robst combinatorial optimization, soltion o discretized large-scale dierential eqations. Conseqently, or analysis algorithms relate to a large body o eisting ory. However, in this paper we rely on a recently developed abstract dynamic programg ory o semicontractive, capitalize on general reslts developed in contet o this ory [Ber]. We irst discss inormally se connections we srvey related literatre. Relations with Or s Literatre Review The closest connection to or wor is classical shortest path problem where objective is to reach a destination node with a imm length path rom every or node in a directed graph. This is a ndamental problem that has an enormos range o applications has been stdied etensively (see e.g., srveys [Dre69], [GaP88], many tetboos, inclding [Roc84], [AMO89], [Ber98], [Ber5]. The assmption is that at any node, we may detere a sccessor node y rom a given set o possible sccessors, deined arcs, y o graph that otgoing rom. In some, however, ollowing decision at a given node, re is inherent ncertainty abot sccessor node. In a stochastic ormlation, ncertainty is modeled a probability distribtion over set o sccessors, or decision is n to choose at each node one distribtion ot o a given set o distribtions. The reslting problem, nown as stochastic shortest path problem (also nown as transient programg problem, is a total cost ininite horizon Marovian decision problem, with a sbstantial analytical algorithmic ology, which inds etensive applications in o motion planning, robotics, or where aim is to reach a goal state with probability nder stochastic ncertainty (see [Pal67], [Der7], [Pli78], [Whi8], [BeT89], [BeT9], [Pt94], [HCP99], [HiW5], [ac6], [LaV6], [Bon7], [Ber], [BeY6], [CCV4], [YBa]. Anor important a o application is largescale comptation or discretized versions o dierential eqations (sch as Hamilton-acobi-Bellman eqation, eional eqation; see [GoR85], [al87], [KD9], [BGM95], [Tsi95], [PBT98], [Set99a], [Set99b], [Vla8], [AlM], [ChV], [AnV4], [Mir4]. In this paper, we introdce a seqential ima ormlation o shortest path problem, where ncertainty is modeled set membership: at a given node, we may choose one sbset ot o a given collection o sbsets o nodes, sccessor node on path is chosen rom this sbset an antagonistic opponent. Or principal o analysis is dynamic programg (DP or short. Related have been stdied or a long time, in contet o control o ncertain discrete-time dynamic systems with a set membership description o ncertainty (starting with ses [Wit66] [Ber7], ollowed p many or wors; see e.g., monographs [BaB9], [KV97], [BlM8], srvey [Bla99], reerences given re. These relevant or eample in contet o model predictive control nder ncertainty, a sbject o great importance in crrent practice o control ory (see e.g., srveys [MoL99], [MRR], boos [Mac], [CaB4], [RaM9]; model predictive control with set membership distrbances is discssed in sis [Ker] tet [Ber5], Section Seqential ima have also been stdied in contet o seqential games (starting with paper [Sha5], ollowed p many or wors, inclding boos [BaB9], [iv96], reerences given re. Seqential games that involve shortest paths particlarly relevant; see wors [PaB99], [Gr8], [Y], [BaL5]. An important dierence with some o wors on seqential

3 games is that in or ima ormlation, we assme that antagonistic opponent nows decision corresponding sbset o sccessor nodes chosen at each node. Ths in or problem, it wold mae a dierence i decisions at each node were made with advance nowledge o opponent s choice ( -ma is typically not eqal to ma- in or contet. Generally shortest path games admit a simpler analysis when arc lengths assmed nonnegative (as is done or eample in recent wors [Gr8], [BaL5], when problem inherits strctre o negative DP (see [Str66], or tets [Pt94], [Ber] or abstract monotone increasing abstract DP models (see [Ber77], [BeS78], [Ber]. However, or ormlation line o analysis is based on recently introdced abstract semicontractive DP model o [Ber], allows negative as well as nonnegative arc lengths. s with negative arc lengths arise in applications when we want to ind longest path in a networ with nonnegative arc lengths, sch as critical path analysis. s with both positive negative arc lengths inclde searching a networ or objects o vale with positive search costs (c. Eample 4., inancial o maimization o total reward when re transaction or costs. An important application o or shortest path is in prsit-evasion (or search resce contets, where a team o prsers aig to reach one or more evaders that move npredictably. s o this ind have been stdied etensively rom dierent s o view (see e.g., [Par76], [MHG88], [BaS9], [HKR9], [BS94], [BB95], [GLL99], [VKS], [LaV6], [AHK7], [BBH8], [BaL5]. or or shortest path ormlation to be applicable to sch a problem, prsers evaders mst have perect inormation abot each ors positions, Cartesian prodct o ir positions ( state o system mst be restricted to inite set o nodes o a given graph, with nown transition costs (i.e., a terrain map that is nown a priori. We may deal with prsit-evasion with imperect state inormation set-membership ncertainty means o a redction to perect state inormation, which is based on set membership estimators notion o a siciently inormative nction, introdced in sis [Ber7] in sbseqent paper [BeR7]. In this contet, original imperect state inormation problem is reormlated as a problem o perect state inormation, where states correspond to sbsets o nodes o original graph ( set o states that consistent with observation history o system, in terology o set membership estimation [BeR7], [Ber7], [KV97]. Ths, since X has a inite nmber o nodes, reormlated problem still involves a inite (bt mch larger nmber o states, may be dealt with sing ology o this paper. Note that problem reormlation jst described is also applicable to general ima control with imperect state inormation, not jst to prsit-evasion. Or wor is also related to sbject o robst optimization (see e.g., boo [BGN9] recent srvey [BBC], which incldes ima ormlations o general optimization with set membership ncertainty. However, or emphasis here is placed on presence o destination node reqirement or teration, which is salient eatre essential strctre o shortest path. Moreover, a dierence with or wors on robst shortest path selection (see e.g., [YY98], [BeS], [MoG4] is that in or wor ncertainty abot transitions or arc cost data at a given node is decopled rom corresponding ncertainty at or nodes. This allows a DP ormlation o or problem. Becase or contet diers in essential respects rom preceding wors, reslts o present paper new to a great etent. The line o analysis is also new, is based on connection with ory o abstract semicontractive DP mentioned earlier. In addition to simpler proos, a major beneit o this abstract line o treatment is deeper insight into strctre o or problem, natre o or analytical comptational reslts. Several related, involving or eample an additional stochastic type o ncertainty, admit a similar treatment. Some o se described in last

4 section, ir analysis associated algorithms sbjects or rr research. Robst Shortest Path ormlation To ormally describe or problem, we consider a graph with a inite set o nodes X {t} a inite set o directed arcs A {, y, y X {t} }, where t is a node called destination. At each node X we may choose a control or action rom a nonempty set U, which is a sbset o a inite set U. Then a sccessor node y is selected an antagonistic opponent rom a nonempty set Y, X {t}, sch that, y A or all y Y,, a cost g,, y is incrred. The destination node t is absorbing cost-ree, in sense that only otgoing arc rom t is (t, t we have g(t,, t or all U(t. A policy is deined to be a nction that assigns to each node X a control U. We denote inite set o all policies M. A possible path nder a policy starting at node X is an arc seqence o orm p {,,,,... }, sch that Y (, or all. The set o all possible paths nder starting at is denoted P, ; it is set o paths that antagonistic opponent may generate starting rom, once policy has been chosen. The length o a path p P, is deined L (p g (,,, i series above is convergent, more generally L (p lim sp m m g (,,, i it is not. or completeness, we also deine length o a portion { i, i, i, i,..., m, m } o a path p P,, consisting o a inite nmber o consective arcs, m g (,,. i When consion cannot arise we will also reer to sch a inite-arc portion as a path. O interest cycles, that is, paths o orm { i, i, i, i,..., im, i }. Paths that do not contain any cycle or than sel-cycle (t, t called simple. or a given policy t, a path p P, is said to be terating i it has orm p {,,,,..., m, t, (t, t,... }, (. where m is a positive integer,,..., m distinct nondestination nodes. Since g(t,, t or all U(t, length o a terating path p o orm (., corresponding to, is given L (p g ( m, m, t m g (,,, 4

5 t t Improper policy Proper policy igre.. A robst shortest path problem with X {, }, two controls at node, one control at node. There two policies,, corresponding to two controls at node. The igre shows sbgraphs o arcs A A. The policy is improper becase A contains cycle (,, (sel-cycle (,. is eqal to inite length o its initial portion that consists o irst m arcs. An important characterization o a policy is provided sbset o arcs A X {, y y Y (, }. Ths A, toger with sel-arc (t, t, consists o set o paths X P,, in sense that it contains this set o paths no or paths. We say that A is destination-connected i or each X re eists a terating path in P,. We say that is proper i sbgraph o arcs A is acyclic (i.e., contains no cycles. Ths is proper i only i all paths in X P, simple hence terating (eqivalently is proper i only i A is destination-connected has no cycles. The term proper is consistent with a similar term in stochastic shortest path, where it indicates a policy nder which destination is reached with probability, see e.g., [Pal67], [BeT89], [BeT9]. I is not proper, it is called improper, in which case sbgraph o arcs A mst contain a cycle; see eamples o ig... or a proper, we associate with every X worst-case path length over inite set o possible paths starting rom, which is denoted ma L (p, X. (. p P, Ths is length o longest path rom to t in acyclic sbgraph o arcs A. Since re initely many paths in this acyclic graph, may be ond eir enmeration comparison o se paths (in simple, or solving shortest path problem obtained when signs o arc lengths g (,, y,, y A, reversed. Or problem is to ind an optimal proper policy, i.e., one that imizes over all proper, simltaneosly or all X, nder assmptions that parallel those or classical shortest path problem. We reer to this as problem o robst shortest path selection (RSP or short. Note that in or problem, reaching destination starting rom every node is a reqirement, regardless o choices o hypotical antagonistic opponent. In or words imization in RSP is over proper policies only. 5

6 O corse or problem to have a easible soltion ths be meaningl, re mst eist at least one proper policy, this may be restrictive or a given problem. One may deal with where easibility is not nown to hold introdcing or every an artiicial teration action into U [i.e., a with Y, {t}], associated with very large length [i.e., g,, t g >> ]. Then policy that selects teration action at each is proper has cost nction g. In problem ths reormlated optimal cost over proper policies will be naected or all nodes or which re eists a proper policy with < g. Since or a proper, cost is bonded above nmber o nodes in X times largest arc length, a sitable vale o g is readily available. In Section we will ormlate RSP in a way that semicontractive DP ramewor can be applied. In Section, we will describe briely this ramewor we will qote reslts that will be sel to s. In Section 4, we will develop or main analytical reslts or RSP. In Section 5, we will discss algorithms o vale policy iteration type, izing corresponding algorithms o semicontractive DP, adapting available algorithms or stochastic shortest path. Among ors, we will give a Dijstralie algorithm or with nonnegative arc lengths, which terates in a nmber o iterations eqal to nmber o nodes in graph, has low order polynomial compleity. Related Dijstra-lie algorithms were proposed recently, in contet o dynamic games with an abbreviated convergence analysis, [Gr8] [BaL5].. MINIMAX ORMULATION In this section we will reormlate RSP into a ima problem, where given a policy, an antagonistic opponent selects a sccessor node y Y (, or each X, with aim o maimizing lengths o reslting paths. The essential dierence between RSP associated ima problem is that in RSP only proper policies admissible, while in ima problem all policies will be admissible. Or analysis will be based in part on assmptions nder which improper policies cannot be optimal or ima problem, implying that optimal policies or ima problem will be optimal or original RSP problem. One sch assmption is ollowing. Assmption.: (a There eists at least one proper policy. (b or every improper policy, all cycles in sbgraph o arcs A have positive length. The preceding assmption parallels generalizes typical assmptions in classical deteristic shortest path problem, i.e., case where Y, consists o a single node. Then condition (a is eqivalent to assg that each node is connected to destination with a path, while condition (b is eqivalent to assg that all directed cycles in graph have positive length. Later in Section 4, in addition to To veriy eistence o a proper policy [condition (a] one may apply a reachability algorithm, which constrcts seqence {N } o sets N N { X {t} re eists U with Y, N }, 6

7 Assmption., we will consider anor weaer assmption, where positive length is replaced with nonnegative length in condition (b above. This assmption will hold in case where all arc lengths g,, y nonnegative, bt re may eist a zero length cycle. As a irst step, we etend deinition o nction to case o an improper policy. Recall that or a proper policy, has been deined Eq. (., as length o longest path p P,, ma L (p, X. (. p P, We etend this deinition to any policy, proper or improper, deining as lim sp sp p P, L p(, (. where L p( is sm o lengths o irst arcs in path p. When is proper, this deinition coincides with one given earlier [c. Eq. (.]. Ths or a proper, is real-valed, it is niqe soltion o optimality eqation (or Bellman eqation or longest path problem associated with proper policy acyclic sbgraph o arcs A : where we denote nction given ma [ ( g,, y (y ] X, (. y Y, (y { (y i y X, i y t. (.4 Any shortest path algorithm may be sed to solve this longest path problem or a proper. However, when is improper, we may have, soltion o corresponding longest path problem may be problematic. We will consider problem o inding * M, X, (.5 a policy attaining imm above, simltaneosly or all X. Note that imization is over all policies, in contrast with RSP problem, where imization is over jst proper policies. Embedding Within an Abstract DP Model We will now reormlate ima problem o Eq. (.5 more abstractly, epressing it in terms o mapping that appears in Bellman s eqation (.-(.4, re bringing to bear ory o abstract DP. We denote E(X set o nctions : X [, ], R(X set o nctions starting with N {t} (see [Ber7], [BeR7]. A proper policy eists i only i this algorithm stops with a inal set N eqal to X {t}. I re is no proper policy, this algorithm will stop with N eqal to a strict sbset o X {t} o nodes starting rom which re eists a terating path nder some policy. The problem may n be reormlated over redced graph consisting o node set N, so re will eist a proper policy in this redced problem. 7

8 : X (,. Note that since X is inite, R(X can be viewed as a inite-dimensional Eclidean space. We introdce mapping H : X U E(X [, ] given H,, where or any E(X we denote nction given ma [ ] g,, y (y, (.6 y Y, (y { (y i y X, i y t. (.7 We consider or each policy, mapping T : E(X E(X, deined (T H (,,, X, (.8 we note that ied eqation T is identical to Bellman eqation (.. We also consider mapping T : E(X E(X deined (T H,,, X, (.9 U also eqivalently written as (T M (T, X. (. We denote T T -old compositions o mappings T T with mselves, respectively. Let s consider zero nction, which we denote :, X. Using Eqs. (.6-(.8, we see that or any M X, (T is reslt o -stage DP algorithm that comptes sp p P, L p(, length o longest path nder that starts at consists o arcs, so that (T sp p P, L p(, X. Ths deinition (. o can be written in alternative eqivalent orm lim sp (T, X. (. We ocsing on optimization over stationary policies becase nder assmptions o this paper (both Assmption. alternative assmptions o Section 4 optimal cost nction wold not be improved allowing nonstationary policies, as shown in [Ber], Chapter. In more general ramewor o [Ber], nonstationary Marov policies o orm π {,,...}, with M,,,..., allowed, ir cost nction is deined π lim sp (T T, X, where T T is composition o mappings T,..., T. Moreover, is deined as inimm o π over all sch π. However, nder assmptions o present paper, this inimm is attained a stationary policy (in act one that is proper. Hence, attention may be restricted to stationary policies withot loss o optimality withot aecting reslts rom [Ber] that will be sed. 8

9 The reslts that we will show nder Assmption. generalize main analytical reslts or classical deteristic shortest path problem, stated in abstract orm, ollowing: (a * is niqe ied o T within R(X, we have T * or all R(X. (b Only proper policies can be optimal, re eists an optimal proper policy. (c A policy is optimal i only i it attains imm or all X in Eq. (. when *. Proos o se reslts rom irst principles qite comple. However, airly easy proos can be obtained embedding problem o imizing nction o Eq. (. over M, within abstract semicontractive DP ramewor introdced in [Ber]. In particlar, we will se general reslts or this ramewor, which we will smmarize in net section.. SEMICONTRACTIVE DP ANALYSIS We will now view problem o imizing over M cost nction, given in abstract orm (., as a case o a semicontractive DP model. We irst provide a brie review o this model, with a notation that corresponds to one sed in preceding section. The starting is a set o states X, a set o controls U, a control constraint set U U or each X. or general ramewor o this section, X U arbitrary sets; we contine to se some o notation o preceding section in order to indicate relevant associations. A policy is a mapping : X U with U or all X, set o all policies is denoted M. or each policy, we given a mapping T : E(X E(X that is monotone in sense that or any two, E(X, T T. We deine mapping T : E(X E(X The cost nction o is deined as (T in M (T, X. lim sp (T, X, m where is some given nction in E(X. The objective is to ind * in M or each X, a policy sch that *, i one eists. Based on correspondences with Eqs. (.6-(., it can be seen that ima problem o preceding section is case o problem o this section, where X U inite sets, T is deined Eq. (.8, is zero nction. Since set o policies is inite, re eists a policy imizing over set o proper policies, or each X. However, assertion here is stronger, namely that re eists a proper imizing over all M simltaneosly or all X, i.e., a proper with. 9

10 In contractive models, mappings T assmed to be contractions, with respect to a weighted sp-norm with a contraction modls, in sbspace o nctions in E(X that bonded with respect to weighted sp-norm. These models have a strong analytical algorithmic ory, which dates to [Den67]; see also [BeS78], Ch., recent etensive treatments given in Chapters - o [Ber], Ch. o [Ber]. In semicontractive models, only some policies have a contraction-lie property. This property is captred notion o S-reglarity o a policy introdced in [Ber] deined as ollows. Deinition.: Given a set o nctions S E(X, we say that a policy is S-reglar i: (a S T. (b lim T or all S. A policy that is not S-reglar is called S-irreglar. Roghly, is S-reglar i is an asymptotically stable eqilibrim o T within S. An important case o an S-reglar is when S is a complete sbset o a metric space T maps S to S, when restricted to S, is a contraction with respect to metric o that space. There several dierent choices o S, which may be sel depending on contet, sch as or eample R(X, E(X, { R(X }, { E(X }, ors. There also several sets o assmptions corresponding reslts, which given in [Ber] will be sed to prove or analytical reslts or RSP problem. In this paper, we will se S R(X, bt or ease o reerence, we will qote reslts rom [Ber] with S being an arbitrary sbset o R(X. We give below an assmption relating to semicontractive models, which is Assmption.. o [Ber]. A ey part o this assmption is part (c, which implies that S-irreglar policies have ininite cost or at least one state, so y cannot be optimal. This part will provide a connection to Assmption.(b. Assmption.: hold: In semicontractive model o this section with a set S R(X ollowing (a S contains, has property that i, two nctions in S, n S contains all nctions with. (b The nction Ĵ given Ĵ in : S-reglar, X, belongs to S. (c or each S-irreglar policy each S, re is at least one state X sch that lim sp (T.

11 (d The control set U is a metric space, set { (T λ } is compact or every S, X, λ R. (e or each seqence { m } S with m or some S we have lim (T m (T, X, M. m ( or each nction S, re eists a nction S sch that T. The ollowing two propositions given in [Ber] as Prop... Lemma..4, respectively. Or analysis will be based on se two propositions. Proposition.: Let Assmption. hold. Then: (a The optimal cost nction * is niqe ied o T within set S. (b A policy is optimal i only i T * T *. Moreover, re eists an optimal S-reglar policy. (c We have T * or all S. (d or any S, i T we have *, i T we have *. Proposition.: Let Assmption.(b,(c,(d hold. Then: (a The nction Ĵ o Assmption.(b is niqe ied o T within S. (b Every policy satisying T Ĵ T Ĵ is optimal within set o S-reglar policies, i.e., is S-reglar Ĵ. Moreover, re eists at least one sch policy. The second proposition is sel or sitations where only some o conditions o Assmption. satisied, will be sel in proo o an important part o Prop. 4. in net section. As noted in preceding section, a more general problem is deined in [Ber], where nonstationary Marov policies allowed, is deined as inimm over se policies. However, nder or assmptions, attention may be restricted to stationary policies withot loss o optimality withot aecting validity o two propositions.

12 4. SEMICONTRACTIVE MODELS AND SHORTEST PATH PROBLEMS We will now apply preceding two propositions to ima ormlation o RSP problem: imizing over all M shortest path cost as given Eq. (. or both proper improper policies. We will irst derive some preliary reslts. The ollowing proposition clariies properties o when is improper. Proposition 4.: Let be an improper policy let be its cost nction as given Eq. (.. (a I all cycles in sbgraph o arcs A have nonpositive length, < or all X. (b I all cycles in sbgraph o arcs A have nonnegative length, > or all X. (c I all cycles in sbgraph o arcs A have zero length, is real-valed. (d I re is a positive length cycle in sbgraph o arcs A, we have or at least one node X. More generally, or each R(X, we have lim sp (T or at least one X. Proo: Any path with a inite nmber o arcs, can be decomposed into a simple path, a inite nmber o cycles (see e.g., path decomposition orem o [Ber98], Prop.., Eercise.4. Since re is only a inite nmber o simple paths nder, ir length is bonded above below. Ths in part (a length o all paths with a inite nmber o arcs is bonded above, in part (b it is bonded below, implying that < or all X or > or all X, respectively. Part (c ollows combining parts (a (b. To show part (d, consider a path p, which consists o an ininite repetition o positive length cycle that is assmed to eist. Let C(p be length o path that consists o irst cycles in p. Then C(p C(p or all [c. Eq. (.], where is irst node in cycle, ths implying that. Moreover or every R(X all, (T is maimm over lengths o -arc paths that start at, pls a teral cost that is eqal to eir (y (i teral node o -arc path is y X, or (i teral node o -arc path is destination. Ths we have, { } (T, (T. X Since lim sp (T as shown earlier, it ollows that lim sp (T or all R(X..E.D. Note that i re is a negative length cycle in sbgraph o arcs A, it is not necessarily tre that or some X we have. Even or on negative length cycle, vale o is detered longest path in P,, which may be simple in which case is a real nmber, or contain an ininite repetition o a positive length cycle in which case. A ey act in or analysis is ollowing characterization o notion o R(X-reglarity its connection to notion o properness. It shows that proper policies R(X-reglar, bt set o R(X- reglar policies may contain some improper policies, which characterized in terms o sign o lengths o ir associated cycles.

13 Proposition 4.: Consider ima ormlation o RSP problem, viewed as a case o abstract semicontractive DP model o Section. with T given Eqs. (.6-(.8, being zero nction. The ollowing eqivalent or a policy : (i is R(X-reglar. (ii The sbgraph o arcs A is destination-connected all its cycles have negative length. (iii is eir proper or else, i it is improper, all cycles o sbgraph o arcs A have negative length, R(X. Proo: To show that (i implies (ii, let be R(X-reglar to arrive at a contradiction, assme that A contains a nonnegative length cycle. Let be a node on cycle, consider path p that starts at consists o an ininite repetition o this cycle, let L (p be length o irst arcs o that path. Let also be a nonzero constant nction, r, where r is a scalar. Then we have L (p r (T, since rom deinition o T, we have that (T is maimm over lengths o all -arc paths nder starting at, pls r, i last node in path is not destination. Since is R(X-reglar, we have lim sp (T <, so that or all scalars r, lim sp ( L (p r <. Taing inimm over r R, it ollows that lim sp L (p, which contradicts nonnegativity o cycle o p. Ths all cycles o A have negative length. To show that A is destination-connected, assme contrary. Then re eists some node X sch that all paths in P, contain an ininite nmber o cycles. Since length o all cycles is negative, as jst shown, it ollows that, which contradicts R(X-reglarity o. To show that (ii implies (iii, we assme that is improper show that R(X. By (ii A is destination-connected, so set P, contains a simple path or all X. Moreover, since (ii cycles o A have negative length, each path in P, that is not simple has smaller length than some simple path in P,. This implies that is eqal to largest path length among simple paths in P,, so is a real nmber or all X. To show that (iii implies (i, we note that i is proper, it is R(X-reglar, so we ocs on case where is improper. Then (iii, R(X, so to show R(X-reglarity o, we mst show that (T or all X R(X, that T. Indeed, rom deinition o T, we have (T sp [ L (p p ], (4. p P, where p is node reached ater arcs along path p, (t is deined to be eqal to. Ths as, or every path p that contains an ininite nmber o cycles (each necessarily having negative length,

14 a t igre 4.. The sbgraph o arcs A corresponding to an improper policy, or case o a single node a destination node t. The arcs lengths shown in igre. seqence L p( p approaches. It ollows that or siciently large, spremm in Eq. (4. is attained one o simple paths in P,, so p t p. Ths limit o (T does not depend on, is eqal to limit o (T, i.e.,. To show that T, we note that preceding argment, is length o longest path among paths that start at terate at t. Moreover, we have [ (T ma g,, y (y ], y Y, where we denote (t. Ths (T is also length o longest path among paths that start at terate at t, hence it is eqal to..e.d. We illstrate preceding proposition with a two-node eample involving an improper policy with a cycle that may have positive, zero, or negative length. Eample 4.: Let X {}, consider policy where at state, antagonistic opponent may orce eir staying at or terating, i.e., Y (, ( {, t}. Then is improper since its sbgraph o arcs A contains sel-cycle (, ; c. ig. 4.. Let Then, g (, (, a, g (, (, t. (T ( ma [, a ( ], ( Consistently with Prop. 4., ollowing hold: { i a >, i a. (a or a >, cycle (, has positive length, is R(X-irreglar becase (. (b or a, cycle (, has zero length, is R(X-irreglar becase or a nction R(X with ( >, lim sp(t ( > (. (c or a <, cycle (, has negative length, is R(X-reglar becase (, we have R(X, ( ma [, a (] (T (, or all R(X, lim sp(t ( (. 4

15 We now show one o or main reslts. Proposition 4.: Let Assmption. hold. Then: (a The optimal cost nction * o RSP is niqe ied o T within R(X. (b A policy is optimal or RSP i only i T * T *. Moreover, re eists an optimal proper policy. (c We have T * or all R(X. (d or any R(X, i T we have *, i T we have *. Proo: We veriy parts (a-( o Assmption. with S R(X. The reslt n will be proved sing Prop... To this end we arge as ollows: ( Part (a is satisied since S R(X. ( Part (b is satisied since Assmption.(a, re eists at least one proper policy, which Prop. 4. is R(X-reglar. Moreover, or each R(X-reglar policy, we have R(X. Since nmber o all policies is inite, it ollows that Ĵ R(X. ( To show that part (c is satisied, note that since Prop. 4. every R(X-irreglar policy mst be improper, it ollows rom Assmption.(b that sbgraph o arcs A contains a cycle o positive length. By Prop. 4.(d, this implies that or each R(X, we have lim sp (T or at least one X. (4 Part (d is satisied since U is a inite set. (5 Part (e is satisied since X is inite T is a continos nction mapping inite-dimensional space R(X into itsel. (6 To show that part ( is satisied, we note that applying Prop.. with S R(X, we have that Ĵ is niqe ied o T within R(X. It ollows that or each R(X, re eists a siciently large scalar r > sch that nction given where e is nit nction, e, satisies as well as Ĵ re, X, (4. Ĵ re T Ĵ re T (Ĵ re T, (4. where ineqality holds in view o Eqs. (.6 (.9, act r >. Ths all parts o Assmption. with S R(X satisied, Prop.. applies with S R(X. Since nder Assmption., improper policies R(X-irreglar [c. Prop. 4.(d] so cannot be optimal, ima ormlation o Section is eqivalent to RSP, conclsions o Prop.. precisely reslts we want to prove..e.d. 5

16 a t t Improper policy Proper policy igre 4.. A contereample involving a single node in addition to destination t. There two policies,, with corresponding sbgraphs o arcs A A, arc lengths shown in igre. The improper policy is optimal when a. It is R(X-irreglar i a, it is R(X-reglar i a <. The ollowing variant o two-node Eample 4. illstrates what may happen in absence o Assmption.(b, when re may eist improper policies that involve a nonpositive length cycle. Eample 4.: Let X {}, consider improper policy with Y (, ( {, t} proper policy with Y (, ( {t} (c. ig. 4.. Let g (, (, a, g (, (, t, g (, (, t. Then it can be seen that nder both policies, longest path rom to t consists o arc (, t. Ths, (, (, so improper policy is optimal or ima problem (.5, strictly doates proper policy (which is optimal or RSP version o problem. To eplain what is happening here, we consider two dierent : ( a : In this case, optimal policy is both improper R(X-irreglar, bt with ( <. Ths conditions o both Props.. 4. do not hold becase Assmptions.(c Assmption.(b violated. ( a < : In this case, is improper bt R(X-reglar, so re no R(X-irreglar policies. Then all conditions o Assmption. satisied, Prop.. applies. Consistent with this proposition, re eists an optimal R(X-reglar policy (i.e., optimal over both proper improper policies, which however is improper hence not an optimal soltion or RSP. We will net discss modiications o Prop. 4., which address diiclties illstrated in two o preceding eample. The Case o Improper Policies with Negative Length Cycles We note that Prop. 4., set o R(X-reglar policies incldes not jst proper policies, bt also some improper ones (those or which A is destination-connected all its cycles have negative length. As 6

17 a reslt we can weaen Assmption. as long as it still implies Assmption. so we can se Prop.. to obtain corresponding versions o or main reslt o Prop. 4.. Here two sch weaer versions o Assmption.. Assmption 4.: Every policy is eir proper or else it is improper its sbgraph o arcs A is destination-connected with all cycles having negative length. rom Prop. 4., it ollows that preceding assmption is eqivalent to all policies being R(X-reglar. The net assmption is weaer in that it allows policies that R(X-irreglar, as long as some cycle o A has positive length. Assmption 4.: (a There eists at least one R(X-reglar policy. (b or every R(X-irreglar policy, some cycle in sbgraph o arcs A has positive length. Now essentially repeating proo o Prop. 4., we see that Assmption 4. implies Assmption., so that Prop.. applies. Then we obtain ollowing variant o Prop. 4.. Proposition 4.4: Let eir Assmption 4. or (more generally Assmption 4. hold. Then: (a The nction * o Eq. (. is niqe ied o T within R(X. (b A policy satisies *, where * is imm o over all M [c. Eq. (.5], i only i T * T *. Moreover, re eists an optimal R(X-reglar policy. (c We have T * or all R(X. (d or any R(X, i T we have *, i T we have *. It is important to note that optimal R(X-reglar policy o part (b above may not be proper, hence needs to be checed to ensre that it solves RSP problem (c. Eample 4. with a <. Ths one wold have to additionally prove that at least one o optimal R(X-reglar policies is proper in order or proposition to lly apply to RSP. The Case o Improper Policies with Zero Length Cycles In some, it may be easier to garantee nonnegativity rar than positivity o lengths o cycles corresponding to improper policies, which is reqired Assmption.(b. This is tre or eample in 7

18 important case where all arc lengths nonnegative, i.e., g,, y or all X, U, y Y,, as in case ( o Eample 4.. Let s consider ollowing relaation o Assmption.. Assmption 4.: (a There eists at least one proper policy. (b or every improper policy, all cycles in sbgraph o arcs A have nonnegative length. Note that similar to case o Assmption., we may garantee that part (a o preceding assmption is satisied introdcing a high cost teration action at each node. Then policy that terates at each state is proper. or an analysis nder preceding assmption, we will se a pertrbation approach that was introdced in Section.. o [Ber]. The idea is to consider a scalar δ > a δ-pertrbed problem, where each arc length g,, y with X is replaced g,, y δ. As a reslt, a nonnegative cycle length corresponding to an improper policy as per Assmption 4.(b becomes strictly positive, so Assmption. is satisied or δ-pertrbed problem, Prop. 4. applies. We ths see that δ *, optimal cost nction o δ-pertrbed problem, is niqe ied o mapping T δ given (T δ U H δ,,, X, where H δ,, is given H δ,, H,, δ. Moreover re eists an optimal proper policy δ or δ-pertrbed problem, which Prop. 4.(b, satisies optimality eqation T δ,δ δ * T δ δ *, where T,δ is mapping that corresponds to a policy in δ-pertrbed problem: ( (T,δ H δ,,, X. We have ollowing proposition. Proposition 4.5: policies only, Then: Let Assmption 4. hold, let Ĵ be optimal cost nction over proper Ĵ, X. : proper (a Ĵ lim δ * δ. 8

19 (b Ĵ is niqe ied o T within set { R(X Ĵ}. (c We have T Ĵ or every R(X with Ĵ. (d Let be a proper policy. Then is optimal within class o proper policies (i.e., Ĵ i only i T Ĵ T Ĵ. (e There eists δ > sch that or all δ (, δ], an optimal policy or δ-pertrbed problem is an optimal proper policy or original RSP. Proo: (a or all δ >, consider an optimal proper policy δ o δ-pertrbed problem, i.e., one with cost δ,δ δ *. We have Ĵ δ δ,δ * δ,δ Nδ, : proper, where N is nmber o nodes o X (since an etra δ cost is incrred in δ-pertrbed problem every time a path goes throgh a node t, any path nder a proper contains at most N nodes t. By taing limit as δ n imm over all that proper, it ollows that so lim δ * δ Ĵ. Ĵ lim δ * δ Ĵ, (b or all proper, we have T T Ĵ T Ĵ. Taing imm over proper, we obtain Ĵ T Ĵ. Conversely, or all δ > M, we have * δ T * δ δe T * δ δe. Taing limit as δ, sing part (a, we obtain Ĵ T Ĵ or all M. Taing imm over M, it ollows that Ĵ T Ĵ. Ths Ĵ is a ied o T. The niqeness o Ĵ will ollow once we prove part (c. (c or all R(X with Ĵ proper policies, we have sing relation Ĵ T Ĵ jst shown in part (b, Ĵ lim T Ĵ lim T lim T. Taing imm over all proper, we obtain Ĵ lim T Ĵ, Ĵ. (d I is a proper policy with Ĵ, we have Ĵ T T Ĵ, so, sing also relation Ĵ T Ĵ [c. part (a], we obtain T Ĵ T Ĵ. Conversely, i satisies T Ĵ T Ĵ, n rom part (a, we have T Ĵ Ĵ hence lim T Ĵ Ĵ. Since is proper, we have lim T Ĵ, so Ĵ. (e or every proper policy we have lim δ,δ. Hence i a proper is not optimal or RSP, it is also nonoptimal or δ-pertrbed problem or all δ [, δ ], where δ is some positive scalar. Let δ be imm δ over nonoptimal proper policies. Then or δ (, δ], an optimal policy or δ-pertrbed problem cannot be nonoptimal or RSP..E.D. 9

20 Note that we may have * < Ĵ or some, bt in RSP only proper policies admissible, so letting δ we approach optimal soltion o interest. This happens or instance in Eample 4. when a. or eample Ĵ (not * can be obtained as limit o T, starting rom Ĵ [c. part (c]. The ollowing eample describes an interesting problem, where Prop. 4.5 applies. Eample 4.: (Minima Search s Consider searching a graph with node set X {t}, looing or an optimal node X at which to stop. At each X we have two options: ( stopping at a cost s, which will stop search moving to t, or ( contining search choosing a control U, in which case we will move to a node y, chosen rom within a given set o nodes Y, an antagonistic opponent, at a cost g,, y. Then Assmption 4. holds, since re eists a proper policy ( one that stops at every. An interesting case is when stopping costs s all nonnegative, while searching is cost-ree [i.e., g,, y ], bt may lead in tre to nodes where a higher stopping cost will become inevitable. Then a policy that never stops is optimal bt improper, bt i we introdce a small pertrbation δ > to costs g,, y, we will mae lengths o all cycles positive, Prop. 4.5 may be sed to ind an optimal policy within class o proper policies. Note that this is an eample where we really interested in solving RSP problem (where only proper policies admissible, not its ima version (where all policies admissible. 5. COMPUTATIONAL METHODS We will now discss comptational s that patterned ater classical DP algorithms o vale iteration policy iteration (VI PI or short, respectively. In particlar, s o this section motivated ized stochastic shortest path algorithms. 5.. Vale Iteration Algorithms We have already shown as part o Prop. 4. that nder Assmption., VI algorithm, which seqentially generates T or, converges to optimal cost nction * or any starting nction R(X. We have also shown as part o Prop. 4.5 that nder Assmption 4., VI seqence T or, converges to Ĵ, optimal cost nction over proper policies only, or any starting nction Ĵ. We can etend se convergence properties to asynchronos versions o VI based on monotonicity ied properties o mapping T. This has been nown since paper [Ber8] (see also [Ber8], [BeT89], we reer to discssions in Sections.6.,.. o [Ber], which apply in ir entirety when ized to RSP problem o this paper. It trns ot that or or problem, nder Assmption. or Assmption 4., VI algorithm also terates initely when initialized with or all X [it can be seen that in view o orm (.9 o mapping T, VI algorithm is well-deined with this initialization]. In act nmber o iterations or teration is no more than N, where N is nmber o nodes in X, leading to polynomial compleity. This is consistent with a similar reslt or stochastic shortest path ([Ber], Section.4., which relies on assmption o acyclicity o graph o possible transitions nder an optimal policy. Becase this assmption is restrictive, inite teration o VI algorithm is an eceptional property in stochastic shortest path. However, in ima case o this paper, an optimal policy eists is

21 proper [c. Prop. 4.(b or Prop. 4.5(e], so graph o possible transitions nder is acyclic, it trns ot that inite teration o VI is garanteed to occr. Note that in deteristic shortest path initialization or all X, leads to polynomial compleity, generally wors better in practice that or initializations (sch as < *, or which compleity is only psedopolynomial, c. [BeT89], Section 4., Prop... To show inite teration property jst described, let be an optimal proper policy, consider sets o nodes X, X,..., deined X {t}, { X / m X m y m X m or all y Y (, },,,..., (5. let X be last o se sets that is nonempty. Then in view o acyclicity o sbgraph o arcs A, we have m X m X {t}. We will now show indction that starting rom or all X, iterates T o VI satisy (T *, m X m,,...,. (5. Indeed, it can be seen that this is so or. Assme that (T * i m X m. Then, since T T is monotone, (T is monotonically nonincreasing, so that * (T, X. (5. Moreover, indction hyposis, deinition o sets X, optimality o, we have (T H (,, T H (,, * *, m X m, (5.4 where irst eqality ollows rom orm (.6 o H act that or all m X m, we have y m X m or all y Y (, deinition (5. o X. The two relations (5. (5.4 complete indction proo. Ths nder Assmption., VI when started with or all X, will ind optimal costs o all nodes in set m X m ater iterations; c. Eq. (5.. The is tre nder Assmption 4., ecept that will ind corresponding optimal costs over proper policies. In particlar, all optimal costs will be ond ater N iterations, where N is nmber o nodes in X. This indicates that behavior o VI algorithm, when initialized with or all X, is similar to one o Bellman-ord algorithm or deteristic shortest path. Still each iteration o VI algorithm reqires as many as N applications o mapping T at every node. Ths it is liely that perormance o VI algorithm can be improved with a sitable choice o initial nction, with an asynchronos implementation that ses a avorable order o selecting nodes or iteration, one-nodeat-a-time similar to Gass-Seidel. This is consistent with deteristic shortest path case, where re VI-type algorithms, within class o label-correcting s, which aster than Bellman-ord algorithm even aster than eicient implementations o Dijstra algorithm or some types o ; see e.g., [Ber98]. or RSP problem, it can be seen that node selection order is based on sets X deined Eq. (5., i.e., iterate on nodes in set X, n on nodes in X, so on. In this case, only one iteration per node will be needed. While sets X not nown, an algorithm that tries to approimate optimal order cold be mch more eicient that stard all-nodes-at-once VI that comptes seqence T, or (or an eample o an algorithm o this type or stochastic shortest path, see [PBT98]. The development o sch more eicient VI algorithms is an interesting sbject or rr research, which, however, is beyond scope o present paper.

22 a 4a 4 a a4 4 a4 4 RSP 4 t b c 4 4 a t b c 4 a RSP RSP t 4b c Optimal 4 Proper 4 t baac t ba ca RSP 5 RSP 5Optimal a 5 4 Proper 4Optimal 4Proper Proper t b c Optimal t b c Optimal 4Proper 4 t b c b a 5a t b crsp RSP t γ γ c γγ c RSP!RSP " t bc Optimal Proper Proper( 4 Optimal c, arsp ( Optimal Proper Optimal Proper! " "!!c, a "( " "! Proper 4 Optimal (! ( a( ( b c at! arsp "c, c, a ( ( c, ( a ( c, a ( bc,a ( ( ( t b c a t( b Optimal Proper 4 b ( ( ( b ( a 4!b " (b ""( (! (!! " ( t ( ( ( ( a c,aab ( ( ( RSP c,( ( c, ( c, ab b( c! " 4 Proper! " ( c, a ( Optimal (bb ( ( ( 4 ( 4 tbc ( a RSP b c, ab! ( ( a θ a ( ( " θ 4 ( c, a ( θ b( Proper t b c t b c ( θ ( t θbcoptimal θ ( b RSP θ θ θ θ ( θ θ Proper θ a ( b! ( " θ Optimal t b Evalation ( c, a ( θ θ θ θ "!! " θ θ θ! θ ( c," a ( ( c, ( a ( ( c, a θ a t θ ( b ( a t b θc b θ! " ( Evalation ( Evalation ( b ( b ( c, a ( Evalation Ctting Evalation ( Improper policy θ θ b ( Evalation Evalation ( b ( policy " ν Proper ν! Evalation ( c, a ( ν VI ν Evalation ν Iteration # All-Nodes-at-Once Method One-Node-at-a-Time Method Evalation VI Evalation ν θ θ ν ν ν Ctting θ θctting ( θevalation ( b ν Evalation ν ν νθ Ctting Ctting ν (,,ν,, (, ν, ν θ θ Evalation Ctting Ctting Evalation T (, (,, 5 θ,, θ Ctting, ν ν ν Ctting ν N Ctting Ctting Evalation T (, 5, 6, ν (,ν,, Ctting Ctting T θ (, θ, o N. 5,, (,, ν ν Evalation Ctting 4 N Evalation Ctting 4 4 T policy (,, Evalation 4,, Evalation 4 (, Improper 4, N 99s N N Ctting Proper policy Evalation o N. ν ν N N ν ν ν N o N. Evalation N N N o N. o N. igre 5.: An eample RSP νproblem its optimal policy. At each X {,,, 4} re Ctting ν 99s Ctting o N. o N. o N. Ctting Ctting o N. 99s ν ν two controls: one (shown a solid line where Y, consists o a single element, anor s as. N o N. N o N. 99s Y (shown a broen line where, 99s has two elements. Arc lengths shown net to arcs. Ctting N 99s N 99s 99s 99s Both all-nodes-at-once one-node-at-a-time versions o VI terate in or iterations, Ctting Nondierentiability eatres. o99s N. o N. 99s reqires or N bt latter version times less comptation per iteration. s as. o N. o N. s as. 99s 99s. s as. s as N o N. s as. Nondierentiability eatres. s as. 99s 99s N Eample 5.: s as. s as. N N Dal Nondifferentiability eatres. s as. s as. o N. Nondifferentiability eatres. Nondifferentiability Nondifferentiability eatres. 99s eatres. Nondifferentiability eatres. N o Nondifferentiability eatres. Nondifferentiability eatres. Let s illstrate VI or problem o ig. 5.. The optimal policy is shown inn. this igre, o N. o N. Vertical Distances N s as. Nondifferentiability eatres. s as. Nondifferentiability eatres. 99s it is proper; this is consistent with act that Assmption 4. is satisied. table gives iteration The Dal o N. as. s ass. 99s Crossing Point Dierentials 99s 99s seqence o two VI Dal s, starting with (,,,. The irst is all-nodes-at-once o N. Nondifferentiability eatres. Nondifferentiability eatres. Dal Dal Dal Dal as. T, which inds in or iterations. In 99s this eample, we have X {t}, X {}, Xs {4}, Vertical Distances Nondierentiability eatres. Nondifferentiability eatres. Dal Dal Vales Crossing s (y X {}, X4 {},Vertical assertion o Eq. (5. may be veriied. The second Distances 99s is asynchronos Dal DalVertical s as. Vertical Distances Distances Nondifferentiability Crossing Point Dierentials VI, which iterates one-node-at-a-time in (most avorable order, 4,,. The second also eatres. y s as. (y (y ( y ( as s. s as. Dal inds in or iterations with or times less comptation. Dal Nondierentiability eatres. CrossingVales Point Differentials Crossing Point Differentials Crossing s (y Dal ys as. Dal Slope y Slope Nondifferentiability eatres.. eatres. Nondifferentiability Nondifferentiability eatres. o s We inally note that in absence Assmption. or Assmption 4., it isas possible that VI Dal (y (y Vertical ( y ( y can eatres.. Distances seqence {T } will not converge to Nondifferentiability starting rom any with 6 This be seen with a simple Nondifferentiability eatres. Dal deteristic shortest path problem involving a zero length version o Eample 4.. Here Slope ycycle, SlopeaCrossing y simplerpoint Dierentials Dal Dal Dal Vertical Distances Crossing s (y Vales Dal Distances Vertical Distances Dal Vertical Crossing Point Dierentials (y (y ( y ( y CrossingVales PointDifferentials Crossing Point Differentials Vertical Distances Crossing s (y Slope y Slope y

23 re is a single node, aside rom destination t, two choices at : stay at at cost, move to t at cost. Then we have, while T is given T {, }. It can be seen that set o ied s o T is (, ], contains in its interior. Starting with, VI seqence converges to in a single step, while starting at it stays at. This is consistent with Prop. 4.5(c, since in this eample Assmption 4. holds, we have Ĵ. In case o Eample 4. with a, sitation is somewhat dierent bt qalitatively similar. There it can be veriied that, set o ied s is [, ], {T } will converge to starting rom, will converge to starting rom, will stay at starting rom [, ]. 5.. Iteration Algorithms The development o PI algorithms or RSP problem is straightorward given connection with semicontractive models. Briely, nder Assmption., based on analysis o Section.. o [Ber], re two types o PI algorithms. The irst is a natral orm o PI that generates proper policies eclsively. Let be an initial proper policy (re eists one assmption. At typical iteration, we have a proper policy, irst compte solving a longest path problem over corresponding acyclic sbgraph o arcs A. We n compte a policy sch that T T, imizing over U epression H,, o Eq. (.6, or all X. We have T T T lim m T m, (5.5 where second ineqality ollows rom monotonicity o T, last eqality is jstiied becase is proper hence R(X, so net policy cannot be improper [in view o Assmption.(b Prop. 4.(d]. In conclsion mst be proper has improved cost over. Ths seqence o policies { } is well-deined proper, corresponding seqence { } is nonincreasing. It n ollows that converges to * in a inite nmber o iterations. The reason is that rom Eq. (5.5, we have that at th iteration, eir strict improvement > (T is obtained or at least one node X, or else T, which implies that * [since * is niqe ied o T within R(X, Prop. 4.(a] is an optimal proper policy. Unortnately, when re improper policies, preceding PI algorithm is somewhat limited, becase an initial proper policy may not be nown, also becase when asynchronos versions o algorithm implemented, it is diiclt to garantee that all generated policies proper. There is anor algorithm, combining vale policy iterations, which has been developed in [BeY], [BeY], [YBa], [YBb] or a variety o DP models, inclding disconted, stochastic shortest path, abstract, is described in Sections.6... o [Ber]. This algorithm pdates a cost nction a policy, bt it also maintains an additional nction V, which acts as a threshold to eep bonded algorithm convergent. The algorithm not only can tolerate presence o improper policies, bt can also be operated in asynchronos mode, where vale iterations, policy evalation operations, policy improvement iterations perormed one-node-at-a-time withot any reglarity. The algorithm is

Complex Variables. For ECON 397 Macroeconometrics Steve Cunningham

Complex Variables. For ECON 397 Macroeconometrics Steve Cunningham Comple Variables For ECON 397 Macroeconometrics Steve Cnningham Open Disks or Neighborhoods Deinition. The set o all points which satis the ineqalit

More information

Integration of Basic Functions. Session 7 : 9/23 1

Integration of Basic Functions. Session 7 : 9/23 1 Integration o Basic Fnctions Session 7 : 9/3 Antiderivation Integration Deinition: Taking the antiderivative, or integral, o some nction F(), reslts in the nction () i ()F() Pt simply: i yo take the integral

More information

Gradient Projection Anti-windup Scheme on Constrained Planar LTI Systems. Justin Teo and Jonathan P. How

Gradient Projection Anti-windup Scheme on Constrained Planar LTI Systems. Justin Teo and Jonathan P. How 1 Gradient Projection Anti-windp Scheme on Constrained Planar LTI Systems Jstin Teo and Jonathan P. How Technical Report ACL1 1 Aerospace Controls Laboratory Department of Aeronatics and Astronatics Massachsetts

More information

Worst-case analysis of the LPT algorithm for single processor scheduling with time restrictions

Worst-case analysis of the LPT algorithm for single processor scheduling with time restrictions OR Spectrm 06 38:53 540 DOI 0.007/s009-06-043-5 REGULAR ARTICLE Worst-case analysis of the LPT algorithm for single processor schedling with time restrictions Oliver ran Fan Chng Ron Graham Received: Janary

More information

Classify by number of ports and examine the possible structures that result. Using only one-port elements, no more than two elements can be assembled.

Classify by number of ports and examine the possible structures that result. Using only one-port elements, no more than two elements can be assembled. Jnction elements in network models. Classify by nmber of ports and examine the possible strctres that reslt. Using only one-port elements, no more than two elements can be assembled. Combining two two-ports

More information

m = Average Rate of Change (Secant Slope) Example:

m = Average Rate of Change (Secant Slope) Example: Average Rate o Change Secant Slope Deinition: The average change secant slope o a nction over a particlar interval [a, b] or [a, ]. Eample: What is the average rate o change o the nction over the interval

More information

TOWARD THE FORMAL THEORY OF (, n)-categories

TOWARD THE FORMAL THEORY OF (, n)-categories TOWRD THE FORML THEORY OF (, n)-ctegories EMILY RIEHL (JOINT WORK WITH DOMINIC VERITY) bstract. Formal category theory reers to a commonly applicable ramework (i) or deining standard categorical strctres

More information

1. Tractable and Intractable Computational Problems So far in the course we have seen many problems that have polynomial-time solutions; that is, on

1. Tractable and Intractable Computational Problems So far in the course we have seen many problems that have polynomial-time solutions; that is, on . Tractable and Intractable Comptational Problems So far in the corse we have seen many problems that have polynomial-time soltions; that is, on a problem instance of size n, the rnning time T (n) = O(n

More information

LIGHTWEIGHT STRUCTURES in CIVIL ENGINEERING - CONTEMPORARY PROBLEMS

LIGHTWEIGHT STRUCTURES in CIVIL ENGINEERING - CONTEMPORARY PROBLEMS ITERATIOAL SEMIAR Organized by Polish Chapter o International Association or Shell and Spatial Strctres LIGHTWEIGHT STRUCTURES in CIVIL EGIEERIG - COTEMPORARY PROBLEMS STOCHASTIC CORROSIO EFFECTS O RELIABILITY

More information

Chapter 2 Difficulties associated with corners

Chapter 2 Difficulties associated with corners Chapter Difficlties associated with corners This chapter is aimed at resolving the problems revealed in Chapter, which are cased b corners and/or discontinos bondar conditions. The first section introdces

More information

Figure 1 Probability density function of Wedge copula for c = (best fit to Nominal skew of DRAM case study).

Figure 1 Probability density function of Wedge copula for c = (best fit to Nominal skew of DRAM case study). Wedge Copla This docment explains the constrction and properties o a particlar geometrical copla sed to it dependency data rom the edram case stdy done at Portland State University. The probability density

More information

Elements of Coordinate System Transformations

Elements of Coordinate System Transformations B Elements of Coordinate System Transformations Coordinate system transformation is a powerfl tool for solving many geometrical and kinematic problems that pertain to the design of gear ctting tools and

More information

Higher Maths A1.3 Recurrence Relations - Revision

Higher Maths A1.3 Recurrence Relations - Revision Higher Maths A Recrrence Relations - Revision This revision pack covers the skills at Unit Assessment exam level or Recrrence Relations so yo can evalate yor learning o this otcome It is important that

More information

Stochastic Shortest Path Problems

Stochastic Shortest Path Problems Chapter 8 Stochastic Shortest Path Problems 1 In this chapter, we study a stochastic version of the shortest path problem of chapter 2, where only probabilities of transitions along different arcs can

More information

1 Undiscounted Problem (Deterministic)

1 Undiscounted Problem (Deterministic) Lectre 9: Linear Qadratic Control Problems 1 Undisconted Problem (Deterministic) Choose ( t ) 0 to Minimize (x trx t + tq t ) t=0 sbject to x t+1 = Ax t + B t, x 0 given. x t is an n-vector state, t a

More information

REINFORCEMENT LEARNING AND OPTIMAL ADAPTIVE CONTROL

REINFORCEMENT LEARNING AND OPTIMAL ADAPTIVE CONTROL Lewis c11.tex V1-10/19/2011 4:10pm Page 461 11 REINFORCEMENT LEARNING AND OPTIMAL ADAPTIVE CONTROL In this book we have presented a variety of methods for the analysis and design of optimal control systems.

More information

Subcritical bifurcation to innitely many rotating waves. Arnd Scheel. Freie Universitat Berlin. Arnimallee Berlin, Germany

Subcritical bifurcation to innitely many rotating waves. Arnd Scheel. Freie Universitat Berlin. Arnimallee Berlin, Germany Sbcritical bifrcation to innitely many rotating waves Arnd Scheel Institt fr Mathematik I Freie Universitat Berlin Arnimallee 2-6 14195 Berlin, Germany 1 Abstract We consider the eqation 00 + 1 r 0 k2

More information

Chapter 9 Flow over Immersed Bodies

Chapter 9 Flow over Immersed Bodies 57:00 Mechanics o Flids and Transport Processes Chapter 9 Proessor Fred Stern Fall 01 1 Chapter 9 Flow over Immersed Bodies Flid lows are broadly categorized: 1. Internal lows sch as dcts/pipes, trbomachinery,

More information

Bertrand s Theorem. October 8, µr 2 + V (r) 0 = dv eff dr. 3 + dv. f (r 0 )

Bertrand s Theorem. October 8, µr 2 + V (r) 0 = dv eff dr. 3 + dv. f (r 0 ) Bertrand s Theorem October 8, Circlar orbits The eective potential, V e = has a minimm or maximm at r i and only i so we mst have = dv e L µr + V r = L µ 3 + dv = L µ 3 r r = L µ 3 At this radis, there

More information

Bridging the Gap Between Multigrid, Hierarchical, and Receding-Horizon Control

Bridging the Gap Between Multigrid, Hierarchical, and Receding-Horizon Control Bridging the Gap Between Mltigrid, Hierarchical, and Receding-Horizon Control Victor M. Zavala Mathematics and Compter Science Division Argonne National Laboratory Argonne, IL 60439 USA (e-mail: vzavala@mcs.anl.gov).

More information

Section 7.4: Integration of Rational Functions by Partial Fractions

Section 7.4: Integration of Rational Functions by Partial Fractions Section 7.4: Integration of Rational Fnctions by Partial Fractions This is abot as complicated as it gets. The Method of Partial Fractions Ecept for a few very special cases, crrently we have no way to

More information

3. Several Random Variables

3. Several Random Variables . Several Random Variables. To Random Variables. Conditional Probabilit--Revisited. Statistical Independence.4 Correlation beteen Random Variables Standardied (or ero mean normalied) random variables.5

More information

The Linear Quadratic Regulator

The Linear Quadratic Regulator 10 The Linear Qadratic Reglator 10.1 Problem formlation This chapter concerns optimal control of dynamical systems. Most of this development concerns linear models with a particlarly simple notion of optimality.

More information

Conditions for Approaching the Origin without Intersecting the x-axis in the Liénard Plane

Conditions for Approaching the Origin without Intersecting the x-axis in the Liénard Plane Filomat 3:2 (27), 376 377 https://doi.org/.2298/fil7276a Pblished by Faclty of Sciences and Mathematics, University of Niš, Serbia Available at: http://www.pmf.ni.ac.rs/filomat Conditions for Approaching

More information

Universal Scheme for Optimal Search and Stop

Universal Scheme for Optimal Search and Stop Universal Scheme for Optimal Search and Stop Sirin Nitinawarat Qalcomm Technologies, Inc. 5775 Morehose Drive San Diego, CA 92121, USA Email: sirin.nitinawarat@gmail.com Vengopal V. Veeravalli Coordinated

More information

Robust Tracking and Regulation Control of Uncertain Piecewise Linear Hybrid Systems

Robust Tracking and Regulation Control of Uncertain Piecewise Linear Hybrid Systems ISIS Tech. Rept. - 2003-005 Robst Tracking and Reglation Control of Uncertain Piecewise Linear Hybrid Systems Hai Lin Panos J. Antsaklis Department of Electrical Engineering, University of Notre Dame,

More information

1 Differential Equations for Solid Mechanics

1 Differential Equations for Solid Mechanics 1 Differential Eqations for Solid Mechanics Simple problems involving homogeneos stress states have been considered so far, wherein the stress is the same throghot the component nder std. An eception to

More information

Quasi Steady State Modelling of an Evaporator

Quasi Steady State Modelling of an Evaporator Qasi Steady State Modelling o an Evaporator Ed. Eitelberg NOY Bsiness 58 Baines Road, Drban 400, RSA controle@pixie.dw.ac.za Ed. Boje Electrical, Electronic & Compter Eng. University o Natal, Drban 404,

More information

Chapter 3. Preferences and Utility

Chapter 3. Preferences and Utility Chapter 3 Preferences and Utilit Microeconomics stdies how individals make choices; different individals make different choices n important factor in making choices is individal s tastes or preferences

More information

Lecture 8: September 26

Lecture 8: September 26 10-704: Information Processing and Learning Fall 2016 Lectrer: Aarti Singh Lectre 8: September 26 Note: These notes are based on scribed notes from Spring15 offering of this corse. LaTeX template cortesy

More information

Move Blocking Strategies in Receding Horizon Control

Move Blocking Strategies in Receding Horizon Control Move Blocking Strategies in Receding Horizon Control Raphael Cagienard, Pascal Grieder, Eric C. Kerrigan and Manfred Morari Abstract In order to deal with the comptational brden of optimal control, it

More information

A Macroscopic Traffic Data Assimilation Framework Based on Fourier-Galerkin Method and Minimax Estimation

A Macroscopic Traffic Data Assimilation Framework Based on Fourier-Galerkin Method and Minimax Estimation A Macroscopic Traffic Data Assimilation Framework Based on Forier-Galerkin Method and Minima Estimation Tigran T. Tchrakian and Sergiy Zhk Abstract In this paper, we propose a new framework for macroscopic

More information

Optimization via the Hamilton-Jacobi-Bellman Method: Theory and Applications

Optimization via the Hamilton-Jacobi-Bellman Method: Theory and Applications Optimization via the Hamilton-Jacobi-Bellman Method: Theory and Applications Navin Khaneja lectre notes taken by Christiane Koch Jne 24, 29 1 Variation yields a classical Hamiltonian system Sppose that

More information

B-469 Simplified Copositive and Lagrangian Relaxations for Linearly Constrained Quadratic Optimization Problems in Continuous and Binary Variables

B-469 Simplified Copositive and Lagrangian Relaxations for Linearly Constrained Quadratic Optimization Problems in Continuous and Binary Variables B-469 Simplified Copositive and Lagrangian Relaxations for Linearly Constrained Qadratic Optimization Problems in Continos and Binary Variables Naohiko Arima, Snyong Kim and Masakaz Kojima October 2012,

More information

Discontinuous Fluctuation Distribution for Time-Dependent Problems

Discontinuous Fluctuation Distribution for Time-Dependent Problems Discontinos Flctation Distribtion for Time-Dependent Problems Matthew Hbbard School of Compting, University of Leeds, Leeds, LS2 9JT, UK meh@comp.leeds.ac.k Introdction For some years now, the flctation

More information

4.2 First-Order Logic

4.2 First-Order Logic 64 First-Order Logic and Type Theory The problem can be seen in the two qestionable rles In the existential introdction, the term a has not yet been introdced into the derivation and its se can therefore

More information

Lecture Notes On THEORY OF COMPUTATION MODULE - 2 UNIT - 2

Lecture Notes On THEORY OF COMPUTATION MODULE - 2 UNIT - 2 BIJU PATNAIK UNIVERSITY OF TECHNOLOGY, ODISHA Lectre Notes On THEORY OF COMPUTATION MODULE - 2 UNIT - 2 Prepared by, Dr. Sbhend Kmar Rath, BPUT, Odisha. Tring Machine- Miscellany UNIT 2 TURING MACHINE

More information

Formal Methods for Deriving Element Equations

Formal Methods for Deriving Element Equations Formal Methods for Deriving Element Eqations And the importance of Shape Fnctions Formal Methods In previos lectres we obtained a bar element s stiffness eqations sing the Direct Method to obtain eact

More information

Optimal Control of a Heterogeneous Two Server System with Consideration for Power and Performance

Optimal Control of a Heterogeneous Two Server System with Consideration for Power and Performance Optimal Control of a Heterogeneos Two Server System with Consideration for Power and Performance by Jiazheng Li A thesis presented to the University of Waterloo in flfilment of the thesis reqirement for

More information

APPENDIX B MATRIX NOTATION. The Definition of Matrix Notation is the Definition of Matrix Multiplication B.1 INTRODUCTION

APPENDIX B MATRIX NOTATION. The Definition of Matrix Notation is the Definition of Matrix Multiplication B.1 INTRODUCTION APPENDIX B MAIX NOAION he Deinition o Matrix Notation is the Deinition o Matrix Mltiplication B. INODUCION { XE "Matrix Mltiplication" }{ XE "Matrix Notation" }he se o matrix notations is not necessary

More information

Weighted Sup-Norm Contractions in Dynamic Programming: A Review and Some New Applications

Weighted Sup-Norm Contractions in Dynamic Programming: A Review and Some New Applications May 2012 Report LIDS - 2884 Weighted Sup-Norm Contractions in Dynamic Programming: A Review and Some New Applications Dimitri P. Bertsekas Abstract We consider a class of generalized dynamic programming

More information

Second-Order Wave Equation

Second-Order Wave Equation Second-Order Wave Eqation A. Salih Department of Aerospace Engineering Indian Institte of Space Science and Technology, Thirvananthapram 3 December 016 1 Introdction The classical wave eqation is a second-order

More information

FEA Solution Procedure

FEA Solution Procedure EA Soltion Procedre (demonstrated with a -D bar element problem) EA Procedre for Static Analysis. Prepare the E model a. discretize (mesh) the strctre b. prescribe loads c. prescribe spports. Perform calclations

More information

J. Basic. Appl. Sci. Res., 3(2s) , , TextRoad Publication

J. Basic. Appl. Sci. Res., 3(2s) , , TextRoad Publication , TetRoad Pblication ISSN 9-44 Jornal o Basic and Applied Scientiic Research www.tetroad.com A Comparison among Homotopy Pertrbation Method and the Decomposition Method with the Variational Iteration Method

More information

Robust Shortest Path Planning and Semicontractive Dynamic Programming

Robust Shortest Path Planning and Semicontractive Dynamic Programming erry 4 (Revised nry 5 Report LIDS - 95 Rost Shortest Pth Plnning Semicontrctive Dynmic Progrmg Dimitri P. Bertses Astrct In this pper we consider shortest pth in directed grph where trnsitions etween nodes

More information

A Theory of Markovian Time Inconsistent Stochastic Control in Discrete Time

A Theory of Markovian Time Inconsistent Stochastic Control in Discrete Time A Theory of Markovian Time Inconsistent Stochastic Control in Discrete Time Tomas Björk Department of Finance, Stockholm School of Economics tomas.bjork@hhs.se Agatha Mrgoci Department of Economics Aarhs

More information

BLOOM S TAXONOMY. Following Bloom s Taxonomy to Assess Students

BLOOM S TAXONOMY. Following Bloom s Taxonomy to Assess Students BLOOM S TAXONOMY Topic Following Bloom s Taonomy to Assess Stdents Smmary A handot for stdents to eplain Bloom s taonomy that is sed for item writing and test constrction to test stdents to see if they

More information

Cubic graphs have bounded slope parameter

Cubic graphs have bounded slope parameter Cbic graphs have bonded slope parameter B. Keszegh, J. Pach, D. Pálvölgyi, and G. Tóth Agst 25, 2009 Abstract We show that every finite connected graph G with maximm degree three and with at least one

More information

RESOLUTION OF INDECOMPOSABLE INTEGRAL FLOWS ON A SIGNED GRAPH

RESOLUTION OF INDECOMPOSABLE INTEGRAL FLOWS ON A SIGNED GRAPH RESOLUTION OF INDECOMPOSABLE INTEGRAL FLOWS ON A SIGNED GRAPH BEIFANG CHEN, JUE WANG, AND THOMAS ZASLAVSKY Abstract. It is well-known that each nonnegative integral flow of a directed graph can be decomposed

More information

Chapter 3 MATHEMATICAL MODELING OF DYNAMIC SYSTEMS

Chapter 3 MATHEMATICAL MODELING OF DYNAMIC SYSTEMS Chapter 3 MATHEMATICAL MODELING OF DYNAMIC SYSTEMS 3. System Modeling Mathematical Modeling In designing control systems we mst be able to model engineered system dynamics. The model of a dynamic system

More information

FREQUENCY DOMAIN FLUTTER SOLUTION TECHNIQUE USING COMPLEX MU-ANALYSIS

FREQUENCY DOMAIN FLUTTER SOLUTION TECHNIQUE USING COMPLEX MU-ANALYSIS 7 TH INTERNATIONAL CONGRESS O THE AERONAUTICAL SCIENCES REQUENCY DOMAIN LUTTER SOLUTION TECHNIQUE USING COMPLEX MU-ANALYSIS Yingsong G, Zhichn Yang Northwestern Polytechnical University, Xi an, P. R. China,

More information

Essentials of optimal control theory in ECON 4140

Essentials of optimal control theory in ECON 4140 Essentials of optimal control theory in ECON 4140 Things yo need to know (and a detail yo need not care abot). A few words abot dynamic optimization in general. Dynamic optimization can be thoght of as

More information

Formulas for stopped diffusion processes with stopping times based on drawdowns and drawups

Formulas for stopped diffusion processes with stopping times based on drawdowns and drawups Stochastic Processes and their Applications 119 (009) 563 578 www.elsevier.com/locate/spa Formlas for stopped diffsion processes with stopping times based on drawdowns and drawps Libor Pospisil, Jan Vecer,

More information

Input selection in observer design for non-uniformly observable systems

Input selection in observer design for non-uniformly observable systems 9th IAC Symposim on Nonlinear Control Systems Tolose, rance, September -6, ra. selection in observer design or non-niormly observable systems Gildas Besançon, Ignacio Rbio Scola Didier Georges Control

More information

Structural Reliability Assessment with Fuzzy Probabilities

Structural Reliability Assessment with Fuzzy Probabilities 7th International Symposim on Imprecise Probability: Theories and Applications, Innsbrck, Astria, 2011 Strctral Reliability Assessment with Fzzy Probabilities Michael Beer Centre or Engineering Sstainability,

More information

Stability of Model Predictive Control using Markov Chain Monte Carlo Optimisation

Stability of Model Predictive Control using Markov Chain Monte Carlo Optimisation Stability of Model Predictive Control sing Markov Chain Monte Carlo Optimisation Elilini Siva, Pal Golart, Jan Maciejowski and Nikolas Kantas Abstract We apply stochastic Lyapnov theory to perform stability

More information

Pulses on a Struck String

Pulses on a Struck String 8.03 at ESG Spplemental Notes Plses on a Strck String These notes investigate specific eamples of transverse motion on a stretched string in cases where the string is at some time ndisplaced, bt with a

More information

Graph-Modeled Data Clustering: Fixed-Parameter Algorithms for Clique Generation

Graph-Modeled Data Clustering: Fixed-Parameter Algorithms for Clique Generation Graph-Modeled Data Clstering: Fied-Parameter Algorithms for Cliqe Generation Jens Gramm Jiong Go Falk Hüffner Rolf Niedermeier Wilhelm-Schickard-Institt für Informatik, Universität Tübingen, Sand 13, D-72076

More information

Regular Policies in Abstract Dynamic Programming

Regular Policies in Abstract Dynamic Programming August 2016 (Revised January 2017) Report LIDS-P-3173 Regular Policies in Abstract Dynamic Programming Dimitri P. Bertsekas Abstract We consider challenging dynamic programming models where the associated

More information

Typed Kleene Algebra with Products and Iteration Theories

Typed Kleene Algebra with Products and Iteration Theories Typed Kleene Algebra with Prodcts and Iteration Theories Dexter Kozen and Konstantinos Mamoras Compter Science Department Cornell University Ithaca, NY 14853-7501, USA {kozen,mamoras}@cs.cornell.ed Abstract

More information

CHANNEL SELECTION WITH RAYLEIGH FADING: A MULTI-ARMED BANDIT FRAMEWORK. Wassim Jouini and Christophe Moy

CHANNEL SELECTION WITH RAYLEIGH FADING: A MULTI-ARMED BANDIT FRAMEWORK. Wassim Jouini and Christophe Moy CHANNEL SELECTION WITH RAYLEIGH FADING: A MULTI-ARMED BANDIT FRAMEWORK Wassim Joini and Christophe Moy SUPELEC, IETR, SCEE, Avene de la Bolaie, CS 47601, 5576 Cesson Sévigné, France. INSERM U96 - IFR140-

More information

On relative errors of floating-point operations: optimal bounds and applications

On relative errors of floating-point operations: optimal bounds and applications On relative errors of floating-point operations: optimal bonds and applications Clade-Pierre Jeannerod, Siegfried M. Rmp To cite this version: Clade-Pierre Jeannerod, Siegfried M. Rmp. On relative errors

More information

Krauskopf, B., Lee, CM., & Osinga, HM. (2008). Codimension-one tangency bifurcations of global Poincaré maps of four-dimensional vector fields.

Krauskopf, B., Lee, CM., & Osinga, HM. (2008). Codimension-one tangency bifurcations of global Poincaré maps of four-dimensional vector fields. Kraskopf, B, Lee,, & Osinga, H (28) odimension-one tangency bifrcations of global Poincaré maps of for-dimensional vector fields Early version, also known as pre-print Link to pblication record in Explore

More information

FEA Solution Procedure

FEA Solution Procedure EA Soltion Procedre (demonstrated with a -D bar element problem) MAE 5 - inite Element Analysis Several slides from this set are adapted from B.S. Altan, Michigan Technological University EA Procedre for

More information

Discussion of The Forward Search: Theory and Data Analysis by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli

Discussion of The Forward Search: Theory and Data Analysis by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli 1 Introdction Discssion of The Forward Search: Theory and Data Analysis by Anthony C. Atkinson, Marco Riani, and Andrea Ceroli Søren Johansen Department of Economics, University of Copenhagen and CREATES,

More information

An Investigation into Estimating Type B Degrees of Freedom

An Investigation into Estimating Type B Degrees of Freedom An Investigation into Estimating Type B Degrees of H. Castrp President, Integrated Sciences Grop Jne, 00 Backgrond The degrees of freedom associated with an ncertainty estimate qantifies the amont of information

More information

Linear Strain Triangle and other types of 2D elements. By S. Ziaei Rad

Linear Strain Triangle and other types of 2D elements. By S. Ziaei Rad Linear Strain Triangle and other tpes o D elements B S. Ziaei Rad Linear Strain Triangle (LST or T6 This element is also called qadratic trianglar element. Qadratic Trianglar Element Linear Strain Triangle

More information

The Replenishment Policy for an Inventory System with a Fixed Ordering Cost and a Proportional Penalty Cost under Poisson Arrival Demands

The Replenishment Policy for an Inventory System with a Fixed Ordering Cost and a Proportional Penalty Cost under Poisson Arrival Demands Scientiae Mathematicae Japonicae Online, e-211, 161 167 161 The Replenishment Policy for an Inventory System with a Fixed Ordering Cost and a Proportional Penalty Cost nder Poisson Arrival Demands Hitoshi

More information

Hybrid modelling and model reduction for control & optimisation

Hybrid modelling and model reduction for control & optimisation Hybrid modelling and model redction for control & optimisation based on research done by RWTH-Aachen and TU Delft presented by Johan Grievink Models for control and optimiation market and environmental

More information

Active Flux Schemes for Advection Diffusion

Active Flux Schemes for Advection Diffusion AIAA Aviation - Jne, Dallas, TX nd AIAA Comptational Flid Dynamics Conference AIAA - Active Fl Schemes for Advection Diffsion Hiroaki Nishikawa National Institte of Aerospace, Hampton, VA 3, USA Downloaded

More information

Model Discrimination of Polynomial Systems via Stochastic Inputs

Model Discrimination of Polynomial Systems via Stochastic Inputs Model Discrimination of Polynomial Systems via Stochastic Inpts D. Georgiev and E. Klavins Abstract Systems biologists are often faced with competing models for a given experimental system. Unfortnately,

More information

Introdction In the three papers [NS97], [SG96], [SGN97], the combined setp or both eedback and alt detection lter design problem has been considered.

Introdction In the three papers [NS97], [SG96], [SGN97], the combined setp or both eedback and alt detection lter design problem has been considered. Robst Falt Detection in Open Loop s. losed Loop Henrik Niemann Jakob Stostrp z Version: Robst_FDI4.tex { Printed 5h 47m, Febrar 9, 998 Abstract The robstness aspects o alt detection and isolation (FDI)

More information

(One Dimension) Problem: for a function f(x), find x 0 such that f(x 0 ) = 0. f(x)

(One Dimension) Problem: for a function f(x), find x 0 such that f(x 0 ) = 0. f(x) Solving Nonlinear Equations & Optimization One Dimension Problem: or a unction, ind 0 such that 0 = 0. 0 One Root: The Bisection Method This one s guaranteed to converge at least to a singularity, i not

More information

Unknown Input High Gain Observer for Parametric Fault Detection and Isolation of Dynamical Systems

Unknown Input High Gain Observer for Parametric Fault Detection and Isolation of Dynamical Systems Proceedings o the International MltiConerence o Engineers Compter Scientists 009 Vol II IMECS 009, March 8-0, 009, Hong Kong Unknown Inpt High Gain Observer or Parametric Falt Detection Isolation o Dynamical

More information

A Model-Free Adaptive Control of Pulsed GTAW

A Model-Free Adaptive Control of Pulsed GTAW A Model-Free Adaptive Control of Plsed GTAW F.L. Lv 1, S.B. Chen 1, and S.W. Dai 1 Institte of Welding Technology, Shanghai Jiao Tong University, Shanghai 00030, P.R. China Department of Atomatic Control,

More information

Discrete Applied Mathematics. The induced path function, monotonicity and betweenness

Discrete Applied Mathematics. The induced path function, monotonicity and betweenness Discrete Applied Mathematics 158 (2010) 426 433 Contents lists available at ScienceDirect Discrete Applied Mathematics jornal homepage: www.elsevier.com/locate/dam The indced path fnction, monotonicity

More information

Lecture Notes: Finite Element Analysis, J.E. Akin, Rice University

Lecture Notes: Finite Element Analysis, J.E. Akin, Rice University 9. TRUSS ANALYSIS... 1 9.1 PLANAR TRUSS... 1 9. SPACE TRUSS... 11 9.3 SUMMARY... 1 9.4 EXERCISES... 15 9. Trss analysis 9.1 Planar trss: The differential eqation for the eqilibrim of an elastic bar (above)

More information

ON THE SHAPES OF BILATERAL GAMMA DENSITIES

ON THE SHAPES OF BILATERAL GAMMA DENSITIES ON THE SHAPES OF BILATERAL GAMMA DENSITIES UWE KÜCHLER, STEFAN TAPPE Abstract. We investigate the for parameter family of bilateral Gamma distribtions. The goal of this paper is to provide a thorogh treatment

More information

A Proposed Method for Reliability Analysis in Higher Dimension

A Proposed Method for Reliability Analysis in Higher Dimension A Proposed Method or Reliabilit Analsis in Higher Dimension S Kadr Abstract In this paper a new method is proposed to ealate the reliabilit o stochastic mechanical sstems This techniqe is based on the

More information

Sources of Non Stationarity in the Semivariogram

Sources of Non Stationarity in the Semivariogram Sorces of Non Stationarity in the Semivariogram Migel A. Cba and Oy Leangthong Traditional ncertainty characterization techniqes sch as Simple Kriging or Seqential Gassian Simlation rely on stationary

More information

3 2D Elastostatic Problems in Cartesian Coordinates

3 2D Elastostatic Problems in Cartesian Coordinates D lastostatic Problems in Cartesian Coordinates Two dimensional elastostatic problems are discssed in this Chapter, that is, static problems of either plane stress or plane strain. Cartesian coordinates

More information

9. Tensor product and Hom

9. Tensor product and Hom 9. Tensor prodct and Hom Starting from two R-modles we can define two other R-modles, namely M R N and Hom R (M, N), that are very mch related. The defining properties of these modles are simple, bt those

More information

A Note on Johnson, Minkoff and Phillips Algorithm for the Prize-Collecting Steiner Tree Problem

A Note on Johnson, Minkoff and Phillips Algorithm for the Prize-Collecting Steiner Tree Problem A Note on Johnson, Minkoff and Phillips Algorithm for the Prize-Collecting Steiner Tree Problem Palo Feofiloff Cristina G. Fernandes Carlos E. Ferreira José Coelho de Pina September 04 Abstract The primal-dal

More information

denition of distributed computations with synchronous than originally stated. Key words: Distributed computation { Message passing Communication

denition of distributed computations with synchronous than originally stated. Key words: Distributed computation { Message passing Communication Synchronos, asynchronos, and casally ordered commnication Bernadette Charron{Bost Laboratoire d'informatiqe, Ecole Polytechniqe, 91128 Palaisea Cedex, France Friedemann Mattern Department of Compter Science,

More information

A Computational Study with Finite Element Method and Finite Difference Method for 2D Elliptic Partial Differential Equations

A Computational Study with Finite Element Method and Finite Difference Method for 2D Elliptic Partial Differential Equations Applied Mathematics, 05, 6, 04-4 Pblished Online November 05 in SciRes. http://www.scirp.org/jornal/am http://d.doi.org/0.46/am.05.685 A Comptational Stdy with Finite Element Method and Finite Difference

More information

FEA Solution Procedure

FEA Solution Procedure EA Soltion rocedre (demonstrated with a -D bar element problem) MAE - inite Element Analysis Many slides from this set are originally from B.S. Altan, Michigan Technological U. EA rocedre for Static Analysis.

More information

Mixed Type Second-order Duality for a Nondifferentiable Continuous Programming Problem

Mixed Type Second-order Duality for a Nondifferentiable Continuous Programming Problem heoretical Mathematics & Applications, vol.3, no.1, 13, 13-144 SSN: 179-9687 (print), 179-979 (online) Scienpress Ltd, 13 Mied ype Second-order Dality for a Nondifferentiable Continos Programming Problem.

More information

L 1 -smoothing for the Ornstein-Uhlenbeck semigroup

L 1 -smoothing for the Ornstein-Uhlenbeck semigroup L -smoothing for the Ornstein-Uhlenbeck semigrop K. Ball, F. Barthe, W. Bednorz, K. Oleszkiewicz and P. Wolff September, 00 Abstract Given a probability density, we estimate the rate of decay of the measre

More information

4 Exact laminar boundary layer solutions

4 Exact laminar boundary layer solutions 4 Eact laminar bondary layer soltions 4.1 Bondary layer on a flat plate (Blasis 1908 In Sec. 3, we derived the bondary layer eqations for 2D incompressible flow of constant viscosity past a weakly crved

More information

A generalized Alon-Boppana bound and weak Ramanujan graphs

A generalized Alon-Boppana bound and weak Ramanujan graphs A generalized Alon-Boppana bond and weak Ramanjan graphs Fan Chng Abstract A basic eigenvale bond de to Alon and Boppana holds only for reglar graphs. In this paper we give a generalized Alon-Boppana bond

More information

Formules relatives aux probabilités qui dépendent de très grands nombers

Formules relatives aux probabilités qui dépendent de très grands nombers Formles relatives ax probabilités qi dépendent de très grands nombers M. Poisson Comptes rends II (836) pp. 603-63 In the most important applications of the theory of probabilities, the chances of events

More information

CRITERIA FOR TOEPLITZ OPERATORS ON THE SPHERE. Jingbo Xia

CRITERIA FOR TOEPLITZ OPERATORS ON THE SPHERE. Jingbo Xia CRITERIA FOR TOEPLITZ OPERATORS ON THE SPHERE Jingbo Xia Abstract. Let H 2 (S) be the Hardy space on the nit sphere S in C n. We show that a set of inner fnctions Λ is sfficient for the prpose of determining

More information

The Scalar Conservation Law

The Scalar Conservation Law The Scalar Conservation Law t + f() = 0 = conserved qantity, f() =fl d dt Z b a (t, ) d = Z b a t (t, ) d = Z b a f (t, ) d = f (t, a) f (t, b) = [inflow at a] [otflow at b] f((a)) f((b)) a b Alberto Bressan

More information

Simplified Identification Scheme for Structures on a Flexible Base

Simplified Identification Scheme for Structures on a Flexible Base Simplified Identification Scheme for Strctres on a Flexible Base L.M. Star California State University, Long Beach G. Mylonais University of Patras, Greece J.P. Stewart University of California, Los Angeles

More information

Symmetric Range Assignment with Disjoint MST Constraints

Symmetric Range Assignment with Disjoint MST Constraints Symmetric Range Assignment with Disjoint MST Constraints Eric Schmtz Drexel University Philadelphia, Pa. 19104,USA Eric.Jonathan.Schmtz@drexel.ed ABSTRACT If V is a set of n points in the nit sqare [0,

More information

Joint Transfer of Energy and Information in a Two-hop Relay Channel

Joint Transfer of Energy and Information in a Two-hop Relay Channel Joint Transfer of Energy and Information in a Two-hop Relay Channel Ali H. Abdollahi Bafghi, Mahtab Mirmohseni, and Mohammad Reza Aref Information Systems and Secrity Lab (ISSL Department of Electrical

More information

A New Approach to Direct Sequential Simulation that Accounts for the Proportional Effect: Direct Lognormal Simulation

A New Approach to Direct Sequential Simulation that Accounts for the Proportional Effect: Direct Lognormal Simulation A ew Approach to Direct eqential imlation that Acconts for the Proportional ffect: Direct ognormal imlation John Manchk, Oy eangthong and Clayton Detsch Department of Civil & nvironmental ngineering University

More information

3.1 The Basic Two-Level Model - The Formulas

3.1 The Basic Two-Level Model - The Formulas CHAPTER 3 3 THE BASIC MULTILEVEL MODEL AND EXTENSIONS In the previos Chapter we introdced a nmber of models and we cleared ot the advantages of Mltilevel Models in the analysis of hierarchically nested

More information

Imprecise Continuous-Time Markov Chains

Imprecise Continuous-Time Markov Chains Imprecise Continos-Time Markov Chains Thomas Krak *, Jasper De Bock, and Arno Siebes t.e.krak@.nl, a.p.j.m.siebes@.nl Utrecht University, Department of Information and Compting Sciences, Princetonplein

More information

Convergence analysis of ant colony learning

Convergence analysis of ant colony learning Delft University of Technology Delft Center for Systems and Control Technical report 11-012 Convergence analysis of ant colony learning J van Ast R Babška and B De Schtter If yo want to cite this report

More information