Optimal Decentralized State-Feedback Control with Sparsity and Delays

Size: px
Start display at page:

Download "Optimal Decentralized State-Feedback Control with Sparsity and Delays"

Transcription

1 Opimal Decenralized Sae-Feedback Conrol wih Sparsiy and Delays Andrew Lamperski Lauren Lessard Submied o Auomaica Absrac This work presens he soluion o a class of decenralized linear quadraic sae-feedback conrol problems, in which he plan and conroller mus saisfy he same combinaion of delay and sparsiy consrains. Using a novel decomposiion of he noise hisory, he conrol problem is spli ino independen subproblems ha are solved using dynamic programming. The approach presened herein boh unifies and generalizes many exising resuls. I Inroducion While opimal decenralized conroller synhesis is difficul in general [25, 28], much progress has been made oward idenifying racable subclasses of problems. Two closely relaed condiions, parial nesedness and quadraic invariance, guaranee, respecively, ha he opimal soluion for LQG conrol problems is linear [3], and ha opimal synhesis can be cas as convex opimizaion [6, 9]. These resuls alone do no necessarily guaranee ha he opimal conroller can be efficienly compued, since he associaed convex programs are large. Indeed, he problem of [3] reduces o a sysem of linear equaions ha grows wih he ime horizon. In [6], a sequence of convex programs of increasing size converges o he opimal soluion, and [9] uses vecorizaion o reduce decenralized problems o much larger cenralized ones. More efficien compuaional ools have been developed for linear quadraic problems, in paricular linear marix inequaliies have been used o solve he sae-feedback [7] and oupu-feedback [, 8] cases. Oupu-feedback problems wih delays can also be solved using a combinaion of specral facorizaion and quadraic programming [7]. A general drawback of purely compuaional approaches is ha hey give lile insigh ino he srucure of he opimal conroller. When finding he bes linear conroller, for example, we mus ofen resric our search o conrollers of a fixed dimension, and here is no inuiion for he physical meaning of he conroller saes. Explici soluions, which provide efficien compuaion in addiion o a physical inerpreaion for he saes of he conroller, have been found for he specific consrain classes of sparsiy and delays. In he delay case, i is assumed all conrollers evenually measure he global sae, bu no necessarily simulaneously. Problems wih a one-imesep delay beween conrollers were found by exending classical dynamic programming argumens [4, 20, 29]. In he linear quadraic seing, he sae-feedback problem wih delays characerized by a graph is solved in [6]. The soluion is similar in spiri and complexiy o compuing he opimal cenralized LQR conroller, bu he opimal policy urns ou o be dynamic raher han saic. Conroller saes have inerpreaions as delayed esimaes of he global sae. In he sparsiy case, all measuremens are ransmied insanly, bu some conrollers never gain access o cerain measuremens. This amouns o a delayed sysem where each delay is eiher zero or infinie. The firs explici soluion solved he wo-player case using a specral facorizaion approach [24]. The resuls were laer exended o a general class of quadraically invarian, also called pose-causal, sparsiy paerns [2, 22]. A dynamic programming argumen for he wo-player problem was also given in [23]. Again, he conroller saes can be inerpreed as esimaes of he global saes condiioned on he paricular subses of he available informaion. This paper unifies he reamen of sparsiy and delay consrains by considering an informaion flow characerized by a direced graph. Each edge may be labeled wih eiher a 0 for insananeous informaion ransfer, or wih a for one-imesep delayed ransfer. We use a generalizaion of he dynamic programming approach of [6] ha does no require he graph o be srongly conneced. Therefore, his work gives a general mehod for solving boh he delay and sparsiy problems described in he previous wo paragraphs. In addiion, our framework can rea problems ha conain a mixure of sparsiy and delay consrains. A preliminary version of his work appeared in he conference paper [8]. The presen work differs subsanially from [8]. Specifically, he presen work includes complee proofs o all resuls, new illusraive examples, and exensive discussion in Secions IV and V. We also have a new resul, Theorem 3, which gives a disribued messagepassing implemenaion of he opimal conroller. In he remainder of his secion, we explain he noaions and oher convenions used in he paper. In Secions II, we sae he main problem, and we presen our soluion in Secion III. We discuss how our work unifies exising resuls in Secion IV and we discuss limiaions

2 and possible exensions in Secion V. We provide wo numerical examples in Secion VI. The proof o our main resul appears in Secion VII. Finally, we conclude in Secion VIII. I-A Noaion Lower-case leers are random vecors unless oherwise indicaed. Subscrips are ime indices and superscrips are subsysem labels. For example, he sae of subsysem 2 a ime is x 2. The global sae of all subsysems is indicaed by omiing he subsysem, and ime sequences are denoed using he colon operaor. For example, x = x. x N and x 0: = x 0. x. For marices, subscrips denoe subblocks, while superscrips are used o selec ses of subblocks. For example, a block 2 3 marix M is wrien as [ ] M M M = 2 M 3. M 2 M 22 M 23 To selec row r = {2} and columns s = {, 3}, we wrie M rs = [ ] M 2 M 23. The symbol I denoes a blockideniy marix whose dimensions are o be inferred by conex. In he example above, we could have wrien M rs = I r,{,2} MI {,2,3},s. If Y = {y,..., y M } is a se of random vecors (possibly of differen sizes), we say ha z lin Y if here are appropriaely sized real marices C,..., C M such ha z = M i= Ci y i. In general, all marices in his paper are ime-varying bu we will ofen omi ime indices if hey are clear from conex. We use a new noaion o indicae a family of equaions. When we wrie x + = Ax + Bu + w, we mean ha x + = A x + B u + w for = 0,..., T. We will use he same ime horizon T hroughou his paper, so here is no ambiguiy. Noe ha he subscrip + means ha he ime index of he corresponding symbol is incremened by. We use a similar noaion for oher binary relaions such as inequaliies, by wriing a over he symbol. Finally, a similar noaion is used o modify summaions. For example, we wrie II x T Qx insead of wriing Problem saemen T x T Q x. =0 We begin wih some basic definiions. A nework graph G(V, E) is a direced graph where each edge is labeled wih a 0 if he associaed link is delay-free, or a if i has a one-imesep delay. The verices are V = {,..., N}. If here is an edge from j o i, we wrie (j, i) E, or simply j i. When delays are perinen, hey are denoed as j 0 i or j i. Associaed wih he nework graph G(V, E) is he delay marix D. Each enry D ij is he sum of he delays along he direced pah from j o i wih he shores aggregae delay. We assume D ii = 0 for all i, and if no direced pah exiss, we se D ij =. Delays are assumed o be fixed for all ime. Direced cycles are permied in he nework graph, bu we assume here are no direced cycles wih a oal delay of zero. In our framework, all nodes belonging o such a delay-free cycle can be collapsed ino a single node and reaed as such wihou any loss of generaliy. See Fig. for an example of a nework graph. We now sae he general class of problems ha can be solved using he mehods developed in his paper. Problem. Le G(V, E) be a nework graph wih a imeinvarian delay marix D. Suppose he following imevarying sae-space equaions are given: x i + = ( Aij x j + B ij u j) + w i for all i V. () j V D ij Sacking he various vecors and marices, we obain he more compac represenaion x + = Ax + Bu + w. (2) All random disurbances are assumed o be joinly Gaussian and independen from one anoher. Specifically, he random vecors {x i 0, w0, i..., wt i } i V are muually independen. Their means and covariances are given by x i 0 N (µ i 0, Σ i 0) and w i N (0, W i ) for all 0. (3) The inpus u 0:T are conrolled using sae feedback subjec o an informaion consrain. The informaion se for conroller i a ime is as follows. I i = { x j k : j V, 0 k D ij}. (4) In oher words, I i is he se of saes belonging o nodes ha have had sufficien ime o reach node i by ime. Each decision measures is corresponding informaion se u i = γ i (I i ). (5) The goal is o choose he se of policies γ = {γ0:t i } i V ha minimize he expeced quadraic cos ( [ ] T [ ] [ ] ) min E γ x Q S x + x TT Q γ u R u f x T, (6) S T where he expecaion is aken wih respec o he join probabiliy measure on (x 0:T, u 0:T ) induced by he choice of γ. We make he sandard assumpions ha [ ] Q S 0, R > 0, Qf 0. (7) S T R In Problem, we assume ha all decision-makers know he sysem parameers A, B, Q, R, S, Q f, as well as he 2

3 opology of he underlying nework graph G(V, E). The problem is cooperaive in naure; we are o joinly design he se of policies γ o opimize he cos (6). As menioned in Secion I-A, he parameers of he sysem in Problem can be ime-varying. In fac, heir sizes can also vary wih ime. For example, he inpu u may change dimensions a every imesep, and he marices B, R, S would change size accordingly. We now illusrae how he general problem saemen given by () (7) specializes in a simple hree-node case. Example. Consider he nework graph of Fig Figure : Nework graph for Example. Each node represens a subsysem, and he edge labels indicae he propagaion delay from one subsysem o anoher. The delay marix for Example is given by 0 D = 0. (8) 0 0 While he delay marix for Example only conains 0 s, s, and s, a general delay marix may conain arbirarily large nonnegaive inegers, provided he associaed nework graph is sufficienly large. The sae-space equaions for Example are of he form x + x 2 + = x 3 + A A 2 0 A 2 A 22 0 A 3 A 32 A 33 + x x 2 x 3 B B 2 0 B 2 B 22 0 B 3 B 32 B 33 u u 2 + u 3 w w 2. (9) w 3 Noe ha whenever D ij >, we have A ij = 0 and B ij = 0. The policies of he hree decision-makers in Example are consrained as follows. u = γ ( x 0:, x 2 ) 0: u 2 = γ 2 ( x 0:, x 2 ) 0: for 0 T. (0) u 3 = γ 3 ( x 0:, x 2 0:, x 3 ) 0: There is a combinaion of sparsiy and delay consrains; some saes may never be available o a paricular conroller, while oher saes migh be available bu delayed. Our goal is o choose he policies γ such ha we minimize he cos given by (6). Noe ha according o our definiions, boh he plan () and he conroller (5) share he same sparsiy consrains. In Example for insance, x 3 does no influence x or x 2 via he dynamics, nor can i affec u or u 2 via he conroller. As explained in Subsecion VII-A, his condiion is sufficien o guaranee ha he opimal conrol policies γ are linear, a powerful fac. Based on our formulaion, he conroller may be any funcion of he pas informaion hisory, which grows wih he size of he ime horizon T. We will show by consrucion ha here exiss an opimal policy ha has a finie memory ha is independen of T. III Main resuls This secion presens he main resuls: explici saespace soluions for Problem. Two equivalen forms of he conroller are presened. The firs form has saes which are funcions of he primiive random variables x 0 and w 0:. The second form uses sae feedback only and implemens he conroller in a disribued fashion by message passing. To describe he conroller coordinaes, he nework graph is ransformed ino an alernae represenaion, he informaion graph. This graph racks he propagaion of he noise signals w i as hey propagae hrough he nework graph. Formally, we define he informaion graph as follows. Le s j k be he se of nodes reachable from node j wihin k seps: s j k = {i V : D ij k}. () The informaion graph Ĝ(U, F), is given by U = {s j k : k 0, j V} F = {(s j k, sj k+ ) : k 0, j V}. Consider Example, and rack each of he noise signals w, w 2, w 3 as hey are propagaed hrough he nework graph of Fig.. w {} {, 2, 3} {, 2, 3}... w 2 {2, 3} {, 2, 3} {, 2, 3}... w 3 {3} {3}... (2) Each chain in (2) is of he form w i s i 0 s i s i 2... and racks he nodes reached by w i afer some number of imeseps. Assembling he pahs in (2) and aggregaing duplicae nodes, we obain he informaion graph for Example, shown in Fig. 2. The addiional labels w i are no couned amongs he nodes of Ĝ as a maer of convenion, bu are shown as a reminder of which noise signal is being racked. We will ofen wrie expressions such as {s U : w i s} o denoe he se of roo nodes of Ĝ. The following proposiion gives some useful properies of he informaion graph. 3

4 {, 2, 3} {} {2, 3} {3} w w 2 w 3 Figure 2: Informaion graph for Example. The nodes are subses of nodes in he nework graph (see Fig. ) affeced by differen noises. For example, w 2 is injeced a node 2, affecs nodes {2, 3} immediaely, and affecs {, 2, 3} afer one imesep. Proposiion. Given an informaion graph Ĝ(U, F), he following properies hold. (i) Every node in Ĝ has exacly one descendan. In oher words, for every r U, here is a unique s U such ha r s. (ii) Every pah evenually his a node wih a self-loop. (iii) If he nework graph saisfies V = n, he number of nodes in Ĝ is bounded by n U n2 n +. The firs wo properies are immediae by consrucion. The lower bound on U is achieved by direced acyclic nework graphs wih zero delay on all edges. The upper bound on U is achieved by nework graph consising of one large cycle: 2 n, and each link has a one-imesep delay. Noe ha he informaion graph may have several conneced componens. This happens whenever he nework graph is no srongly conneced. For example, Fig. 2 has wo conneced componens because here is no pah 3 2 in Fig.. In (2), we characerized how noise injeced a he curren imesep propagaes o he oher nodes a fuure imeseps. Alernaively, one may consider he hisory of all pas injeced noises, and ask which of hese noise erms have affeced a paricular node a he presen ime. Define he noise hisory a ime o be he se of all noise erms injeced ino he sysem prior o ime, H = {x 0, w 0: }. (3) The noise pariion diagram is formed by arranging he elemens of H in a able and grouping hem based on which nodes a he curren ime hey can affec. We call hese groups label ses, and hey are discussed in greaer deail in Secion VII-B. The noise pariion diagram for Example is shown in Fig. 3. Noe ha as we raverse each row of he noise pariion diagram from righ o lef in Fig. 3, we recover (2). We are now ready o presen he main resul of his paper, which expresses he opimal conroller as a funcion of new coordinaes induced by he informaion graph. {, 2, 3} {} Node w 3 w 2 w {2, 3} Node 2 w 2 3 w 2 2 w 2 {3} Node 3 w 3 3 w 3 2 w Time Figure 3: Noise pariion diagram for Example (see Fig. and 2). All pas noise disurbances are placed ino groups depending on which decisions a ime hey can affec. Theorem 2. Consider Problem, and le Ĝ(U, F) be he associaed informaion graph. Define he marices {X0:T r } r U and {K0:T r } r U recursively as follows, X r T = Q rr f K r = ( R rr + B srt X s +B sr) ( S rr + A srt X s +B sr) T X r = (Q rr + A srt X+A s sr ) + ( S rr + A srt X+B s sr) K r. (4) where for each r U, we have defined s U o be he unique node such ha r s. The opimal conrol decisions saisfy he following sae-space equaions ζ0 s = I s,{i} x i 0 ζ s + u i = ( A sr + B sr K r) ζ r + I s,{i} w i r s = I {i},r K r ζ r. r i The corresponding opimal expeced cos is ( (µ i 0) T (X0) s {i},{i} (µ i 0) V 0 = i V (5) ( )) + race (X0) s {i},{i} Σ i 0 + ( race (X+) s {i},{i} W i). (6) i V Proof. See Secion VII. In he saemen of Theorem 2, marices wih ses used as superscrips, as in A sr or I s,{i}, specify submarices. This noaion is explained in Secion I-A. Noe ha (5) expresses he conroller as a map w u, generalizing noise-based conrollers from [5, 2, 22, 23]. (See [, 7] for oher uses of noise-based represenaions in decenralized conrol.) In he proof of Theorem 2, we show ha his conroller is feasible because he noise erms used in compuing each u i can be deduced from he associaed informaion se I i. Our second main resul, Theorem 3, gives an explici implemenaion of Theorem 2 by describing how each 4

5 node uses is locally available sae informaion o compue he opimal conrol decisions. To illusrae how his will work, recall he nework graph of Example and is corresponding informaion graph, Fig. 2. We implemen he opimal conroller of Theorem 2 by using a combinaion of local measuremen, local compuaion, and message passing, as follows. Assume ha a ime, each node has a copy of he conroller saes required for compuing is local u i using (5). Namely, : ζ {}, ζ {,2,3} 2 : ζ {2,3}, ζ {,2,3} 3 : ζ {2,3}, ζ {,2,3}, ζ {3}. (7) A ime, messages are passed along he delay- edges of he nework graph as follows: node sends ζ {} o node 2 and node 2 sends ζ {2,3} o node. These messages arrive one imesep laer, due o he delays associaed wih hese edges of he nework graph. Now advance o imesep +, such ha all hese messages have been received. Nodes and 2 now have copies of ζ {}, ζ {2,3}, and ζ {,2,3}. Thus, nodes and 2 can compue ζ {,2,3} + via (5) using heir curren conroller sae informaion. We canno use (5) o compue ζ {} +, ζ{2,3} +, or ζ{3} + since i would require knowledge of w, w 2, or w 3 respecively. However, we will see laer ha he ζ-coordinaes combine o form he global sae: x = s U IV,s ζ s, for = 0,..., T. The need o know he w i can hus be avoided. For example, once node measures x +, we can compue ζ {} + = x + (ζ {,2,3} + ) {} and similarly for ζ {2,3} +. Finally, node 2 sends ζ{2,3} + and ζ {,2,3} + o node 3. These messages arrive insanly because he associaed edge has delay 0. Node 3 hen measures x 3 + and compues ζ {3} + = x3 + (ζ {,2,3} + ) {3} (ζ {2,3} + ){3}. A his poin, each node has copies of he saes indicaed by (7) a he incremened ime +. Therefore, u + can be compued and we have come full-circle. Fig. 4 shows all he messages passed along he nework graph. : ζ {} 0 : ζ {2,3}, ζ {,2,3} 2 3 coming seps. Noe ha if = 0, here are no delay- messages o receive. 2) For non-roo nodes s, so w i s, if ζ s is available from a delay-0 message, do nohing. Oherwise compue i (or iniialize i if = 0) using (5). 3) For he roo node s such ha w i s, measure x i and updae as follows. (ζ s ) s\{i} = r s (ζ s ) {i} = x i r i r s I s\{i},s ( A sr + B sr K r ) ζ r I {i},r ζ r 4) Compue inpu u i according o (5). 5) Send all oubound messages M ij, defined as follows. If i 0 j : If i j : M ij M ij = {ζ s : s U, i, j s}. = {ζ s : s U, i s, j / s}. Proof. Proved in Subsecion VII-D. IV Exising resuls (8) In his secion, we explain how Theorem 2 specializes o he exising resuls menioned in Secion I. For each such case, we show he associaed nework graph and informaion graph. Cenralized case. The firs relaed resul is classical sae-feedback LQR conrol. This corresponds o he rivial case where he nework graph and informaion graphs are single nodes, as in Fig. 5. The recursion (4) becomes a sandard Riccai difference equaion, and K {} is he classical LQR gain. Furhermore, he algorihm of Theorem 3 reduces o he classical saefeedback policy. No messages are passed, none of he updae seps are execued, and ζ {} = x. : ζ {2,3} Figure 4: Nework graph for Example wih messages. Our nex main resul, given below, generalizes he above procedure o any insance of Problem. Theorem 3. A disribued implemenaion of he opimal conroller in Theorem 2 is given as follows. Each node i V is responsible for soring and updaing a local copy of he saes {ζ s } s i. Each node i V performs he following seps for = 0,..., T. ) Receive and sore all inbound delay-0 and delay- messages. This provides he ζ r and ζ r values respecively ha will be used for local updaes in he (a) nework graph {} w (b) informaion graph Figure 5: Classical cenralized sae-feedback conrol. Sparsiy consrains. For sparsiy consrains wih no delays, he simples problem, he wo-player problem, is shown in Fig. 6. The informaion graph consiss of wo disconneced self-loops, implying ha he opimal 5

6 conroller depends on he soluion of wo Riccai difference equaions and heir associaed K-gains. For sparsiy over a general graph wih N nodes, he informaion graph has N disconneced self-loops and so N Riccai difference equaions mus be solved. 0 2 (a) nework graph Figure 6: {, 2} {2} w w 2 (b) informaion graph The wo-player problem, sparsiy only. Delayed sharing. The simples problem wih delays bu no sparsiy, he one-sep informaion sharing paern, is shown in Fig. 7. For general srongly conneced graphs wih one-imesep delays on every edge, he informaion graph is conneced, and all edges evenually lead o he self-loop V V. Thus, a single Riccai difference equaion mus be solved, which corresponds o he cenralized Riccai difference equaion, and is soluion is propagaed according o (4). Figure 7: V 2 (a) nework graph {, 2} {} {2} w w 2 (b) informaion graph One-sep delay informaion sharing paern. Furher exensions We now discuss seleced opics exploring he limiaions of our work and direcions for possible fuure research. Infinie horizon. While he problem formulaion in his paper considers opimizaion over a finie ime horizon, our soluion exends naurally o an infinie horizon. To his end, we assume all sysem parameers are ime-invarian, and we seek a sabilizing conroller ha minimizes he average finie-horizon cos as he lengh of he horizon ends o infiniy. In his limi, he Riccai difference equaions (4) for he nodes of he informaion graph wih self-loops r r become Algebraic Riccai Equaions (AREs). Under classical condiions [2], each X rr approaches a seady-sae value as T, he associaed seady-sae gain K r will be sabilizing, and he corresponding ζ r sae will be sable. One can easily show ha he remaining X s and K s marices for he oher nodes of he informaion graph also approach seady-sae limis and heir corresponding saes are also sable. For a numerical example and furher discussion, see Example 2 in Secion VI. Oupu feedback. In oupu feedback problems, he decision-makers have access o noisy measuremens of saes raher han he saes hemselves. Soluions are known for wo-player sparsiy [2, 3], n-player broadcas [9], and one-imesep delays [4, 20, 29], bu i is no clear how o exend he resuls of his paper o oupu feedback over a general graph wih mixed delays. Difficulies arise because deermining appropriae sufficien saisics for dynamic programming becomes suble when he delay beween any wo nodes is a leas 2 imeseps [5, 27, 30]. This phenomenon occurs for all sparsiy paerns and mos complex delay paerns. For sysems in his paper, sufficien saisics are compued by projecing he sae ono orhogonal subspaces of he informaion ses. I is unclear if such a decomposiion is possible for oupu-feedback problems. We menion one promising excepion [4], in which sufficien saisics are derived along wih a soluion o he finie-horizon version of he wo-player problem. The oupu-feedback soluions found so far are significanly more complicaed ha heir sae-feedback counerpars. For insance, in Theorem 2, he noise covariances W i only appear in he expression for he opimal cos. In oupu-feedback however, he policy iself depends explicily on W i. Correlaed noise. We assume in Problem ha he noises injeced ino he various nodes are independen. This independence is used o show ha he ζ s saes are muually independen, hus enabling a criical simplificaion of he value funcion used in he dynamic programming argumen. If he noises are correlaed, for example E(w w 2 T ) 0, our approach fails. Such problems are sill parially nesed, as defined in Secion VII-A, so we expec he opimal conroller o be unique and linear, bu we do no know how many saes i will have. A simple version of his problem wih wo players and decoupled dynamics was recenly solved [0], and he soluion is surprisingly complex. If each player has n saes, he opimal conroller may have a number of saes proporional o n 2. Realizabiliy. In general, causal linear ime-invarian sysem may be equivalenly represened using eiher sae-space or ransfer funcions. However, he wo represenaions are no equivalen when we impose an underlying graph srucure and associaed sparsiy for he sae-space marices [, 26]. Specifically, every srucured sae-space realizaion ha is sabilizable and deecable corresponds o a srucured ransfer funcion, bu he opposie is no rue. A counerexample is given in []. We avoid he issue of realizabiliy in his paper by only considering plans for which a srucured sae-space 6

7 realizaion exiss, as implied by (). Under his assumpion, we produce a srucured sae-space realizaion for he opimal conroller. As discussed earlier in his secion, all inernal modes ζ s of his conroller are sable by consrucion, so our realizaion is sabilizing. For graphs wih pure sparsiy (no delays), here is no loss in assuming a realizable plan, because non-realizable plans can never be sabilized using srucured conrollers []. However, no analogous resul is known for sysems wih delays, as discussed in [26]. VI Examples In his secion, we give wo examples o illusrae boh he generaliy and versailiy of our resuls. Example 2. Consider he four-node sysem depiced in Fig. 8. We show he nework graph and is associaed messages from Theorem 3 and he informaion graph and noise pariion diagram derived in Secion III. : ζ {} : ζ {} 2 4 : ζ {,2,3}, ζ {2} (a) nework graph wih messages 3 ζ {,2,3,4} : ζ{,2,3} ζ {3} 0 : ζ {2,3,4} ζ {3,4} As a supplemen o he discussion on infinie ime horizon soluions in Secion V, we now presen a numerical simulaion of his four-node example ha shows convergence of he opimal conrol gains as he ime horizon grows. We use he following ime-invarian parameers, A = B = D = Q = R = 8 8 S = and noise covariance W = I. Noe ha he A and B marices have a sparsiy paern ha conforms o he delay marix D. As explained in Secion II, we mus have A ij = 0 and B ij = 0 whenever D ij 2. To guaranee ha he Riccai equaions for he selfloops converge o sabilizing soluions, i is sufficien ha (Q rr S rr R rr S rrt, A rr B rr R rr S rrt, B rr ) be sabilizable and deecable for r = {3, 4}, {2, 3, 4}, and {, 2, 3, 4}. A deailed discussion of he convergence of Riccai equaions may be found in [2]. A direc calculaion shows ha Q rr S rr R rr S rrt and B rr are inverible for all self-loops, and so sabilizabiliy and deecabiliy are guaraneed. I follows ha he X r marices, which are soluions o (4), should converge o seady-sae values as we ge farher from he erminal imesep. This fac is suppored by he plo of Fig. 9. {, 2, 3, 4} {, 2, 3} {2, 3, 4} 400 {} {2} {3} {3, 4} 200 w w 2 w 3 w 4 (b) informaion graph {, 2, 3, 4} {, 2, 3} {} Node w 3 w 2 w {2, 3, 4} {2} Node 2 w 2 3 w 2 2 w 2 {3, 4} {3} Node 3 w 3 3 w 3 2 w 3 Node 4 w 3 4 w 2 4 w Time (c) noise pariion diagram Figure 8: Graphical represenaions for Example Time Sep () Figure 9: Plo of race(x r ) as a funcion of ime for Example 2. A curve is shown for each r U. Example 3. To invesigae he effec of increased delays on oal cos, we consider he 0-node sysem defined by he linear nework graph of Fig. 0. Depending on he value of k, we can vary he end-o-end delay D 90 = k k 9 Figure 0: Nework graph for Example 3. 7

8 Suppose he sysem marices are bi-diagonal, as follows A = B = The cos marices are Q = R = I, S = 0, and he noise covariance is W = 0 3 I. Because of heir bi-diagonal srucure, hese marices are admissible for any choice of k. In Fig., he oal cos is ploed as a funcion of k for he fixed ime horizon of T = 20. As expeced, he cos increases monoonically as we increase he delay Toal delay (k) Figure : Plo of he opimal cos as a funcion of he oal delay for Example 3, wih a ime horizon of T = 20. VII Proof of main resuls This secion conains proofs of Theorems 2 and 3. The proof of Theorem 2 requires hree seps. ) Lineariy. We show ha he informaion consrain for Problem is parially nesed [3]. Thus he opimal policy is unique, and consiss of linear funcions of he informaion hisory. 2) Feasibiliy. In (5), u i depends on w i. In spie of his, we prove ha u i need only measure is associaed informaion se I i. So he policy is feasible. 3) Opimaliy. Finally, we mus show ha he proposed policy is opimal. Our approach uses dynamic programming in he ζ s coordinaes, wih an opimizaion over policies raher han acions. VII-A Lineariy Lineariy of he opimal policy follows from parial nesedness, a concep firs inroduced by Ho and Chu in [3]. We sae he main definiion and resul below. Definiion 4. A dynamical sysem (2) wih informaion srucure (5) is parially nesed if for every admissible policy γ, whenever u j τ affecs I i, hen I j τ I i. Lemma 5 (see [3]). Given a parially nesed srucure, he opimal conrol law ha minimizes a quadraic cos of he form (6) exiss, is unique, and is linear. In oher words, an informaion srucure is parially nesed if whenever he decision of Player j affecs he informaion used in Player i s decision, hen Player i mus have access o all he informaion available o Player j. When his is he case, he opimal policy is linear. Using parial nesedness, he following lemma shows ha he opimal sae and inpu may be expressed as linear funcions of erms from he informaion ses I i and he noise hisory H, which was defined in (3). Lemma 6. The informaion srucure described in Problem is parially nesed. Furhermore, he opimally conrolled saes and inpus belong o heir associaed informaion subspaces, and he global sae and inpu belong o he noise hisory subspace. Namely, x i, u i lin I i for all i V, and x, u lin H. (9) Proof. See Appendix B. VII-B Feasibiliy I is clear from Lemma 6 ha lin I i lin H. The inclusion is ypically sric, since he informaion consrains may preven some noise erms in H from having an immediae influence on all nodes of he nework. In his subsecion, we begin by characerizing he subspaces of H ha are associaed wih each of he I i. This will evenually lead o an inuiive definiion for he new ζ s coordinaes. Lemma 7. Consider an informaion graph Ĝ(U, F) and define he corresponding label ses {L s 0:T } s U recursively by L s 0 = {x i 0} L s + s U = s U {w i } L r. r s The following properies of he label ses hold. (20) (i) For every 0, he label ses are a pariion of he noise hisory: L r L s = when r s, and H = L s. (2) (ii) For all i V, lin I i Proof. See Appendix C. s U = lin L s. (22) The label ses specify he groupings in he noise pariion diagram, and hus we can use he noise pariion s i 8

9 diagram o find he noise erms available o a given inpu. Consider Example and Fig. 3 once again. The label ses ha conain 2 are {2, 3} and {, 2, 3}. Therefore, u 2 lin I 2 = lin{l {2,3}, L {,2,3} }. (23) Referring o Fig. 3, we may expand he conens of hese label ses and obain u 2 lin{x 0, x 2 0, w 0: 2, w 2 0: }. We now define he ζ s coordinaes used in he saemen of Theorem 2. Combining (9) and (22), we have ha Thus, we may wrie x = s U x i, u i lin s i L s for all i V. (24) I V,s ζ s and u = s U I V,s ϕ s. (25) where ζ s lin L s and ϕ s lin L s. As explained in Secion I-A, I V,s denoes a submarix of a large ideniy marix wih block rows and columns of appropriae dimensions. For example, he inpu in Example is u = I 0 0 ϕ {} I 0 0 I ϕ {2,3} I ϕ {3} + ϕ {,2,3}, where each of he ϕ s are muually independen. The reason for his srucure is due o (23), or more generally (24). For example, if we examine he second row of he above equaions, u 2 only depends on ϕ {2,3} and ϕ {,2,3}, which depend on L {2,3} and L {,2,3}, and his agrees wih wha we found in (23). Since we assumed in Problem ha he random vecors in H are muually independen and he {L s } s U are a pariion of H, i follows ha he vecors in he se {[ ]} ζ s are muually independen. (26) ϕ s s U Jus as (2) characerizes he evoluion of x, we may also derive sae equaions for he ζ coordinaes. We sae he resul in he following lemma. Lemma 8. The coordinaes {ζ s } s U and {ϕ s } s U saisfy he recursive equaions ζ0 s = I s,{i} x i 0 (27) ζ s + = ( A sr ζ r + B sr ϕ r) + r s Proof. See Appendix D. I s,{i} w i (28) Noe ha (27) (28) agrees wih he formula (5) given in he saemen of Theorem 2, provided ha we se ϕ s = K s ζ s. This choice of policy is feasible because u i = s i I {i},s K s ζ s lin s i L s = lin I i, where he inclusion follows by definiion, and he las equaliy follows from Lemma 7. This complees he proof of feasibiliy. Remark 9. We may inerpre ζ s and ϕ s as condiional esimaes of x and u, respecively. Namely, VII-C ζ s = I s,v E(x L s ) and ϕ s = I s,v E(u L s ). Opimaliy We now prove he conroller is opimal, and derive an expression for he corresponding minimal expeced cos. Our proof uses a dynamic programming argumen, and we opimize over policies raher han acions. Le γ = {γ i } i V be he se of policies a ime. By Lemma 5, we may assume he γ i are linear. Define he cos-o-go V (γ 0: ) = min E γ γ :T ( T [ xk k= u k ] T [ Qk S T k S k R k ] [ xk u k ] + x T T Q f x T ), where he expecaion is aken wih respec o he join probabiliy measure on (x :T, u :T ) induced by he choice of γ = γ 0:T. These funcions are he minimum expeced fuure cos from ime, given ha he policies up o ime have been fixed. We allow V o be a funcion of pas policies, bu i urns ou ha V will no depend on hem explicily. By causaliy, we may ierae he minimizaions and wrie a recursive formulaion for he cos-o-go, V (γ 0: ) = ( [x ] T [ min E γ Q γ u S T S R ] ) + V u + (γ 0:, γ ). ] [ x (29) Our objecive is o find he opimal cos (6), which is simply V 0. Consider he erminal imesep, and use he decomposiion (25), V T (γ 0:T ) = E γ( x T T Q f x T ) = E γ s U (ζ s T ) T Q ss f (ζ s T ). In he las sep, we used he fac ha he ζ s coordinaes are independen (26). Noe ha V T depends on he policies up o ime T because he disribuion of ζt s depends on pas policies implicily hrough (28). We will prove by inducion ha he value funcion always has a similar quadraic form. Specifically, suppose ha for some 0, we have V + (γ 0: ) = E γ s U(ζ s +) T X s +(ζ s +) + c +, where {X+} s s U is a se of marices and c + is a scalar. Now compue V (γ 0: ) using he recursion (29). Subsiuing he definiions for ζ s and ϕ s from (25) and using 9

10 he independence resul (26), we obain ( [ ] V (γ 0: ) = min E γ ζ s T [ Q ss γ s U ϕ s S ss S sst R ss ] [ ] ζ s ϕ s + (ζ s +) T X s +(ζ s +) + c + ). Subsiuing he sae equaions (28), using he independence resul once more and rearranging erms, we obain V (γ 0: ) = min E [ ] γ ζ r T [ ] γ ϕ r Γ r ζ r ϕ r + c, (30) r U where Γ r 0:T and c 0:T are given by: Γ r = [ Q rr S rr S rrt R rr c = c + + i V ] + [ A sr B sr] T X s + [ A sr B sr] (3) race ( (X s +) {i},{i} W i). (32) The erminal condiions are Γ r T = Qrr f and c T = 0, and s is he unique node in Ĝ(U, F) such ha r s, see Proposiion. In (30), he erms in he sum are independen, so hey may be opimized separaely. A lower bound on he cos-o-go is found by relaxing he informaion consrains and performing an unconsrained opimizaion over he acions ϕ = {ϕ s } s U, V (γ 0: ) E γ r U min γ [ ζ r ϕ r E [ γ ζ r min ϕ ϕ r r U ] T [ Γ r ζ r ϕ r ] T [ Γ r ζ r ϕ r ] + c ] + c, where he firs inequaliy follows from Faou s lemma applied o (30), and he second inequaliy follows from he relaxaion menioned above. Each minimizaion is a simple quadraic opimizaion, and he opimal cos and acion are given by (4). Subsiuion yields V (γ 0: ) E γ s U(ζ s ) T X s (ζ s ) + c. This lower-bound is in fac igh, because he opimal unconsrained acions are ϕ s = K s ζ s lin L s, which is precisely he admissible se for ϕ s. This complees he inducion argumen as well as he proof ha he specified policy is opimal. The opimal cos is given by V 0 = E s U(ζ s 0) T X s 0(ζ s 0) + c 0 = E (x i 0) T (X0) s {i},{i} (x i 0) + c 0. (33) i V where c 0 may be evaluaed by saring wih c T = 0 and recursing backwards using (32). Finally, (33) evaluaes o he desired expression (6) because x i 0 N (µ i 0, Σ i 0). This complees he proof of Theorem 2. VII-D Proof of Theorem 3 Firs we show ha he message passing algorihm execues wihou deadlock. A = 0, deadlock canno occur, since here are no delay-0 direced cycles. Therefore, he delay-0 edges induce a parial order on he nodes by i j if here is a direced delay-0 pah from i o j. Any execuion ordering ha saisfies he parial order will be feasible. Now for > 0, assuming ha all delay- messages were sen a ime, he same reasoning shows ha no deadlock could occur a ime. Each node compues u i using (5), so we mus check ha all ζ s saes wih s i are available o node i a ime. I is sraighforward o show ha he claim holds for = 0. Suppose i holds a imesep. Compuing ζ s in sep 2 or he firs par of sep 3 requires knowledge of {ζ } r r s. If i r, hen ζ r is already available, by assumpion. If i / r, hen here exiss j r wih j i. Then (8) implies ha ζ r M ji. Thus, ζ r is made available via an inbound message. For he second par of sep 3, we mus know all ζ r for which r i and r s. This happens as long as we evaluae he updaes while respecing he parial order explained above. Then all missing ζ r are provided by delay-0 messages, as defined in (8). This complees he proof of Theorem 3. VIII Conclusion This paper uses dynamic programming o derive opimal policies for a general class of decenralized linear quadraic sae feedback problems. As noed in Secion IV, he soluion generalizes many exising works on decenralized sae-feedback conrol [6, 2, 22]. The key concep in his paper is he informaion graph. Is nodes are a paricular subses of he se of all nodes in he nework and is edges show how available informaion evolves over ime. This urns ou o be closely relaed o he srucure of he opimal decenralized conroller. For example, he number of Riccai difference equaions ha mus be propagaed is equal o he number of conneced componens in he informaion graph. As discussed in Secion V, many possible avenues for fuure research remain open. For example, some special cases wih noisy measuremens or correlaed noise have been solved, bu exensions o general direced graphs wih mixed sparsiy and delays have ye o be found. IX Acknowledgemens The firs auhor hanks John Doyle for very helpful discussions. The second auhor would like o hank Ashuosh Nayyar for some very helpful discussions. 0

11 References [] A. Gaami. Generalized linear quadraic conrol. IEEE Transacions on Auomaic Conrol, 55():3 36, 200. [2] M. Green and D. J. N. Limebeer. Linear Robus Conrol. Prenice Hall, 995. [3] Y.-C. Ho and K.-C. Chu. Team decision heory and informaion srucures in opimal conrol problems Par I. IEEE Transacions on Auomaic Conrol, 7():5 22, 972. [4] B.-Z. Kuraran and R. Sivan. Linear-quadraic- Gaussian conrol wih one-sep-delay sharing paern. IEEE Transacions on Auomaic Conrol, 9(5):57 574, 974. [5] A. Lamperski and J. C. Doyle. On he srucure of saefeedback LQG conrollers for disribued sysems wih communicaion delays. In IEEE Conference on Decision and Conrol, pages , 20. [6] A. Lamperski and J. C. Doyle. Dynamic programming soluions for decenralized sae-feedback LQG problems wih communicaion delays. In American Conrol Conference, pages , 202. [7] A. Lamperski and J. C. Doyle. Oupu feedback H 2 model maching for decenralized sysems wih delays. In American Conrol Conference, 203. To appear. [8] A. Lamperski and L. Lessard. Opimal sae-feedback conrol under sparsiy and delay consrains. In 3rd IFAC Workshop on Disribued Esimaion and Conrol in Neworked Sysems, pages , 202. [9] L. Lessard. Decenralized LQG conrol of sysems wih a broadcas archiecure. In IEEE Conference on Decision and Conrol, volume , 202. [0] L. Lessard. Opimal conrol of a fully decenralized quadraic regulaor. In Alleron Conference on Communicaion, Conrol, and Compuing, pages IEEE, 202. [] L. Lessard, M. Krisalny, and A. Ranzer. On srucured realizabiliy and sabilizabiliy of linear sysems. In American Conrol Conference, 203. To appear. [2] L. Lessard and S. Lall. Opimal conroller synhesis for he decenralized wo-player problem wih oupu feedback. In American Conrol Conference, pages , 202. [3] L. Lessard and S. Lall. Opimal conrol of wo-player sysems wih oupu feedback. ArXiv e-prins, Mar [4] L. Lessard and A. Nayyar. Srucural resuls and explici soluion for wo-player LQG sysems on a finie ime horizon. ArXiv e-prins, Mar [5] A. Nayyar, A. Mahajan, and D. Tenekezis. Opimal conrol sraegies in delayed sharing informaion srucures. IEEE Transacions on Auomaic Conrol, 56(7): , 20. [6] X. Qi, M. V. Salapaka, P. G. Voulgaris, and M. Khammash. Srucured opimal and robus conrol wih muliple crieria: a convex soluion. IEEE Transacions on Auomaic Conrol, 49(0): , [7] A. Ranzer. Linear quadraic eam heory revisied. In American Conrol Conference, pages , [8] A. Ranzer. A separaion principle for disribued conrol. In IEEE Conference on Decision and Conrol, pages , [9] M. Rokowiz and S. Lall. A characerizaion of convex problems in decenralized conrol. IEEE Transacions on Auomaic Conrol, 5(2): , [20] N. R. Sandell and M. Ahans. Soluion of some nonclassical LQG sochasic decision problems. IEEE Transacions on Auomaic Conrol, 9(2):08 6, 974. [2] P. Shah and P. A. Parrilo. H 2-opimal decenralized conrol over poses: A sae space soluion for saefeedback. In IEEE Conference on Decision and Conrol, pages , 200. [22] J. Swigar. Opimal Conroller Synhesis for Decenralized Sysems. PhD hesis, Sanford Universiy, 200. [23] J. Swigar and S. Lall. An explici dynamic programming soluion for a decenralized wo-player opimal linearquadraic regulaor. In Symposium on he Mahemaical Theory of Neworks, pages , 200. [24] J. Swigar and S. Lall. An explici sae-space soluion for a decenralized wo-player opimal linear-quadraic regulaor. In American Conrol Conference, pages , 200. [25] J. Tsisiklis and M. Ahans. On he complexiy of decenralized decision making and deecion problems. IEEE Transacions on Auomaic Conrol, 30(5): , 985. [26] A. S. M. Vamsi and N. Elia. Nework realizabiliy for inerconneced sysems over arbirary one-sep delay neworks. In American Conrol Conference, pages , 202. [27] P. Varaiya and J. Walrand. On delayed sharing paerns. IEEE Transacions on Auomaic Conrol, 23(3): , 978. [28] H. S. Wisenhausen. A counerexample in sochasic opimum conrol. SIAM Journal on Conrol, 6():3 47, 968. [29] T. Yoshikawa. Dynamic programming approach o decenralized sochasic conrol problems. IEEE Transacions on Auomaic Conrol, 20(6): , 975. [30] T. Yoshikawa and H. Kobayashi. Separaion of esimaion and conrol for decenralized sochasic conrol sysems. Auomaica, 4(6): , 978.

12 A Informaion Se Properies We now prove some useful properies of informaion ses. Lemma 0. The informaion ses (4) may be expressed recursively as follows, I i = (34) I i = { x j : j V, D ij = 0 } I j for 0. j V D ij Furhermore, suppose i, j V and 0 k. The following are equivalen. (i) I j k Ii (ii) x j k Ii (iii) D ij k Proof. We firs prove ha (i) (iii) are equivalen. (i) = (ii): I is immediae from (4) ha x j k Ij k. So i follows ha x j k Ii. (ii) = (iii): I follows from (4) ha if x j k Ii, hen we mus have 0 k D ij. Therefore D ij k as required. (iii) = (i): By he riangle inequaliy, D iα D jα D ij for any α V. Therefore, if D ij k, hen k D jα D iα. So for any l ha saisfies 0 l k D jα, we mus also have 0 l D iα. I follows from (4) ha I j k Ii. We now derive he recursive expression for I. i wih (4), which we rewrie here for convenience, I i = { x j k : j V, 0 k D ij}. Sar Pariion ino wo cases; when k = (which implies D ij = 0), and when k. Then, pariion furher based on he value of D ij. I i = { x j : D ij = 0 } { x j k : 0 k D ij, k } = { x j : j V, D ij = 0 } { x j k : j V, 0 k D ij, D ij = 0 } { x j k : j V, 0 k D ij, D ij }. and D lj + = D ij. Therefore, I i = { x j : j V, D ij = 0 } { x j k : j V, 0 k D ij, D ij = 0 } { x j k : j V, 0 k D } lj l i D il = = { x j : D ij = 0 } l V D il = { x j : j V, D ij = 0 } as required. B Proof of Lemma 6 { x j k : 0 k D } lj j V D ij I j, Suppose ha u j τ affecs I i in he simples way possible; namely ha u j τ affecs x l σ a a fuure imesep σ > τ via recursive applicaions of he sae equaions (), and x l σ I. i Then we have u j τ affecs x l σ = D lj σ τ (35) x l σ I i = D il σ. (36) Adding (35) (36) ogeher and using he riangle inequaliy, we obain D ij τ. By Lemma 0, i follows ha Iτ j I, i as required. If u j τ affecs I i via a more complicaed pah, apply he above argumen o each consecuive pair of inpus along he pah o obain he chain of inclusions Iτ j I. i Wih parial nesedness esablished, Lemma 5 implies ha here is a unique linear opimal conroller. In paricular, he opimal u i is a linear funcion of I. i The same is rivially rue of x i since we have x i I i from (4). Thus he firs par of (9) is verified. We proceed by inducion o prove he second par of (9). A = 0, we clearly have x 0, u 0 lin H 0, since I i = {x 0,..., x N 0 } = H 0. i V Now suppose ha for some 0, we have x, u lin H. Applying he sae equaions (2), i follows ha x + lin { H {w } } = lin H + u + lin i V I i + = lin { {x + } H } = lin H+, and he proof is complee. In he las erm, when D ij, i means ha here is a pah j i wih an aggregae delay of a leas ; so here exiss an inermediae node l i where D il = 2

13 C Proof of Lemma 7 Par (i). We proceed by inducion. A = 0, we have H 0 = {x 0,..., x N 0 }. Since each w i poins o exacly one elemen s U, i is clear from (20) ha {L s 0} s U pariions H 0. Now suppose ha {L s } s U pariions H for some 0. By Proposiion, for each r U here exiss a unique s U such ha r s. Therefore each elemen w i k H is conained in exacly one label se L s +. I follows from (20) ha {L s +} s U mus pariion H + and he proof is complee. Par (ii). Again, by inducion. A = 0, I i 0 = {x j 0 : D ij = 0} = {x j 0 : sj 0 i} = {x j 0 : wj s, s i} = s i L s 0. So he ideniy holds a = 0. Now suppose i holds for some 0. By Lemma 0, I+ i = {x j + : j V, D ij = 0} I j. (37) j V D ij Applying he sae equaions (), we have: x j + = ( Ajl x l + B jl u l) + w j. l V D jl By Lemma 6, he x l and u l erms belong o lin I l where D jl. Therefore, aking he lin of boh sides of (37), D Proof of Lemma 8 Suppose ζ s saisfies (27) (28). The recursive formulaion of he label ses (20) implies ha ζ s lin L s for all 0 and all s U. All ha remains o be shown is ha ζ s indeed provides decomposiion of x as in (25). The decomposiion of x is saisfied a = 0, since for all i V here is a unique s U wih w i s. Now assume inducively ha (25) holds for some 0. Therefore, I V,s ζ+ s s U = ( I V,s ( A sr ζ r + B sr ϕ r) + s U r s = s U I s,{i} w i ) ( ( AI V,r ζ r + BI V,r ϕ r) + ) I V,{i} w i r s ( AI V,s ζ s + BI V,s ϕ s) + w i = s U = Ax + Bu + w = x +, where we have subsiued he dynamics (27) in he firs sep, and he inducion hypohesis in he fourh sep. In he second sep, we ook advanage of he paricular sparsiy srucures of A and B, ha imply AI V,r = I V,s A sr and BI V,r = I V,s B sr. I follows ha (25) holds for +. lin I i + = lin{w j : j V, D ij = 0} + j V D ij = lin{w j : j V, D ij = 0} + j V D ij = lin s i L s +, lin I j lin r j L r where he second equaliy follows from he inducion hypohesis and he hird equaliy follows from he label se recursion (20). 3

Structural results for partially nested LQG systems over graphs

Structural results for partially nested LQG systems over graphs Srucural resuls for parially nesed LQG sysems over graphs Ashuosh Nayyar 1 Lauren Lessard 2 American Conrol Conference, pp. 5457 5464, 2015 Absrac We idenify a broad class of decenralized oupufeedback

More information

Optimal Control for LQG Systems on Graphs Part I: Structural Results

Optimal Control for LQG Systems on Graphs Part I: Structural Results Opimal Conrol for LQG Sysems on Graphs Par I: Srucural Resuls Ashuosh Nayyar Lauren Lessard Absrac In his wo-par paper, we idenify a broad class of decenralized oupu-feedback LQG sysems for which he opimal

More information

Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach

Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach 1 Decenralized Sochasic Conrol wih Parial Hisory Sharing: A Common Informaion Approach Ashuosh Nayyar, Adiya Mahajan and Demoshenis Tenekezis arxiv:1209.1695v1 [cs.sy] 8 Sep 2012 Absrac A general model

More information

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB Elecronic Companion EC.1. Proofs of Technical Lemmas and Theorems LEMMA 1. Le C(RB) be he oal cos incurred by he RB policy. Then we have, T L E[C(RB)] 3 E[Z RB ]. (EC.1) Proof of Lemma 1. Using he marginal

More information

Finish reading Chapter 2 of Spivak, rereading earlier sections as necessary. handout and fill in some missing details!

Finish reading Chapter 2 of Spivak, rereading earlier sections as necessary. handout and fill in some missing details! MAT 257, Handou 6: Ocober 7-2, 20. I. Assignmen. Finish reading Chaper 2 of Spiva, rereading earlier secions as necessary. handou and fill in some missing deails! II. Higher derivaives. Also, read his

More information

10. State Space Methods

10. State Space Methods . Sae Space Mehods. Inroducion Sae space modelling was briefly inroduced in chaper. Here more coverage is provided of sae space mehods before some of heir uses in conrol sysem design are covered in he

More information

STATE-SPACE MODELLING. A mass balance across the tank gives:

STATE-SPACE MODELLING. A mass balance across the tank gives: B. Lennox and N.F. Thornhill, 9, Sae Space Modelling, IChemE Process Managemen and Conrol Subjec Group Newsleer STE-SPACE MODELLING Inroducion: Over he pas decade or so here has been an ever increasing

More information

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle Chaper 2 Newonian Mechanics Single Paricle In his Chaper we will review wha Newon s laws of mechanics ell us abou he moion of a single paricle. Newon s laws are only valid in suiable reference frames,

More information

SOLUTIONS TO ECE 3084

SOLUTIONS TO ECE 3084 SOLUTIONS TO ECE 384 PROBLEM 2.. For each sysem below, specify wheher or no i is: (i) memoryless; (ii) causal; (iii) inverible; (iv) linear; (v) ime invarian; Explain your reasoning. If he propery is no

More information

Chapter 2. First Order Scalar Equations

Chapter 2. First Order Scalar Equations Chaper. Firs Order Scalar Equaions We sar our sudy of differenial equaions in he same way he pioneers in his field did. We show paricular echniques o solve paricular ypes of firs order differenial equaions.

More information

Lecture 20: Riccati Equations and Least Squares Feedback Control

Lecture 20: Riccati Equations and Least Squares Feedback Control 34-5 LINEAR SYSTEMS Lecure : Riccai Equaions and Leas Squares Feedback Conrol 5.6.4 Sae Feedback via Riccai Equaions A recursive approach in generaing he marix-valued funcion W ( ) equaion for i for he

More information

Lecture Notes 2. The Hilbert Space Approach to Time Series

Lecture Notes 2. The Hilbert Space Approach to Time Series Time Series Seven N. Durlauf Universiy of Wisconsin. Basic ideas Lecure Noes. The Hilber Space Approach o Time Series The Hilber space framework provides a very powerful language for discussing he relaionship

More information

An introduction to the theory of SDDP algorithm

An introduction to the theory of SDDP algorithm An inroducion o he heory of SDDP algorihm V. Leclère (ENPC) Augus 1, 2014 V. Leclère Inroducion o SDDP Augus 1, 2014 1 / 21 Inroducion Large scale sochasic problem are hard o solve. Two ways of aacking

More information

On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems

On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems MATHEMATICS OF OPERATIONS RESEARCH Vol. 38, No. 2, May 2013, pp. 209 227 ISSN 0364-765X (prin) ISSN 1526-5471 (online) hp://dx.doi.org/10.1287/moor.1120.0562 2013 INFORMS On Boundedness of Q-Learning Ieraes

More information

1 Review of Zero-Sum Games

1 Review of Zero-Sum Games COS 5: heoreical Machine Learning Lecurer: Rob Schapire Lecure #23 Scribe: Eugene Brevdo April 30, 2008 Review of Zero-Sum Games Las ime we inroduced a mahemaical model for wo player zero-sum games. Any

More information

Matrix Versions of Some Refinements of the Arithmetic-Geometric Mean Inequality

Matrix Versions of Some Refinements of the Arithmetic-Geometric Mean Inequality Marix Versions of Some Refinemens of he Arihmeic-Geomeric Mean Inequaliy Bao Qi Feng and Andrew Tonge Absrac. We esablish marix versions of refinemens due o Alzer ], Carwrigh and Field 4], and Mercer 5]

More information

5. Stochastic processes (1)

5. Stochastic processes (1) Lec05.pp S-38.45 - Inroducion o Teleraffic Theory Spring 2005 Conens Basic conceps Poisson process 2 Sochasic processes () Consider some quaniy in a eleraffic (or any) sysem I ypically evolves in ime randomly

More information

Notes for Lecture 17-18

Notes for Lecture 17-18 U.C. Berkeley CS278: Compuaional Complexiy Handou N7-8 Professor Luca Trevisan April 3-8, 2008 Noes for Lecure 7-8 In hese wo lecures we prove he firs half of he PCP Theorem, he Amplificaion Lemma, up

More information

23.2. Representing Periodic Functions by Fourier Series. Introduction. Prerequisites. Learning Outcomes

23.2. Representing Periodic Functions by Fourier Series. Introduction. Prerequisites. Learning Outcomes Represening Periodic Funcions by Fourier Series 3. Inroducion In his Secion we show how a periodic funcion can be expressed as a series of sines and cosines. We begin by obaining some sandard inegrals

More information

Signaling equilibria for dynamic LQG games with. asymmetric information

Signaling equilibria for dynamic LQG games with. asymmetric information Signaling equilibria for dynamic LQG games wih asymmeric informaion Deepanshu Vasal and Achilleas Anasasopoulos Absrac We consider a finie horizon dynamic game wih wo players who observe heir ypes privaely

More information

Notes on Kalman Filtering

Notes on Kalman Filtering Noes on Kalman Filering Brian Borchers and Rick Aser November 7, Inroducion Daa Assimilaion is he problem of merging model predicions wih acual measuremens of a sysem o produce an opimal esimae of he curren

More information

Linear Response Theory: The connection between QFT and experiments

Linear Response Theory: The connection between QFT and experiments Phys540.nb 39 3 Linear Response Theory: The connecion beween QFT and experimens 3.1. Basic conceps and ideas Q: How do we measure he conduciviy of a meal? A: we firs inroduce a weak elecric field E, and

More information

Some Ramsey results for the n-cube

Some Ramsey results for the n-cube Some Ramsey resuls for he n-cube Ron Graham Universiy of California, San Diego Jozsef Solymosi Universiy of Briish Columbia, Vancouver, Canada Absrac In his noe we esablish a Ramsey-ype resul for cerain

More information

Expert Advice for Amateurs

Expert Advice for Amateurs Exper Advice for Amaeurs Ernes K. Lai Online Appendix - Exisence of Equilibria The analysis in his secion is performed under more general payoff funcions. Wihou aking an explici form, he payoffs of he

More information

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon 3..3 INRODUCION O DYNAMIC OPIMIZAION: DISCREE IME PROBLEMS A. he Hamilonian and Firs-Order Condiions in a Finie ime Horizon Define a new funcion, he Hamilonian funcion, H. H he change in he oal value of

More information

O Q L N. Discrete-Time Stochastic Dynamic Programming. I. Notation and basic assumptions. ε t : a px1 random vector of disturbances at time t.

O Q L N. Discrete-Time Stochastic Dynamic Programming. I. Notation and basic assumptions. ε t : a px1 random vector of disturbances at time t. Econ. 5b Spring 999 C. Sims Discree-Time Sochasic Dynamic Programming 995, 996 by Chrisopher Sims. This maerial may be freely reproduced for educaional and research purposes, so long as i is no alered,

More information

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions Muli-Period Sochasic Models: Opimali of (s, S) Polic for -Convex Objecive Funcions Consider a seing similar o he N-sage newsvendor problem excep ha now here is a fixed re-ordering cos (> 0) for each (re-)order.

More information

Vehicle Arrival Models : Headway

Vehicle Arrival Models : Headway Chaper 12 Vehicle Arrival Models : Headway 12.1 Inroducion Modelling arrival of vehicle a secion of road is an imporan sep in raffic flow modelling. I has imporan applicaion in raffic flow simulaion where

More information

Two Coupled Oscillators / Normal Modes

Two Coupled Oscillators / Normal Modes Lecure 3 Phys 3750 Two Coupled Oscillaors / Normal Modes Overview and Moivaion: Today we ake a small, bu significan, sep owards wave moion. We will no ye observe waves, bu his sep is imporan in is own

More information

14 Autoregressive Moving Average Models

14 Autoregressive Moving Average Models 14 Auoregressive Moving Average Models In his chaper an imporan parameric family of saionary ime series is inroduced, he family of he auoregressive moving average, or ARMA, processes. For a large class

More information

CONTROL SYSTEMS, ROBOTICS AND AUTOMATION Vol. XI Control of Stochastic Systems - P.R. Kumar

CONTROL SYSTEMS, ROBOTICS AND AUTOMATION Vol. XI Control of Stochastic Systems - P.R. Kumar CONROL OF SOCHASIC SYSEMS P.R. Kumar Deparmen of Elecrical and Compuer Engineering, and Coordinaed Science Laboraory, Universiy of Illinois, Urbana-Champaign, USA. Keywords: Markov chains, ransiion probabiliies,

More information

Echocardiography Project and Finite Fourier Series

Echocardiography Project and Finite Fourier Series Echocardiography Projec and Finie Fourier Series 1 U M An echocardiagram is a plo of how a porion of he hear moves as he funcion of ime over he one or more hearbea cycles If he hearbea repeas iself every

More information

Timed Circuits. Asynchronous Circuit Design. Timing Relationships. A Simple Example. Timed States. Timing Sequences. ({r 6 },t6 = 1.

Timed Circuits. Asynchronous Circuit Design. Timing Relationships. A Simple Example. Timed States. Timing Sequences. ({r 6 },t6 = 1. Timed Circuis Asynchronous Circui Design Chris J. Myers Lecure 7: Timed Circuis Chaper 7 Previous mehods only use limied knowledge of delays. Very robus sysems, bu exremely conservaive. Large funcional

More information

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017 Two Popular Bayesian Esimaors: Paricle and Kalman Filers McGill COMP 765 Sep 14 h, 2017 1 1 1, dx x Bel x u x P x z P Recall: Bayes Filers,,,,,,, 1 1 1 1 u z u x P u z u x z P Bayes z = observaion u =

More information

A Primal-Dual Type Algorithm with the O(1/t) Convergence Rate for Large Scale Constrained Convex Programs

A Primal-Dual Type Algorithm with the O(1/t) Convergence Rate for Large Scale Constrained Convex Programs PROC. IEEE CONFERENCE ON DECISION AND CONTROL, 06 A Primal-Dual Type Algorihm wih he O(/) Convergence Rae for Large Scale Consrained Convex Programs Hao Yu and Michael J. Neely Absrac This paper considers

More information

RANDOM LAGRANGE MULTIPLIERS AND TRANSVERSALITY

RANDOM LAGRANGE MULTIPLIERS AND TRANSVERSALITY ECO 504 Spring 2006 Chris Sims RANDOM LAGRANGE MULTIPLIERS AND TRANSVERSALITY 1. INTRODUCTION Lagrange muliplier mehods are sandard fare in elemenary calculus courses, and hey play a cenral role in economic

More information

Designing Information Devices and Systems I Spring 2019 Lecture Notes Note 17

Designing Information Devices and Systems I Spring 2019 Lecture Notes Note 17 EES 16A Designing Informaion Devices and Sysems I Spring 019 Lecure Noes Noe 17 17.1 apaciive ouchscreen In he las noe, we saw ha a capacior consiss of wo pieces on conducive maerial separaed by a nonconducive

More information

ODEs II, Lecture 1: Homogeneous Linear Systems - I. Mike Raugh 1. March 8, 2004

ODEs II, Lecture 1: Homogeneous Linear Systems - I. Mike Raugh 1. March 8, 2004 ODEs II, Lecure : Homogeneous Linear Sysems - I Mike Raugh March 8, 4 Inroducion. In he firs lecure we discussed a sysem of linear ODEs for modeling he excreion of lead from he human body, saw how o ransform

More information

t is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t...

t is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t... Mah 228- Fri Mar 24 5.6 Marix exponenials and linear sysems: The analogy beween firs order sysems of linear differenial equaions (Chaper 5) and scalar linear differenial equaions (Chaper ) is much sronger

More information

Georey E. Hinton. University oftoronto. Technical Report CRG-TR February 22, Abstract

Georey E. Hinton. University oftoronto.   Technical Report CRG-TR February 22, Abstract Parameer Esimaion for Linear Dynamical Sysems Zoubin Ghahramani Georey E. Hinon Deparmen of Compuer Science Universiy oftorono 6 King's College Road Torono, Canada M5S A4 Email: zoubin@cs.orono.edu Technical

More information

11!Hí MATHEMATICS : ERDŐS AND ULAM PROC. N. A. S. of decomposiion, properly speaking) conradics he possibiliy of defining a counably addiive real-valu

11!Hí MATHEMATICS : ERDŐS AND ULAM PROC. N. A. S. of decomposiion, properly speaking) conradics he possibiliy of defining a counably addiive real-valu ON EQUATIONS WITH SETS AS UNKNOWNS BY PAUL ERDŐS AND S. ULAM DEPARTMENT OF MATHEMATICS, UNIVERSITY OF COLORADO, BOULDER Communicaed May 27, 1968 We shall presen here a number of resuls in se heory concerning

More information

R t. C t P t. + u t. C t = αp t + βr t + v t. + β + w t

R t. C t P t. + u t. C t = αp t + βr t + v t. + β + w t Exercise 7 C P = α + β R P + u C = αp + βr + v (a) (b) C R = α P R + β + w (c) Assumpions abou he disurbances u, v, w : Classical assumions on he disurbance of one of he equaions, eg. on (b): E(v v s P,

More information

Approximation Algorithms for Unique Games via Orthogonal Separators

Approximation Algorithms for Unique Games via Orthogonal Separators Approximaion Algorihms for Unique Games via Orhogonal Separaors Lecure noes by Konsanin Makarychev. Lecure noes are based on he papers [CMM06a, CMM06b, LM4]. Unique Games In hese lecure noes, we define

More information

EE363 homework 1 solutions

EE363 homework 1 solutions EE363 Prof. S. Boyd EE363 homework 1 soluions 1. LQR for a riple accumulaor. We consider he sysem x +1 = Ax + Bu, y = Cx, wih 1 1 A = 1 1, B =, C = [ 1 ]. 1 1 This sysem has ransfer funcion H(z) = (z 1)

More information

Modal identification of structures from roving input data by means of maximum likelihood estimation of the state space model

Modal identification of structures from roving input data by means of maximum likelihood estimation of the state space model Modal idenificaion of srucures from roving inpu daa by means of maximum likelihood esimaion of he sae space model J. Cara, J. Juan, E. Alarcón Absrac The usual way o perform a forced vibraion es is o fix

More information

GMM - Generalized Method of Moments

GMM - Generalized Method of Moments GMM - Generalized Mehod of Momens Conens GMM esimaion, shor inroducion 2 GMM inuiion: Maching momens 2 3 General overview of GMM esimaion. 3 3. Weighing marix...........................................

More information

Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still.

Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still. Lecure - Kinemaics in One Dimension Displacemen, Velociy and Acceleraion Everyhing in he world is moving. Nohing says sill. Moion occurs a all scales of he universe, saring from he moion of elecrons in

More information

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation Course Noes for EE7C Spring 018: Convex Opimizaion and Approximaion Insrucor: Moriz Hard Email: hard+ee7c@berkeley.edu Graduae Insrucor: Max Simchowiz Email: msimchow+ee7c@berkeley.edu Ocober 15, 018 3

More information

Zürich. ETH Master Course: L Autonomous Mobile Robots Localization II

Zürich. ETH Master Course: L Autonomous Mobile Robots Localization II Roland Siegwar Margaria Chli Paul Furgale Marco Huer Marin Rufli Davide Scaramuzza ETH Maser Course: 151-0854-00L Auonomous Mobile Robos Localizaion II ACT and SEE For all do, (predicion updae / ACT),

More information

15. Vector Valued Functions

15. Vector Valued Functions 1. Vecor Valued Funcions Up o his poin, we have presened vecors wih consan componens, for example, 1, and,,4. However, we can allow he componens of a vecor o be funcions of a common variable. For example,

More information

Scheduling of Crude Oil Movements at Refinery Front-end

Scheduling of Crude Oil Movements at Refinery Front-end Scheduling of Crude Oil Movemens a Refinery Fron-end Ramkumar Karuppiah and Ignacio Grossmann Carnegie Mellon Universiy ExxonMobil Case Sudy: Dr. Kevin Furman Enerprise-wide Opimizaion Projec March 15,

More information

Let us start with a two dimensional case. We consider a vector ( x,

Let us start with a two dimensional case. We consider a vector ( x, Roaion marices We consider now roaion marices in wo and hree dimensions. We sar wih wo dimensions since wo dimensions are easier han hree o undersand, and one dimension is a lile oo simple. However, our

More information

Longest Common Prefixes

Longest Common Prefixes Longes Common Prefixes The sandard ordering for srings is he lexicographical order. I is induced by an order over he alphabe. We will use he same symbols (,

More information

Sliding Mode Extremum Seeking Control for Linear Quadratic Dynamic Game

Sliding Mode Extremum Seeking Control for Linear Quadratic Dynamic Game Sliding Mode Exremum Seeking Conrol for Linear Quadraic Dynamic Game Yaodong Pan and Ümi Özgüner ITS Research Group, AIST Tsukuba Eas Namiki --, Tsukuba-shi,Ibaraki-ken 5-856, Japan e-mail: pan.yaodong@ais.go.jp

More information

This document was generated at 1:04 PM, 09/10/13 Copyright 2013 Richard T. Woodward. 4. End points and transversality conditions AGEC

This document was generated at 1:04 PM, 09/10/13 Copyright 2013 Richard T. Woodward. 4. End points and transversality conditions AGEC his documen was generaed a 1:4 PM, 9/1/13 Copyrigh 213 Richard. Woodward 4. End poins and ransversaliy condiions AGEC 637-213 F z d Recall from Lecure 3 ha a ypical opimal conrol problem is o maimize (,,

More information

Chapter 3 Boundary Value Problem

Chapter 3 Boundary Value Problem Chaper 3 Boundary Value Problem A boundary value problem (BVP) is a problem, ypically an ODE or a PDE, which has values assigned on he physical boundary of he domain in which he problem is specified. Le

More information

GENERALIZATION OF THE FORMULA OF FAA DI BRUNO FOR A COMPOSITE FUNCTION WITH A VECTOR ARGUMENT

GENERALIZATION OF THE FORMULA OF FAA DI BRUNO FOR A COMPOSITE FUNCTION WITH A VECTOR ARGUMENT Inerna J Mah & Mah Sci Vol 4, No 7 000) 48 49 S0670000970 Hindawi Publishing Corp GENERALIZATION OF THE FORMULA OF FAA DI BRUNO FOR A COMPOSITE FUNCTION WITH A VECTOR ARGUMENT RUMEN L MISHKOV Received

More information

Algebraic Attacks on Summation Generators

Algebraic Attacks on Summation Generators Algebraic Aacks on Summaion Generaors Dong Hoon Lee, Jaeheon Kim, Jin Hong, Jae Woo Han, and Dukjae Moon Naional Securiy Research Insiue 161 Gajeong-dong, Yuseong-gu, Daejeon, 305-350, Korea {dlee,jaeheon,jinhong,jwhan,djmoon}@eri.re.kr

More information

Optimality Conditions for Unconstrained Problems

Optimality Conditions for Unconstrained Problems 62 CHAPTER 6 Opimaliy Condiions for Unconsrained Problems 1 Unconsrained Opimizaion 11 Exisence Consider he problem of minimizing he funcion f : R n R where f is coninuous on all of R n : P min f(x) x

More information

Adaptation and Synchronization over a Network: stabilization without a reference model

Adaptation and Synchronization over a Network: stabilization without a reference model Adapaion and Synchronizaion over a Nework: sabilizaion wihou a reference model Travis E. Gibson (gibson@mi.edu) Harvard Medical School Deparmen of Pahology, Brigham and Women s Hospial 55 h Conference

More information

PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD

PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD HAN XIAO 1. Penalized Leas Squares Lasso solves he following opimizaion problem, ˆβ lasso = arg max β R p+1 1 N y i β 0 N x ij β j β j (1.1) for some 0.

More information

Online Convex Optimization Example And Follow-The-Leader

Online Convex Optimization Example And Follow-The-Leader CSE599s, Spring 2014, Online Learning Lecure 2-04/03/2014 Online Convex Opimizaion Example And Follow-The-Leader Lecurer: Brendan McMahan Scribe: Sephen Joe Jonany 1 Review of Online Convex Opimizaion

More information

Technical Report Doc ID: TR March-2013 (Last revision: 23-February-2016) On formulating quadratic functions in optimization models.

Technical Report Doc ID: TR March-2013 (Last revision: 23-February-2016) On formulating quadratic functions in optimization models. Technical Repor Doc ID: TR--203 06-March-203 (Las revision: 23-Februar-206) On formulaing quadraic funcions in opimizaion models. Auhor: Erling D. Andersen Convex quadraic consrains quie frequenl appear

More information

Chapter 6. Systems of First Order Linear Differential Equations

Chapter 6. Systems of First Order Linear Differential Equations Chaper 6 Sysems of Firs Order Linear Differenial Equaions We will only discuss firs order sysems However higher order sysems may be made ino firs order sysems by a rick shown below We will have a sligh

More information

SZG Macro 2011 Lecture 3: Dynamic Programming. SZG macro 2011 lecture 3 1

SZG Macro 2011 Lecture 3: Dynamic Programming. SZG macro 2011 lecture 3 1 SZG Macro 2011 Lecure 3: Dynamic Programming SZG macro 2011 lecure 3 1 Background Our previous discussion of opimal consumpion over ime and of opimal capial accumulaion sugges sudying he general decision

More information

di Bernardo, M. (1995). A purely adaptive controller to synchronize and control chaotic systems.

di Bernardo, M. (1995). A purely adaptive controller to synchronize and control chaotic systems. di ernardo, M. (995). A purely adapive conroller o synchronize and conrol chaoic sysems. hps://doi.org/.6/375-96(96)8-x Early version, also known as pre-prin Link o published version (if available):.6/375-96(96)8-x

More information

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC

1. An introduction to dynamic optimization -- Optimal Control and Dynamic Programming AGEC This documen was generaed a :45 PM 8/8/04 Copyrigh 04 Richard T. Woodward. An inroducion o dynamic opimizaion -- Opimal Conrol and Dynamic Programming AGEC 637-04 I. Overview of opimizaion Opimizaion is

More information

We just finished the Erdős-Stone Theorem, and ex(n, F ) (1 1/(χ(F ) 1)) ( n

We just finished the Erdős-Stone Theorem, and ex(n, F ) (1 1/(χ(F ) 1)) ( n Lecure 3 - Kövari-Sós-Turán Theorem Jacques Versraëe jacques@ucsd.edu We jus finished he Erdős-Sone Theorem, and ex(n, F ) ( /(χ(f ) )) ( n 2). So we have asympoics when χ(f ) 3 bu no when χ(f ) = 2 i.e.

More information

Online Appendix to Solution Methods for Models with Rare Disasters

Online Appendix to Solution Methods for Models with Rare Disasters Online Appendix o Soluion Mehods for Models wih Rare Disasers Jesús Fernández-Villaverde and Oren Levinal In his Online Appendix, we presen he Euler condiions of he model, we develop he pricing Calvo block,

More information

Math 10B: Mock Mid II. April 13, 2016

Math 10B: Mock Mid II. April 13, 2016 Name: Soluions Mah 10B: Mock Mid II April 13, 016 1. ( poins) Sae, wih jusificaion, wheher he following saemens are rue or false. (a) If a 3 3 marix A saisfies A 3 A = 0, hen i canno be inverible. True.

More information

Chapter 7: Solving Trig Equations

Chapter 7: Solving Trig Equations Haberman MTH Secion I: The Trigonomeric Funcions Chaper 7: Solving Trig Equaions Le s sar by solving a couple of equaions ha involve he sine funcion EXAMPLE a: Solve he equaion sin( ) The inverse funcions

More information

L07. KALMAN FILTERING FOR NON-LINEAR SYSTEMS. NA568 Mobile Robotics: Methods & Algorithms

L07. KALMAN FILTERING FOR NON-LINEAR SYSTEMS. NA568 Mobile Robotics: Methods & Algorithms L07. KALMAN FILTERING FOR NON-LINEAR SYSTEMS NA568 Mobile Roboics: Mehods & Algorihms Today s Topic Quick review on (Linear) Kalman Filer Kalman Filering for Non-Linear Sysems Exended Kalman Filer (EKF)

More information

I. Introduction to place/transition nets. Place/Transition Nets I. Example: a vending machine. Example: a vending machine

I. Introduction to place/transition nets. Place/Transition Nets I. Example: a vending machine. Example: a vending machine Inroducory Tuorial I. Inroducion o place/ransiion nes Place/Transiion Nes I Prepared by: Jörg Desel, Caholic Universiy in Eichsä and Karsen Schmid, Humbold-Universiä zu Berlin Speaker: Wolfgang Reisig,

More information

Hierarchical Design of Decentralized Receding Horizon Controllers for Decoupled Systems

Hierarchical Design of Decentralized Receding Horizon Controllers for Decoupled Systems Hierarchical Design of Decenralized Receding Horizon Conrollers for Decoupled Sysems Tamás Keviczy, Francesco Borrelli, Gary J. Balas* Absrac We consider a se of decoupled dynamical sysems and an opimal

More information

THE MATRIX-TREE THEOREM

THE MATRIX-TREE THEOREM THE MATRIX-TREE THEOREM 1 The Marix-Tree Theorem. The Marix-Tree Theorem is a formula for he number of spanning rees of a graph in erms of he deerminan of a cerain marix. We begin wih he necessary graph-heoreical

More information

Appendix 14.1 The optimal control problem and its solution using

Appendix 14.1 The optimal control problem and its solution using 1 Appendix 14.1 he opimal conrol problem and is soluion using he maximum principle NOE: Many occurrences of f, x, u, and in his file (in equaions or as whole words in ex) are purposefully in bold in order

More information

Signal and System (Chapter 3. Continuous-Time Systems)

Signal and System (Chapter 3. Continuous-Time Systems) Signal and Sysem (Chaper 3. Coninuous-Time Sysems) Prof. Kwang-Chun Ho kwangho@hansung.ac.kr Tel: 0-760-453 Fax:0-760-4435 1 Dep. Elecronics and Informaion Eng. 1 Nodes, Branches, Loops A nework wih b

More information

Guest Lectures for Dr. MacFarlane s EE3350 Part Deux

Guest Lectures for Dr. MacFarlane s EE3350 Part Deux Gues Lecures for Dr. MacFarlane s EE3350 Par Deux Michael Plane Mon., 08-30-2010 Wrie name in corner. Poin ou his is a review, so I will go faser. Remind hem o go lisen o online lecure abou geing an A

More information

Reading from Young & Freedman: For this topic, read sections 25.4 & 25.5, the introduction to chapter 26 and sections 26.1 to 26.2 & 26.4.

Reading from Young & Freedman: For this topic, read sections 25.4 & 25.5, the introduction to chapter 26 and sections 26.1 to 26.2 & 26.4. PHY1 Elecriciy Topic 7 (Lecures 1 & 11) Elecric Circuis n his opic, we will cover: 1) Elecromoive Force (EMF) ) Series and parallel resisor combinaions 3) Kirchhoff s rules for circuis 4) Time dependence

More information

ACE 562 Fall Lecture 5: The Simple Linear Regression Model: Sampling Properties of the Least Squares Estimators. by Professor Scott H.

ACE 562 Fall Lecture 5: The Simple Linear Regression Model: Sampling Properties of the Least Squares Estimators. by Professor Scott H. ACE 56 Fall 005 Lecure 5: he Simple Linear Regression Model: Sampling Properies of he Leas Squares Esimaors by Professor Sco H. Irwin Required Reading: Griffihs, Hill and Judge. "Inference in he Simple

More information

KINEMATICS IN ONE DIMENSION

KINEMATICS IN ONE DIMENSION KINEMATICS IN ONE DIMENSION PREVIEW Kinemaics is he sudy of how hings move how far (disance and displacemen), how fas (speed and velociy), and how fas ha how fas changes (acceleraion). We say ha an objec

More information

2.7. Some common engineering functions. Introduction. Prerequisites. Learning Outcomes

2.7. Some common engineering functions. Introduction. Prerequisites. Learning Outcomes Some common engineering funcions 2.7 Inroducion This secion provides a caalogue of some common funcions ofen used in Science and Engineering. These include polynomials, raional funcions, he modulus funcion

More information

The expectation value of the field operator.

The expectation value of the field operator. The expecaion value of he field operaor. Dan Solomon Universiy of Illinois Chicago, IL dsolom@uic.edu June, 04 Absrac. Much of he mahemaical developmen of quanum field heory has been in suppor of deermining

More information

Econ107 Applied Econometrics Topic 7: Multicollinearity (Studenmund, Chapter 8)

Econ107 Applied Econometrics Topic 7: Multicollinearity (Studenmund, Chapter 8) I. Definiions and Problems A. Perfec Mulicollineariy Econ7 Applied Economerics Topic 7: Mulicollineariy (Sudenmund, Chaper 8) Definiion: Perfec mulicollineariy exiss in a following K-variable regression

More information

The Arcsine Distribution

The Arcsine Distribution The Arcsine Disribuion Chris H. Rycrof Ocober 6, 006 A common heme of he class has been ha he saisics of single walker are ofen very differen from hose of an ensemble of walkers. On he firs homework, we

More information

Computer-Aided Analysis of Electronic Circuits Course Notes 3

Computer-Aided Analysis of Electronic Circuits Course Notes 3 Gheorghe Asachi Technical Universiy of Iasi Faculy of Elecronics, Telecommunicaions and Informaion Technologies Compuer-Aided Analysis of Elecronic Circuis Course Noes 3 Bachelor: Telecommunicaion Technologies

More information

MATH 5720: Gradient Methods Hung Phan, UMass Lowell October 4, 2018

MATH 5720: Gradient Methods Hung Phan, UMass Lowell October 4, 2018 MATH 5720: Gradien Mehods Hung Phan, UMass Lowell Ocober 4, 208 Descen Direcion Mehods Consider he problem min { f(x) x R n}. The general descen direcions mehod is x k+ = x k + k d k where x k is he curren

More information

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles Diebold, Chaper 7 Francis X. Diebold, Elemens of Forecasing, 4h Ediion (Mason, Ohio: Cengage Learning, 006). Chaper 7. Characerizing Cycles Afer compleing his reading you should be able o: Define covariance

More information

Lecture 4 Notes (Little s Theorem)

Lecture 4 Notes (Little s Theorem) Lecure 4 Noes (Lile s Theorem) This lecure concerns one of he mos imporan (and simples) heorems in Queuing Theory, Lile s Theorem. More informaion can be found in he course book, Bersekas & Gallagher,

More information

EXERCISES FOR SECTION 1.5

EXERCISES FOR SECTION 1.5 1.5 Exisence and Uniqueness of Soluions 43 20. 1 v c 21. 1 v c 1 2 4 6 8 10 1 2 2 4 6 8 10 Graph of approximae soluion obained using Euler s mehod wih = 0.1. Graph of approximae soluion obained using Euler

More information

Random Walk with Anti-Correlated Steps

Random Walk with Anti-Correlated Steps Random Walk wih Ani-Correlaed Seps John Noga Dirk Wagner 2 Absrac We conjecure he expeced value of random walks wih ani-correlaed seps o be exacly. We suppor his conjecure wih 2 plausibiliy argumens and

More information

Optimal Server Assignment in Multi-Server

Optimal Server Assignment in Multi-Server Opimal Server Assignmen in Muli-Server 1 Queueing Sysems wih Random Conneciviies Hassan Halabian, Suden Member, IEEE, Ioannis Lambadaris, Member, IEEE, arxiv:1112.1178v2 [mah.oc] 21 Jun 2013 Yannis Viniois,

More information

Unsteady Flow Problems

Unsteady Flow Problems School of Mechanical Aerospace and Civil Engineering Unseady Flow Problems T. J. Craf George Begg Building, C41 TPFE MSc CFD-1 Reading: J. Ferziger, M. Peric, Compuaional Mehods for Fluid Dynamics H.K.

More information

Logic in computer science

Logic in computer science Logic in compuer science Logic plays an imporan role in compuer science Logic is ofen called he calculus of compuer science Logic plays a similar role in compuer science o ha played by calculus in he physical

More information

In this chapter the model of free motion under gravity is extended to objects projected at an angle. When you have completed it, you should

In this chapter the model of free motion under gravity is extended to objects projected at an angle. When you have completed it, you should Cambridge Universiy Press 978--36-60033-7 Cambridge Inernaional AS and A Level Mahemaics: Mechanics Coursebook Excerp More Informaion Chaper The moion of projeciles In his chaper he model of free moion

More information

2.160 System Identification, Estimation, and Learning. Lecture Notes No. 8. March 6, 2006

2.160 System Identification, Estimation, and Learning. Lecture Notes No. 8. March 6, 2006 2.160 Sysem Idenificaion, Esimaion, and Learning Lecure Noes No. 8 March 6, 2006 4.9 Eended Kalman Filer In many pracical problems, he process dynamics are nonlinear. w Process Dynamics v y u Model (Linearized)

More information

Ordinary Differential Equations

Ordinary Differential Equations Ordinary Differenial Equaions 5. Examples of linear differenial equaions and heir applicaions We consider some examples of sysems of linear differenial equaions wih consan coefficiens y = a y +... + a

More information

4 Sequences of measurable functions

4 Sequences of measurable functions 4 Sequences of measurable funcions 1. Le (Ω, A, µ) be a measure space (complee, afer a possible applicaion of he compleion heorem). In his chaper we invesigae relaions beween various (nonequivalen) convergences

More information

SPECTRAL EVOLUTION OF A ONE PARAMETER EXTENSION OF A REAL SYMMETRIC TOEPLITZ MATRIX* William F. Trench. SIAM J. Matrix Anal. Appl. 11 (1990),

SPECTRAL EVOLUTION OF A ONE PARAMETER EXTENSION OF A REAL SYMMETRIC TOEPLITZ MATRIX* William F. Trench. SIAM J. Matrix Anal. Appl. 11 (1990), SPECTRAL EVOLUTION OF A ONE PARAMETER EXTENSION OF A REAL SYMMETRIC TOEPLITZ MATRIX* William F Trench SIAM J Marix Anal Appl 11 (1990), 601-611 Absrac Le T n = ( i j ) n i,j=1 (n 3) be a real symmeric

More information

SUFFICIENT CONDITIONS FOR EXISTENCE SOLUTION OF LINEAR TWO-POINT BOUNDARY PROBLEM IN MINIMIZATION OF QUADRATIC FUNCTIONAL

SUFFICIENT CONDITIONS FOR EXISTENCE SOLUTION OF LINEAR TWO-POINT BOUNDARY PROBLEM IN MINIMIZATION OF QUADRATIC FUNCTIONAL HE PUBLISHING HOUSE PROCEEDINGS OF HE ROMANIAN ACADEMY, Series A, OF HE ROMANIAN ACADEMY Volume, Number 4/200, pp 287 293 SUFFICIEN CONDIIONS FOR EXISENCE SOLUION OF LINEAR WO-POIN BOUNDARY PROBLEM IN

More information