Network Utility Maximization over Partially Observable Markovian Channels

Size: px
Start display at page:

Download "Network Utility Maximization over Partially Observable Markovian Channels"

Transcription

1 WIOPT Nework Uiliy Maximizaion over Parially Observable Markovian Channels Chih-ping Li, Suden Member, IEEE and Michael J. Neely, Senior Member, IEEE Absrac This paper considers maximizing hroughpu uiliy in a muli-user nework wih parially observable Markov ON/OFF channels. Insananeous channel saes are never known, and all conrol decisions are based on informaion provided by ACK/NACK feedback from pas ransmissions. This sysem can be viewed as a resless muli-armed bandi problem wih a concave objecive funcion of he ime average reward vecor. Such problems are generally inracable. However, we provide an approximae soluion by opimizing he concave objecive over a non-rivial inner bound on he nework performance region, where he inner bound is consruced by randomizing welldesigned saionary policies. Using a new frame-based Lyapunov drif argumen, we design a policy of admission conrol and channel selecion ha sabilizes he nework wih hroughpu uiliy ha can be made arbirarily close o he opimal in he inner performance region. Our problem has applicaions in limied channel probing in wireless neworks, dynamic specrum access in cogniive radio neworks, and arge racking of unmanned aerial vehicles. Our analysis generalizes he MaxWeigh-ype scheduling policies in sochasic nework opimizaion heory from ime-sloed sysems o frame-based sysems ha have policy-dependen frame sizes. I. INTRODUCTION This paper sudies a muli-user wireless scheduling problem in a parially observable environmen. We consider a base saion serving N users via N independen Markov ON/OFF channels see Fig. 1). Time is sloed wih normalized slos Fig. 1. P n,10 P n,11 ON1) OFF0) P n,00 P n,01 The Markov ON/OFF model for channel n {1, 2,..., N}. Z +. Channel saes are fixed in every slo, and can only change a slo boundaries. Suppose he base saion has unlimied daa o send for all users. In every slo, he channel saes are unknown, and he base saion selecs a mos one user o which i blindly ransmis a packe. The ransmission succeeds if he used channel is ON, and fails oherwise. A he end of a slo, an error-free ACK is fed back from he Chih-ping Li web: hp://www-scf.usc.edu/ chihpinl) and Michael J. Neely web: hp://www-rcf.usc.edu/ mjneely) are wih he Deparmen of Elecrical Engineering, Universiy of Souhern California, Los Angeles, CA 90089, USA. This maerial is suppored in par by one or more of he following: he DARPA IT-MANET gran W911NF , he NSF Career gran CCF , and he Nework Science Collaboraive Technology Alliance sponsored by he U.S. Army Research Laboraory. served user o he base saion over an independen conrol channel absence of an ACK is considered as a NACK). Since channels are ON/OFF and correlaed over ime, he ACK/NACK feedback provides parial informaion of fuure channel saes, and can improve fuure scheduling decisions for beer performance. The goal is o design a conrol policy ha maximizes a concave uiliy funcion of he hroughpu vecor from all channels. Specifically, le y n ) be he number of packes delivered o user n {1,..., N} in slo. Define 1 E y nτ) as he hroughpu of user 1 y n lim n. Le Λ denoe he nework capaciy region, defined as he closure of he se of all achievable hroughpu vecors y y n ) N. The goal is o solve: maximize: gy), subjec o: y Λ, 1) where g ) is a concave, coninuous, nonnegaive, and nondecreasing funcion. The ineres in he above problem comes from is many applicaions. One applicaion is limied channel probing over wireless neworks. Consider he same wireless downlink as saed above, excep ha a mos one channel is explicily probed in every slo. A packe is served over he probed channel if he sae is ON. This seup is essenially he same as our original problem, excep ha channels are probed differenly implici probing by ACK/NACK feedback versus probing by explici signaling). The moivaion for sudying limied channel probing is ha, in a fas-changing environmen where full channel probing may be infeasible due o iming overhead, we shall probe channels wisely and exploi channel memory o improve nework performance. As an example of 1), we may addiionally provide fairness o all users, such as a varian of rae proporional fairness 1, 2 by solving: maximize: gy) = N log 1+y n ), subjec o: y Λ. 2) In cogniive radio neworks 3, 4, a secondary user has access o a collecion of Markov ON/OFF channels. Every channel reflecs he occupancy of a specrum by a primary user, and he secondary user opporunisically ransmis daa over unused specrums for beer specrum efficiency. In arge racking of unmanned aerial vehicles UAVs) 5, a UAV deecs one of he many arges in every slo. Every Markov channel reflecs he movemen of a arge; a channel is ON if is associaed arge moves o a hospo, and OFF oherwise. Delivering a packe over a channel represens gaining a reward by locaing a arge a is hospo. In he above wo applicaions, possible goals include maximizing a weighed

2 WIOPT sum gy) = N c ny n of hroughpus/ime-average rewards, where c n are given consans, or providing fairness o differen specrums/arges by solving 2). The problem 1) is challenging because he curren informaion available for each channel depends on he pas ransmission decisions. This problem belongs o he class of resless muli-armed bandi RMAB) problems 6, which are generally inracable 7. In addiion, he nework capaciy region Λ in 1) does no seem o have a closed form expression see 8 for more discussions). Therefore we mus resor o approximaion mehods o solve 1). In his paper, we propose an achievable region approach o consruc an approximae soluion o 1). There are wo seps: i) we consruc a good inner performance region Λ in Λ for he original problem, hen ii) we solve he consrained problem: maximize: gy), subjec o: y Λ in, 3) which serves as an approximaion o he original problem 1). In previous work 8, we have consruced a non-rivial inner performance region Λ in using he rich srucure of Markov channels see Secion III for deails). The inner performance region Λ in is rendered as a convex hull of performance vecors of a well-designed collecion of round robin policies. The ighness of he inner region Λ in see Fig. 2 for an example) is analyzed in 8 when channels are saisically idenical. In his special case we show ha he gap beween he boundary of he inner region Λ in and ha of he full performance region Λ decreases o zero geomerically fas as he reference direcion moves closer o he 45-degree angle. 1 The main conribuion of his paper is wih respec o he second sep of he achievable region approach: Given an inner performance region Λ in, we consruc a policy ha solves 3) using Lyapunov drif heory. Lyapunov drif heory is originally developed for hroughpu opimal conrol over ime-sloed wireless neworks 9, 10, laer exended o opimize various performance objecives such as average power 11 or rae uiliy funcions 1, 12, 13 in wireless neworks, and recenly generalized o opimize dynamic sysems ha have a renewal srucure The inuiion is he following. Since he performance region Λ in is a convex hull of performance vecors of he round robin policies we design, he problem 3) is solved by some random mixure of hese policies. Using Lyapunov drif heory see more deails in Secion IV), we greedily consruc a sequence of round robin policies whose long-erm ime sharing can approximae he opimal soluion as close as desired, wih some radeoffs discussed laer. Our conrol policy ha solves 3) has wo componens. To faciliae he soluion o 3), we keep an infinie-capaciy queue for every user a he base saion, and design an admission 1 We remark ha he ighness of he inner region Λ in is difficul o check in general cases, alhough he region is inuiively large by he naure of is consrucion. The boomline is, consrucing an inuiively good and easily achievable inner performance region is of pracical ineres, because saisfying performance ouside he inner region may ineviably involve solving much more complicaed parially observable Markov decision processes. From his view, in inracable RMAB problems, we may regard an inner performance region as an operaional performance region, which shall be gradually improved by a deeper invesigaion ino he problem srucure. conrol algorihm ha admis daa ino he queues for evenual ransmissions. In every slo, he amoun of daa admied for every user is decided by solving a simple convex program. 2 In addiion, he base saion deploys a sequence of round robin policies implemened frame by frame, where every frame is one round of execuion by a round robin policy. The round robin policy used in every frame is seleced by maximizing an average drif minus reward raio over he average frame size c.f. 18)). We emphasize ha his new raio rule generalizes he MaxWeigh-ype policies 1, 10 for sochasic nework opimizaion from ime-sloed wireless neworks o framebased sysems in which he disribuion of he random frame size is policy-dependen. We prove ha he above policy of admission conrol and channel scheduling yields a hroughpu vecor y saisfying gy) gy ) B V, 4) where gy ) is he opimal objecive of problem 3), B > 0 is a finie consan, V > 0 is a predefined conrol parameer, and we emporarily assume ha all limis exis. By choosing V sufficienly large in 4), he performance uiliy gy) can be made arbirarily close o he opimal gy ), wih he radeoff ha he average queue size a he base saion grows linearly wih V. We remark ha he proof of 4) does no require he knowledge of he opimal uiliy gy ). In he lieraure, sochasic uiliy maximizaion over wireless neworks is solved in 1, 12, assuming ha channel saes are i.i.d. over slos and are known perfecly and insanly. Limied channel probing in wireless neworks is sudied in differen conexs in 17 22, also assuming ha channel saes are i.i.d. over ime. This paper generalizes he framework in 1 o wireless neworks wih limied channel probing and ime-correlaed channels, and uses channel memory o improve performance. RMAB problems wih Markov ON/OFF projecs are previously sudied in for he maximizaion of sum of ime average or discouned rewards. In paricular, work shows ha greedy round robin policies are opimal in some special cases; we modify hese policies in 8 for he consrucion of a racable inner performance region Λ in. Index policies such While s index 6 are consruced in 26, 27, and are shown o have good performance by simulaions. A 2 + ɛ)-approximae algorihm is derived in 28 based on dualiy mehods. This paper provides a new mahemaical programming mehod for opimizing nonlinear objecive funcions of ime average rewards in RMAB problems. In he lieraure RMAB problems are mosly sudied wih linear objecive funcions. The wo popular mehods for linear RMABs While s index 6 and parially observable) Markov decision heory 29 seem difficul o apply o nonlinear RMABs because hey are based on dynamic programming ideas. Exensions of our new mehod in his paper o oher RMAB problems wih general projec sae space are lef for fuure research. 2 The admission conrol decision decouples ino separable one-dimensional convex programs ha are easily solved in real ime when he hroughpu uiliy gy) is a sum of one-dimensional uiliy funcions.

3 WIOPT In he res of he paper, he deailed nework model is in he nex secion. Secion III inroduces he inner performance region Λ in consruced in 8. Our dynamic conrol policy is moivaed and given in Secion IV, followed by performance analysis. II. DETAILED NETWORK MODEL Beside he nework model inroduced in he previous secion, we suppose ha every Markov ON/OFF channel n {1,..., N} changes saes a slo boundaries by he ransiion probabiliy marix Pn,00 P P n = n,01, P n,10 P n,11 where sae ON is represened by 1 and OFF by 0, and P n,ij denoes he ransiion probabiliy from sae i o j. Assume ha he marices P n are known. We assume ha every channel is posiively correlaed over ime, so ha an ON sae is more likely followed by he same sae. An equivalen mahemaical definiion is P n,01 + P n,10 < 1 for all n. We suppose ha every user has a higher-layer daa source of unlimied packes a he base saion. The base saion keeps a nework-layer queue Q n ) of infinie capaciy for every user n {1,..., N}, where Q n ) denoes he backlog for user n in slo. In every slo, he base saion admis r n ) 0, 1 packes for user n from is daa source ino queue Q n ). For simpliciy, we assume ha r n ) akes real values in 0, 1 for all n. 3 Le µ n ) {0, 1} denoe he service rae for user n in slo. The queueing process {Q n )} =0 of user n evolves as Q n + 1) = maxq n ) µ n ), 0 + r n ). 5) Iniially Q n 0) = 0 for all n. We say queue Q n ) is srongly) sable if is limiing average backlog is finie, i.e., lim sup 1 1 E Q n τ) <. The nework is sable if all queues Q 1 ),..., Q N )) are sable. Clearly a sufficien condiion for sabiliy is: lim sup 1 1 N E Q n τ) <. 6) Our goal is o design a policy ha admis he righ amoun of daa ino he nework and serves hem properly by channel scheduling, so ha he nework is sable wih hroughpu uiliy ha can be made arbirarily close o he opimal soluion o he problem 3). III. A PERFORMANCE INNER BOUND This secion presens an inner performance region Λ in consruced in previous work 8 using randomized round robin policies; see 8 for deails. For every channel n {1,..., N}, le P k) n,ij denoe he k-sep ransiion probabiliy from sae i o j, and π n,on be he saionary probabiliy of sae ON. We 3 We can accommodae he ineger-value assumpion of r n) by inroducing auxiliary queues; see 1 for an example. define he informaion sae for user n in slo, denoed by ω n ), as he condiional probabiliy ha channel n is ON in slo given all pas channel observaions. Namely, ω n ) Pr s n ) = ON channel observaion hisory, where s n ) denoes he sae of channel n in slo. Condiioning on he mos recen channel observaion, we observe ha ω n ) akes values in he counably infinie se W n {P k) n,01, Pk) n,11 : k N} {π n,on}. The informaion sae vecor ω n )) N is a sufficien saisic 29; i is opimal o ac based only on he ω n )) N informaion. Le n) denoe he channel observed in slo via ACK/NACK feedback. The probabiliy ω n ) on channel n {1,..., N} evolves as: P n,01, if n = n), s n ) = OFF ω n +1)= P n,11, if n = n), s n ) = ON ω n )P n, ω n ))P n,01, if n n). 7) A. Randomized round robin policy Le Φ denoe he se of all nonzero N-dimensional binary vecors. Every vecor φ φ n ) N Φ represens a collecion of acive channels, where we say channel n is acive if φ n = 1. Le Mφ) denoe he number of ones or acive channels) in φ. Consider he nex dynamic round robin policy RRφ) ha serves acive channels in φ, possibly wih differen order in differen rounds. This is he building block of randomized round robin policies ha we will inroduce shorly. Dynamic Round Robin Policy RRφ): 1) In every round, we assume an ordering of acive channels in φ is given. 2) When swiching o an acive channel n, Wih probabiliy P Mφ)) n,01 /ω n ), we keep ransmiing packes over channel n unil a NACK is received, afer which we swich o he nex acive channel according o he predefined ordering. Oherwise, we ransmi over channel n a dummy packe wih no informaion conen for one slo used for channel sensing), hen swich o he nex acive channel. 3) Updae probabiliies ω n )) N by 7) in every slo. These RRφ) policies are carefully designed o have good and, more imporanly, racable performance. Work 24 shows ha, when channels are saisically idenical, serving all channels by greedy round robin policies differen from he above) maximizes he sum hroughpu of he nework. Thus, inuiively, we ge a good achievable hroughpu region Λ in by randomly mixing round robin policies each of which serves a differen subse of channels. Randomized Round Robin Policy RandRR: 1) In every round, pick a binary vecor φ Φ {0} wih some probabiliy α φ, where α 0 + φ Φ α φ = 1. 2) If a vecor φ Φ is seleced, run policy RRφ) for one round using he channel ordering of leas recenly used

4 WIOPT firs. Oherwise, φ = 0, idle he sysem for one slo. A he end of eiher case, go o Sep 1. We noe ha, in every round of a RandRR policy, a RRφ) policy is feasible only if P Mφ)) n,01 ω n ) whenever an acive channel n sars service see Sep 2 of he RRφ) policy). This condiion is guaraneed by serving acive channels in he order of leas recenly used firs 8, Lemma 6. Thus all RandRR policies are feasible. 4 The following resuls presen he amoun of service opporuniies provided by a RandRR policy o every user n. Theorem 1 8). i) In every round of a RandRR policy, when policy RRφ) is randomly chosen for service, le L φ n denoe he ime duraion an acive channel n is accessed. The duraion L φ n has he probabiliy disribuion: and L φ n = { 1 wih prob. 1 P Mφ)) n,01 j 2 wih prob. P Mφ)) n,01 P n,11 ) j 2) P n,10 E L φ P Mφ)) n,01 n = ) P n,10 ii) In he duraion L φ n, channel n serves L φ n 1) packes. Theorem 1 shows ha he disribuion of L φ n is independen of he informaion sae vecor ω n )) N a he sar of a ransmission round; i only depends on he number of channels, Mφ), chosen for service in a round. This observaion implies ha he ransmission rounds in a RandRR policy have i.i.d. duraions. Moreover, for every fixed user n, he number of user-n packes served in a round is also i.i.d. over differen rounds. This leads o he following corollary. Corollary 1. i) Le T k denoe he duraion of he kh ransmission round in a RandRR policy. The random variables T k are i.i.d. over differen k wih E T k = α 0 + α φ E L φ n, φ Φ n:φ which is compued by condiioning on he policy chosen in a round. ii) Le N n,k denoe he number of packes served for user n in round T k. For each fixed n, he random variables N n,k are i.i.d. over differen k wih E N n,k = φ:φ α φ E L φ n 1, which is compued by condiioning on he RRφ) policy ha is chosen and uses channel n. iii) Because N n,k and T k are i.i.d. over k, he hroughpu of user n under a RandRR policy is equal o E N n,k /E T k. B. The inner performance region Λ in In his paper, we define he inner performance region Λ in in 3) as he se of all hroughpu vecors achieved by he class of RandRR policies. Equivalenly, he inner hroughpu region 4 The feasibiliy of RandRR policies is proved in 8 under he special case ha here are no idle operaions α 0 = 0). Using he monooniciy of he k- sep ransiion probabiliies {P k) n,01, Pk) n,11 }, he feasibiliy can be similarly proved for he exended RandRR policies considered here. Λ in can be viewed as a convex hull of he zero vecor and he performance vecors of he subse of RandRR policies, each of which execues a fixed RRφ) policy in every round. A closed form expression of he inner region Λ in is given in 8, Theorem 1. An example is given nex. Consider a wo-user sysem wih saisically idenical channels wih P 01 = P 10 = 0.2. Fig. 2 shows he ighness of he inner hroughpu region Λ in compared o he unknown) full nework capaciy region Λ. We noe ha poins B, A, and λ 2 B D Λ in A Λ Fig. 2. The closeness of he inner hroughpu region Λ in and he nework capaciy region Λ in a wo-user nework wih saisically idenical channels. C in Fig. 2 maximize he sum hroughpu of he nework in direcions 0, 1), 1, 1), and 1, 0), respecively 24. Thus he boundary of Λ is a concave curve connecing hese poins. C λ 1 IV. NETWORK UTILITY MAXIMIZATION A. The QRRNUM policy Following he above discussions, he problem 3) is a welldefined convex program. Ye, solving 3) is difficul because he performance region Λ in is represened as a convex hull of 2 N performance vecors. The following admission conrol and channel scheduling policy solves 3) in a dynamic manner wih low complexiy. Queue-dependen Round Robin for Nework Uiliy Maximizaion QRRNUM): Admission conrol) A he sar of every round, observe he curren queue backlog Q) = Q 1 ),..., Q N )) and solve he convex program maximize: V gr) N Q n ) r n 9) subjec o: r n 0, 1, n {1,..., N}, 10) where V > 0 is a predefined conrol parameer, and vecor r r n ) N. Le rn QRR ) N denoe he soluion o 9)-10). In every slo of he curren round, admi rn QRR packes ino queue Q n ) for every user n {1,..., N}. Channel scheduling) A he sar of every round, over all nonzero binary vecors φ = φ n ) N Φ, le φ QRR be he maximizer of he raio N Q n)e L φ n 1 φ n, 11) N E L φ n φ n

5 WIOPT where E L φ n is given in 8). If he maximum of 11) is posiive, run policy RRφ QRR ) for one round using he channel ordering of leas recenly used firs. Oherwise, idle he sysem for one slo. A he end of eiher case, sar a new round of admission conrol and channel scheduling. When he uiliy funcion g ) is a sum of individual uiliies, i.e., gr) = N g nr n ), problem 9)-10) decouples ino N one-dimensional convex programs, each of which maximizes he weighed difference V g n r n ) Q n )r n over r n 0, 1, which can be solved efficienly in real ime. The mos complex par of he QRRNUM policy is o maximize he raio 11). In he following we presen a bisecion algorihm 16, Secion ha searches for he maximum of 11) wih exponenially fas speed. This algorihm is moivaed by he nex lemma. Lemma 1. 16, Lemma 7.5) Le aφ) and bφ) denoe he numeraor and denominaor of 11), respecively. Define { } aφ) θ max, cθ) max aφ) θbφ). φ Φ bφ) φ Φ Then he following is rue: 1) If θ = θ, hen cθ) = 0. 2) If θ < θ, hen cθ) > 0. 3) If θ > θ, hen cθ) < 0. The value cθ) can be easily compued by noicing { } cθ) = max aφ) θbφ), 12) φ Φ k max k {1,...,N} where Φ k Φ denoes he se of binary vecors having k ones. The inner maximum of 12) is equal o { N } k) P n,01 max Q n ) θ) θ φ n, φ Φ k P n,10 P which is solved by soring he values k) n,01 P n,10 Q n ) θ) θ. Inuiion from Lemma 1: To search for he opimal raio θ, suppose iniially we know θ θ min, θ max for some θ min and θ max. We compue he midpoin θ mid = 1 2 θ min + θ max ) and evaluae cθ mid ). If cθ mid ) > 0, we have θ mid < θ and hus θ θ mid, θ max ; one such bisecion operaion reduces he feasible region of θ by half. By ieraing he bisecion, we can find θ quickly. Noice ha he maximizer of cθ ) is he desired policy φ QRR, since by definiion we have aφ) θ bφ) 0 for all φ Φ and aφ QRR ) θ bφ QRR ) = 0. The bisecion algorihm ha maximizes 11): Iniially, define θ min 0 and N ) Q n) θ max { } πn,on max n {1,...,N} P n,10 { } Pn, min n {1,...,N} P n,10 so ha θ min aφ)/bφ) θ max for all φ Φ. I follows ha θ θ min, θ max. 5 Compue θ mid = 1 2 θ min + θ max ) and cθ mid ). If cθ mid ) = 0, we have θ = θ mid and φ QRR is he maximizer of 5 The value θ max is creaed by noing ha, in a posiively correlaed channel, he k-sep ransiion probabiliies P k) k) n,01 and P n,10 increase and decrease wih k, respecively; boh sequences have he same limi π n,on. cθ ). When cθ mid ) < 0, updae he feasible region of θ as θ min, θ mid. If cθ mid ) > 0, updae he feasible region of θ as θ mid, θ max. In eiher case, repea he bisecion process. B. Lyapunov drif inequaliy The consrucion of he QRRNUM policy follows a new Lyapunov drif argumen. We sar wih consrucing a framebased Lyapunov drif inequaliy over a frame of size T, where T is possibly random bu has a finie second momen bounded by a consan C so ha C E T 2 Q) for all and all possible Q). Inuiion for consrucing such an inequaliy is shown laer. By ieraively applying 5), i is no hard o show T 1 T 1 Q n +T ) max Q n ) µ n + τ), 0 + r n +τ) 13) for every n {1,..., N}. We define he quadraic Lyapunov funcion LQ)) 1 2 N Q2 n) as a scalar measure of he queue size vecor Q). Define he T -slo Lyapunov drif T Q)) E LQ + T )) LQ)) Q) as a condiional expeced change of queue sizes across T slos, where he expecaion is wih respec o he randomness of he nework wihin he T slos, including he randomness of T. By aking square of 13) for every n, using inequaliies maxa b, 0 a a, b 0, maxa b, 0) 2 a b) 2, µ n ) 1, r n ) 1, o simplify erms, summing all resuling inequaliies, and aking condiional expecaion on Q), we can show T Q)) B N E Q n ) T 1 14) µ n + τ) r n + τ) Q) where B NC > 0 is a consan. Subracing from boh sides T 1 of 14) he weighed sum V E gr + τ)) Q), where V > 0 is a predefined conrol parameer and r + τ) an admied daa vecor, we ge he Lyapunov drif inequaliy T 1 T Q)) V E gr + τ)) Q) 15) where fq)) hq)) E B fq)) hq)), N Q n )E T 1 T 1 µ n + τ) Q) V gr + τ)) 16) N Q n )r n + τ) Q). 17) The inequaliy 15) holds for any scheduling policy over a frame of any size T.

6 WIOPT C. Inuiion behind he Lyapunov drif inequaliy The desired nework conrol policy shall sabilize all queues Q 1 ),..., Q N )) and maximize he hroughpu uiliy g ). For queue sabiliy, we wan o minimize he Lyapunov drif T Q)), because i capures he expeced growh of queue sizes over a duraion of ime. On he oher hand, o increase hroughpu uiliy, we wan o admi more daa ino he sysem for service, and maximize he expeced sum T 1 gr + τ)) Q). Minimizing Lyapunov uiliy E drif and maximizing hroughpu uiliy, however, conflic wih each oher, because queue sizes increase wih more daa admied ino he sysem. To capure his radeoff, i is naural o minimize a weighed difference of Lyapunov drif and hroughpu uiliy, which is he lef side of 15). The radeoff is conrolled by he posiive parameer V. Inuiively, a large V value pus more weighs on hroughpu uiliy, hus hroughpu uiliy is improved, a he expense of he growh of he queue size refleced in T Q)). The consrucion of he inequaliy 15) provides a useful bound on he weighed difference of Lyapunov drif and hroughpu uiliy. The QRRNUM policy ha we will consruc in he nex secion uses he above ideas wih wo modificaions. Firs, i suffices o minimize a bound on he weighed difference of Lyapunov drif and hroughpu uiliy, i.e., he righ side of 15). Second, since he weighed difference of Lyapunov drif and hroughpu uiliy in 15) is made over a frame of T slos, where he value T is random and depends on he policy implemened wihin he frame, i is naural o normalize he weighed difference by he average frame size, and we will minimize he resuling raio see 18)). This new raio rule is a generalizaion of he MaxWeigh policies for sochasic nework opimizaion over frame-based sysems. D. Consrucion of he QRRNUM policy We consider he policy ha, a he sar of every round, observes he curren queue backlog vecor Q) and maximizes over all feasible policies he expression: fq)) + hq)) E T Q) 18) over a frame of size T, where he numeraor is defined in 16) and 17). Every feasible policy here consiss of: 1) an admission policy ha admis packes ino queues Q) for all users in every slo; 2) a randomized round robin policy RandRR given in Secion III-A) for daa delivery. The frame size T in 18) is he lengh of one ransmission round under he candidae RandRR policy, and is disribuion depends on he backlog vecor Q) via he queue-dependen choice of policy RandRR. When he feasible policy ha maximizes 18) is chosen, i is execued for one round of ransmission, afer which a new policy is chosen for he nex round based on he updaed raio of 18), and so on. Nex we simplify he maximizaion of 18); he resul is he QRRNUM policy in Secion IV-A. In hq)), he opimal admied daa vecor r+τ) in every slo is independen of he frame size T and of he rae allocaions µ n + τ) in fq)). In addiion, i should be he same for all τ {0,..., T 1}, and is he soluion o 9)-10). These observaions lead o he admission conrol subrouine in he QRRNUM policy. Le Ψ Q)) denoe he opimal objecive of 9)-10). Since he opimal admied daa vecor is independen of he frame size T, we ge hq)) = E T Q) Ψ Q)), and 18) is equal o fq)) E T Q) + Ψ Q)). 19) I indicaes ha finding he opimal admission policy is independen of finding he opimal RandRR policy ha maximizes he firs erm of 19). Nex we evaluae he firs erm of 19) under a fixed RandRR policy wih parameers {α φ } φ Φ {0}. In he res of he secion, when we use a RRφ) policy for one round, he channel ordering of leas recenly used firs is always adoped. Condiioning on he choice of φ, we ge fq)) = α φ fq), RRφ)), φ Φ {0} where fq), RRφ)) denoes he erm fq)) in 16) evaluaed under he policy RRφ) for one round; for convenience we have denoed by RR0) he policy of idling he sysem for one slo. Similarly, by condiioning we can show 6 E T = E T Q) = α φ E T RRφ), φ Φ {0} where T RRφ) denoes he duraion of one ransmission round under he RRφ) policy. I follows ha fq)) E T Q) = φ Φ {0} α φ fq), RRφ)) φ Φ {0} α φ E. 20) T RRφ) The nex lemma shows ha here always exiss a RRφ) policy maximizing 20) over all RandRR policies for one round of ransmission. Therefore i suffices o focus on he class of RRφ) policies in every round of ransmission. Lemma 2. We index RRφ) policies for all φ Φ {0}. For he RRφ) policy wih index k, define f k fq), RRφ)), D k E T RRφ). Wihou loss of generaliy, assume f 1 D 1 f k D k, k {2, 3,..., 2 N }. Then for any probabiliy disribuion {α k } k {1,...,2N } wih α k 0 and 2 N k=1 α k = 1, we have 2 N f 1 k=1 α k f k D 2 N 1 k=1 α. k D k Proof of Lemma 2: Omied due o space consrain. By Lemma 2, nex we evaluae he firs erm of 19) under a given RRφ) policy. When φ = 0, we ge fq))/e T Q) = 0. Oherwise, fix some φ Φ. For each acive channel n in φ, we denoe by L φ n he amoun 6 Given a fixed RandRR policy, he frame size T no longer depends on he backlog vecor Q), and E T = E T Q).

7 WIOPT of ime he nework says wih channel n in one round of ransmission under policy RRφ). The probabiliy disribuion and he mean of L φ n are given in Theorem 1. I follows ha under he RRφ) policy we have E T = E T Q) = E L φ n, E T 1 n:φ µ n + τ) Q) = As a resul, fq)) E T Q) = { E L φ n 1 if φn = 1 0 if φ n = 0. N Q n)e L φ n 1 φ n. 21) N E L φ n φ n The above simplificaions lead o he channel scheduling subrouine of he QRRNUM policy. V. PERFORMANCE ANALYSIS Theorem 2. Le y n ) = minq n ), µ n ) be he number of packes delivered o user n in slo ; define y) y n )) N. For any conrol parameer V > 0, he QRRNUM policy sabilizes all queues Q 1 ),..., Q N )) and yields hroughpu uiliy saisfying lim inf g 1 1 E yτ) ) gy ) B V, 22) where gy ) is he opimal objecive of he consrained problem 3) and B > 0 is a finie consan. Proof of Theorem 2: In Appendix A. Theorem 2 shows ha he hroughpu uiliy under he QRRNUM policy is a mos B/V away from he opimal gy ). By choosing V sufficienly large, he hroughpu uiliy can be made arbirarily close o he opimal gy ) and he consrained problem 3) is solved. The radeoff of choosing a large V value is ha he average queue size in he nework grows linearly wih V. Such radeoff agrees wih he design principle of he QRRNUM policy discussed in Secion IV-C. VI. CONCLUSION This paper provides a heoreical framework for uiliy maximizaion over a wireless nework wih parially observable Markov ON/OFF channels. The performance and conrol in his nework are consrained by limiing channel probing and delayed/uncerain channel sae informaion, bu can be improved by exploiing channel memory. Overall, aacking such problems requires solving a leas approximaely) highdimensional resless muli-armed bandi problems wih nonlinear objecive funcions of ime average rewards, which are difficul o solve by exising ools such as While s index or Markov decision heory. This paper provides a new achievable region mehod for hese problems. The idea is o firs idenify a good inner performance region rendered by randomizing saionary policies, and hen solve he problem over he inner region, serving as an approximaion o he original problem. In his paper, wih an inner performance region consruced in 8, we provide a novel frame-based Lyapunov drif argumen ha solves he approximaion problem wih provably near-opimal performance. We generalize he classical MaxWeigh policies from ime-sloed wireless neworks o frame-based ones ha have policy-dependen random frame sizes. Exensions of his new achievable region mehod o oher open sochasic opimizaion problems are ineresing fuure research. REFERENCES 1 M. J. Neely, E. Modiano, and C.-P. Li, Fairness and opimal sochasic conrol for heerogeneous neworks, IEEE/ACM Trans. New., vol. 16, no. 2, pp , Apr F. P. Kelly, Charging and rae conrol for elasic raffic, European Trans. Telecommunicaions, vol. 8, pp , Online. Available: hp:// frank/elasic.hml 3 Q. Zhao and B. M. Sadler, A survey of dynamic specrum access, IEEE Signal Process. Mag., vol. 24, no. 3, pp , May Q. Zhao and A. Swami, A decision-heoreic framework for opporunisic specrum access, IEEE Wireless Commun. Mag., vol. 14, no. 4, pp , Aug J. L. Ny, M. Dahleh, and E. Feron, Muli-uav dynamic rouing wih parial observaions using resless bandi allocaion indices, in American Conrol Conference, Seale, WA, USA, Jun P. While, Resless bandis: Aciviy allocaion in a changing world, J. Appl. Probab., vol. 25, pp , C. H. Papadimiriou and J. N. Tsisiklis, The complexiy of opimal queueing nework conrol, Mah. of Oper. Res., vol. 24, pp , May C.-P. Li and M. J. Neely, Exploiing channel memory for muliuser wireless scheduling wihou channel measuremen: Capaciy regions and algorihms, Performance Evaluaion, 2011, acceped for publicaion. 9 L. Tassiulas and A. Ephremides, Sabiliy properies of consrained queueing sysems and scheduling policies for maximum hroughpu in mulihop radio neworks, IEEE Trans. Auom. Conrol, vol. 37, no. 12, pp , Dec , Dynamic server allocaion o parallel queues wih randomly varying conneciviy, IEEE Trans. Inf. Theory, vol. 39, no. 2, pp , Mar M. J. Neely, Energy opimal conrol for ime varying wireless neworks, IEEE Trans. Inf. Theory, vol. 52, no. 7, pp , Jul , Dynamic power allocaion and rouing for saellie and wireless neworks wih ime varying channels, Ph.D. disseraion, Massachuses Insiue of Technology, November A. Eryilmaz and R. Srikan, Fair resource allocaion in wireless neworks using queue-lengh-based scheduling and congesion conrol, IEEE/ACM Trans. New., vol. 15, no. 6, pp , Dec M. J. Neely, Sochasic opimizaion for markov modulaed neworks wih applicaion o delay consrained wireless scheduling, in IEEE Conf. Decision and Conrol CDC), , Dynamic opimizaion and learning for renewal sysems, in Asilomar Conf. Signals, Sysems, and Compuers, Nov. 2010, invied paper. 16, Sochasic Nework Opimizaion wih Applicaion o Communicaion and Queueing Sysems. Morgan & Claypool, C.-P. Li and M. J. Neely, Energy-opimal scheduling wih dynamic channel acquisiion in wireless downlinks, IEEE Trans. Mobile Compu., vol. 9, no. 4, pp , Apr P. Chaporkar, A. Prouiere, H. Asnani, and A. Karandikar, Scheduling wih limied informaion in wireless sysems, in ACM In. Symp. Mobile Ad Hoc Neworking and Compuing MobiHoc), New Orleans, LA, May N. B. Chang and M. Liu, Opimal channel probing and ransmission scheduling for opporunisic specrum access, in ACM In. Conf. Mobile Compuing and Neworking MobiCom), New York, NY, 2007, pp P. Chaporkar, A. Prouiere, and H. Asnani, Learning o opimally exploi muli-channel diversiy in wireless sysems, in IEEE Proc. INFOCOM, P. Chaporkar and A. Prouiere, Opimal join probing and ransmission sraegy for maximizing hroughpu in wireless sysems, IEEE J. Sel. Areas Commun., vol. 26, no. 8, pp , Oc S. Guha, K. Munagala, and S. Sarkar, Joinly opimal ransmission and probing sraegies for mulichannel wireless sysems, in Conf. Informaion Sciences and Sysems, Mar

8 WIOPT Q. Zhao, B. Krishnamachari, and K. Liu, On myopic sensing for mulichannel opporunisic access: Srucure, opimaliy, and preformance, IEEE Trans. Wireless Commun., vol. 7, no. 12, pp , Dec S. H. A. Ahmad, M. Liu, T. Javidi, Q. Zhao, and B. Krishnamachari, Opimaliy of myopic sensing in mulichannel opporunisic access, IEEE Trans. Inf. Theory, vol. 55, no. 9, pp , Sep S. H. A. Ahmad and M. Liu, Muli-channel opporunisic access: A case of resless bandis wih muliple plays, in Alleron Conf. Communicaion, Conrol, and Compuing, 2009, pp K. Liu and Q. Zhao, Indexabiliy of resless bandi problems and opimaliy of while s index for dynamic mulichannel access, IEEE Trans. Inf. Theory, vol. 56, no. 11, pp , Nov J. Nino-Mora, An index policy for dynamic fading-channel allocaion o heerogeneous mobile users wih parial observaions, in Nex Generaion Inerne Neworks, 2008, pp S. Guha, K. Munagala, and P. Shi, Approximaion algorihms for resless bandi problems, Tech. Rep., Feb D. P. Bersekas, Dynamic Programming and Opimal Conrol, 3rd ed. Ahena Scienific, 2005, vol. I. APPENDIX A Proof of Theorem 2: We need o show ha all queues Q n )) N are sable and ha 22) is achieved. Due o space consrain, we only prove 22) here. Under policy QRRNUM, le k 1 and T k be he beginning and he duraion of he kh ransmission round, respecively. We have T k = k k 1 and k = k i=1 T i for all k N. Every T k is he lengh of a ransmission round of some RRφ) policy. Assume 0 = 0. To show 22), we compare he QRRNUM policy wih he unknown) soluion o problem 3). By definiion of he hroughpu region Λ in in Secion III-B, here exiss a randomized round robin policy RandRR ha solves 3) and yields he opimal hroughpu vecor y = y n) N. Le T denoe he lengh of one ransmission round under policy RandRR. From Corollary 1, we have for every user n {1,..., N}: T 1 T 1 E µ n + τ) Q) = E µ n + τ) = y ne T. Combining RandRR wih he admission policy σ ha ses r n +τ) = y n for all users n and τ {0,..., T 1} yields 7 f Q)) = E T h Q)) = E T N Q n ) y n 23) V gy ) N Q n ) y n 24) where 23) and 24) are fq)) and hq)) see 16), 17)) evaluaed under policies RandRR and σ, respecively. Since he QRRNUM policy maximizes 18), comparing 18) under QRRNUM and he join policy RandRR, σ ) yields f QRRNUM Q k )) + h QRRNUM Q k )) E T k+1 Q k ) f Q k )) + h Q k )) E T a) = E T k+1 Q k ) V gy ), 25) where a) is from 23) and 24). The inequaliy 15) under he QRRNUM policy in he k +1)h round of ransmission yields T k+1 1 Tk+1 Q k )) V E gr k + τ)) Q k ) B f QRRNUM Q k )) h QRRNUM Q k )) b) B E T k+1 Q k ) V gy ), 26) where b) uses 25). Taking expecaion over Q k ) in 26) and summing i over k {0,..., K 1}, we ge K 1 E LQ K )) E LQ 0 )) V E grτ)) 27) BK V gy ) E K B V gy ) E K where he las inequaliy uses K = K k=1 T k K. Ignoring he firs erm, noing Q 0 ) = 0, and dividing by V yields E K 1 grτ)) gy ) B V ) E K. 28) Le K) denoe he number of ransmission rounds ending by ime ; we have K) < K)+1. I follows ha 1 E grτ)) c) K) 1 E grτ)) d) gy ) B ) E K) V = gy ) B gy ) B ) E K), V V 29) where c) uses he nonnegaiviy of g ) and K), and d) follows 28). Dividing 29) by, aking a lim inf as, and noing E K) E TK)+1 < yields lim inf 1 1 E grτ)) gy ) B V. 30) Using Jensen s inequaliy and he concaviy of g ), we ge lim inf g r )) gy ) B V, 31) 1 E r nτ). Since where r ) r ) n ) N and r ) n 1 all queues Q n )) N are sable, we can show ha he ime average hroughpu vecor y ) 1 E y nτ), saisfies 1 = y ) n ) N, where y ) n lim inf g y )) lim inf g r )). 32) Combining 31) and 32) finishes he proof. 7 The hroughpu y n is less han or equal o one, hus is a feasible choice of r n + τ).

5. Stochastic processes (1)

5. Stochastic processes (1) Lec05.pp S-38.45 - Inroducion o Teleraffic Theory Spring 2005 Conens Basic conceps Poisson process 2 Sochasic processes () Consider some quaniy in a eleraffic (or any) sysem I ypically evolves in ime randomly

More information

Resource Allocation in Visible Light Communication Networks NOMA vs. OFDMA Transmission Techniques

Resource Allocation in Visible Light Communication Networks NOMA vs. OFDMA Transmission Techniques Resource Allocaion in Visible Ligh Communicaion Neworks NOMA vs. OFDMA Transmission Techniques Eirini Eleni Tsiropoulou, Iakovos Gialagkolidis, Panagiois Vamvakas, and Symeon Papavassiliou Insiue of Communicaions

More information

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB

T L. t=1. Proof of Lemma 1. Using the marginal cost accounting in Equation(4) and standard arguments. t )+Π RB. t )+K 1(Q RB Elecronic Companion EC.1. Proofs of Technical Lemmas and Theorems LEMMA 1. Le C(RB) be he oal cos incurred by he RB policy. Then we have, T L E[C(RB)] 3 E[Z RB ]. (EC.1) Proof of Lemma 1. Using he marginal

More information

A Primal-Dual Type Algorithm with the O(1/t) Convergence Rate for Large Scale Constrained Convex Programs

A Primal-Dual Type Algorithm with the O(1/t) Convergence Rate for Large Scale Constrained Convex Programs PROC. IEEE CONFERENCE ON DECISION AND CONTROL, 06 A Primal-Dual Type Algorihm wih he O(/) Convergence Rae for Large Scale Consrained Convex Programs Hao Yu and Michael J. Neely Absrac This paper considers

More information

Opportunistic Scheduling with Reliability Guarantees in Cognitive Radio Networks

Opportunistic Scheduling with Reliability Guarantees in Cognitive Radio Networks PROC. IEEE INFOCO, PHOENIX, AZ, APRIL 008 Opporunisic Scheduling wih Reliabiliy Guaranees in Cogniive Radio Neworks Rahul Urgaonkar, ichael J. Neely Universiy of Souhern California, Los Angeles, CA 90089

More information

On the Stability Region of Multi-Queue Multi-Server Queueing Systems with Stationary Channel Distribution

On the Stability Region of Multi-Queue Multi-Server Queueing Systems with Stationary Channel Distribution On he Sabiliy Region of Muli-Queue Muli-Server Queueing Sysems wih Saionary Channel Disribuion Hassan Halabian, Ioannis Lambadaris, Chung-Horng Lung Deparmen of Sysems and Compuer Engineering, Carleon

More information

Vehicle Arrival Models : Headway

Vehicle Arrival Models : Headway Chaper 12 Vehicle Arrival Models : Headway 12.1 Inroducion Modelling arrival of vehicle a secion of road is an imporan sep in raffic flow modelling. I has imporan applicaion in raffic flow simulaion where

More information

Solutions for Assignment 2

Solutions for Assignment 2 Faculy of rs and Science Universiy of Torono CSC 358 - Inroducion o Compuer Neworks, Winer 218 Soluions for ssignmen 2 Quesion 1 (2 Poins): Go-ack n RQ In his quesion, we review how Go-ack n RQ can be

More information

Stable Scheduling Policies for Maximizing Throughput in Generalized Constrained Queueing Systems

Stable Scheduling Policies for Maximizing Throughput in Generalized Constrained Queueing Systems 1 Sable Scheduling Policies for Maximizing Throughpu in Generalized Consrained Queueing Sysems Prasanna Chaporar, Suden Member, IEEE, Saswai Sarar, Member, IEEE Absrac We consider a class of queueing newors

More information

Opportunism, Backpressure, and Stochastic Optimization with the Wireless Broadcast Advantage

Opportunism, Backpressure, and Stochastic Optimization with the Wireless Broadcast Advantage ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, PACIFIC GROVE, CA, OCTOBER 2008 1 Opporunism, Backpressure, and Sochasic Opimizaion wih he Wireless Broadcas Advanage Michael J. Neely, Rahul Urgaonkar

More information

Optimal Server Assignment in Multi-Server

Optimal Server Assignment in Multi-Server Opimal Server Assignmen in Muli-Server 1 Queueing Sysems wih Random Conneciviies Hassan Halabian, Suden Member, IEEE, Ioannis Lambadaris, Member, IEEE, arxiv:1112.1178v2 [mah.oc] 21 Jun 2013 Yannis Viniois,

More information

Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach

Decentralized Stochastic Control with Partial History Sharing: A Common Information Approach 1 Decenralized Sochasic Conrol wih Parial Hisory Sharing: A Common Informaion Approach Ashuosh Nayyar, Adiya Mahajan and Demoshenis Tenekezis arxiv:1209.1695v1 [cs.sy] 8 Sep 2012 Absrac A general model

More information

An introduction to the theory of SDDP algorithm

An introduction to the theory of SDDP algorithm An inroducion o he heory of SDDP algorihm V. Leclère (ENPC) Augus 1, 2014 V. Leclère Inroducion o SDDP Augus 1, 2014 1 / 21 Inroducion Large scale sochasic problem are hard o solve. Two ways of aacking

More information

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions Muli-Period Sochasic Models: Opimali of (s, S) Polic for -Convex Objecive Funcions Consider a seing similar o he N-sage newsvendor problem excep ha now here is a fixed re-ordering cos (> 0) for each (re-)order.

More information

On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems

On Boundedness of Q-Learning Iterates for Stochastic Shortest Path Problems MATHEMATICS OF OPERATIONS RESEARCH Vol. 38, No. 2, May 2013, pp. 209 227 ISSN 0364-765X (prin) ISSN 1526-5471 (online) hp://dx.doi.org/10.1287/moor.1120.0562 2013 INFORMS On Boundedness of Q-Learning Ieraes

More information

Stability and Bifurcation in a Neural Network Model with Two Delays

Stability and Bifurcation in a Neural Network Model with Two Delays Inernaional Mahemaical Forum, Vol. 6, 11, no. 35, 175-1731 Sabiliy and Bifurcaion in a Neural Nework Model wih Two Delays GuangPing Hu and XiaoLing Li School of Mahemaics and Physics, Nanjing Universiy

More information

Lecture 4 Notes (Little s Theorem)

Lecture 4 Notes (Little s Theorem) Lecure 4 Noes (Lile s Theorem) This lecure concerns one of he mos imporan (and simples) heorems in Queuing Theory, Lile s Theorem. More informaion can be found in he course book, Bersekas & Gallagher,

More information

Application of a Stochastic-Fuzzy Approach to Modeling Optimal Discrete Time Dynamical Systems by Using Large Scale Data Processing

Application of a Stochastic-Fuzzy Approach to Modeling Optimal Discrete Time Dynamical Systems by Using Large Scale Data Processing Applicaion of a Sochasic-Fuzzy Approach o Modeling Opimal Discree Time Dynamical Sysems by Using Large Scale Daa Processing AA WALASZE-BABISZEWSA Deparmen of Compuer Engineering Opole Universiy of Technology

More information

1 Review of Zero-Sum Games

1 Review of Zero-Sum Games COS 5: heoreical Machine Learning Lecurer: Rob Schapire Lecure #23 Scribe: Eugene Brevdo April 30, 2008 Review of Zero-Sum Games Las ime we inroduced a mahemaical model for wo player zero-sum games. Any

More information

Random Walk with Anti-Correlated Steps

Random Walk with Anti-Correlated Steps Random Walk wih Ani-Correlaed Seps John Noga Dirk Wagner 2 Absrac We conjecure he expeced value of random walks wih ani-correlaed seps o be exacly. We suppor his conjecure wih 2 plausibiliy argumens and

More information

Chapter 2. First Order Scalar Equations

Chapter 2. First Order Scalar Equations Chaper. Firs Order Scalar Equaions We sar our sudy of differenial equaions in he same way he pioneers in his field did. We show paricular echniques o solve paricular ypes of firs order differenial equaions.

More information

International Journal of Scientific & Engineering Research, Volume 4, Issue 10, October ISSN

International Journal of Scientific & Engineering Research, Volume 4, Issue 10, October ISSN Inernaional Journal of Scienific & Engineering Research, Volume 4, Issue 10, Ocober-2013 900 FUZZY MEAN RESIDUAL LIFE ORDERING OF FUZZY RANDOM VARIABLES J. EARNEST LAZARUS PIRIYAKUMAR 1, A. YAMUNA 2 1.

More information

Energy Storage Benchmark Problems

Energy Storage Benchmark Problems Energy Sorage Benchmark Problems Daniel F. Salas 1,3, Warren B. Powell 2,3 1 Deparmen of Chemical & Biological Engineering 2 Deparmen of Operaions Research & Financial Engineering 3 Princeon Laboraory

More information

Lecture 20: Riccati Equations and Least Squares Feedback Control

Lecture 20: Riccati Equations and Least Squares Feedback Control 34-5 LINEAR SYSTEMS Lecure : Riccai Equaions and Leas Squares Feedback Conrol 5.6.4 Sae Feedback via Riccai Equaions A recursive approach in generaing he marix-valued funcion W ( ) equaion for i for he

More information

Intelligent Packet Dropping for Optimal Energy-Delay Tradeoffs in Wireless Downlinks

Intelligent Packet Dropping for Optimal Energy-Delay Tradeoffs in Wireless Downlinks PROC. OF 4TH INT. SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT), APRIL 2006 1 Inelligen Packe Dropping for Opimal Energy-Delay Tradeoffs in Wireless Downlinks

More information

STATE-SPACE MODELLING. A mass balance across the tank gives:

STATE-SPACE MODELLING. A mass balance across the tank gives: B. Lennox and N.F. Thornhill, 9, Sae Space Modelling, IChemE Process Managemen and Conrol Subjec Group Newsleer STE-SPACE MODELLING Inroducion: Over he pas decade or so here has been an ever increasing

More information

CONTRIBUTION TO IMPULSIVE EQUATIONS

CONTRIBUTION TO IMPULSIVE EQUATIONS European Scienific Journal Sepember 214 /SPECIAL/ ediion Vol.3 ISSN: 1857 7881 (Prin) e - ISSN 1857-7431 CONTRIBUTION TO IMPULSIVE EQUATIONS Berrabah Faima Zohra, MA Universiy of sidi bel abbes/ Algeria

More information

Modal identification of structures from roving input data by means of maximum likelihood estimation of the state space model

Modal identification of structures from roving input data by means of maximum likelihood estimation of the state space model Modal idenificaion of srucures from roving inpu daa by means of maximum likelihood esimaion of he sae space model J. Cara, J. Juan, E. Alarcón Absrac The usual way o perform a forced vibraion es is o fix

More information

Object tracking: Using HMMs to estimate the geographical location of fish

Object tracking: Using HMMs to estimate the geographical location of fish Objec racking: Using HMMs o esimae he geographical locaion of fish 02433 - Hidden Markov Models Marin Wæver Pedersen, Henrik Madsen Course week 13 MWP, compiled June 8, 2011 Objecive: Locae fish from agging

More information

Simulation-Solving Dynamic Models ABE 5646 Week 2, Spring 2010

Simulation-Solving Dynamic Models ABE 5646 Week 2, Spring 2010 Simulaion-Solving Dynamic Models ABE 5646 Week 2, Spring 2010 Week Descripion Reading Maerial 2 Compuer Simulaion of Dynamic Models Finie Difference, coninuous saes, discree ime Simple Mehods Euler Trapezoid

More information

14 Autoregressive Moving Average Models

14 Autoregressive Moving Average Models 14 Auoregressive Moving Average Models In his chaper an imporan parameric family of saionary ime series is inroduced, he family of he auoregressive moving average, or ARMA, processes. For a large class

More information

Christos Papadimitriou & Luca Trevisan November 22, 2016

Christos Papadimitriou & Luca Trevisan November 22, 2016 U.C. Bereley CS170: Algorihms Handou LN-11-22 Chrisos Papadimiriou & Luca Trevisan November 22, 2016 Sreaming algorihms In his lecure and he nex one we sudy memory-efficien algorihms ha process a sream

More information

Some Ramsey results for the n-cube

Some Ramsey results for the n-cube Some Ramsey resuls for he n-cube Ron Graham Universiy of California, San Diego Jozsef Solymosi Universiy of Briish Columbia, Vancouver, Canada Absrac In his noe we esablish a Ramsey-ype resul for cerain

More information

The Asymptotic Behavior of Nonoscillatory Solutions of Some Nonlinear Dynamic Equations on Time Scales

The Asymptotic Behavior of Nonoscillatory Solutions of Some Nonlinear Dynamic Equations on Time Scales Advances in Dynamical Sysems and Applicaions. ISSN 0973-5321 Volume 1 Number 1 (2006, pp. 103 112 c Research India Publicaions hp://www.ripublicaion.com/adsa.hm The Asympoic Behavior of Nonoscillaory Soluions

More information

SZG Macro 2011 Lecture 3: Dynamic Programming. SZG macro 2011 lecture 3 1

SZG Macro 2011 Lecture 3: Dynamic Programming. SZG macro 2011 lecture 3 1 SZG Macro 2011 Lecure 3: Dynamic Programming SZG macro 2011 lecure 3 1 Background Our previous discussion of opimal consumpion over ime and of opimal capial accumulaion sugges sudying he general decision

More information

Supplement for Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence

Supplement for Stochastic Convex Optimization: Faster Local Growth Implies Faster Global Convergence Supplemen for Sochasic Convex Opimizaion: Faser Local Growh Implies Faser Global Convergence Yi Xu Qihang Lin ianbao Yang Proof of heorem heorem Suppose Assumpion holds and F (w) obeys he LGC (6) Given

More information

Intelligent Packet Dropping for Optimal Energy-Delay Tradeoffs in Wireless Downlinks

Intelligent Packet Dropping for Optimal Energy-Delay Tradeoffs in Wireless Downlinks IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. 54, NO. 3, PP. 565-579, MARCH 2009. 1 Inelligen Packe Dropping for Opimal Energy-Delay Tradeoffs in Wireless Downlinks Michael J. Neely Universiy of Souhern

More information

Lecture 2 October ε-approximation of 2-player zero-sum games

Lecture 2 October ε-approximation of 2-player zero-sum games Opimizaion II Winer 009/10 Lecurer: Khaled Elbassioni Lecure Ocober 19 1 ε-approximaion of -player zero-sum games In his lecure we give a randomized ficiious play algorihm for obaining an approximae soluion

More information

Matrix Versions of Some Refinements of the Arithmetic-Geometric Mean Inequality

Matrix Versions of Some Refinements of the Arithmetic-Geometric Mean Inequality Marix Versions of Some Refinemens of he Arihmeic-Geomeric Mean Inequaliy Bao Qi Feng and Andrew Tonge Absrac. We esablish marix versions of refinemens due o Alzer ], Carwrigh and Field 4], and Mercer 5]

More information

MATH 5720: Gradient Methods Hung Phan, UMass Lowell October 4, 2018

MATH 5720: Gradient Methods Hung Phan, UMass Lowell October 4, 2018 MATH 5720: Gradien Mehods Hung Phan, UMass Lowell Ocober 4, 208 Descen Direcion Mehods Consider he problem min { f(x) x R n}. The general descen direcions mehod is x k+ = x k + k d k where x k is he curren

More information

PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD

PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD HAN XIAO 1. Penalized Leas Squares Lasso solves he following opimizaion problem, ˆβ lasso = arg max β R p+1 1 N y i β 0 N x ij β j β j (1.1) for some 0.

More information

Finish reading Chapter 2 of Spivak, rereading earlier sections as necessary. handout and fill in some missing details!

Finish reading Chapter 2 of Spivak, rereading earlier sections as necessary. handout and fill in some missing details! MAT 257, Handou 6: Ocober 7-2, 20. I. Assignmen. Finish reading Chaper 2 of Spiva, rereading earlier secions as necessary. handou and fill in some missing deails! II. Higher derivaives. Also, read his

More information

Tom Heskes and Onno Zoeter. Presented by Mark Buller

Tom Heskes and Onno Zoeter. Presented by Mark Buller Tom Heskes and Onno Zoeer Presened by Mark Buller Dynamic Bayesian Neworks Direced graphical models of sochasic processes Represen hidden and observed variables wih differen dependencies Generalize Hidden

More information

Delay Reduction via Lagrange Multipliers in Stochastic Network Optimization

Delay Reduction via Lagrange Multipliers in Stochastic Network Optimization IEEE TRANS. ON AUTOMATIC CONTROL, OL. 56, ISSUE 4, PP. 842-857, APRIL 2011 1 Delay Reducion via Lagrange Mulipliers in Sochasic Nework Opimizaion Longbo Huang, Michael J. Neely Absrac In his paper, we

More information

Final Spring 2007

Final Spring 2007 .615 Final Spring 7 Overview The purpose of he final exam is o calculae he MHD β limi in a high-bea oroidal okamak agains he dangerous n = 1 exernal ballooning-kink mode. Effecively, his corresponds o

More information

Optimality Conditions for Unconstrained Problems

Optimality Conditions for Unconstrained Problems 62 CHAPTER 6 Opimaliy Condiions for Unconsrained Problems 1 Unconsrained Opimizaion 11 Exisence Consider he problem of minimizing he funcion f : R n R where f is coninuous on all of R n : P min f(x) x

More information

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles Diebold, Chaper 7 Francis X. Diebold, Elemens of Forecasing, 4h Ediion (Mason, Ohio: Cengage Learning, 006). Chaper 7. Characerizing Cycles Afer compleing his reading you should be able o: Define covariance

More information

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon 3..3 INRODUCION O DYNAMIC OPIMIZAION: DISCREE IME PROBLEMS A. he Hamilonian and Firs-Order Condiions in a Finie ime Horizon Define a new funcion, he Hamilonian funcion, H. H he change in he oal value of

More information

( ) ( ) if t = t. It must satisfy the identity. So, bulkiness of the unit impulse (hyper)function is equal to 1. The defining characteristic is

( ) ( ) if t = t. It must satisfy the identity. So, bulkiness of the unit impulse (hyper)function is equal to 1. The defining characteristic is UNIT IMPULSE RESPONSE, UNIT STEP RESPONSE, STABILITY. Uni impulse funcion (Dirac dela funcion, dela funcion) rigorously defined is no sricly a funcion, bu disribuion (or measure), precise reamen requires

More information

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle Chaper 2 Newonian Mechanics Single Paricle In his Chaper we will review wha Newon s laws of mechanics ell us abou he moion of a single paricle. Newon s laws are only valid in suiable reference frames,

More information

Games Against Nature

Games Against Nature Advanced Course in Machine Learning Spring 2010 Games Agains Naure Handous are joinly prepared by Shie Mannor and Shai Shalev-Shwarz In he previous lecures we alked abou expers in differen seups and analyzed

More information

Expert Advice for Amateurs

Expert Advice for Amateurs Exper Advice for Amaeurs Ernes K. Lai Online Appendix - Exisence of Equilibria The analysis in his secion is performed under more general payoff funcions. Wihou aking an explici form, he payoffs of he

More information

Notes for Lecture 17-18

Notes for Lecture 17-18 U.C. Berkeley CS278: Compuaional Complexiy Handou N7-8 Professor Luca Trevisan April 3-8, 2008 Noes for Lecure 7-8 In hese wo lecures we prove he firs half of he PCP Theorem, he Amplificaion Lemma, up

More information

Mean-square Stability Control for Networked Systems with Stochastic Time Delay

Mean-square Stability Control for Networked Systems with Stochastic Time Delay JOURNAL OF SIMULAION VOL. 5 NO. May 7 Mean-square Sabiliy Conrol for Newored Sysems wih Sochasic ime Delay YAO Hejun YUAN Fushun School of Mahemaics and Saisics Anyang Normal Universiy Anyang Henan. 455

More information

Comments on Window-Constrained Scheduling

Comments on Window-Constrained Scheduling Commens on Window-Consrained Scheduling Richard Wes Member, IEEE and Yuing Zhang Absrac This shor repor clarifies he behavior of DWCS wih respec o Theorem 3 in our previously published paper [1], and describes

More information

Network Utility Maximization over Partially Observable Markovian Channels

Network Utility Maximization over Partially Observable Markovian Channels ELSEVIER PERFORMANCE EVALUATION, 2012 1 Network Utility Maximization over Partially Network Observable Utility Maximization Markovian Channels over Partially Observable Markovian Channels Chih-ping Li

More information

Stationary Distribution. Design and Analysis of Algorithms Andrei Bulatov

Stationary Distribution. Design and Analysis of Algorithms Andrei Bulatov Saionary Disribuion Design and Analysis of Algorihms Andrei Bulaov Algorihms Markov Chains 34-2 Classificaion of Saes k By P we denoe he (i,j)-enry of i, j Sae is accessible from sae if 0 for some k 0

More information

Orientation. Connections between network coding and stochastic network theory. Outline. Bruce Hajek. Multicast with lost packets

Orientation. Connections between network coding and stochastic network theory. Outline. Bruce Hajek. Multicast with lost packets Connecions beween nework coding and sochasic nework heory Bruce Hajek Orienaion On Thursday, Ralf Koeer discussed nework coding: coding wihin he nework Absrac: Randomly generaed coded informaion blocks

More information

Optima and Equilibria for Traffic Flow on a Network

Optima and Equilibria for Traffic Flow on a Network Opima and Equilibria for Traffic Flow on a Nework Albero Bressan Deparmen of Mahemaics, Penn Sae Universiy bressan@mah.psu.edu Albero Bressan (Penn Sae) Opima and equilibria for raffic flow 1 / 1 A Traffic

More information

Essential Microeconomics : OPTIMAL CONTROL 1. Consider the following class of optimization problems

Essential Microeconomics : OPTIMAL CONTROL 1. Consider the following class of optimization problems Essenial Microeconomics -- 6.5: OPIMAL CONROL Consider he following class of opimizaion problems Max{ U( k, x) + U+ ( k+ ) k+ k F( k, x)}. { x, k+ } = In he language of conrol heory, he vecor k is he vecor

More information

Navneet Saini, Mayank Goyal, Vishal Bansal (2013); Term Project AML310; Indian Institute of Technology Delhi

Navneet Saini, Mayank Goyal, Vishal Bansal (2013); Term Project AML310; Indian Institute of Technology Delhi Creep in Viscoelasic Subsances Numerical mehods o calculae he coefficiens of he Prony equaion using creep es daa and Herediary Inegrals Mehod Navnee Saini, Mayank Goyal, Vishal Bansal (23); Term Projec

More information

Particle Swarm Optimization

Particle Swarm Optimization Paricle Swarm Opimizaion Speaker: Jeng-Shyang Pan Deparmen of Elecronic Engineering, Kaohsiung Universiy of Applied Science, Taiwan Email: jspan@cc.kuas.edu.w 7/26/2004 ppso 1 Wha is he Paricle Swarm Opimizaion

More information

Hamilton- J acobi Equation: Weak S olution We continue the study of the Hamilton-Jacobi equation:

Hamilton- J acobi Equation: Weak S olution We continue the study of the Hamilton-Jacobi equation: M ah 5 7 Fall 9 L ecure O c. 4, 9 ) Hamilon- J acobi Equaion: Weak S oluion We coninue he sudy of he Hamilon-Jacobi equaion: We have shown ha u + H D u) = R n, ) ; u = g R n { = }. ). In general we canno

More information

Online Appendix to Solution Methods for Models with Rare Disasters

Online Appendix to Solution Methods for Models with Rare Disasters Online Appendix o Soluion Mehods for Models wih Rare Disasers Jesús Fernández-Villaverde and Oren Levinal In his Online Appendix, we presen he Euler condiions of he model, we develop he pricing Calvo block,

More information

BU Macro BU Macro Fall 2008, Lecture 4

BU Macro BU Macro Fall 2008, Lecture 4 Dynamic Programming BU Macro 2008 Lecure 4 1 Ouline 1. Cerainy opimizaion problem used o illusrae: a. Resricions on exogenous variables b. Value funcion c. Policy funcion d. The Bellman equaion and an

More information

Lecture Notes 2. The Hilbert Space Approach to Time Series

Lecture Notes 2. The Hilbert Space Approach to Time Series Time Series Seven N. Durlauf Universiy of Wisconsin. Basic ideas Lecure Noes. The Hilber Space Approach o Time Series The Hilber space framework provides a very powerful language for discussing he relaionship

More information

Utility maximization in incomplete markets

Utility maximization in incomplete markets Uiliy maximizaion in incomplee markes Marcel Ladkau 27.1.29 Conens 1 Inroducion and general seings 2 1.1 Marke model....................................... 2 1.2 Trading sraegy.....................................

More information

Inventory Control of Perishable Items in a Two-Echelon Supply Chain

Inventory Control of Perishable Items in a Two-Echelon Supply Chain Journal of Indusrial Engineering, Universiy of ehran, Special Issue,, PP. 69-77 69 Invenory Conrol of Perishable Iems in a wo-echelon Supply Chain Fariborz Jolai *, Elmira Gheisariha and Farnaz Nojavan

More information

Solutions Problem Set 3 Macro II (14.452)

Solutions Problem Set 3 Macro II (14.452) Soluions Problem Se 3 Macro II (14.452) Francisco A. Gallego 04/27/2005 1 Q heory of invesmen in coninuous ime and no uncerainy Consider he in nie horizon model of a rm facing adjusmen coss o invesmen.

More information

Notes on Kalman Filtering

Notes on Kalman Filtering Noes on Kalman Filering Brian Borchers and Rick Aser November 7, Inroducion Daa Assimilaion is he problem of merging model predicions wih acual measuremens of a sysem o produce an opimal esimae of he curren

More information

Learning to Take Concurrent Actions

Learning to Take Concurrent Actions Learning o Take Concurren Acions Khashayar Rohanimanesh Deparmen of Compuer Science Universiy of Massachuses Amhers, MA 0003 khash@cs.umass.edu Sridhar Mahadevan Deparmen of Compuer Science Universiy of

More information

Sliding Mode Extremum Seeking Control for Linear Quadratic Dynamic Game

Sliding Mode Extremum Seeking Control for Linear Quadratic Dynamic Game Sliding Mode Exremum Seeking Conrol for Linear Quadraic Dynamic Game Yaodong Pan and Ümi Özgüner ITS Research Group, AIST Tsukuba Eas Namiki --, Tsukuba-shi,Ibaraki-ken 5-856, Japan e-mail: pan.yaodong@ais.go.jp

More information

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation

Course Notes for EE227C (Spring 2018): Convex Optimization and Approximation Course Noes for EE7C Spring 018: Convex Opimizaion and Approximaion Insrucor: Moriz Hard Email: hard+ee7c@berkeley.edu Graduae Insrucor: Max Simchowiz Email: msimchow+ee7c@berkeley.edu Ocober 15, 018 3

More information

t is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t...

t is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t... Mah 228- Fri Mar 24 5.6 Marix exponenials and linear sysems: The analogy beween firs order sysems of linear differenial equaions (Chaper 5) and scalar linear differenial equaions (Chaper ) is much sronger

More information

SUFFICIENT CONDITIONS FOR EXISTENCE SOLUTION OF LINEAR TWO-POINT BOUNDARY PROBLEM IN MINIMIZATION OF QUADRATIC FUNCTIONAL

SUFFICIENT CONDITIONS FOR EXISTENCE SOLUTION OF LINEAR TWO-POINT BOUNDARY PROBLEM IN MINIMIZATION OF QUADRATIC FUNCTIONAL HE PUBLISHING HOUSE PROCEEDINGS OF HE ROMANIAN ACADEMY, Series A, OF HE ROMANIAN ACADEMY Volume, Number 4/200, pp 287 293 SUFFICIEN CONDIIONS FOR EXISENCE SOLUION OF LINEAR WO-POIN BOUNDARY PROBLEM IN

More information

MODULE 3 FUNCTION OF A RANDOM VARIABLE AND ITS DISTRIBUTION LECTURES PROBABILITY DISTRIBUTION OF A FUNCTION OF A RANDOM VARIABLE

MODULE 3 FUNCTION OF A RANDOM VARIABLE AND ITS DISTRIBUTION LECTURES PROBABILITY DISTRIBUTION OF A FUNCTION OF A RANDOM VARIABLE Topics MODULE 3 FUNCTION OF A RANDOM VARIABLE AND ITS DISTRIBUTION LECTURES 2-6 3. FUNCTION OF A RANDOM VARIABLE 3.2 PROBABILITY DISTRIBUTION OF A FUNCTION OF A RANDOM VARIABLE 3.3 EXPECTATION AND MOMENTS

More information

Ordinary Differential Equations

Ordinary Differential Equations Ordinary Differenial Equaions 5. Examples of linear differenial equaions and heir applicaions We consider some examples of sysems of linear differenial equaions wih consan coefficiens y = a y +... + a

More information

Timed Circuits. Asynchronous Circuit Design. Timing Relationships. A Simple Example. Timed States. Timing Sequences. ({r 6 },t6 = 1.

Timed Circuits. Asynchronous Circuit Design. Timing Relationships. A Simple Example. Timed States. Timing Sequences. ({r 6 },t6 = 1. Timed Circuis Asynchronous Circui Design Chris J. Myers Lecure 7: Timed Circuis Chaper 7 Previous mehods only use limied knowledge of delays. Very robus sysems, bu exremely conservaive. Large funcional

More information

Linear Response Theory: The connection between QFT and experiments

Linear Response Theory: The connection between QFT and experiments Phys540.nb 39 3 Linear Response Theory: The connecion beween QFT and experimens 3.1. Basic conceps and ideas Q: How do we measure he conduciviy of a meal? A: we firs inroduce a weak elecric field E, and

More information

Ensamble methods: Boosting

Ensamble methods: Boosting Lecure 21 Ensamble mehods: Boosing Milos Hauskrech milos@cs.pi.edu 5329 Senno Square Schedule Final exam: April 18: 1:00-2:15pm, in-class Term projecs April 23 & April 25: a 1:00-2:30pm in CS seminar room

More information

2. Nonlinear Conservation Law Equations

2. Nonlinear Conservation Law Equations . Nonlinear Conservaion Law Equaions One of he clear lessons learned over recen years in sudying nonlinear parial differenial equaions is ha i is generally no wise o ry o aack a general class of nonlinear

More information

Kriging Models Predicting Atrazine Concentrations in Surface Water Draining Agricultural Watersheds

Kriging Models Predicting Atrazine Concentrations in Surface Water Draining Agricultural Watersheds 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 Kriging Models Predicing Arazine Concenraions in Surface Waer Draining Agriculural Waersheds Paul L. Mosquin, Jeremy Aldworh, Wenlin Chen Supplemenal Maerial Number

More information

Martingales Stopping Time Processes

Martingales Stopping Time Processes IOSR Journal of Mahemaics (IOSR-JM) e-issn: 2278-5728, p-issn: 2319-765. Volume 11, Issue 1 Ver. II (Jan - Feb. 2015), PP 59-64 www.iosrjournals.org Maringales Sopping Time Processes I. Fulaan Deparmen

More information

GMM - Generalized Method of Moments

GMM - Generalized Method of Moments GMM - Generalized Mehod of Momens Conens GMM esimaion, shor inroducion 2 GMM inuiion: Maching momens 2 3 General overview of GMM esimaion. 3 3. Weighing marix...........................................

More information

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017

Two Popular Bayesian Estimators: Particle and Kalman Filters. McGill COMP 765 Sept 14 th, 2017 Two Popular Bayesian Esimaors: Paricle and Kalman Filers McGill COMP 765 Sep 14 h, 2017 1 1 1, dx x Bel x u x P x z P Recall: Bayes Filers,,,,,,, 1 1 1 1 u z u x P u z u x z P Bayes z = observaion u =

More information

Optimal Path Planning for Flexible Redundant Robot Manipulators

Optimal Path Planning for Flexible Redundant Robot Manipulators 25 WSEAS In. Conf. on DYNAMICAL SYSEMS and CONROL, Venice, Ialy, November 2-4, 25 (pp363-368) Opimal Pah Planning for Flexible Redundan Robo Manipulaors H. HOMAEI, M. KESHMIRI Deparmen of Mechanical Engineering

More information

Stable approximations of optimal filters

Stable approximations of optimal filters Sable approximaions of opimal filers Joaquin Miguez Deparmen of Signal Theory & Communicaions, Universidad Carlos III de Madrid. E-mail: joaquin.miguez@uc3m.es Join work wih Dan Crisan (Imperial College

More information

. Now define y j = log x j, and solve the iteration.

. Now define y j = log x j, and solve the iteration. Problem 1: (Disribued Resource Allocaion (ALOHA!)) (Adaped from M& U, Problem 5.11) In his problem, we sudy a simple disribued proocol for allocaing agens o shared resources, wherein agens conend for resources

More information

Overview. COMP14112: Artificial Intelligence Fundamentals. Lecture 0 Very Brief Overview. Structure of this course

Overview. COMP14112: Artificial Intelligence Fundamentals. Lecture 0 Very Brief Overview. Structure of this course OMP: Arificial Inelligence Fundamenals Lecure 0 Very Brief Overview Lecurer: Email: Xiao-Jun Zeng x.zeng@mancheser.ac.uk Overview This course will focus mainly on probabilisic mehods in AI We shall presen

More information

INDEPENDENT SETS IN GRAPHS WITH GIVEN MINIMUM DEGREE

INDEPENDENT SETS IN GRAPHS WITH GIVEN MINIMUM DEGREE INDEPENDENT SETS IN GRAPHS WITH GIVEN MINIMUM DEGREE JAMES ALEXANDER, JONATHAN CUTLER, AND TIM MINK Absrac The enumeraion of independen ses in graphs wih various resricions has been a opic of much ineres

More information

arxiv: v1 [math.oc] 11 Sep 2017

arxiv: v1 [math.oc] 11 Sep 2017 Online Learning in Weakly Coupled Markov Decision Processes: A Convergence ime Sudy Xiaohan Wei, Hao Yu and Michael J. Neely arxiv:709.03465v [mah.oc] Sep 07. Inroducion Absrac: We consider muliple parallel

More information

Online Convex Optimization Example And Follow-The-Leader

Online Convex Optimization Example And Follow-The-Leader CSE599s, Spring 2014, Online Learning Lecure 2-04/03/2014 Online Convex Opimizaion Example And Follow-The-Leader Lecurer: Brendan McMahan Scribe: Sephen Joe Jonany 1 Review of Online Convex Opimizaion

More information

A DELAY-DEPENDENT STABILITY CRITERIA FOR T-S FUZZY SYSTEM WITH TIME-DELAYS

A DELAY-DEPENDENT STABILITY CRITERIA FOR T-S FUZZY SYSTEM WITH TIME-DELAYS A DELAY-DEPENDENT STABILITY CRITERIA FOR T-S FUZZY SYSTEM WITH TIME-DELAYS Xinping Guan ;1 Fenglei Li Cailian Chen Insiue of Elecrical Engineering, Yanshan Universiy, Qinhuangdao, 066004, China. Deparmen

More information

POSITIVE SOLUTIONS OF NEUTRAL DELAY DIFFERENTIAL EQUATION

POSITIVE SOLUTIONS OF NEUTRAL DELAY DIFFERENTIAL EQUATION Novi Sad J. Mah. Vol. 32, No. 2, 2002, 95-108 95 POSITIVE SOLUTIONS OF NEUTRAL DELAY DIFFERENTIAL EQUATION Hajnalka Péics 1, János Karsai 2 Absrac. We consider he scalar nonauonomous neural delay differenial

More information

Outline. lse-logo. Outline. Outline. 1 Wald Test. 2 The Likelihood Ratio Test. 3 Lagrange Multiplier Tests

Outline. lse-logo. Outline. Outline. 1 Wald Test. 2 The Likelihood Ratio Test. 3 Lagrange Multiplier Tests Ouline Ouline Hypohesis Tes wihin he Maximum Likelihood Framework There are hree main frequenis approaches o inference wihin he Maximum Likelihood framework: he Wald es, he Likelihood Raio es and he Lagrange

More information

) were both constant and we brought them from under the integral.

) were both constant and we brought them from under the integral. YIELD-PER-RECRUIT (coninued The yield-per-recrui model applies o a cohor, bu we saw in he Age Disribuions lecure ha he properies of a cohor do no apply in general o a collecion of cohors, which is wha

More information

CHAPTER 10 VALIDATION OF TEST WITH ARTIFICAL NEURAL NETWORK

CHAPTER 10 VALIDATION OF TEST WITH ARTIFICAL NEURAL NETWORK 175 CHAPTER 10 VALIDATION OF TEST WITH ARTIFICAL NEURAL NETWORK 10.1 INTRODUCTION Amongs he research work performed, he bes resuls of experimenal work are validaed wih Arificial Neural Nework. From he

More information

Západočeská Univerzita v Plzni, Czech Republic and Groupe ESIEE Paris, France

Západočeská Univerzita v Plzni, Czech Republic and Groupe ESIEE Paris, France ADAPTIVE SIGNAL PROCESSING USING MAXIMUM ENTROPY ON THE MEAN METHOD AND MONTE CARLO ANALYSIS Pavla Holejšovsá, Ing. *), Z. Peroua, Ing. **), J.-F. Bercher, Prof. Assis. ***) Západočesá Univerzia v Plzni,

More information

Ann. Funct. Anal. 2 (2011), no. 2, A nnals of F unctional A nalysis ISSN: (electronic) URL:

Ann. Funct. Anal. 2 (2011), no. 2, A nnals of F unctional A nalysis ISSN: (electronic) URL: Ann. Func. Anal. 2 2011, no. 2, 34 41 A nnals of F uncional A nalysis ISSN: 2008-8752 elecronic URL: www.emis.de/journals/afa/ CLASSIFICAION OF POSIIVE SOLUIONS OF NONLINEAR SYSEMS OF VOLERRA INEGRAL EQUAIONS

More information

Particle Swarm Optimization Combining Diversification and Intensification for Nonlinear Integer Programming Problems

Particle Swarm Optimization Combining Diversification and Intensification for Nonlinear Integer Programming Problems Paricle Swarm Opimizaion Combining Diversificaion and Inensificaion for Nonlinear Ineger Programming Problems Takeshi Masui, Masaoshi Sakawa, Kosuke Kao and Koichi Masumoo Hiroshima Universiy 1-4-1, Kagamiyama,

More information